BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018511
(355 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 280/342 (81%), Positives = 300/342 (87%), Gaps = 10/342 (2%)
Query: 21 HHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSR-------EVVSK 73
H PL + S IS+S R +F +S +RR N S SSS+TLVAGSR E V+K
Sbjct: 24 HRPLLLLSKISVSAPRISHFSNSFSPIRRWNVCS--ASSSETLVAGSRKENGKTGEAVTK 81
Query: 74 KEED-LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
KE+D GDLK+WMH NGLPPCKVIL+EKPSH++ HRPIHYVAASEDL+ GD AFSVPNSL
Sbjct: 82 KEDDEFGDLKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSL 141
Query: 133 VVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
VVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL
Sbjct: 142 VVTLERVLGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQL 201
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
AVESPLLWSE EL YL+GSPTK E+LERAEGIK+EYNELDTVWFMAGSLFQQYPYDIPTE
Sbjct: 202 AVESPLLWSEDELDYLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTE 261
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
AF+FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY S CKAML AVD AV+LV
Sbjct: 262 AFSFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLTAVDGAVELV 321
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
VDRPYKAGESI VWCGPQPNSKLL+NYGFVDEDN YDRLVVE
Sbjct: 322 VDRPYKAGESIAVWCGPQPNSKLLLNYGFVDEDNRYDRLVVE 363
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 561 bits (1447), Expect = e-157, Method: Compositional matrix adjust.
Identities = 268/304 (88%), Positives = 284/304 (93%), Gaps = 7/304 (2%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
HY+AASEDLQAGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ
Sbjct: 108 HYIAASEDLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 167
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
GKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGIKREYNE
Sbjct: 168 GKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGIKREYNE 227
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
LDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPP
Sbjct: 228 LDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPP 287
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
LLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDEDN YDR
Sbjct: 288 LLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDEDNSYDR 347
Query: 351 LVVE 354
+VVE
Sbjct: 348 IVVE 351
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 268/310 (86%), Positives = 284/310 (91%), Gaps = 13/310 (4%)
Query: 58 SSSDTLVAGSR-------EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI 110
S SDTLVAGSR E KKE++ GDLKSWMH+NGLPPCKV+LKE+PSH+E+H+ I
Sbjct: 48 SGSDTLVAGSRKEDGRVSEAARKKEDEFGDLKSWMHENGLPPCKVVLKERPSHHEQHKAI 107
Query: 111 HYVAASEDLQ------AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
HY+AASEDLQ AGD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYL
Sbjct: 108 HYIAASEDLQGFLLLQAGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYL 167
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYLTGSPTKAE+LERAEGI
Sbjct: 168 MYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLTGSPTKAEVLERAEGI 227
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFAL
Sbjct: 228 KREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFAL 287
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
VPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIVVWCGPQPNSKLL+NYGFVDE
Sbjct: 288 VPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIVVWCGPQPNSKLLLNYGFVDE 347
Query: 345 DNPYDRLVVE 354
DN YDR+VVE
Sbjct: 348 DNSYDRIVVE 357
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 279/360 (77%), Positives = 300/360 (83%), Gaps = 22/360 (6%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME +C +K ISP L++ S +SIS P S RR F+
Sbjct: 1 MEFTC--LHNKCISPS-------LTVLSRVSISFSNLPKRAVSFHRRRRNLCFA------ 45
Query: 61 DTLVAGSR--EVVSKK----EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVA 114
TLV G R EVVSK+ E++ GDLKSWMHKNGLPPCKV+LKE+PSH++K RPIHYVA
Sbjct: 46 -TLVDGKRTSEVVSKRGGEEEDEFGDLKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVA 104
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDLQA D A SVPNSLVVTLERVLGNET+AELLTTNKLSELACLALYLMYEKKQGKKS
Sbjct: 105 ASEDLQASDVAVSVPNSLVVTLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKS 164
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYIRELDRQRGRGQLAVESPLLWSE ELAYLTGSPTKAE+L+RA+GIKREY ELDTV
Sbjct: 165 FWYPYIRELDRQRGRGQLAVESPLLWSEAELAYLTGSPTKAEVLDRADGIKREYEELDTV 224
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
WFMAGSLFQQYPYDIPTEAF FEIFKQAFVA+QSCVVHLQKVSLARRFALVPLGPPLLAY
Sbjct: 225 WFMAGSLFQQYPYDIPTEAFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAY 284
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
SS CKAML AVD AV+LVVDRPYKAGE IVVWCGPQPNSKLL+NYGFVDEDNPYDR+ VE
Sbjct: 285 SSNCKAMLTAVDGAVELVVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDEDNPYDRIAVE 344
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 275/363 (75%), Positives = 300/363 (82%), Gaps = 16/363 (4%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353
Query: 352 VVE 354
+VE
Sbjct: 354 IVE 356
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 275/363 (75%), Positives = 300/363 (82%), Gaps = 16/363 (4%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 293
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 294 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRV 353
Query: 352 VVE 354
+VE
Sbjct: 354 IVE 356
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 272/363 (74%), Positives = 299/363 (82%), Gaps = 14/363 (3%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V ++ + SSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNTKLISSSRSVPLRS-LCVSASSS 55
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVAG +V SKKE +D DLK WM KNGLPPCKV+LKE+P+H+ K++PIH
Sbjct: 56 DTLVAGGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIH 115
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 116 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 175
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 176 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 235
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHLQ V LARRFALVPLGPPL
Sbjct: 236 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPL 295
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
LAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+KLL+NYGFVDEDNPYDR+
Sbjct: 296 LAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDEDNPYDRI 355
Query: 352 VVE 354
+VE
Sbjct: 356 IVE 358
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 248/270 (91%), Positives = 262/270 (97%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MH+NGLPPCKV+LKE+PSH+E+H+ IHY+AASEDLQAGD AFSVP+SLVVTLERVLGNET
Sbjct: 1 MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
IAELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+E
Sbjct: 61 IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
A+QSCVVHLQKVSLARRFALVPLGPPLLAY S CKAMLAAVD +VQLVVDRPYKAGESIV
Sbjct: 181 AIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVVDRPYKAGESIV 240
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
VWCGPQPNSKLL+NYGFVDEDN YDR+VVE
Sbjct: 241 VWCGPQPNSKLLLNYGFVDEDNSYDRIVVE 270
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 256/320 (80%), Positives = 279/320 (87%), Gaps = 2/320 (0%)
Query: 36 RDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV 95
R P+F S RR+ R S+SDTLVA + + K++ED GDLK+WMHKNGLPPCKV
Sbjct: 27 RLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGK--KKRDEDDGDLKTWMHKNGLPPCKV 84
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LK+KPS ++ +PIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNETIAELLTTNK S
Sbjct: 85 VLKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFS 144
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE+ELAYL GSP K
Sbjct: 145 ELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESELAYLEGSPLKD 204
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
EI++R EGI++EYNELDTVWFM+GSLFQQYPYD+PTEAF FEIFKQAF AVQSCVVHLQ
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
VSLARRFALVPLGPPLLAY S CKAML AVD AVQLVVDRPYKAG+ IVVWCGPQPN+KL
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVVDRPYKAGDPIVVWCGPQPNTKL 324
Query: 336 LINYGFVDEDNPYDRLVVEV 355
L NYGFVDEDN DRL+VEV
Sbjct: 325 LTNYGFVDEDNSNDRLIVEV 344
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 275/381 (72%), Positives = 300/381 (78%), Gaps = 34/381 (8%)
Query: 1 MEASCSLRSSKFISPPIRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSS 60
ME + +K +S PIR PLS S S+ R+ SS R V + S+ VSSS
Sbjct: 1 MEGVITCFHTKCVSLPIR--SFPLSRVS--SLPRWRNNKLISSSRSVHLR---SLCVSSS 53
Query: 61 DTLVAGSR--------EVVSKKE-EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
DTLVA +V SKKE +D DLK WM KNGLPPCKVILKE+P+H++KH+PIH
Sbjct: 54 DTLVASGSPKEDERQSKVSSKKEGDDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIH 113
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
YVAASEDLQ GD AFSVP+SLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG
Sbjct: 114 YVAASEDLQKGDVAFSVPDSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 173
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PYIRELDRQRGRGQL ESPLLWSE EL YLTGSPTKAE+LERAEGIKREYNEL
Sbjct: 174 KKSVWYPYIRELDRQRGRGQLDAESPLLWSEAELDYLTGSPTKAEVLERAEGIKREYNEL 233
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL------------------ 273
DTVWFMAGSLFQQYP+DIPTEAF+FEIFKQAFVA+QSCVVHL
Sbjct: 234 DTVWFMAGSLFQQYPFDIPTEAFSFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCT 293
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
Q V LARRFALVPLGPPLLAY S CKAML AVD AV+LVVDRPYKAG+ IVVWCGPQPN+
Sbjct: 294 QNVGLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVELVVDRPYKAGDPIVVWCGPQPNA 353
Query: 334 KLLINYGFVDEDNPYDRLVVE 354
KLL+NYGFVDEDNPYDR++VE
Sbjct: 354 KLLLNYGFVDEDNPYDRVIVE 374
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 244/320 (76%), Positives = 273/320 (85%), Gaps = 7/320 (2%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS 155
+LKE+PS + KH+PI YVAASEDLQ GD AFS+PNSL+VTLERVLGNETIAELLTTNKLS
Sbjct: 91 MLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNSLIVTLERVLGNETIAELLTTNKLS 150
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
ELACLALYLMYEKKQG +SFW P+IRELDRQRGRGQLAVESPLLWS EL Y TGSP K
Sbjct: 151 ELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQLAVESPLLWSSEELKYFTGSPMKE 210
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+LER GIKREY ELDTVWFMAGSLF+QYPYDIPTEAF FEIFKQAFVAVQSCVVHLQ
Sbjct: 211 IMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIPTEAFPFEIFKQAFVAVQSCVVHLQN 270
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
V+LARRFALVPLGPPLL+Y S CKAML AV D+VQL VDR YKAGE IVVWCGPQPN++L
Sbjct: 271 VNLARRFALVPLGPPLLSYKSNCKAMLKAVGDSVQLEVDREYKAGEPIVVWCGPQPNARL 330
Query: 336 LINYGFVDEDNPYDRLVVEV 355
L+NYGFVDEDNP+DRL+VEV
Sbjct: 331 LLNYGFVDEDNPHDRLIVEV 350
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 231/275 (84%), Positives = 254/275 (92%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
WS++EL YL+GSP K E+++R E I++EY ELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WSKSELDYLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
G+ IVVWCGPQPNSKLLINYGFVDE+N DRL+VE
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVE 344
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 499 bits (1286), Expect = e-139, Method: Compositional matrix adjust.
Identities = 231/275 (84%), Positives = 254/275 (92%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DLKSWMHK+GLPPCKV+LK+KP N+ H+PIHYVAAS+DLQ GD AFSVPNSLVVTLERV
Sbjct: 70 DLKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERV 129
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LGNET+AELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQL+VESPLL
Sbjct: 130 LGNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLL 189
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W ++EL YL+GSP K E+++R E I++EYNELDTVWFMAGSLFQQYPYDIPTEAF+FEIF
Sbjct: 190 WLKSELDYLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
KQAF A+QSCVVHLQKVSLARRFALVPLGPPLL+Y S CKAML AVD AV+L VDRPYKA
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAMLTAVDGAVELAVDRPYKA 309
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
G+ IVVWCGPQPNSKLLINYGFVDE+N DRL+VE
Sbjct: 310 GDPIVVWCGPQPNSKLLINYGFVDENNSNDRLIVE 344
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 228/362 (62%), Positives = 270/362 (74%), Gaps = 23/362 (6%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 14 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 64
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 65 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 124
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 125 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 184
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 185 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 244
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL
Sbjct: 245 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 304
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 305 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 364
Query: 354 EV 355
E
Sbjct: 365 EA 366
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 232/325 (71%), Positives = 255/325 (78%), Gaps = 7/325 (2%)
Query: 38 PNFGSSLRLVRRKNRFSIRVSSSDTLVAGSRE---VVSKKEEDLG--DLKSWMHKNGLPP 92
P FG+ V + R S ++ T V E SKK+E DLK WM + GLP
Sbjct: 45 PRFGTQKVAVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPE 104
Query: 93 CKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
CKV L E +PS +K +PIHYV ASEDLQ G+ A ++P SLVVTLERVLG+ETIAELLTT
Sbjct: 105 CKVSLAEHQPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTT 164
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTG 210
NKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS EL Y TG
Sbjct: 165 NKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTG 224
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
S K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQAFVAVQSCV
Sbjct: 225 STMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCV 284
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
VHLQ VSLARRFALVPLGPPLLAY S CKAML AVDD V L VDR YKAG+ I VWCGPQ
Sbjct: 285 VHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVDDNVVLEVDRAYKAGDPIAVWCGPQ 344
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
PNSKLL+NYGFVDEDNPYDRL VE
Sbjct: 345 PNSKLLLNYGFVDEDNPYDRLAVEA 369
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 228/362 (62%), Positives = 270/362 (74%), Gaps = 23/362 (6%)
Query: 8 RSSKFISPP-----IRPPHHPLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDT 62
RSS+ +PP + HH L + + R P GS R +R + +DT
Sbjct: 9 RSSEARAPPMASSALSGTHHRLLLPCFLR----RLPQPGS-----RSCSRLRLAACHADT 59
Query: 63 LVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP---SHNEKHRPIHYV 113
L++ S G W+ NGLPP K+ + E+P S + RP+H+V
Sbjct: 60 LLSSSGAQGPPSPAACLSASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFV 119
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
AA +DL+AGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+
Sbjct: 120 AAGQDLEAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRD 179
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W PYI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP + E++ R EGIK+EYNELDT
Sbjct: 180 SLWYPYIKELDRQRGRGQLAVESPLLWTESELDYLNGSPMRDEVVVRDEGIKKEYNELDT 239
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+WFMAGSLF+QYP+D+PTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL
Sbjct: 240 LWFMAGSLFKQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLT 299
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
Y S CKAML AVD +V+L+VDRPYKAGE I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +
Sbjct: 300 YKSNCKAMLTAVDGSVRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAI 359
Query: 354 EV 355
E
Sbjct: 360 EA 361
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 213/277 (76%), Positives = 242/277 (87%), Gaps = 3/277 (1%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 71 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 130
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 131 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 190
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 191 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 250
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 251 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 310
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+E
Sbjct: 311 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIE 347
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 213/277 (76%), Positives = 242/277 (87%), Gaps = 3/277 (1%)
Query: 81 LKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
W+ ++GLPP KV + ++P K P+HYVAA +DL+AGD AF VP SLVVTLE
Sbjct: 75 FSDWLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLE 134
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
RVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW PYI+ELDRQRGRGQLAVESP
Sbjct: 135 RVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESP 194
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMAGSLFQQYP+DIPTEAF FE
Sbjct: 195 LLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFE 254
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AV D+V+LVVDRPY
Sbjct: 255 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDSVRLVVDRPY 314
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
KAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+E
Sbjct: 315 KAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIE 351
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 211/274 (77%), Positives = 241/274 (87%), Gaps = 3/274 (1%)
Query: 84 WMHKNGLPPCKVILKEKP---SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
W+ +GLPP KV + E+P S K RP+H+VAA +DL+ GD AF +P SLVVTLERVL
Sbjct: 85 WLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERVL 144
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G+E++AELLTTNKLSELACLALYLMYEKKQGK S W PYI+ELDRQRGRGQLAVESPLLW
Sbjct: 145 GDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLLW 204
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E+EL YL GSP + E++ R EGI+REYNELDT+WFMAGSLF+QYP+D+PTEAF FEIFK
Sbjct: 205 TESELDYLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEIFK 264
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
QAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML AVDD+V+LVVDRPYKAG
Sbjct: 265 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDSVRLVVDRPYKAG 324
Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
E I+VWCGPQPNS+LL+NYGFVDEDNPYDR+ +E
Sbjct: 325 EPIIVWCGPQPNSRLLLNYGFVDEDNPYDRIAIE 358
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 210/275 (76%), Positives = 235/275 (85%), Gaps = 3/275 (1%)
Query: 83 SWMHKNGLPPCKVILKEKPSH---NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
W+ GLPP KV ++E+P N K P+ YVAA DLQAGD AF VP SLVVTLERV
Sbjct: 80 DWLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERV 139
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
LG+E+IAELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLL
Sbjct: 140 LGDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLL 199
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W+E+EL YLTGSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIF
Sbjct: 200 WTESELDYLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 259
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKA
Sbjct: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDSVRLVVDRPYKA 319
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
GE I++WCGPQ NS+L++NYGFVDEDNP+DR+ +E
Sbjct: 320 GEPIIIWCGPQTNSRLVLNYGFVDEDNPFDRIAIE 354
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 439 bits (1130), Expect = e-121, Method: Compositional matrix adjust.
Identities = 213/300 (71%), Positives = 242/300 (80%), Gaps = 3/300 (1%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPS---HNEKHRPIHYVA 114
SSS+ A V E W+ GLPP KV ++E+P + K +P+ YV+
Sbjct: 55 SSSEARAAPGPAVEPSSESATDCFVDWLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVS 114
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A DLQAGD AF V SLVVTLERVLG+E+IAELLT NKLSELACLALYLMYEKKQGK S
Sbjct: 115 AVVDLQAGDVAFEVSMSLVVTLERVLGDESIAELLTNNKLSELACLALYLMYEKKQGKDS 174
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
FW PYI+ELDR RGRGQLAVESPLLW+E+EL YLTGSP K E++ R E I+REYNELDT+
Sbjct: 175 FWYPYIKELDRHRGRGQLAVESPLLWTESELDYLTGSPLKDEVVARDEAIRREYNELDTL 234
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y
Sbjct: 235 WFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTY 294
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
S CKAML A D+V+LVVDRPYKAGE I++WCGPQ NS+L++NYGFVDEDNP+DR+ +E
Sbjct: 295 RSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGPQTNSRLVLNYGFVDEDNPFDRVAIE 354
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 210/272 (77%), Positives = 229/272 (84%), Gaps = 2/272 (0%)
Query: 85 MHKNGLPPCKVILKE-KPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
M + GLP C V L E + + +K +PIHYV AS+DLQ GD A +VP SLVVTLERVLG+E
Sbjct: 1 MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
TIAELLTTNKLSELACLALYLMYEKKQGK+S+W PYIRELDRQRGRGQL+V SPLLWS
Sbjct: 61 TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120
Query: 204 EL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
EL Y TGS K +LER GIKREY ELDTVWFMAGSLF+QYP+D+PTEAF+FEIFKQA
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
FVAVQSCVVHLQ VSLARRFALVPLGPPLLAY S CKAML AV D VQL VD YK G+
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAMLKAVGDNVQLEVDHAYKTGDP 240
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
I VWCGPQPNSKLL+NYGFVDEDNP+DRL VE
Sbjct: 241 IAVWCGPQPNSKLLLNYGFVDEDNPFDRLAVE 272
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 209/273 (76%), Positives = 230/273 (84%), Gaps = 4/273 (1%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGN
Sbjct: 2 DWMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGN 58
Query: 143 ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 202
ETIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 59 ETIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTS 118
Query: 203 TEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQ
Sbjct: 119 EELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQ 178
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
AFVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 179 AFVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGE 238
Query: 322 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
I+VWCGPQPN++LL+NYGFVD DNP+DRL VE
Sbjct: 239 QILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVE 271
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 209/273 (76%), Positives = 230/273 (84%), Gaps = 4/273 (1%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
WM + GLPPCKV LKE+ + + I YV ASEDL+ GD A SVP SLVVTLERVLGN
Sbjct: 2 DWMLEQGLPPCKVSLKERDLNG---KTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGN 58
Query: 143 ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 202
ETIAELLTTNKLSELACLALYLMYEKK+GK+SFW P+IRELDRQRGRGQ+AVESPLLW+
Sbjct: 59 ETIAELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTS 118
Query: 203 TEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
EL Y TGS K +LER EGIKREY ELDTVWFMAGSLF++YP+DIPTEAF+FEIFKQ
Sbjct: 119 EELDEYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQ 178
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
AFVAVQSCVVHLQ VSL RRFALVPLGPPLLAY S CKAML A D V+L VDR YK GE
Sbjct: 179 AFVAVQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAMLKAAGDLVRLEVDRAYKKGE 238
Query: 322 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
I+VWCGPQPN++LL+NYGFVD DNP+DRL VE
Sbjct: 239 QILVWCGPQPNTRLLLNYGFVDPDNPHDRLSVE 271
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 198/236 (83%), Positives = 218/236 (92%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
LQAGD AF VP SLVVTLERVLG+E++AELLTTNKLSELACLALYLMYEKKQG+ SFW P
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ELDRQRGRGQLAVESPLLW+E+EL YL GSP K E++ R EGI+REYNELDT+WFMA
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELNYLKGSPIKDEVVARDEGIRREYNELDTLWFMA 262
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
GSLFQQYP+DIPTEAF FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL Y S C
Sbjct: 263 GSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNC 322
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
KAML AV D+V+LVVDRPYKAGE I+VWCGPQPNS+LL+NYGF+DEDNPYDR+V+E
Sbjct: 323 KAMLTAVGDSVRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDEDNPYDRIVIE 378
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 174/190 (91%), Positives = 182/190 (95%)
Query: 85 MHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET 144
MHKNGLPPCKV+LKE+PSH+ K RPIHYVAASEDLQ GD AFSVPNSLVVTLERVLGNET
Sbjct: 1 MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+ ELLTTNKLSELACLALYLMYEKKQGKKSFW PYIRELDRQRGRGQLAVESPLLWSE E
Sbjct: 61 VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
LAYLTGSPTKAE+LERA+GIKREY+ELDTVWFMAGSLFQQYPYDIPTEAF FEIFKQAFV
Sbjct: 121 LAYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAFV 180
Query: 265 AVQSCVVHLQ 274
A+QSCVVHLQ
Sbjct: 181 AIQSCVVHLQ 190
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 151/244 (61%), Positives = 192/244 (78%), Gaps = 3/244 (1%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ DLQAG+ A +P+ LV+TL+RV +E++AELLTT+KLSELACL LYLMYEKK G++S
Sbjct: 49 AARDLQAGELALRIPDHLVITLDRVFEDESLAELLTTDKLSELACLTLYLMYEKKNGRQS 108
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDT 233
W +I+ELDR +GRGQ+ +SPLLW E ++ YL GSP AEI ER +GI++EY ELDT
Sbjct: 109 VWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEYLAGSPLVAEIKERLKGIEKEYAELDT 168
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
VWFMAGSLF+ YPYD+PTEAF+ ++F+Q F AVQ+ VVHLQ V L++RFALVPLGPPLL+
Sbjct: 169 VWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQASVVHLQGVPLSKRFALVPLGPPLLS 228
Query: 294 YSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
YSS KAML +A VQL VDR Y GE I WCGPQPN +LL+NYG V ++NP+D++
Sbjct: 229 YSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEAWCGPQPNRRLLLNYGIVTDNNPHDKM 288
Query: 352 VVEV 355
+ V
Sbjct: 289 ALTV 292
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 156/277 (56%), Positives = 200/277 (72%), Gaps = 8/277 (2%)
Query: 80 DLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+L W+ +NG + +V + PS RP+ V A L AG+ A SVP L +TL+
Sbjct: 43 ELVDWLRENGAKIDAVEVKTMDVPSAG---RPLDVVVAGRSLAAGEVALSVPERLCLTLD 99
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
R+ +E +AELLTT+KLSELACLALYLMYEKK KKSFW PYI+ELD+Q+ RG A ESP
Sbjct: 100 RIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAAESP 159
Query: 198 LLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
LLW + EL + L GSP + +R GI++EY LDTVWFMAGSLF +YP+D+PTE F+F
Sbjct: 160 LLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTETFSF 219
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVD 314
E+F+QAF VQ+ +VHLQ V +A+RFALVPLGPPL+AYSS K M+ +D +V+LVV
Sbjct: 220 ELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRLVVS 279
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
P +AG + WCGPQPNS+LL+NYG VDE NP+D+L
Sbjct: 280 GPVEAGRPVAAWCGPQPNSRLLLNYGVVDEHNPFDKL 316
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 151/278 (54%), Positives = 196/278 (70%), Gaps = 6/278 (2%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
+ W+ ++G P KV L+ + + A+E LQ GD A +P L+VTL+RVL
Sbjct: 9 MMQWLTESGAPQQKVKLQTVVREGTE---VDITVAAEALQPGDVALRIPEHLIVTLDRVL 65
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ T+AEL+TT KLSELACL LYL YEKK+GK+ W +I+ELDR +GRG +SPLLW
Sbjct: 66 EDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQGAKSPLLW 125
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
E + A L GSP EI R +GI++EY ELDTVW++AGSLF + P+ PTE F+F +F
Sbjct: 126 DEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPTEQFSFPVF 185
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPY 317
+QAF AVQS VVHLQ V+L +RFALVP+GPPLL YSS KAML ++ V+L VDR Y
Sbjct: 186 RQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEVRLAVDRAY 245
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ GE+++ WCGPQPNS+LLINYG VDE NPYD+L + +
Sbjct: 246 QPGEAVLAWCGPQPNSRLLINYGIVDESNPYDKLPLSI 283
>gi|413950741|gb|AFW83390.1| hypothetical protein ZEAMMB73_201403, partial [Zea mays]
Length = 130
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 108/128 (84%), Positives = 117/128 (91%)
Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
ELLT NKLSELACLALYLMYEKKQGK SFW PYI+ELDR RGRGQLAVESPLLW+E+EL
Sbjct: 1 ELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLWTESELD 60
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
YL+GSP K E++ R E I+REYNELDT+WFMAGSLFQQYP+DIPTEAF FEIFKQAFVAV
Sbjct: 61 YLSGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIFKQAFVAV 120
Query: 267 QSCVVHLQ 274
QSCVVHLQ
Sbjct: 121 QSCVVHLQ 128
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 62/85 (72%), Positives = 74/85 (87%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
++ QKVSLARRFALVPLGPPLL Y S CKAML A D+V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDSVRLVVDRPYKAGEPIIIWCGP 69
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVE 354
Q NS+L++NYGFVDEDNP+DR+ +E
Sbjct: 70 QTNSRLVLNYGFVDEDNPFDRVAIE 94
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 59/85 (69%), Positives = 73/85 (85%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVE 354
Q NS+L++NYGFVDE+NP+DR+ +E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIE 94
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 59/85 (69%), Positives = 73/85 (85%)
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
++ QKVSLARRFALVPLGPPLL Y S CKAML ++V+LVVDRPYKAGE I++WCGP
Sbjct: 10 LIQEQKVSLARRFALVPLGPPLLTYKSNCKAMLTVDGESVRLVVDRPYKAGEPIIIWCGP 69
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVE 354
Q NS+L++NYGFVDE+NP+DR+ +E
Sbjct: 70 QTNSRLVLNYGFVDENNPFDRISIE 94
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 95/282 (33%), Positives = 134/282 (47%), Gaps = 30/282 (10%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L L W+ K G P VIL E+ + AA D + G+ A +P + VT
Sbjct: 82 LHQLSEWLSKQGFPTQDVILT---GFGEEGVGL---AAGRDFKEGEVALKIPENYTVTGV 135
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V+ + +A ++ L L+LMYE+ G+KS W PY++ SP
Sbjct: 136 DVVNHPVVAAPAAGR--GDVIGLTLWLMYERSLGEKSVWYPYLQTFPS-------TTLSP 186
Query: 198 LLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+LW+ E + L GSP E+ +R+ ++ EY +L S F + P P E F+
Sbjct: 187 ILWTAEEQQKLLKGSPALEEVQQRSAALEGEYEDLQ-------SYFTKDPQAFPQEYFSL 239
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVD 314
E FK AF + S V+L L FALVP L + + +A L + D AV VD
Sbjct: 240 EAFKSAFSVILSRAVYLPSADL---FALVPYADA-LNHRADSQAYLDYSMEDQAVVFPVD 295
Query: 315 RPYKAGESIVVWCG-PQPNSKLLINYGFVDEDNPYDRLVVEV 355
R YK GE + G + N+ LLI YGFVDE+N D L +EV
Sbjct: 296 RNYKEGEQVFTSYGRERSNADLLITYGFVDENNAMDYLDLEV 337
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 137/293 (46%), Gaps = 31/293 (10%)
Query: 81 LKSWMHKNGL-----------PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
LK WM +NG+ P + + NE + A+ +++ GD F++P
Sbjct: 92 LKEWMGENGVWVYDKSDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLP 151
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
L++T + E A+++T + LSE +AL ++EK +GK+SFW YI L
Sbjct: 152 IDLLLT-KDAAKKEFGADVITED-LSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVE-- 207
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
V LW+E +LA L GSP A ++ EY ++ L ++P +
Sbjct: 208 ---EVYPTYLWAEEDLALLEGSPVIAATESMRRKLEVEYATVEN------DLLDKFPEIL 258
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------PPLLAY-SSKCKAML 302
P E T+E F+ AF + S + L +S ALVP P +Y ++ + +
Sbjct: 259 PREVHTYEEFQWAFAMLFSRAIRLGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLF 318
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ D V + DR YK E + + GP+ NS LL+ YGF + NPY+ + V V
Sbjct: 319 FSKTDEVVVYADRSYKKMEQVYISYGPKGNSDLLLLYGFSLDRNPYNSVDVTV 371
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 135/279 (48%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P+ P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPHLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV + NP D
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSD 328
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 134/279 (48%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVIFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV + NP D
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSD 328
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 134/279 (48%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ VP SLV++
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVPPSLVIS 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV + NP D
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSD 328
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 130/279 (46%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + +K E+ + A ++++ G+ VP SLV+
Sbjct: 71 ENAEALQKWLSDSGLPPQKMAI-QKVEVGERG-----LVALKNIRKGEMLLFVPPSLVIA 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 125 ADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F +YP+ P E F
Sbjct: 178 SLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRL------RIFSKYPHLFPEEVF 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLV 312
E FK +F + S +V L S+ R ALVP +L +SS+ + L V
Sbjct: 232 NMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSSEVETFLDYDKSSKGVVFT 288
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV + NP D
Sbjct: 289 TDRPYQPGEQVFISYGRKSNGELLLSYGFVPREGTNPSD 327
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 132/279 (47%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L++W+ +GLPP K+ + ++ E+ + AS++L+ G+ V SLV+
Sbjct: 72 ENATSLQNWLSDSGLPPQKMAI-DRVDIGERG-----LVASQNLRKGEKLLFVSPSLVIC 125
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N E++ + + LA YL+ E K S W YI L RQ
Sbjct: 126 ADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYISALPRQ-------PY 178
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TEL YL S + +ER + Y +L + +F ++P P E F
Sbjct: 179 SLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRS------RIFSKHPQLFPKEVF 232
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ RFALVP +L ++ + + L V
Sbjct: 233 NDETFKWSFGILFSRLVRLP--SMDGRFALVPWA-DMLNHNCEVETFLDYDKSSKGVVFT 289
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV + NP D
Sbjct: 290 TDRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSD 328
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 131/279 (46%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 67 ENSSALQRWLSESGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 120
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 121 PDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSNYISALPRQ-------PY 173
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W++ EL YL S + +ER + YN+L +F +YP P E F
Sbjct: 174 SLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRL------RIFSKYPDLFPDEVF 227
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E FK +F + S +V L S+ ALVP +L +S + L +
Sbjct: 228 NIESFKWSFGILFSRLVRLP--SMGGNVALVPWA-DMLNHSCDVETFLDYDKTSKGIVFT 284
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV ++ NP D
Sbjct: 285 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGANPSD 323
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 88/318 (27%), Positives = 149/318 (46%), Gaps = 33/318 (10%)
Query: 49 RKNRFSIRVSSSDTLV-----AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH 103
R R R+++ L+ AGS E+ + E + +LK+W+ + GLPP K+ P
Sbjct: 26 RHRRCRCRLAAQAGLLDLLRGAGSTEIATDAEGE--ELKAWLIERGLPPPKLAAAATPGS 83
Query: 104 NEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY 163
+ A++ + G++ S+P LV+T L + LL L + LAL+
Sbjct: 84 GRG------LVAAQPIGKGESLLSIPQQLVLTPAAALEQSCLRPLLEEQPLPAWSVLALW 137
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L ++ G W PY+R L + G L WSE E+ +L GS ++ LE
Sbjct: 138 LAEQRAAGSAGGWWPYVRLLPERTG-------CVLEWSEEEVEWLCGSQLHSDALEIRAA 190
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
+ + E+ V A + + + AF + AF + S +V L L + A
Sbjct: 191 AEASWAEMQAVLAAAKAQGRAPAHG----AFGRAQLQWAFAVLLSRLVRL--AGLGDQEA 244
Query: 284 LVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
L+P LL + + L +A + AV L +R Y+AGE +++ G + + +LL++YGF
Sbjct: 245 LLPWA-DLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLISYGQKTSGELLLSYGF 303
Query: 342 VDE--DNPYD--RLVVEV 355
+ NP+D RL++E+
Sbjct: 304 CPDLGSNPHDGCRLLLEL 321
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 132/279 (47%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 62 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 115
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 116 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 168
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 169 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 222
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
E FK +F + S +V L S+ + ALVP ++ +S + + L + +V
Sbjct: 223 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 279
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV ++ NP D
Sbjct: 280 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSD 318
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 132/279 (47%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ + +K E+ + A +++ G+ VP LV+T
Sbjct: 65 ENSSSLQKWLSQSGLPSQKMSI-DKVDVGERG-----LVALNNIRKGEKLLFVPPQLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E K S W YI L RQ
Sbjct: 119 PDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L WS+ EL YL S + +ER + YN+ M +F +YP P E F
Sbjct: 172 SLLYWSQAELDRYLEASQIRERAIERTNNVIGTYND------MRVRIFSKYPDFFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV-- 312
E FK +F + S +V L S+ + ALVP ++ +S + + L + +V
Sbjct: 226 NIESFKWSFGILFSRMVRLP--SMDGKNALVPWA-DMMNHSCEVETFLDYDKSSKGIVFP 282
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DRPY+ GE + + G + N +LL++YGFV ++ NP D
Sbjct: 283 TDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSD 321
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 130/278 (46%), Gaps = 28/278 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLV---SSFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + L
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYVKVFPAS------TLSLLLW 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
E + L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 155 EQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFSA 205
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
FK AF V S V+L L FALVP G + SS+ + V+L VD+ YK
Sbjct: 206 FKTAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRYK 262
Query: 319 AGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
G+ + + ++ LI YGF+DE + D + +EV
Sbjct: 263 KGDQVFASYAQNLTSADFLIRYGFLDESDENDCIEIEV 300
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 135/282 (47%), Gaps = 29/282 (10%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ +GLPP K+ + E+ E+ + A ++++ G+ VP SLV+T
Sbjct: 65 ENAALLQKWLSDSGLPPQKMGI-ERVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L N + + LA YL+ E + S W YI L RQ
Sbjct: 119 ADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P+ P E F
Sbjct: 172 SLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRL------RIFSKHPHLFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E FK +F + S +V L S+ + ALVP +L +S + + L D + Q VV
Sbjct: 226 NMETFKWSFGILFSRLVRLP--SMDEKIALVPWA-DMLNHSCEVETFL-DYDKSSQGVVF 281
Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYDRL 351
DR Y+ E + + G + N +LL++YGFV + NP D++
Sbjct: 282 TTDRTYQPSEQVFISYGKKSNGELLLSYGFVPREGTNPNDKV 323
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 134/279 (48%), Gaps = 30/279 (10%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D+ W+ + G P +++ S +K A+ DLQAGDAA S+P + VT V
Sbjct: 49 DMTKWLQEQGFPQQPLLVS---SFEDKGLG---CCATRDLQAGDAALSIPENFTVTAVDV 102
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ I+ EL LAL+LMYE+++ + S W PY++ + SPLL
Sbjct: 103 ANHPVISS--AAEGRDELVGLALWLMYEQERSQDSPWYPYLKVF-------PASTLSPLL 153
Query: 200 WSETELAYLT-GSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFE 257
W + E L GS A++ ++ +++ ++ L DT+ + D P E FTF
Sbjct: 154 WEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL---------KDNKDFPMEKFTFS 204
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
FK AF V S V+L L FALVP G + SS+ + V+L VD+ Y
Sbjct: 205 AFKAAFSVVLSRAVYLPSAEL---FALVPFGDLINHESSRSLLDYDIEEQKVKLAVDKRY 261
Query: 318 KAGESIVV-WCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
K G+ + + ++ LI YGF+DE + D + +EV
Sbjct: 262 KKGDQVFASYAQNLTSADFLIRYGFLDESDENDFIEIEV 300
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 125/257 (48%), Gaps = 32/257 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A D+ GD +P +L +T + + + + + ++++E +A +L+YE+ +G++
Sbjct: 131 ARRDINDGDELLRIPMALCMT--KSAARKAVGKDVLPSEINEYLAMACHLIYERNVRGEE 188
Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W PY+ ++D V W + +LA+L GSP A ++REY+
Sbjct: 189 SPWKPYLDVLPDIDE--------VNPTFTWPDEDLAFLNGSPVIAATKSLQMKLRREYDA 240
Query: 231 LDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
L + G L +YP P EAF F+ ++ AF + S + L+ + ALVP
Sbjct: 241 L-----LGGEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYA 295
Query: 289 PPLLAYSSKCKAMLAAV----------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
L+ +S +A + A D+ V L DR Y+ E I + GP+ N++LL+
Sbjct: 296 D-LINHSPFSQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYGPKSNAELLLL 354
Query: 339 YGFVDEDNPYDRLVVEV 355
YGF E NP++ + V V
Sbjct: 355 YGFAVERNPFNSVDVTV 371
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 132/280 (47%), Gaps = 29/280 (10%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
++ L+ W+ NGLP K+ + +K E+ + A ++++ G+ VP SLV+T
Sbjct: 76 DNAAALQRWLSNNGLPDQKMAI-DKVEVGERG-----LVALKNIRKGEKLLFVPPSLVIT 129
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ E+L + + LA+YL+ E K S W YI L RQ
Sbjct: 130 ADSEWSCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISALPRQ-------PY 182
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F +YP P E F
Sbjct: 183 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 236
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E FK +F + S +V L S+ + ALVP +L +S + + L D + Q VV
Sbjct: 237 NLETFKWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVF 292
Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DR Y+ GE + + G + N +LL++YGFV + NP D
Sbjct: 293 TTDRQYEPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 332
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 97/360 (26%), Positives = 161/360 (44%), Gaps = 59/360 (16%)
Query: 9 SSKFISPPIRPPHH----PLSIASTISISVIRDPNFGSSLRLVRRKNRFSIRVSSSDTLV 64
S+ + PP+R P H P S +S+ S R + R IR S++
Sbjct: 4 STTTLHPPLRAPRHLRPLPHSYSSSFS----------------RTRGRAPIRASAASASA 47
Query: 65 AGSREVVS--------KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAAS 116
RE + + E L+ W+ +GLP ++ + ++ E+ + A
Sbjct: 48 PAQREAAAGVPWGCEIESLESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVAL 101
Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFW 176
++++ G+ VP SLV+T + G + E++ N + + +A YL+ E S W
Sbjct: 102 KNIRKGEKLLFVPPSLVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRW 161
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTV 234
YI L RQ S L W+ EL AYL SP + ++R + YN+L D +
Sbjct: 162 SSYIAALPRQ-------PYSLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRI 214
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ LF P E + E F +F + S +V L S+ + ALVP +L +
Sbjct: 215 FSRHSDLF-------PEEVYNIETFLWSFGILFSRLVRLP--SMDEKVALVPWA-DMLNH 264
Query: 295 SSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
S + + L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ NP D
Sbjct: 265 SPEVETFL-DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 323
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 132/280 (47%), Gaps = 28/280 (10%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ K GL K++L S + A++ L+ G+ VP+ L++T +
Sbjct: 16 LQDWLMKEGLAKQKLVLDRVDSGGRG------LVATQSLRQGERLLFVPSGLLITADSEW 69
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G ++ L E LA++L+ E + + S W PY L + S L W
Sbjct: 70 GCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPK-------TPSSILQW 122
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+E E+ +LT SP + + LE + Y +L ++F ++P P++ +T F
Sbjct: 123 TEEEVNTWLTASPVREKALECIRDVTETYRDL------RATIFLKHPEVFPSQVYTLAAF 176
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRP 316
K AF + S +V L V + ALVP +L +S + + L + +V V DR
Sbjct: 177 KWAFGILFSRLVRLPSVG---KLALVPWA-DMLNHSPQVDSFLDFDQNNAKSVVTVTDRA 232
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEV 355
Y++GE + + G + + +L + YGF+ E N +D + +E+
Sbjct: 233 YQSGEQVFISYGKRSSGELFLAYGFIPSELNVHDSVELEM 272
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 133/280 (47%), Gaps = 29/280 (10%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E+ L+ W+ ++GLP K+ ++ N R + A ++++ G+ VP SLV++
Sbjct: 65 ENASALQKWLSESGLPDQKMSIQRV---NVGERGL---VALKNVRKGEKLLFVPPSLVIS 118
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
E E+L N + + +A YL+ E K S W YI L RQ
Sbjct: 119 AESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQ-------PY 171
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL YL S + +ER + YN+L +F ++P P E F
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSI------RVFSKHPELFPEEVF 225
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E FK +F + S +V L S+ + ALVP +L ++ + + L D A Q VV
Sbjct: 226 NIETFKWSFGILFSRLVRLP--SMDGKVALVPWA-DMLNHNCEVETFL-DYDKASQGVVF 281
Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DR Y+ GE + + G + N +LL++YGFV ++ NP D
Sbjct: 282 TTDRAYQPGEQVFISYGKKSNGELLLSYGFVPKEGSNPSD 321
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 135/280 (48%), Gaps = 29/280 (10%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ L EK E+ + A ++++ G+ VP +LV+T
Sbjct: 74 ESAASLERWLTASGLPEQRLAL-EKVDIGERG-----LVALKNVRNGEKLLFVPPTLVIT 127
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ N + +++ + + LA YL+ E S W YI L RQ
Sbjct: 128 ADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQ-------PY 180
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ TE+ AYL SP + + R + YN+L +F ++P P + +
Sbjct: 181 SLLYWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRD------RIFSKHPDLFPEKVY 234
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
E F+ +F + S +V L+ S+ + ALVP +L +S + A L D + Q +V
Sbjct: 235 NMENFRWSFGILFSRLVRLE--SMGGKVALVPWA-DMLNHSPEVDAFL-DYDKSSQGIVF 290
Query: 314 --DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DR Y+ GE + + G + + +LL++YGFV ++ NP D
Sbjct: 291 TTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 330
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/265 (28%), Positives = 123/265 (46%), Gaps = 25/265 (9%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ A+L+ + K L+ ++ +N+L+ FM + P E F E F
Sbjct: 148 TALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLESF 201
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
K AF + S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258
Query: 318 KAGESIVVWCGPQPNSKLLINYGFV 342
++ + + + G + N +LL+ YGFV
Sbjct: 259 ESNKEVFISYGKRSNGELLVAYGFV 283
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 131/279 (46%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DR Y+ GE + + G + + +LL++YGFV ++ NP D
Sbjct: 289 TDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 327
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 130/279 (46%), Gaps = 27/279 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGENLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRD------RIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R LVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVVLVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
DR Y+ GE + + G + + +LL++YGFV ++ NP D
Sbjct: 289 TDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 327
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 131/286 (45%), Gaps = 26/286 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+DL L+ W+ KNGL + E ++ + V A +D + G+ VP L+ T
Sbjct: 66 DDLEQLRVWLLKNGLDSKWLEGIEFAANLPEGSG---VVAKKDFKKGEPFLQVPRKLMFT 122
Query: 136 LERVLGNETIAELLTTNKL---SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + N + +LL +K S CLAL+L+ EK SFW PYI+ L + G
Sbjct: 123 CQ-AMQNTPLGQLLKVDKFLAQSPSLCLALHLLVEK-HNHSSFWTPYIKTLPKSYG---- 176
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
+ L ++ EL L GSPT ++ + +Y + LFQ +
Sbjct: 177 ---TCLYFTLEELEGLRGSPTFTSAIKVIATVAIQYTYIH-------DLFQIRKDILHIN 226
Query: 253 AFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
AFT++ F A AV Q+ V +L+ +AL+P + D+
Sbjct: 227 AFTWDEFIWAMSAVGSRQNQVPQWGHNALSE-YALIPAWDMCNHDHGDLQTFWDVNSDST 285
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ R YK GE + ++ GP+PNS LL++ GFV E+N +D L + V
Sbjct: 286 ESHAMRAYKKGEQVYIFYGPRPNSDLLLHAGFVYENNRFDALAIRV 331
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 125/293 (42%), Gaps = 46/293 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDKRG-------LFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+LL++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 --PLEFQQLLSSG-VSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W + ELA + S E + R I+ E+NE+ + FQ+Y + + ++ F
Sbjct: 144 WRKEELAMIQESSLSYETMSRRAAIREEFNEMQPI-------FQRYEH-VFGGPVSYASF 195
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD------------ 307
K A+V C ++ + A+VP + + AML D
Sbjct: 196 KHAYVTATVCS-RAWRIDGLEKLAMVPFAD-FMNHDWSSNAMLTYDTDNGSTEVEEVKVY 253
Query: 308 ---------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
QL D+ Y AGE + + GP N+ L +++GF NP+D++
Sbjct: 254 SDCLDIALFCAQLFADKNYAAGEQVTISFGPLCNASLALDFGFTVPYNPWDKV 306
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 121/265 (45%), Gaps = 25/265 (9%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ + GLP KV LK + + + L GD +P +L +T E
Sbjct: 41 LQQWLSQAGLPIQKVELKNVGAGGRG------LVSKRMLYKGDRLLFLPATLAITTESEW 94
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+++ L E LA YL+ E GK S W PYI L R+ G S LLW
Sbjct: 95 ACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPYIAALPRRPG-------SILLW 147
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ ++ +L+ + K L+ ++ +N+L+ FM + P E F + F
Sbjct: 148 TALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNR------EEFPPEVFNLKSF 201
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPY 317
K AF + S +V L SL ++ AL+P G +L + ++ L + ++ +DR Y
Sbjct: 202 KWAFGILFSRLVRLP--SLGQKLALIPFG-DMLNHDTEVTTFLDFDSGSKSITCTLDRGY 258
Query: 318 KAGESIVVWCGPQPNSKLLINYGFV 342
++ + + G + N +LL+ YGFV
Sbjct: 259 ESNREVFISYGKRSNGELLVAYGFV 283
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 124/260 (47%), Gaps = 38/260 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEK-KQ 170
A + GD +P L +T + R LG + + E ++E +A L++EK +
Sbjct: 210 ARRSINDGDELLKIPLDLCLTRKSARRELGKDALQE-----GINEYLAVACQLIHEKFVK 264
Query: 171 GKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
G+ SF+ Y+ E+D V W + +LA+L GSP A ++RE
Sbjct: 265 GEDSFYAAYMGVLPEVDE--------VNPTFTWPDEDLAFLEGSPVVAATRSLQMKLRRE 316
Query: 228 YNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
Y++L + G L ++P P E +TFE ++ AF + S + L+ + + R A+V
Sbjct: 317 YDDL-----LGGPDGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMV 371
Query: 286 PLGPPLLAYSSKCKAMLAA----------VDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
P L+ +S+ +A + A ++ V L DR Y+ E + + G + N++L
Sbjct: 372 PYAD-LINHSAFSQAFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAEL 430
Query: 336 LINYGFVDEDNPYDRLVVEV 355
L+ YGF E NPY+ + V V
Sbjct: 431 LLLYGFALERNPYNSVDVTV 450
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 144/319 (45%), Gaps = 40/319 (12%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGL----------PPCKVILKEKPSHNE 105
R S + + +G V ++ W+ +NG+ P ++++ E+ + +E
Sbjct: 58 RSSDAFSFTSGDPAVQKGWSSEISAFYDWLKENGVYLSEKASWTHAPHRLVIAEE-TKDE 116
Query: 106 KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM 165
+ +S + G+ +P L+ T R L ET + + E + L L+
Sbjct: 117 GEYSGRGLLSSRSVNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLL 174
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL-LWSETELAYLTGSPTKAEILERAEGI 224
YEK +G SF+ PY+ L L +PL LWS +L L GSPT + + + +
Sbjct: 175 YEKAKGFDSFFKPYLDILP------TLDELNPLFLWSNKDLDLLQGSPTLSACEQLRDKL 228
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
REY ++ ++ Q P + ++ F+ F+ AF + S + ++R AL
Sbjct: 229 LREYT------YLGKNIIPQIP-NFASKPIDFKQFQWAFGILFSRAICFPS---SKRIAL 278
Query: 285 VPLGPPLLAYSSKCKAML--------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
VP LL +S C A + V +AV + VDR Y+ E + V GP+ N +LL
Sbjct: 279 VPYAD-LLNHSPFCSAFIDEEKIPFGNGVTEAV-VYVDRLYEPYEQVYVSYGPRSNQELL 336
Query: 337 INYGFVDEDNPYDRLVVEV 355
+ YGF E NP+D + + +
Sbjct: 337 LLYGFSLERNPFDCVEITI 355
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 118/239 (49%), Gaps = 22/239 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ +
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENA 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ P+ + E FK A+ V S ++ + + ALVP +
Sbjct: 117 LLLC-------PHIFGSRVSALE-FKHAYATVCSRAWGIETI---KSLALVPF-VDFFNH 164
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ C+AML+ +D ++V DR Y G+ +V+ G N+ L +++GF NP+D++
Sbjct: 165 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQV 223
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 120/249 (48%), Gaps = 23/249 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +++ G+ VP SLV++ + N + +++ + + + LA YL+ E
Sbjct: 13 LVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLLATYLISEASLEG 72
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W YI L RQ S L W+ TE+ AYL SP + + R + YN+L
Sbjct: 73 SSRWSSYIDALPRQ-------PYSLLYWTRTEIDAYLVASPIRERAISRIGDVIGTYNDL 125
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+F ++P P E + E F+ +F + S +V L S+ + ALVP +
Sbjct: 126 ------RDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLP--SMDGKVALVPWA-DM 176
Query: 292 LAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINYGFVDED--N 346
L ++ + A L D + Q +V DR Y+ GE + + G + + +LL++YGFV ++ N
Sbjct: 177 LNHNPEVDAFL-DFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTN 235
Query: 347 PYDRLVVEV 355
P D + V
Sbjct: 236 PNDSVEFSV 244
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 69/254 (27%), Positives = 115/254 (45%), Gaps = 26/254 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK-QGKK 173
A + GD +P L +T R + + + + ++E +A L++EK G +
Sbjct: 50 ARRSINDGDELLKIPMDLCIT--RKSARKALGKDALQDGINEYLAIACQLIHEKYVLGDE 107
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S W Y+ L V W + +LA+L GSP A ++REY+ L
Sbjct: 108 SEWDAYMGVLPEVE-----EVNPTFTWKDEDLAFLDGSPVVAATRSLQMKLRREYDAL-- 160
Query: 234 VWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ G L ++P P E FT+E + AF + S + L+ + + R A+VP L
Sbjct: 161 ---LGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRLRNLQVGERLAMVPYAD-L 216
Query: 292 LAYSSKCKAMLAAVD----------DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ +S+ A + A + + V L DR Y+ E + + G + N++LL+ YGF
Sbjct: 217 INHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGF 276
Query: 342 VDEDNPYDRLVVEV 355
E NPY+ + V V
Sbjct: 277 ALERNPYNSVDVTV 290
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 123/276 (44%), Gaps = 40/276 (14%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
DL W+ NGL K+ L+ + + A+E+++ G+A V S ++T+ER
Sbjct: 66 DLTRWLASNGLRAQKMTLESNLAEG------RGLVATEEIKRGEALLGVDASCLITVERA 119
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 120 IAEAKLGP--RHAELQEWSVLATFLAQQAMALESGNAGTFGEYIRALPRRTG-------S 170
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER E + E+ + +P DI A
Sbjct: 171 VLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAEIRS----------SFP-DITEGALR 219
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ AF + S ++ L ++ ALVP +L + C A + AV L DR
Sbjct: 220 W-----AFDILFSRLIRLD--AMGGELALVPWAD-MLNHKPGCAAFIDLNGSAVNLTTDR 271
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYD 349
Y AGE + G +P+S+LLI+YGF E +NP D
Sbjct: 272 AYAAGEQVWASYGQRPSSELLISYGFAPEVGENPDD 307
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 129/275 (46%), Gaps = 29/275 (10%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
F+ +F + S +V L S+ R ALVP +L +S + + L + DR
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
Y+ GE + + G + + +LL++YGFV ++ NP D
Sbjct: 292 YQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 326
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 129/275 (46%), Gaps = 29/275 (10%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T +
Sbjct: 75 LERWLTDSGLPEQRLGI-QRVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSEW 128
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G + +L N + + +A YL+ E S W YI L RQ S L W
Sbjct: 129 GCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQ-------PYSLLYW 181
Query: 201 SETEL-AYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+ EL AYL SP + ++R + YN+L D ++ LF P E + E
Sbjct: 182 TRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLF-------PEEVYNLET 234
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRP 316
F+ +F + S +V L S+ R ALVP +L +S + + L + DR
Sbjct: 235 FRWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDYDKSSGGIVFTTDRS 291
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
Y+ GE + + G + + +LL++YGFV ++ NP D
Sbjct: 292 YQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPND 326
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 118/239 (49%), Gaps = 24/239 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS + G+ V + L++T E++ E + +LL+ + +S A LAL+L+ +K+ + S
Sbjct: 5 ASRPIHTGECMLHVSHDLMITPEKL--PEEVTKLLSKD-VSAWAKLALFLLAHQKKKETS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L ++ S + W++ EL YL SP E ++R + ++ E+ + V
Sbjct: 62 AWAPYISCLPPFG-----SMHSTIFWTQDELVYLKVSPVYRETVQRKDVVRMEFAAAENV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L QQ + + T ++ V S ++ + + ALVP +
Sbjct: 117 CM----LMQQVKLFVCSRILT------DYITVCSRAWGIETI---KSLALVPF-VDFFNH 162
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ C+AML+ +D ++V DR Y G+ +V+ G N+ L +++GF NP+D++
Sbjct: 163 DANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALPFNPHDQV 221
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 122/276 (44%), Gaps = 40/276 (14%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ LP K+ L+ + + A+E+++ G+A VP + ++T+ER
Sbjct: 83 ELARWLEGRRLPGQKMALEVNLAEG------RGLVATEEIKRGEALLGVPRTTLITVERA 136
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEK---KQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + +L E + LA +L + + G + YIR L R+ G S
Sbjct: 137 IAEAKLGP--KHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTG-------S 187
Query: 197 PLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
L W E E+ L GSP++ ER + + +E+ + +P T
Sbjct: 188 VLDWPEDEVDKLLKGSPSRLAAAERQDSVNAAIDEIRSY----------FPE------IT 231
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+ AF + S ++ L ++ ALVP +L + C A + DAV L DR
Sbjct: 232 VGALRWAFDILFSRLIRLD--AMGGELALVPWAD-MLNHKPGCAAFIDLNGDAVNLTTDR 288
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDE--DNPYD 349
Y GE + G +P+S+LLI+YGF E +NP D
Sbjct: 289 SYVKGEQVWASYGQRPSSELLISYGFAPEVGENPDD 324
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 133/291 (45%), Gaps = 32/291 (10%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+ D +L WM +NG K L + + E R + A++ Q GD S+P L++
Sbjct: 28 DNDYIELVRWMSRNGF---KGALLKPANFKETGRGL---MATKPFQIGDQVISIPEMLLI 81
Query: 135 TLERVLGNE--TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T + VL + + T KLS + + YL+ E+ + K SFW YI+ L +
Sbjct: 82 TTQNVLSSYLGDFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPK------- 134
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
+ +P+ ++ E+ +L K ++ + E I Y EL ++ + S F +
Sbjct: 135 SYSNPVYFTNEEINWLP-RRIKRKVFDECEKINTAYRELKNLFSILESTFVSFK-----G 188
Query: 253 AFTFEIFKQAFVAVQSCVVHLQK-----VSLAR-RFALVPLGPPLLAYSS--KCKAMLAA 304
F + F+ A+ V + V++ + +S+ R +AL P LL +++ + KA
Sbjct: 189 IFEYSAFRWAWCTVNTRSVYMLQEQNPHLSIERDHYALAPF-LDLLNHTNTVEVKASYNP 247
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
V ++ K + + ++ GP N KL I YGFV N ++ VVE+
Sbjct: 248 VSKCYEIFTCTACKKYDQMFIYYGPHDNVKLFIEYGFVLPQNQHN--VVEL 296
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 116/247 (46%), Gaps = 19/247 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +T + +L L ++ +A LA ++ EKK G+ S
Sbjct: 65 ASKIIQTGDCILKVPYRVQITADNLLPE---IRSLIGEEVGNIAKLATVILIEKKLGQGS 121
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L +Q G+L + + W+E+EL + S E +++ I++++ + +
Sbjct: 122 EWYPYISCLPQQ---GEL--HNTVFWTESELEMIRPSSVYQETIDQKSQIEKDFLAIKHI 176
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ + F Y A T +F V + V + AL+P L +
Sbjct: 177 FECSHQSFGDSTYKDFMHACTLVLFDHFNVELP---VGSRAWGSTNGLALIPFAD-FLNH 232
Query: 295 SSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+A++ + DD ++Q++ DR Y GE +++ G N+ L++++GF N
Sbjct: 233 DGVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLMLDFGFTIPYNI 292
Query: 348 YDRLVVE 354
YD++ ++
Sbjct: 293 YDQVQIQ 299
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 110/237 (46%), Gaps = 22/237 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ G+ V L++T R+ + E + ++E + LAL+ + K GK S W P
Sbjct: 70 IKRGEQVLRVSRELMITPNRL---PSCVEESLSEDVNEWSRLALFQLLHKHAGKASPWEP 126
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L RG +++ + W + EL L S + R I +++ +
Sbjct: 127 YIRCLPPLRG-----LQNTVFWRDEELELLRQSNVYDQTEHRKTLISNQFD-------LV 174
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
++ +YP ++ E T E FK A+ S ++ + +VP + + S
Sbjct: 175 QAVVNKYP-ELFGETVTLESFKHAYCVASSRSWGVEALG---SITMVPF-VDMFNHDSSA 229
Query: 299 KAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+A+LA ++ ++V D+ Y G +V+ G PNS L +++GF DNP+D + +
Sbjct: 230 RALLAYYEEEGYAEVVADKDYNQGSQVVITYGTLPNSSLALDFGFTLPDNPHDEVQI 286
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 113/241 (46%), Gaps = 24/241 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++ G+ V L++T ++ + ELL T ++E A LAL+++ E+ G+ S
Sbjct: 5 AARPIEVGEQVLRVSGDLMITPNKL--PTEVKELLPTG-VTEWARLALFILVEQHLGQAS 61
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L A+ S + W + EL + + E ++R I E+ + V
Sbjct: 62 QWAPYINCLPTCG-----ALHSTVFWKKEELELVRFTSLHRETMQRRAVIGSEFASVLPV 116
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
Q+ P+ I E FKQA+ +S + S R VP +
Sbjct: 117 -------LQKCPH-IFGERVLHSKFKQAYATGKSL-----RRSSNTRILTVPF-VDFFNH 162
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
S C+A+L+ ++ +++ D+ Y GE +V+ G PN+ L +++GF NPYD++
Sbjct: 163 DSNCRALLSYDEERACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFTISCNPYDQVE 222
Query: 353 V 353
V
Sbjct: 223 V 223
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 121/288 (42%), Gaps = 43/288 (14%)
Query: 66 GSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
+E + K E +L SW+ N + LK +HN + + +QA +
Sbjct: 142 ADKETLKKSE----NLLSWVQANKGEFSSIKLKYLSTHNRS------IVSKRIIQADETV 191
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL-----ACLALYLMYEKKQGKKSFWLPYI 180
S+P V+TL+ V + ++LT K ++L A AL+L+ E+K+ S + YI
Sbjct: 192 ISIPQEQVITLD-VASSSDFCKILT-EKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYI 249
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P L+SE EL YL G+ + E+ E IK +Y + V
Sbjct: 250 DSLPTDLSSF------PALFSEEELQYLEGTAALKLVQEQKEDIKTDYESISQV------ 297
Query: 241 LFQQYPYDIP--TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC 298
IP F+FE F+ AF+ S V + KV + +VPL L S
Sbjct: 298 --------IPEFKSEFSFEQFRWAFLCSHSRVFGI-KVKGVKTSVMVPLADMLNHKHSGQ 348
Query: 299 KAMLAAVDDAVQLVVDRPYKA---GESIVVWCGPQPNSKLLINYGFVD 343
+ DDA + K + I G + NSKL +NYGFVD
Sbjct: 349 EDSEWVFDDATNCFTVKALKKIQRNQQIHFSYGSKCNSKLFLNYGFVD 396
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 109/242 (45%), Gaps = 39/242 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W + ELA+L SPT E +E + IK E++ L+
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKDEELAWLKHSPTYRETMECLKIIKSEFHVLEA 119
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
F + D+ E + F A+ Q +P
Sbjct: 120 NVF-------PWCRDVLGEV-SLTDFMHAYSTDQ-----------------IPFA-DFFN 153
Query: 294 YSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ C+ L+ D V D+ YKAG+ I + G PNS L ++YGF NP++++
Sbjct: 154 HDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAVASNPHEQV 213
Query: 352 VV 353
V
Sbjct: 214 EV 215
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 113/252 (44%), Gaps = 25/252 (9%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L+ W+ +GLP ++ + ++ E+ + A ++++ G+ VP SLV+T
Sbjct: 71 ESAASLERWLIDSGLPEQRLAI-QRVDIGERG-----LVALKNIRKGEKLLFVPPSLVIT 124
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ G + +++ N + + +A YL+ E S W+ YI L RQ
Sbjct: 125 ADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQ-------PY 177
Query: 196 SPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
S L W+ EL AYL SP + ++R + YN+L +F ++P P E +
Sbjct: 178 SLLYWTRAELDAYLVASPIRKRAIQRITDVIGTYNDL------RDRIFSRHPDLFPEEVY 231
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
E F +F + S +V L S+ R ALVP +L +S + + L +
Sbjct: 232 NIETFLWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSPEVETFLDFDKSSRGIVFT 288
Query: 313 VDRPYKAGESIV 324
DR Y+ G I+
Sbjct: 289 TDRSYQPGIYIL 300
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 136/301 (45%), Gaps = 35/301 (11%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T+ GSRE + DL SW +NG + + R + D++A
Sbjct: 69 TVFEGSRE------DSFADLMSWAQENGASCDGFTITNFGTEGYGLR------TTRDIKA 116
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLP 178
+ VP +++T+E N + + + +++ + LAL+L+ E+ SFW P
Sbjct: 117 EELFLWVPRKMLMTVESA-QNSVLGPIYSQDRILQAMGNVTLALHLLCERGD-PASFWSP 174
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YIR L ++ ++PL + + ++ L G+ ++L + + R+Y +F
Sbjct: 175 YIRSLPQE-------YDTPLYYQQEDVQLLLGTQAVQDVLNQYKNTARQY-----AYFY- 221
Query: 239 GSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAY 294
L Q +P +P + F+F+ ++ A +V + + V +R AL+PL
Sbjct: 222 -KLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQNQIPTVDGSRVTLALIPLWDMCNHT 280
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
+ DD + V + YK E I ++ G + N++ +I+ GF +DN +DR+ ++
Sbjct: 281 NGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFFQDNAHDRVKIK 340
Query: 355 V 355
+
Sbjct: 341 L 341
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 115/242 (47%), Gaps = 22/242 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F+A + Q P I TE E F A+ V S + ++R +L+P +
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENSKRISLIPFADFMNHD 233
Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+L D+ + ++ DR Y G+ + + G N+ L++++GF N +D + +
Sbjct: 234 GLSASIVLRDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFPYNIHDEVQI 293
Query: 354 EV 355
++
Sbjct: 294 QM 295
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 73 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FTF+ ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ +++
Sbjct: 291 EDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKL 341
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ +L W +NG L E P + A+ +++A + VP
Sbjct: 72 GKREDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRK 125
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 126 LLMTVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE-- 181
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL ++E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 182 -----YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNA 229
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FTF+ ++ A +V + + +R AL+PL +
Sbjct: 230 NKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 289
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ +++
Sbjct: 290 EDDRCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKL 340
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 77/299 (25%), Positives = 129/299 (43%), Gaps = 41/299 (13%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ + KE + V D +AG A VP S V
Sbjct: 48 EDARELAAWLSYDKGVDASALAFKEDAKGGVR------VILKADAEAGATALRVPQSAAV 101
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++EL + EL LAL+L E+ +G S W PY++ L +
Sbjct: 102 TSVDVGEHPIVSELASGR--PELIGLALWLCAERIKGGASEWAPYVKTL-------RANP 152
Query: 195 ESPLLWSET-ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W++ + A L GSP A+ +ER++ + EY + V + P P EA
Sbjct: 153 DAPLFWTDAKDFALLKGSPVAADAIERSKSARTEYASITEV-------IKSDPSSYPPEA 205
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL----------GPPLLAYSS-----KC 298
+ F + A+ + + A+ +ALVPL P +L S+ +C
Sbjct: 206 YEFLTEARFVDALATVCAKATWLPTAQCYALVPLLDVISIGGAPVPGVLPPSASDGVVRC 265
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGP--QPNSKLLINYGFVDEDNPYDRLVVEV 355
VD A ++ A S V+ + N +L +N G+VD+ +P D + ++
Sbjct: 266 GPADYDVDTASVVLRCATKAAANSEVIQLDALQRNNGELFLNTGYVDQKHPGDYIYMKT 324
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 116/242 (47%), Gaps = 22/242 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP ++ +T + + + ++ T+++ + LA L+ EKK+G+KS
Sbjct: 75 ASKVIHAGDCMLKVPFNVQITPDELSPDIRVS---LTDEVGNIGKLAAVLIREKKKGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E E + + S E +++ I++E++
Sbjct: 132 RWVPYISRLPQ-----PAEMHSTIFWGEDEFSMIRCSAVHKETVKQKAQIEKEFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F+A + Q YP I E E F A+ V S + ++ +L+P +
Sbjct: 182 -FVAQAFKQHYPMVI--ERPYLEDFMYAYALVGS-----RAWETSKGISLIPFADFMNHD 233
Query: 295 SSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+L+ D+ + ++ DR Y G+ + + G N+ L++++GF N +D + +
Sbjct: 234 GLSASIVLSDEDNQLSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTVPYNIHDEVQI 293
Query: 354 EV 355
++
Sbjct: 294 QM 295
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 135/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E E+ + A+ D++A + VP
Sbjct: 252 GKREDYFPDLMKWASENG---ASVEGFEMVDFKEEGFGLR---ATRDIKAEELFLWVPRK 305
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 306 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 361
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 362 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 409
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 410 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 469
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 470 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 520
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/291 (24%), Positives = 134/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASDNG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYAQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-- 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPQA 230
Query: 247 YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+ +P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 HKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 116/263 (44%), Gaps = 33/263 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI--AELLTTNKLSELACLALYLMYEKKQ 170
AAS DL AG A ++P+S ++T L + T T L E + L+L+YEK
Sbjct: 195 AAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEKYA 254
Query: 171 -GKKSFWLPYIREL---------DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILE 219
G +S W P + L + G L + +P W +E A L G+P + ++
Sbjct: 255 LGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGL-TPASWPAEVTDALLRGAPLLDDAVK 313
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
E R++ L F A L + +P PTE +T F+ A A + + +Q ++
Sbjct: 314 ARETTARQHAAL----FPA--LGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVG 367
Query: 280 RRFALVPLGPPLLAYSSKCKAMLA-------AV------DDAVQLVVDRPYKAGESIVVW 326
PP A+L AV DDA+ L + R +AGE I V
Sbjct: 368 GASGGGEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVS 427
Query: 327 CGPQPNSKLLINYGFVDEDNPYD 349
G + N++LL+ YGF DNPYD
Sbjct: 428 YGAKSNAELLLFYGFGVRDNPYD 450
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 110/272 (40%), Gaps = 46/272 (16%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW+ G +LK P + A D++AG+ V ++T +R+
Sbjct: 39 ELVSWLKIRGEHDACSLLKTGPDK-------RGLFAVRDIKAGECILRVSRDTMMTADRL 91
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ L ++ +SE A LAL L++EK+ G+ S W PYI L R + S
Sbjct: 92 ---PLEFQQLLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWG-----TIHSTAF 143
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W + EL + S E + R I+ E+NE+ +V F +D + A
Sbjct: 144 WRKEELTMIQESSLSYETMSRRAAIREEFNEMQSVPFA-----DFMNHDWSSNAMLTYDT 198
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
V+ V+ + +A A QL D+ Y A
Sbjct: 199 DNGSTEVEEVKVYSDCLYIALFCA--------------------------QLFADKNYAA 232
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
GE + + GP N+ L +++GF NP+D++
Sbjct: 233 GEQVTISFGPLCNASLALDFGFTVPYNPWDKV 264
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 110/247 (44%), Gaps = 46/247 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ + AG +P ++T E V ++ LL+T+ L+L+L+ EK + ++S
Sbjct: 10 ATRRVPAGSRFLEIPRIAIITPENVPSQ--VSHLLSTSNPKTR--LSLFLLSEKHKAQES 65
Query: 175 FWLPYIRELDRQRGRGQLA-VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
W PY+R L QL +ES + W ELA+L SPT E +E + IK E++ L
Sbjct: 66 QWAPYLRCL------PQLGDIESTMFWKAEELAWLKHSPTYRETMECLKIIKSEFHLLT- 118
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL-----ARRFALVPLG 288
+A Q +P+ C L +VSL A +P
Sbjct: 119 ---LANK--QVFPW---------------------CRDALGEVSLTDFMHAYSTDQIPFA 152
Query: 289 PPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
+ C+ L+ D V D+ YKAG+ I + G PNS L ++YGF N
Sbjct: 153 -DFFNHDHNCQTRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAVASN 211
Query: 347 PYDRLVV 353
P++++ V
Sbjct: 212 PHEQVEV 218
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 133/288 (46%), Gaps = 25/288 (8%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 81 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 135 LMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
++PL + E E+ YL + ++ + + R+Y +F + + +
Sbjct: 190 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFYKVIQTHPHAHKL 240
Query: 250 P-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDD 307
P ++FT+E ++ A +V + + +R AL+PL S DD
Sbjct: 241 PLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNLEDD 300
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 301 RCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 348
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 135/288 (46%), Gaps = 29/288 (10%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
E+ +L W +NG L E P + A+ +++A + VP L++
Sbjct: 1 EDYFPELMEWCKENGASTDGFELVEFPEEG------FGLKATREIKAEELFLWVPRKLLM 54
Query: 135 TLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
T+E G+ + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 55 TVESAKGS-VLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWLPYIKTLPNE----- 107
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DI 249
++PL ++E E+ YL + ++ + + R+Y +F + Q +P +
Sbjct: 108 --YDTPLYFNEDEVQYLQSTQAILDVFSQYKNTARQY-----AYFY--KVIQTHPNANKL 158
Query: 250 P-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDD 307
P ++FTF+ ++ A +V + + +R AL+PL +S DD
Sbjct: 159 PLKDSFTFDDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNSLITTGYNLEDD 218
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ V + +K+GE I ++ G + N++ +I+ GF E+N +DR+ +++
Sbjct: 219 RCECVALQDFKSGEQIYIFYGTRSNAEFVIHNGFFFENNLHDRVKIKL 266
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 231 HKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 81 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 135 LMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAH 238
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 239 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNLE 298
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 299 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 348
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 81 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 135 LMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 190 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHPHAH 238
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL S
Sbjct: 239 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTSGLITTGYNLE 298
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 299 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 348
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 125/297 (42%), Gaps = 46/297 (15%)
Query: 76 EDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
ED +L +W+ + G+ ++ KE R VA D+ AG +VP V
Sbjct: 47 EDARELAAWLSYDKGVDASGLVFKEG------ARGEVEVALRGDVDAGARVLAVPQDCAV 100
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T V + ++ L EL LAL+L E+ +G S W PY++ L
Sbjct: 101 TSVDVDAHPIVSGL--AKGRPELVGLALWLCAERIKGGASDWAPYVKTLAAN-------P 151
Query: 195 ESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
++PL W+E E A L GSP + +ER+ + EY + V + P P EA
Sbjct: 152 DAPLFWTEAEDFALLKGSPIVNDAVERSRSAREEYAAIVEV-------IKGDPTAFPAEA 204
Query: 254 FTF---EIFKQAFVAVQSCVVHLQKVSLARRFALVPL-------GPPLLAY---SSKCKA 300
+ F E F A V + L S +ALVPL G P+ S+K
Sbjct: 205 YEFFTEERFVDALATVCAKATWLPTASC---YALVPLLDVITIAGSPVPGVSPPSAKDGI 261
Query: 301 MLAAVD---DAVQLVVDRPYKA-GESIVVWCGP--QPNSKLLINYGFVDEDNPYDRL 351
A D D+ +V+ KA S VV P + N +L +N G VD+ +P D L
Sbjct: 262 ARCAADYDVDSACVVLSAVVKAPANSRVVQLDPLQRNNGELFLNTGRVDQKHPGDYL 318
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E S E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVSFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFGEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 135/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 73 KREDYFPDLXKWASENG---ASVEGFEXVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 126
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+ T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LXTVESA-KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 181
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 182 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 230
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 KLPLKDSFTYEDYRWAVSSVXTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLITTGYNLE 290
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 340
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 87 GKREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 140
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 141 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 196
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 197 -----YDTPLYFEEDEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--RVIQTHPHA 244
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 245 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTTGLITTGYNL 304
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 305 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 355
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 91/215 (42%), Gaps = 26/215 (12%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+ +E +AL L+ E+ +G +SFW YI L G + W ELAYL G
Sbjct: 162 NDDTNEYIAIALLLILERSKGSRSFWSEYIAILPTNEDVG-----ATFTWPAEELAYLEG 216
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
SP + ++ E+ A L D E FTFE ++ AF + S
Sbjct: 217 SPAASATASMMAKLRAEH---------AAVLEGNSALD--PEIFTFEAWQWAFTNLFSRA 265
Query: 271 VHLQKVSLARRFALVPL-----GPPLLAYSSKCKAMLAAV-----DDAVQLVVDRPYKAG 320
+ L+ A+VP P + + + A +D V L DR YK
Sbjct: 266 IRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEKEDEVVLFADRAYKKF 325
Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
E + + GP+ N+ LL+ YGF + NP++ + + V
Sbjct: 326 EQVFISYGPKSNADLLLLYGFALDRNPFNSVDLAV 360
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|62642307|gb|AAX92711.1| SET domain-containing protein [Picea abies]
Length = 106
Score = 71.2 bits (173), Expect = 7e-10, Method: Composition-based stats.
Identities = 37/75 (49%), Positives = 52/75 (69%), Gaps = 7/75 (9%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGS------REVVSKKEEDLGDLKSWMHKNGLPPCKV 95
S +RL R F + V S+DTL A S ++ + KEE++ DLKSWMH++GLPPC+V
Sbjct: 32 SRVRLPGRCVGFPMVVYSADTLTASSQHGEDKKDAIRGKEEEV-DLKSWMHRHGLPPCRV 90
Query: 96 ILKEKPSHNEKHRPI 110
+LKE+PS + KH+PI
Sbjct: 91 MLKERPSPDGKHKPI 105
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 118/248 (47%), Gaps = 19/248 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCER- 165
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 166 ANPSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNA 333
Query: 348 YDRLVVEV 355
+DR+ +++
Sbjct: 334 HDRVKIKL 341
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 89 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 142
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 143 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 197
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 198 ----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 246
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 247 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 306
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 307 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 356
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 76 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 129
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 130 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 184
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 ----YDTPLYFEEEEVRYLQSTQAVHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 233
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 234 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 293
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 294 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 343
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 76 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 129
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 130 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 184
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 185 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 233
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 234 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 293
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 294 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 343
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 118/248 (47%), Gaps = 19/248 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKK 169
+ A++D++A + +P +++T+E N + L + +++ + LAL+L+ E+
Sbjct: 108 LKATKDIKAEELFLWIPRKMLMTVESA-KNSVLGPLYSQDRILQAMGNVTLALHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
S WLPYI+ L + ++PL + E E+ +L + ++L + + R+Y
Sbjct: 167 N-PSSPWLPYIKTLPSE-------YDTPLYFEEEEVRHLLATQAIQDVLSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPYDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F +P +AFTF+ ++ A +V + + +R AL+PL
Sbjct: 218 ----AYFYKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPL 273
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + YK GE I ++ G + N++ +I+ GF EDN
Sbjct: 274 WDMCNHTNGLITTGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFEDNA 333
Query: 348 YDRLVVEV 355
+DR+ +++
Sbjct: 334 HDRVKIKL 341
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 136/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSMLGPLYSQDRILQAMGNITLAFHLLCERA-NPNSFWQPYIQSLPGE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLHSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 135/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 135/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 125/310 (40%), Gaps = 24/310 (7%)
Query: 52 RFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIH 111
RF R S+ VS + L++W+ K C + L+ P
Sbjct: 49 RFGCRWVQSNGSTHTKESNVSISNTKVERLRNWLKKLNHDDCNLKLERCPQGGSGS-GYG 107
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
A + G VP ++T E + + L+ + L+ + L+L+YE+ +G
Sbjct: 108 AFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPLVKKSDLTPWQAMCLHLLYERARG 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSE-TELAYLTGSPTKAEILERAEGIKREYNE 230
+ SFW PYI L ++ +L P+LWS+ +L GSP ++ ER I RE E
Sbjct: 168 ETSFWYPYIAVLPKEL---ELIGIHPMLWSQKMRREWLEGSPM-LDVTERRLAICREDYE 223
Query: 231 LDTVWFMAGSLF----QQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------AR 280
+ AG L + P I A + + +S ++LQ L
Sbjct: 224 A-MLLAGAGRLTPRGNEGEPISITETAVQ---WAATMLLSRSFSLNLQTQKLRPGSFAED 279
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPN-SKLL 336
ALVP L SS + D L R Y GE + GP + S+LL
Sbjct: 280 TIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSEGEQVFDSYGPSCSPSRLL 339
Query: 337 INYGFVDEDN 346
++YGFVDE+N
Sbjct: 340 LDYGFVDEEN 349
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 135/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 117/280 (41%), Gaps = 32/280 (11%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPS-HNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
EE+ L W +NG+ V K +P+ + R + A E + + + SVP L+
Sbjct: 47 EENYISLLKWAKRNGM----VFKKIRPAIFSSTGRGM---LAIERIHSSECVISVPERLL 99
Query: 134 VT----LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+T LE +GN + K S L L+LMYEK K SFW PYIR L
Sbjct: 100 ITASSVLESAIGNYVAERMKGGAKSSNDYLLVLFLMYEKYLEKGSFWAPYIRTLPD---- 155
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
+P ++ EL +L + + E+ IK+ Y + + Q + +
Sbjct: 156 ---TFNTPCYFTRKEL-FLLPEQCREQAFEQVTQIKQSYKSFAKAY---NDVLQDFDCNF 208
Query: 250 PTEAFTFEIFKQAFVAVQS-CVVHLQKVSLAR----RFALVPLGPPLLAYSSKCK--AML 302
FE FK A+ V + V H + A+ AL PL LL + K +
Sbjct: 209 -WRTVDFESFKWAWCVVNTRSVYHDEPNRRAQPIDGNCALAPL-LDLLNHCDKAEMCGRF 266
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+ ++ V Y+ G + + GP N++L + YGFV
Sbjct: 267 NSSSKNYEINVITEYQKGTQVFINYGPHDNTRLFLEYGFV 306
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 122/282 (43%), Gaps = 28/282 (9%)
Query: 81 LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L +WM NG K+ L++ P V A E L G+ +P SL+++
Sbjct: 31 LLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPTSLLISTRTA 84
Query: 140 LGNETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L +++ T KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 85 L--QSLLHSFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 135
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ T L + E+ R I+R + +L + + G + ++ + FT+
Sbjct: 136 PVFLRRTVFESLP-KDLREEVHTRITSIQRTFLKLKVL--LGGHVEEEPEVQSLSTGFTW 192
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F A+ AV + + Q + + + AL P L + K A V + ++
Sbjct: 193 NNFVWAWTAVNTRCIFAQGSNSSSLWENDHCALAPF-LDCLNHHWKASIETAMVGENFEI 251
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ + + A E + + GP N +L ++YGFV DNP D +VV
Sbjct: 252 LSHKSHDANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 293
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERAN-PNSFWLPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVQCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 135/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KRENYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIYDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 135/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 105/257 (40%), Gaps = 43/257 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ G+ SVP SLVV + R+L E N L
Sbjct: 59 VVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDEEFPLSATNENAL------ 112
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 113 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHHFARQR 162
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV K L
Sbjct: 163 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKRGL 216
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ---PNSKL 335
+ + P + + K + AVD + P K GE I V G N++L
Sbjct: 217 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQL 272
Query: 336 LINYGFVDEDNPYDRLV 352
L++YGFV E+NP+D +V
Sbjct: 273 LMDYGFVLENNPHDAVV 289
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 104/254 (40%), Gaps = 43/254 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT------------LERVLGNETIAELLTTNKLSELACL 160
V A+ D+ AG+ SVP SLVV + R+L E N L
Sbjct: 45 VVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDEEFPLSPTNENAL------ 98
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
L+ K S W YI L + L +S+ EL+YL GS +R
Sbjct: 99 ---LLLVHKNDPNSPWQRYIDVLPS-------TFSTTLFFSDDELSYLEGSSLHYFARQR 148
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV--QSCVVHLQKVSL 278
I+ +Y+ + T LF YP E F+ + +K A + +S VV K L
Sbjct: 149 RRAIESQYDTIFT------PLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVDEGKSGL 202
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP---QPNSKL 335
+ + P + + K + AVD + P K GE I V G N++L
Sbjct: 203 VPWADMFNMAPE----TEQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQL 258
Query: 336 LINYGFVDEDNPYD 349
L++YGFV E+NP+D
Sbjct: 259 LMDYGFVLENNPHD 272
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 131/286 (45%), Gaps = 35/286 (12%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGRS------LFASKSIQTGDCILKVPYNVQ 92
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 93 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 144
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 145 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 200
Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA--V 309
F + +C +V + + +L+P + + ++L +D
Sbjct: 201 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 248
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+++ DR Y GE +++ G PN+ LL+++GF N YD++ ++V
Sbjct: 249 EVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQV 294
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 135/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 133/292 (45%), Gaps = 40/292 (13%)
Query: 75 EEDLGDLKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
++D D W+ K G+ V+ K ++ + + AS+ +Q GD VP ++
Sbjct: 39 DKDCDDFLPWLEQKAGVEISSVLSIGKSTYGSRS-----LFASKSIQTGDCILKVPYNVQ 93
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
++ + V I LL +++ +A LA+ + E K G+ S W PYI L Q G
Sbjct: 94 ISPDNV--PSKINSLLG-DEVGNIAKLAIVISVEWKMGQDSEWAPYINRLP-QPGE---- 145
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW-FMAGSLFQQYPYDIPTE 252
+ S + WSE EL + S E + + I++++ + V + +LF+ DI +
Sbjct: 146 MHSTIFWSEGELKMIQQSSVYQETINQKAQIQKDFLAIKPVLHHFSENLFK----DISLK 201
Query: 253 AFTFEIFKQAFVAVQSC-VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---- 307
F + +C +V + + +L+P + + ++L +D
Sbjct: 202 EF-----------MHACALVGSRAWGSTKGLSLIPFA-DFVNHDGFSDSVLLGDEDKQLS 249
Query: 308 ----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
++++ DR Y GE +++ G PN+ LL+++GF N YD++ ++V
Sbjct: 250 ESSSTLEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLPYNIYDQVQIQV 301
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 136/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEVVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAN-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 135/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRDLQCTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P +AFT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDAFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 120/248 (48%), Gaps = 23/248 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNS 251
Query: 348 YDRLVVEV 355
+DR+ +++
Sbjct: 252 HDRVKIKL 259
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 133/293 (45%), Gaps = 33/293 (11%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y V Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYRV-------IQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R + AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 108/249 (43%), Gaps = 36/249 (14%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMY 166
HRP + AG+ +LV+T E+ L E IA L N+ + ++ L +M
Sbjct: 72 HRP---------MCAGECMIKFSQNLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMA 118
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
EK++G+ S W PYI L G+ + S + W ELA L SP ER ++
Sbjct: 119 EKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQS 173
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
EY E+ V S Y D+ + E FK + V S S ++P
Sbjct: 174 EYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTMIP 222
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGES-----IVVWCGPQPNSKLLINYG 340
L + + + + + DD +VV R Y+ G+ + + G N+ L ++YG
Sbjct: 223 L-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALDYG 281
Query: 341 FVDEDNPYD 349
FV DNP+D
Sbjct: 282 FVVPDNPFD 290
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 112/272 (41%), Gaps = 48/272 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ + E + EL + L + L L +M
Sbjct: 694 IVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDEQQALDSWSALILIMM 753
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 754 YEFFLGHQSKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 806
Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
R +L + GS Y +D+ E + +V
Sbjct: 807 FRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDDESDGWV 866
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + V++A +L ++ A + D+ + + R KAGE I+
Sbjct: 867 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 916
Query: 325 VWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
+ GP PNS+LL YG++ E + YD VVE+
Sbjct: 917 NYYGPHPNSELLRRYGYITEKHSRYD--VVEI 946
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 71/263 (26%), Positives = 126/263 (47%), Gaps = 40/263 (15%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA--- 158
+++ +R +H A + + +P S ++TLE + ET +A+ + KL+ L+
Sbjct: 173 YSKNYRGVH---ARRKVYNKETILFIPKSHLITLE--MAKETDVAKKIIAAKLNLLSPKH 227
Query: 159 -CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L+ +L+ E+K K+S W PY+ L + P+ +SE +L++L GSP + ++
Sbjct: 228 SFLSTFLLQERK-NKESKWKPYLDILPSDYN------QFPIFFSEDDLSWLKGSPFQNQV 280
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
E+ IKR+Y+++ +V F +Y TFE F A + S V LQ ++
Sbjct: 281 REKKADIKRDYDDICSV----APEFAEY---------TFEDFCWARMTASSRVFGLQ-IN 326
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 332
+ A VPL L + K DD +Q + D P GE + G + N
Sbjct: 327 EQKTDAFVPLADML--NHRRPKQTSWQYDDQREGFVIQALEDIP--RGEQVYDSYGRKCN 382
Query: 333 SKLLINYGFVDEDNPYDRLVVEV 355
S+ +NYGF++ DN + + + +
Sbjct: 383 SRFFLNYGFINLDNDANEVALRL 405
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 70/304 (23%), Positives = 122/304 (40%), Gaps = 50/304 (16%)
Query: 81 LKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L W H +G P ++ EK + + +P +A+E L + A + P S+ ++
Sbjct: 12 LLDWAHNHGASLHPSVEIYQDEKTGFSLRVKP----SAAESLHSPFKAVTCPTSITLSYL 67
Query: 138 RVLGNETIAELLT---------------TNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
L + I LT N L YL+ + +GK S W PYI
Sbjct: 68 NALTDGPITPYLTPPALDTQKHAFPERFMNSLPPHVIGRFYLIQQYLKGKSSLWAPYIST 127
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-- 240
L + A+ P W+E ++ L G+ I E + +K EY + + GS
Sbjct: 128 LTDPSQLDKWAL--PPFWTEHDIELLRGTNAYVAIQEIQDNVKSEYKQARKILKQEGSPD 185
Query: 241 --LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--- 295
+ Q Y+ FT F+ + + +S ++++ L+P G + +S
Sbjct: 186 YRAYTQVLYNWAYCMFTSRSFRPSLILSESAREYVER--------LLPEGAKIDDFSILQ 237
Query: 296 -----------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
++ L + A +L+ Y+ G+ + G + NS+LL+ YGFV E
Sbjct: 238 PLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQVFNNYGKKTNSELLLGYGFVTE 297
Query: 345 DNPY 348
+N Y
Sbjct: 298 NNDY 301
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 107/249 (42%), Gaps = 36/249 (14%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMY 166
HRP + AG+ LV+T E+ L E IA L N+ + ++ L +M
Sbjct: 72 HRP---------MCAGECMIKFSQDLVLTPEK-LPCEVIALLDQANEFTRVS---LLVMA 118
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
EK++G+ S W PYI L G+ + S + W ELA L SP ER ++
Sbjct: 119 EKRKGQNSAWAPYIECLP---SFGE--IHSTIFWDPKELACLECSPIHRGTGERNALLQS 173
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
EY E+ V S Y D+ + E FK + V S S ++P
Sbjct: 174 EYREVKKV---VESCPHLYDPDV-----SLEQFKHEYATVSSRAWGQGPHS---DMTMIP 222
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVV-DRPYKAGES-----IVVWCGPQPNSKLLINYG 340
L + + + + + DD +VV R Y+ G+ + + G N+ L ++YG
Sbjct: 223 L-VDFANHDPRSRTLFSHADDNCTVVVASRDYQTGDENFHLKVHICYGDHSNAVLALDYG 281
Query: 341 FVDEDNPYD 349
FV DNP+D
Sbjct: 282 FVVPDNPFD 290
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 120/248 (48%), Gaps = 23/248 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 28 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-S 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 86 PNSFWQPYIQTLPSE-------YDTPLYFEEEEVRYLQSTQAIHDVFSQYKNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 136 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 191
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V + ++AGE I ++ G + N++ +I+ GF ++N
Sbjct: 192 WDMCNHTNGLITTGYNLEDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNS 251
Query: 348 YDRLVVEV 355
+DR+ +++
Sbjct: 252 HDRVKIKL 259
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 135/293 (46%), Gaps = 33/293 (11%)
Query: 72 SKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVP 129
K+E+ +L W NG + +V+ E+ + A+ +++A + VP
Sbjct: 73 GKREDYFPELMKWASANGASVEGFEVVNFEEEGFG--------LRATREIKAEELFLWVP 124
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 125 RKLLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++PL + E E+ YL + ++ + + R+Y +F + Q +P
Sbjct: 183 -------YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--RVIQTHP 228
Query: 247 Y--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAML 302
+ +P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 229 HANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGY 288
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V R + AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 289 NLEDDRCECVALRDFHAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/195 (26%), Positives = 89/195 (45%), Gaps = 26/195 (13%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ S W PYI L + +++ LW +TEL+YL SP + E
Sbjct: 114 LALIVLMERYKGQSSVWAPYISCLPQPA-----ELDNTFLWEDTELSYLKASPLYGKTRE 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
R E I E+ ++ + LF + + E FK + V S + +++
Sbjct: 169 RLEMITTEFGQVQNALNVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIG 214
Query: 280 RRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
LV + P+L + +S K + + + DR Y + I + G N++
Sbjct: 215 EDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVVTADRAYTENDQIWINYGDLSNAE 272
Query: 335 LLINYGFVDEDNPYD 349
L ++YGF +NPYD
Sbjct: 273 LALDYGFTVPENPYD 287
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 121/250 (48%), Gaps = 23/250 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEV 355
N +DR+ +++
Sbjct: 332 NSHDRVKIKL 341
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 107/248 (43%), Gaps = 35/248 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE----K 168
+ A+E+++ G++ +P S ++T+ER + + L E + LA +L +
Sbjct: 24 LVATEEVRRGESLLDIPESTLITVERAIAESNLGP--AHANLQEWSVLAAFLAEQALAID 81
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKRE 227
S + Y+R L R+ G L W E ++ L GSP++ +ER +
Sbjct: 82 AGADGSRFATYVRALPRRTG-------GVLDWPEEDVKELLAGSPSQRAAMERQASVDAA 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+E+ +P P + AF + S ++ L A ALVP
Sbjct: 135 IDEIRA----------SFPQLTPG------ALRWAFDVLFSRLIRLPNRGGA--LALVPW 176
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 345
+L + C A + AV L DR YK GE + GP+P+S+LLI+YGF +
Sbjct: 177 AD-MLNHRPGCDAYIDDTGGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAPAVGE 235
Query: 346 NPYDRLVV 353
NP D V
Sbjct: 236 NPDDEFEV 243
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 129/302 (42%), Gaps = 62/302 (20%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W +NG V +K++P + + + A E ++ +P LV+T ++
Sbjct: 50 WCSENGAYLGSVAIKDRPDGD------YGLVAEEKIEESMQFLGIPMKLVMTTASARKSK 103
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 104 -LGPLLRDDPIMKSMSNVA-LAIFLILELSAGESSFWHPYISVLPD-------SFNTVLY 154
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
++ EL L+GS E L+ I R+Y + F L + P+ + FT++++
Sbjct: 155 FNIEELELLSGSAVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPF---KDCFTYDLY 210
Query: 260 KQAFVAV---QSCVVHLQKVSL----------ARRFALVPL--------GPPLLAYSS-- 296
+ A AV Q+ V + L A ALVPL G L Y S
Sbjct: 211 RWAVSAVMTRQNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHSDGKVLTDYDSSA 270
Query: 297 ---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+C AM R + GE + ++ G + N++ I+ GFV EDN YD + +
Sbjct: 271 SMVRCYAM-------------RDFDKGEEVTIFYGKRTNAEFFIHNGFVFEDNRYDAVDI 317
Query: 354 EV 355
++
Sbjct: 318 KL 319
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 108/262 (41%), Gaps = 50/262 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A +D+ A F+VP S +V +E L +T E+ + + L +
Sbjct: 531 IVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTLII 590
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RA 221
L+YE +G +S W PY+ L + E+P+ WS+ E+ L S T+++I + A
Sbjct: 591 VLIYEYFKGDQSSWKPYLDVL-------PASFETPMFWSDAEVDELQASATRSKIGKTNA 643
Query: 222 EGI--------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
E + + EL + GS Y +D F+
Sbjct: 644 EEMFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFD----------FQN 693
Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
+ + A+ +VP+ +L ++ A + DDA+ + R KAG
Sbjct: 694 EDEEEEDDSEEWVEDREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVATLRTIKAG 752
Query: 321 ESIVVWCGPQPNSKLLINYGFV 342
E I+ + GP PNS+LL YG+V
Sbjct: 753 EEILNYYGPHPNSELLRRYGYV 774
>gi|255947868|ref|XP_002564701.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591718|emb|CAP97961.1| Pc22g06730 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 679
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 98/200 (49%), Gaps = 15/200 (7%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMGQYLRGSEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYN----ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA-VQSCVVHL 273
ER + +++Y+ +LD + F +QY +++ A T I +AF A V S V
Sbjct: 160 ERIKVWEQKYDLGYLKLDEIGFPD---CEQYTWELYLWASTI-ITSRAFSAKVLSGAVQP 215
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+ AL+PL L + K A D+ + L+V + AG+ I GP+ N
Sbjct: 216 DDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDEDIGLLVLEDHSAGQEISNNYGPRNNE 274
Query: 334 KLLINYGFVDEDNPYDRLVV 353
+LLINYGF NP D +V
Sbjct: 275 QLLINYGFCIAGNPTDYRIV 294
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 115/263 (43%), Gaps = 45/263 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++D++ GD FS+P S++++ + ++EL ++LS + L L +MYE ++
Sbjct: 41 VTANKDIKEGDLLFSLPRSILLSQLTSSLKDQVSEL---SELSGWSPLILCMMYEIEK-P 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
SFW PY L R+ +P+ W++ +L L G+ ++I + E + +NEL+
Sbjct: 97 DSFWKPYFDVLPRE-------FTTPMFWNQEDLKELEGTDIISKI-GKKESEELFHNELE 148
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSCVVHLQKVSLARR--------- 281
+ ++YP + T E+F + + S LQK
Sbjct: 149 PI-------IKKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEE 201
Query: 282 ---------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
++VP+ L + A L D++Q+ + K GE I
Sbjct: 202 EEEEEEEEEEEEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAIKDIKEGEQIYNT 261
Query: 327 CGPQPNSKLLINYGFVDEDNPYD 349
G N+ LL YGFVDE N +D
Sbjct: 262 YGDLCNADLLRKYGFVDEKNDFD 284
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/248 (23%), Positives = 119/248 (47%), Gaps = 23/248 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 99 ATRDIKAEELFLWVPRKLLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD- 156
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW PYI+ L + ++PL + E E+ L + ++ + + R+Y
Sbjct: 157 PNSFWQPYIQTLPSE-------YDTPLYFEEDEVRSLQSTQAVHDVFSQYKNTARQY--- 206
Query: 232 DTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+F + Q +P+ +P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 207 --AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 262
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ DD + V R ++AGE I ++ G + N++ +I+ GF ++N
Sbjct: 263 WDMCNHTNGLITTGYNLEDDRCECVALRDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNS 322
Query: 348 YDRLVVEV 355
+DR+ +++
Sbjct: 323 HDRVKIKL 330
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 120/250 (48%), Gaps = 23/250 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRRLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + +PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YNTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEV 355
N +DR+ +++
Sbjct: 332 NSHDRVKIKL 341
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 110/242 (45%), Gaps = 25/242 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
FQ P+ FT++ F A V S + + +L+P L +
Sbjct: 178 -------FQ--PFCQSFGDFTWKDFMHACTLVGS-----RAWGSTKGLSLIPFAD-FLNH 222
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+A++ + DD ++ DR Y GE +++ G N+ L++++GF N YD++
Sbjct: 223 DGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFTIPYNIYDQVQ 282
Query: 353 VE 354
++
Sbjct: 283 IQ 284
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/252 (23%), Positives = 121/252 (48%), Gaps = 31/252 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L +++ + LAL+L+ E+
Sbjct: 110 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYNQDRILQAMDNVTLALHLLCERA-N 167
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + E+ L G+ ++L + R+Y
Sbjct: 168 PASFWLPYIRTLPQE-------YDTPLFYEQDEVQLLQGTQAVQDVLSQYRNTARQY--- 217
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
+F L Q +P +P ++FTF+ ++ A +V + + ++V+LA
Sbjct: 218 --AYFY--KLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA---- 269
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
L+PL + DD + V + YK E I ++ G + N++ +I+ GF
Sbjct: 270 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFY 329
Query: 344 EDNPYDRLVVEV 355
++N +D++ +++
Sbjct: 330 QENAHDQVKIKL 341
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 135/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 135/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 131/317 (41%), Gaps = 63/317 (19%)
Query: 78 LGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
G L W ++G P ++ L ++ + P +A+E LQ G AA S P + +
Sbjct: 10 FGALVEWAEQHGARLHPSVEIYLDPVSKYSLRVSP----SATEGLQPGFAAVSCPARITL 65
Query: 135 TLERVLGNETIAELLTTNKLSELACLA------------------------LYLMYEKKQ 170
+ L + + +++ ++ A L +L+ E +
Sbjct: 66 SYLNALVDGLLDPSALSDRSAQSARLDQETSSTGAFPPRFTRSVPPHVLGRFFLVKEYLK 125
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVES-PLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
GK SFW PYI L Q+AV + P W + ++AYL G+ I E E +KRE+
Sbjct: 126 GKDSFWWPYIATLPPPE---QVAVWALPPFWPDHDIAYLEGTNAHVAIQEIQENVKREFK 182
Query: 230 ELDTVWFMAGSLFQQYPY-DIPTEAFTFEIFKQAFVAVQSCVVH----LQKVSLARRFAL 284
+ A L ++ + D+P A+T ++K AF S L + R AL
Sbjct: 183 Q-------ARKLLKEEDFPDLP--AYTQLLYKWAFCIFTSRSFRPSLVLSDATKRRLSAL 233
Query: 285 VPLG---------PPLLAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+P G PLL ++ +V D +L+ PY+ G + G +
Sbjct: 234 LPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPGTQVYNNYGLK 293
Query: 331 PNSKLLINYGFVDEDNP 347
NS+LL+ YGF+ + P
Sbjct: 294 TNSELLLAYGFILPETP 310
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/249 (25%), Positives = 110/249 (44%), Gaps = 39/249 (15%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 77 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 126
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 127 LLEHKLGLGSEWAPYIIRLPQP-----WEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 181
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KRE+ + + +P I + + + F A+ V S + +L
Sbjct: 182 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 227
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+P L + +AML DD ++V DR + GE +++ G N+ L++++GF
Sbjct: 228 IPFAD-FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGFA 286
Query: 343 DEDNPYDRL 351
N +D++
Sbjct: 287 LPYNIHDQV 295
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 135/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 77 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 130
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 131 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 185
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 186 ----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 234
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 235 KLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 294
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 295 DDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 344
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 121/250 (48%), Gaps = 23/250 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ +L + ++ + + R+Y
Sbjct: 167 N-PHSFWLPYIQTLPSE-------YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEV 355
N +DR+ +++
Sbjct: 332 NSHDRVKIKL 341
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 119/250 (47%), Gaps = 23/250 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ D++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATRDIKAEELFLWVPRKLLMTVESA-KNSILGSLYSQDRILQAMGNITLAFHLLCER- 165
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + + L + E E+ YL + +I + + R+Y
Sbjct: 166 YNPNSFWLPYIQTLPNE-------YNTALYFEEDEVQYLQSTQAIHDIFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVVQTHPNASKLPLKDSFTYDDYRWAVSSVMARQNQIPAEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLKDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEV 355
N +DR+ +++
Sbjct: 332 NSHDRVKIKL 341
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 123/303 (40%), Gaps = 51/303 (16%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G I E + + R V AS+D+ + F +P + ++++E +L
Sbjct: 13 AWLRQSGAEISPKIKLEDLRNKDAGRG---VVASQDIAEHELLFRIPRASILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S ++ ERA+ +E L+ +
Sbjct: 121 EDELAELQASAVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENL 180
Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
M GSL Y +D+ T E+ ++ A + L K +VPL L
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
A + +C A L D +++ +P +AGE I GP P S LL YG+V DN V
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDV 291
Query: 353 VEV 355
VE+
Sbjct: 292 VEI 294
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 121/250 (48%), Gaps = 23/250 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E + + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KSSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPNE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL + DD + V + +KAGE I ++ G + N++ +I+ GF ++
Sbjct: 272 PLWDMCNHTNGLITTGYNLEDDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDN 331
Query: 346 NPYDRLVVEV 355
N +DR+ +++
Sbjct: 332 NSHDRVKIKL 341
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 122/282 (43%), Gaps = 24/282 (8%)
Query: 79 GDLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
G L +WM NG K+ L++ P V A E L G+ +P +L+++
Sbjct: 30 GRLLTWMEANGFRLHSKLGLRDFPDTGRG------VVALEKLVGGETFLKLPATLLISTR 83
Query: 138 RVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
L + + ++ + KL+ + L L+++ +K G+ S W P++ L R +
Sbjct: 84 TALQSRLHSFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPR-------TFTT 136
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P+ L + E+ I+R + +L + + G + ++ + FT+
Sbjct: 137 PVFLRRKVFESLP-KDLREEVQTGITFIQRTFLKLKVL--LGGHVEEEPEVQCLSTGFTW 193
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F A+ AV + + Q + + + AL P L + K A V + ++
Sbjct: 194 NNFVWAWTAVNTRCIFAQGSNSSSLWEDDHCALAPF-LDCLNHHWKASIETAMVGENFEI 252
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ + + A E + + GP N +L ++YGFV DNP D +VV
Sbjct: 253 LSHKSHDANEQVFISYGPHSNRRLFLDYGFVLPDNPNDVVVV 294
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/252 (23%), Positives = 122/252 (48%), Gaps = 31/252 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQG 171
A+ D++A + +P +++T+E + L T +++ + LAL+L+ E+
Sbjct: 28 ATRDIKAEELFLWIPRKMLMTVESA-KKSVLGPLYTQDRILQAMDNVTLALHLLCERAD- 85
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFWLPYIR L ++ ++PL + + ++ L G+ ++L + R+Y
Sbjct: 86 PASFWLPYIRTLPQE-------YDTPLFYQQQDVQLLHGTQAIQDVLSQYRNTARQY--- 135
Query: 232 DTVWFMAGSLFQQYPYD--IP-TEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFA 283
+F L Q +P +P ++FTF+ ++ A +V + + ++V+LA
Sbjct: 136 --AYFY--KLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPTEDGRQVTLA---- 187
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
L+PL + DD + V + YK E I ++ G + N++ +I+ GF
Sbjct: 188 LIPLWDMCNHRNGLITTGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNAEFVIHNGFFY 247
Query: 344 EDNPYDRLVVEV 355
++N +D++ +++
Sbjct: 248 QENAHDQVKIKL 259
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 133/291 (45%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 12 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 65
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 66 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 121
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y V Q +P+
Sbjct: 122 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHA 169
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 170 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 229
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 230 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 280
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/290 (22%), Positives = 132/290 (45%), Gaps = 37/290 (12%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNITLAFHLLCERAD-PNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRDLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV 305
+P +AFT+E ++ V S + + L + G P + +
Sbjct: 231 NKLPLKDAFTYEDYRLGLV---SLALGRWALGLECGVGIARCGKPQITTGYNLE------ 281
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 282 DDRCECVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 331
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/280 (23%), Positives = 119/280 (42%), Gaps = 41/280 (14%)
Query: 78 LGDLKSWMHKNGLPPCKVI-LKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+G+L SW+ + G P V+ L P+ + +++AGD +P++L++T
Sbjct: 21 IGELYSWIQRLGFKPTSVLRLACTPASGRG------IVCLSNIEAGDVIIDLPSTLLITP 74
Query: 137 ERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ V EL ++ LS L ++++ E+ G+KS W PYI +
Sbjct: 75 DLVR-----KELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESI------------ 117
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P ++ + P + A+ I R E V+ F+ D+
Sbjct: 118 -PDVFDGLQCRKSVRLPRRL-----AQAIDRWNAERRNVFSRLRMFFRGRGIDL-----N 166
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
FE F A+ AV + ++++ L P LL + K + V++ + +
Sbjct: 167 FETFSWAWSAVNTRCIYVE----GHGSTLAPF-LDLLNHHWKASIETSFVNNHFIIRSNV 221
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
Y+AG + + G N L +NYGFV ++NP D + VE+
Sbjct: 222 GYEAGSEVFIGYGSHDNRTLFLNYGFVLDENPNDCITVEL 261
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/290 (22%), Positives = 128/290 (44%), Gaps = 26/290 (8%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPI--HYVAASEDLQAGDAAFSVP 129
S +E++ W+ N + V ++ H + + + A+ D + G+ ++P
Sbjct: 71 SGREKNFDGFMGWLKSNSVDAEAVEIQ--------HFDVGGYGIKATRDFKEGELFLAIP 122
Query: 130 NSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+++T + N + L+ N++ + LAL+++ E SFWLPY++ L
Sbjct: 123 RSVMMTTDTA-KNSALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPH- 179
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ SPL ++ +L L SPT +E++ + I R+Y + F L + P
Sbjct: 180 ------SYSSPLYFNPEDLQLLKASPTLSEMINQFRNITRQYAYFFNL-FQGHELASKLP 232
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAV 305
I + ++ ++ A +V + + + R AL+PL + + +
Sbjct: 233 --IQVKNICYDDYRWAVSSVMTRQNQIPTLDGQRMISALIPLWDMCNHTNGQITTDFSLK 290
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+D + AG + ++ G + N++LLI+ GFV N DRL + +
Sbjct: 291 NDRSECFSLEGTVAGAQVFIFYGSRSNAELLIHNGFVYPQNHSDRLTIRL 340
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 135/290 (46%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P E+FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFQAGDQIYIFYGTRSNAESVIHSGFFFDNNSHDRVKIKL 341
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 116/291 (39%), Gaps = 41/291 (14%)
Query: 83 SWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
+W+ NG K+ L + N + A EDL + FSVP S ++T E
Sbjct: 16 NWLRDNGASISAKITLDDLRQQNAGRG----IVAVEDLDEDEELFSVPRSTMLTTETSRN 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E + + + LS + +AL + G +S W PY L ++ ++ + WS
Sbjct: 72 GEAVLQEVDDPWLSLIVVMALEYL----DGSQSRWKPYFDVL-------PVSFDNLMFWS 120
Query: 202 ETELAYLTGSPTKAEI----------------LERAEGIKREYNE-LDTVWFMAGSLFQQ 244
+ EL +L GS +I +ER K NE L + GS
Sbjct: 121 DRELRHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTIMA 180
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
Y +D+ T + + + + L K +VPL L A + + A L
Sbjct: 181 YGFDLETSSDQAKNDGEEWEEDSDAGETLPK-------GMVPLADMLNADADRNNAKLFY 233
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD V + +P KAGE + G P + LL YG++ DN VVE+
Sbjct: 234 EDDKVVMKTIKPVKAGEELYNDFGSLPRADLLRRYGYL-TDNYAQYDVVEI 283
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 135/291 (46%), Gaps = 29/291 (9%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ L + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEEEVRCLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAA 304
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNL 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + ++AG+ I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 291 EDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 112/272 (41%), Gaps = 48/272 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLM 165
+ A D+ A F++P ++ E + + +L + L + L L +M
Sbjct: 43 ITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDEEQALDSWSALILIMM 102
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEGI 224
YE G +S W PYI L L ++P+ WSE EL+YL S T +I + AE +
Sbjct: 103 YEFFLGDESKWKPYIDVL-------PLTFDTPMFWSEEELSYLQASATVNKIGKADAEEM 155
Query: 225 KREY--------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
R +L + GS Y +D+ E + +V
Sbjct: 156 FRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAFDLENEEAENDEESDGWV 215
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + V++A +L ++ A + D+ + + R KAGE I+
Sbjct: 216 EDREGKSMMGMVAMA----------DILNADAEFNAHVNHGDEELTVTSIRDIKAGEEIL 265
Query: 325 VWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
+ GP PNS+LL YG++ E + YD VVE+
Sbjct: 266 NYYGPHPNSELLRRYGYITEKHSRYD--VVEI 295
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 120/243 (49%), Gaps = 25/243 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP S + + +L +++LL +++ +A LA+ L+ ++K G++S
Sbjct: 51 ASKSIQTGDCILRVPYSAQIASDNLLPE--LSDLLG-DEVGSVAKLAIVLLVDQKVGQES 107
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q G + S + WS++EL + S E +++ I++++ + V
Sbjct: 108 KWAPYISRLP-QLGE----MHSTIFWSKSELDMIFQSSVYKETIKQKAQIEKDFLTIKPV 162
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
+ +P + + TF+ F A+ V+S + + +L+P L +
Sbjct: 163 -------LEHFPQ--ISRSITFQDFMHAYALVKS-----RAWGSTKGVSLIPFA-DFLNH 207
Query: 295 SSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+A++ +D ++ DR Y E +++ G N+ LL+++GF N ++++
Sbjct: 208 DGFSEAVVLNDEDKQVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLPYNIHEQVE 267
Query: 353 VEV 355
+++
Sbjct: 268 IQI 270
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 110/251 (43%), Gaps = 30/251 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--------LGNETIAELLTTNKLSELACLALYL 164
+ A +D+ A F+VP S +++ E + ET E+ + + L + +
Sbjct: 537 IVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLIIVM 596
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
MYE +G +S W PYI L + E+P+ WS+ EL L S T++++ +A
Sbjct: 597 MYEYFKGSESKWKPYIDVLPS-------SFETPMFWSDAELDELQASATRSKV-GKASAE 648
Query: 225 KREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFVAVQSCVV 271
+ +++ V LF Q Y D + +++F+ +
Sbjct: 649 EMFQDKVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDETEE 708
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
+++ +VP+ +L ++ A + DDA+ + R KAGE I + GP P
Sbjct: 709 WVEEREAKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIFNYYGPHP 767
Query: 332 NSKLLINYGFV 342
NS+LL YG+V
Sbjct: 768 NSELLRRYGYV 778
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 80/172 (46%), Gaps = 18/172 (10%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 83 DVRAGEPLIEIPQNLAVTSVDVADSPIVAGLAAGR--GELVGLALWLCLERHKGPLSEWA 140
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + PLLW+ EL L GSP + + + R E EY +
Sbjct: 141 PYVATLP------SAGSDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASI----- 189
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL 287
+ P D P +A+ F + + AFV A+ + + ++ A +A+VPL
Sbjct: 190 --ADQIRSNPNDFPPDAYEF-LTRDAFVDALATVLARAVWLNAANCYAMVPL 238
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 66/290 (22%), Positives = 132/290 (45%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG L N K + A+ +++A + VP L
Sbjct: 81 KREDYFPDLIKWAAENGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRKL 134
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 135 LMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 190 ----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNAN 238
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 239 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 298
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + + GE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 299 DDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 348
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 93/202 (46%), Gaps = 24/202 (11%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA L+ EKK G+KS W+PYI L + + S + W E EL+ + S E ++
Sbjct: 2 LAAVLIREKKMGQKSRWVPYISRLPQ-----PAEMHSSIFWGEDELSMIRCSAVHQETVK 56
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ I+++++ F+A + Q P I TE E F A+ V S + +
Sbjct: 57 QKAQIEKDFS------FVAQAFKQHCP--IVTERPDLEDFMYAYALVGS-----RAWENS 103
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 333
+R +L+P + +L D+ +Q+ DR Y G+ + + G N+
Sbjct: 104 KRISLIPFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNA 163
Query: 334 KLLINYGFVDEDNPYDRLVVEV 355
L++++GF N +D + +++
Sbjct: 164 TLMLDFGFTFPYNIHDEVQIQM 185
>gi|296419472|ref|XP_002839331.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635461|emb|CAZ83522.1| unnamed protein product [Tuber melanosporum]
Length = 541
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 62/249 (24%), Positives = 114/249 (45%), Gaps = 25/249 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V D+ + S P++L + + + A+ +T ++ A L ++L E +GK
Sbjct: 43 VITCTDIPSHSQLISCPHTLTINYTKARSAFS-ADFITNT--TQHAALCMFLCLEWLKGK 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW PY+ L R+ ++PL +S+ +L +L G +A +E + I RE E
Sbjct: 100 ESFWWPYLCVLPRE-------FDTPLYFSDEDLQFLQGCNLEATEVEARKLIWREEFE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA----LVPLG 288
A S+ Q+ YD TE +T+E++ A S + + R +P+
Sbjct: 151 ----AAVSILQREGYD--TEYYTWELYLWASTIFTSRSFPGKLMDWDRIIVHEDDTMPIL 204
Query: 289 PPLLAYSSKCKAMLAA---VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
PL+ + A + D +++++ AG + GP+ N +LL+ YGF
Sbjct: 205 FPLIDSLNHYPATIITWQPSDTSLRIISGVGVSAGAEVYNNYGPKANEELLMGYGFTLLQ 264
Query: 346 NPYDRLVVE 354
NP+D +++
Sbjct: 265 NPFDSFLLK 273
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 68/274 (24%), Positives = 116/274 (42%), Gaps = 50/274 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN----------KLSELACLAL 162
+ A +D+ A F++P ++ +E + + ++ + +L + L L
Sbjct: 610 IIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDDDDEAPRLDSWSSLIL 669
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK-------A 215
LMYE QG+KS W PY L + ++P+ WSE+EL L S + A
Sbjct: 670 VLMYEYLQGEKSQWKPYFDVLPS-------SFDTPMFWSESELDQLQASHMRHKIGKADA 722
Query: 216 EILERAE-------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
E + R G R ++L + GS Y +D+ + E
Sbjct: 723 ESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDEDEEEEETDG 782
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+V + + +VP+ +L ++ A + ++++ + RP KAGE
Sbjct: 783 WVEDREGKSMM---------GMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEE 832
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
I + GP PNS+LL YG+V E + YD VVE+
Sbjct: 833 IFNYYGPHPNSELLRRYGYVTERHSRYD--VVEI 864
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 66/290 (22%), Positives = 131/290 (45%), Gaps = 29/290 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W NG L N K + A+ +++A + VP L
Sbjct: 74 KREDYFPDLIKWAAANGASTDGFELV-----NFKEEGFG-LRATREIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L +
Sbjct: 128 LMTVESA-KNSVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ +L + ++ + + R+Y +F + Q +P
Sbjct: 183 ----YDTPLYFEEDEVQHLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPNAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAV 305
+P ++FT+E ++ A +V + + +R AL+PL +
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLE 291
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + + GE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 292 DDRCECVALQDFNVGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 341
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 123/291 (42%), Gaps = 26/291 (8%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
+E L WM ++G CK + ++ R + ++L+ GD+ +P L+V
Sbjct: 40 DEQYITLMKWMKEHGFN-CKGCCLKPAVFSDTGRGL---MTKKNLRPGDSIVEIPRHLLV 95
Query: 135 TLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T + +L E + ++ K + + +L+ E+ +GK SFW PYI L +
Sbjct: 96 TAKDILNTE-LGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKD------ 148
Query: 193 AVESPLLWSETELAYLTGSPT--KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+P S T+ A PT ++ + + + I+ + ++ F QY
Sbjct: 149 -FTTPAFGS-TKQADFDVLPTIARSRAINQLQDIRAAFESASCLFEDIERTFPQYRIFFS 206
Query: 251 TEAFTFEIF----KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS--SKCKAMLAA 304
++F + F + ++ C K S FAL P LL +S ++ A
Sbjct: 207 LDSFVWAWFVINSRSVYIEPSGCEAFDPKAS--DDFALAPFLD-LLNHSPGAEVTAGFDP 263
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
V + ++ Y A + + + GP N LL+ YGFV NP+D + E+
Sbjct: 264 VSNCYRIKTLDSYHAYDQVFIHYGPHDNVNLLLEYGFVIPSNPHDAVSFEL 314
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 96/206 (46%), Gaps = 23/206 (11%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +LM + +G + FW PY+R L + GQL +PL + E ++ ++ G+ +
Sbjct: 105 TFAFFLMAQYLRGPEGFWYPYLRTLPQP---GQLT--TPLFFGEEDVDWIQGTGIPEAAV 159
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
ER + + +Y D+ + G+ +P E +T+E++ A + S + +S
Sbjct: 160 ERIKIWEEKY---DSGYLQLGA--TGFP---DCETYTWELYLWASTIITSRAFSAKVLSG 211
Query: 279 ARR---------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
A + AL+PL L + K A D + L+V + AG+ I GP
Sbjct: 212 AVQPGDLPEDGVSALLPL-IDLPNHRPMAKVEWRAGDKDIGLLVLEDHSAGQEISNNYGP 270
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEV 355
+ N +LLINYGF NP D +V +
Sbjct: 271 RNNEQLLINYGFCIAGNPTDYRIVHL 296
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 95/208 (45%), Gaps = 24/208 (11%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQ 120
VA V S+ + D L +W+ G+ KV ++ P R VAA ED+
Sbjct: 44 VAVDASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGG----RGWGLVAA-EDIG 98
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYI 180
GDA ++P SL +T++ L + A + +AL L++E+ G+KS W Y+
Sbjct: 99 GGDAVLAIPRSLWMTVDTALASPIGAH---CGDEAGWIAVALQLLHERSIGEKSRWAAYV 155
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L Q +++PL WS E+A LTG+ ++L+ A G + T + S
Sbjct: 156 NALPAQ-------LDAPLFWSAEEVATLTGT----QLLDAAAGY--DSYARGTWARLKES 202
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
F P P++AF F AF ++S
Sbjct: 203 AFDANPDVFPSDAFDEPSFLWAFGILRS 230
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/270 (25%), Positives = 109/270 (40%), Gaps = 57/270 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + I L L + LMYE+ +GK
Sbjct: 34 VRALRDLHHGELIATIPKAACLTLLTTAARDAIERARLGGGLG----LTVALMYERSKGK 89
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L RQ P LWSE E+ L G+ + E +K ++ E
Sbjct: 90 GSKWYRYLKTLPRQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 141
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
L ++ P + P + FTFE +++A +S V ++ + +VPL
Sbjct: 142 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 191
Query: 291 LLAYSSKCKA-----MLAA-----------VDDA---------------VQLVVDRPYKA 319
++ K A ML A +DD +++V+ + A
Sbjct: 192 --LFNHKTDAEDVHFMLNASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKDVAA 249
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
G I G N+ LL YGF + +NP+D
Sbjct: 250 GSEIFNTYGQLGNAALLHRYGFTEPNNPHD 279
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 131/288 (45%), Gaps = 36/288 (12%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW+ +NG V +K NE + + A++DL+ + ++P +++
Sbjct: 81 REDHFSNLISWIKENGGVADNVTIKH---FNEMG---YGLEAAKDLEESELICAIPKNVM 134
Query: 134 VTLERVLGNETIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
+TL+ V + L N + +A LAL+L+ E + + SFW YI L
Sbjct: 135 MTLDNV-KVSPLKYLYENNPILKNMGNVA-LALFLILEHVKNENSFWHHYISSLPSD--- 189
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD- 248
+ L + + + SPT + + I R+Y +F +LFQ +
Sbjct: 190 ----YNTVLYFDLNDFLEMKNSPTFEMATKHCKNIARQY-----AYF--NNLFQNSNDEA 238
Query: 249 --IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----ALVPLGPPLLAYSSKCKAM 301
I FT+++++ A V + + S + L+PL + +++ +
Sbjct: 239 SLILRNVFTYQLYRWAVSTVMTRQNFIPSSSTSNDVENGINGLIPLWD-MCNHTNGYLST 297
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
VD + L +P+K GE ++++ G + NS L++ GFV ++NP+D
Sbjct: 298 QYKVDRSECLAC-KPFKKGEQVLIFYGERSNSDFLVHNGFVYDENPHD 344
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 104/250 (41%), Gaps = 40/250 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ASED++ + F +P S +++E + I ++ KL+ L LY+M K G
Sbjct: 42 VIASEDIEEDEVLFKIPRSSFLSVEN--DPDFIKQVPEAKKLNSWLQLILYMM---KAGS 96
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-- 230
+ W PY L Q ++S ++W++ EL L GS +I G + +Y E
Sbjct: 97 MTKWKPYFDVLPTQ-------LDSLMMWTDDELEGLKGSMIVKKI--GKAGAEEDYQEKL 147
Query: 231 -------------LDT---VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 274
DT + G L Y +D P ++F+ + L
Sbjct: 148 KPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAP-DSFS----EDEEDDEDIEHDDLY 202
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-AVQLVVDRPYKAGESIVVWCGPQPNS 333
L + A+VPL L A++ C A L A DD + +P K GE + G PN
Sbjct: 203 NEGLVK--AMVPLADTLNAHTRFCNANLIAEDDGGFSMTAIQPIKKGEQVYNTYGELPNC 260
Query: 334 KLLINYGFVD 343
L YG+V+
Sbjct: 261 DFLRRYGYVE 270
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/249 (23%), Positives = 109/249 (43%), Gaps = 26/249 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
+ A + +Q G+ +P +L+++++R + NE + L+E L ++L+ + G
Sbjct: 103 LLAFKKIQQGEKLIEIPENLILSVDRDQIKNEG------NDFLNEYDSLGIFLIQQMAMG 156
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KS W Y L R+ + W+ ++ +L GS T L E IK ++ L
Sbjct: 157 DKSKWKIYFDILPREED-----LNLGFRWNLNDIVFLRGSKTLNASLYLKEKIKIQFLRL 211
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
+ F L +YP I F ++ A + S + LQ + ++ +LVP
Sbjct: 212 EKTIFSKNRL--KYPVSI----FNLAQWEWALSILLSRAIFLQNL---KKVSLVPYADFM 262
Query: 290 ---PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
P K + + + + + D+ Y + I G + N +LL+ YGF+ E N
Sbjct: 263 NHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELLLLYGFILERN 322
Query: 347 PYDRLVVEV 355
P+D + + +
Sbjct: 323 PFDSIELRI 331
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 123/284 (43%), Gaps = 30/284 (10%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK+ K LKEK + K RP + + ++ LQ G+ S+P ++T + V
Sbjct: 29 HKDEYILLKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTV 88
Query: 140 LGNETIAELLT--TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + T +S L L +L+ EK +KS W PY+ L + S
Sbjct: 89 L-NSYLREYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLPE--------IYSC 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + ++ + +F LF P D+ + F ++
Sbjct: 140 PVCLEQKIVNLFPEPLRRKAHEQRKLVQELFISSQQFFFSLQPLF---PKDVAS-VFNYQ 195
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQL-- 311
FK A+ + + V H Q+ +R L P L L + + A ++ +
Sbjct: 196 AFKWAWCTINTRTVYMKHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYE 255
Query: 312 --VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
V + + E + + GP N +LL+ YGFV DNP+ + V
Sbjct: 256 ITTVTQCHHYNEVFICY-GPHDNQRLLLEYGFVSRDNPHSSVYV 298
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 121/293 (41%), Gaps = 50/293 (17%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
+D L W+ +NG ++L + P V ++ +L+ GD S+P +L++T
Sbjct: 4 DDSIQLMRWLRRNGFRDSHLVLTDFPDTGRG------VMSTRNLKEGDCIVSLPENLLIT 57
Query: 136 LERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
V+ N + + + T K L+ L+LYL+ EK +GK SFW PYI+ L +
Sbjct: 58 TTTVV-NSHLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTL-------PTS 109
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+P +S E+ L +A + R + ++ Y L T LF +
Sbjct: 110 YTTPSYFSTAEVDALPALVREATLRHR-KVLQNSYKSLQTSLHNLEPLFPDW-----KTV 163
Query: 254 FTFEIFKQAFVAVQSCVVH-------LQKVSLARRFALVPL-----GPPLLA------YS 295
FT + ++ A+ V + V+ S +AL P PL+ S
Sbjct: 164 FTLKSYRWAWATVYTRSVYKRGPGWEFLDPSDPDVYALAPFLDMLNHSPLVQTDTDFNVS 223
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
SKC ++ + + + + P N +LL+ YGFV NP+
Sbjct: 224 SKC----------YEVKTEGACRKYRQVFINYDPYDNGRLLMEYGFVMPRNPH 266
>gi|317144568|ref|XP_001820210.2| SET domain protein [Aspergillus oryzae RIB40]
gi|391871646|gb|EIT80803.1| N-methyltransferase [Aspergillus oryzae 3.042]
Length = 703
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 90/195 (46%), Gaps = 7/195 (3%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 121 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 175
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 176 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 235
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 236 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 294
Query: 341 FVDEDNPYDRLVVEV 355
F +NP D +V +
Sbjct: 295 FCLPNNPCDYRIVSL 309
>gi|238485948|ref|XP_002374212.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83768069|dbj|BAE58208.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699091|gb|EED55430.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 713
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 90/195 (46%), Gaps = 7/195 (3%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+LM + QGK+ FW PYIR L Q G A+ +PL + +L +L G+ ++A
Sbjct: 131 FFLMGQYLQGKEGFWYPYIRTLP-QPG----ALTTPLYYEGDDLEWLEGTSLSPARQQKA 185
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+K +Y + T AG ++Y +D+ A T + + V S V+ ++
Sbjct: 186 NLLKEKYGTVYTELCKAGFDGAEKYTWDLYLWASTIFVSRAFSAKVLSGVIPDTQLPEEN 245
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
L+P +L + K A V +V AG+ I GP+ N +L++NYG
Sbjct: 246 VSVLLPF-IDILNHRPLAKVEWRAGKGNVAFLVLEDVAAGQEISNNYGPRNNEQLMMNYG 304
Query: 341 FVDEDNPYDRLVVEV 355
F +NP D +V +
Sbjct: 305 FCLPNNPCDYRIVSL 319
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/252 (25%), Positives = 105/252 (41%), Gaps = 34/252 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT-----TNKLSELA---CLALYL 164
+ AS+D+ + F +P S ++ N T ++L T KL EL+ L + +
Sbjct: 90 IIASKDIDTDELLFEIPRSSIL-------NVTTSQLCVDFPHITGKLMELSQWDSLIICM 142
Query: 165 MYEKKQGK-KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKA 215
MYE K + +S W Y L L + W++ EL++LT G
Sbjct: 143 MYEMKVLQHESRWSSYFNVLPSSESLNTL-----MYWNDKELSFLTPSLVVNRVGKGDAE 197
Query: 216 EILERAEGIKREYNELDTVWFMAGSL----FQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
+ R E+NE D + GS+ F P I +F EI
Sbjct: 198 TMYRRILDTINEFNE-DILTEKLGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEF 256
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
++ +++PL L A + KC A L D+++++ +P K GE + G P
Sbjct: 257 DEKEEEPELLKSMIPLADTLNADTHKCNANLTYDKDSLKMLAIKPIKKGEQVYNTYGELP 316
Query: 332 NSKLLINYGFVD 343
NS+LL YG+V+
Sbjct: 317 NSELLRKYGYVE 328
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 108/250 (43%), Gaps = 18/250 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----VLGNETIAELLTTNKLSELACLALYLMYE 167
+ AS +++ G+ VP+ V+ E VL E + + ++ E+ L + +M+E
Sbjct: 68 LVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEGMTKDSADEEILEVQGLVIAVMWE 127
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ +G +S W PY+ L PL W E L G+ ++L RA+
Sbjct: 128 RWRGPESRWAPYLALLPDD------MTHMPLYWKRREFRELRGTAAYDKMLGRAQHPSDA 181
Query: 228 YNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
++ +W + G ++P +P +E+++ A AV S L + A+V
Sbjct: 182 PTQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD---DKYQAMV 238
Query: 286 PLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P+ L + L + +Q++ R AG +V G N++LL YGFV+
Sbjct: 239 PVWDLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYGFVE 298
Query: 344 EDNPYDRLVV 353
N Y+ + V
Sbjct: 299 RANRYNHIPV 308
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 111/249 (44%), Gaps = 20/249 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQG 171
+ A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G
Sbjct: 62 LMATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAG 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
++S W PY+ + P+ W E E+ +L +P + + LE+ K E EL
Sbjct: 122 ERSQWKPYLDVIPS-------TYSCPVYW-ELEIVHLLPAPLRQKALEQ----KTEVQEL 169
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA---LVPLG 288
T + Q D + +T++ + A+ V + V+++ R A + L
Sbjct: 170 HTESLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALA 229
Query: 289 P--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P LL +S + + D ++ + + + + GP N +LL+ YGFV
Sbjct: 230 PYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAA 289
Query: 345 DNPYDRLVV 353
+NP+ + V
Sbjct: 290 NNPHRSVYV 298
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 117/251 (46%), Gaps = 25/251 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC---LALYLMYEKK 169
+ A +D++A + ++P L++T E ++ L+ +++ ++ LAL+++ EK
Sbjct: 121 LKAVKDIKAEELFITIPRKLMLTTETA-RESSLGPLIKKDRILQVMANVSLALHVLCEK- 178
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFW PYI +PL + E E+ +L GS +++L + + I R+Y
Sbjct: 179 YSSNSFWAPYINIFPG-------TYTTPLYFEEGEMLHLQGSLNFSDVLNQYKSIARQY- 230
Query: 230 ELDTVWFMAGSLFQQYP--YDIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--AL 284
+F LFQ P +P E FTF+ ++ A V + + S R AL
Sbjct: 231 ----AYFY--KLFQTQPEAAGLPLKECFTFDEYRWAVSTVMTRQNQV-PTSDGRHLITAL 283
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+P+ + + D+ + + R + + ++ G + N++ LI+ GFV
Sbjct: 284 IPMWDMCNHSNGEVSTEFNLGSDSAECLAMREFPTDSQVYIFYGMRSNAEFLIHNGFVYP 343
Query: 345 DNPYDRLVVEV 355
+N +DR+ V++
Sbjct: 344 ENVHDRVNVKL 354
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 74/269 (27%), Positives = 121/269 (44%), Gaps = 59/269 (21%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLA 161
+R +H A + ++ G+ +P ++TLE L E I +L+ + + L+ L+
Sbjct: 373 YRGVH---ARQKIKKGECILFIPVDNMITLE--LSKELPICQLIESKNIRLLSPKHTFLS 427
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES---PLLWSETELAYLTGSPTKAEIL 218
+Y++ EKK KSFW P++ L VE P+L+++ EL +L GSP ++
Sbjct: 428 IYIIIEKK-NHKSFWKPFL---------DILPVEYTTFPILYTDEELFWLKGSPFLNQVK 477
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA--FTFEIFKQAFVAVQSCVVHLQKV 276
ER E I ++Y Q IP A T + F A + S + L +
Sbjct: 478 ERRECITQDY--------------QAIVSKIPEFAKLCTLDEFAWARMMAASRIYGL-FI 522
Query: 277 SLARRFALVPLGPPLL----AYSS------KCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ R A VPL AY++ K ML A +D + G+ I
Sbjct: 523 NKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDI---------RRGDQIYYS 573
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
CG + NS+ L+NYGFV ++N + + + V
Sbjct: 574 CGRKCNSRFLLNYGFVVKNNEANEIQLRV 602
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 105/249 (42%), Gaps = 38/249 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLM------- 165
+ A+E ++ G+ +P ++T+E L + E +L E + LA +L
Sbjct: 118 LVATESIKRGEKVLEIPQEAIITVEVALKESLLREKKKLAELQEWSILATFLAETAQNLS 177
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGI 224
E K + Y++ L R G S L W E+++ L GSP+ LER +
Sbjct: 178 TEDNSSNKYRFATYVKALPRSTG-------SVLEWPESDVRTLLAGSPSLFSALERRASV 230
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
E+ + Q+ +DI +F S ++ L+ SL AL
Sbjct: 231 AAAIAEIRVNFPELNEKTLQWAFDI--------LF--------SRLIRLE--SLGGNLAL 272
Query: 285 VPLGPPLLAYSSKCKAM--LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
VP +L + C+A L V L DR Y+ GE + G +P+S+LLI+YGF
Sbjct: 273 VPWAD-MLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRPSSELLISYGFA 331
Query: 343 DE--DNPYD 349
DNP D
Sbjct: 332 PAVGDNPDD 340
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 119/284 (41%), Gaps = 30/284 (10%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK K LK++ + RP + + ++ LQAG+ S+P +VT V
Sbjct: 28 HKLEYIKLKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTV 87
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L N + E + K +S L L +L+ EK G++S W PY+ L + P
Sbjct: 88 L-NSCLGEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-------TYSCP 139
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ E ++ L P + + E+ + Y + SLF + I F +
Sbjct: 140 VC-LEQDVVQLLPEPLRKQAQEQRTAVHELYMSSKAFFSSLQSLFAENTATI----FNYS 194
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYS--SKCKAMLAAVDDAV 309
+ A+ + + +++ K S F+L P L P LL +S + KA
Sbjct: 195 ALEWAWCTINTRTIYM-KHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNY 253
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
++ + K E + + GP N +LL+ YGFV DNP+ + V
Sbjct: 254 EIQTNSQCKKYEEVFICYGPHDNQRLLLEYGFVAVDNPHSSVYV 297
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/249 (24%), Positives = 117/249 (46%), Gaps = 20/249 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DLQ G+ S+P+S ++T E VL + + T + +S L L +L+ E+
Sbjct: 62 LMATRDLQPGELIISLPDSCLITTETVLQSYLGKYIRTWSPPVSPLLALCTFLIAERVAR 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
++S W PY+ L + P+ W E+E+ L +P + + LE+ +K + E
Sbjct: 122 ERSPWKPYLDVLPS-------SYSCPVYW-ESEIISLLPAPLRQKALEQQTEVKELHTE- 172
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
W SL + +I T+ +T+ + A+ V + V H ++ L+ + + +
Sbjct: 173 --SWSFFVSLQPLFGGNI-TDIYTYGALRWAWCTVNTRTVYMKHPRRHGLSAQQDVYAMA 229
Query: 289 PPL-LAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P L L S + AA ++ ++ + + + + GP N +LL+ YGF+
Sbjct: 230 PYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFIAA 289
Query: 345 DNPYDRLVV 353
+NP+ + V
Sbjct: 290 NNPHRSVYV 298
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 72/293 (24%), Positives = 121/293 (41%), Gaps = 59/293 (20%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+H++G V + E + + A++D G +VP ++++ E+
Sbjct: 88 SWLHEHGAEFEGVEISEFDGYG------FGLKATKDFSEGSLILTVPGKVMMS-EKDPKA 140
Query: 143 ETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
++E + + L + LAL+L+ EK SFW PYI L + + L
Sbjct: 141 SDLSEFINIDPLLQNMPNVTLALFLLLEK-NNPNSFWKPYIDVLPEK-------YSTVLY 192
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREY----NELDTVWFMAGSLFQQYPYDIPT---- 251
++ ELA L SP L+ I R+Y N++ T+ D+P
Sbjct: 193 FNSEELAELRPSPVFESSLKLYRSIVRQYAYFYNKIHTI-------------DLPVLKNL 239
Query: 252 -EAFTFEIFKQAFVAV---QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD 307
+ FTF+ ++ A V Q+ +V +L A +PL C +
Sbjct: 240 QDIFTFDNYRWAVSTVMTRQNNIVQGTAFTLTN--AFIPLW-------DMCNHKHGKITT 290
Query: 308 AVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
L ++R Y+ E I ++ G +PNS L ++ GFV DN YD L +
Sbjct: 291 DFNLELNRGECYALQDYRRDEQIFIFYGARPNSDLFLHNGFVYPDNDYDSLSI 343
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 69/295 (23%), Positives = 116/295 (39%), Gaps = 57/295 (19%)
Query: 70 VVSKKEEDLGD-LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
VV + D D SWM NG S +EK H +A L G
Sbjct: 520 VVHQNGTDTTDQFVSWMEGNGF-----------SISEKLSITHLLAGDGKLVRG------ 562
Query: 129 PNSLVVTLERVLGNETIAEL-----------LTTNKLSELACLALYLMYEKKQGKKSFWL 177
VV L+ + ET+ L + ++ A L+ EK +G S W
Sbjct: 563 ----VVVLKNIRRGETLCNLPLDMGLYDNETIVAGEVDSWDRAAARLLREKAKGSSSAWA 618
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
YI L + + P+L + EL + P E+++ + I+ ++ L +V +
Sbjct: 619 SYINILPQN-------MTVPILLEDHELHEVQWWPVLRELVQVRKSIRESFSLL-SVDDL 670
Query: 238 AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
AG+ F++Y + + + AFT +F A + ++ ++ + + P+
Sbjct: 671 AGADFEEYRWAAMMVHSRAFTLPVFADDHYAPYVMMPYMDMINHHYHYQADWMSQPIWG- 729
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
V++V R K GE + GP+ N L + YGFV +DNP+D
Sbjct: 730 ------------GKVEIVARRDIKKGEELFASFGPRANDNLFLYYGFVLKDNPFD 772
>gi|396495152|ref|XP_003844476.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312221056|emb|CBY00997.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 475
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 112/276 (40%), Gaps = 54/276 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYLMYEKKQG 171
V A+++++ + F +P S V+++E +L E T + L L L ++YE G
Sbjct: 40 VVATQEIREHEVLFRIPRSAVLSVENSILSTEIPTS--TFDLLGPWLSLILVMLYEHLNG 97
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-------------- 217
S W PY L + + + WSE ELA L S A+I
Sbjct: 98 DASNWAPYFAVLPNE-------FNTLMFWSEHELAELQASAVLAKIGREGANEAFLGQLV 150
Query: 218 -----------------LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
++AE ++ E N T+ GSL Y +DI
Sbjct: 151 PVIKEFAGIFFSGDSRAAQKAEEMRDEKN--ITLMHKMGSLIMAYAFDIEPAT------P 202
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
+ V + + +L + ++PL L A + +C A L +++ +P KAG
Sbjct: 203 RKDVDEEGFAEEEEDEALPK--GMIPLADMLNADADRCNARLFYEQKYLEMKALKPIKAG 260
Query: 321 ESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEV 355
E I GP P S LL YG+V E+ YD VVEV
Sbjct: 261 EEIFNDYGPLPRSDLLRRYGYVTENYAQYD--VVEV 294
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 66/296 (22%), Positives = 128/296 (43%), Gaps = 24/296 (8%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
VAG S +E W++ NG+ V + K + + A++D++
Sbjct: 66 VAGEPMQQSDREVHFETFFKWLNTNGVTTDAVKMA-------KFDEGYGLQATQDIKMDQ 118
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYI 180
++P +++T + + + TI +L+ ++L + LA++++ EK + SFW PY+
Sbjct: 119 ELMNIPRKVMMTDQNAVDSPTIGDLVRGDRLLKGMPNVSLAIFILSEKLK-SDSFWKPYL 177
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L + PL ++ E+ GS E L++ + I R+Y L F +
Sbjct: 178 DVLPS-------SYSLPLYFTPDEIQLFQGSTMYGECLKQHKNIARQYAYL----FKLLN 226
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-QKVSLARRFALVPLGPPLLAYSSKCK 299
L + I E FT++ ++ A V + + K +L+PL + + K
Sbjct: 227 LPENSKLHI-REYFTYDFYRWAVSTVMTRQNQIPAKDGKGMSLSLIPLWDMCNHANGEMK 285
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
D+ + R + GE I + G + ++ LL+ GFV N YD + +++
Sbjct: 286 TDFIEERDSCVNMALRDFSVGEQIFICYGRRSSADLLLYSGFVYPGNVYDGMAIQL 341
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 71/273 (26%), Positives = 111/273 (40%), Gaps = 48/273 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
VAA +D+ + FS+P S ++++E + + I T L L L ++YE G
Sbjct: 40 VAAKQDIAEHELLFSIPRSSILSVENSILSTEIPPT-TFALLGPWLSLILVMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
S W PY L ++ + W+E EL L S +I
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELTELQASAVVNKIGKEGANEVFIEQLLP 151
Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
ERA+ + +E + + M GSL Y +D+ A + + +
Sbjct: 152 VIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
A + L K +VPL L A + +C A L D +++ +P +AGE I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADADRCNARLFYEKDGLEMKALKPIQAGEEI 263
Query: 324 VVWCGPQPNSKLLINYGFVDED-NPYDRLVVEV 355
GP P S LL YG++ E+ YD VVE+
Sbjct: 264 FNDYGPLPRSDLLRRYGYITENYAQYD--VVEI 294
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 109/249 (43%), Gaps = 20/249 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNKLSELACLALYLMYEKKQG 171
+ A+ DL+ G+ ++P + ++T E VL + + L +S L L +L+ E+ G
Sbjct: 62 LMATRDLKPGELIIALPETCLITTETVLQSYLGKYIRLWRPHVSPLLALCTFLIAERFAG 121
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
S W PY+ + P+ W E E+ +L +P + + LE+ K E EL
Sbjct: 122 DCSQWKPYLDVIPS-------TYSCPVYW-ELEIIHLLPAPLRKKALEQ----KTEVQEL 169
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA---LVPLG 288
T S Q D + +T++ + A+ V + V+++ R A + L
Sbjct: 170 HTESLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTVYMKHTQQDRLLAQQDVCALA 229
Query: 289 P--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P LL +S + + D ++ + + + + GP N +LL+ YGFV
Sbjct: 230 PYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAFICYGPHDNQRLLLEYGFVAA 289
Query: 345 DNPYDRLVV 353
+NP+ + V
Sbjct: 290 NNPHRSVYV 298
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 85/307 (27%), Positives = 118/307 (38%), Gaps = 75/307 (24%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALY------LM 165
A D+QAG+ F VP L T + R + EL + LA L LY
Sbjct: 120 ARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGLDESW 179
Query: 166 YEKKQGKKSFWLPYIRELDR--QRGRGQLAVES----PL----LWSETELAYLTGSPTKA 215
+ +G SFW PY+ L +G ES PL LW+E E+ +L GSPT
Sbjct: 180 KKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEMQWLQGSPTLL 239
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQ 274
++REY E L++++P+ E AF E F AF + S V L
Sbjct: 240 SARALRSKVEREYAE------ACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLP 293
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAA----------------------------VD 306
+ ALVP L +S+ C + + A D
Sbjct: 294 AENGM--LALVPYA-DLANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSD 350
Query: 307 DA------------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
DA V DR Y E + V G + N++LL+ YGFV + NPY
Sbjct: 351 DAGAVANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFVSDRNPY 410
Query: 349 DRLVVEV 355
+ + V V
Sbjct: 411 NSVEVCV 417
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 121/297 (40%), Gaps = 39/297 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
++ + D L +W+ + G V ++ P + A+ D+ GDAA V
Sbjct: 73 ARTQADFDALWTWLEREGADVASVSPALVDATPGGRGWG-----LVATRDVGGGDAAIVV 127
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P +L +T E ++ I L LAL L++EK G S W YIR L R
Sbjct: 128 PRALWMTKETAFASK-IGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVE- 185
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYPY 247
A+++PL WS ELA L G+ A ++ + L +T + +LF
Sbjct: 186 ----ALDAPLFWSSEELAELAGTQLLANAAGYDSYVRGTHAALKETTFKEHPALFGDAGD 241
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-------KA 300
D AF+ F AF ++S L V AL+P G + + C
Sbjct: 242 DDGGGAFSEREFLWAFGVLRSRA--LPPVDQGESIALIP-GIDMANHDGLCSQTWQLNNG 298
Query: 301 MLAAV---------DDAVQLVVDRP----YKAGESIVVWCGPQP-NSKLLINYGFVD 343
+AAV +V L V++ K GE I GP +S+ ++YGFVD
Sbjct: 299 GIAAVFGGRGGADGGGSVLLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFVD 355
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 123/302 (40%), Gaps = 40/302 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATELLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTLAPNTENPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLKGTNAYVAIQEIQSNVKSEYKQARK 176
Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV----SLARRFALV 285
+ G + Q Y+ FT F+ + V +S +++++ S F+++
Sbjct: 177 ILKKEGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGSKIDDFSIL 236
Query: 286 PLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
PL ++ + L + A +L+ + Y G+ + G + NS+LL+ YG
Sbjct: 237 ---QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELLLGYG 293
Query: 341 FV 342
F+
Sbjct: 294 FI 295
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 121/303 (39%), Gaps = 51/303 (16%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ K+G I E + + R V AS+++ + F +P + ++++E +L
Sbjct: 13 AWLRKSGAEISPKIKLEDLRNKDAGRG---VVASQEIAEHELLFRIPRTSILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A L+ L L L ++YE G S W PY L + + + W+
Sbjct: 70 TEIPAATLSL--LGPWLSLILVMLYEYHNGSASNWAPYFAVLPTE-------FNTLMFWT 120
Query: 202 ETELAYLTGSPTKAEIL---------------------------ERAEGIKREYNELDTV 234
E ELA L S +I E+A+ +E +
Sbjct: 121 EDELAELQASAVVGKIGKESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNL 180
Query: 235 WFM--AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
M GSL Y +D+ T E+ ++ A + L K +VPL L
Sbjct: 181 ELMHKMGSLIMAYAFDVEPATPTKEVDEEG-FAEEEEDAALPK-------GMVPLADMLN 232
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
A + +C A L D +++ +P +AGE I GP P S LL YG+V DN V
Sbjct: 233 ADADRCNARLFYEKDCLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDV 291
Query: 353 VEV 355
VE+
Sbjct: 292 VEI 294
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 92/241 (38%), Gaps = 39/241 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 63 ASEPIQEGDCIMQVPYHVQLTLDKL---PQKFNTLLDHAVGDTSKLAALLIMEQHLGNES 119
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI+ L + + + +LW EL + S E +E E K+E+ L
Sbjct: 120 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEFLALKPA 174
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
LF + A + V + QK
Sbjct: 175 LDHFPHLFGEVKLGDFMHASALDFLNHDGVFGSVLIYDEQK------------------- 215
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
D +++ DR Y GE +++ G N+ L +N+GF N YD+ ++
Sbjct: 216 ------------DVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFTLARNIYDQALIR 263
Query: 355 V 355
+
Sbjct: 264 I 264
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 120/299 (40%), Gaps = 34/299 (11%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
S +E L W HK+G P +V E + + +P +A+E L +G A S
Sbjct: 3 SPHKERFEALLDWAHKHGASLHPLLEVYEDEVTGFSLRVKP----SATERLGSGFKAVSC 58
Query: 129 PNSLVVTLERVLGNETIAELLTT---------------NKLSELACLALYLMYEKKQGKK 173
P S+ ++ L + I TT N L YL+ + +GK
Sbjct: 59 PTSITLSYLNALTDGPITPSSTTPAPNTKNPAFPERFMNSLPPHVIGRFYLIQQYLKGKS 118
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
SFW PYI L + A+ P W+E ++ L G+ I E +K EY +
Sbjct: 119 SFWAPYISTLADPSQLDKWAL--PPFWAEDDIELLQGTNAYIAIQEIQNNVKSEYKQARK 176
Query: 234 VWFMAG----SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-VSLARRFALVPLG 288
+ G + Q Y+ FT F+ + V +S ++++ + + +
Sbjct: 177 ILKKEGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERLLPEGTKIDDFSVL 236
Query: 289 PPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
PL ++ + L + A +L+ + Y G+ + G + NS+LL+ YGF+
Sbjct: 237 QPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFKTNSELLLGYGFI 295
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 103/242 (42%), Gaps = 29/242 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK-- 172
AS ++AG+ + L++ + G + T ++LA + L Y K Q K
Sbjct: 25 ASRPVRAGERVLEISLDLMIAPSDLPGELSTVLSSTVKPWTKLALIVLMERY-KGQAKLQ 83
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 84 SSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEMITTEFGQVQ 138
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
+ LF + + E FK + V S + +++ LV + P+L
Sbjct: 139 NALDVWPQLFGK---------VSLEDFKHVYATVFS-----RSLAIGEDSTLVMI--PML 182
Query: 293 AY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ +S K + + + DR Y + I + G N++L ++YGF +NP
Sbjct: 183 DFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYGFAVPENP 242
Query: 348 YD 349
YD
Sbjct: 243 YD 244
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 100/239 (41%), Gaps = 18/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 153 ASESIGVGDIALEIPESLIISDELLCQSEVFLSLKDFNNITSETMLLLWSMRERYNLGSK 212
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 213 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 264
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L +P + T++ F A S + + S LVP+ L
Sbjct: 265 ------LCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 318
Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
S VD+A ++ + RP AGE + G P S LL YGF+ DNPYD
Sbjct: 319 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLLTFYGFLPRGDNPYD 377
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 67/275 (24%), Positives = 117/275 (42%), Gaps = 49/275 (17%)
Query: 110 IHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLA 161
++ A ED+ A F++P ++ +E + I + +K L + L
Sbjct: 44 VNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDKPDDDDAPGLDSWSSLI 103
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-R 220
L ++YE QG+ S W PY L + ++P+ WS+ EL L S + +I +
Sbjct: 104 LIMIYEYLQGENSKWKPYFDVLPS-------SFDTPMFWSDNELDQLQASHMRHKIGKAD 156
Query: 221 AEGIKRE------------YN-------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
AE + ++ +N EL + GS Y +D+ + +
Sbjct: 157 AENMFQKTLLPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLENDE-------E 209
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
V S+ +VP+ +L ++ A + ++++ + RP KAGE
Sbjct: 210 EEEEADGWVEDRDGKSM---MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGE 265
Query: 322 SIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
I+ + GP PNS+LL YG+V E + YD VVE+
Sbjct: 266 EILNYYGPHPNSELLRRYGYVTEKHSRYD--VVEI 298
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/304 (21%), Positives = 132/304 (43%), Gaps = 52/304 (17%)
Query: 74 KEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
+E+ +L SW +NG C+ + + + A+ D++A + +P ++
Sbjct: 75 REDHFPELMSWAKENG-ASCECFTVANFG-----KEGYGLRATRDIKAEELFLWIPRKML 128
Query: 134 VTLERVLGNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
+T+E N + L + +++ + LAL+L+ E+ SFWLPYIR L ++
Sbjct: 129 MTVESA-QNSILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLPQE---- 182
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY----------NELDTVWF---- 236
+ PL + + ++ L G+ ++L + + R+Y L +V
Sbjct: 183 ---YDIPLYYQQEDVQLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFA 239
Query: 237 -----MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
M G LF Q+ A + + +Q + + +V+L AL+PL
Sbjct: 240 SLTPVMGGKLFDQW-------AVSSVMTRQNQIPTEDG----SRVTL----ALIPLWDMC 284
Query: 292 LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ DD + V + YK E I ++ G + N++ +I+ GF +D+ +DR+
Sbjct: 285 NHTNGLITTGYNLEDDRCECVALQDYKENEQIYIFYGTRSNAEFVIHNGFFFQDDAHDRV 344
Query: 352 VVEV 355
+++
Sbjct: 345 KIKL 348
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 117/292 (40%), Gaps = 30/292 (10%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
SR V + + +LK W+ +I P + + LQ G
Sbjct: 22 SRGVNESYKSEFIELKKWLKDRKFEDTNLIPARFPGTGRG------LMSKTSLQVGQMII 75
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYIRELD 184
S+P S +++ + V+ + +T K S L L +L+ EK G +S W PY+ L
Sbjct: 76 SLPESCLLSTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLP 134
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P+ W E E+ L P +A+ E+ ++ + + LF +
Sbjct: 135 K-------TYTCPVCW-EPEVVNLLPRPLRAKAQEQRTRVQEFFTSFRDFFSSLQPLFSE 186
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PLLAYSS-- 296
+I FT+ A+ V + V+L+ L R F+ P L P LL +S
Sbjct: 187 AVENI----FTYSALLWAWCTVNTRAVYLRHRQL-RCFSAEPDTCALAPYLDLLNHSPDV 241
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+ KA ++V + E + + GP N +LL+ YGFV NP+
Sbjct: 242 QVKAAFNEKTRCYEIVAVSSCRKHEEVFICYGPHDNHRLLLEYGFVSTRNPH 293
>gi|212542185|ref|XP_002151247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210066154|gb|EEA20247.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 709
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 95/206 (46%), Gaps = 24/206 (11%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
A +LM + + FW PYIR L G+ + +PL + E E L +L G + A
Sbjct: 108 AFFLMGQYLLQEHGFWYPYIRSLP-----GKEELTTPLFFREEEGDLEWL-GMTSLAASR 161
Query: 219 ERAEGI-----KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA--VQSCVV 271
ER I +R Y L + F + Y +D+ A T I +AF A + S +
Sbjct: 162 ERRLAIWRGNYERGYTMLKELGFEG---VEGYTWDLYLWASTI-ISSRAFTAKVLASVIP 217
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
L+ + R L+PL + A + K K A D++ LVV AGE + GP
Sbjct: 218 ELKNAEVDRVSVLLPL---IDATNHKPLSKVEWRAGTDSIGLVVMSDVAAGEEVGNNYGP 274
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEV 355
+ N +L++NYGF DNP + VV +
Sbjct: 275 RNNEQLMMNYGFCIPDNPCEYRVVSL 300
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 112/266 (42%), Gaps = 42/266 (15%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL---LTT-----NKLSELA 158
HR + + A+ED++ G+ F+VP L++ + E L T N S
Sbjct: 127 HR--YGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKWLDTHGKSLNDSSGWL 184
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEI 217
L + LM+E Q K SFW Y+ + G PL W E E G P +I
Sbjct: 185 PLLITLMWEFNQ-KDSFWASYLLLVPEISEFGH-----PLFWKEEEYNLEFQGMPLLNDI 238
Query: 218 LERAEGIKREYNELDTVWF-----MAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCV 270
+ E I+ EY E ++ + GSL E ++ E FK+ AFV S
Sbjct: 239 IVDRENIETEYAEFVLLFLRRNKDLFGSL----------ENYSLEFFKRMVAFVMAYSFT 288
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+ S+ VP+ +L + S A L +Q++ R K GE + G
Sbjct: 289 EDEESPSM------VPMAD-ILNHHSNNNAHLVFHKSNLQMISIRRIKKGEEVFNTFGKL 341
Query: 331 PNSKLLINYGFVD-EDNPYDRLVVEV 355
N++LL YG+V+ N YD L++ V
Sbjct: 342 GNTELLQMYGYVEIPSNQYDSLLLPV 367
>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 492
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 117/277 (42%), Gaps = 55/277 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
D++ M + + P P+ EA + + + A + + + ++KV
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209
Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
+VPL L A + + A L DD++ + +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
GE I G P + LL YG+V DN VVE+
Sbjct: 270 VGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVEL 305
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 68/257 (26%), Positives = 117/257 (45%), Gaps = 45/257 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYE 167
V A + + A + +P S ++TLE + ET +A+ + +L L+ L+ +L+ E
Sbjct: 165 VNAKQKINAKELILFIPKSHMITLE--MAKETPVAKKMIQFRLDLLSPKHSFLSTFLLQE 222
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
K + SFW PY+ L Q P+ ++ +L +L GSP +I ++ +K++
Sbjct: 223 KSRPN-SFWKPYLDIL------PQSYPSFPIFFNNYDLEWLQGSPFLKQINDKLSDLKKD 275
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
YN++ V F QY +F F A + S + + + + A VPL
Sbjct: 276 YNDICNV----APEFSQY---------SFYEFCWARMTASSRIFGI-NIKGVKTDAFVPL 321
Query: 288 G-------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
P L + YS + + + D+ + DR G+ I G + NS+ L+N
Sbjct: 322 ADMLNHKRPKLTSWCYSEEKQGFIIETDEKI----DR----GQMIFDSYGRKCNSRFLLN 373
Query: 339 YGFVDEDNPYDRLVVEV 355
YGFV +DN + + V V
Sbjct: 374 YGFVVDDNDANEVNVTV 390
>gi|70984218|ref|XP_747626.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66845253|gb|EAL85588.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 492
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 117/277 (42%), Gaps = 55/277 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I EG +
Sbjct: 105 LGEQSAWAPYFKILPK-------SFDTLMFWSPSELRELQGSAIVSKI--GKEGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT---------EAFTFEIFKQAFVA---VQSCVVHLQKVS 277
D++ M + + P P+ EA + + + A + + + ++KV
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVDGLASWDGEAGSHALLRLAHIMGSLIMAYAFDIEKVE 209
Query: 278 LARRF-------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
+VPL L A + + A L DD++ + +P +
Sbjct: 210 DEDDENNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEDDSLVMKAIKPIR 269
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
GE I G P + LL YG+V DN VVE+
Sbjct: 270 VGEEIFNDYGELPRADLLRRYGYV-TDNYAQYDVVEL 305
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 119/287 (41%), Gaps = 41/287 (14%)
Query: 68 REVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
R E+ L L WM + G L P I+++ V A D+ G+
Sbjct: 2 RTSAESSEQKLSSLLRWMEQGGALFPKMHIVRQADGERS-------VLARTDIAEGEVVL 54
Query: 127 SVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
+P + + TLER +G ++L N + LA +L+ EK +G SFW P++
Sbjct: 55 QIPTTHLFTLERAKASDIGRRIQSQLQPDN---DFLYLASWLLEEKHRGADSFWKPFVDS 111
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLF 242
L + PL +SE E A + GS LER ++R+ E + +
Sbjct: 112 LP------EAYPHVPLFYSEQERARMKGSQ-----LERLVEVQRQSFEQE---------Y 151
Query: 243 QQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
Q +P E F FE + A +++ S + L+ + +LVPL + + +
Sbjct: 152 AQLREKLPEYERFGFEEYVWARISLYSRLFSLK--GGLQGPSLVPLS-DMFNHRQPPDVL 208
Query: 302 LAAVDDA--VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
+ +D +++ R AG I G + + L++ GFV + N
Sbjct: 209 WSTSEDGQTFRMIAQRAVPAGTEIHTHYGAKSSDVFLLHSGFVPDGN 255
>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
Length = 549
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 110/247 (44%), Gaps = 22/247 (8%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +DL+ GD A +P S++++ E V + L + +S L L+ M EK
Sbjct: 178 ARKDLKVGDIALEIPVSIIISEELVHETDMYGVLKEIDGISSETILLLWSMKEKYNCDSK 237
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L + G L +S + L G+ EI++ + + +Y+EL
Sbjct: 238 FKI-YFDTLPEKFNTG-------LSFSIQAITMLDGTLLLEEIMQARQHLHAQYDEL--- 286
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
F A L +P P E +T+E F A S + + R L+PL L
Sbjct: 287 -FPA--LCNNFPDIFPPELYTWEKFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFL--N 341
Query: 295 SSKCKAML--AAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPY 348
S C ++ VD ++++ + RP ++GE + G +S L+ YGF+ + DN Y
Sbjct: 342 HSLCPHVMHYGKVDPATNSLKFCLSRPCRSGEECCLSYGNFSSSHLITFYGFLPQGDNSY 401
Query: 349 DRLVVEV 355
D + +++
Sbjct: 402 DVIPLDI 408
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 107/218 (49%), Gaps = 34/218 (15%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
T++ + L+L+L+ EK +GK SFW PYIR L + +P+ ++E+EL L+
Sbjct: 230 TSRFTCAQVLSLFLLLEKNKGKDSFWYPYIRSLPN-------SFTTPVYFTESELNALSP 282
Query: 211 SPTKAEILERAEGIKRE----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S + E+A +K+E +N+L+ F+ L + FTF+ F+ A+ +
Sbjct: 283 S-----LQEKARDLKKELLHAFNDLEP--FVTSCLPEL------DSTFTFDAFRWAWSVL 329
Query: 267 QSCVVHLQKVS---LARR----FALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPY 317
++ ++ + L+ + LVP+ L+ +S KA ++ V PY
Sbjct: 330 KTRTLYQEDCRSPYLSNKEPQTSTLVPM-LDLINHSPSAKARFGYNVNTSCYEVRVLEPY 388
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ + + + G + N++L++ +GF +NP D + + +
Sbjct: 389 RKYDQVFISYGFEENTELMLKFGFFVPENPKDFMKINL 426
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 78/191 (40%), Gaps = 12/191 (6%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + + P W E ++ +L G+ I E
Sbjct: 111 FFLIKEYLKGENSFWWPYIATLPQPEQVNSWTL--PAFWPEDDIQFLEGTNAHVAIGEIQ 168
Query: 222 EGIKREYNELDTVW----FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-V 276
IKREY + V F + Q Y FT F+ + + QS ++ +
Sbjct: 169 ANIKREYKQARKVLKEENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDYVSTLL 228
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP 331
AR + PL ++ D + QL+ Y+ G+ + G +
Sbjct: 229 PSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNNYGFKT 288
Query: 332 NSKLLINYGFV 342
NS+LL+ YGF+
Sbjct: 289 NSELLLAYGFI 299
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 71/157 (45%), Gaps = 16/157 (10%)
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK 260
+E E++ L G+P +E + I+ +Y V +L YP DI + T + F
Sbjct: 65 TEEEVSMLEGTPAHTTFVEARQHIREQYRAAQPVL---QALTAAYPDDITPDLVTEDKFI 121
Query: 261 QAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAAVDDAVQLV 312
A S + ++ V A R LVP+ P ++ Y L A D+++L
Sbjct: 122 WACELWYSYAIEVEYVDGAVRQTLVPIAHLLNHSPWPHIVRYGR-----LDAATDSLRLR 176
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
R AGE + GP PN KLL+ YGF DNP+D
Sbjct: 177 AFRHCAAGEQCFLSYGPLPNLKLLLFYGFALPDNPHD 213
>gi|429861365|gb|ELA36056.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 471
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 60/263 (22%), Positives = 101/263 (38%), Gaps = 50/263 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ A F++P ++ +E GN+ E + L L L
Sbjct: 42 IIATKDIPAETTLFTIPRRSIINVETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 101
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERA 221
++YE QG S W PY L + + + W ++L L GS ++I E A
Sbjct: 102 VMIYEFLQGAASPWKPYFEVLPEK-------FHTLMFWESSDLENLKGSAVLSKIGKEEA 154
Query: 222 EGIKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ + R EL + GS+ Y +D+ E +
Sbjct: 155 DEMFRSRILTVIAANPAIFYPEGSSPLGEAELLQLAHRMGSIIMAYAFDLDNEEEPEQEE 214
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
++ + L V +A +L ++ A + DD + + RP A
Sbjct: 215 DDEWIEDRDGKTMLGMVPMAD----------ILNADAEFNAHVNHGDDELTVTALRPIPA 264
Query: 320 GESIVVWCGPQPNSKLLINYGFV 342
GE I+ + GP PNS+LL YG+V
Sbjct: 265 GEEILNYYGPHPNSELLRRYGYV 287
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 121/287 (42%), Gaps = 39/287 (13%)
Query: 70 VVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
S + L +L W+ + G P +++ +E V A + AG+
Sbjct: 10 AASSSNQKLSNLLRWLEEGGARFPKLQLVRREDGERA--------VLAQAPISAGETVLQ 61
Query: 128 VPNSLVVTLERVLGNE---TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
VP + ++TLE ++ IAE L + +E LA +L+ EK + + SFW PYI L
Sbjct: 62 VPRTHMLTLELARESDIGRAIAEGLDPD--NEDLYLASFLLQEKHR-EGSFWKPYIDSLP 118
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ PL + E A L G + +A+ ++ +Y SL Q
Sbjct: 119 ESYS------QMPLFYGSEEHALLKGCFALTLLTHQAQSLREDYL----------SLCQN 162
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
P E FT F A ++V S + L+K + LVP+ +L + +
Sbjct: 163 VP---GYERFTPGEFVWARLSVSSRLFSLKKGGFLGQ-TLVPMAD-MLNHRRPPDVLWET 217
Query: 305 VDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+D V+ + AG+ + G + N +L+++GFV +DN +D
Sbjct: 218 TEDGESFVMKANNAVAAGDEVHDSYGAKSNDLMLLHFGFVTDDNEHD 264
>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
Length = 583
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 105/250 (42%), Gaps = 28/250 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 211 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 270
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 271 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 319
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
+L + +P P E +T E F A S + + R L+P+
Sbjct: 320 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 376
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y + + ++++ V +P GE + G +S L+ YGF+ + D
Sbjct: 377 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGD 431
Query: 346 NPYDRLVVEV 355
N YD + +E+
Sbjct: 432 NLYDTIPLEI 441
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 106/255 (41%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT----LERVLGNETIAELLTTNKLSELACLALYLMYEK 168
+ A + ++ + SVP ++++ L NE I E + +S A++LMY
Sbjct: 65 IVAKQSIKKNEKLISVPKLIIMSNMGGFSHHLPNE-IYEPSISIGISPTNLQAIFLMY-C 122
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
K KSFW PY+ L ++ + + +SE EL L S K + R +GI+R Y
Sbjct: 123 KLNDKSFWYPYVSVLPKE-------FTTSIYFSEEELDELQSSKLKEFTIIRKDGIERHY 175
Query: 229 NELDTVWFMAGSLFQQYPYDIPT---EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
N T G + + P T + +T E+F A V S L +V
Sbjct: 176 NSTFTRLSNRG-IAEFSPTSTQTLQQKGYTLELFTWALSCVWSRAFSLSD----SDGGMV 230
Query: 286 PLGPPLLAYS-SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLINYGF 341
PL A SK K D + + GE I G P +S++L++YGF
Sbjct: 231 PLADMFNAEEISKSKVQPKVTDSTLDYYASDDIEIGEQIFTPYGVYKPLSSSQMLMDYGF 290
Query: 342 V-DEDNPYDRLVVEV 355
V D P D + + V
Sbjct: 291 VFDHGTPSDNVAISV 305
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 112/272 (41%), Gaps = 49/272 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A +D+ A F++P ++ E + I ++ +K L + L L +
Sbjct: 578 IIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLILIM 637
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
+YE QG S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 638 IYEYLQGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAED 690
Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R EL + GS Y +D+ + E ++
Sbjct: 691 MFKKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 750
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ + +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 751 DREGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 799
Query: 325 VWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
+ GP PNS+LL YG+V E + YD VVE+
Sbjct: 800 NYYGPHPNSELLRRYGYVTEKHSRYD--VVEI 829
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/292 (22%), Positives = 125/292 (42%), Gaps = 30/292 (10%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTL---- 188
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY 247
+ +P+ + +L L GSP L+ I R+Y ++ ++ P
Sbjct: 189 ---PVTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQYAYFKKLFQLSND-----PA 240
Query: 248 D-IPTEAFTFEIFKQA---FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
I + FT+E ++ A ++ Q+ V S AL+PL S +
Sbjct: 241 SVILKDTFTYEYYRWAVSTLMSRQNTVPSSDNPS-ENVSALIPLWDMFNHRSGRLSTDFV 299
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ D Y A E + ++ G + N+ L++ GFV DN +D + + +
Sbjct: 300 KSSNVCVCYADGDYAADEQVYIFYGVRTNADFLVHNGFVYPDNEHDAVKIRL 351
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 109/258 (42%), Gaps = 34/258 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELACLALYLMYEK 168
V A++D+ G+ VP+ V+ E +E + + TN E L L LM EK
Sbjct: 32 VVATKDISCGEVVVHVPDESVLMPENCSCSEALEDAGLTNASGDAEMESIGLILALMTEK 91
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR-- 226
K GK S W Y+ L + PL W +L L G+ ++E+ G K
Sbjct: 92 KLGKSSKWKGYLDFLPKS------IPGMPLFWDSEQLQSLEGT----SLIEKMNGCKAMP 141
Query: 227 --------EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
++N + + F+ + + P++ + + ++ A V+ S + +
Sbjct: 142 DRPLEPPCKFNSV-VLPFLQSNAHLKLPHNAASTRRLY-VWATAMVSAYSFTIGEDRFQ- 198
Query: 279 ARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
A+VP+ L + L A A++++ GE ++ G PNS+LL
Sbjct: 199 ----AMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKGEQVINSYGDLPNSELL 254
Query: 337 INYGFVDED-NPYDRLVV 353
YGFV+ D NP+D L V
Sbjct: 255 RRYGFVETDPNPHDCLEV 272
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P E+FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPT 132
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192
Query: 335 LLINYGFVDEDNPYDRLVVEV 355
+I+ GF ++N +DR+ +++
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKL 213
>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
Length = 400
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 105/238 (44%), Gaps = 24/238 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHELVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L S + W + EL +L + ER E I EYN + V
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWGKEELKFLECTRAFRGTAERREMISDEYNSVKDV 124
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
P+ + F+ F A+ V V +L+ ++ P +
Sbjct: 125 -------ISSCPHVFGEDISLFQ-FAHAYATV---VSRAWNGALSSEISMRPF-VDFCNH 172
Query: 295 SSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
A ++ DA ++ +R Y GE + + G + N+ L ++YGFV +N D+
Sbjct: 173 DPVSHATVSHDTCKDAT-IIAERDYTKGEEVFISYGKRSNAVLAVDYGFVLPNNLSDQ 229
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 116/251 (46%), Gaps = 38/251 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ED++AG+ SVP + ++T NE++ +L T+ LS LAL+L+ + K
Sbjct: 1 MMATEDIEAGEVIVSVPRNFLIT------NESLTKLYGTHSLSPHQLLALHLVLLTRD-K 53
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S+W PY L + S LL ++L S K E +++ + I +Y
Sbjct: 54 QSWWKPYTDLLPMHFNTMPVNYPSELL------SHLPNS-LKQETMQQKDNIHTDY---- 102
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
V + +Q P DI T E FK A++ V + +H+ + + L P L
Sbjct: 103 -VTCLKFCKSKQLPQDI-----TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLD 156
Query: 292 -LAYSSKCKAMLAAVDDAVQLVVDR-------PYKAGESIVVWCGPQPNSKLLINYGFVD 343
L ++++ K ++ + R YK GE + + GP N +L YGFV
Sbjct: 157 FLNHTTEAK-----IESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYGFVL 211
Query: 344 EDNPYDRLVVE 354
+N Y+ ++++
Sbjct: 212 NENIYNFVLLD 222
>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
Length = 566
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 105/250 (42%), Gaps = 28/250 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S+V++ E V ++ L + +S L L+ M EK
Sbjct: 194 ATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKHNSNSK 253
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F Y L A + L + + L G+ EI+E + + +Y EL
Sbjct: 254 F-NTYFNALPE-------AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL--- 302
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
+L + +P P E +T E F A S + + R L+P+
Sbjct: 303 ---VPALCKDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHS 359
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y + + ++++ V +P GE + G +S L+ YGF+ + D
Sbjct: 360 LYPHIMHYGK-----VDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGD 414
Query: 346 NPYDRLVVEV 355
N YD + +E+
Sbjct: 415 NLYDTIPLEI 424
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 107/244 (43%), Gaps = 35/244 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK---K 169
+ A ED++ G+ +P++ ++T+ER + + +L E + LA +L + +
Sbjct: 25 LVAREDVKRGEPLLEIPDASLITVERAVKESKLGP--KHAELQEWSLLAAFLAEQALDIE 82
Query: 170 QGKKS-FWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 227
G +S + Y++ L R+ G L W E ++ L GSP++ ER +
Sbjct: 83 NGDESGVFAAYVKALPRRTG-------GVLDWPEEDVKTLLAGSPSQRAAYERQASVDGA 135
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
E+ ++P P + AF + S ++ L + ALVP
Sbjct: 136 IEEIRA----------EFPQLTPG------ALRWAFDVLFSRLIRLP--NRGGELALVPW 177
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE--D 345
+L + C A + V L DR YK GE + G +P+++LLI+YGF E +
Sbjct: 178 AD-MLNHKPGCNAYIDDSGGKVCLQPDRAYKPGEQVFASYGQRPSAELLISYGFAPEVGE 236
Query: 346 NPYD 349
NP D
Sbjct: 237 NPDD 240
>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
Length = 541
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 100/244 (40%), Gaps = 29/244 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L + ++ + L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPISSIISEEYVFNSDMYPILEKIDGITSETMVLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G + + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------MSFGVNAIMELDGTLLLDEIMQAKELLRERYDEL--- 281
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
L + + P E +T+E + A S + ++ + L+P+
Sbjct: 282 ----IPLLSNHRHVFPPEHYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 337
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y C +++ V RP GE + G +S LL YGF+ + D
Sbjct: 338 IYPHIVKYGKVCVET-----SSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGD 392
Query: 346 NPYD 349
NPYD
Sbjct: 393 NPYD 396
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 63/287 (21%), Positives = 118/287 (41%), Gaps = 31/287 (10%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE W+H+NG C+ + + R + A++ L+ + +P L++
Sbjct: 18 EESFVRFFQWLHRNG---CRNVPLKPAVFPGTGRGM---MATKALKHEELMLVIPQRLLI 71
Query: 135 TLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
T++ ++ + + + +L+ LA++LM EK + +KSFW PYI L +
Sbjct: 72 TMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKYRREKSFWRPYIDILPEE------- 124
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
P ++E + L S + + + +EY EL + M LF +A
Sbjct: 125 YSCPTFFTEDDFRLLPNS-LRGKAKAKKYECHKEYKELAPFFKMLADLFPD-----QEDA 178
Query: 254 FTFEIFKQAFVAVQSCVV----------HLQKVSLARRFALVPLGPPL-LAYSSKCKAML 302
F F+ FK A+ A+++ + HL+ + PL + A +K +
Sbjct: 179 FNFKDFKWAWSAIKTRALDVPIGRESCRHLRDAEDTPTPTMFPLVDSINHAAQAKIRHRY 238
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
++ + Y+ ++ G N LL+ +GFV NP D
Sbjct: 239 NEKSRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVVPGNPED 285
>gi|307109196|gb|EFN57434.1| hypothetical protein CHLNCDRAFT_142903 [Chlorella variabilis]
Length = 1233
Score = 57.8 bits (138), Expect = 8e-06, Method: Composition-based stats.
Identities = 67/255 (26%), Positives = 106/255 (41%), Gaps = 47/255 (18%)
Query: 81 LKSW-MHKNGLPPCKV---ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
L W + + LPP V +L +P H A+ D+ G+ VP SL +T
Sbjct: 842 LVRWCIEQRQLPPLTVEPAVLDGEPGVQR-----HGFVAARDVGQGEVLLQVPGSLAITA 896
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
V G + E+L + SEL LAL+LM E+ + A +
Sbjct: 897 VDV-GKDAQLEVLARGR-SELVGLALWLMQERAK----------------------ATLT 932
Query: 197 PLLWSETE-LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P+LW + E L GSP E R + +++E+ ++ + + YP + E
Sbjct: 933 PILWPDEERQQLLRGSPVLEEARTREQALRQEWQDIAAIAAQTSGGPEAYPAVVYNE--- 989
Query: 256 FEIFKQAFVAVQSCVV-HLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLV 312
QAF+ S V+ H + A+ FAL+PL L S A+L +AV +V
Sbjct: 990 -----QAFLEAMSVVLAHAAYLPKAQCFALLPLVGGLCRTGSSSGALLDYDLEREAVTVV 1044
Query: 313 VDRPYKAGESIVVWC 327
R G+ + ++C
Sbjct: 1045 AQR--TPGQEVALYC 1057
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 111/256 (43%), Gaps = 35/256 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVL---GNETIAELLTTNKLSELACLALYLMYEKK 169
AA+ + AGD A ++P + T+ L G A + L E AL+L+ E+
Sbjct: 188 AAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFA-ALGEDTIAALWLIAERA 246
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVE-----SPLLWS-ETELAYLTGSPTKAEILERAEG 223
G+ S W I L G G+ + +P+ W E A L G+P A+ + +E
Sbjct: 247 LGEASPWHAVIASLPWPEG-GEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
+ R++ L F A ++ D+ P A+T + F++A A S + +Q
Sbjct: 306 LARQHAAL----FPA---LSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQASPGEPAA 358
Query: 283 ALVP---------LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+P L P ++ YS D ++L V R AGE + V G + N+
Sbjct: 359 TCLPPVAMLCNHALWPHVVRYSRL-------RDGTLRLPVARSVHAGEEVFVSYGAKSNA 411
Query: 334 KLLINYGFVDEDNPYD 349
+LL+ YGF NPYD
Sbjct: 412 ELLLFYGFALPGNPYD 427
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 107/246 (43%), Gaps = 24/246 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A +Q G+ +P +L+ L++ L E +E L+ L+E LA+ + E+ G+KS
Sbjct: 100 AFRKIQQGEKLIEIPENLI--LKKSLK-ENRSEDLSF--LNEYDSLAIKAIQERAIGEKS 154
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W Y L +++ + W +++ +L GS E IK ++ ++
Sbjct: 155 KWKVYYEILPKEKDLNLV-----FRWKISDIVFLRGSKVLNASFYLKEKIKIQFLRIEKT 209
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP----- 289
F L P + F + ++ A + S + LQ + ++ ALVP
Sbjct: 210 IFSKNRLV------YPEKIFNLQSWEWAISLLLSRAIFLQNM---KKIALVPYADFINHN 260
Query: 290 PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
P K + + ++ + + D+ Y + I G + N +LL+ YGF+ E NP+D
Sbjct: 261 PFSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGFIIERNPFD 320
Query: 350 RLVVEV 355
+ + V
Sbjct: 321 SIELRV 326
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 81/199 (40%), Gaps = 18/199 (9%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +G+ SFW PYI L + A+ P W E ++AYL G+ I E
Sbjct: 107 FFLIKEYLKGRDSFWAPYIATLPQPEHVSAWAL--PAFWPEEDIAYLAGTNAHVAIAEIQ 164
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV----AVQSCVVHL 273
+K E+ + L F A + Q Y FT F+ + V A Q L
Sbjct: 165 ANVKSEFKQARKALKAAGFPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAELL 224
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA-----VQLVVDRPYKAGESIVVWCG 328
F+++ PL ++ A D A QLV Y+ GE + G
Sbjct: 225 PPGCQLDDFSILQ---PLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYG 281
Query: 329 PQPNSKLLINYGFVDEDNP 347
+ NS+LL+ YGF+ P
Sbjct: 282 LKTNSELLLAYGFILPPTP 300
>gi|145516585|ref|XP_001444181.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411592|emb|CAK76784.1| unnamed protein product [Paramecium tetraurelia]
Length = 658
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 26/143 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL---------SELACLALY 163
V+A ++ A ++PN L+++ +VL +E ++++ T+K +E CLALY
Sbjct: 48 VSAKMNIPANKVIIAIPNKLIISHHKVLKSE-LSDMFKTHKQFFDDQITADAEFNCLALY 106
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
+ Y K QG KSFW PY+ +++ + W +L L E +
Sbjct: 107 IFYHKLQGDKSFWYPYLNVVEQH---------TMFEWRNRDLFNLQDQSLIDEFMYIQ-- 155
Query: 224 IKREYNELDTVWFMAGSLFQQYP 246
+E+D W+ L +YP
Sbjct: 156 -----SEMDKSWYKFKGLMNKYP 173
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 117/298 (39%), Gaps = 37/298 (12%)
Query: 67 SREVVSKKEED---LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
+++ K+E D + + W+ +G K +K + E + ++ D++ G+
Sbjct: 56 GKQIAVKQETDQQLVSNFMEWLKNSGFDETKSKVKIGRNLAEGSG----LVSTCDIKEGE 111
Query: 124 AAFSVPNSL---VVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGKKSFWLP 178
+P L ++T + G LL N + + LALYL+ E S P
Sbjct: 112 EFLEIPEKLFIDIMTALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSSI-AP 170
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
Y++ L + + W + L GSP + G R+Y +F
Sbjct: 171 YLKVLPK-------TYSTIGYWGIEDFKQLEGSPVFQTAVNYTRGSMRQY-----CYFY- 217
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP--PLLAYSS 296
LF P + T FT+E F A VQS V + AL+P ++
Sbjct: 218 -QLFDNNPGILQTSNFTYEAFIWAVATVQS---RQNPVGGGQEMALIPFWDFCNHSSHGG 273
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF-----VDEDNPYD 349
K + V + + YK GE + ++ GP+PNS+ + GF +++D +D
Sbjct: 274 KITTFIDPVKHVLTCSAAKSYKKGEQVYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFD 331
>gi|145250231|ref|XP_001396629.1| SET domain protein [Aspergillus niger CBS 513.88]
gi|134082145|emb|CAK42259.1| unnamed protein product [Aspergillus niger]
gi|350636112|gb|EHA24472.1| hypothetical protein ASPNIDRAFT_48629 [Aspergillus niger ATCC 1015]
Length = 489
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 121/305 (39%), Gaps = 54/305 (17%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLDDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
S +EL L GS + ILE RA +G+ + T +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEESILETIAPIVRANPALFPPIDGVASYDGDAGTQALL 188
Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
GSL Y +DI P + + ++ + +VPL
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGERDGEDGYLTDEE--------EEQSSKGMVPLADL 240
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
L A + + A L ++ + + +P KAGE I G P S LL YG+V DN
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKAGEEIFNDYGEIPRSDLLRRYGYV-TDNYAQY 299
Query: 351 LVVEV 355
VVE+
Sbjct: 300 DVVEL 304
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 28 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 79
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 80 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 132
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 133 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 192
Query: 335 LLINYGFVDEDNPYDRLVVEV 355
+I+ GF ++N +DR+ +++
Sbjct: 193 FVIHSGFFFDNNSHDRVKIKL 213
>gi|310799999|gb|EFQ34892.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 478
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 100/254 (39%), Gaps = 33/254 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ F++P ++ +E GN+ E + L L L
Sbjct: 44 IVATKDIAPETVLFTIPRKSIINIETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++YE QG S W Y L + ++ + W +L YL GS ++I +
Sbjct: 104 VMIYEYLQGNASPWKTYFEVLPEK-------FDTLMFWESPDLEYLKGSAVLSKIGKDEA 156
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR- 281
L + AG F Q P+E+ ++ + + + L+ +
Sbjct: 157 DEMFRSRILPVISANAGIFFPQ-GVSPPSESELLQLAHRMGSIIMAYAFDLENEEEPEQE 215
Query: 282 -------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
+VP+ +L ++ A + +D + + RP KAGE I+ + G
Sbjct: 216 DEEWVEDREGKTMLGMVPMAD-ILNADAEFNAHVNHGEDDLSVTALRPIKAGEEILNYYG 274
Query: 329 PQPNSKLLINYGFV 342
P PNS+LL YG+V
Sbjct: 275 PHPNSELLRRYGYV 288
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
LA +L+ E+ SFW PYI+ L + ++PL + E E+ L + ++
Sbjct: 10 ALAFHLLCERA-SPNSFWQPYIQTLPSE-------YDTPLYFEEEEVRCLQSTQAIHDVF 61
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + R+Y +F + Q +P+ +P ++FT+E ++ A +V + +
Sbjct: 62 SQYKNTARQY-----AYFY--KVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPT 114
Query: 276 VSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
+R AL+PL + DD + V + ++AG+ I ++ G + N++
Sbjct: 115 EDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAE 174
Query: 335 LLINYGFVDEDNPYDRLVVEV 355
+I+ GF ++N +DR+ +++
Sbjct: 175 FVIHSGFFFDNNSHDRVKIKL 195
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 106/272 (38%), Gaps = 46/272 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +D+ + FS+P S ++ +E + + I T L L L ++YE G
Sbjct: 40 VVAKQDIAEHELLFSIPRSSILGVENSILSTEIPPA-TFAHLGPWLSLILIMLYEYHNGS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-------------- 218
S W PY L ++ + W+E ELA L S +I
Sbjct: 99 ASNWAPYFAVLPTD-------FDTLMFWTEDELAELQASAVVNKIGKEGANEVFIEQLLP 151
Query: 219 -------------ERAEGIKREYNELDTVWFM--AGSLFQQYPYDIPTEAFTFEIFKQAF 263
ERA+ +E + + M GSL Y +D+ A + + +
Sbjct: 152 VIEEFADVIFSGDERAKHKAKEMRAPENLELMHKMGSLIMAYAFDVEP-AISDKEVDEEG 210
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
A + L K +VPL L A +C A L D +++ +P +AG+ I
Sbjct: 211 FAEEEEDAALPK-------GMVPLADMLNADGDRCNARLFYEKDGLEMKALKPIQAGDEI 263
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
GP P S LL YG++ DN VVE+
Sbjct: 264 FNDYGPLPRSDLLRRYGYI-TDNYAQYDVVEI 294
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 106/247 (42%), Gaps = 37/247 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A + AGD +VP S+ + L V N ++ + L +AL+L++E S
Sbjct: 73 AQRPVNAGDEILNVPESVWINLAAVQ-NSSLGK--ACEGLKPWVAVALFLIHESSN-PSS 128
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DT 233
W PY+ L + +++SPL WS+ ELA L G+ + E ++ EYN L +
Sbjct: 129 KWRPYLDSLPK-------SLDSPLFWSDEELAELVGTQLLGSVTGYLEFLENEYNNLVEE 181
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL------ 287
V +F Y TF+ FK AF ++S ALVP+
Sbjct: 182 VLEPNNKIFNPAVY-------TFDGFKWAFGILRSRTFSPLT---GEDIALVPIADLVNH 231
Query: 288 ------GPP--LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP-QPNSKLLIN 338
G P + +S+ + D + + + AGE +++ G + N+ L ++
Sbjct: 232 GKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVRASANFSAGEQVLMQYGATKSNADLALD 291
Query: 339 YGFVDED 345
YGFV+ D
Sbjct: 292 YGFVERD 298
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 121/291 (41%), Gaps = 34/291 (11%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V+ R +SK++ +L W+ NG K+ L+E + + R +H + + L
Sbjct: 18 TPVSPPRNGMSKEDVVGQELIQWLETNGADSKKLTLQE---YAPEVRGVH---SRKVLVP 71
Query: 122 GDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFW 176
G+ +P ++T+E +G +T I L + +A L ++L+ + + + SF+
Sbjct: 72 GERILVIPKKCLITVE--MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMEHVETSFF 129
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y L P+ WSE EL++L GS +I ER I+++Y+ + V
Sbjct: 130 RNYYSTL------PSTLSNMPIFWSEEELSWLKGSYIIQQIQERKAAIRKDYDVICRV-- 181
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
D F+ + F A + V S L + + ALVP L Y
Sbjct: 182 -----------DPSFARFSLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRP 229
Query: 297 KCKA-MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
+ + DA + G + G + N + L+NYGF EDN
Sbjct: 230 RETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNHRFLLNYGFAVEDN 280
>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
Length = 353
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/283 (24%), Positives = 120/283 (42%), Gaps = 28/283 (9%)
Query: 80 DLKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
DL WM NG ++ ++E E R + A + + AG+ VP L++T
Sbjct: 28 DLLEWMIANGFELHVQLCVRE---FTETGRGL---ATLQKVTAGETFLRVPTCLLITTTT 81
Query: 139 VLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
L + L+ ++ L+ + L L+L+ EK +G S W +I L ++ +P
Sbjct: 82 ALSSSLHGFLVRHHRQLTAIEVLTLFLINEKLRGLDSEWRFFIDSL-------PVSYTTP 134
Query: 198 LLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+ LA L + KAE + I+R + L + + +L +E FT+
Sbjct: 135 VFLGSKLLARLPETMCRKAE--AQVSRIRRTFVRLQIL--LKRALLDDSALLNLSENFTW 190
Query: 257 EIFKQAFVAVQS-CVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDA--VQ 310
+F A+ AV + C+ K F L P L + KA + + +
Sbjct: 191 HLFVWAWTAVNTRCI--FSKHRTDHSFWDDDYCALAPFLDCLNHHWKADVETTVEGSYFE 248
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+V + Y+ + + + G N KLL+ YGFV DNP D + +
Sbjct: 249 IVTNNNYEPNDQVFISYGSHDNKKLLLEYGFVLADNPNDVVAI 291
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 107/270 (39%), Gaps = 55/270 (20%)
Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNE----------------TIAELLTTNKLS 155
+ A D+ A F++P +S++ T L NE + E T++
Sbjct: 49 IVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSSQD 108
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
L L L+YE QG S W PY+ L A ++P+ WS TELA L S
Sbjct: 109 SWTLLILILIYEYLQGDASQWKPYLDVL-------PSAFDTPMFWSPTELAELQASALVT 161
Query: 216 EI-LERAEGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
++ E A+ + R D V+F G Q+ D FE+ + A+ +
Sbjct: 162 KVGREEADRMIRSKILPVIRGHDHVFFPHGR--QRLDDDQ-----LFELAHRMGSAIMAY 214
Query: 270 VVHLQKVSLARR-----------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
L+K A +VP+ +L ++ A + D++
Sbjct: 215 AFDLEKDDDANEEASEQDEWVDDREGRTMLGMVPMA-DMLNADAEFNAYINHGADSLTAT 273
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
R KAGE I+ + GP PN +LL YG+V
Sbjct: 274 ALRTIKAGEEILNYYGPLPNGELLRRYGYV 303
>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
Length = 558
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 110/250 (44%), Gaps = 28/250 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+EDL+ GD A +P S++++ E V ++ L + +S L L+ M E+
Sbjct: 190 ATEDLKVGDIALEIPVSIIISEELVRHSDMYHILEKIDGISSETMLLLWSMKERHNCNSK 249
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
+ Y L ++ G ++ S+ L + EI++ E ++ +Y+EL
Sbjct: 250 SKI-YFDTLPKEFNTGLSFGVDAIMASDGTLLF-------DEIMQAKEHLRVQYDEL--- 298
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
+L YP P E +T+E F A S + ++ + R L+P
Sbjct: 299 ---VPALCNNYPDVFPPELYTWEQFLWACELWYSNSMKIKFLDGKLRTCLIPIAGFLNHS 355
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
L P ++ Y + ++ + ++ + RP + GE + G + L+ YGF+ + D
Sbjct: 356 LHPHIIHYGK-----VDSITNTLKFPLSRPCRVGEQCCLSYGNFSGAHLITFYGFLPQGD 410
Query: 346 NPYDRLVVEV 355
N YD + +++
Sbjct: 411 NRYDIIPLDI 420
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 115/256 (44%), Gaps = 43/256 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A + + + + VP S ++TLE + + +A+ + +L L+ L+ +L+ EK
Sbjct: 190 VNARKAISSKEVILFVPRSHMITLE-MAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQEK 248
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
K + SFW PY+ L + P+ +++++L +L GSP ++ ++ +K++Y
Sbjct: 249 K-IQDSFWKPYLDVLPKSYSNF------PIFFNDSDLEWLKGSPFLKQVKDKITDLKKDY 301
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
++ V A Q +F+ F A + S + + + + A VPL
Sbjct: 302 CDICQV---APEFLQN----------SFDEFCWARMTASSRIFGIN-IKGVKTDAFVPLA 347
Query: 289 PPL---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
L YS + + + D+ ++ G+ I G + NS+ L+NY
Sbjct: 348 DMLNHKRPKLTSWCYSDERQGFIIETDENIE--------KGQMIFDSYGSKCNSRFLLNY 399
Query: 340 GFVDEDNPYDRLVVEV 355
GFV +DN + + V V
Sbjct: 400 GFVVDDNNANEVNVMV 415
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 81/190 (42%), Gaps = 25/190 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG-- 141
W NG + L+ P + AA +D+Q G VP +++T +G
Sbjct: 11 WCLDNGATINGITLQALPDDE------YGFAAEQDIQVGPVFLGVPLGMMMT---TIGAR 61
Query: 142 NETIAELLTTN---KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+ LL + K E L+++L+ E G SFW PYI L R + + L
Sbjct: 62 KSKLGALLKDDPIMKSMENVALSMFLILELCAGSASFWHPYISILPR-------SFNTVL 114
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
+S EL LTGS E L+ I R+Y + F L + PY + FT+++
Sbjct: 115 YFSVDELQLLTGSSVLDEALKLHRSIARQYAYFHKI-FRTHPLAKSLPY---KDCFTYDL 170
Query: 259 FKQAFVAVQS 268
++ A AV +
Sbjct: 171 YRWAVSAVMT 180
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 83/203 (40%), Gaps = 32/203 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE QG+ S W PY+ L ++P+ WS TEL+ L S A++
Sbjct: 109 LILVLIYEHLQGEASRWRPYLDVL-------PPTFDTPMFWSPTELSELQASALVAKV-G 160
Query: 220 RAEG----------IKREYNELDTVWFMAG----------SLFQQYPYDIPTEAFTFEIF 259
RAE + R + E V+F G L + I AF E
Sbjct: 161 RAEADRMIEAKVLPVIRAHEE---VFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLEND 217
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
++ +VP+ +L ++ A + DDA+ RP +A
Sbjct: 218 DSDNDEADEDDEWVEDREGRTMLGMVPMAD-MLNADAEFNAHINHGDDALTATALRPIRA 276
Query: 320 GESIVVWCGPQPNSKLLINYGFV 342
G+ I+ + GP PN +LL YG+V
Sbjct: 277 GDEILNYYGPLPNGELLRRYGYV 299
>gi|358366345|dbj|GAA82966.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 673
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 90/205 (43%), Gaps = 7/205 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTALRNAGFEGADAYTWDLYLWAASMFISRAFSARVLSGV 191
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D V VV AG+ I GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDVAFVVLEDVSAGQEISNNYGPR 250
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
N +L++NYGF NP D +V +
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSL 275
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 100/234 (42%), Gaps = 30/234 (12%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + + V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDADTVRRSE-IGE--CCEGLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + + E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSMNGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSLA----RRFALVPLGPPLLAYSS 296
+T E F AF ++S +L V LA F L P +
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDP---GWKV 196
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
K + A + + E ++ + + N++L +YGFVD D +R
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQLATDYGFVDSDEKNNR 250
>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
Length = 543
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKLGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|403215215|emb|CCK69715.1| hypothetical protein KNAG_0C06190 [Kazachstania naganishii CBS
8797]
Length = 496
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 59/254 (23%), Positives = 109/254 (42%), Gaps = 32/254 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT--TNKLSELACLALYLMYE-KK 169
V A ED++ + F VP + ++ +E ++ E+ + + L + L++E K
Sbjct: 41 VIAIEDIEKDEILFEVPRTTMLNVENCELSKRYPEIKNHLVESVGQWEGLIIALLFEWKV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
G+KS W PY++ L ++ QL + W++ EL L G+ E+ E
Sbjct: 101 VGEKSKWWPYLQVLPKKTDMNQL-----IYWADDELELLKPSLILERVGADKAKEMFENV 155
Query: 222 EGI--KREYNELDT-----VW---FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
I K E D+ W + S+ Y +D+ + + E K+ +
Sbjct: 156 VDIINKSTLKEKDSYILKVTWENFLLVASIIMSYSFDV--QDYVEE--KEGGTDEEEDDN 211
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
+ V + ++PL L + + KC A L + +++ + K GE I G P
Sbjct: 212 ESENVRSLK--CMIPLADTLNSNTHKCNAHLIHGSNLLEMRSIKAIKKGEQIYNIYGDHP 269
Query: 332 NSKLLINYGFVDED 345
NS++L YG+++ D
Sbjct: 270 NSEILRRYGYIEPD 283
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 101/234 (43%), Gaps = 30/234 (12%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
S+P +L + ++ V +E I E L +ALYL++EK + S W YIR L R
Sbjct: 37 SIPKTLWMDVDTVRRSE-IGECCA--GLRPWIAVALYLLHEKAK-PHSDWSAYIRVLPR- 91
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
++SPL WSE ELA L G+ + I E +KREY+++ T + + P
Sbjct: 92 ------TLDSPLFWSEEELAELKGTQLLSSINGFKEFLKREYDKVMT------EVIEPRP 139
Query: 247 YDIPTEAFTFEIFKQAFVAVQSCVV------HLQKVSLA----RRFALVPLGPPLLAYSS 296
+T E F AF ++S +L V LA F L P +
Sbjct: 140 DVFDRSLYTLEAFTWAFGILRSRTFPPLIGDNLALVPLADFVNHGFGLTNEDP---YWHV 196
Query: 297 KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
K + A + + E ++ + + N++L +YGFVD D +R
Sbjct: 197 KSAGVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQLATDYGFVDSDEKNNR 250
>gi|358369683|dbj|GAA86297.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 489
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 122/305 (40%), Gaps = 54/305 (17%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G P K+ K + + H V A DL G+ F++P + V++++ N
Sbjct: 22 TWLA--GKPGVKINPKIQIADLRSHAAGRGVVAQSDLDEGEELFTIPRAHVLSVQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ LL+ N L L + ++YE QG +S W Y R L R ++ + W
Sbjct: 76 SNLKNLLSQNLEDLGPWLSLMVVMIYEYLQGDQSAWASYFRVLPRN-------FDTLMFW 128
Query: 201 SETELAYLTGSP---------TKAEILE------RA--------EGIKREYNELDTVWFM 237
S +EL L GS + I+E RA +G+ + T +
Sbjct: 129 SASELEELQGSAIVEKIGKQGAEGSIIESIAPIVRANPALFPPIDGVASYDGDAGTQALL 188
Query: 238 -----AGSLFQQYPYDI--PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
GSL Y +DI P + + ++ + + +VPL
Sbjct: 189 HLAHTMGSLIMAYAFDIEKPEDEEGDRDGEDGYLTDEEEEQSSK--------GMVPLADL 240
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
L A + + A L ++ + + +P K+GE I G P S LL YG+V DN
Sbjct: 241 LNADADRNNARLFQEEEVLVMKAIKPIKSGEEIFNDYGEIPRSDLLRRYGYV-TDNYAQY 299
Query: 351 LVVEV 355
VVE+
Sbjct: 300 DVVEL 304
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 99/239 (41%), Gaps = 18/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASE + GD A +P L+++ E + +E L N ++ L L+ M E+
Sbjct: 184 ASESIGVGDIALEIPEFLIISDELLCQSEVFLALKDFNNITSETMLLLWSMRERYNLGSK 243
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + LA L G+ EI++ + ++++Y+EL +
Sbjct: 244 F-KPYFDTLPANFNTG-------LSFGIDALAALEGTLLFDEIIQARQHLRQQYDELFPL 295
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L +P + T++ F A S + + S LVP+ L
Sbjct: 296 ------LCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLSSGKLSTCLVPVAGLLNHS 349
Query: 295 SSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
S VD+A ++ + RP AGE + G P S L+ YGF+ DNPYD
Sbjct: 350 VSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPYD 408
>gi|403349615|gb|EJY74245.1| hypothetical protein OXYTRI_04500 [Oxytricha trifallax]
Length = 689
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 66/131 (50%), Gaps = 19/131 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-------SELACLALYLM 165
+ A +D+ A +PNS ++++ RV + + ++L+ ++ ++ CLA++LM
Sbjct: 74 IGAKKDIGQYKAFLFIPNSCIISVTRVKKHPIVGQILSNHQELFMKHADADQLCLAVFLM 133
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETELAYLTGSPTKAEILERAEG 223
E QG++SFW PYI ++ ES LL W + E+ L + E +
Sbjct: 134 NEYLQGQQSFWWPYINVMN----------ESDLLYKWKDEEIKLLNDFEIYQQAKEYRDD 183
Query: 224 IKREYNELDTV 234
I+ E+N+L +
Sbjct: 184 IEDEWNKLSKI 194
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 101/241 (41%), Gaps = 18/241 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE + G+ A +P SL+++ E + +E L N ++ L L+ M E+
Sbjct: 182 MVASESIGVGEIALEIPESLIISDELLCQSEVFLALKDFNSITSETMLLLWSMRERYNLA 241
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
F PY L G L + LA L G+ EI++ + ++++Y+EL
Sbjct: 242 SKF-KPYFDTLPANFNTG-------LSFGIDGLAALEGTLLFDEIMQAKQHLRQQYDELF 293
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
+ L +P + T++ F A S + + S LVP+ L
Sbjct: 294 PL------LCTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLSTCLVPVAGLLN 347
Query: 293 AYSSKCKAMLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPY 348
S VD+A ++ + RP AGE + G P S L+ YGF+ DNPY
Sbjct: 348 HSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPRGDNPY 407
Query: 349 D 349
D
Sbjct: 408 D 408
>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 543
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 70/156 (44%), Gaps = 16/156 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ +P S ++T +I+ L K+ + ++ L+YE G
Sbjct: 59 VIALQDLKIDHTVAIIPKSCLLTPHTT----SISAYLKKYKIKDATATSIALLYEASIGS 114
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ L L+V+ P+LW++ +L L G+ + + E E + YN+
Sbjct: 115 QSKWYGYIKSL-------PLSVDLPILWNDADLKNLKGTSIETVVYENKETVDATYNK-- 165
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
++ L +P F+ + FK+A V S
Sbjct: 166 ---YIKSKLIANHPDVFNEHVFSLDNFKRASCLVSS 198
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 107/247 (43%), Gaps = 29/247 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+++ + + +L + LS LALYL++ K
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVKKAYADPLLGPVLRAAQPPLSVEDSLALYLLFVKS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R L + R +A + + +++ EL GS A + + +
Sbjct: 87 ----------RTLGYEGQRHHIAAMPQSYSASIFFTDDELQVCKGSSLYALTPQLEQRVH 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L +L Q+ P + FT E +K A ++ S + A +
Sbjct: 137 DDYRQLLV------ALLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDFAVSETASVRLVA 190
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
PL +L +S +C A D + ++ + Y+ G+ I ++ G PN++LL YGFV
Sbjct: 191 PLAD-MLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFV 248
Query: 343 DEDNPYD 349
DNP D
Sbjct: 249 LPDNPND 255
>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 519
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 80/189 (42%), Gaps = 21/189 (11%)
Query: 160 LALYLMYEKK-QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L +YL++++K G SF+ PY L P+ WS EL L GS ++I
Sbjct: 116 LMIYLLWDRKTHGSSSFFHPYYEILP------PTLRNMPIFWSAFELQELEGSHLLSQIA 169
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+R + I+ +Y + V G+L T + FK A + V S LQ +
Sbjct: 170 DRGQAIQDDYEAILEVAPSLGTLC------------TLDEFKWARMCVCSRNFGLQ-IDG 216
Query: 279 ARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
R ALVP L Y + K V + + +AG + G + N + L+
Sbjct: 217 HRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGAQVYDSYGQKCNHRFLL 276
Query: 338 NYGFVDEDN 346
NYGF EDN
Sbjct: 277 NYGFAVEDN 285
>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 536
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 98/239 (41%), Gaps = 19/239 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 180 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 239
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 240 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDELI-- 289
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
L + P E +T+E + A S + ++ + L+P+ L
Sbjct: 290 -----PLLSNHREVFPPELYTWEHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHS 344
Query: 295 SSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYD 349
VD +++ V RP GE + G +S LL YGF+ + DNPYD
Sbjct: 345 IYPHIVKYGKVDIETSSLKFPVSRPCNKGEQCFLSYGNYSSSHLLTFYGFLPKGDNPYD 403
>gi|327295326|ref|XP_003232358.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
gi|326465530|gb|EGD90983.1| hypothetical protein TERG_07206 [Trichophyton rubrum CBS 118892]
Length = 692
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/206 (24%), Positives = 86/206 (41%), Gaps = 35/206 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFFQDSDLEWLQGTSLYETHRA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ +A S+ + Y + E++T++IF A+ + S + +
Sbjct: 163 YRNTVKEEYD-------LAISILRDEGY-LAIESYTWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVD----------------DAVQLVVDRPYKAGESI 323
R P L + + ML VD ++L V P GE +
Sbjct: 209 RVLDAYLSNHPSLKQEEEFQIMLPLVDFSNHKPLAKIEWQAEATEIRLKVVEPTFTGEEV 268
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 64/272 (23%), Positives = 111/272 (40%), Gaps = 49/272 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--------LSELACLALYL 164
+ A D+ A F++P + +E + I ++ +K L + L L +
Sbjct: 43 IIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLILIM 102
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE-RAEG 223
+YE +G S W Y L + ++P+ WSE EL L S + +I + AE
Sbjct: 103 IYEYLRGDSSQWKSYFDVLPS-------SFDTPMFWSENELDQLQASHMRHKIGKADAEN 155
Query: 224 I-------------------KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R +EL + GS Y +D+ + E ++
Sbjct: 156 MFKKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVE 215
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ +VP+ +L ++ A + ++++ + RP KAGE I+
Sbjct: 216 DRDGKSM----------MGMVPMAD-ILNADAEFNAHVNHEEESLTVTSLRPIKAGEEIL 264
Query: 325 VWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
+ GP PNS+LL YG+V E + YD VVE+
Sbjct: 265 NYYGPHPNSELLRRYGYVTEKHSRYD--VVEI 294
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 88/198 (44%), Gaps = 22/198 (11%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+ L + +M+E +G +S W PY+ L + E+P+ WS EL L S T+ ++
Sbjct: 543 SILIIIMMFEYFKGDESKWKPYMDVL-------PASFETPMFWSGAELDELQASATRTKV 595
Query: 218 LERAEGIKREYNELDTVWFMAGSLF---QQYPYDIPTE----------AFTFEIFKQAFV 264
+A+ + + ++ V +F Q Y D + ++ F+ +
Sbjct: 596 -GKADAEEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEE 654
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
+ ++ +VP+ +L ++ A + DDA+ + R KAGE I+
Sbjct: 655 DEEDEEEWVEDRESKSTMGMVPMAD-ILNADAEYNAHVNYGDDALTVTALRTIKAGEEIL 713
Query: 325 VWCGPQPNSKLLINYGFV 342
+ GP PNS+LL YG+V
Sbjct: 714 NYYGPHPNSELLRRYGYV 731
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 118/285 (41%), Gaps = 32/285 (11%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W NG V +K+ P + A E ++ + VP L++T
Sbjct: 80 WCSDNGAYLGSVSIKDLPDGE------YGFVADEHIEESNQFLGVPLKLMMTTAAA-KKS 132
Query: 144 TIAELLTTN----KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ LL + +S +A LA++L+ E G+ SFW PYI L + + L
Sbjct: 133 KLGPLLRDDPIMMSMSNVA-LAMFLILEFCTGESSFWHPYISTL-------PASFNTVLY 184
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+S EL L GS E L+ I R+Y+ + F L + PY + FT++++
Sbjct: 185 FSVEELELLHGSTVLDEALKLHRSIARQYSYFHKI-FRTHPLAKSLPY---KDCFTYDLY 240
Query: 260 KQAFVAVQS--CVVHLQKVSLAR-------RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ A AV + V L + A+VPL K + ++
Sbjct: 241 RWAVSAVMTRQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHSDGKVFTDYDISANMLR 300
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
R ++ G+ + ++ G + N++ I+ GFV +N +D + +++
Sbjct: 301 CYAMRDFEKGQEVTIFYGRRTNAEFFIHNGFVFPENRHDSVDIKL 345
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 110/255 (43%), Gaps = 34/255 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN---ETIAELLTTNKLSELACLALYLMYEKK 169
V L GD ++P SL++T VL + I + L +LS L ++L+ E+
Sbjct: 8 VMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLPC-RLSPTETLVIFLLCERN 66
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G SFW PY+ L + L W+ E+ L TK + + +N
Sbjct: 67 KGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACDLRLKAEESFN 118
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLA----RRF 282
L + L +Q P AFT+++FK A+ +V + V++ Q L+ +
Sbjct: 119 RLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEEDKS 174
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVWCGPQPNSKLL 336
AL P LL ++ + A DD+ L +PY + + + GP N KLL
Sbjct: 175 ALAPFL-DLLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFINYGPHSNEKLL 229
Query: 337 INYGFVDEDNPYDRL 351
+ YGF NP++ +
Sbjct: 230 LEYGFTLPCNPHNNI 244
>gi|121707885|ref|XP_001271968.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119400116|gb|EAW10542.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 677
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 93/205 (45%), Gaps = 27/205 (13%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + QG+ FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 97 FFLIGQYLQGEDGFWFPYIRTLPQP-----LSLTTPLYYEGDDLGWLKGTSLWPAREQRM 151
Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYD--------IPTEAFTFEIFKQAFVAVQSCV 270
E +K Y + V + + FQ +Y +D I + AF+ ++ +AF +
Sbjct: 152 ELLKEAYE--NGVRELRKAGFQDVDKYTWDLYLWASSMIVSRAFSPKVLAEAFADID--- 206
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+ VS+ L+P L+ + K A V +V AG+ I GP+
Sbjct: 207 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAYLVLEDVAAGQEIANNYGPR 260
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
N +L++NYGF DNP D +V +
Sbjct: 261 NNEQLMMNYGFCLPDNPCDYRIVSL 285
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/273 (23%), Positives = 112/273 (41%), Gaps = 34/273 (12%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L W+ NG K+ L+E + + R +H + + L G+ +P ++T+E
Sbjct: 44 ELIQWLEGNGADTKKLALQE---YAPEVRGVH---SRKVLAPGERILVIPKKCLITVE-- 95
Query: 140 LGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+G +T I L + +A L ++L+ + ++ + SF+ Y L
Sbjct: 96 MGKQTDIGRKLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLP------STLS 149
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
P+ WS+ EL +L GS +I ER I+++Y+ + V D F
Sbjct: 150 NMPIFWSDEELGWLKGSYIIQQIQERKAAIRKDYDVICRV-------------DPAFARF 196
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA-MLAAVDDAVQLVV 313
+ + F A + V S L + + ALVP L Y + + DA +
Sbjct: 197 SLDRFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITS 255
Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
G + G + N + L+NYGF EDN
Sbjct: 256 LGTIGTGAQVYDSYGKKCNHRFLLNYGFAVEDN 288
>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 487
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 109/268 (40%), Gaps = 54/268 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FSVP ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNISEGEELFSVPRAMVLTVQ----NSELRTLLGENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA---EGIKR 226
QG+KS W PY R L + ++ + WS EL L S T E + R+ E I+
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSGAEESIRN 157
Query: 227 ----------------------EYNELDT----VWFMAGSLFQQYPYDI---PTEAFTFE 257
E + D V + GSL Y +DI + E
Sbjct: 158 SIAPILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAFDIEKSEDDGDEGE 217
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
++++ L K +VPL L A + + A L + A+ + +P
Sbjct: 218 ANDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKPI 270
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDED 345
+ GE I G P + LL YG+V ++
Sbjct: 271 QQGEEIFNDYGEIPRADLLRRYGYVTDN 298
>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
Length = 440
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 115/274 (41%), Gaps = 28/274 (10%)
Query: 97 LKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLT 150
LKE+ + RP + + A + LQ G+ S+P ++T + V+ + + + +T
Sbjct: 40 LKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTVIKS-YLGDYIT 98
Query: 151 --TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
T +S L L +L+ E G KS W PY+ L + + L+ E ++ L
Sbjct: 99 KWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKDY--------TCLVCLEPQVVRL 150
Query: 209 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
P K + E+ ++ + + SLF + D+ F + F A+ + +
Sbjct: 151 LPKPLKIKAQEQKTQVQELFVSSRGFFSSLQSLFTE---DVK-HIFHYHAFLWAWCTINT 206
Query: 269 CVV---HLQKVSLARRFALVPLGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGE 321
V H QK L+ + L P LL +S + A ++ K E
Sbjct: 207 RTVYMKHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYEIRTTSSCKKYE 266
Query: 322 SIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ + GP N +LL+ YGFV +NP+ + V +
Sbjct: 267 ELFICYGPHDNHRLLLEYGFVAINNPHSAVYVSI 300
>gi|393230612|gb|EJD38215.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 381
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 109/259 (42%), Gaps = 27/259 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT-------LERVLGNETIAELLTTNKLSE--LACLALY 163
V SE+L S P SL +T L+R+LG A+L N LSE L C L
Sbjct: 3 VHTSEELPPDAPVISAPFSLAITPTVAADALQRILGPG--ADL---NSLSERELVCTYLA 57
Query: 164 LMYEKKQ---GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
+ + K+ G + L + +D R QL +PL ++ ELA L G+ A +R
Sbjct: 58 MHWIAKEVDLGPSAASLDHGPYVDSLPSRAQL--RTPLHFTPQELALLKGTNMAAATTDR 115
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
+ E V G Y + + F ++ + ++ + +
Sbjct: 116 EADWRSECERCRAVLGHWGEHLTWEHYLTASTHLSSRAFPSTLLSPEPALI----PTPSS 171
Query: 281 RFALVPLGPPL-LAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLL 336
LVPL L A + ++ D+ +V P AG ++ GP+PN++L+
Sbjct: 172 HPVLVPLIDSLNHARAHPVSWSVSPADNGAHTLSIVQHAPVAAGAEVLNNYGPKPNAELV 231
Query: 337 INYGFVDEDNPYDRLVVEV 355
+ YGF DNP D LV++V
Sbjct: 232 LGYGFALPDNPDDTLVLKV 250
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 62/117 (52%), Gaps = 7/117 (5%)
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS 296
M +F +YP P E F E FK +F + S +V L S+ + ALVP ++ +S
Sbjct: 73 MRVRIFSKYPDFFPEEVFNIESFKWSFGILFSRMVRLP--SMDGKNALVPWAD-MMNHSC 129
Query: 297 KCKAMLAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
+ + L + +V DRPY+ GE + + G + N +LL++YGFV ++ NP D
Sbjct: 130 EVETFLDYDKSSKGIVFPTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGTNPSD 186
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 109/272 (40%), Gaps = 59/272 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL G+ ++P + +TL + IA L L + +MYE+ +GK
Sbjct: 10 VRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKGK 65
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREYNEL 231
S W Y++ L Q P LWSE E+ L G+ + E +K ++ E
Sbjct: 66 GSKWYRYLKTLPCQE-------SVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEE- 117
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPP 290
L ++ P + P + FTFE +++A +S V ++ + +VPL
Sbjct: 118 -----NIAPLTKEDPLEFPAQDFTFE----SYLAAKSLVSSRSFEIDAEHGYGMVPLAD- 167
Query: 291 LLAYSSKCKA-----MLAA-------------VDDA---------------VQLVVDRPY 317
++ K A ML A +DD +++V+ +
Sbjct: 168 --LFNHKTDAEDVHFMLNASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKDV 225
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
AG I G N+ LL YGF + +NP+D
Sbjct: 226 AAGSEIFNTYGQLGNAALLHRYGFTEPNNPHD 257
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 14/193 (7%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 90 LLLIKELLRGEESFWWPYIQALPQPEDVDDWAL--PPFWPEEEAELLEGTNVEVGLDKIR 147
Query: 222 EGIKREYNELDTVWFMA--------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
+ +KRE+ E + + L + Y+ F+ F+ + V ++ L
Sbjct: 148 DDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQAL 207
Query: 274 -QKVSLARRFALVPL---GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
+ VS+ L+PL G +A + + A A QL V R ++ G+ I P
Sbjct: 208 PEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYSP 267
Query: 330 QPNSKLLINYGFV 342
+ N++LL+ YGF+
Sbjct: 268 KTNAELLLGYGFM 280
>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
Length = 487
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 109/269 (40%), Gaps = 56/269 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A ++ G+ FS+P ++V+T++ N + LL N ++ L L ++YE
Sbjct: 50 VVAQSNIVEGEELFSIPRTMVLTVQ----NSELRTLLAENLEEQMGPWLSLMLVMVYEYL 105
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 221
QG+KS W PY R L + ++ + WS EL L S T E + R+
Sbjct: 106 QGEKSRWAPYFRVLPSR-------FDTLMFWSPAELQELQAS-TIVEKIGRSNAEESIRD 157
Query: 222 ----------------------EGIKREYNELDTVWFMAGSLFQQYPYDI---PTEAFTF 256
EGI + L V + GSL Y +DI +
Sbjct: 158 SIAPILAKRPDLFPPPPGLASWEGIAGDA-ALIQVGHVMGSLIMAYAFDIEKAEDDDDEG 216
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRP 316
E+ ++++ L K +VPL L A + + A L + A+ + +P
Sbjct: 217 EVNDESYMTDDEEEEQLPK-------GMVPLADLLNADADRNNARLYQEEGALVMKAIKP 269
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDED 345
+ G+ I G P + LL YG+V ++
Sbjct: 270 IQKGDEIFNDYGEIPRADLLRRYGYVTDN 298
>gi|453083670|gb|EMF11715.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 477
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 105/270 (38%), Gaps = 51/270 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A++DL + FS+P + ++T NET L N EL L L +++E
Sbjct: 45 VVATQDLSEDEELFSIPRASILT------NETTD--LPANLRKELDHPWLSLILVMVHEY 96
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS----------------- 211
+G KS W PY L +S + WS+ EL L GS
Sbjct: 97 LKGTKSSWYPYFNLLPE-------TFDSLMFWSDEELLSLKGSAVVDKIGKESADSTFTE 149
Query: 212 ---PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI--PTEAFTFEIFKQAFVAV 266
P A+ + R +EL ++ GS Y +D+ P +
Sbjct: 150 QLIPLIAQHANIFQTAGRSNDELLSLCHRMGSTIMAYAFDLEKPEPSQPPNQQDDEEWEE 209
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ + L K ++PL L A + A L DD V + +AGE +
Sbjct: 210 EESAISLPK-------GMIPLADMLNANADHNNAKLFYQDDKVVMKTLHAVRAGEELFND 262
Query: 327 CGPQPNSKLLINYGFV-DEDNPYDRLVVEV 355
GP P S LL YG+V D+ YD VVE+
Sbjct: 263 FGPLPRSDLLRRYGYVTDQYAKYD--VVEI 290
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 97/248 (39%), Gaps = 78/248 (31%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
L+++ L L L L+YE +G S W PY+ L E+P+ W+ EL L
Sbjct: 96 LSSSPLDAWGALILVLLYEHLRGAASAWRPYLDVL-------PATFETPMFWTGAELGAL 148
Query: 209 TGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
T ++ RE E DT + + + +P ++F+ +
Sbjct: 149 QAGATAGKV-------GRESAE-DTFRGILLPVVRAHP----------DVFQGSAALSDE 190
Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYS---------------------SKCKAMLAAV-- 305
+V +LA R +G ++AY+ KAM+ V
Sbjct: 191 ALV-----ALAHR-----MGSTIMAYAFDLENDEEREDEEDEDGWVEDRDGKAMMGMVPM 240
Query: 306 -----------------DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP- 347
D+ + + RP KAGE I+ + GP PNS+LL YG+V E +
Sbjct: 241 ADILNADAEFNAHVNHGDNELTVTALRPIKAGEEILNYYGPHPNSELLRRYGYVTERHSR 300
Query: 348 YDRLVVEV 355
YD VVE+
Sbjct: 301 YD--VVEI 306
>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
Length = 1234
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/244 (21%), Positives = 102/244 (41%), Gaps = 18/244 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQG 171
+ ++ ++ + VP ++ ++ + + + + L++ L L+++YEK +
Sbjct: 767 IVTTKKVEENEVIIKVPRKFLINVQVAREHPILGRIFEEFSGLNDDTILFLFVIYEK-EN 825
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
SFW P+ L + + ++ TEL L G+ AE L+ +K +
Sbjct: 826 PNSFWRPFFDTLPS-------YFPTSIHYTSTELLELEGTNLFAETLQ----VKEHLQSI 874
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ F L +QYP P F++E F A S + L K+ LVP+ +
Sbjct: 875 RDMLF--PELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KIDDKITNCLVPMADMI 931
Query: 292 LAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ + + D ++V I + G N +L + YGFV +DNPYD
Sbjct: 932 NHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQNRELALYYGFVIQDNPYD 991
Query: 350 RLVV 353
+++
Sbjct: 992 SMLI 995
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 62/282 (21%), Positives = 119/282 (42%), Gaps = 27/282 (9%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
D D+ W NG KV+L++ + + PI + A ED++AG+ S+P +L+
Sbjct: 25 DGSDVYEWAAANGANVSKVVLRD----DGEAGPI--LHAKEDIEAGEVILSLPANLLFP- 77
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
RV + + ++ + + + LYL+ E+ S W P+++ L +
Sbjct: 78 TRVSDHSPVVHMIENTTIGRITAICLYLISERADS-SSHWKPWLQSLPPRFFHA------ 130
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ------QYPYDIP 250
L +SE ++ + S K + + +++EY + F P ++
Sbjct: 131 -LSYSEDDMLHFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVT 189
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLLAYSSKCKAMLAAVD- 306
E FT+E F+ A+ V + + + R VP LGP ++ + + D
Sbjct: 190 REDFTYEAFEWAYSVVTTRGIFPGLLGEEEREGEVPLLVLGPLADSFIHGASGVKISYDA 249
Query: 307 DAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDN 346
+ V +K ++ I + G N +LL N GF+ ++N
Sbjct: 250 QEHRCVFSALHKVAKNSPISIGVGMSSNMELLANRGFMMQNN 291
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 65/285 (22%), Positives = 121/285 (42%), Gaps = 28/285 (9%)
Query: 86 HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERV 139
HK + LK++ + RP + + A + LQ G+ S+P ++T + V
Sbjct: 30 HKQEFIELRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTV 89
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + + + +T +S L L +L+ EK G KS W PY+ L + +
Sbjct: 90 IRS-YLGDYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK--------AYTC 140
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L+ E E+ L P + + E+ +++ + + SLF + D+ F +
Sbjct: 141 LVCLEPEVVRLLPRPLQMKAEEQRMQVQKLFISSRGFFSSLQSLFTE---DV-KHVFHYH 196
Query: 258 IFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAV--Q 310
F A+ + + V H QK L+ + L P LL +S + A ++ +
Sbjct: 197 AFLWAWCTINTRTVYMKHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYE 256
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ K E + + GP N +LL+ YGFV +NP+ + + +
Sbjct: 257 IRTTSHCKKFEELFICYGPHDNHRLLLEYGFVASNNPHSAVYIAI 301
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 78/176 (44%), Gaps = 20/176 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +DL+ + +P V++ + +IA +L +L E ++ LMYE +G
Sbjct: 44 IIAKQDLKVDEIIAVIPKRCVLSPKTT----SIAPILEKYELEEAVATSIALMYETSKGV 99
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W YI+ + ++ P+LW + + YL G+ + ++E E ++ +Y E
Sbjct: 100 QSKWYSYIQSM-------PTVIDLPILWDKESIEYLVGTDLEEIVIENIETLEEQYRE-- 150
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + +P FT E FK A V S ++ + +LVPL
Sbjct: 151 ----DVEPIIKNHPETFKENIFTLESFKIASTIVSSRAFNIDQYHGE---SLVPLA 199
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 73/149 (48%), Gaps = 16/149 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + AGD VP + +T + + + +L +N++ + LA L+ EKK G+KS
Sbjct: 75 ASKVIYAGDCMLKVPFNAQITPDELPSD---IRVLLSNEVGNIGMLAAVLIREKKMGQKS 131
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W+PYI L + + S + W E EL+ + S E +++ I+++++
Sbjct: 132 RWVPYISRLPQPA-----EMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFS----- 181
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
F+A + Q P I TE E F A+
Sbjct: 182 -FVAQAFKQHCP--IVTERPDLEDFMYAY 207
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 135/297 (45%), Gaps = 46/297 (15%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
KSW+ ++G+ + ++E S E V A +D++ ++P ++T E
Sbjct: 83 FKSWLKEHGVEYSAIDIQE-VSEEEGFG----VIALQDIEIKCPLVTIPRKAMMTYEDA- 136
Query: 141 GNETIAELLTTNKLSEL---ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ +A L+ N++ + CLALYL E+ S + PYI + ++ +
Sbjct: 137 KSSYLAGLIEGNEVLSVMPNVCLALYLHCERFT-LNSKYQPYIDMIPQE-------FNTI 188
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA---F 254
L + E+ YL G+ + + + + I R++ L V+ GS ++ +P +A F
Sbjct: 189 LYFKPHEMKYLKGTAALSVAINQFKSIVRQFALLYQVF--NGSHQKEDVEKLPLQARNAF 246
Query: 255 TFEIFKQAFVAVQS----CVVH----LQKVSLARRFALVPL--------GPPLLAYSSKC 298
TF+ ++ AV + H L + AL+P+ GP AY+
Sbjct: 247 TFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLSTAYN--- 303
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
A+ ++ + + +K GE + + G + NS LLI+ GFV +++P+D++ + +
Sbjct: 304 -----ALTRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKESPFDKVRIHL 355
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 123/299 (41%), Gaps = 45/299 (15%)
Query: 72 SKKEEDLGDLKSWMHKNGL---PPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
+K+ E L + K W+ N P + L +K + + A + ++ D S+
Sbjct: 34 TKEIESLKEFKEWLVNNNAYINPNIDIELLDKYGRS--------IVAKKSIKKQDKLISI 85
Query: 129 PNSLVVTLERVLGN-----ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
P ++++ + G + I E + + LS A+++MY K +KSFW PY+ L
Sbjct: 86 PKDIIMS--NIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMY-SKLNEKSFWHPYVTVL 142
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
+ + L +S+ EL L S K + R +GI+R Y S F
Sbjct: 143 PE-------SFSTSLYFSDNELDELQASQLKEFTIIRKDGIERHYE----------STFS 185
Query: 244 QYPYDIPTEAFTFEIFKQA-FVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYS-SKCKA 300
+ +P E ++ Q F SCV + SLA +VPL A SK K
Sbjct: 186 RLSKLVP-EFSNLALYNQELFTWALSCVWS-RAFSLAENDGGMVPLADMFNAEDRSKSKV 243
Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWCG---PQPNSKLLINYGFV-DEDNPYDRLVVEV 355
+ D + GE I G P +S++L++YGF+ DE D + + V
Sbjct: 244 LPKVTDTTLDYYASDDIAEGEQIFTPYGVYKPLSSSQMLMDYGFIFDEGTVSDNVAITV 302
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 21/200 (10%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T V G+R + + +L W+H G+ I + S + V A+E ++A
Sbjct: 371 TAVIGTRPAALESRKIGDNLLQWLHNAGMTS---IAENHLSIADFEHTGRGVLANERIEA 427
Query: 122 GDAAFSVPNSLVVTLERVLG-NETIAELLTT--NKLSELACLALYLMYEK-KQGKKSFWL 177
G +P L++ + L + I +L+ ++ + L LY+++EK G S W
Sbjct: 428 GVEVLHLPQHLLINIHVALDESHPIGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWA 487
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
P+ L SPLL+ TEL L G+ E E +G++ + L
Sbjct: 488 PFFETL-------PATYNSPLLFHVTELLELEGTRLIDETFEIKDGLRVLHESL------ 534
Query: 238 AGSLFQQYPYDIPTEAFTFE 257
G L + YP PT+AFT+E
Sbjct: 535 -GPLAEAYPALFPTDAFTYE 553
>gi|295668911|ref|XP_002795004.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226285697|gb|EEH41263.1| SET domain-containing protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 488
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 59/263 (22%), Positives = 106/263 (40%), Gaps = 49/263 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINKEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKRE- 227
G S W PY + L ++ + W++ EL L GS I + A E R+
Sbjct: 106 GVASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158
Query: 228 -------------------YNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTSGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVKNDEAEEVEGEGG 218
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 219 YVTDD------EERQLPK--GMIPLADLLNADADRNNACLFQEDGYLAMKSIKSIRKGEE 270
Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
I G P ++LL YG+V ++
Sbjct: 271 IFNDYGELPRAELLRRYGYVTDN 293
>gi|119467702|ref|XP_001257657.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119405809|gb|EAW15760.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 116/280 (41%), Gaps = 61/280 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ G+ FS+P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVARSDIFDGEELFSIPRGLVLSAQ----NSKLKDLLSQD-LEELGPWLSLILVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G++S W PY + L + + ++ + WS +EL L GS ++I + EG +
Sbjct: 105 LGEQSAWAPYFKVLPK-------SFDTLMFWSPSELQELQGSAIVSKIGK--EGAE---- 151
Query: 230 ELDTVWFMAGSLFQQYPYDIPT----EAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
D++ M + + P P+ ++ E A + + + L +A F
Sbjct: 152 --DSIMQMIAPVVRANPSLFPSVEGLASWDGEAGSHALLGLAHIMGSL---IMAYAFDIE 206
Query: 283 ---------------------------ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
+VPL L A + + A L +D++ + +
Sbjct: 207 KAEDEDDEDNDEEEGYVTDDEQDQSSKGMVPLADILNADADRNNARLFQEEDSLVMKAIK 266
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
P AGE I G P + LL YG+V DN VVE+
Sbjct: 267 PIHAGEEIFNDYGELPRADLLRRYGYV-TDNYAHYDVVEL 305
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 120/287 (41%), Gaps = 36/287 (12%)
Query: 68 REVVSKKEEDL--GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAA 125
RE V+ E D+ +L W+ G K++L++ + + R +H +L G+
Sbjct: 105 REDVADLENDVVGAELIDWLQNQGAETKKLMLQQ---YAPEVRGVH---CRNELVPGERI 158
Query: 126 FSVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYI 180
+P + ++T+E +G +T I + + + + +A L LYL+ + ++ +F+ Y
Sbjct: 159 LFIPKNCLITVE--MGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYY 216
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L P+ WS+ EL++L GS +I ER I+++Y+ +
Sbjct: 217 STL------PSTLKNMPIFWSDQELSWLKGSYILHQIQERKAAIRKDYDAICRA------ 264
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
D F+ E F A + V S L + + ALVP L Y + +
Sbjct: 265 -------DPSFSRFSLERFSWARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETS 316
Query: 301 -MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
D + +G + G + N + L+NYGF EDN
Sbjct: 317 WTFDQKLDGFTITSLESICSGAQVYDSYGKKCNHRFLLNYGFAVEDN 363
>gi|242769547|ref|XP_002341787.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724983|gb|EED24400.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 739
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/209 (28%), Positives = 95/209 (45%), Gaps = 29/209 (13%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEIL 218
+LM + + ++ FW PYI+ L G + +PLL+ E + LA+L + A
Sbjct: 138 TFFLMGQYLRREEGFWYPYIQSLP-----GPEELTTPLLFKEEDGDLAWLNMTSLAASRE 192
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQ--------YPYDIPTEAFTFEIFKQAFVA--VQS 268
R + K Y + A S+ Q Y +D+ A T I +AF A + S
Sbjct: 193 RRLQIWKVNYEK-------AYSMMQDLGVENARLYTWDLYLWASTI-ISSRAFTAKVLAS 244
Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ LQ R ++ L P + A + K K A D++ LVV +AG+ +
Sbjct: 245 VIPKLQTAEEGDRISV--LLPLIDATNHKPLSKVEWRAGTDSIGLVVMSDLRAGDEVGNN 302
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
GP+ N +L++NYGF DNP + VV +
Sbjct: 303 YGPRNNEQLMMNYGFCIPDNPCEYRVVSL 331
>gi|350632383|gb|EHA20751.1| hypothetical protein ASPNIDRAFT_120572 [Aspergillus niger ATCC
1015]
Length = 668
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 90/205 (43%), Gaps = 7/205 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 77 DAVGEKESTIFFLVGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 131
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 132 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 191
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 192 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 250
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
N +L++NYGF NP D +V +
Sbjct: 251 NNEQLMMNYGFCIPGNPCDHRIVSL 275
>gi|317038661|ref|XP_001401929.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 699
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 90/205 (43%), Gaps = 7/205 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 103 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 157
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 158 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 217
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 218 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 276
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
N +L++NYGF NP D +V +
Sbjct: 277 NNEQLMMNYGFCIPGNPCDHRIVSL 301
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 26/191 (13%)
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
+K Q + S W PYI L +++ LW +TEL+YL SP + ER E I
Sbjct: 503 QKFQLQSSAWAPYISCLPEPA-----ELDNTFLWEDTELSYLRASPLYGKTRERLEIITT 557
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
E+ ++ + LF + + E F + V S + +++ LV
Sbjct: 558 EFGQVQNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVM 603
Query: 287 LGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ P+L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 604 I--PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661
Query: 342 VDEDNPYDRLV 352
+N YD ++
Sbjct: 662 TVPENRYDEVM 672
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 97/217 (44%), Gaps = 35/217 (16%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A + L++ K+QG +S P+I +L G PL WS+ +LA L A+
Sbjct: 138 FAKMGAMLLWHKRQGSQSPLAPWIAQLPADTG-------VPLNWSDKQLAALQYPYLVAQ 190
Query: 217 ILERAEGIKREYNEL-DTV-----------------WFMAGSLFQQYPYDIPTEAFTFEI 258
+ E+ +RE+ L DT+ W+ G + + + P T
Sbjct: 191 VKEQ----QREWTALYDTLRGSGMAAGAAPPSREEFWWAMG-VVRSRTFSGPYIGSTLSD 245
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA--VDDAVQLVVDRP 316
+ V + VV L + SL +++A+ PL L ++S ++ ++ D+ +V R
Sbjct: 246 RLRLAGLVAALVVILSR-SL-KQYAICPL-IDLFNHTSAAQSEVSYNYFGDSYSVVASRD 302
Query: 317 YKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+K GE + + G Q N L+ YGF + DNP D V+
Sbjct: 303 FKKGEQVFITYGAQSNDSLMQYYGFAEADNPQDTYVI 339
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/326 (23%), Positives = 126/326 (38%), Gaps = 49/326 (15%)
Query: 49 RKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSH-NEKH 107
R +R S+ V G + E L W+ + G +V+ + N+
Sbjct: 12 RASRARWTTRSTRARVRGDAQRARASREAYDGLWMWLERRGADVSRVVADAVTTDANDSE 71
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALY 163
R V A L+ G A +P + + R + + L + + +AL
Sbjct: 72 RAQFGVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALL 131
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
L+ E+++G + Y+ L + ++SPL WS EL + G+ ++L+ A G
Sbjct: 132 LLKERERGADGEFAAYVATLPK-------TLDSPLFWSADELRDIAGT----QLLDNAAG 180
Query: 224 ----IKREYNELDTVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQS-CVVHLQKVS 277
++ Y EL +F +Y + AF F+ AF ++S + L +
Sbjct: 181 YDAYVRAVYEEL------KNGVFVEYASTFDVDGAFDEASFRWAFGILRSRTMAPLDGAN 234
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDD---------------AVQLVVDRPYKAGES 322
+A LVP G L+ +SS A A + DR Y G
Sbjct: 235 VA----LVP-GLDLINHSSLSGARWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAE 289
Query: 323 IVVWCGPQP-NSKLLINYGFVDEDNP 347
I V P+ +SK ++YGF+D NP
Sbjct: 290 IFVNYDPEGIDSKFALDYGFIDVVNP 315
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 107/245 (43%), Gaps = 37/245 (15%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + +L T+ LS LA+Y+++ + + K +
Sbjct: 34 FKEGENILTIPSGILWTVEHAYADSILGPVLRSTSLPLSVEDTLAIYILFVRSR-KSGYD 92
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
P R +A S + + E +L G+ + + I+ +Y L
Sbjct: 93 GP----------RNHVAALPASYSSSIFFMEDQLEVCAGTSLYTITKQLEQRIEDDYRGL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
V M G QYP P + FT E +K A V S + + + L FA
Sbjct: 143 --VVRMLG----QYPDLFPLDKFTVEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFA--- 193
Query: 287 LGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+L +SS+ K + A + ++ + Y+AG+ + + GP PN++LL YGFV
Sbjct: 194 ---DMLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGPMPNNRLLRLYGFVVP 250
Query: 345 DNPYD 349
NP D
Sbjct: 251 GNPND 255
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 102/260 (39%), Gaps = 49/260 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--------TKAEILERA- 221
G S W PY + L ++ + W++ EL L GS T E+ R
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSTAEEVFLRDL 158
Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
G YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLSMKSIKSIRKGEE 270
Query: 323 IVVWCGPQPNSKLLINYGFV 342
I G P ++LL YG+V
Sbjct: 271 IFNDYGELPRAELLRRYGYV 290
>gi|225678514|gb|EEH16798.1| SET domain-containing protein RMS1 [Paracoccidioides brasiliensis
Pb03]
Length = 488
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/260 (22%), Positives = 101/260 (38%), Gaps = 49/260 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P LV++ + N + +L+ N+ L + CL L ++YE Q
Sbjct: 50 IVAYDDINEEEELFAIPQGLVLSFQ----NSKLKDLMEINERDLGQWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
G S W PY + L ++ + W++ EL L GS I + A
Sbjct: 106 GAASPWAPYFKVLPTD-------FDTLMFWTDAELLELKGSAVLGRIGKSAAEEVFLRDL 158
Query: 222 -------------EGIKREYNELD------TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
G YN D ++ GSL Y +D+ + +
Sbjct: 159 LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVENDE------AEE 212
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V ++ L + ++PL L A + + A L D + + + + GE
Sbjct: 213 VEGEDGYVTDDEERQLPK--GMIPLADLLNADADRNNARLFQEDGYLAMKSIKSIRKGEE 270
Query: 323 IVVWCGPQPNSKLLINYGFV 342
I G P ++LL YG+V
Sbjct: 271 IFNDYGELPRAELLRRYGYV 290
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 118/300 (39%), Gaps = 35/300 (11%)
Query: 64 VAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGD 123
A VS L L +W+ GL ++ +P N+ Y+ AS ++A
Sbjct: 156 AAAQLAAVSTPRGALARLTAWIDNAGL---EINSNARPGLNDVDE--LYLFASNPIEAAT 210
Query: 124 AAFSVPNSLVV--TLERVLGNETIAELLTTNKLSELA----CLALYLMYEKKQGKKSFWL 177
+VP LV+ T R L N I + K + LA+ L+YE + KS W
Sbjct: 211 LVATVPAPLVMFETYLRTLENPMILAIDRRFKTMSVPDPSYALAMALLYESYE-PKSMWR 269
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP--TKAEILERAEGIKREYNELDTVW 235
+I L + ++S + WS E L P K +ILER +++ YN
Sbjct: 270 EWISSLPQ-------TLDSTVFWSAEEQDALQSLPLKRKTQILER--HLQQLYNA----- 315
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAY 294
L +P+ +++E+FK A++ V S + + L PL L
Sbjct: 316 -TTPRLLAAFPHIFAGGNYSYEMFKWAYMIVDSRSLTFSTGPDTLPQIMLAPLVDLLHHD 374
Query: 295 SSKCKAMLAAVDDAV-----QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ L + V L R K GE +V G PN +LL+ +G NPY+
Sbjct: 375 PVQTNIQLGVHPEEVLGFEISLKTTRAIKKGEPLVRHIGELPNHQLLLRFGLAMPRNPYE 434
>gi|134074534|emb|CAK38827.1| unnamed protein product [Aspergillus niger]
Length = 625
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 90/205 (43%), Gaps = 7/205 (3%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ + E +L+ + +G + FW PYIR L Q G ++ +P + +L +L G+
Sbjct: 29 DAVGEKESTIFFLIGQYLRGTEGFWYPYIRTLP-QPG----SLTTPPYYEGEDLQWLDGT 83
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
A +R E +K +Y + T AG Y +D+ A + I + V S V
Sbjct: 84 SLLAAREKRLEVLKEKYEKGSTELRNAGFEGADAYTWDLYLWAASMFISRAFSAKVLSGV 143
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+S + L+P+ + + K A D + VV AG+ I GP+
Sbjct: 144 FPETDLSEEKLSVLLPI-IDMGNHRPLAKVEWRAGKDDIAFVVLEDVWAGQEISNNYGPR 202
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
N +L++NYGF NP D +V +
Sbjct: 203 NNEQLMMNYGFCIPGNPCDHRIVSL 227
>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
Length = 518
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 21/151 (13%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGD------LKSWMHKNGLPPCKVILKEKP-------- 101
R+ +S T++ + K+ ED G W G+ C L P
Sbjct: 9 RIWASFTVLRRNSRQTKKEMEDAGQDEGFERFLKWAANLGISDCTTNLSLHPQSPTSCLG 68
Query: 102 -SHNEKHRPI---HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL 157
S H P +AA DL+ G+ VP S+++T + +L +E + + N S L
Sbjct: 69 HSLTVSHFPDAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSL 128
Query: 158 A---CLALYLMYEKKQGKKSFWLPYIRELDR 185
+ LA+ L+YE +GK S+W PY+ L R
Sbjct: 129 SPTQILAVCLLYEMGKGKSSWWYPYLMHLPR 159
>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
Length = 410
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 97/215 (45%), Gaps = 21/215 (9%)
Query: 153 KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP 212
+L E L+L+L+ EK + ++S W +IR + + ++ WSE +A L P
Sbjct: 19 QLCERQLLSLHLLVEKWKAERSRWWRFIRSIPP-------SYDTLENWSEQSVARLQYKP 71
Query: 213 TKAEILERAEGIKREYNELDTV--------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
A R + E+++L + W + + + +F+ E + A
Sbjct: 72 FLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFSREDYLWAAG 131
Query: 265 AVQSCVVHLQKVS-LARRFALVPLGPPLLAYSSKCKAMLAAV---DDAV--QLVVDRPYK 318
V + H ++ S + R V P+L + + A +AA DA+ ++ R Y+
Sbjct: 132 TVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVYRVTCLRSYE 191
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
GE +++ G N+ LL +YGFV EDNP D ++
Sbjct: 192 EGEQVMIHYGNWSNAGLLEHYGFVLEDNPLDSCML 226
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 94/212 (44%), Gaps = 37/212 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELA---------CLA 161
V A D+ AG +VP +L++T E+ +ET +L+T+ L +EL+ L
Sbjct: 170 VIARRDIPAGQTFINVPEALMMTAEKARKSETF-QLITSGALDSTELSPAMAKLDNFLLR 228
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
++L+ E+++G S+W PYI +L QR R PL ++E EL L SP E +
Sbjct: 229 MFLIVERRRGGNSYWSPYI-DLLPQRFR------LPLYFTEAELELLKPSPALQEAFVQL 281
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+ R+Y + ++QY E+ + A + S H + + RR
Sbjct: 282 RNVVRQY-----------AAWKQY-------LMMLELARAAELPSGSGDAHQKILDQRRR 323
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 313
+P+ L Y C A A Q+VV
Sbjct: 324 AQAMPVRYNELTYDLFCWASSAVATRQNQIVV 355
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/247 (23%), Positives = 107/247 (43%), Gaps = 29/247 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T+E+ + + +L + + LS LA+YL++ +
Sbjct: 28 VKALRSFKEGERILTIPSACLWTVEKAYADPLLGPVLRSAQPPLSVEDALAVYLLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS A + + ++
Sbjct: 87 ----------RTSGYEGQRHHIAAMPQSYSASIFFTEDELQVCAGSSLYALTRQLEQRVR 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L L Q+ P + FT E +K A ++ S + VS LV
Sbjct: 137 DDYRQLLV------PLLSQHRDLFPLDQFTIEDYKWALCSIWSRAMDF-AVSGTTSVRLV 189
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+L +S +C A D + ++ + Y+ G+ + ++ G PN++LL YGFV
Sbjct: 190 APLADMLNHSPDVKQCHAYDPTSGD-LSILAAKDYQVGDQVFIYYGSVPNNRLLRLYGFV 248
Query: 343 DEDNPYD 349
DNP D
Sbjct: 249 LPDNPND 255
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 109/269 (40%), Gaps = 33/269 (12%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +NG + E S++++ R +H A+ DL+ + VP ++T+E +G
Sbjct: 37 WLTENGGKFADCV--ELRSYDDEVRGVH---ATRDLETEEILVEVPLKCLITVE--MGKA 89
Query: 144 TIAELLTTNKLSELAC-----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
T EL L L+++ +++ +F+ PY L P+
Sbjct: 90 TDVGRAVLEAELELDAPKHVFLMLFVLLDRRDSS-TFFAPYYDIL------PSTLSNMPI 142
Query: 199 LWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W EL +L GS +I ER IK +Y + +W P I + T E
Sbjct: 143 FWQPDELEWLKGSYLLTQIEERKRAIKADYEAICGIW----------PSFI--DVCTLEE 190
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CKAMLAAVDDAVQLVVDRPY 317
FK A + V S + V+ AR A+VP L + + K A + +
Sbjct: 191 FKWARMCVCSRNFGVV-VNGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKI 249
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDN 346
G I G + N + L+NYGF EDN
Sbjct: 250 SVGSQIYDSYGQKCNHRFLLNYGFAIEDN 278
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 128/303 (42%), Gaps = 54/303 (17%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQ 120
D + G ++ +++D ++++W + P + EK + +S D++
Sbjct: 12 DIRIGGQTVQLTFRKDDGINIQTWKQDSKQPLLSLTPNEKG-----------IFSSRDIK 60
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA--------LYLMY--EKKQ 170
G+ S+P +++ +V + + L NK+ +L A LY Y +
Sbjct: 61 EGEELLSLPWYNSLSMNKV--QQQLPWLF--NKIQDLELTAEDGLVVALLYYRYCMDDLS 116
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
S W + E+ + S L +S+ E L GSP +++ + K +
Sbjct: 117 FDYSEWFSAMPEV----------LNSGLFFSDAEAELLNGSPAYIDLMNQRLDAKELFGR 166
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPL 287
L SLF++ + A T++ K A+ V S ++ + +L F V L
Sbjct: 167 L-------KSLFKEQQFS--KCAMTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVL 217
Query: 288 GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQP-NSKLLINYGF 341
P L Y + + AA D A+++V +P K GE I + G Q NS LLI+YGF
Sbjct: 218 AP-FLDYFNHAEDAQAAYDFDYDESAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGF 276
Query: 342 VDE 344
+D+
Sbjct: 277 IDQ 279
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 105/242 (43%), Gaps = 33/242 (13%)
Query: 133 VVTLERVLGNETIAEL--------------LTTNKLSELACLALYLMYEKKQGKKSFWLP 178
++ +L NE I E+ + +N + LA+ L+ E + KKSFW P
Sbjct: 102 LIASRNILKNEKIIEISENLMFDKFEHNLEINSNGSDNYSDLAIKLLVELFKNKKSFWFP 161
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI L + L W EL ++ GS + +K +Y ++
Sbjct: 162 YIGILPEEYDLKLL-----FRWPLKELFFIKGSRLSKASDYLKKKLKAQYEMVNK----- 211
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGPPLLAYS 295
+FQ+ P++ F ++ ++ + + S + LQ KV L L+ P ++
Sbjct: 212 -EVFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQETKKVVLIPYIDLLNHNPFSSSFI 270
Query: 296 SKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
S K L+ D+ ++VV D+ + + + G + N +LL YGF+ E NPYD +++
Sbjct: 271 SYRKIPLS---DSKEIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIAERNPYDSVII 327
Query: 354 EV 355
+
Sbjct: 328 RI 329
>gi|326480913|gb|EGE04923.1| SET domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 692
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 82/206 (39%), Gaps = 35/206 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E+++++IF A+ + S + +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P GE I
Sbjct: 209 RVLDAYLSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|403414266|emb|CCM00966.1| predicted protein [Fibroporia radiculosa]
Length = 420
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 136/318 (42%), Gaps = 54/318 (16%)
Query: 61 DTLVAGSREVVSKKEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASED 118
D +V+ + +VV+ K+W+ +NG P E+ ++ V AS+D
Sbjct: 8 DGIVSANGDVVA--------FKNWLAENGAEFHPHAAFRTERSGYS--------VIASQD 51
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSE--LAC--LALYLMYEKKQ 170
L++ S P SL +T E + + LL T SE L C + ++ + +
Sbjct: 52 LRSDTTVVSCPFSLAITPE--VSKNALTTLLGPTFTGQSWSERQLICSYICMHWILDPSA 109
Query: 171 GKKSFWLPYIREL---DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ PYIR L D+ R +PL +S+TEL L GS L+R + E
Sbjct: 110 SSELAHWPYIRMLPAPDKLR--------TPLHFSDTELEALKGSNLYGATLDRRRDWQSE 161
Query: 228 YNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
+ + + TV G F Y + + F ++ +V S +
Sbjct: 162 WEQCQKTIATVDLTWGEQFSWERYLSASTYLSSRAFPSMVLSPNPSLV-----STEESYP 216
Query: 284 LVPLGPPLLAYS-----SKCKAMLAAVD-DAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
++ G L +S S ++ + D + + LV+ + AG ++ GP+PN++L++
Sbjct: 217 VLLPGIDSLNHSRGQPVSWVVSIGTSSDVNRISLVLHKSTPAGSELLNNYGPKPNAELIL 276
Query: 338 NYGFVDEDNPYDRLVVEV 355
YGF +NP D +V+++
Sbjct: 277 GYGFSLPENPDDTIVLKI 294
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 116/274 (42%), Gaps = 31/274 (11%)
Query: 97 LKEKPSHNEKHRPIHY------VAASEDLQAG-DAAFSVPNSLVVTLERVLGNETIAELL 149
LK++ + RP + + ++ LQ D S+P ++T + VL + + E +
Sbjct: 39 LKDRGFEDSHLRPAEFWDTGRGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSS-CLGEYI 97
Query: 150 TTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
K +S L L +L+ EK G+KS W PY+ L + P+ E ++
Sbjct: 98 MKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPK-------TYSCPVC-LEHDVVS 149
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L P + + E+ + Y + LF + I F + + A+ +
Sbjct: 150 LLPEPLRKKAQEQRTKVHELYISSKAFFSSLQPLFAENTETI----FNYSALEWAWCTIN 205
Query: 268 SCVVHLQKVSLARRFALVP----LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKA 319
+ +++ K S + F+L P L P LL +S + KA + ++ + K
Sbjct: 206 TRTIYM-KHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKK 264
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
E + + GP N +LL+ YGFV DNP+ + V
Sbjct: 265 YEEVFICYGPHDNQRLLLEYGFVAMDNPHSSVYV 298
>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
anatinus]
Length = 499
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 109/250 (43%), Gaps = 22/250 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A++ L+AG+ S+P + ++T + VL + + + K +S L L +L+ EK+
Sbjct: 62 LMATKSLKAGEMIISLPEACLLTTDTVL-KSPLGDYIWKWKPPVSPLLALCTFLIAEKQA 120
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G +S W PY+ L + A P+ L+ L P E+ + RE
Sbjct: 121 GARSLWQPYLGVLPQ-------AYTCPVGLDAAVLSLLP-QPLGRRAREQRTAV-RELFA 171
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPL 287
+F SL + D+ FT + A+ V + V H Q+ + + L
Sbjct: 172 ASRAFF--SSLQPLFSEDV-ERVFTLDALGWAWCTVNTRTVYMEHAQRDCFSAEADIYAL 228
Query: 288 GP--PLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P LL +S ++ +A ++ + E +++ GP N +LL+ YGFV
Sbjct: 229 APYLDLLNHSPGAQVEAAFNKETRCYEIRTASRCRKYEEVLICYGPHDNRRLLLEYGFVC 288
Query: 344 EDNPYDRLVV 353
+NP+ +VV
Sbjct: 289 SNNPHSNVVV 298
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 113/256 (44%), Gaps = 43/256 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A + + A + +P S ++TLE + T+A+ + +L L+ L+ +L+ EK
Sbjct: 165 VNAKQTINAKELILFIPKSHMITLE-MAKETTVAKKMMQFRLDLLSPKHSFLSTFLLQEK 223
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ SFW PYI L P+ ++ ++L +L GSP +I ++ ++++Y
Sbjct: 224 FRPN-SFWKPYIDILPSSYPSF------PIFYNNSDLEWLKGSPFLKQIKDKLADLQKDY 276
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
N++ V F QY F F A + S + + ++ + A VPL
Sbjct: 277 NDICNV----VPEFTQY---------QFHEFCWARMTASSRIFGI-NINGVKTDAFVPLA 322
Query: 289 -------PPLLA--YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
P L + YS + + + D+ ++ G+ I G + NS+ +NY
Sbjct: 323 DMLNHKRPKLTSWCYSDEKQGFIIETDEKIE--------RGQMIFDSYGRKCNSRFFLNY 374
Query: 340 GFVDEDNPYDRLVVEV 355
GFV E N + + + V
Sbjct: 375 GFVVEGNDANEVNLAV 390
>gi|326473914|gb|EGD97923.1| hypothetical protein TESG_05224 [Trichophyton tonsurans CBS 112818]
Length = 692
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/206 (23%), Positives = 82/206 (39%), Gaps = 35/206 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +E+ +G+ S W PY+ L R + S L + +++L +L G+
Sbjct: 108 LAFFVAHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDSDLDWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E+++++IF A+ + S + +
Sbjct: 163 YRNTVKEEYDSAISILRDEGCL--------AVESYSWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P GE I
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFTGEEI 268
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 110/248 (44%), Gaps = 25/248 (10%)
Query: 117 EDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKS 174
+ +Q G S+P S ++T VL + + L + K S L L ++L+ E+ +G+ S
Sbjct: 66 QPIQPGGMLVSLPESCLLTTSTVL-HSYLGPFLKSWKPRPSSLVALCVFLVCERHRGEAS 124
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYL-TGSPTKAEILERAEGIKREYNELDT 233
W PYI L + P +++T +A L +G +AE E+ EG++ Y +
Sbjct: 125 DWFPYIDVLP-------CSYCCPPYFTDTVMAVLPSGVRRRAE--EQREGLQHLY-AVHQ 174
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-----VSLARRFALVPLG 288
+FM+ +P P E T+E + A+ ++ + V + + +S +AL P
Sbjct: 175 DFFMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSVFMDRPSSSFLSGPDNYALAPF- 230
Query: 289 PPLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
LL + + KA ++ + + G N +LL+ YGFV N
Sbjct: 231 LDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFINYGSHDNQRLLLEYGFVSSCN 290
Query: 347 PYDRLVVE 354
P+ + VE
Sbjct: 291 PHSVIYVE 298
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 122/312 (39%), Gaps = 52/312 (16%)
Query: 81 LKSWM------HKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSV 128
LKS+M HK K LKE+ + RP + + ++ LQAGD S+
Sbjct: 17 LKSFMDGVNCSHKLEYIKLKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISL 76
Query: 129 PNSLVVT----LERVLGNE------------TIAELLTTNKLSELACLALYL---MYEKK 169
P ++T L LG + L L L C L + EK
Sbjct: 77 PEKCLLTTGTVLSSCLGGHIEKWKPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKH 136
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G+KS W PY+ L + A P E ++ L P + + E+ I+ +
Sbjct: 137 AGQKSPWKPYLDVLPK-------AYTCPAC-LEPDIINLLPKPLQKKAQEQKMLIQELFQ 188
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP--- 286
+ LF + +I F F + A+ V + +++ K F+L P
Sbjct: 189 SSRAFFSSLQPLFAEDTGNI----FNFSALQWAWCTVNTRTIYM-KHPHRECFSLEPDVY 243
Query: 287 -LGP--PLLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
L P LL +S + KA + ++ D K + +++ GP N +LL+ YGF
Sbjct: 244 ALAPYLDLLNHSPNVQVKAGFNEQTRSYEIWTDSQCKKYQEVLICYGPHDNQRLLLEYGF 303
Query: 342 VDEDNPYDRLVV 353
V DNP+ + V
Sbjct: 304 VATDNPHSSVYV 315
>gi|118395738|ref|XP_001030215.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila]
gi|89284510|gb|EAR82552.1| hypothetical protein TTHERM_01108540 [Tetrahymena thermophila SB210]
Length = 1709
Score = 52.8 bits (125), Expect = 2e-04, Method: Composition-based stats.
Identities = 41/148 (27%), Positives = 70/148 (47%), Gaps = 37/148 (25%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLG-------NETIAELLTTNKLSEL----AC-- 159
+AA +D+ ++PN L+++ ++V G + +++ N+ EL C
Sbjct: 954 IAADQDISPQKVILAIPNKLIISEDKVYGCDLEEVLEKIQQQIIKQNRFPELFDEEKCGD 1013
Query: 160 -----LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
LALYLMYEK +G++SFW PY EL+++ + L WS ELA S
Sbjct: 1014 ADFNILALYLMYEKLKGEQSFWHPYF-ELNQKS-------YTLLDWSTEELAQFEDSY-- 1063
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLF 242
I +E N+ + ++F+ S+
Sbjct: 1064 ---------ILQEVNQSNQIFFLQQSVL 1082
>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
distachyon]
Length = 532
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 105/246 (42%), Gaps = 28/246 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ ASE++ G A +P SL+++ E + ++ L N ++ L L+ M E+
Sbjct: 178 MVASENIGVGHIALEIPESLIISEELLCQSDMFLALKDLNSITTETMLLLWSMRERHNPS 237
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+F + + L G L + LA L G+ E+++ + + ++Y+EL
Sbjct: 238 SNFKM-FFETLPSNFNTG-------LNFGIGALAALEGTLLFDELMQARQHLHQQYDELF 289
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ L ++P + +T++ F A S + + S L+P+
Sbjct: 290 PM------LCTKFPEIFTQDIYTWDNFLWACELWYSNSMMVVLSSGKLTTCLIPVAGLLN 343
Query: 289 ----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-D 343
P +L Y +A +++ + RP KAG+ + G S L+ YGF+
Sbjct: 344 HSVYPHILNYGRVDQAT-----KSLKFPLSRPCKAGQQCFLSYGKHSGSHLITFYGFLPR 398
Query: 344 EDNPYD 349
EDNPYD
Sbjct: 399 EDNPYD 404
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/296 (21%), Positives = 122/296 (41%), Gaps = 33/296 (11%)
Query: 67 SREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAF 126
+R V EE W+H+NG C+ + + E R + A++ L+ +
Sbjct: 19 TRPVSLAHEESFVRFFQWLHRNG---CRNVPLKPAVFPETGRGL---MATKALKHEELIL 72
Query: 127 SVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELD 184
+P L++T++ ++ + +A + ++L+ LA++LM EK + +KSFW PYI L
Sbjct: 73 VIPKRLLITIDAIM-DSYLAPYIERADSQLTPSQALAVFLMCEKCRREKSFWRPYIDILP 131
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQ 244
+ P ++E + L S + + + +E+ EL + M LF
Sbjct: 132 EE-------YTCPAFFTEEDFRLLPNS-LRGKAKAKKYECHKEFMELAPFFKMLADLFPD 183
Query: 245 YPYDIPTEAFTFEIFKQAFVAVQS----------CVVHLQKVSLARRFALVPLGPPL-LA 293
+AF F+ FK A+ A+++ L+ + PL + A
Sbjct: 184 -----QEDAFNFKDFKWAWSAIKTRAFDVPLGGETCYRLRDSEDTSNPTMFPLVDSINHA 238
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+K + ++ + Y+ ++ G N LL+ +GFV NP D
Sbjct: 239 AQAKIRHRYNEKRRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVVPGNPAD 294
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 100/247 (40%), Gaps = 24/247 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A D G+ +P TL L ++ + + + + +AL++ E+ +G+K+
Sbjct: 26 ALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCCARHD-DVVALHVCAERFRGEKA 84
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
++ L R + ++ WSE EL LTG+ E + E K +Y L
Sbjct: 85 TRAAHVATLPR-------SFDTAFFWSEEELRELTGTTCLRETMNLREETKNDYETLTKK 137
Query: 235 W--FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
G +++ D +E + A + S L R A+VP +
Sbjct: 138 MEAIGEGGWMREHEVD-------YERYAWARSNLWSRQCDLLMPDGKRTRAMVPT-FDIF 189
Query: 293 AYSSKC----KAMLAAVDDAVQLVVDRPYKAGES--IVVWCGPQPNSKLLINYGFVDEDN 346
+S+K L A + V + YKAGE I G NSKLL YGF +DN
Sbjct: 190 NHSAKAPLGKTHKLNAEKNCVTVYAADDYKAGEQAFISYGSGEAANSKLLTWYGFCIDDN 249
Query: 347 PYDRLVV 353
PY+ L V
Sbjct: 250 PYEELDV 256
>gi|396468374|ref|XP_003838159.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
gi|312214726|emb|CBX94680.1| hypothetical protein LEMA_P116830.1 [Leptosphaeria maculans JN3]
Length = 660
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 111/275 (40%), Gaps = 58/275 (21%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVL-----GNETIAELLTTNKLSELACLAL 162
+ A+ D+ A F++P + ++ +E R+L G AE L A L L
Sbjct: 41 IVATRDIPAETTLFTIPRNAIINVETSDLARLLPGIFDGTLNDAEDEKAEPLDPWASLIL 100
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
++ E G++S+W PYI L + ++P+ W++ EL L G+ AE + ++E
Sbjct: 101 VMLREYLHGEQSYWKPYIDIL-------PTSFDTPIFWTQDELKELEGTVLTAEKIGKSE 153
Query: 223 ------------------------GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
I +L + GS Y +D+ + +
Sbjct: 154 SDEMLRTHVLPIVTQNPTAFCPKGAIPLNEEDLLALAHRIGSTIMSYAFDLDDDKEESDA 213
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD---R 315
++ +V + + L +VP+ L A + A V+ +L V
Sbjct: 214 EEEGWVEDRDGLTML---------GMVPMADVLNANAD----FNAHVNHGEKLEVTSLRS 260
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVD-EDNPYD 349
+AG I+ + GP P+S+LL YG+V E + YD
Sbjct: 261 DIRAGTEILNYYGPLPSSELLRRYGYVTPEHHRYD 295
>gi|169626351|ref|XP_001806576.1| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
gi|160705819|gb|EAT76160.2| hypothetical protein SNOG_16462 [Phaeosphaeria nodorum SN15]
Length = 474
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 120/305 (39%), Gaps = 55/305 (18%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER-VLG 141
+W+ ++G+ I E + + R V A++D+ + F +P + ++++E +L
Sbjct: 13 AWLRRSGVEISPKIQLEDLRNAQAGRG---VVATQDIPEHELLFRIPRTAILSVENSILS 69
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
E A T L L L ++YE G S W PY L + + + WS
Sbjct: 70 TEIPA--ATFEMLGPWLSLILVMLYEYINGDASNWAPYFSVLPTE-------FNTLMFWS 120
Query: 202 ETELAYLTGSPTKAEI-------------------------------LERAEGIKREYNE 230
E ELA L S +I +RAE ++ E N
Sbjct: 121 EDELAELQASAVLNKIGKEGANEAFMEQLLPIIKEFADIFFAGDERAKQRAEEMRDERNV 180
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
L + GSL Y +D+ A + + + A + L K ++PL
Sbjct: 181 L--LMHKMGSLIMAYAFDVEP-ATSRKDVDEEGFAEEEEDEALPK-------GMIPLADM 230
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
L A + A L + +++ +P +AGE + GP P S LL YG+V DN
Sbjct: 231 LNADADCNNARLFYEEKYLEMKALKPIRAGEEVFNDYGPLPRSDLLRRYGYVT-DNYAQY 289
Query: 351 LVVEV 355
VVE+
Sbjct: 290 DVVEI 294
>gi|119500300|ref|XP_001266907.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119415072|gb|EAW25010.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 704
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 91/205 (44%), Gaps = 27/205 (13%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 124 FFLIGQYLKGSEGFWFPYIRTLPQP-----LSLTTPLYYEGGDLRWLDGTSLAPAREQRM 178
Query: 222 EGIKREYNELDTVWFMAGSLFQ---QYPYDI--------PTEAFTFEIFKQAFVAVQSCV 270
K +Y T AG FQ QY +D+ + AF+ ++ +A V+
Sbjct: 179 GVWKEKYKNGITELRKAG--FQDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE--- 233
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+ VS+ L+P L+ + K A V VV +G+ I GP+
Sbjct: 234 LPEDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVGSGQEISNNYGPR 287
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
N +L++NYGF DNP D +V +
Sbjct: 288 NNEQLMMNYGFCLPDNPCDYRIVSL 312
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/277 (25%), Positives = 115/277 (41%), Gaps = 56/277 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A D+ + F++P LV++ + N + +LL+ + L EL L L +MYE
Sbjct: 50 VVAQTDIPENEELFTIPRDLVLSTQ----NSKLKDLLSQD-LEELGPWLSLMLVMMYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKAEILER 220
G +S W Y + L R+ ++ + W+ +EL L GS ILE
Sbjct: 105 LGDQSTWAAYFKVLPRK-------FDTLMFWTPSELLELQGSAVIDKIGRQGADESILEM 157
Query: 221 AEGIKREYNEL----------------DTVWFMA---GSLFQQYPYDI--PTEAFTFEIF 259
I R + L + +A GSL Y +DI P +
Sbjct: 158 IAPIVRAHPSLFPPVDGLPSYDGDAGTQALLHLAHTMGSLIMAYAFDIEKPEDEDEEGDG 217
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ ++ + ++ L++ +VPL L A + + A L ++A+ + +P
Sbjct: 218 EGGYMTDE------EEEQLSK--GMVPLADLLNADADRNNARLFQDENALVMKAIKPIAK 269
Query: 320 GESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVVEV 355
GE I G P + LL YG+V D PYD VVEV
Sbjct: 270 GEEIFNDYGEIPRADLLRRYGYVTDNYAPYD--VVEV 304
>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 327
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 53/218 (24%), Positives = 96/218 (44%), Gaps = 31/218 (14%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ +G K+ E PS+ + I A +D+ + + S+P L++T + L +
Sbjct: 74 AWLRAHGARCDKI---EWPSYATGSQ-IRGAVALDDINSNEDMVSIPEPLLLTPDVALKD 129
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
I ++ N S+ L + LM+E+ +G+ SF+ PY+ L R ++ L W
Sbjct: 130 PDIGKVFEDNLEDFSDEDMLLILLMHERGKGETSFFYPYLATLPR-------LPDTLLNW 182
Query: 201 SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIP------TE 252
+E L++L E+ R + Y L AG LF + P D +
Sbjct: 183 NEEGLSWLQDEGLSLEVFLRESQLTAHYTRLVEEKLKAGWPGLFGEAPDDASDSESKGAD 242
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRF---ALVPL 287
++ E F+ A++ +Q+ + RR AL+PL
Sbjct: 243 PYSLENFRFAWLTIQA-------RAFGRRLPYSALIPL 273
>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
Length = 736
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/202 (20%), Positives = 80/202 (39%), Gaps = 22/202 (10%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA+ L+ E+ +G +SFW PY+R L + P+ ++ +E + +
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEK------YAHMPIFYNNSEFGSIQIPSLMRTVQS 313
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
R + ++ +Q + P E + + C + +
Sbjct: 314 RCRMLVN----------ISDGYLRQLSHGGPAENPFLDDVHANDMGWGLCAASSRALRNI 363
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNS 333
PL P++ + + + D ++QLV R + G+++ + G N
Sbjct: 364 PGLGSTPLMVPVIDFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNP 423
Query: 334 KLLINYGFVDEDNPYDRLVVEV 355
+LL++YGF DNP+DR V +
Sbjct: 424 QLLLDYGFTLSDNPHDRFEVTL 445
>gi|302658278|ref|XP_003020845.1| SET domain protein [Trichophyton verrucosum HKI 0517]
gi|291184711|gb|EFE40227.1| SET domain protein [Trichophyton verrucosum HKI 0517]
Length = 692
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 81/206 (39%), Gaps = 35/206 (16%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R S L + + +L +L G+
Sbjct: 108 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----EFTSALFYQDNDLEWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
++ EY+ ++ G L E++ ++IF A+ + S + +
Sbjct: 163 YRNAVQEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 208
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P +GE +
Sbjct: 209 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTSSGEEV 268
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP N +L+ YGF DNP D
Sbjct: 269 HNNYGPLNNQQLMTTYGFCIVDNPCD 294
>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
Length = 2165
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 117/285 (41%), Gaps = 61/285 (21%)
Query: 97 LKEKPSHNEKHRPIHYVA------ASEDLQAGDAAFSVPNSLVVTLERVL----GNETIA 146
L++ SH EK + +Y A A+ D++ G+ VP ++TLE + G +
Sbjct: 154 LEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVGKKMYE 213
Query: 147 ELLTTNKLS-ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
+ L +S + + L+ Y+M EK++ + S W YI L + P+ ++E E
Sbjct: 214 KGLRQRLISPKHSFLSTYIMQEKRKPE-SQWQIYIDILPKNFSN------FPIFFTEEER 266
Query: 206 AYLTGSPTKAEILERAEGIKREYN---------------ELDTVWFMAGSLFQQYPYDIP 250
+L GSP +ILE+ E IK +Y+ E + M S + I
Sbjct: 267 IWLKGSPFLDQILEKIEDIKADYDLICKEVPEYVQFPIREYSEIRMMVSSRI----FGIQ 322
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E + FVA + H + P K ++ A++D
Sbjct: 323 IEG----VKTDGFVAYADMLNHKR-----------PRQTSWTYTDEKQGFIIEAMEDI-- 365
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ GE + G + NS+ +NYGF++ +N + + ++V
Sbjct: 366 -------QRGEQVYDSYGKKCNSRFFLNYGFINLNNDANEVPIKV 403
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 60/266 (22%), Positives = 109/266 (40%), Gaps = 43/266 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET---IAELLTTNKLSEL------ACLALYLM 165
A+ED+ G+ S+P LV+T E +E +A L + L A L YL+
Sbjct: 32 ATEDILPGEELCSIPVRLVLTTEIARKSEVGRLVAAHLNAVQGERLRVSAGRAILCAYLI 91
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
+++ + +FW PY+R L + R + ++ +L G+ + E+ + I+
Sbjct: 92 HQRA-AQDAFWGPYLRSLPKHDDR-----------PDEDIQHLAGTNLFYAMQEKQQQIR 139
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV------QSCVVHLQKVSLA 279
++ + +L +P P + FT++ F F A Q+ V + A
Sbjct: 140 ESFD------LLFPALCHAHPTVFPPDLFTWDHFLWTFTACSSRSFPQTLVQQPTATTSA 193
Query: 280 RR--FALVPLGPPLL--------AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
+ L+ + LL Y K L ++ V + + G GP
Sbjct: 194 HADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLKFVTEDTVEKGTEAFNNYGP 253
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEV 355
+ N +LL+ YGF EDN D +++ +
Sbjct: 254 KGNEELLMGYGFCIEDNEQDYVMIRL 279
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 3/70 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V A++ + AG A ++P L++T++ L + E L L E L L+L++EK +
Sbjct: 497 VFAAQAVPAGQALLTIPRQLLITVDTAL-ESPLGEALQYVEGGLDEDTVLTLFLVWEKGR 555
Query: 171 GKKSFWLPYI 180
G+ S W P++
Sbjct: 556 GQASPWYPFL 565
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 120/290 (41%), Gaps = 40/290 (13%)
Query: 84 WMHKNGLPPC-KVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL-G 141
W+ K G+ K+ LK+ S V A+ D + + F +P + V+ + V G
Sbjct: 16 WLSKIGVRINPKMTLKDLKSEGRGR----GVVAAADFEEDEVVFCIPRTAVLNVNNVFAG 71
Query: 142 NETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
++ A ++ L +M E +Q S W PY+ L ++ ++S + WS
Sbjct: 72 QDSGASKEALLQMPNWLALTATMMSEGQQSD-SRWAPYLAVLPQK-------LDSLVFWS 123
Query: 202 ETELAYLTGSPTKAEILERA--EGIKREYNELDTVWF------MAGSLFQQYPYDIPTEA 253
E ELA L S +I + E + + L F S+ Y +DIP E
Sbjct: 124 EEELAELQASSVAKKIGRSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE- 182
Query: 254 FTFEIFKQAFVAVQSCVVHL-------QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD 306
E KQ + L +K L+ ++PL L A + + A + +
Sbjct: 183 ---EPAKQENGGAEGETDDLVSDDGEDEKTILS----MIPLADMLNADAERNNARIYYEN 235
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED-NPYDRLVVEV 355
+ +++ +P AGE I G P S LL YG+V E+ YD VVE+
Sbjct: 236 EDLEMRTIKPIMAGEEIFNDYGQLPRSDLLRRYGYVTENYAQYD--VVEI 283
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 71/304 (23%), Positives = 121/304 (39%), Gaps = 71/304 (23%)
Query: 77 DLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
+L L SW ++G K+ L++ + + L G+ S+P SL +T+
Sbjct: 27 ELDGLLSWFVEHGGSMTKLCLEDLGGEMSLS-----LLTGQALNKGEVVMSIPISLCMTV 81
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ VL AL+LM E+++G SFW Y+R L V++
Sbjct: 82 DSVL--------------------ALHLMAERRKGDGSFWKQYLRTLPDD-------VDT 114
Query: 197 PLLW----SETELAYLTGSPTKAEILERA--EGIKREYNELDTVWFMAGSLFQQYPYDIP 250
PL W +E E L G T +L R +++++ E L + +P +
Sbjct: 115 PLRWLVEQAEEEFRLLDG--TMVGLLSRMMHSQVRKDWEEFHL------PLVEAHPEILG 166
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPL------GPP-------LLA 293
TFE + A ++ S Q+ S R A+VP+ P ++
Sbjct: 167 --GVTFEDYLWAMSSIWSRSFDYQEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIE 224
Query: 294 YSSKCKAMLAAVDD------AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ ++ + + + +++ R Y A E + G N+KLL +YGFV NP
Sbjct: 225 FQAQEGGLSMGIGEPGRARGTLRVSAGRDYAAREQFFILYGRYSNAKLLYSYGFVLASNP 284
Query: 348 YDRL 351
Y L
Sbjct: 285 YGGL 288
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 101/248 (40%), Gaps = 29/248 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + L + + LS LA Y+++ +
Sbjct: 34 FKEGERILTIPSDVLWTVEHAYADSLLGPTLHSARPPLSVDDTLATYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + ++E EL TG+ A + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHLAALPKSYSSSIFFTEDELEVCTGTSLYAITKQLGRCIQDDYKAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
L Q+ P FT E +K A V S + L P +
Sbjct: 143 VV------RLLIQHRDLFPLSKFTIEDYKWALCTVWSRAMDFVLPDGKSIRLLAPFAD-M 195
Query: 292 LAYSSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +SS + A + + ++ + YKAG+ + ++ G PN++LL YGF+ NP D
Sbjct: 196 LNHSSDVRQCHAYDPLSGNLSILAGKDYKAGDQVFIYYGSIPNNRLLRLYGFIIPSNPND 255
Query: 350 R--LVVEV 355
LV+E
Sbjct: 256 NYELVLET 263
>gi|346324642|gb|EGX94239.1| SET domain-containing protein RMS1 [Cordyceps militaris CM01]
Length = 482
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 90/226 (39%), Gaps = 47/226 (20%)
Query: 158 ACLALYLMYEKKQ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+ L L L+YE Q G W PY+ L A +P+ WS EL L S
Sbjct: 102 SALILVLLYEHLQRDADATGAACRWRPYLDVL-------PAAFATPMFWSPAELGALQAS 154
Query: 212 PTKAEI-LERAEGIKREY--------------------NELDTVWFMAGSLFQQYPYDIP 250
P A++ E A+ + R ++ + GS Y +D+
Sbjct: 155 PAVAKVGRESADNMFRGILLPAVRAHAHVFAGSERLSDEQIVALAHRMGSTIMAYAFDLD 214
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E E + +V + + V +A +L ++ + DD +
Sbjct: 215 KEEDEDEDGEDGWVEDRDGKALMGMVPMA----------DILNADAEFNVHVNHGDDDLT 264
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
+ RP +AGE I+ + GP PNS+LL YG+V E + YD VVE+
Sbjct: 265 VTALRPIRAGEEILNYYGPHPNSELLRRYGYVTERHARYD--VVEI 308
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 87/203 (42%), Gaps = 29/203 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L E +G +S W YI L + +PL ++E + A+L + + E
Sbjct: 82 LCTFLALESLKGIQSKWYGYIEYLPK-------TFNTPLYFNENDNAFLISTNAYSAAQE 134
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK-QAFVAVQSCVVHLQKVSL 278
R K EY E A SL + PTE FTF+++ A V C +
Sbjct: 135 RLHIWKHEYQE-------ALSL-----HPSPTERFTFDLYIWSATVFSSRC---FSSNLI 179
Query: 279 ARRFALVPLGPPLL-AYSSKCKAMLAAVDD-----AVQLVVDRPYKAGESIVVWCGPQPN 332
+ P+ PL+ + + K K + D +VQL+ G + GP+ N
Sbjct: 180 YKDSESTPILLPLIDSLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGN 239
Query: 333 SKLLINYGFVDEDNPYDRLVVEV 355
+LL+ YGF DNP+D + ++V
Sbjct: 240 EELLMGYGFCLPDNPFDTVTLKV 262
>gi|70993754|ref|XP_751724.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66849358|gb|EAL89686.1| SET domain protein [Aspergillus fumigatus Af293]
gi|159125354|gb|EDP50471.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 674
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 89/203 (43%), Gaps = 23/203 (11%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYIR L + L++ +PL + +L +L G+ +R
Sbjct: 94 FFLIGQYLRGSEGFWFPYIRTLPQP-----LSLTTPLYYEGDDLRWLDGTSLAPAREQRM 148
Query: 222 EGIKREYNELDTVWFMAG-SLFQQYPYDI--------PTEAFTFEIFKQAFVAVQSCVVH 272
K +Y T AG QY +D+ + AF+ ++ +A V+ +
Sbjct: 149 GVWKEKYENGITELRKAGFEDVDQYTWDLYLWSSSILVSRAFSAKVLAEAVTDVE---LP 205
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
VS+ L+P L+ + K A V VV +G+ I GP+ N
Sbjct: 206 EDGVSV-----LLPC-IDLMNHRPLAKVEWRAGKQDVAFVVLEDVASGQEISNNYGPRNN 259
Query: 333 SKLLINYGFVDEDNPYDRLVVEV 355
+L++NYGF DNP D +V +
Sbjct: 260 EQLMMNYGFCLPDNPCDYRIVSL 282
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 109/254 (42%), Gaps = 22/254 (8%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
+ IK+ E T S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSTCVANCPSKFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEELE 215
Query: 278 LARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
R +++PL L A +SKC A L + +++V R + E + G PNS+LL
Sbjct: 216 NERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELL 275
Query: 337 INYGFVDED-NPYD 349
YG+V+ D + YD
Sbjct: 276 RRYGYVEWDGSKYD 289
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 90/227 (39%), Gaps = 37/227 (16%)
Query: 135 TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
T ER L + +L N + +L+ E+ +GK+SFW PYI L +L++
Sbjct: 90 TAERCL---LVGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYISALPSVE---ELSI 143
Query: 195 ESPLLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIP 250
PLLW +ET L GSP I R + + ++ L T F+ G
Sbjct: 144 SHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG---------- 193
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLA 303
E + V S V+ + SL LVP L SS +
Sbjct: 194 ------ETLSEGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCL 247
Query: 304 AVDDAVQ---LVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 346
D + L + Y G+ + GP S+L ++YGFVD++N
Sbjct: 248 IFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294
>gi|303271159|ref|XP_003054941.1| methyltransferase [Micromonas pusilla CCMP1545]
gi|226462915|gb|EEH60193.1| methyltransferase [Micromonas pusilla CCMP1545]
Length = 544
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 55/111 (49%), Gaps = 8/111 (7%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D++AG+ +P +L VT V + +A L EL LAL+L E+ +G S W
Sbjct: 98 DVRAGEPLLEIPQNLAVTSVDVSDHPIVAGLAAGR--GELVGLALWLCCERAKGSLSDWA 155
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKRE 227
PY+ L V+ PL W E+E+ + L GSPT + + RA + E
Sbjct: 156 PYVNTLPT-----GCTVDHPLRWEESEIRSLLKGSPTCEQAVGRAVDAREE 201
>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 457
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 122/308 (39%), Gaps = 63/308 (20%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNE---KHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
K+W+ NG + ++ +E + P V A D++ G++ +P+S T E
Sbjct: 4 FKTWLRSNGFWWNEDAIELGSRIDEGGGEDAPRVGVKAKRDIEIGESVARIPSSACFTCE 63
Query: 138 RVLGNETIAEL-LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ + ++ L+ + LA L L+ E+ G S W Y+ L +
Sbjct: 64 NCAHADAVRKVKLSAGEDEWLASLGTALVLERTLGSSSRWNAYLDSLPHSE------PDV 117
Query: 197 PLLWSET--ELAYLTGSPTKAEILERAEGIKREYNE-----LDTVWFMAGSLFQQYPYDI 249
++WSE YL G+ + + + + E+ LDT+ A + +D
Sbjct: 118 VMMWSEDGERRRYLCGTDIEQSLRDERAAARTEWTRHVKPVLDTLRGAA----KDVGFD- 172
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLL--------------AY 294
F+A +S V+ +R F + P +G L+ Y
Sbjct: 173 ------------DFLAARS-------VASSRAFTVNPRVGAGLVPIADLFNHRTGGHHVY 213
Query: 295 SSKCKAMLA-------AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
S + A + DDA+ + V + KAGE + G N+KLL +YGF DNP
Sbjct: 214 LSDARGTAAVSERDEGSDDDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLDNP 273
Query: 348 YDRLVVEV 355
D++ + V
Sbjct: 274 ADKVTIGV 281
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 109/256 (42%), Gaps = 34/256 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ Q +P+D+ + F F F +C+ + L+PL
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDF-FWAFGMLRSRAFTCLE-------GQSLVLIPLADLAN 245
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
P + A Y+ + + A L + + L P KAG+ +++ + + N++L ++YG
Sbjct: 246 HSPDITAPKYAWEIRGAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLT 305
Query: 343 D---EDNPYDRLVVEV 355
+ E N Y L +E+
Sbjct: 306 ESRSERNAY-TLTLEI 320
>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 104/259 (40%), Gaps = 31/259 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS------ELACLALYLMY 166
V A+ DL+ G+ SVP+ V+T++ + + E + + L + +M
Sbjct: 57 VRATRDLRVGEVVVSVPDDAVLTVDACAVKKELGEFVGDGDDEAPSPRLDKELLVIAVMC 116
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP-------TKAEILE 219
E GK S W Y+ + G S L W + ++ L G+ E L+
Sbjct: 117 EMCAGKSSAWCEYLETVHEAVRVGH----SVLAWDDEQVTALFGTDAWRDAYENDDETLD 172
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHLQKVSL 278
+ + + T++F LF + + EA A VA S + ++
Sbjct: 173 LPMMTEEHFENVVTLFF---KLFPKLASGLSVEALRELHFAATAMVAGYSFTLGDDEIQ- 228
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPNSK 334
A+VP +L ++ C+A + D +Q++ R K GE + GP N++
Sbjct: 229 ----AMVPFWD-MLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEVFNTYGPLRNAE 283
Query: 335 LLINYGFVDEDNPYDRLVV 353
LL YGFV NP+ V
Sbjct: 284 LLRRYGFVLPRNPHGGTTV 302
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 86/199 (43%), Gaps = 13/199 (6%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+++ E+ L + +MYEK +G++S W PY+ + PL W E L G+
Sbjct: 179 DEILEVQGLIIAVMYEKSRGRQSRWAPYLNLIPDD------MTHMPLYWKHREFKELRGT 232
Query: 212 PTKAEILERAEGIKREYNELDTVWF-MAGSLFQQYP-YDIPTEAFTFEIFKQAFVAVQSC 269
+++ + + ++ +W + Q++P ++P +++++ A AV S
Sbjct: 233 AAYDKMMGKVQCPADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASY 292
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWC 327
L A+VP+ L + + L A + ++ R GE +V
Sbjct: 293 SFILGDDKYQ---AMVPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNY 349
Query: 328 GPQPNSKLLINYGFVDEDN 346
G N++LL YGFV+ N
Sbjct: 350 GELSNAELLRGYGFVEARN 368
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/285 (21%), Positives = 116/285 (40%), Gaps = 31/285 (10%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
SK+ + +G +W+ +NG + E P ++ + A + + +P
Sbjct: 75 SKRSQGIGQFINWLKENGANVDGASVAEFPGYDLG------LKAERNFLENELILRIPRG 128
Query: 132 LVVTLERVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
L+ ++ + EL+T + ++A LA+ L+ E+ + + S W PY+ L
Sbjct: 129 LIFSI-----HNAAPELITLQNDPLIQHMPQVA-LAIALLIERHK-ENSKWKPYLDILPT 181
Query: 186 QRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQY 245
+ L + ++ L GSPT L++ I R+Y+ + V+ +
Sbjct: 182 -------TYTTVLYMTAADMIELKGSPTLEAALKQCRNIARQYSYFNKVFQNNNNAVSAI 234
Query: 246 PYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAA 304
D+ FT+E + A V + + +R AL+P+ + + A
Sbjct: 235 LRDV----FTYERYCWAVSTVMTRQNLIPSEDGSRMIHALIPMWDMCNHENGRITTDFNA 290
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ + R +K GE I + GP+ NS ++ GFV +N D
Sbjct: 291 TSNYCECYALRDFKKGEQIFISYGPRTNSDFFVHSGFVYMENKQD 335
>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 107/250 (42%), Gaps = 28/250 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++DL+ GD A +P S++++ E V ++ L + ++ L L+ M E+
Sbjct: 149 ATKDLKVGDIALEIPVSIIISEEHVHKSDMYHILEKIDGITSETMLLLWSMKERHNCSSK 208
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L + G L + + L G+ EI++ E ++ +Y+EL
Sbjct: 209 FKI-YFDTLPEEFKTG-------LSFGVDAIMALDGTLLLEEIMQAKEHLRVQYDEL--- 257
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG------ 288
L + YP E +T+E F A S + + V R L+P+
Sbjct: 258 ---VPPLCKNYPDVFLPELYTWEQFLWACELWYSNSMKVMFVDGKLRTCLIPIAGFLNHS 314
Query: 289 --PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
P ++ Y + + + ++ + RP GE + G +S L+ YGF+ + D
Sbjct: 315 LYPHIVHYGK-----VDSATNTLKFPLTRPCCFGEQCCLSYGNFSSSHLITFYGFMPQGD 369
Query: 346 NPYDRLVVEV 355
NP D + +++
Sbjct: 370 NPCDVIPLDI 379
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 64/288 (22%), Positives = 115/288 (39%), Gaps = 39/288 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
+ + FT+E + A V + ++ + SL AL+P+ SK
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHEDSKITTD 288
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
A + + R +K E I + GP+ NS ++ GFV DN D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISYGPRTNSDFFVHSGFVYMDNEQD 336
>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
Length = 394
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 108/263 (41%), Gaps = 46/263 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV------------------LGNETIAELLTTNKL 154
V A E ++AG+ VP ++ +E+ +G++ I + T L
Sbjct: 4 VRAVERVEAGECVARVPWDALLGVEQTVETSSPSPTSEILKQLTRMGDQIIMVIWLTAAL 63
Query: 155 SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
C YE+ W P +R L + S L W+ +L + G
Sbjct: 64 DAFEC-GDASAYEE-------WAPALRALPTR-------ASSSLAWNADDLGAVAGEDLA 108
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF-TFEIFKQAFVAVQSCVVHL 273
+ E +K +Y+ L F A L +Q P P AF + F++A+ S + +
Sbjct: 109 NRLREYRRSVKVQYDAL----FPA--LCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKV 162
Query: 274 QK-VSLARRFALVP----LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
Q SL R +VP L A+S + ++ A +L + R GE+I + G
Sbjct: 163 QDPDSLQIREVIVPGVFLCNHSLSAHSVRYTSLERGTK-AFRLELSRGCVEGEAITISYG 221
Query: 329 PQPNSKLLINYGFVDEDNPYDRL 351
N+ LL+ YGF E+NPYDR+
Sbjct: 222 RLDNADLLMFYGFSLENNPYDRV 244
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 60/238 (25%), Positives = 101/238 (42%), Gaps = 26/238 (10%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S+ + G+ F+VP + +T + ++ + L L +AL+L++E+ G S
Sbjct: 4 SKAVNKGEQLFAVPEAAWITADTAQQSQIGSHL---TGLESWLAIALFLLHERAMGNASR 60
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L G SP+ W E +LA L GS + ++ +++L
Sbjct: 61 WAPYIALLPADSG-------SPVQWEEADLAELQGSQVLGTVQGYRAYFQQRFDQLQAEV 113
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV------VHLQKVSLARRFALVPLGP 289
F S +D P F F+ F A V++ ++ V LA P P
Sbjct: 114 FGPNS----QAFD-PI-VFNFDAFLWAACTVRARAHPPLDGGNIALVPLADMVRSQPSWP 167
Query: 290 PLLA-YSSKCKAMLAAVDDAVQLVVDR--PYKAGESIVVWCGPQ-PNSKLLINYGFVD 343
P A + K L LV++ AG++I + GPQ + +LL+++G +D
Sbjct: 168 PDSAGWQLKQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVID 225
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 66/304 (21%), Positives = 119/304 (39%), Gaps = 37/304 (12%)
Query: 65 AGSREVVSKKEEDL-GDLKSWMHKNGL--PPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
A S ++V E L + W+ G CKV + S + A++D++
Sbjct: 56 ANSGKIVEPTEAQLVANFIEWLKGKGFDESKCKVKIDRNTSEGTG------LVATQDIKE 109
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLP 178
G+ +P++L +T +L ++L + L+++L+ E S W P
Sbjct: 110 GEDFVEIPSNLFITTAVAFQGLGKPPILENDRLIQSIPGILLSIFLVKELS-NPTSEWGP 168
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA 238
YI+ L +Q + W E GSP + G R+Y ++
Sbjct: 169 YIKLLPKQ-------YNTVYYWGLKEFTQFRGSPNLEYAMRYVRGAMRQY------CYLY 215
Query: 239 GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-----FALVPLGPPLLA 293
+ + +P +FT++ F A VQS Q A AL+P
Sbjct: 216 SMIDRTQSNIMPISSFTWDAFVWAISTVQS----RQNPVYAGNGNGSIMALIPFWDFCNH 271
Query: 294 YS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
S SK + + + + +K GE + ++ GP+ N++LL++ GF + N +D
Sbjct: 272 SSTGSKITSFYHMDSNCMTSGAIKDFKKGEQVYMFYGPRDNTQLLMHAGFATKTNLHDSY 331
Query: 352 VVEV 355
E+
Sbjct: 332 PFEL 335
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 106/245 (43%), Gaps = 26/245 (10%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A + L +G +P L + +E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKQALTSGQVTLRIPFKLTMNIESAARSDLARVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SF+ P+I L + P+ WSE+EL L G+ + ++R++
Sbjct: 95 RSDSFFAPFIASLPT-------TFDLPVFWSESELNELKGTNVLLLTQLMKQQLQRDFEN 147
Query: 231 LDTVWFMAGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+ ++ + +P +PT T E + A + S + + R L P
Sbjct: 148 IHQ------AVVEDFPEVFALLPT--LTLEDYTWAMSVIWSRAFGVTREKKYLR-VLCP- 197
Query: 288 GPPLLAYSSKCKAML---AAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ + + +L + D+ Q++ V + AG ++ + G N+KLL +YGF
Sbjct: 198 AMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQISYGQYSNAKLLFSYGF 257
Query: 342 VDEDN 346
V ++N
Sbjct: 258 VAKEN 262
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 72/167 (43%), Gaps = 21/167 (12%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
S + +SE EL G+ + + IK +Y +L A LF Q+P P + FT
Sbjct: 107 SSIFFSEGELEVCAGTSLYTVTKQLEQRIKDDYRQL------AVRLFAQHPDLFPLQKFT 160
Query: 256 FEI----------FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAML 302
E +K A V S + + L P +L +SS +C A
Sbjct: 161 IEDVRLLRRATDPYKWALCTVWSRSMDFTLPDGSSIRLLAPFAD-MLNHSSEVKQCHAYD 219
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
D + + + Y+ G+ + ++ GP PN++LL YGFV DNP D
Sbjct: 220 VKSGD-LSVFAGKDYEIGDQVYIYYGPIPNNRLLRLYGFVIPDNPND 265
>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 316
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 74/181 (40%), Gaps = 22/181 (12%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KKS W PY+R L R + + + W EL + S E +ER E +E++ +
Sbjct: 38 KKSGWAPYVRSLPRND-----QMHNMMFWDLNELHMVRISSICDEAIERRERAMKEFSAV 92
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF + E F A S +V + +R +L+P
Sbjct: 93 KPSLECFPHLFGE---------IKLEDFMHA-----SALVSSRAWQTSRGVSLIPFAD-F 137
Query: 292 LAYSSKCKAML--AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L + ++L D +++ DR Y GE ++V G N+ L +N+GF N YD
Sbjct: 138 LNHDGVSDSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYD 197
Query: 350 R 350
+
Sbjct: 198 Q 198
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 70/150 (46%), Gaps = 17/150 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL G +P + V++++ N IA++L +++ L + +MYE GK
Sbjct: 4 VFAVQDLCEGQRLCEIPKTAVLSVQ----NTGIADILEQHRIRGGLGLIIAIMYELSIGK 59
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW Y+ EL ++ PL W+E E + L G+ + E E + ++
Sbjct: 60 ESFWHGYLEELHKRE-------YLPLFWAEQERSLLQGTEAEHRPQEDEELTQEDFET-- 110
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L +Q+ + ++FT E F+ A
Sbjct: 111 ----HVPPLVEQHADRLRADSFTLESFRVA 136
>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 753
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 99/227 (43%), Gaps = 28/227 (12%)
Query: 127 SVPNSLVVTLERVLGNET-IAELLTTNKLSELA----CLALYLMYEKK-QGKKSFWLPYI 180
S+P S ++T+E +G T I + T+ L A L +Y+++++K G+ SF+ PY
Sbjct: 141 SIPKSCLITVE--MGQATPIGRKILTSDLELDAPKHIFLMIYILWDRKVNGETSFFAPYY 198
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
+ L + P+ W+ EL L GS +I +RAE IK +Y + ++ G
Sbjct: 199 KILP------ETLRNMPIFWTREELDALEGSYLLLQIADRAEAIKEDYISICSIAPEFGD 252
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSK-CK 299
+ T E F+ A + V S L ++ R ALVP L + K
Sbjct: 253 I------------ATLEEFQWARMIVCSRNFGLL-INGHRTSALVPHADMLNHLRPRETK 299
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
+ + + + GE + G + N + L+NYGF E N
Sbjct: 300 WTFSEESQSFTITTLQEIGMGEQVFDSYGQKCNHRFLLNYGFCVERN 346
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 61/284 (21%), Positives = 116/284 (40%), Gaps = 27/284 (9%)
Query: 88 NGLPPCKVILKEKPSHNEKHRPIHYVA---------ASEDLQAGDAAFSVPNSLVVTLER 138
+ LPP L+ +H+ K P+ V A++ + + FS+P L+++ E
Sbjct: 75 DTLPP---FLEWMTNHDVKMGPVELVELPLYGCCVRATKQVSTDELLFSIPQKLMLSNET 131
Query: 139 VLGNETIAELLTTNK-LSELACLAL-YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+ TI + + LS++ +AL + + + KSFW PY+ L + ++
Sbjct: 132 A-NSSTIGHFINNDPILSQMPNVALAFHVLNELYDPKSFWKPYLDALPS-------SYDT 183
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPYDIPT 251
+ ++ E+ L GSP + L I R+Y+ V +L + Y+
Sbjct: 184 VMYFTPDEITELKGSPAFDDALRMCRNIARQYSYFYSLLQKNVDPALSNLRANFTYNDYR 243
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
A + + +Q + Q + K L AL+PL +
Sbjct: 244 WAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALIPLWDFCNHQDGQFSTEFQLESRRTVC 303
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
R + GE + ++ G + ++ I+ GFVD +N +D L ++V
Sbjct: 304 QAGRDFGPGEQVFIFYGTRTCAEQFIHNGFVDINNAHDALTLKV 347
>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
n-methyltransferase i [Nannochloropsis gaditana CCMP526]
Length = 385
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 20/202 (9%)
Query: 78 LGDLKSWMHKN---GLPPCKVILKEKP-SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLV 133
LG+ WM G+PP ++L + E + + G+A F +P S+V
Sbjct: 115 LGENGVWMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIV 174
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+ E+ L A ++E +AL L+ E+ G SFW PYI L
Sbjct: 175 IDKEKSLAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTE-----E 226
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTE 252
V L+W E +LA L SP A + E+ L+ + A S +F
Sbjct: 227 VNPTLVWPEGDLALLEASPLVAATRSLKRKLAAEFALLEEQYMRARSDVFD-------PS 279
Query: 253 AFTFEIFKQAFVAVQSCVVHLQ 274
FTFE + AF+ + S + ++
Sbjct: 280 VFTFEAYLWAFINIFSRAIRVK 301
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 61/256 (23%), Positives = 102/256 (39%), Gaps = 34/256 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-------------LGNETIAELLTTNKLSELAC 159
A +D+ G+ ++P+ +++ ERV + +++I+ LSE
Sbjct: 75 TTAKDDIADGELYIAIPDHMLMGPERVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKNK 134
Query: 160 LALYL---MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
+ +Y MY K K+SFW PY + + SP+ WSE EL L GS
Sbjct: 135 VLMYFLLQMYNPK--KESFWKPYFDIMPTN-------LTSPIFWSEDELQELAGSEVSNM 185
Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ ++ Y+EL +F+ +AFT + + A S V+ L +
Sbjct: 186 ARIEKKRLRAMYDELRE------RIFKHDRKTFLKQAFTLKNWFWANGLYDSRVIQLNRQ 239
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNS 333
+ +PL + S+ K + A + DR G + G + N
Sbjct: 240 TGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRAAVMYADRAVSRGVQVFESYGNKSNY 299
Query: 334 KLLINYGFVDEDNPYD 349
+ L+ GFV EDNP D
Sbjct: 300 EYLLYNGFVMEDNPND 315
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 110/250 (44%), Gaps = 27/250 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
V +ED+ FS+P V++++ + N + + +L+ E LA+ L+YEK
Sbjct: 47 VFIAEDVTPHAEVFSIPLDSVLSVKSLQENAVLQSIAFFQQLTPEREDDQLAIALLYEKF 106
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G KS W +I L R + L + EL L GS + E + +Y
Sbjct: 107 VRGSKSKWAKHIELLPR-------TYHNALYFGPEELRALEGSNVYFIAQQMEEKVAHDY 159
Query: 229 NELDTVWFMAGSLFQQYP----YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-- 282
L + LF+ P D+ E F+ E +K A + S V +A++
Sbjct: 160 ARLKESVLL--ELFENVPEGINVDLFDEFFSLENYKWALSTIWS---RFGDVPVAKQSFK 214
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINY 339
A+VP+ +L + + + M D + Q LV + + AG + + GP N KLL Y
Sbjct: 215 AMVPVFD-MLNHDPEAE-MSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHKLLALY 272
Query: 340 GFVDEDNPYD 349
GFV NP+D
Sbjct: 273 GFVIIGNPFD 282
>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
Length = 762
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 103/274 (37%), Gaps = 52/274 (18%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
+W+ G K+ +K H R + AA E G+ SVP +L++T+++
Sbjct: 22 AWLRAGGASFDKLAIK----HTALGRGVVATAAYE---PGETLLSVPEALLLTVDKASRR 74
Query: 143 ETIAELLTTNKLSELAC------LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVES 196
+A L + + LAL+L ++ + W PY + R
Sbjct: 75 ADVAASLGAARARGVDANGGNLALALFLAGDRSEA----WRPYRNVISRS------VSHL 124
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P W + A L GSP +++ R + I+R+ L F + + AF F
Sbjct: 125 PCFWPTADEALLAGSPLGEDVVRRRDEIRRDCRSLGLTAVEDRQAFAFAEAQVLSRAFAF 184
Query: 257 EIFKQA--FVAVQSCVVHLQK-VSLA-RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
+ F + + H ++ V A R A V + AV
Sbjct: 185 NGTRAMVPFADLMNTARHHERHVDFAFERGAFV----------------MRAV------- 221
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
R AGE + GP+ N++ L+NYGF DN
Sbjct: 222 --RRGAAGEPVTDSYGPKSNARYLLNYGFAMADN 253
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R AL+P+ L ++ C + +A + DR Y+AGE + G N LL YG
Sbjct: 593 RLALLPVADVLNHANAGCSVAFST--EAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYG 650
Query: 341 FVDEDNPYDRLVVE 354
FV DNP+D+L ++
Sbjct: 651 FVLPDNPWDQLCLD 664
>gi|50294638|ref|XP_449730.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529044|emb|CAG62706.1| unnamed protein product [Candida glabrata]
Length = 510
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 116/301 (38%), Gaps = 46/301 (15%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ NG+ K+ K K N V + D+Q + F +P ++++ E
Sbjct: 14 SWLTNNGV---KISPKLKVEDNRYKDEGRCVVTTTDIQKDELLFEIPRNVLLNCETSQLV 70
Query: 143 ETIAELLT---TNKLSE-------LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ I +LT T SE + CL Y MY K KS W PY L L
Sbjct: 71 KDIPAVLTELETFSGSEPLSWEPLILCL-FYEMYILKD--KSRWWPYFEVLPTLEDMNVL 127
Query: 193 AVESPLLWSETELAYLTGS-----------PTKAEILER------AEGIKREYNELDTVW 235
+LWS+ +LA L S ++L+R E +K N+
Sbjct: 128 -----VLWSDEDLAALEPSYVLSCIGKEQVENMYQLLKRFIEASDHEQLKSNLNKFSWDS 182
Query: 236 FM-AGSLFQQYPYDIPTEAF-------TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
F+ GSL Y +D+ E + + + + + ++VPL
Sbjct: 183 FIRIGSLIMSYSFDVGKEIHNEGKEGESMNENDNMTNGDEDEDEDEEDLEVEMIKSMVPL 242
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L A + KC A L ++++ R +GE + G NS+LL YG+V+ D
Sbjct: 243 ADTLNADTKKCNANLLHSKQTLRMIAIRDIPSGEQVYNTYGELSNSELLRRYGYVEWDGS 302
Query: 348 Y 348
Y
Sbjct: 303 Y 303
>gi|358056332|dbj|GAA97699.1| hypothetical protein E5Q_04377 [Mixia osmundae IAM 14324]
Length = 347
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 51/103 (49%), Gaps = 4/103 (3%)
Query: 252 EAFTFEIFKQAFVAVQS-CV-VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA- 308
E F+ F+ A++ V S CV + L + F LVPL + +SS C D A
Sbjct: 122 EIIDFDAFRWAWLCVNSRCVWLDLDYEAHEENFTLVPL-LDMANHSSTCANATVKYDHAH 180
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+L + RP K GE IV G + L YGF++ NP++R+
Sbjct: 181 FELKLTRPVKRGEEIVFEYGGHDQATLWAEYGFIESSNPHERI 223
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/253 (24%), Positives = 111/253 (43%), Gaps = 24/253 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A +D+ + VP + + V +E + N L +AL+LM EKK G
Sbjct: 86 LVAQKDISRNEVVLEVPQKFWINPDTVAASEIGS---VCNGLKPWVSVALFLMREKKLGN 142
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PYI L + S + WSE EL+ L GS L E + E+ +L+
Sbjct: 143 SSSWKPYIDILPD-------STNSTIYWSEEELSELQGSQLLNTTLGVKELVANEFAKLE 195
Query: 233 TVWFMAGSLFQQYPYDIPTEAF--TFEIFK-QAFVAVQS---CVVHLQKVSLARRFALVP 286
+ Q +P+D+ + F F + + +AF ++ ++ L + + +
Sbjct: 196 EEVLVPHK--QLFPFDVTQDDFFWAFGMLRSRAFTCLEGQSLVLIPLADLWVQQANHSPD 253
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVD-- 343
+ P A+ + A L + + L P KAG+ +++ + + N++L ++YG +
Sbjct: 254 ITAPKYAWEIR-GAGLFSRELVFSLRNPTPVKAGDQVLIQYDLNKSNAELALDYGLTESR 312
Query: 344 -EDNPYDRLVVEV 355
E N Y L +E+
Sbjct: 313 SERNAY-TLTLEI 324
>gi|424513480|emb|CCO66102.1| predicted protein [Bathycoccus prasinos]
Length = 571
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/265 (24%), Positives = 110/265 (41%), Gaps = 48/265 (18%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
S++++ GD S+P VT + +A L+ + EL LAL+L EK + K S
Sbjct: 122 SKNVEGGDVILSIPQDNCVTAVDAKEHPIVAPLI--EEKPELVQLALWLCCEKAKAKGSE 179
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELA-YLTGSPTKAEILERAEGIKREYNELDTV 234
W PY++ L+ S L ++E E L G+ E +R + K EY L
Sbjct: 180 WWPYLKTLNGNPN-------SVLRFTEEEFKELLKGTSIDKEARQRRDSAKEEYEALRAA 232
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFV-AVQSCVVHLQKVSLARRFALVPL------ 287
+ P P + + F + + AF+ A+ Q ++ A +A+VPL
Sbjct: 233 -------IAEDPGKYPLDVYAF-LTESAFIDALDIVCARAQWLNSANCYAMVPLMDAIPI 284
Query: 288 --GPPLLA-------------------YSSKCKAMLAAVDDA-VQLVVDRPYKAGESIV- 324
PP ++ + +C VD A V L + AG I+
Sbjct: 285 CGAPPPVSPEDPSFARFYEIRDIKTGLTAVRCGYADYDVDSASVVLCANTRASAGSKILQ 344
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYD 349
+ + NS+L +++G VD+ +P D
Sbjct: 345 IDHSVRNNSELYLSFGDVDDQHPGD 369
>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 485
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 113/291 (38%), Gaps = 42/291 (14%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P +LV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N ++ +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSSLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
+ W++ EL L+GS +L + E N L + + +P +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANILRNILPLVSGNPSHFPPMSGVASFDSP 181
Query: 251 TE----------------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
A+ F+I K + ++ +VPL L A
Sbjct: 182 EGKAALLSLAHRMGSLIMAYAFDIEKGENDGGEGQDGYVTDDEEELSKGMVPLADLLNAD 241
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
+ + A L D + + +P + GE I G P + LL YG+V ++
Sbjct: 242 TDRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDN 292
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 78/331 (23%), Positives = 134/331 (40%), Gaps = 53/331 (16%)
Query: 48 RRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKH 107
RR+ R+ +S + D L W+ NG V + + +E
Sbjct: 24 RRRARWGDATTSKTRRPRTRARRDAASSADHDALHEWLSANGADVASVEFYDARAGDEDD 83
Query: 108 RPIHYVA--ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL----TTNKLSELACLA 161
A+ L G A VP SL +T E + ++ + + L L+ LA
Sbjct: 84 GGDAGWGARATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLA 143
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ E+++G++S + Y++ L + SPL W+ EL+ + G+ ++L+ A
Sbjct: 144 LTLLKERERGEESKYAAYVKTLPE-------VLHSPLFWNAEELSEIQGT----QLLDNA 192
Query: 222 EG----IKREYNELDTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
G ++ Y L T +F ++ +D+ AF+ + F+ AF ++S
Sbjct: 193 AGYDGYVRGVYETLRT------GMFAKHADVFDVEG-AFSEDNFRWAFGILRS---RTMA 242
Query: 276 VSLARRFALVPLGPPLLAYSSKCKA-------MLAAV---------DDAVQLVV--DRPY 317
ALVP G L+ +SS +A + AV DD V V DR
Sbjct: 243 PCDGANIALVP-GVDLVNHSSLSQARWRVSGGVAGAVAGLFGGGKGDDGVSARVECDRAL 301
Query: 318 KAGESIVVWCGPQ-PNSKLLINYGFVDEDNP 347
E + V P+ ++ +++GFVD P
Sbjct: 302 NVNEPLYVNYNPEGTDTSFALDFGFVDTITP 332
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/242 (23%), Positives = 103/242 (42%), Gaps = 31/242 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPTSYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
LF Q+P P + FT E +K A V S + LA ++ L P
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193
Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+L ++S+ K + + ++ + Y+AG+ + + GP PNS+LL YGFV NP
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGNLSVLAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNP 253
Query: 348 YD 349
D
Sbjct: 254 ND 255
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 109/271 (40%), Gaps = 47/271 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A ++Q G+ F +P+ +V+ ++ N+ +A+ L L L + ++YE G+
Sbjct: 50 VVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKN--LGPWLSLVVVMIYEYSLGE 107
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKAEILERAEG 223
+S W Y + L + ++ + WSE E + L S + +I E+
Sbjct: 108 QSNWKQYFQVLPSK-------FDTLMFWSEEEFSQLQASAVVDKVGKRDAEEDIFEKVLP 160
Query: 224 IKREYNEL------------DT-------VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFV 264
+ R + +L DT + GSL Y +DI + ++
Sbjct: 161 LVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEGEDGYL 220
Query: 265 AVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
L K +VPL L A + + A L + A+ + +P KAG+ I
Sbjct: 221 TDDEE--QLPK-------GMVPLADLLNADADRNNARLFQEEGALVMRAIKPIKAGDEIF 271
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
G P S LL YG+V DN VVE+
Sbjct: 272 NDYGELPRSDLLRRYGYV-TDNYAQYDVVEL 301
>gi|67540796|ref|XP_664172.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|40738718|gb|EAA57908.1| hypothetical protein AN6568.2 [Aspergillus nidulans FGSC A4]
gi|259480141|tpe|CBF71002.1| TPA: SET domain protein (AFU_orthologue; AFUA_6G04520) [Aspergillus
nidulans FGSC A4]
Length = 484
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/263 (22%), Positives = 101/263 (38%), Gaps = 48/263 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN--KLSELACLALYLMYEKKQ 170
V A D+ + F++P LV++ N + +LL+ + +L L L +++E Q
Sbjct: 50 VVAQADIDEDEELFAIPRDLVLSTH----NSKLKDLLSQDLDQLGPWLSLMLVMIFEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--------- 221
G KS W PY + L + ++ + WS EL L GS +I ++
Sbjct: 106 GGKSTWAPYFKVLPQN-------FDTLMFWSPEELEELQGSAVVEKIGKQGAEESILKLI 158
Query: 222 --------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQA 262
G+ ++ + GSL Y +DI T E +
Sbjct: 159 IPVVRANPALFPPINGLASYDGDVGAQALLGLAHTMGSLIMAYAFDIETPENEDEREGED 218
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
K +VPL L A + + A L ++++ + +P +AGE
Sbjct: 219 GYLTDEEEEQSSK-------GMVPLADMLNADAYRNNARLFQEEESLVMKAIKPIRAGEE 271
Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
I G P S LL YG+V ++
Sbjct: 272 IFNDYGEIPRSDLLRRYGYVTDN 294
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 110/256 (42%), Gaps = 38/256 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV--LGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ A+ D Q GD +P L++ R L N A + L + +AL++ ++K
Sbjct: 47 LMATSDFQIGDPVVRIPARLLLVPRRTHKLFNNHPAIV----ALKQHPSIALFIAWQKIH 102
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
W PYI L R L ++ LL L Y +I E A K + ++
Sbjct: 103 PTPE-WSPYIDILPRSFDTMPLCIDLKLL---AMLPY--------DIQEIA---KNQQSK 147
Query: 231 LDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-------RF 282
LDT + + Y+ IP + IFK A++ V + + + ++++ +
Sbjct: 148 LDTDYAFVCTALAVSGYEMIPKD-----IFKWAWIVVNTRCITMNTNAISKPQLSHIHQQ 202
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQ----LVVDRPYKAGESIVVWCGPQPNSKLLIN 338
++ L P L + A ++A D V+ + PYK G + + GP N+ LL
Sbjct: 203 PIITLAPFLDCLNHTSTARISAGYDTVEKAYIIRTLVPYKKGSQVFINYGPHDNNFLLAE 262
Query: 339 YGFVDEDNPYDRLVVE 354
YGF NP++ +V++
Sbjct: 263 YGFAILKNPFNHVVLD 278
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 105/235 (44%), Gaps = 24/235 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALYLMYEKKQGK 172
A+ DLQ +VP L++++ + + + L + LAL+++ E+ +
Sbjct: 26 ATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLISREHGLRSMPHVVLALHVLCERLH-E 84
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L R + + L +S ++ L GSP+ E L++ GI ++Y
Sbjct: 85 DSTWAPYLNILPR-------SYSTCLYFSPDDMMALQGSPSMGEALKQFRGIVKQY---- 133
Query: 233 TVWFMAGSLFQQYPYDIPTE-AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
V+F +P + +FTF+ F+ A V + ++ S AL+ P+
Sbjct: 134 -VYFFRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKVSSNETVKALI----PM 188
Query: 292 LAYSSKCKA-MLAAVDDA---VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+ C DD+ V+ + +P +AG+ + ++ G + N+ L + GFV
Sbjct: 189 WDMCNHCNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNADRLFHNGFV 243
>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
Length = 523
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/259 (22%), Positives = 105/259 (40%), Gaps = 63/259 (24%)
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
VLGN + + N++ + + ++L+ E+ +G+ SFW PY+ L G +PL
Sbjct: 80 VLGN--VYREMLGNEIDDRLLVMIFLIIERARGRASFWAPYLEMLPSGFG-------TPL 130
Query: 199 LWSETELAYLTGS----PTKAEI--------------LERAEGIKREYNELDTVWFMAGS 240
+ + EL L G+ TKA++ L R + + E+ E +W A
Sbjct: 131 WFEDEELMELDGTTLFEATKAQVFFPSTFVSTCMSLYLFRPDDRELEFQEF--LW--ANC 186
Query: 241 LFQQYPYDIPTEA-----------------FTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
+F +IP A + F++ + + VS
Sbjct: 187 IFWTRALNIPCPASFVTSSSPEVAKDDGNRLVIYVLPHPFISCSA-----KDVSTIWIEG 241
Query: 284 LVPLGPPLLAYSSKCKAM---------LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
LVP G ++ + + + V ++ L+ D + G +++ G + N +
Sbjct: 242 LVP-GIDFCNHTRRASGLWEIDGSDGSTSGVPHSMYLIADVVFPPGSEVLINYGDKGNEE 300
Query: 335 LLINYGFVDEDNPYDRLVV 353
LL YGFV+EDN D ++V
Sbjct: 301 LLFLYGFVEEDNSNDYVMV 319
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/246 (22%), Positives = 101/246 (41%), Gaps = 27/246 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++L T++ + + L + + L+ LA+Y+++ +
Sbjct: 28 VKARRRFKQGERILTIPSALHWTVQHAQADSLLGPALRSARPPLTVEDTLAVYVLFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
RE R +A S + ++E EL G+ + + I+
Sbjct: 87 ----------RESGYNGPRSHVAALPTSYSSSIFFTEDELEVCAGTSLYTITKQLKQRIE 136
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
+Y +L + P P FT +K A V S + + + L
Sbjct: 137 DDYKDL------IARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMDFELYDGSSMRLLA 190
Query: 286 PLGPPLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P +L +SS+ K + A + ++ + Y+AG+ + + G PNS+LL YGFV
Sbjct: 191 PFAD-MLNHSSESKQCHVYDASTGNLSILAGKDYEAGDQVYIHYGSIPNSRLLRLYGFVI 249
Query: 344 EDNPYD 349
DNP D
Sbjct: 250 PDNPND 255
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 86/217 (39%), Gaps = 34/217 (15%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SET 203
+ +L N + +L+ E+ +GK+SFW PYI L +L++ PLLW +ET
Sbjct: 97 VGPMLRKNDFRPWLTMCAHLLVERSRGKESFWHPYIAALPSV---DELSISHPLLWPAET 153
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVW---FMAGSLFQQYPYDIPTEAFTFEIFK 260
L GSP I R + + ++ L T F+ G E
Sbjct: 154 IQELLQGSPMLDTIATRLKLCQEDHEALLTAGIEKFLPGG----------------ETLS 197
Query: 261 QAFVAVQSCVVHLQKVSLA-------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ--- 310
+ V S V+ + SL LVP L SS + D +
Sbjct: 198 EGDVRWASAVLLSRAFSLELDVDDDFDTLCLVPWADMLNHCSSAGEESCLIFDQDTKTAS 257
Query: 311 LVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDEDN 346
L + Y G+ + GP S+L ++YGFVD++N
Sbjct: 258 LEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDEN 294
>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
Length = 468
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 80/199 (40%), Gaps = 29/199 (14%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E + K S+W PYI L + A+ P +W E ++ L + + E
Sbjct: 104 FFLVKEYLKEKDSYWWPYISTLPQPDRVDTWAL--PAVWPEDDIECLEETNAHVAVREIQ 161
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
IK+EY L V F + Q Y FT F+ + + Q H+
Sbjct: 162 ANIKKEYKHARKLLKEVDFPGWQEYTQLLYKWAFCIFTSRSFRPSLILSQETQDHV---- 217
Query: 278 LARRFALVPLGP---------PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
L P G PLL +S+ + L VD QL+ + Y+ G+ +
Sbjct: 218 ----LGLTPHGTKVDDFSILQPLLDIGNHDPTSQYQWNLE-VDGTCQLICNNAYQPGQQV 272
Query: 324 VVWCGPQPNSKLLINYGFV 342
G + NS+LL+ YGF+
Sbjct: 273 FNNYGLKSNSELLLGYGFI 291
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/242 (23%), Positives = 102/242 (42%), Gaps = 31/242 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 34 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP-- 289
LF Q+P P + FT E +K A V S + LA ++ L P
Sbjct: 143 VV------RLFVQHPDLFPLDKFTVEDYKWALCTVWSRAMDF---VLADGNSIRLLAPFA 193
Query: 290 PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+L ++S+ K + + + + Y+AG+ + + GP PNS+LL YGFV NP
Sbjct: 194 DMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVIPGNP 253
Query: 348 YD 349
D
Sbjct: 254 ND 255
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/250 (22%), Positives = 105/250 (42%), Gaps = 31/250 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKSFWL 177
++ G +P ++++ + VL + + + +L+ + + +L+Y+K G+ SFW
Sbjct: 65 IKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQAITTFLIYQKHIGETSFWK 124
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + P+ + E + YL S +A I + + + Y EL +
Sbjct: 125 PYLDILPNE-------YTHPVYFGEEDFLYLPHS-LRANIKAKKQECIKSYEELKPFFPS 176
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAF--VAVQSCVVHLQKVSLARRF--------ALVPL 287
L + FTF+ ++ A+ V +S V + ++ R +LVP+
Sbjct: 177 LEPLLPNW-----EGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNLDKSGLGVTSLVPM 231
Query: 288 GPPLLAYSSKCKAML------AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
LL +S + L D + + YK G+ ++ N LL+NYGF
Sbjct: 232 -VDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDDYKRGDQVLFCYRRADNQTLLLNYGF 290
Query: 342 VDEDNPYDRL 351
V DN D +
Sbjct: 291 VLPDNHLDTI 300
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/242 (22%), Positives = 106/242 (43%), Gaps = 19/242 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P+ L T++ + + L + + LS LA+++++ +
Sbjct: 28 VKARRRFKQGERILTIPSGLHWTVKHAQNDSLLGPALCSAQPPLSVEDTLAVHILFVRS- 86
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
++S + ++R + S + +++ EL G+ + + I+ +Y +
Sbjct: 87 -RESGYDGLRSHVERLPA----SYSSSIFFTDDELEVCAGASLYTITKQLQQRIEDDYRD 141
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
L + QYP P + FT +K A AV S + Q + L P
Sbjct: 142 LVV------RVLVQYPDLFPLDKFTLHHYKWALCAVWSRAMDFQLSDGSSIRLLAPFAD- 194
Query: 291 LLAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+L +SS +C A+ D + ++ + Y+AG+ + + G PN +LL YGF+ NP
Sbjct: 195 MLNHSSESKQCHVYDASSGD-LSVLAGKDYEAGDQVYIHYGSIPNHRLLRLYGFIIPGNP 253
Query: 348 YD 349
D
Sbjct: 254 ND 255
>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
CCMP526]
Length = 452
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 66 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 122
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 123 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 177
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L WS E L S T+ + + +Y W + +L + + ++F
Sbjct: 178 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 231
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
F+ A V S + R A PLL +++ + +
Sbjct: 232 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 282
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
++++ DR G+ IV+ GP+ + L ++GFV
Sbjct: 283 LKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFV 316
>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloropl [Ectocarpus
siliculosus]
Length = 460
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/270 (23%), Positives = 111/270 (41%), Gaps = 35/270 (12%)
Query: 84 WMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLG 141
W+ K+G L V+ P E+ + A++ ++ G + ++P SL +T L
Sbjct: 18 WLTKSGVRLTDNAVLAGRSPLAGERG-----LVAAKAIETGQSVLAIPQSLGLTATG-LK 71
Query: 142 NETIAELLTT--NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
+ IA+ + E +AL +++E+ QG+ S P+I L ++ G+L E PL
Sbjct: 72 SSGIAQYVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKE---GEL--EMPLF 126
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
W E +L S T+ G + +E D W ++ + F ++P P + F F
Sbjct: 127 WGEADLTLADASSTRG-----ISGFVADVDE-DFAW-LSENAFAKHPKVFPADKFGPGDF 179
Query: 260 KQAFVAVQSCVVHLQK-------VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
+ A S + V A +L + P + + + AV L
Sbjct: 180 RWAVGVALSRSFFVDGELRLTPLVDFANHSSLRGVSEP----TGGTTGLFGS--KAVVLR 233
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+ Y+ GE V GP+ + L GFV
Sbjct: 234 AGKNYEEGEEFFVSYGPKGAAGYLEENGFV 263
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 75/171 (43%), Gaps = 40/171 (23%)
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
A LA+ LM + G + W Y L AV+S ++WS+ EL L GS +
Sbjct: 47 ATLAVALMQQTNGGASARWRAYCDAL-------PAAVDSLMMWSDEELEVLQGSALRQRA 99
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDI-PTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ R + KREY+ L F A L + P EA++F++F+ A+ V
Sbjct: 100 VFRRDLCKREYDAL----FPA--LARADPETFGDVEAYSFDVFRWAYATV---------- 143
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVD------DAVQLVVDRPYKAGE 321
+AR F L L +C A+L +D DA + VV+R A E
Sbjct: 144 -MARAFVLPDL---------QCMALLPGLDIYNSARDAEKCVVERDEGACE 184
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/266 (24%), Positives = 106/266 (39%), Gaps = 37/266 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A D+Q G+ F +P +V+ ++ NE +A+ L L L + ++YE G+
Sbjct: 50 VVARSDIQEGEDLFHLPQRVVLMVKTSPLNEILADEL--KNLGPWLSLVVVMIYEYSLGE 107
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSP---------TKAEILERAEG 223
+S W Y + L + ++ + WS EL+ L S + +I E+
Sbjct: 108 RSNWNQYFQVLPTK-------FDTLMFWSGEELSQLQASAVIHKIGKKDAEEDIFEKIIP 160
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTE--------------AFTFEIFKQAFVAVQSC 269
+ R + +L F + Y D + A+ F+I K +
Sbjct: 161 LVRSHPDL----FPPVNGVMSYDDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEGE 216
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGP 329
+L +VPL L A + + A L D A+ + +P K G+ I G
Sbjct: 217 DGYLTDDEEQLPKGMVPLADLLNADADRNNARLFQEDGALVMRAIKPIKTGDEIFNDYGE 276
Query: 330 QPNSKLLINYGFVDEDNPYDRLVVEV 355
P S LL YG+V DN VVE+
Sbjct: 277 LPRSDLLRRYGYV-TDNYAQYDVVEL 301
>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
gaditana CCMP526]
Length = 446
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 110/274 (40%), Gaps = 35/274 (12%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +KNG+ I S + A+ ++ G+ +VP +L +++ V
Sbjct: 60 LLEWCNKNGIKDASKITIGPVSQAGMGLGL---VATAPIKQGETLATVPLNLCFSMDSVR 116
Query: 141 GN---ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
+ + I E L + + +AL L+YE G KS + YI+ L R GQ + P
Sbjct: 117 ASPLGKVIGEF--EPALGDASLIALQLLYEAHMGPKSKYAVYIKSLPRP---GQDGFDHP 171
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
L WS E L S T+ + + +Y W + +L + + ++F
Sbjct: 172 LFWSTAEQGVLAKSSTRNLGETLIDAVAEDYG-----WIQS-ALARGGISGLQADSFDLS 225
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA---------MLAAVDDA 308
F+ A V S + R A PLL +++ + +
Sbjct: 226 DFEWAVAVVLSRSFFAEN---GLRLA------PLLDMANRGEGCTNEPQIGGLGIFGGKG 276
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
++++ DR G+ IV+ GP+ + L ++GFV
Sbjct: 277 LKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFV 310
>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
Length = 475
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/265 (22%), Positives = 109/265 (41%), Gaps = 44/265 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
K S W PY+ L + ++ E E L ++ E ++ +
Sbjct: 115 KTSRWHPYLMHLPH-------TYDVLAMFGEFEKHAL-------QVDEAMWVTEKAMLKA 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ W A SL Q + + FTF+ + A + S +H + L P+G L
Sbjct: 161 KSEWKEAHSLMQDLMF--KPQFFTFKAWVWAAATISSRTLH---IPWDEAGCLCPVG-DL 214
Query: 292 LAYSSKC--KAMLAAVDDAVQL---------------------VVDRPYKAGESIVVWCG 328
Y + + + +D A QL YK G+ +++ G
Sbjct: 215 FNYDAPGIEPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYG 274
Query: 329 PQPNSKLLINYGFVDEDNPYDRLVV 353
N +LL +YGF+ ++NP D++ +
Sbjct: 275 TYTNLELLEHYGFLLQENPNDKVFI 299
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 115/274 (41%), Gaps = 33/274 (12%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
W+ +G + E PS +E + A D+ GD +P++L+++ +
Sbjct: 26 WLRSHG---AAIDCVEWPS-SETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADP 81
Query: 144 TIAEL--LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
+ L T L LALY+M E + ++SF+ PY+R L ES LL
Sbjct: 82 KYGHVHRLNTRLLGSDNGLALYIMQEILKEERSFYWPYLRMLPTPCNLRNWNRESLLLLQ 141
Query: 202 ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ +L T + ++ ++L + RE T+ F++ S YP + +TFE+F
Sbjct: 142 DHKLVRRTAARSR-QLL----ALYRE-----TIEFLSSS----YPELYTADRYTFELFDF 187
Query: 262 AFVAVQSCVVHLQKVSLARRF---ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DR 315
A+ +Q+ + +R ALVP L + + K + + +
Sbjct: 188 AWRTIQA-------RAFGKRLKSSALVPFADCLNHGNVQTKYDFDVGGNGTFRLFPSGNN 240
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
Y ++ G + N LL++YGF DN +D
Sbjct: 241 RYPRNSEVLNSYGRRANDNLLLDYGFAMLDNEWD 274
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 69/156 (44%), Gaps = 16/156 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ +++DL+ + +P +++++ +I+ +LT + A+ L+YE G+
Sbjct: 82 IISNKDLKVNNIVAKIPKDIILSIHT----SSISNILTKYTMERNIATAIALIYEASIGE 137
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
KS W YI L L V+ P+LW + L G+ + I + I Y ++
Sbjct: 138 KSKWYGYISSL-------PLKVDIPILWDKESQQLLNGTVMEDVIQDDNILINHAYADI- 189
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
+ L + +P E F+FE FK A V S
Sbjct: 190 ----VESLLIKNHPEYFSKEIFSFENFKIANSIVSS 221
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 112/277 (40%), Gaps = 39/277 (14%)
Query: 83 SWMHKNGLPPCKV-----ILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+W K G K IL +KP + AS D+Q G++ VP++ V++
Sbjct: 46 AWATKQGAKLEKANLSTDILTDKP----------ILVASADVQPGESLIVVPDAAWVSVP 95
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
V T+ +L ++ L LAL L+ E+ KS Y L G +P
Sbjct: 96 NV-AKTTVGKLASSAGLEPWLQLALVLVAERFGSAKSELAGYASSLPEDLG-------TP 147
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LLWSE E L G+ + + + +L LF P P FT
Sbjct: 148 LLWSEEETRALAGTQVAGTLNSYLTFFRSTFAQLQA------GLFTANPAAFPPAVFTLP 201
Query: 258 IFKQAFVAVQSCV---VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV- 313
F A AV+S + K++LA LV L A ++K + + Q+ V
Sbjct: 202 NFVWAVAAVRSRSHPPLEGDKIALA---PLVDLVSHRRAANTKLSVRSSGLFGRGQVAVV 258
Query: 314 --DRPYKAGESIVVWCGP-QPNSKLLINYGFVDEDNP 347
R + GE++ + P + + +L++YG +D +P
Sbjct: 259 EATRAIRKGEALGMDYAPGKLDGPVLLDYGVMDTASP 295
>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
Length = 442
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 79/195 (40%), Gaps = 20/195 (10%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ E +GK SFW PY+ L + P W E ++AYL + I E
Sbjct: 113 FFLIKEYLKGKDSFWWPYLATLPSPDQVNAWVL--PAFWPEDDIAYLECTNAHVAIQEIQ 170
Query: 222 EGIKREYNE----LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---- 273
+K E+ + L F + + Y FT F+ + + + H+
Sbjct: 171 ANVKGEFKQARKILKNENFPDVAAYTSLMYKWAFTIFTSRSFRPSLILSDTTKRHISTLL 230
Query: 274 -QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWC 327
Q V L F+++ PLL ++ + + D DA LV Y G +
Sbjct: 231 PQSVEL-DDFSILQ---PLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPPGAQVFNNY 286
Query: 328 GPQPNSKLLINYGFV 342
G + NS+LL+ YGF+
Sbjct: 287 GLKTNSELLLGYGFI 301
>gi|403342378|gb|EJY70508.1| SET domain containing protein [Oxytricha trifallax]
Length = 653
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 59/124 (47%), Gaps = 14/124 (11%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGN------ETIAELLTTNKLSELACLALYLMYE 167
AA +++ D VP +++T+ER L + + A + + + L ++L+YE
Sbjct: 56 AAKLNIKNNDVIVYVPQKVLITVERALASPIGFIFDNHASIFKATEDRDYLVLLVFLIYE 115
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++G +SFW PY +D G L P WS+ + L S K +I + + + +
Sbjct: 116 HQKGTRSFWHPYFEAID----PGLL----PCFWSDQTIEELADSELKDQIRQERDNYEED 167
Query: 228 YNEL 231
++ L
Sbjct: 168 WDML 171
>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
Length = 559
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 103/250 (41%), Gaps = 28/250 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A EDL GD +P +++++ E V + L + + L+ M EK
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMMLLWSMKEKHIVDSE 248
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L A + L + + L G+ E+++ E ++++YNEL
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
+L +P P E +++E F A S + + R LVP
Sbjct: 298 ---FPALCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
L P +L Y + + D+++ + RP +AGE + G S L+ YGF+ E D
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGD 409
Query: 346 NPYDRLVVEV 355
N D + +++
Sbjct: 410 NVNDVIPLDI 419
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 103/258 (39%), Gaps = 39/258 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI---AELLTTNKLSELACLALYLMYEKKQG 171
A+ +L++ FS+P SLV+++ +++ +E+ T + + CL MYE+
Sbjct: 39 ATSNLRSETELFSIPRSLVLSVHTSPLPKSLPDWSEISTQGWVGLILCL----MYEQID- 93
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREYNE 230
S W Y+ + +S + WS+ EL L GS +I E AEG Y+
Sbjct: 94 PASHWKRYLNSM-------PTCFDSLMFWSDDELRELQGSSVLDKIGREEAEG--SYYSI 144
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL------------ 278
L +F+ P EA++ ++ + + S H+
Sbjct: 145 LVPYLSKHADIFK------PLEAYSLALYHRCGSLILSRSFHVSNQDDSASDASDDDDAA 198
Query: 279 ---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
+VP+ L A S A L DA+ + + AGE I PN+ L
Sbjct: 199 YHEVETVGMVPMADVLNAKSGSANACLVYHPDALVMTTTKEIAAGEQIFNTYNDPPNADL 258
Query: 336 LINYGFVDEDNPYDRLVV 353
L YG VDE N D + +
Sbjct: 259 LRRYGHVDEVNLNDNVEI 276
>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 724
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 110/277 (39%), Gaps = 43/277 (15%)
Query: 113 VAASEDLQAGDAAFSVP--NSLVVT-LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
+AA++++ G+ S+P +LVV+ ER T +K +AL L+YE++
Sbjct: 102 MAAADNIAKGEVLVSLPVAAALVVSPKERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQ 161
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G S PY+ L + +PL WSE +L L E+ + EG+KR +
Sbjct: 162 LGPASKLAPYVAALP-------VDFSTPLSWSEAQLQALCYPQLIREVATQREGLKRLHA 214
Query: 230 EL------------DTVWFMAGSLFQQY--PYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
EL D +W + + + PY PT + F + +
Sbjct: 215 ELAVSTPGTPITEQDLIWALQAVRSRAFSGPYAGPTWRSRLKTFGALGALAAASITVAHV 274
Query: 276 VSLARRFALVPL-GPPLLAYSSKCKAMLAAVD-----DAVQLVVDRPYKA---------- 319
++ A AL L +L+ K AM VD VQ V+ Y A
Sbjct: 275 LNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDFLNHKSTVQSEVEYEYFADRFSVRCQSY 334
Query: 320 ---GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
GE + + G Q N LL YGFV+ P+D +
Sbjct: 335 FSKGEQVFISYGKQSNDSLLQYYGFVEPGIPHDTYTI 371
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 49.3 bits (116), Expect = 0.003, Method: Composition-based stats.
Identities = 66/312 (21%), Positives = 124/312 (39%), Gaps = 29/312 (9%)
Query: 42 SSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKP 101
S ++L KN+FS S + S EV + E +W+ G+ K+ +
Sbjct: 728 SLIQLSNLKNQFSKPTIKSVPFIKPSDEVYKRFE-------NWLKAGGVQFPKLQIANFT 780
Query: 102 SHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLA 161
+ V ++ + +A VP ++ ++ + + + L++ L
Sbjct: 781 DSTGRG-----VVTTKKVDENEAVVVVPKKYLINVDVAKAHPILGPIFEELHLNDDTILF 835
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L+++YEK SFW P+ L + + +S TEL L G+ + E
Sbjct: 836 LFVIYEKGNAN-SFWRPFYDTLPSY-------FTTSIHYSATELLELEGT----NLFEET 883
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
K++ N F L +QYP P F++E F A + S + L K+ + +
Sbjct: 884 LHTKQQLNSFRDYLF--PELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KIDGSIK 940
Query: 282 FALVPLGPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
LVP+ + +++ + + +++ A I + G N +L + Y
Sbjct: 941 SCLVPMADMINHHTNAQISERFFDHDSQSFKMISSCNIPANNQIFLHYGALQNWELALYY 1000
Query: 340 GFVDEDNPYDRL 351
GF+ +N YD L
Sbjct: 1001 GFIIPNNIYDSL 1012
>gi|378731232|gb|EHY57691.1| hypothetical protein HMPREF1120_05719 [Exophiala dermatitidis
NIH/UT8656]
Length = 714
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 87/226 (38%), Gaps = 46/226 (20%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELD--RQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
A +L+ + G KS+W PYI L Q E+ LLW L G+ KA
Sbjct: 128 AFFLLEQLVLGDKSWWAPYISSLPTVEDVSHSQFEDEADLLW-------LEGTNLKAGFA 180
Query: 219 ERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTEAFTFEIFKQAFV--AVQSCVVHLQ 274
A K Y + G L Q + A+T+E F+ A +S +
Sbjct: 181 AEAARWKEMY--------LKGMHQLKQSQWENAVNGAYTWERFRWAMTIFGSRSFTSQVL 232
Query: 275 KVSLARRFALVP------------LGP----------PLLAYSSK---CKAMLAAVDDAV 309
+L AL+ LG PL+ S+ K A V
Sbjct: 233 DATLPADKALLQQYRHDDGRDLCVLGELFAQHFGVLLPLVDISNHKPGAKVEWQARYSFV 292
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
L V PY++G+ I GP+ N LL+ YGF DNP+D +V+ +
Sbjct: 293 GLQVLEPYESGQEIFNNYGPRDNETLLVAYGFTIPDNPFDHVVISI 338
>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
dendrobatidis JAM81]
Length = 966
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/312 (22%), Positives = 123/312 (39%), Gaps = 60/312 (19%)
Query: 71 VSKKEEDLGDLKS---WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFS 127
++K + L L+S W+H NG+ + +K+ + I ++ + G+
Sbjct: 548 TAEKLDQLASLESFTQWLHANGINTDGISIKKVDDSKDVGLGIF---STRQIHKGECLVK 604
Query: 128 VPNSLVVTLERVLGNETIA-----ELLTTNKL--SELACLALYLMYEKKQGKKSFWLPYI 180
+P L+ +L N+T A ++ +N L ++ + + + + ++ S W PY
Sbjct: 605 IP------LKLILSNDTSAMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYF 658
Query: 181 RELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGS 240
L R P+L S +LA TG+ E++ + R+Y L +
Sbjct: 659 DLLPR-------VFTIPVLGSAQDLAAYTGTSIIDEVVHDMIALMRQYLYLQHI------ 705
Query: 241 LFQQYPYD-IPTEAFTFEIFKQAFVAVQS-----CVVHLQKVSLARRFALVPL------- 287
F+ P IP FTF F A V + C + + + L+PL
Sbjct: 706 -FKSIPEPPIPLADFTFAAFSWARAIVSTRQNEICYANPSTSEMQQFLCLIPLFDMFNHK 764
Query: 288 -GPPLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
G + +K C +A+ D GE I + G + N ++L+ GFVD
Sbjct: 765 PGNSTTQFDTKEYCSETIASCD----------VSPGEQIFIHYGKRSNQEMLLYSGFVDP 814
Query: 345 DN-PYDRLVVEV 355
N YD + + V
Sbjct: 815 TNIEYDHIKLSV 826
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 58/268 (21%), Positives = 112/268 (41%), Gaps = 43/268 (16%)
Query: 112 YVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE------LLTTNKLSELACLALYLM 165
++ A+ED++ + F +P ++ VL + ++E +L + L + ++
Sbjct: 42 FILATEDIKTDELLFEIPRESILN---VLTSSLVSEYPAWENILLDGDVGHWEGLIICML 98
Query: 166 YEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAE 216
+E K K S W PY L L + W+ EL L G+ +
Sbjct: 99 FEIKVKKNMSKWAPYFDVLPESTDLNSL-----MYWTAEELEALKPSLVLDRIGNDGAHQ 153
Query: 217 ILERAEGIKREYNELDTV-----------WFMAGSLFQQYPYDI---PTEAFTFEIFKQA 262
+ E+ + R + + +V + S+ Y +D+ PT A E +
Sbjct: 154 MHEKVMELIRTFEKDHSVDLSFGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEV 213
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
V+ V + + +++PL L + ++KC A L +D++++ KAGE
Sbjct: 214 EEDVEQTVRNEGSLK-----SMIPLADTLNSDTNKCNAHLIYDEDSLKMRAISNIKAGEQ 268
Query: 323 IVVWCGPQPNSKLLINYGFVD-EDNPYD 349
+ G PN+++L YG+V+ E + YD
Sbjct: 269 VYNIYGNHPNAEILRRYGYVEWEGSKYD 296
>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
Length = 307
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 100/245 (40%), Gaps = 22/245 (8%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S
Sbjct: 67 SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
LF + I F++ A+ V + V+L Q+ L+ L P
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233
Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
LL +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
Query: 349 DRLVV 353
+ V
Sbjct: 294 ACVYV 298
>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
troglodytes]
gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
Length = 307
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 98/240 (40%), Gaps = 22/240 (9%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSF 175
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S
Sbjct: 67 SLQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSL 125
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 126 WKPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFF 177
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--P 290
LF + I F++ A+ V + V+L Q+ L+ L P
Sbjct: 178 SSLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLD 233
Query: 291 LLAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
LL +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 234 LLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
Length = 440
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 131/324 (40%), Gaps = 47/324 (14%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F+ SS+ SR V + + +LK W LK
Sbjct: 3 NGGGRTSRIRRRKLFT----SSE-----SRGVNESYKPEFIELKKW------------LK 41
Query: 99 EKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN 152
++ + P H+ + + LQ G S+P S ++T + V+ + A +
Sbjct: 42 DRRFEDTTLIPAHFPGTGRGLMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQ 101
Query: 153 KL-SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
S L L +L+ EK G +S W PY+ L + A P+ E E+ L +
Sbjct: 102 PPPSPLLALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPN 153
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
P K + E+ + ++ + LF + I F++ + A+ AV + V
Sbjct: 154 PLKTKAWEQRSHVWEFFSSSRGFFSSLQPLFSEAVETI----FSYRALRWAWCAVNTRAV 209
Query: 272 HLQK---VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIV 324
++++ + L+ L P LL +S + A ++ + + G+ +
Sbjct: 210 YMKRPPLLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVF 269
Query: 325 VWCGPQPNSKLLINYGFVDEDNPY 348
+ GP N +LL+ YGFV NP+
Sbjct: 270 ICYGPHDNHRLLLEYGFVCVSNPH 293
>gi|255075907|ref|XP_002501628.1| predicted protein [Micromonas sp. RCC299]
gi|226516892|gb|ACO62886.1| predicted protein [Micromonas sp. RCC299]
Length = 607
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 94/215 (43%), Gaps = 16/215 (7%)
Query: 145 IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE 204
+A + +L+ A LAL++++E +S Y+ L G+ +V PLLW+ T+
Sbjct: 137 VAASMGAPELATHAALALHVLFELGD-PRSEGFAYLATLPGLAGKASPSV--PLLWTPTQ 193
Query: 205 LAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY---DIPTEAFTFEIFKQ 261
+A L G+PT +L RA+ + + L G +++ + + + A + +
Sbjct: 194 VATLRGTPTHGRVLRRAKFVSDAHAALFGSGGGGGVPLEKFAWALSSVLSRAASGDRMPY 253
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
AF+ + H V + V L P + + D V V D P AGE
Sbjct: 254 AFLPGVDLLNH-GGVDANCELSAVKLAP------GGNEENVTWGDVEVTCVKDTP--AGE 304
Query: 322 SIVVWCGPQP-NSKLLINYGFVDEDNPYDRLVVEV 355
+ + G + N +LL YGF N +DR +E+
Sbjct: 305 QLTISYGDESDNCRLLRLYGFATRGNVHDRRTIEL 339
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 63/288 (21%), Positives = 115/288 (39%), Gaps = 39/288 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+ + +G +W+ +NG + E P ++ + A + + +P L
Sbjct: 77 KRSQGIGRFINWLKQNGANVYGASVAEFPGYDLG------LKAERNFLENELILRIPREL 130
Query: 133 VVTLERV------LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ ++ L N+ + +L+ LA+ L+ EK + + S W PY+ L
Sbjct: 131 IFSIHNAAPELVALQNDPLLQLMPQ------VALAIALLIEKHK-EYSKWKPYLDILPT- 182
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP 246
+ L + ++ L GSPT L++ I R+Y +F LFQ+
Sbjct: 183 ------TYTTVLYMTAADMNELKGSPTLEAALKQCRNIARQY-----AYF--NKLFQKNN 229
Query: 247 YDIPT---EAFTFEIFKQAF--VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAM 301
+ + FT+E + A V + ++ + SL AL+P+ +SK
Sbjct: 230 NAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSKDGSLMIH-ALIPMWDMCNHENSKITTD 288
Query: 302 LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
A + + R +K E I + G + NS ++ GFV DN D
Sbjct: 289 FNATLNCCECYALRDFKKAEQIFISYGARTNSDFFVHSGFVYMDNEQD 336
>gi|121703688|ref|XP_001270108.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119398252|gb|EAW08682.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 492
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 100/269 (37%), Gaps = 49/269 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
V A + G+ FS+P LV++ E N + LL+ + L EL L L ++YE
Sbjct: 50 VVAQSAIVEGEELFSIPRDLVLSTE----NSKLKSLLSQD-LGELGPWLSLMLVMIYEYL 104
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-------- 221
++S W PY R ++ + WS EL L GS +I +
Sbjct: 105 LREQSAWAPYYRIFPEN-------FDTLMFWSPAELQELQGSAIVDKIGRQGAEESILQM 157
Query: 222 ---------------EGIKREYNELDTVWFMA-----GSLFQQYPYDIPTEAFTFEIFKQ 261
+G+ E T + GSL Y +DI + +
Sbjct: 158 IAPVVKANPSLFPPIQGLSSWEGEAGTQALLGLAHVMGSLIMAYAFDIEKVNDEDDEDNE 217
Query: 262 AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE 321
+ Q +VPL L A + + A L +D++ + +P AG+
Sbjct: 218 GEDGYMTDEEEDQSSK-----GMVPLADILNADADRNNARLFQEEDSLVMKAIKPIAAGD 272
Query: 322 SIVVWCGPQPNSKLLINYGFV-DEDNPYD 349
I G P S LL YG+V D PYD
Sbjct: 273 EIFNDYGELPRSDLLRRYGYVTDNYAPYD 301
>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
Length = 390
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 58/282 (20%), Positives = 111/282 (39%), Gaps = 35/282 (12%)
Query: 75 EEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVV 134
EE+ +L W+ + G KV + + + A+ + G+ +P + ++
Sbjct: 24 EEEYDELVDWLKQCGATVDKVAVDHFNGMGQG------LKATAEAAPGETLLRIPEACML 77
Query: 135 --------TLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
TL + ++T+ +L+ L+ + + + SFW PYI L
Sbjct: 78 SEESARRSTLGAYMDSDTMLKLMPNVTLA-------FHLLLELHDLDSFWRPYIACL--- 127
Query: 187 RGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQY 245
++ PL W +L L GS E + + + R+Y L + + A +
Sbjct: 128 ----PVSYSVPLYWDLPDLMSLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCF 183
Query: 246 PY--DIPTEAFTFEIFKQAFVAVQSCVVHLQKVS----LARRFALVPLGPPLLAYSSKCK 299
P + EAFTFE ++ A V + + + + AL+PL + +
Sbjct: 184 PLTLGLSPEAFTFEDWRWAVATVMTRQNSIPQAGPDGQMKPTLALIPLWDMINHANHPMS 243
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ + ++ V P K G I +W G + N + L++ GF
Sbjct: 244 TQFDSERECLEFVCPAPAKPGSQITMWYGDRNNGQFLLHQGF 285
>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 450
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 120/303 (39%), Gaps = 51/303 (16%)
Query: 81 LKSWMHKNGLPP--CKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
+ WM NG+ C++ +PS N ++ A ++ Q P L +T
Sbjct: 18 FRDWMQINGVQSRFCEI----RPSSNGENAGFGLFATKDNAQG--VLMVTPLLLAITPMT 71
Query: 139 VLGNETIA----ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
VL + + +L+ ++ + + L+L+ E+ +G+ SFW PY+ L + G
Sbjct: 72 VLQDPELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFG------ 125
Query: 195 ESPLLWSETELAYLTGSPT-KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
+PL +SE EL+ L G+ +A + G+ LD A S+F +IP
Sbjct: 126 -TPLSFSEEELSELKGTHLFQATQQQSTTGLILRCPVLDR----ANSVFWTRALNIPCP- 179
Query: 254 FTFEIFKQAFVAVQSCVVH----------LQKVSLARRFALVPLGPPLLAYSSKCKAM-- 301
F F H V + + L P + + KA+
Sbjct: 180 ---HSFNNRFAVDLDSTTHKKPEESSAADTDDVKIPSSVWVEGLVPGIDFCNHDLKAVAL 236
Query: 302 ---------LAAVDDAVQLV--VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
+ V +++ LV +D G I + G + N +LL YGFV +NP D
Sbjct: 237 WEVDGPEGSVTGVPNSMYLVTGLDVVISNGSEIFISYGNKSNEELLYLYGFVLVENPDDY 296
Query: 351 LVV 353
L+V
Sbjct: 297 LMV 299
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 108/250 (43%), Gaps = 26/250 (10%)
Query: 105 EKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
+ RP+ + AS D Q GD FSVP+S ++ E V + +L L +AL L
Sbjct: 65 QTDRPV--LIASTDAQQGDVLFSVPDSAWLSAESVK-KAAVGKLAAAAGLEPWLQIALQL 121
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ ++ KS Y + +++PLLWSE EL L G+ ++L+ G
Sbjct: 122 VADRFGSTKSELSAYAASIPED-------LDTPLLWSEDELQELQGT----QVLQTLGGY 170
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARR 281
+ T + LF P P FT F A AV+S + K++LA
Sbjct: 171 LTFFRS--TFQQLQSGLFTSNPAAFPPSIFTLPRFLWAVAAVRSRSHPPLDGPKIALA-- 226
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGP-QPNSKLLI 337
L L A +SK A + Q++V R + GE + + GP + + +L+
Sbjct: 227 -PLTELVSHRRAANSKLSVRSAGLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLV 285
Query: 338 NYGFVDEDNP 347
+YG +D +P
Sbjct: 286 DYGVMDVTSP 295
>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
Length = 440
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 100/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQGKKSFWL 177
LQ G S+P S ++T + V+ + A + LS L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ + +
Sbjct: 128 PYLEVLPK-------AYTCPVC-LEPEVVDLLPKPLKAKAREQRTRLQAFFTSSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PL 291
LF + I F++ F A+ V + V++ K R F+ P L P L
Sbjct: 180 LRPLFSEAVESI----FSYSAFLWAWCTVNTRAVYM-KPRRRRCFSAEPDTYALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + +A ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 235 LNHSPDVQVRAGFNEETRCYEIRTVSSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPH 293
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/264 (20%), Positives = 105/264 (39%), Gaps = 34/264 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTT-NKLSELACLALYLMYEKKQ 170
+ A+ +L+ G+ VP ++T E ++ + +++ + N LS L++ L+YE +
Sbjct: 51 LGAARELKKGELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSK 110
Query: 171 GKKSFWLPYIRELDRQRG---------RGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
KKSFW PY+ + R + L VE + +E A +A L +
Sbjct: 111 EKKSFWYPYLFHIPRDYDLLATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKE 170
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+K ++ W A + +P ++ V L
Sbjct: 171 LELKPKFRSFQ-AWLWASATISSRTLHVPWDS----------AGCLCPVGDLFNYDAPGD 219
Query: 282 FALVPLGPP---------LLAYSSKCKAMLAAVDDAVQ---LVVDRPYKAGESIVVWCGP 329
++ P GP L+ + + ++ V L R Y+ GE +++ G
Sbjct: 220 YSNTPQGPESANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGT 279
Query: 330 QPNSKLLINYGFVDEDNPYDRLVV 353
N +LL +YGF+ E+N D++ +
Sbjct: 280 YTNLELLEHYGFMLEENSNDKVFI 303
>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 485
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 111/291 (38%), Gaps = 42/291 (14%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ SW+ + P KV K K + + A +D+ + F++P SLV++ +
Sbjct: 19 EFMSWLKQR--PGVKVSPKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLSFQ-- 74
Query: 140 LGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N + +LL N+ CL + ++YE QG S W Y + L ++
Sbjct: 75 --NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQGGASTWSRYFQLLPTN-------FDTL 125
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-------YDIP 250
+ W++ EL L+GS +L + E N + + +P +D P
Sbjct: 126 MFWTDEELRELSGSA----VLNKIGRSDAEANIFRNILPLVSGNPSLFPPMSGVASFDSP 181
Query: 251 ----------------TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
A+ F+I K + ++ +VPL L A
Sbjct: 182 EGKAALLSLAHRMGSLVMAYAFDIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNAD 241
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
+ + A L D + + +P + GE I G P + LL YG+V ++
Sbjct: 242 ADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYVTDN 292
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/247 (21%), Positives = 102/247 (41%), Gaps = 29/247 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A + G+ ++P++ + T E + + +L + + LS LA++L++ K
Sbjct: 912 VKALRSFKKGERILTIPSACLWTAEAARADPLLGPVLRSAQPPLSVEDTLAIHLLFVKS- 970
Query: 171 GKKSFWLPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIK 225
R + R +A + + ++E EL GS + + ++
Sbjct: 971 ----------RTAGYEGQRLHIAAMPQRHSASIFFAEDELQVCEGSSLHTLTTQLEQRVQ 1020
Query: 226 REYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
++ +L L Q+ P + FT E +K A + S + +
Sbjct: 1021 DDFRQLLV------QLLSQHRDLFPLDQFTIEDYKWALCTIWSRAMDFAVSDTTSVRLVA 1074
Query: 286 PLGPPLLAYS---SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
PL +L +S +C A D + ++ + Y+ G+ I ++ G PN++LL YGFV
Sbjct: 1075 PLAD-MLNHSLDVKQCHAYDPTSGD-LSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFV 1132
Query: 343 DEDNPYD 349
DNP D
Sbjct: 1133 LLDNPND 1139
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 103/252 (40%), Gaps = 29/252 (11%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+ + A +L G+ VP L + + V + + L ++L ++ E
Sbjct: 39 PVLGLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREA 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G S W PY+ L RQ +S + WSE EL + G+ + + E ++ E+
Sbjct: 99 ARGGDSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEF 151
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
+ ++ AG + D+ TF+ F AF ++S V + + AL+P
Sbjct: 152 DNVE-----AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFA 201
Query: 289 PPL----------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLI 337
+ + K K L D L K+GE I V + + N++L +
Sbjct: 202 DLINHNGDITSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELAL 260
Query: 338 NYGFVDEDNPYD 349
+YGF + ++ D
Sbjct: 261 DYGFTESNSSRD 272
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/249 (22%), Positives = 106/249 (42%), Gaps = 31/249 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ +L G+ VP L + + V ++ + L ++L ++ E +G
Sbjct: 86 LVAARNLPRGEVVAEVPKKLWMDADAVAASDIGRACRSGGDLRPWVSVSLLILREAARGG 145
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 146 DSLWAPYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE 198
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
+ G +P + TF+ F AF ++S V + + AL+P L+
Sbjct: 199 AK--IIGPNKDLFP-----DTITFDDFLWAFGILRSRVFPELR---GDKLALIPFA-DLI 247
Query: 293 AYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYG 340
+S+ + K L D L K+GE + V + + N++L ++YG
Sbjct: 248 NHSADITSKQSCWEIQGKGFLGR-DVVFSLRTPMEVKSGEQVYVQYDLDKSNAELALDYG 306
Query: 341 FVDEDNPYD 349
F + ++ D
Sbjct: 307 FTETNSTRD 315
>gi|145518912|ref|XP_001445328.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412772|emb|CAK77931.1| unnamed protein product [Paramecium tetraurelia]
Length = 761
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATKDIPANTAIICVPQPLIISQEKCKLSSLSIVYDKHPELFDENETSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G+KSF+ PY++ + + + + WS+ EL Y+ E I
Sbjct: 106 FNEKKKGEKSFYHPYVQAIQ--------SNNTLIDWSKEELNYIEDPIILDEF-----AI 152
Query: 225 KREYNELDTVWFMAGSLFQQY 245
RE +L +W A +F ++
Sbjct: 153 VRE--DLKDLWNQAKEIFNEF 171
>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
Length = 439
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 126/318 (39%), Gaps = 35/318 (11%)
Query: 39 NFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILK 98
N G +RR+ F SS+ SR V + + +LK W+ +I
Sbjct: 3 NGGGRTSRIRRRKLFR----SSE-----SRGVNESYKPEFIELKKWLKDRRFEDATLIPA 53
Query: 99 EKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SEL 157
P + + LQ G S+P S ++T + V+ + A + S L
Sbjct: 54 RFPGTGRG------LMSKTSLQEGQTIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPL 107
Query: 158 ACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
L +L+ EK G +S W PY+ L + A P+ E E+ L +P K +
Sbjct: 108 LALCTFLVSEKHAGDRSPWKPYLEVLPK-------AYTCPVC-LEPEVVNLLPNPLKTKA 159
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK-- 275
E+ ++ ++ + LF + I F++ + A+ V + V++++
Sbjct: 160 WEQRSHVQEFFSSSRGFFSSLQPLFSEAIETI----FSYRALRWAWCTVNTRAVYMKRPP 215
Query: 276 -VSLARRFALVPLGP--PLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGE--SIVVWCGPQ 330
+ L+ L P LL +S + A ++ + + G+ + + GP
Sbjct: 216 QLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFICYGPH 275
Query: 331 PNSKLLINYGFVDEDNPY 348
N +LL+ YGFV NP+
Sbjct: 276 DNHRLLLEYGFVSVSNPH 293
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 109/251 (43%), Gaps = 27/251 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ L +AL L+ E +G
Sbjct: 84 LVAARDLPRGEVVAEVPKKLWMDADAVAASDIGRACGGGGGLRPWVAVALLLLSEVARGA 143
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY--NE 230
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +E
Sbjct: 144 DSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTVGVK-EYVQSE 191
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV---VHLQKVSLARRFALVPL 287
D+V S + D+ + TF+ F AF ++S V + K++L LV
Sbjct: 192 FDSVQAEIISTNK----DLFPGSITFDDFLWAFGMLRSRVFPELRGDKLALIPFADLVNH 247
Query: 288 GPPLLAYSS----KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
P + + S K K + + L K+G+ I + + + N++L ++YGFV
Sbjct: 248 SPNITSEGSSWEIKGKGLFGR-ELMFSLRTPVNVKSGQQIYIQYDLDKSNAELALDYGFV 306
Query: 343 DEDNPYDRLVV 353
+ + D V
Sbjct: 307 ESNPSRDSFTV 317
>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
chloroplastic-like, partial [Cucumis sativus]
Length = 923
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 92/219 (42%), Gaps = 39/219 (17%)
Query: 112 YVAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAELLTTNKLSELACLALYL 164
++ ASE ++AGD VP +SL + + +LGNE + +A LA+ +
Sbjct: 734 FLFASETIRAGDCILKVPFNVQISPDSLPLPIRDLLGNE----------IGNVAKLAVVV 783
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ E K G S W PYI L + + + + W E+EL + S E L + I
Sbjct: 784 LLEHKLGLGSEWAPYIIRLPQ-----PWEMHNTIFWKESELEMIRKSSLYEESLNQRSQI 838
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
KRE+ + + +P I + + + F A+ V S + +L
Sbjct: 839 KREFLAIRKA-------LEAFPEII--DRISCDDFMHAYALVTS-----RAWRSTEGVSL 884
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDA--VQLVVDRPYKAGE 321
+P L + +AML DD ++V DR + GE
Sbjct: 885 IPFA-DFLNHDGASEAMLLNDDDKQLSEVVADRDFAPGE 922
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 95/227 (41%), Gaps = 33/227 (14%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ + G+ VP L++ + L ++ LL +L + C+ L LM E S
Sbjct: 32 ASKRISPGETFLKVPRHLLLGPHQ-LRASSLDRLLEGWQLPD--CMLLLLMCESVN-SSS 87
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F+ PY+ L V++P+ WS+ E L GSP ++ + R + E
Sbjct: 88 FFRPYLDLLPD-------TVDTPITWSKEEAKELVGSPVLHRAVKLRHELARSFQE---- 136
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY 294
M +F +YP P F++E ++ A+ ++S + L+PL + +
Sbjct: 137 --MKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAFG--------NYTLMPLIDLMNHH 186
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC--GPQPNSKLLINY 339
A D + L+ R Y VW G + ++ LL+NY
Sbjct: 187 PDSRLAPTLLSDGSDALIARREYN------VWGFYGRKSDADLLLNY 227
>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
gorilla]
Length = 440
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 99/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G++S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGRRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
Length = 236
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 79/186 (42%), Gaps = 29/186 (15%)
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ S W PYI L G +++ LW +TEL+YL SP + ER E I E+ ++
Sbjct: 74 QSSAWAPYISCLPEPAG-----LDNTFLWEDTELSYLRASPLYGKTRERLEIITTEFGQV 128
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ LF + + E F + V S + +++ LV + P+
Sbjct: 129 QNALDVWPQLFGK---------VSVEDFMHVYATVFS-----RPLAIGEDSTLVMI--PM 172
Query: 292 LAYSSKCKAMLAAVD-----DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
L + + A A + + + DR + I + CG N++L ++YGF
Sbjct: 173 LDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT---V 229
Query: 347 PYDRLV 352
P +RL+
Sbjct: 230 PENRLI 235
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 119/305 (39%), Gaps = 69/305 (22%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL---VVTLE 137
+ +W+ ++G V +K P + + +S++ GD S+P L + L
Sbjct: 4 ITTWVKEHGGHIDGVYVKNFPVYGNG------LCSSKEFHEGDTLLSIPYHLQLNTIELH 57
Query: 138 RVL-----GNET--IAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
V G E + E N+ E + + LYL K +K F PYI L
Sbjct: 58 NVFESMVPGFEVPRLGEG-AKNRDDENSVVYLYLAM-NKTNEKCFHFPYINTLPT----- 110
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
PL +SE EL L G+ ++L E K +L + +L QYP
Sbjct: 111 --TFSCPLSYSENELKMLKGT----KLLVTVEKTKTFLKKLSDYY---ETLTHQYP---- 157
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV---PLGP-----PLLAYSSKCKAML 302
T F+ F Q V +V +R F ++ P+G P +S+
Sbjct: 158 TRFQQFDDFYQRLVWAH-------QVFWSRAFLVIYPDPIGDVASLIPFADFSNH----- 205
Query: 303 AAVDDAVQLVVDRPYKA----GESIVVWCGPQ--------PNSKLLINYGFVDEDNPYDR 350
+ V V +R + V+ CG Q PN K+L+ YGFV +NPYD
Sbjct: 206 -NTETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNYRIRPNEKMLLGYGFVISENPYDE 264
Query: 351 LVVEV 355
+++ +
Sbjct: 265 VLLRI 269
>gi|384246985|gb|EIE20473.1| rubisco small subunit N-methyltransferase, partial [Coccomyxa
subellipsoidea C-169]
Length = 363
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 54/107 (50%), Gaps = 10/107 (9%)
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIREL 183
A +P +L VT V +E +A L EL LAL+LM E+++G++S W P++ L
Sbjct: 2 ALVELPGNLSVTAVDVAAHEEVAGL--AEGRGELTGLALWLMAERQKGEESRWAPFLECL 59
Query: 184 DRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILERAEGIKREYN 229
A SP+LW E + L SPT E R +++E++
Sbjct: 60 PE-------ATLSPVLWPEEVQDELLKNSPTLKECRARRAALQQEWD 99
>gi|358380690|gb|EHK18367.1| hypothetical protein TRIVIDRAFT_47382 [Trichoderma virens Gv29-8]
Length = 479
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 49/207 (23%), Positives = 85/207 (41%), Gaps = 32/207 (15%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L+L+ E +G++SFW PYI+ L + A+ P W E E L G+ + + +
Sbjct: 91 LFLIKELLRGQESFWYPYIQALPQPEDFDDWAL--PPFWPEEEAELLEGTNVEIGLEKIR 148
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-------CVVHLQ 274
E + RE+ + + + Q+ D ++ T E+++ A+ S + Q
Sbjct: 149 EDLGREFRDARNLLIAS----QKDAEDDFSDHLTRELYQWAYCIFSSRSFRPSLVLSEEQ 204
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAV-------------------DDAVQLVVDR 315
+ SL ++ L + M V AVQL V R
Sbjct: 205 QQSLPDGVSVNDFSVLLPLFDIGNHDMTVHVRWDLAAGDEAAAGAGVRGSGAAVQLKVGR 264
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFV 342
+K G+ I P+ N++LL+ YGF+
Sbjct: 265 EHKPGQQIFNNYSPKTNAELLLGYGFM 291
>gi|452824261|gb|EME31265.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Galdieria sulphuraria]
Length = 546
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 79/197 (40%), Gaps = 28/197 (14%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E +L++W+ NG+P K P + R A L+ G+ ++P +T
Sbjct: 71 EKTEELENWLFDNGVPSIKGKPVLSPHNCRTFR------AKIPLKLGEEVLAIPERFWLT 124
Query: 136 L---ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
E++LG + LS+ +A L+ E + + SFW P+I L
Sbjct: 125 KQLSEKLLG-------FHVSDLSDEEAIAALLLVETARKETSFWKPWIETLPSSDELHHF 177
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-DIPT 251
L+WS E YL S T +IL E + EL+T LF ++ Y
Sbjct: 178 -----LVWSTAETQYLESSSTFEDILSLRETASLVFEELNT------ELFPKFLYPQYDV 226
Query: 252 EAFTFEIFKQAFVAVQS 268
+ FT F A VQS
Sbjct: 227 KYFTLPYFTWALSIVQS 243
>gi|326472332|gb|EGD96341.1| SET domain-containing protein [Trichophyton tonsurans CBS 112818]
Length = 485
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/288 (20%), Positives = 105/288 (36%), Gaps = 56/288 (19%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ L+++++ +
Sbjct: 24 LKRSSPHFKMHSGIHIADLRSIGAGRGICASRDIAEDEELFVIPDDLILSVQNSEARSVL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 G--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
+ L GS +I + A DT+ L Q P+ P +
Sbjct: 135 SELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPHHFPPRPNMPPLNSPDSQN 186
Query: 266 VQSCVVH-LQKVSLARRF---------------------------ALVPLGPPLLAYSSK 297
C+ H + + +A F +VPL A + +
Sbjct: 187 ALLCLAHRMGSIIMAYAFDIEKADEADEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQR 246
Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
A L + + + + +GE I G P + LL YG+V ++
Sbjct: 247 NNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDN 294
>gi|281201674|gb|EFA75882.1| tryptophan 2,3-dioxygenase [Polysphondylium pallidum PN500]
Length = 732
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 82/170 (48%), Gaps = 20/170 (11%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE---RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
+ A+ +++ GD VP L ++ E + +G++ E L K+ L ++L+ E +
Sbjct: 98 LIANTEIKEGDEFIKVPLKLFMSQETAFKSIGDKVSREPLF--KMLPNMLLVIHLIQETQ 155
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+ ++SFW PYIR L + + ++ L ++ E L GSP +LE E I N
Sbjct: 156 KQQQSFWAPYIRMLPK-------SYKTALYFTLAEFQLLIGSP----VLE--ESINTYRN 202
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
L F+ F + P + T FT+E F+Q +A +V L K LA
Sbjct: 203 TLRQYCFLY-DFFGKNPGILSTSNFTWE-FEQNELAAYKSIVSLLKKRLA 250
>gi|242066082|ref|XP_002454330.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
gi|241934161|gb|EES07306.1| hypothetical protein SORBIDRAFT_04g028760 [Sorghum bicolor]
Length = 490
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 113/289 (39%), Gaps = 44/289 (15%)
Query: 80 DLKSWMHKNG---LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTL 136
DL W+ + G P +V +H E + A D+ GD ++P L + L
Sbjct: 60 DLVRWVQREGGFVHPALRVA-----NHPEHGLGVSAAAPDGDIPPGDVLIALPGRLPLRL 114
Query: 137 ERVLGN-ETIAELLTTNKLSELACLALYL-MYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
R G + + L EL + L L + +++ SFW PYI L
Sbjct: 115 RRPTGAADDVLVQLAQQVPEELWAMKLGLRLLQERAKSDSFWWPYIANLPE-------TF 167
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
P+ + ++ L +P ++ +R + E+ QQ + +P+
Sbjct: 168 TVPIFFPGEDIKNLQYAPLLHQVNKRCRFLLEFEKEI-----------QQKLHTVPSVDH 216
Query: 255 TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC------------KAML 302
F + Q + S + + +R F L P LL C + +
Sbjct: 217 PF--YGQDVNS--SSLGWAMSAASSRAFRLHGEIPMLLPLIDMCNHSFNPNARIVQEGSV 272
Query: 303 AAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
++D +V++V ++ + SI + G PN L++YGFV NPYD++
Sbjct: 273 NSLDMSVKVVAEKKIEQNASITLNYGCHPNDFFLLDYGFVITPNPYDQV 321
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 102/211 (48%), Gaps = 35/211 (16%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL-DRQRGRGQLAVESPLLWSETELAYLTGSP 212
+S LAL+L+ ++ G KS W ++ L DR+ G ++ PL WS+ + LT P
Sbjct: 79 MSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEG----FLDVPLQWSKEDQDSLT--P 132
Query: 213 TKAEILERA-EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CV 270
+L++ + + +Y++ T F+A +Y D P +A+ + A++ V S C+
Sbjct: 133 EGIVVLKKTLDTFEADYDKTKT--FVA-----KYDSD-PRDAYLW-----AWLCVNSRCL 179
Query: 271 VHLQKVSLARRFAL-----VPLGP--PLLAYS-----SKCKAMLAAVDDAVQLVVDRPYK 318
++ ++ A + L P L+ +S + C+ +++ + L R Y
Sbjct: 180 YFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGFEI-LCGQRGYT 238
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
A E I + GP+ NS LL YGF +NP+D
Sbjct: 239 ADEEIFLCYGPRSNSVLLCEYGFTVPENPWD 269
>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
24927]
Length = 468
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 56/241 (23%), Positives = 99/241 (41%), Gaps = 29/241 (12%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTT--NKLSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + I +L + LS LA+Y+++ +
Sbjct: 34 FKEGERILTIPSSILWTVEHAYADSIIRPVLQSMQGALSVDDTLAIYILFVRS------- 86
Query: 177 LPYIRELDRQRGRGQL-----AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R + + S + +++ EL GS + + I+ +Y L
Sbjct: 87 ----RESGYNGLRSHVEALPTSYSSSIFFTDDELEVCAGSSLYTITKQLKQQIQDDYRTL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF QY FT E +K A V S + + L P +
Sbjct: 143 ------VERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMDFVQPDGKSIRLLAPFAD-M 195
Query: 292 LAYSS---KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +SS KC + D + ++ + Y+ G+ + + G PN++LL YGFV +NP
Sbjct: 196 LNHSSDVKKCHVYDTSSGD-LSILAGKDYEPGDQVFINYGSIPNNRLLRLYGFVVPNNPN 254
Query: 349 D 349
D
Sbjct: 255 D 255
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 98/241 (40%), Gaps = 54/241 (22%)
Query: 157 LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
A +A L++ K+QG +S P+I +L G P+LW E ++A L A+
Sbjct: 135 FAKMAAMLLWHKRQGSQSPLAPWIAQLPSDTG-------VPVLWDERQIAALQYPYLIAQ 187
Query: 217 ILERAEGIKREYNEL---------------DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ 261
+ E+ ++ Y +L D W M S + + P T + +
Sbjct: 188 VKEQQREWQQLYGDLVRSGTPAGVQAPSREDFFWAM--SCVRSRTFSGPYIGSTLQDRLR 245
Query: 262 -----AFVAVQSCVVHL---QKVSLA-------------------RRFALVPLGPPLLAY 294
A +A + V+ L QK A +++A+ PL L +
Sbjct: 246 TAGLVAVLAAGNTVLGLADPQKTLSAAIAVLLFNVLYELILSRSLKQYAICPL-IDLFNH 304
Query: 295 SSKCKAMLAA--VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
SS ++ +A D+ +V R +K GE + + G Q N L+ YGF + +NP D V
Sbjct: 305 SSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGFAEANNPQDVYV 364
Query: 353 V 353
+
Sbjct: 365 M 365
>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
Length = 424
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 105/256 (41%), Gaps = 42/256 (16%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK---LSELACLALYLMYEKKQG 171
A+ + +G+ +P L+++ + + + + N+ + LAL+L+ E
Sbjct: 43 AAAAVASGEPMLCIPRRLLISEDLCWRDPQLGRVFQDNRDVFTRDDPVLALFLVRELLLA 102
Query: 172 KKSFWLPYIRELDRQRGRGQLAV----ESPLLWSETELAYLTGSPTKAEILER-AEGIKR 226
+SF+ PY LAV ES W++ EL L ER + R
Sbjct: 103 DRSFFHPY------------LAVLPYPESVQDWTQAELGELHD--------ERLVDAAAR 142
Query: 227 EYNELDTVWFMAGSLFQ-QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF--- 282
+E+D + Q +YP + P +TF+ FK A+ +Q+ + RR
Sbjct: 143 RTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQA-------RTFGRRLPWT 195
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV---DRPYKAGESIVVWCGPQPNSKLLINY 339
ALVP L + K D+ + + + G + G + N +LL++Y
Sbjct: 196 ALVPFADCLNHTNVATKYDFDVNDNGLFRLYPSGATSFAQGAEVFNSYGRRSNFQLLLDY 255
Query: 340 GFVDEDNPYDRLVVEV 355
GF DN +D + VE+
Sbjct: 256 GFALPDNEWDYVDVEI 271
>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
Length = 559
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 103/250 (41%), Gaps = 28/250 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A EDL GD +P +++++ E V + L + L+ M EK
Sbjct: 189 AKEDLDVGDTVLEIPLAIIISEELVQKSTMYPVLSKVEGMLPETMTLLWSMKEKHIVDSE 248
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F + Y L A + L + + L G+ E+++ E ++++YNEL
Sbjct: 249 FRV-YFDTLPE-------AFNTGLSFGVGAMTTLVGTLLFDELMQAKEHLRKQYNEL--- 297
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-------- 286
F A L +P P E +++E F A S + + R LVP
Sbjct: 298 -FPA--LCNNHPDIFPEEFYSWEEFLWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHS 354
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-D 345
L P +L Y + + D+++ + RP +AGE + G S L+ YGF+ E D
Sbjct: 355 LHPHILHYGK-----VDSDTDSLKFRLSRPCRAGEECYLSYGNYSGSHLVTFYGFLPEGD 409
Query: 346 NPYDRLVVEV 355
N D + +++
Sbjct: 410 NVNDVIPLDI 419
>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 478
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 124/303 (40%), Gaps = 51/303 (16%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW+ +G P KV K + + V A D+ G+ F++P V++ + N
Sbjct: 22 SWL--SGKPGVKVNPKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQ----N 75
Query: 143 ETIAELLTTN--KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
+ +LL+ + +L L L ++YE G +S W Y + L R+ ++ + W
Sbjct: 76 SKLKDLLSQDVEELGPWLSLMLVMIYEYLLGDQSAWASYFKILPRK-------FDTLMFW 128
Query: 201 SETELAYLTGSPTKAEILER--AEGIKREYNELDTVWFMAG-SLF------QQYPYDIPT 251
S +EL L GS I++R EG + E+ A SLF Y D T
Sbjct: 129 SPSELQELQGSA----IVDRIGKEGAEESILEMIAPIVRANPSLFPPVDGLASYDGDAGT 184
Query: 252 EAF-------TFEIFKQAF-----------VAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
+A I AF +S V + L++ +VPL L A
Sbjct: 185 QALLNLAHVMGSLIMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSK--GMVPLADLLNA 242
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLV 352
+ + A L + + + +P AG I G P + LL YG+V D +PYD V
Sbjct: 243 DADQNNARLFQEETGLVMKAIKPISAGAEIFNDYGEIPRADLLRRYGYVTDNYSPYD--V 300
Query: 353 VEV 355
VE+
Sbjct: 301 VEL 303
>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
Length = 986
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 109/282 (38%), Gaps = 67/282 (23%)
Query: 113 VAASEDLQA----GDAAFSVPNSLVVTLERVLGNET---IAELLTTNK-LSELACLALYL 164
V A+E++ GD FS+P + ++T + T + EL ++ + + L +L
Sbjct: 45 VIAAENVNGAQDGGDTIFSIPITCLMTPAAAFADVTYGKVFELFAAHQSVEDRTVLVFFL 104
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
E+++G S W PYIREL +PL WS E L G+ R G
Sbjct: 105 AIERQRGMTSHWGPYIRELPS-------IFSNPLNWSRAETLRLAGT--------RLGGA 149
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA----- 279
+ F +L Q +P AF + Q ++ + + +SLA
Sbjct: 150 TK---------FHDCALLQLTEVCVP--AFIAILRAQLILSANTKAIASGAISLAQDALS 198
Query: 280 -----------------------RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--- 313
R ALVPLG +L +S + D A Q ++
Sbjct: 199 PDRLAWSHSCVSSRAFSLFLNGQRTIALVPLG-DMLDHSPDAQIEWRTDDTAGQFLIISH 257
Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DR AG + G + N +L++ YGF + + + L V +
Sbjct: 258 DR-LPAGSIMFNNYGAKSNEELILGYGFFMKSSVLETLYVRL 298
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 57/245 (23%), Positives = 102/245 (41%), Gaps = 26/245 (10%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
H V A L +G VP L + E ++ L ++ + LAL+LM+E+ +
Sbjct: 35 HGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLAPVLEKYPQIPDDEVLALHLMHERSK 94
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G +SF+ P+I + + P+ W+E EL L G+ + ++R++
Sbjct: 95 GGESFFAPFIASM-------PTTFDLPVFWTEAELNELKGTNVLLLTQLMKQHLERDFEN 147
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ + F +PT T + + A + S VS ++ V L P
Sbjct: 148 IHQA---VAADFPDIFASLPT--LTIDDYMWAMSVIWSRAF---GVSKGGKYLHV-LCPA 198
Query: 291 LLAYSSKCKAMLAAVDDAV------QLV---VDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ ++ + +DD V Q++ V AG ++ + G N+KLL +YGF
Sbjct: 199 MDMFNHDV-TVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHISYGQYSNAKLLYSYGF 257
Query: 342 VDEDN 346
V +N
Sbjct: 258 VSPEN 262
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 98/246 (39%), Gaps = 25/246 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D++AG+ SVP L+ + E + + L N + L + L+ EK +G S
Sbjct: 206 ATRDIKAGEQVLSVPRKLIFSEELLPEKQ---RQLFRNFPTHLK-VTYTLIMEKLRGADS 261
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 262 PWQPFIDTLPSR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARLYASMYKC 314
Query: 235 WFMA---------GSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
FM +LF Y Y++ A + +Q V Q + ++ A
Sbjct: 315 AFMQLDDSVMGGMANLFTDYGLCYELYRWAVSTVTTRQNLVPRQEIPSDAANLPIS---A 371
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
L+P S K + ++ YK+GE ++ G + N+ L++ GFVD
Sbjct: 372 LIPYWDMANHRSGKITSFYDQAAGQMECTAQEAYKSGEQYFIYYGDRSNADRLVHNGFVD 431
Query: 344 EDNPYD 349
NP D
Sbjct: 432 MQNPKD 437
>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
Length = 440
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 98/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 52/242 (21%), Positives = 96/242 (39%), Gaps = 19/242 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A++D+ A VP + + E++ E + T + LA L+ EK +G S
Sbjct: 118 ATKDINADQQVLRVPRKKIFSEEQLSKTERESFCNFTTNFN----LANALVVEKSRGADS 173
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L + + L ++ ++ L G+ + L + I R+Y +L
Sbjct: 174 IWKPYIDVLPSR-------YNTVLYFTVEQMRRLRGTSVCSSALRQCRMIARKYAKLYAF 226
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR-------FALVPL 287
+ S + +E+++ A V + +L +A + AL+P
Sbjct: 227 AYCDSSYLRPDTGLFTQHGLCYELYRWAVSTVMT-RQNLVPREIATKDDGNSPISALIPC 285
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
K + + ++ KAG ++ G +PN+ LL++ GFVD +N
Sbjct: 286 WDMANHRPGKITSFYDSNAHQMECTAQEFCKAGNQFFIYYGDRPNADLLVHNGFVDPNNN 345
Query: 348 YD 349
D
Sbjct: 346 KD 347
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 120/265 (45%), Gaps = 38/265 (14%)
Query: 113 VAASEDLQAGDA------AFSVPNSLVVTLERVLG----NETIAELL-TTNKLSELACLA 161
+ A DL++ +A ++P+ LV++ E V + +LL + S +
Sbjct: 143 LVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHNFKQLLEAVGRQSTRGDIM 202
Query: 162 LYLMYEKKQGKK------SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
LYL+ + Q + + W YIR L R + P +W+E E L G+ +A
Sbjct: 203 LYLVSQFAQSSRPKGLSPTPWTEYIRLLPR-------PIPVPTMWTEPERLLLNGTSLEA 255
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + + +E++ L V + +P+ E+ + +V V + +
Sbjct: 256 ALEAKLLSLGKEFDTLREV-------SEDFPFWNEFLWSGEEVSLEDWVLVDAWY-RSRC 307
Query: 276 VSLARR-FALVPLGPPLLAYSSKCKAMLAAVD-DAVQLVV--DRPYKAGESIVVWCG-PQ 330
+ L R A+VP G ++ +SSK A D D V L++ P ++GE + + G +
Sbjct: 308 LELPRSGTAMVP-GLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSGEEVTISYGDAK 366
Query: 331 PNSKLLINYGFVDEDNPYDRLVVEV 355
P S++L +YGF+D +N D+L + +
Sbjct: 367 PASEMLFSYGFIDPNNIVDKLTLRL 391
>gi|325095092|gb|EGC48402.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 485
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 56/258 (21%), Positives = 100/258 (38%), Gaps = 40/258 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P +LV+ + N + +LL N+ CL + ++YE Q
Sbjct: 50 IVADDDIGEDEELFAIPQNLVLGFQ----NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G S W Y + L ++ + W++ EL L+GS +L + E N
Sbjct: 106 GGASTWSRYFQLLPTN-------FDTLMFWTDEELRELSGSA----VLNKIGRSDAEANI 154
Query: 231 LDTVWFMAGSLFQQYP-------YDIPTE----------------AFTFEIFKQAFVAVQ 267
L + + +P +D P A+ F+I K +
Sbjct: 155 LRNILPLVSGNPSHFPPMSGVASFDSPEGKAALLSLAHRMGSLIMAYAFDIEKGENDGRE 214
Query: 268 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
++ +VPL L A + + A L D + + +P + GE I
Sbjct: 215 GQDGYVTDDEEELSKGMVPLADLLNADADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDY 274
Query: 328 GPQPNSKLLINYGFVDED 345
G P + LL YG+V ++
Sbjct: 275 GELPRADLLRRYGYVTDN 292
>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
troglodytes]
gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
Length = 440
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 98/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
Length = 440
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 98/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|240276868|gb|EER40379.1| SET domain-containing protein [Ajellomyces capsulatus H143]
Length = 485
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 56/258 (21%), Positives = 100/258 (38%), Gaps = 40/258 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ A +D+ + F++P +LV+ + N + +LL N+ CL + ++YE Q
Sbjct: 50 IVADDDIGEDEELFAIPQNLVLGFQ----NSRLKDLLDFNERDFDPWLCLIVVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G S W Y + L ++ + W++ EL L+GS +L + E N
Sbjct: 106 GGASTWSRYFQLLPTN-------FDTLMFWTDEELRELSGSA----VLNKIGRSDAEANI 154
Query: 231 LDTVWFMAGSLFQQYP-------YDIPTE----------------AFTFEIFKQAFVAVQ 267
L + + +P +D P A+ F+I K +
Sbjct: 155 LRNILPLVSGNPSHFPPMSGVASFDSPEGKAALLSLAHRMGSLIMAYAFDIEKGENDGRE 214
Query: 268 SCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
++ +VPL L A + + A L D + + +P + GE I
Sbjct: 215 GQDGYVTDDEEELSKGMVPLADLLNADADRNNARLFQEDCYLSMRSIKPIRKGEEIFNDY 274
Query: 328 GPQPNSKLLINYGFVDED 345
G P + LL YG+V ++
Sbjct: 275 GELPRADLLRRYGYVTDN 292
>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
Length = 440
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 98/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPQSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L+ + L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRHRECLSAELDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
Length = 565
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 110/282 (39%), Gaps = 58/282 (20%)
Query: 108 RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE 167
+P ++ A S+ G A+ +P LVVT ++ G + + S L L +YL Y
Sbjct: 28 KPNYFGAISK--SNGKASIQIPRELVVTCDK--GIDLYKDTYKNANHSSL--LKIYLCYS 81
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
+ Q +SF PY+ L + A++SP +WS + A L G+ + E + E
Sbjct: 82 RTQ--QSFHQPYLDTLPSLQ-----AIDSPYIWSAEDKALLKGTNLGNSLKENISSLVEE 134
Query: 228 YNELDTVWFMAGSLFQQYPYDIP------------------TEAFTFEIFKQAFVAVQSC 269
W+ A +L P D+P T+ + F + + +
Sbjct: 135 -------WWNAINLL---PEDVPKPEQHFINLKFYYENKFYTDDDYYSYFNEVDTSNWTS 184
Query: 270 VVHLQKVSL---ARRFALVPLGPP-------------LLAYSSKCKAMLAAVDDAVQLVV 313
+ SL +R F + P LL ++ K K + D
Sbjct: 185 FPNYLWASLVLKSRAFPAYIIDPSLPKNEPMLLPVVDLLNHNPKTKVQWSGTDGGFLFQS 244
Query: 314 DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
D +GE + G + N +LL+ YGF E+NP D +++
Sbjct: 245 DDA-SSGEELFNNYGQKGNEELLLAYGFAIENNPADSAALKI 285
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 88/205 (42%), Gaps = 32/205 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF---EIFKQAFVAVQSCVVHLQKV 276
R + + +EL + + P +I A F A AV S + V
Sbjct: 157 RCRFLLQFSSEL--------AKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGV 208
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV----------QLVVDRPYKAGESIVVW 326
+ A++ PL+ + A +++ + ++V R + G +I +
Sbjct: 209 TNKLCSAMML---PLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLN 265
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRL 351
GP N LL++YGFV DNP+DR+
Sbjct: 266 YGPLSNDLLLLDYGFVIPDNPHDRI 290
>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
Length = 418
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 70/158 (44%), Gaps = 18/158 (11%)
Query: 200 WSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF---TF 256
W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F +
Sbjct: 202 WEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFGKVSL 258
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDDAVQL 311
E K + V S + +++ LV + P+L + +S K + + +
Sbjct: 259 EDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLNYAVV 311
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
DR Y + I + G N++L ++YGF +NPYD
Sbjct: 312 TADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 349
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 37/72 (51%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
++PL S+K + V++ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 342 VDEDNPYDRLVV 353
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
Length = 440
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 98/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 57/137 (41%), Gaps = 6/137 (4%)
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
LE E +++ E W + F+ D+P E +T+ + + K
Sbjct: 113 LESREHLRKREKEFQGNW----NAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYP 168
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
R AL+P+ CK +A D +V DR YK GE + + N +L+
Sbjct: 169 WEDRLALIPVADLFNHSDDGCKVYYSA--DGYHIVADREYKKGEELFISYSSHSNDYILL 226
Query: 338 NYGFVDEDNPYDRLVVE 354
YGF+ +++ D + ++
Sbjct: 227 EYGFIPDESLDDDVYID 243
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
Length = 416
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 99/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + + +T K S L L +L+ EK G +S W
Sbjct: 44 LQEGQMIISLPESCLLTTDTVIRS-YLGAYITKWKPPPSPLLALCTFLVSEKHAGHRSLW 102
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 103 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 154
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 155 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQRECLSAEPDTCALAPYLDL 210
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 211 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 269
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
boliviensis]
Length = 440
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 97/239 (40%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQQECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSAHNPH 293
>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
Length = 440
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 54/242 (22%), Positives = 99/242 (40%), Gaps = 28/242 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
L+ G S+P S ++T + V+ + A ++ S L L +L+ EK G +S W
Sbjct: 68 LREGQMIISLPGSCLLTTDTVIRSSLGAYIIKWKPPPSPLLALCTFLVSEKHAGDQSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + + P+ E E+ L P KA+ E+ +++ + +
Sbjct: 128 PYLDILPK-------SYTCPVCL-EPEVVNLLPEPLKAKAEEQRMSVQQFFASSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS- 296
LF++ + F++ A+ V + V+L+ RR + L P A +
Sbjct: 180 LQPLFEEATDSV----FSYSALLWAWCTVNTRAVYLR----TRRRDCLSLEPDTCALAPY 231
Query: 297 ----------KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
+ KA ++ Y+ + + + GP N +LL+ YGFV N
Sbjct: 232 LDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEVFICYGPHDNHRLLLEYGFVSLCN 291
Query: 347 PY 348
P+
Sbjct: 292 PH 293
>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
Length = 486
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 71/162 (43%), Gaps = 18/162 (11%)
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF- 254
S W +TEL+YL SP + ER E I E+ ++ F L Q D+ + F
Sbjct: 313 STFRWEDTELSYLRASPLYGKARERLEMITTEFGQVQND-FCTCVLEQ--ALDVWPQLFG 369
Query: 255 --TFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY-----SSKCKAMLAAVDD 307
+ E K + V S + +++ LV + P+L + +S K + +
Sbjct: 370 KVSLEDLKHVYATVFS-----RSLAIGEDSTLVMI--PMLDFFNHNATSFAKLSFNGLLN 422
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ DR Y + I + G N++L ++YGF +NPYD
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPENPYD 464
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 117/289 (40%), Gaps = 48/289 (16%)
Query: 81 LKSWMHK-NGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L+ W+ + +G+ KV L+ P V A+ L+ G+ F +P S + E V
Sbjct: 29 LRIWLEEEHGVDMSKVDLQRSPLEG------LGVFANRRLEPGETLFMIPKSCCIYPELV 82
Query: 140 LGNETIAELLTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
+ + + + KL+ E+ LA +L EK +G +S + P+I L
Sbjct: 83 FEDRQLGK--SMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVL-------PW 133
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PLLW++ E+ L G+ EIL E ++ + V G ++Q+ I TE
Sbjct: 134 DSLHPLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKG--WKQFFQTIETE 191
Query: 253 AFTFEIF----KQAFVAVQS----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA 304
T E F + AF +V S + L R + P L ++
Sbjct: 192 KMTPEEFGFMMRGAFASVLSRAFDSKIGRGDKGLEERVVI----PLLDIFNHGSYGPSIT 247
Query: 305 VDDAVQLVVDRPY-----------KAGESIVVWCGPQPNSKLLINYGFV 342
D A++ ++ + + GE + + G +PN +L YGFV
Sbjct: 248 FDTALERDNEKGFPVRVADKGKSIEEGEELFGFYGDKPNWNMLTTYGFV 296
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSINENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
Length = 378
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 101/239 (42%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 6 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 64
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 65 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 116
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 117 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 172
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 173 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPH 231
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 101/248 (40%), Gaps = 39/248 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLALYLMYEK 168
V A+ D+ + FS+P + V L + +A L + +L E L LM E
Sbjct: 42 VVATGDIDDDEIIFSIPRNAV------LNAQNVAPLPVSRRLFEKMPSWLVLTSILMTEA 95
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA------- 221
Q + S W PY+ L + ++S + WS++ELA L S +I ++
Sbjct: 96 -QMENSKWAPYLAVLPER-------LDSLVFWSDSELAELQASAVVKKIGKKDAEDMFKS 147
Query: 222 ----EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQ 274
+G+K E+ S+ Y +DIP + + A V +
Sbjct: 148 YIAPQGLKHSSTEM---CHKVASVIMAYAFDIPDPSDAPTSGGKGGEAGDDLVSDDGEDE 204
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
K L +++PL L A + + A L ++ +++ +P GE I G P S
Sbjct: 205 KTIL----SMIPLADMLNADADRNNARLICDNEELEMRAIKPISKGEEIFNDYGQLPRSD 260
Query: 335 LLINYGFV 342
LL YG+V
Sbjct: 261 LLRRYGYV 268
>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
Length = 439
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 101/239 (42%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 67 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 125
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 177
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 178 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 233
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 234 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPH 292
>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
Length = 480
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 96/234 (41%), Gaps = 31/234 (13%)
Query: 134 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLA 193
+T+E +LG LS L+ Y+ +EK++G SFW P+I L + LA
Sbjct: 135 LTMEEMLG------------LSSFQLLSFYICFEKQRGSSSFWKPFIDMLP-ETSDFDLA 181
Query: 194 VESPLLWS------ETELAYLTGSPTKAEILERAEGIKREYNEL-DTVWFMAGSLFQQYP 246
PL+W EL L + TK + + + + +YN + D + +
Sbjct: 182 ---PLVWKVLKVDHYEELLKLLPNSTKRHMDKIYDRFQTDYNVVKDLISIKLKEISDNER 238
Query: 247 YDIPTEAFT----FEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCK 299
+ T+A E++ +++ + S +++ Q + A F + P L +S +
Sbjct: 239 SNDLTDAIRHLVPIELYLWSWMCINSRCLYMEIPQSKNAADNFTMAPY-VDFLNHSCDDQ 297
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
L Q+ Y E + + GP N LL YGF +N ++ L V
Sbjct: 298 CGLKIDGTGFQVYTTCSYNPDEQLFLSYGPHSNEFLLCEYGFTLPENKWNDLDV 351
>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 481
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 62/263 (23%), Positives = 107/263 (40%), Gaps = 50/263 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
+ A ++ + F++P +LV++ + N + +LL + L CL L ++YE Q
Sbjct: 50 IVALSNINEDEELFAIPQNLVLSFQ----NSKLKDLLHISEKDLGPWLCLILVMIYEYLQ 105
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI--LERAEGIKRE- 227
G S W Y + L + ++ + W++ EL L+GS +I + I R+
Sbjct: 106 GGASPWSRYFQVLPTE-------FDTLMFWTDEELRELSGSAVLNKIGKSDAEAAILRDI 158
Query: 228 -------------------YNELD---TVWFMA---GSLFQQYPYDIPTEAFTFEIFKQA 262
Y+ D T+ +A GSL Y +DI E +
Sbjct: 159 FPIVSTNPHLFPPISGLGSYDSPDGRATLLSLAHRMGSLIMAYAFDI-------EKGEDE 211
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
VQ + + L + +VPL L A + + A L D + + +P + GE
Sbjct: 212 EGEVQDGYITDEGEELTK--GMVPLADLLNADADRNNARLFQEDGYLAMKSIKPIRNGEE 269
Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
I G P + LL YG+V ++
Sbjct: 270 IFNDYGELPRADLLRRYGYVTDN 292
>gi|402076002|gb|EJT71425.1| hypothetical protein GGTG_10683 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 497
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 82/202 (40%), Gaps = 27/202 (13%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
L L +++E +G S W PY+ L + E+P+ WS ELA L SP A +
Sbjct: 104 SLILVMIHEHLRGSASPWRPYLDVLPAR-------FETPMFWSAAELAELQASPVVASV- 155
Query: 219 ERAEG-------IKREYNELDTVWFMAGS----------LFQQYPYDIPTEAFTFEIFKQ 261
RAEG I E + ++F AG L + I AF E
Sbjct: 156 GRAEGDAMIRSRILPVIRENEALFFGAGGAAMGDEELVELAHRMGSTIMAYAFDLERDDD 215
Query: 262 AFVAVQSCVVHLQKVSLARR-FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAG 320
A + + R +VP+ +L ++ A + ++A+ + R AG
Sbjct: 216 AMDEDDAEGDGWVEDRDGRTVMGMVPMA-DILNADAEFNAHINHSEEALVAISLRKIPAG 274
Query: 321 ESIVVWCGPQPNSKLLINYGFV 342
E I+ + GP PN +L YG+
Sbjct: 275 EEILNYYGPLPNGQLCRRYGYT 296
>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
Length = 439
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 101/239 (42%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 67 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 125
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 177
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 178 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 233
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 234 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPH 292
>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
Length = 397
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 101/239 (42%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ EK G +S W
Sbjct: 25 LQEGQVMISLPESCLLTTDTVI-RSSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSLW 83
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L SP KA+ E+ ++ + +
Sbjct: 84 KSYLDILPK-------SYTCPVCL-EPEVVDLLPSPLKAKAEEQRARVQDLFTSARGFFS 135
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ + L+ L P L
Sbjct: 136 TLQPLFAE-PVD---SVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDL 191
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA ++ + + + + GP N +LL+ YGFV NP+
Sbjct: 192 LNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPH 250
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 88/217 (40%), Gaps = 25/217 (11%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDASRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ +VP+ L A +++ A + + + KA
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIKA 262
Query: 320 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEV 355
G+ I+ + GP P S+LL YG+V E + YD VVEV
Sbjct: 263 GDQILNYYGPLPTSELLRRYGYVTPEHSRYD--VVEV 297
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 36/74 (48%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R AL+PL + YS M D A+ L V + G+ GP N LL YG
Sbjct: 215 RMALLPLIDSINHYSRMPTHMYWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYG 274
Query: 341 FVDEDNPYDRLVVE 354
FV++DNP D V+E
Sbjct: 275 FVEQDNPSDTYVLE 288
>gi|254577261|ref|XP_002494617.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
gi|238937506|emb|CAR25684.1| ZYRO0A05654p [Zygosaccharomyces rouxii]
Length = 494
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 60/257 (23%), Positives = 112/257 (43%), Gaps = 34/257 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--LTTNKLSELACLALYLMYE-KK 169
V AS+D+ + + F +P S V+ + +L + +L L L ++YE K
Sbjct: 42 VLASQDIGSDEVLFEIPRSSVLNVATSQLVRDFPQLKDVFWQELGHWEGLILCMVYEIKV 101
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT--------GSPTKAE----I 217
G++SFW Y++ L + + L + WS +LA L G+ E I
Sbjct: 102 MGQQSFWWNYLQVLPKSQDLNTL-----VYWSADQLAALEPSLVVGRLGADESQEMYRQI 156
Query: 218 LERAEGIKREYN----ELDTVWFM-AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ----S 268
L+ + E+ +L F+ S+ Y +D+ + E + + S
Sbjct: 157 LKYIQNFGPEFQSKIGQLTFEEFVHVASVIMSYSFDVDLKGEDDEDDEDEDEGEEEEGES 216
Query: 269 CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
V H + + ++VPL L A + + A L +++++V +P K G+ + + G
Sbjct: 217 NVAHDKYMK-----SMVPLADTLNADTKQFNAHLVYDKESLKMVSVKPIKMGQQVYNFYG 271
Query: 329 PQPNSKLLINYGFVDED 345
PN+++L YG+V+ D
Sbjct: 272 EHPNAEILRRYGYVEWD 288
>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 372
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 98/242 (40%), Gaps = 24/242 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ D+QAG+ VP L TL+ V + I+ L N +S A LA YL +K
Sbjct: 33 IIATRDIQAGETILFVPFKLFRTLKHV--PKAISRRLPRN-MSLHALLATYLSLDKTD-- 87
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+F +P D + P LW EL P ++++ KR+
Sbjct: 88 -TFAIPNKTLPDLSSFEAGM----PFLWP-AELHPFLPKPALDLLMKQQRSFKRD----- 136
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
W + + D A+ + ++F ++++ R A++P+
Sbjct: 137 --WDIVSKAYSNISQDQYLHAWLL-VNTRSFYCTTPI---MERLPHDDRLAILPVADLFN 190
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
C+A A+ + + DR Y+ GE + + G LL YGFV +N +D +
Sbjct: 191 HADVGCEARFAS--ENYSFIADRDYRTGEELHISYGSHSTDFLLTEYGFVPTENCWDVVC 248
Query: 353 VE 354
++
Sbjct: 249 LD 250
>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
Length = 440
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 98/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ + + V+L Q+ L+ L P L
Sbjct: 179 SLQPLFVEAVDSI----FSYSALLWAWCTINTRAVYLRPRQRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDNELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|115391295|ref|XP_001213152.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194076|gb|EAU35776.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 691
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 15/193 (7%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
+L+ + +G + FW PYI L + G L +PL + +L +L G+ + A E+
Sbjct: 115 FFLIGQYLRGSEGFWYPYICTLPQP---GDLT--TPLYYEGADLRWLEGT-SLAPAREQK 168
Query: 222 EGIKRE-----YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
E + +E + EL F ++Y +++ A T + + V + VV ++
Sbjct: 169 ESLLKEKYQSTFEELRKSGFGDA---EKYTWELYLWASTIFVSRAFSAKVLAGVVPHAEL 225
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLL 336
L+P +L + K A + V VV AGE + GP+ N +L+
Sbjct: 226 PEENVSVLLPF-IDVLNHRPLAKVEWRAGERDVLFVVLEHVAAGEEVANNYGPRNNEQLM 284
Query: 337 INYGFVDEDNPYD 349
+NYGF ++NP D
Sbjct: 285 MNYGFCLQNNPCD 297
>gi|156374449|ref|XP_001629819.1| predicted protein [Nematostella vectensis]
gi|156216828|gb|EDO37756.1| predicted protein [Nematostella vectensis]
Length = 281
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 64/278 (23%), Positives = 116/278 (41%), Gaps = 30/278 (10%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHY-VAASEDLQAGDAAFSVPNSLVVT-----LE 137
W H N L L K S +K Y + A ED+ + F VP L++ +
Sbjct: 22 WCHDNDLK-----LNNKVSSMQKGSCHRYGMVAMEDISPDECLFKVPRGLLLEPKTCGIS 76
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
++L + I +L+ ++ L L LMYE S W PY+ + G ++ P
Sbjct: 77 KILTGKVIQNMLSQHE--GWVPLLLALMYEYTN-PTSLWKPYMDIV-----PGIDILDQP 128
Query: 198 LLW-SETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
+ W ET + L G+ + ++ + + I+R+Y + +A + +++ + +
Sbjct: 129 MFWPDETRQSLLQGTGFEDDVEDDKQRIERQY------FTVAVPIMKKFKKFFDLKRHSL 182
Query: 257 EIFKQ--AFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD 314
++K AF+ S +VP+ +L + S A L ++ + +V
Sbjct: 183 SLYKHMAAFIMAYSFTEDSPSFHGNNVPVMVPMAD-ILNHHSNNNARLEFGEEELSMVST 241
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRL 351
+ G + G N LL +YGFV+ DNP D +
Sbjct: 242 QHILKGGEVFNTYGQLANCHLLQSYGFVEGPDNPNDTV 279
>gi|323309789|gb|EGA62995.1| Set7p [Saccharomyces cerevisiae FostersO]
Length = 417
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 26/255 (10%)
Query: 113 VAASEDLQAGDAAFSVPNS--LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKK- 169
V A++ ++ + F +P S L VT +++ + + N+ L + ++YE +
Sbjct: 41 VVATQKIKKDETLFKIPRSSVLSVTTSQLIKDYPSLKDKFLNETGSWEGLIICILYEMEV 100
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA 221
++S W PY + ++ L + W + EL L G E+ ER
Sbjct: 101 LQERSRWAPYFKVWNKPSDMNAL-----IFWDDXELQLLKPSLVLERIGKKEAKEMHERI 155
Query: 222 -EGIKREYNELDTVWFMAGSL-FQQYPYD---IPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
+ IK+ E V A S F + Y I + +F E+ + + +++
Sbjct: 156 IKSIKQIGGEFSRV---ATSFEFDNFAYIASIILSYSFDLEMQDSSVNENEEEETSEEEL 212
Query: 277 SLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R +++PL L A +SKC A L + +++V R + E + G PNS+L
Sbjct: 213 ENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSEL 272
Query: 336 LINYGFVDED-NPYD 349
L YG+V+ D + YD
Sbjct: 273 LRRYGYVEWDGSKYD 287
>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
CIRAD86]
Length = 449
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 99/250 (39%), Gaps = 36/250 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A+ DL + + F +P + ++T E + I + LT LS L L +++E G
Sbjct: 42 VVATSDLTSDEEIFRIPRTSILTTETTDLPQEILQQLTDPWLS----LILAMIFEYLLGT 97
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERAEGIKREY--- 228
S + PY+ L + + + W++ EL YL GS ++I E A+ E
Sbjct: 98 NSRFKPYLDILPE-------SFNTLMFWTDNELQYLQGSAILSKIGKEEADNTFSEQLLP 150
Query: 229 ----------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
+L + GS+ Y +D+ T +
Sbjct: 151 IITKNPEIFKIGTCNNQDLLALCHRMGSIIMSYAFDLDPPPTT--TTSSSEEWESDSDSE 208
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
+K+S AL+PL L A + L D+ + +P AGE ++ GP P
Sbjct: 209 NEKISPK---ALIPLADMLNANGDLTNSKLFFSSDSFIMKTLQPVAAGEELLNDFGPLPP 265
Query: 333 SKLLINYGFV 342
+ LL YGFV
Sbjct: 266 ADLLRRYGFV 275
>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 454
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 100/231 (43%), Gaps = 23/231 (9%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIREL 183
++P+ L+ T++R + + LL++ + LS LA Y+++ + + K + P
Sbjct: 41 LTIPHGLLWTVKRAYADPVLGPLLSSTRPPLSVDDTLATYILFIRAR-KSGYDGP----- 94
Query: 184 DRQRGRGQL--AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Q L + S + +++ EL GS A I+ +Y +L L
Sbjct: 95 --QSHVAALPASYSSSIFFADAELEICAGSSLYTTTKHLARQIEVDYKDL------VARL 146
Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYS---SKC 298
F ++ P++ FT + +K A V S + K+ L+ +L +S +C
Sbjct: 147 FGRHRDVFPSDKFTIDDYKWALCTVWSRAMDF-KLRDGESIRLMAPFADMLNHSPDVGQC 205
Query: 299 KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ + ++ + Y+ G+ + + GP PN++L YGFV NP D
Sbjct: 206 HVYDPQSGN-LSILAGKSYEPGDQVFINYGPIPNNRLSRLYGFVVPGNPND 255
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 97/240 (40%), Gaps = 27/240 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 34 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 86
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 87 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 142
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF Q+ P + F+ E +K A V S + Q L P +
Sbjct: 143 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 195
Query: 292 LAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +SS+ K + + ++ + Y+ G+ + + G PNS+LL YGFV NP D
Sbjct: 196 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPND 255
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 97/240 (40%), Gaps = 27/240 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+S++ T+E + + L + + LS L YL++ +
Sbjct: 43 FKEGEKILTIPSSILWTVEHAYADPLLGPALCSVQPPLSPEDTLTTYLLFVRS------- 95
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + ++E EL G+ + + I+ ++ L
Sbjct: 96 ----RESGYDGQRSHVAALPTSYSSSIFFTEEELEVCAGTSLYTITKQLEQSIEDDHRAL 151
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
LF Q+ P + F+ E +K A V S + Q L P +
Sbjct: 152 ------VMQLFIQHRDLFPLDKFSIEDYKWALCTVWSRRMDFQLRDGKSMRLLAPFAD-M 204
Query: 292 LAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +SS+ K + + ++ + Y+ G+ + + G PNS+LL YGFV NP D
Sbjct: 205 LNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQVFINYGSVPNSRLLRLYGFVIPGNPND 264
>gi|380480025|emb|CCF42668.1| SET domain-containing protein RMS1 [Colletotrichum higginsianum]
Length = 318
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+VP+ +L ++ A + +D + +V RP KAGE I+ + GP PNS+LL YG+
Sbjct: 71 LGMVPMAD-ILNADAEFNAHVNHGEDDLSVVALRPIKAGEEILNYYGPHPNSELLRRYGY 129
Query: 342 V 342
V
Sbjct: 130 V 130
>gi|367009050|ref|XP_003679026.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
gi|359746683|emb|CCE89815.1| hypothetical protein TDEL_0A04830 [Torulaspora delbrueckii]
Length = 484
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++PL L A +SKC A L +++++ +P GE + G PNS+LL YG+V
Sbjct: 213 SMIPLADTLNANTSKCNANLVYDIESLKMCATKPIGMGEQVYNIYGDHPNSELLRRYGYV 272
Query: 343 D-EDNPYD 349
+ E + YD
Sbjct: 273 EWEGSKYD 280
>gi|367013376|ref|XP_003681188.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
gi|359748848|emb|CCE91977.1| hypothetical protein TDEL_0D03930 [Torulaspora delbrueckii]
Length = 484
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 48/100 (48%), Gaps = 6/100 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL+ G +P S + + N +IA LL +++ + L + +YE K
Sbjct: 40 VFAKQDLEEGTVLLKLPKSCLFSA----SNSSIANLLVDDEIDGVLALNIAFLYETTVFK 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
KS W PY++ + R L V P WSETE L GS
Sbjct: 96 EKSHWFPYLKSI-RIYNDDGLLVLPPSHWSETEKLLLKGS 134
>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
Length = 228
Score = 46.6 bits (109), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 100/222 (45%), Gaps = 29/222 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 6 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 59
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 60 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 114
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y V Q +P+
Sbjct: 115 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYFYKV-------IQTHPHAN 163
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 164 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 205
>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
Length = 1335
Score = 46.2 bits (108), Expect = 0.021, Method: Composition-based stats.
Identities = 54/228 (23%), Positives = 93/228 (40%), Gaps = 19/228 (8%)
Query: 129 PNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
P ++ ++ N + + L++ L L+++YE K+ +FW P+ L
Sbjct: 849 PRKYLINVDVAKSNPILGPIFEELHLNDETILFLFVIYE-KENPNTFWRPFYDTLPSY-- 905
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD 248
+ + +S TEL L G+ AE L +K++ F L QYP
Sbjct: 906 -----FTTSIHYSSTELLELEGTNLFAETL----AVKQQLQAFRDYLF--PELSNQYPDI 954
Query: 249 IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVD-- 306
P F++E F A + S + L K+ + LVP+ ++ + + + D
Sbjct: 955 FPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCLVPMA-DMINHHTNAQISERHFDQD 1012
Query: 307 -DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ ++V A I + G NS L + YGFV +N YD V
Sbjct: 1013 SNCFRMVSSCNIPANNQIFLHYGALQNSDLALYYGFVIPNNIYDSFHV 1060
>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
Length = 292
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 102/222 (45%), Gaps = 29/222 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 57/259 (22%), Positives = 105/259 (40%), Gaps = 29/259 (11%)
Query: 103 HNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELA 158
+E HR + A++ ++ G+ +P + ++LE V L N + ++ K + +
Sbjct: 63 QSEGHRTLR---ATQFIRQGEWVLFIPRTQYLSLEEVKKSCLINRKMIQI--NYKPNNIQ 117
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
+ + ++ + K SFW PYI L + P + + A L GSPT ++
Sbjct: 118 TYFVNHLLQENRRKYSFWKPYIDVLPKD------VSGFPTYFDAEQDALLKGSPTLFTVI 171
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+ + K EY L A FQ+Y Y T++ F + + S +Q
Sbjct: 172 NQRKVFKEEYENLKE----AVKEFQKYGY-------TYDDFIKFRILTISRSFTVQIGEK 220
Query: 279 ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLL 336
++ LVPL + + + DA + R + GE + G N
Sbjct: 221 EQQQLLVPLAD-FINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWSNKYFF 279
Query: 337 INYGFVDEDNPYDRLVVEV 355
+NYGF NP ++ +++
Sbjct: 280 MNYGFASLTNPMNQFDLDI 298
>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
Length = 508
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 55/250 (22%), Positives = 101/250 (40%), Gaps = 34/250 (13%)
Query: 113 VAASEDLQAGDAAFSVP-NSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE-KKQ 170
V A +D+ G+ F +P +S++ L L ++ T + L L L+YE K +
Sbjct: 41 VIAVKDIAEGEVLFEIPRDSILNVLTSSLSSDFSDLEETLQSIGSWEGLILCLLYEWKGK 100
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
+KS W Y L A+ + W+E EL +L S I +++ K Y++
Sbjct: 101 KEKSKWWKYFNVLPSSN-----AMNGLMYWNEQELEHLRPSLVLDRIGKKSA--KNMYHK 153
Query: 231 LDTV-----------------WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
+ T+ + A S+ Y +D+ Q
Sbjct: 154 VLTLVKESKFPEVLCNVEWEDFVYAASVIMAYSFDVENGE------SQTLNEEDDDQDEE 207
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+ + +++PL L + + +C A L D +++ +P K GE + G PN+
Sbjct: 208 ENTGYIK--SMIPLADTLNSDTHQCNANLMYDDKFLKMYAIKPIKKGEQVFNIYGNHPNA 265
Query: 334 KLLINYGFVD 343
++L YG+V+
Sbjct: 266 EILRRYGYVE 275
>gi|320170264|gb|EFW47163.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 938
Score = 46.2 bits (108), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 117/291 (40%), Gaps = 41/291 (14%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
S+++ + +WM NG V + + + + A L+ D ++P
Sbjct: 32 SQEQTSMSRFLTWMRGNGAEFGHVDVSQDWHQGRR------LIADNPLKPDDRIAAIPTL 85
Query: 132 LVVTLERVLGNETIAELLTT---NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L ++L+ L + TT S+ LAL+L+ EK G +S W PYI L ++
Sbjct: 86 LTISLDTAL-QVGLPRAFTTIWHESGSQDDLLALFLLREKALGARSAWAPYIEILPKK-- 142
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYD 248
+ + L +++ ELA L + E + ++ +EL F+A +Q+ D
Sbjct: 143 -----LSNLLFFNDGELAQLQNE-------QLVEQVSQQKSELQGR-FLA---LRQHEAD 186
Query: 249 IPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAA---- 304
I E+ F+ ++ V+ ARR+ L+P L + + +
Sbjct: 187 I--FGGKAELVLSDFLWARAIVLSRAFTIHARRY-LIPFADLLNHRFHPTRGLDESGEFF 243
Query: 305 ------VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ L DRP E + G N++ L YGFV E NP++
Sbjct: 244 YRHHDFQNGMFLLTCDRPVNENEEVEDDYGNLSNAQFLQLYGFVPESNPHE 294
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 60/258 (23%), Positives = 105/258 (40%), Gaps = 44/258 (17%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ F++P ++ T+E + + L + + LS LA+Y+++ +
Sbjct: 52 FKEGERIFTIPADVLWTVEHAYADSLLGPALRSARPPLSVDDTLAMYILFVRS------- 104
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R LA S + +++ EL GS A I+ +Y L
Sbjct: 105 ----RESGYDGPRSHLATLPKSYSSSIFFTDDELEVCAGSSLYALTKRLGRCIEDDYRAL 160
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRFALVP 286
L Q+ P + FT E +K A V S + + + L FA
Sbjct: 161 VV------RLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFVLPGGKSIRLMAPFA--- 211
Query: 287 LGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGES-----IVVWCGPQPNSKLLINY 339
+L +SS+ + A + + ++ + Y+AG+ + ++ G PN++LL Y
Sbjct: 212 ---DMLNHSSEVRQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYGSIPNNRLLRLY 268
Query: 340 GFVDEDNPYDR--LVVEV 355
GFV NP D LV+E
Sbjct: 269 GFVMPGNPNDSYDLVLET 286
>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
Length = 296
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 102/222 (45%), Gaps = 29/222 (13%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ D++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY-- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHAN 231
Query: 248 DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 232 KLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 46.2 bits (108), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 36/72 (50%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
++PL S+K + V+ Q++ ++P K GE I G N LL+ +GF
Sbjct: 171 LCVIPLADQFNHSSTKWHTRVREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGF 230
Query: 342 VDEDNPYDRLVV 353
++ DNP+D +
Sbjct: 231 IEFDNPHDHFIT 242
>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
Length = 284
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 102/223 (45%), Gaps = 29/223 (13%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K+E+ DL W +NG V E + E+ + A+ D++A + VP
Sbjct: 73 GKREDYFPDLMKWASENG---ASVEGFEMVNFKEEGFGLR---ATRDIKAEELFLWVPRK 126
Query: 132 LVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRG 188
L++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 127 LLMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSE-- 182
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPY- 247
++PL + E E+ YL + ++ + + R+Y +F + Q +P+
Sbjct: 183 -----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY-----AYFY--KVIQTHPHA 230
Query: 248 -DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPL 287
+P ++FT+E ++ A +V + + +R AL+PL
Sbjct: 231 NKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 273
>gi|308812602|ref|XP_003083608.1| unnamed protein product [Ostreococcus tauri]
gi|116055489|emb|CAL58157.1| unnamed protein product [Ostreococcus tauri]
Length = 427
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 87/207 (42%), Gaps = 27/207 (13%)
Query: 151 TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
+N+ +E A LA+ L E+++G S + P++ L+ T
Sbjct: 79 SNESAEWA-LAIELAMEREKGVASRYRPFV----------------DSLYERTPANSTVV 121
Query: 211 SPTKAEIL--ERAEGIKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
S E L AE + R Y+E D V W A F+ +P + FT F++A V
Sbjct: 122 SKKARERLAEHHAEKVMRRYDE-DIVRGWNAAVRTFRTFPTIFRAQDFTRSKFEEALAIV 180
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
++ + + R LVPL L+ +S + VDD + VD ++AG+ +
Sbjct: 181 RANSFEVTRADGVRERVLVPLAHLLVHDTSSSVPCVKMVDDTFVINVD-EHRAGDELSCS 239
Query: 327 CGPQPNSKLLINYG----FVDEDNPYD 349
G +++ +G + +E+N D
Sbjct: 240 HGEYSDAETFARFGTSAVYSEENNARD 266
>gi|389629438|ref|XP_003712372.1| hypothetical protein MGG_04887 [Magnaporthe oryzae 70-15]
gi|351644704|gb|EHA52565.1| hypothetical protein MGG_04887 [Magnaporthe oryzae 70-15]
Length = 499
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 108/278 (38%), Gaps = 46/278 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE---------------- 156
+ A+ D++ F++P ++ LE + + + N+ SE
Sbjct: 43 IVATADIEPDTVLFTIPRKDIICLENSQLFKEVDSSIFVNRGSEPLDEDDEDQDGGSSHQ 102
Query: 157 -----LACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
L L +MYE + S W PY+ L + E+P+ W+ E+A L S
Sbjct: 103 RGQNSWTTLILVMMYEHLRRDPSPWRPYLDVLPTE-------FETPMFWTSEEIAELQAS 155
Query: 212 PTKAEILERAEG-----------IKREYNELDTVWFMAGSLFQQYPYDIPTE--AFTFEI 258
P A I R E IK V Q + + + A++F++
Sbjct: 156 PVVASI-GREEADVMIRTKILPVIKENAAVFGGVTDADDERLVQLAHQMGSTIMAYSFDL 214
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYK 318
+ + ++ +VP+ +L ++ A + ++A+ R
Sbjct: 215 EGEGDEEDEDEDGWVEDREGRTMMGMVPMA-DILNADAEFNAHINHSEEALVATALRKIP 273
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
AGE I+ + GP PN +LL YG+V E + YD VVE+
Sbjct: 274 AGEEILNYYGPLPNGQLLRRYGYVTEKHSRYD--VVEL 309
>gi|315042966|ref|XP_003170859.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311344648|gb|EFR03851.1| SET domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 693
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 84/194 (43%), Gaps = 11/194 (5%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL++ +++ + K S W PY+ L R + S L + +L +L G+
Sbjct: 108 LALFVAHQQLKEKGSHWWPYLATLPRAS-----ELTSALFYHGDDLEWLQGTNLYQTHQA 162
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQ-YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+K EY+ ++ G L + Y +D+ A+T I +AF + + V+L +
Sbjct: 163 YMNAVKEEYDSAISILRDEGCLAAELYSWDLFCWAYTV-IASRAFTS-RVLSVYLSRNPA 220
Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
++ + PL+ S+ K A + L V P + E I GP N +L
Sbjct: 221 LKQDEEFQILLPLVDSSNHKPLAKIEWRAEAAEIGLKVVEPIVSEEEIHNNYGPLNNQQL 280
Query: 336 LINYGFVDEDNPYD 349
+ YGF DNP D
Sbjct: 281 MTTYGFCIVDNPCD 294
>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
Length = 459
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 108/262 (41%), Gaps = 41/262 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYE 167
+ A E+L+ G+ F V S V++ E +L ET + + S L LMYE
Sbjct: 53 MVAQEELEEGECLFKVDKSAVLSTETTEIAHLLKEETSLHGDSLHGDSGWVPQILALMYE 112
Query: 168 KKQGKKSFWLPYIR------ELDRQRGRGQLAVESPLLWSETELA---YLTGSPTKAEIL 218
S W PY++ +LD+ P+ W+E E+ TG P +
Sbjct: 113 YT-NPNSRWRPYLQLVPDFSQLDQ-----------PMFWTEDEIERDLCNTGIPEASS-- 158
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKV 276
+K EY L A +++ + E +FE++K+ AF+ S +
Sbjct: 159 SDLTKMKLEYTSL------ALPFIRKHRHIFSEEVHSFELYKRMVAFIMAYSFFEPVNGR 212
Query: 277 SLARRFALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+ +PL P+ L + +K A L D +++V R AGE + G N
Sbjct: 213 EDEGGKSSLPLMVPMADILNHVAKNNAQLEWDADCLRMVTTRTVAAGEEVFNTFGQLANW 272
Query: 334 KLLINYGFVDE--DNPYDRLVV 353
+LL YGF + +N YD + +
Sbjct: 273 QLLHMYGFAEAWPENIYDTVDI 294
>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 229
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 9/110 (8%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
D W+ K G + E + E R V A +++ G +VP L+++
Sbjct: 5 DFAEWLQKGGALIADI---EPGAVAEGFRG---VIAKANIEEGTLLVAVPERLLLSAHSA 58
Query: 140 LGNETIAE-LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQ 186
+ AE LL TNK + LA +L++E +G++SFW PY+ L RQ
Sbjct: 59 KKDRAFAEALLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ 108
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 65/169 (38%), Gaps = 28/169 (16%)
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM---------AGSLFQQ 244
+ +PL WS+ E L GS YN LD W M A L Q
Sbjct: 161 LTTPLFWSDKEREELQGSNL--------------YNMLDG-WTMNVEKLHRSTARVLGQH 205
Query: 245 YPY-DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA---RRFALVPLGPPLLAYSSKCKA 300
+ D+P ++ + FK A+ + + + S R+ + P+ K
Sbjct: 206 NVFPDLPKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHGDVKTSY 265
Query: 301 MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
A +L + + GE I + + N++ L+ YGFV E NP+D
Sbjct: 266 TFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIESNPHD 314
>gi|296808191|ref|XP_002844434.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238843917|gb|EEQ33579.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 684
Score = 45.8 bits (107), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 84/194 (43%), Gaps = 11/194 (5%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++ +++ + K S W PY+ L R G+L S L + +L +L +
Sbjct: 111 LAFFVAHQQLKAKDSHWWPYLATLPRA---GELT--SALFYQGEDLEWLQDTNFYHARQM 165
Query: 220 RAEGIKREYNELDTVWFMAGS-LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
+ +K EY+ ++ G L + Y ++I A+T I +AF + + ++ K
Sbjct: 166 YHDAVKTEYDAAISILRKEGCPLVESYSWNIFCWAYTV-IASRAFTS-RVLEAYISKNPA 223
Query: 279 ARRFALVPLGPPLLAYSSK---CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
R+ + PL+ S+ K A + L V P A E I GP N +L
Sbjct: 224 LRQDDEFQIMLPLVDSSNHRPLAKIEWRAEATRIGLKVIDPVSAKEEIHNNYGPLNNQQL 283
Query: 336 LINYGFVDEDNPYD 349
+ YGF DNP D
Sbjct: 284 MATYGFCIVDNPCD 297
>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
Length = 440
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 58/239 (24%), Positives = 97/239 (40%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVI-RSYLGAYITKWKPPPSPLLALCTFLVSEKHAGDRSLW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKSLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L Q L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDL 234
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + KA + ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 235 LNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPH 293
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 51/99 (51%), Gaps = 7/99 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A++ LQ G+ +P L ++ + ++ L N+L + +AL+LM E+ +
Sbjct: 39 VYAAKSLQKGEITMEIPFHLTISKVTAMQSDLRQILQDKNELDQDEIVALFLMIERFKSS 98
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
SF+ P+I+ L Q + P+ W++++ A L G+
Sbjct: 99 DSFFEPFIQSLPSQ-------FDLPIFWNDSDFAELEGT 130
>gi|303275314|ref|XP_003056953.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461305|gb|EEH58598.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 701
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 59/136 (43%), Gaps = 19/136 (13%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNE 143
WM + G+ V + P R A+ D+ GD SVP ++T E + +
Sbjct: 37 WMKRRGIVLNGVGVGRFP------RTGRGCVATRDIAPGDVLVSVPEDAIITAETSVAAD 90
Query: 144 TIAEL-LTTNKLS-------ELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ + L +++S E L L ++ E +G +S + PY+ L R A
Sbjct: 91 ALTKFGLGGDEMSAEASPRLEREALVLAVLAEMSRGHESDFAPYLAALPTLR-----ATH 145
Query: 196 SPLLWSETELAYLTGS 211
SPL WS ELA L G+
Sbjct: 146 SPLAWSGAELAELEGT 161
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 45.4 bits (106), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 67/155 (43%), Gaps = 11/155 (7%)
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
+ +S+ E+ GS + I +Y +L T M ++ P F E
Sbjct: 115 IFFSDEEMQVCKGSSLYTLTTQLRGRIGDDYKKLLTRVLM------RHRNLFPLSKFGIE 168
Query: 258 IFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS---KCKAMLAAVDDAVQLVVD 314
+K A V S + VS L+ +L +SS +C A D + ++
Sbjct: 169 HYKWALCTVWSRGMDF-TVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGD-LSILAS 226
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ Y G+ + ++ GP PN++LL YGFV +NP+D
Sbjct: 227 KDYNVGDQVFIYYGPVPNNRLLRLYGFVLPENPHD 261
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 91/197 (46%), Gaps = 16/197 (8%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L L L+YE+ Q K S+W PYI L + P+ +S ++ + +P ++ +
Sbjct: 105 LGLKLLYERAQ-KGSYWWPYISMLPH-------SFTLPIFFSGVDIESIDYAPVTHQVKK 156
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH--LQKVS 277
R + + EL + + + + + A + + A V+ ++ +H K+
Sbjct: 157 RCRFLLQFSAELAKLESLPEEVHPFAGQSVDSGALGWAM---AAVSSRAFRIHGVTNKLC 213
Query: 278 LARRFALVPLGPPLLAYSSKCKAMLA--AVDDA-VQLVVDRPYKAGESIVVWCGPQPNSK 334
A L+ + ++ + L+ A D + +++V R + G +I + GP N
Sbjct: 214 SAMMLPLIDMCNHSFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDL 273
Query: 335 LLINYGFVDEDNPYDRL 351
LL++YGFV DNP+DR+
Sbjct: 274 LLLDYGFVIPDNPHDRI 290
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 88/217 (40%), Gaps = 25/217 (11%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
L L L ++YE QG S W PY L +Q ++P+ WS+ EL L G+
Sbjct: 91 LDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQ-------FDTPIFWSDGELLELQGTSL 143
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL 273
AE + + E +++ + ++F PTE + + + + L
Sbjct: 144 TAEKIGKVESDAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAHRMGSTIMAYAFDL 203
Query: 274 QKVSLARR--------------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ +VP+ L A +++ A + + + +A
Sbjct: 204 ENDDENENEEDGWVEDREGRTMLGMVPMADTLNA-NAEFNAHINHGESLEATAIRADIRA 262
Query: 320 GESIVVWCGPQPNSKLLINYGFVD-EDNPYDRLVVEV 355
G+ ++ + GP P S+LL YG+V E + YD VVEV
Sbjct: 263 GDQVLNYYGPLPTSELLRRYGYVTPEHSRYD--VVEV 297
>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
Length = 447
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 61/280 (21%), Positives = 112/280 (40%), Gaps = 31/280 (11%)
Query: 84 WMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT-----LER 138
W NG K I+ + + R + VA D++A + F++P +V++ +
Sbjct: 10 WFTTNGGEFSKDIVAIGENVDGMGRGLVAVA---DIKAQTSLFTIPRDIVLSTRTSSFKE 66
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL 198
+G + +L N + L + + +E QG S W Y + L +Q S +
Sbjct: 67 KVGQDVYKQLENDN-IGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQ-------FTSLM 118
Query: 199 LWSETELAYLTGSPTKAEI--------LERAEGI-KREYNELDTVWFMAGSLFQQYPYDI 249
WS+ EL+ L G+ +I ER I K+ N + LF++ I
Sbjct: 119 FWSKEELSLLKGTTVVDKIGLEDIENEFERVRDIVKQNENVFGDIANYTLDLFKRMGSLI 178
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKV------SLARRFALVPLGPPLLAYSSKCKAMLA 303
+ +FT E +K + + + A+VP+ L + + A
Sbjct: 179 LSRSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRTDSVNAHTE 238
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
++ ++++ + KAG+ I PN+ L+ YG VD
Sbjct: 239 YEENCLRMISLQDIKAGDQIFNTYNDPPNADLIRRYGHVD 278
>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
Length = 370
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 60/255 (23%), Positives = 104/255 (40%), Gaps = 47/255 (18%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSE--------LACLALYLMY 166
A + + G +P+ L++T GN+ L N + + ++++L +
Sbjct: 38 AKKPITKGSILLEIPDPLLIT-----GNKVCKWLERNNWIGHQQISSVQGVLLVSIFLFF 92
Query: 167 EKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR 226
E +Q SFW PY++ L L + LL +Y+T +A+I++ E ++R
Sbjct: 93 ESRQSD-SFWKPYLQVLPTSYDLLFLYRDGLLL------SYVT----EADIMQMVESVRR 141
Query: 227 EYNELDTVWFMAGSLFQQY--PY-----DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ FQ Y P+ D F+ F + + AV S + +L
Sbjct: 142 ----------ILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVVSRICYLPDDIAG 191
Query: 280 RRFALVPLGPPL--LAYSSKCKAMLAAVDDAVQLV-VDRPYKAGESIVVWCGPQPNSKLL 336
ALVPLG A + + A + + R + G + V G N++L+
Sbjct: 192 ---ALVPLGDIFNHEAVDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSYGALSNTELM 248
Query: 337 INYGFVDEDNPYDRL 351
+ YGF DNP+D L
Sbjct: 249 MYYGFTLNDNPWDTL 263
>gi|395326815|gb|EJF59220.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 429
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 69/327 (21%), Positives = 123/327 (37%), Gaps = 83/327 (25%)
Query: 72 SKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY--------VAASEDLQAGD 123
S + E++ + KSW+ + G + H +H+ VAA D+ +
Sbjct: 3 SIEPENVANFKSWIAQQG--------------GQIHAGVHFEPVEFGFNVAARSDIPSDA 48
Query: 124 AAFSVPNSLVVTLERVLGNETIAELLTTNKLS----ELAC--LALYLMYEKKQGKKSFWL 177
S+P SL +T + I +LL T + +L C + L+ + E
Sbjct: 49 TVVSIPFSLAITPN--VARHAIKQLLNTEPQNWSERQLECTYIVLHSIVEPIDPSILRHR 106
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + +PL ++E EL+ GS L+R + E+ +
Sbjct: 107 PYLDTLPSPE-----QLRTPLHFTEAELSSFRGSNLFGATLDRKHEWETEWQQCKNTVSA 161
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG--PPLLAYS 295
A + + Q +FT+E + A + S R F L P L+
Sbjct: 162 AIAGWGQ--------SFTWEKYLTAATYLSS-----------RAFPSTILSDTPSLVTTE 202
Query: 296 SKCKAMLAAVD---------------------------DAVQLVVDRPYKAGESIVVWCG 328
+ +L +D ++ LV+ P G ++ G
Sbjct: 203 TSYPVLLPGIDALNHARGHPVSWVVSAPSQTSSSQRSESSISLVIHTPTPRGSELLNNYG 262
Query: 329 PQPNSKLLINYGFVDEDNPYDRLVVEV 355
P+PNS+L++ YGF +NP D +V+++
Sbjct: 263 PKPNSELILGYGFSLPNNPDDTIVLKI 289
>gi|393217169|gb|EJD02658.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 513
Score = 45.4 bits (106), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 114/302 (37%), Gaps = 75/302 (24%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A +D+ G FSVP SL ++ L +++G E + L NK L L +M+E+
Sbjct: 34 ALQDIPEGHTLFSVPRSLTLSTHTSELPKLIG-EAAWKSLRLNK--GWVGLILCMMWEEC 90
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+ S W Y L R A ++P+ W+ EL L G+ +I + E +R+Y
Sbjct: 91 RWTDSKWCGYFNILPR-------AFDTPMFWTGDELKELDGTDVLGKIGK--EQAERDYY 141
Query: 230 EL-------DTVWFMAGSLFQQYPYD--------IPTEAFTFEIFKQAFVAVQS------ 268
E+ F G + Y + I + +F E +K+ QS
Sbjct: 142 EILNPAVRTRPDLFDPGHIASFYSLENYHVMGSRILSRSFHVEKWKEQTPGSQSRASSEL 201
Query: 269 -----CV------VHLQKVSLAR--------------RFALVPLGPPLLAYSSKCKAMLA 303
C+ +L V A+VP+ L A A L
Sbjct: 202 HENGDCMDIDDESSNLSAVGAENGGDDDSDDEAENPSDIAMVPMADMLNAQYGSENAKLF 261
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD----------EDNPYDRLVV 353
+ +V +P + GE I G PNS LL YG VD E NP D VV
Sbjct: 262 YEPTHLNMVSTKPIRRGEQIYNAYGDLPNSALLREYGHVDLVPLPGVPWKEGNPAD--VV 319
Query: 354 EV 355
E+
Sbjct: 320 EI 321
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 45.4 bits (106), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 34/60 (56%), Gaps = 2/60 (3%)
Query: 295 SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
S CK + +A+ +VQ DR YK GE + V GP N LL YGF+ + N +D + ++
Sbjct: 194 SQGCKLVYSALGYSVQ--TDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDEVYLD 251
>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
Length = 440
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 64/291 (21%), Positives = 124/291 (42%), Gaps = 42/291 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVV 134
L+ W+++ G +I P+++ + A++ ++A ++ S+P ++
Sbjct: 37 LRRWLNERGFTSQSLI------------PVNFHDTGRGLMATQTIKAKNSVISLPEECLL 84
Query: 135 TLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
T VL + +A+ + +S L L +L+ E+ G+ S W PYI L +
Sbjct: 85 TTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPK------- 136
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
PL + + + L S K + ++ E + ++ T + LF Q PTE
Sbjct: 137 TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSSSQTFFHSLQPLFNQ-----PTE 190
Query: 253 A-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLLAY--SSKCKAMLAA 304
F+ + + A+ +V + V H Q L+R + L P LL + + + +A
Sbjct: 191 ELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNK 250
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
++ K + + GP N +LL+ YGFV NP+ + V++
Sbjct: 251 ETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFVAPCNPHSVVYVDL 301
>gi|145517214|ref|XP_001444490.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411912|emb|CAK77093.1| unnamed protein product [Paramecium tetraurelia]
Length = 748
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 23/141 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERV-LGNETIA-----ELLTTNKLS--ELACLALYL 164
V A++D+ A A VP +L+++ E+ L + +I EL N+ S E L YL
Sbjct: 46 VVATQDIPANTAIICVPQTLIISQEKCKLSSLSIVYDKHPELFDENQTSDAEFNILIFYL 105
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
EKK+G++SF+ PYI+ + + + W++ EL+ + E +E +
Sbjct: 106 FNEKKKGEQSFFYPYIQAIQTNN--------TLIDWTKEELSQIEDPIVLDEFAIVSEDL 157
Query: 225 KREYNELDTVWFMAGSLFQQY 245
K +W A +F ++
Sbjct: 158 K-------VLWNYAQDIFNEF 171
>gi|407923069|gb|EKG16157.1| hypothetical protein MPH_06594 [Macrophomina phaseolina MS6]
Length = 305
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 54/250 (21%), Positives = 99/250 (39%), Gaps = 42/250 (16%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
D+Q G+ F++P S V++ + + +L L A L + ++YE +G S W
Sbjct: 3 DIQEGEVLFTIPRSAVLSATNSSLSSILPQLF--EHLDPWASLIVTMIYEYLRGDASPWK 60
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG----------IKRE 227
PY L ++ + WS+ ELA L S +I + + ++R
Sbjct: 61 PYFDVLPAH-------FDTLMFWSDDELAELQASAVTQKIGKDSANEMFTNTIIPLVRRH 113
Query: 228 YN---------------ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVH 272
+ +L + GS Y +DI + + ++ ++ + +
Sbjct: 114 ASVFFPDPNTAQGASDGDLLALAHRMGSTIMAYAFDIEPDPASKQVDEEGYASDD----- 168
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
+ +L + +VPL L A + + A L DA+ + AG+ + G P
Sbjct: 169 -EDEALPK--GMVPLADMLNADADRNNARLHYGPDALTMEAVTNISAGDEVFNDYGSLPR 225
Query: 333 SKLLINYGFV 342
S LL YG+V
Sbjct: 226 SDLLRRYGYV 235
>gi|344300819|gb|EGW31140.1| hypothetical protein SPAPADRAFT_142076 [Spathaspora passalidarum
NRRL Y-27907]
Length = 436
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 47/206 (22%), Positives = 77/206 (37%), Gaps = 19/206 (9%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW---SETELAYLTG 210
LS L +Y+ E ++GK SFW P+ LD + PL+W ++ +L L
Sbjct: 117 LSSFQLLGMYITIETQRGKSSFWKPF---LDMLPSIADFEL-MPLVWQINNQHDLLDLLP 172
Query: 211 SPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
P + + +YN + + Q D + F A++ + S
Sbjct: 173 QPIRKTSEKVYTRFTSDYNTV--------TALLQTKIDNTEAVLPLDQFLLAWICINSRC 224
Query: 271 VHLQ---KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
+++ S + F + P L +S L Q+ Y E + +
Sbjct: 225 LYMNLPTSKSASDNFTMAPY-VDFLNHSPNDHCTLKIDGRGFQVFSTCAYSENEQVYLSY 283
Query: 328 GPQPNSKLLINYGFVDEDNPYDRLVV 353
GP N LL YGF DN ++ L V
Sbjct: 284 GPHSNDFLLCEYGFTISDNKWNDLDV 309
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 110/300 (36%), Gaps = 49/300 (16%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+ +SW+ GL ++L+ R + AS L G+ +P+ LV+T ER
Sbjct: 24 EFQSWLRSEGLSTQPLLLRHC------GREGRGLVASRSLSRGEVLVKLPDHLVITAERA 77
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL 199
G ++ LL LA ++ + W PY+ L ++ G + L
Sbjct: 78 AGEWSLLALLLAEVKGRLAA------GDRSSPAAARWGPYVAVLPQRPG-------TLLD 124
Query: 200 WSETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEI 258
W E+ L GSP + + EL+ + G P +P E
Sbjct: 125 WPAKEVQQLLRGSPLQRLADSITSAASASWRELEPL-IAQGRADGLVPEHVPLSKGDLEW 183
Query: 259 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL---LAYSSKC----------------K 299
AF + S + L S L P L ++ C
Sbjct: 184 ---AFGVLLSRCIRLP--SRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGD 238
Query: 300 AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV--DEDNPYD--RLVVEV 355
A A+ L DRPY AG+ + V GP+ + +LL++YGF NP+ RL V V
Sbjct: 239 RAGGATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAV 298
>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
Length = 386
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 111/246 (45%), Gaps = 23/246 (9%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSF 175
+++ GD S+P S ++T VL N + + + K LS L L ++L+ E+ +G+ S
Sbjct: 65 NVKPGDMLISLPESCLLTTSTVL-NSYLGSFIKSWKPHLSPLLALCVFLVCERHRGEASD 123
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L + + P +++ +A L S + + E+ E + RE + + +
Sbjct: 124 WFPYIDVLPK-------SYTCPAYFTDEVMALLPPS-VQRKAREQREAV-REIHSSNKAF 174
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALVPLGPPLLAY 294
F + P + + T+E + A+ +V + V L + R V P L
Sbjct: 175 FRSLQPVLTQPAE---DVLTYEALRWAWCSVNTRSVFMLHSSNDFLRGQDVYALAPFLDL 231
Query: 295 SSKCK--AMLAAVDDAVQLV----VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+ C + A+ ++ + V R + ++ + + G N +L++ YGFV NP+
Sbjct: 232 LNHCPDVQVKASFNEETKCYEIRSVSRMLQYQQAFINY-GSHDNQRLMLEYGFVAPCNPH 290
Query: 349 DRLVVE 354
+ V+
Sbjct: 291 SVVYVD 296
>gi|328771298|gb|EGF81338.1| hypothetical protein BATDEDRAFT_87914 [Batrachochytrium
dendrobatidis JAM81]
Length = 607
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 71/154 (46%), Gaps = 17/154 (11%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L LK W +N + + ++ + N R V A + L+ GD ++P +++++
Sbjct: 4 LNILKQWFGENKIAYDEEKIRIEHDTNNGFR----VFAKQTLEVGDILCAIPKEAILSIK 59
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESP 197
N +A++L L L + LM+E+ G+KS W YI+ L L P
Sbjct: 60 ----NCGVADVLEEQGLGGQLGLVIALMFERSLGEKSPWYGYIQSL-------PLRENIP 108
Query: 198 LLWSETELAYLTGSPTKAEILE-RAEGIKREYNE 230
L W + + A L G+ A +LE + +K +Y E
Sbjct: 109 LFWEKDQQACLDGTAV-AHLLEPMPKDLKADYKE 141
>gi|327295769|ref|XP_003232579.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
gi|326464890|gb|EGD90343.1| hypothetical protein TERG_06571 [Trichophyton rubrum CBS 118892]
Length = 488
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 59/288 (20%), Positives = 103/288 (35%), Gaps = 56/288 (19%)
Query: 97 LKEKPSHNEKHRPIHY-----------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETI 145
LK H + H IH + AS D+ + F +P+ LV++++ +
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSVGAGRGICASRDIAEDEELFIIPDDLVLSVQNSEARSAL 83
Query: 146 AELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
L +L L + ++YE QG++S W PY R L + ++ + W++ +L
Sbjct: 84 E--LDDKQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQL 134
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
L GS +I + A DT+ L Q P P +
Sbjct: 135 LELQGSAVVGKIGKAAAD--------DTILQKVVPLIQANPRHFPPRPNMPPLNSSDSQN 186
Query: 266 VQSCVVH-LQKVSLARRF---------------------------ALVPLGPPLLAYSSK 297
C+ H + + +A F +VPL A + +
Sbjct: 187 ALLCLAHRMGSIIMAYAFDIEKTDEVDEDTAEDGYMTDDEDEPAKGMVPLADIFNADAQR 246
Query: 298 CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
A L + + + + +GE I G P + LL YG+V ++
Sbjct: 247 NNARLFQEEGSFVMKAIKNIHSGEEIFNDYGELPRADLLRRYGYVTDN 294
>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
Length = 491
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 58/259 (22%), Positives = 103/259 (39%), Gaps = 44/259 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + F +P L++++E E + L +L L + ++YE QG+
Sbjct: 61 LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYNE 230
+S W PY L + ++ + W+E +L L G +I + A E I ++
Sbjct: 119 QSRWEPYFHIL-------PTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVP 171
Query: 231 L--------------------DTVWFMA---GSLFQQYPYDI-PTEAFTFEIFKQAFVAV 266
L D + +A GSL Y +DI TE + + ++
Sbjct: 172 LIQANPHHFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTD 231
Query: 267 QSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVW 326
+ A+ +VPL A + + A L + + + R +AGE I
Sbjct: 232 -------DEDEPAK--GMVPLADIFNADAQRNNARLFQEEGSFVMKAIRNIQAGEEIFND 282
Query: 327 CGPQPNSKLLINYGFVDED 345
G P + LL YG+V ++
Sbjct: 283 YGELPRADLLRRYGYVTDN 301
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 45.1 bits (105), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R +P+ L + CK + +A+ +VQ DR YK GE + V GP N LL YG
Sbjct: 178 RLVCMPVAD-LFNHDQGCKLVYSALGYSVQ--TDRVYKQGEEVYVSYGPHSNDFLLTEYG 234
Query: 341 FVDEDNPYDRLVVE 354
F+ + N +D + ++
Sbjct: 235 FILDTNRWDEVYLD 248
>gi|403158396|ref|XP_003307692.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163798|gb|EFP74686.2| hypothetical protein PGTG_00642 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 622
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
A+VPL L A + A L D +++ R K GE I G PNS LL YG
Sbjct: 345 IAMVPLADLLNAKTGSENARLFYETDCLKMKATRNIKKGEQIYNTYGDPPNSDLLRRYGH 404
Query: 342 VDEDNPYDRLVVEV 355
VD+ N +D VVE+
Sbjct: 405 VDDPNRFD--VVEI 416
>gi|219125216|ref|XP_002182882.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405676|gb|EEC45618.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 488
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 40/161 (24%), Positives = 63/161 (39%), Gaps = 17/161 (10%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
P WSE+EL L GSP + + EGI+++YN + W + + P+ F
Sbjct: 138 PRQWSESELKALRGSPLLIRVRKAREGIQKDYNLIREAW------KEIHSSHSPSTTANF 191
Query: 257 ---EIFKQAFVAVQS-----CVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV--- 305
E F A V S K ++PL + +C +
Sbjct: 192 PKLEEFSSAMAVVSSRAFSGMAGFETKGGAVNDTTMIPLLDLCNHHRGRCVTKNVSYRFK 251
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN 346
D V + G+++ + G Q N++L +NYGF DN
Sbjct: 252 DGTVMVKAVTDIAIGDTLKITYGAQGNAQLFLNYGFCVADN 292
>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 465
Score = 45.1 bits (105), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 81/213 (38%), Gaps = 31/213 (14%)
Query: 150 TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLT 209
T + S L L L E+ G KS + Y R L R A W++ E +YL
Sbjct: 89 TKTEASWLCGLTAALCVERSLGLKSRYFAYDRVLPRCEANVVCA------WNDGERSYLA 142
Query: 210 GSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSC 269
G+ + + + A K E+ + +F+++ + +FE F +A V S
Sbjct: 143 GTEVETSLRDEAAAAKNEWER------VVAPVFKEHGVEC-----SFEQFIEARTVVSSR 191
Query: 270 VVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD-----------AVQLVVDRPYK 318
L S LVP+ + + D V++ ++ +
Sbjct: 192 AFTL---SPNAGVGLVPIADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVKVTNEQGVR 248
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
G+ I G N+KLL +YGF DNP D +
Sbjct: 249 RGDEIFNTYGFHGNAKLLNSYGFTQNDNPADEV 281
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 45.1 bits (105), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 55/243 (22%), Positives = 99/243 (40%), Gaps = 29/243 (11%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL 177
+L G+ VP L + + V + + L ++L ++ E +G S W
Sbjct: 52 NLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRPWVSVSLLILREAARGGDSLWA 111
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L RQ +S + WSE EL + G+ + + E ++ E++ ++
Sbjct: 112 PYLAILPRQ-------TDSTIFWSEEELLEIQGTQLLSTTMGVKEYVQSEFDNVE----- 159
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL------ 291
AG + D+ TF+ F AF ++S V + + AL+P +
Sbjct: 160 AGII--NVNKDLFPGTITFDDFLWAFGVLRSRVFPELR---GDKLALIPFADLINHDGDI 214
Query: 292 ----LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDN 346
+ K K L D L K+GE I V + + N++L ++YGF + ++
Sbjct: 215 TSKESCWEIKGKGFLGR-DTVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFTESNS 273
Query: 347 PYD 349
D
Sbjct: 274 SRD 276
>gi|226505024|ref|NP_001151430.1| SET domain containing protein [Zea mays]
gi|195646778|gb|ACG42857.1| SET domain containing protein [Zea mays]
gi|413923893|gb|AFW63825.1| SET domain containing protein [Zea mays]
Length = 491
Score = 45.1 bits (105), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 55/256 (21%), Positives = 108/256 (42%), Gaps = 44/256 (17%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLER-VLGNETIAELLTTNKLSELACLALYL-MYEKKQG 171
AA D+ GD ++P+ L + L R + + L EL + L L + +++
Sbjct: 93 AAYGDIPIGDVLIALPSQLPLRLRRPTSAADDVLVQLAQQVPDELWAMKLGLRLLQERAK 152
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI----KRE 227
SFW PYI L P+ + ++ L +P ++ +R + K
Sbjct: 153 SDSFWWPYIANLPE-------TFTVPIFFPGEDIKNLQYAPILHQVNKRCRFLLEFEKEV 205
Query: 228 YNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL--- 284
+L TV + + Q D+ + + + + + +R F L
Sbjct: 206 QQKLHTVPLVDHPFYGQ---DVNSSSLGWAM----------------SAASSRAFRLHGE 246
Query: 285 VPLGPPLL-----AYSSKCKAM----LAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKL 335
VP+ PL+ +++ + + + ++D +V+++ ++ K E+I + G PN
Sbjct: 247 VPMLLPLIDMCNHSFNPNARIVQERSVNSLDMSVKVLAEKKIKQNEAITLNYGCYPNDFF 306
Query: 336 LINYGFVDEDNPYDRL 351
L++YGFV NPYD++
Sbjct: 307 LLDYGFVITQNPYDQV 322
>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
Length = 372
Score = 45.1 bits (105), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 61/277 (22%), Positives = 107/277 (38%), Gaps = 44/277 (15%)
Query: 80 DLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
+L SW + G+ + + PS + A+ D+QAG+ VP + +L+ V
Sbjct: 6 ELLSWATERGVKLSGIKPQNIPSRGTG------IIATRDIQAGETILFVPFKVFRSLKHV 59
Query: 140 LGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVESP 197
+ IA L N +S A LA YL +K + LP + + P
Sbjct: 60 --PKAIARRLPRN-MSLHALLAAYLTLDKTDTFAIANQTLPDLSSFE---------AGMP 107
Query: 198 LLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFE 257
LW EL P + ++ +R++ + V+ + E
Sbjct: 108 FLWP-AELHPFLPKPALDLLKKQQRNFQRDWATVSKVY----------------SNVSHE 150
Query: 258 IFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
+ +++ V + + S+ R R A++P+ C+A A+ + +
Sbjct: 151 QYLHSWLLVNTRSFYCTTPSMERLPHDDRLAILPVADLFNHADVGCEAQFAS--ENYSFI 208
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
DR Y+AGE + + G LL YGFV +N +D
Sbjct: 209 ADRTYRAGEELYISYGTHSTDFLLAEYGFVPAENRWD 245
>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
Length = 619
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 85/204 (41%), Gaps = 19/204 (9%)
Query: 157 LACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKA 215
L ++L+YE + +KS PY+ L R+ + L + E E+A L +
Sbjct: 106 LIVFYMFLIYELHVEKEKSTHFPYLNLLPRE-------FTTALYFDEDEMAALRSTNLYK 158
Query: 216 EILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+ + +K+ Y E + M +YP + F++E F AF AV S V ++
Sbjct: 159 SVQSIRQNLKQIY-ETKVEYLM-----NKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEY 212
Query: 276 -VSLARRFALVP-LGPPLLAYSSKCKAMLA---AVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+VP L P + + K A + D L K+G+ + G +
Sbjct: 213 PAENGEGVEIVPTLLPTVDILNHKFNAKITYFTGSDRRFYLKTRESLKSGDYVCNNYGAK 272
Query: 331 PNSKLLINYGFVDEDNPYDRLVVE 354
N L++YGFV +N D L V+
Sbjct: 273 SNDSFLLSYGFVIPNNSEDTLYVQ 296
>gi|403350232|gb|EJY74567.1| hypothetical protein OXYTRI_04175 [Oxytricha trifallax]
Length = 766
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 67/164 (40%), Gaps = 23/164 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--------SELACLALYL 164
V A ED++ +A VP L++T+E + I + NK E L +++
Sbjct: 27 VRAREDIEHREAFLYVPFKLLITMELAHNHPIIGHVFKENKQIFTKEHEDFEQLTLTVFM 86
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+YE ++G +SFW PY+ L VE WS++++ + E I
Sbjct: 87 LYEYQKGLESFWFPYLNLLP--------DVEFFCNWSKSDIEAIDDQELAYETKSYKRDI 138
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
+ E+ E++ L YP + +F + F V S
Sbjct: 139 EIEWKEIEL-------LLLHYPQHFSSALIDKHLFMRIFAQVCS 175
>gi|428174941|gb|EKX43834.1| hypothetical protein GUITHDRAFT_140267 [Guillardia theta CCMP2712]
Length = 805
Score = 44.7 bits (104), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 77/190 (40%), Gaps = 35/190 (18%)
Query: 69 EVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSV 128
E +K+ EDL + W+ +NG+ KV L+ H + + A + ++ + F +
Sbjct: 540 EGSAKRNEDLIEFSKWLRRNGVDDSKVKLRADGGHGMG----NSLYARQMIKEDELLFRI 595
Query: 129 PNSLVVTLERVLGNETIAELLTTNKL-----SELACLALYLM--------YEKKQ----- 170
P + + V + T+ ++ ++ E L+L LM YE Q
Sbjct: 596 PLKIAFYSDAVRRHPTLGSVIKGARIPQGMQGETFLLSLMLMGPLTHLEQYEACQVGHME 655
Query: 171 ------GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI 224
+ SFWLPYI+ L + +P+ W+E E L GS + +
Sbjct: 656 TGCKLSNETSFWLPYIKILPK-------TFSAPIFWNEVERQELKGSQVMEMLNDDLAQA 708
Query: 225 KREYNELDTV 234
+RE+ + V
Sbjct: 709 RREWEMMKIV 718
>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
Y34]
Length = 373
Score = 44.7 bits (104), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 70/163 (42%), Gaps = 24/163 (14%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYN-ELDTVWFMAGSL----FQQYPYDIPT 251
P +W + EL L PT A + E + +YN E +TV S+ FQ Y + + T
Sbjct: 107 PFMWPK-ELQKLL--PTSARVF--LENQQTKYNHEWNTVSQAMPSISEERFQYYWHIVNT 161
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
F +E V + S R ALVPL C+ ++ + + +
Sbjct: 162 RTFLYE------------VSETECYSWEDRLALVPLADIFNHADEGCR--VSYMPEHYVI 207
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
DR Y+AGE + + G N LL YGF+ N +D + ++
Sbjct: 208 TTDRAYEAGEELFISYGDHSNDCLLTEYGFLLPKNRWDIICID 250
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 44.7 bits (104), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
++ L+R FA G L +S Q+V ++ +K G+S+ + G + N +
Sbjct: 183 EIVLSRAFAFSRTGGDDLVFSG-TSVKYDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHE 241
Query: 335 LLINYGFVDEDNPYDRLVV 353
LL++YGF+ DNP D V+
Sbjct: 242 LLLSYGFILPDNPEDFFVI 260
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 44.7 bits (104), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 36/166 (21%), Positives = 71/166 (42%), Gaps = 24/166 (14%)
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEA 253
V P++W EL +L + + +R + + ++++ F + DI E
Sbjct: 104 VGMPMMWPR-ELKHLLPLEPRNLVFKREKAFQGDWSD-----------FHKAFSDISYEE 151
Query: 254 FTFEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA 308
+T+ A++ V + + + K R AL+P+ + C+ + +
Sbjct: 152 YTY-----AWLTVNTRTFYNESPETLKYPWEDRLALIPVADLFNHADAGCRVYYSP--EG 204
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
+V DR YK GE + + N L+ YGFV ++NP D + ++
Sbjct: 205 YHIVADRDYKRGEELYISYSSHSNDYNLVEYGFVPDENPSDDVYID 250
>gi|146180409|ref|XP_001020886.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila]
gi|146144524|gb|EAS00641.2| hypothetical protein TTHERM_00411920 [Tetrahymena thermophila
SB210]
Length = 726
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 81/188 (43%), Gaps = 31/188 (16%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-----TIAELLTTNKLSELA---CLALYLM 165
AA++D+ A S+PN ++++ +R +E +E L + K ++ A L ++ M
Sbjct: 67 AATKDIAPLTAFISIPNKIIISYDRARFSELKSFFKQSEDLFSEKENDEAGVNVLTVFFM 126
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
YE+ +GKKS W Y L+ E+ L W+ E+ + + +
Sbjct: 127 YERLKGKKSLWHEYFEILENN--------ETILTWTAEEINRIPDPYIQKQA-------- 170
Query: 226 REYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFA 283
REY E +D +W L P T E+F A+ V S C + QK + +
Sbjct: 171 REYKEQVDELWDELKELLHSQPNFFQKATATKELFLWAYNIVMSRCFGYTQKGT-----S 225
Query: 284 LVPLGPPL 291
+VP L
Sbjct: 226 IVPFADCL 233
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 44.7 bits (104), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 102/246 (41%), Gaps = 35/246 (14%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P ++ T++ + + L + + LS LA Y+++ K
Sbjct: 42 FKEGEKILTIPAGILWTVKHAYADPLLGPALRSAQPPLSVEDTLATYILFVKS------- 94
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S +L++E +L G+ + + I+ ++ L
Sbjct: 95 ----RESGYDGQRSHIAALPASYSSSILFAEDDLEACAGTSLYTITKQLEQSIEDDHRAL 150
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEI----FKQAFVAVQSCVVHLQKVSLARRFALVPL 287
LF Q+P P + FT E +K A V S + LA ++ L
Sbjct: 151 VV------RLFVQHPDLFPLDKFTVEDVGLHYKWALCTVWSRAMDF---VLADGNSIRLL 201
Query: 288 GP--PLLAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P +L ++S+ K + + + + Y+AG+ + + GP PNS+LL YGFV
Sbjct: 202 APFADMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGDQVFINYGPVPNSRLLRLYGFVI 261
Query: 344 EDNPYD 349
NP D
Sbjct: 262 PGNPND 267
>gi|345325921|ref|XP_001512684.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 392
Score = 44.7 bits (104), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 29/164 (17%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFS 127
K+E+ DL W NG + E +++ + A+ +++A +
Sbjct: 74 KREDYFPDLMKWATANG------------ASTEGFELVNFEEGFGLRATREIKAEELFLW 121
Query: 128 VPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELD 184
VP L++T+E N + L + +++ + LA +L+ E+ SFWLPYI+ L
Sbjct: 122 VPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERAN-PSSFWLPYIQTLP 179
Query: 185 RQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ ++PL + E E+ YL + ++ + + R+Y
Sbjct: 180 SE-------YDTPLYFEEDEVQYLQSTQAIHDVFSQYKNTARQY 216
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 44.7 bits (104), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 10/119 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++AE L +L L + +M+E+ G+ S W Y L RG+ + P+ W+
Sbjct: 38 SVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLP---ARGERTL--PMFWTSA 92
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
+L +L G+ + E AE ++ ++NE + L +P P T E + +A
Sbjct: 93 QLEHLRGTDLLRHVTEDAESMRLDFNE-----NVVDGLCVTHPVAFPPGKHTLEAYMEA 146
>gi|321462357|gb|EFX73381.1| hypothetical protein DAPPUDRAFT_58066 [Daphnia pulex]
Length = 425
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 67/306 (21%), Positives = 118/306 (38%), Gaps = 44/306 (14%)
Query: 56 RVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPC-KVILKEKPS-HNEKHRPIHYV 113
R+ + + SR + + +L WM NG K L KP+ N R +
Sbjct: 8 RIRNRLVRIVHSRPLRIDSHSEFVELCKWMSANGWNAVSKNCLVTKPALFNSTGRGL--- 64
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKK 173
A ++ +P SL++T E+VL I++LL + ++ CL +++ K G
Sbjct: 65 MAMSNIAPNHLLVQIPQSLLITKEKVLAE--ISDLLQFS-MTTAECLTFFILNSKFNGLY 121
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDT 233
S YI L + G L E+A L S + +I+ + ++Y ++
Sbjct: 122 S---SYISTLPKSFSVGGLC-------KSQEIAALP-SFLQEKIMCNQNFVLKKYEKIFA 170
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL--------- 284
+W I + E+F+ A+ V + V Q S L
Sbjct: 171 IW-----------RKIYGSTLSLELFQWAWFCVNTRAVFYQD-SKQHSHGLNKVDGMENN 218
Query: 285 VPLGPPLLAYSSKCKAMLAA----VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
+ L P L ++ + ++ A ++ DR K + + + GP N KL + YG
Sbjct: 219 MALAPYLDMFNHDAEVVVEAGFNKTTQCYEIRSDRHIKKYQQVFINYGPHDNMKLFLEYG 278
Query: 341 FVDEDN 346
F+ N
Sbjct: 279 FLATKN 284
>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
[Ostreococcus tauri]
Length = 446
Score = 44.7 bits (104), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 62/250 (24%), Positives = 102/250 (40%), Gaps = 34/250 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL-ACLALYLMYEK-KQ 170
VA + D+ G+ +VP V+ + T+ L+ + L LA +++ E
Sbjct: 25 VATTRDVTRGELLATVPLEKCVSTSSARADATLWRGLSARPGASLDGILAAHVLREAFGL 84
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G++S + P++R L + ++ + W E EL L GS A + + EY+
Sbjct: 85 GERSAFWPWLRLLPSE-------TDAAVGWDEDELRELQGSNVVAFARAIKKSWREEYDA 137
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF--------TFEIFKQAFVAVQSCVVHLQKVSLARRF 282
LD F D P EAF TFE F A V S + L+ S +
Sbjct: 138 LD---------FAGLGVDFP-EAFGGEHAAHYTFEKFTWARFVVWSRAIDLKTDSTSA-- 185
Query: 283 ALVPLGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
++ + P+L A S K A +AV++ +K + +P+ L+
Sbjct: 186 PVIRMLVPILDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLL 245
Query: 338 NYGFVDEDNP 347
YGF+ E NP
Sbjct: 246 QYGFIPEANP 255
>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
Length = 591
Score = 44.3 bits (103), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 113/267 (42%), Gaps = 47/267 (17%)
Query: 100 KPSHNEKH-RPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK----- 153
KP E+ R I +A E++ FS+P ++++ + + + +IA + +K
Sbjct: 35 KPHDGERGVRVISDIAPCEEM------FSIPEKILMSRKSCMAS-SIAHVFRKHKDVLFS 87
Query: 154 -LSELACLALYLMYEK-KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
ELA L L ++YEK QG SFW P I L G + WSE EL L
Sbjct: 88 SRDELA-LTLLILYEKLDQGNASFWKPMIDILPADPG-------AASKWSEEELQELQDE 139
Query: 212 PTKAEILERAEGIKREYNE-LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCV 270
KAE + +++ Y L + G +F + +T+E F+ A + V+S
Sbjct: 140 SLKAEAMIVVASMQQTYQRVLRPILVQHGDVF-------SVDRYTWEEFRWALLCVESRT 192
Query: 271 VHLQKVSLARRF----ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD----RPYKAGES 322
RF ++VP L + + + + D ++ GE
Sbjct: 193 FG--------RFLPHPSIVPFADLLNHVNVQTSYRWLPEERRAAYMCDASGEHVHRRGEE 244
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ GP+ N++LL++YGF + N Y+
Sbjct: 245 AFMSYGPRSNAELLLHYGFALQSNRYE 271
>gi|403338831|gb|EJY68658.1| hypothetical protein OXYTRI_10728 [Oxytricha trifallax]
Length = 770
Score = 44.3 bits (103), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 71/164 (43%), Gaps = 26/164 (15%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE------TIAELLTTNKLSELACLALYLMYE 167
A ED+Q +A +PN ++T+ER +E + +++ + L +++M E
Sbjct: 67 AVKEDIQHNEAFVYIPNKCLITVERARSSEIGFIFANHENVFKSSEDRDFLTLLVFMMCE 126
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++G +SFW PY +D G+L W + L+ L +I RE
Sbjct: 127 FQKGDQSFWYPYFNAVD----PGELTC----YWDQKYLSALDDKECIEQI--------RE 170
Query: 228 YN-ELDTVWFMAGSLFQQYPYD-IPTEAFTFEIFKQ--AFVAVQ 267
Y +D +W + + Y D E +FK+ AF+A +
Sbjct: 171 YGASMDEMWEQVHKILRVYSPDYFNLEVCNEAMFKKVAAFIATR 214
>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1188
Score = 44.3 bits (103), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 39/174 (22%), Positives = 72/174 (41%), Gaps = 21/174 (12%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTF 256
PL W++ EL +L G+ I ER ++ ++ + V L ++ P P + FT+
Sbjct: 245 PLWWNDAELDHLDGTNIGGYIQERRNQVRNQFLNVFPV------LSREQPALFPKDVFTY 298
Query: 257 EIFKQAFVAVQSCVVHLQ-KVSLARRFALVPLGPPLLAYSSKCKAMLAAV---------- 305
E + AF S L+ V+ +G P+ +C L +
Sbjct: 299 EAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPMKEPCVECLLPLLDMMNHQFGASIT 358
Query: 306 ----DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ +V+ + GE + GP+ N +LL+ YGF +N D + +++
Sbjct: 359 WFTDETSVRFFTGAKVRKGEQVYNNYGPKSNEELLMGYGFCLPNNEADHVKIQL 412
>gi|358055500|dbj|GAA98620.1| hypothetical protein E5Q_05307 [Mixia osmundae IAM 14324]
Length = 462
Score = 44.3 bits (103), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 64/287 (22%), Positives = 120/287 (41%), Gaps = 39/287 (13%)
Query: 81 LKSWM-HKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT---L 136
L SW+ H +G +I+ + ++ + A+ DL AG S P++L +T
Sbjct: 10 LGSWLRHHDGFIHEHLIVVQDELGDKS------IIATTDLPAGTCIASCPHTLAITPTSA 63
Query: 137 ERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWL--PYIRELDRQRGRGQLAV 194
LG+ +LS+ + LYL+ K L Y+ L + A+
Sbjct: 64 RAALGHHA-------TELSDHQAMVLYLVLHKHPSPAVCCLHQAYVDTLPPRS-----AM 111
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG--SLFQQYPYDIPTE 252
+PL ++ E+ L G+ + +R + E+ TV AG LF+ E
Sbjct: 112 RTPLWFNPAEVQLLQGTNLAGAVTDRQRDWQLEWM---TVLRRAGQSGLFKASF----EE 164
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSS----KCKAMLAAVDDA 308
+ ++ ++ ++ HL + + + L P + A++ K ++
Sbjct: 165 TWPSALWAATILSSRAFPSHL--IDGNEQASTPVLFPGVDAFNHQQARKVTWQTSSASGR 222
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
LV D P AG+ + GP+ N + L+ YGF+ +NP D +V+++
Sbjct: 223 FNLVQDEPTAAGQQVFNNYGPKSNEEFLLGYGFIIPNNPDDHMVLKL 269
>gi|406607002|emb|CCH41620.1| SET domain-containing protein 4 [Wickerhamomyces ciferrii]
Length = 424
Score = 44.3 bits (103), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 42/208 (20%), Positives = 89/208 (42%), Gaps = 24/208 (11%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
LS ++L+L E +GK+S+W P+I+ L + SP LW + G
Sbjct: 114 LSSFQIMSLFLELESSRGKESWWDPFIQMLPTIND----FLTSPFLWQ------IQG--- 160
Query: 214 KAEILER-----AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
K E++E+ + + +N ++ + +L + ++ + + F ++ + S
Sbjct: 161 KYELIEKLPKSTQKHSLKMFNRFESDFKAVKTLLE--THNASKDIINHDKFVLYWMCINS 218
Query: 269 CVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
+++ QK + + F + P + +S+ + L ++ YK + + +
Sbjct: 219 RCLYMEIPQKKTTSDNFTMAPY-VDFINHSTNDQCKLKIDRTGFHVITTSNYKENDELYL 277
Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVV 353
GP N LL YGF +N ++ L +
Sbjct: 278 SYGPHSNEFLLCEYGFHLSNNEWNDLDI 305
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 44.3 bits (103), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 49/97 (50%), Gaps = 7/97 (7%)
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVD 314
E FK +F + S +V L S+ + ALVP +L +S + L + D
Sbjct: 2 ESFKWSFGILFSRMVRLP--SMDGKVALVPWAD-MLNHSCDVETFLDYDKQSKGIVFTTD 58
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDED--NPYD 349
RPY+ GE + + G + N +LL++YGFV + NP D
Sbjct: 59 RPYQPGEQVFISYGKKSNGELLLSYGFVTREGANPSD 95
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 44.3 bits (103), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 67/141 (47%), Gaps = 17/141 (12%)
Query: 149 LTTNKLSELACLALYLMYE-KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+T+ +++ A +AL+L++E Q +KS W P++ L R VE+PLLW+ ELA
Sbjct: 189 ITSREVTIDAVIALHLLHELYVQREKSEWWPWVSILPRD-------VETPLLWTPRELAQ 241
Query: 208 LTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ 267
L GS I R +K + D ++ L Q++P P E F E + A V
Sbjct: 242 LEGSNL---IGFRDAVLKGWTTQRDALF---PKLTQKFPSLFPEEHFRTERWAWAMAIVW 295
Query: 268 SCVVHLQKVSLARRFALVPLG 288
S V + R A+ P G
Sbjct: 296 SRAA---DVPVPRPEAIFPSG 313
>gi|307104961|gb|EFN53212.1| hypothetical protein CHLNCDRAFT_137077 [Chlorella variabilis]
Length = 512
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
V A++D+ AG VP L++++E + + L ++ L+ LA++L+ E +
Sbjct: 37 VLATQDIPAGTCVLRVPRHLLMSVESARRDAELCTALRQHRAALTSDQVLAVHLLCEASK 96
Query: 171 GKKSFWLPYIREL 183
G SFW PY+R L
Sbjct: 97 GAASFWQPYLRSL 109
>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
Length = 393
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 46/192 (23%), Positives = 81/192 (42%), Gaps = 27/192 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LAL ++ E+ +G+ + W PYI L + +++ W +TEL+YL SP + E
Sbjct: 167 LALIVLMERYKGQ-AIWAPYISCLPQPA-----ELDNTFRWEDTELSYLRASPLYGKARE 220
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQ-SCVVHLQKVSL 278
R E I E+ ++ F L Q +++ Q F V + H+
Sbjct: 221 RLEMITTEFGQVQND-FCTCVLEQ-----------ALDVWPQLFGKVSLEDLKHVYATVF 268
Query: 279 ARRFALVPLGP---PLLAY-----SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
+R A+ P+L + +S K + + + DR Y + I + G
Sbjct: 269 SRSLAIGEDSTTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDL 328
Query: 331 PNSKLLINYGFV 342
N++L ++YGF
Sbjct: 329 SNAELALDYGFT 340
>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
Length = 706
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 87/206 (42%), Gaps = 22/206 (10%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIREL--DRQRGRGQLAVESPLLWSE-TELAYLTG 210
+ E LA +LM + G +SFW PYI+ L D Q R + L W E T L L
Sbjct: 120 VEEPGALAFFLMDQYLLGDESFWAPYIQSLPDDSQFTRLEYYTGDDLKWLEGTNLLKLRE 179
Query: 211 SPTKAEILERAEGIK--REYNELDT---VW--FMAGSLFQQYPYDIPTEAFTFEIFKQAF 263
+ + G++ +E+ +T W F+ S I + AF+ E+ K
Sbjct: 180 KLLERLKAKYETGLRLLKEFPNKNTPKYTWERFLWASSI------ILSRAFSSEVLKDYI 233
Query: 264 VAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESI 323
+ V L+ S+ LVPL + + + A + + L+V + GE +
Sbjct: 234 KGTPTRVKPLEDFSV-----LVPLVD-ISNHQPLAQVEWATSLEKIGLIVHKTLLPGEEV 287
Query: 324 VVWCGPQPNSKLLINYGFVDEDNPYD 349
GP+ N +L++NYGF N D
Sbjct: 288 PNNYGPRSNERLMMNYGFCIRGNVCD 313
>gi|302498903|ref|XP_003011448.1| SET domain protein [Arthroderma benhamiae CBS 112371]
gi|291174999|gb|EFE30808.1| SET domain protein [Arthroderma benhamiae CBS 112371]
Length = 689
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 81/207 (39%), Gaps = 36/207 (17%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
LA ++++E+ +G+ S W PY+ L R + S L + + +L +L G+
Sbjct: 104 LAFFMVHEQLKGRDSHWWPYLATLPRAS-----ELTSALFYQDNDLEWLQGTNLYQTHQA 158
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+K EY+ ++ G L E++ ++IF A+ + S + +
Sbjct: 159 YRNAVKEEYDSAISILRDEGFL--------AVESYRWDIFCWAYTLIAS------RAFTS 204
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDA----------------VQLVVDRPYKAGESI 323
R P L + + ML VD + + L V P +GE +
Sbjct: 205 RVLDAYFSNHPTLKQDEEFQIMLPLVDSSNHKPLAKIEWRAEATEIGLKVIEPTFSGEEV 264
Query: 324 VVWCGPQPNSK-LLINYGFVDEDNPYD 349
G N + ++ YGF DNP D
Sbjct: 265 HNNYGSLNNQQSVMTTYGFCIVDNPCD 291
>gi|255077808|ref|XP_002502485.1| set domain protein [Micromonas sp. RCC299]
gi|226517750|gb|ACO63743.1| set domain protein [Micromonas sp. RCC299]
Length = 728
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 19/137 (13%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE-RVLG 141
+WM K G+ V + P H + A+ D++ GD VP + ++T + V G
Sbjct: 51 AWMKKKGVKLNGVSIGRFP-HTGRG-----CVATRDIKEGDVLVEVPEAAIITADGSVAG 104
Query: 142 NETIAELLTTNKL-------SELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAV 194
+ +A L L E L L +M E +G++S + PY+ L R A
Sbjct: 105 SALVAFGLGGEALLHEYSPRLEREALVLAVMAEMSRGEESEFAPYLAALPTLR-----AT 159
Query: 195 ESPLLWSETELAYLTGS 211
SPL WS EL+ L G+
Sbjct: 160 HSPLGWSGAELSELEGT 176
>gi|207346544|gb|EDZ73016.1| YDR257Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 354
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++PL L A +SKC A L + +++V R + E + G PNS+LL YG+V
Sbjct: 80 SMIPLADMLNADTSKCNANLTYDSNCLKMVALRDIEKNEQVYNIYGEHPNSELLRRYGYV 139
Query: 343 DED-NPYD 349
+ D + YD
Sbjct: 140 EWDGSKYD 147
>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
Length = 440
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 95/238 (39%), Gaps = 20/238 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
LQ G S+P S ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPESCLLTTDTVIQSYLGAYIAKWKPPPSPLLALCTFLVSEKHAGDRSLWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L KA+ E+ ++ + +
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLLPISLKAKAEEQRAHVQEFFASSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGP--PLL 292
LF + I F++ A+ V + V+L Q L+ L P LL
Sbjct: 180 LQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRPRQWECLSAEPDTCALAPYLDLL 235
Query: 293 AYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+S + KA ++ ++ E + + GP N +L + YGFV NP+
Sbjct: 236 NHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFICYGPHDNHRLFLEYGFVSGHNPH 293
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 106/256 (41%), Gaps = 46/256 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 244 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 302
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 303 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 348
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
S F+ +I +E TF F AF ++S V + + AL+
Sbjct: 349 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 398
Query: 286 PLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 333
P L+ +S K K + D L K+GE I + + + N+
Sbjct: 399 PFA-DLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNA 456
Query: 334 KLLINYGFVDEDNPYD 349
+L ++YGF + ++ D
Sbjct: 457 ELALDYGFTESNSSRD 472
>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
Length = 533
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 60/264 (22%), Positives = 111/264 (42%), Gaps = 35/264 (13%)
Query: 113 VAASEDLQAGDAA--FSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ AS+ ++A SVP S +L + +A L E A L+L +YE
Sbjct: 101 IFASQAIEASTTTPLLSVPLSTFFARFTLLDSPMMAALAVRPVAREEAKLSLLFLYEYFD 160
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SFW P+ + R+ ++ W + L L + + I + I+ EY++
Sbjct: 161 -PDSFWQPWFQLFPRE-------LDCAGFWDDLLLMELDNTSIRDAIRQLEALIEYEYDQ 212
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL--- 287
LD +L ++P + F+++ FK AF+ + S + + V+ A ++P
Sbjct: 213 LDL-----PALRLRFPDSFVADRFSYDDFKWAFMVLASRGLTM-SVNNAPCTVMIPFVDF 266
Query: 288 ----GPPLLAYSSKCKAMLAA------VDDAVQ------LVVDRPYKAGESIVVWCGPQP 331
G +A+S +A A+ DD+V+ + + + GE + +
Sbjct: 267 FNHNGAKSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAHS 326
Query: 332 NSKLLINYGFVDEDNPYDRLVVEV 355
N LL++YGF N +D +V +
Sbjct: 327 NEVLLLHYGFALPHNEHDTFLVRL 350
>gi|302510645|ref|XP_003017274.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
gi|291180845|gb|EFE36629.1| hypothetical protein ARB_04152 [Arthroderma benhamiae CBS 112371]
Length = 479
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 55/126 (43%), Gaps = 14/126 (11%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +PN L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPNDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W PY R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYPYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEI 217
+I
Sbjct: 135 AVVGKI 140
>gi|50546259|ref|XP_500648.1| YALI0B08624p [Yarrowia lipolytica]
gi|49646514|emb|CAG82890.1| YALI0B08624p [Yarrowia lipolytica CLIB122]
Length = 490
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 71/185 (38%), Gaps = 18/185 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A +DL AGD VP S ++ R G IA LL + L +A L + +YE+ G
Sbjct: 36 VFAKKDLDAGDIVLKVPKSACLS-PRTCG---IANLLDEHDLDNIAGLLVAFLYERSLGD 91
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNEL 231
+S W + L E P WS E L EI G E Y EL
Sbjct: 92 QSPWHEFFESLKPVIAD---VPEIPKFWSNDEDRALLSGTEVEEIGGLETGEDEEVYQEL 148
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+F I E +F+ FK+ V + S + + R LVP G L
Sbjct: 149 IVPFFEDNGKL------INLECPSFDEFKKLVVVIASRAFEVDQF---RELCLVP-GACL 198
Query: 292 LAYSS 296
+S
Sbjct: 199 FNHSD 203
>gi|410079629|ref|XP_003957395.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
gi|372463981|emb|CCF58260.1| hypothetical protein KAFR_0E01060 [Kazachstania africana CBS 2517]
Length = 534
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 9/144 (6%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V A DL AG +P S + + N TI+ LL ++ + L L +YE +
Sbjct: 40 VFAKRDLPAGTTLLQLPKSAIFSA----SNSTISNLLVEEEIDGVLALNLAFIYETTVFR 95
Query: 173 -KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE-YNE 230
KS W PY++ + +G ++V P WSE L G T + L A ++E Y
Sbjct: 96 EKSHWYPYLKSIQVVDSQGNISV-PPGYWSEEAKDLLRG--TTLDTLYDALSPQQEVYEG 152
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAF 254
+ +A Q++ +P E F
Sbjct: 153 FEISLHVAKKWNQEFSLPLPEEYF 176
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 54/240 (22%), Positives = 99/240 (41%), Gaps = 27/240 (11%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
Q G+ ++P + T+E + + +L + + LS LA+YL++ +
Sbjct: 34 FQQGERILTIPGDSLWTVEHADSDPLLGPVLRSVQPPLSVEDTLAVYLLFVR-------- 85
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+RE + R +A S + ++E EL G+ + E I+ +Y
Sbjct: 86 ---LREHGYEGPRSHVAAMPARYSSSIFFNEDELEVCAGTSLYTITKQLEERIEDDYR-- 140
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL 291
+ +F Q+P +P + + +K A V S + + L P +
Sbjct: 141 ----VLVMRVFTQHPDLLPLAKISIQDYKWALCTVWSRAMDFVLPNGKPLRVLAPFAD-M 195
Query: 292 LAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ +S + K A + L V + Y+ G+ I + G PN++LL YGFV +NP D
Sbjct: 196 INHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQIYISYGSIPNNRLLRLYGFVIPENPND 255
>gi|217074704|gb|ACJ85712.1| unknown [Medicago truncatula]
Length = 209
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 8/120 (6%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP SL +T + + + + + +A LA L+ K G+ S
Sbjct: 66 ASKSIQTGDCILQVPYSLQLTPDNLPPE---IKPFISEDVGNIAKLATVLLIHKNLGQDS 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PYI L Q + + + W+E+EL + S E + + I++++ E+ V
Sbjct: 123 EWHPYISCLPPQA-----EMHNTIFWNESELEMIRQSSVYQETIYQKSQIEKDFLEIKPV 177
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 106/256 (41%), Gaps = 46/256 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A+ DL G+ VP L + + V ++ + + L +AL L+ E +G
Sbjct: 90 LVAARDLPRGEVLAEVPKKLWLDADAVAASD-LGGAVGRGGLRPWVAVALLLLREAARGA 148
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
S W PY+ L RQ +S + WSE EL + G+ ++L G+K EY +
Sbjct: 149 GSPWAPYLAILPRQ-------TDSTIFWSEEELLEIQGT----QLLSTTMGVK-EYVQ-- 194
Query: 233 TVWFMAGSLFQQYPYDIPTE-------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALV 285
S F+ +I +E TF F AF ++S V + + AL+
Sbjct: 195 -------SEFESVEAEIISENRELFPGTVTFNDFLWAFGILRSRVFAELR---GDKLALI 244
Query: 286 PLGPPLLAYSS-----------KCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNS 333
P L+ +S K K + D L K+GE I + + + N+
Sbjct: 245 PFA-DLVNHSDDITSKESSWEIKGKGLFGR-DVVFSLRTPVNVKSGEQIYIQYDLDKSNA 302
Query: 334 KLLINYGFVDEDNPYD 349
+L ++YGF + ++ D
Sbjct: 303 ELALDYGFTESNSSRD 318
>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
Length = 440
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 67/299 (22%), Positives = 120/299 (40%), Gaps = 40/299 (13%)
Query: 65 AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHY------VAASED 118
A SR V + + +LK W LK++ + P H+ + +
Sbjct: 20 AESRGVNESFKCEFIELKKW------------LKDRKFEDTNLMPAHFPGTGRGLMSKTS 67
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P + ++T + V+ + + +T K S L L +L+ EK G +S W
Sbjct: 68 LQEGQMIISLPENCLLTTDTVIES-YLGAYITKWKPPPSPLLALCTFLVSEKHAGDQSPW 126
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
PY+ L + A P+ E E+ L P KA+ E+ ++ + +
Sbjct: 127 KPYLEILPK-------AYTCPVC-LEPEVVNLLPKPLKAKAEEQRAHVQEFFASSRDFFS 178
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ A+ V + V+L+ + L+ L P L
Sbjct: 179 SLQPLFAEAVDSI----FSYSALLWAWCTVNTRAVYLRHRRRECLSAEPDTCALAPYLDL 234
Query: 292 LAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + A ++ ++ ++ E + + G N +LL+ YGFV NP+
Sbjct: 235 LNHSPNVQVRAAFNEETRCYEIRTASSWRKHEEVFICYGHHDNQRLLLEYGFVSIQNPH 293
>gi|308812294|ref|XP_003083454.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116055335|emb|CAL58003.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 492
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 294 YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+S A + ++ V+LV R KAG+ I + G N +L ++YGF+ EDN +D
Sbjct: 248 HSFDASARVRECENGVELVTTRDLKAGQPIELCYGELSNDELFLDYGFIVEDNAFD 303
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 83/196 (42%), Gaps = 19/196 (9%)
Query: 159 CLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPL--LWSETELAYLTGSPTKAE 216
LALY+++ + +G+ + +RQ L E L +++ EL GS
Sbjct: 76 TLALYILFVRSRGEDPAYA------ERQTHVAMLPSEYTLSMYFTDEELRVCAGSSLYTL 129
Query: 217 ILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
+ +Y +L T FM LF P + F+F+ +K A ++ S +
Sbjct: 130 TTHLRGRVGDDYKKLLTGVFMRHRDLF-------PLDKFSFQHYKWALSSIWSRGMDF-T 181
Query: 276 VSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV--DRPYKAGESIVVWCGPQPNS 333
+S L+ +L ++S K A L V R Y+ G+ + ++ G NS
Sbjct: 182 ISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVLACRDYEVGDQVFIYYGNVSNS 241
Query: 334 KLLINYGFVDEDNPYD 349
+LL YGFV DNP D
Sbjct: 242 RLLRLYGFVLPDNPND 257
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 54/105 (51%), Gaps = 1/105 (0%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
E+FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 12 ESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 71
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
V + ++AGE I ++ G + N++ +++ GF ++N +DR+ +++
Sbjct: 72 CVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFFDNNSHDRVKIKL 116
>gi|357131408|ref|XP_003567330.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like
[Brachypodium distachyon]
Length = 495
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 65/313 (20%), Positives = 111/313 (35%), Gaps = 63/313 (20%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K WM K+G V+ + + YV A L+ GD ++P +T R
Sbjct: 13 FKRWMSKHG-----VVCSDALCLDASEAGGVYVRALSALREGDLVATIPRRACLT-PRTS 66
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
G +L LA+ +MYE+ +G +S W Y+R + PL+W
Sbjct: 67 GAAAAI---EAAELGGTLALAVAVMYERARGAESPWNAYLRLIPD-------CEPVPLVW 116
Query: 201 SETELA-YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ E L+G+ + + E + ++ E +G L + E F+ E +
Sbjct: 117 PDEEAERLLSGTELDKIVKQDREFLCEDWKECIEPLISSGDL------GVNPEDFSLEKY 170
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA----------- 308
A + S H+ + +VPL + V DA
Sbjct: 171 FAAKSLLSSRSFHIDSYHGS---GMVPLADLFNHKTDGEHVHFTKVSDASDSDEGEDDDD 227
Query: 309 --------------------------VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++++ R AGE + G N+ LL YGF
Sbjct: 228 QSNAGSDEEPTVENSATNPSGYNDEDLEMIIVRDANAGEEVYNTYGTMGNAALLHRYGFT 287
Query: 343 DEDNPYDRLVVEV 355
+ DNPYD + +++
Sbjct: 288 ELDNPYDIVNIDL 300
>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 62/301 (20%), Positives = 121/301 (40%), Gaps = 57/301 (18%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE----DLQAGDAAFSVPNSLVVTL 136
L +W NG+ + ++ + + VA +E D +A D +VP+ L +TL
Sbjct: 11 LSTWAKLNGMSLEGIAFQKLHGEHGTDKGTAIVATAEKKDEDAEA-DTLLTVPSDLALTL 69
Query: 137 ERVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWL 177
E V + + + + T + + L L + + + +K G + W
Sbjct: 70 EYVHNHAKTDRHLREVLDAVGDFGRTARGAILIFLIVQITHASPDFANQRQKIGVSNPWT 129
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----D 232
YIR + ++ P +S E L G+ + + + +++E+ L D
Sbjct: 130 EYIRFM-------PASIPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFEHLRQATED 182
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
W Q++ +D T FTF+ K +S VV L + A + P +
Sbjct: 183 IHWC------QEHWWDEDTGKFTFDDLKYVDAVYRSRVVDLPRSGHA-------IVPCVD 229
Query: 293 AYSSKCKAMLAAVDD-------AVQLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDE 344
+ C+ ++ A D +QL + + GE + + G + P S+++ +YGFV+
Sbjct: 230 MANHACEDLVKARYDEDGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVEN 289
Query: 345 D 345
+
Sbjct: 290 E 290
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 127/288 (44%), Gaps = 42/288 (14%)
Query: 76 EDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVT 135
E L L +W+ + GLP K+ ++ + + + S+ + G +VP+S +T
Sbjct: 50 ETLPPLSAWVEQRGLPLKKLNVRPEIVEGDL-----CLVVSKPTKKGQPLVAVPSSAWLT 104
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
++V+ + +I L+ L +AL+L++E+ + + W ++ + A +
Sbjct: 105 -QQVVRSSSIGSLV--EDLEPWLQIALFLLHERSKPDAA-WQGFLDSI-------PAAPD 153
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
PL WSE EL+ L G+ + + + + +Y EL+ LF + P ++
Sbjct: 154 VPLFWSEEELSQLEGTQLLSSVQGYRQFFEAKYAELEE------QLFAPHREAFPPKSHQ 207
Query: 256 FEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAY----SSKCKAMLAA--VDDAV 309
+ F A V+S V ALVPL L+ + ++ + LA A
Sbjct: 208 LDDFLWAVATVRSRV---HSPLDGEDVALVPLA-DLVQHRKLQGARWQLQLAGGLFSKAQ 263
Query: 310 QLVVD--RPYKAGESIVVWCGP--------QPNSKLLINYGFVDEDNP 347
LVV+ R Y GE + + G + +S++L++YG +D D P
Sbjct: 264 ALVVEAQRDYAEGEVVTMDFGAPLTEEDQEKLDSQVLLDYGALDADRP 311
>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
Length = 502
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 54/246 (21%), Positives = 95/246 (38%), Gaps = 54/246 (21%)
Query: 136 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
L++ + I + + L+ + C +K S WLPY+ L +
Sbjct: 142 LKKCFEQDMIVKTMDNVALALMVCC-------QKLSPDSSWLPYLDALPQ-------TFS 187
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG---------------- 239
+PL +S EL L+ SP E L + R++ V+F+A
Sbjct: 188 TPLYFSALELRKLSPSPAYEESLIMYRNVARQF-----VYFLAAVQRSERSRSAKKDKNH 242
Query: 240 ------SLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL----QKVSLARRFALVPLGP 289
LF P+ + FTF++++ A V + + + K S + A VP
Sbjct: 243 AAVGMEPLFLNAPFTVSN--FTFDLYRWAVACVTTRINFIPSQYAKDSNGQPVA-VPCLI 299
Query: 290 PLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV-D 343
PLL + + D + + YKAG+ + ++ G + N + ++ GFV D
Sbjct: 300 PLLDMANHEFDHPLTVHFSTEGDYASIKATKDYKAGDEVTIFYGIRTNRQFFLHNGFVPD 359
Query: 344 EDNPYD 349
+N D
Sbjct: 360 GENKND 365
>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
reading frame 18 (C21orf18)) [Danio rerio]
Length = 440
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 111/253 (43%), Gaps = 24/253 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQ 170
+ +++ ++A ++ S+P ++T VL + +A+ + +S L L +L+ E+
Sbjct: 63 LMSTQTIKAKNSLISLPEECLLTTSTVLKS-YMADYIKRWHPPISPLLALCCFLISERHH 121
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G+ S W PYI L + PL + + + L S K + ++ E + ++
Sbjct: 122 GEASEWNPYIDILPK-------TYTCPLYFPDNVIELLPRSLQK-KATQQKEQFQELFSS 173
Query: 231 LDTVWFMAGSLFQQYPYDIPTEA-FTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVP 286
T + LF Q PTE F+ + + A+ +V + V H Q L+R +
Sbjct: 174 SQTFFHSLQPLFNQ-----PTEELFSQDALRWAWCSVNTRTVYMEHDQSKYLSREKDVYA 228
Query: 287 LGP--PLLAY--SSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
L P LL + + + +A ++ K + + GP N +LL+ YGFV
Sbjct: 229 LAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAFINYGPHDNHRLLLEYGFV 288
Query: 343 DEDNPYDRLVVEV 355
NP+ + V++
Sbjct: 289 APCNPHSVVYVDL 301
>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
Length = 389
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 56/114 (49%), Gaps = 9/114 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ ++AG+ +P+ LV+T E++ ++ + +LL+T + L L ++ E+ +G+ S
Sbjct: 14 AARSIRAGEQIVRIPHDLVLTAEKL--DDCVKKLLSTEY--DWCPLTLLILAEQHKGEAS 69
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
W PY+ L S + W + EL +L + ER E I EY
Sbjct: 70 RWAPYVSCLPSFGDH-----HSTIFWEKEELKFLECTRAFRGTAERREMISDEY 118
>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 467
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 81/197 (41%), Gaps = 23/197 (11%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDR--QRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L+L+ E + KSFW PYIR L + Q + Q A+ W + E L G+ + I +
Sbjct: 95 LFLIKEYLKRDKSFWWPYIRALPQPGQGNKSQWALAP--FWDDDEAELLEGTNVEVGIDK 152
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-------CVVH 272
++R+ E + + G + +A T E+++ A+ S +
Sbjct: 153 IRNDVRRDLQEAQELLRLHGDADGAF-----GKALTTELYQWAYCIFSSRSFRPSLVLSD 207
Query: 273 LQKVSLARRFALVPLGPPLLAYSSKCKAMLAAV----DDAVQ---LVVDRPYKAGESIVV 325
Q+ SL R + L + M + DD Q L V R + G+ +
Sbjct: 208 EQRRSLPRGVTMDDFSVLLPLFDIGNHDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFN 267
Query: 326 WCGPQPNSKLLINYGFV 342
+ N++LL+ YGF+
Sbjct: 268 NYSMKTNAELLLGYGFM 284
>gi|414886518|tpg|DAA62532.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 483
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 62/278 (22%), Positives = 116/278 (41%), Gaps = 58/278 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
+G S W PY+ +L T LA T + + E L+ + I ++
Sbjct: 111 KGSNSVWYPYLCQLPSYY---------------TILA--TFNDFEVEALQVDDAIWVAQK 153
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
+ + + W A L ++ + + F+ + AF V S +H ++ L P
Sbjct: 154 AKSAIKSDWEDATPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 208
Query: 287 LG-------------------PPLLAYSSKCKAMLAAVD----------DAVQLVVDRPY 317
+G L Y K M + + +A L + Y
Sbjct: 209 VGDLFNYAAPDDDTLLEDEDTAELTNYQQK-NGMTNSSERLTDGGYEDCNAYCLYARKNY 267
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
K GE +++ G N +LL +YGF+ +NP ++ +E+
Sbjct: 268 KKGEQVLLAYGTYTNLELLEHYGFLLGENPNEKTFIEL 305
>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
Length = 494
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 54/253 (21%), Positives = 99/253 (39%), Gaps = 36/253 (14%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
+ A ED+ G+ F +P ++ + T++L E L L ++YE G
Sbjct: 43 MVAVEDVAEGETLFEIPRGSILNVNTSALTRDYPSF-GTSQLGEWEELILCMLYEMFVLG 101
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL--------TGSPTKAEILERA-E 222
+ S W PY L + S + WS+ EL L G E+ +
Sbjct: 102 ENSRWYPYFNVLP-----SSAELNSLIYWSDRELGLLKPSFVIERIGRGKSQEMFSKVLS 156
Query: 223 GIKREYNELDTV--------WFMAGSLFQQYPYDI----PTEAFTFEIFKQAFVAVQSCV 270
I+ + ++L + + S+ Y +D+ P EI + S
Sbjct: 157 YIENQDSDLSLIAKYLTWENFVYVASIIMSYSFDVEDLNPQSDEDDEIEDDDNDSEMSPD 216
Query: 271 VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQ 330
++ +++PL L + + C A L + +++ +P +AGE + G
Sbjct: 217 KSIK--------SMIPLADTLNSDTHLCNANLMYDKETLKMTAIKPIRAGEEVFNIYGEH 268
Query: 331 PNSKLLINYGFVD 343
PNS++L YG+V+
Sbjct: 269 PNSEILRRYGYVE 281
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R AL+P+ S C + + + DR Y+A E + G N LL YG
Sbjct: 178 RLALLPVADMFNHASVGCAVAFST--EVYDVTADRDYEADEELYTSYGAHSNDFLLAEYG 235
Query: 341 FVDEDNPYDRLVVEV 355
F+ +DNP+D+L ++
Sbjct: 236 FMLQDNPHDQLCLDA 250
>gi|440802833|gb|ELR23759.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 518
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 115/307 (37%), Gaps = 56/307 (18%)
Query: 83 SWMHKNGLP-----PCKVILKEKPSHNEKHR---PIHYVAASEDLQAGDAAFSVPNSLVV 134
+W+ NGL K L P R PIH ED+ A SVP L++
Sbjct: 13 TWLATNGLDVECPDHVKFRLATFPWTGRGTRATVPIH----EEDVLA-----SVPRQLIL 63
Query: 135 TLERVLGNETIAELLTTNKLSELA----CLALYLMYEKKQGKKSFWLPYIREL------- 183
+ + + ++L + + LA LAL L+YEK G SFW PY L
Sbjct: 64 SAVTATDSRVVRDVLPAVEGTILADPMVALALLLLYEKNLGPASFWAPYFHILLLPPLCR 123
Query: 184 -------DRQRGRGQ-----LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+Q R + + P+ WS +L L + T +IL + ++ L
Sbjct: 124 FVPLLAWWQQTKRNKPVQWIVYYNLPIFWSSEDLVLLEEAHT--DILPHSRNMRTSILRL 181
Query: 232 DTVWFMA-------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
+ + S+F+ YP T + AF + S L A+
Sbjct: 182 YFGFLLPLFHLLIFISIFKDYPDMFSPAVHTCDELMWAFATIWSRGYWLDGDDTMP--AI 239
Query: 285 VPLGPPLLAYSSKCKAMLA-----AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
VPL L + K +A A +++ Y+ G+ ++ G + N L +Y
Sbjct: 240 VPLADMLNHNTEKGGERVAHYFYDADAQIFKVISKTSYEPGQQVLTHYGNKANGNFLEDY 299
Query: 340 GFVDEDN 346
GFV +N
Sbjct: 300 GFVYMNN 306
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 25/39 (64%)
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+ R + AGE +++ GP+ N LL YGFV++DNP D
Sbjct: 40 VTTQRGWTAGEQVLISYGPRSNDHLLRRYGFVEQDNPND 78
>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
Length = 510
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A+ DL+ G+ VP S ++T + L + + + + LS L + L+YE +G
Sbjct: 57 LGAARDLKKGELVLRVPKSALLTKDSFLKDGLLLSAINNHSALSPTQTLTVCLLYEMSKG 116
Query: 172 KKSFWLPYIRELDR 185
+ SFW PY+ L R
Sbjct: 117 QSSFWYPYLMHLPR 130
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 54/105 (51%), Gaps = 1/105 (0%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 23 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 82
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 83 CVALQDFRAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 127
>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
Length = 440
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 68/316 (21%), Positives = 119/316 (37%), Gaps = 35/316 (11%)
Query: 41 GSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEK 100
G +RR+ F VS R V + + +LK W+ +I
Sbjct: 5 GGRTSRIRRRKLFRCSVS---------RGVNESYKPEFIELKKWLKDRKFEDTNLIPACF 55
Query: 101 PSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELAC 159
P + + L+ G S+P S ++T + V+ + + S L
Sbjct: 56 PGTGRG------LMSKTSLREGQMIISLPESCLITTDTVIRSYLGTYIAKWQPPPSPLLA 109
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L +L+ EK G +S W PY+ L + A P+ E E+ L P KA+ E
Sbjct: 110 LCTFLVSEKHAGDQSLWKPYLEILPQ-------AYTCPVC-LEPEVVNLFPKPLKAKAEE 161
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKV 276
+ ++ ++ + LF + I F++ A+ V + V H Q+
Sbjct: 162 QRARVQEFFSSSRDFFSSLQPLFSEAVESI----FSYRALLWAWCTVNTRAVYVKHRQRQ 217
Query: 277 SLARRFALVPLGP--PLLAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPN 332
+ L P LL +S + + A ++ ++ + E + + GP N
Sbjct: 218 CFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEVFICYGPHDN 277
Query: 333 SKLLINYGFVDEDNPY 348
+LL+ YGFV NP+
Sbjct: 278 QRLLLEYGFVSIHNPH 293
>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
Length = 504
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQG 171
+AA+ DL G+ +VP S ++T + +L +E ++ + + LS L + L+ E +G
Sbjct: 51 LAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLAEMSKG 110
Query: 172 KKSFWLPYIRELDR 185
K S+W PY+ +L R
Sbjct: 111 KSSWWHPYLMQLPR 124
>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 490
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 66/297 (22%), Positives = 123/297 (41%), Gaps = 49/297 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 11 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70
Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
V + + + + T + + L L + + + +K G + W
Sbjct: 71 YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
YIR + +V P +S E L G+ + + + +++E++ L +
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
W Q++ +D T FTF+ +K +S VV L + A+VP +
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233
Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDED 345
S K K +AV QL + + GE + + G + P S+++ +YGFV+ +
Sbjct: 234 ACEDSVKAKYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENE 290
>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
Length = 199
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 89/205 (43%), Gaps = 30/205 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L ++L+ E+ +G SFW PY+ L + L W+ E+ L TK +
Sbjct: 1 LVIFLLCERNKGCSSFWKPYVDILPS-------SYTDILHWTSKEMDLLPKF-TKRRACD 52
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL---QKV 276
+ +N L + L +Q P AFT+++FK A+ +V + V++ Q
Sbjct: 53 LRLKAEESFNRLCNGFLPL--LVRQMPQF--NGAFTWDLFKWAWSSVNTRCVYMSQPQNS 108
Query: 277 SLA----RRFALVPLGPPLLAYSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVW 326
L+ + AL P LL ++ + A DD+ L +PY + + +
Sbjct: 109 VLSPDEEDKSALAPFLD-LLNHTVDVEVN-ARFDDSSKSYKITTLTACKPY---DQVFIN 163
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRL 351
GP N KLL+ YGF NP++ +
Sbjct: 164 YGPHSNEKLLLEYGFTLPCNPHNNI 188
>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 889
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 15/135 (11%)
Query: 79 GDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLER 138
G L+ W+ GLPP KV + + V E L +VP L++T +
Sbjct: 32 GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKL------LNVPAQLLLTADV 85
Query: 139 VLGNETIAELLTTNKLSELACLALYLMYEKKQ--GKKSFWLPYIRELDRQRGRGQLAVES 196
L + LL + + + LA +L ++Q G K+ W Y+ L Q G
Sbjct: 86 ALQHSAYGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTG-------C 138
Query: 197 PLLWSETELAYLTGS 211
L W+ E+ L G+
Sbjct: 139 VLEWASEEVDLLRGT 153
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 54/250 (21%), Positives = 103/250 (41%), Gaps = 34/250 (13%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
A+ D + + S+P L++T + ++ L L + + EK+Q K
Sbjct: 35 ATTDFRENETIISIPVGLIITAGFIAEMPDYCDVFKRYCLKPFEALVYFFLVEKEQNSK- 93
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
W PY+ L P +S + + P R + ++ NEL
Sbjct: 94 -WTPYLEVL-------------PKSFSTPASLHPSLKPEDFPYCLRKQWYVQK-NELKI- 137
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKV------SLARRFALVPL 287
+++++ I + ++ F A+ V + C+ K+ + A+VPL
Sbjct: 138 ------MYEKF-VTILADNTIWDHFLWAWHIVNTRCIYRNNKLHPLIDNTEDDSLAIVPL 190
Query: 288 GPPLLAYS--SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
+L +S S+C A+ + + +++V RP + GE I + G N L I YGF +D
Sbjct: 191 -IDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKD 249
Query: 346 NPYDRLVVEV 355
N D++ + +
Sbjct: 250 NICDKVEISL 259
>gi|363747293|ref|XP_003643967.1| PREDICTED: N-lysine methyltransferase SETD6-like [Gallus gallus]
Length = 447
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 102/239 (42%), Gaps = 25/239 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A+ DL+ G+ FSVP S +++ R L ++ L + S L L L++E
Sbjct: 51 AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SGWVPLLLALLHEYTT 107
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
G S W PY + +++ P+ W E E + L G+ + + I+ EY+
Sbjct: 108 G-TSHWRPYFS-----LWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
+ + FM + +P E T E++KQ AFV S L++ + P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215
Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
P +L + + A L +++V +P G+ I G N +LL YGF +
Sbjct: 216 MVPVADILNHVANHNASLKYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE 274
>gi|50303805|ref|XP_451849.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640981|emb|CAH02242.1| KLLA0B07161p [Kluyveromyces lactis]
Length = 553
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
D+ V +++ KAGE I G NS LL YGF EDNP+D
Sbjct: 319 TDECVDIILSNDVKAGEEIFNSYGDHSNSYLLARYGFCIEDNPHD 363
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 54/105 (51%), Gaps = 1/105 (0%)
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
++FT+E ++ A +V + + +R AL+PL + DD +
Sbjct: 346 DSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLITTGYNLEDDRCE 405
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
V + ++AGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 406 CVALQDFQAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 450
>gi|302896942|ref|XP_003047350.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
gi|256728280|gb|EEU41637.1| hypothetical protein NECHADRAFT_106552 [Nectria haematococca mpVI
77-13-4]
Length = 471
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 57/109 (52%), Gaps = 13/109 (11%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LE 219
A++L+ + G++SFW PYI+ L + A+ PLLW E++L +L G+ + + +
Sbjct: 111 AIFLVQQYLLGEQSFWYPYIQILPQPDDDKDSAI--PLLWPESDLLWLRGTHLEEAVSKQ 168
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS 268
+ + +KR W A Q+Y +D P++ FT E+ A+ S
Sbjct: 169 KVDHVKR--------WTEAMETLQKYGWD-PSQ-FTLELGLWAYYCFYS 207
>gi|400598098|gb|EJP65818.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 356
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 64/159 (40%), Gaps = 22/159 (13%)
Query: 197 PLLWSETELAYLTGSPTKAEILERAE-GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P W L G T +LE+ + R++ L + YPY +P+E +
Sbjct: 104 PFFWPPEAQRLLPG--TARRLLEKQQSNFGRDWKHLQSA----------YPY-VPSEDYM 150
Query: 256 FEIFKQAFVAVQSCVVHLQKVSL---ARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLV 312
F V+ ++ Q+ L R A++P+ S CK A ++ +V
Sbjct: 151 HAWF---VVSSRAFYQETQQTLLYPWHDRLAMLPVADLFNHASVGCKVSYCA--ESYDIV 205
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
DR Y G+ + G N LL YGF+ ++N DR
Sbjct: 206 ADREYGTGDEVCTCYGEHSNDFLLAEYGFLLQNNTNDRF 244
>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
Length = 555
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 107/249 (42%), Gaps = 45/249 (18%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ ++ T+E + + +L + + LS LA Y+++ +
Sbjct: 121 FKKGERILTIPSGILWTVEHAYADPLVGPVLRSARPPLSVEDTLATYILFIRS------- 173
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSP--TKAEILERAEGIKREYN 229
RE R +A S + ++E EL G+ T + L+R+ I+ +Y
Sbjct: 174 ----RESGYDGLRSHVAAFPTSYPSSIFFAEEELEVCAGTSLYTITKKLDRS--IEDDYR 227
Query: 230 ELDTVWFMAGS--LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHL-----QKVSLARRF 282
L V +A S LF P + F+ E +K A V S + + L F
Sbjct: 228 TL-VVRVLAQSRDLF-------PLDKFSIEDYKWALCTVWSRAMDFVLPDGNSIRLVAPF 279
Query: 283 ALVPLGPPLLAYSSKCK--AMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
A +L +SS+ + + A + ++ + Y+AG+ ++ G PNS+LL YG
Sbjct: 280 A------DMLNHSSEVEPCHIYDASSGNLSVLAGKDYEAGDQAFIYYGSIPNSRLLRLYG 333
Query: 341 FVDEDNPYD 349
FV NP D
Sbjct: 334 FVMPGNPND 342
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 57/234 (24%), Positives = 95/234 (40%), Gaps = 37/234 (15%)
Query: 126 FSVPNSLVVTLERVLGNETIAELLTTNKLSEL--ACLALY-LMYEKKQGKKSFWLPYIRE 182
FS+P S V L + L + +L+E + LAL ++ + Q S W PY+
Sbjct: 55 FSIPRSAV------LNAQNAKPLAISKRLAEKMPSWLALTSILMAEGQVDDSKWAPYLAI 108
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA-----------EGIKREYNEL 231
L Q + S + WS++ELA L S +I ++ +G++ E+
Sbjct: 109 LPEQ-------LNSLVFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEM 161
Query: 232 DTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLG 288
S+ Y +DIP + + A V +K L+ ++PL
Sbjct: 162 ---CHKVASVIMAYAFDIPDPSEGPTSGGKGEEAADDLVSDDGEDEKTILS----MIPLA 214
Query: 289 PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
L A + + A L ++ +++ +P GE I G P S LL YG+V
Sbjct: 215 DMLNADADRNNARLICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDLLRRYGYV 268
>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
Length = 440
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 57/239 (23%), Positives = 100/239 (41%), Gaps = 22/239 (9%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
LQ G S+P + ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 68 LQEGQVIISLPETCLLTTDTVIRSYLGAYIAKWRPPPSPLLALCTFLVSEKHAGDQSVWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P +A+ E+ ++ ++ +
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLRAKAEEQRARVREFFSSSRGFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP----LGP--PL 291
LF + I F++ A+ V + V++ K R F+ P L P L
Sbjct: 180 LQPLFSEAVGSI----FSYRALLWAWCTVNTRAVYV-KPRRRRCFSAEPDTCALAPYLDL 234
Query: 292 LAYSSKCKAMLAAVDD--AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
L +S + A ++ ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 235 LNHSPHVQVEAAFNEETRCYEIRTASSCRKHEEVFICYGPHDNQRLLLEYGFVSIHNPH 293
>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 79/321 (24%), Positives = 121/321 (37%), Gaps = 66/321 (20%)
Query: 83 SWMHKNG-LPPCKVILKEKPSHNEKHRPIHY--VAASEDLQAGDAAFSVPNSLVVTLERV 139
+W KN L P + S EK Y V A+ D+ + D +P T+ V
Sbjct: 76 AWRVKNNILAPNVEVAYVGGSEKEKGGDDLYRGVKATSDIASEDDLVRLPRE--ATMLVV 133
Query: 140 LGNETIAELLTTNKLSELAC-------LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
G E E +N+L A +AL L+YEK G +S + YI +L +
Sbjct: 134 EGQENPHEEYISNELWAKAGDERWALRVALVLLYEKSLGSRSKFYEYIEQLPK------- 186
Query: 193 AVESPLLWSETE---LAYLTGSP-TKAEILE------------RAEGIKREYNELDTVWF 236
+ E+ W+E E L Y G K + LE R G+K E + +W
Sbjct: 187 SFENLGTWTEEEVRELQYSVGEKFAKEQRLENEKACELIQEYARDGGLKTIERE-EVIWA 245
Query: 237 M--------AGSLFQQYPYD---IPTEAFTFEIFKQAFVAVQS------CVVHL------ 273
+ +G + Q +P +F +F+ Q+ CV L
Sbjct: 246 LDVVRSRVFSGKIADQEALQRKLLPRALSVGTVFA-SFLTAQTTELKWLCVFALLALVVF 304
Query: 274 -----QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCG 328
V + L+PL + + K + L + YK GE +++ G
Sbjct: 305 DSTKENDVKTDTAYVLMPL-IDAFNHQTMLKTEFEFTNSEFALKSPKSYKKGEEVLISYG 363
Query: 329 PQPNSKLLINYGFVDEDNPYD 349
PN +LL+ YGFVD+ N D
Sbjct: 364 LMPNDELLLRYGFVDDQNVAD 384
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 57/252 (22%), Positives = 102/252 (40%), Gaps = 49/252 (19%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
+ G+ ++P+ + T+E + + +L + + LS LA+Y+++ +
Sbjct: 33 FKQGERILTIPSGCLWTVEHAYADAVLGPVLRSAQPPLSVEDTLAIYILFVRS------- 85
Query: 177 LPYIRELDRQRGRGQLAV-----ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
RE R +A S + + + EL GS + + I+ +Y L
Sbjct: 86 ----RESGYDGLRSHVAALPASYSSSIFFEDDELEVCAGSSLYTITRQLEQRIEEDYRGL 141
Query: 232 DT-VWFMAGSLFQQYPYDIPTEAFTFEI--FKQAFVAVQSCVVHLQKVSLARRFALVPLG 288
V+ + LF P FT E +K A V S + ++P G
Sbjct: 142 VVRVFGLHLDLF-------PLNKFTIENVGYKWALCTVWSRAMDF----------VLPNG 184
Query: 289 PPL---------LAYSSKCKA--MLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
PL + +S + K + A + ++ + Y+A + + ++ GP PNS+LL
Sbjct: 185 NPLRLLAPFADMVNHSPEVKQCHVYDASSGNLSILAGKDYEAEDQVFIYYGPMPNSRLLR 244
Query: 338 NYGFVDEDNPYD 349
YGFV DNP D
Sbjct: 245 LYGFVIPDNPND 256
>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
melanoleuca]
Length = 440
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 96/238 (40%), Gaps = 20/238 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
L+ G S+P S ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 68 LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 127
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ ++ +
Sbjct: 128 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 179
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
LF + I F++ A+ V + V H Q+ + L P LL
Sbjct: 180 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 235
Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+S + KA ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 236 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICYGPHDNQQLLLEYGFVSIQNPH 293
>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
Length = 593
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 102/258 (39%), Gaps = 27/258 (10%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERV----LGNETIAELLTTNKLSELACLALYL 164
P+ + A + GD VP L+++ E LG A L + S
Sbjct: 191 PLRGLRADTAVAPGDVVLHVPADLLISYETAKKSDLGKVLSALPLDLSDDSIALIWTCVE 250
Query: 165 MYEKKQGKKSFW--LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
+E + FW LP+ + + L S+ ++A L G+P + + RA
Sbjct: 251 RHEPEAPHAPFWAALPH-------------SFSTALSASQEDVALLEGTPLHGDAV-RAR 296
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF 282
E E + F SL YP E F++E + A S + +Q S R
Sbjct: 297 QHLSEAFESSSPAFR--SLLGAYPDYFKPEWFSWESYLWAAELWYSYGIQVQFASGDIRT 354
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVD---DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
L P + + + VD +++ RP +AG + + GP N+KLL+ Y
Sbjct: 355 CLAPYLGLMNHHPLPHVVHFSKVDPETGCLRVRAFRPCEAGNQLFLSYGPYSNAKLLLFY 414
Query: 340 GFVDEDNPYD--RLVVEV 355
GF DNP D LV++V
Sbjct: 415 GFAVRDNPADEVELVLQV 432
>gi|367001244|ref|XP_003685357.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
gi|357523655|emb|CCE62923.1| hypothetical protein TPHA_0D02870 [Tetrapisispora phaffii CBS 4417]
Length = 495
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 61/263 (23%), Positives = 103/263 (39%), Gaps = 53/263 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER-----------VLGNETIAELLTTNKLSELACLA 161
V ASE ++ + F +P ++ ++ + G I E+ L + CL
Sbjct: 43 VVASEHIEKDEVLFEIPRDSILNVDTSELFKNHYEGYIDGKTVIEEIGLWETL--ILCL- 99
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER- 220
Y M+ KK+ +SFW Y L + L + W + EL L S ILER
Sbjct: 100 FYEMFVKKE--ESFWSQYFAVLPKATDFNTL-----MYWEDRELENLKPSF----ILERI 148
Query: 221 ------------AEGIKREYNELDTVWF------MAGSLFQQYPYDIPTEAFTFEIFKQA 262
E +++ + ++T F + S+ Y +DI E +
Sbjct: 149 GKDKSVAMHEKLMEFVEKNLDVIETSSFTWDRFLLVASIIMAYSFDI-------ERGECD 201
Query: 263 FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGES 322
+ + SL + +++PL L A + +C A L +++ +P KA E
Sbjct: 202 ADEEEEEEEEDIERSLIK--SMIPLADTLNADTKRCNANLIYDSGVLKMCAIKPIKANEQ 259
Query: 323 IVVWCGPQPNSKLLINYGFVDED 345
I G N +LL YG+V+ D
Sbjct: 260 IYNTYGNHANFELLRRYGYVEVD 282
>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 493
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 48/199 (24%), Positives = 84/199 (42%), Gaps = 25/199 (12%)
Query: 158 ACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAE 216
ACL L L++E+ G+ S + Y+ L + PL W+E E+ L G T AE
Sbjct: 133 ACLTVLRLLHERGLGESSPFHSYLSVLPQDH-------RLPLEWTEAEVGLLQG--TSAE 183
Query: 217 ILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV 276
L A + ++ +V Q+P T F + V+S +
Sbjct: 184 PLVGAGSLDSQFEAFQSV-------VAQHPTVWEPSVCTKAAFAKGVNWVRS-----RGF 231
Query: 277 SLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSK 334
++ ++P G + + +++ D V+ +P KAGE + G N++
Sbjct: 232 TVMGDPHMIP-GADMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQ 290
Query: 335 LLINYGFVDEDNPYDRLVV 353
LL +YGFV N +D +++
Sbjct: 291 LLNSYGFVLPGNSFDTVLI 309
>gi|149059901|gb|EDM10784.1| hypothetical protein RDA279, isoform CRA_d [Rattus norvegicus]
Length = 399
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ E+ G S W
Sbjct: 27 LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 85
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L P +A+ E+ ++ + +
Sbjct: 86 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 137
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ F A+ V + V+L+ + L+ L P L
Sbjct: 138 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 193
Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L +S + AA ++ + R K E+ + + GP N +LL+ YGFV NP
Sbjct: 194 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 251
Query: 348 Y 348
+
Sbjct: 252 H 252
>gi|222640175|gb|EEE68307.1| hypothetical protein OsJ_26571 [Oryza sativa Japonica Group]
Length = 422
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 26/45 (57%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
V +++ + RP KAGE + G P S L+ YGF+ DNPYD
Sbjct: 315 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYD 359
>gi|149059902|gb|EDM10785.1| hypothetical protein RDA279, isoform CRA_e [Rattus norvegicus]
Length = 475
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ E+ G S W
Sbjct: 103 LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 161
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L P +A+ E+ ++ + +
Sbjct: 162 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 213
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ F A+ V + V+L+ + L+ L P L
Sbjct: 214 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 269
Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L +S + AA ++ + R K E+ + + GP N +LL+ YGFV NP
Sbjct: 270 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 327
Query: 348 Y 348
+
Sbjct: 328 H 328
>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
gallopavo]
Length = 241
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 48/100 (48%), Gaps = 9/100 (9%)
Query: 94 KVILKEKPSHNEKHRPIHY------VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE 147
K LK++ + RP + + + LQAG+ S+P +VT VL N + E
Sbjct: 36 KKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVL-NSCLGE 94
Query: 148 LLTTNK--LSELACLALYLMYEKKQGKKSFWLPYIRELDR 185
+ K +S L L +L+ EK G+KS W PY+ L +
Sbjct: 95 YIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPK 134
>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 67/284 (23%), Positives = 115/284 (40%), Gaps = 59/284 (20%)
Query: 113 VAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAEL---------LTTNKLSE 156
V A++++ A + +P N+++ + R G E++ +L TT++ +E
Sbjct: 73 VYATQNVSAKETLVRIPHSFLMNTNTIIKHISRFNGKESVPDLGYSVSLPSEYTTDQWTE 132
Query: 157 LAC---------------LALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVE 195
L ALY+ EKK+ + SFW LP + ELD
Sbjct: 133 LYAKIPISKWLQLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF---------- 182
Query: 196 SPLLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEA 253
+P++W E+E LTGS A+ E R + + +V F + ++ +E
Sbjct: 183 APIVWEVESE---LTGSKA-ADFFELLPRSSRNHAKKVSVRFNEDYTAVSEFLTAAKSEP 238
Query: 254 FTFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAV 309
F A++ + S +++ + A F L P L KC + + +V
Sbjct: 239 LNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFSV 298
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
VD + AG+ ++ GP N LL Y F E N ++ L V
Sbjct: 299 ISCVD--HAAGQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 52/119 (43%), Gaps = 10/119 (8%)
Query: 144 TIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSET 203
++A+ L +L L + +M E+ G +S W Y L RG L P+ W+E
Sbjct: 85 SVAKELRDARLGGGLALNVAVMVERALGSESRWRDYFAVLP-SRGERTL----PMFWTEA 139
Query: 204 ELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQA 262
L L G+ + E AE ++ +Y+E + L +P E TFE + +A
Sbjct: 140 RLEALKGTDLATHVREDAENLRADYDEE-----VVNGLCVAHPEKFRREELTFERYLEA 193
>gi|166091525|ref|NP_001107219.1| SET domain-containing protein 4 [Rattus norvegicus]
gi|165971256|gb|AAI58670.1| Setd4 protein [Rattus norvegicus]
Length = 439
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 101/241 (41%), Gaps = 26/241 (10%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK--LSELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T + V+ ++ + K +S L L +L+ E+ G S W
Sbjct: 67 LQEGQVIISLPESCLLTTDTVI-RSSVGPYIKKWKPPVSPLLALCTFLVSERHAGSHSLW 125
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E E+ L P +A+ E+ ++ + +
Sbjct: 126 KSYLDILPK-------SYTCPVCL-EPEVVDLLPGPLRAKAEEQRARVQDLFASSRDFFS 177
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ---KVSLARRFALVPLGP--PL 291
LF + I F++ F A+ V + V+L+ + L+ L P L
Sbjct: 178 TLQPLFAESVDSI----FSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFLDL 233
Query: 292 LAYSSKCKAMLAAVDDAVQL----VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L +S + AA ++ + R K E+ + + GP N +LL+ YGFV NP
Sbjct: 234 LNHSPHVQVK-AAFNEKTRCYEIRTASRCRKHQEAFICY-GPHDNQRLLLEYGFVAFGNP 291
Query: 348 Y 348
+
Sbjct: 292 H 292
>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
2508]
gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 533
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 92/219 (42%), Gaps = 33/219 (15%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-L 218
L L LM+E QG S W PY+ L Q ++P+ W+E ELA L S A++
Sbjct: 131 LILILMHEYLQGSSSNWSPYLSILPHQ-------FDTPMFWTEAELAELQASALVAKVGK 183
Query: 219 ERAEGIKRE-----YNELDTVWFMAGSLFQQ-----------YPYDIPTEAFTFEIFKQA 262
+ A+ + R E + V++ AG+ Q + A+ F++ K+
Sbjct: 184 DEADKMIRTKIVKVVQENEDVFYPAGTPKTQRLDEGELLKLGHRMGSAIMAYAFDLAKEE 243
Query: 263 FVAVQSCV-----VHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPY 317
V + + +VP+ +L + A + + + R
Sbjct: 244 DDDEDEEEEEDGWVEDKIGGMNDTMGMVPMA-DMLNADAVFNAHINHGEACLTATSLREI 302
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNP-YDRLVVEV 355
K GE I+ + GP +++LL YG+V ++ YD VVEV
Sbjct: 303 KEGEEILNYYGPLSSAELLRRYGYVTPNHARYD--VVEV 339
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 64/313 (20%), Positives = 122/313 (38%), Gaps = 50/313 (15%)
Query: 58 SSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASE 117
S L S+ + + + +L W+ KV ++ K +E +R + AS+
Sbjct: 20 DSESELRTKSKRITYEDPDPYKNLIQWLKDGKAEVSKVSIEVK---SEGYRTLR---ASQ 73
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNETIA-ELLTTNKL-SELACLALYLMYEKKQGKKSF 175
++ G+ VP + ++LE V + I +++ N + + + + + ++ + + SF
Sbjct: 74 FIRQGEWVLFVPRTHYLSLEEVKKSCLINRKMIQLNYIPNNIQTYFVNHLLQENRRQNSF 133
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PYI L + P + + A L GSPT ++ + + + EY+ L
Sbjct: 134 WKPYIDVLPKD------VSGFPTNFDAEQDALLKGSPTLFTVMNQRKTFQEEYDNLKE-- 185
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRF-----------AL 284
A FQ+Y Y T+ F V + ++++R F L
Sbjct: 186 --AVKEFQRYGY-------TYNDF-----------VKFRTLTISRSFPVYIGENEQQQLL 225
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKAGESIVVWCGPQPNSKLLINYGFV 342
VPL + + + DA + R + GE + G N +NYGF
Sbjct: 226 VPLA-DFINHDNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQWSNKYFFMNYGFA 284
Query: 343 DEDNPYDRLVVEV 355
NP ++ ++
Sbjct: 285 SLTNPMNQFDFDI 297
>gi|297608243|ref|NP_001061350.2| Os08g0244400 [Oryza sativa Japonica Group]
gi|255678277|dbj|BAF23264.2| Os08g0244400, partial [Oryza sativa Japonica Group]
Length = 195
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 30/51 (58%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
V +++ + RP KAGE + G P S L+ YGF+ DNPYD + +++
Sbjct: 14 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYDVIPLDL 64
>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
Length = 415
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 96/238 (40%), Gaps = 20/238 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWL 177
L+ G S+P S ++T + V+ + A + S L L +L+ EK G +S W
Sbjct: 44 LREGQMIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWK 103
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
PY+ L + A P+ E E+ L P KA+ E+ ++ ++ +
Sbjct: 104 PYLEILPK-------AYTCPVC-LEPEVVNLFPKPLKAKAEEQRARVQGFFSSSRDFFSS 155
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV---HLQKVSLARRFALVPLGP--PLL 292
LF + I F++ A+ V + V H Q+ + L P LL
Sbjct: 156 LQPLFSEAVESI----FSYSALLWAWCTVNTRAVYVKHRQEQCFSTEPNTCALAPYLDLL 211
Query: 293 AYSSKC--KAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+S + KA ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 212 NHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFICYGPHDNQQLLLEYGFVSIQNPH 269
>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
Length = 777
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 39/165 (23%), Positives = 74/165 (44%), Gaps = 31/165 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA-------ELLTTNKLSELACLALYLM 165
VAA + + +A +PN L++ +++ +E E T K S+ L ++
Sbjct: 112 VAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPNEFKNTEK-SDQTSLIFFVA 170
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVES--PLLWSETELAYLTGSPTKAEILERAEG 223
E +G++S+W PY + A +S P W + + L + KAE+
Sbjct: 171 LELLKGEESYWHPYF----------ETAQDSDLPQFWEDQNIDELEDALIKAEL------ 214
Query: 224 IKREYNELDTV--WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
+ +++D + + +A + YP + E FT EI+K+A+ V
Sbjct: 215 ---QMHQVDFIGDYEIAHGIANHYPDLVHAEKFTIEIYKRAYNIV 256
>gi|156717956|ref|NP_001096520.1| N-lysine methyltransferase setd6 [Xenopus (Silurana) tropicalis]
gi|325530258|sp|A4QNG5.1|SETD6_XENTR RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|140832737|gb|AAI35641.1| LOC100125156 protein [Xenopus (Silurana) tropicalis]
Length = 454
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 98/246 (39%), Gaps = 37/246 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNET-IAELLTTNKLSELAC-----LALYLMYEK 168
A EDL G+ FS+P S +++ N T I +L+ + S +C L + L+YE
Sbjct: 55 AREDLSDGELLFSIPRSAILS-----QNTTRIRDLIEKEQDSLQSCSGWVPLLISLLYEA 109
Query: 169 KQGKKSFWLPYIR---ELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGI 224
S W PY ELD + P+ WSE E L G+ + + + I
Sbjct: 110 TDS-SSHWAPYFGLWPELD--------PPDMPMFWSEEEQTKLLQGTGILEAVHKDLKNI 160
Query: 225 KREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
++EYN + ++ P T +++K+ V + +
Sbjct: 161 EKEYNSI------VLPFIRRNPEKFCPMKHTLDLYKRLVAFVMAYSFQEPQEEDEEEDIE 214
Query: 285 VPLGPP-------LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLI 337
+ PP LL + ++ A L + ++++ + AG+ + G N +LL
Sbjct: 215 KDILPPMMVPVADLLNHVAQHNAHLEFTPECLRMITTKSVCAGQELFNTYGQMANWQLLH 274
Query: 338 NYGFVD 343
YGF +
Sbjct: 275 MYGFAE 280
>gi|50303389|ref|XP_451636.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640768|emb|CAH02029.1| KLLA0B02354p [Kluyveromyces lactis]
Length = 594
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 63/270 (23%), Positives = 111/270 (41%), Gaps = 53/270 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK---------LSELACLALYLM 165
AS+D++ A F +P+ L++T + L N+ + + + + LS+L ++
Sbjct: 36 ASDDIEQ--AVFQLPSHLIIT--KDLSNKHFRDQVKSKEHHNTWLKLFLSKLKFSDEMII 91
Query: 166 YEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIK 225
+ + K F PYI L Q V+SPL W+ +ELA L G+ + E+ + I
Sbjct: 92 LDNENITKLF-RPYIMALPSQ-------VDSPLGWNPSELALLNGTNLYTSLKEKLQSIY 143
Query: 226 REYNELDTVW-FMAGSLFQQYPYDIPT-------EAFTFEIFKQAFVAVQSCVVHL--QK 275
E+ W + G+ FQ Y+I E T ++F + + S L
Sbjct: 144 DEW------WNVIEGTSFQTRQYNIEQLSIDEIYEQITSKVFSEKILDFFSFPAFLWSHL 197
Query: 276 VSLARRFALVPLGP-------------PLLAYSSKCKAMLAAVDDAVQLVVDR--PYKAG 320
+ +R F + P LL + ++ K + + + ++ P G
Sbjct: 198 MFTSRAFPERVINPYCDEYNVILLPVLDLLNHENRSKIQWSCSSEG-SFIFEKLEPVSKG 256
Query: 321 ESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
I G + N +LL YGFV + N +D
Sbjct: 257 TEICNNYGAKGNEELLYGYGFVVDGNEFDN 286
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 29/59 (49%)
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
LL + + K +D V V K GE + G + N LL++YGFV + NPYD
Sbjct: 229 LLNHKNDTKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFVQDQNPYD 287
>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 490
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 65/297 (21%), Positives = 123/297 (41%), Gaps = 49/297 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 11 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 70
Query: 138 RVLGN-----------ETIAELLTTNKLSELACLALYLMY--------EKKQGKKSFWLP 178
V + + + + T + + L L + + + +K G + W
Sbjct: 71 YVHNHAKIDRHLREVLDAVGDFGRTARGAILIFLIIQITHASPDFVNKRQKIGISNPWTE 130
Query: 179 YIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DT 233
YIR + +V P +S E L G+ + + + +++E++ L +
Sbjct: 131 YIRFM-------PASVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEI 183
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPP 290
W Q++ +D T FTF+ +K +S VV L + A+VP +
Sbjct: 184 PWC------QEHWWDEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANH 233
Query: 291 LLAYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDED 345
S K + +AV QL + + GE + + G + P S+++ +YGFV+ +
Sbjct: 234 ACEDSVKARYDEEGAGNAVLQLRTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENE 290
>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 283
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 42/182 (23%), Positives = 87/182 (47%), Gaps = 23/182 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY- 217
Query: 230 ELDTVWFMAGSLFQQYPY--DIP-TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALV 285
+F + Q +P +P ++FT++ ++ A +V + + +R AL+
Sbjct: 218 ----AYFY--KVIQTHPNASKLPLKDSFTYDDYRWAVSSVMTRQNQIPTEDGSRVTLALI 271
Query: 286 PL 287
PL
Sbjct: 272 PL 273
>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 514
Score = 42.0 bits (97), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 59/246 (23%), Positives = 103/246 (41%), Gaps = 27/246 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK-KQG 171
VA + ++ AG+ VP + + + + + S A LA +++ E G
Sbjct: 85 VATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGASGASGDAILAAHVLREAFDAG 144
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA--EGIKREYN 229
KS + P++R L R V+S + W+E EL+ L+GS + RA + EY+
Sbjct: 145 SKSAYWPWLRLLPRD-------VDSTVGWNEDELSELSGS--NVVVFTRAIKAQWRMEYD 195
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEA---FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
LD +L +++P E +TF+ F A + S + L S +
Sbjct: 196 ALDV-----PTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESA--EAPTIR 248
Query: 287 LGPPLL-----AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+ PLL A K + A +AV++ ++ + +P+ L+ YGF
Sbjct: 249 VLVPLLDMANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFNYDTKPSQYFLLQYGF 308
Query: 342 VDEDNP 347
+ E NP
Sbjct: 309 IPETNP 314
>gi|357122881|ref|XP_003563142.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Brachypodium
distachyon]
Length = 480
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 114/281 (40%), Gaps = 60/281 (21%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTN--KLSELACLALYLMYEKKQ 170
AA+ DL+ G+ VP + ++T +RV+ ++ IA + +LS + L + L+ E +
Sbjct: 45 AAARDLRRGELVLRVPRAALLTSDRVMADDPEIASCIAARHPRLSSVQRLIVCLLAEVGK 104
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI-KREYN 229
GK S W Y+ +L T LA +A ++ A I ++ +
Sbjct: 105 GKSSSWYLYLSQLPSYY---------------TVLATFNDFEIEALQVDDAIWIAQKSLS 149
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGP 289
+ + W A L Q + + F+ + AF V S +H V+ L P+G
Sbjct: 150 AIRSEWEDATPLMQGLKF--KPKLLIFKTWLWAFATVSSRTLH---VAWDDAGCLCPVG- 203
Query: 290 PLLAYS----------------SKCKA---MLAAV----------------DDAVQLVVD 314
L Y+ +KC+ ML V +A L
Sbjct: 204 DLFNYAAPDDDISSEEENREEVTKCQQKNEMLEEVKFGRSSERLSDGGYEDSEAYCLYAR 263
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ Y GE +++ G N +LL +YGF+ +NP ++ +++
Sbjct: 264 KCYTKGEQVLLGYGTYTNLELLEHYGFLLAENPNEKTYIQL 304
>gi|218189844|gb|EEC72271.1| hypothetical protein OsI_05430 [Oryza sativa Indica Group]
Length = 1243
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%), Gaps = 8/114 (7%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
AS+ +Q GD VP + +TL+++ L + + + + LA L+ E+ G +S
Sbjct: 66 ASKPIQEGDCIMQVPYHVQLTLDKLPQKFNT---LLDHAVGDTSKLAALLIMEQHLGNES 122
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
W PYI+ L + + + +LW EL + S E +E E K+E+
Sbjct: 123 GWAPYIKSLPTKD-----QMHNMVLWDLNELHAVQNSSIYDEAIEHKEQAKKEF 171
>gi|365982325|ref|XP_003667996.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
gi|343766762|emb|CCD22753.1| hypothetical protein NDAI_0A05980 [Naumovozyma dairenensis CBS 421]
Length = 573
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%)
Query: 296 SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
SK + L DD V +V R GE I + GP PN+ LL GF DNP+D
Sbjct: 335 SKPEEELNNPDDYVDIVTTRGILKGEEIFISYGPLPNAFLLAKCGFTMADNPFD 388
>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
Length = 1658
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 53/245 (21%), Positives = 97/245 (39%), Gaps = 19/245 (7%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL--TTNKLSELACLALYLMYEKKQ 170
V ++ ++ + SVP ++ ++ + + +L L++ L L+++YEK +
Sbjct: 1212 VVTTKKVEENECVVSVPRKFLINVDCARKHPVLNSILFEEATGLNDDTILFLFVIYEK-E 1270
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
SFW P+ L + + ++ TEL L G+ + E IK
Sbjct: 1271 NPNSFWRPFFDTLPS-------YFPTSIHYTTTELLELEGT----NLFEETIQIKEHLES 1319
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPP 290
+ + F L QYP P FT E F A S + L K+ LVP+
Sbjct: 1320 IRELLF--PELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-KIDGRIVNCLVPMADM 1376
Query: 291 LLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+ + + +D +++ A I + G + +L + YGFV +N Y
Sbjct: 1377 INHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSWELALYYGFVISNNHY 1436
Query: 349 DRLVV 353
D + +
Sbjct: 1437 DSVHI 1441
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 68/306 (22%), Positives = 118/306 (38%), Gaps = 51/306 (16%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
L W +G V +KE S +++ + AS+D+ +P++++++ V
Sbjct: 36 LVDWGRMHGANIENVEIKETASDDDR-KLTRGAYASKDIPPNSEICFIPSTILLSESDVR 94
Query: 141 GNETIAELLT--------TNKLSE---------LACLALYLMYEKKQ-GKKSFWLPYIRE 182
+E +LT K+S+ L +A +++++ S WLPY+
Sbjct: 95 ASEIGKAILTYIDEHQDAKQKISDKIKHPHAEILLAMAAFIVHQVSLPTADSHWLPYLAS 154
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL-ERAEGIKREYNELDTVWFMAGSL 241
L + PL+W+ + L G + ++ ER E I+ N V G
Sbjct: 155 LPKNYAL-------PLMWTRDRIQNLLGGTSLLYMMIERLEWIQ---NSTKVVENACGHY 204
Query: 242 FQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR------------FALVPLGP 289
F PT A T + + A ++ S K SL + + + L P
Sbjct: 205 F-------PTGALTVQSMQWATCSIWSRAFPKAKPSLDLQDGSHQDVQDWIGLSEICLFP 257
Query: 290 PLLAYSSK--CKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L ++ K + + V + G ++ GP+ N LL NYGFV E+NP
Sbjct: 258 ILDMFNHKRGYRVEWRMTEKGVSFITPDGICKGSELLNNYGPKGNENLLSNYGFVIENNP 317
Query: 348 YDRLVV 353
D V
Sbjct: 318 EDYFKV 323
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 16/50 (32%), Positives = 33/50 (66%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
DD + V + +KAGE I ++ G + N++ +I+ GF ++N +DR+ +++
Sbjct: 9 DDRCECVALQDFKAGEQIYIFYGTRSNAEFVIHSGFFFDNNSHDRVKIKL 58
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 60/270 (22%), Positives = 102/270 (37%), Gaps = 56/270 (20%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE-----RVLGNETIAELLTTNKLSELACLALYLMYEKK 169
A+ L + SVP L+ + E R+ G T A L LA L+ EK
Sbjct: 59 ATRPLAKDELVLSVPRKLIFSEESNSDCRLFGKMTQATHLN---------LAYDLVIEKI 109
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
+G+ S W PYI L + + L ++ ++ L G+ + L + I ++Y
Sbjct: 110 RGEFSEWRPYIDVLPAK-------YSTVLYFTTKQMELLRGTAAASLALRQCRVIAKQYA 162
Query: 230 ELDTVWFMA--------------GSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQK 275
L G F Q+ +E+++ A S V+ Q
Sbjct: 163 FLYRYAHTMTEPSTGNRSHPGERGLFFTQH-------GLCYELYRWAV----STVMTRQN 211
Query: 276 VSLARR----------FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV 325
+ + + AL+P K + AAV ++ AGE +
Sbjct: 212 LVPSEKQESEDTPKLISALIPYWDMANHRPGKITSFYAAVPRQLECTAQEAVDAGEQFFI 271
Query: 326 WCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ G + N+ LL++ GFVD++N D + + V
Sbjct: 272 YYGDRSNTDLLVHNGFVDDNNLKDYVNIRV 301
>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
Length = 656
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 6/149 (4%)
Query: 206 AYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVA 265
A L GSP AE + + + + SL + YP F++E + A
Sbjct: 164 AALAGSPLAAEAGQARRHLAEAFAASQPAF---ESLLKAYPDYFQPHWFSWESYLWAAEL 220
Query: 266 VQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDA---VQLVVDRPYKAGES 322
S + +Q + R LVP + + + VD A +++ RP G
Sbjct: 221 WYSYGIQVQVAAGDIRTCLVPYLGLMNHHPLPHVVHFSKVDPASRGLRVRAFRPCARGRQ 280
Query: 323 IVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ + GP PNSKLL+ YGF DNP D +
Sbjct: 281 LFLSYGPYPNSKLLLFYGFALPDNPVDEV 309
>gi|401624185|gb|EJS42251.1| set7p [Saccharomyces arboricola H-6]
Length = 494
Score = 42.0 bits (97), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
+++PL L A +SKC A L +++++ R + E + G PNS++L YG+V
Sbjct: 220 SMIPLADTLNADTSKCNANLTYDSGSLKMIAVRDIEIDEQVYNIYGEHPNSEILRRYGYV 279
Query: 343 DED-NPYD 349
+ D + YD
Sbjct: 280 EWDGSKYD 287
>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 42.0 bits (97), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 20/81 (24%), Positives = 38/81 (46%), Gaps = 2/81 (2%)
Query: 274 QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
++++ + AL P+ L C+ + + DR YK GE + + G N
Sbjct: 194 ERLTKDDKMALQPVADLLNHSDEGCEVVFDT--GCYTISADREYKQGEEVYICYGTHSND 251
Query: 334 KLLINYGFVDEDNPYDRLVVE 354
L++ YGF E+N +D + ++
Sbjct: 252 FLMVEYGFCPEENKWDEVCID 272
>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
DL-1]
Length = 499
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 88/220 (40%), Gaps = 31/220 (14%)
Query: 141 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW 200
GN+ + E L++ L L L YE G++S W Y+ L + S + W
Sbjct: 74 GNQEVLE-----TLNQWEALILCLAYEMMLGEESRWSSYLAVLPEK-------FNSLMFW 121
Query: 201 SETELAYLTGSPTKAEI-LERAE--------------GIKR--EYNELDTVWFMAGSLFQ 243
S EL L S I E+AE G K+ EY +D + + S+
Sbjct: 122 SSEELEKLKPSNVLQRIGREQAEQMYSKLVPEYCLRLGSKKLVEYLTIDR-FHVVASIIM 180
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLA 303
Y +D+ E + K + ++VPL L + ++ A L+
Sbjct: 181 SYSFDVDDPEDDPEDDEDEEEDFDEIEQECIKYDGYLK-SMVPLADTLNSNTNLVNANLS 239
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
+DA+ + + K GE I G PNS++L YG+V+
Sbjct: 240 YENDALVMTATKDIKKGEQIYNIYGELPNSEILRKYGYVE 279
>gi|302829721|ref|XP_002946427.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
gi|300268173|gb|EFJ52354.1| hypothetical protein VOLCADRAFT_86703 [Volvox carteri f.
nagariensis]
Length = 658
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 17/90 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----------------LS 155
+ A+ DLQ G+A VP L++T + +A L + L
Sbjct: 32 IVATRDLQPGEAVLRVPERLLLTTRSAARDPQLAAALQRHTERSRGVAAAPSCGGGCGLG 91
Query: 156 ELACLALYLMYEKKQGKKSFWLPYIRELDR 185
LA +L+ E +G +SFW PY+++L R
Sbjct: 92 PHQVLACHLLLEVSRGPQSFWWPYLKQLPR 121
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 306 DDAVQ---LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
D+A Q +VV RPY+ GE +++ G N +LL YGFV E N +D
Sbjct: 312 DEATQQYCIVVRRPYREGEQVMLCYGRYTNLELLEYYGFVLEGNLHD 358
>gi|302790237|ref|XP_002976886.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
gi|300155364|gb|EFJ21996.1| hypothetical protein SELMODRAFT_416932 [Selaginella moellendorffii]
Length = 177
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
L S CK + AV +++++ R KAG + G PN LL YGFV E+NP+D
Sbjct: 54 FLWASELCK--IDAVTNSLKVYSLRSCKAGMQCFISYGALPNIDLLCFYGFVLENNPFDT 111
Query: 351 LVVEV 355
+ VE+
Sbjct: 112 IPVEL 116
>gi|397642897|gb|EJK75526.1| hypothetical protein THAOC_02751 [Thalassiosira oceanica]
Length = 395
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 110/281 (39%), Gaps = 40/281 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVL 140
K W G+ E+ ++K R I Y + +AG A VP L+++ + +
Sbjct: 82 FKYWASTMGIEKNDCFKLEE--QDKKQREI-YAMTTRSTEAGTAVLYVPEHLILSSSKAM 138
Query: 141 ----------GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRG 190
E +A + ++L E L L ++ E ++G S W ++ L R
Sbjct: 139 AELRTDGMAEAEEYLASVGAESQLREYY-LMLKVLLEYQKGSDSEWHKWLDALPRYYSNA 197
Query: 191 QLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP 250
+ +E L L K + ER Y+ + +V F+A + + +P D+
Sbjct: 198 -------VAMTEFCLTCLPPLMKKLAVEERDAQKLLSYDSIQSVPFLADDIKEGFPRDMV 250
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-RFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
T A+ +V+ + V ++P+G ++S ++ D+A
Sbjct: 251 TWAYQ--------------IVYTRSVETEDGDLKIIPMG-DFFDHASDYAEIVPQYDEAG 295
Query: 310 QLVVDRPYK--AGESI-VVWCGPQPNSKLLINYGFVDEDNP 347
Y AG+ + ++ P+ S LL YGF+DE P
Sbjct: 296 NYYAVTAYDVPAGKKLRYIYSNPRNPSHLLARYGFIDEICP 336
>gi|449472508|ref|XP_002187588.2| PREDICTED: N-lysine methyltransferase SETD6 [Taeniopygia guttata]
Length = 383
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 104/243 (42%), Gaps = 26/243 (10%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT-NKLSELACLALYLMYEKKQGKK 173
A+E+L+AG+ F++P + +++ + + E + S L L L++E
Sbjct: 3 AAEELEAGEVLFTIPRTALLSQHTTSIHALLQEAQESLQSQSGWVPLLLALLHEYT-ASN 61
Query: 174 SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL---TGSPTKAEILERAEGIKREYNE 230
S W PY R +++ P+ W + E L TG P + + I+ EYN
Sbjct: 62 SHWQPYFSLWQDFR-----SLDHPMFWPQEERTRLLQGTGIPEAVD--KDLANIQLEYNS 114
Query: 231 LDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPLG 288
+ + FM + +P + T E++K+ AFV S L++ + P+
Sbjct: 115 I-ILPFM-----ETHPDIFDPKLHTLELYKELVAFVMAYSFQEPLEEEEEDEKGPNPPMM 168
Query: 289 PP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDED 345
P +L + + A L +++V +P + G+ I G N +LL YGF +
Sbjct: 169 VPVADILNHVANHNANLEYSPQCLRMVTTQPVRKGQEIFNTYGQMANWQLLHMYGFAE-- 226
Query: 346 NPY 348
PY
Sbjct: 227 -PY 228
>gi|158508540|ref|NP_001025734.2| N-lysine methyltransferase SETD6 [Gallus gallus]
Length = 447
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 104/244 (42%), Gaps = 28/244 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A+ DL+ G+ FSVP S +++ R L ++ L + S L L L++E
Sbjct: 51 AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SVWVPLLLALLHEYTT 107
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
G S W PY + +++ P+ W E E + L G+ + + I+ EY+
Sbjct: 108 G-TSRWRPYF-----SLWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
+ + FM + +P E T E++KQ AFV S L++ + P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215
Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P +L + + A L +++V +P G+ I G N +LL YGF +
Sbjct: 216 MVPVADILNHVANHNASLEYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274
Query: 345 DNPY 348
PY
Sbjct: 275 --PY 276
>gi|169847976|ref|XP_001830696.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
gi|116508170|gb|EAU91065.1| hypothetical protein CC1G_03233 [Coprinopsis cinerea okayama7#130]
Length = 496
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 60/121 (49%), Gaps = 19/121 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVT-----LERVLGNETIAELLTTNKLSE-LACLALYLMYEK 168
A +DL G F++P +L ++ L + G E L KL + A L L +M+E
Sbjct: 42 ALKDLPEGHVLFTIPRALTLSTRTSRLPELFGLEEWKRL----KLHQGWAGLMLCMMWEA 97
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
QGK+S W Y+ L A ++P+ W+E +L+ L G+ ++ + E +R+Y
Sbjct: 98 AQGKESRWAGYLDIL-------PAAFDTPMFWNEEDLSELAGTSIVGKLGK--EDAERDY 148
Query: 229 N 229
+
Sbjct: 149 D 149
>gi|143584415|sp|Q5ZK17.2|SETD6_CHICK RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
domain-containing protein 6
Length = 447
Score = 41.6 bits (96), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 104/244 (42%), Gaps = 28/244 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLE----RVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
A+ DL+ G+ FSVP S +++ R L ++ L + S L L L++E
Sbjct: 51 AAADLEPGELLFSVPRSALLSQHTCAIRALLHDAQESLQSQ---SVWVPLLLALLHEYTT 107
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
G S W PY + +++ P+ W E E + L G+ + + I+ EY+
Sbjct: 108 G-TSRWRPYFS-----LWQDFSSLDHPMFWPEEERVRLLQGTGIPEAVDKDLANIQLEYS 161
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHLQKVSLARRFALVPL 287
+ + FM + +P E T E++KQ AFV S L++ + P+
Sbjct: 162 SI-ILPFM-----KSHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPM 215
Query: 288 GPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
P +L + + A L +++V +P G+ I G N +LL YGF +
Sbjct: 216 MVPVADILNHVANHNASLEYAPTCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAE- 274
Query: 345 DNPY 348
PY
Sbjct: 275 --PY 276
>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
Length = 377
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 65/167 (38%), Gaps = 22/167 (13%)
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
LA PL WS L P +A L RA+ K + W + F
Sbjct: 102 ADLATALPLAWSSPVLHNYLPPPARA--LLRAQQAKFARD-----WAAVSAAFP------ 148
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR-----RFALVPLGPPLLAYSSKCKAMLAA 304
A + F+ A++ + + + AR R L P+ L +++ +A
Sbjct: 149 ---ALAPDAFRHAWLLTNTRTFYHETARTARLPHDDRMVLQPVAD-LFNHAADGGCEVAF 204
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
+ + DR Y GE +++ G N LL+ YGFV E N +D +
Sbjct: 205 TPASFAITADRAYAEGEEVLICYGRHSNDFLLVEYGFVLEQNRWDEV 251
>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 415
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSK 334
K++ A +A+ P S C+ A ++ DR Y+AGE + V GP N
Sbjct: 190 KLTSADCYAMCPFMDYFNHSDSGCEPQHNA--HGYSVLADRAYRAGEEVYVSYGPHTNDF 247
Query: 335 LLINYGFVDEDNPYD 349
LL+ YGF+ + N D
Sbjct: 248 LLVEYGFLLDANSND 262
>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAE--LLTTNKLSELACLALYLMYEKKQ 170
+ A +L+ G+ VP + ++T E ++ + ++ LS L++ L+YE +
Sbjct: 54 LGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLLYEMGK 113
Query: 171 GKKSFWLPYIRELDR 185
GK+SFW PY+ L R
Sbjct: 114 GKRSFWYPYLVHLPR 128
>gi|168014081|ref|XP_001759585.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689124|gb|EDQ75497.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 340
Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 38/61 (62%), Gaps = 5/61 (8%)
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDR 350
LL +SS+ +++ +++V ++ + GE++V+ GP N LL++YGFV NP DR
Sbjct: 123 LLQHSSESQSL-----PVLEVVAEKDLEKGENVVLNYGPLSNDILLLDYGFVMPKNPNDR 177
Query: 351 L 351
+
Sbjct: 178 V 178
>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
Length = 497
Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 1/72 (1%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSELACLALYLMYEKKQG 171
+ A DL+ G+ VP S ++T E V+ ++ + + + + LS L + L+YE +G
Sbjct: 55 LGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKG 114
Query: 172 KKSFWLPYIREL 183
K S W PY+ L
Sbjct: 115 KTSRWHPYLMHL 126
>gi|307103393|gb|EFN51653.1| hypothetical protein CHLNCDRAFT_139846 [Chlorella variabilis]
Length = 712
Score = 41.6 bits (96), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 23/35 (65%)
Query: 315 RPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
RP +AG+ + + GP PN KLL YGFV NP+D
Sbjct: 444 RPCQAGQQVFISYGPVPNLKLLCYYGFVVPHNPHD 478
>gi|218200748|gb|EEC83175.1| hypothetical protein OsI_28406 [Oryza sativa Indica Group]
Length = 319
Score = 41.6 bits (96), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 26/45 (57%)
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
V +++ + RP KAGE + G P S L+ YGF+ DNPYD
Sbjct: 179 VTKSLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYD 223
>gi|67538920|ref|XP_663234.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
gi|40743533|gb|EAA62723.1| hypothetical protein AN5630.2 [Aspergillus nidulans FGSC A4]
gi|259484901|tpe|CBF81518.1| TPA: SET domain protein (AFU_orthologue; AFUA_4G11040) [Aspergillus
nidulans FGSC A4]
Length = 707
Score = 41.6 bits (96), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 92/243 (37%), Gaps = 28/243 (11%)
Query: 118 DLQAGDAAF-----SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
D AGDA F P+S + ERV G+E +A +L+ + +G
Sbjct: 78 DFHAGDAHFPAHDVKFPSSFI---ERV-GSEEVA--------------IFFLIGQYLRGP 119
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+SFW PYIR L + L E K ++ E + NEL
Sbjct: 120 ESFWHPYIRTLPQPGSLTTLPYYEEEEDLEWLEGTSLLQARKRKVALLREKYESSSNELR 179
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
F ++Y +D+ A T + + V S V+ ++ L+P +L
Sbjct: 180 ESGFQDA---ERYSWDLYLWASTIFVSRAFSEKVLSGVIPEHEMP-ENTSVLLPF-IDIL 234
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
+ K A V VV E I GP+ N +L++NYGF +NP D
Sbjct: 235 NHRPLAKVEWRAGLQNVDFVVLEDVSVNEEIANNYGPRNNEQLMMNYGFCLANNPCDYRT 294
Query: 353 VEV 355
V +
Sbjct: 295 VSL 297
>gi|218200744|gb|EEC83171.1| hypothetical protein OsI_28399 [Oryza sativa Indica Group]
Length = 437
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 25/42 (59%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
+++ + RP KAGE + G P S L+ YGF+ DNPYD
Sbjct: 300 SLKFPLSRPCKAGEQCFLSYGKHPGSHLITFYGFLPRDNPYD 341
>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 377
Score = 41.2 bits (95), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 58/254 (22%), Positives = 93/254 (36%), Gaps = 48/254 (18%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLE-------RVLGNET-----IAELLTTNKLSELACL 160
+AA L+ G+ VP LV +L R L ++T +A LT N ++ELA
Sbjct: 33 MAAGRHLKEGEDILYVPTGLVRSLHTVPEHVSRKLPSDTSIHALLAADLTVNGMTELA-- 90
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILER 220
W + L + P +W + +L L P + E+LE
Sbjct: 91 --------------LWRDCLPTL------ADFSTGMPFMWHK-KLQELLPKPAR-ELLE- 127
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR 280
N+L + + +P D+ E + + + ++ A
Sbjct: 128 --------NQLGNFHRDWARVTKAFP-DLQQEDYLHNWLAVSTRSFYYWTPQMELYPPAD 178
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R ALVP+ + C A D + DR Y G+ I + G N LL YG
Sbjct: 179 RLALVPIADLFNHADTGCGASFTP--DGFVVSTDRKYHVGQEIYISYGTHTNDLLLAEYG 236
Query: 341 FVDEDNPYDRLVVE 354
FV N +D+ ++
Sbjct: 237 FVPMANRWDKTCLD 250
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 41.2 bits (95), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 40/89 (44%), Gaps = 6/89 (6%)
Query: 123 DAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRE 182
DA + LV + ER + L E LAL L+YE+++G KS W P+I
Sbjct: 64 DAMLHARSPLVCSGEREANDARALGALLGKVTREDDALALRLLYERRKGAKSRWGPHIAL 123
Query: 183 LDRQRGRGQLAVESPLLWSETELAYLTGS 211
L + L WSE ELA L GS
Sbjct: 124 LP------ATPPHALLRWSEAELAELAGS 146
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 41.2 bits (95), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 35/68 (51%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+AL+PL + V D + + + +KAGE + ++ G + N+ L ++ GF
Sbjct: 14 YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 73
Query: 342 VDEDNPYD 349
V E+N YD
Sbjct: 74 VFENNDYD 81
>gi|145502426|ref|XP_001437191.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124404340|emb|CAK69794.1| unnamed protein product [Paramecium tetraurelia]
Length = 637
Score = 41.2 bits (95), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 23/132 (17%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA------ELLT---TNKLSELACLALY 163
V ASEDL + +P SL+++ ++ + I E+ TN+ +E L Y
Sbjct: 46 VVASEDLPSDTVIICIPQSLIISPDKCKQSTLITVYNSHPEMFDEEETNE-AEFNILTFY 104
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLA--------VESPLLWSETELAY--LTGSPT 213
+ EKK+G++SF+ PYI+ + Q +A +E PL+ E +L G +
Sbjct: 105 MFNEKKKGEQSFYYPYIQAI--QTSNTLMAWSNEDLQKIEDPLILEEFQLIKQDFLGLWS 162
Query: 214 KAE-ILERAEGI 224
KA+ I + A+ I
Sbjct: 163 KAKLIFDNAQDI 174
>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
CIRAD86]
Length = 391
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 39/92 (42%), Gaps = 1/92 (1%)
Query: 257 EIFKQAFVAVQSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDR 315
++FK + V S H + F L P + S + ++ +R
Sbjct: 164 DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPSGTGVNVRQTAKGYEVTANR 223
Query: 316 PYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
Y AGE ++ G PN KLL++YGF++ P
Sbjct: 224 DYVAGEEVLATYGAHPNDKLLVHYGFINSSKP 255
>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 400
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 47/196 (23%), Positives = 83/196 (42%), Gaps = 23/196 (11%)
Query: 161 ALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE- 219
A+ + E +SFW PY+ EL AV + W++ ELA + + E++E
Sbjct: 49 AVLWLLESVNCAQSFWQPYLSELPD-------AVATVDRWNQEELAEVGHTLMLYEMVEY 101
Query: 220 RAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLA 279
+ + I +Y + + + Q + IP+E E +++A V S + L
Sbjct: 102 KKKKIAADYAAILLPFLQENT--QLFGGSIPSE----EEYRRALSLVYSRTFDFSE--LI 153
Query: 280 RRFALVPLGPPLLAYS------SKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNS 333
+P L +S + C D +L+ Y GE + + G + +S
Sbjct: 154 GEHVFIPF-VDFLNHSINDTGKAACTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSS 212
Query: 334 KLLINYGFVDEDNPYD 349
+LL +YGF+ E+N D
Sbjct: 213 QLLASYGFMYENNAED 228
>gi|336258546|ref|XP_003344085.1| hypothetical protein SMAC_09068 [Sordaria macrospora k-hell]
gi|380093059|emb|CCC09296.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 421
Score = 41.2 bits (95), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 27/47 (57%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
A + RPY AGE + + G N LLI YGF+ ++N +D + ++
Sbjct: 263 AFTITTTRPYSAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCID 309
>gi|345566622|gb|EGX49564.1| hypothetical protein AOL_s00078g53 [Arthrobotrys oligospora ATCC
24927]
Length = 611
Score = 41.2 bits (95), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 89/210 (42%), Gaps = 28/210 (13%)
Query: 158 ACLAL---YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
AC L L+ E+ Q FW PYIR L + ++PL +++ E+ L G+
Sbjct: 112 ACFHLSQHLLLKEQSQ----FW-PYIRLLPK-------TFDTPLYFNDDEMERLAGTNLG 159
Query: 215 A-EILERAEGIKREYNELDTVWFMAG---SLFQQYPYDIPTEA---FTFEIFKQAFVAV- 266
A ++L R + E+ F+ G ++Y +D+ A +T F V +
Sbjct: 160 AGDVLLRKQLWMEEWEAGKQ--FLEGVGAERAREYTWDLFLRAATIYTSRSFPSKLVGIT 217
Query: 267 -QSCVVHLQKVSLARRF-ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIV 324
S + +S F L+PL +L + K + + L+ G +
Sbjct: 218 MDSSIEENTMLSDDNGFPVLIPL-VDILNHKPNTKIIWEPTQTSFSLITPETISEGSQVF 276
Query: 325 VWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
GP+ N +LL+ YGFV +NP D L ++
Sbjct: 277 NNYGPKGNEELLMGYGFVIPENPGDSLAMK 306
>gi|148237199|ref|NP_001085404.1| N-lysine methyltransferase setd6 [Xenopus laevis]
gi|82184826|sp|Q6INM2.1|SETD6_XENLA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|48734800|gb|AAH72257.1| MGC82362 protein [Xenopus laevis]
Length = 455
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 95/242 (39%), Gaps = 28/242 (11%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETI-AELLTTNKLSELACLALYLMYEKKQGKK 173
A ED+ G+ F+VP S +++ E + E + S L + L+YE
Sbjct: 55 AREDIADGELLFTVPRSAILSQNTTRIQELLEKEQESLQSTSGWVPLLISLLYEATDSS- 113
Query: 174 SFWLPYIR---ELDRQRGRGQLAVESPLLWSETE-LAYLTGSPTKAEILERAEGIKREYN 229
S W PY ELD + P+ WSE E L G+ I + I+ EYN
Sbjct: 114 SLWAPYFGLWPELD--------PPDMPMFWSEEEQTKLLQGTGVLEAIRNDLKNIEEEYN 165
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFV---AVQSCVVHLQKVSLARRFAL 284
+ + P T +++K+ AFV + Q + + + L
Sbjct: 166 SI------VLPFITRNPEKFCPMKHTLDLYKRLVAFVMAYSFQEPLEENDEEDEDEKDIL 219
Query: 285 VPLGPP---LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
P+ P LL + + A L + +++V + AG+ + G N +LL YGF
Sbjct: 220 PPMMVPVADLLNHVAHHNAHLEFTPECLRMVTTKSVHAGQELFNTYGEMANWQLLHMYGF 279
Query: 342 VD 343
+
Sbjct: 280 AE 281
>gi|4185151|gb|AAD08954.1| unknown protein [Arabidopsis thaliana]
gi|20197036|gb|AAM14885.1| unknown protein [Arabidopsis thaliana]
Length = 441
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 15/152 (9%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKS 174
ASEDL+ GD A +P S +++ E V ++ L T + ++ L L+ M EK
Sbjct: 173 ASEDLKFGDVALEIPVSSIISEEYVYNSDMYPILETFDGITSETMLLLWTMREKHNLDSK 232
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTV 234
F PY L G L + + L G+ EI++ E ++ Y+EL
Sbjct: 233 F-KPYFDSLQENFCTG-------LSFGVDAIMELDGTLLLDEIMQAKELLRERYDEL--- 281
Query: 235 WFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAV 266
L + P E +T+E + A+ V
Sbjct: 282 ----IPLLSNHREVFPPELYTWEHYLWAYFDV 309
>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
Length = 606
Score = 41.2 bits (95), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 27/47 (57%)
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+Q P K G + + GP N++LL+ YG+ ++DNPY +E+
Sbjct: 440 LQFCTMAPIKQGSQVFLNYGPLDNTQLLLYYGYAEQDNPYQTYAIEL 486
>gi|71659283|ref|XP_821365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886742|gb|EAN99514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 661
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 466 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 517
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 518 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 552
>gi|392563539|gb|EIW56718.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 441
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 30/47 (63%)
Query: 309 VQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ LV+ P G ++ GP+PN++L++ YGF +NP D +V+++
Sbjct: 250 ISLVIHTPTTTGSELLNNYGPKPNAELILGYGFSLPNNPDDTIVLKI 296
>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 111/282 (39%), Gaps = 55/282 (19%)
Query: 113 VAASEDLQAGDAAFSVP-------NSLVVTLERVLGNETIAEL---------LTTNKLSE 156
V A++++ A + +P N+++ + R G E++ +L TT++ +E
Sbjct: 73 VYATQNVSAKETLVRIPHSFLMNTNTIIKHISRFNGKESVPDLGYSVLLPSEYTTDQWTE 132
Query: 157 LAC---------------LALYLMYEKKQGKKSFW------LPYIRELDRQRGRGQLAVE 195
L ALY+ EKK+ + SFW LP + ELD
Sbjct: 133 LYAKIPISKWLQLTAFQRTALYICLEKKRKENSFWCAFISSLPKLEELDF---------- 182
Query: 196 SPLLWS-ETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAG-SLFQQYPYDIPTEA 253
+P++W E+E LTGS A+ E R + + V F + ++ +E
Sbjct: 183 APIVWEVESE---LTGSKA-ADFFELLPRSSRNHAKKVLVRFNEDYTAVSEFLTAAKSEP 238
Query: 254 FTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVV 313
F A++ + S +++ S L P + + C A D+ +V
Sbjct: 239 LNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLAPYVDFLNHDCDEKCAIKIDSRGFLV 298
Query: 314 DR--PYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+ AG+ ++ GP N LL Y F E N ++ L V
Sbjct: 299 ISCVDHAAGQELLFSYGPHSNEFLLCEYAFTMETNKWNNLDV 340
>gi|354502761|ref|XP_003513450.1| PREDICTED: SET domain-containing protein 4 [Cricetulus griseus]
Length = 440
Score = 40.8 bits (94), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 101/246 (41%), Gaps = 22/246 (8%)
Query: 119 LQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL--SELACLALYLMYEKKQGKKSFW 176
LQ G S+P S ++T V+ ++ + K S L L +L+ E+ G +S W
Sbjct: 66 LQEGQMIISLPESCLLTTNTVI-RSSLGPYMKKWKPPPSPLLALCTFLISERHAGGQSLW 124
Query: 177 LPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF 236
Y+ L + + P+ E ++ L P KA+ E+ ++ + +
Sbjct: 125 KSYLDILPK-------SYTCPVCL-EPDVVDLLPQPLKAKAEEQRADVQDFFASSRAFFS 176
Query: 237 MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKV---SLARRFALVPLGP--PL 291
LF + P D F++ F A+ V + V+L+ L+ L P L
Sbjct: 177 TLQPLFVE-PVD---GIFSYSAFLWAWCTVNTRAVYLRSTRQECLSAEPDTCALAPYLDL 232
Query: 292 LAYSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L +S + KA + ++ + E + + GP N +LL+ YGFV NP+
Sbjct: 233 LNHSPHVQVKAAFSEKTGCYEIRTASRCRKHEQVFICYGPYDNQRLLLEYGFVSVCNPHA 292
Query: 350 RLVVEV 355
+ V +
Sbjct: 293 CVPVNI 298
>gi|242210761|ref|XP_002471222.1| predicted protein [Postia placenta Mad-698-R]
gi|220729781|gb|EED83650.1| predicted protein [Postia placenta Mad-698-R]
Length = 264
Score = 40.8 bits (94), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 108/269 (40%), Gaps = 41/269 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVT-------LERVLGNETIAELLTTNKLSE--LAC--LA 161
V AS DL + A S+P SL +T LE + E A L + SE L C +
Sbjct: 10 VIASSDLPSDTAVVSIPFSLAITPTVSRAALETLF--EASAAKLALDGWSERQLICSYIC 67
Query: 162 LYLMYEKKQGKKSFWL---PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEIL 218
++ + E+ + L PY+ L + +PL + ELA G+ L
Sbjct: 68 MHWVVERNTCGQPAVLAHHPYLATLPSPD-----KLLTPLHFIPVELAAFQGTNLYGATL 122
Query: 219 ERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSL 278
R + + E W S Q D + FT++ + A + S +S
Sbjct: 123 ARRDAWRAE-------WQACQSAIAQTNPDW-ADTFTWDRYLTASTYLSSRAFPSTLLSP 174
Query: 279 ARRFALVPLGPPLLA------YSSKCKAMLAAVDDA------VQLVVDRPYKAGESIVVW 326
A P P+L ++ + + AV A + LV+ + AG +
Sbjct: 175 TPSLAPSPDSHPVLLPGVDALNHARGQPVSWAVSTAPNAPSSISLVLHNAHPAGAELFNN 234
Query: 327 CGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
GP+PN++L++ YGF NP D +V+++
Sbjct: 235 YGPKPNAELILGYGFALPHNPDDTIVLKL 263
>gi|340505659|gb|EGR31971.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 705
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 52/104 (50%), Gaps = 10/104 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN---------KLSELACLALY 163
+AA+ED+ A +PN ++++L ++ E + +++ N +E +A+Y
Sbjct: 21 IAAAEDIPANTIIACIPNKIMISLNQIKECE-LKDIINENPSLFDEEENAEAEFNIIAMY 79
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+++EK +G+KSF+ PY + R +E L E+ Y
Sbjct: 80 VIHEKLKGEKSFYKPYFDTIQRSYTMYDWTIEEVKLTESEEIIY 123
>gi|449301991|gb|EMC98000.1| hypothetical protein BAUCODRAFT_146595 [Baudoinia compniacensis
UAMH 10762]
Length = 633
Score = 40.8 bits (94), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 86/214 (40%), Gaps = 27/214 (12%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLW-SETELAYLTGSPTKAEILER 220
YLM + ++SFW PY+ L +PL + + +LA+L G+ +L R
Sbjct: 87 FYLMTQYLNKEQSFWKPYLDVLPSPS-----EFSTPLWFDAPADLAWLDGTDVLHTMLAR 141
Query: 221 AEGIKREYNELDTVWFMAGSLFQQYPYDI---PTEAFTFEIFKQAFVAVQS---CVVHLQ 274
E + Y V +G Y +D+ FT F + Q+ VH
Sbjct: 142 REVYAQYYQSGLKVLSESGIDVTLYTWDLFRWAITTFTSRSFTSRVLLPQNRKYWPVHRT 201
Query: 275 KVSLARRFALVPLG-------------PPLLAYSSKCKAMLAAVDDAVQLVVD--RPYKA 319
+ R+ L+ + P L + + A + DA Q + +P +A
Sbjct: 202 STNGRRQTVLLDMSHSPAEDLDFSVLFPGLDSGNHDPNAQVDWSFDANQFSIALVQPIEA 261
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
G + GP+ N +LL+ YGF +NP D +++
Sbjct: 262 GAEVCNNYGPKANDELLMGYGFCIPNNPRDEVLL 295
>gi|380472668|emb|CCF46664.1| SET domain-containing protein RMS1, partial [Colletotrichum
higginsianum]
Length = 289
Score = 40.8 bits (94), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 49/251 (19%), Positives = 92/251 (36%), Gaps = 51/251 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLER----------VLGNETIAELLTTNKLSELACLAL 162
+ A++D+ F++P ++ E GN+ E + L L L
Sbjct: 44 IIATKDIAPETVLFTIPRKSIINTETSELPKKIPQVFTGNDGDDEDMENEPLDSWGSLIL 103
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI-LERA 221
++YE QG S W PY L + ++ + W ++L +L GS ++I + A
Sbjct: 104 VMIYEYLQGDASPWKPYFEVLPDK-------FDTLMFWESSDLEWLRGSAVLSKIGKDEA 156
Query: 222 EGIKREY----------------------NELDTVWFMAGSLFQQYPYDIPTEAFTFEIF 259
+ + R EL + GS+ Y +D+
Sbjct: 157 DEMFRSRILSVIAANPTIFFPQGVAQPSETELLQLAHRMGSIIMAYAFDL---------- 206
Query: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKA 319
+ + ++ +VP+ +L ++ A + +D + +V RP KA
Sbjct: 207 ENEEEPEEENEEWVEDRDGKTMLGMVPMA-DILNADAEFNAHVNHGEDDLSVVALRPIKA 265
Query: 320 GESIVVWCGPQ 330
GE I+ + GP
Sbjct: 266 GEEILNYYGPH 276
>gi|85113406|ref|XP_964517.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
gi|28926302|gb|EAA35281.1| hypothetical protein NCU02158 [Neurospora crassa OR74A]
Length = 504
Score = 40.8 bits (94), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 27/47 (57%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
A + RPY AGE + + G N LLI YGF+ ++N +D + ++
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCID 316
>gi|336463341|gb|EGO51581.1| hypothetical protein NEUTE1DRAFT_125257 [Neurospora tetrasperma
FGSC 2508]
gi|350297448|gb|EGZ78425.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 503
Score = 40.8 bits (94), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 27/47 (57%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
A + RPY AGE + + G N LLI YGF+ ++N +D + ++
Sbjct: 270 AFTITTTRPYAAGEEVYICYGNHSNDFLLIEYGFLFDENVWDEVCID 316
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 40.8 bits (94), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 20/68 (29%), Positives = 35/68 (51%)
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
+AL+PL + V D + + + +KAGE + ++ G + N+ L ++ GF
Sbjct: 451 YALIPLWDMCNHTNGTISTAYNPVLDRSECLAVKNFKAGEQLFIFYGSRSNADLFVHNGF 510
Query: 342 VDEDNPYD 349
V E+N YD
Sbjct: 511 VFENNDYD 518
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|145485580|ref|XP_001428798.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395886|emb|CAK61400.1| unnamed protein product [Paramecium tetraurelia]
Length = 331
Score = 40.8 bits (94), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 1/71 (1%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
LVP G +L + KC+ + + + + + E I + G NS LL YGF
Sbjct: 186 CLVPFGD-MLNHHDKCQTKQKIIGTDLVFITTKQIQENEEIYNFFGEHGNSFLLCWYGFT 244
Query: 343 DEDNPYDRLVV 353
++N YD+L +
Sbjct: 245 YDNNIYDKLYL 255
>gi|303284022|ref|XP_003061302.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226457653|gb|EEH54952.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 536
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 67/316 (21%), Positives = 110/316 (34%), Gaps = 60/316 (18%)
Query: 83 SWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGN 142
SW K+G+ P S E R V A+ + AG VP +L+++ +
Sbjct: 27 SWALKHGVEPRDCRPDFVGSAREGWRG---VVATAPIAAGATLLRVPTALLMSGRTAAAD 83
Query: 143 ETIAELLTTNK-------LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVE 195
+ +A L+ + L+ LA++L+ E +G +SFW Y+R+L R +
Sbjct: 84 DVLARALSEHHERDGEPPLTPTDRLAVHLLRELSRGAESFWHLYLRQLPR-------SYA 136
Query: 196 SPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
W+ E L P + +R+ R+ W A + ++ +F
Sbjct: 137 LTCGWTAAERNALQ-LPHAVDAADRSAAACRD------AWARATPVMEKIGLPATYRSFG 189
Query: 256 FEIFKQAFVAVQSCVVHLQKVS----LARRFALVPLGPP--------------------- 290
+ A ++ ++ V + F VP PP
Sbjct: 190 AWAWAAATISSRTVFVPFDAAGALCPVGDLFNYVPPTPPHVPKVVGTPLEGPSDERDDEE 249
Query: 291 -------LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV- 342
L DA V R Y+ GE I + G N LL +YGF
Sbjct: 250 DDENDSYFLRRGVGGDGAWHEASDAWVFVARRDYRKGEEISLCYGQHTNLGLLTHYGFTM 309
Query: 343 -DEDNPYDR--LVVEV 355
+N +D LVV+
Sbjct: 310 SHGENAHDEAPLVVDA 325
>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
strain 10D]
Length = 576
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 36/62 (58%), Gaps = 3/62 (4%)
Query: 291 LLAYSSKCKAMLAA--VDDAVQLVV-DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
L +SS+ ++ +A DA L + +R AG+ + + G N +LL YGFV+EDNP
Sbjct: 411 LFNHSSRVQSKVAYEYFYDAFSLSISNRDTHAGDQVFISYGTLTNDELLALYGFVEEDNP 470
Query: 348 YD 349
+D
Sbjct: 471 HD 472
>gi|406606937|emb|CCH41659.1| hypothetical protein BN7_1200 [Wickerhamomyces ciferrii]
Length = 577
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 52/214 (24%), Positives = 83/214 (38%), Gaps = 56/214 (26%)
Query: 178 PYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWF- 236
P+I L R G SP W+E E + + T A++ I N+L W+
Sbjct: 110 PFIEFLPTGREIG-----SPFFWNEMERSLIKN--TDADL-----AIDVGLNKLVEEWYD 157
Query: 237 MAGSL---FQQYPYDIPTEAFT-----------FEIFKQAFVAVQSCVVHLQKVSL--AR 280
+ L FQ Y Y + F FE F V+ S +L ++ +R
Sbjct: 158 IVTKLPKKFQSYQYQKDLKFFHDFQKDRDVSKHFEFFNDDSVSWTSFAAYLWSSTIFTSR 217
Query: 281 RFALVPLGPPLLAYSSKCK----AMLAAVDDA-----------------VQLVVDRPYKA 319
F P L++ + +C+ ML + D + ++ K
Sbjct: 218 GF------PFLISSTDECRDLNEGMLVPIQDLSNHNPSVEIKWGRLDKFMTFTTEQIVKK 271
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
G+ I GP+ N +LL YGFV ++N YD+ V+
Sbjct: 272 GDEIFSNYGPKSNHELLFGYGFVMDNNIYDKAVL 305
>gi|260946533|ref|XP_002617564.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
gi|238849418|gb|EEQ38882.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
Length = 430
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 46/204 (22%), Positives = 86/204 (42%), Gaps = 22/204 (10%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPT 213
LS LA+YL+ EK++G SFW P+I D +L++ +P++W ++ P
Sbjct: 117 LSSFQLLAIYLVLEKERGAASFWKPFI---DMLPSIEELSL-APVVWKVLQV------PH 166
Query: 214 KAEILERAEGIKREYNELDTVWFMAGSLFQQYPY--DIPT-EAFTFEIFKQAFVAVQSCV 270
++ R++ E + + Y D+P+ AF F A++ + S
Sbjct: 167 CDDLWRMLSRSARKHAE-----SVVARFEKDYAVVCDLPSVPAFERSSFLWAWMCINSRC 221
Query: 271 VHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
+++ Q + F + P L +S++ + + ++ YK E +
Sbjct: 222 LYMSMPQAKDTSDNFTMAPY-VDFLNHSNEDQCGIKIDPHGFHVLTSSAYKPQEELYFSY 280
Query: 328 GPQPNSKLLINYGFVDEDNPYDRL 351
GP N LL YGF N ++ +
Sbjct: 281 GPHSNEFLLCEYGFTLPHNKWNYI 304
>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 71/164 (43%), Gaps = 25/164 (15%)
Query: 197 PLLWSETELAYLTGSPTKAEI-LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFT 255
P+LW EL L P ++++ LER E E W F++ D+P + +T
Sbjct: 107 PMLWPR-ELKQLL--PLESQVTLERRE------KEFQDNW----DDFKEAFPDVPRDDYT 153
Query: 256 FEIFKQAFVAVQSCVVHLQ-----KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ A++ V + + + K R AL+P+ + C+ + +
Sbjct: 154 Y-----AWLVVNTRTFYHETPETLKYPWEDRLALIPVADLFNHAAGGCRVYYSP-EGCYH 207
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
+V DR YK GE + + N L+ YGF+ ++N D + ++
Sbjct: 208 VVADRAYKKGEELFISYSSHSNDYNLLEYGFIPDENSLDDVYID 251
>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
6054]
gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 453
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 50/223 (22%), Positives = 86/223 (38%), Gaps = 42/223 (18%)
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWS-------ETELA 206
L+ L+LYL +E+++ SFW P++ L +PL+W E +
Sbjct: 122 LTSFQLLSLYLCFERQRIHSSFWKPFLEMLPDISDFSL----NPLIWQVLQVDQWEELIQ 177
Query: 207 YLTGSPTKAEILERAEGIKREYNE------------LDTVWFMAGSLFQQYPYDIPTEAF 254
+L S + RAE + + E LD + S + P D
Sbjct: 178 FLPESAKR-----RAEDVYERFLEDYVVVRALVSRILDDLKLSESSADEYIPVD------ 226
Query: 255 TFEIFKQAFVAVQSCVVHL---QKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
+F A++ + S +++ Q + A F + P L +S + + +
Sbjct: 227 ---LFLWAWMCINSRCLYMTIPQGKTNADNFTMAPY-VDFLNHSCNDECSILIDTTGFHV 282
Query: 312 VVDRPYKAGESIVVWCGPQPNSKLLINYGFV-DEDNPYDRLVV 353
PY G+ + + GP N LL YGFV DN ++ L +
Sbjct: 283 RTTTPYMPGDQLFLSYGPHCNEFLLCEYGFVIPHDNKWNDLDI 325
>gi|432119027|gb|ELK38252.1| SET domain-containing protein 4 [Myotis davidii]
Length = 339
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 100/241 (41%), Gaps = 26/241 (10%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKK 169
H+ A + + G S+P S ++T + V+ + A + S L L +L+ EK
Sbjct: 22 HFRAGASGAREGQVIISLPESCLLTTDTVIRSYLGAYIAKWQPPPSPLLALCTFLVAEKH 81
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYN 229
G +S W PY+ L + A P+ E E+ L P +A+ E+ ++
Sbjct: 82 AGDRSPWKPYLEVLPK-------AYTCPVCL-EPEVVALLPRPLEAKAREQRTRVR---- 129
Query: 230 ELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL----- 284
EL T S Q + F++ F+ A+ V + V++++ RR L
Sbjct: 130 ELFTSSRGRFSSLQPLLSEAAASVFSYRAFRWAWCTVNTRAVYMER---GRRQGLSAEPD 186
Query: 285 -VPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRP---YKAGESIVVWCGPQPNSKLLINY 339
L P L L +S + AA ++ + R + E + + GP + +LL+ Y
Sbjct: 187 TCALAPYLDLLNNSPAVQVKAAFNEETRCYEIRTGSGCRRHEEVFICYGPHDSRRLLLEY 246
Query: 340 G 340
G
Sbjct: 247 G 247
>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
Length = 371
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 34/74 (45%), Gaps = 2/74 (2%)
Query: 281 RFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYG 340
R +P CK +A+ +VQ DR Y GE + V GP N LL YG
Sbjct: 178 RLVCMPTADLFNHADQGCKLAYSALGYSVQ--ADRVYHQGEEVYVSYGPHSNDFLLSEYG 235
Query: 341 FVDEDNPYDRLVVE 354
F+ + N +D + ++
Sbjct: 236 FILDTNRWDEVYLD 249
>gi|323455796|gb|EGB11664.1| hypothetical protein AURANDRAFT_61664 [Aureococcus anophagefferens]
Length = 1916
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLV 352
DA + R Y AG+ + G + N++L+ NYGF++ NP+D V
Sbjct: 294 DAFAVNAHRDYDAGDEVHASYGKKSNAQLVANYGFLEPGNPFDDYV 339
>gi|254585507|ref|XP_002498321.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
gi|238941215|emb|CAR29388.1| ZYRO0G07502p [Zygosaccharomyces rouxii]
Length = 562
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 66/299 (22%), Positives = 121/299 (40%), Gaps = 47/299 (15%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
L D W KNG + I ++ S + + + A+E+ +P+ L++T E
Sbjct: 4 LEDCIQWAVKNGSIVDERIHFKQSSISGISAVVEGILATEE-----PLIQIPSKLLITNE 58
Query: 138 RVLGNETIAELLTTNKLSELACLALYLMYEKK----QGKKSFWLPYIRELDRQRGRGQLA 193
+ E+ + ++ + + A AL +Y K +G S + PYI L L
Sbjct: 59 K--AQESFQ--VDSDVIDKNAPNALVQLYVAKLKFAKGMPSIYQPYIDLLP-------LK 107
Query: 194 VESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL--------------------DT 233
+E P W EL + G+ + +R + E+ L D
Sbjct: 108 LEQPYFWDWKELQVIKGTDLYLVMKQRLPKLLEEWTTLLKKLSLEPSDDLGQLETPGLDL 167
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS---CVVHLQKVSLARRFALVPLGPP 290
V ++A +++ +P +F ++ A ++ ++ Q +S+ F L P+
Sbjct: 168 VDYVAR--YRETNEQLPWNSFAAYVWSAGIFASRAFPKIALNDQCLSINEAF-LYPI-VD 223
Query: 291 LLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
L + + K D + V K+GE + G + N +LL+NYGFV ++N YD
Sbjct: 224 FLNHKNDTKVKWCFQDGKMCFVSKESLKSGEELFNNYGDKSNEELLLNYGFVQDNNQYD 282
>gi|66827459|ref|XP_647084.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
gi|60475269|gb|EAL73204.1| hypothetical protein DDB_G0267502 [Dictyostelium discoideum AX4]
Length = 472
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/162 (22%), Positives = 71/162 (43%), Gaps = 20/162 (12%)
Query: 195 ESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAF 254
++ L + E E+ YL GSP +I+ E + Y++L F + + +
Sbjct: 163 DTSLYFDEKEIEYLAGSPAFVDIMVEKEVATKLYDQLSQTLFKDNVILEMCQG--QSTII 220
Query: 255 TFEIFKQAFVAVQSCVVHLQK----VSLARRFALVPLGPPLLAYSSKCKAMLAAVD---- 306
++ F+ A + + +++ S ++ L P+ PP++ Y + A +D
Sbjct: 221 GWDQFRWAHSTITARKIYVTDPDSVGSDGKQMKLSPVVPPIVDYFNHGNQPSAEIDYNEE 280
Query: 307 -DAVQLVVDRPYKAGESIVV-----WCGPQPNSKLLINYGFV 342
+V + + K GE I V +CG S LL++YG++
Sbjct: 281 LGSVDVKAIKDIKKGEEIFVSYDHHYCG----SDLLVDYGYL 318
>gi|449544081|gb|EMD35055.1| hypothetical protein CERSUDRAFT_107074 [Ceriporiopsis subvermispora
B]
Length = 457
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 31/50 (62%)
Query: 306 DDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
D AV L++ P G ++ GP+PN++L++ YGF NP D +V+++
Sbjct: 275 DLAVSLLLHSPTPRGAELLNNYGPKPNAELVLGYGFALPSNPDDTIVLKI 324
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 111/290 (38%), Gaps = 36/290 (12%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVKTFEGYGRG------LCANKEFKKDEVIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ PL ++ EL + G+ A + E+ + V + L QQ+P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYF-- 157
Query: 252 EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQL 311
++F ++FK+ A QS V + F V P +S+ C Q
Sbjct: 158 QSFD-DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQT 216
Query: 312 ------VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ K GE I + N KLL+ YGFV+E+NP D L++ +
Sbjct: 217 ETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRI 266
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 111/291 (38%), Gaps = 38/291 (13%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGVIDGVDVKTFDGYGRG------LCANKEFKKDEIIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWPYINVLPE------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
+ PL ++ EL + G+ A + E+ + V + L QQ+P Y P
Sbjct: 107 -TYDCPLSYTIDELNLMKGTKLYAAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQ 310
+ ++FK+ A QS V + F V P +S+ C Q
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQ 215
Query: 311 L------VVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ K GE I + N KLL+ YGFV+E+NP D L++ +
Sbjct: 216 TETFSFQTNEALVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRI 266
>gi|340507383|gb|EGR33354.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 165
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 50/103 (48%), Gaps = 20/103 (19%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL--------LTTNKLSELACLALYL 164
V A E++ A ++PN+L+++ V +E L L + ++ LALYL
Sbjct: 51 VIAKEEIPANKVFVAIPNNLLLSTYLVEQSELKVILEENPHLFDLDEDDDAQFNKLALYL 110
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLL--WSETEL 205
M EK +G+ SFW PY+ Q+A ES L W E E+
Sbjct: 111 MKEKIKGENSFWYPYL----------QIAPESFTLLDWKEEEV 143
>gi|363746364|ref|XP_003643627.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 225
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKK 169
+ A+ +++A + VP L++T+E N + L + +++ + LA +L+ E+
Sbjct: 108 LKATREIKAEELFLWVPRKLLMTVESA-KNSVLGSLYSQDRILQAMGNITLAFHLLCERA 166
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
SFWLPYI+ L + ++PL + E E+ YL + ++ + + R+Y
Sbjct: 167 -NPNSFWLPYIQTLPSE-------YDTPLYFEEDEVQYLRSTQAIHDVFSQYKNTARQY 217
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 104/251 (41%), Gaps = 24/251 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYE-KKQG 171
V A DL AG+ VP SL++ +E + + +L +LS+ +A +L+YE +
Sbjct: 79 VFALRDLAAGETVLRVPLSLLLNVEHASAS-PLGGILDDFRLSDAEAMAFWLIYELTRPE 137
Query: 172 KKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKR----E 227
+ S WLPY+ L QL + + E+ L SP AE R R +
Sbjct: 138 RASPWLPYLESL--PASIKQLT----MFYDPFEMKRLQASPV-AEFTSRRTVKMRNKFGK 190
Query: 228 YNELDTVWFMAGSLFQQYPYDIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
Y E + A ++P ++ T + F + + Q + V H R LVP
Sbjct: 191 YREQISKHRPAHLAEIEFPVELITVDDFLWAMAVQFTRLITVQVKHPADGEWERTKCLVP 250
Query: 287 LG-----PPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQ--PNSKLLIN 338
L P + +C L + + RP G+ ++ + G + N +L+++
Sbjct: 251 LADLLNTAPADQINVECATNLDSTH--FECATIRPVAEGQELLTPYGGAEQLSNGQLIMD 308
Query: 339 YGFVDEDNPYD 349
YG +NP D
Sbjct: 309 YGVTFRNNPSD 319
>gi|315045047|ref|XP_003171899.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
gi|311344242|gb|EFR03445.1| SET domain-containing protein 6 [Arthroderma gypseum CBS 118893]
Length = 485
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/105 (22%), Positives = 49/105 (46%), Gaps = 9/105 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ AS D+ + F +P L+++++ + L +L L + ++YE QG+
Sbjct: 51 ICASRDITEDEELFVIPEDLILSVQNSEARTVLG--LDDKQLGPWLSLIIAMIYEYYQGE 108
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+S W PY L + ++ + W++ +L+ L GS +I
Sbjct: 109 QSKWYPYFGVLPS-------SFDTLMFWTDEQLSELQGSAVVGKI 146
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
V+L+ R +GE I + G N +LL++YGF+ +DNP+D
Sbjct: 276 GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKDNPFD 317
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 50/242 (20%), Positives = 94/242 (38%), Gaps = 30/242 (12%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC-----LALYLMYEKKQGKKSF 175
A + VP SL++ L E + + K + A LAL++++E ++ SF
Sbjct: 4 ASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELRK-PDSF 62
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W PY L + V P+ W++ ++ L GSP A +L + + + + E
Sbjct: 63 WRPYFDALPSR-------VNLPMFWADEDMQLLAGSPLHAAVLAQKKQARDWHTE----- 110
Query: 236 FMAGSLFQQY--PYDIPTE------AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+ ++Y P+ + + +++ F+ + S + +VP+
Sbjct: 111 -HIVPIVRRYPRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPM 169
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLV---VDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+ + DD Q V +PY GE + + N +LL Y + E
Sbjct: 170 ADLINHSLTNDNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNFELLKTYAMMVE 229
Query: 345 DN 346
DN
Sbjct: 230 DN 231
>gi|302828172|ref|XP_002945653.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
gi|300268468|gb|EFJ52648.1| hypothetical protein VOLCADRAFT_120141 [Volvox carteri f.
nagariensis]
Length = 163
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 6/91 (6%)
Query: 111 HYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQ 170
+ + A E ++ G VP L+++ + +E + L+E L L+L+ E+
Sbjct: 77 YSLVADEPVRRGQILVRVPRRLLMSQDTARASEACGRTVREAGLNEWQSLILHLLCERAL 136
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWS 201
G +SFW PY+ L Q PLLW
Sbjct: 137 GSRSFWAPYLDTLP------QDMSFHPLLWG 161
>gi|300124011|emb|CBK25282.2| unnamed protein product [Blastocystis hominis]
Length = 366
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 74/183 (40%), Gaps = 19/183 (10%)
Query: 164 LMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEG 223
+ E + + SF+ PY L P++W+ +E+ L GS I R
Sbjct: 57 FLLEDMENEDSFYKPYYDTLPED------ISNIPVIWTNSEINQLHGSYFSICIRSRVVE 110
Query: 224 IKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFA 283
I R+Y ++ V S F +YP+D I + F + + + + V LA
Sbjct: 111 IYRDYQKMCDV----NSFFCRYPFDQYLRV-RLLIGSRNFGSFFNSLNNGILVPLADMLN 165
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
Y K KA + + + L + G ++ G + N +LL +YGFV+
Sbjct: 166 HTRPRQTTWEYDDKEKAFV--ITSLLNL------RQGAQVMDSYGRRDNRRLLFSYGFVE 217
Query: 344 EDN 346
+DN
Sbjct: 218 DDN 220
>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
Y34]
Length = 464
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 48/111 (43%), Gaps = 3/111 (2%)
Query: 241 LFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKA 300
L Q+ P E FT E +K A V S + L P +L +S K
Sbjct: 146 LLVQHRDLFPLEQFTIEDYKWALCTVWSRAMDFVLPGGNSIRLLAPFAD-MLNHSDNVKQ 204
Query: 301 MLA--AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYD 349
A + + ++ + Y+AG+ + ++ GP NS+LL YGFV N D
Sbjct: 205 CHAYDSSSKTLSVLAGKDYEAGDQVFIYYGPVSNSRLLRLYGFVLPGNSND 255
>gi|328854233|gb|EGG03367.1| hypothetical protein MELLADRAFT_90239 [Melampsora larici-populina
98AG31]
Length = 509
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 32/67 (47%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VPL L A S A L + + + + GE I PN+ LL YG V
Sbjct: 260 AMVPLADILNAKSGCENAKLFYEPTTLNMTTTKSIRKGEQIYNTYADPPNADLLRRYGHV 319
Query: 343 DEDNPYD 349
D++NP+D
Sbjct: 320 DDENPFD 326
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 65/291 (22%), Positives = 115/291 (39%), Gaps = 38/291 (13%)
Query: 78 LGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLE 137
+ D+K W+ +NG V +K + + A+++ + + S+P S + +
Sbjct: 1 MEDIKKWVIQNGGIIDGVDVKTFEGYGRG------LCANKEFKQDEIIMSIPYS--IQIN 52
Query: 138 RVLGNETIAELLT------TNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
R+ N E+ + +L L + K K F PYI L +
Sbjct: 53 RINLNHIWPEVKLPKFNEGDDDRDDLNGLVYLYLAINKTNPKCFHWPYINVLPK------ 106
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYP-YDIP 250
+ PL ++ EL + G+ + E+ + V + L QQ+P Y P
Sbjct: 107 -TYDCPLSYTIDELNIMKGTKLYVAV-EKINAFL-----MKVVDYYNNKLIQQFPQYFQP 159
Query: 251 TEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKC-KAMLAAVDDAV 309
+ ++FK+ A QS V + F V P +S+ C +A + + +
Sbjct: 160 FD----DLFKRLQWAHQSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTR 215
Query: 310 QLVV-----DRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ K GE I + N KLL+ YGFV+E+NP D L++ +
Sbjct: 216 TETFSFQTNEEVVKPGEQIFNNYRIRSNEKLLLGYGFVEENNPCDNLLLRI 266
>gi|71409849|ref|XP_807248.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871208|gb|EAN85397.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 544
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 349 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 400
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 401 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 435
>gi|299748031|ref|XP_002911244.1| tho2 protein [Coprinopsis cinerea okayama7#130]
gi|298407787|gb|EFI27750.1| tho2 protein [Coprinopsis cinerea okayama7#130]
Length = 2474
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 15/48 (31%), Positives = 31/48 (64%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
++ L+ G+ + GP+PNS+L+++YGF +DNP D +++++
Sbjct: 209 SISLIAHSAIWTGQEVFNNYGPKPNSELILSYGFSIQDNPDDSIILKL 256
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/159 (23%), Positives = 73/159 (45%), Gaps = 18/159 (11%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
K+E+ DL W +NG V E + E+ + A+ +++A + VP L
Sbjct: 74 KREDYFPDLMKWASENG---ASVDGFEMVNFKEEGFGLR---ATREIKAEELFLWVPRKL 127
Query: 133 VVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQGKKSFWLPYIRELDRQRGR 189
++T+E N + L + +++ + LA +L+ E+ SFW PYI+ L +
Sbjct: 128 LMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLCERAS-PNSFWQPYIQTLPSE--- 182
Query: 190 GQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
++PL + E E+ YL + ++ + + R+Y
Sbjct: 183 ----YDTPLYFEEDEVRYLQSTQAIHDVFSQYKNTARQY 217
>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 104/246 (42%), Gaps = 35/246 (14%)
Query: 121 AGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA--CLALYLMYEKKQGKKSFWLP 178
AG+ +VP S+ + + EL+ +SE LAL L+ E+ + K S W P
Sbjct: 110 AGECIITVPRSMFFYVTNEPRYRQLLELMPGAMMSEQGNIMLALALIMERFRAK-SDWKP 168
Query: 179 YIREL-DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFM 237
Y+ L DR +PL ++ ++ L + L+ + I R+Y +
Sbjct: 169 YLDLLPDR--------YTTPLYYTTEDMGELAETDAFLPALKLCKHIARQYGFIRR---- 216
Query: 238 AGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS----CVVHLQKV-SLARRFALVPL----- 287
F Q D + FT+++F+ A V + V+L + + AL+PL
Sbjct: 217 ----FVQEKVDELRDCFTYDVFRWAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMAN 272
Query: 288 -GPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYK--AGESIVVWCGPQPNSKLLINYGFV 342
P A ++C A A ++ ++ + R A I + G + +++ L++ GFV
Sbjct: 273 HAFPDTANETRCVAETCYNATNEQLECSLTREVSDIASVPIFIVYGTRTDAEFLVHNGFV 332
Query: 343 DEDNPY 348
NP+
Sbjct: 333 CPRNPH 338
>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
Length = 854
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 56/121 (46%), Gaps = 10/121 (8%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGN----ETIAELLTTNKLSELACLALYLMYEK 168
V A E +Q G+ + N V+ L L + + + N+LSE A +AL L++EK
Sbjct: 512 VIAKEAIQKGEEVLRIHNDTVIGLHTALTHPRFGKAFSAFYHQNQLSEYALIALTLLWEK 571
Query: 169 KQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKRE 227
++ S + P++ +L P+L S+ +L +L GS E+ + RE
Sbjct: 572 FDNERWSLFAPFLAKLPSIE-----EFHHPVLLSKDDLLHLYGSALLDEVSALNATLHRE 626
Query: 228 Y 228
+
Sbjct: 627 F 627
>gi|41054567|ref|NP_955894.1| N-lysine methyltransferase setd6 [Danio rerio]
gi|82177062|sp|Q803K4.1|SETD6_DANRE RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|27882107|gb|AAH44440.1| SET domain containing 6 [Danio rerio]
Length = 460
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 96/244 (39%), Gaps = 31/244 (12%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-----LSELACLALYLMYEKK 169
A ED++ G F++P ++ G + ++L K S L L LMYE
Sbjct: 53 AKEDIEEGHVLFTIPREALLHQ----GTTKVKKVLEEGKKCLESASGWVPLLLSLMYEYT 108
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL-AYLTGSPTKAEILERAEGIKREY 228
S W PY+ R ++ P+ WSE E L G+ ++ ++ EY
Sbjct: 109 SST-SHWKPYLSLWPDFR-----TLDQPMFWSEEECDKLLKGTGIPESVITDLRKLQDEY 162
Query: 229 NELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQS----CVVHLQKVSLARRF 282
N + + FM + +P E E++K AFV S + +
Sbjct: 163 NSV-VLPFM-----KSHPDLWDPEKHNLELYKSLVAFVMAYSFQEPVEDDDEDEEDDEKK 216
Query: 283 ALVPLGPPL---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINY 339
+P+ P+ L + SK A L + +++V R GE + G N +LL Y
Sbjct: 217 PNLPMMVPMADMLNHISKHNANLEYTPECLKMVSIRRIGKGEEVFNTYGQMANWQLLHMY 276
Query: 340 GFVD 343
GF +
Sbjct: 277 GFAE 280
>gi|407846232|gb|EKG02467.1| hypothetical protein TCSYLVIO_006496 [Trypanosoma cruzi]
Length = 546
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 45/95 (47%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 351 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 402
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA I E L VW+ G +
Sbjct: 403 YSSQYDIALSCLQRAVAISTEDETLADVWYNIGHI 437
>gi|392587357|gb|EIW76691.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 430
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/52 (30%), Positives = 31/52 (59%)
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
+ + + +V + G+ I GP+PNS+L++ YGF +NP D +V+++
Sbjct: 230 STETTISIVSHDAFNEGQEIYNNYGPKPNSELILGYGFSLPNNPDDTIVLQL 281
>gi|270005260|gb|EFA01708.1| hypothetical protein TcasGA2_TC007288 [Tribolium castaneum]
Length = 253
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNK-LSEL--ACLALYLMYEKK 169
V A+ D+ +VP L++++E + +L+ +K L + L+++L+ EK
Sbjct: 116 VKANVDIAESSLVIAVPRKLMMSVENA-KESVLKDLIEKDKILGSMPNVALSIFLLLEKY 174
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+G SFW PYI L + + L +S EL L GSPT L + + I R+Y
Sbjct: 175 KGD-SFWKPYIDILPK-------TYTTVLYFSIDELEELRGSPTLEVALRQIKSITRQY 225
>gi|159468798|ref|XP_001692561.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278274|gb|EDP04039.1| predicted protein [Chlamydomonas reinhardtii]
Length = 724
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 52/222 (23%), Positives = 86/222 (38%), Gaps = 30/222 (13%)
Query: 147 ELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA 206
++L +L L +MYE +G S W Y+R L + P+ WS +L
Sbjct: 80 DVLRAERLGGGLALVAAVMYEAARGPASKWHGYLRSLPARE-------YLPVFWSARQLQ 132
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIP--TEAFTFEIFKQAFV 264
L G+ ++ ++AE R D +A L +YP + ++ E F A
Sbjct: 133 QLAGT----DLADKAE-EDRASMAADFSTHLA-PLLSRYPGRLGHLAAGWSLEAFMHAAS 186
Query: 265 AVQSCVVHLQKVSLARRFALVPLGP---------PLLAYSSKCKAMLAAVDDAVQL---V 312
V S ++ ALVPL L S + A + V+L V
Sbjct: 187 WVASRAFYVDDT---HGDALVPLADVFNHKAARVDLGEGSGWSAGFVVAEQEGVELLDIV 243
Query: 313 VDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
+P G + G N++L+ YGF N +D ++++
Sbjct: 244 AAQPLAGGTEVYNTYGEHSNAELVNKYGFALPYNAFDEILLD 285
>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
Length = 321
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)
Query: 57 VSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNG-LPPCKVILKEKPSHNEKHRPIHYVAA 115
V++ D A + ++ +ED W NG + K+ +K + V
Sbjct: 50 VAAGDQGAASGADQQAQLQEDWTAFVKWFRSNGGIISSKLTVKVRNGRQG-------VYF 102
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNET--IAELLTTNKLSELACLALYLMYEKKQGKK 173
E ++ G+ S P +L + + + + + + L +K + L++++E K GK
Sbjct: 103 KERMRRGETIVSFPRNLRLDEKTAMKGKAGHVFQRLKQDKCYPDLMVILHVVHEDKLGKD 162
Query: 174 SFWLPYIRELDRQ 186
SFW PY + L RQ
Sbjct: 163 SFWFPYFKLLRRQ 175
>gi|242045610|ref|XP_002460676.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
gi|241924053|gb|EER97197.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
Length = 489
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 60/278 (21%), Positives = 116/278 (41%), Gaps = 58/278 (20%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ P + ++T +RV ++ IA ++ ++ LS + L + L+ E
Sbjct: 57 LAAARDLRRGELVLRAPRAALLTSDRVTADDPRIAACVSAHRPRLSSVQILIVCLLAEVG 116
Query: 170 QGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI---KR 226
+G+ S W PY+ +L P + T LA T + E L+ + I ++
Sbjct: 117 KGRNSVWYPYLSQL-------------PSYY--TILA--TFDDFEVEALQVDDAIWVAQK 159
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP 286
+ + + W L ++ + + F+ + AF V S +H ++ L P
Sbjct: 160 AKSAIKSDWEDVTPLMKELEF--KPKLLMFKSWLWAFATVSSRTLH---IAWDEAGCLCP 214
Query: 287 LGPPLLAYSSKCKAMLAAVDDAVQL---------------VVDRPY-------------- 317
+G L Y++ +D +L + D Y
Sbjct: 215 VG-DLFNYAAPDDDTSLEAEDTAELTNYQQKNEMINSSERLTDGGYEDSNAYCLYARKNY 273
Query: 318 KAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
K GE +++ G N +LL +YGF+ +NP ++ +E+
Sbjct: 274 KQGEQVLLGYGTYTNLELLEHYGFLLGENPNEKTFIEL 311
>gi|194707708|gb|ACF87938.1| unknown [Zea mays]
Length = 352
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 31/48 (64%)
Query: 304 AVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRL 351
++D +V+++ ++ K E+I + G PN L++YGFV NPYD++
Sbjct: 136 SLDMSVKVLAEKKIKQNEAITLNYGCYPNDFFLLDYGFVITQNPYDQV 183
>gi|321257099|ref|XP_003193469.1| nucleus protein [Cryptococcus gattii WM276]
gi|317459939|gb|ADV21682.1| nucleus protein, putative [Cryptococcus gattii WM276]
Length = 491
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 57/262 (21%), Positives = 100/262 (38%), Gaps = 51/262 (19%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSEL----ACLALYLMYEKKQ 170
A +D++ G F V ++L+++ + + L ++ +L A L L +M+E +
Sbjct: 46 AVKDIEEGTPLFHVTDNLILSPY----TSDLKDHLDASEWDQLNKGWAQLILVMMWETIK 101
Query: 171 GKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
G KS W Y+ + + E+P+ W+E + L+G+ + A+ I RE E
Sbjct: 102 GSKSRWAGYLTNM-------PVMFETPMFWTEQQRDQLSGT-------DIADRIGREDAE 147
Query: 231 LDTVWFMAGSLFQQ---YPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLAR------- 280
+ +A + +P D P + + + +S V L + ++
Sbjct: 148 AEYTSLLAPFIKAHPDLFPVDSPHTTIDAFHIQGSRILSRSFTVPLHRFGRSQSQSQSDG 207
Query: 281 ------------RFALVPLGPPLLAYSSKCKAML-------AAVDDAVQLVVDRPYKAGE 321
++P L A K A L D+ V + R K E
Sbjct: 208 NETESDDEEEEEVVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTRLVKQSE 267
Query: 322 SIVVWCGPQPNSKLLINYGFVD 343
I PNS+LL YG VD
Sbjct: 268 QIYNTYDSPPNSELLRKYGHVD 289
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 30/61 (49%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L D +Q++ +P GE I G PNS LL YG+V
Sbjct: 1060 AMVPMADMLNARCGCNNAKLFYTRDDLQMMATKPIAKGEQIWNTYGDPPNSDLLRRYGYV 1119
Query: 343 D 343
D
Sbjct: 1120 D 1120
>gi|254568640|ref|XP_002491430.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
gi|238031227|emb|CAY69150.1| Nuclear protein that contains a SET-domain [Komagataella pastoris
GS115]
Length = 538
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
++VP L A ++ C A L + + + +P K GE + G PNS++L YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313
Query: 343 D 343
+
Sbjct: 314 E 314
>gi|407920105|gb|EKG13323.1| hypothetical protein MPH_09605 [Macrophomina phaseolina MS6]
Length = 574
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 49/224 (21%), Positives = 86/224 (38%), Gaps = 32/224 (14%)
Query: 143 ETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSE 202
E + LL N LS +A + L+ G+KS W PYI L + + +P+ ++E
Sbjct: 71 EVVQGLLPNNVLSNIALIKELLL-----GEKSLWAPYINCLPKSE-----QLNTPIYFAE 120
Query: 203 -----------TELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPT 251
+ A+L G+ R E + E+ +V G I T
Sbjct: 121 EMTQEAINGRRNDTAWLLGTNLDKSWRPRKEQWEEEWKNAVSVLKRQG---------IAT 171
Query: 252 EAFTFEIFKQA--FVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
E +T++ + A +S + + ++A++ LL + K +
Sbjct: 172 EGYTWDAYAWAATIFTSRSFISDPGLSKESSQYAVLMPVIDLLNHRFPTKVAWFFNEGNF 231
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
Q + + P G I G + N +LL YGF +N D + +
Sbjct: 232 QFITEEPVPKGHEIFNNYGGKGNEELLNGYGFCIPNNHCDEVAI 275
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 78/200 (39%), Gaps = 23/200 (11%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
LYL K K W PYI L + PL ++ EL + G+ A + E+
Sbjct: 42 LYLAVNKTNPKCFHW-PYINVLPE-------TYDCPLSYTIDELNLMKGTKLYAAV-EKI 92
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR 281
+ V + L QQ+P ++F ++FK+ A QS V +
Sbjct: 93 NAFL-----MKVVDYYNNKLIQQFPQYF--QSFD-DLFKRLQWAHQSFWSRAFLVIYPQP 144
Query: 282 FALVPLGPPLLAYSSKCKAMLAAVDDAVQL------VVDRPYKAGESIVVWCGPQPNSKL 335
F V P +S+ C Q + K GE I + N KL
Sbjct: 145 FGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKL 204
Query: 336 LINYGFVDEDNPYDRLVVEV 355
L+ YGFV+E+NP D L++ +
Sbjct: 205 LLGYGFVEENNPCDNLLLRI 224
>gi|443699166|gb|ELT98776.1| hypothetical protein CAPTEDRAFT_151537 [Capitella teleta]
Length = 413
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 104/262 (39%), Gaps = 42/262 (16%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL-----TTNKLSELACLALYLMYE 167
+ A+ D+ GD F +P SL++T + N TI LL + + S L + LMYE
Sbjct: 1 MVATSDISQGDTIFEIPRSLLLTPQ----NSTIGVLLNEEADSLQEASRWVPLLITLMYE 56
Query: 168 KKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY-LTGSPTKAEILERAEGIKR 226
S W PY D QL + P+ WS E+ L G+ + + I +
Sbjct: 57 YT-SPSSRWKPY---FDLVPDFDQLDL--PMFWSSDEVKRELKGTGIPSLVESDLLNISK 110
Query: 227 EYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQ--AFVAVQSCVVHL----------- 273
E+N+L Q++ E + +K+ AFV S
Sbjct: 111 EFNDL------VLPFIQKHSNVFSDECKCLKFYKKMVAFVMAYSFTEPPPSPDLDDSDDL 164
Query: 274 --QKVSLARRFALVPLGPPLLAYSSKCKAMLA--AVDDAVQLVVDRPYKAGESIVVWCGP 329
+ L + +VP+ +L + +K A L ++++V + + GE I G
Sbjct: 165 SGDEHDLMPQPMMVPMA-DILNHVAKNSARLDFPKGSSSLKMVATQDIQKGEEIFNTYGE 223
Query: 330 QPNSKLLINYGFVDED--NPYD 349
N LL YGF ++ N YD
Sbjct: 224 LANMNLLHMYGFAEDIGCNEYD 245
>gi|170090678|ref|XP_001876561.1| SET domain protein [Laccaria bicolor S238N-H82]
gi|164648054|gb|EDR12297.1| SET domain protein [Laccaria bicolor S238N-H82]
Length = 434
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 64/317 (20%), Positives = 120/317 (37%), Gaps = 54/317 (17%)
Query: 81 LKSWMHKNGLP-PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERV 139
L+SW+ K+G +V E S + V ASED+ S P L +T +
Sbjct: 11 LRSWLSKHGGGFNTRVRFSEATSGSR-------VVASEDIPEDTEIVSCPFDLAITRQHA 63
Query: 140 ---LGNE-TIAELLTTNKLSELACLALYLMYE-------KKQGKKSFWLPYIRELDRQRG 188
LGN +E++++ SE ++ Y+++ + K PY++ L
Sbjct: 64 QLALGNFLDTSEIISSTHWSERQWISTYIIFHWIICDESQPNLKILAHYPYLKTLPSSE- 122
Query: 189 RGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE-LDTV------WFMAGSL 241
+ +PL ++ E+ G+ L+R + E+ + LD V W + +
Sbjct: 123 ----KLLTPLHFTSEEIQLFKGTNLYGATLDRERDWRTEWADCLDRVSQANQDWGKSFTW 178
Query: 242 FQQYPYDIPTEAFTF----EIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA---- 293
+ + +F E + A + S +S PL P+L
Sbjct: 179 SATFSESTAYHSLSFFYDRERYLTAATYLSSRAFPSSLLSPTPSLKHSPLTEPVLLPGID 238
Query: 294 ---------------YSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLIN 338
Y + + + + L++ P G + GP+PNS+L++
Sbjct: 239 SLNHARGQPVSWVVRYPNNVPPDSSLQEPKISLILHTPATGGNELFNNYGPKPNSELILG 298
Query: 339 YGFVDEDNPYDRLVVEV 355
YGF NP D +++++
Sbjct: 299 YGFSLPQNPDDTILLKI 315
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 24/35 (68%)
Query: 320 GESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVE 354
GE +V+ G + N +LL +GF D DNP+D LV++
Sbjct: 199 GEEVVISYGDKTNEELLFVHGFADRDNPHDALVLQ 233
>gi|326491013|dbj|BAK05606.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 481
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 60/277 (21%), Positives = 113/277 (40%), Gaps = 60/277 (21%)
Query: 118 DLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTN--KLSELACLALYLMYEKKQGKKS 174
DL+ G+ VP + ++T +RV+ ++ IA + + +LS + L + + E +GK S
Sbjct: 53 DLRRGELVLRVPRAALLTSDRVMADDPRIASCIDAHRPRLSSIQRLIVCFLAEVGKGKSS 112
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGI-KREYNELDT 233
W Y+ +L P + T LA +A ++ A + ++ + + +
Sbjct: 113 SWYLYLSQL-------------PSYY--TILATFNDFEIEALQVDDAVWVAQKALSAIRS 157
Query: 234 VWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLA 293
W A L ++ D + F + AF V S +H V L P+G L
Sbjct: 158 EWEEATPLMRE--LDFKPKLLVFTTWLWAFATVSSRTLH---VPWDDAGCLCPIG-DLFN 211
Query: 294 YSS----------------KC---KAMLAAV----------------DDAVQLVVDRPYK 318
Y++ KC ML + +A L + Y+
Sbjct: 212 YAAPDDDTSSEEQDTEEAMKCHEINVMLGKIKLDSSSERMTDGGYEDSNAYCLYARKRYR 271
Query: 319 AGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
GE +++ G N +LL +YGF+ ++NP ++ +++
Sbjct: 272 KGEQVLLGYGTYTNLELLEHYGFLLDENPNEKTYIQL 308
>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 498
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 42/175 (24%), Positives = 78/175 (44%), Gaps = 17/175 (9%)
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVW 235
W YIR + ++ P ++E EL L G+ + + + +++E+ L
Sbjct: 134 WTEYIRFM-------PPSIRLPTFYTEAELELLRGTSLRTAVFAKLASLEKEFERLRQS- 185
Query: 236 FMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLL 292
Q+Y +D T TF+ +K +S VV L + A+VP +
Sbjct: 186 TEGIPWCQKYWWDEDTGRLTFDDWKYVDAVYRSRVVELPESG----HAIVPCVDMANHAS 241
Query: 293 AYSSKCKAMLAAVDDAV-QLVVDRPYKAGESIVVWCGPQ-PNSKLLINYGFVDED 345
S K + ++ +DA+ QL R +GE + + G + P S+++ +YGFV+ +
Sbjct: 242 EDSVKARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENE 296
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVV 353
+A L D+ +G+ + + GP+ N +LL YGFV+ +NP D V+
Sbjct: 361 NAYSLATDQAIPSGDEVYISYGPRSNDQLLQYYGFVERNNPNDVYVM 407
>gi|406978090|gb|EKE00118.1| hypothetical protein ACD_22C00090G0009 [uncultured bacterium]
Length = 478
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 74/191 (38%), Gaps = 28/191 (14%)
Query: 109 PIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEK 168
P+HY+ ++ G FS P + T + G E + E L SE +Y MY K
Sbjct: 77 PMHYMQKEKEHVKG---FS-PELAIAT---IAGGEKLTEELAIRPTSETI---MYDMYRK 126
Query: 169 KQGKKSFWLPYIRELD----------RQRGRGQLAVE-SPLLWSETELAYLTGSPTKAEI 217
W R+L R R L + S LW E A+LT + +
Sbjct: 127 -------WTNSWRDLPVLINQWCNVVRWEKRTYLFLRTSEFLWQEGHCAHLTHEESTDTV 179
Query: 218 LERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVS 277
+ K+ YN+L +++ + G + + + +TFE ++Q+C H +
Sbjct: 180 IWAINAYKKTYNDLMSIYGIVGVKSESEKFAGAVKTYTFESLMPNGKSLQTCTSHDLGQN 239
Query: 278 LARRFALVPLG 288
++ F G
Sbjct: 240 FSKSFEWTVQG 250
>gi|409080258|gb|EKM80618.1| hypothetical protein AGABI1DRAFT_71041 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 492
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 38/81 (46%), Gaps = 10/81 (12%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L D +++V +P K GE I G PN++LL YG V
Sbjct: 245 AMVPMADILNARYQTENAKLFHEKDELKMVTTKPIKTGEQIWNTYGDLPNAELLRRYGHV 304
Query: 343 D--------EDNPYDRLVVEV 355
D NP D VVE+
Sbjct: 305 DFLSLPSGGHGNPGD--VVEI 323
>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
Length = 482
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 104/247 (42%), Gaps = 36/247 (14%)
Query: 128 VPNSLVVTLERVLG----NETIAELL-TTNKLSELACLALYLMYEKKQ------GKKSF- 175
+P LV++ E V ++ +LL S + LYL+ Q G ++F
Sbjct: 57 IPRDLVLSAEAVEEYAKVDQNFKQLLDVAGHQSTRGDIMLYLLTHLVQSKATSPGTRAFA 116
Query: 176 ---WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
W YIR L R + P +W+ E L G+ +A + + + EY++L
Sbjct: 117 STPWTEYIRFLPR-------PIPVPTMWTNDERELLKGTSLEAAVSAKLSALSSEYDKLC 169
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLL 292
A +L + + +E+ T E + A +S + L + A+VP G +
Sbjct: 170 E---EASAL--SFWSTLLSESATLEDWVLADAWYRSRCLELPRAG----HAMVP-GLDMA 219
Query: 293 AYSSKCKAMLAAVDDAVQLVVDRPYK---AGESIVVWCG-PQPNSKLLINYGFVDEDNPY 348
+S A D +++ RP AG I + G +P +++L +YGF+D+D+
Sbjct: 220 NHSQSHSAYYDESSDGDVVLLPRPGSKIPAGAEITISYGEAKPAAEMLFSYGFIDKDSTV 279
Query: 349 DRLVVEV 355
L + +
Sbjct: 280 KELTLHL 286
>gi|384490907|gb|EIE82103.1| hypothetical protein RO3G_06808 [Rhizopus delemar RA 99-880]
Length = 216
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 28/119 (23%), Positives = 59/119 (49%), Gaps = 12/119 (10%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
V ++ ++ + +VP S+ +T E+V N T S +L+L+ +K GK
Sbjct: 22 VYTTDTVKENEKFATVPFSICIT-EKVARNA----FPTLTGFSGRVLQSLFLVQQKNLGK 76
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
KSF+ PYI L ++ + + L + E ++ Y+ + + + ER ++ ++++L
Sbjct: 77 KSFYFPYINILPKK-------IVTALHFDENDMNYIKKTNLELALRERKTALRDDFDKL 128
>gi|410082986|ref|XP_003959071.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
gi|372465661|emb|CCF59936.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
Length = 584
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 53/251 (21%), Positives = 97/251 (38%), Gaps = 38/251 (15%)
Query: 127 SVPNSLVVTLERVLGNETIAELLTTNKLSELACLALY---LMYEKKQGKKSFWLPYIREL 183
SVP+ L++T E L ++ ++ + A + LY L + K F+ PY+ L
Sbjct: 55 SVPSKLLITNELALKEFNVSSKNLSSLFNPNALIQLYLCKLKFNATTAKSDFFKPYLDIL 114
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMA----- 238
V P W++++L L G+ T +I + + + E+ EL V +
Sbjct: 115 PPN-------VPHPYFWNKSQLQLLQGTDTLIKIKQNLQNLITEWYELLNVLEITPIEKE 167
Query: 239 ---------GSLFQQYPYDIPTEA----------FTFEIFKQAFVAVQSCVVHLQKVSLA 279
S+F ++ T + ++F IF A +++ S
Sbjct: 168 GTAFDVNDIDSIFSYISENVKTTSPKWNSFIAYLWSFGIFTSR--AFPEILINPDNCSNV 225
Query: 280 RRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVD-RPYKAGESIVVWCGPQPNSKLLIN 338
+ L P+ LL + + DD + + K + G + N +LL+
Sbjct: 226 NQAFLYPI-VDLLNHKNGTSVKWTFEDDQAHFFTNEKNLKKHTELFNNYGDKSNEELLLG 284
Query: 339 YGFVDEDNPYD 349
YGFV +N +D
Sbjct: 285 YGFVQSNNAHD 295
>gi|302660547|ref|XP_003021952.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
gi|291185873|gb|EFE41334.1| hypothetical protein TRV_03939 [Trichophyton verrucosum HKI 0517]
Length = 479
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/126 (22%), Positives = 54/126 (42%), Gaps = 14/126 (11%)
Query: 97 LKEKPSHNEKHRPIHY-----VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
LK H + H IH A + + F +P+ L+++++ + L
Sbjct: 24 LKRSSPHFKMHPGIHIADLRSTGAGRGISEDEELFVIPDDLILSVQNSEARSVLG--LDD 81
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
+L L + ++YE QG++S W Y R L + ++ + W++ +L+ L GS
Sbjct: 82 KQLGPWLSLIITMIYEYYQGEQSKWYSYFRILPS-------SFDTLMFWTDEQLSELQGS 134
Query: 212 PTKAEI 217
+I
Sbjct: 135 SVVGKI 140
>gi|42820762|emb|CAF32075.1| SET domain protein, putative [Aspergillus fumigatus]
Length = 530
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 117/289 (40%), Gaps = 41/289 (14%)
Query: 81 LKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDL-QAGDAA--FSVPNSLVVTLE 137
L SW NG+ + ++ S + + VA +E + G+A +VP+ L +TLE
Sbjct: 59 LSSWAKLNGISLEGIAFQKLYSEHGTDKGSAIVATAEKKDEEGEANTLLTVPSDLALTLE 118
Query: 138 RVLGNETIAELL-----TTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQL 192
V + I L + ++ +K G + W YIR +
Sbjct: 119 YVHNHAKIDRHLREVLDAVGDFGRVCYSPDFVNKRQKIGISNPWTEYIRFM-------PA 171
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTVWFMAGSLFQQYPY 247
+V P +S E L G+ + + + +++E++ L + W Q++ +
Sbjct: 172 SVPLPTFYSAEERELLRGTSLQTAVDAKLGSLEKEFDHLRQATEEIPWC------QEHWW 225
Query: 248 DIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP---LGPPLLAYSSKCKAMLAA 304
D T FTF+ +K +S VV L + A+VP + S K +
Sbjct: 226 DEDTGKFTFDDWKYVDAVYRSRVVDLPRSG----HAIVPCVDMANHACEDSVKARYDEEG 281
Query: 305 VDDAV-QLVVDRPYKAGE----SIVVWC---GPQPNSKLLINYGFVDED 345
+AV QL + + GE + V C +P S+++ +YGFV+ +
Sbjct: 282 AGNAVLQLRTGKKLRVGEEKLHADAVACRYGDEKPASEMVFSYGFVENE 330
>gi|328726082|ref|XP_001952202.2| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 241
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 73/161 (45%), Gaps = 20/161 (12%)
Query: 73 KKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSL 132
+ ++ + L W KNG IL H ++ + + A++++ GD +VP +L
Sbjct: 81 RNDQSIEKLTKWATKNG-----AILNGVEIHQFENYA-YGMKANKNITVGDKLVTVPRAL 134
Query: 133 VVTLERV----LGNETIAELLTTNKLSELACLALYLMYEK-KQGKKSFWLPYIRELDRQR 187
++T E + L +++ N + LA++++ E ++ KKSFW Y+ L
Sbjct: 135 MMTEENIPSSPLWKLHSQDMMLRNMPN--VALAIFILVESLRKDKKSFWHSYLTTLP--- 189
Query: 188 GRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
+ +P+ + +L L GSP L+ I R+Y
Sbjct: 190 ----VTYSTPVYFDVADLEALKGSPAFEAALKLNRNIARQY 226
>gi|242210759|ref|XP_002471221.1| predicted protein [Postia placenta Mad-698-R]
gi|220729780|gb|EED83649.1| predicted protein [Postia placenta Mad-698-R]
Length = 256
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 30/48 (62%)
Query: 308 AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
++ LV+ + AG + GP+PN++L++ YGF NP D +V+++
Sbjct: 77 SISLVLHNAHPAGAELFNNYGPKPNAELILGYGFALPHNPDDTIVLKL 124
>gi|156054286|ref|XP_001593069.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980]
gi|154703771|gb|EDO03510.1| hypothetical protein SS1G_05991 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 418
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 50/226 (22%), Positives = 94/226 (41%), Gaps = 24/226 (10%)
Query: 147 ELLTTNKLSELACLA-LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETEL 205
E L T K + + +LM + + +KS W YIR L + L + P+ W E +
Sbjct: 77 EFLETLKQDDPNIIGHFFLMQQYLKCEKSPWWQYIRLLPQPGDPKSLGI--PIWWPEEDQ 134
Query: 206 AYLTGSPT----------------KAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDI 249
+L G+ K +L R +EY+ + W A ++F +
Sbjct: 135 KFLAGTNAGPPLQKREQMWRDQWKKGVVLLRELPNHKEYSYILYQW--AATIFDSRSFR- 191
Query: 250 PTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAV 309
P+ E ++ + + H++ + + LV +G + K L++ ++
Sbjct: 192 PSLTICPEALSESSKEMDLNLDHVRNDRFSILYPLVDIGNHNGINQVEWKKDLSS--NSF 249
Query: 310 QLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPYDRLVVEV 355
LV G+ I + G + NS+LL+ YGF+ ++ +R VV +
Sbjct: 250 DLVHSAGVSEGDQIYNYYGNKSNSELLLGYGFILPNDIVNRNVVNL 295
>gi|412987667|emb|CCO20502.1| related to histone-lysine N-methyltransferase (ISS) [Bathycoccus
prasinos]
Length = 866
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 62/269 (23%), Positives = 111/269 (41%), Gaps = 53/269 (19%)
Query: 114 AASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC---------LALYL 164
A +ED++ GD +P S +LE +E + + + + +A+++
Sbjct: 28 AVTEDVRRGDVLLEIPLSRCFSLESAQKSEMLTKAMAKAAAAAAGTRFTPTHDQYMAMFI 87
Query: 165 MYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETE--LAYLTGSPTKAEILERAE 222
+ E+ GK+S +I + + A + PL WSE E + L G+ T AE L E
Sbjct: 88 LLEQNLGKQSSHYEHILSIPK-------AYDLPLFWSEEERQRSLLFGTTTYAETLALDE 140
Query: 223 GIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFK--QAFVAVQSC----------- 269
+ ++Y L + F++ + T + FK +A + + C
Sbjct: 141 EVIQDYELLKH--HLGEDFFRE-------QNITMDRFKWVRATLWSRQCDLLRPAPETTR 191
Query: 270 -VVHLQKVSLARRFALVPLGPPL-LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWC 327
V + + + + VPLG L YS + ++ A A + P I
Sbjct: 192 LRVLIPEFDMFNHSSKVPLGSSHKLNYS---RGLVTAFATA-----NVPKGEQAYISYGS 243
Query: 328 GPQPNSKLLINYGFV---DEDNPYDRLVV 353
G +SKLL+ YGF + +NP+++L V
Sbjct: 244 GEASSSKLLLWYGFAPLNEGENPFEQLDV 272
>gi|414886517|tpg|DAA62531.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 147
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNE-TIAELLTTNK--LSELACLALYLMYEKK 169
+AA+ DL+ G+ +P + ++T +RV ++ IA ++ +K LS + L + L+ E
Sbjct: 51 LAAARDLRRGELVLRLPRAALLTSDRVTADDPRIAACVSAHKPRLSSVQILIVCLLAEVG 110
Query: 170 QGKKSFWLPYIREL 183
+G S W PY+ +L
Sbjct: 111 KGSNSVWYPYLCQL 124
>gi|351694473|gb|EHA97391.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 297
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA---CLALYLMYEKKQG 171
A+ D++AG+ VP LV+T+E N + L + +++ + LA +L+ ++
Sbjct: 112 ATRDIKAGELFLWVPRKLVMTVESA-KNSVLGPLYSQDRILQAMGNIALAFHLLLCERAS 170
Query: 172 KKSFWLPYIREL 183
SFWLPYI+ L
Sbjct: 171 PISFWLPYIQTL 182
>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
Length = 539
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 50/245 (20%), Positives = 96/245 (39%), Gaps = 23/245 (9%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
+ DL G+ +VP L+ + E + + + L+ + L+ EK +G S
Sbjct: 151 TRDLAEGELVLTVPRQLIFSEELLPEAQRKLFIDFPTHLN----VTYMLIIEKVRGAASN 206
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL---- 231
W P+I L + + L ++ ++ L G+ + + I R Y +
Sbjct: 207 WQPFIDTLPTR-------YNTVLYFTVEQMQRLRGTSACSAAVRHCRVIARIYASMYKCA 259
Query: 232 -----DTVWFMAGSLFQQYP--YDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFAL 284
D+V +LF +Y Y++ A + +Q V Q V + AL
Sbjct: 260 YMQPDDSVMAGMANLFTEYGLCYELYRWAVSTVTTRQNLVPRQ-LATDSDGVRNSPMSAL 318
Query: 285 VPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE 344
+P K + ++ + +KAGE ++ G + N+ L+++GF+D
Sbjct: 319 IPFWDMANHRCGKITSYYKPSAQQMECIAQEAFKAGEQFFIYYGDRCNADRLVHHGFLDM 378
Query: 345 DNPYD 349
+N D
Sbjct: 379 NNLKD 383
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 104/246 (42%), Gaps = 34/246 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ + T + F AF ++S + + L+PL +
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237
Query: 292 ---------LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGF 341
AY K A L + D L KAGE + + + + N++L ++YGF
Sbjct: 238 HNPAIKTEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGF 296
Query: 342 VDEDNP 347
V E NP
Sbjct: 297 V-ESNP 301
>gi|328352061|emb|CCA38460.1| hypothetical protein PP7435_Chr2-0775 [Komagataella pastoris CBS
7435]
Length = 846
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
++VP L A ++ C A L + + + +P K GE + G PNS++L YG+V
Sbjct: 254 SMVPFADTLNADTNLCNANLTYQSENLVMKAIKPIKKGEQVYNTYGNHPNSEILRRYGYV 313
Query: 343 D 343
+
Sbjct: 314 E 314
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 38.5 bits (88), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 106/245 (43%), Gaps = 32/245 (13%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRLYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG---- 288
+ D+ + T + F AF ++S + + L+PL
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAFGILKSRAFSRLR---GQNLVLIPLADLIN 237
Query: 289 --PPLLA--YSSKCK-AMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFV 342
P + Y+ + K A L + D L KAGE + + + + N++L ++YGFV
Sbjct: 238 HNPAITTEDYAYEIKGAGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFV 297
Query: 343 DEDNP 347
E NP
Sbjct: 298 -ESNP 301
>gi|392594054|gb|EIW83379.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 508
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 17/109 (15%)
Query: 115 ASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELAC------LALYLMYEK 168
A +D+ G FS+P L ++L T+ LL ++ E L L +M+E+
Sbjct: 43 ALQDIHEGTTLFSLPRELTLSLR----TSTLPSLLGVDRWKEFGLNKGWVGLILCMMWEE 98
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEI 217
+G +S W Y+ L ++P+ WS +L L G+ +I
Sbjct: 99 SRGVESKWDVYLSSLPS-------TFDTPMFWSAEDLEELKGTAVPDKI 140
>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
Length = 391
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 5/48 (10%)
Query: 311 LVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDN-----PYDRLVV 353
+ DR YKAGE + V G N LL+ YGF+ + N P D L++
Sbjct: 203 VTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSNRNDAIPLDHLIL 250
>gi|449302028|gb|EMC98037.1| hypothetical protein BAUCODRAFT_67154 [Baudoinia compniacensis UAMH
10762]
Length = 381
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 25/41 (60%)
Query: 307 DAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
+ +++ +R Y AGE ++ G N KLL++YGF+ E P
Sbjct: 204 NGYEMLANRDYDAGEEVLFTYGAHSNDKLLVHYGFICESPP 244
>gi|428183324|gb|EKX52182.1| hypothetical protein GUITHDRAFT_150712, partial [Guillardia theta
CCMP2712]
Length = 205
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 28/115 (24%), Positives = 53/115 (46%), Gaps = 13/115 (11%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSF 175
++D+++ S+P+ L L R + +KL +AL ++YEK + ++SF
Sbjct: 88 TQDVKSNSVVCSIPSKLF--LSRSTTRAAFGSM--ADKLDVRTAMALQILYEKSKKEESF 143
Query: 176 WLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNE 230
W +++ L + G +P LW E + L G+ + E E K Y++
Sbjct: 144 WCEWLKVLPDRENLG-----TPCLWPEDDQNLLKGT----SVFEEVEASKSLYSK 189
>gi|449542715|gb|EMD33693.1| hypothetical protein CERSUDRAFT_56467 [Ceriporiopsis subvermispora
B]
Length = 510
Score = 38.5 bits (88), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 38/83 (45%), Gaps = 12/83 (14%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L + +++V +P AGE I G PNS LL YG V
Sbjct: 260 AMVPMADMLNARFESENAKLFYEEHYLKMVATKPINAGEQIWNTYGDPPNSDLLRRYGHV 319
Query: 343 D----------EDNPYDRLVVEV 355
D E NP D VVE+
Sbjct: 320 DVVPLGEPLSGEGNPAD--VVEI 340
>gi|403375581|gb|EJY87766.1| hypothetical protein OXYTRI_23666 [Oxytricha trifallax]
Length = 789
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 43/194 (22%), Positives = 85/194 (43%), Gaps = 26/194 (13%)
Query: 62 TLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQA 121
T + + + K++E + + W+ NG+ V + P + + +AA +D+
Sbjct: 28 TFIHHEKTNLLKQQEKYVNFQKWLEDNGVLHPGV---DYPVAFGRQGQLIGMAARKDIPP 84
Query: 122 GDAAFSVPNSLVVTLERVLGNETIAELLTTN-------KLSELACLALYLMYEKKQGKKS 174
A VP L+++ E + N IA LL+ + + +E + ++ +E +G+ S
Sbjct: 85 QKAFLFVPQRLMIS-EVTVRNSKIAPLLSKHPEIFKHHQDAEYLVIIAFVWHELMKGEAS 143
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY-NELDT 233
FW PY + ++ + P+LWS+ E+ + +I + K EY NE
Sbjct: 144 FWHPYFQIIN--------LSDLPMLWSDQEIQEFQDQVLQKDI----QDYKVEYENEWKL 191
Query: 234 VW--FMAGSLFQQY 245
V+ F + +Y
Sbjct: 192 VYEAFSKDETYDEY 205
>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 510
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 10/141 (7%)
Query: 92 PCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTT 151
P +L+ P R Y+ A E+++ G S+P S V+++E + LL
Sbjct: 88 PMSTVLQ--PEGFNFGRGTAYITA-ENVEVGSELLSLPMSQVMSVESA-ARGRVGLLLEV 143
Query: 152 N-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTG 210
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L G
Sbjct: 144 NPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMNELEG 198
Query: 211 SPTKAEILERAEGIKREYNEL 231
S + L RA+ ++ Y+ L
Sbjct: 199 SQLQRFTLGRAQAVEAFYDAL 219
>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 10/143 (6%)
Query: 90 LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELL 149
L P +L+ P R Y+ +E+++ G S+P S V+++E + LL
Sbjct: 16 LAPMSTVLQ--PEGFNFGRGTAYIT-TENVEVGSVLLSLPMSQVMSVESA-ARGRVGLLL 71
Query: 150 TTN-KLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYL 208
N L L L+L+ E+ G S + ++ L A+ S L +SE E+ L
Sbjct: 72 EVNPDLPSAIALGLHLLEERALGAASNFSDFVATLPTIE-----AINSTLFYSEDEMKGL 126
Query: 209 TGSPTKAEILERAEGIKREYNEL 231
GS + L RA+ + Y+ L
Sbjct: 127 EGSQLQRFTLGRAQAVDAFYDAL 149
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 38.1 bits (87), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 31/61 (50%)
Query: 283 ALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFV 342
A+VP+ L A A L + +++V +P KAGE I G PNS LL YG V
Sbjct: 264 AMVPMADMLNARFESENAKLFYDERELKMVSTKPIKAGEQIWNTYGDPPNSDLLRRYGHV 323
Query: 343 D 343
D
Sbjct: 324 D 324
>gi|311031127|ref|ZP_07709217.1| D-alanyl-alanine synthetase A [Bacillus sp. m3-13]
Length = 356
Score = 38.1 bits (87), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 75/174 (43%), Gaps = 19/174 (10%)
Query: 35 IRDPNFGSSLRLVRRKNRFSIRVSSSDTLVAGSREVVSKKEEDLGDLKSWMHKNGLPPCK 94
++ N GSS+ + + KNR + + + + +V + E +++ + N P C
Sbjct: 176 VKPANLGSSVGISKAKNREELETAFREAFDYDRKIIVEQGLEGAREIEIGVLGNDEPECS 235
Query: 95 VILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAEL-LTTNK 153
VI + P K Y A ED GD A +P + + E + E+ +T K
Sbjct: 236 VIGEIAP----KTEFYDYKAKYED---GDTAMIIPAEISEDI-----YEQVKEMAITAFK 283
Query: 154 LSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAY 207
+ A L + K+GK YI E++ G ++ PLLW T+++Y
Sbjct: 284 AIDGAGLVRADFFLTKEGKL-----YINEVNTMPGFTPFSM-FPLLWQHTDVSY 331
>gi|301119251|ref|XP_002907353.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105865|gb|EEY63917.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 38.1 bits (87), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 109/248 (43%), Gaps = 24/248 (9%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLS---ELACLALYLMYEK- 168
V +ED+ FS+P V++++ + + + +L+ E LA+ L+YEK
Sbjct: 43 VFIAEDVTPHTEVFSIPLDSVLSVKSLQDISALQSITFFQQLTPEREDDQLAIALLYEKY 102
Query: 169 KQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREY 228
QG KS W +I L + + L + E+ L GS + E + +Y
Sbjct: 103 MQGDKSKWAKHIELLPK-------TYHNALYFEAGEIKALEGSNLFFIAQQMEEKVASDY 155
Query: 229 NEL-DTVWF-MAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQS-CVVHLQKVSLARRFALV 285
L ++V F + ++ + D+ E F+ + +K A + S V+ + K S A+V
Sbjct: 156 AVLKESVLFELFENITEGITVDLFDEIFSLDNYKWALSTIWSRFVLPVAKQSFK---AMV 212
Query: 286 PLGPPLLAYSSKCKAMLAAVDD----AVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGF 341
P+ L + +A ++ D +LV + + AG + + G N KLL YGF
Sbjct: 213 PVFDML---NHDPEAEMSHFFDMETQCFKLVSHQHWNAGAQMFINYGALSNHKLLSLYGF 269
Query: 342 VDEDNPYD 349
V N +D
Sbjct: 270 VIIGNLFD 277
>gi|255723006|ref|XP_002546437.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240130954|gb|EER30516.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 578
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 121/308 (39%), Gaps = 39/308 (12%)
Query: 74 KEEDLGDLKSWMHKNG--LPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDAAFSVPNS 131
K E + L W NG L P V KE ++N +A S++ S+P
Sbjct: 3 KLEKINSLVKWAESNGAELSP-DVQFKEITTNNIGAIYDGKIAPSDN----GYPISIPFK 57
Query: 132 LVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQ 191
L++T + + E L +T + A L YL +E+ SF+ PY++ L
Sbjct: 58 LIITTQNAI-TEFGKYLKSTEDKNSNAILKFYLCHERINAD-SFYHPYLKLL-----PSL 110
Query: 192 LAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL-----DTV-----WFMAGSL 241
A++SP WS + YL G+ + E + E+ E+ D V F+
Sbjct: 111 AAIDSPYTWSAQDKTYLKGTNLGNSLKENLGSLVEEWWEVINLLKDEVSKPEQHFINMKF 170
Query: 242 FQQYPY-------------DIPT-EAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPL 287
+ + + DI +F ++ + +S +L S + A++
Sbjct: 171 YYDFKFYTDDDLDKYLNEEDINNWTSFPNYLWASLILKSRSFPAYLIDKSCNKNDAMLLP 230
Query: 288 GPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNP 347
LL ++ + K V D + G+ I G + N +LL+ YGF E NP
Sbjct: 231 VVDLLNHNPQAKVNW-DVSDGFFRFKSESIQPGKEIFNNYGLKGNEELLLAYGFCIEGNP 289
Query: 348 YDRLVVEV 355
D + +++
Sbjct: 290 RDSVALKI 297
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 38.1 bits (87), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 104/240 (43%), Gaps = 30/240 (12%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGK 172
+ A D+ + +P L + E V ++ I L L +AL+L+ EK + +
Sbjct: 79 LVARRDIGRNEVVLEIPKRLWINPETVTASK-IGPL--CGGLKPWVSVALFLIREKYE-E 134
Query: 173 KSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELD 232
+S W Y+ L + + +S + WSE ELA L G+ + L E ++ E+ +L+
Sbjct: 135 ESSWRVYLDMLPQ-------STDSTVFWSEEELAELKGTQLLSTTLGVKEYVENEFLKLE 187
Query: 233 TVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPL- 291
+ D+ + T + F AF +++ + ++ F + P +
Sbjct: 188 QEILLPNK-------DLFSSRITLDDFIWAF-----GILNRESLTSMFEFEQINHNPAIK 235
Query: 292 ---LAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVV-WCGPQPNSKLLINYGFVDEDNP 347
AY K A L + D L KAGE + + + + N++L ++YGFV E NP
Sbjct: 236 TEDYAYEIKG-AGLFSRDLLFSLKSPVYVKAGEQVYIQYDLNKSNAELALDYGFV-ESNP 293
>gi|403412960|emb|CCL99660.1| predicted protein [Fibroporia radiculosa]
Length = 508
Score = 38.1 bits (87), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 30/60 (50%)
Query: 284 LVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVD 343
+VP+ L A A L + +++V +P KAGE I G PNS LL YG VD
Sbjct: 258 MVPMADMLNARFGSENAKLFYEEHHLKMVTTKPIKAGEQIWNTYGDPPNSDLLRRYGHVD 317
>gi|407394128|gb|EKF26797.1| hypothetical protein MOQ_009497 [Trypanosoma cruzi marinkellei]
Length = 547
Score = 37.7 bits (86), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 10/95 (10%)
Query: 149 LTTNKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELA-- 206
L ++ + +AC+A Y+ YEKKQ + + L Y R L Q+ V++ LW+ L
Sbjct: 352 LDSSNMESIACIAAYMFYEKKQPEIALRL-YRRLL-------QMGVQTTELWNNLGLCCF 403
Query: 207 YLTGSPTKAEILERAEGIKREYNELDTVWFMAGSL 241
Y + L+RA E L VW+ G +
Sbjct: 404 YSSQYDIALSCLQRAIATSTEDETLADVWYNIGHI 438
>gi|348679693|gb|EGZ19509.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Phytophthora sojae]
Length = 606
Score = 37.7 bits (86), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 7/117 (5%)
Query: 116 SEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTN-KLSELACLALYLMYEKKQGKKS 174
+ED++ G S+P S V+++ + LL N L L L+L+ E+ G KS
Sbjct: 204 TEDVEQGAELLSLPMSKVMSVASA-ARGRVGLLLEVNPDLPPAIALGLHLLEEQALGAKS 262
Query: 175 FWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNEL 231
+ ++ L G A+ S L +SE +L + GS L RA+ ++ Y+ L
Sbjct: 263 NFSEFVSSLP-----GVEAINSTLFYSENQLKEMEGSQLLRYTLGRAQAVEAFYDAL 314
>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
Length = 634
Score = 37.7 bits (86), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 47/199 (23%), Positives = 86/199 (43%), Gaps = 27/199 (13%)
Query: 162 LYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERA 221
L L+ ++ +GK+S W YI L G ++ +PL + + ++A+L G+ ER
Sbjct: 105 LLLIEQRNKGKESPWHAYIACLP-----GAESMTTPLWFDDEDMAFLAGTSLAPAAKERK 159
Query: 222 EGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIF-KQAFVAVQSCVVHLQKVSLAR 280
+++ + + AG D + + IF +AF++ H
Sbjct: 160 SLYYQQWEQALGIMKDAGVALAD-EVDFESLLWAATIFTSRAFISTHILPDH-------- 210
Query: 281 RFALVPLGPPL---LAYSSKCKAM-----LAAVDDAVQLVVDRPYKAGESIVVWCGPQPN 332
VPL P+ L +S K LA+ +++L+ + AG+ + P+ N
Sbjct: 211 --ETVPLLFPIVDILNHSVSAKVEWEFQPLASF--SLKLLEGDTFTAGQELFNNYAPKQN 266
Query: 333 SKLLINYGFVDEDNPYDRL 351
+LL+ YGF E NP ++
Sbjct: 267 DELLLGYGFCLEHNPIEQF 285
>gi|412991387|emb|CCO16232.1| predicted protein [Bathycoccus prasinos]
Length = 622
Score = 37.7 bits (86), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 22/64 (34%), Positives = 37/64 (57%), Gaps = 9/64 (14%)
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILE 219
L++YL+ E ++G+ SF+ PY+ L R +ESPL WS ++ + GS ++L+
Sbjct: 204 LSVYLVCELEKGESSFYAPYLSYL-----REATVLESPLFWSTEDVNAIAGS----QLLD 254
Query: 220 RAEG 223
A G
Sbjct: 255 DAAG 258
>gi|335300684|ref|XP_003358991.1| PREDICTED: SET domain-containing protein 4 [Sus scrofa]
Length = 440
Score = 37.7 bits (86), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 66/297 (22%), Positives = 113/297 (38%), Gaps = 36/297 (12%)
Query: 65 AGSREVVSKKEEDLGDLKSWMHKNGLPPCKVILKEKPSHNEKHRPIHYVAASEDLQAGDA 124
A SR V + + +LK W+ +I P + + LQ G
Sbjct: 20 AESRGVNESYKPEFIELKKWLKDRNFEDTNLIPARFPGTGRG------LMSKTSLQEGQL 73
Query: 125 AFSVPNSLVVTLERVLGNETIAELLTTNKL-SELACLALYLMYEKKQGKKSFWLPYIREL 183
++P S ++T + VL + + S L L +L+ EK G +S W PY+ L
Sbjct: 74 VIALPESCLLTTDTVLRSYLGPYIAKWQPPPSPLLALCTFLVSEKHAGDQSPWKPYLEVL 133
Query: 184 DRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQ 243
+ P+ E E+ L P K++ E+ R + + SL
Sbjct: 134 PK-------TYTCPVC-LEPEVVNLLPGPLKSKAREQR---TRVWEFFSSSRDFFSSLQP 182
Query: 244 QYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQKVSLARR----FALVP----LGP--PLLA 293
+P + + IF + + C V+ + V + +R F+ P L P LL
Sbjct: 183 LFPEAVES------IFSYSALLWAWCTVNTRAVYMKQRPRQCFSTEPDTCALAPYLDLLN 236
Query: 294 YSS--KCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDEDNPY 348
+S + KA ++ + E + + GP + +LL+ YGFV NP+
Sbjct: 237 HSPAVQVKAAFNEESRCYEIRTGTSCRKHEEVFICYGPHGSHRLLLEYGFVSPRNPH 293
>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
Length = 480
Score = 37.7 bits (86), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 125 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 183
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 308
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 184 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 239
Query: 309 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 344
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 240 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 276
>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 473
Score = 37.7 bits (86), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 308
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257
Query: 309 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 344
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294
>gi|340503061|gb|EGR29686.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 286
Score = 37.7 bits (86), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 15/121 (12%)
Query: 107 HRPIHYVAASEDLQAGDAAFSVPNSLVVTLERVLGNETIAELLTTNKLSELA----CLAL 162
+R IH + + + + +P ++TLE N TI + + + L+ L++
Sbjct: 149 YRGIH---SKQKINKDETILFIPQKYMITLELCKQN-TICKQIEQRNIKLLSPKHSILSI 204
Query: 163 YLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTKAEILERAE 222
Y++ EKK SFW PY+ L + P+L++E E+ +L GS +I E+ +
Sbjct: 205 YILSEKK-NPNSFWKPYLDILPCE------FTTFPILYTEEEIQWLKGSLIINQIYEKNK 257
Query: 223 G 223
Sbjct: 258 A 258
>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
Length = 498
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 69/157 (43%), Gaps = 10/157 (6%)
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
A+ P +SE EL L GS + + + +++E+ L ++Y +D T
Sbjct: 143 AISLPTFYSEEELELLRGSSLRLAVHAKIASLEKEFEHLRRS-TEGLDWCEKYWWDEDTG 201
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVP-LGPPLLAYSSKCKAMLAAVDDA--- 308
TF +K +S +V L + A+VP + A KA+ D
Sbjct: 202 KLTFNDWKYVDALYRSRMVDLPR----HGHAMVPCIDMANHASEGTVKALYDEDADGNAV 257
Query: 309 VQLVVDRPYKAGESIVVWCGPQPN-SKLLINYGFVDE 344
+QL R +A E + + G + + S+L+ +YGF+DE
Sbjct: 258 LQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDE 294
>gi|323473309|gb|ADX78230.1| CIA6 [Chlamydomonas reinhardtii]
Length = 699
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 13/86 (15%)
Query: 113 VAASEDLQAGDAAFSVPNSLVVTLERVLGNETIA-------------ELLTTNKLSELAC 159
V A+ DL GD VP L++T + +A ++ L+
Sbjct: 32 VVATRDLAPGDTVLRVPGRLLMTTRSARADPALAAALSRLAASPDPDDVAAAAALTPHQL 91
Query: 160 LALYLMYEKKQGKKSFWLPYIRELDR 185
LA +L++E +G +SFW PY+++L R
Sbjct: 92 LATHLLHEVSKGPESFWHPYLQQLPR 117
>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 514
Score = 37.4 bits (85), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 80/192 (41%), Gaps = 19/192 (9%)
Query: 152 NKLSELACLALYLMYEKKQGKKSFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGS 211
N + + LAL LMYE+ + S W ++R L +ES L W++ EL +
Sbjct: 132 NAIDPMTALALGLMYERSRA-DSPWRAWLRMLPD-------PIESMLEWNDVELWPVEQL 183
Query: 212 PTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVV 271
K ER ++ Y + T Y D+ FT E F A V Q+ +
Sbjct: 184 YVKELREERIRNLEAVYESVIT------PFIDTYESDLVGVDFTIEAFVWAAVIAQTRGL 237
Query: 272 HLQKVSLARRFALVPLGPPLLAYSSKCKAMLAAVDDAVQLVVDRPYKAGESIVVWCGPQP 331
H S +L+P+ ++ + + A++ A + + KAGE I +
Sbjct: 238 H---ESEKNGLSLLPIV-DMINHHREPNAVVVASGPNILVRTKTSLKAGEEITI-DYEMS 292
Query: 332 NSKLLINYGFVD 343
+ LL+ YGFV+
Sbjct: 293 SHVLLLLYGFVE 304
>gi|147777505|emb|CAN60498.1| hypothetical protein VITISV_027869 [Vitis vinifera]
Length = 2077
Score = 37.4 bits (85), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 39/172 (22%), Positives = 72/172 (41%), Gaps = 20/172 (11%)
Query: 193 AVESPLLWSETELAYLTGSPTKAEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 252
A + L + + L G+ EI+E + + +Y EL +L + +P P E
Sbjct: 1483 AFNTGLSFEFDAIMVLAGTLLLEEIIEAKKHLNAQYEEL------VPALCKDHPDIFPPE 1536
Query: 253 AFTFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLG--------PPLLAYSSKCKAMLAA 304
+T E F A S + + R L+P+ P ++ Y + +
Sbjct: 1537 FYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIMHYGK-----VDS 1591
Query: 305 VDDAVQLVVDRPYKAGESIVVWCGPQPNSKLLINYGFVDE-DNPYDRLVVEV 355
++++ V +P GE + G +S L+ YGF+ + DN YD + +E+
Sbjct: 1592 KTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIPQGDNLYDTIPLEI 1643
>gi|308804211|ref|XP_003079418.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116057873|emb|CAL54076.1| N-methyltransferase (ISS), partial [Ostreococcus tauri]
Length = 305
Score = 37.4 bits (85), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 78/202 (38%), Gaps = 9/202 (4%)
Query: 156 ELACLALYLMYEKKQGKK-SFWLPYIRELDRQRGRGQLAVESPLLWSETELAYLTGSPTK 214
E L + +M E +G+K S W Y+R + G S L W + + L G+ T
Sbjct: 47 EKELLVIAVMCEMLRGEKNSEWGAYLRVVREGAENGH----SILAWDDEQAEALEGTDTW 102
Query: 215 AEILERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFTFEIFKQAFVAVQSCVVHLQ 274
+ E + D W LF + ++ E+ + F A +
Sbjct: 103 FDAYENDDEGLDLPTMTDEHWEHVVRLFFERNPELARGMDEDELRELHFAATAMVAGYSF 162
Query: 275 KVSLARRFALVPLGPPLLAYSSKCKAMLAAVDD---AVQLVVDRPYKAGESIVVWCGPQP 331
+ +VP +L ++ C A + D +Q++ R K GE + GP
Sbjct: 163 TLGDDEIQGMVPFWD-MLNHAPPCAASVRLNHDPKRGLQMITVREVKKGEEVFNTYGPLR 221
Query: 332 NSKLLINYGFVDEDNPYDRLVV 353
N++LL YGFV NP+ V
Sbjct: 222 NAELLRRYGFVLARNPHGGTTV 243
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,498,125,528
Number of Sequences: 23463169
Number of extensions: 218928146
Number of successful extensions: 600284
Number of sequences better than 100.0: 832
Number of HSP's better than 100.0 without gapping: 268
Number of HSP's successfully gapped in prelim test: 564
Number of HSP's that attempted gapping in prelim test: 599227
Number of HSP's gapped (non-prelim): 1065
length of query: 355
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 212
effective length of database: 9,003,962,200
effective search space: 1908839986400
effective search space used: 1908839986400
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)