BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000955
(1210 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359494102|ref|XP_002276879.2| PREDICTED: uncharacterized protein LOC100243584 [Vitis vinifera]
Length = 1260
Score = 956 bits (2471), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/745 (62%), Positives = 574/745 (77%), Gaps = 19/745 (2%)
Query: 3 GFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE 62
GFIVD M+ED + +T E S+EE++LFDSVCS CDNGGDLLCCEGRC+RSFHAT +AGEE
Sbjct: 196 GFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLLCCEGRCMRSFHATKEAGEE 255
Query: 63 SHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
S CA+LG++ +VEAM NF+CKNC+YKQHQCF+CGKLGSSDK +GAEVF C +ATCG FY
Sbjct: 256 SLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSDKSSGAEVFLCANATCGRFY 315
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCP 182
HP CV+KLL R+DE AA++L K+I AGE F CP+H+C +CKQGE+K D +LQFA+CRRCP
Sbjct: 316 HPQCVAKLLHREDEAAAEELQKNIYAGELFACPIHRCHVCKQGEDKKDLELQFAICRRCP 375
Query: 183 KAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
K+YHRKCLPRKI+FED EEGII RAW+GLLPN RILIYCLKHEID+ +GTPIRDHI FP
Sbjct: 376 KSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCLKHEIDELLGTPIRDHIKFP 434
Query: 243 GIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRV---PSKAG 299
EE R+ + +S KV S K SL S+ P+ + +VKA K+V S
Sbjct: 435 NDEEKM----EKRRSELFSSRKDLDKVVSKKRSLVSEDSPRERMAVKATKQVEKLSSTVK 490
Query: 300 QGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLY 359
G++ + SE+ G D S+R K T S+KS NVKS+S +VD+SS D KTSLGE+LY
Sbjct: 491 DGDSTKKSEKRSSGPDPSKRLKVTGFSKKSLDDNVKSISKKVDKSSMADENKTSLGEQLY 550
Query: 360 AAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAAS 419
A +++ K+D + E + V K K S LPSLD DS+ R+ +++K++ S
Sbjct: 551 ALI----KNRSEPRKEDTPNSELEQKVVTK----KTSSSLPSLDRDSENRILAIIKESKS 602
Query: 420 SVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKA 479
+ +E+++K+HK+PSTHAY+SK+ VD+ IT GKVEGS+EA+R ALKKL+ G SIEDAKA
Sbjct: 603 LITLEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEALRAALKKLE-GGGSIEDAKA 661
Query: 480 VCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMI 539
VCEPEVL+QI KWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKL+ IV+KLH+YV +GD I
Sbjct: 662 VCEPEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLKEIVEKLHYYVKNGDTI 721
Query: 540 VDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRL 599
VDFCCGANDFSCLMK+KL+E GK C YKNYD++ KNDFNFEKRDWM+V+ KEL GS+L
Sbjct: 722 VDFCCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNFEKRDWMSVKQKELPTGSQL 781
Query: 600 IMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSG 659
IMGLNPPFGVKA LAN FINKAL+F PKLLILIVPPETERLD+K Y+L+WEDD LSG
Sbjct: 782 IMGLNPPFGVKASLANMFINKALQFKPKLLILIVPPETERLDKKRPPYDLIWEDDNELSG 841
Query: 660 KSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERN 719
KSFYLPGSVD NDKQ++QWN+ P LYLWSR D+ H+A+A+K GH+SR + + +E+
Sbjct: 842 KSFYLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAIAQKCGHVSRRRRVSHLEKI 901
Query: 720 CYETHAVDHP--KEEGQGDASMLID 742
E +DHP + G SM++D
Sbjct: 902 QNEEPVLDHPMADQTHSGHVSMMLD 926
Score = 131 bits (329), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/253 (38%), Positives = 133/253 (52%), Gaps = 38/253 (15%)
Query: 995 SPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHIGTTYYG-TQAGIPNDM 1052
SP A+S+++ H +++ S G + Y H +P S + T YG + A IP DM
Sbjct: 1009 SPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHFQPGVPDSSLQRTGYGGSHASIPEDM 1068
Query: 1053 GS-YGMSS--------------LNNGLSHGA-NLDERYTGYVRNT-DSLGYRPSM----S 1091
Y + S ++ GL +G N +E +T Y+R + D+LGYR S+
Sbjct: 1069 ARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRHSIRDRDE 1128
Query: 1092 TDRELTMWPLARIYGQDFP-----------APTPGYGQMGSVPSNLYGNLGSSAEASY-R 1139
R + + YG P P +GQMGS PS YG+ GS AE+SY R
Sbjct: 1129 YGRNADIRSQVQSYGLHDPIGMSQRSNYLAGQDPRFGQMGSFPST-YGHPGSGAESSYSR 1187
Query: 1140 MSTSAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRA-PQPGFFAD-MDFGPGFHP 1197
M+TSAM RYAP+L +LN+TRMN+F E MP R YD A P+PGF AD M F PG H
Sbjct: 1188 MNTSAMQRYAPQLDELNHTRMNSFGYERPMPIRNNIYDPLAPPRPGFQADSMGFAPGLHH 1247
Query: 1198 PFPQQGSGGWLDD 1210
PF +Q S GWL++
Sbjct: 1248 PFSKQNSSGWLNE 1260
>gi|147814981|emb|CAN61366.1| hypothetical protein VITISV_020586 [Vitis vinifera]
Length = 2238
Score = 954 bits (2466), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/954 (53%), Positives = 643/954 (67%), Gaps = 42/954 (4%)
Query: 3 GFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE 62
GFIVD M+ED + +T E S+EE++LFDSVCS CDNGGDLLCCEGRC+RSFHAT +AGEE
Sbjct: 841 GFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLLCCEGRCMRSFHATKEAGEE 900
Query: 63 SHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
S CA+LG++ +VEAM NF+CKNC+YKQHQCF+CGKLGSSDK +GAEVF C +ATCG FY
Sbjct: 901 SLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSDKSSGAEVFLCANATCGRFY 960
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCP 182
HP CV+KLL R+DE AA+ L K+I AGE F CP+H+C +CKQGE+K D +LQFA+CRRCP
Sbjct: 961 HPQCVAKLLHREDEAAAEDLQKNIYAGELFACPIHRCHVCKQGEDKKDLELQFAICRRCP 1020
Query: 183 KAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
K+YHRKCLPRKI+FED EEGII RAW+GLLPN RILIYCLKHEID+ +GTPIRDHI FP
Sbjct: 1021 KSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCLKHEIDELLGTPIRDHIKFP 1079
Query: 243 GIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRV---PSKAG 299
EE R+ + +S KV S K SL S+ P+ + +VKA K+V S
Sbjct: 1080 NDEEKM----EKRRSELFSSRKDLDKVVSKKRSLVSEDSPRERMAVKATKQVEKLSSTVK 1135
Query: 300 QGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLY 359
G++ + SE+ G D S+R K T S+KS NVKS+S +VD+SS D KTSLGE+LY
Sbjct: 1136 DGDSTKKSEKRSSGPDPSKRLKVTGFSKKSLDDNVKSISKKVDKSSMADENKTSLGEQLY 1195
Query: 360 AAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAAS 419
A + K+D + E + V K K S LPSLD DS+ R+ +++K++ S
Sbjct: 1196 ALIKNRSEPR----KEDTPNSELEQKVVTK----KTSSSLPSLDRDSENRILAIIKESKS 1247
Query: 420 SVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKA 479
+ +E+++K+HK+PSTHAY+SK+ VD+ IT GKVEGS+EA+R ALKKL+ G SIEDAKA
Sbjct: 1248 LITLEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEALRAALKKLE-GGGSIEDAKA 1306
Query: 480 VCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMI 539
VCEPEVL+QI KWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKL+ IV+KLH+YV +GD I
Sbjct: 1307 VCEPEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLKEIVEKLHYYVKNGDTI 1366
Query: 540 VDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRL 599
VDFCCGANDFSCLMK+KL+E GK C YKNYD++ KNDFNFEKRDWM+V+ KEL GS+L
Sbjct: 1367 VDFCCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNFEKRDWMSVKQKELPTGSQL 1426
Query: 600 IMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSG 659
IMGLNPPFGVKA LAN FINKAL+F PKLLILIVPPETERLD+K Y+L+WEDD LSG
Sbjct: 1427 IMGLNPPFGVKASLANMFINKALQFKPKLLILIVPPETERLDKKRPPYDLIWEDDNELSG 1486
Query: 660 KSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERN 719
KSFYLPGSVD NDKQ++QWN+ P LYLWSR D+ H+A+A+K GH+SR + + +E+
Sbjct: 1487 KSFYLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAIAQKCGHVSRRRRVSHLEKI 1546
Query: 720 CYETHAVDHPK--EEGQGDASMLIDLPLQINVTKELRNEAREDDKAGFPDNATEGGGESS 777
E +DHP + G SM++D E E E + + T G ESS
Sbjct: 1547 QNEEPVLDHPMADQTHSGHVSMMLD---------EHSVENHELEHEERREIVTAGRVESS 1597
Query: 778 -HGHGDNQSGKTSRKRKRDRKKHGSG---MRENSPLDGQNRGRHLASGIHGMSKHSPANI 833
H D + + +++HG G R + D + ++ G S S
Sbjct: 1598 PHSGVDREDHGKKLLNENSKQRHGKGKHEKRTENISDDKQIMTPVSEMCKGTSCTSSPRA 1657
Query: 834 ANVSPLLEGHSSKSIDMPSHVGSGDNDCQHFSNKGMPLSSPTIVIDGTSPLGVHSSKTIE 893
++ ++ H +++ S V G+ HF G+P SS + T G H+S +
Sbjct: 1658 SDARSTVDIHQPEALKKSSPVEVGEEVYPHF-QPGVPDSS----LQRTGYGGSHASIPED 1712
Query: 894 MLSH---DDGCQCDQMPHRSPVNVSSGILEG--ESSKPIEIPALSGIGDGGYEH 942
M D HR VS G+ G S +P I + GY H
Sbjct: 1713 MARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRH 1766
Score = 120 bits (301), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 95/247 (38%), Positives = 128/247 (51%), Gaps = 38/247 (15%)
Query: 995 SPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHIGTTYYG-TQAGIPNDM 1052
SP A+S+++ H +++ S G + Y H +P S + T YG + A IP DM
Sbjct: 1654 SPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHFQPGVPDSSLQRTGYGGSHASIPEDM 1713
Query: 1053 GS-YGMSS--------------LNNGLSHGA-NLDERYTGYVRNT-DSLGYRPSM----S 1091
Y + S ++ GL +G N +E +T Y+R + D+LGYR S+
Sbjct: 1714 ARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRHSIRDRDE 1773
Query: 1092 TDRELTMWPLARIYGQDFP-----------APTPGYGQMGSVPSNLYGNLGSSAEASY-R 1139
R + + YG P P +GQMGS PS YG+ GS AE+SY R
Sbjct: 1774 YGRNADIRSQVQSYGLHDPIGMSQRSNYLAGQDPRFGQMGSFPST-YGHPGSGAESSYSR 1832
Query: 1140 MSTSAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRA-PQPGFFAD-MDFGPGFHP 1197
M+TSAM RYAP+L +LN+TRMN+F E MP R YD A P+PGF AD M F PG H
Sbjct: 1833 MNTSAMQRYAPQLDELNHTRMNSFGYERPMPIRNNIYDPLAPPRPGFQADSMGFAPGLHH 1892
Query: 1198 PFPQQGS 1204
PF +Q S
Sbjct: 1893 PFSKQNS 1899
>gi|255576262|ref|XP_002529024.1| protein binding protein, putative [Ricinus communis]
gi|223531504|gb|EEF33335.1| protein binding protein, putative [Ricinus communis]
Length = 1249
Score = 911 bits (2355), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 485/846 (57%), Positives = 589/846 (69%), Gaps = 28/846 (3%)
Query: 31 SVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
SVC+FCDNGG+LLCC+G C+RSFHAT +AGEES C SLG T+ EVEA F+CKNCEYKQ
Sbjct: 222 SVCTFCDNGGELLCCDGSCMRSFHATKEAGEESMCVSLGFTEREVEATERFYCKNCEYKQ 281
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG- 149
HQCFACG+LGSSDK +GAEVF C +ATCG+FYHP C++KLL ++DEVAA +L K I AG
Sbjct: 282 HQCFACGELGSSDKLSGAEVFRCANATCGYFYHPSCIAKLLHQEDEVAAKELQKKIAAGK 341
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAW 209
ESFTCP+HKCC+CKQGENK +LQFAVCRRCP +YHRKC+P +I FE K E I RAW
Sbjct: 342 ESFTCPIHKCCVCKQGENKKIRELQFAVCRRCPTSYHRKCMPSEIVFEKKKGEEEI-RAW 400
Query: 210 EGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKV 269
E LLPN RILIYCLKHEI D +GTPIRD I FP IEE K K Q P +K
Sbjct: 401 EDLLPN-RILIYCLKHEIIDYLGTPIRD-IRFPDIEEKK-------KTQISDLPGSSEKD 451
Query: 270 ASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKS 329
+ K LTS+ G +K +K S A + ++ SE+L GS RR K D SRKS
Sbjct: 452 LAKKRRLTSEDLFSGDAVIKKVKDSSSGARKVTNIKKSEKLSPGSTFLRRVKERDASRKS 511
Query: 330 FKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVK 389
K +KS S+++DRS++ + KTSLG++L+ + + +EQ GK+D E + VTVK
Sbjct: 512 LKEKMKSTSIELDRSATANLNKTSLGDKLFD--IMKRSEQVHNGKKDVHTNEIDKPVTVK 569
Query: 390 PLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHK--IPSTHAYASKSAVDKA 447
KL ELPSLD D++RRL +LMK+++S + ME++ K H+ IPSTHAY+ ++ +KA
Sbjct: 570 A-STKLSDELPSLDADTERRLLALMKESSSLISMEDVRKTHQVHIPSTHAYSLRTVCEKA 628
Query: 448 ITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGM 507
IT GKVEG+VEA+RTALKKL+ DG S EDAKAVC P LSQ+FKWK+KL+VYLAPFL+GM
Sbjct: 629 ITAGKVEGAVEAVRTALKKLE-DGCSTEDAKAVCGPANLSQVFKWKSKLRVYLAPFLNGM 687
Query: 508 RYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYK 567
RYTSFGRHFTKV+KL+ I + LHWYV DGD IVDFCCGANDFSCLMKKKL++T K C YK
Sbjct: 688 RYTSFGRHFTKVEKLEEITNLLHWYVEDGDTIVDFCCGANDFSCLMKKKLEQTRKTCSYK 747
Query: 568 NYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPK 627
NYD++ KNDFNFEKRDWMTV P+EL P LIMGLNPPFGVKA LANKFINKALEF PK
Sbjct: 748 NYDVIQPKNDFNFEKRDWMTVRPEEL-PKEGLIMGLNPPFGVKAALANKFINKALEFKPK 806
Query: 628 LLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYL 687
LLILIVPPETERLD+K+S Y LVWEDD+F+SGKSFYLPGS+DENDK+MDQWN+T PPLYL
Sbjct: 807 LLILIVPPETERLDKKDSPYNLVWEDDRFVSGKSFYLPGSIDENDKRMDQWNLTTPPLYL 866
Query: 688 WSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKE--EGQGDASMLIDLPL 745
WSR D+ H A+A+K GH+S + + + N ET DHP E + DAS L D
Sbjct: 867 WSRPDWHEKHLAIAQKQGHLSGQREGSSSKENYPETMTYDHPLEVYSSKADASELTDDDR 926
Query: 746 QINVTKELRN-----EAREDDKAGFPDNATEGGGESSHGHGDNQSG-KTSRKRKRDRKKH 799
+ KEL+ E K P + E S+G +QS KT RKRK K
Sbjct: 927 LVQ-NKELKEPNDNISVAEGSKECSPHDNGSRESEDSYGPERSQSKEKTLRKRKHGEDKL 985
Query: 800 GSGMRENSPLDGQNRGRHLASGIHGMSKH-SPANIANVSPLLEGHSSKSIDMPSHVGSGD 858
G G E P Q + S + +H SP + N EG +S+S +M H G
Sbjct: 986 GRGTSEKLPKTRQTGAKPPRSNTYRGIRHCSPPKMVNSRSSQEGLTSRSFEMTPHAEVGK 1045
Query: 859 NDCQHF 864
+F
Sbjct: 1046 TSSPNF 1051
Score = 123 bits (309), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 133/244 (54%), Gaps = 26/244 (10%)
Query: 989 KGVPRCSPAKVI-AKSSLEDHSSKSIDIPSQTGFG-SDYQHHEPTRSSSHIGTTYYGTQA 1046
+G+ CSP K++ ++SS E +S+S ++ G + + E SSH+ + GT
Sbjct: 1010 RGIRHCSPPKMVNSRSSQEGLTSRSFEMTPHAEVGKTSSPNFESGMFSSHMPS---GTAC 1066
Query: 1047 G--IPNDMGSYGMSSLNN--------GLSHGANLDERYTGYVR-NTDSLGYRPSMSTDRE 1095
G N G S+N+ G SH NLDER TG +R +T+++GYR + RE
Sbjct: 1067 GNLTSNHDGVGRKFSMNSDEYLQGIHGFSH-PNLDERSTGPIRESTENIGYRSYVMGLRE 1125
Query: 1096 LTMWPLARIYGQ--DFPAPT----PGYGQMGSVPSNLYGNLGSSAEASYRMSTSAMDRYA 1149
+ + YGQ D A PGYG+MGS PS LY +LG+ ++ YRM+TSAM RYA
Sbjct: 1126 SDLRSQVQQYGQHPDSSAQRNFHDPGYGRMGSAPSMLYRHLGTPSDPLYRMNTSAMQRYA 1185
Query: 1150 PRLHQLNNTRMNTFRSEPFMPSRFGFYD--SRAPQPGFFAD-MDFGPGFHPPFPQQGSGG 1206
PRL +LN+T M F +P M R G Y+ P PG+ D M+F PG H P+ S G
Sbjct: 1186 PRLDELNHTMMGDFSPDPSMMHRNGMYNPRPPQPPPGYHIDSMNFAPGPHRPYSHHNSAG 1245
Query: 1207 WLDD 1210
WL++
Sbjct: 1246 WLNE 1249
>gi|297737514|emb|CBI26715.3| unnamed protein product [Vitis vinifera]
Length = 1201
Score = 892 bits (2304), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/951 (51%), Positives = 611/951 (64%), Gaps = 94/951 (9%)
Query: 3 GFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE 62
GFIVD M+ED + +T E S+EE++LFDSVCS CDNGGDLLCCEGRC+RSFHAT +AGEE
Sbjct: 195 GFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLLCCEGRCMRSFHATKEAGEE 254
Query: 63 SHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
S CA+LG++ +VEAM NF+CKNC+YKQHQCF+CGKLGSSDK +GAEVF C +ATCG FY
Sbjct: 255 SLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSDKSSGAEVFLCANATCGRFY 314
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCP 182
HP CV+KLL R+DE AA++L K+I AGE F CP+H+C +CKQGE+K D +LQFA+CRRCP
Sbjct: 315 HPQCVAKLLHREDEAAAEELQKNIYAGELFACPIHRCHVCKQGEDKKDLELQFAICRRCP 374
Query: 183 KAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
K+YHRKCLPRKI+FED EEGII RAW+GLLPN RILIYCLKHEID+ +GTPIRDHI FP
Sbjct: 375 KSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCLKHEIDELLGTPIRDHIKFP 433
Query: 243 GIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGE 302
EE R+ + +S KV S K SL S+ P G + K+ KR
Sbjct: 434 NDEEKM----EKRRSELFSSRKDLDKVVSKKRSLVSEDSPHGDSTKKSEKRSS------- 482
Query: 303 TMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAF 362
G D S+R K T S+KS N D+ + L +++
Sbjct: 483 ----------GPDPSKRLKVTGFSKKSLDDN--------------DTPNSELEQKV---- 514
Query: 363 VTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVR 422
+ KK S LPSLD DS+ R+ +++K++ S +
Sbjct: 515 ----------------------------VTKKTSSSLPSLDRDSENRILAIIKESKSLIT 546
Query: 423 MEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCE 482
+E+++K+HK+PSTHAY+SK+ VD+ IT GKVEGS+EA+R ALKKL+ G SIEDAKAVCE
Sbjct: 547 LEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEALRAALKKLE-GGGSIEDAKAVCE 605
Query: 483 PEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDF 542
PEVL+QI KWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKL+ IV+KLH+YV +GD IVDF
Sbjct: 606 PEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLKEIVEKLHYYVKNGDTIVDF 665
Query: 543 CCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMG 602
CCGANDFSCLMK+KL+E GK C YKNYD++ KNDFNFEKRDWM+V+ KEL GS+LIMG
Sbjct: 666 CCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNFEKRDWMSVKQKELPTGSQLIMG 725
Query: 603 LNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSF 662
LNPPFGVKA LAN FINKAL+F PKLLILIVPPETERLD+K Y+L+WEDD LSGKSF
Sbjct: 726 LNPPFGVKASLANMFINKALQFKPKLLILIVPPETERLDKKRPPYDLIWEDDNELSGKSF 785
Query: 663 YLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYE 722
YLPGSVD NDKQ++QWN+ P LYLWSR D+ H+A+A+K GH+SR + + +E+ E
Sbjct: 786 YLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAIAQKCGHVSRRRRVSHLEKIQNE 845
Query: 723 THAVDHP--KEEGQGDASMLIDLPLQINVTKELRNEAREDDKAGFPDNATEGGGESS-HG 779
+DHP + G SM++D E E E + + T G ESS H
Sbjct: 846 EPVLDHPMADQTHSGHVSMMLD---------EHSVENHELEHEERREIVTAGRVESSPHS 896
Query: 780 HGDNQSGKTSRKRKRDRKKHGSG---MRENSPLDGQNRGRHLASGIHGMSKHSPANIANV 836
D + + +++HG G R + D + ++ G S S ++
Sbjct: 897 GVDREDHGKKLLNENSKQRHGKGKHEKRTENISDDKQIMTPVSEMCKGTSCTSSPRASDA 956
Query: 837 SPLLEGHSSKSIDMPSHVGSGDNDCQHFSNKGMPLSSPTIVIDGTSPLGVHSSKTIEMLS 896
++ H +++ S V G+ HF G+P SS + T G H+S +M
Sbjct: 957 RSTVDIHQPEALKKSSPVEVGEEVYPHF-QPGVPDSS----LQRTGYGGSHASIPEDMAR 1011
Query: 897 H---DDGCQCDQMPHRSPVNVSSGILEG--ESSKPIEIPALSGIGDGGYEH 942
D HR VS G+ G S +P I + GY H
Sbjct: 1012 RYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRH 1062
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 98/253 (38%), Positives = 133/253 (52%), Gaps = 38/253 (15%)
Query: 995 SPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHIGTTYYG-TQAGIPNDM 1052
SP A+S+++ H +++ S G + Y H +P S + T YG + A IP DM
Sbjct: 950 SPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHFQPGVPDSSLQRTGYGGSHASIPEDM 1009
Query: 1053 GS-YGMSS--------------LNNGLSHGA-NLDERYTGYVRNT-DSLGYRPSM----S 1091
Y + S ++ GL +G N +E +T Y+R + D+LGYR S+
Sbjct: 1010 ARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRHSIRDRDE 1069
Query: 1092 TDRELTMWPLARIYGQDFP-----------APTPGYGQMGSVPSNLYGNLGSSAEASY-R 1139
R + + YG P P +GQMGS PS YG+ GS AE+SY R
Sbjct: 1070 YGRNADIRSQVQSYGLHDPIGMSQRSNYLAGQDPRFGQMGSFPST-YGHPGSGAESSYSR 1128
Query: 1140 MSTSAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRA-PQPGFFAD-MDFGPGFHP 1197
M+TSAM RYAP+L +LN+TRMN+F E MP R YD A P+PGF AD M F PG H
Sbjct: 1129 MNTSAMQRYAPQLDELNHTRMNSFGYERPMPIRNNIYDPLAPPRPGFQADSMGFAPGLHH 1188
Query: 1198 PFPQQGSGGWLDD 1210
PF +Q S GWL++
Sbjct: 1189 PFSKQNSSGWLNE 1201
>gi|356569700|ref|XP_003553034.1| PREDICTED: uncharacterized protein LOC100803073 [Glycine max]
Length = 1225
Score = 846 bits (2186), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/804 (54%), Positives = 547/804 (68%), Gaps = 34/804 (4%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML 79
++S+ EDELFDSVC+ CDNGG LLCC+G+C+RSFHA + GEES CASLG ++ EV+ +
Sbjct: 215 EDSDGEDELFDSVCAICDNGGQLLCCDGKCMRSFHANEEDGEESTCASLGFSRKEVDEIQ 274
Query: 80 NFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 139
NF+CKNCEY QHQCFACG LG SDK +GAEVF C SATCG FYHPHCV+KLL E A
Sbjct: 275 NFYCKNCEYNQHQCFACGTLGCSDKFSGAEVFKCASATCGFFYHPHCVAKLLHGIVEDAP 334
Query: 140 DQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDK 199
+L + I G FTCP H CC CK+ E+K D QFAVCRRCP++YHRKCLPR+IAF+D
Sbjct: 335 KELEEKIAEGGPFTCPTHYCCECKEMEDKKKHDFQFAVCRRCPRSYHRKCLPREIAFDDI 394
Query: 200 LEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQS 259
+E IITRAWE LLPN+RILIYCL+HEIDDE+GTPIRDHI FP ++ ID +S
Sbjct: 395 EDEDIITRAWEDLLPNNRILIYCLEHEIDDELGTPIRDHIKFPNVKATVREIDAEENAKS 454
Query: 260 LASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRR 319
+ K++V K+++ SK GK + + ++P K G+ + + S+ SR+
Sbjct: 455 ----ATKERVILNKNNIDSKNL-FGKKATAKVSKLPGKMSSGKVGDKKSEKISRSNISRK 509
Query: 320 AKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSD 379
K + SR F N +S + + S + SLG +L+A +E G + +D
Sbjct: 510 -KINEASR-CFNENKRSTISKETKKSDGAENRPSLGAKLFA-LKQNSSEHINSGNE--AD 564
Query: 380 RETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYA 439
T+ VKP KKL S LP+LD DSKRRL +L K+A SSV +E ++K HK +TH ++
Sbjct: 565 DVAKNTLVVKPT-KKLSSTLPALDADSKRRLLALFKEATSSVTLENVIKEHKFAATHTHS 623
Query: 440 SKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVY 499
KS V+K ITLGK+EGSVEA+RTAL+ L+ DG +I DA+AVC P+VL+QIFKWK+KLKVY
Sbjct: 624 LKSVVEKTITLGKLEGSVEAVRTALRMLE-DGHNIRDAEAVCGPDVLNQIFKWKDKLKVY 682
Query: 500 LAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDE 559
LAP L+G RYTSFGRHFT+++KL+ IVDKLHWYV +GD IVDFCCGANDFS LM KKL+E
Sbjct: 683 LAPVLYGNRYTSFGRHFTQIEKLEGIVDKLHWYVQNGDTIVDFCCGANDFSILMNKKLEE 742
Query: 560 TGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFIN 619
TGK C YKN+D+LP KNDFNFE RDWMT++ KEL GS+LIMGLNPPFG+KA LANKFI+
Sbjct: 743 TGKRCSYKNFDLLPTKNDFNFEMRDWMTIQTKELPTGSQLIMGLNPPFGLKAALANKFID 802
Query: 620 KALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWN 679
KALEF PKLLILIVPPETERLD K S Y+LVWED +FL GKSFYLPGSVD ND+Q+DQWN
Sbjct: 803 KALEFRPKLLILIVPPETERLDEKRSPYDLVWEDKRFLLGKSFYLPGSVDANDRQIDQWN 862
Query: 680 MTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEEGQGDASM 739
+ PPLYLWSR D+ HKA+A KHGH + ++E E H ++ G SM
Sbjct: 863 VKPPPLYLWSRPDWTDKHKAIARKHGHFISQRGLLRIESFDKEKSPASHTLDDSSGFNSM 922
Query: 740 -------LIDLPLQINVTKELRNEAREDDKAGFPDNATEGGGESSHGHGDNQSGKTSRKR 792
L D P+ E P + + + ++ KTS KR
Sbjct: 923 PGHDILNLTDAPIN------------EGQTGCSPHGNVDRESQERQKYMVRKADKTSWKR 970
Query: 793 KRDRKKHGSGMRENS---PLDGQN 813
KR + G + S P+DG++
Sbjct: 971 KRSEENDGRRLGVTSPPNPIDGRS 994
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 101/245 (41%), Gaps = 46/245 (18%)
Query: 996 PAKVIAKSSLEDHSSKSIDIPSQTGFG-SDYQHHEPTRSSSH--IGTTYYGTQ--AGIPN 1050
P + +SS+E + P G Y+H EPT SS I Y GTQ + N
Sbjct: 987 PNPIDGRSSVESFQLRPDMPPPDYELGDKSYRHLEPTSSSRMGGIRAAYSGTQNWPSVAN 1046
Query: 1051 DMGSYGMSSLNNGLSHGANLDERYTGYVRN-TDSLGYRPSMSTD----RELTMWPLARIY 1105
+ G++ ++ E ++ R+ +S+GYRP + D REL R Y
Sbjct: 1047 PLYDSGIT----------DVGEHHSSLPRDIANSIGYRPYVREDENYLRELETRQQTRHY 1096
Query: 1106 GQDFPAPTPGYGQMGSVPSNLYGNLGSS------AEASYRMSTSAMDRYAPRLHQLNNTR 1159
G P P+N + ++G S A Y M+T AM RYAPRL +LN+ R
Sbjct: 1097 GIQNPNSVMSNYLSVHDPANSH-HMGPSYPALALASEPYVMNTPAMQRYAPRLDELNHAR 1155
Query: 1160 MNTFRS------------------EPFMPSRFGFYDSRAPQPGFFADM-DFGPGFHPPFP 1200
M+ S EP + R G ++ A PG+ + M F G H +
Sbjct: 1156 MDPLGSRLDELNHARMDPLGSRLDEPAIVGRNGAFERSALPPGYGSRMPGFAAGSHHMYS 1215
Query: 1201 QQGSG 1205
+Q S
Sbjct: 1216 RQNSA 1220
>gi|357459829|ref|XP_003600195.1| hypothetical protein MTR_3g055370 [Medicago truncatula]
gi|355489243|gb|AES70446.1| hypothetical protein MTR_3g055370 [Medicago truncatula]
Length = 1289
Score = 817 bits (2111), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 431/830 (51%), Positives = 551/830 (66%), Gaps = 87/830 (10%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
LFDSVCSFCDNGG+LLCCEG+C+RSFHA + GEES CASLG ++ EVE + NF+CKNCE
Sbjct: 227 LFDSVCSFCDNGGELLCCEGKCMRSFHANEEDGEESSCASLGFSRKEVEEIQNFYCKNCE 286
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
+ +HQCFACG+LG SDK GAEVF C SATCG FYHP CV+KLL A +L +I
Sbjct: 287 HNKHQCFACGELGCSDKFAGAEVFKCASATCGFFYHPQCVAKLLHLVISDAPTELVTNIA 346
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
GE FTCP H C ICK+ ENK + +L FAVCRRCPK+YHRKCLPRK+AFED +EEGI+ R
Sbjct: 347 KGEPFTCPAHYCRICKEMENKNEHELHFAVCRRCPKSYHRKCLPRKVAFEDIVEEGIVAR 406
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQ 267
AWE LLPN+RILIYCLKHEIDDE+GTPIRDHI FP +++ A P+ K+
Sbjct: 407 AWEDLLPNNRILIYCLKHEIDDELGTPIRDHIKFPYVKQK-------------AKPATKE 453
Query: 268 KVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSR 327
+ + + L K + L ++ K G+ + ++GS+ R+ KA + SR
Sbjct: 454 VINNNNAKLDDL---HVKRTSATLPKLSGKMSFGKVGIENPGKILGSNIPRK-KANEASR 509
Query: 328 KSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVT 387
+ N + + ++S + + SLG +LY+ + +G++Q G N+ + + +V
Sbjct: 510 RLLNENKRPTLKEAEKSDH-EENQPSLGLQLYSHY-QKGSKQINSGNHVNNVADNTLSVK 567
Query: 388 VKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKA 447
+P KKL S P LD DS+RRL +L+K+A+SS+ +E ++K HK STH ++ K+ V+K
Sbjct: 568 -RP--KKLSSAPPQLDADSERRLLALVKEASSSITLESVIKEHKFVSTHTHSLKNVVEKT 624
Query: 448 ITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGM 507
IT+GK+EGSVEA+RTAL+ LD +G SI DA+AVC P+V++++FKWK+KLKVYLAP L+G
Sbjct: 625 ITMGKLEGSVEAVRTALRMLD-EGHSIRDAEAVCGPDVMNRLFKWKDKLKVYLAPVLYGN 683
Query: 508 RYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYK 567
RYTSFGRHFT+V+KL+ IVDKLHWYV + DMIVDFCCGANDFS LMKKKL+ETGK+CLYK
Sbjct: 684 RYTSFGRHFTQVEKLEGIVDKLHWYVQNNDMIVDFCCGANDFSRLMKKKLEETGKSCLYK 743
Query: 568 NYDILPAK--------------------------------------------NDFNFEKR 583
N+D+LP K NDFNFE R
Sbjct: 744 NFDLLPTKAALHRLKLKSLGSVFSPTSQSPPPQKQNPTRWVFPPDVVLICLQNDFNFEMR 803
Query: 584 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 643
DW+TV+ KEL GS+LIMGLNPPFG+KA LANKFI+KALEF PKLLILIVPPETERLDRK
Sbjct: 804 DWLTVQRKELPLGSQLIMGLNPPFGLKAALANKFIDKALEFEPKLLILIVPPETERLDRK 863
Query: 644 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 703
S Y LVWED++FLSGKSFYLPGSVD NDKQM+QWN+ PPLYLWS D+A HK +A++
Sbjct: 864 RSRYVLVWEDERFLSGKSFYLPGSVDSNDKQMEQWNVKPPPLYLWSHPDWADKHKLIAQE 923
Query: 704 HGHISRPQSRTQMERNCYETHAVDHPKEEGQGDASMLIDLPLQINVTKELRNEAREDDKA 763
HGH+ R + ++ME E + H ++ D +ML D L + K + D +
Sbjct: 924 HGHLFRERDVSRMESFDKEKSSASHSMDDNYFDDTML-DRMLDHDFLK-----STSDQDS 977
Query: 764 GFPDNATEGGGESSHGHGDNQSGK-----------TSRKRKRDRKKHGSG 802
F EG SSHG+ S + TS KRKR + G G
Sbjct: 978 SFMIGQMEG---SSHGNVFRVSQERQDYLTINAENTSWKRKRTEENDGRG 1024
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/246 (28%), Positives = 94/246 (38%), Gaps = 72/246 (29%)
Query: 1016 PSQTGFGSDYQHHEPTRSSS-HIGTTYYGTQ--AGIPNDMGSYGMSSLNNGLSHGANLDE 1072
P + Y+H EP SS G Y GT + + + YG+ A+L E
Sbjct: 1065 PQSSIVDDSYRHLEPLSSSRVEFGQAYDGTHNWPNVADPLPDYGL----------ADLQE 1114
Query: 1073 RYTGYVRN-TDSLGYRPSMSTD------------RELTMWPLA----RIYGQDFPAPT-- 1113
+G++ + T SLGYRP + D E + PL + G D+ P
Sbjct: 1115 HNSGHLGDGTSSLGYRPHLRGDDIYPPGLPRPYLTEDVIQPLGVPRPYVMGADYIRPPRV 1174
Query: 1114 ---------------PGYGQMGSVPSNLYGNLGSSAEASY-------RMSTSAMDRYAPR 1151
P Y QMGS Y GS +E Y +MST AM RYAPR
Sbjct: 1175 PMPDPMSSSYLSGRGPAYSQMGST----YSVCGSGSELPYMMNTPAMQMSTPAMQRYAPR 1230
Query: 1152 LHQLNNTRMNTFRSEPFMPSRFGFYDSRAPQ------PGFFADMDFGPGFHP-PFPQQGS 1204
L +LN+ + N+ E + R + A PGF PG P + +Q S
Sbjct: 1231 LDELNHVKTNSLGPEHPIVDRSTTSEHSAQSGSENVPPGF-------PGGSPHLYSRQNS 1283
Query: 1205 GGWLDD 1210
W D
Sbjct: 1284 SNWFSD 1289
>gi|145359280|ref|NP_200350.2| enhanced downy mildew 2 [Arabidopsis thaliana]
gi|334188424|ref|NP_001190545.1| enhanced downy mildew 2 [Arabidopsis thaliana]
gi|332009240|gb|AED96623.1| enhanced downy mildew 2 [Arabidopsis thaliana]
gi|332009241|gb|AED96624.1| enhanced downy mildew 2 [Arabidopsis thaliana]
Length = 1297
Score = 793 bits (2047), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/730 (55%), Positives = 514/730 (70%), Gaps = 27/730 (3%)
Query: 31 SVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
SVC+ CDNGG++LCCEG CLRSFHAT GE+S C SLG K +VEA+ +FC NCE+K
Sbjct: 223 SVCAICDNGGEILCCEGSCLRSFHATKKDGEDSLCDSLGFNKMQVEAIQKYFCPNCEHKI 282
Query: 91 HQCFACGKLGSSDKETGA-EVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
HQCF C LGSSD +GA EVF CVSATCG+FYHPHCV++ L ++ ++ L + IIAG
Sbjct: 283 HQCFICKNLGSSDNSSGAAEVFQCVSATCGYFYHPHCVTRRLRLGNKEESEALERQIIAG 342
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAW 209
E +TCPLHKC +C+ GE K DS+LQFAVCRRCPK+YHRKCLPR+I+FED +E I+TRAW
Sbjct: 343 E-YTCPLHKCSVCENGEVKTDSNLQFAVCRRCPKSYHRKCLPREISFEDIEDEDILTRAW 401
Query: 210 EGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKV 269
+GLL N R+LIYC +HEID+E+ TP+RDH+ FP EE K + R+ L S G+ K
Sbjct: 402 DGLLHN-RVLIYCQEHEIDEELLTPVRDHVKFPFTEEQKVFVKEQRR--ILESHVGRDK- 457
Query: 270 ASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKS 329
A K + GK S + + + G + + + D SR+ K D S K
Sbjct: 458 ARLKVKDPALQDTCGKASKNSFRSSFPSSKDGFSTKKHGLVSSVPDHSRKRKDIDPSIKH 517
Query: 330 FKGNVKSLSVQVDRSSSVDSK------------KTSLGERLYAAFVTEGTEQTKFGKQDN 377
KS + D + +K K SLGERL++ T+ K G+
Sbjct: 518 KMVPQKSQKMMEDSREAGKNKLGVKEARDAGKSKISLGERLFS--YTQEPNPVKPGRVIP 575
Query: 378 SDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHA 437
D + ++T ++ K+ SE+P+LD DS+RRL ++MK A + M ILK+ KI ST +
Sbjct: 576 VDSKHNKTDSIA--SKEPGSEIPTLDNDSQRRLLAVMKKATEEITMGTILKKFKIQSTMS 633
Query: 438 -YASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKL 496
+++++ VDK IT+GKVEGSV+AIRTALKKL+ +G +IEDAKAVCEPEVLSQI KWK+KL
Sbjct: 634 THSTRNVVDKTITMGKVEGSVQAIRTALKKLE-EGGNIEDAKAVCEPEVLSQILKWKDKL 692
Query: 497 KVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKK 556
KVYLAPFLHG RYTSFGRHFT +KLQ IVD+LHWY +DGDMIVDFCCG+NDFSCLM K
Sbjct: 693 KVYLAPFLHGARYTSFGRHFTNPEKLQQIVDRLHWYADDGDMIVDFCCGSNDFSCLMNAK 752
Query: 557 LDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANK 616
L+ETGK CLYKNYD+ PAKN+FNFE++DWMTV EL PGS+LIMGLNPPFGV A LANK
Sbjct: 753 LEETGKKCLYKNYDLFPAKNNFNFERKDWMTVSKDELEPGSKLIMGLNPPFGVNASLANK 812
Query: 617 FINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMD 676
FI KALEF PK+LILIVPPETERLD+K+S+Y L+WED FLSG SFYLPGSV+E DKQ++
Sbjct: 813 FITKALEFRPKILILIVPPETERLDKKKSSYVLIWEDKTFLSGNSFYLPGSVNEEDKQLE 872
Query: 677 QWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEEGQGD 736
WN+ PPL LWSR D+AA HK +AEKH H+SR ++++ E +A HP G
Sbjct: 873 DWNLVPPPLSLWSRSDFAAKHKKIAEKHCHLSRDVGSSKLKIVEEEANASLHPLGASDG- 931
Query: 737 ASMLIDLPLQ 746
M D+P++
Sbjct: 932 --MCDDIPME 939
>gi|297796419|ref|XP_002866094.1| hypothetical protein ARALYDRAFT_495622 [Arabidopsis lyrata subsp.
lyrata]
gi|297311929|gb|EFH42353.1| hypothetical protein ARALYDRAFT_495622 [Arabidopsis lyrata subsp.
lyrata]
Length = 1294
Score = 781 bits (2018), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/740 (55%), Positives = 512/740 (69%), Gaps = 47/740 (6%)
Query: 31 SVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
SVC+ CDNGG+LLCCEG CLRSFHAT GE+S C SLGL K +VEA+ +FC NCE+K
Sbjct: 223 SVCAICDNGGELLCCEGSCLRSFHATKKDGEDSLCDSLGLNKMQVEAIQKYFCPNCEHKI 282
Query: 91 HQCFACGKLGSSDKETGA-EVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
HQCF C LGSSD TGA EVF CVSATCG+FYHP CV++ L ++ ++ L + IIAG
Sbjct: 283 HQCFICKNLGSSDNSTGAAEVFQCVSATCGYFYHPRCVTRRLRLGNKDDSEALERQIIAG 342
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAW 209
E +TCPLHKC +C+ GE K DS+LQFAVCRRCPK+YHRKCLPR+I+FED +E I TRAW
Sbjct: 343 E-YTCPLHKCSVCENGEVKTDSNLQFAVCRRCPKSYHRKCLPREISFEDIADEDIFTRAW 401
Query: 210 EGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKV 269
+GLL N R+LIYC +HEID+E+ TP+RDH+ FP EE K I R+ L S G+ K
Sbjct: 402 DGLLHN-RVLIYCQEHEIDEELLTPLRDHVKFPFTEEKKVFIKEQRR--ILESHVGRDKA 458
Query: 270 ---------------ASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGS 314
AS KS +S + FS K K G G ++ R
Sbjct: 459 RPKVKDPALQDTCGKASDKSFRSSFPSSKDGFSTK-------KHGLGSSVPDHSRKRKDI 511
Query: 315 DSSRRAKATDVSRKSFKGNVKS-------LSVQVDRSSSVDSKKTSLGERLYAAFVTEGT 367
D S + K V +KS K S L V+ R S K SLGERL+ ++ E
Sbjct: 512 DPSIKHKM--VPQKSHKMMENSREAGKNRLGVKETRESG--KSKVSLGERLFN-YIQE-P 565
Query: 368 EQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEIL 427
K G+ D + ++T ++ K+ SE+P+LD DS+RRL ++MK A + M +L
Sbjct: 566 NLVKPGRMIPVDSKHNKTDSIA--SKEPGSEIPTLDNDSQRRLLAVMKKATEEITMGTML 623
Query: 428 KRHKIPST-HAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVL 486
K+ K+ ST + ++++ VDK IT+GKVEGSV+A+RTALKKL+ +G +IEDAKAVCEPEVL
Sbjct: 624 KKFKVQSTMNTNSTRNVVDKTITMGKVEGSVQAVRTALKKLE-EGGNIEDAKAVCEPEVL 682
Query: 487 SQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGA 546
SQI KWK+KLKVYLAPFLHG RYTSFGRHFT +KLQ IVD+LHWY DGDMIVDFCCG+
Sbjct: 683 SQILKWKDKLKVYLAPFLHGARYTSFGRHFTNPEKLQQIVDRLHWYAEDGDMIVDFCCGS 742
Query: 547 NDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPP 606
NDFSCLM KL+ETGK CLYKNYD+ AKN+FNFE++DWMTV EL PGS+LIMGLNPP
Sbjct: 743 NDFSCLMNTKLEETGKKCLYKNYDLFQAKNNFNFERKDWMTVSKDELEPGSKLIMGLNPP 802
Query: 607 FGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPG 666
FGV A LANKFI KALEF PK+LILIVPPETERLD+K+S+Y L+WED FLSG SFYLPG
Sbjct: 803 FGVNASLANKFITKALEFRPKILILIVPPETERLDKKKSSYVLIWEDKTFLSGNSFYLPG 862
Query: 667 SVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAV 726
SV+E DKQ++ WN+ PPL LWSR D+AA HK +AEKH H+SR ++++ E A
Sbjct: 863 SVNEEDKQLEDWNIVPPPLSLWSRSDFAAKHKKIAEKHCHLSRDVGSSKIKIVEEEAKAS 922
Query: 727 DHPKEEGQGDASMLIDLPLQ 746
HP G M D+P++
Sbjct: 923 LHPLGPSDG---MCDDIPVE 939
>gi|225446989|ref|XP_002268363.1| PREDICTED: uncharacterized protein LOC100248222 [Vitis vinifera]
Length = 1216
Score = 781 bits (2017), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/721 (55%), Positives = 499/721 (69%), Gaps = 30/721 (4%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
LFDSVC+ CDNGG+LLCCEGRCLRSFHAT+DAG ES C SLG + +VEA+ NF CKNC+
Sbjct: 230 LFDSVCAICDNGGELLCCEGRCLRSFHATVDAGTESFCESLGFSDAQVEAIQNFLCKNCQ 289
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
Y+QHQCF CG LGSS++ +GAEVF C SATCG FYHP+CV+K L + + A L I
Sbjct: 290 YQQHQCFVCGMLGSSNESSGAEVFRCASATCGRFYHPYCVAKRLHPMNNILAKHLQNKIA 349
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
G SFTCPLHKC +CK+GENK DLQFA+CRRCPKAYHRKCLP I+FE E I+ R
Sbjct: 350 GGVSFTCPLHKCFVCKRGENKGVDDLQFALCRRCPKAYHRKCLPGNISFECIYNENIMQR 409
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQ 267
AW GLLPN RILIYC++H+I+ ++ TP R+HI FP E KK PS +
Sbjct: 410 AWIGLLPN-RILIYCMEHKINRKLRTPERNHIRFPDPESKG-------KKHVSELPSSNE 461
Query: 268 KVASTKSSLTSKAPPQGKFSVKALK-RVPSKAGQGETMEISERLLV--GSDSSRRAKATD 324
KV S K ++ S+ P +VK K V ++ + E+ G D + K D
Sbjct: 462 KVMSKKRNIVSEIFPAESTAVKMTKLEVHRVVKDVDSTKFFEKRCSSQGFDPPTKQKIND 521
Query: 325 VSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSR 384
++K + NVKS+ V++ S +V ++SL R Y KQ N + +
Sbjct: 522 ATKKFLRDNVKSVPVKICASVAVKGTQSSL--RNYNIK----------PKQQNIPSKVEK 569
Query: 385 TVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAV 444
++KP K+ S P +D + + R+ LMK SS +EE ++ K+ ++ SK+ +
Sbjct: 570 ITSLKPSMKRASSSQPLMDAELETRIVDLMKSTTSSFSLEEFREKQKVLCSY---SKNVL 626
Query: 445 DKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFL 504
D IT GKVE SV+AIRTAL+KL+ G SIEDAKAVCEPEVL+QI +WK KLKVYLAPFL
Sbjct: 627 DSTITQGKVEVSVKAIRTALEKLE-KGCSIEDAKAVCEPEVLNQIMRWKRKLKVYLAPFL 685
Query: 505 HGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNC 564
HGMRYTSFGRHFTKV+KL+ +VD+LHWYV GDMIVDFCCG+NDFSCLMK+KLD+ GK+C
Sbjct: 686 HGMRYTSFGRHFTKVEKLREVVDRLHWYVQHGDMIVDFCCGSNDFSCLMKEKLDKVGKSC 745
Query: 565 LYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEF 624
+KNYD++ KNDF+FEKRDWM++ EL GS+LIMGLNPPFGVKA LANKFI+KAL F
Sbjct: 746 SFKNYDLIQPKNDFSFEKRDWMSIHLDELPAGSQLIMGLNPPFGVKASLANKFIDKALSF 805
Query: 625 NPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPP 684
PKLLILIVP ET+RLD K+SAY+L+WED+ LSGKSFYLPGSVD +DKQ++QWN+ P
Sbjct: 806 RPKLLILIVPKETKRLDEKDSAYDLIWEDEDILSGKSFYLPGSVDMHDKQLEQWNLLPPL 865
Query: 685 LYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEEGQ---GDASMLI 741
LYLWSR D+ + HKA+A+K GHIS Q +E N E ++ EE GD S L+
Sbjct: 866 LYLWSRPDWTSRHKAVAQKCGHISIEQKDFLVEGNNVEREVSNYLMEENHDCYGDFSNLM 925
Query: 742 D 742
+
Sbjct: 926 N 926
>gi|9758171|dbj|BAB08556.1| unnamed protein product [Arabidopsis thaliana]
Length = 1332
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/792 (52%), Positives = 531/792 (67%), Gaps = 63/792 (7%)
Query: 4 FIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEES 63
FIV D E+ V E++ +E+D+ F+SVC+ CDNGG++LCCEG CLRSFHAT GE+S
Sbjct: 197 FIVGD-EDTYVASDEDELDDEDDDFFESVCAICDNGGEILCCEGSCLRSFHATKKDGEDS 255
Query: 64 HCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA-EVFPCVSATCGHFY 122
C SLG K +VEA+ +FC NCE+K HQCF C LGSSD +GA EVF CVSATCG+FY
Sbjct: 256 LCDSLGFNKMQVEAIQKYFCPNCEHKIHQCFICKNLGSSDNSSGAAEVFQCVSATCGYFY 315
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCP 182
HPHCV++ L ++ ++ L + IIAGE +TCPLHKC +C+ GE K DS+LQFAVCRRCP
Sbjct: 316 HPHCVTRRLRLGNKEESEALERQIIAGE-YTCPLHKCSVCENGEVKTDSNLQFAVCRRCP 374
Query: 183 KAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
K+YHRKCLPR+I+FED +E I+TRAW+GLL N R+LIYC +HEID+E+ TP+RDH+ FP
Sbjct: 375 KSYHRKCLPREISFEDIEDEDILTRAWDGLLHN-RVLIYCQEHEIDEELLTPVRDHVKFP 433
Query: 243 GIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGE 302
EE K + R+ L S G+ K A K + GK S + + + G
Sbjct: 434 FTEEQKVFVKEQRR--ILESHVGRDK-ARLKVKDPALQDTCGKASKNSFRSSFPSSKDGF 490
Query: 303 TMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSK------------ 350
+ + + D SR+ K D S K KS + D + +K
Sbjct: 491 STKKHGLVSSVPDHSRKRKDIDPSIKHKMVPQKSQKMMEDSREAGKNKLGVKEARDAGKS 550
Query: 351 KTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRL 410
K SLGERL++ T+ K G+ D + ++T ++ K+ SE+P+LD DS+RRL
Sbjct: 551 KISLGERLFS--YTQEPNPVKPGRVIPVDSKHNKTDSIA--SKEPGSEIPTLDNDSQRRL 606
Query: 411 SSLMKDAASSVRMEEILKRHKIPSTHA-YASKSAVDKAITLGKVEGSVEAIRTALKKLDV 469
++MK A + M ILK+ KI ST + +++++ VDK IT+GKVEGSV+AIRTALKKL+
Sbjct: 607 LAVMKKATEEITMGTILKKFKIQSTMSTHSTRNVVDKTITMGKVEGSVQAIRTALKKLE- 665
Query: 470 DGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKL 529
+G +IEDAKAVCEPEVLSQI KWK+KLKVYLAPFLHG RYTSFGRHFT +KLQ IVD+L
Sbjct: 666 EGGNIEDAKAVCEPEVLSQILKWKDKLKVYLAPFLHGARYTSFGRHFTNPEKLQQIVDRL 725
Query: 530 HWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVE 589
HWY +DGDMIVDFCCG+NDFSCLM KL+ETGK CLYKNYD+ PAKN+FNFE++DWMTV
Sbjct: 726 HWYADDGDMIVDFCCGSNDFSCLMNAKLEETGKKCLYKNYDLFPAKNNFNFERKDWMTVS 785
Query: 590 PKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETE----------- 638
EL PGS+LIMGLNPPFGV A LANKFI KALEF PK+LILIVPPETE
Sbjct: 786 KDELEPGSKLIMGLNPPFGVNASLANKFITKALEFRPKILILIVPPETERFQFPSISSAP 845
Query: 639 ------------------------RLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQ 674
RLD+K+S+Y L+WED FLSG SFYLPGSV+E DKQ
Sbjct: 846 LYHSITLIYRLLSLSLVKSITFLNRLDKKKSSYVLIWEDKTFLSGNSFYLPGSVNEEDKQ 905
Query: 675 MDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEEGQ 734
++ WN+ PPL LWSR D+AA HK +AEKH H+SR ++++ E +A HP
Sbjct: 906 LEDWNLVPPPLSLWSRSDFAAKHKKIAEKHCHLSRDVGSSKLKIVEEEANASLHPLGASD 965
Query: 735 GDASMLIDLPLQ 746
G M D+P++
Sbjct: 966 G---MCDDIPME 974
>gi|297739138|emb|CBI28789.3| unnamed protein product [Vitis vinifera]
Length = 1025
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/734 (54%), Positives = 499/734 (67%), Gaps = 43/734 (5%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
LFDSVC+ CDNGG+LLCCEGRCLRSFHAT+DAG ES C SLG + +VEA+ NF CKNC+
Sbjct: 26 LFDSVCAICDNGGELLCCEGRCLRSFHATVDAGTESFCESLGFSDAQVEAIQNFLCKNCQ 85
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
Y+QHQCF CG LGSS++ +GAEVF C SATCG FYHP+CV+K L + + A L I
Sbjct: 86 YQQHQCFVCGMLGSSNESSGAEVFRCASATCGRFYHPYCVAKRLHPMNNILAKHLQNKIA 145
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
G SFTCPLHKC +CK+GENK DLQFA+CRRCPKAYHRKCLP I+FE E I+ R
Sbjct: 146 GGVSFTCPLHKCFVCKRGENKGVDDLQFALCRRCPKAYHRKCLPGNISFECIYNENIMQR 205
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQ 267
AW GLLPN RILIYC++H+I+ ++ TP R+HI FP E KK PS +
Sbjct: 206 AWIGLLPN-RILIYCMEHKINRKLRTPERNHIRFPDPESKG-------KKHVSELPSSNE 257
Query: 268 KVASTKSSLTSKAPPQGKFSVKALK-RVPSKAGQGETMEISERLLV--GSDSSRRAKATD 324
KV S K ++ S+ P +VK K V ++ + E+ G D + K D
Sbjct: 258 KVMSKKRNIVSEIFPAESTAVKMTKLEVHRVVKDVDSTKFFEKRCSSQGFDPPTKQKIND 317
Query: 325 VSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSR 384
++K + NVKS+ V++ S +V ++SL R Y KQ N + +
Sbjct: 318 ATKKFLRDNVKSVPVKICASVAVKGTQSSL--RNYNI----------KPKQQNIPSKVEK 365
Query: 385 TVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAV 444
++KP K+ S P +D + + R+ LMK SS +EE ++ K+ ++ SK+ +
Sbjct: 366 ITSLKPSMKRASSSQPLMDAELETRIVDLMKSTTSSFSLEEFREKQKVLCSY---SKNVL 422
Query: 445 DKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFL 504
D IT GKVE SV+AIRTAL+KL+ G SIEDAKAVCEPEVL+QI +WK KLKVYLAPFL
Sbjct: 423 DSTITQGKVEVSVKAIRTALEKLE-KGCSIEDAKAVCEPEVLNQIMRWKRKLKVYLAPFL 481
Query: 505 HGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDM-------------IVDFCCGANDFSC 551
HGMRYTSFGRHFTKV+KL+ +VD+LHWYV GDM IVDFCCG+NDFSC
Sbjct: 482 HGMRYTSFGRHFTKVEKLREVVDRLHWYVQHGDMSFYNSLFPECEIRIVDFCCGSNDFSC 541
Query: 552 LMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKA 611
LMK+KLD+ GK+C +KNYD++ KNDF+FEKRDWM++ EL GS+LIMGLNPPFGVKA
Sbjct: 542 LMKEKLDKVGKSCSFKNYDLIQPKNDFSFEKRDWMSIHLDELPAGSQLIMGLNPPFGVKA 601
Query: 612 GLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEN 671
LANKFI+KAL F PKLLILIVP ET+RLD K+SAY+L+WED+ LSGKSFYLPGSVD +
Sbjct: 602 SLANKFIDKALSFRPKLLILIVPKETKRLDEKDSAYDLIWEDEDILSGKSFYLPGSVDMH 661
Query: 672 DKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKE 731
DKQ++QWN+ P LYLWSR D+ + HKA+A+K GHIS Q +E N E ++ E
Sbjct: 662 DKQLEQWNLLPPLLYLWSRPDWTSRHKAVAQKCGHISIEQKDFLVEGNNVEREVSNYLME 721
Query: 732 EGQ---GDASMLID 742
E GD S L++
Sbjct: 722 ENHDCYGDFSNLMN 735
>gi|356499417|ref|XP_003518537.1| PREDICTED: uncharacterized protein LOC100806429 [Glycine max]
Length = 1065
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/711 (53%), Positives = 507/711 (71%), Gaps = 28/711 (3%)
Query: 3 GFIVDDMEEDMVHDTEEDESNEEDELFDS--VCSFCDNGGDLLCCEGRCLRSFHATIDAG 60
G V ++ DM+ DESNEE E D VC+ CDNGG++ CC+G C+RSFHAT++AG
Sbjct: 206 GLTVVGIDSDMI-----DESNEESEERDDLDVCALCDNGGNVTCCDGVCMRSFHATVEAG 260
Query: 61 EESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGH 120
E+ C SLG T+ EV+ + +F+CKNCEY QHQCFACGKLGSSDK GAEV CVSATC
Sbjct: 261 RENSCVSLGFTQKEVDEIQSFYCKNCEYYQHQCFACGKLGSSDKVKGAEVIKCVSATCDR 320
Query: 121 FYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRR 180
FYHPHCV+KLL + + A+ L ++I F CPLH CC+CK+ ENK D +LQFAVCRR
Sbjct: 321 FYHPHCVAKLLPQLAKPVAEDLERNIADRVPFICPLHYCCVCKELENKVDPELQFAVCRR 380
Query: 181 CPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHII 240
CPK+YHRKCLPR+IA ++ + II RAWEGLLPN+RILIYCL H+ID E+GTP+RDHI
Sbjct: 381 CPKSYHRKCLPREIAPSNRGNKNIIQRAWEGLLPNNRILIYCLNHKIDRELGTPVRDHIK 440
Query: 241 FPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSL---TSKAPPQGKFSVKALKRVPSK 297
FP +E I+ +++ P+ K++V K ++ S K S K K
Sbjct: 441 FPNMEPTVQKINTTIEQKE---PATKERVILKKKNVDLDNSSGKSIAKGSKLTGKLSSHK 497
Query: 298 AGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRS-SSVDSKKTSLGE 356
G +T ++++ GS+ SR+ K+ + SR N +S+S + + S S + + ++GE
Sbjct: 498 VGSKKT----KKIISGSNISRKPKSKETSR-CLTENKRSISKKSEMSDSEQNYNQPTIGE 552
Query: 357 RLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKD 416
+YA EG ++ K + N+ + ++VKP+ + L ELP LD DS++ L +L K+
Sbjct: 553 -IYA-LQKEGLKRIKHDNKVNN--VITNPLSVKPI-ESLSVELPPLDADSEKSLLTLFKE 607
Query: 417 AASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIED 476
A SS+ +E +L++H STH + ++ V+K IT+GK+E SV A++TAL+KL+ G SI+D
Sbjct: 608 ARSSITLESVLEKHTFASTHTHPLRNVVEKTITMGKLEYSVNAVQTALRKLE-SGCSIQD 666
Query: 477 AKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDG 536
KA C+P+ L Q+FKWK++LK+YLAP L+G RYTS+GRHFT V+KL+ IVDKLHWYV +
Sbjct: 667 VKAFCDPDDLKQLFKWKDELKIYLAPVLYGNRYTSYGRHFTLVEKLEGIVDKLHWYVQNS 726
Query: 537 DMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPG 596
D IVDFCCGANDFS LMKKKL+E GK C Y+NYD+LP KNDF+FE+RDWMTV+P EL G
Sbjct: 727 DTIVDFCCGANDFSILMKKKLEENGKKCSYRNYDLLPTKNDFSFERRDWMTVQPTELPTG 786
Query: 597 SRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQF 656
S+LIMGLNPPFG KA LANKF++KALEF PKL+ILIVPPETERLD+K+S Y+L+WED+ F
Sbjct: 787 SQLIMGLNPPFGHKAALANKFVDKALEFKPKLVILIVPPETERLDKKQSPYDLIWEDENF 846
Query: 657 LSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 707
LSG SFYLPGSV D+QMDQ N P L LWSR D+ HK +A+++GH+
Sbjct: 847 LSGTSFYLPGSV---DRQMDQRNARPPLLSLWSRPDWTTKHKVIAQENGHV 894
>gi|222640382|gb|EEE68514.1| hypothetical protein OsJ_26949 [Oryza sativa Japonica Group]
Length = 1311
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/684 (52%), Positives = 455/684 (66%), Gaps = 52/684 (7%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
LFDSVC+ CDNGG+LLCCEG C+RSFHA I GE+S+CA+LG TK EV+A+ NF CKNC+
Sbjct: 244 LFDSVCAICDNGGELLCCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKALKNFVCKNCD 303
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
+KQHQCF CG+L SD A+VF C +ATCGHFYHP CV++LL + A ++ K I+
Sbjct: 304 HKQHQCFVCGELEPSDG-PNAKVFLCNNATCGHFYHPRCVAQLLHPNSRNEASEMEKKIM 362
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
AG SFTCP+H C CK E++ LQFAVCRRCP++YHRKCLPR+I+FED +GIITR
Sbjct: 363 AGFSFTCPVHWCFHCKGLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFEDINTQGIITR 422
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEEN-----KTIIDRPRKKQSLAS 262
AWE + RILIYCL HEID +IGTP RDHI FP +E++ K + + KK+ +
Sbjct: 423 AWEL---SKRILIYCLDHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKELAEKKRRICD 479
Query: 263 PSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKA 322
S + ++ L K +G S KA G E E L
Sbjct: 480 DSYVSEPLQKRAKLNEKFNAKGDKSKKA----------GVKSEFEEVL------------ 517
Query: 323 TDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRET 382
+S K +SL K+T E L + +RE
Sbjct: 518 -----ESEKKKTRSL-----------KKRTQPEEPLVECAAAAAANNA---NRPVKEREK 558
Query: 383 SRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKS 442
+ + K +S P +D ++++R+S+L++ SS+ + +I +R IPST+A S
Sbjct: 559 ELGTSSLDMGKIPLSSFPIVDSETEKRISALVEKEVSSLTVADISRRCVIPSTYA-CSGR 617
Query: 443 AVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAP 502
+DK + GK+E S++A++ AL+KL+ +G +++DAKAVCE EVL Q+ +W NKL+VYLAP
Sbjct: 618 QIDKIVVRGKLERSIQAVKAALQKLE-NGGAVDDAKAVCESEVLRQLTRWHNKLRVYLAP 676
Query: 503 FLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGK 562
F+HGMRYTSFGRHFTK +KL I +KLHWYV GDMIVDF CG NDFS MK+KLD+ GK
Sbjct: 677 FIHGMRYTSFGRHFTKKEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFSQFMKEKLDKVGK 736
Query: 563 NCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKAL 622
C +KNYD++ KN F+FEKRDWMTV KEL GS+LIMGLNPPFG KA LANKFI+KAL
Sbjct: 737 RCNFKNYDVIQPKNSFSFEKRDWMTVRQKELPHGSKLIMGLNPPFGPKAMLANKFIDKAL 796
Query: 623 EFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTA 682
F PKL+ILIVP E ERLDRK+ Y+LVWEDDQ LSGKSFYLPGS+D +DKQ+DQWN +
Sbjct: 797 TFKPKLIILIVPKEAERLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDVSDKQIDQWNKSP 856
Query: 683 PPLYLWSRHDYAAHHKALAEKHGH 706
PPLYLWSR D+ HK +AE+HGH
Sbjct: 857 PPLYLWSRPDWTQKHKRIAEQHGH 880
>gi|218200976|gb|EEC83403.1| hypothetical protein OsI_28850 [Oryza sativa Indica Group]
Length = 1283
Score = 690 bits (1781), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/683 (51%), Positives = 454/683 (66%), Gaps = 52/683 (7%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
FDSVC+ CDNGG+LLCCEG C+RSFHA I GE+S+CA+LG TK EV+A+ NF CKNC++
Sbjct: 217 FDSVCAICDNGGELLCCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKALKNFVCKNCDH 276
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
KQHQCF CG+L SD A+VF C +ATCGHFYHP CV++LL + A ++ K I+A
Sbjct: 277 KQHQCFVCGELEPSDG-PNAKVFLCNNATCGHFYHPRCVAQLLHPNSRNEASEMEKKIMA 335
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRA 208
G SFTCP+H C CK E++ LQFAVCRRCP++YHRKCLPR+I+FED +GIITRA
Sbjct: 336 GFSFTCPVHWCFHCKGLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFEDINTQGIITRA 395
Query: 209 WEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEEN-----KTIIDRPRKKQSLASP 263
WE + RILIYCL HEID +IGTP RDHI FP +E++ K + + KK+ +
Sbjct: 396 WEL---SKRILIYCLDHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKELAEKKRRICDD 452
Query: 264 SGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKAT 323
S + ++ L K +G S KA G E E L
Sbjct: 453 SYVSEPLQKRAKLNEKFNAKGDKSKKA----------GVKSEFEEVL------------- 489
Query: 324 DVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETS 383
+S K +SL K+T E L + +RE
Sbjct: 490 ----ESEKKKTRSL-----------KKRTQPEEPLVECAAAAAANNA---NRPVKEREKE 531
Query: 384 RTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSA 443
+ + K +S P +D ++++R+S+L++ SS+ + +I +R IPST+A S
Sbjct: 532 LGTSSLDMGKIPLSSFPIVDSETEKRISALVEKEVSSLTVADISRRCVIPSTYA-CSGRQ 590
Query: 444 VDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPF 503
+DK + GK+E S++A++ AL+KL+ +G +++DAKAVCE EVL Q+ +W NKL+VYLAPF
Sbjct: 591 IDKIVVRGKLERSIQAVKAALEKLE-NGGAVDDAKAVCESEVLRQLTRWHNKLRVYLAPF 649
Query: 504 LHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKN 563
+HGMRYTSFGRHFTK +KL I +KLHWYV GDMIVDF CG NDFS MK+KLD+ GK
Sbjct: 650 IHGMRYTSFGRHFTKKEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFSQFMKEKLDKVGKR 709
Query: 564 CLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALE 623
C +KNYD++ KN F+FEKRDWMTV KEL GS+LIMGLNPPFG KA LANKFI+KAL
Sbjct: 710 CNFKNYDVIQPKNSFSFEKRDWMTVRQKELPHGSKLIMGLNPPFGPKAMLANKFIDKALT 769
Query: 624 FNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAP 683
F PKL+ILIVP E ERLDRK+ Y+LVWEDDQ LSGKSFYLPGS+D +DKQ+DQWN + P
Sbjct: 770 FKPKLIILIVPKEAERLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDVSDKQIDQWNKSPP 829
Query: 684 PLYLWSRHDYAAHHKALAEKHGH 706
PLYLWSR D+ HK +AE+HGH
Sbjct: 830 PLYLWSRPDWTQKHKRIAEQHGH 852
>gi|357512983|ref|XP_003626780.1| hypothetical protein MTR_8g009000 [Medicago truncatula]
gi|355520802|gb|AET01256.1| hypothetical protein MTR_8g009000 [Medicago truncatula]
Length = 1054
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/711 (48%), Positives = 471/711 (66%), Gaps = 49/711 (6%)
Query: 24 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFC 83
E+D +D+VC+ CDNGG++L CEGRCLRSFHAT++ G +S CASLG T+ EV A NF+C
Sbjct: 305 EQDIGYDTVCAICDNGGEILPCEGRCLRSFHATLEDGRDSLCASLGYTRTEVNAFPNFYC 364
Query: 84 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 143
+NC++K+HQCFACGKLGSSD+ + EVFPCV+A CGH+YHP CV++LL ++ +++
Sbjct: 365 ENCKHKKHQCFACGKLGSSDESSNPEVFPCVTANCGHYYHPECVARLLYPGIDIGQEEMR 424
Query: 144 KSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEG 203
K II ++F CPLH C +C++GEN+ DLQFA+CRRCPKAYHRKCLP++I+F G
Sbjct: 425 KRIIIEKTFVCPLHICSLCRKGENRNVHDLQFAMCRRCPKAYHRKCLPKEISFTYDYYTG 484
Query: 204 IITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIID-RPRKKQSLAS 262
I RAW+ LL + RIL+YC+ H+I E+GTP RDH+IFP E + II ++ A
Sbjct: 485 IEMRAWDNLL-DKRILMYCMNHKIVLELGTPARDHLIFPNKEVKRKIISTESLHREKDAI 543
Query: 263 PSGK-------QKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSD 315
P K K K ++ + Q S K ++++ SK Q M V D
Sbjct: 544 PLKKFFEDLLPNKTLKPKMTIKERVGLQMGGSSKVMEKICSK--QDTHMSTGP---VYFD 598
Query: 316 SSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQ 375
+R+ + ++ G+ +SL + + + S RL ++ ++ +Q
Sbjct: 599 RARKY----LKVETMSGSNRSLP-NYENKVPLKNVNLSCNPRL---------DEARY-QQ 643
Query: 376 DNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPST 435
S T KP KK+ + L D ++R+ +LM++A S++ ME+ K + +T
Sbjct: 644 KRSVGRIEETSWKKPPVKKVKTSLEDRKADMEKRILALMEEATSTLNMEKFKKDNHAVNT 703
Query: 436 HAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKN- 494
+ +++ K +TLGKVEGSV+A++ AL+KLD +G IE+AKA+CEPE++ Q+F W+N
Sbjct: 704 SSSLTETVFRKTLTLGKVEGSVKAVQIALQKLD-EGCGIEEAKAICEPEIIRQLFTWQNE 762
Query: 495 ------------------KLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDG 536
+LK+YLAPFLHG RYTSFGRHFTK+DKL+ IVD+LHWYV G
Sbjct: 763 VAGIKSNSRPLGYRGSDKQLKIYLAPFLHGRRYTSFGRHFTKIDKLKEIVDRLHWYVQSG 822
Query: 537 DMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPG 596
D ++DFCCGANDFSCLMK KL++TGK C +KNYD+ AKNDFNFEKRDWM+V+ +EL G
Sbjct: 823 DTVLDFCCGANDFSCLMKSKLEQTGKLCSFKNYDLFQAKNDFNFEKRDWMSVQAEELPHG 882
Query: 597 SRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQF 656
S LI+GLNPPFGV+ LANKFI+KAL F PKLLILIVP T RLDRK++ Y L+W D++
Sbjct: 883 SNLIIGLNPPFGVRGSLANKFIDKALTFKPKLLILIVPKVTRRLDRKKAGYNLIWADEEI 942
Query: 657 LSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 707
SGKSFYLPGSVD DKQ++ WN+ PPLYLWSR D+ A H +A H HI
Sbjct: 943 CSGKSFYLPGSVDTRDKQLEDWNLKPPPLYLWSRPDWTAWHMQIARMHRHI 993
>gi|357141809|ref|XP_003572354.1| PREDICTED: uncharacterized protein LOC100831561 [Brachypodium
distachyon]
Length = 888
Score = 639 bits (1648), Expect = e-180, Method: Compositional matrix adjust.
Identities = 352/775 (45%), Positives = 460/775 (59%), Gaps = 98/775 (12%)
Query: 18 EEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEA 77
E D+ D + D +C+ CDNGG++ CEG+CLRSFHAT AGE+ C +LG T + +A
Sbjct: 3 ENDDEVSSDSI-DLICALCDNGGEIASCEGKCLRSFHATKGAGED--CKTLGYTTVQFDA 59
Query: 78 MLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEV 137
+ F CKNCE + +QCFAC +LGS+ K EVFPC S CGHFYH CV+ LL ++E
Sbjct: 60 IKVFLCKNCEREIYQCFACHRLGSA-KTNPPEVFPCASPNCGHFYHAKCVAHLLFPENEA 118
Query: 138 AADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFE 197
A + I+ G F CP+H+C +CK GE K LQFAVCRRCPK+YHR+CLPRKI+F+
Sbjct: 119 KATEYMTKIVNGAKFGCPVHRCDVCKYGEKKEVEALQFAVCRRCPKSYHRRCLPRKISFD 178
Query: 198 DKLEEGI--ITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPR 255
D +E G+ RAW+GLLPN+RILIYC+KH+ID ++ TP+RDHI FPG
Sbjct: 179 DIIENGVCHFQRAWDGLLPNNRILIYCMKHDIDPKLRTPLRDHIKFPG------------ 226
Query: 256 KKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSD 315
P+ +K + +V ++KRV K + E E + V
Sbjct: 227 ------DPALSRKPS----------------NVNSMKRV--KIRRLEECVAEECVAVPLS 262
Query: 316 SSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQ 375
+S+R+ T S K + + ++ GE+ VT + F +
Sbjct: 263 NSKRSFGTTTCSSSSNLIAKRMKAPESGGAKACYRRPVSGEKA----VTSVIPVSSFPEV 318
Query: 376 D-NSDR-------ETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEIL 427
D N+ R +TS +T++ ++KKL+ +S MK+ +
Sbjct: 319 DINTARRIYEFAQKTSSEITIEDVQKKLV---------VSSTYTSFMKNTDT-------- 361
Query: 428 KRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLS 487
ITLGKVE SVEAIRTAL L+ +G+ IE AK VC L
Sbjct: 362 --------------------ITLGKVEKSVEAIRTALHMLE-NGAGIEAAKGVCTQHDLF 400
Query: 488 QIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGAN 547
Q+ KWKNKL +YLAPFLHGMRYTS+GRHFTK+DKL+ IVDKL WYV GD +VDFCCG+N
Sbjct: 401 QLAKWKNKLNIYLAPFLHGMRYTSYGRHFTKLDKLEKIVDKLQWYVQSGDTVVDFCCGSN 460
Query: 548 DFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPF 607
DFS L+K+KL+++ KNC YKNYD++ KNDFNFE+RDWMTV+P EL GSRLIMGLNPPF
Sbjct: 461 DFSTLLKEKLEDSEKNCFYKNYDLIQPKNDFNFERRDWMTVQPDELPAGSRLIMGLNPPF 520
Query: 608 GVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGS 667
G KA LAN+FINKAL F PKL++LIVP ETERLD+K YEL+WED LSGKSFYLPGS
Sbjct: 521 GFKASLANQFINKALSFKPKLIVLIVPRETERLDKKYPPYELIWEDSNQLSGKSFYLPGS 580
Query: 668 VDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI-SRPQSRTQMERNCYETHAV 726
D ++KQM+QWNM+ PPL LWSR DYA H +A GH+ S+ RN + +V
Sbjct: 581 FDADNKQMEQWNMSPPPLSLWSRSDYAQRHYEIARSKGHLCSKNPCSGDSHRNIADIASV 640
Query: 727 DHPKEEGQGDA-----SMLIDLPLQINVTKELRNEAREDDKAGFPDNATEGGGES 776
+ G+A M + P +V ++L + D + D T+ G S
Sbjct: 641 STTRNLEIGNADINDEEMQGERPATASVIEQLLADTYHDTTSSPGDYWTDTNGRS 695
>gi|218201406|gb|EEC83833.1| hypothetical protein OsI_29778 [Oryza sativa Indica Group]
Length = 955
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 353/803 (43%), Positives = 466/803 (58%), Gaps = 135/803 (16%)
Query: 16 DTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDE- 74
D E ES+ D VC+ CDNGG++ CEG+CLRSFHA DAGE+ C +LG T+ +
Sbjct: 3 DNAEGESSAN---VDLVCALCDNGGEIASCEGKCLRSFHAVRDAGED--CQTLGYTRRQF 57
Query: 75 ---------------------------------VEAMLNFFCKNCEYKQHQCFACGKLGS 101
++A+ F CKNCE +++QCFAC +LGS
Sbjct: 58 DVSVLNLPLMNDTVFTSHVLHILDGNIHLAFPYLQALNPFLCKNCELEKYQCFACMRLGS 117
Query: 102 SDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCI 161
+ +T EVFPC SA CG+FYH CV++LL ++E A + I +G F CPLHKC +
Sbjct: 118 AKTDT-PEVFPCASANCGYFYHAKCVAQLLFTENEAKALEYTTKIASGVKFACPLHKCDV 176
Query: 162 CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII--TRAWEGLLPNHRIL 219
CK GENK + +LQFAVCRR KIAF+D ++ G+ RAWEGLLPN+RIL
Sbjct: 177 CKYGENKDEKELQFAVCRR------------KIAFDDFVDNGVFHFQRAWEGLLPNNRIL 224
Query: 220 IYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSK 279
I+CLKH+ID ++ TP RDHI FP +N + +P + +KV K L
Sbjct: 225 IFCLKHDIDPKLRTPTRDHIKFP---DNPAVTRKPFDVNGM-----NKKV--VKIRLLED 274
Query: 280 APPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSV 339
PP + SD +KSF G V S
Sbjct: 275 CPPAP---------------------------LSSD-----------KKSF-GTVNRFS- 294
Query: 340 QVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISEL 399
SS V +KK + V+ GT+ RE + + PL S
Sbjct: 295 ----SSDVITKKRKV-------LVSGGTKHCALSA---VAREKTSVPSFIPL-----SSF 335
Query: 400 PSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEA 459
P +D+ ++RR+ + +S + +E+I K+ +PSTH SK+ DK ITLG V+ SVEA
Sbjct: 336 PVIDKSTERRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKN-TDK-ITLGMVQRSVEA 393
Query: 460 IRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKV 519
I AL L+ +G+SIED K+VC P L Q+ +WKNKL +YLAPFLHGMRYTS+GRHFTK+
Sbjct: 394 INAALHMLE-NGASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRHFTKL 452
Query: 520 DKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFN 579
DKL+ IVD+L WY+ GD +VDFCCG+NDFS L+K+KL+ + K+C YKN+D++ KNDFN
Sbjct: 453 DKLEQIVDRLQWYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPKNDFN 512
Query: 580 FEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETER 639
FE+RDWMTV+P EL G RLIMGLNPPFG KA LAN+FINKAL F PKL+ILIVP ETER
Sbjct: 513 FERRDWMTVQPDELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPKETER 572
Query: 640 LDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKA 699
LDRK YEL+WED L+GKSFYLPGS+D ++K M+QWNM+ PPL LWSR D+A HK
Sbjct: 573 LDRKYPPYELIWEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWARKHKE 632
Query: 700 LAEKHGHISR-PQSRTQMERNCYETHAVDHPKEEGQGDASM-----LIDLPLQINVTKEL 753
+A+ GHIS+ +R+ T H + +GD + + PL +V +L
Sbjct: 633 IAKTMGHISKNVWCLDDTQRSVVNT---GHAQMANEGDDDLDNKERQEEAPLNASVIDQL 689
Query: 754 RNEAREDDKAGFPDNATEGGGES 776
++ D + D T+ G S
Sbjct: 690 LSDTYHDPTSSPGDYWTDTNGRS 712
>gi|222640820|gb|EEE68952.1| hypothetical protein OsJ_27843 [Oryza sativa Japonica Group]
Length = 846
Score = 544 bits (1401), Expect = e-151, Method: Compositional matrix adjust.
Identities = 304/675 (45%), Positives = 401/675 (59%), Gaps = 95/675 (14%)
Query: 110 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 169
VFPC SA CG+FYH CV++LL ++E A + I +G F CPLHKC +CK GENK
Sbjct: 16 VFPCASANCGYFYHAKCVAQLLFTENEAKALEYTTKIASGVKFACPLHKCDVCKYGENKD 75
Query: 170 DSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII--TRAWEGLLPNHRILIYCLKHEI 227
+ +LQFAVCRR KIAF+D ++ G+ RAWEGLLPN+RILI+CLKH+I
Sbjct: 76 EKELQFAVCRR------------KIAFDDFVDNGVFHFQRAWEGLLPNNRILIFCLKHDI 123
Query: 228 DDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFS 287
D ++ TP RDHI FP +N + +P F
Sbjct: 124 DPKLRTPTRDHIKFP---DNPAVTRKP-------------------------------FD 149
Query: 288 VKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSV 347
V + + K + +E + SD +KSF G V S SS V
Sbjct: 150 VNGMNK---KVVKIRLLEDCPPAPLSSD-----------KKSF-GTVNRFS-----SSDV 189
Query: 348 DSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSK 407
+KK + V+ GT+ RE + + PL S P +D+ ++
Sbjct: 190 ITKKRKV-------LVSGGTKHCALSA---VAREKTSVPSFIPL-----SSFPVIDKSTE 234
Query: 408 RRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKL 467
RR+ + +S + +E+I K+ +PSTH SK+ DK ITLG V+ SVEAI AL L
Sbjct: 235 RRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKN-TDK-ITLGMVQRSVEAINAALHML 292
Query: 468 DVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVD 527
+ +G+SIED K+VC P L Q+ +WKNKL +YLAPFLHGMRYTS+GRHFTK+DKL+ IVD
Sbjct: 293 E-NGASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRHFTKLDKLEQIVD 351
Query: 528 KLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMT 587
+L WY+ GD +VDFCCG+NDFS L+K+KL+ + K+C YKN+D++ KNDFNFE+RDWMT
Sbjct: 352 RLQWYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPKNDFNFERRDWMT 411
Query: 588 VEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAY 647
V+P EL G RLIMGLNPPFG KA LAN+FINKAL F PKL+ILIVP ETERLDRK Y
Sbjct: 412 VQPDELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPKETERLDRKYPPY 471
Query: 648 ELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 707
EL+WED L+GKSFYLPGS+D ++K M+QWNM+ PPL LWSR D+A HK +A+ GHI
Sbjct: 472 ELIWEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWARKHKEIAKTMGHI 531
Query: 708 SR-PQSRTQMERNCYETHAVDHPKEEGQGDASM-----LIDLPLQINVTKELRNEAREDD 761
S+ +R+ T H + +GD + + PL +V +L ++ D
Sbjct: 532 SKNVWCLDDTQRSVVNT---GHAQMANEGDDDLDNKERQEEAPLNASVIDQLLSDTYHDP 588
Query: 762 KAGFPDNATEGGGES 776
+ D T+ G S
Sbjct: 589 TSSPGDYWTDTNGRS 603
>gi|224061083|ref|XP_002300346.1| predicted protein [Populus trichocarpa]
gi|222847604|gb|EEE85151.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 474 bits (1220), Expect = e-130, Method: Compositional matrix adjust.
Identities = 236/428 (55%), Positives = 320/428 (74%), Gaps = 6/428 (1%)
Query: 95 ACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTC 154
AC LG +++E EVF C +ATCGHFYHPHC + +L R+D+VAA++L K I AG+SF C
Sbjct: 57 ACESLGFTNREV--EVFRCANATCGHFYHPHCAATMLHREDKVAAEELRKKIAAGDSFAC 114
Query: 155 PLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLP 214
P+HKCCICKQ E+K SDLQFAVCRRCP +YH+KCLP++IAFE++ +E I RAW+ LLP
Sbjct: 115 PIHKCCICKQVEDKKKSDLQFAVCRRCPTSYHQKCLPKEIAFENEADEDTIARAWQNLLP 174
Query: 215 NHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKS 274
N RILIYCLKH+I ++IGTP+RDHI FP + K + +K+++ P+ +++ S K
Sbjct: 175 N-RILIYCLKHDIIEDIGTPVRDHIRFPDV-GGKNTAAKVQKRKTSELPANEEESLSKKK 232
Query: 275 SLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNV 334
LTS+ G F KA K + S A + SE++ S+S + + + SRKS + N
Sbjct: 233 RLTSEESFSGTFRTKASKEMFSSAKIVKITNDSEQISSESNSLGKMRMNNPSRKSIRENT 292
Query: 335 KSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKK 394
KS S +V+RS++ + KTSLG++LY F+T + + K KQD E +++ VK + KK
Sbjct: 293 KSASSEVERSTAANVNKTSLGDKLY-GFMTIKSGKAKLRKQDIFGSELDKSLAVKSVGKK 351
Query: 395 LISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVE 454
L SELPSLD D++RRL +L+K+AASS+ ++ ++K+H++PSTH ++SK+ VDK ITLGKVE
Sbjct: 352 LTSELPSLDADTQRRLLALVKEAASSITLDNVIKKHEVPSTHVHSSKNVVDKNITLGKVE 411
Query: 455 GSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGR 514
G+VEA+RTALKKL+ + SIEDAKAVCEP+VL+Q+FKWKNKLKVYLAPFL+GMRYTSFGR
Sbjct: 412 GTVEAVRTALKKLE-EKCSIEDAKAVCEPDVLNQVFKWKNKLKVYLAPFLYGMRYTSFGR 470
Query: 515 HFTKVDKL 522
HFTKV+KL
Sbjct: 471 HFTKVEKL 478
>gi|414588820|tpg|DAA39391.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 751
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 238/542 (43%), Positives = 331/542 (61%), Gaps = 48/542 (8%)
Query: 4 FIVDD--MEEDMVHDTEEDESNEED-ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAG 60
FI DD ++E +V D + + EED +LFDS+C+ CD+GGD+LCC+G C+RSFHA +G
Sbjct: 242 FITDDEDIDEMIVEDADNESDEEEDGDLFDSICAICDDGGDILCCDGPCMRSFHAKEGSG 301
Query: 61 EESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGH 120
E+S+C +LG T+ EVEAM F CKNCEYKQHQCF CG L S+ A+VF C +ATCGH
Sbjct: 302 EDSYCDTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEPSNG-AAAKVFLCNNATCGH 360
Query: 121 FYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRR 180
FYHP CV++ L ++ A +L K I G SFTCP+H C CK E++ LQFAVCRR
Sbjct: 361 FYHPKCVAQQLHPNNIDEASELEKKITDGFSFTCPIHWCFHCKGLEDRTQGPLQFAVCRR 420
Query: 181 CPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHII 240
CPK+YHRKCLPR+I+F++ E+G+I RAWE + ILIYCL HEID E+ TP+RDHI
Sbjct: 421 CPKSYHRKCLPREISFDEIEEKGVIIRAWEL---SKMILIYCLNHEIDSELETPVRDHIK 477
Query: 241 FPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQ 300
FP IE+ + KK + P K++ T S + PP+
Sbjct: 478 FPRIEKPLQFV----KKGANLLPKKKKR---TYSESILEQPPK----------------- 513
Query: 301 GETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYA 360
+T+++ ++ + + +VS KSF NV VD K + Y
Sbjct: 514 -DTVKMKSKVRMQESEQNKHDTREVSVKSFTQNV------VDTPVKKKGKLKENKQLPYV 566
Query: 361 AFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASS 420
A + F S +E + + PL S P +D + ++R+ S++ + SS
Sbjct: 567 A-------KDHFVSSPKSVKEQEQELVPLPLSGIRKSSFPKVDSEIEKRVISILGNEISS 619
Query: 421 VRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAV 480
+ ++++ + +PSTH Y+ + DK + LGK+E SV+A+ ALKKL +G S+ DAKAV
Sbjct: 620 LTIKDVTRNCSLPSTHVYSGRPT-DK-MPLGKIERSVQAVEAALKKLK-NGGSVNDAKAV 676
Query: 481 CEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIV 540
CEP+VL Q+ +W KL+VY++PF+HG RY+SFGRHFTKV+KL IVD+LHWYV GDM+
Sbjct: 677 CEPDVLRQLARWHTKLRVYISPFIHGTRYSSFGRHFTKVEKLVEIVDRLHWYVESGDMVY 736
Query: 541 DF 542
F
Sbjct: 737 FF 738
>gi|115477142|ref|NP_001062167.1| Os08g0502000 [Oryza sativa Japonica Group]
gi|42407765|dbj|BAD08911.1| unknown protein [Oryza sativa Japonica Group]
gi|113624136|dbj|BAF24081.1| Os08g0502000 [Oryza sativa Japonica Group]
gi|215740510|dbj|BAG97166.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 694
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 191/314 (60%), Positives = 243/314 (77%), Gaps = 3/314 (0%)
Query: 396 ISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEG 455
+S P +D+ ++RR+ + +S + +E+I K+ +PSTH SK+ DK ITLG V+
Sbjct: 71 LSSFPVIDKSTERRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKN-TDK-ITLGMVQR 128
Query: 456 SVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRH 515
SVEAI AL L+ +G+SIED K+VC P L Q+ +WKNKL +YLAPFLHGMRYTS+GRH
Sbjct: 129 SVEAINAALHMLE-NGASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRH 187
Query: 516 FTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAK 575
FTK+DKL+ IVD+L WY+ GD +VDFCCG+NDFS L+K+KL+ + K+C YKN+D++ K
Sbjct: 188 FTKLDKLEQIVDRLQWYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPK 247
Query: 576 NDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPP 635
NDFNFE+RDWMTV+P EL G RLIMGLNPPFG KA LAN+FINKAL F PKL+ILIVP
Sbjct: 248 NDFNFERRDWMTVQPDELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPK 307
Query: 636 ETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAA 695
ETERLDRK YEL+WED L+GKSFYLPGS+D ++K M+QWNM+ PPL LWSR D+A
Sbjct: 308 ETERLDRKYPPYELIWEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWAR 367
Query: 696 HHKALAEKHGHISR 709
HK +A+ GHIS+
Sbjct: 368 KHKEIAKTMGHISK 381
>gi|356503198|ref|XP_003520398.1| PREDICTED: uncharacterized protein LOC100810533 [Glycine max]
Length = 973
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/353 (55%), Positives = 251/353 (71%), Gaps = 26/353 (7%)
Query: 379 DRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHK-IPSTHA 437
D +T KPL KK+ + L + + + + + SLMKD S+ EE K H+ +T
Sbjct: 559 DGSIGKTGIEKPL-KKIQTSLDANNAEMENSILSLMKDTMSTFNEEEFKKNHQAFSTTSG 617
Query: 438 YASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWK---- 493
+ + K +T GKVEGSV+AI+TAL++L+ +G SIE+AKA+C+P VL Q+F W+
Sbjct: 618 FTEPVSHHKNLTQGKVEGSVKAIQTALQRLE-EGGSIEEAKAICDPGVLHQLFIWQFSDQ 676
Query: 494 -------------------NKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVN 534
+LKVYL+PFLHGMRYTSFGRHFTK+DKL+ + ++LHWYV
Sbjct: 677 SGHSKNQLISEPVRTGLKQKQLKVYLSPFLHGMRYTSFGRHFTKIDKLKEVANRLHWYVQ 736
Query: 535 DGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELA 594
+GD ++DFCCG+NDFS LMK KL+E GK+C +KNYD+ KNDFNFEKRDWM+V +EL
Sbjct: 737 NGDTVLDFCCGSNDFSRLMKSKLEEMGKSCSFKNYDLFQPKNDFNFEKRDWMSVNAEELP 796
Query: 595 PGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDD 654
GS+LI+GLNPPFGVK LANKFINKAL F PKLLILIVP T+RLDRK+ Y+L+WEDD
Sbjct: 797 NGSQLIIGLNPPFGVKGSLANKFINKALTFKPKLLILIVPKVTKRLDRKKEGYDLIWEDD 856
Query: 655 QFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 707
+ LSGKSFYLPGSVD DKQ++ WN+ PPLYLWSR D+ H+ +A+KH HI
Sbjct: 857 EMLSGKSFYLPGSVDTRDKQLEDWNLKPPPLYLWSRPDWTTKHREIAQKHCHI 909
Score = 304 bits (779), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 146/268 (54%), Positives = 192/268 (71%), Gaps = 13/268 (4%)
Query: 2 SGFIVD-DMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAG 60
S FIVD + EE+ D E D E++ +D+VC+ CDNGG++L CEGRCLRSFHAT +AG
Sbjct: 307 SKFIVDSEDEENDQSDGELDPDGEQNIGYDTVCAICDNGGEILPCEGRCLRSFHATKEAG 366
Query: 61 EESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGH 120
++ C SLG T +V+A NF+C+NC+YK HQCFACGKLGSSD + AEVFPCV+A CGH
Sbjct: 367 IDAVCESLGYTSAQVKAFPNFYCQNCKYKLHQCFACGKLGSSDVSSKAEVFPCVTANCGH 426
Query: 121 FYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRR 180
+YHP CV++LL + +++ K + G++F CPLH C +CK+GENK DLQFA+CRR
Sbjct: 427 YYHPECVARLLSPSIDTEQEEMRKKVAMGKAFVCPLHICSLCKKGENKNFHDLQFAICRR 486
Query: 181 CPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHII 240
CPKAYHRKCLPR+I+F ++ I RAW+GLL +H+ILIYCL H I E+GTP RDH++
Sbjct: 487 CPKAYHRKCLPREISFVFDNDKSIEQRAWDGLL-DHQILIYCLDHVIVRELGTPARDHLV 545
Query: 241 FPGIEEN-----------KTIIDRPRKK 257
FP ++ KT I++P KK
Sbjct: 546 FPDVKVRKKKKMVDGSIGKTGIEKPLKK 573
>gi|218188313|gb|EEC70740.1| hypothetical protein OsI_02147 [Oryza sativa Indica Group]
Length = 1296
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 196/369 (53%), Positives = 263/369 (71%), Gaps = 21/369 (5%)
Query: 397 SELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGS 456
S P +D ++++R+ +L++ AS + +++I ++ +PSTHAY+ + VD+ I GK+E S
Sbjct: 486 SSFPRVDSETEKRVMALVEREASYLTLKDISRKCLMPSTHAYSGRQ-VDRIIATGKLERS 544
Query: 457 VEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHF 516
V+A+ ALK L+ +G ++ DAKAVCEPEVL Q+ +W +KL+VY++PF++G RY+SFGRHF
Sbjct: 545 VQAVGAALKMLE-NGGNVNDAKAVCEPEVLKQLTRWHSKLRVYISPFIYGTRYSSFGRHF 603
Query: 517 TKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKN 576
TKV+KL IVDKLH YV GD IVDFCCGANDFS LMK+KLD+ K C +KNYD++ +N
Sbjct: 604 TKVEKLVEIVDKLHCYVEPGDTIVDFCCGANDFSRLMKEKLDKVQKKCHFKNYDLIQPQN 663
Query: 577 DFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPE 636
F FE++DWMTV PKEL GS+LIMGLNPPFGVKA LANKFI+KAL F PKL+ILIVP E
Sbjct: 664 CFAFERKDWMTVRPKELPHGSKLIMGLNPPFGVKASLANKFIDKALTFKPKLVILIVPKE 723
Query: 637 TERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAH 696
T+RLD+K++ Y+L+WED LSGK+FYLPGSVD NDK ++ WN +APPLYLWS D+
Sbjct: 724 TKRLDQKKTPYDLIWEDSDCLSGKAFYLPGSVDVNDKVVEGWNASAPPLYLWSHPDWTRK 783
Query: 697 HKALAEKHGHISRPQSRTQMERN-------------CYETHAVDHPKEEGQGDASM---- 739
H+ +AE+H H S + + +E + Y H KEE G S
Sbjct: 784 HRKVAEEHNHTSIGRIASHVEEDNLSDNLPMMKETESYGIHNSRSGKEENAGQTSCNLKG 843
Query: 740 --LIDLPLQ 746
L DLP++
Sbjct: 844 TNLSDLPVR 852
Score = 135 bits (340), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/117 (56%), Positives = 85/117 (72%), Gaps = 4/117 (3%)
Query: 4 FIVDDMEEDMVHDTEED---ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAG 60
FI DD + + + D + + + EE++LFDS+CS CDNGGDLLCC+G C+RSFHA I G
Sbjct: 204 FIADDEDIEEMADVDNNIESDEEEEEDLFDSICSICDNGGDLLCCDGPCMRSFHAKIGTG 263
Query: 61 EESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSAT 117
E+S+C +LG T+ EV+AM F CKNCE+KQHQCF CG L SD T A+VF C +AT
Sbjct: 264 EDSYCDTLGYTEAEVQAMKTFLCKNCEHKQHQCFICGALEPSDGPT-AKVFLCNNAT 319
>gi|222618537|gb|EEE54669.1| hypothetical protein OsJ_01967 [Oryza sativa Japonica Group]
Length = 1296
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 195/369 (52%), Positives = 263/369 (71%), Gaps = 21/369 (5%)
Query: 397 SELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGS 456
S P +D ++++R+ +L++ AS + +++I ++ +PSTHAY+ + VD+ I GK+E S
Sbjct: 486 SSFPRVDSETEKRVMALVEREASYLTLKDISRKCLMPSTHAYSGRQ-VDRIIATGKLERS 544
Query: 457 VEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHF 516
V+A+ ALK L+ +G ++ DAKAVCEPEVL Q+ +W +KL+VY++PF++G RY+SFGRHF
Sbjct: 545 VQAVGAALKMLE-NGGNVNDAKAVCEPEVLKQLTRWHSKLRVYISPFIYGTRYSSFGRHF 603
Query: 517 TKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKN 576
TKV+KL IVDKLH YV GD IVDFCCGANDFS LMK+KLD+ K C +KNYD++ +N
Sbjct: 604 TKVEKLVEIVDKLHCYVEPGDTIVDFCCGANDFSRLMKEKLDKVQKKCHFKNYDLIQPQN 663
Query: 577 DFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPE 636
F FE++DWMTV PKEL GS+LIMGLNPPFGVKA LANKFI+KAL F PKL+ILIVP E
Sbjct: 664 CFAFERKDWMTVRPKELPHGSKLIMGLNPPFGVKASLANKFIDKALTFKPKLVILIVPKE 723
Query: 637 TERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAH 696
T+RLD+K++ Y+L+WED LSGK+FYLPGSVD NDK ++ WN +APPL+LWS D+
Sbjct: 724 TKRLDQKKTPYDLIWEDSDCLSGKAFYLPGSVDVNDKVVEGWNASAPPLHLWSHPDWTRK 783
Query: 697 HKALAEKHGHISRPQSRTQMERN-------------CYETHAVDHPKEEGQGDASM---- 739
H+ +AE+H H S + + +E + Y H KEE G S
Sbjct: 784 HRKVAEEHNHTSIGRIASHVEEDNLSDNLPMMKETESYGIHNSRSGKEENAGQTSCNLKG 843
Query: 740 --LIDLPLQ 746
L DLP++
Sbjct: 844 TNLSDLPVR 852
Score = 134 bits (336), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 65/117 (55%), Positives = 84/117 (71%), Gaps = 4/117 (3%)
Query: 4 FIVDDMEEDMVHDTEED---ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAG 60
FI DD + + + D + + + EE++LFDS+CS CDNGGDLLCC+G C+RSFHA I G
Sbjct: 204 FIADDEDIEEMADVDNNIESDEEEEEDLFDSICSICDNGGDLLCCDGPCMRSFHAKIGTG 263
Query: 61 EESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSAT 117
E+S+C +LG T+ EV+AM F CKNCE+KQHQCF CG L SD A+VF C +AT
Sbjct: 264 EDSYCDTLGYTEAEVQAMKTFLCKNCEHKQHQCFICGALEPSDG-PAAKVFLCNNAT 319
>gi|413925249|gb|AFW65181.1| hypothetical protein ZEAMMB73_193747 [Zea mays]
Length = 584
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/330 (56%), Positives = 241/330 (73%), Gaps = 4/330 (1%)
Query: 410 LSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDV 469
+ + A++ + M+++ K+ +PST+A ++A + ITL VE VEA++ AL L+
Sbjct: 1 MYEFAQKASAGITMDDVKKKLVVPSTYAPNLQNADN--ITLENVEIYVEAVKGALHMLE- 57
Query: 470 DGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKL 529
+G+SI DAK+VC P VL Q+ KWKNKL V LAPFLHGMRYTS+GRHFTK+DKLQ IVDKL
Sbjct: 58 NGASIADAKSVCPPNVLFQLVKWKNKLSVVLAPFLHGMRYTSYGRHFTKLDKLQLIVDKL 117
Query: 530 HWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVE 589
WY+ GD +VDFCCG+NDFS L+++ L+ +GKNC YKNYD++ KNDFNFE+RDWMTV+
Sbjct: 118 QWYIQSGDTVVDFCCGSNDFSLLLREILESSGKNCFYKNYDLIQPKNDFNFERRDWMTVQ 177
Query: 590 PKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYEL 649
P EL G RL+MGLNPPFG KA LAN+FINKAL F PKL+ILIVP ETERLD+K YEL
Sbjct: 178 PDELPTGCRLVMGLNPPFGTKASLANQFINKALTFKPKLIILIVPKETERLDKKYPPYEL 237
Query: 650 VWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISR 709
+W+D LSGKSFYLPGS+D ++K M+QWN++ PPL LWSR D+A H +A+ GHIS
Sbjct: 238 IWQDSDQLSGKSFYLPGSLDADNKVMEQWNVSPPPLSLWSRSDWAERHSEIAKSMGHISN 297
Query: 710 PQSRTQMERNCYETHAVDHPKEEGQGDASM 739
Q + H +H + + G A M
Sbjct: 298 EIGDLQRDLAA-SIHTTEHVEADDAGVAGM 326
>gi|449453220|ref|XP_004144356.1| PREDICTED: uncharacterized protein LOC101205205 [Cucumis sativus]
Length = 707
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 207/425 (48%), Positives = 278/425 (65%), Gaps = 34/425 (8%)
Query: 108 AEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGEN 167
++VF CV+ATCG+FYHP C+S+LL +++VAA L K I +GESF+CP+HKC +C GEN
Sbjct: 8 SQVFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGEN 67
Query: 168 KADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEI 227
K +LQFAVCRRCPK+YHRKCLPRKI FE + TRAWE LLPN RILIYCL HEI
Sbjct: 68 KKIWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLPN-RILIYCLDHEI 126
Query: 228 DDEIGTPIRDHIIFPGIEENKTIIDR-------PRKKQSLASPSGKQKVASTKSSLTSKA 280
D+EI TP RDHI FPG+EE++ I + R+ +++ ++ V S K S+
Sbjct: 127 DEEIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDL 186
Query: 281 PPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQ 340
QGK + K K + G+ + + S S+ K ++SR S N K SV
Sbjct: 187 --QGKSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESKNVKLGNISRNSL--NQKGESVL 242
Query: 341 VDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELP 400
+D ++ KK+SL + +A T+ + +K K+D S +
Sbjct: 243 MDIDKTIKVKKSSLVGK--SAIPTKRFDPSKIYKEDRSG-------------------ML 281
Query: 401 SLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAI 460
LD +S+RRL +MK+ ASS+ +E+++K+HK+PSTHAY+ K VDK I +GK+EGSV A+
Sbjct: 282 LLDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAV 341
Query: 461 RTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVD 520
R AL+KL+ +G IEDA+AVCEPEVL+ IFKWKNKL+VYLAPFL+GMRY+SFGRHFTKV+
Sbjct: 342 RAALRKLE-EGCCIEDAEAVCEPEVLNHIFKWKNKLRVYLAPFLYGMRYSSFGRHFTKVE 400
Query: 521 KLQAI 525
KL I
Sbjct: 401 KLVEI 405
>gi|449456251|ref|XP_004145863.1| PREDICTED: uncharacterized protein LOC101205827 [Cucumis sativus]
Length = 464
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 204/423 (48%), Positives = 273/423 (64%), Gaps = 34/423 (8%)
Query: 110 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 169
VF CV+ATCG+FYHP C+S+LL +++VAA L K I +GESF+CP+HKC +C GENK
Sbjct: 56 VFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGENKK 115
Query: 170 DSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDD 229
+LQFAVCRRCPK+YHRKCLPRKI FE + TRAWE LLPN RILIYCL HEID+
Sbjct: 116 IWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLPN-RILIYCLDHEIDE 174
Query: 230 EIGTPIRDHIIFPGIEENKTIIDR-------PRKKQSLASPSGKQKVASTKSSLTSKAPP 282
EI TP RDHI FPG+EE++ I + R+ +++ ++ V S K S+
Sbjct: 175 EIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDL-- 232
Query: 283 QGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVD 342
QGK + K K + G+ + + S S K ++SR N K SV +D
Sbjct: 233 QGKSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESENVKLGNISRNCL--NQKGESVLMD 290
Query: 343 RSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSL 402
++ KK+SL + +A T+ + +K K+D S + L
Sbjct: 291 IDKTIKVKKSSLVGK--SAIPTKRFDPSKIYKEDRSG-------------------MLLL 329
Query: 403 DEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRT 462
D +S+RRL +MK+ ASS+ +E+++K+HK+PSTHAY+ K VDK I +GK+EGSV +R
Sbjct: 330 DANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVVVRA 389
Query: 463 ALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKL 522
AL+KL+ +G IEDA+AVCEPEVL+ IFKWKNKL+VYLAPFL+GMRY+SFGRHFTKV+KL
Sbjct: 390 ALRKLE-EGCCIEDAEAVCEPEVLNHIFKWKNKLRVYLAPFLYGMRYSSFGRHFTKVEKL 448
Query: 523 QAI 525
+
Sbjct: 449 VEV 451
>gi|240256412|ref|NP_199620.4| EDM2-like protein1 [Arabidopsis thaliana]
gi|332008234|gb|AED95617.1| EDM2-like protein1 [Arabidopsis thaliana]
Length = 677
Score = 352 bits (902), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 178/344 (51%), Positives = 234/344 (68%), Gaps = 15/344 (4%)
Query: 404 EDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTA 463
++ ++R+ ++ + SS +E +K + +Y S + K IT G V V A R A
Sbjct: 344 DEVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAA 403
Query: 464 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 523
LK + +G DA+A+ +P+ L Q+ K K KL++ +PFLHGMRYTSFGRHFT +KL+
Sbjct: 404 LKMFE-EGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLK 459
Query: 524 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 583
IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE R
Sbjct: 460 EIVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMR 519
Query: 584 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 643
DW++V+ +EL GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D
Sbjct: 520 DWLSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAI 579
Query: 644 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 703
+ YEL+WED L+G SFYLPGSVD NDK ++QWN PPLYLWSR D + HK A +
Sbjct: 580 DD-YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQ 638
Query: 704 HGHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 745
GHI TQM Y ++H + P +G D M+ D+ +
Sbjct: 639 QGHI------TQMHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 674
Score = 192 bits (487), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 101/234 (43%), Positives = 135/234 (57%), Gaps = 28/234 (11%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT-KDEVEAMLNFFCKNCE 87
FD VCS CDNGG +LCCEG CLRSFH TI G E+ C SLG T K +++A+ + C NC
Sbjct: 70 FDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQALGTYLCNNCL 129
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
YKQHQC+ACG+LGSSD+ +VFPC ++ CGHFYHP CV++LL DD+ A++L I
Sbjct: 130 YKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQNKAEELQAKIA 189
Query: 148 AGESFTCPLHKCCICKQGENK---------AD-------SDLQFAVCRRCPKAYHRKCLP 191
A + F CPLH C +C E+K AD ++ + R C
Sbjct: 190 ARDCFACPLHTCKLCNMSEDKNQYACILLYADVAQQLITENVYLGILLLNSTLTMRHCKE 249
Query: 192 R-KIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGI 244
K+ F +T A+ ++ H++L HEID I TP RDH++FP +
Sbjct: 250 LGKVCF--------LTTAFLYIVSFHKLLNR--AHEIDGLILTPARDHLVFPDV 293
>gi|238481509|ref|NP_001154768.1| EDM2-like protein1 [Arabidopsis thaliana]
gi|10177760|dbj|BAB11073.1| unnamed protein product [Arabidopsis thaliana]
gi|332008235|gb|AED95618.1| EDM2-like protein1 [Arabidopsis thaliana]
Length = 636
Score = 351 bits (900), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 178/344 (51%), Positives = 234/344 (68%), Gaps = 15/344 (4%)
Query: 404 EDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTA 463
++ ++R+ ++ + SS +E +K + +Y S + K IT G V V A R A
Sbjct: 303 DEVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAA 362
Query: 464 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 523
LK + +G DA+A+ +P+ L Q+ K K KL++ +PFLHGMRYTSFGRHFT +KL+
Sbjct: 363 LKMFE-EGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLK 418
Query: 524 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 583
IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE R
Sbjct: 419 EIVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMR 478
Query: 584 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 643
DW++V+ +EL GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D
Sbjct: 479 DWLSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAI 538
Query: 644 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 703
+ YEL+WED L+G SFYLPGSVD NDK ++QWN PPLYLWSR D + HK A +
Sbjct: 539 DD-YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQ 597
Query: 704 HGHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 745
GHI TQM Y ++H + P +G D M+ D+ +
Sbjct: 598 QGHI------TQMHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 633
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/234 (43%), Positives = 135/234 (57%), Gaps = 28/234 (11%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT-KDEVEAMLNFFCKNCE 87
FD VCS CDNGG +LCCEG CLRSFH TI G E+ C SLG T K +++A+ + C NC
Sbjct: 29 FDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQALGTYLCNNCL 88
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
YKQHQC+ACG+LGSSD+ +VFPC ++ CGHFYHP CV++LL DD+ A++L I
Sbjct: 89 YKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQNKAEELQAKIA 148
Query: 148 AGESFTCPLHKCCICKQGENK---------AD-------SDLQFAVCRRCPKAYHRKCLP 191
A + F CPLH C +C E+K AD ++ + R C
Sbjct: 149 ARDCFACPLHTCKLCNMSEDKNQYACILLYADVAQQLITENVYLGILLLNSTLTMRHCKE 208
Query: 192 R-KIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGI 244
K+ F +T A+ ++ H++L HEID I TP RDH++FP +
Sbjct: 209 LGKVCF--------LTTAFLYIVSFHKLLNR--AHEIDGLILTPARDHLVFPDV 252
>gi|52354517|gb|AAU44579.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
gi|60547933|gb|AAX23930.1| hypothetical protein At5g48090 [Arabidopsis thaliana]
Length = 364
Score = 350 bits (898), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 178/344 (51%), Positives = 234/344 (68%), Gaps = 15/344 (4%)
Query: 404 EDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTA 463
++ ++R+ ++ + SS +E +K + +Y S + K IT G V V A R A
Sbjct: 31 DEVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAA 90
Query: 464 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 523
LK + +G DA+A+ +P+ L Q+ K K KL++ +PFLHGMRYTSFGRHFT +KL+
Sbjct: 91 LKMFE-EGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLK 146
Query: 524 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 583
IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE R
Sbjct: 147 EIVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMR 206
Query: 584 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 643
DW++V+ +EL GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D
Sbjct: 207 DWLSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAI 266
Query: 644 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 703
+ YEL+WED L+G SFYLPGSVD NDK ++QWN PPLYLWSR D + HK A +
Sbjct: 267 DD-YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQ 325
Query: 704 HGHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 745
GHI TQM Y ++H + P +G D M+ D+ +
Sbjct: 326 QGHI------TQMHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 361
>gi|414588821|tpg|DAA39392.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 731
Score = 350 bits (897), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 208/489 (42%), Positives = 289/489 (59%), Gaps = 48/489 (9%)
Query: 4 FIVDD--MEEDMVHDTEEDESNEED-ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAG 60
FI DD ++E +V D + + EED +LFDS+C+ CD+GGD+LCC+G C+RSFHA +G
Sbjct: 242 FITDDEDIDEMIVEDADNESDEEEDGDLFDSICAICDDGGDILCCDGPCMRSFHAKEGSG 301
Query: 61 EESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGH 120
E+S+C +LG T+ EVEAM F CKNCEYKQHQCF CG L S+ A+VF C +ATCGH
Sbjct: 302 EDSYCDTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEPSNG-AAAKVFLCNNATCGH 360
Query: 121 FYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRR 180
FYHP CV++ L ++ A +L K I G SFTCP+H C CK E++ LQFAVCRR
Sbjct: 361 FYHPKCVAQQLHPNNIDEASELEKKITDGFSFTCPIHWCFHCKGLEDRTQGPLQFAVCRR 420
Query: 181 CPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHII 240
CPK+YHRKCLPR+I+F++ E+G+I RAWE + ILIYCL HEID E+ TP+RDHI
Sbjct: 421 CPKSYHRKCLPREISFDEIEEKGVIIRAWEL---SKMILIYCLNHEIDSELETPVRDHIK 477
Query: 241 FPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQ 300
FP IE+ + KK + P K++ T S + PP+ +K+ R
Sbjct: 478 FPRIEKPLQFV----KKGANLLPKKKKR---TYSESILEQPPKDTVKMKSKVR------- 523
Query: 301 GETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYA 360
M+ SE+ + +VS KSF NV VD K + Y
Sbjct: 524 ---MQESEQ--------NKHDTREVSVKSFTQNV------VDTPVKKKGKLKENKQLPYV 566
Query: 361 AFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASS 420
A + F S +E + + PL S P +D + ++R+ S++ + SS
Sbjct: 567 A-------KDHFVSSPKSVKEQEQELVPLPLSGIRKSSFPKVDSEIEKRVISILGNEISS 619
Query: 421 VRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAV 480
+ ++++ + +PSTH Y+ + DK + LGK+E SV+A+ ALKKL +G S+ DAKAV
Sbjct: 620 LTIKDVTRNCSLPSTHVYSGRPT-DK-MPLGKIERSVQAVEAALKKLK-NGGSVNDAKAV 676
Query: 481 CEPEVLSQI 489
CEP+VL Q+
Sbjct: 677 CEPDVLRQL 685
>gi|297791955|ref|XP_002863862.1| hypothetical protein ARALYDRAFT_494862 [Arabidopsis lyrata subsp.
lyrata]
gi|297309697|gb|EFH40121.1| hypothetical protein ARALYDRAFT_494862 [Arabidopsis lyrata subsp.
lyrata]
Length = 625
Score = 345 bits (884), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 175/343 (51%), Positives = 234/343 (68%), Gaps = 19/343 (5%)
Query: 405 DSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTAL 464
D ++R+ ++ + SS +E +K K +Y S + + K IT G V+ V A R AL
Sbjct: 297 DVEQRVMGIIDEVESSFNFDEFVKSRKPTHIKSYHSGNDISKNITTGLVQTHVNAARAAL 356
Query: 465 KKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQA 524
K + +G +DA+A+ +P++L Q+ K K KL++YL+ RYTSFGRHFT +KL+
Sbjct: 357 KMFE-EGRD-KDARAIFDPDILLQLMKHKTKLEIYLS------RYTSFGRHFTNPEKLKE 408
Query: 525 IVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRD 584
IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C +KN+D++P KN+FNFEKRD
Sbjct: 409 IVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKTCFFKNFDLIPPKNNFNFEKRD 468
Query: 585 WMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKE 644
W++V+P+EL S+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D
Sbjct: 469 WLSVKPEELPDCSQLIMGLNPPFGHKASLANTFIKKALEFKPKILILIVPSETKRVDAI- 527
Query: 645 SAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKH 704
+ YEL+WED L+G SFYLPGSVD NDK ++QWN PPLYLWSR D++ HK A +
Sbjct: 528 ADYELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNIPPPLYLWSRRDWSRSHKTTALQQ 587
Query: 705 GHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 745
HI TQM Y + H + P +G D M+ D+ +
Sbjct: 588 DHI------TQMHHFTYNGDCHHTEIP--QGGDDIDMVKDMEI 622
Score = 288 bits (737), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 133/243 (54%), Positives = 171/243 (70%), Gaps = 2/243 (0%)
Query: 3 GFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE 62
FIV++M D + + D+ ++ + FD VCS CDNGG +LCCEG CLRSFH TI G E
Sbjct: 4 NFIVNNMI-DEENSSGSDDDSQTNLQFDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIE 62
Query: 63 SHCASLGLT-KDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHF 121
+ C SLG T K +++A+ + C NC YKQHQC+ACG+LGSSD+ + +VFPC ++ CGHF
Sbjct: 63 TECESLGFTYKTQIQALGTYLCNNCLYKQHQCYACGELGSSDENSSQQVFPCSASNCGHF 122
Query: 122 YHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRC 181
YHP CV++LL DD++ A++L I A + F CPLH C +C E+K + L FAVCRRC
Sbjct: 123 YHPICVARLLCADDQIKAEELQAKIAARDYFACPLHICKVCNMSEDKNEYALHFAVCRRC 182
Query: 182 PKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIF 241
P AYHRKCLPR+I E ++ + RAWE LLP +RILIYCL HEID I TP RDH+IF
Sbjct: 183 PTAYHRKCLPREITSELNSDDDTLQRAWERLLPYNRILIYCLTHEIDGHILTPARDHLIF 242
Query: 242 PGI 244
P I
Sbjct: 243 PDI 245
>gi|38424061|dbj|BAD01751.1| unknown protein [Oryza sativa Japonica Group]
gi|38637135|dbj|BAD03389.1| unknown protein [Oryza sativa Japonica Group]
gi|215704583|dbj|BAG94216.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 653
Score = 342 bits (877), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 154/214 (71%), Positives = 177/214 (82%)
Query: 493 KNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCL 552
+NKL+VYLAPF+HGMRYTSFGRHFTK +KL I +KLHWYV GDMIVDF CG NDFS
Sbjct: 9 QNKLRVYLAPFIHGMRYTSFGRHFTKKEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFSQF 68
Query: 553 MKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAG 612
MK+KLD+ GK C +KNYD++ KN F+FEKRDWMTV KEL GS+LIMGLNPPFG KA
Sbjct: 69 MKEKLDKVGKRCNFKNYDVIQPKNSFSFEKRDWMTVRQKELPHGSKLIMGLNPPFGPKAM 128
Query: 613 LANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEND 672
LANKFI+KAL F PKL+ILIVP E ERLDRK+ Y+LVWEDDQ LSGKSFYLPGS+D +D
Sbjct: 129 LANKFIDKALTFKPKLIILIVPKEAERLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDVSD 188
Query: 673 KQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGH 706
KQ+DQWN + PPLYLWSR D+ HK +AE+HGH
Sbjct: 189 KQIDQWNKSPPPLYLWSRPDWTQKHKRIAEQHGH 222
>gi|168059731|ref|XP_001781854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666661|gb|EDQ53309.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1275
Score = 341 bits (875), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 244/720 (33%), Positives = 360/720 (50%), Gaps = 143/720 (19%)
Query: 16 DTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEV 75
D E DE++EE L DS+C CD+GG+LLCC+G C+RSFHA D + +C +L LTK +
Sbjct: 421 DVEIDEASEE--LGDSLCIICDDGGELLCCDGPCMRSFHAIRDP--QHNCETLKLTKSAI 476
Query: 76 EAML-NFFCKNCEYKQHQCFACGKLGSSDKETGA--EVFPCVSATCGHFYHPHCVS-KLL 131
M+ + CKNC YK+HQC+ CG+LG SD++ G EVF C A CG FYHP CV+ ++
Sbjct: 477 AKMVGQWLCKNCVYKKHQCYVCGELGDSDEDLGVKREVFVCDVACCGKFYHPTCVANEIA 536
Query: 132 LRDDEVAADQLAKSIIAG-ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
L +D+ LA +I AG ESF CPLH+C C +GE+ + L A CRRCP +H+KCL
Sbjct: 537 LTEDDRKL--LATNIQAGVESFACPLHRCKKCGKGEDSTEESLLLARCRRCPATWHKKCL 594
Query: 191 PRKIAF-EDKLEEGIITRAWEGLLPNHRILIYCL-------------------------- 223
P I F E+ ++ RAW R +IYC
Sbjct: 595 PSGIRFAEETDDKHPEVRAWAFGDVYKRFIIYCTGNNGNNQLPICHYSPGPNEVVSKSYT 654
Query: 224 ------KHEIDDEIGTPIRDHIIFPGIEE---NKTIIDRPRKK---QSLASPSGKQKVAS 271
KH+I ++GTP R+H+ FP + E + + + + K ++ SPS + K +S
Sbjct: 655 LRSIYRKHKIMKDLGTPKRNHVKFPRVLERPGDPEVFKQLKSKVAAEANVSPSLEAKASS 714
Query: 272 T-KSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSF 330
+ K L K+ + V+ L A + EI E L + R + + D+ +KS
Sbjct: 715 SLKPILVDKSAKKKHSGVELL------AKEKRPEEIRELPL--ASPPRLSISADMLKKS- 765
Query: 331 KGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKP 390
+R+ + E ++ T ++V
Sbjct: 766 ---------------------EDAAKRVVLGVIAESAQKV-----------TDQSV---- 789
Query: 391 LRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITL 450
KL LPS+ + RR S MK + ILK +I
Sbjct: 790 ---KLKLALPSIYRE--RRGFSFMKQG----HKDSILKAARI------------------ 822
Query: 451 GKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYT 510
+E +R+ KK+ V E+AKA+C P +++I ++ LK +LAP L+G RY
Sbjct: 823 -----VLEKLRS--KKITV-----EEAKAMCNPSNINRIELEEDYLKTFLAPTLYGDRYV 870
Query: 511 SFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCL-YKNY 569
S+GRHFTK++KL+ +V +LH + D D IVDF CG N FS L+ L + GK L +KN+
Sbjct: 871 SYGRHFTKLEKLRKVVRRLHPLIWDHDTIVDFSCGDNSFSRLLHNALSDAGKTHLKFKNF 930
Query: 570 DILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLL 629
DI P K+ F FE++DW + E G L++GLNPPF A KF+ AL P+LL
Sbjct: 931 DIFPPKDTFEFERKDWFDTKATEFGSGENLVIGLNPPF----PFAEKFVAHALMMQPRLL 986
Query: 630 ILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSV-DENDKQMDQWNMTAPPLYLW 688
+LI PP +++ + Y + +D L SFY+PGS+ D++ + + Q N P ++
Sbjct: 987 VLITPPLKNKMEHR--GYNCLEDDHLLLDDNSFYVPGSLHDQHMESLQQVNAVVPHFCIY 1044
>gi|52354515|gb|AAU44578.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
Length = 272
Score = 327 bits (837), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/272 (58%), Positives = 200/272 (73%), Gaps = 13/272 (4%)
Query: 476 DAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVND 535
DA+A+ +P+ L Q+ K K KL++ +PFLHGMRYTSFGRHFT +KL+ IV++LHWYV +
Sbjct: 9 DARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLKEIVERLHWYVEN 66
Query: 536 GDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAP 595
GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE RDW++V+ +EL
Sbjct: 67 GDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMRDWLSVKEEELPD 126
Query: 596 GSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQ 655
GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D + YEL+WED
Sbjct: 127 GSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAIDD-YELIWEDRN 185
Query: 656 FLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQ 715
L+G SFYLPGSVD NDK ++QWN PPLYLWSR D + HK A + GHI TQ
Sbjct: 186 LLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQQGHI------TQ 239
Query: 716 MERNCY--ETHAVDHPKEEGQGDASMLIDLPL 745
M Y ++H + P +G D M+ D+ +
Sbjct: 240 MHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 269
>gi|449472641|ref|XP_004153655.1| PREDICTED: uncharacterized protein LOC101221422, partial [Cucumis
sativus]
Length = 364
Score = 323 bits (827), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 181/391 (46%), Positives = 246/391 (62%), Gaps = 34/391 (8%)
Query: 110 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 169
VF CV+ATCG+FYHP C+S+LL +++VAA L K I +GESF+CP+HKC +C GENK
Sbjct: 1 VFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGENKK 60
Query: 170 DSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDD 229
+LQFAVCRRCPK+YHRKCLPRKI FE + TRAWE LLPN RILIYCL HEID+
Sbjct: 61 IWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLPN-RILIYCLDHEIDE 119
Query: 230 EIGTPIRDHIIFPGIEENKTIIDR-------PRKKQSLASPSGKQKVASTKSSLTSKAPP 282
EI TP RDHI FPG+EE++ I + R+ +++ ++ V S K S+
Sbjct: 120 EIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDL-- 177
Query: 283 QGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVD 342
QGK + K K + G+ + + S S+ K ++SR S N K SV +D
Sbjct: 178 QGKSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESKNVKLGNISRNSL--NQKGESVLMD 235
Query: 343 RSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSL 402
++ KK+SL + +A T+ + +K K+D S + L
Sbjct: 236 IDKTIKVKKSSLVGK--SAMPTKRFDPSKIYKEDRSG-------------------MLLL 274
Query: 403 DEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRT 462
D +S+RRL +MK+ ASS+ +E+++K+HK+PSTHAY+ K VDK I +GK+EGSV A+R
Sbjct: 275 DANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAVRA 334
Query: 463 ALKKLDVDGSSIEDAKAVCEPEVLSQIFKWK 493
AL+KL+ +G IEDA+AVCEPEVL+ IFKWK
Sbjct: 335 ALRKLE-EGCCIEDAEAVCEPEVLNHIFKWK 364
>gi|357494761|ref|XP_003617669.1| hypothetical protein MTR_5g094150 [Medicago truncatula]
gi|355519004|gb|AET00628.1| hypothetical protein MTR_5g094150 [Medicago truncatula]
Length = 1169
Score = 322 bits (825), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 151/276 (54%), Positives = 192/276 (69%), Gaps = 10/276 (3%)
Query: 8 DMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS 67
D D++ +++ED S ED+LFDS C+ CDNGG+L+ C+G C+RSFHAT G ES C S
Sbjct: 204 DTNNDIIEESDED-SKVEDDLFDSFCALCDNGGNLIMCDGACMRSFHATKVDGRESLCDS 262
Query: 68 LGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV 127
LG TK EV+ + F+CKNCEY QHQCFACG+LGSSDK+ AEVF C S TC FYHPHC+
Sbjct: 263 LGFTKKEVDDIETFYCKNCEYHQHQCFACGELGSSDKDKDAEVFKCASETCDRFYHPHCI 322
Query: 128 SKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHR 187
+ L+ ++ L K+ G++F+CP+H CC+C+ ENK D +LQFAVC RCPK+YHR
Sbjct: 323 APLVKH----VSEDLEKNNADGQTFSCPIHFCCVCEGLENKMDPELQFAVCNRCPKSYHR 378
Query: 188 KCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEEN 247
+CLP +IAF+D + + TRAWEGLLPN+RILIYCL HEI DE+GTP+RDHI FP +
Sbjct: 379 RCLPTEIAFDDI--DDVATRAWEGLLPNNRILIYCLNHEISDELGTPVRDHIKFPDTRAS 436
Query: 248 KTIIDRPRKKQSLASPSG---KQKVASTKSSLTSKA 280
I+ P KK + G K V KSS S A
Sbjct: 437 VRDINTPDKKMKPTTTEGVKFKSNVDFVKSSDKSTA 472
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 130/289 (44%), Gaps = 73/289 (25%)
Query: 226 EIDDEIGTPIRDHIIFPGIE--------ENKTIIDRPRKKQSLASPSGKQKVASTKSSLT 277
E D++ TP+RD IIFP NK ++ ++ L + G K + ++
Sbjct: 731 ETSDKLETPVRDDIIFPDTRATVLEINTSNKRMMPTTTERVELKNNVGYVKSSGKSTAHG 790
Query: 278 SKAP---PQGKFSVKALKRVPSKAGQGETME------------------ISERLLVGSDS 316
SK P P S K+ +++ S + T E +SE+ + SD
Sbjct: 791 SKVPGKLPSKTGSKKSTEKIVSGSDVSRTPESKEMSSKCVTEYKSSSKNLSEKAISCSDI 850
Query: 317 SRRAKATDVS--RKSFKGNVKSLSVQVDRSSSVDSKKTS--------------------- 353
S ++ +S ++ K N K LS +V S + SK S
Sbjct: 851 SWNPESNKISLTEENKKSNSK-LSGKVISGSHISSKPKSNERSETGLTKNKKPMPKCEVK 909
Query: 354 ---LGERLYAAFVTEGTEQTKFGKQ-DNSDRETSRTVTVKPLRKKLISELPSLDEDSKRR 409
G++L A+ + +G+EQ K+ Q D++D ++T+++ PLR S LP D DS++
Sbjct: 910 KHPTGQKLCAS-MNKGSEQIKYDNQVDDAD---NQTLSIMPLR----SLLPPPDADSEKG 961
Query: 410 LSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVE 458
L K+A SS+ +E ++K+HK+ DKAIT+ K++ SV+
Sbjct: 962 LVDSFKEARSSILLEGVIKKHKLGDVG--------DKAITMRKLDDSVD 1002
>gi|414884833|tpg|DAA60847.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 429
Score = 310 bits (795), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 188/448 (41%), Positives = 255/448 (56%), Gaps = 45/448 (10%)
Query: 45 CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
C+G C+RSFHA +GE+S C +LG T+ EVEAM F CKNCEYKQHQCF CG L SD
Sbjct: 22 CDGPCMRSFHAKEGSGEDSCCVTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEPSDG 81
Query: 105 ETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQ 164
E A+VF C +ATCGHFYHP CV++ L + A +L K I G SFTCP+H C CK
Sbjct: 82 E-AAKVFLCNNATCGHFYHPKCVAQQLHPINIDEASELEKKITEGFSFTCPIHWCFHCKG 140
Query: 165 GENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLK 224
E++ LQFAVCRRCPK+YHRKCLP +I+F++ E+ II RAWE + RILIYCL
Sbjct: 141 LEDRTQEPLQFAVCRRCPKSYHRKCLPSEISFDENEEDDIIIRAWEL---SKRILIYCLD 197
Query: 225 HEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQG 284
HEID E+ TP+RDHI FP IE+ + +K SL K+ + T + K +
Sbjct: 198 HEIDSELETPVRDHIKFPKIEKPLQFV---KKGSSLLPKKKKRTYSETVLEQSPKDTVKR 254
Query: 285 KFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRS 344
K V L+R +K E VS KSF N+ VD
Sbjct: 255 KSKVHMLEREQNKQDTRE----------------------VSAKSFTQNL------VDTP 286
Query: 345 SSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDE 404
+K + Y A + F S +E + + PL S P +D
Sbjct: 287 VKKKAKLKEKIQLPYVA-------KDHFISSPKSVKEQEQELVPLPLSAIRKSSFPKVDS 339
Query: 405 DSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTAL 464
++++R+ +++ + SS+ ++++ + +PSTH Y+ + DK I LGK+E SV+A+ AL
Sbjct: 340 ETEKRVITILGNKVSSLTIKDVTRNCSLPSTHVYSGR-LTDK-IPLGKIERSVQAVEAAL 397
Query: 465 KKLDVDGSSIEDAKAVCEPEVLSQIFKW 492
KKL+ +G S+ DAKAVCEPEVL Q+ +W
Sbjct: 398 KKLE-NGGSVNDAKAVCEPEVLRQLARW 424
>gi|449467187|ref|XP_004151306.1| PREDICTED: uncharacterized protein LOC101207373, partial [Cucumis
sativus]
Length = 625
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 140/204 (68%), Positives = 165/204 (80%), Gaps = 2/204 (0%)
Query: 539 IVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSR 598
IVDFCCGANDFS LMKKKLDETGK C ++N+D +P KNDFNFEKRDWMTV+PKEL GS+
Sbjct: 1 IVDFCCGANDFSVLMKKKLDETGKQCSFRNFDFIPPKNDFNFEKRDWMTVQPKELPKGSQ 60
Query: 599 LIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLS 658
LIMGLNPPFGVKA LANKF++KALEFNPKLLILIVPPETERLD K + Y+LVWED +FLS
Sbjct: 61 LIMGLNPPFGVKAALANKFVDKALEFNPKLLILIVPPETERLDEKMTPYDLVWEDTEFLS 120
Query: 659 GKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHIS-RPQSRTQME 717
GKSFYLPGSVD DKQMDQWN+ P LYLWSR D+ H A+A++HGH+ R Q+ ++ E
Sbjct: 121 GKSFYLPGSVDAKDKQMDQWNVRPPVLYLWSRRDWTHKHTAIAQEHGHLWLRKQAHSEKE 180
Query: 718 RNCYETHAVDHPKEEGQGDASMLI 741
N +T V +E +G +S L+
Sbjct: 181 -NASDTLRVRQLEESEKGKSSDLL 203
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 110/249 (44%), Gaps = 44/249 (17%)
Query: 1001 AKSSLEDH-SSKSIDIPSQT---GFGSDYQHHEP-----TRSSSHIGTTYYGTQAGIPND 1051
A ++ DH SSKS+++P G G HH+ T ++++G Y Q + +D
Sbjct: 382 ASPNVSDHTSSKSVEMPPYADVDGIG----HHQQSGSTMTEPNTNLGAPYDAAQISLTDD 437
Query: 1052 MG-SYGMSSL------NNGLSHGAN---------LDERYTGYVRNTDSLGYRP-SMSTD- 1093
+ Y +++ N G S+ A+ ++ER + L Y+P S D
Sbjct: 438 ITRKYNLNAEESYPRGNTGWSNNASPMYDIGSRHVEERIVDQMGGRVGLNYKPYSTGVDT 497
Query: 1094 --RELTMWPLARIYGQDFP---------APTPGYGQMGSVPSNLYGNLGSSAEASYRMST 1142
R+ + R YG P P Y ++G++P++ YG+LG+ +E S M+T
Sbjct: 498 YMRDSEIRSHIRHYGHPDTDNLRSNYQVGPDPRYSRIGAIPAS-YGHLGTFSEPSRWMNT 556
Query: 1143 SAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRAPQPGFFA-DMDFGPGFHPPFPQ 1201
SA RY PRL +LN+TR+ M F GF F PG P+
Sbjct: 557 SATQRYVPRLDELNHTRLGGMGVARQMNGGSTFDPRVHTSSGFRGVSQGFAPGPQYPYSN 616
Query: 1202 QGSGGWLDD 1210
Q S GWL++
Sbjct: 617 QNSAGWLNE 625
>gi|115475968|ref|NP_001061580.1| Os08g0337300 [Oryza sativa Japonica Group]
gi|38424060|dbj|BAD01750.1| unknown protein [Oryza sativa Japonica Group]
gi|38637134|dbj|BAD03388.1| unknown protein [Oryza sativa Japonica Group]
gi|113623549|dbj|BAF23494.1| Os08g0337300 [Oryza sativa Japonica Group]
Length = 585
Score = 296 bits (758), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 147/268 (54%), Positives = 187/268 (69%), Gaps = 9/268 (3%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
LFDSVC+ CDNGG+LLCCEG C+RSFHA I GE+S+CA+LG TK EV+A+ NF CKNC+
Sbjct: 244 LFDSVCAICDNGGELLCCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKALKNFVCKNCD 303
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
+KQHQCF CG+L SD A+VF C +ATCGHFYHP CV++LL + A ++ K I+
Sbjct: 304 HKQHQCFVCGELEPSDG-PNAKVFLCNNATCGHFYHPRCVAQLLHPNSRNEASEMEKKIM 362
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
AG SFTCP+H C CK E++ LQFAVCRRCP++YHRKCLPR+I+FED +GIITR
Sbjct: 363 AGFSFTCPVHWCFHCKGLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFEDINTQGIITR 422
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEEN-----KTIIDRPRKKQSLAS 262
AWE + RILIYCL HEID +IGTP RDHI FP +E++ K + + KK+ +
Sbjct: 423 AWEL---SKRILIYCLDHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKELAEKKRRICD 479
Query: 263 PSGKQKVASTKSSLTSKAPPQGKFSVKA 290
S + ++ L K +G S KA
Sbjct: 480 DSYVSEPLQKRAKLNEKFNAKGDKSKKA 507
>gi|357153232|ref|XP_003576383.1| PREDICTED: uncharacterized protein LOC100846531 [Brachypodium
distachyon]
Length = 683
Score = 269 bits (687), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 133/238 (55%), Positives = 174/238 (73%), Gaps = 5/238 (2%)
Query: 7 DDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCA 66
+D+E+ + D E + + +D+LFDS C+ CDNGGDLL C+G C+RSFHA I GE+S+C
Sbjct: 202 EDVEDIVADDNVESDEDGDDDLFDSTCAICDNGGDLLGCDGPCMRSFHAKIGTGEDSYCQ 261
Query: 67 SLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHC 126
+LG T+ EVEAM F CKNCEYKQHQCF CG L SD T A+VF C +ATCG+FYHP C
Sbjct: 262 TLGFTEAEVEAMKTFLCKNCEYKQHQCFICGVLEPSDGPT-AKVFLCNNATCGYFYHPKC 320
Query: 127 VSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYH 186
V++ L ++++ A + K I G SFTC +H C CK E++ + LQFAVCRRCPK+YH
Sbjct: 321 VAQQLHPNNKIEALEKEKKIAGGSSFTCSIHWCFCCKGLEDRTEEHLQFAVCRRCPKSYH 380
Query: 187 RKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGI 244
RKCLP +I FED +E I+TRAW+ + RILIYC++HEID +I TP+R+HI FPG+
Sbjct: 381 RKCLPSEIPFEDS-DEDIVTRAWDL---SQRILIYCMEHEIDLDIETPVRNHIKFPGL 434
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 412 SLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDG 471
+L++ SS+ + +I ++ +PSTH Y+ + DK I GK+E SV+A+R AL L V
Sbjct: 601 ALVEKEVSSLTLNDISRKCLMPSTHVYSGRQT-DKIIATGKLERSVQAVRQALHLLAV-- 657
Query: 472 SSIEDAKAVCEPEVLSQIFKW 492
+ AKA CEP+VL Q+ +W
Sbjct: 658 GDVNTAKATCEPQVLKQLARW 678
>gi|357160452|ref|XP_003578769.1| PREDICTED: PHD finger-containing protein DDB_G0268158-like
[Brachypodium distachyon]
Length = 298
Score = 261 bits (668), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 123/230 (53%), Positives = 161/230 (70%), Gaps = 10/230 (4%)
Query: 18 EEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEA 77
EED SNE +D+ C+ CD+GGD +CCEG CLRSFH T GE+S C +LGLT++ +
Sbjct: 27 EEDRSNE----WDNACAICDDGGDFICCEGGCLRSFHPTKKYGEDSMCTTLGLTEEWWQT 82
Query: 78 MLN-----FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
+ + + CKNCEYKQHQCFACG LGSS +G+EV C CG+FYHP CV++L+
Sbjct: 83 LQSNEQEKYICKNCEYKQHQCFACGLLGSSGLTSGSEVIQCKDKMCGYFYHPKCVAELVH 142
Query: 133 RDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPR 192
D + A K + G +F CPLH+C +CK+ EN+ D D+Q AVCRRCP AYHRKCLP
Sbjct: 143 PDSKAQAIFFEKRVALGLTFHCPLHRCSLCKEAENRDDKDMQLAVCRRCPTAYHRKCLPS 202
Query: 193 KIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
I+FE +EG RAW+ +LP+ +ILIYC+KH+ID ++ TP RDHI+FP
Sbjct: 203 DISFEKDGKEGTQQRAWDNVLPD-QILIYCMKHKIDKDLRTPKRDHIVFP 251
>gi|218186609|gb|EEC69036.1| hypothetical protein OsI_37847 [Oryza sativa Indica Group]
Length = 345
Score = 261 bits (666), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/301 (45%), Positives = 177/301 (58%), Gaps = 29/301 (9%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATID-AGEESHCASLGLTKDEVEAMLN------- 80
++ VCS CD+GGD LCCEGRCLRSFHAT E + C +LGLT+++ +
Sbjct: 42 WEDVCSICDDGGDFLCCEGRCLRSFHATEKHITEINQCTTLGLTEEQWKIFRQNDEKSEP 101
Query: 81 -FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 139
+ CKNC+Y QHQCF+CG LGSSD +GAEVF C CGHFYHP C+++LL D +
Sbjct: 102 LYICKNCKYNQHQCFSCGLLGSSDLSSGAEVFQCEDEKCGHFYHPKCLARLLYPDSSMQP 161
Query: 140 DQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDK 199
+ + G F CP+HKC +CK GENK D + QFAVCRRCP YHRKCLP I FED
Sbjct: 162 LNFEEEVARGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDD 221
Query: 200 LEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP--------------GIE 245
GI+ RAW+ +L +ILIYC+KHEID E+GTP R+HIIFP G E
Sbjct: 222 ESNGIVQRAWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFPDCKNLLTRGPSKPKGQE 280
Query: 246 ENKTIIDRPRKKQSL---ASPSGKQKVASTKSSLTSKAPPQG--KFSVKALKRVPSKAGQ 300
+ ++D P + S + PS + ++ T + G F+ KAL +P
Sbjct: 281 DAPAVLDIPEEDMSPDHSSEPSQPPAQPAAETDQTDQDLSNGFNSFAPKALFPLPYPGSC 340
Query: 301 G 301
G
Sbjct: 341 G 341
>gi|224061077|ref|XP_002300345.1| predicted protein [Populus trichocarpa]
gi|222847603|gb|EEE85150.1| predicted protein [Populus trichocarpa]
Length = 582
Score = 260 bits (665), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 152/297 (51%), Positives = 193/297 (64%), Gaps = 8/297 (2%)
Query: 574 AKNDFNFEKRDWMTVEPKELAP-GSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILI 632
+NDFNFEKRDWMTV P EL GS+LIMGLNPPFGVKA LANKFI+KAL+F PKLLILI
Sbjct: 100 TQNDFNFEKRDWMTVCPDELPKKGSQLIMGLNPPFGVKAALANKFIDKALQFKPKLLILI 159
Query: 633 VPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHD 692
VPPETERLD+K+ Y LVWE+D FLSGKSFYLPGSV+ENDKQMDQWN+TAPPLYLWSR +
Sbjct: 160 VPPETERLDKKK-PYNLVWENDHFLSGKSFYLPGSVNENDKQMDQWNVTAPPLYLWSRQE 218
Query: 693 YAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEE-GQGDASMLID-LPLQINVT 750
++A HKA+A+KHGH + Q + +++N +ET D ++ ASML + +PLQ
Sbjct: 219 WSAKHKAIAQKHGHPFKQQEISNLDKNHFETKTPDPVNDQYNNAGASMLPNYIPLQSKEP 278
Query: 751 KELRNEAREDDKAGFPD-NATEGGGESSHGHGDNQSGKTSRKRKRDRKKHGSGMRENSPL 809
+E DD G N ++ + SH + S +TSRK+++ K G E S
Sbjct: 279 EESNCGIVNDDHKGRSQCNNSDRESQDSHCPRKSHSDETSRKKRQGEKMVERGTGEKSLE 338
Query: 810 DGQNRGRHLASGIHGMSKHSPANIANVS--PLLEGHSSKSIDMPSHVGSGDNDCQHF 864
QN G+ + H P+ N+ LL+G SS+S++ S G G N QH
Sbjct: 339 GRQNGGKKPSPSDSDKGVHRPSPPPNIDGRSLLDG-SSRSVEKQSQAGIGKNCYQHL 394
Score = 126 bits (316), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 94/244 (38%), Positives = 128/244 (52%), Gaps = 35/244 (14%)
Query: 988 DKGVPRCSPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHI---GTTYYG 1043
DKGV R SP I SL D SS+S++ SQ G G + YQH +P+ S S+ T Y G
Sbjct: 353 DKGVHRPSPPPNIDGRSLLDGSSRSVEKQSQAGIGKNCYQHLDPSFSDSYSQQHATPYGG 412
Query: 1044 TQAGIPNDMGS------YGMSSLN-NGLSHGANLDERYTGYVRNTDSLGYRPSMSTDREL 1096
+ A +DM + SLN +GLS G N++E+ T + T E
Sbjct: 413 SWASNHDDMNRRHCTNIHESYSLNIHGLSSGGNMEEQSTRCMNGT-------------EF 459
Query: 1097 TMWPLARIYG--------QDFPAPT-PGYGQMGSVPSNLYGNLGSSAEASYRMSTSAMDR 1147
P +YG ++P+ GYG M P+ YG++GS++E Y M+ SAM R
Sbjct: 460 VRQPQVHLYGLQGADSARWNYPSGRYLGYGHMEPAPAIPYGHMGSASEPPYMMNMSAMQR 519
Query: 1148 YAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRAPQPGF-FADMDFGPGFHPPFPQQGSGG 1206
YAPRL +LN+TRM++ EP M +R G YD RAP G+ F M F PG P+P S G
Sbjct: 520 YAPRLDELNHTRMSSLGPEPSMLNRNGSYDPRAPGAGYRFDSMGFAPGPQHPYPHH-SAG 578
Query: 1207 WLDD 1210
WL++
Sbjct: 579 WLNE 582
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%), Gaps = 16/48 (33%)
Query: 551 CLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSR 598
C+MKKKL+E GK C YKNYD+ ++PK+ AP +R
Sbjct: 12 CIMKKKLEEMGKKCSYKNYDV----------------IQPKKYAPDNR 43
>gi|115487874|ref|NP_001066424.1| Os12g0224500 [Oryza sativa Japonica Group]
gi|108862352|gb|ABA96791.2| hydroxyproline-rich glycoprotein family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113648931|dbj|BAF29443.1| Os12g0224500 [Oryza sativa Japonica Group]
gi|215693390|dbj|BAG88772.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 345
Score = 258 bits (658), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 122/223 (54%), Positives = 151/223 (67%), Gaps = 10/223 (4%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATID-AGEESHCASLGLTKDEVEAMLN------- 80
++ VCS CD+GGD LCCEGRCLRSFHAT E + C +LGLT+++ +
Sbjct: 42 WEDVCSICDDGGDFLCCEGRCLRSFHATEKHITEINQCTTLGLTEEQWKIFRQNDEKSEP 101
Query: 81 -FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 139
+ CKNC+Y QHQCF+CG LGSSD +GAEVF C CGHFYHP C+++LL D +
Sbjct: 102 LYICKNCKYNQHQCFSCGLLGSSDLSSGAEVFQCEDEKCGHFYHPKCLARLLYPDSSMQP 161
Query: 140 DQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDK 199
+ + G F CP+HKC +CK GENK D + QFAVCRRCP YHRKCLP I FED
Sbjct: 162 LNFEEEVARGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDD 221
Query: 200 LEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
GI+ RAW+ +L +ILIYC+KHEID E+GTP R+HIIFP
Sbjct: 222 ESNGIVQRAWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFP 263
>gi|326503094|dbj|BAJ99172.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326526707|dbj|BAK00742.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 338
Score = 256 bits (654), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 118/217 (54%), Positives = 152/217 (70%), Gaps = 7/217 (3%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN------FFCKN 85
VC+FCD+GG+L+CC+G CLRSFH T + GE S C SLGL + + E +FC N
Sbjct: 53 VCAFCDDGGELICCDGGCLRSFHPTKEHGERSMCTSLGLNEAKWEKHKALGEKGLYFCNN 112
Query: 86 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 145
C++KQHQCFACG LGSS+ EVF C CGHFYHP CV+++L D++ A +
Sbjct: 113 CKHKQHQCFACGLLGSSNLTAEPEVFKCKHRKCGHFYHPGCVAEILYPDNKAKATCFEQC 172
Query: 146 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII 205
+ G F+CP+HKC CK+ ENK D ++QFAVCRRCP AYHRKCLPR I+ + EE +
Sbjct: 173 VAGGLEFSCPVHKCKRCKEAENKHDKEMQFAVCRRCPTAYHRKCLPRDISLTEDEEEAMP 232
Query: 206 TRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
RAW+G+LP+ +ILIYC KH+I E+GTP RDHI+FP
Sbjct: 233 QRAWDGILPD-QILIYCRKHDIQKELGTPKRDHIVFP 268
>gi|302782778|ref|XP_002973162.1| hypothetical protein SELMODRAFT_99176 [Selaginella moellendorffii]
gi|300158915|gb|EFJ25536.1| hypothetical protein SELMODRAFT_99176 [Selaginella moellendorffii]
Length = 215
Score = 256 bits (653), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 115/217 (52%), Positives = 154/217 (70%), Gaps = 2/217 (0%)
Query: 497 KVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKK 556
+VYL+P L GMRY+SFGRHFT+ KLQ+IVD+L +YVN GDM+VD CCG+N+FS LM K
Sbjct: 1 RVYLSPMLFGMRYSSFGRHFTRTYKLQSIVDQLSYYVNPGDMVVDLCCGSNEFSRLMSDK 60
Query: 557 LDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANK 616
L K+C +KN+D + + F+FE++D+M ++P+EL G +LIMGLNPPFG +A LA+K
Sbjct: 61 LTAMKKDCRFKNFDFIQPSDTFHFERKDYMEIQPEELCSGDKLIMGLNPPFGFRAQLASK 120
Query: 617 FINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMD 676
F+ ++F PK+++LI P ET R D + YEL WED L G +FYLPGSVD K +
Sbjct: 121 FVEHCVKFQPKIIVLIAPEETRRPDGR--YYELTWEDKDLLRGDAFYLPGSVDAEGKTIK 178
Query: 677 QWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSR 713
Q N P LYLWSR D+A H+ +A +HGH+ R R
Sbjct: 179 QENNVPPVLYLWSRVDFAEAHRKIARRHGHLRRSSGR 215
>gi|357157830|ref|XP_003577927.1| PREDICTED: uncharacterized protein LOC100829433 [Brachypodium
distachyon]
Length = 632
Score = 254 bits (649), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 115/171 (67%), Positives = 137/171 (80%)
Query: 538 MIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGS 597
MIVDFCCGANDFS LMK+KLD K C +KNYD++ +N F FE+RDWMTV+ EL GS
Sbjct: 1 MIVDFCCGANDFSRLMKEKLDLVQKKCHFKNYDLIQPQNTFCFERRDWMTVQRNELPRGS 60
Query: 598 RLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFL 657
RL+MGLNPPFGVKA LANKFI+KAL FNPKL+ILIVP ET+RLD+K++ Y+LVWED L
Sbjct: 61 RLVMGLNPPFGVKAALANKFIDKALSFNPKLIILIVPKETKRLDQKKTPYDLVWEDGDCL 120
Query: 658 SGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHIS 708
+GKSFYLPGSVD N+K + WN +APPLYLWS D+ HK +AE+H H S
Sbjct: 121 AGKSFYLPGSVDVNEKIVQGWNASAPPLYLWSHPDWTKKHKKVAEEHNHTS 171
>gi|357154941|ref|XP_003576954.1| PREDICTED: uncharacterized protein LOC100825038 [Brachypodium
distachyon]
Length = 332
Score = 250 bits (638), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 119/228 (52%), Positives = 156/228 (68%), Gaps = 8/228 (3%)
Query: 22 SNEEDELFDS--VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAM- 78
+ ED L D C+ CD+GG+L+ CEG CLRSFH G +S C SLGLT++ + +
Sbjct: 59 TRREDGLSDEWDACAICDDGGELIWCEGGCLRSFHPIKICGLDSMCTSLGLTEEHWQTLH 118
Query: 79 ----LNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRD 134
+ CKNCE KQHQCFACG LGSSD +G+EVF C CGHFYHP CV++LL D
Sbjct: 119 ANKQEKYICKNCENKQHQCFACGLLGSSDLISGSEVFKCKEKNCGHFYHPKCVAELLYPD 178
Query: 135 DEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 194
+ A + + +G F C +H+C +CK+ EN+ D+D+Q AVCRRCP AYHRKCLP I
Sbjct: 179 SKGLAIYFEEHVASGLEFHCLMHRCSLCKEAENRDDTDMQLAVCRRCPTAYHRKCLPSDI 238
Query: 195 AFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
+F + +EG RAW+ +LP+ +ILIYC+KHEID ++GTP RDHI+FP
Sbjct: 239 SFIEDNDEGTQQRAWDNILPD-QILIYCMKHEIDKDLGTPKRDHIVFP 285
>gi|242044182|ref|XP_002459962.1| hypothetical protein SORBIDRAFT_02g019320 [Sorghum bicolor]
gi|241923339|gb|EER96483.1| hypothetical protein SORBIDRAFT_02g019320 [Sorghum bicolor]
Length = 919
Score = 249 bits (637), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 111/169 (65%), Positives = 136/169 (80%)
Query: 538 MIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGS 597
+IVDFCCGANDFS LMK+KLD+ K C +KNYD++ ++ F FEKRDWMTV+P EL GS
Sbjct: 301 LIVDFCCGANDFSRLMKEKLDKVQKKCNFKNYDLIQPQHRFCFEKRDWMTVQPNELPRGS 360
Query: 598 RLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFL 657
+LIMGLNPPFGVKA LANKFI+KAL F PKL+ILIVP ET+RLD+K + Y+L+WED + L
Sbjct: 361 QLIMGLNPPFGVKASLANKFIDKALTFKPKLIILIVPKETKRLDQKRTPYDLIWEDSEIL 420
Query: 658 SGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGH 706
+GKSFYLPGSVD NDK ++ WN +APPLYLWS + H +AE+ GH
Sbjct: 421 AGKSFYLPGSVDLNDKTVEGWNASAPPLYLWSHPHWTKKHMQVAEERGH 469
Score = 133 bits (334), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 68/157 (43%), Positives = 108/157 (68%), Gaps = 6/157 (3%)
Query: 372 FGKQDNSDRETSRTVTVKPL---RKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILK 428
FG S +E + PL RK +S P++D ++++R+ +++ + SS+ ++++ +
Sbjct: 54 FGSSPKSVKEEEQESPSLPLSATRKIPLSLFPTVDSETEKRVITILGNKVSSLTIKDVTR 113
Query: 429 RHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQ 488
+PSTH Y+ + DK I GK+E SV+A+ ALKKL+ +G S+ DAKAVCEP+VL Q
Sbjct: 114 NCSLPSTHVYSGR-LTDK-IPQGKLERSVQAVEAALKKLE-NGGSVNDAKAVCEPDVLRQ 170
Query: 489 IFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAI 525
+ +W +KL+VY++PF+HG RY+SFGRHFTKV+KL +
Sbjct: 171 LARWHSKLRVYISPFIHGTRYSSFGRHFTKVEKLVEV 207
>gi|302789842|ref|XP_002976689.1| hypothetical protein SELMODRAFT_175739 [Selaginella moellendorffii]
gi|300155727|gb|EFJ22358.1| hypothetical protein SELMODRAFT_175739 [Selaginella moellendorffii]
Length = 209
Score = 244 bits (622), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 110/210 (52%), Positives = 147/210 (70%), Gaps = 2/210 (0%)
Query: 504 LHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKN 563
L GMRY+SFGRHFT+ KLQ+IVD+L +YVN GDM+VD CCG+N+FS LM KL K+
Sbjct: 2 LFGMRYSSFGRHFTRTYKLQSIVDQLSYYVNPGDMVVDLCCGSNEFSRLMSDKLTAMKKD 61
Query: 564 CLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALE 623
C +KN+D + + F+FE++D+M ++P+EL +LIMGLNPPFG +A LA+KF+ ++
Sbjct: 62 CRFKNFDFIQPSDTFHFERKDYMEIQPEELCSSDKLIMGLNPPFGFRAQLASKFVEHCVK 121
Query: 624 FNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAP 683
F PK+++LI P ET R D + YEL WED L G +FYLPGSVD K + Q N P
Sbjct: 122 FQPKIIVLIAPEETRRPDGR--YYELTWEDKDLLRGDAFYLPGSVDAEGKTIKQENNVPP 179
Query: 684 PLYLWSRHDYAAHHKALAEKHGHISRPQSR 713
LYLWSR D+A H+ +A +HGH+ R R
Sbjct: 180 VLYLWSRVDFAEAHRKIARRHGHLRRSSGR 209
>gi|357154938|ref|XP_003576953.1| PREDICTED: uncharacterized protein LOC100824733 [Brachypodium
distachyon]
Length = 315
Score = 243 bits (621), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 112/233 (48%), Positives = 156/233 (66%), Gaps = 6/233 (2%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN----FFCKNCE 87
VC+ CD GG L+CC+G CLRSFH T GEES C +LGLT+++ + + + + CKNC+
Sbjct: 36 VCAICDEGGSLICCDGGCLRSFHPTRKVGEESKCTTLGLTEEQWQILESNDEKYICKNCK 95
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
KQHQCFACG LGSSD +G EVF C C FYHP CV+++L D + A + +
Sbjct: 96 SKQHQCFACGLLGSSDLTSGTEVFQCKDKQCARFYHPKCVAEVLYPDSKSRALRFGNDVA 155
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
+G F CP+H+C +CK+ N+ D D++ AVC RCP AYHR CLPR+I+F + E+GI R
Sbjct: 156 SGLEFHCPMHRCSLCKEAGNRDDKDMRLAVCGRCPTAYHRTCLPREISFRENKEKGIQKR 215
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSL 260
AW+ +LP+ IYC+KH+++ ++ TP RDHI+FP N RPR +++L
Sbjct: 216 AWDDVLPDQD-FIYCMKHKMEKDLRTPRRDHIVFPDT-RNPCAAKRPRGQETL 266
>gi|414884838|tpg|DAA60852.1| TPA: hypothetical protein ZEAMMB73_000278 [Zea mays]
Length = 589
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/279 (44%), Positives = 167/279 (59%), Gaps = 34/279 (12%)
Query: 553 MKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAG 612
MK+KLD+ K C +KNYD++ +N F FEKRDWMTV+P EL GS+LIMGLNPPFGVKA
Sbjct: 1 MKEKLDKVQKKCHFKNYDLIQPQNRFCFEKRDWMTVQPNELPHGSQLIMGLNPPFGVKAS 60
Query: 613 LANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEND 672
LANKFI+KAL F PKL++LIVP ET+RLD++ + Y+L+WED + LSGKSFYLPGSVD ND
Sbjct: 61 LANKFIDKALTFKPKLIVLIVPKETKRLDQRRTPYDLIWEDSECLSGKSFYLPGSVDLND 120
Query: 673 KQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEE 732
K ++ WN +APPLYLWS + H +AE+HGH + +E E++ D
Sbjct: 121 KIVEGWNASAPPLYLWSHPGWTKKHMQVAEEHGHTCMGKIACHVE----ESNLSDDLSVR 176
Query: 733 GQGDASMLI------DLPLQINVTKEL-----RNEAREDD-------------KAGFPDN 768
Q DA+ + + P ++ L R +AR ++ KA PDN
Sbjct: 177 KQSDATNKLNSRSGKETPFNDRMSNLLDDGSARKQARSEEEKRTLGKIEHHVKKANIPDN 236
Query: 769 ---ATEGGGES---SHGHGDNQSGKTSRKRKRDRKKHGS 801
E S SH D + G+ +R+R +R G+
Sbjct: 237 LPVKKEPEASSKVMSHSGKDRKIGRYNRRRHDNRSSKGT 275
>gi|414592041|tpg|DAA42612.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 1006
Score = 219 bits (557), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 107/219 (48%), Positives = 138/219 (63%), Gaps = 7/219 (3%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
+ D +C+ CD+GG LL C+G C RSFH T G ES C SL T EV+ + + C NC+
Sbjct: 236 VTDQICALCDDGGHLLSCDGPCKRSFHPTKKDGRESKCESLHYTSAEVKRIGTYLCANCK 295
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
KQHQCF CG+L S A+VF C A+CG+FYHP C+++LL D +L + I+
Sbjct: 296 NKQHQCFRCGELEPSHGPN-AKVFQCNQASCGYFYHPKCIAQLL--DPNATDGELERRIM 352
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
+G SF CP+H C C ENKA LQ AVCRRCP+AYHR+CLPR ++F K ++G R
Sbjct: 353 SGMSFPCPIHWCFKCGHMENKAQRALQLAVCRRCPRAYHRECLPRDLSFGTKDKDG-NQR 411
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 246
AW+ + I IYCL HEID + GT R+HI FP E
Sbjct: 412 AWK---LSDTIFIYCLDHEIDKDTGTTSRNHIKFPATPE 447
>gi|414592040|tpg|DAA42611.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 1010
Score = 219 bits (557), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 107/219 (48%), Positives = 138/219 (63%), Gaps = 7/219 (3%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
+ D +C+ CD+GG LL C+G C RSFH T G ES C SL T EV+ + + C NC+
Sbjct: 240 VTDQICALCDDGGHLLSCDGPCKRSFHPTKKDGRESKCESLHYTSAEVKRIGTYLCANCK 299
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
KQHQCF CG+L S A+VF C A+CG+FYHP C+++LL D +L + I+
Sbjct: 300 NKQHQCFRCGELEPSHGPN-AKVFQCNQASCGYFYHPKCIAQLL--DPNATDGELERRIM 356
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
+G SF CP+H C C ENKA LQ AVCRRCP+AYHR+CLPR ++F K ++G R
Sbjct: 357 SGMSFPCPIHWCFKCGHMENKAQRALQLAVCRRCPRAYHRECLPRDLSFGTKDKDG-NQR 415
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 246
AW+ + I IYCL HEID + GT R+HI FP E
Sbjct: 416 AWK---LSDTIFIYCLDHEIDKDTGTTSRNHIKFPATPE 451
>gi|414588818|tpg|DAA39389.1| TPA: hypothetical protein ZEAMMB73_973048 [Zea mays]
Length = 591
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 97/155 (62%), Positives = 122/155 (78%)
Query: 553 MKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAG 612
MK+KLD+ K C +KNYD++ ++ F FEKRDWMTV+P EL GS+LIMGLNPPFGVKA
Sbjct: 1 MKEKLDKVQKKCHFKNYDLIQPQHHFCFEKRDWMTVQPNELPRGSQLIMGLNPPFGVKAS 60
Query: 613 LANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEND 672
LANKFI+KAL F PKL++LIVP ET+RLD+K+ Y+L+WED + L+GKSFYLPGSVD ND
Sbjct: 61 LANKFIDKALTFKPKLIVLIVPKETKRLDKKKIPYDLIWEDSECLAGKSFYLPGSVDLND 120
Query: 673 KQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 707
++ WN +APPLYLWS + H +AE+HGH
Sbjct: 121 NTVEGWNASAPPLYLWSHPHWTKKHMQVAEEHGHT 155
>gi|356524309|ref|XP_003530772.1| PREDICTED: uncharacterized protein LOC100811715 [Glycine max]
Length = 407
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 102/174 (58%), Positives = 126/174 (72%), Gaps = 4/174 (2%)
Query: 3 GFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE 62
GFI+DD++ M+ +++S+ EDELFDSVC+ CDNGG LLCC+G+C+RSFHA + GEE
Sbjct: 202 GFIIDDIDNGMI----DEDSDGEDELFDSVCAICDNGGQLLCCDGKCMRSFHANEEDGEE 257
Query: 63 SHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
S CASLG ++ EV+ + NF+CKNCEY QHQCFACG LG SDK +GAEVF C SA CG FY
Sbjct: 258 STCASLGFSRKEVDEIQNFYCKNCEYNQHQCFACGTLGCSDKFSGAEVFKCASAICGFFY 317
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFA 176
HPHCV+KLL E +L + I G FTCP H CC CK+ E+K D FA
Sbjct: 318 HPHCVAKLLHGVVEDDLKELERKIAEGGPFTCPTHYCCECKEMEDKKKHDFHFA 371
>gi|357140293|ref|XP_003571704.1| PREDICTED: uncharacterized protein LOC100832660 [Brachypodium
distachyon]
Length = 1028
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/276 (40%), Positives = 158/276 (57%), Gaps = 11/276 (3%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
+C+ CDNGG LL CEGRC R+FH G ES+C +LG + +++ + ++ CKNCE++QH
Sbjct: 263 LCAMCDNGGKLLSCEGRCKRAFHPREKDGRESNCDTLGFSSAQLQEIGHYLCKNCEFQQH 322
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
QCF C L SD A+VF C A CGHFYHP C++KLL D A +L K I AG +
Sbjct: 323 QCFKCEDLEPSDGPN-AKVFQCYKACCGHFYHPSCIAKLLEPHDNDGACELEKRIAAGMA 381
Query: 152 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEG 211
FTCP H C C + E+ + L AVCRRCP + I+FED EE ++
Sbjct: 382 FTCPAHWCSKCGKMEDHTEIPLWLAVCRRCPIS--------NISFED-FEEKDGRKSIRA 432
Query: 212 LLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPS-GKQKVA 270
+ RI+IYC +HEI+D+IGTP RDHI P I+++ TI L+S + + +A
Sbjct: 433 WHLDDRIIIYCRQHEIEDKIGTPRRDHIKLPSIKDSGTIGTHCMDHMKLSSTTESTRDLA 492
Query: 271 STKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEI 306
K+ +T K + +++ +K Q + +I
Sbjct: 493 KKKAQVTGKRKMNTDQGLTGTEKLSNKISQEKAGQI 528
>gi|222616830|gb|EEE52962.1| hypothetical protein OsJ_35606 [Oryza sativa Japonica Group]
Length = 312
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 102/215 (47%), Positives = 130/215 (60%), Gaps = 27/215 (12%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATID-AGEESHCASLGLTKDEVEAMLNFFCKNCE 87
++ VCS CD+GGD LCCEGRCLRSFHAT E + C +LGLT+++ +
Sbjct: 42 WEDVCSICDDGGDFLCCEGRCLRSFHATEKHITEINQCTTLGLTEEQWKIF--------- 92
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
++ +VF C CGHFYHP C+++LL D + + +
Sbjct: 93 ----------------RQNDEKVFQCEDEKCGHFYHPKCLARLLYPDSSMQPLNFEEEVA 136
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 207
G F CP+HKC +CK GENK D + QFAVCRRCP YHRKCLP I FED GI+ R
Sbjct: 137 RGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDDESNGIVQR 196
Query: 208 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
AW+ +L +ILIYC+KHEID E+GTP R+HIIFP
Sbjct: 197 AWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFP 230
>gi|242085260|ref|XP_002443055.1| hypothetical protein SORBIDRAFT_08g007180 [Sorghum bicolor]
gi|241943748|gb|EES16893.1| hypothetical protein SORBIDRAFT_08g007180 [Sorghum bicolor]
Length = 310
Score = 197 bits (500), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 100/225 (44%), Positives = 131/225 (58%), Gaps = 36/225 (16%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS-LGLTKDEVEAML----NFFCK 84
D +C+ CD+GGD++CC+G C RSFH E S C L L+ ++ + +L +F CK
Sbjct: 34 DDLCAICDDGGDVICCDGVCQRSFHLADGNSERSRCREILRLSAEQAKMILAADKDFICK 93
Query: 85 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 144
NC+Y++HQCFACGKLGSSD + A+VF C CGHFYHP CV+KLL D E A
Sbjct: 94 NCKYQEHQCFACGKLGSSDLSSEAKVFQCEVDDCGHFYHPKCVAKLLYPDSEEEATLFEV 153
Query: 145 SIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGI 204
+ A E FTCP+H+C +C+ GENK D ++QFAVCRRCP YHR CLPR
Sbjct: 154 HVAAREKFTCPIHECIVCRGGENKNDRNMQFAVCRRCPTTYHRMCLPR------------ 201
Query: 205 ITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKT 249
KH I ++ TP DHIIFP +++ +
Sbjct: 202 -------------------KHPIVKKLKTPKWDHIIFPDVKKIRV 227
>gi|52354519|gb|AAU44580.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
Length = 268
Score = 187 bits (476), Expect = 2e-44, Method: Composition-based stats.
Identities = 79/141 (56%), Positives = 100/141 (70%), Gaps = 1/141 (0%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT-KDEVEAMLNFFCKNCE 87
FD VCS CDNGG +LCCEG CLRSFH TI G E+ C SLG T K +++A+ + C NC
Sbjct: 70 FDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQALGTYLCNNCL 129
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
YKQHQC+ACG+LGSSD+ +VFPC ++ CGHFYHP CV++LL DD+ A++L I
Sbjct: 130 YKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQNKAEELQAKIA 189
Query: 148 AGESFTCPLHKCCICKQGENK 168
A + F CPLH C +C E+K
Sbjct: 190 ARDCFACPLHTCKLCNMSEDK 210
>gi|168064191|ref|XP_001784048.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664434|gb|EDQ51154.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 541
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/224 (45%), Positives = 136/224 (60%), Gaps = 27/224 (12%)
Query: 32 VCSFCDNGGD----LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
VC+FCD+G + LLCC+G C+RSFH TID+G ++ C +L LT+ V + + C NCE
Sbjct: 272 VCAFCDDGVEGSERLLCCDGPCMRSFHPTIDSGWQNKCPTLRLTRAAVSVEI-WICPNCE 330
Query: 88 YKQHQCFACGKLGSSDKETGA----EVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 143
QHQCF CGKLG K TG+ +VF C C FYHP CV+KLL+ D VA + LA
Sbjct: 331 VGQHQCFVCGKLG---KSTGSADHQQVFKCEHRQCKRFYHPACVAKLLVPD--VALNNLA 385
Query: 144 KSI-IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 202
I + E+FTCPLHKC C E+KAD L CRRCP A+H +CLP +
Sbjct: 386 CRIQLKFETFTCPLHKCANCNLDEDKADPTLCLIKCRRCPAAWHERCLP----------Q 435
Query: 203 GIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 246
++ W L + + ++YC KH +D E+ TP R+HI FP +E+
Sbjct: 436 ACRSQMWP--LEDGKCVMYCGKHRLDPELLTPERNHITFPELEK 477
>gi|222640381|gb|EEE68513.1| hypothetical protein OsJ_26947 [Oryza sativa Japonica Group]
Length = 1287
Score = 186 bits (472), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 108/278 (38%), Positives = 149/278 (53%), Gaps = 43/278 (15%)
Query: 23 NEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN-- 80
N D+ D +C+ CD+GG LL CEG C RSFHA + G +S C +L T E++ +
Sbjct: 235 NATDDDTDMICAICDDGGKLLSCEGPCKRSFHARVKHGRKSKCRTLRFTSVELKLKESGT 294
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 140
F C+NCE+ +HQCF CG+L S A+VF C +CGHFYHP C++ LL +D A
Sbjct: 295 FLCENCEHNEHQCFKCGELEPS-YGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGAC 353
Query: 141 QLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKL 200
+L + I G SFTCP+H C C + E++ S+ QFAVCRRCP++YH +CLPR
Sbjct: 354 KLEEMIADGMSFTCPVHWCFKCAKMEDRTRSEFQFAVCRRCPRSYHTECLPRH------- 406
Query: 201 EEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSL 260
H ID GTP +HI FP I + K + +K
Sbjct: 407 ------------------------HGIDATTGTPSGEHIKFPSIPKIKKTKNLSKKD--- 439
Query: 261 ASPSGKQKVA----STKSSLTSKAPPQGKFSVKALKRV 294
A +GK+K + STKS+ + P+G+ K +RV
Sbjct: 440 AKVTGKRKKSGNKFSTKSTKLANVSPRGE--SKQTRRV 475
Score = 164 bits (415), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 81/208 (38%), Positives = 119/208 (57%), Gaps = 21/208 (10%)
Query: 522 LQAIVDKLHWYVNDGDM---------------------IVDFCCGANDFSCLMKKKLDET 560
L I DKLHWYV G+ IVD C ++FS LMK+KL++
Sbjct: 588 LVQIADKLHWYVQPGNTESCGKLPDGVLLERSIVVVREIVDLCFNMDNFSRLMKEKLEDV 647
Query: 561 GKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINK 620
K C +KNYD+ KN+ F++ +W+T++PK+L G L+M L+ P ++A ANKF++K
Sbjct: 648 SKGCNFKNYDLFQHKNNLCFDESNWVTMQPKDLPHGLNLVMVLDLPLDIQAVSANKFVDK 707
Query: 621 ALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNM 680
L F P+L+I++ P + ERLD KE Y L+WE +Q + GK Y PG VD ND + ++
Sbjct: 708 VLTFKPRLIIIVSPTDIERLDCKEEPYHLIWEYNQHIFGKPLYQPGDVDVNDDAKNGLHV 767
Query: 681 TAPPLYLWSRHDYAAHHKALAEKHGHIS 708
+ LWS D + +A KHGH++
Sbjct: 768 IPLCVSLWSFPDKTEENMRIAGKHGHLN 795
>gi|218200973|gb|EEC83400.1| hypothetical protein OsI_28843 [Oryza sativa Indica Group]
Length = 615
Score = 186 bits (471), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 109/279 (39%), Positives = 148/279 (53%), Gaps = 37/279 (13%)
Query: 7 DDMEEDMVHDTEED------ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAG 60
DD ED + +D + N D+ ++VC+ CD GG LL C+G C RSFH T++ G
Sbjct: 210 DDYTEDGSSGSSDDNGDNYTDGNASDDGTEAVCAICDQGGILLGCKGECKRSFHPTLEDG 269
Query: 61 EESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGH 120
+S C +LG T EVE + F C NC++KQHQCF CG+L SS ET +VF C +A+CG
Sbjct: 270 TKSFCKTLGYTSREVEEIPIFICSNCQHKQHQCFKCGQLDSS-HETNPKVFQCCNASCGR 328
Query: 121 FYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRR 180
FYHP CV+ LL D A L K I G +FTCP+H C CKQ E+
Sbjct: 329 FYHPKCVAGLLEPD---GACGLEKRIADGMTFTCPVHWCFECKQIED------------- 372
Query: 181 CPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHII 240
R+I+F+ K + R WE + +I IYCL H+ID TP RDHI
Sbjct: 373 -----------REISFKKK---DVTARVWEVPKGDPKIFIYCLDHDIDATFRTPCRDHIK 418
Query: 241 FPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSK 279
FP + + + D RKK + +V+ + L++K
Sbjct: 419 FPSAPQIERVKDLARKKVKVTDIRNTDEVSPESAELSTK 457
>gi|168033774|ref|XP_001769389.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679309|gb|EDQ65758.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 707
Score = 184 bits (468), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 104/233 (44%), Positives = 138/233 (59%), Gaps = 21/233 (9%)
Query: 18 EEDESNEEDELFDSVCSFCDNGGD----LLCCEGRCLRSFHATIDAGEESHCASLGLTKD 73
+ED ++E D VC FCD+G + LLCC+G C+RSFH TID+G ++ C +L LT+
Sbjct: 443 DEDWADEPD-----VCVFCDDGVEGSQRLLCCDGPCMRSFHPTIDSGLQNKCKTLRLTRA 497
Query: 74 EVEAMLNFFCKNCEYKQHQCFACGKLG-SSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
+ ++ + C NCE QHQCFACGKLG S++ EVF C C FYHP CV+KLL+
Sbjct: 498 AI-SVETWICPNCEVGQHQCFACGKLGKSTESADEQEVFVCDHNRCRRFYHPACVAKLLV 556
Query: 133 RDDEVAADQLAKSI-IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
E A + LA I + E+FTCPLHKC C ENK D L CRRCP A+H KCLP
Sbjct: 557 --SEAAQNNLACRIQLKLETFTCPLHKCANCNLDENKIDPTLHLVKCRRCPVAWHEKCLP 614
Query: 192 RKIAFEDKLE-EGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPG 243
+ +F + I+ R+ L + + KH +D E+ TP R+HI FP
Sbjct: 615 QYGSFNHLISVRSIVCRSQIWPLADGK------KHRLDPELLTPERNHITFPA 661
>gi|222640379|gb|EEE68511.1| hypothetical protein OsJ_26944 [Oryza sativa Japonica Group]
Length = 566
Score = 182 bits (463), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 103/259 (39%), Positives = 140/259 (54%), Gaps = 31/259 (11%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 80
+ N D+ ++VC+ CD GG LL C+G C RSFH ++ G +S C +LG T EVE +
Sbjct: 230 DGNASDDGTEAVCAICDQGGILLGCKGECKRSFHPKLEDGTKSFCKTLGYTSREVEEIPI 289
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 140
F C NC++KQHQCF CG+L SS ET +VF C +A+CG FYHP CV+ LL D A
Sbjct: 290 FICSNCQHKQHQCFKCGQLDSS-HETNPKVFQCCNASCGRFYHPKCVAGLLEPD---GAC 345
Query: 141 QLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKL 200
L K I G +FTCP+H C CKQ E+ R+I+F+ K
Sbjct: 346 GLEKRIADGMTFTCPVHWCFECKQIED------------------------REISFKKK- 380
Query: 201 EEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSL 260
+ R WE + +I IYCL H+ID TP RDHI FP + + + D RKK +
Sbjct: 381 --DVTARVWEVPKGDPKIFIYCLDHDIDATFRTPCRDHIKFPSAPQIERVKDLARKKVKV 438
Query: 261 ASPSGKQKVASTKSSLTSK 279
+V+ + L++K
Sbjct: 439 TDIRNTDEVSPESAELSTK 457
>gi|218200975|gb|EEC83402.1| hypothetical protein OsI_28848 [Oryza sativa Indica Group]
Length = 1266
Score = 164 bits (415), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 119/208 (57%), Gaps = 21/208 (10%)
Query: 522 LQAIVDKLHWYVNDGDM---------------------IVDFCCGANDFSCLMKKKLDET 560
L I DKLHWYV G+ IVD C ++FS LMK+KL++
Sbjct: 567 LVQIADKLHWYVQPGNTESCGKLPDGVLLERSIVVVREIVDLCFNMDNFSRLMKEKLEDV 626
Query: 561 GKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINK 620
K C +KNYD+ KN+ F++ +W+T++PK+L G L+M L+ P V+A ANKF++K
Sbjct: 627 SKGCNFKNYDLFQHKNNLCFDESNWVTMQPKDLPHGLNLVMVLDLPLDVQAVSANKFVDK 686
Query: 621 ALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNM 680
L F P+L+I++ P + ERLD KE Y L+WE +Q + GK Y PG VD ND + ++
Sbjct: 687 VLTFKPRLIIIVSPTDIERLDCKEEPYHLIWEYNQHIFGKPLYQPGDVDVNDDAKNGLHV 746
Query: 681 TAPPLYLWSRHDYAAHHKALAEKHGHIS 708
+ LWS D + +A KHGH++
Sbjct: 747 IPLCVSLWSFPDKTEENMRIAGKHGHLN 774
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 79/133 (59%), Gaps = 3/133 (2%)
Query: 23 NEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN-- 80
N D+ D +C+ CD+GG LL CEG C RSFHA + G +S C +L T E++ +
Sbjct: 235 NATDDDTDMICAICDDGGKLLSCEGPCKRSFHARVKHGRKSKCRTLRFTSVELKLKESGT 294
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 140
F C+NCE+ +HQCF CG+L S A+VF C +CGHFYHP C++ LL +D A
Sbjct: 295 FLCENCEHNEHQCFKCGELEPS-YGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGAC 353
Query: 141 QLAKSIIAGESFT 153
+L + I F+
Sbjct: 354 KLEEMIADAIMFS 366
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 71/161 (44%), Gaps = 34/161 (21%)
Query: 157 HKCCICKQGENKADSDLQFAVCRR--CPKAYHRKCLPRKIAFED-----KLEEGI---IT 206
H+C C + E + + C + C YH KC+ + ED KLEE I I
Sbjct: 305 HQCFKCGELEPSYGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGACKLEEMIADAIM 364
Query: 207 RAWEGLLPNHRILIYCLK---------HEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKK 257
+W H+++ + K H ID GTP +HI FP I + K + +K
Sbjct: 365 FSW------HQMVFFTAKFPLKQGQRHHGIDATTGTPSGEHIKFPSIPKIKKTKNLSKKD 418
Query: 258 QSLASPSGKQKVA----STKSSLTSKAPPQGKFSVKALKRV 294
A +GK+K + STKS+ + P+G+ K +RV
Sbjct: 419 ---AKVTGKRKKSGNKFSTKSTKLANVSPRGE--SKQTRRV 454
>gi|242085258|ref|XP_002443054.1| hypothetical protein SORBIDRAFT_08g007170 [Sorghum bicolor]
gi|241943747|gb|EES16892.1| hypothetical protein SORBIDRAFT_08g007170 [Sorghum bicolor]
Length = 279
Score = 164 bits (414), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 132/258 (51%), Gaps = 51/258 (19%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN---FFCKNC 86
D VC+ CD+GG + CC+ CLRSFH T + GE S C SLG+ +E + +++ F CKNC
Sbjct: 36 DDVCAICDDGGYVTCCDSGCLRSFHLTEEHGEGSKCPSLGINSEEAKMIIDKKDFICKNC 95
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
+YKQHQC +CG LGSSD +GAEVF C CGHFYHP C+SKLL D++ A
Sbjct: 96 KYKQHQCSSCGLLGSSDLSSGAEVFQCKEYNCGHFYHPKCISKLLYPGDKLRACHF---- 151
Query: 147 IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIIT 206
+ D+ F + P Y I
Sbjct: 152 -----------------------EHDIPFET-KEGPNGY------------------IFQ 169
Query: 207 RAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGK 266
RAW+G+L + RILIYC+KHEI E+G P R IIFP +EN + P G+
Sbjct: 170 RAWDGILRD-RILIYCMKHEIVKELGIPRRKLIIFP-YDENLCVPQGPESAPKEQDTLGQ 227
Query: 267 QKVASTKSSLTSKAPPQG 284
+++ SS S++ P G
Sbjct: 228 EELLDHPSSEPSQSLPSG 245
>gi|242085262|ref|XP_002443056.1| hypothetical protein SORBIDRAFT_08g007190 [Sorghum bicolor]
gi|241943749|gb|EES16894.1| hypothetical protein SORBIDRAFT_08g007190 [Sorghum bicolor]
Length = 232
Score = 144 bits (364), Expect = 2e-31, Method: Composition-based stats.
Identities = 78/217 (35%), Positives = 109/217 (50%), Gaps = 50/217 (23%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN---FFCKNCEY 88
VC+ CD+GG+L CCEG C R+FH + +LGLT ++ + ++ F CKNC+Y
Sbjct: 41 VCAICDDGGNLTCCEGCCQRAFHLD---NKHDCIKTLGLTIEDAKNIIEKEAFICKNCQY 97
Query: 89 KQHQCFACGKLGSSD---KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 145
KQHQC+ CG LGSSD + AEVF C C FYHP CV+++L D +
Sbjct: 98 KQHQCYICGSLGSSDDTDTSSQAEVFQCEHDDCARFYHPKCVAQMLYPDSHHRQELFELQ 157
Query: 146 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII 205
+ AGE F+CP+H+C +CK+ ENK + L
Sbjct: 158 VAAGERFSCPMHECIVCKEVENKTEKPLL------------------------------- 186
Query: 206 TRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 242
+ + ++ KHEI ++ TP RDHIIFP
Sbjct: 187 ----------NFVNLFYRKHEIVRKLQTPARDHIIFP 213
>gi|414877686|tpg|DAA54817.1| TPA: hypothetical protein ZEAMMB73_710478 [Zea mays]
Length = 277
Score = 144 bits (362), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 93/269 (34%), Positives = 130/269 (48%), Gaps = 64/269 (23%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFFCKNC 86
++VC+ CD+GG+L CCEG C RSFH E + +LGLT E + ++ +F CKNC
Sbjct: 39 ENVCAICDDGGNLTCCEGCCQRSFHLD---NEHNCIKTLGLTIQEAKNIIEKEDFICKNC 95
Query: 87 EYKQHQCFACGKLGSSD--KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 144
+YKQHQCF CG LGSSD + +VF C C FYHP CV+++L D + +
Sbjct: 96 QYKQHQCFICGSLGSSDDTDTSSQQVFQCEHDDCARFYHPKCVAQMLYPDSQ---ELFEL 152
Query: 145 SIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGI 204
+ AGESFTCP+H+C +CK+ +NK D L + F
Sbjct: 153 HVAAGESFTCPMHECIVCKEVDNKKDKTL--------------------LDF-------- 184
Query: 205 ITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP---------GIEENKTIIDRPR 255
+ ++ KHEI E+ TP R+HI+FP + + D
Sbjct: 185 -------------VNLFYRKHEIVRELQTPARNHIVFPESNVLKTFQALYAQREAPDHSS 231
Query: 256 KKQSLASP---SGKQKVASTKSSLTSKAP 281
KQ+ +SP + Q S L S AP
Sbjct: 232 SKQTHSSPPPAARDQNQCYCSSPLCSFAP 260
>gi|384251374|gb|EIE24852.1| hypothetical protein COCSUDRAFT_41130 [Coccomyxa subellipsoidea
C-169]
Length = 1073
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 149/278 (53%), Gaps = 24/278 (8%)
Query: 416 DAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIE 475
D A+S+R E R P +++++ IT +++G V+A R AL
Sbjct: 766 DLANSLRDVEANSRKPKPY------QNSLEPTITRDRLDGHVKACRGALDN--------P 811
Query: 476 DAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVND 535
+ + +++++++ ++ L++ LAP LHG YTS+GRHFT L+ + L ++
Sbjct: 812 HSAQLMMGRLVNELYRAEDNLRMVLAPVLHGASYTSYGRHFTLPYLLEEVNKYLLPFMLT 871
Query: 536 GDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAP 595
GD +VDF CG N + ++K+ G + +DI+ AK+ +F K W+ V+ +L
Sbjct: 872 GDTVVDFSCGYNVWLPMLKRMGLAEGLEIHGRAFDIITAKDTEDFVKSSWLDVDFGQLPR 931
Query: 596 GSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQ 655
G +L++GLNPPFG LA+KF+ A +F P++++LI PP T Y ++ E+ +
Sbjct: 932 GDKLVIGLNPPFGKNGCLASKFVTHAAQFQPRVIVLIAPPGT----IIPPGYIVMLEETR 987
Query: 656 FLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDY 693
++G++F+ PG VD+ WN P + R DY
Sbjct: 988 LMAGEAFFKPG-VDKAS-----WNDVPPSTRILIREDY 1019
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 117/469 (24%), Positives = 177/469 (37%), Gaps = 94/469 (20%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESH-----------CASLGLTKDEVEAM 78
D++C+ CD+GG+L+ C+G C+R+FHA I EE C L + +
Sbjct: 185 DTICTKCDDGGELILCDGGCMRAFHAGIKPAEEEEEGPVYVDDPGSCNPLQIGAELFMRF 244
Query: 79 LN----FFCKNCEYKQHQCFACG--KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
+ F C NC +HQCFAC EVF C CG FYH C++K
Sbjct: 245 DSTNEPFMCPNCHSGKHQCFACKVEGSVGGSSSGLPEVFKCTVGDCGRFYHARCLNK--- 301
Query: 133 RDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPR 192
D +I F CPLH C C +GEN ++ CRRCP AYH CLP+
Sbjct: 302 -------DPQEMAIF----FVCPLHTCKRCNEGENAEKGEV--ISCRRCPVAYHAGCLPK 348
Query: 193 KIAFEDKLEEGIITRAW-------EG-----LLPNHRI----------LIYCLKHEIDDE 230
KI +TR W EG LP + L+YC++H++
Sbjct: 349 KIL------NSPVTRVWITRKLDKEGNVLPPPLPRTKAVEPECDIGQSLLYCMRHDMALG 402
Query: 231 IGTPIRDHII---------------FPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSS 275
TP+ + FP + ++ I+DR + KQ + +++
Sbjct: 403 DATPVMGELFSPQLRRLWAKAQATKFPDLLWSRDIMDRFAARAERRRQQAKQNAQAERAA 462
Query: 276 LTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVK 335
++A Q + + P KA + + V R K S V
Sbjct: 463 --TQASKQAGVARAVPQAAPKKAAVSKGAAPAAGAAVAVPKRPRTKPQPAPAASAAAAVA 520
Query: 336 SLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVK--PLRK 393
V ++D + L R+ A + +Q + V + P
Sbjct: 521 RKPAPV----AIDDNEIPLAVRIAAG---------RKAQQQRPQKPAPIVVEAEYIPFED 567
Query: 394 KLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKS 442
+P++ S + + KD+AS+ R E KR P T AS +
Sbjct: 568 TPAEPVPAVRPPSPVK-PPIKKDSASAPRQSEHEKRLPAPDTRGDASAA 615
>gi|296082727|emb|CBI21732.3| unnamed protein product [Vitis vinifera]
Length = 88
Score = 139 bits (350), Expect = 9e-30, Method: Composition-based stats.
Identities = 64/88 (72%), Positives = 77/88 (87%)
Query: 586 MTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKES 645
MTV+P EL GS LIMGLNPPFGV+A LANKFI++ALEF PKLL+LIVP ET+RLD+K+
Sbjct: 1 MTVKPSELPTGSHLIMGLNPPFGVQAALANKFIDQALEFKPKLLVLIVPQETQRLDKKKL 60
Query: 646 AYELVWEDDQFLSGKSFYLPGSVDENDK 673
Y+LVW++D+ LSGKSFYLPGS+D NDK
Sbjct: 61 PYDLVWKEDEELSGKSFYLPGSIDVNDK 88
>gi|108707357|gb|ABF95152.1| hydroxyproline-rich glycoprotein family protein, putative [Oryza
sativa Japonica Group]
gi|125585711|gb|EAZ26375.1| hypothetical protein OsJ_10258 [Oryza sativa Japonica Group]
Length = 256
Score = 134 bits (337), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 100/183 (54%), Gaps = 36/183 (19%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
+C+ CD+GG+L+ CEG C R FH TI + +C +L +++++VE+ F CKNC YKQH
Sbjct: 25 ICAICDDGGNLIRCEGACRRYFHRTISNDADFNCETLNMSQEQVESS-KFICKNCVYKQH 83
Query: 92 QCFACGKLGSSDKETG-AEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
QCF CG+LGSSD +G AEV+ C + C FYHP C+++ S
Sbjct: 84 QCFGCGELGSSDMSSGSAEVYQCSKSRCRRFYHPKCLAEF-------------DSSKNPP 130
Query: 151 SFTCPLHKCCICKQGENKADSD---------------------LQFAVCRRCPKAYHRKC 189
F CPLH+C CK K + + + A+CRRCP AYHRKC
Sbjct: 131 VFECPLHECFACKNKGEKYNEETCKNKGQESIKKKQGAENNKKMHLALCRRCPIAYHRKC 190
Query: 190 LPR 192
LPR
Sbjct: 191 LPR 193
>gi|449532244|ref|XP_004173092.1| PREDICTED: uncharacterized protein LOC101229951 [Cucumis sativus]
Length = 77
Score = 134 bits (336), Expect = 4e-28, Method: Composition-based stats.
Identities = 59/69 (85%), Positives = 64/69 (92%)
Query: 575 KNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVP 634
+NDFNFEKRDWMTV+PKE GS+LIMGLNPPFGVKA LANKF++KALEFNPKLLILIVP
Sbjct: 9 QNDFNFEKRDWMTVQPKEFPKGSQLIMGLNPPFGVKAALANKFVDKALEFNPKLLILIVP 68
Query: 635 PETERLDRK 643
PETERLD K
Sbjct: 69 PETERLDEK 77
>gi|168001433|ref|XP_001753419.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695298|gb|EDQ81642.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 593
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 134/254 (52%), Gaps = 37/254 (14%)
Query: 18 EEDESNEEDELFDS-VCSFCD----NGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTK 72
E DE +++ + + VC+FCD NG LLCCEG C+RSFH TI++G ++ C +LGL+
Sbjct: 359 ETDEVGDDNWIDEPDVCAFCDDGVENGEKLLCCEGLCMRSFHPTINSGLQNKCRTLGLSP 418
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA------EVFPCVSATCGHFYHPHC 126
V ++ C NCE QHQCFACGKLGSS + + EV C + C FYHP C
Sbjct: 419 AAVNVK-SWICPNCEAGQHQCFACGKLGSSSSGSSSNSMGLLEVVVCGAKYCQRFYHPDC 477
Query: 127 VSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYH 186
V KLL+ + E D + ++ E+ CPLHKC + E+ + RR Y
Sbjct: 478 VVKLLVPEAE-QKDLAMRIRLSLETLICPLHKC---RMPESSVAT-------RRWQVCYV 526
Query: 187 RKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 246
+ + +G IT + GL N + + +H ID E+ TP RDHI+FP +
Sbjct: 527 LR----------GIGDGGITDS--GLAFNGSLWVSS-RHRIDPELLTPERDHILFPPPKP 573
Query: 247 NKTIIDRPRKKQSL 260
+T RPR Q +
Sbjct: 574 GRT-TSRPRVTQPV 586
>gi|307105371|gb|EFN53620.1| hypothetical protein CHLNCDRAFT_58419 [Chlorella variabilis]
Length = 819
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/219 (33%), Positives = 102/219 (46%), Gaps = 44/219 (20%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN------FF 82
FD+ C+ CD+GG+LL C+G C+RSFH + E+ C L L D + + + F
Sbjct: 386 FDTQCALCDDGGNLLACDGPCMRSFHVS---AEDDGCNLLCLPPDLAQTLTSAANSETFL 442
Query: 83 CKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
C NC HQC+ C + G + G EVF C++A+CG YHP C
Sbjct: 443 CPNCLAGVHQCYVCKREG----KAGTEVFKCINASCGRHYHPECAGA------------- 485
Query: 143 AKSIIAGESFTCPLHKCCICKQ-GENKADSDL-QFAVCRRCPKAYHRKCLPRKIAFEDKL 200
AG+ F C LH C C Q G+ A + CRRCP A+HR+CLP I
Sbjct: 486 ----EAGQPFVCKLHSCHACGQPGDASAPGQAGELVPCRRCPVAFHRRCLP--IELPRHR 539
Query: 201 EEGI-ITRAW---------EGLLPNHRILIYCLKHEIDD 229
+ G + R W + L + L+YC H + +
Sbjct: 540 DGGASLARVWLADYEDESGKWLDGVEQSLLYCRNHSLGE 578
Score = 47.8 bits (112), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 56/113 (49%), Gaps = 12/113 (10%)
Query: 418 ASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIE-- 475
A S+ + + + + P +AY K +D+A K++ +++ AL + + ++
Sbjct: 696 APSLTKDSLWRATRRPEPYAYPMKKTIDEA----KLQALERSVQYALARRLPEAELLQLL 751
Query: 476 DAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDK 528
DA+AV + + L + LAP+LH RY+S+GRHFT L D+
Sbjct: 752 DAQAV------RSVLSHYDDLLMVLAPYLHENRYSSYGRHFTSHRLLHTAADR 798
>gi|449473710|ref|XP_004153960.1| PREDICTED: uncharacterized protein LOC101211308, partial [Cucumis
sativus]
Length = 131
Score = 120 bits (300), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 56/92 (60%), Positives = 76/92 (82%), Gaps = 1/92 (1%)
Query: 402 LDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIR 461
LD +S+RRL +MK+ ASS+ +E+++K+HK+PSTHAY+ K VDK I +GK+EGSV A+R
Sbjct: 41 LDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAVR 100
Query: 462 TALKKLDVDGSSIEDAKAVCEPEVLSQIFKWK 493
AL+KL+ +G IEDA+AVCEPEVL+ IFKWK
Sbjct: 101 AALRKLE-EGCCIEDAEAVCEPEVLNHIFKWK 131
>gi|242041461|ref|XP_002468125.1| hypothetical protein SORBIDRAFT_01g040025 [Sorghum bicolor]
gi|241921979|gb|EER95123.1| hypothetical protein SORBIDRAFT_01g040025 [Sorghum bicolor]
Length = 131
Score = 97.8 bits (242), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/131 (45%), Positives = 69/131 (52%), Gaps = 27/131 (20%)
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA--EVFPCVSATCGHFYHPHCVSKL 130
+E NF CKNCEYK+HQCF CG+LGSS+ G+ EV C CG FYHP C+SK
Sbjct: 16 NEKSKSFNFICKNCEYKKHQCFVCGELGSSEMPPGSPEEVLKCQKKYCGRFYHPKCLSKY 75
Query: 131 LLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----------QGENKADSDLQFAVCRR 180
D Q F CPLH+C CK Q E K ++ L CRR
Sbjct: 76 ----DPTKNRQ---------DFECPLHECHSCKNKGETVITSEQTEKKKETYL--VQCRR 120
Query: 181 CPKAYHRKCLP 191
CP AYHRKCLP
Sbjct: 121 CPVAYHRKCLP 131
>gi|449472830|ref|XP_004153707.1| PREDICTED: uncharacterized protein LOC101217475, partial [Cucumis
sativus]
Length = 200
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/65 (66%), Positives = 54/65 (83%), Gaps = 3/65 (4%)
Query: 45 CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
C+GRC+RSFHAT + G+ C SLGL+K+EV+A+ F CKNCEYKQHQC+ACG LGSSD+
Sbjct: 139 CDGRCMRSFHATEEDGD---CFSLGLSKEEVDAIETFICKNCEYKQHQCYACGNLGSSDQ 195
Query: 105 ETGAE 109
+GAE
Sbjct: 196 SSGAE 200
>gi|328865207|gb|EGG13593.1| PHD zinc finger-containing protein [Dictyostelium fasciculatum]
Length = 584
Score = 94.7 bits (234), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 96/219 (43%), Gaps = 43/219 (19%)
Query: 17 TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 76
EE + + ++VC+ C+ GG+LL C+G CLRSFH LGL + V
Sbjct: 72 VEEKQLPKRQRKNEAVCTICEKGGELLMCDGACLRSFHV----------ECLGL-QSLVT 120
Query: 77 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 136
+ C +C +Q+ CF+C K G G ++ C CG FYH C+++
Sbjct: 121 PGQRWECDDCLNQQNSCFSCKKRGI----IGMDLMKCKVHQCGKFYHHSCIAEF------ 170
Query: 137 VAADQLAKSI-IAGESFTCPLHKCCICKQ-GENKADSDLQFAVCRRCPKAYHRKCLPRKI 194
LAK + F CPLH C C Q G+ K Q C RCP AYH C+P +
Sbjct: 171 ----PLAKMVNTKSPRFNCPLHYCGKCGQSGDGK-----QSVHCFRCPAAYHVTCIPPGV 221
Query: 195 AFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGT 233
K +E T LI C KH+ + I T
Sbjct: 222 KMLTKSKETRKTG-----------LILCPKHQHETVIPT 249
>gi|66828485|ref|XP_647597.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
gi|74859221|sp|Q55FD6.1|Y8158_DICDI RecName: Full=PHD finger-containing protein DDB_G0268158
gi|60475597|gb|EAL73532.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
Length = 688
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 93/192 (48%), Gaps = 23/192 (11%)
Query: 15 HDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK- 72
++T +D+ + ++VC+FC+ G+LL C+G CLRSFH + + A + +S ++
Sbjct: 121 NETNDDDKPKRPRKNEAVCTFCEKPGELLMCDGLCLRSFHISCLKARNLYNPSSSSISPV 180
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
++ + + C +C Q+ CF+C K G G ++ C CG FYH C
Sbjct: 181 TTIDGTVRWECNDCVSSQNSCFSCKKRGI----IGIDLMKCKVHQCGKFYHYKC------ 230
Query: 133 RDDEVAADQLAKSI-IAGESFTCPLHKCCICK-QGENKADSDLQFAVCRRCPKAYHRKCL 190
VA +LAK I F CPLH C +C+ G+ K Q C RCP AYH C+
Sbjct: 231 ----VADYKLAKLINTKTPRFNCPLHYCSVCEVSGDGK-----QSVHCFRCPTAYHVICM 281
Query: 191 PRKIAFEDKLEE 202
+ K E
Sbjct: 282 QPGVKMLTKTRE 293
>gi|330833845|ref|XP_003291988.1| hypothetical protein DICPUDRAFT_156684 [Dictyostelium purpureum]
gi|325077793|gb|EGC31483.1| hypothetical protein DICPUDRAFT_156684 [Dictyostelium purpureum]
Length = 537
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/187 (32%), Positives = 86/187 (45%), Gaps = 27/187 (14%)
Query: 18 EEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEA 77
+E+E + +++C+ C+ GDLL C+G CLRSFH H L D
Sbjct: 74 DEEEKPKRTRKNETICTVCEKPGDLLMCDGLCLRSFHVNCVGSANIH---LNTNPD---G 127
Query: 78 MLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEV 137
+ + C +C +Q+ CF+C K G G ++ C CG FYH CVS+
Sbjct: 128 SVRWECNDCVNQQNYCFSCKKRGI----IGMDLMKCKVHQCGKFYHYKCVSEY------- 176
Query: 138 AADQLAKSI-IAGESFTCPLHKCCICK-QGENKADSDLQFAVCRRCPKAYHRKCLPRKIA 195
+LAK I F CPLH C +C+ G+ K Q C RCP AYH C+ +
Sbjct: 177 ---KLAKLINTKTPRFNCPLHYCAVCEVSGDGK-----QSVHCFRCPTAYHVICMQPGVK 228
Query: 196 FEDKLEE 202
K +E
Sbjct: 229 MLTKSKE 235
>gi|325193611|emb|CCA27889.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1008
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 79/187 (42%), Gaps = 51/187 (27%)
Query: 29 FDSVCSFCDNG-GDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN---FFCK 84
D +C C+ GDL C G C+ ++H HC + + A L+ + C
Sbjct: 1 MDHICLRCEKSDGDLWLCTGLCVSAYHL--------HCLQI-----DANANLDPTTWKCP 47
Query: 85 NCEYKQHQCFACGKLGSSDKETGA------------------EVFPCVSATCGHFYHPHC 126
+C+ H CF C + GSS A V C + +CG FYH C
Sbjct: 48 SCQNHLHICFYCHQTGSSFYHENAVQQEPSQTHTPPLHDNISYVSKCRALSCGKFYHLEC 107
Query: 127 VSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYH 186
++KL L + IAG F CPLH C C+Q K +S C RCP AYH
Sbjct: 108 ITKLAL------------ARIAGTHFICPLHTCASCEQSGAKKES----VRCARCPVAYH 151
Query: 187 RKCLPRK 193
CLPRK
Sbjct: 152 TSCLPRK 158
>gi|242081339|ref|XP_002445438.1| hypothetical protein SORBIDRAFT_07g019210 [Sorghum bicolor]
gi|241941788|gb|EES14933.1| hypothetical protein SORBIDRAFT_07g019210 [Sorghum bicolor]
Length = 149
Score = 87.4 bits (215), Expect = 5e-14, Method: Composition-based stats.
Identities = 49/128 (38%), Positives = 61/128 (47%), Gaps = 22/128 (17%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 86
EL D VC+ C +GGDLL SLG T+ +VEAM NF C +C
Sbjct: 31 ELNDVVCALCGDGGDLLW----------------------SLGFTEAQVEAMQNFICHSC 68
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
+QH+C +CG GSS +VF C TCG FYH C+S L D A + +
Sbjct: 69 RNRQHRCASCGIRGSSGVGNHPQVFRCNHETCGRFYHASCISAELHPGDPAEAGRCRVRV 128
Query: 147 IAGESFTC 154
AG F C
Sbjct: 129 AAGRPFWC 136
>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 810
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 84/178 (47%), Gaps = 32/178 (17%)
Query: 23 NEEDELFDSVCSFCDNG--GDLLCCEGRCLRSFHAT-IDAGEESHCASLGLTKDEVEAML 79
N D+ ++CS D D++ C+G CLRSFH +D GE++ K+E
Sbjct: 220 NSADDWICAMCSQLDTPCMSDMVTCDGPCLRSFHVVCLDLGEDA-------LKEE----- 267
Query: 80 NFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 139
+ C++CE +H+C+ CG G + G VF C +CG FYH HCV L ++ V A
Sbjct: 268 KWLCEDCERAEHECWQCGDYGQDNVVGG--VFRCGVPSCGRFYHRHCVE--LNKNSVVKA 323
Query: 140 DQLAKSIIAGE------SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
D + GE F C H C C G + L C +CP AYH C+P
Sbjct: 324 D-----VDEGEDGQPIFKFRCAYHTCDTCCSGRGGTKNHLY--KCIKCPTAYHLNCIP 374
>gi|414884834|tpg|DAA60848.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 143
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/67 (61%), Positives = 49/67 (73%), Gaps = 1/67 (1%)
Query: 45 CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
C+G C+RSFHA +GE+S C +LG T+ EVEAM F CKNCEYKQHQCF CG L SD
Sbjct: 22 CDGPCMRSFHAKEGSGEDSCCVTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEPSDG 81
Query: 105 ETGAEVF 111
E A+V+
Sbjct: 82 E-AAKVY 87
>gi|348672863|gb|EGZ12683.1| hypothetical protein PHYSODRAFT_563400 [Phytophthora sojae]
Length = 1174
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 73/175 (41%), Gaps = 42/175 (24%)
Query: 32 VCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFFCKNCE 87
+C C+ GD L+ C G C+ +FHA L+ D A + C +C
Sbjct: 80 ICVVCEREGDALVVCGGPCISAFHAAC------------LSPDSGAAQAQDDTWLCPSCR 127
Query: 88 YKQHQCFACGKLG------SSDKETGAE-----VFPCVSATCGHFYHPHCVSKLLLRDDE 136
K H CF C + G + GA+ V C + +CG FYH C+++ L
Sbjct: 128 SKTHACFHCKQTGVEALKDDAPASAGADPTLRPVRKCRALSCGKFYHQECIAQFPL---- 183
Query: 137 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
I F CPLH C C N++ + + C RCP AYH +CLP
Sbjct: 184 -------ARIATNTHFICPLHTCAGC----NQSGAQQEAVRCMRCPVAYHARCLP 227
>gi|414884835|tpg|DAA60849.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 170
Score = 84.7 bits (208), Expect = 3e-13, Method: Composition-based stats.
Identities = 44/121 (36%), Positives = 75/121 (61%), Gaps = 3/121 (2%)
Query: 372 FGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHK 431
F S +E + + PL S P +D ++++R+ +++ + SS+ ++++ +
Sbjct: 48 FISSPKSVKEQEQELVPLPLSAIRKSSFPKVDSETEKRVITILGNKVSSLTIKDVTRNCS 107
Query: 432 IPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFK 491
+PSTH Y+ + + I LGK+E SV+A+ ALKKL+ +G S+ DAKAVCEPEVL Q+ +
Sbjct: 108 LPSTHVYSGR--LTDKIPLGKIERSVQAVEAALKKLE-NGGSVNDAKAVCEPEVLRQLAR 164
Query: 492 W 492
W
Sbjct: 165 W 165
>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 2586
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 68/161 (42%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1436 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1482
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1483 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVV 1523
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1524 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1564
>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Rattus norvegicus]
gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 2381
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 68/161 (42%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1231 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1277
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1278 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVV 1318
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1319 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1359
>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
guttata]
Length = 1339
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 64/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ GDLL CEG C R+FH + LGL+ F C C H
Sbjct: 673 VCQLCEKTGDLLLCEGLCYRAFHVSC----------LGLSG---RPAGKFICSECTSGVH 719
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE A+V CV + CG FYH CV K L ++
Sbjct: 720 TCFVC-------KERKADVKRCVVSHCGKFYHEACVKKFHL------------TVFENRG 760
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C + S + C RCP AYH
Sbjct: 761 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 797
>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Cricetulus griseus]
Length = 2690
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1631
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1632 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1672
>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Cricetulus griseus]
Length = 2318
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1172 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1218
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++
Sbjct: 1219 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1259
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1260 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1300
>gi|301095030|ref|XP_002896617.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108847|gb|EEY66899.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1089
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 74/179 (41%), Gaps = 50/179 (27%)
Query: 32 VCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
+C C+ G DL+ C G C+ +FH + A G++ EA + C +C K
Sbjct: 74 ICVVCEREGDDLVVCAGPCISAFHLSCLP------ARSGVSDALDEA---WLCPSCTNKT 124
Query: 91 HQCFACGKLGSSDKETGAE------------------VFPCVSATCGHFYHPHCVSKLLL 132
H CF C KETG E V C + +CG FYH C+++ L
Sbjct: 125 HACFHC-------KETGVETLKDDAPASTGIDPNKRPVRKCRALSCGKFYHQECITQFPL 177
Query: 133 RDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
I F CPLH C C N++ + + C RCP AYH KCLP
Sbjct: 178 -----------ARIAINTHFICPLHTCAGC----NESGAQQEAVRCMRCPVAYHAKCLP 221
>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Cricetulus griseus]
Length = 1436
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C R FH LGLT F C+ C+
Sbjct: 701 DTVCQICETAGDCLVSCEGECCRHFHWEC----------LGLT---AVPEGRFICEECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C KE+G +V C + CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KESGEDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Cricetulus griseus]
Length = 1387
Score = 82.4 bits (202), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C R FH LGLT F C+ C+
Sbjct: 695 DTVCQICETAGDCLVSCEGECCRHFHWEC----------LGLT---AVPEGRFICEECKT 741
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C KE+G +V C + CG FYH CV K +I
Sbjct: 742 GQHPCFSC-------KESGEDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 782
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 783 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 821
>gi|345315868|ref|XP_001516412.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2,
partial [Ornithorhynchus anatinus]
Length = 951
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH+ A LGL + F C C H
Sbjct: 477 VCQLCEKTGSLLLCEGPCYGAFHS----------ACLGLPR---RPEGKFICSECTSGVH 523
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE EV CV + CG FYH CV K L ++
Sbjct: 524 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVRKHHL------------TVFESRG 564
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 565 FRCPLHSCVSCHVSNPSNPRASKGKLMRCVRCPVAYH 601
>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Sarcophilus harrisii]
Length = 1464
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G+LL CEG C +FH + LGL++ F C C H
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSC----------LGLSR---RPEGKFICSECTSGIH 716
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE EV CV + CG FYH CV K L ++
Sbjct: 717 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVKKYHL------------TVFESRG 757
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 758 FRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCPVAYH 794
>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
gallus]
Length = 1369
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 64/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ GDLL CEG C R+FH + LGL+ F C C H
Sbjct: 674 VCQLCEKTGDLLLCEGLCYRAFHVSC----------LGLSG---RPAGKFVCSECTSGVH 720
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE A++ CV + CG FYH CV K L ++
Sbjct: 721 TCFVC-------KERKADLKRCVVSHCGKFYHEACVKKFHL------------TVFESRG 761
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C + S + C RCP AYH
Sbjct: 762 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 798
>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Cavia porcellus]
Length = 1366
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL + F C C H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLPR---RPEGKFTCMECTSGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ AEV CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKAEVKRCVVTQCGKFYHEACVRKFPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
2 [Cavia porcellus]
Length = 1367
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL + F C C H
Sbjct: 670 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLPR---RPEGKFTCMECTSGIH 716
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ AEV CV CG FYH CV K L ++
Sbjct: 717 SCFVC-------KESKAEVKRCVVTQCGKFYHEACVRKFPL------------TVFESRG 757
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 758 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 794
>gi|255583040|ref|XP_002532288.1| hypothetical protein RCOM_0549780 [Ricinus communis]
gi|223528022|gb|EEF30103.1| hypothetical protein RCOM_0549780 [Ricinus communis]
Length = 164
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 55/87 (63%), Gaps = 7/87 (8%)
Query: 637 TERLDRKESAYELVWEDDQFLSGKSFY------LPGSVDENDKQMDQWNMTAPPLYLWSR 690
+RLD K + Y+L+WED LSGK + LPGS+D NDKQ++Q N PPLYLWSR
Sbjct: 79 VKRLDEK-APYKLIWEDHMLLSGKGTWQFTGSHLPGSIDVNDKQIEQRNSKPPPLYLWSR 137
Query: 691 HDYAAHHKALAEKHGHISRPQSRTQME 717
D+ A H A+A + GH+S + ME
Sbjct: 138 LDWIAKHMAIARQCGHMSNQHEVSVME 164
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 24/42 (57%), Gaps = 3/42 (7%)
Query: 43 LCCEGRCLRSFHATIDAGEESHCASLGL---TKDEVEAMLNF 81
LC GRCLRSFH ++ G ES C SLG + EV L F
Sbjct: 9 LCWNGRCLRSFHPSVADGVESKCVSLGFRNKAEYEVWTTLQF 50
>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Meleagris gallopavo]
Length = 1348
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 64/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ GDLL CEG C R+FH + LGL+ F C C H
Sbjct: 653 VCQLCEKTGDLLLCEGLCYRAFHVSC----------LGLSG---RPAGKFVCSECTSGVH 699
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE A++ CV + CG FYH CV K L ++
Sbjct: 700 TCFVC-------KERKADLKRCVVSHCGKFYHEACVKKFHL------------TVFENRG 740
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C + S + C RCP AYH
Sbjct: 741 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 777
>gi|380791865|gb|AFE67808.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
[Macaca mulatta]
Length = 949
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Oryzias latipes]
Length = 2321
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 69/174 (39%), Gaps = 39/174 (22%)
Query: 17 TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 76
++ D + L ++VC C+ GDLL CEG C +FH C L V
Sbjct: 1454 SQVDGKGKTSSLKENVCQVCERPGDLLVCEGHCYGAFHL--------QCVGLS-----VA 1500
Query: 77 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 136
FFC+ C H CF C K G +V C+ CG FYH C+
Sbjct: 1501 PKGKFFCQECNTGDHMCFVCKKSG--------DVKRCIIPLCGKFYHMDCI--------- 1543
Query: 137 VAADQLAKSIIA--GESFTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
LA S + F CPLH C C N S + A C RCP AYH
Sbjct: 1544 -----LAFSATQPHNKGFRCPLHVCLSCHITNPVNVFSSKGRLARCVRCPVAYH 1592
>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Monodelphis domestica]
Length = 1366
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G+LL CEG C +FH + LGL++ F C C H
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSC----------LGLSR---RPEGKFICSECTSGIH 716
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE EV CV + CG FYH CV K L ++
Sbjct: 717 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVKKYHL------------TVFESRG 757
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 758 FRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCPVAYH 794
>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
homolog; Short=WHSC1-like protein 1
Length = 1439
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C R FH LGLT +F C+ CE
Sbjct: 701 DTVCQVCEKAGDCLVACEGECCRHFHVEC----------LGLT---AVPEGHFTCEECET 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C + CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
construct]
Length = 1446
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C R FH LGLT +F C+ CE
Sbjct: 711 DTVCQVCEKAGDCLVACEGECCRHFHVEC----------LGLT---AVPEGHFTCEECET 757
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C + CG FYH CV K +I
Sbjct: 758 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 798
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 799 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 837
>gi|119583723|gb|EAW63319.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_a [Homo
sapiens]
Length = 992
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
norvegicus]
Length = 1396
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 67/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+N GD L+ CEG C R FH C L + F C+ CE
Sbjct: 702 DTVCQVCENTGDSLVACEGECCRHFHP--------ECLGLAAVPEG-----RFTCEECET 748
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C + CG FYH CV K +I
Sbjct: 749 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 789
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 790 SKGFRCPQHCCSSCSMEKDIHKASKGRMMKCLRCPIAYH 828
>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
Length = 1381
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C R FH LGLT +F C+ CE
Sbjct: 686 DTVCQVCEKAGDCLVACEGECCRHFHVEC----------LGLT---AVPEGHFTCEECET 732
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C + CG FYH CV K +I
Sbjct: 733 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 773
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 774 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 812
>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
Full=H4-K20-HMTase; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
Length = 2588
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 1439 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1485
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++
Sbjct: 1486 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1526
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1527 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1567
>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
musculus]
Length = 2588
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 1439 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1485
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++
Sbjct: 1486 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1526
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1527 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1567
>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Mus musculus]
Length = 2691
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1588
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1629
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1630 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670
>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Oreochromis niloticus]
Length = 2122
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 75/194 (38%), Gaps = 47/194 (24%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L +SVC C+ GDLL CEG C +FH +GL+ F C+ C
Sbjct: 1245 LKESVCQVCERTGDLLVCEGHCYGAFHLQC----------IGLS---APPKGKFLCRECN 1291
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K G K C+ CG FYH C+ +A S
Sbjct: 1292 TGVHACFVCKKSGDGVKR-------CIIPLCGKFYHTDCI--------------MAYSAT 1330
Query: 148 A--GESFTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEG 203
+ F CPLH C C N S + A C RCP AYH A ++ + G
Sbjct: 1331 QPHNKGFRCPLHVCLSCHITNPLNICSSKGRLARCVRCPVAYH--------ANDNCMAAG 1382
Query: 204 IITRAWEGLL-PNH 216
+ A L PNH
Sbjct: 1383 SLVLANNSFLCPNH 1396
>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
musculus]
Length = 2382
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 1233 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1279
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++
Sbjct: 1280 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1320
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F CPLH C C N + S + C RCP AYH
Sbjct: 1321 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1361
>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
Length = 2695
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1540 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1586
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1587 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1627
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1628 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1668
>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
Length = 2695
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1540 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1586
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1587 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1627
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1628 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1668
>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Macaca mulatta]
Length = 2695
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1540 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1586
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1587 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1627
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1628 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1668
>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Cricetulus griseus]
Length = 1365
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKMEVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHATNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform b [Homo sapiens]
gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific; AltName: Full=Androgen receptor
coactivator 267 kDa protein; AltName: Full=Androgen
receptor-associated protein of 267 kDa; AltName:
Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
receptor-binding SET domain-containing protein 1;
Short=NR-binding SET domain-containing protein
gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
sapiens]
Length = 2696
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1541 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1587
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1588 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1628
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1629 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1669
>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670
>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus
griseus]
Length = 1344
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 648 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 694
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 695 SCFVC-------KESKMEVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 735
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 736 FRCPLHSCVSCHATNPSNPRPSKGKMMRCVRCPVAYH 772
>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 8 [Pan troglodytes]
gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Pan paniscus]
gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2697
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670
>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
Length = 2596
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1438 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1484
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1485 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1525
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1526 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1566
>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Macaca mulatta]
Length = 2426
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1271 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1317
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1318 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1358
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1359 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1399
>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 1 [Pongo abelii]
Length = 2697
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670
>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Papio anubis]
Length = 2343
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1185 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1231
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1232 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1272
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1273 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1313
>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Macaca mulatta]
Length = 2329
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1185 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1231
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1232 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1272
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1273 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1313
>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 2 [Pongo abelii]
Length = 2594
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1439 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1485
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1486 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1526
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1527 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1567
>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific isoform a [Homo sapiens]
gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
sapiens]
Length = 2427
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1272 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1318
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1319 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1359
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1360 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1400
>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Otolemur garnettii]
Length = 2410
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1257 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1303
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K ++I
Sbjct: 1304 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVI 1344
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1345 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1385
>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
sapiens]
Length = 2593
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1438 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1484
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1485 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1525
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1526 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1566
>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
Length = 2427
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1272 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1318
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1319 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1359
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1360 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1400
>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Nomascus
leucogenys]
Length = 2697
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPKGKFICNECR 1588
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670
>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Saimiri boliviensis boliviensis]
Length = 2697
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECH 1588
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670
>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1273 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1319
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1320 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1360
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1361 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1401
>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
Length = 2428
Score = 79.7 bits (195), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1273 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1319
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1320 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1360
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1361 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1401
>gi|296222019|ref|XP_002757007.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Callithrix jacchus]
Length = 1427
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C R FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Sus scrofa]
Length = 2392
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1236 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1282
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1283 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1323
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1324 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1364
>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Otolemur garnettii]
Length = 1371
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH + LGL++ F C C H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHLSC----------LGLSR---RPEGRFTCSECTSGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHETCVRKFPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPANPRPSKGKMMRCVRCPVAYH 793
>gi|296222021|ref|XP_002757008.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Callithrix jacchus]
Length = 1438
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C R FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific [Callithrix
jacchus]
Length = 2692
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1541 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECH 1587
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1588 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1628
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1629 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1669
>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
Length = 1366
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 670 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 716
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 717 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 757
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 758 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 794
>gi|449534225|ref|XP_004174066.1| PREDICTED: uncharacterized protein LOC101231072, partial [Cucumis
sativus]
Length = 158
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 97/176 (55%), Gaps = 25/176 (14%)
Query: 283 QGKFSVKALKRVPSKAGQGETM-EISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQV 341
QGK + K K + G+ + +++ + L S+S + K ++SR S N K SV +
Sbjct: 6 QGKSAAKVSKSFERSSSNGKLLGKVTAKSLWSSES-KNVKLGNISRNSL--NQKGESVLM 62
Query: 342 DRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPS 401
D ++ KK+SL + +A T+ + +K K+D S +
Sbjct: 63 DIDKTIKVKKSSLVGK--SAIPTKRFDPSKIYKEDRSG-------------------MLL 101
Query: 402 LDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSV 457
LD +S+RRL +MK+ ASS+ +E+++K+HK+PSTHAY+ K VDK I +GK+EGSV
Sbjct: 102 LDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSV 157
>gi|296222023|ref|XP_002757009.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
[Callithrix jacchus]
Length = 1389
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C R FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
syndrome candidate 1 protein homolog; Short=WHSC1
Length = 1365
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
Length = 1346
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 650 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 696
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 697 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 737
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 738 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 774
>gi|403294386|ref|XP_003938171.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Saimiri
boliviensis boliviensis]
Length = 1438
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C R FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
guttata]
Length = 1435
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ G+ LL CEG C FH C L +E F C C+
Sbjct: 698 DTVCQICESSGESLLACEGECCSMFHL--------ECLGLKAMPEE-----KFICTECKN 744
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K G +V C +TCG FYH CV K A ++
Sbjct: 745 GEHTCFSC-------KLPGKDVKRCSVSTCGKFYHEACVRK------------FATALFE 785
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
F CP H C C ++ S + A C RCP AYH
Sbjct: 786 SRGFRCPQHCCTACSMDKDMHKASKGRMARCLRCPVAYH 824
>gi|170284946|gb|AAI61082.1| whsc1 protein [Xenopus (Silurana) tropicalis]
Length = 1028
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 63/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ GDL+ CEG C +FH + +GL+ + CK C
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLSC----------IGLS---TRPAGKYLCKECTSGAR 659
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V C+ CG FYH C+ K L ++
Sbjct: 660 SCFLC-------KESNRDVKRCIVPHCGKFYHESCLRKYPL------------AVFESRG 700
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH+C C N S + C RCP AYH
Sbjct: 701 FRCPLHRCATCYFSNPSNPRASKGKMVRCVRCPLAYH 737
>gi|325188110|emb|CCA22651.1| chromodomainhelicaseDNAbinding protein putative [Albugo laibachii
Nc14]
Length = 1883
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 71/160 (44%), Gaps = 27/160 (16%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
+C+ C +GG +L C+G C RSFH +G+T + + ++ C +C +H
Sbjct: 1595 LCTLCGDGGLILLCDGPCHRSFHL----------ECIGMTHEPQDE--HWLCPDCNAGKH 1642
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
C C K+G E G V C A CG FYH C+ D V + +
Sbjct: 1643 MCLLCKKVGEMGVEFG--VLQCSMARCGRFYHRGCLEV----DRHV-------EWVGKKR 1689
Query: 152 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
F CP H C CK+ + S + C C +A+H C+P
Sbjct: 1690 FRCPSHFCHSCKERKRPRQSTI--VSCLHCARAFHADCIP 1727
>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
Length = 1279
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 72/197 (36%), Gaps = 51/197 (25%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C C G+LLCCEG CL S+H C L F C C
Sbjct: 677 DDYCGICGEAGNLLCCEGGCLSSYHL--------FCVGLSCAPQGA-----FVCDACTTG 723
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CFAC + G + G + C CG YH C+S ++ +
Sbjct: 724 NHLCFACEQPGGLE---GLQT--CSVRNCGKKYHRACIS------------NNPRAALKD 766
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAW 209
SF CPLHKC C + S C RCP AYH C+P
Sbjct: 767 NSFKCPLHKCANCTYPQA---STYPLVRCIRCPIAYHTCCVP------------------ 805
Query: 210 EGLLPNHRILIYCLKHE 226
G L + I + C KH+
Sbjct: 806 AGCLHENAIYLLCPKHQ 822
>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
fascicularis]
Length = 1365
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|242047476|ref|XP_002461484.1| hypothetical protein SORBIDRAFT_02g003420 [Sorghum bicolor]
gi|241924861|gb|EER98005.1| hypothetical protein SORBIDRAFT_02g003420 [Sorghum bicolor]
Length = 631
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 50/81 (61%), Gaps = 4/81 (4%)
Query: 166 ENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKH 225
ENK LQ AVCRRCP+AYH +CLPR+++ K ++G RAW+ + I YCL H
Sbjct: 2 ENKTQRALQLAVCRRCPRAYHWECLPRELSLGAKDKDG-NPRAWK---LSKTIFFYCLDH 57
Query: 226 EIDDEIGTPIRDHIIFPGIEE 246
EID + T R+HI FP E
Sbjct: 58 EIDKDTRTASRNHIKFPATPE 78
>gi|390461100|ref|XP_003732597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
3 [Callithrix jacchus]
Length = 1389
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 667 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 713
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE +V CV CG FYH CV K L ++
Sbjct: 714 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 754
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 755 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791
>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
mulatta]
Length = 1365
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
Length = 1365
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|344281620|ref|XP_003412576.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Loxodonta africana]
Length = 1438
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 68/160 (42%), Gaps = 36/160 (22%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DAVCQICESTGDSLIPCEGECCKHFHL--------ECLGLASLPDG-----KFICMECQT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C TCG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGTDVKRCSVGTCGKFYHETCVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN--KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ KA S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIYKA-SKGRMMRCLRCPVAYH 827
>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
2 [Callithrix jacchus]
Length = 1400
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 667 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 713
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE +V CV CG FYH CV K L ++
Sbjct: 714 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 754
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 755 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791
>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Gorilla gorilla gorilla]
Length = 1365
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Pongo abelii]
Length = 1365
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
Full=Multiple myeloma SET domain-containing protein;
Short=MMSET; AltName: Full=Nuclear SET domain-containing
protein 2; Short=NSD2; AltName: Full=Protein
trithorax-5; AltName: Full=Wolf-Hirschhorn syndrome
candidate 1 protein; Short=WHSC1
gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
Length = 1365
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
Length = 1298
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 602 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 648
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 649 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 689
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 690 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 726
>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
6 [Pan troglodytes]
gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
7 [Pan troglodytes]
gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
8 [Pan troglodytes]
gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
10 [Pan troglodytes]
gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
paniscus]
gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
Length = 1365
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
Length = 1364
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G L+ CEG C +FH A LGL++ F C C H
Sbjct: 668 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 714
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV + CG FYH CV K L ++
Sbjct: 715 SCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPL------------TVFESRG 755
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 756 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 792
>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
Length = 1446
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +GA+V C + CG FYH CV K +I
Sbjct: 749 GQHPCFSC-------KVSGADVKRCSVSACGKFYHEACVRK------------FPTAIFE 789
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828
>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Callithrix jacchus]
Length = 1365
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Saimiri boliviensis boliviensis]
Length = 1368
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECTSGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE +V CV CG FYH CV K L ++
Sbjct: 716 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
aries]
Length = 1439
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C +TCG FYH CV K +I
Sbjct: 749 GQHPCFSC-------KVSGMDVKRCSVSTCGKFYHEACVRK------------FPTAIFE 789
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828
>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
lysine-20 specific [Desmodus rotundus]
Length = 2699
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1632 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1672
>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Heterocephalus glaber]
Length = 2698
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1541 LKENVCQNCEKVGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1587
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1588 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1628
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1629 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1669
>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Cavia
porcellus]
Length = 2509
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 77/189 (40%), Gaps = 41/189 (21%)
Query: 7 DDMEEDMVHDTEEDESNEED-------ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDA 59
+++EE + HD S + L ++VC C+ G+LL CE +C +FH
Sbjct: 1326 ENIEEGIEHDASMSSSKKMQGERGGGAALKENVCQNCEKVGELLLCEAQCCGAFHLEC-- 1383
Query: 60 GEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 119
LGLT+ F C C H CF C K++G +V C+ CG
Sbjct: 1384 --------LGLTE---MPRGKFICNECRTGIHTCFVC-------KQSGEDVKRCLLPLCG 1425
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGE--NKADSDLQFAV 177
FYH CV K +++ + F C LH C C + + S +
Sbjct: 1426 KFYHEECVQK------------YPPTVMQNKGFRCSLHICITCHAANPASVSASKGRLMR 1473
Query: 178 CRRCPKAYH 186
C RCP AYH
Sbjct: 1474 CVRCPVAYH 1482
>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Ailuropoda melanoleuca]
gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
Length = 2699
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1545 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1591
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1592 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1632
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1633 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1673
>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Ovis aries]
Length = 2698
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1632 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 1672
>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Bos grunniens mutus]
Length = 2698
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1632 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 1672
>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Canis lupus familiaris]
Length = 1364
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G L+ CEG C +FH A LGL++ F C C H
Sbjct: 668 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 714
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV + CG FYH CV K L ++
Sbjct: 715 SCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPL------------AVFESRG 755
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 756 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 792
>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific isoform 5 [Canis lupus familiaris]
Length = 2698
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1670
>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
carolinensis]
Length = 1438
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/164 (32%), Positives = 66/164 (40%), Gaps = 44/164 (26%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
DS+C C++ G+ LL CEG C R FH LGL+ + F C C+
Sbjct: 700 DSICQICESSGESLLSCEGECYRVFHTEC----------LGLSS---QPEGKFICIECKN 746
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K G +V C CG FYH CV K A ++
Sbjct: 747 GQHTCFSC-------KLPGKDVKRCSVNACGKFYHEACVRK------------FATTVFE 787
Query: 149 GESFTCPLHKCCICKQGENKADSDL------QFAVCRRCPKAYH 186
F CP H C C D D+ + C RCP AYH
Sbjct: 788 SRGFRCPQHCCTSC-----SVDKDIHKASKGRMVRCLRCPIAYH 826
>gi|114619713|ref|XP_519713.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 6 [Pan
troglodytes]
gi|410226716|gb|JAA10577.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
gi|410266866|gb|JAA21399.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
gi|410353855|gb|JAA43531.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
Length = 1437
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Equus caballus]
Length = 1365
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G L+ CEG C +FH A LGL++ F C C H
Sbjct: 669 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE +V CV + CG FYH CV K L ++
Sbjct: 716 TCFVC-------KENKTDVKRCVVSQCGKFYHEACVRKYPL------------TVFESRG 756
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793
>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
paniscus]
Length = 1437
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Felis catus]
Length = 2432
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1276 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1322
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1323 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1363
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1364 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1404
>gi|397521375|ref|XP_003830772.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Pan
paniscus]
Length = 1426
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|332825882|ref|XP_003311721.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Pan
troglodytes]
Length = 1426
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Anolis carolinensis]
Length = 2106
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 65/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL++ F C C
Sbjct: 972 LKENVCQICEKPGELLLCEAQCCGAFHLEC----------LGLSE---MPKGKFICTECS 1018
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K +G V C+ CG +YH C+ K ++I
Sbjct: 1019 TGVHTCFVC-------KNSGENVKRCLLPLCGKYYHEECIQK------------YPPTVI 1059
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1060 QNKGFRCSLHICMTCHAANPTNVSASKGRLMRCVRCPVAYH 1100
>gi|92098122|gb|AAI15007.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
Length = 1437
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Sarcophilus harrisii]
Length = 2717
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH +GLT+ F CK C
Sbjct: 1544 LKENVCQNCEKVGELLLCEAQCCGAFHLEC----------IGLTE---MPKGKFICKECR 1590
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K +G +V C+ CG FYH C+ K +++
Sbjct: 1591 TGIHTCFVC-------KTSGEDVKRCLLPLCGKFYHEACIQK------------YPPTVL 1631
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1632 QNKGFRCSLHMCITC-HAANPASLSASKGRLMRCVRCPVAYH 1672
>gi|114619717|ref|XP_001170799.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 4 [Pan
troglodytes]
Length = 1388
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|358419376|ref|XP_003584219.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
taurus]
Length = 1439
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +GA+V C + CG FYH CV K ++
Sbjct: 749 GQHPCFSC-------KVSGADVKRCSVSACGKFYHEACVRK------------FPTAVFE 789
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828
>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific, partial [Bos taurus]
Length = 2389
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1235 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1281
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1282 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1322
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1323 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 1363
>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
[Oryctolagus cuniculus]
Length = 2431
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1275 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1321
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1322 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1362
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1363 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1403
>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
paniscus]
Length = 1388
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|13699811|ref|NP_075447.1| histone-lysine N-methyltransferase NSD3 isoform long [Homo sapiens]
gi|74761342|sp|Q9BZ95.1|NSD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
Full=Nuclear SET domain-containing protein 3; AltName:
Full=Protein whistle; AltName: Full=WHSC1-like 1 isoform
9 with methyltransferase activity to lysine; AltName:
Full=Wolf-Hirschhorn syndrome candidate 1-like protein
1; Short=WHSC1-like protein 1
gi|12642817|gb|AAK00355.1| putative protein WHSC1L1l [Homo sapiens]
gi|75517229|gb|AAI01718.1| WHSC1L1 protein, isoform long [Homo sapiens]
gi|109731704|gb|AAI13470.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
gi|119583726|gb|EAW63322.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_c [Homo
sapiens]
gi|168277388|dbj|BAG10672.1| histone-lysine N-methyltransferase NSD3 [synthetic construct]
Length = 1437
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|219841942|gb|AAI43511.1| WHSC1L1 protein [Homo sapiens]
Length = 1426
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|12697312|emb|CAC28350.1| putative chromatin modulator [Homo sapiens]
Length = 1437
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Loxodonta africana]
Length = 2702
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1545 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1591
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1592 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1632
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1633 QNKGFRCSLHICITC-HAANPASVTASKGRLMRCVRCPVAYH 1673
>gi|375294191|ref|NP_001069063.2| histone-lysine N-methyltransferase NSD3 [Bos taurus]
gi|359080579|ref|XP_003588019.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
taurus]
Length = 1439
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +GA+V C + CG FYH CV K ++
Sbjct: 749 GQHPCFSC-------KVSGADVKRCSVSACGKFYHEACVRK------------FPTAVFE 789
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828
>gi|384247005|gb|EIE20493.1| hypothetical protein COCSUDRAFT_57640 [Coccomyxa subellipsoidea
C-169]
Length = 2740
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 68/152 (44%), Gaps = 31/152 (20%)
Query: 29 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
SVCS C L+ C G CLR++H S C LGL + +FC C+
Sbjct: 1862 LSSVCSAC-----LVVCGGPCLRAYH--------SGC--LGLPVLPAPGSV-WFCPECDS 1905
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+ +CF+CG+ G+ ++ + C CG FYH CV +L+ Q+AK
Sbjct: 1906 GRMRCFSCGEFGAGFEDP--TIRKCSLGVCGRFYHVRCVDRLV-------NTQMAKR--- 1953
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRR 180
G F CP H C +C + + D VC R
Sbjct: 1954 GAHFRCPQHYCSVCAKSGDGVD---MVVVCAR 1982
>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus
norvegicus]
Length = 1346
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 650 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSQ---RPEGRFTCTECASGIH 696
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV C+ CG FYH CV K L ++
Sbjct: 697 SCFVC-------KESKMEVKRCMVNQCGKFYHEACVKKYPL------------TVFESRG 737
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 738 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 774
>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
[Oryctolagus cuniculus]
Length = 2700
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1632 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1672
>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
1 [Bos taurus]
gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
taurus]
gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
Length = 1365
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 66/159 (41%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
+ VC C+ G L+ CEG C +FH A LGL++ +L C C
Sbjct: 667 EHVCQLCEEPGSLVLCEGPCCGAFHL----------ACLGLSRRPEGRLL---CGECTSG 713
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C KE+ ++V CV + CG FYH CV + L ++
Sbjct: 714 IHSCFVC-------KESKSDVKRCVVSQCGKFYHEACVRRFPL------------TVFES 754
Query: 150 ESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 755 RGFRCPLHSCLSCHASNPSNPRPSKGKMLRCVRCPVAYH 793
>gi|12697314|emb|CAC28351.1| Putative Chromatin modulator [Homo sapiens]
Length = 1388
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
scrofa]
Length = 1338
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
+ VC C+ G L+ CEG C +FH A LGL + E L C C
Sbjct: 640 EHVCQLCEKPGSLVLCEGPCCGAFHL----------ACLGLAR-RPEGRLT--CLECTSG 686
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C KE+ A+V CV + CG FYH CV + L ++
Sbjct: 687 VHSCFVC-------KESKADVRRCVVSQCGKFYHEACVRRFPL------------TVFES 727
Query: 150 ESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 728 RGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 766
>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
Length = 2055
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 66/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CEG+C +FH +GLT+ F C+ C+
Sbjct: 1224 LKENVCQVCEKTGELLLCEGQCCGAFHLQC----------IGLTET---PKGRFICQECK 1270
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K+ EV C+ CG FYH C+ K + ++
Sbjct: 1271 MGVHTCFVC-------KKPDKEVRRCMIPVCGKFYHMDCILK------------YSPTVA 1311
Query: 148 AGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 186
F C +H C C N S + C RCP AYH
Sbjct: 1312 QNRGFRCSIHVCLSCYITNPNNPGISKGRLTRCVRCPVAYH 1352
>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Gallus gallus]
Length = 2412
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 64/161 (39%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL++ F C C
Sbjct: 1256 LKENVCQICEKPGELLLCEAQCCGAFHLQC----------LGLSEMPTG---KFICNECS 1302
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K G +V C+ CG +YH C+ K ++
Sbjct: 1303 TGVHTCFVC-------KNCGQDVKRCLLPLCGKYYHEECIQK------------YPPTVT 1343
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1344 QNKGFRCSLHICMTCHAANPTNISASKGRLMRCVRCPVAYH 1384
>gi|351699801|gb|EHB02720.1| Histone-lysine N-methyltransferase NSD3 [Heterocephalus glaber]
Length = 1439
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 67/160 (41%), Gaps = 36/160 (22%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C + FH C L D F C+ C+
Sbjct: 701 DTVCQICERSGDSLVPCEGECCKHFHL--------ECLGLTSVPDG-----KFMCRECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K G +V C CG FYH CV K + +I
Sbjct: 748 GQHPCFSC-------KVPGKDVKRCSVGACGKFYHEACVRK------------FSTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN--KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ KA S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIYKA-SKGRMMRCLRCPVAYH 827
>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Gallus gallus]
Length = 1436
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ G+ L+ CEG C +FH C L +E FFC C+
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K G +V C + CG FYH CV K A ++
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 186
F CP H C C D D+ A C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825
>gi|414884836|tpg|DAA60850.1| TPA: putative EDM2-like family protein [Zea mays]
Length = 103
Score = 77.0 bits (188), Expect = 6e-11, Method: Composition-based stats.
Identities = 38/84 (45%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 409 RLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLD 468
R+ +++ + SS+ ++++ + +PSTH Y+ + DK I LGK+E SV+A+ ALKKL+
Sbjct: 18 RVITILGNKVSSLTIKDVTRNCSLPSTHVYSGR-LTDK-IPLGKIERSVQAVEAALKKLE 75
Query: 469 VDGSSIEDAKAVCEPEVLSQIFKW 492
+G S+ DAKAVCEPEVL Q+ +W
Sbjct: 76 -NGGSVNDAKAVCEPEVLRQLARW 98
>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like, partial [Meleagris gallopavo]
Length = 2336
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 64/161 (39%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL++ F C C
Sbjct: 1244 LKENVCQICEKPGELLLCEAQCCGAFHLQC----------LGLSEMPTG---KFICNECS 1290
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K G +V C+ CG +YH C+ K ++
Sbjct: 1291 TGVHTCFVC-------KSCGQDVKRCLLPLCGKYYHEECIQK------------YPPTVT 1331
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 1332 QNKGFRCSLHICMTCHAANPTNISASKGRLMRCVRCPVAYH 1372
>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Sarcophilus harrisii]
Length = 1437
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 70/161 (43%), Gaps = 38/161 (23%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNC 86
D+VC C++ GD L+ CEG C + FH +D LGL+ ++ N F C C
Sbjct: 701 DTVCQICESYGDSLVVCEGECCKHFH--LDC--------LGLS-----SLPNGRFLCTEC 745
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
+ QH CF+C K G +V C CG FYH CV K A ++
Sbjct: 746 KTGQHTCFSC-------KVPGKDVKRCSVTACGKFYHEACVRK------------FATAV 786
Query: 147 IAGESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
F CP H C C ++ S + C RCP AYH
Sbjct: 787 FESRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 827
>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
[Meleagris gallopavo]
Length = 1436
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ G+ L+ CEG C +FH C L +E FFC C+
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K G +V C + CG FYH CV K A ++
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 186
F CP H C C D D+ A C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825
>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
[Xenopus (Silurana) tropicalis]
Length = 1298
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 63/157 (40%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ GDL+ CEG C +FH + +GL+ + CK C
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLSC----------IGLS---TRPAGKYLCKECTSGAR 659
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V C+ CG FYH C+ K L ++
Sbjct: 660 SCFLC-------KESNRDVKRCIVPHCGKFYHESCLRKYPL------------AVFESRG 700
Query: 152 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 186
F CPLH+C C N S + C RCP AYH
Sbjct: 701 FRCPLHRCATCYFSNPSNPRASKGKMVRCVRCPLAYH 737
>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
[Saccoglossus kowalevskii]
Length = 1787
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 62/159 (38%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
++VC C+ G+LL CEG+C +FH C L V F C C
Sbjct: 922 ENVCQVCERPGELLLCEGQCCGAFHL--------DCIGLQQMPSGV-----FKCDECISG 968
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C K EV C A CG FYH C+ K L + G
Sbjct: 969 VHSCFICTKC-------DIEVKRCSVALCGKFYHEDCLKKWTL------------TRFDG 1009
Query: 150 ESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 186
+ TCPLH C C N + + C RCP AYH
Sbjct: 1010 KGVTCPLHTCQACAADNPRNPKATKGRLTRCVRCPTAYH 1048
>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
(predicted) [Rattus norvegicus]
Length = 1298
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 602 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSQ---RPEGRFTCTECASGIH 648
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV C+ CG FYH CV K L ++
Sbjct: 649 SCFVC-------KESKMEVKRCMVNQCGKFYHEACVKKYPL------------TVFESRG 689
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 690 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 726
>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Gallus gallus]
Length = 1386
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ G+ L+ CEG C +FH C L +E FFC C+
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K G +V C + CG FYH CV K A ++
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 186
F CP H C C D D+ A C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825
>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
Length = 1322
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 700 DTVCQICESSGDSLIPCEGECYKHFHL--------ECLGLTSLPDG-----KFICMECKT 746
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH C+ K +I
Sbjct: 747 GQHPCFSC-------KVSGTDVKRCSVGACGKFYHEACIRK------------FPTAIFE 787
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 788 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 826
>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
Length = 1437
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD ++ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSVIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|395739588|ref|XP_002819060.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pongo
abelii]
Length = 1438
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
[Meleagris gallopavo]
Length = 1386
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ G+ L+ CEG C +FH C L +E FFC C+
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K G +V C + CG FYH CV K A ++
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 186
F CP H C C D D+ A C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825
>gi|380814048|gb|AFE78898.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca
mulatta]
gi|383419477|gb|AFH32952.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca
mulatta]
Length = 1438
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFVCMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|297299251|ref|XP_001091186.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Macaca mulatta]
Length = 1419
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFVCMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Tupaia chinensis]
Length = 2687
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 66/162 (40%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 1533 LKENVCQNCEKVGELLLCEAQCCGAFHLEC----------LGLAE---MPRGKFICNECR 1579
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1580 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1620
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1621 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1661
>gi|281208068|gb|EFA82246.1| PHD zinc finger-containing protein [Polysphondylium pallidum PN500]
Length = 533
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 85/196 (43%), Gaps = 36/196 (18%)
Query: 9 MEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASL 68
+ ++ HD D ++ +++C+ C+ +LL C+G CLRSFH ID +
Sbjct: 49 INNNVTHDA--DAKAKKPRKNEAICTVCEKSDNLLMCDGPCLRSFH--IDC--------I 96
Query: 69 GLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVS 128
GL + + C +C Q+ CFAC + G + C CG FYH C+S
Sbjct: 97 GL---DSMPTSRWECSDCSKNQNICFACKERGI----INMNLTKCKVHQCGKFYHNKCIS 149
Query: 129 KLLLRDDEVAADQLAKSI-IAGESFTCPLHKCCIC-KQGENKADSDLQFAVCRRCPKAYH 186
+ +L K + F CPLH C C G+ K Q C RCP AYH
Sbjct: 150 EF----------KLTKVVNTKSPRFNCPLHYCESCGVSGDGK-----QSVHCFRCPTAYH 194
Query: 187 RKCLPRKIAFEDKLEE 202
C+P + K +E
Sbjct: 195 VICMPPVVKMLTKSKE 210
>gi|384252957|gb|EIE26432.1| hypothetical protein COCSUDRAFT_58970 [Coccomyxa subellipsoidea
C-169]
Length = 707
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 53/164 (32%), Positives = 72/164 (43%), Gaps = 24/164 (14%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 92
C C +GG+LL C+G CLRSFH E D EA ++C +C
Sbjct: 321 CGRCADGGELLECDGTCLRSFHLNCLTPSERPSPD-----DPPEA--PWYCPDCRLGIGA 373
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA---- 148
C C + G G + C +CG+++H C++ LRD + +A
Sbjct: 374 CAICKQTGV----VGQHILKCKMGSCGYYFHNQCLNG--LRDSGLLKIHREDPAVANGGN 427
Query: 149 GES-FTCPLHKCCICKQGENKADSDLQFAV-CRRCPKAYHRKCL 190
GE FTCP H C +C+ D Q V C RCP AYH + L
Sbjct: 428 GERVFTCPAHFCHVCRM-----SGDAQRMVRCWRCPTAYHSRGL 466
>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
Length = 1438
Score = 76.6 bits (187), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFVCMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Sarcophilus harrisii]
Length = 1389
Score = 76.6 bits (187), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 70/161 (43%), Gaps = 38/161 (23%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNC 86
D+VC C++ GD L+ CEG C + FH +D LGL+ ++ N F C C
Sbjct: 701 DTVCQICESYGDSLVVCEGECCKHFH--LDC--------LGLS-----SLPNGRFLCTEC 745
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
+ QH CF+C K G +V C CG FYH CV K A ++
Sbjct: 746 KTGQHTCFSC-------KVPGKDVKRCSVTACGKFYHEACVRK------------FATAV 786
Query: 147 IAGESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
F CP H C C ++ S + C RCP AYH
Sbjct: 787 FESRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 827
>gi|301117486|ref|XP_002906471.1| chromodomain-helicase-DNA-binding protein, putative [Phytophthora
infestans T30-4]
gi|262107820|gb|EEY65872.1| chromodomain-helicase-DNA-binding protein, putative [Phytophthora
infestans T30-4]
Length = 2158
Score = 76.6 bits (187), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 70/172 (40%), Gaps = 30/172 (17%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D +C+ C +GG +L C+G C RSFH C + KDE + C +C
Sbjct: 1659 DDLCTLCGDGGLILLCDGPCHRSFHL--------DCVGM---KDEPNDE-QWLCPDCAEG 1706
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
+H C C ++G E G V C A CG FYH C+ A+ +
Sbjct: 1707 RHMCLICKQVGEMGVEFG--VTQCSVAKCGRFYHKGCL-----------AENSRVEWVGK 1753
Query: 150 ESFTCPLHKCCICKQ-----GENKADSDLQFAVCRRCPKAYHRKCLPRKIAF 196
+ F CP H C C + K + + C C +A+H C+P F
Sbjct: 1754 KRFRCPSHFCHACSKQATTNSNKKGEDNTGLVSCIHCSQAFHPDCIPSADKF 1805
>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Equus caballus]
Length = 2700
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 66/162 (40%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLNE---MPRGKFICNECR 1590
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1632 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1672
>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
Length = 1059
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 504 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 550
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 551 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 591
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 592 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 632
>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Otolemur garnettii]
Length = 1438
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLC-CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLTPCEGECCKHFHL--------ECLGLTSVPDG-----KFICTECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827
>gi|348688432|gb|EGZ28246.1| hypothetical protein PHYSODRAFT_470076 [Phytophthora sojae]
Length = 2182
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 70/177 (39%), Gaps = 35/177 (19%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D +C+ C +GG +L C+G C RSFH C + KDE + C +C
Sbjct: 1658 DDLCTLCGDGGLILLCDGPCHRSFHL--------ECVGM---KDEPNDE-QWLCPDCAEG 1705
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
+H C C ++G E G V C A CG FYH C++K +
Sbjct: 1706 RHMCLICKQVGEMGVEFG--VTQCSVAKCGRFYHKGCLAK-----------SSRVEWVGK 1752
Query: 150 ESFTCPLHKCCICKQGENKA----------DSDLQFAVCRRCPKAYHRKCLPRKIAF 196
+ F CP H C C Q K D C C +A+H +C+P F
Sbjct: 1753 KRFRCPSHFCHACSQASKKESAAKSKKKKDDVKTHVVSCIHCSQAFHPECIPSADKF 1809
>gi|395847339|ref|XP_003796336.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
[Otolemur garnettii]
Length = 1427
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLC-CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLTPCEGECCKHFHL--------ECLGLTSVPDG-----KFICTECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827
>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
gorilla]
Length = 1397
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 661 DTVCQICESPGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 707
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 708 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 748
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 749 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 787
>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
rubripes]
Length = 1499
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 67/164 (40%), Gaps = 43/164 (26%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ G+ L+ CEG C R FH LGLT + F C C+
Sbjct: 749 DTVCQICEAYGESLVVCEGDCNRQFHLEC----------LGLT---AQPEGRFICLECQN 795
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF+C K G EV C + CG +YH CV KL L +
Sbjct: 796 GDHPCFSC-------KTAGQEVTRCSVSGCGCYYHEDCVRKL-----------LGTTSSP 837
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 186
G F CP H C C + DLQ A C RCP AYH
Sbjct: 838 GGGFCCPQHICSTC-----CLERDLQRASKGRLMRCIRCPVAYH 876
>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
Length = 1435
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C FH C L D F C C+
Sbjct: 698 DTVCQICESSGDSLIACEGECCTHFHL--------ECLGLPSLPDG-----KFICVECQT 744
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 745 GQHPCFSC-------KMSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 785
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 786 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 824
>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Otolemur garnettii]
Length = 1389
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLC-CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESSGDSLTPCEGECCKHFHL--------ECLGLTSVPDG-----KFICTECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827
>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific, partial [Pteropus alecto]
Length = 2202
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 66/162 (40%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL + F C C
Sbjct: 990 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLNE---MPRGKFICNECR 1036
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 1037 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1077
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 1078 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1118
>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Taeniopygia guttata]
Length = 1651
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 65/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGL++ F C C
Sbjct: 270 LKENVCQICEKPGELLLCEAQCCGAFHLQC----------LGLSE---MPKGKFICNECS 316
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K G +V C+ CG +YH C+ K +++
Sbjct: 317 TGVHTCFVC-------KSCGEDVKRCLLPLCGKYYHEACIQK------------YPPTVM 357
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C N + S + C RCP AYH
Sbjct: 358 QNKGFRCSLHICMTCHAANPANISASKGRLMRCVRCPVAYH 398
>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Bos taurus]
Length = 1470
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 316 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 362
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 363 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 403
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 404 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 444
>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
specific [Crassostrea gigas]
Length = 1079
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 74/173 (42%), Gaps = 35/173 (20%)
Query: 16 DTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEV 75
D +E+E N + +C C++ GDL+ CEG C FH + C S + +
Sbjct: 298 DNQEEEKNSLCKSPRYLCQVCEDNGDLMECEGECHNYFHKS--------CVSENVEEP-- 347
Query: 76 EAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDD 135
F C+ C+ H CFAC KE C CG FYH +C++KL+
Sbjct: 348 -----FKCEECKSGVHTCFAC-------KEKDENTKKCSVHLCGKFYHENCLNKLMHSKS 395
Query: 136 EVAADQLAKSIIAGESFTCPLHKCCICKQGENKAD--SDLQFAVCRRCPKAYH 186
+ + + F CPLH+C C +K S + C RCP AYH
Sbjct: 396 D-----------SNKGFVCPLHQCATCAIESSKKTKASKGRLLRCVRCPTAYH 437
>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
Length = 1440
Score = 75.1 bits (183), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 66/159 (41%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ G+ L+ CEG C FH C L DE F C C+
Sbjct: 691 DTVCQICESSGESLVSCEGECCGVFHP--------ECLGLKSLPDE-----KFICAECKN 737
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K G +V C +TCG FYH CV K A ++
Sbjct: 738 GEHTCFSC-------KLPGKDVKRCSVSTCGKFYHEACVRK------------FATALFE 778
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
F CP H C C ++ S + C RCP A+H
Sbjct: 779 SRGFRCPQHCCTACSMDKDIHKASKGRMVRCLRCPVAFH 817
>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
Length = 857
Score = 75.1 bits (183), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 161 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 207
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 208 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 248
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 249 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 285
>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
Length = 851
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 155 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 201
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 202 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 242
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 243 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 279
>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Papio anubis]
Length = 1013
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ F C C H
Sbjct: 317 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 363
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 364 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 404
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 405 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 441
>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
Length = 2363
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 62/159 (38%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
+++C C+ G+LL CEG+C +FH C SL F C C+
Sbjct: 1549 ENICQVCEKTGELLLCEGQCCGAFHLA--------CISLAEAPKG-----KFICPECKSG 1595
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C K+ +V C+ CG FYH C++ A ++
Sbjct: 1596 IHTCFVC-------KKRSEDVRRCMIPVCGKFYHGECIA------------NFAPTVAVN 1636
Query: 150 ESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 186
F C +H C C + S + C RCP AYH
Sbjct: 1637 RGFRCSIHVCLTCFIANPNSSTISKGRLVRCVRCPIAYH 1675
>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Strongylocentrotus purpuratus]
Length = 1736
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 66/160 (41%), Gaps = 34/160 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
++VC C+ G LL CEG C +FH +D +GL +V +F C C
Sbjct: 831 ENVCQVCERTGQLLLCEGGCCGAFH--LDC--------IGL---QVAPSGSFRCDECISG 877
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C K + EV C CG +YH C+ + L +
Sbjct: 878 VHSCFVC-------KLSDQEVRRCHVPVCGKYYHEGCIRRFPL------------TRFDS 918
Query: 150 ESFTCPLHKCCICKQGENKAD--SDLQFAVCRRCPKAYHR 187
FTCPLH C C K+ S + C RCP AYH+
Sbjct: 919 RGFTCPLHACVACFADNPKSTKASRGRLMRCVRCPTAYHQ 958
>gi|348554221|ref|XP_003462924.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Cavia
porcellus]
Length = 1438
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C R FH C D F C C+
Sbjct: 701 DTVCQICESSGDSLVPCEGECYRYFHL--------ECLGWTSVPDG-----KFTCMECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K G +V C CG FYH CV K + +I
Sbjct: 748 GQHPCFSC-------KVPGEDVKRCSVGACGKFYHEACVRK------------FSTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
Length = 2421
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 66/162 (40%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
+ +++C C+ G+LL CEG+C +FH C SL F C C+
Sbjct: 1599 MKENICQVCEKTGELLLCEGQCCGAFHLP--------CISLAEAPKG-----KFVCPECK 1645
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K+ G +V C+ CG FYH C++ A ++
Sbjct: 1646 SGIHTCFVC-------KKRGEDVRRCMIPVCGKFYHGECIA------------NYAPTVP 1686
Query: 148 AGESFTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYH 186
F C +H C C N S++ + C RCP AYH
Sbjct: 1687 VNRGFRCSIHVCLTCFIA-NPNSSNISKGRLVRCVRCPVAYH 1727
>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
Length = 1363
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 64/161 (39%), Gaps = 38/161 (23%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ- 90
VC C+ G L+ CEG C +FH A LGL++ F C C +
Sbjct: 663 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGRF 709
Query: 91 ---HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C KE+ +V CV + CG FYH CV K L ++
Sbjct: 710 SWIHSCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPL------------TVF 750
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 751 ESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791
>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
Length = 1275
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 119 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 165
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 166 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 206
Query: 148 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 186
+ F C LH C C N A S + C RCP AYH
Sbjct: 207 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 247
>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
Length = 1421
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 60/157 (38%), Gaps = 37/157 (23%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ G LL CEG C +FH A LGL++ C H
Sbjct: 560 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSRRPEGRF------TCSEWIH 603
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 604 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 644
Query: 152 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 645 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 681
>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
Length = 1641
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C + D F C C+
Sbjct: 895 DTVCQICESSGDSLIPCEGECYKYFHL--------ECLGVTSLPDG-----KFICMECKT 941
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K +G +V C CG FYH CV K ++
Sbjct: 942 GKHPCFSC-------KVSGTDVKRCSVGACGKFYHEACVRK------------FPTAVFE 982
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 983 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 1021
>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
[Desmodus rotundus]
Length = 1438
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 63/159 (39%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C D F C C+
Sbjct: 701 DTVCQICESSGDSLIPCEGECYKYFHL--------ECLGSPPLPDG-----KFICVECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K G EV C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVPGTEVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|452821906|gb|EME28931.1| hypothetical protein Gasu_36670 [Galdieria sulphuraria]
Length = 1451
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 67/156 (42%), Gaps = 20/156 (12%)
Query: 37 DNGGDLLCCEGRCLRSFHATIDAGEESH---CASLGLTKDEVEAMLNFFCKNCEYKQHQC 93
D L+ C+G C ++FH G+ES S+ L + + C NC +H C
Sbjct: 1291 DASNPLIACDGPCQKTFHYECAPGDESERPPIESIELYASDERPLWQ--CHNCTSGEHIC 1348
Query: 94 FACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFT 153
F+CG+ G + + C +CG FYH C + + LA+ G F
Sbjct: 1349 FSCGRPGHI-ADVNDPLRKCSLGSCGRFYHNSCAQQ----------EPLARLASDGNWFR 1397
Query: 154 CPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKC 189
CP H C +C E DS C CP+A+H +C
Sbjct: 1398 CPQHYCVVC---EESGDSRPMIK-CIYCPRAWHVQC 1429
>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like
[Oreochromis niloticus]
Length = 1605
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 33/159 (20%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ GD L+ CEG C R FH LGL+ F C C
Sbjct: 857 DTVCQICEVYGDGLMVCEGDCNRQFHMEC----------LGLSS---PPEGRFTCAECRT 903
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF+C K G EV C + CG +YH CV KL +
Sbjct: 904 GNHPCFSC-------KTVGREVTRCSVSGCGCYYHEDCVRKL-----------PGTTSSP 945
Query: 149 GESFTCPLHKC-CICKQGENKADSDLQFAVCRRCPKAYH 186
G F+CP H C C + + + S + C RCP AYH
Sbjct: 946 GGGFSCPQHSCSTCCLERDPQRASKGRLIRCIRCPLAYH 984
>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
NSD3-like [Sus scrofa]
Length = 1438
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 64/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ D L+ CEG C + FH C L D F C C+
Sbjct: 701 DTVCQICESPSDSLIPCEGECCKHFHL--------ECLGLTSLPDG-----KFVCVECKT 747
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 748 GQHPCFSC-------KVSGTDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827
>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
Length = 2214
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 66/161 (40%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
+ +++C C+ G+LL CEG+C +FH C SL D + F C C+
Sbjct: 1424 MKETICQVCEKTGELLLCEGQCCGAFHLA--------CISLA---DAPKG--KFVCPECK 1470
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K+ G +V C+ CG FYH C++ A +
Sbjct: 1471 SGVHTCFVC-------KKRGEDVRRCMIPVCGKFYHGECIT------------NHAPTAP 1511
Query: 148 AGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 186
F C +H C C + + S + C RCP AYH
Sbjct: 1512 VNRGFRCSIHVCLTCFIANPNSSSISKGRLVRCVRCPVAYH 1552
>gi|428162029|gb|EKX31238.1| hypothetical protein GUITHDRAFT_83305 [Guillardia theta CCMP2712]
Length = 171
Score = 72.4 bits (176), Expect = 2e-09, Method: Composition-based stats.
Identities = 47/166 (28%), Positives = 66/166 (39%), Gaps = 29/166 (17%)
Query: 30 DSVCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D C C G L+CC+G C RSF H A LG+ ++E + + C C+
Sbjct: 27 DYHCDICLGWEGTLVCCDGSCRRSF----------HLACLGMDEEENDEEEEWLCNLCKV 76
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+C C SD + + C +C ++H C+ D A
Sbjct: 77 GAKRCMIC-----SDSQDSENMIHCKVESCKKYFHRDCLKTWNCEVD------------A 119
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 194
FTCP H C C Q + F C CP A+H KCLP ++
Sbjct: 120 AGRFTCPRHTCKACNQHSYTGKQGVMFK-CIDCPAAFHFKCLPSQV 164
>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
latipes]
Length = 1597
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 70/177 (39%), Gaps = 46/177 (25%)
Query: 17 TEEDESNEEDELFDSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEV 75
+ +D SNE+ D+VC C+ G+ L+ CEG C R FH C L D
Sbjct: 840 SRQDSSNEKR---DTVCQICEAYGEGLMVCEGDCSRQFHP--------ECLGLSAPPDG- 887
Query: 76 EAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDD 135
F C C+ H CF+C K TG EV C + C +YH CV K
Sbjct: 888 ----RFTCLECKNSNHPCFSC-------KTTGQEVSRCSAPGCSCYYHEDCVWK------ 930
Query: 136 EVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 186
+ G F CP H C C + DLQ A C RCP AYH
Sbjct: 931 -----HPGTTSSPGGGFCCPQHSCSTC-----CLERDLQRASKGRLIHCIRCPTAYH 977
>gi|355708043|gb|AES03146.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
furo]
Length = 588
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 67/161 (41%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ G+LL CE +C +FH LGLT+ F C C
Sbjct: 365 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 411
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K++G +V C+ CG FYH CV K +++
Sbjct: 412 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 452
Query: 148 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
+ F C LH C C + + S + C RCP AYH
Sbjct: 453 QNKGFRCSLHICITCHAANPASVSASKGRLMRCVRCPVAYH 493
>gi|443709486|gb|ELU04158.1| hypothetical protein CAPTEDRAFT_93578, partial [Capitella teleta]
Length = 1129
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
+S+C C+ +LL C G CL FHA A LGL+ V F C C
Sbjct: 490 ESLCGVCEKPNELLHCVGPCLGHFHA----------ACLGLS---VVPTSTFKCDECSTG 536
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C K +DKET C S CG FYH CVS + + +
Sbjct: 537 IHSCFTCKK---ADKETKK----CSSHGCGKFYHMKCVSG------------IRSAKVDN 577
Query: 150 ESFTCPLHKCCICKQGENKADSDLQ--FAVCRRCPKAYH 186
FTCPLH C C + Q C RCP AYH
Sbjct: 578 GHFTCPLHACATCAVDDVYKPKPKQNRLLRCVRCPTAYH 616
>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
Length = 742
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 63/159 (39%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
+ VC C+ G LL CEG C +FH A LGL++ F C C
Sbjct: 44 EYVCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASG 90
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 91 IHSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFES 131
Query: 150 ESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 132 RGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 170
>gi|325180900|emb|CCA15310.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
Length = 1258
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 39/187 (20%)
Query: 25 EDELFDSVCSFCDNGGD--LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFF 82
++EL C D G ++ C+ C ++H + LGLT DE + +
Sbjct: 310 DEELCGICCELEDEEGSNLVVWCKKSCYTAYHLS----------CLGLT-DEFDDS-EWK 357
Query: 83 CKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL---------R 133
C+ CE ++ CFACG+ GS D+ G VF C S C FYH CV R
Sbjct: 358 CEQCENREQICFACGRNGSIDERGG--VFKCCSQRCHKFYHYSCVEGCRRTRFYGSKRKR 415
Query: 134 DDEV--AADQLAKSIIAGES-------FTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 184
+ ++ +D L+ E F CP H C +C E+ S+L F C +CP++
Sbjct: 416 NPQIRKESDNLSMESDTNEEDFKYKFKFRCPRHICAVC---EDAKSSELMF--CIKCPES 470
Query: 185 YHRKCLP 191
YH C+P
Sbjct: 471 YHTSCVP 477
>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
Length = 704
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 63/159 (39%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
+ VC C+ G LL CEG C +FH A LGL++ F C C
Sbjct: 6 EYVCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASG 52
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C KE+ EV CV CG FYH CV K L ++
Sbjct: 53 IHSCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFES 93
Query: 150 ESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 94 RGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 132
>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
Length = 1235
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D++C C++ GD LL CEG C R FH LGLT F C C+
Sbjct: 555 DTLCQVCESAGDSLLPCEGDCCRHFHLEC----------LGLTS---LPDGKFTCVECQT 601
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF+C K G +V C + CG FYH CV K ++
Sbjct: 602 GLHPCFSC-------KTPGKDVKRCSVSACGKFYHEACVRK------------FPTAVFE 642
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 643 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 681
>gi|38424052|dbj|BAD01742.1| hypothetical protein [Oryza sativa Japonica Group]
gi|38637126|dbj|BAD03380.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 645
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 109/475 (22%), Positives = 182/475 (38%), Gaps = 86/475 (18%)
Query: 98 KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLH 157
K S K + VF C +A+CG FYHP CV+ LL D A L K I G +FTCP+H
Sbjct: 212 KTSSKKKYNTSLVFQCCNASCGRFYHPKCVAGLLEPD---GACGLEKRIADGMTFTCPVH 268
Query: 158 KCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHR 217
C CKQ E+ R+I+F+ K + R WE + +
Sbjct: 269 WCFECKQIED------------------------REISFKKK---DVTARVWEVPKGDPK 301
Query: 218 ILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLT 277
I IYCL I + G+ G E+ I Q+L S + +S T
Sbjct: 302 IFIYCLLVSIAGKGGS--------LGTYEDIVI-------QTLPSQRRAHRHHQIRSHCT 346
Query: 278 SKAPPQGKFSVKALKRVPSKAGQGETMEISERLLV-GSDSSRRAKATDVSRKSFKGNVKS 336
+ PP +A++ +G+ ER D + R + N KS
Sbjct: 347 QRQPPNF-LRREAMREAVESSGEAVRERTGERTPAKDDDEAARKQGKQRWHSGATANTKS 405
Query: 337 LSVQVDRSSSVDSKKTSLGERLYAAFVTEGTE------QTKFGKQDNSDRETSRTVTVK- 389
D+ +V K T + + + T +++ K+D + + + T K
Sbjct: 406 QESYEDQ-FNVSKKPTHISSQAGTCIASCQTRMNNKEHKSRRNKEDIPEMRSEKVATPKV 464
Query: 390 PLRKKLISELPSLDED----------SKRRLSSLMKDAASSVRME-----------EILK 428
+++K E LD++ +L SL+ + + + +E E+
Sbjct: 465 SIKRKRKCEFEFLDDEVATTKGMAFQRAEKLPSLIVEKETKMAIESNVFKQMEQNRELSS 524
Query: 429 RHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQ 488
HK P Y S D+ I L + E ++E I++ ++++ + + D E E LS
Sbjct: 525 PHKRPKGGVYLS----DEQIKLVEYESTLELIQSKIREVKRLKTEVID---FVEIEELSS 577
Query: 489 IFKWKNKLKVYLAPFLHGMR--YTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVD 541
F K +V + M + +H +D+ + ++H V + D
Sbjct: 578 -FHRKATAEVLITKMQSDMSLVWGKLSQHLDGLDEEAMLDREMHSLVQKCYFVAD 631
>gi|224010613|ref|XP_002294264.1| hypothetical protein THAPSDRAFT_269843 [Thalassiosira pseudonana
CCMP1335]
gi|220970281|gb|EED88619.1| hypothetical protein THAPSDRAFT_269843 [Thalassiosira pseudonana
CCMP1335]
Length = 392
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 69/151 (45%), Gaps = 31/151 (20%)
Query: 29 FDSVCSFC-------DNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNF 81
F +C+ C D LL CEG C R FH T CA+L + E +
Sbjct: 208 FTWICTECREAECLDDPDAVLLICEGLCNRPFHPT--------CANLLSPPPDNET---W 256
Query: 82 FCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL-----LLRDDE 136
C++CE +HQC AC G D +V+ C + CG F+H C+S ++ +++
Sbjct: 257 ICQDCEQGRHQCAACHNYGQDD----VDVYCCTAKNCGLFFHEACLSMYNVDVQVVEEEQ 312
Query: 137 VAADQLA----KSIIAGESFTCPLHKCCICK 163
A D A ++I++ F CP H C C+
Sbjct: 313 YADDDTADLQERTIVSMPMFKCPAHSCFTCQ 343
>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
Length = 926
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 180 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDG-----KFICVECKT 226
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
QH CF+C K +G +V C CG FYH CV K +I
Sbjct: 227 GQHACFSC-------KVSGTDVKRCSVGACGKFYHEACVRK------------FPTAIFE 267
Query: 149 GESFTCPLHKCCICK-QGENKADSDLQFAVCRRCPKAYH 186
+ F CP H C C + + S + C RCP AYH
Sbjct: 268 SKGFRCPQHCCSACSVEKDIYKASKGRMMRCLRCPVAYH 306
>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Takifugu rubripes]
Length = 1169
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 66/160 (41%), Gaps = 33/160 (20%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L ++VC C+ GDLL C+G C +FH +GL+ FFC+ C
Sbjct: 377 LKENVCQVCERTGDLLACDGHCYGAFHPQC----------IGLS---AAPKGKFFCRECS 423
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K +G V C+ CG FYH C++ + +
Sbjct: 424 SGVHSCFVC-------KTSGNGVKRCMIPLCGKFYHNDCIT------------AFSATQP 464
Query: 148 AGESFTCPLHKCCICKQGENKAD-SDLQFAVCRRCPKAYH 186
+ CPLH C C ++ S + A C RCP AYH
Sbjct: 465 HNKGLRCPLHVCLSCHINYPHSNCSKGRLARCVRCPVAYH 504
>gi|298715287|emb|CBJ27936.1| Chromodomain-helicase-DNA-binding protein 8 [Ectocarpus siliculosus]
Length = 3661
Score = 70.1 bits (170), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 69/168 (41%), Gaps = 34/168 (20%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC+ C++GG + C+G C RSFH A LG+ + E + C C K
Sbjct: 2739 VCARCEDGGVTIMCDGPCQRSFHP----------ACLGMDDNPEED--PWMCNRCMNKVQ 2786
Query: 92 QCFACGKLGSS-DKETGAEVFP-----CVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 145
+C CGK GS D A P C ++CG +YH C+ K+ + +
Sbjct: 2787 KCLECGKKGSEMDSHNRAVKIPGGVSRCQLSSCGRYYHKECLDKITPNRTSYSKEG---- 2842
Query: 146 IIAGESFTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYHRKCL 190
+F CP H C C K ++L C RC KA CL
Sbjct: 2843 -----NFKCPQHFCIDC----GKTSTNLGPRTLVKCLRCAKARCPDCL 2881
>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Ornithorhynchus anatinus]
Length = 1437
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 61/159 (38%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ D L+ CEG C R FH C L D F C CE
Sbjct: 700 DTVCQVCESYSDSLVACEGECFRHFHL--------ECLGLTSLPDG-----KFVCVECET 746
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF+C K G +V C + CG FYH CV +I
Sbjct: 747 GHHTCFSC-------KVPGKDVKRCSVSACGKFYHEACVRI------------FPTAIFE 787
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
F CP H C C ++ S + C +CP AYH
Sbjct: 788 ARGFRCPQHCCSACSMEKDIHKASKGRMMRCLKCPIAYH 826
>gi|189534132|ref|XP_001923087.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Danio rerio]
Length = 773
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 63/161 (39%), Gaps = 36/161 (22%)
Query: 35 FCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCF 94
C+ GDLL CEG+C +FH E F C+ C H CF
Sbjct: 1 MCEKQGDLLLCEGQCCGAFHPQCTGLNEP-------------PTGKFLCQECTSGVHSCF 47
Query: 95 ACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTC 154
AC +LG +V C+ CG FYH C A A ++ +F C
Sbjct: 48 ACKRLGE-------DVRRCMVPGCGKFYHGEC------------AASHAPTVPLNRAFRC 88
Query: 155 PLHKC--CICKQGENKADSDLQFAVCRRCPKAYHRK--CLP 191
PLH C C N + + Q C RCP AYH C+P
Sbjct: 89 PLHACLSCFILNPANPSVAKGQLTRCIRCPVAYHSSDFCIP 129
>gi|242080051|ref|XP_002444794.1| hypothetical protein SORBIDRAFT_07g028156 [Sorghum bicolor]
gi|241941144|gb|EES14289.1| hypothetical protein SORBIDRAFT_07g028156 [Sorghum bicolor]
Length = 212
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 38/51 (74%)
Query: 657 LSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 707
LSGKSFYLPGS+ ++K M+QWNM+ PPL LWSR D+A H +A+ HI
Sbjct: 2 LSGKSFYLPGSLGVDNKVMEQWNMSPPPLSLWSRSDWAKRHSDIAKSMRHI 52
>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Oreochromis niloticus]
Length = 1167
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 62/158 (39%), Gaps = 35/158 (22%)
Query: 32 VCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
VC C+ G DL+ CEG+C FH HC L D+ C+ C
Sbjct: 461 VCQMCEQAGEDLVPCEGQCCGMFHL--------HCLGQTLNPDD-----KLLCQECSTGI 507
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C KL EV C CG FYH CV L ++ +
Sbjct: 508 HSCFICKKLE-------GEVRRCHVPHCGKFYHEACVRLNTL------------TVFDNK 548
Query: 151 SFTCPLHKCCICKQG-ENKADSDL-QFAVCRRCPKAYH 186
F CPLH C C G K S+ + C RCP AYH
Sbjct: 549 GFRCPLHTCLSCHYGCRTKHKSNKGRLMRCLRCPVAYH 586
>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
domestica]
Length = 1435
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 61/159 (38%), Gaps = 34/159 (21%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ D L+ CEG C + FH C L D F C C+
Sbjct: 699 DTVCQICESYSDSLVVCEGECCKHFHPD--------CLGLSSLPDG-----RFLCTECKT 745
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
CF+C K G +V C CG FYH CV K A ++
Sbjct: 746 GHRTCFSC-------KIPGKDVKRCSVPACGKFYHEACVRK------------FATAVFE 786
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
F CP H C C ++ S + C RCP AYH
Sbjct: 787 SRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 825
>gi|340500320|gb|EGR27208.1| myb domain protein [Ichthyophthirius multifiliis]
Length = 1057
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 76/185 (41%), Gaps = 36/185 (19%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFH----ATIDAGEE-------SHCASLGLTKDEV 75
+++ +CS C+ C+ C RSFH TI+ + + L ++
Sbjct: 697 DIYKKLCSLCEQPKCTFFCQSFCKRSFHEQCKETIEQKQNLKEKADIYYPKELNFEDLQL 756
Query: 76 EAMLN--FFCKNCEYKQHQCFACGKLGSSD------KETGAEVFPCVSATCGHFYHPHCV 127
+ M N + C +C+ CF C G + K+ V C +A C +YH +C+
Sbjct: 757 QQMTNNKYICIDCQGNMVICFICKNYGIYNNNQKLKKQKNDSVNKCSTANCNKYYHLNCI 816
Query: 128 SKLLLRDDEVAADQLAKSIIA-GESFTCPLHKCCICKQ-GENKADSDLQFAVCRRCPKAY 185
L K I + + F C LH C C Q G++K S C +CPKAY
Sbjct: 817 KNA----------PLFKFIDSNNKRFRCSLHYCASCTQSGDSKVISQ-----CIKCPKAY 861
Query: 186 HRKCL 190
H CL
Sbjct: 862 HLGCL 866
>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific [Xenopus (Silurana) tropicalis]
Length = 2440
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 73/198 (36%), Gaps = 51/198 (25%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
+ ++VC C+ G+LL CE +C +FH LG+ E F C C
Sbjct: 1356 MKENVCQVCEKPGELLLCEAQCCGAFHLQC----------LGM---EAMPQGKFVCTECS 1402
Query: 88 YKQ----HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 143
H CF C KE+ V C+ CG +YH C A +
Sbjct: 1403 SGTLQGYHTCFVC-------KESDQGVKRCMLPLCGKYYHEEC------------ALKYP 1443
Query: 144 KSIIAGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYHRK--CLPRKIAFEDK 199
+ F C LH C C N + S + C RCP AYH CLP
Sbjct: 1444 PATQQNRGFRCSLHICSTCYATNPSNPSASKGRLMRCVRCPIAYHANDFCLP-------- 1495
Query: 200 LEEGIITRAWEGLL-PNH 216
G +T A ++ PNH
Sbjct: 1496 --AGAVTLASNSIICPNH 1511
>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
Length = 729
Score = 66.6 bits (161), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 50/178 (28%), Positives = 73/178 (41%), Gaps = 36/178 (20%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 73
V +E+ + E+ + VC+ C+ G L C G C +FHA LG++
Sbjct: 179 VGPLQENSAVEKPPTKEEVCTVCEKMGATLFCTGPCKLAFHAD----------CLGVS-- 226
Query: 74 EVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLR 133
F C C +H C C LG ++K C +CG FYH C++KL L
Sbjct: 227 --HVPRAFVCDECTTGEHLCLVCKDLGETEK--------CSLESCGCFYHKKCLAKLPL- 275
Query: 134 DDEVAADQLAKSIIAGESFTCPLHKCCICKQGE-NKADSDLQFAVCRRCPKAYHRKCL 190
+ + F CP H C C Q + ++ + C RCP A+H CL
Sbjct: 276 ------------PLKQDPFVCPRHFCLGCFQEKPTTLNAKGRLLRCVRCPSAFHVGCL 321
>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
Length = 1461
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)
Query: 30 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
+SVC C+ G D++ C G+C ++H HC + + +++ C C
Sbjct: 759 ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 804
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF C K++ EV C + CG FYH CV L ++
Sbjct: 805 GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 845
Query: 149 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 186
F CPLH C C A + + C RCP AYH
Sbjct: 846 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 885
>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
Length = 1461
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)
Query: 30 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
+SVC C+ G D++ C G+C ++H HC + + +++ C C
Sbjct: 759 ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 804
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF C K++ EV C + CG FYH CV L ++
Sbjct: 805 GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 845
Query: 149 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 186
F CPLH C C A + + C RCP AYH
Sbjct: 846 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 885
>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
Length = 1541
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 64/162 (39%), Gaps = 37/162 (22%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
+++C C+ G+LL CEG C +FH +D +GL + F C C
Sbjct: 680 ENLCQVCEQVGELLLCEGSCCGAFH--LDC--------IGLQQ---MPTGTFKCDECISG 726
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H CF C K + K C CG +YH C+ K ++
Sbjct: 727 VHTCFVCRKSEVTTKR-------CSIPICGKYYHEDCLRK------------FPNTVFEA 767
Query: 150 ESFTCPLHKC--CICKQGENKADSDLQ---FAVCRRCPKAYH 186
+ F CPLH C C+ G + + A C RCP AYH
Sbjct: 768 KGFRCPLHVCGTCVAVAGGDVKKVKSRGRILARCVRCPTAYH 809
>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
[Oryzias latipes]
Length = 1167
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 59/158 (37%), Gaps = 35/158 (22%)
Query: 32 VCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
VC C+ G DL+ CEG+C FH HC ++ C+ C
Sbjct: 563 VCQVCEQAGEDLVPCEGQCCGMFHL--------HCLGPSFKPED-----KLLCQQCRAGI 609
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CFAC K++ EV C CG FYH C+ L ++ +
Sbjct: 610 HSCFAC-------KQSEGEVRRCHVVHCGKFYHEACIRLNPL------------TVFDNK 650
Query: 151 SFTCPLHKCCICKQGENKADSDLQ--FAVCRRCPKAYH 186
CPLH C C G + C RCP AYH
Sbjct: 651 GLRCPLHACLSCHYGSRTKHKSTKGRLMRCLRCPVAYH 688
>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
magnipapillata]
Length = 1214
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 63/159 (39%), Gaps = 31/159 (19%)
Query: 30 DSVCSFCDNG-GDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D C C+ G L+ C+G C SFH C + K +F C C
Sbjct: 450 DLYCCICEGSEGQLITCQGSCFNSFHF--------DCLGVSCIKT------SFTCDECLS 495
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF C K G+ + C CG YH C+ K+ + V + K
Sbjct: 496 NNHCCFFCKKPGA--------ILKCSHNMCGKHYHQDCLIKIPVIKSNVNENNANK---- 543
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHR 187
F CPLH C +C + +K + + C RCP AYH+
Sbjct: 544 ---FICPLHNCRLCSEKASKGNLT-KLLKCIRCPTAYHQ 578
>gi|431897324|gb|ELK06586.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
Length = 915
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 61/171 (35%), Gaps = 44/171 (25%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ- 90
VC C+ G L C G C +FH + LGL + E L C C +
Sbjct: 596 VCQLCEKTGGLALCVGPCCGAFHPSC----------LGLPRTP-EGRLT--CHECASGKC 642
Query: 91 --------HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
H CF C KE+ +V CV + CG FYH CV + L
Sbjct: 643 RVPTGPWVHSCFVC-------KESAGDVKRCVVSQCGKFYHEACVRRFPL---------- 685
Query: 143 AKSIIAGESFTCPLHKCCIC---KQGENKADSDLQFAVCRRCPKAYHRKCL 190
++ F CPLH C C + C CP A+H CL
Sbjct: 686 --AVFESRGFRCPLHSCVSCHASNPSNPRPSKGGSLLCCESCPAAFHPDCL 734
>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
Length = 1366
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)
Query: 30 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
+SVC C+ G D++ C G+C ++H HC + + +++ C C
Sbjct: 664 ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 709
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF C K++ EV C + CG FYH CV L ++
Sbjct: 710 GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 750
Query: 149 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 186
F CPLH C C A + + C RCP AYH
Sbjct: 751 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 790
>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Takifugu rubripes]
Length = 1329
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 62/158 (39%), Gaps = 36/158 (22%)
Query: 32 VCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
VC C+ G DL+ CEG+C FH LG+ E+E L C+ C
Sbjct: 630 VCQSCEKAGEDLVPCEGQCCGMFHLQC----------LGMM--ELEDKL--LCQECSTGV 675
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C K++ V C CG FYH C+ L ++ +
Sbjct: 676 HSCFHC-------KKSEGSVRRCHVPHCGKFYHEACIRLNPL------------TVFDNK 716
Query: 151 SFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 186
F CPLH C C C S + C RCP AYH
Sbjct: 717 GFRCPLHTCLGCCCSSRTKHKPSKGRLMRCLRCPVAYH 754
>gi|168031756|ref|XP_001768386.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680311|gb|EDQ66748.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 615
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 76/181 (41%), Gaps = 30/181 (16%)
Query: 11 EDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGL 70
+DM + + +D+ F +C C + G+LLCCE RC FH + LGL
Sbjct: 12 KDMPKEDGRKKGKRDDDGFAFICDVCSDFGELLCCE-RCRSGFHLS----------CLGL 60
Query: 71 TKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL 130
D+ + + C +C + +CF C GS + ++ C CG +YH C SK
Sbjct: 61 --DKCPDVEPWLCSSCAENKVRCFKCKAFGSLE----VDLVKCAHRNCGKYYHKDC-SKG 113
Query: 131 LLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+R D A CP H C C++ + A + C CP AYH C
Sbjct: 114 WVRIPPKKTDNGA--------MVCPRHHCDACRKCQKNA----KLHRCLYCPVAYHESCS 161
Query: 191 P 191
P
Sbjct: 162 P 162
>gi|145484966|ref|XP_001428492.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395578|emb|CAK61094.1| unnamed protein product [Paramecium tetraurelia]
Length = 1405
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 79/211 (37%), Gaps = 70/211 (33%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
+C+ C+ L C G C R FH G + + + G E + + C++CE +
Sbjct: 1166 LCAICEKQKCDLFCRGFCRRQFHKECLEGGQYNGQTQG------EINIKYVCQDCEKYKG 1219
Query: 92 QCFACGKLGS-----------SDKE------------------TG--------------- 107
CF C K G+ DK+ TG
Sbjct: 1220 TCFVCLKQGTFYPNQSKKKTQQDKQFIDDDGYYDNQGEQIIVATGKITRARAQQQLQQQL 1279
Query: 108 AEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS--IIAGE--SFTCPLHKCCICK 163
+E+ C S C YH CV Q +K+ I+ GE F C LH C CK
Sbjct: 1280 SELVKC-SLNCHKLYHFACV-------------QTSKNFKILDGERQKFKCALHFCEKCK 1325
Query: 164 QGENKADSDLQFAVCRRCPKAYHRKCLPRKI 194
N D + + C RCPK+YH KC P+ I
Sbjct: 1326 DKSN--DDNQKMIQCLRCPKSYHEKCAPKGI 1354
>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
leucogenys]
Length = 780
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 69/178 (38%), Gaps = 50/178 (28%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTK-------------DEVE 76
+ VC C+ G LL CEG C +FH A LGL++ D +
Sbjct: 140 EYVCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSRRPEGRFTCSECASDRSK 189
Query: 77 AMLN---FFCKNC---EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL 130
+L+ F +N E H CF C KE+ +V CV CG FYH CV K
Sbjct: 190 VILDGWIFLLENAWTSEQGIHSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKY 242
Query: 131 LLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
L ++ F CPLH C C N S + C RCP AYH
Sbjct: 243 PL------------TVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 288
>gi|224129130|ref|XP_002320508.1| predicted protein [Populus trichocarpa]
gi|222861281|gb|EEE98823.1| predicted protein [Populus trichocarpa]
Length = 190
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 660 KSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERN 719
KSFYL GS N KQ++ W + PP+Y+WSR D+ A +A +HGH R + +
Sbjct: 29 KSFYLSGSAVANGKQIEDWTVNTPPIYIWSRADWTAWRIPIAREHGHGIRALVTGEQSKG 88
Query: 720 CY-----ETHAVDHPKEEGQGD 736
C+ E DH K+ G+
Sbjct: 89 CFGYVDIEGWESDHTKKHSHGN 110
>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
Length = 768
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 63/157 (40%), Gaps = 33/157 (21%)
Query: 31 SVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
++ C+ GDLL C+G C +FH C L + FFC+ C
Sbjct: 45 TILQVCERTGDLLVCDGHCYGAFH--------PQCIGLSAAPEG-----RFFCRECSSGV 91
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C K G+ K C+ CG FYH C++ + + +
Sbjct: 92 HSCFVCKKSGNGVKR-------CMIPLCGKFYHSDCIA------------TFSATQPHNK 132
Query: 151 SFTCPLHKCCICKQGENKAD-SDLQFAVCRRCPKAYH 186
F CPLH C C ++ S + A C RCP AYH
Sbjct: 133 GFRCPLHVCLSCHVAYPHSNCSKGRLARCVRCPVAYH 169
>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1404
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 54/134 (40%), Gaps = 32/134 (23%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C+ G+ L+ CEG C R FH LGLT F C C
Sbjct: 713 DTVCQICEAYGEGLVVCEGDCSRQFHLEC----------LGLT---ALPEGRFTCLECLN 759
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+H CF+C K G EV C + CG FYH CV KL L +
Sbjct: 760 GKHPCFSC-------KTAGREVTRCSVSGCGCFYHEDCVRKL-----------LGTTSSP 801
Query: 149 GESFTCPLHKCCIC 162
G F CP H C C
Sbjct: 802 GGGFCCPQHICSTC 815
>gi|427798749|gb|JAA64826.1| Putative histone-lysine n-methyltransferase nsd2, partial
[Rhipicephalus pulchellus]
Length = 755
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 69/173 (39%), Gaps = 36/173 (20%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML 79
+E+ +++ +C C+ G L C G C SFH +D +G+T+
Sbjct: 258 NENGIDEKAKGQLCLICEATGATLTCTGPCRMSFH--LDC--------IGITQPPDA--- 304
Query: 80 NFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 139
F C C H C C ++ K C CG FYH C+ L L E
Sbjct: 305 -FLCDECTTGSHSCLVCKGTEATQK--------CSMEQCGTFYHISCLKTLPLPIKE--- 352
Query: 140 DQLAKSIIAGESFTCPLHKCCICKQGENKADS-DLQFAVCRRCPKAYHRKCLP 191
+S CPLH C C Q + +A S + C +CP A+H CLP
Sbjct: 353 ----------DSLVCPLHFCLPCLQQKPRAVSMKGRLLRCVQCPSAFHSGCLP 395
>gi|300121581|emb|CBK22099.2| unnamed protein product [Blastocystis hominis]
Length = 213
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 74/163 (45%), Gaps = 36/163 (22%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
+C+ C+ G + C G+C R+FH A L T V++ + C +CE + H
Sbjct: 5 ICAECNKFGVDIVCFGKCHRAFHQ----------ACLPCT---VQSTRRWICPDCEKEVH 51
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
+C C + S DK+ + C + C F+HP CV + R+
Sbjct: 52 RCHQCKEFAS-DKD----LIQCQATDCYLFFHPSCVPEQFRRN---------------RP 91
Query: 152 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 194
F CP H C C+Q E++ D L+ C C K+YH CLP ++
Sbjct: 92 FLCPAHCCSFCRQWEDERDPLLK---CAYCCKSYHEYCLPLEV 131
>gi|449470764|ref|XP_004153086.1| PREDICTED: uncharacterized protein LOC101206575, partial [Cucumis
sativus]
Length = 203
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 76 EAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAE 109
+A+ F CKNCEYKQHQC+ACG LGSSD+ +GAE
Sbjct: 170 QAIETFICKNCEYKQHQCYACGNLGSSDQSSGAE 203
>gi|118367847|ref|XP_001017133.1| Type III restriction enzyme, res subunit family protein [Tetrahymena
thermophila]
gi|89298900|gb|EAR96888.1| Type III restriction enzyme, res subunit family protein [Tetrahymena
thermophila SB210]
Length = 2184
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 86/239 (35%), Gaps = 84/239 (35%)
Query: 24 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT----IDAGEESHCASLGLTK------- 72
++++LF +C +CD + C+G C RSFH+ ++AG + T+
Sbjct: 1561 KKEDLFKKLCYYCDRPKCTVFCQGHCKRSFHSECKERVEAGWINENGMTSETRIVPTEDH 1620
Query: 73 ---DEVEAMLN--FFCKNCEYKQHQC---------FACGKLGSSDK-------------- 104
+ ++ MLN + CK+CE C F L +K
Sbjct: 1621 YDEERLKKMLNINYTCKDCEANTAICFICKKKGSFFPPAVLNKQNKRIQQQKQKIIAEKE 1680
Query: 105 ------------------------------ETGAEVFPCVSATCGHFYHPHCVSKLLLRD 134
+ +++ C +A C FYHP C+ L
Sbjct: 1681 KQKDPSNEIVEESENELDEDYDLVEDDSVSQKTSDLTKCSTANCNKFYHPVCIKPYAL-- 1738
Query: 135 DEVAADQLAKSIIAG-ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPR 192
K I + + F CPLH C C DS + + C RCPKAYH +C +
Sbjct: 1739 --------FKYIDSNNKRFRCPLHYCAHCFIS---GDS-MAISQCVRCPKAYHLRCFEK 1785
>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
Length = 1562
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 64/167 (38%), Gaps = 50/167 (29%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATI---DAGEESHCASLGLTKDEVEAMLNFFCKN 85
D+VC C+ GD L+ CEG C R FH ++G+ES C+
Sbjct: 811 DTVCHVCETFGDSLVSCEGDCNRLFHPECMGSNSGKESE----------------TVCQE 854
Query: 86 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 145
C+ H CF+C K T ++ C CG +YH CV K S
Sbjct: 855 CKTGSHPCFSC-------KVTEGDMKRCSVNGCGRYYHETCVRK------------YTGS 895
Query: 146 IIAGESFTCPLHKCCICKQGENKADSDLQFA------VCRRCPKAYH 186
+ CP H C C D DLQ A C RCP AYH
Sbjct: 896 ASDTKGLRCPQHSCATC-----CLDRDLQKAGKGRMMRCIRCPVAYH 937
>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2 [Ovis aries]
Length = 1273
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 58/149 (38%), Gaps = 40/149 (26%)
Query: 40 GDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKL 99
G L+ CEG C +FH A LGL++ + H CF C
Sbjct: 627 GSLVLCEGPCCGAFHL----------ACLGLSRRP---------EGRSRGTHSCFVC--- 664
Query: 100 GSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKC 159
KE+ ++V CV + CG FYH CV + L ++ F CPLH C
Sbjct: 665 ----KESKSDVKRCVVSQCGKFYHEACVRRFPL------------TVFESRGFRCPLHSC 708
Query: 160 CICKQGE--NKADSDLQFAVCRRCPKAYH 186
C N S + C RCP AYH
Sbjct: 709 LSCHASNPSNPRPSKGKMLRCVRCPVAYH 737
>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1443
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 57/154 (37%), Gaps = 35/154 (22%)
Query: 36 CDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCF 94
C+ G DL+ CEG+C FH C + L D+ C+ C H CF
Sbjct: 676 CEEAGEDLVPCEGQCCGMFHL--------QCLGVLLEPDD-----KVLCQECSTGVHSCF 722
Query: 95 ACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTC 154
C K++ V C CG FYH C+ L ++ + F C
Sbjct: 723 HC-------KKSEGSVRRCHVPHCGKFYHEACIRLNPL------------TVFDNKGFRC 763
Query: 155 PLHKCCICKQGENKADSDLQ--FAVCRRCPKAYH 186
PLH C C G + C RCP AYH
Sbjct: 764 PLHTCLGCCYGNRTKPKSTKGRLMRCLRCPVAYH 797
>gi|47221386|emb|CAF97304.1| unnamed protein product [Tetraodon nigroviridis]
Length = 722
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 61/161 (37%), Gaps = 34/161 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L +V C+ G+LL CEG+C +FH C SL F C C+
Sbjct: 42 LLVTVVQVCEKTGELLLCEGQCCGAFHLP--------CISLAEAPKG-----KFICPECK 88
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
H CF C K+ +V C+ CG FYH C++ A ++
Sbjct: 89 SGIHTCFVC-------KKRSDDVRRCMIPVCGKFYHGECIA------------NFAPTVA 129
Query: 148 AGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 186
F C +H C C + S + C RCP AYH
Sbjct: 130 VNRGFRCSIHVCLTCFIANPNSSTISKGRLVRCVRCPIAYH 170
>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
domestica]
Length = 2705
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 66/190 (34%), Gaps = 45/190 (23%)
Query: 7 DDMEEDMVHD-------TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDA 59
D +EE + HD + E L ++VC C+ G+LL CE +C +FH
Sbjct: 1517 DSIEESVEHDHGMPVSKKMQAERGGGAALKENVCQNCEKVGELLLCEAQCCGAFHLEC-- 1574
Query: 60 GEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 119
+GLT+ F CK C H CF C K G +V C+ CG
Sbjct: 1575 --------IGLTE---MPKGKFICKECRTGIHTCFVC-------KPDGEDVKRCLLPLCG 1616
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKAD---SDLQFA 176
F F C LH C C N A S +
Sbjct: 1617 KFT--------------XXXXXXXXXXXXXXGFRCSLHMCITC-HAANPASLSASKGRLM 1661
Query: 177 VCRRCPKAYH 186
C RCP AYH
Sbjct: 1662 RCVRCPVAYH 1671
>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
candidate 1 (WHSC1) [Danio rerio]
Length = 728
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)
Query: 30 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
+SVC C+ G D++ C G+C ++H HC + + +++ C C
Sbjct: 26 ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 71
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
H CF C K++ EV C + CG FYH CV L ++
Sbjct: 72 GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 112
Query: 149 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 186
F CPLH C C A + + C RCP AYH
Sbjct: 113 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 152
>gi|428175286|gb|EKX44177.1| hypothetical protein GUITHDRAFT_109961 [Guillardia theta CCMP2712]
Length = 707
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 57/114 (50%), Gaps = 16/114 (14%)
Query: 525 IVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCL---YKNYD-ILPAKNDFNF 580
+V+ L +N D VDF CG N F ++ + C + +YD +P D N+
Sbjct: 555 LVEALQGVINQSDHFVDFACGNNRFGSILCQ--------CFGLSWSSYDEAVP--EDRNY 604
Query: 581 EKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVP 634
+ WM P ++ + +++G+ PP + ++ AL+F P+LL+L+VP
Sbjct: 605 MGKSWMLAGPSDVP--ANVVIGIFPPLHKGPIMCEMYLQHALQFRPRLLVLLVP 656
>gi|145491053|ref|XP_001431526.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398631|emb|CAK64128.1| unnamed protein product [Paramecium tetraurelia]
Length = 1668
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 84/215 (39%), Gaps = 57/215 (26%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCAS------------- 67
S ++EL +C C+ + C G C R+FH A D E + +
Sbjct: 1240 SKSKEELIKKLCYVCERPNCSVFCMGHCRRAFHIACKDLLETTEYINIEGPDQDFLNNHS 1299
Query: 68 ---LGLTKDEVEAMLN--FFCKNCEYKQHQCFACGKLGSS--DKETGAEVFP-------- 112
L ++++++ +N + C +C C C G+ +K+ EV
Sbjct: 1300 FPELNWSEEQLKENVNIRYSCPDCRNSLVVCLLCKTKGTYPPEKKQKEEVIASSDENDPL 1359
Query: 113 ---------------CVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG-ESFTCPL 156
C +A C ++H +C+ A+ L+K++ + E F CP
Sbjct: 1360 EDNVRKIKNKSAISKCSTANCNRYFHLNCIQ----------ANPLSKTLDSNVELFRCPS 1409
Query: 157 HKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
H C CK N +++ C RC +++H KC P
Sbjct: 1410 HVCVFCKV--NSSNTTTALIHCVRCCRSFHSKCAP 1442
>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
Length = 1912
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 55/125 (44%), Gaps = 10/125 (8%)
Query: 74 EVEAMLNFFCKNCE-YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
E+ F CKNC+ Y CF C K S KE C A C +YH C+
Sbjct: 1276 EITDYSEFKCKNCQKYDPPVCFVC-KYPISPKEKQGHRQKCQVAHCNKYYHLECLDHW-- 1332
Query: 133 RDDEVAADQLAKSIIAGESFTCPLHKC--CICK--QGENKADSDLQFAVCRRCPKAYH-- 186
+ +++++ E+ TCP H C C+C +G S + A C RCP YH
Sbjct: 1333 PQTQFNGGEISRTNKFSEALTCPRHVCHTCVCDDPRGCKTRFSGDKLARCVRCPATYHTF 1392
Query: 187 RKCLP 191
KCLP
Sbjct: 1393 TKCLP 1397
>gi|198420162|ref|XP_002123587.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1-like 1
[Ciona intestinalis]
Length = 1485
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 73/163 (44%), Gaps = 27/163 (16%)
Query: 30 DSVCSFCDNGGD---LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 86
+++CS C++ D L+ C G C SFH C L + + ++++F CK C
Sbjct: 1331 ENICSKCESASDEEELVHCIGGCCASFHP--------RC----LEEGAITSIVSFTCKLC 1378
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSK---LLLRDDEVAADQLA 143
+ H CF C K S+D G E+ C CG YH C+++ LL+ D+ +
Sbjct: 1379 KEDNHPCFICKKSSSNDDSLG-EIKRCSVPKCGRSYHTTCLTERKIALLQHDDDDQKKQD 1437
Query: 144 KSIIAGE-----SFTCPLHKCCICKQGENKADSDLQFAVCRRC 181
K S C LH+C C E K DS++ A RC
Sbjct: 1438 KEKEKERQAILNSSKCTLHQCGTCTTKE-KEDSNV--ASSNRC 1477
>gi|384253423|gb|EIE26898.1| hypothetical protein COCSUDRAFT_59402 [Coccomyxa subellipsoidea
C-169]
Length = 534
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 59/166 (35%), Gaps = 43/166 (25%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 92
C C +GG+LL C+G CLRSFH A E
Sbjct: 146 CGCCGDGGELLECDGMCLRSFHQNCLAPSE------------------------------ 175
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL----LLRDDEVAADQLAKSIIA 148
+ + + PC +CG ++H C++ L LL+ A
Sbjct: 176 -----RPNPENPPETPCILPCKMGSCGWYFHNACLNGLRDSGLLKIHREDPGVAAGGEDG 230
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 194
FTCP H C +C LQ C RCP AYH C+P +
Sbjct: 231 QRVFTCPAHFCHVCGN-SGAGRHTLQ---CWRCPTAYHATCVPAGV 272
>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
Length = 1048
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 47/173 (27%), Positives = 60/173 (34%), Gaps = 49/173 (28%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C C G L+CC+G C +FH C S+ L E F C C
Sbjct: 439 DDTCRICGELGQLICCDGGCRGAFHL--------ECLSI-LQPPTGE----FRCDECSTG 485
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H C+ C K+ GA++ C C YH C K + AD
Sbjct: 486 NHTCYTCDKV-------GADLIKCQFPHCNKLYHRGCAEK------QFKADNF------- 525
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 202
C +C G + VC CP AYH C+ AF K +E
Sbjct: 526 ---------CLVCGTGGD-------LVVCDGCPGAYHAACIKSTFAFTGKPDE 562
>gi|145522682|ref|XP_001447185.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414685|emb|CAK79788.1| unnamed protein product [Paramecium tetraurelia]
Length = 1508
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 50/225 (22%), Positives = 86/225 (38%), Gaps = 57/225 (25%)
Query: 12 DMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCAS--- 67
D+ D S ++EL +C C+ + C G C R+FH A D E + +
Sbjct: 1078 DLGEDARRLASKSKEELIKKLCYVCERPNCSVFCMGHCRRAFHIACKDLLETTEYINIEG 1137
Query: 68 -------------LGLTKDEVEAMLN--FFCKNCEYKQHQCFACGKLGSS--DKETGAEV 110
L ++++++ +N + C +C C C G+ +K+ EV
Sbjct: 1138 PDQDFLNNHSFPELNWSEEQLKENVNIRYSCPDCRNCLVVCLLCKSKGTYPPEKKQKEEV 1197
Query: 111 FP-----------------------CVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
C +A C ++H C+ A+ L+K++
Sbjct: 1198 IASSDENDPLEDNVKKIKNKSAISKCSTANCNRYFHLSCIQ----------ANPLSKTLD 1247
Query: 148 AG-ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
+ E F CP H C CK N +++ C RC +++H KC P
Sbjct: 1248 SNVELFRCPSHVCVFCKV--NSSNTTTALIHCVRCCRSFHSKCAP 1290
>gi|145512399|ref|XP_001442116.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409388|emb|CAK74719.1| unnamed protein product [Paramecium tetraurelia]
Length = 1659
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 46/217 (21%), Positives = 80/217 (36%), Gaps = 56/217 (25%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATI-----------------DAGEESH 64
S ++EL +C C+ + C G C R+FH D ++
Sbjct: 1240 SKSKEELIKKLCYVCERSNCSIFCMGHCRRAFHDACKELLETTDYINVEGPDQDFLNKNA 1299
Query: 65 CASLGLTKDEVEAMLN--FFCKNCEYKQHQCFACGKLGSS--DKETGAEVF--------- 111
+ ++++ +N + C +C C C + G+ DK+ E+
Sbjct: 1300 FPEFNWSGEQLKEKINIRYSCPDCRNNLVVCLLCKQKGTYPPDKKQKEEIVQIDEFDPSD 1359
Query: 112 -------------PCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI-IAGESFTCPLH 157
C +A C ++H C+ A+ L+KS+ + F CP H
Sbjct: 1360 DNVKKSKNKSIISKCSTANCNRYFHLSCIQ----------ANPLSKSLDTNADLFRCPSH 1409
Query: 158 KCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 194
C CK N ++ C RC +++H KC P +I
Sbjct: 1410 ICVFCKI--NSSNMTTALIHCIRCCRSFHSKCAPPEI 1444
>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
glaber]
Length = 1372
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 39/98 (39%), Gaps = 21/98 (21%)
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C KE AEV CV CG FYH CV + L ++
Sbjct: 728 HACFVC-------KERKAEVKRCVVTQCGKFYHDACVRRYPL------------TVFESR 768
Query: 151 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 769 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 806
>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
catus]
Length = 1300
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 40/98 (40%), Gaps = 21/98 (21%)
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C KE+ +V CV + CG FYH CV K L ++
Sbjct: 650 HSCFVC-------KESKTDVRRCVVSQCGKFYHEACVRKYPL------------TVFESR 690
Query: 151 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 691 GFRCPLHSCVTCHASNPSNPRPSKGKMMRCVRCPVAYH 728
>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis
lupus familiaris]
Length = 742
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 58/159 (36%), Gaps = 43/159 (27%)
Query: 30 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D+VC C++ GD L+ CEG C + FH C L D F C C+
Sbjct: 41 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASVPDG-----KFICIECK- 86
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+G +V C CG FYH CV K +I
Sbjct: 87 ---------------TVSGTDVKRCSVGACGKFYHEACVRK------------FPTAIFE 119
Query: 149 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 186
+ F CP H C C ++ S + C RCP AYH
Sbjct: 120 SKGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPVAYH 158
>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
Length = 1795
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 23/106 (21%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
C CG++G+ ++ C CG FYHP C+ KL Q S+ A +SF
Sbjct: 1062 CLVCGEMGAQGRQK------CSLHQCGRFYHPECL-KLW--------PQTQWSLNASDSF 1106
Query: 153 TCPLHKCCIC-----KQGENKADSDLQFAVCRRCPKAYHRK--CLP 191
CP H C C + ++ SD + C +CP YH C+P
Sbjct: 1107 VCPRHVCHTCISDDPRAANSRCSSD-KIVKCLKCPATYHSSNYCVP 1151
>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
Length = 1740
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 23/106 (21%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
C CG++G+ ++ C CG FYHP C+ KL Q S+ A +SF
Sbjct: 1007 CLVCGEMGAQGRQK------CSLHQCGRFYHPECL-KLW--------PQTQWSLNASDSF 1051
Query: 153 TCPLHKCCIC-----KQGENKADSDLQFAVCRRCPKAYHRK--CLP 191
CP H C C + ++ SD + C +CP YH C+P
Sbjct: 1052 VCPRHVCHTCISDDPRAANSRCSSD-KIVKCLKCPATYHSSNYCVP 1096
>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Amphimedon queenslandica]
Length = 1171
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 66/167 (39%), Gaps = 32/167 (19%)
Query: 31 SVCSFCDNGGD----LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 86
++CS CDN + ++ C+G CL SFH +D LGL F C C
Sbjct: 377 NICSICDNPSNPTQCIITCKGHCLNSFH--VDC--------LGLIS---PPSFPFVCDEC 423
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
C+AC K + + + PC +C YH C+ D ++
Sbjct: 424 TLSPSVCYACKK---PSVDPVSSLVPCSHHSCSQLYHLSCIRSC----GNFKFDSSNPNV 476
Query: 147 IAGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYHR-KCL 190
I +C LH C C+C G + + C +CP + H+ CL
Sbjct: 477 I----LSCGLHSCAKCVCSDGP-PVTGNQKLMQCLKCPLSLHKYSCL 518
>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
distachyon]
Length = 1416
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 88/219 (40%), Gaps = 47/219 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H + +E S ++C NC
Sbjct: 1036 DDTCGFCGDGGELLCCDN-CPSTYHEACLSSQELPEGS-------------WYCHNC--- 1078
Query: 90 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL-----RDDEVAADQLA 143
C +CG + + + +++ C+ CG YH C+ +++L R D +
Sbjct: 1079 --TCRSCGNPVNEKEVSSFSDILKCLQ--CGDAYHNTCIDRVMLPSDGKRSDTWFCGRYC 1134
Query: 144 KSIIAGESFTCPLHKCCICKQG-ENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 202
K I G LH + G EN ++DL + + RC R +KI
Sbjct: 1135 KEIFMG------LHS----QVGVENVINNDLSWTIL-RCNSDGQRLHSAQKI-------- 1175
Query: 203 GIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIF 241
G++T L +L C +D G + H+++
Sbjct: 1176 GLMTECNTKLAVALTLLEECFIRMVDPRTGVDMIPHVLY 1214
>gi|403352806|gb|EJY75922.1| Type III restriction enzyme, res subunit family protein [Oxytricha
trifallax]
Length = 2584
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 12/85 (14%)
Query: 110 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 169
V C C +YH CV ++ V+ K + F CPLH C IC+
Sbjct: 2299 VVRCTLGGCFKYYHVDCV----YQNKNVSCINSMKV----QRFRCPLHYCEICQAS---G 2347
Query: 170 DSDLQFAVCRRCPKAYHRKCLPRKI 194
DS L C CP AYH KCLP+ +
Sbjct: 2348 DSVL-IVQCVNCPTAYHLKCLPQDL 2371
>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
(Silurana) tropicalis]
Length = 5215
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 88/218 (40%), Gaps = 47/218 (21%)
Query: 14 VHDTEEDESNEEDELFDS----VCSFCDN-GGDLLCCEGRCLRSFHATIDAG----EESH 64
VH EE E D+ S C++C + G + CCE C +++H AG ++ +
Sbjct: 587 VHQNEEQELVSVDKAVLSGSTERCAYCKHLGATIKCCEETCTQTYHYPCAAGAGTFQDLN 646
Query: 65 CASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHP 124
C +L L + ++ L ++ C C G + ++F TCG YH
Sbjct: 647 CLTL-LCPEHIDQALERSKEDAN-----CALCDSSG----DLLDQLF---CTTCGQHYHG 693
Query: 125 HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 184
C+ ++A L ++ + CP C +C+ ++ D D Q VC C K
Sbjct: 694 MCL--------DIAVTPLKRA-----GWQCP--DCKVCQNCKHSGD-DNQMLVCDTCDKG 737
Query: 185 YHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYC 222
YH CL + + + T W+ N RI C
Sbjct: 738 YHTFCL-------QPVMDSVPTNGWK--CKNCRICTEC 766
>gi|301115892|ref|XP_002905675.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110464|gb|EEY68516.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 470
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 59/166 (35%), Gaps = 32/166 (19%)
Query: 28 LFDSVCSFCDNGG-DLLCCEGRCLRSFHAT----IDAGEESHCASLGLTKDEVEAMLNFF 82
LF S C C NGG LL C G C RS H + A SL K E N+
Sbjct: 92 LFPSHCFLCGNGGRGLLHCGGSCARSAHQNCVNQLQAPTVGEPVSLAERKQAAE---NWK 148
Query: 83 CKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
C C H+C CG +G + C CG+ +H C L DDE
Sbjct: 149 CAQCLRGLHRCQRCGFMGHEAN----GMRKCSVLDCGYHFHEQC-----LPDDET----- 194
Query: 143 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVC-----RRCPK 183
A F CP H C C E VC RCP+
Sbjct: 195 -----AHAGFVCPRHTCATCGTQETDMRRCKSCTVCDHMTHLRCPR 235
>gi|315639351|ref|ZP_07894513.1| putative type II DNA modification enzyme (methyltransferase)
[Campylobacter upsaliensis JV21]
gi|315480677|gb|EFU71319.1| putative type II DNA modification enzyme (methyltransferase)
[Campylobacter upsaliensis JV21]
Length = 365
Score = 51.2 bits (121), Expect = 0.004, Method: Composition-based stats.
Identities = 52/163 (31%), Positives = 74/163 (45%), Gaps = 32/163 (19%)
Query: 491 KWKNKLKV-----YLAPFLHGMRYT-------SFGRHFTKVDKLQAIVDKLHWYVNDGDM 538
+W+N KV + FL G +YT S FTK + + KL ++N +
Sbjct: 32 RWENLKKVPDEYKHDLNFLLGNKYTLEKENYRSHNEFFTKKESAKYCYKKLMDFLNQHSI 91
Query: 539 ------IVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKE 592
++ CG F LM K + G + YKN +IL +N NF PK+
Sbjct: 92 DVGEYVFIEPSCGDLSFYDLMPKN-SKIGIDLTYKNKEIL-CENFLNFT--------PKK 141
Query: 593 LAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPP 635
G LI+G NPPFG++ LA +FIN A F + I+PP
Sbjct: 142 --EGKYLILG-NPPFGLRGNLALRFINHAYRF-ADFIAFILPP 180
>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
Length = 725
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 69/170 (40%), Gaps = 41/170 (24%)
Query: 32 VCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
VC C+ G+ L+ C+G C FH LG+T+ V+ F C+ C
Sbjct: 40 VCMLCEGIGEKLVTCQGGCFSKFHLE----------CLGITEWPVKG---FKCQECTSGT 86
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
+CF CG + V PC A CG YH C+S+ +V+ + +
Sbjct: 87 RKCFVCG-------SSEPPVLPCKVAGCGKCYHDTCISQFTC--GQVSEEGCIR------ 131
Query: 151 SFTCPLHKCCICKQ---GENKADSDL----QFAVCRRCPKAYH--RKCLP 191
CP H C C +N + D+ + A C CP +YH CLP
Sbjct: 132 ---CPRHVCTTCTAFAISKNDENFDVVIQKKLARCIYCPVSYHASEACLP 178
>gi|57506149|ref|ZP_00372070.1| putative type II DNA modification enzyme (methyltransferase)
[Campylobacter upsaliensis RM3195]
gi|57015544|gb|EAL52337.1| putative type II DNA modification enzyme (methyltransferase)
[Campylobacter upsaliensis RM3195]
Length = 377
Score = 50.8 bits (120), Expect = 0.004, Method: Composition-based stats.
Identities = 52/163 (31%), Positives = 74/163 (45%), Gaps = 32/163 (19%)
Query: 491 KWKNKLKV-----YLAPFLHGMRYT-------SFGRHFTKVDKLQAIVDKLHWYVNDGDM 538
+W+N KV + FL G +YT S FTK + + KL ++N ++
Sbjct: 32 RWENLKKVPDEYKHDLNFLLGNKYTLEKENYRSHNEFFTKKESAKYCYKKLIDFLNQHNI 91
Query: 539 ------IVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKE 592
++ CG F LM K G + YKN +IL +N NF PK+
Sbjct: 92 DVGKYVFIEPSCGDLSFYDLMPKN-SRIGIDLTYKNKEIL-CENFLNFT--------PKK 141
Query: 593 LAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPP 635
G LI+G NPPFG++ LA +FIN A F + I+PP
Sbjct: 142 --EGKYLILG-NPPFGLRGNLALRFINHAYHF-ADFIAFILPP 180
>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
Length = 713
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 63 HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESR 103
Query: 151 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 104 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 141
>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Macaca mulatta]
Length = 713
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 63 HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESR 103
Query: 151 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 104 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 141
>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
Length = 715
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H CF C KE+ +V CV CG FYH CV K L ++
Sbjct: 65 HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESR 105
Query: 151 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 186
F CPLH C C N S + C RCP AYH
Sbjct: 106 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 143
>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
Length = 1480
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 39/166 (23%), Positives = 61/166 (36%), Gaps = 23/166 (13%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC-EYKQ 90
VC+ C DL+ C +C +H + + D F C +C K
Sbjct: 813 VCAVCTKPHDLVKC-TKCYNHYHLAC-------LTDVPIKPDPAGENKTFTCTDCVMLKA 864
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
CF C + KE E F CV CG YH +C+ + +
Sbjct: 865 PTCFVCNDQDDAVKE--EEKFRCVMNGCGKQYHLNCLRLF-------PQHKFTGTSSKSS 915
Query: 151 SFTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYHR--KCLP 191
+ CP H C C + ++++ C +CP +YH +C+P
Sbjct: 916 TLYCPTHTCHTCVSDDPRSNATTTKGHLIRCIKCPSSYHTEARCIP 961
>gi|300122245|emb|CBK22818.2| unnamed protein product [Blastocystis hominis]
Length = 171
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 74/186 (39%), Gaps = 35/186 (18%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D++C+ C+ + C G CLRSFH C L D ++ C +C +
Sbjct: 6 DNICAICERPNCPVRCSGGCLRSFHI--------QCLRLTSVPDR-----SWKCNDCVKR 52
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H+CF C + +E +E+ C C ++H A + ++
Sbjct: 53 SHECFECKQ-----RELDSELVQCSYPDCRRYFH-----------KREACCRCNPRSLST 96
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFED--KLEEGIITR 207
F CP H+C C +D + C RC K YH CLP + D ++ E + +
Sbjct: 97 LDFVCPSHRCYAC---HGPSDMNNPLLKCFRCTKCYHYSCLPPSVQSLDSKRIRECLQPK 153
Query: 208 -AWEGL 212
W G+
Sbjct: 154 PTWRGI 159
>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
Length = 1093
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 56/225 (24%), Positives = 80/225 (35%), Gaps = 58/225 (25%)
Query: 6 VDDMEEDMVHDTEEDESNEED-ELF-----DSVCSFCDNGGDLLCCEGRCLRSFH-ATID 58
+ ++D V++TE+ E+ LF + VC C+ G L C+G C FH + +
Sbjct: 432 IKRAKQDNVYNTEKSETKRRPYNLFKGMKQEKVCQICEKTGKLTRCKGPCYSYFHLSCVK 491
Query: 59 AGE-------------ESHCASLGLTKDEV--EAMLN----------FFCKNC-EYKQHQ 92
GE E + L + K + E+ N F C +C
Sbjct: 492 PGESSPEYSVDENTLDERLLSDLNIIKRSINGESENNGKSDEQEDETFKCIDCLSGVAPA 551
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C E + C+ CG YH C+ +S G
Sbjct: 552 CFLCN-------EREGDRIRCIVPACGKHYHSKCLI------------PWPQSHWQGGRL 592
Query: 153 TCPLHKCCICK----QGENKADSDLQFAVCRRCPKAYHRK--CLP 191
TCP H C C Q + + A C RCP +YH CLP
Sbjct: 593 TCPYHVCHTCSSDNPQNNRSRAPNEKVAKCVRCPSSYHASALCLP 637
>gi|224061090|ref|XP_002300347.1| predicted protein [Populus trichocarpa]
gi|222847605|gb|EEE85152.1| predicted protein [Populus trichocarpa]
Length = 238
Score = 50.1 bits (118), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 33/43 (76%), Positives = 38/43 (88%)
Query: 1 MSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLL 43
MS FIVDD+E+D D EEDESN+EDELFDSVC+FCDNGG+LL
Sbjct: 196 MSAFIVDDVEDDNFEDLEEDESNDEDELFDSVCAFCDNGGNLL 238
>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1283
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 14/83 (16%)
Query: 15 HDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH---------ATIDAGEESHC 65
HD++ D + + + C+ C +GG+LLCC+ RC R+FH + + A + S
Sbjct: 121 HDSDHDAWTDHNRWY---CNLCKDGGELLCCD-RCPRAFHLKWYVGCFPSAVVAHQASRY 176
Query: 66 ASLGLTKDEVEAMLNFFCKNCEY 88
ASLGL K+E+ ++CK C Y
Sbjct: 177 ASLGLQKEEIPES-EWYCKFCAY 198
>gi|328716964|ref|XP_001950932.2| PREDICTED: hypothetical protein LOC100161826, partial
[Acyrthosiphon pisum]
Length = 1530
Score = 49.7 bits (117), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 46/120 (38%), Gaps = 17/120 (14%)
Query: 79 LNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 138
L + C C+ + CF CG D + + C ++CG ++H C+ D
Sbjct: 771 LKYMCSLCKAGKTNCFVCGL----DIDDPGQKIVCKISSCGKYFHESCLK------DWPQ 820
Query: 139 ADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAV-----CRRCPKAYHRK--CLP 191
+ S CP H C +C K+ F C +CP AYHR CLP
Sbjct: 821 CQWIQGSRNNVRCVVCPHHVCHLCISDNPKSSCTTHFPAEKITRCIKCPTAYHRSEYCLP 880
>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4455
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 69/170 (40%), Gaps = 35/170 (20%)
Query: 26 DELFDS----VCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 80
D+ DS VC+FC + G L C E C RS+H A+ G +D + +
Sbjct: 400 DKAIDSGSIQVCAFCRHLGASLRCQETGCTRSYHMPC-------AAAAGACQDWNQRRI- 451
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 140
++ QC C G S G + C CG YH C+ D V
Sbjct: 452 LCTQHARTGSSQCRLCAGSGDSG---GLLMCSC----CGSCYHGSCL------DPPVTPS 498
Query: 141 QLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
L++ + CP +C +C+ + DS + +C RC KAYH CL
Sbjct: 499 PLSRV-----GWQCP--QCRVCRSCSLQGDSGV--LLCARCDKAYHAHCL 539
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 43/103 (41%), Gaps = 20/103 (19%)
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
KQ C CG G GAE + CG YHP+CV+ + R +I
Sbjct: 1017 LKQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVNIKITR------------VI 1059
Query: 148 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C+ + +D + +C C +YH CL
Sbjct: 1060 LTKGWRC--LECTVCEACGDASDPG-RLLLCDDCDISYHTYCL 1099
>gi|390596238|gb|EIN05640.1| hypothetical protein PUNSTDRAFT_145612 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1008
Score = 49.3 bits (116), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 26/134 (19%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 92
C +C +GG+L+CC GRC R FH C GLT +++ C +QH
Sbjct: 863 CHYCRDGGELVCC-GRCPRVFH--------KECR--GLTAAQLKRNFAIIC-----EQHS 906
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
C C + S + G +F C TC + C+ + D + + L + ++ G F
Sbjct: 907 CVVCNR---STGDAGGMLFRC--QTCPQAFCEDCLPE---GDIDAVGETLPEFLLLGYGF 958
Query: 153 TCPLH--KCCICKQ 164
+ + +C C+Q
Sbjct: 959 SPSAYYIRCHDCRQ 972
>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
Length = 4884
Score = 49.3 bits (116), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D NFF E+
Sbjct: 285 CAFCKHLGATIKCCEDKCTQMYHYPCAAG-------AGTFQD----FRNFFLLCPEHIDQ 333
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 334 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 378
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 379 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 422
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI + C
Sbjct: 423 KSVPTNGWK--CKNCRICVEC 441
Score = 44.7 bits (104), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 54/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A L KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 925 ADLSSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 979
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 980 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1024
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 1025 DCDISYHTYCL 1035
>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
finger protein HRX) (ALL-1) (Trithorax-like protein)
(Lysine N-methyltransferase 2A) (CXXC-type zinc finger
protein 7) [Ciona intestinalis]
Length = 3406
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 44/99 (44%), Gaps = 17/99 (17%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
C CG S+ + + C A C YHP C L D +A Q+ ++ ++
Sbjct: 1024 CLICGSFSSAKSISEGALVHC--ACCCEPYHPFCAEPDFLNTDVLA--QMKRN-----TW 1074
Query: 153 TCPLHKCC-ICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
C +CC +C +N VCRRC K+YH +CL
Sbjct: 1075 ICRKCQCCHVCGHPKN-------LLVCRRCKKSYHSECL 1106
>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Canis lupus familiaris]
Length = 4874
Score = 48.9 bits (115), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 69/201 (34%), Gaps = 56/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAG--------------------------EESHC 65
C+FC + G + CCE +C + +H AG E+++C
Sbjct: 253 CAFCKHLGATIKCCEEKCTQIYHYPCAAGAGTFQDFRNFFLLCPEHIDQAPERSKEDANC 312
Query: 66 ASLGLTKDEVEAMLNFFCKNCEYKQH----------------QCFACGKLGSSDKETGAE 109
A D ++ FFC C H QC C K+ + KETG
Sbjct: 313 AVCDSPGDLLD---QFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPEC-KVCQNCKETGKN 368
Query: 110 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPL----HKCCICKQG 165
F TC Y+P CVS L+ Q I E T H C +C
Sbjct: 369 TFVLFCFTCSLNYNPFCVSPLVRIVPTNLFTQCRNCRICVECGTRSSSQWHHNCLVCDSC 428
Query: 166 ENKADSDLQFAVCRRCPKAYH 186
+ D+ +C C K YH
Sbjct: 429 YQQQDN-----LCPFCGKCYH 444
Score = 43.1 bits (100), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)
Query: 72 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 125
KDE E ++ F + ++ HQ C CG G GAE + CG YHP+
Sbjct: 898 KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 952
Query: 126 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 185
CVS + + ++ + + C +C +C + KA + +C C +Y
Sbjct: 953 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKASDPGRLLLCDDCDISY 997
Query: 186 HRKCL 190
H CL
Sbjct: 998 HTYCL 1002
>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ailuropoda melanoleuca]
Length = 4927
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D NFF E+
Sbjct: 300 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----FRNFFLLCPEHIDQ 348
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 349 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 393
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 394 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 437
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI + C
Sbjct: 438 KSVPTNGWK--CKNCRICVEC 456
Score = 43.1 bits (100), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 26/126 (20%)
Query: 71 TKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHP 124
+KDE E ++ F + ++ HQ C CG G GAE + CG YHP
Sbjct: 944 SKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHP 998
Query: 125 HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 184
+CVS + + ++ + + C +C +C + KA + +C C +
Sbjct: 999 YCVSIKITK------------VVLSKGWRC--LECTVC-EACGKASDPGRLLLCDDCDIS 1043
Query: 185 YHRKCL 190
YH CL
Sbjct: 1044 YHTYCL 1049
>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
Length = 4780
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 52/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D NFF E+
Sbjct: 156 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----FRNFFLLCPEHIDQ 204
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 205 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 249
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 250 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 293
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI + C
Sbjct: 294 KSVPTNGWK--CKNCRICVEC 312
Score = 43.1 bits (100), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 26/126 (20%)
Query: 71 TKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHP 124
+KDE E ++ F + ++ HQ C CG G GAE + CG YHP
Sbjct: 800 SKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHP 854
Query: 125 HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 184
+CVS + + ++ + + C +C +C + KA + +C C +
Sbjct: 855 YCVSIKITK------------VVLSKGWRC--LECTVC-EACGKASDPGRLLLCDDCDIS 899
Query: 185 YHRKCL 190
YH CL
Sbjct: 900 YHTYCL 905
>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
Length = 499
Score = 48.5 bits (114), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 49/102 (48%), Gaps = 26/102 (25%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 87
L D++CSFC GGDL+ C+ +C +FH LG + + + N+FC +C
Sbjct: 141 LSDTICSFCHYGGDLILCD-KCPSTFH-------------LGCLELKDVPLENWFCPSC- 185
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSK 129
C CGK G S T A C+ C YH HC++K
Sbjct: 186 ----CCELCGK-GDSSTSTNA----CLQ--CARAYHVHCLTK 216
>gi|310831063|ref|YP_003969706.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis
virus BV-PW1]
gi|309386247|gb|ADO67107.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis
virus BV-PW1]
Length = 297
Score = 48.5 bits (114), Expect = 0.022, Method: Composition-based stats.
Identities = 45/169 (26%), Positives = 80/169 (47%), Gaps = 18/169 (10%)
Query: 506 GMRYTSFGRHFTKVDKLQAIVD--KLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKN 563
G+ + +++TK + ++ +D K + +N D I++ G F ++E
Sbjct: 31 GLNRNTMDKYYTKTNIVKNCIDSIKSNVIINKLDTIIEPSAGNGAF-------IEEIKHL 83
Query: 564 CL-YKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKAL 622
C YK YDILP N+ K+D++ ++ I+G NPPFG ++ +A KFI K
Sbjct: 84 CNNYKFYDILPENNEI--IKQDFLKLDVNNFKNQKIHIIG-NPPFGRQSSIAIKFIKKCC 140
Query: 623 EFNPKLLILIVPPETERLDRK--ESAYELVWEDDQFLSGKSFYLPGSVD 669
F + ++ + RK Y LV+E+D K+ +L ++D
Sbjct: 141 LFAESISFILPKSFKKESMRKYFNKYYHLVFEEDLL---KNCFLVNNID 186
>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 1 [Bombus impatiens]
Length = 1230
Score = 48.5 bits (114), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 67/196 (34%), Gaps = 54/196 (27%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 80
+ VC C+ G L C+G C FH + + GE S H + D++ +N
Sbjct: 402 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILNDINEIKQSN 461
Query: 81 -----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
F C +C CF C E + C CG Y
Sbjct: 462 TDNDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 514
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 177
H C+ +S G TCP H C C +N DS + + A
Sbjct: 515 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTC-SSDNPQDSHSRAPNEKLAR 561
Query: 178 CRRCPKAYH--RKCLP 191
C RCP +YH CLP
Sbjct: 562 CVRCPSSYHTSTSCLP 577
>gi|300121090|emb|CBK21472.2| unnamed protein product [Blastocystis hominis]
Length = 1183
Score = 48.5 bits (114), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 35/87 (40%), Gaps = 12/87 (13%)
Query: 110 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 169
VF C+ +CG FYH +C+ + E +D F CP H CC C + N
Sbjct: 984 VFKCLVKSCGRFYHVNCIQNMEYGSKEHFSDL--------THFRCPAHFCCECHETGNTK 1035
Query: 170 DSDLQFAVCRRCPKAYHRKCLPRKIAF 196
C C KA H CL + F
Sbjct: 1036 ----HLVQCVMCYKAIHVSCLTLENGF 1058
>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
isoform 2 [Bombus impatiens]
Length = 1239
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 67/196 (34%), Gaps = 54/196 (27%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 80
+ VC C+ G L C+G C FH + + GE S H + D++ +N
Sbjct: 411 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILNDINEIKQSN 470
Query: 81 -----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
F C +C CF C E + C CG Y
Sbjct: 471 TDNDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 523
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 177
H C+ +S G TCP H C C +N DS + + A
Sbjct: 524 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTC-SSDNPQDSHSRAPNEKLAR 570
Query: 178 CRRCPKAYH--RKCLP 191
C RCP +YH CLP
Sbjct: 571 CVRCPSSYHTSTSCLP 586
>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
lysine-20 specific-like [Megachile rotundata]
Length = 1302
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 67/195 (34%), Gaps = 52/195 (26%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASL-------------------G 69
+ VC C+ G L C+G C FH + + GE S S+
Sbjct: 474 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDESTMDDKILNDIKEIKGDN 533
Query: 70 LTKDEVEAMLN------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
+ +DE ++ F C +C CF C E + C CG Y
Sbjct: 534 IDEDETSGKIDEQEDELFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 586
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVC 178
H C+ +S G TCP H C C Q + S+ + A C
Sbjct: 587 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTCSSDNPQDSHPRTSNEKLARC 634
Query: 179 RRCPKAYH--RKCLP 191
RCP +YH CLP
Sbjct: 635 VRCPSSYHTSTSCLP 649
>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1350
Score = 48.1 bits (113), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 59/138 (42%), Gaps = 34/138 (24%)
Query: 4 FIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD ++ D++H D + D S E D+ D C C +GG+L+CC+G C
Sbjct: 989 ILVDGLDNDLLHCLIIAWDKQSDSERQAFFPVSTEGDDPNDDTCGICGDGGNLICCDG-C 1047
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAE 109
+FH + EE L D+ + C NC C C + + D AE
Sbjct: 1048 PSTFHMSCLELEE-------LPSDD------WRCTNC-----SCKLCHEHLNHDAPDNAE 1089
Query: 110 VFPCVSAT-CGHFYHPHC 126
+ P S + C YHP C
Sbjct: 1090 IDPLHSCSQCEKKYHPSC 1107
>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
Length = 1181
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 40/167 (23%), Positives = 62/167 (37%), Gaps = 31/167 (18%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ- 90
C+ C D++ C C FH + S L F C C Q
Sbjct: 690 ACAVCLEPNDVVKCSA-CHNHFHPRCIGAKTSSDPKL------------FKCVECADGQT 736
Query: 91 HQCFACGKLGSSDKETGAEV-FPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
H+CF C D +E C + CG +YH HC+ L ++ + + ++
Sbjct: 737 HRCFVCN---DQDATIASETKHRCALSGCGKYYHIHCLR--LFPQHKITSTPNSSTLF-- 789
Query: 150 ESFTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYH--RKCLP 191
CP H C C + + ++ C +CP +YH KC+P
Sbjct: 790 ----CPYHTCHTCVSDDPRTNATATRGSLVRCIKCPSSYHPDAKCVP 832
>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
[Nasonia vitripennis]
Length = 1382
Score = 47.8 bits (112), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 63/194 (32%), Gaps = 51/194 (26%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTKDEVEAML--------- 79
+ VC C+ G L+ C G C FH A + GE S S +E A
Sbjct: 574 ERVCQICEKTGKLIRCRGPCHSYFHLACVKPGESSPEPSEAGDIEENNAYKEDLKEIKAK 633
Query: 80 ---------------NFFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYH 123
NF C +C CF C E E C CG YH
Sbjct: 634 SKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVC-------HERDGERTKCSILACGKHYH 686
Query: 124 PHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVCR 179
P C+ + G TCP H C C Q + + +FA C
Sbjct: 687 PDCLK------------SWPQCQWQGGRLTCPHHICHTCASDNPQNSHPRSAGEKFAKCV 734
Query: 180 RCPKAYHR--KCLP 191
+CP YH CLP
Sbjct: 735 KCPSTYHASISCLP 748
>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
latipes]
Length = 4802
Score = 47.8 bits (112), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 74/194 (38%), Gaps = 41/194 (21%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATID--AGEESHCASLGL-TKDEVEAMLNFFCKNCEY 88
C++C G + CC C + +H AG SL L D +E ++ F +
Sbjct: 285 CAYCKRLGASIKCCAEGCAQLYHYPCAGAAGTFQDIRSLSLLCPDHIEVAIHKFVDDVN- 343
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
C C G + ++F C S CG YH C+ +A + +
Sbjct: 344 ----CVLCDSPG----DLLDQLF-CTS--CGLHYHGMCLD-------------MAVTPLR 379
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRA 208
+ CP +C +C+ +N D D + VC C K YH CL + E + T
Sbjct: 380 RAGWQCP--ECKVCQTCKNHGD-DTKMLVCDMCDKGYHTFCLQPAM-------ESLPTNG 429
Query: 209 WEGLLPNHRILIYC 222
W N R+ I C
Sbjct: 430 WR--CKNCRVCIQC 441
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 54/141 (38%), Gaps = 25/141 (17%)
Query: 55 ATIDAGEESHCASLGLTKDEVEAMLN---FFCKNCEY--KQHQCFACGKLGSSDKETGAE 109
++I+ G + + ++E AM N F N + KQ C CG G GAE
Sbjct: 923 SSINPGLTTFEMQYPVKEEEENAMHNTVVIFSSNDSFTLKQDMCVVCGSFG-----LGAE 977
Query: 110 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 169
A CG YHP CV + ++ + + C +C +C+
Sbjct: 978 GRLLACAQCGQCYHPFCVGIKI------------NKVVLSKGWRC--LECTVCEACGQAT 1023
Query: 170 DSDLQFAVCRRCPKAYHRKCL 190
D + +C C +YH CL
Sbjct: 1024 DPG-RLLLCDDCDISYHTYCL 1043
>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
H3 lysine-36 and H4 lysine-20 specific-like [Bombus
terrestris]
Length = 1238
Score = 47.8 bits (112), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 67/196 (34%), Gaps = 54/196 (27%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 80
+ VC C+ G L C+G C FH + + GE S H + D++ +N
Sbjct: 410 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILDDINEIKQSN 469
Query: 81 -----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
F C +C CF C E + C CG Y
Sbjct: 470 IDNDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 522
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 177
H C+ +S G TCP H C C +N DS + + A
Sbjct: 523 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTC-SSDNPQDSHSRAPNEKLAR 569
Query: 178 CRRCPKAYH--RKCLP 191
C RCP +YH CLP
Sbjct: 570 CVRCPSSYHTSTSCLP 585
>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Equus caballus]
Length = 4910
Score = 47.8 bits (112), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 76/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C +++H AG G +D +FF E+
Sbjct: 245 CAFCKHLGATIKCCEEKCTQTYHYPCAAG-------AGTFQD----FSHFFLLCPEHIDQ 293
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 294 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 338
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 339 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 382
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI + C
Sbjct: 383 KSVPTNGWK--CKNCRICVEC 401
Score = 44.7 bits (104), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 54/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A L KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 885 ADLSSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 939
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 940 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 984
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 985 DCDISYHTYCL 995
>gi|294461989|gb|ADE76550.1| unknown [Picea sitchensis]
Length = 135
Score = 47.8 bits (112), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 1140 MSTSAMDRYAPRL-HQLNNTRMNTFRSEPF-MPSRFGFYDSRAPQPGFFADMDFGPGFHP 1197
+++SAM RYAPRL Q+N TR N + E S +G Q F DF PG
Sbjct: 63 LTSSAMQRYAPRLDQQMNYTRSNLHQIEQAPRNSIYGAGQVLGSQNFPFDSRDFAPGPQN 122
Query: 1198 PFPQQGSGGWLDD 1210
P+P Q S GW+DD
Sbjct: 123 PYPHQSSSGWIDD 135
>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
porcellus]
Length = 4878
Score = 47.8 bits (112), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 76/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG +G +D +FF E+
Sbjct: 241 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------VGTFQD----FSHFFLLCPEHIDQ 289
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 290 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 334
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 335 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 378
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI + C
Sbjct: 379 KSVPTNGWK--CKNCRICVEC 397
Score = 43.9 bits (102), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 878 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 932
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 933 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 977
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 978 DCDISYHTYCL 988
>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
distachyon]
Length = 1589
Score = 47.4 bits (111), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 46/97 (47%), Gaps = 27/97 (27%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C C +GGDLLCC+ RC +FH A LG+ E+ + ++FC+NC
Sbjct: 851 TCGICGDGGDLLCCD-RCTSTFHV----------ACLGI---EMPSG-DWFCRNC----- 890
Query: 92 QCFACGKLGSSDKETG--AEVFPCVSATCGHFYHPHC 126
C GS+++ T AE+ C+ C YH C
Sbjct: 891 ---ICKFCGSAEERTSSPAELLSCLQ--CSRKYHQVC 922
>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
Length = 1259
Score = 47.4 bits (111), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 35/165 (21%), Positives = 64/165 (38%), Gaps = 20/165 (12%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
VC C+ ++ C +C H A + + D+ + + C +
Sbjct: 608 VCYICNRPNNVTKC-SKCTLHLHLVCLANDPEEVVKMQELVDQ-KKLCCEKCSTTSIVEK 665
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
CF C +++ +++ CV C YH C+ +L + +V+A +
Sbjct: 666 TCFICN--DEIPEKSNEQIYRCVVGKCTQAYHISCL-QLFPQVRQVSAS----------T 712
Query: 152 FTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYHR--KCLP 191
CP H C C E ++ + + A C +CP +YH C+P
Sbjct: 713 IICPYHTCHTCVASEPRSTASMVKTTLAHCLKCPTSYHPSGNCIP 757
>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
mutus]
Length = 4905
Score = 47.4 bits (111), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 76/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D + +FF E+
Sbjct: 231 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGAFQD----LSHFFLLCPEHIDQ 279
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 280 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 324
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 325 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 368
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI + C
Sbjct: 369 KSVPTNGWK--CKNCRICVEC 387
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)
Query: 72 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 125
KDE E ++ F + ++ HQ C CG G GAE + CG YHP+
Sbjct: 872 KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 926
Query: 126 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 185
CVS + + ++ + + C +C +C + KA + +C C +Y
Sbjct: 927 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 971
Query: 186 HRKCL 190
H CL
Sbjct: 972 HTYCL 976
>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
Length = 1566
Score = 47.4 bits (111), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H T + +E S ++C NC
Sbjct: 1025 DDTCGFCGDGGELLCCDN-CPSTYHQTCLSDQELPEGS-------------WYCHNC--- 1067
Query: 90 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
C +CG L + T + + C+ CG YH C+ + +L
Sbjct: 1068 --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 1107
>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
[Nasonia vitripennis]
Length = 1317
Score = 47.4 bits (111), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 63/194 (32%), Gaps = 51/194 (26%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTKDEVEAML--------- 79
+ VC C+ G L+ C G C FH A + GE S S +E A
Sbjct: 509 ERVCQICEKTGKLIRCRGPCHSYFHLACVKPGESSPEPSEAGDIEENNAYKEDLKEIKAK 568
Query: 80 ---------------NFFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYH 123
NF C +C CF C E E C CG YH
Sbjct: 569 SKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVC-------HERDGERTKCSILACGKHYH 621
Query: 124 PHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVCR 179
P C+ + G TCP H C C Q + + +FA C
Sbjct: 622 PDCLK------------SWPQCQWQGGRLTCPHHICHTCASDNPQNSHPRSAGEKFAKCV 669
Query: 180 RCPKAYHR--KCLP 191
+CP YH CLP
Sbjct: 670 KCPSTYHASISCLP 683
>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3 [Ovis aries]
Length = 4922
Score = 47.0 bits (110), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D + +FF E+
Sbjct: 256 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----LSHFFLLCPEHIDQ 304
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 305 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 349
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 350 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 393
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W N RI + C
Sbjct: 394 KSVPTNGWR--CKNCRICVEC 412
Score = 43.5 bits (101), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 891 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 945
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 946 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 990
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 991 DCDISYHTYCL 1001
>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
Length = 4785
Score = 46.6 bits (109), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 76/201 (37%), Gaps = 55/201 (27%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C +++H G G +D + +FF E+
Sbjct: 217 CAFCKHLGATIKCCEEKCTQTYHYPCAVGA-------GTFQD----VSHFFLLCPEHIDQ 265
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ ++A L
Sbjct: 266 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 310
Query: 143 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
++ + CP K C CKQ + D + VC C K YH CL +
Sbjct: 311 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 354
Query: 202 EGIITRAWEGLLPNHRILIYC 222
+ + T W+ N RI + C
Sbjct: 355 KSVPTNGWK--CKNCRICVEC 373
>gi|348683653|gb|EGZ23468.1| hypothetical protein PHYSODRAFT_479200 [Phytophthora sojae]
Length = 466
Score = 46.6 bits (109), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 62/163 (38%), Gaps = 23/163 (14%)
Query: 28 LFDSVCSFCDNGG-DLLCCEGRCLRSFHA-TIDAGEESHCASLGLTKDEVEAMLNFFCKN 85
LF S C C NGG LL C G C R+ H +D + ++ +A + C
Sbjct: 99 LFPSSCFLCGNGGRGLLHCGGSCARAAHQHCVDKLQAPVVGQPLSAAEKKQAAERWKCAQ 158
Query: 86 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 145
C H+C CG LG C CG+ +H C+ + A D+
Sbjct: 159 CLRGLHRCQRCGFLGHESNGMKK----CSVLDCGYHFHAQCLP-------DAAQDE---G 204
Query: 146 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCR-----RCPK 183
+ AG F CP H C C E +VC RCP+
Sbjct: 205 VHAG--FVCPRHTCATCGAQETDMRRCKSCSVCHHMTHFRCPR 245
>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
distachyon]
Length = 1344
Score = 46.6 bits (109), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 59/138 (42%), Gaps = 34/138 (24%)
Query: 4 FIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD +++D++H D + D S E D+ D C C +GG+L+CC+G C
Sbjct: 935 VLVDGLDKDLLHCLISAWDKQSDSERQSFFPVSTEGDDPNDDTCGICGDGGNLICCDG-C 993
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAE 109
+FH + EE L D+ + C NC C C + + D AE
Sbjct: 994 PSTFHMSCLELEE-------LPSDD------WRCANC-----CCKFCQEHSNDDAPDIAE 1035
Query: 110 VFP-CVSATCGHFYHPHC 126
V C + C YHP C
Sbjct: 1036 VDSLCTCSQCEENYHPVC 1053
>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
Length = 1179
Score = 46.6 bits (109), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 50/123 (40%), Gaps = 19/123 (15%)
Query: 77 AMLNFFCKNCEYKQHQCFACG-----KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLL 131
A +F C+ C ++ CF C + +D++ + + C CG YH HC
Sbjct: 629 ASTSFVCRICCERKFTCFKCKMEIVPETEENDQKGDSSIAWCPVEDCGKVYHKHC----- 683
Query: 132 LRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAV--------CRRCPK 183
L +DE++ K + CPLH C C ++ ++ C RCP
Sbjct: 684 LENDELSVPYKPKDSTQT-LYKCPLHNCRSCVLKKDDKKDSIKLNSSSKGKMFKCIRCPA 742
Query: 184 AYH 186
AYH
Sbjct: 743 AYH 745
>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
Length = 5215
Score = 46.6 bits (109), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 39/164 (23%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN------ 85
CSFC++ G +LC C + +H CA+ +V +L FC N
Sbjct: 311 CSFCNHYGASILCKIEGCTKVYHFP--------CATAS-GAFQVLNLLALFCSNHIGQAS 361
Query: 86 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 145
E + C+ C LG A + C ++CG YH CV LA+
Sbjct: 362 LECENSVCYTCKTLGDI-----ANLMFC--SSCGEHYHGICVG-------------LAQL 401
Query: 146 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKC 189
+ C KC IC+ D + + C +C K YH C
Sbjct: 402 PGVRAGWQC--RKCRICQVCRMTGD-ETKLMTCEQCDKIYHSTC 442
>gi|313221637|emb|CBY36122.1| unnamed protein product [Oikopleura dioica]
Length = 913
Score = 46.6 bits (109), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 19/123 (15%)
Query: 77 AMLNFFCKNCEYKQHQCFACG-----KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLL 131
A +F C+ C ++ CF C + +D++ + + C CG YH HC
Sbjct: 629 ASTSFVCRICCERKFTCFKCKMEIVPETEENDQKGDSSIAWCPVEDCGKVYHKHC----- 683
Query: 132 LRDDEVAADQLAKSIIAGESFTCPLHKC--CICKQGENKADSDL------QFAVCRRCPK 183
L +DE++ K + CPLH C C+ K+ + K L + C RCP
Sbjct: 684 LENDELSVPYKPKDSTQT-LYKCPLHNCRSCVLKKDDKKDSIKLNSSSKGKMFKCIRCPA 742
Query: 184 AYH 186
AYH
Sbjct: 743 AYH 745
>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
[Apis mellifera]
Length = 1218
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 67/196 (34%), Gaps = 54/196 (27%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEV----------- 75
+ VC C+ G L C+G C FH + + GE S H + D++
Sbjct: 405 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDENIMDDKILDDIKEIKQNN 464
Query: 76 -----------EAMLNFF-CKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
E FF C +C CF C E + C CG Y
Sbjct: 465 IDDDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 517
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 177
H C+ +S G TCP H C C +N DS + + A
Sbjct: 518 HSSCLK------------SWPQSHWQGGRLTCPYHICHTC-SSDNPQDSHSRAPNEKLAR 564
Query: 178 CRRCPKAYH--RKCLP 191
C RCP +YH CLP
Sbjct: 565 CVRCPSSYHTSTSCLP 580
>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
echinatior]
Length = 1304
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 66/195 (33%), Gaps = 52/195 (26%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 80
+ VC C+ G L C+G C FH + + GE S + D + + LN
Sbjct: 516 EKVCQICEKTGQLTRCKGPCYSYFHLSCVKPGESSPEYSVDENTLDDRLLSDLNIIKKSI 575
Query: 81 -----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
F C +C CF C E + C+ CG Y
Sbjct: 576 NGESENNGKSDEQEDETFKCIDCLSGVAPACFLCN-------EREGDRIRCIVPACGKHY 628
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVC 178
H +C+ +S G TCP H C C Q + + + A C
Sbjct: 629 HSNCLL------------SWPQSHWQGGRLTCPYHVCHTCSSDNPQDKRSRAPNEKMARC 676
Query: 179 RRCPKAYHRK--CLP 191
RCP +YH CLP
Sbjct: 677 VRCPSSYHASTLCLP 691
>gi|380012561|ref|XP_003690348.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
N-methyltransferase NSD2-like [Apis florea]
Length = 1208
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 67/196 (34%), Gaps = 54/196 (27%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEV----------- 75
+ VC C+ G L C+G C FH + + GE S H + D++
Sbjct: 404 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDENIMDDKILDDIKEIKQNN 463
Query: 76 -----------EAMLNFF-CKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 122
E FF C +C CF C E + C CG Y
Sbjct: 464 IDDDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 516
Query: 123 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 177
H C+ +S G TCP H C C +N DS + + A
Sbjct: 517 HSSCLK------------SWPQSHWQGGRLTCPYHICHTC-SSDNPQDSHSRAPNEKLAR 563
Query: 178 CRRCPKAYH--RKCLP 191
C RCP +YH CLP
Sbjct: 564 CVRCPSSYHTSTSCLP 579
>gi|256075776|ref|XP_002574192.1| SET domain protein [Schistosoma mansoni]
Length = 1343
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 62/173 (35%), Gaps = 30/173 (17%)
Query: 32 VCSFCD----NGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML-NFFCKNC 86
VC C+ G + C+G C R H HC + F C C
Sbjct: 287 VCPVCEVYSNAPGQMFQCQGPCGRIVHP--------HCMRYKTPPPADNSRPEKFRCPQC 338
Query: 87 EYKQHQCFACGKLGSSDKETGA-EVFPCVSATCGHFYHPHCVS---KLLLRDDEVAADQL 142
+ C CGK + + GA +++ C C +H C+S +L R +D +
Sbjct: 339 LIGEFLCSICGKPSETGSKDGAGQLYACQVINCSRHFHRDCLSGWPGILTR----PSDNV 394
Query: 143 AKSIIAGESFTCPLHKCCICKQGENKAD---------SDLQFAVCRRCPKAYH 186
A + CP H C C N+ + S+ F C RCP +H
Sbjct: 395 ATRTTTFQVLRCPAHTCNTCYLESNETNCHPVKKPCSSEGPFLECVRCPAVFH 447
>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
Length = 690
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 40/110 (36%), Gaps = 21/110 (19%)
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 140
F C C H CF C + G +V C C FYH C+
Sbjct: 25 FKCDECTSGVHSCFVCRQTG--------DVKACSQPLCSKFYHKECLQSYKC-------- 68
Query: 141 QLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
S I G+ CPLH C C + + + C RCP AYH CL
Sbjct: 69 ----SKIDGDRIYCPLHFCSTCISNKTPVNRG-RLTKCIRCPTAYHAGCL 113
>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
Japonica Group]
Length = 1169
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 39/141 (27%)
Query: 4 FIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD ++ D++H D + D S E D+ D C C +GG+L+CC+G C
Sbjct: 763 VLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-C 821
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGSSDKETG 107
+FH + C E+EA+ ++ C C C C + D +
Sbjct: 822 PSTFHMS--------CL-------ELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDI 861
Query: 108 AEVFP--CVSATCGHFYHPHC 126
AEV C + C YHP C
Sbjct: 862 AEVDSSLCTCSQCEEKYHPGC 882
>gi|422293139|gb|EKU20439.1| phd zinc finger-containing protein, partial [Nannochloropsis
gaditana CCMP526]
Length = 174
Score = 46.2 bits (108), Expect = 0.11, Method: Composition-based stats.
Identities = 17/49 (34%), Positives = 26/49 (53%)
Query: 152 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKL 200
F CP+H C +C +G ++ Q C CP+AYH C+P + + L
Sbjct: 10 FVCPMHICDLCGKGRESGNNRNQLFPCWLCPRAYHLNCIPPACKYHEYL 58
>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
Length = 1324
Score = 46.2 bits (108), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 39/141 (27%)
Query: 4 FIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD ++ D++H D + D S E D+ D C C +GG+L+CC+G C
Sbjct: 918 VLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-C 976
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGSSDKETG 107
+FH + C E+EA+ ++ C C C C + D +
Sbjct: 977 PSTFHMS--------CL-------ELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDI 1016
Query: 108 AEVFP--CVSATCGHFYHPHC 126
AEV C + C YHP C
Sbjct: 1017 AEVDSSLCTCSQCEEKYHPGC 1037
>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
Length = 1305
Score = 46.2 bits (108), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 59/141 (41%), Gaps = 39/141 (27%)
Query: 4 FIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD ++ D++H D + D S E D+ D C C +GG+L+CC+G C
Sbjct: 899 VLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-C 957
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGSSDKETG 107
+FH + C E+EA+ ++ C C C C + D +
Sbjct: 958 PSTFHMS--------CL-------ELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDI 997
Query: 108 AEVFP--CVSATCGHFYHPHC 126
AEV C + C YHP C
Sbjct: 998 AEVDSSLCTCSQCEEKYHPGC 1018
>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
harrisii]
Length = 4951
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 77/200 (38%), Gaps = 53/200 (26%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D + NF E+
Sbjct: 329 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----ISNFSLLCPEHIDQ 377
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
++ C C G + G + F TCG YH C+ ++A L
Sbjct: 378 APERSKEEANCSVCDSPG----DLGDQFF---CTTCGQHYHGMCL--------DIAVTAL 422
Query: 143 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 202
++ + CP C +C+ ++ + D + VC C K YH CL + +
Sbjct: 423 KRA-----GWQCP--DCKVCQNCKHSGE-DSKMLVCDTCDKGYHTFCL-------QPVMD 467
Query: 203 GIITRAWEGLLPNHRILIYC 222
+ T W+ N RI C
Sbjct: 468 SVPTNGWK--CKNCRICAEC 485
Score = 43.1 bits (100), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 26/126 (20%)
Query: 71 TKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHP 124
+KDE E ++ F + ++ HQ C CG G GAE + CG YHP
Sbjct: 983 SKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHP 1037
Query: 125 HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 184
+CVS + + ++ + + C +C +C + KA + +C C +
Sbjct: 1038 YCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDIS 1082
Query: 185 YHRKCL 190
YH CL
Sbjct: 1083 YHTYCL 1088
>gi|353231818|emb|CCD79173.1| putative set domain protein [Schistosoma mansoni]
Length = 1306
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 62/173 (35%), Gaps = 30/173 (17%)
Query: 32 VCSFCD----NGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML-NFFCKNC 86
VC C+ G + C+G C R H HC + F C C
Sbjct: 287 VCPVCEVYSNAPGQMFQCQGPCGRIVHP--------HCMRYKTPPPADNSRPEKFRCPQC 338
Query: 87 EYKQHQCFACGKLGSSDKETGA-EVFPCVSATCGHFYHPHCVS---KLLLRDDEVAADQL 142
+ C CGK + + GA +++ C C +H C+S +L R +D +
Sbjct: 339 LIGEFLCSICGKPSETGSKDGAGQLYACQVINCSRHFHRDCLSGWPGILTR----PSDNV 394
Query: 143 AKSIIAGESFTCPLHKCCICKQGENKAD---------SDLQFAVCRRCPKAYH 186
A + CP H C C N+ + S+ F C RCP +H
Sbjct: 395 ATRTTTFQVLRCPAHTCNTCYLESNETNCHPVKKPCSSEGPFLECVRCPAVFH 447
>gi|72386767|ref|XP_843808.1| transcription activator [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62359823|gb|AAX80252.1| transcription activator, putative [Trypanosoma brucei]
gi|70800340|gb|AAZ10249.1| transcription activator, putative [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 948
Score = 45.8 bits (107), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 19/100 (19%)
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H+CF CG+ +F C + C YH C ++ +L + + A
Sbjct: 795 HECFVCGE----GMRPLQPLFHC--SWCTKAYHAECANERVL----------PEGVSAPR 838
Query: 151 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
++TCP H+C +C E A +D +C CP A+ CL
Sbjct: 839 NWTCPRHRCDLC---EKVATTDGALFMCYECPAAFCFDCL 875
>gi|253747453|gb|EET02131.1| Hypothetical protein GL50581_575 [Giardia intestinalis ATCC 50581]
Length = 292
Score = 45.8 bits (107), Expect = 0.15, Method: Composition-based stats.
Identities = 44/171 (25%), Positives = 53/171 (30%), Gaps = 36/171 (21%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESH-----CASLGLTKDEVEAMLNFFCKNCE 87
CS C D C G C R F T E ++ + F C C
Sbjct: 7 CSICLGQADF-TCSGNCRRCFCKTCFTSPEVQSILQIASTPAQQSADSSTPATFLCLYCS 65
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 147
Q+ C+ C + T + F C CG HC L L V Q
Sbjct: 66 RGQNICYYCFR------PTSQKDFVCTYNNCG--ISAHCTCLLSLYKQNVMPAQ------ 111
Query: 148 AGESFTCPLHKCCICKQGE---------NKADSDLQFAVCRRCPKAYHRKC 189
C LH C IC G K++ L C RC K H KC
Sbjct: 112 ------CLLHTCAICNAGSPYPFSRSRGRKSNESLML-TCARCLKTAHLKC 155
>gi|195028344|ref|XP_001987036.1| GH21693 [Drosophila grimshawi]
gi|193903036|gb|EDW01903.1| GH21693 [Drosophila grimshawi]
Length = 1461
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 63/167 (37%), Gaps = 29/167 (17%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRC--LRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCK 84
+ + CSFCD G + C+ C + F AG S + E + + C
Sbjct: 151 DYLEQKCSFCDQYGASINCKMNCRQMHHFPCAAAAGCFLILESFTVFCTEHLSQVPLICS 210
Query: 85 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 144
+ +C C LG K + C S CG +H C+ L D
Sbjct: 211 D---NNVECLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTR-------- 252
Query: 145 SIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
++C KC IC+Q E +D++F C +C K YH CL
Sbjct: 253 -----SGWSCARCTKCQICRQQE---ANDIKFVKCEQCQKIYHANCL 291
>gi|412989266|emb|CCO15857.1| polybromo-1 [Bathycoccus prasinos]
Length = 2424
Score = 45.4 bits (106), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 18/88 (20%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
+CS C G L+CC+G C +FHA + AGE ++ A A +FC
Sbjct: 681 MCSVCGRSGSLVCCDG-CPSAFHA-VCAGEPTNRAL---------AETEWFCP------- 722
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCG 119
+C A + SDKE A+ FP + C
Sbjct: 723 ECVAVPEQIRSDKELRAQAFPFLRTECA 750
>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
Length = 895
Score = 45.4 bits (106), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 81/221 (36%), Gaps = 67/221 (30%)
Query: 23 NEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFF 82
N+E + D CS C +GGDLLCC+ C +SFH C + L +E+ +++
Sbjct: 59 NKEGLINDDFCSSCKDGGDLLCCDS-CEKSFHLM--------CLNPPL--EEIPEG-DWY 106
Query: 83 CKNCEYKQHQCFACGKLGS--------------SDKETGAEVFPCVSATCGHFYHPHCVS 128
C +C+YK+ + + S +E P S+ G +S
Sbjct: 107 CNSCKYKKSKTNVTKSPSTTIINNKTYFKESEQSPEEMSPPYLPISSSPIG-----STMS 161
Query: 129 KLLLRDDEVAADQLAKSIIAGESFTCPLH-------------KCCICKQGENKADSDLQF 175
L+ D L S + +F P C +C++ N D
Sbjct: 162 SLV--------DNL--SSVNPSTFQLPQEYTKNVNRNSSKKLNCLVCEESSNSGD----I 207
Query: 176 AVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNH 216
C +C AYH C+ D G T AW L P H
Sbjct: 208 LQCNKCNAAYHSTCV-------DSSSLGNKTSAW--LCPKH 239
>gi|261326898|emb|CBH09871.1| transcription activator, putative [Trypanosoma brucei gambiense
DAL972]
Length = 948
Score = 45.4 bits (106), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 19/100 (19%)
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
H+CF CG+ +F C + C YH C ++ +L + + A
Sbjct: 795 HECFVCGE----GMRPLQPLFHC--SWCMKAYHAECANERVL----------PEGVSAPR 838
Query: 151 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
++TCP H+C +C E A +D +C CP A+ CL
Sbjct: 839 NWTCPRHRCDLC---EKVATTDGALFMCYECPAAFCFDCL 875
>gi|307207985|gb|EFN85544.1| Probable histone-lysine N-methyltransferase NSD2 [Harpegnathos
saltator]
Length = 599
Score = 45.4 bits (106), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 67/195 (34%), Gaps = 53/195 (27%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 80
+ VC C+ G L C+G C FH + GE S H ++D++ +N
Sbjct: 232 EKVCQICEKTGKLTRCKGPCYSYFHLPCVKPGESSPEHSVDDNTSEDKIIDDVNVIKRSI 291
Query: 81 ----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYH 123
F C +C CF C E + C CG YH
Sbjct: 292 NDRDDNVKNEEQEDEMFKCIDCLSGVAPACFICN-------EREGDRIRCSVMACGKHYH 344
Query: 124 PHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAVC 178
C+ +S G TCP H C C +N DS + + A C
Sbjct: 345 SMCLK------------SWPQSHWQGGRLTCPYHLCHTC-SSDNPQDSHSRAPNEKMAKC 391
Query: 179 RRCPKAYHRK--CLP 191
RCP +YH CLP
Sbjct: 392 VRCPSSYHTSTLCLP 406
>gi|342320470|gb|EGU12410.1| Hypothetical Protein RTG_01435 [Rhodotorula glutinis ATCC 204091]
Length = 767
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 27/59 (45%), Gaps = 9/59 (15%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C C G LCCEG CLRSFH + C L DEV ++FCK C H
Sbjct: 132 CDSCGGKGHFLCCEGGCLRSFHFS--------CLEPPLEIDEVPDE-SWFCKACRAAAH 181
>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1339
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 65/163 (39%), Gaps = 50/163 (30%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H + +E S ++C NC
Sbjct: 958 DDTCGFCGDGGELLCCDN-CPSTYHQACLSAKELPEGS-------------WYCHNC--- 1000
Query: 90 QHQCFAC-GKLGSSDKETGAEVFPCVSATCGHFYHPHCV--SKLLLRDDEVAADQLAKSI 146
C C G + T + +F C CG YH C+ KL L DQ++++
Sbjct: 1001 --TCQVCGGPFSEKEVSTFSAIFKCFQ--CGDAYHDTCIEQEKLPLE------DQISQTW 1050
Query: 147 IAGESFTCPLHKCCICKQ----------GENKADSDLQFAVCR 179
G+ CK+ +N DSDL +++ R
Sbjct: 1051 FCGK----------YCKEIFIGLRSHVGTDNILDSDLSWSILR 1083
>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
[Strongylocentrotus purpuratus]
Length = 4860
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 69/174 (39%), Gaps = 30/174 (17%)
Query: 23 NEEDELFDSV---CSFCDN-GGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVE 76
N + +F + CS+C G + C E C R +H +G SL L E
Sbjct: 307 NVDKAVFSGITQRCSYCHRFGATIFCVEQGCNRVYHYPCAASSGSFQGIKSLILLCPE-- 364
Query: 77 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 136
L+ + E ++ +C C G+ +E C S CG YH C+ D
Sbjct: 365 -HLDLAEERVEMEELECAICDLPGNL-----SESLFCTS--CGQHYHGSCL------DPP 410
Query: 137 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
V+ D + ++ + CP C IC+ D D + VC C K YH CL
Sbjct: 411 VSIDPVVRA-----GWQCP--NCKICQTCRQPGD-DNKMLVCDTCDKGYHTFCL 456
Score = 43.9 bits (102), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C +CG G G+E + CG YHP+CVS + + ++
Sbjct: 785 QDMCLSCGSFG-----LGSEGRLLTCSQCGQCYHPYCVSIKITK------------VVLS 827
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C C +C +G K+ + + +C C +YH CL
Sbjct: 828 KGWRC--LDCTVC-EGCGKSSDEARLLLCDDCDISYHTYCL 865
>gi|242017983|ref|XP_002429463.1| set domain protein, putative [Pediculus humanus corporis]
gi|212514395|gb|EEB16725.1| set domain protein, putative [Pediculus humanus corporis]
Length = 2269
Score = 45.1 bits (105), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 46/211 (21%), Positives = 79/211 (37%), Gaps = 51/211 (24%)
Query: 23 NEEDELFDSVCSFCDNGGDLLC---CEGR----CLRSFHATIDAGEESHCASLGLTK--- 72
NEE+ + F +N + C C+G C +A + +E+H + +
Sbjct: 1236 NEEEYVCQICFEFLENEKTIKCKGLCQGSFHKSCTEKLNANLPPIDETHMPVIKNKRRRK 1295
Query: 73 -------------DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 119
E + ++ C +C ++ CF C + + C +A CG
Sbjct: 1296 LFANGDSDSPCEAQEKDEEQDWMCGDCLRGENPCFVCNSRSGNRQR-------CCAAYCG 1348
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGES------------FTCPLHKCCICKQGEN 167
FYH C+ KL + + KS ++ +CPLH C C +N
Sbjct: 1349 KFYHQKCL-KLFPQTIYNQGAPIRKSHALQQNKTSNSESPHYQILSCPLHVCHTC-ASDN 1406
Query: 168 KADSDLQF-----AVCRRCPKAYH--RKCLP 191
+D + ++ C +CP AYH C+P
Sbjct: 1407 PSDCNSKYLSERLVKCIKCPSAYHVGNYCVP 1437
>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
[Strongylocentrotus purpuratus]
Length = 4856
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 69/174 (39%), Gaps = 30/174 (17%)
Query: 23 NEEDELFDSV---CSFCDN-GGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVE 76
N + +F + CS+C G + C E C R +H +G SL L E
Sbjct: 306 NVDKAVFSGITQRCSYCHRFGATIFCVEQGCNRVYHYPCAASSGSFQGIKSLILLCPE-- 363
Query: 77 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 136
L+ + E ++ +C C G+ +E C S CG YH C+ D
Sbjct: 364 -HLDLAEERVEMEELECAICDLPGNL-----SESLFCTS--CGQHYHGSCL------DPP 409
Query: 137 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
V+ D + ++ + CP C IC+ D D + VC C K YH CL
Sbjct: 410 VSIDPVVRA-----GWQCP--NCKICQTCRQPGD-DNKMLVCDTCDKGYHTFCL 455
Score = 43.9 bits (102), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C +CG G G+E + CG YHP+CVS + + ++
Sbjct: 765 QDMCLSCGSFG-----LGSEGRLLTCSQCGQCYHPYCVSIKITK------------VVLS 807
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C C +C +G K+ + + +C C +YH CL
Sbjct: 808 KGWRC--LDCTVC-EGCGKSSDEARLLLCDDCDISYHTYCL 845
>gi|400599137|gb|EJP66841.1| PHD-finger domain-containing protein [Beauveria bassiana ARSEF
2860]
Length = 1226
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 16/70 (22%)
Query: 17 TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 76
T +D + + DE CS C N GD+LCC+G C RSFH C +L ++D +
Sbjct: 757 TPKDPATDNDEF----CSACGNAGDVLCCDG-CPRSFHF--------ECVNLAQSEDLPD 803
Query: 77 AMLNFFCKNC 86
+++C C
Sbjct: 804 ---DWYCNEC 810
>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
[Strongylocentrotus purpuratus]
Length = 4873
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 69/174 (39%), Gaps = 30/174 (17%)
Query: 23 NEEDELFDSV---CSFCDN-GGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVE 76
N + +F + CS+C G + C E C R +H +G SL L E
Sbjct: 323 NVDKAVFSGITQRCSYCHRFGATIFCVEQGCNRVYHYPCAASSGSFQGIKSLILLCPE-- 380
Query: 77 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 136
L+ + E ++ +C C G+ +E C S CG YH C+ D
Sbjct: 381 -HLDLAEERVEMEELECAICDLPGNL-----SESLFCTS--CGQHYHGSCL------DPP 426
Query: 137 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
V+ D + ++ + CP C IC+ D D + VC C K YH CL
Sbjct: 427 VSIDPVVRA-----GWQCP--NCKICQTCRQPGD-DNKMLVCDTCDKGYHTFCL 472
Score = 43.9 bits (102), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C +CG G G+E + CG YHP+CVS + + ++
Sbjct: 782 QDMCLSCGSFG-----LGSEGRLLTCSQCGQCYHPYCVSIKITK------------VVLS 824
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C C +C +G K+ + + +C C +YH CL
Sbjct: 825 KGWRC--LDCTVC-EGCGKSSDEARLLLCDDCDISYHTYCL 862
>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
Length = 1168
Score = 45.1 bits (105), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 65/163 (39%), Gaps = 50/163 (30%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H + +E S ++C NC
Sbjct: 958 DDTCGFCGDGGELLCCDN-CPSTYHQACLSAKELPEGS-------------WYCHNC--- 1000
Query: 90 QHQCFAC-GKLGSSDKETGAEVFPCVSATCGHFYHPHCV--SKLLLRDDEVAADQLAKSI 146
C C G + T + +F C CG YH C+ KL L DQ++++
Sbjct: 1001 --TCQVCGGPFSEKEVSTFSAIFKCFQ--CGDAYHDTCIEQEKLPLE------DQISQTW 1050
Query: 147 IAGESFTCPLHKCCICKQ----------GENKADSDLQFAVCR 179
G+ CK+ +N DSDL +++ R
Sbjct: 1051 FCGK----------YCKEIFIGLRSHVGTDNILDSDLSWSILR 1083
>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
Length = 777
Score = 45.1 bits (105), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 59/141 (41%), Gaps = 39/141 (27%)
Query: 4 FIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD ++ D++H D + D S E D+ D C C +GG+L+CC+G C
Sbjct: 371 VLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-C 429
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGSSDKETG 107
+FH ++ E+EA+ ++ C C C C + D +
Sbjct: 430 PSTFH---------------MSCLELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDI 469
Query: 108 AEVFP--CVSATCGHFYHPHC 126
AEV C + C YHP C
Sbjct: 470 AEVDSSLCTCSQCEEKYHPGC 490
>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
Length = 727
Score = 45.1 bits (105), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 48/103 (46%), Gaps = 20/103 (19%)
Query: 24 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFC 83
+ED+ D C C +GG+L+CC+ C +FH A L T+D E +++C
Sbjct: 213 DEDDKNDDSCGLCGDGGELICCD-NCPSTFHQ----------ACL-CTEDLPEG--SWYC 258
Query: 84 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHC 126
NC C+ CG L +DKE + V C H YH C
Sbjct: 259 PNC-----TCWICGDL-VNDKEASSSVGAYKCLQCEHKYHGAC 295
>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
Length = 5713
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 65/172 (37%), Gaps = 52/172 (30%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C+FC + G + C C R FH CA+ + ++ L+ FC QH
Sbjct: 321 CAFCSHYGAGIPCKVASCNRYFHLP--------CAAASSCFQDTKS-LSLFCS-----QH 366
Query: 92 ------------QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 139
C C +G + + C + CG YH CV LL
Sbjct: 367 LGQVPLLLNGDVTCMQCCGMGDV-----SNLVMC--SICGQHYHGSCVGLALL------- 412
Query: 140 DQLAKSIIAGESFTCPLHKCC-ICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ AG + C + C +C+Q E+ + + +C RC KAYH CL
Sbjct: 413 ----PGVRAG--WQCASCRVCQVCRQPEDVS----KVMLCERCEKAYHPSCL 454
>gi|405958289|gb|EKC24431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
Length = 4990
Score = 44.7 bits (104), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 65/166 (39%), Gaps = 38/166 (22%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
CS+C G + C C + +H CA+ G + + L C + Q
Sbjct: 1771 CSYCRRYGATITCIYPECSKKYHYP--------CAAAGACFQD-KKKLAILCPD-HSDQA 1820
Query: 92 QCFA-----CGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
+ A C +DK G ++F C S CGH YH C+ + EV A
Sbjct: 1821 ETIAGEEAFCVLCCQADK-IGKQLF-CTS--CGHHYHGGCLHPSVALSPEVRA------- 1869
Query: 147 IAGESFTCPLHKCC-ICKQ-GENKADSDLQFAVCRRCPKAYHRKCL 190
+ CP K C +C+Q GE D + VC C K YH CL
Sbjct: 1870 ----GWQCPDCKVCQMCRQPGE-----DSKMLVCDTCDKGYHTFCL 1906
>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
Length = 4664
Score = 44.3 bits (103), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 58/146 (39%), Gaps = 26/146 (17%)
Query: 51 RSFHATIDAGEESHCASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDK 104
R F +++ A KDE E ++ F + ++ HQ C CG G
Sbjct: 688 RKFSPGRPRSKQASTADTPSCKDEEENSMHNTVVLFSSSDKFTAHQDMCVVCGSFGQ--- 744
Query: 105 ETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQ 164
GAE + CG YHP+CVS + + ++ + + C +C +C +
Sbjct: 745 --GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-E 787
Query: 165 GENKADSDLQFAVCRRCPKAYHRKCL 190
KA + +C C +YH CL
Sbjct: 788 ACGKATDPGRLLLCDDCDISYHTYCL 813
>gi|81294213|gb|AAI07916.1| Trim24 protein [Rattus norvegicus]
Length = 607
Score = 44.3 bits (103), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS--------LGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH T +H S L+K
Sbjct: 374 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSK 432
Query: 73 DEVE 76
EVE
Sbjct: 433 PEVE 436
>gi|393224442|gb|EJD32834.1| hypothetical protein AURDEDRAFT_178069 [Auricularia delicata
TFB-10046 SS5]
Length = 242
Score = 44.3 bits (103), Expect = 0.47, Method: Composition-based stats.
Identities = 49/170 (28%), Positives = 67/170 (39%), Gaps = 36/170 (21%)
Query: 31 SVCSFCDNGGD--LLCCEGRCLRSFHA----TIDAGEESHCASLGLTKDEVEAMLNFFCK 84
S C C GD LL C+ CLR H+ TI G + A + + V + C+
Sbjct: 61 SSCDVCMQPGDVLLLRCD-MCLRRVHSLCLPTI-PGTDILAAVVRAEEQGVPLPRIWRCE 118
Query: 85 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 144
C + C C D+ G EV C TC YH CV ++
Sbjct: 119 ECCPRS--CSLC------DRVVGKEVVTC--RTCKWTYHTGCVR-----------EKWGM 157
Query: 145 SIIAGES----FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
++AG+ TCP C C + A D A+C C KA+H+ CL
Sbjct: 158 VVLAGDQGFRCKTCPSESCDACGRA---APDDFGTAICGTCGKAWHKDCL 204
>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like
[Monodelphis domestica]
Length = 4862
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 75/200 (37%), Gaps = 53/200 (26%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G + CCE +C + +H AG G +D NF E+
Sbjct: 294 CAFCKHLGATIKCCEEKCTQMYHYPCAAGA-------GTFQD----FSNFSLLCPEHIDQ 342
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
++ C C G + F C TCG YH C+ ++A L
Sbjct: 343 APERSKEEANCAVCDSPGDL-----VDQFFCT--TCGQHYHGMCL--------DIAVTPL 387
Query: 143 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 202
++ + CP C +C+ ++ + D + VC C K YH CL + +
Sbjct: 388 KRA-----GWQCP--DCKVCQNCKHSGE-DSKMLVCDTCDKGYHTFCL-------QPIMD 432
Query: 203 GIITRAWEGLLPNHRILIYC 222
+ T W+ N RI C
Sbjct: 433 SVPTNGWK--CKNCRICAEC 450
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)
Query: 72 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 125
KDE E ++ F + ++ HQ C CG G GAE + CG YHP+
Sbjct: 952 KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 1006
Query: 126 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 185
CVS + + ++ + + C +C +C + KA + +C C +Y
Sbjct: 1007 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 1051
Query: 186 HRKCL 190
H CL
Sbjct: 1052 HTYCL 1056
>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Loxodonta africana]
Length = 4785
Score = 43.9 bits (102), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 873 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 927
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 928 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 972
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 973 DCDISYHTYCL 983
>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 43.9 bits (102), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D SN++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 840 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 875
>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
Length = 1091
Score = 43.9 bits (102), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D SN++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 840 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 875
>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
Length = 1357
Score = 43.9 bits (102), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 66/163 (40%), Gaps = 50/163 (30%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H + +E S ++C NC
Sbjct: 982 DDTCGFCGDGGELLCCD-NCPSTYHPACLSAKELPEGS-------------WYCHNC--- 1024
Query: 90 QHQCFAC-GKLGSSDKETGAEVFPCVSATCGHFYHPHCV--SKLLLRDDEVAADQLAKSI 146
C C G + + T + +F C CG YH C+ KL L DQ++++
Sbjct: 1025 --TCQICGGPVSEKEVSTFSAIFKCFQ--CGDAYHDTCIEQEKLPLE------DQISQTW 1074
Query: 147 IAGESFTCPLHKCCICKQ----------GENKADSDLQFAVCR 179
G+ CK+ EN DS+L +++ R
Sbjct: 1075 FCGK----------YCKEIFIGLRSHVGTENILDSELSWSILR 1107
>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
Length = 1336
Score = 43.9 bits (102), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 24/104 (23%)
Query: 25 EDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCK 84
+D+ D C C +GG+L+CC+ C +FH + EE S ++C
Sbjct: 861 DDDENDDSCGLCGDGGELICCDN-CPSTFHQACLSTEELPEGS-------------WYCP 906
Query: 85 NCEYKQHQCFACGKLGSSDKE--TGAEVFPCVSATCGHFYHPHC 126
NC C+ CG+L +DKE + F C + C H YH C
Sbjct: 907 NC-----TCWICGEL-VNDKEDINSSNAFKC--SQCEHKYHDSC 942
>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
abelii]
Length = 1127
Score = 43.9 bits (102), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D SN++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
Length = 759
Score = 43.9 bits (102), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 509 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 544
>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
abelii]
Length = 1110
Score = 43.9 bits (102), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D SN++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
gorilla]
Length = 759
Score = 43.9 bits (102), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 509 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 544
>gi|189230178|ref|NP_001121413.1| uncharacterized protein LOC100158501 [Xenopus (Silurana)
tropicalis]
gi|183985616|gb|AAI66113.1| LOC100158501 protein [Xenopus (Silurana) tropicalis]
Length = 336
Score = 43.9 bits (102), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 13/80 (16%)
Query: 12 DMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT 71
D+ D E + N+++ CS C +GGDLLCCE C RSFH+ H S
Sbjct: 181 DICTDCESLQQNDDE------CSVCGDGGDLLCCE-ECPRSFHSNC------HVPSTSEA 227
Query: 72 KDEVEAMLNFFCKNCEYKQH 91
K E+ + FCK + + H
Sbjct: 228 KSSSESWICTFCKMQQMRMH 247
>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1294
Score = 43.9 bits (102), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 37/74 (50%), Gaps = 18/74 (24%)
Query: 26 DELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTK-DEVEAMLNFFCK 84
D+ +D CS C GG+LLCC+G C R+FH T +GL K E E +FC
Sbjct: 1188 DDQWDVDCSVCGLGGELLCCDG-CPRAFHVTC----------IGLEKIPETE----WFCN 1232
Query: 85 NCEYKQHQCFACGK 98
C + C AC K
Sbjct: 1233 ECNLQ--TCGACKK 1244
>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4527
Score = 43.9 bits (102), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 56/142 (39%), Gaps = 25/142 (17%)
Query: 54 HATIDAGEESHCASLGLTKDEVEAMLN---FFCKNCEY--KQHQCFACGKLGSSDKETGA 108
+A+I+ G + + ++E AM N F N + KQ C CG G GA
Sbjct: 628 NASINPGVITVETPYPVKEEEENAMHNTVVIFSSNDNFTLKQDMCVVCGSFG-----LGA 682
Query: 109 EVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENK 168
E A CG YHP+CV + ++ + + C +C +C+
Sbjct: 683 EGRLLACAQCGQCYHPYCVGIKI------------NKVVLSKGWRC--LECTVCEACGQA 728
Query: 169 ADSDLQFAVCRRCPKAYHRKCL 190
D + +C C +YH CL
Sbjct: 729 TDPG-RLLLCDDCDISYHTYCL 749
>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
Length = 1444
Score = 43.9 bits (102), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 56/127 (44%), Gaps = 26/127 (20%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 73
+ + + DE ++ D+ C C +GG+L+CC+ C +FH C S K+
Sbjct: 1019 IKNVQIDEIDQNDD----SCGLCGDGGELICCDN-CPSTFHQA--------CLS---AKE 1062
Query: 74 EVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLR 133
E N++C NC C CG L D+E + + C H YH C+ + ++
Sbjct: 1063 LPEG--NWYCPNC-----TCRICGDL-VKDREASSSFLALKCSQCEHKYHMPCLKEKCVK 1114
Query: 134 DDEVAAD 140
EV D
Sbjct: 1115 --EVGGD 1119
>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
Length = 1380
Score = 43.5 bits (101), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 22/111 (19%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C C +GG+L+CC+ C +FH C S K+ E N++C NC
Sbjct: 967 DDSCGLCGDGGELICCDN-CPSTFHQA--------CLS---AKELPEG--NWYCPNC--- 1009
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 140
C CG L D+E + + C H YH C+ + ++ EV D
Sbjct: 1010 --TCRICGDL-VKDREASSSFLALKCSQCEHKYHMPCLKEKCVK--EVGGD 1055
>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
domain [Arabidopsis thaliana]
Length = 1189
Score = 43.5 bits (101), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 22/107 (20%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 80
E +D+ D C C +GG+L+CC+ C +FH C S+ +V +
Sbjct: 717 EKASDDDPNDDSCGVCGDGGELICCDN-CPSTFHQA--------CLSM-----QVLPEGS 762
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV 127
++C +C C+ C +L S + E + F C + C H YH C+
Sbjct: 763 WYCSSC-----TCWICSELVSDNAERSQD-FKC--SQCAHKYHGTCL 801
>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
Length = 4499
Score = 43.5 bits (101), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G ++ + C + CG
Sbjct: 439 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGEE---GRLLAC--SQCG 493
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 494 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 538
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 539 DCDISYHTYCL 549
>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 4032
Score = 43.5 bits (101), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 44/109 (40%), Gaps = 23/109 (21%)
Query: 82 FCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQ 141
C +C Q C CG G GAE + CG YHP+CVS + +
Sbjct: 3 VCSSC---QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK-------- 46
Query: 142 LAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
++ + + C +C +C + KA + +C C +YH CL
Sbjct: 47 ----VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 88
>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
Length = 1095
Score = 43.5 bits (101), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 54/126 (42%), Gaps = 28/126 (22%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D CS C GG L+ C+G C +FHA +C L D ++FC++C
Sbjct: 703 DVFCSVCHYGGKLILCDG-CPSAFHA--------NCLGLEDVPDG-----DWFCQSC--- 745
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSA-TCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
C ACG+ T A+ +S C YHP C LR D A D L K I
Sbjct: 746 --CCGACGQFFLKTTSTNAKEEKFISCKQCELKYHPSC-----LRYDG-ACDSLDK--IL 795
Query: 149 GESFTC 154
GE + C
Sbjct: 796 GEKWFC 801
>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
distachyon]
Length = 1317
Score = 43.5 bits (101), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 48/108 (44%), Gaps = 22/108 (20%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C C +GG+L+CC+ C S+H +E S ++C +C
Sbjct: 1001 DDTCGLCGDGGELICCDN-CPASYHVACLPSQEIPDGS-------------WYCSSC--- 1043
Query: 90 QHQCFACGKLGSS-DKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 136
+C CG++ SS + T F C + C YH C+S +L ++E
Sbjct: 1044 --RCDVCGEVVSSKEPRTPLHAFEC--SQCERQYHIKCISGKVLCNEE 1087
>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
[Sus scrofa]
Length = 881
Score = 43.5 bits (101), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D SN++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 631 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 666
>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
Length = 592
Score = 43.5 bits (101), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 359 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 394
>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
Length = 3310
Score = 43.5 bits (101), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 402 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 444
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 445 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 482
>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
Length = 517
Score = 43.5 bits (101), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H T + +E S ++C NC
Sbjct: 5 DDTCGFCGDGGELLCCD-NCPSTYHQTCLSDQELPEGS-------------WYCHNC--- 47
Query: 90 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
C +CG L + T + + C+ CG YH C+ + +L
Sbjct: 48 --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 87
>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
Length = 517
Score = 43.5 bits (101), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H T + +E S ++C NC
Sbjct: 5 DDTCGFCGDGGELLCCD-NCPSTYHQTCLSDQELPEGS-------------WYCHNC--- 47
Query: 90 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
C +CG L + T + + C+ CG YH C+ + +L
Sbjct: 48 --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 87
>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
Length = 3314
Score = 43.5 bits (101), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 402 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 444
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 445 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 482
>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
Length = 1145
Score = 43.5 bits (101), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 22/107 (20%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 80
E +D+ D C C +GG+L+CC+ C +FH C S+ +V +
Sbjct: 673 EKASDDDPNDDSCGVCGDGGELICCDN-CPSTFHQA--------CLSM-----QVLPEGS 718
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV 127
++C +C C+ C +L S + E + F C + C H YH C+
Sbjct: 719 WYCSSC-----TCWICSELVSDNAERSQD-FKC--SQCAHKYHGTCL 757
>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
guttata]
Length = 4871
Score = 43.5 bits (101), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 87/222 (39%), Gaps = 53/222 (23%)
Query: 9 MEEDMVHDTEEDESNEEDELFDSVCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCAS 67
M+E ++ D ++ + E E C+FC + G + CCE +C + +H AG
Sbjct: 235 MKEQLLMDVDKSCVSGETER----CAFCKHLGATIKCCEEKCTQMYHYPCAAG------- 283
Query: 68 LGLTKDEVEAMLNFFCKN-------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGH 120
G +D + L+ C + ++ C C G + ++F TCG
Sbjct: 284 AGTFQDF--SNLSLLCPDHIDQAPERSKEEANCAVCDSPG----DLLDQLF---CTTCGQ 334
Query: 121 FYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRR 180
YH C+ D +V + A + CP C +C+ ++ + D + VC
Sbjct: 335 HYHGMCL------DIQVTPLKRA-------GWQCP--DCKVCQNCKHSGE-DNKMLVCDT 378
Query: 181 CPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYC 222
C K YH CL + + + T W+ N R+ C
Sbjct: 379 CDKGYHTFCL-------QPVMDAVPTNGWK--CKNCRVCAEC 411
Score = 43.1 bits (100), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)
Query: 72 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 125
KDE E ++ F + ++ HQ C CG G GAE + CG YHP+
Sbjct: 886 KDEEENSMHNTVVLFSNSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 940
Query: 126 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 185
CVS + + ++ + + C +C +C + KA + +C C +Y
Sbjct: 941 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 985
Query: 186 HRKCL 190
H CL
Sbjct: 986 HTYCL 990
>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
Length = 2185
Score = 43.5 bits (101), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 18 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 61 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98
>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
Length = 4906
Score = 43.5 bits (101), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)
Query: 72 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 125
KDE E ++ F + ++ HQ C CG G GAE + CG YHP+
Sbjct: 916 KDEEENSMHNTVVLFSNSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 970
Query: 126 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 185
CVS + + ++ + + C +C +C + KA + +C C +Y
Sbjct: 971 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 1015
Query: 186 HRKCL 190
H CL
Sbjct: 1016 HTYCL 1020
Score = 40.4 bits (93), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 49/198 (24%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN------ 85
C++C + G + CCE +C + +H AG G +D + L+ C +
Sbjct: 287 CAYCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQDF--SNLSLLCPDHIDQAP 337
Query: 86 -CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 144
++ C C G + ++F TCG YH C+ D +V + A
Sbjct: 338 ERSKEEANCAVCDSPG----DLLDQLF---CTTCGQHYHGMCL------DIQVTPLKRA- 383
Query: 145 SIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGI 204
+ CP C +C+ ++ + D + VC C K YH CL + + +
Sbjct: 384 ------GWQCP--DCKVCQNCKHSGE-DNKMLVCDTCDKGYHTFCL-------QPVMDSV 427
Query: 205 ITRAWEGLLPNHRILIYC 222
T W+ N R+ C
Sbjct: 428 PTNGWK--CKNCRVCAEC 443
>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
[Oryctolagus cuniculus]
Length = 4865
Score = 43.1 bits (100), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 878 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 932
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 933 QCYHPYCVSIKISK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 977
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 978 DCDISYHTYCL 988
>gi|296132030|ref|YP_003639277.1| TDP-4-keto-6-deoxy-D-glucose transaminase [Thermincola potens JR]
gi|296030608|gb|ADG81376.1| TDP-4-keto-6-deoxy-D-glucose transaminase [Thermincola potens JR]
Length = 377
Score = 43.1 bits (100), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 95/268 (35%), Gaps = 37/268 (13%)
Query: 176 AVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPI 235
A R K P + ++KL E +T + ++P H + C
Sbjct: 88 AFVLRGAKIVFTDIRPDTMNMDEKLIERAVTERTKAIVPVHYAGVAC------------- 134
Query: 236 RDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVAST-KSSLTSKAPPQGKFSVKALKRV 294
+ TI+ K L Q V ST K G +S K
Sbjct: 135 ----------DMDTIMKIAGKHNLLVIEDAAQAVMSTYKGKALGTIGHLGCYSFHDTKNY 184
Query: 295 PSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSL 354
G+G + I++R V R K T+ R SF L +VD+ S +D + L
Sbjct: 185 T--CGEGGALLINDRRFVDRAEYIREKGTN--RSSF------LRGEVDKYSWIDVGSSYL 234
Query: 355 GERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSL- 413
L AAF+ E+ + +D + +KPLR + + ELP + +D + S
Sbjct: 235 PSELNAAFLYAQLEEAEAINRDRLKSWERYYIGLKPLRDRGVIELPHIPDDCRHNASIFY 294
Query: 414 --MKDAASSVRMEEILKRHKIPSTHAYA 439
++D + L+ H + Y
Sbjct: 295 IKVRDLQERTEFLQYLREHGVQGAFHYV 322
>gi|342877378|gb|EGU78844.1| hypothetical protein FOXB_10633 [Fusarium oxysporum Fo5176]
Length = 1223
Score = 43.1 bits (100), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 24/88 (27%)
Query: 4 FIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEES 63
+ + +D V D +ED CS C GD++CC+G C RSFH
Sbjct: 775 LATNGVAKDQVSDNDED------------CSACGAAGDVVCCDG-CPRSFHF-------- 813
Query: 64 HCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C + +T D + +FC C YK++
Sbjct: 814 ECVGM-ITSDHLPD--EWFCNECLYKRY 838
>gi|334323908|ref|XP_001382045.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1
[Monodelphis domestica]
Length = 1143
Score = 43.1 bits (100), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+N+ED+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 895 NNKEDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 928
>gi|440893214|gb|ELR46066.1| Transcription intermediary factor 1-alpha, partial [Bos grunniens
mutus]
Length = 940
Score = 43.1 bits (100), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 11/92 (11%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 707 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 765
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
EVE + N E ++ + F KL DK
Sbjct: 766 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 795
>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
Length = 616
Score = 43.1 bits (100), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 432 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 467
>gi|334323910|ref|XP_003340460.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2
[Monodelphis domestica]
Length = 1126
Score = 43.1 bits (100), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+N+ED+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 895 NNKEDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 928
>gi|395535724|ref|XP_003769871.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Sarcophilus
harrisii]
Length = 962
Score = 43.1 bits (100), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
SN+E++ + C+ C NGGDLLCCE +C + FH T
Sbjct: 714 SNKEEDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 747
>gi|346324763|gb|EGX94360.1| Zinc finger domain-containing protein, PHD-finger [Cordyceps
militaris CM01]
Length = 1368
Score = 43.1 bits (100), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 12/57 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 86
D CS C N GD+LCC+G C RSFH C +L ++D + +++C C
Sbjct: 896 DEFCSACGNAGDVLCCDG-CPRSFHF--------ECVNLTQSEDLPD---DWYCSEC 940
>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
Length = 1300
Score = 43.1 bits (100), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 22/111 (19%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C C +GG+L+CC+ C +FH C S K+ E N++C NC
Sbjct: 870 DDSCGLCGDGGELICCDN-CPSTFHQA--------CLS---AKELPEG--NWYCPNC--- 912
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 140
C CG L D+E + + C H YH C+ + ++ EV D
Sbjct: 913 --TCRICGDL-VKDREASSSFLALKCSQCEHKYHMPCLKEKCVK--EVGGD 958
>gi|149411557|ref|XP_001506399.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial
[Ornithorhynchus anatinus]
Length = 1050
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+N+ED+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 802 NNKEDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 835
>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
Length = 640
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 59/141 (41%), Gaps = 39/141 (27%)
Query: 4 FIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD ++ D++H D + D S E D+ D C C +GG+L+CC+G C
Sbjct: 234 VLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-C 292
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGSSDKETG 107
+FH ++ E+EA+ ++ C C C C + D +
Sbjct: 293 PSTFH---------------MSCLELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDI 332
Query: 108 AEVFP--CVSATCGHFYHPHC 126
AEV C + C YHP C
Sbjct: 333 AEVDSSLCTCSQCEEKYHPGC 353
>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Ornithorhynchus anatinus]
Length = 4910
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)
Query: 72 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 125
KDE E ++ F + ++ HQ C CG G GAE + CG YHP+
Sbjct: 936 KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 990
Query: 126 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 185
CVS + + ++ + + C +C +C + KA + +C C +Y
Sbjct: 991 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 1035
Query: 186 HRKCL 190
H CL
Sbjct: 1036 HTYCL 1040
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 48/196 (24%), Positives = 76/196 (38%), Gaps = 45/196 (22%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAG-----EESHCASLGLTKDEVEAMLNFFCKNC 86
C+FC + G + CCE +C + +H AG + SH + L + + NC
Sbjct: 280 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHSSLLCPEHIDQAPERSKEEANC 339
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
C + G L + F C TCG YH C+ ++A L ++
Sbjct: 340 AV----CDSPGDL--------LDQFYCT--TCGQHYHGMCL--------DIAITPLKRA- 376
Query: 147 IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIIT 206
+ CP C +C+ ++ + D + VC C K YH CL I + + T
Sbjct: 377 ----GWQCP--DCKVCQNCKHSGE-DNKMLVCDTCDKGYHTFCLQPVI-------DSVPT 422
Query: 207 RAWEGLLPNHRILIYC 222
W+ N R+ C
Sbjct: 423 NGWK--CKNCRVCAEC 436
>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
Length = 1065
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 54/126 (42%), Gaps = 28/126 (22%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D CS C GG L+ C+G C +FHA +C L D ++FC++C
Sbjct: 694 DVFCSVCHYGGKLILCDG-CPSAFHA--------NCLGLEDVPDG-----DWFCQSC--- 736
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSA-TCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
C ACG+ T A+ +S C YHP C LR D A D L K I
Sbjct: 737 --CCGACGQFFLKTTSTNAKEEKFISCKQCELKYHPSC-----LRYDG-ACDSLDK--IL 786
Query: 149 GESFTC 154
GE + C
Sbjct: 787 GEKWFC 792
>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
griseus]
Length = 4871
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 903 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 957
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 958 QCYHPYCVSIKISK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1002
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 1003 DCDISYHTYCL 1013
>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
Length = 980
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 730 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 765
>gi|195122760|ref|XP_002005879.1| GI18846 [Drosophila mojavensis]
gi|193910947|gb|EDW09814.1| GI18846 [Drosophila mojavensis]
Length = 1465
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 62/168 (36%), Gaps = 29/168 (17%)
Query: 26 DELFDSVCSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLNFFC 83
+E + CSFC G + C+ C + H AG S + E + + C
Sbjct: 146 NEYLEQKCSFCGQYGASITCKMNCRQVHHWPCAAAAGCFLILESFTVFCTEHLSQVPLIC 205
Query: 84 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 143
+ C C LG K + C S CG +H C+ L D
Sbjct: 206 TD---NNVGCLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTR------- 248
Query: 144 KSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
++C KC IC+Q E +D++F C +C K YH CL
Sbjct: 249 ------SGWSCARCTKCQICRQQE---ANDIKFIKCEQCQKIYHATCL 287
>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
Length = 4301
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 29/150 (19%)
Query: 50 LRSFHATIDAGEESHCASLGLTK-------DEVEAMLNFFCKNCEYK--QHQCFACGKLG 100
L++ + IDA S + G+TK D ++ + F ++ Q C CG G
Sbjct: 137 LKAMASCIDAVLLSMASDTGVTKEEEEEEDDTMQNTVVLFSNTDKFVLLQDMCVVCGSFG 196
Query: 101 SSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCC 160
K + ++ C A C YHP+CV + ++ +L K + C +C
Sbjct: 197 ---KGSEGQLLAC--AQCAQCYHPYCV------NSKITKTKLRK------GWRC--LECI 237
Query: 161 ICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+C+ KA + +C C +YH CL
Sbjct: 238 VCEMC-GKASDPSRLLLCDDCDVSYHTYCL 266
>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
troglodytes]
gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1127
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|431911670|gb|ELK13818.1| Transcription intermediary factor 1-alpha [Pteropus alecto]
Length = 1615
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 44/92 (47%), Gaps = 13/92 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 1384 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 1442
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
EVE + N E K+ GKL DK
Sbjct: 1443 PEVEYDCDAPSHNSEKKKTD----GKLTPIDK 1470
>gi|171686528|ref|XP_001908205.1| hypothetical protein [Podospora anserina S mat+]
gi|170943225|emb|CAP68878.1| unnamed protein product [Podospora anserina S mat+]
Length = 1304
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 84/239 (35%), Gaps = 60/239 (25%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 73
V + +++ +N + D C+ C + G+L+CC+G C RSFH +C L +D
Sbjct: 841 VGNPQKEVTNHAQDDNDDYCASCSSNGELICCDG-CTRSFHF--------NCVDPPLARD 891
Query: 74 EVEAMLN-FFCKNCE-------YKQHQ-CFACGKLGSSDKETGAEVFPCVSATC------ 118
AM N +FC C +++H FA K + A P C
Sbjct: 892 ---AMPNEWFCNVCRMTRELQPFREHTGSFALLFEKLEAKNSTAFALPPDIRNCFEGVRT 948
Query: 119 ---GHFYH--PHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDL 173
G + P + + DE A D G + C H C + + SD
Sbjct: 949 GPEGEYEEILPVVKTARKKKSDEEAPDFFRLRDAEGNAAIC--HSC------QKHSASDR 1000
Query: 174 QFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAW--------------EGLLPNHRI 218
C C +H CL + L + R W E L P HR+
Sbjct: 1001 AIIPCSACGIFWHLDCL------DPPLANPPVLRTWKCPLHIDELLAEMPEVLAPAHRV 1053
>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
porcellus]
Length = 1128
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 878 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 913
>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
Length = 1120
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
mulatta]
Length = 1110
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
mulatta]
Length = 1151
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 901 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 936
>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
Length = 4532
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 527 QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 569
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 570 SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 608
>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
porcellus]
Length = 1111
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 878 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 913
>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
anubis]
Length = 1127
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
Length = 1131
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 881 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 916
>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
Length = 1127
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
troglodytes]
gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
Length = 1110
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
anubis]
Length = 1151
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 901 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 936
>gi|413923002|gb|AFW62934.1| putative SET-domain containing protein family, partial [Zea mays]
Length = 248
Score = 42.7 bits (99), Expect = 1.3, Method: Composition-based stats.
Identities = 29/100 (29%), Positives = 37/100 (37%), Gaps = 25/100 (25%)
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 151
+C C K+ +E C C +H CV++ S ES
Sbjct: 147 ECRLCSKIIYVGEE-----IKCSVKKCPQMFHLKCVAE-------------DTSNFTVES 188
Query: 152 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
F CP H C +CKQ + F C RC A H KC P
Sbjct: 189 FRCPQHGCMVCKQ-------KMFFWRCGRCTVAAHTKCAP 221
>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
Length = 1127
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
Length = 1041
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 791 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 826
>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
anubis]
Length = 1110
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Pan paniscus]
Length = 1258
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 1008 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 1043
>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
Length = 1110
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
Length = 1110
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
Length = 1012
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 762 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 797
>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
Length = 1110
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
protein; Short=Protein Rfg7; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
Length = 1127
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
mulatta]
Length = 1044
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 794 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 829
>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
mulatta]
Length = 1027
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 794 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 829
>gi|334348540|ref|XP_003342075.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3
[Monodelphis domestica]
Length = 1060
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
ES +ED+ + C+ C NGG+LLCCE +C + FH T +GE L+K
Sbjct: 827 ESRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSK 885
Query: 73 DEVE 76
EVE
Sbjct: 886 PEVE 889
>gi|149065269|gb|EDM15345.1| rCG27932, isoform CRA_a [Rattus norvegicus]
Length = 1050
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS--------LGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH T +H S L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSK 875
Query: 73 DEVE 76
EVE
Sbjct: 876 PEVE 879
>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
Length = 4025
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 18 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 61 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98
>gi|74151452|dbj|BAE38840.1| unnamed protein product [Mus musculus]
Length = 764
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 531 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 565
>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
troglodytes]
Length = 4026
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 18 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 61 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98
>gi|417413221|gb|JAA52953.1| Putative transcription intermediary factor 1-alpha, partial
[Desmodus rotundus]
Length = 951
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 11/92 (11%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 718 EARKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 776
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
EVE + N E ++ + GKL DK
Sbjct: 777 PEVEYDCDAPSHNSEKRKTE--GPGKLAPIDK 806
>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
Length = 1746
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 40/134 (29%)
Query: 106 TGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE--SFTCPLHKCCICK 163
+G ++ C C +YHP C+ K A + G SFTCP H C C
Sbjct: 642 SGDQMIRCSVRACRRWYHPSCLRK----------PPFAVVVREGRSGSFTCPAHTCLACS 691
Query: 164 QGE--NKADSDLQFAVCRRCPKAYH--RKCLP---RKIAFEDKLEEGIITRAWEGLLPNH 216
F C CP AYH CLP ++IA PN
Sbjct: 692 AETPGTMPRPSPHFIRCVMCPAAYHPGDWCLPAGSKEIA------------------PN- 732
Query: 217 RILIYCLKHEIDDE 230
LI C +H + DE
Sbjct: 733 --LIICPRHALQDE 744
>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
Length = 385
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C FC +GG+LLCC+ C ++H T + +E S ++C NC
Sbjct: 5 DDTCGFCGDGGELLCCD-NCPSTYHQTCLSDQELPEGS-------------WYCHNC--- 47
Query: 90 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 132
C +CG L + T + + C+ CG YH C+ + +L
Sbjct: 48 --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 87
>gi|126340851|ref|XP_001374515.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Monodelphis domestica]
Length = 1061
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
ES +ED+ + C+ C NGG+LLCCE +C + FH T +GE L+K
Sbjct: 828 ESRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSK 886
Query: 73 DEVE 76
EVE
Sbjct: 887 PEVE 890
>gi|195382495|ref|XP_002049965.1| GJ21880 [Drosophila virilis]
gi|194144762|gb|EDW61158.1| GJ21880 [Drosophila virilis]
Length = 1458
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 63/166 (37%), Gaps = 29/166 (17%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 90
CSFC G + C+ C + H + AG S + E + + C +
Sbjct: 150 CSFCGQYGASITCKMNCRQVHHWPCSSAAGCLLILESFTVFCTEHLSQVPIICTD---NN 206
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
+C C LG K + C S CG +H C+ L D
Sbjct: 207 VECLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTR-------------S 246
Query: 151 SFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIA 195
++C KC IC+Q E +D++F C +C K YH CL I+
Sbjct: 247 GWSCARCTKCQICRQHET---NDIKFIKCEQCQKMYHAMCLRPTIS 289
>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4931
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G ++ + C + CG
Sbjct: 922 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGEE---GRLLAC--SQCG 976
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 977 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1021
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 1022 DCDISYHTYCL 1032
>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
boliviensis boliviensis]
Length = 4029
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 18 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 61 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98
>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
norvegicus]
Length = 4930
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 26/131 (19%)
Query: 66 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 119
A + KDE E ++ F + ++ HQ C CG G ++ + C + CG
Sbjct: 921 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGEE---GRLLAC--SQCG 975
Query: 120 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 179
YHP+CVS + + ++ + + C +C +C + KA + +C
Sbjct: 976 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1020
Query: 180 RCPKAYHRKCL 190
C +YH CL
Sbjct: 1021 DCDISYHTYCL 1031
>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
Length = 4575
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 563 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 605
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 606 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 643
>gi|313212234|emb|CBY36242.1| unnamed protein product [Oikopleura dioica]
Length = 906
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 68/164 (41%), Gaps = 35/164 (21%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFH--ATIDAG---EESHCASLGLTKDEVEAMLNFFCKNC 86
CS+C N G + C C +H + +G + + C+ L + +A L C
Sbjct: 104 CSYCQNYGASVRCKVSNCSSFYHYPCAVASGAFQDAASCSVLCAPEHRHQARL-IDPSGC 162
Query: 87 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 146
E+ C G+L +++ C TCG H C+++ ++ EV A
Sbjct: 163 EF----CSQVGEL--------SDLLFCT--TCGAHSHARCLNEGIIVTGEVRA------- 201
Query: 147 IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ +TC IC+Q K+D D Q +C C K +H CL
Sbjct: 202 -GWQCYTCK-----ICQQCR-KSDDDAQMIICETCDKGWHTYCL 238
>gi|149065270|gb|EDM15346.1| rCG27932, isoform CRA_b [Rattus norvegicus]
Length = 1016
Score = 42.4 bits (98), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS--------LGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH T +H S L+K
Sbjct: 783 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSK 841
Query: 73 DEVE 76
EVE
Sbjct: 842 PEVE 845
>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
niloticus]
Length = 4872
Score = 42.4 bits (98), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 44/197 (22%), Positives = 74/197 (37%), Gaps = 47/197 (23%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C++C G + CC C + +H CA T ++ ++ ++ E H
Sbjct: 285 CAYCKRLGASIKCCAEGCAQLYHYP--------CAGAAGTFQDIRSLSLLCPEHIELATH 336
Query: 92 Q------CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 145
+ C C G + ++F C S CG YH C+ +A +
Sbjct: 337 KFVDDINCVLCDSPG----DLLDQLF-CTS--CGLHYHGICLD-------------MAVT 376
Query: 146 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII 205
+ + CP +C +C+ +N + D + VC C K YH CL I + +
Sbjct: 377 PLRRAGWQCP--ECKVCQTCKNPGE-DTKMLVCDMCDKGYHTFCLQPVI-------DTLP 426
Query: 206 TRAWEGLLPNHRILIYC 222
T W N R+ + C
Sbjct: 427 TNGWRCQ--NCRVCLQC 441
Score = 42.4 bits (98), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 49/126 (38%), Gaps = 25/126 (19%)
Query: 70 LTKDEVEAMLN---FFCKNCEY--KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHP 124
+ ++E AM N F N + KQ C CG G GAE A CG YHP
Sbjct: 964 IKEEEENAMHNTVVIFSSNDSFTLKQDMCVVCGSFG-----LGAEGRLLACAQCGQCYHP 1018
Query: 125 HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 184
CV + + ++ + + C +C +C+ D + +C C +
Sbjct: 1019 FCVGIKITK------------VVLSKGWRC--LECTVCEACGQATDPG-RLLLCDDCDIS 1063
Query: 185 YHRKCL 190
YH CL
Sbjct: 1064 YHTYCL 1069
>gi|355680664|gb|AER96599.1| olfactory receptor family 2 subfamily AV-like protein [Mustela
putorius furo]
Length = 529
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH +
Sbjct: 485 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 519
>gi|334348538|ref|XP_003342074.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Monodelphis domestica]
Length = 1027
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
ES +ED+ + C+ C NGG+LLCCE +C + FH T +GE L+K
Sbjct: 794 ESRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLMNFPSGEWICTFCRDLSK 852
Query: 73 DEVE 76
EVE
Sbjct: 853 PEVE 856
>gi|367036601|ref|XP_003648681.1| hypothetical protein THITE_2106426 [Thielavia terrestris NRRL 8126]
gi|346995942|gb|AEO62345.1| hypothetical protein THITE_2106426 [Thielavia terrestris NRRL 8126]
Length = 1698
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 46/109 (42%), Gaps = 14/109 (12%)
Query: 84 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 143
K+ ++KQ CG G +D + PC C + YH C+ RD V
Sbjct: 353 KSYDFKQAHRLVCGSCGYADDRNKGALIPCQG--CSNSYHKACIGVRSTRDHRVTK---- 406
Query: 144 KSIIAGESFTCPLHKC-CICKQGENKADSDLQFAVCRRCPKAYHRKCLP 191
+A ESF C CI K+ + +A + VC+ C A + C+P
Sbjct: 407 ---VAAESFVLQCRFCICIYKKKDIRAPA---HDVCQGC-HAKNPSCVP 448
>gi|345570767|gb|EGX53588.1| hypothetical protein AOL_s00006g454 [Arthrobotrys oligospora ATCC
24927]
Length = 1515
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 54/132 (40%), Gaps = 26/132 (19%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 92
C C +GGD++CC C RS+H D +E H K + + + C QH+
Sbjct: 1133 CQVCTDGGDIVCCSS-CPRSYH--YDCLDEEH-------KYKSNGKMQYHC-----SQHE 1177
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV----SKLLLRDDEVAADQLAKSIIA 148
C C + S + G ++ C C Y C+ +KL+ DE+ QL
Sbjct: 1178 CHDCEQKTS---DAGNMLYRC--RFCERAYCEDCLDFDNAKLI--GDEILEYQLLDYYAI 1230
Query: 149 GESFTCPLHKCC 160
++F H C
Sbjct: 1231 PQAFYVECHNCV 1242
>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
Length = 987
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 739 TNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 772
>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
Length = 1120
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 25/100 (25%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNCE 87
D +CS C GG+L+ C+ +C SFH T LGL E + N +FC +C
Sbjct: 737 DYICSVCHYGGELILCD-KCPSSFHKT----------CLGL-----EDIPNGDWFCPSC- 779
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV 127
C CG+ + ++ PC+ C H YH C+
Sbjct: 780 ----CCGICGQRKIDGDDEVGQLLPCIQ--CEHKYHVRCL 813
>gi|195431535|ref|XP_002063792.1| GK15714 [Drosophila willistoni]
gi|194159877|gb|EDW74778.1| GK15714 [Drosophila willistoni]
Length = 1503
Score = 42.0 bits (97), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 40/168 (23%), Positives = 64/168 (38%), Gaps = 31/168 (18%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFH---ATIDAGEESHCASLGLTKDEVEAMLNFFC 83
E+ + C FC G + C+ C R+ H AG S + E + + C
Sbjct: 140 EVLEQKCVFCSQYGASINCKMNC-RTVHHYPCAAAAGCLLILESFTVFCTEHLSQVPLIC 198
Query: 84 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 143
+ +C +C LG K + ++CG +H C+ L D
Sbjct: 199 SD---NNVECLSCSSLGDLSKLI-------MCSSCGDHFHSTCIGLANLPDTR------- 241
Query: 144 KSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ C KC IC+Q ++ +D++F C +C K YH CL
Sbjct: 242 ------SGWCCARCTKCQICRQQDS---NDIKFVKCEQCQKIYHASCL 280
>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
Length = 4884
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 957 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037
>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
Length = 4904
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 950 QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 992
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 993 SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1031
>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
homolog
Length = 4903
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 949 QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 991
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 992 SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1030
>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
Length = 1503
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 40/134 (29%)
Query: 106 TGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE--SFTCPLHKCCICK 163
+G ++ C C +YHP C+ K A + G SFTCP H C C
Sbjct: 642 SGDQMIRCSVRACRRWYHPSCLRK----------PPFAVVVREGRSGSFTCPAHTCLACS 691
Query: 164 QGE--NKADSDLQFAVCRRCPKAYH--RKCLP---RKIAFEDKLEEGIITRAWEGLLPNH 216
F C CP AYH CLP ++IA PN
Sbjct: 692 AETPGTMPRPSPHFIRCVMCPAAYHPGDWCLPAGSKEIA------------------PN- 732
Query: 217 RILIYCLKHEIDDE 230
LI C +H + DE
Sbjct: 733 --LIICPRHALQDE 744
>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
[Ailuropoda melanoleuca]
Length = 1118
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 885 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 943
Query: 73 DEVE 76
EVE
Sbjct: 944 PEVE 947
>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
Length = 4912
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 957 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037
>gi|410644486|ref|ZP_11354967.1| hypothetical protein GAGA_0501 [Glaciecola agarilytica NO2]
gi|410136044|dbj|GAC03366.1| hypothetical protein GAGA_0501 [Glaciecola agarilytica NO2]
Length = 366
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 54/108 (50%), Gaps = 20/108 (18%)
Query: 544 CGANDFSCLMKKKLDETGKNCLYKNYDI------LPAKNDFNFEKRDWMTVEPKELAPGS 597
G D + +K L + G++C+ YDI L + FN ++RD++TV+P+ P
Sbjct: 112 IGLCDLAMSLKDYLIQEGRDCILDGYDIDPLNVELSKQKGFNVQERDFLTVQPE---PIY 168
Query: 598 RLIMGLNPPFGVKAGL-----ANKFINKALEFNPKLLILIVPPETERL 640
L++ +NPPF A + A++F+ +L+ ++P RL
Sbjct: 169 DLVL-MNPPFDSSAFVDHVRHASRFLTPT-----GVLVSVIPVNMPRL 210
>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
Length = 994
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 49/113 (43%), Gaps = 28/113 (24%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 80
+SNE D +CS C GGDL+ C+ C SFH + LGL +V +
Sbjct: 657 KSNERHHENDHICSVCHYGGDLVLCD-HCPSSFHK----------SCLGL---KVGCFGD 702
Query: 81 FFCKNCEYKQHQCFACGKLGSSDKETGAE----VFPCVSATCGHFYHPHCVSK 129
+FC +C CG G + + G+E VF C C YH C+ K
Sbjct: 703 WFCPSC--------CCGICGENKFDGGSEQDNVVFSCYQ--CERQYHVGCLRK 745
>gi|449490172|ref|XP_002194420.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Taeniopygia guttata]
Length = 912
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 664 NNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 697
>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
familiaris]
Length = 1203
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 953 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 988
>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
Length = 4911
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 957 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037
>gi|281208576|gb|EFA82752.1| hypothetical protein PPL_04447 [Polysphondylium pallidum PN500]
Length = 985
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 42/109 (38%), Gaps = 22/109 (20%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C K D + + C S C YH C LA + S+
Sbjct: 771 CFLCKK--EEDPKGRTSIVQCRS--CPKIYHRSCAG-------------LAHT---PRSW 810
Query: 153 TCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
CP H C C++ N +S F +C+ CP +Y CLP I K E
Sbjct: 811 KCPRHSCHSCRKTPN--ESGGSFFICKECPSSYCITCLPNDITILQKHE 857
>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
taurus]
Length = 1126
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 876 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 911
>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
Length = 4916
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 904 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 946
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 947 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 984
>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
Length = 4911
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 957 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037
>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
Full=Homologous to ALR protein; AltName: Full=Lysine
N-methyltransferase 2C; Short=KMT2C; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
Length = 4911
Score = 42.0 bits (97), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 957 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037
>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
leucogenys]
Length = 4856
Score = 42.0 bits (97), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 889 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 931
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 932 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 969
>gi|297474056|ref|XP_002687040.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Bos
taurus]
gi|296488233|tpg|DAA30346.1| TPA: tripartite motif-containing 24 isoform 1 [Bos taurus]
Length = 1050
Score = 42.0 bits (97), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 817 EMRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 875
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
EVE + N E ++ + F KL DK
Sbjct: 876 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 905
>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
aries]
Length = 1127
Score = 42.0 bits (97), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|194754301|ref|XP_001959434.1| GF12873 [Drosophila ananassae]
gi|190620732|gb|EDV36256.1| GF12873 [Drosophila ananassae]
Length = 1486
Score = 42.0 bits (97), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 41/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 85
E + C+FC G + C+ +C + H CA+ +E+ FC
Sbjct: 141 EFLEQKCNFCGRYGASINCKMKCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191
Query: 86 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 138
C +C C LG K + +TCG +H CV L D
Sbjct: 192 LSQVILICSDNNVECLTCSSLGDLSKLI-------MCSTCGDHFHSTCVGLANLPDTR-- 242
Query: 139 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ C KC IC+ ++ +DL++ C +C K YH CL
Sbjct: 243 -----------SGWNCARCTKCQICRVQDS---NDLKYVKCEQCQKIYHASCL 281
>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
garnettii]
Length = 4945
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 994 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 1036
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 1037 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1074
>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
mulatta]
Length = 4785
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 873 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 915
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 916 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 953
>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
taurus]
Length = 1109
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 876 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 911
>gi|449299559|gb|EMC95572.1| hypothetical protein BAUCODRAFT_494698 [Baudoinia compniacensis
UAMH 10762]
Length = 1048
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 9/67 (13%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML 79
D+ NEE + C+ C G LLCC+G C RSFH T C L +D E
Sbjct: 622 DDDNEEQSENNDFCAACGGSGYLLCCDG-CDRSFHFT--------CLDPPLNEDASELNE 672
Query: 80 NFFCKNC 86
++C C
Sbjct: 673 PWYCFVC 679
>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
aries]
Length = 1110
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|358411974|ref|XP_617403.5| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Bos taurus]
Length = 1047
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 814 EMRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 872
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
EVE + N E ++ + F KL DK
Sbjct: 873 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 902
>gi|431895734|gb|ELK05153.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
Length = 921
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 75/200 (37%), Gaps = 53/200 (26%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 88
C+FC + G L CC+ RC + +H AG G +D + +FF E+
Sbjct: 220 CAFCKHLGATLRCCDERCTQMYHYPCAAG-------AGTFQD----LSHFFLLCPEHIDQ 268
Query: 89 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 142
+ C C G + F C TCG YH C+ +VA L
Sbjct: 269 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DVAVTPL 313
Query: 143 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 202
++ + CP K C Q ++ D + VC C K YH CL + +
Sbjct: 314 KRA-----GWQCPECKVC---QNCKQSGEDSKMLVCDTCDKGYHTFCL-------QPVMK 358
Query: 203 GIITRAWEGLLPNHRILIYC 222
+ T W+ N RI + C
Sbjct: 359 SVPTNGWK--CKNCRICVAC 376
>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
[Canis lupus familiaris]
Length = 961
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 728 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 786
Query: 73 DEVE 76
EVE
Sbjct: 787 PEVE 790
>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
caballus]
Length = 942
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 709 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 767
Query: 73 DEVE 76
EVE
Sbjct: 768 PEVE 771
>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
jacchus]
Length = 4909
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 951 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 993
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 994 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1031
>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
Length = 944
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 711 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 769
Query: 73 DEVE 76
EVE
Sbjct: 770 PEVE 773
>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
Length = 1116
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 823 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 858
>gi|47458930|ref|YP_015792.1| putative type II DNA methylase protein [Mycoplasma mobile 163K]
gi|47458258|gb|AAT27581.1| putative type II DNA methylase protein [Mycoplasma mobile 163K]
Length = 309
Score = 41.6 bits (96), Expect = 2.6, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 41/69 (59%), Gaps = 3/69 (4%)
Query: 567 KNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGL-NPPFGVKAGLANKFINKALEFN 625
+ +DI P KN+ EK+D+ V E+ + + + NPPFG K LA KF+NK+LE +
Sbjct: 65 RAFDIDP-KNNSIIEKKDFFDVNFNEMNLNKKKTVTIGNPPFGKKGDLALKFLNKSLE-H 122
Query: 626 PKLLILIVP 634
++ I+P
Sbjct: 123 SGIVAFILP 131
>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
Length = 1144
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 59/128 (46%), Gaps = 27/128 (21%)
Query: 5 IVDDMEEDMV-HDTEEDESNEEDELF----DSVCSFCDNGGDLLCCEGRCLRSFHATIDA 59
++D M E M H T E ++LF D++CS C+ GG+L+ C+ +C ++H
Sbjct: 755 LLDCMMEVMQDHRTREITEKPHNDLFEGENDNICSVCNYGGELILCD-QCPSAYH----- 808
Query: 60 GEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 119
+C +L D ++FC +C +C CG+ + E G C+ C
Sbjct: 809 ---KNCLNLEGIPDG-----DWFCPSC-----RCGICGQNKIEETEDG-HFLTCIQ--CE 852
Query: 120 HFYHPHCV 127
H YH C+
Sbjct: 853 HKYHVECL 860
>gi|196016261|ref|XP_002117984.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
gi|190579457|gb|EDV19552.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
Length = 183
Score = 41.6 bits (96), Expect = 2.6, Method: Composition-based stats.
Identities = 33/114 (28%), Positives = 49/114 (42%), Gaps = 23/114 (20%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG +G K + C CG YH +CVS +L +S+I
Sbjct: 29 QDMCLCCGSIG---KGIDNSLIACFQ--CGQSYHHYCVSA-----------KLTRSVIV- 71
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEG 203
+ C C +C +G KA + + +C C +YH CL ++ DK+ EG
Sbjct: 72 NGWRC--LDCAVC-EGCGKAGDEDRLLLCDECDISYHTYCLNPQL---DKVPEG 119
>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
Length = 1310
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 46/98 (46%), Gaps = 18/98 (18%)
Query: 24 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE-------------SHCASLGL 70
+E++ D C C GG+L+CC+ C +FH + +E C +L +
Sbjct: 792 DENDKNDDSCGLCGEGGELICCDN-CPSTFHLACLSTQEIPDGDWYCTNCTCRICGNLVI 850
Query: 71 TKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA 108
KD ++A + C CE+K H+ C L DK+ GA
Sbjct: 851 DKDTLDAHDSLQCSQCEHKYHE--KC--LEDRDKQEGA 884
>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
Length = 1032
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D S+++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 782 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 817
>gi|24762433|ref|NP_611847.2| lost PHDs of trr [Drosophila melanogaster]
gi|21626677|gb|AAF47094.2| lost PHDs of trr [Drosophila melanogaster]
gi|85861118|gb|ABC86508.1| HL01030p [Drosophila melanogaster]
Length = 1482
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 85
E + C+FC G + C+ C + H CA+ +E+ FC
Sbjct: 141 EFLEQKCNFCGRYGASINCKMNCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191
Query: 86 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 138
C +C +C LG K + +TCG +H C+ L D
Sbjct: 192 LSQVPVICSDNNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-- 242
Query: 139 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ C KC IC+Q ++ +D ++ C +C K YH CL
Sbjct: 243 -----------SGWNCARCTKCQICRQQDS---NDTKYVKCEQCQKTYHASCL 281
>gi|410953041|ref|XP_003983185.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Felis catus]
Length = 1236
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 1/62 (1%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 80
E+ +ED+ + C+ C NGG+LLCCE +C + FH + ++ + + E +L
Sbjct: 1041 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPRWIIVMQKCERLLL 1099
Query: 81 FF 82
F
Sbjct: 1100 FL 1101
>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
Length = 1000
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 767 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 825
Query: 73 DEVE 76
EVE
Sbjct: 826 PEVE 829
>gi|19343832|gb|AAH25482.1| Trim24 protein [Mus musculus]
Length = 516
Score = 41.6 bits (96), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 283 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 317
>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
[Equus caballus]
Length = 412
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 179 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 237
Query: 73 DEVE 76
EVE
Sbjct: 238 PEVE 241
>gi|449271665|gb|EMC81949.1| E3 ubiquitin-protein ligase TRIM33 [Columba livia]
Length = 878
Score = 41.6 bits (96), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 647 NNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 680
>gi|355733003|gb|AES10880.1| myeloid/lymphoid or mixed-lineage leukemia 3 isoform 2 [Mustela
putorius furo]
Length = 102
Score = 41.6 bits (96), Expect = 3.0, Method: Composition-based stats.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 6 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 48
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 49 KGWRC--LECTVC-EACGKASDPGRLLLCDDCDISYHTYCL 86
>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
Length = 2548
Score = 41.6 bits (96), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 6/71 (8%)
Query: 128 SKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGE------NKADSDLQFAVCRRC 181
+KL +E ++ + II + F CPLH C +CK+ + +K + C +C
Sbjct: 1099 NKLFNGKEETLNNENDEKIIVLKKFICPLHICYVCKEFDINNTESSKKELKNNLFRCIKC 1158
Query: 182 PKAYHRKCLPR 192
K+ HRKC+ +
Sbjct: 1159 YKSVHRKCMNQ 1169
>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
Length = 1218
Score = 41.6 bits (96), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 25/100 (25%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNCE 87
D +CS C GG+L+ C+ +C SFH T LGL E + N +FC +C
Sbjct: 641 DYICSVCHYGGELILCD-KCPSSFHKT----------CLGL-----EDIPNGDWFCPSC- 683
Query: 88 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV 127
C CG+ + ++ PC+ C H YH C+
Sbjct: 684 ----CCGICGQRKIDRDDEVEQLLPCIQ--CEHKYHVRCL 717
>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
Length = 1604
Score = 41.6 bits (96), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 47/108 (43%), Gaps = 29/108 (26%)
Query: 26 DELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFF 82
D+ D C C +GGDL+CC+G C +FH + C + ML ++
Sbjct: 757 DDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CLDI--------MMLPPGDWH 799
Query: 83 CKNCEYKQHQCFACGKLGSSD--KETGAEVFPCVSAT-CGHFYHPHCV 127
C NC C CG + S D +E G V ++ + C YH C+
Sbjct: 800 CPNC-----TCKFCG-IASEDFVQEDGTNVSELLTCSLCAKKYHKSCL 841
>gi|198456152|ref|XP_001360232.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
gi|198135513|gb|EAL24806.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
Length = 1486
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 61/167 (36%), Gaps = 29/167 (17%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLNFFCK 84
E C+FC G + C+ C + H AG S + E + + C
Sbjct: 140 EFLKQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESFTVFCTEHLSQVPLICS 199
Query: 85 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 144
+ +C +C LG K + +TCG +H C+ L D
Sbjct: 200 D---NNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-------- 241
Query: 145 SIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ C KC IC+Q ++ +DL++ C +C K YH C
Sbjct: 242 -----SGWNCARCTKCQICRQQDS---NDLKYVKCEQCQKIYHASCF 280
>gi|195149375|ref|XP_002015633.1| GL11176 [Drosophila persimilis]
gi|194109480|gb|EDW31523.1| GL11176 [Drosophila persimilis]
Length = 1486
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 39/167 (23%), Positives = 61/167 (36%), Gaps = 29/167 (17%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLNFFCK 84
E C+FC G + C+ C + H AG S + E + + C
Sbjct: 140 EFLKQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESFTVFCTEHLSQVPLICS 199
Query: 85 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 144
+ +C +C LG K + +TCG +H C+ L D
Sbjct: 200 D---NNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-------- 241
Query: 145 SIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ C KC IC+Q ++ +DL++ C +C K YH C
Sbjct: 242 -----SGWNCARCTKCQICRQQDS---NDLKYVKCEQCQKIYHASCF 280
>gi|94420998|ref|NP_659542.3| transcription intermediary factor 1-alpha isoform 1 [Mus musculus]
gi|6175058|sp|Q64127.1|TIF1A_MOUSE RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase Trim24; AltName: Full=Tripartite motif-containing
protein 24
gi|998815|gb|AAB34290.1| TIF1 isoform [Mus sp.]
gi|34785214|gb|AAH56959.1| Tripartite motif-containing 24 [Mus musculus]
gi|148681701|gb|EDL13648.1| tripartite motif protein 24, isoform CRA_a [Mus musculus]
Length = 1051
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 818 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 852
>gi|195489371|ref|XP_002092710.1| GE14338 [Drosophila yakuba]
gi|194178811|gb|EDW92422.1| GE14338 [Drosophila yakuba]
Length = 1481
Score = 41.2 bits (95), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 85
E + C+FC G + C+ C + H CA+ +E+ FC
Sbjct: 141 EFLEQKCNFCGRYGASINCKMNCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191
Query: 86 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 138
C +C +C LG K + +TCG +H C+ L D
Sbjct: 192 LSQVPVICSDNNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-- 242
Query: 139 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ C KC IC+Q ++ +D ++ C +C K YH CL
Sbjct: 243 -----------SGWNCARCTKCQICRQQDS---NDTKYVKCEQCQKIYHASCL 281
>gi|194885797|ref|XP_001976493.1| GG22900 [Drosophila erecta]
gi|190659680|gb|EDV56893.1| GG22900 [Drosophila erecta]
Length = 1481
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)
Query: 27 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 85
E + C+FC G + C+ C + H CA+ +E+ FC
Sbjct: 141 EFLEQKCNFCGRFGASINCKMNCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191
Query: 86 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 138
C +C +C LG K + +TCG +H C+ L D
Sbjct: 192 LSQVPVICSDNNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-- 242
Query: 139 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ C KC IC+Q ++ +D ++ C +C K YH CL
Sbjct: 243 -----------SGWNCARCTKCQICRQQDS---NDTKYVKCEQCQKIYHASCL 281
>gi|342320729|gb|EGU12668.1| ISWI chromatin-remodeling complex ATPase ISW2 [Rhodotorula glutinis
ATCC 204091]
Length = 1120
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 54/117 (46%), Gaps = 23/117 (19%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 92
C C +GG++ C G C R++HA C+ G++K ++ A + ++C +QH
Sbjct: 980 CLTCGDGGEIYLCSG-CPRAYHA-------GDCS--GMSKRDLAATVQYYC-----QQHT 1024
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
C C + + + G +F C TC + C L D EV + L + ++ G
Sbjct: 1025 CSVCLR---NTQAAGGLLFRC--QTCPAAF---CEDDLPTTDLEVVGEVLPEFLLLG 1073
>gi|301115432|ref|XP_002905445.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110234|gb|EEY68286.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1038
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 8/55 (14%)
Query: 7 DDMEEDMVHDTEEDESNEEDELFDS-------VCSFCDNGGDLLCCEGRCLRSFH 54
D+ ED E+DE E+D DS C C GDL+CC+ +C RS+H
Sbjct: 828 DEETEDESFRQEDDEGQEDDSGSDSELDANLDFCEVCQGAGDLVCCD-KCPRSYH 881
>gi|440918697|ref|NP_001259005.1| transcription intermediary factor 1-alpha isoform 3 [Mus musculus]
Length = 981
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 748 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 782
>gi|74221665|dbj|BAE21530.1| unnamed protein product [Mus musculus]
Length = 981
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 748 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 782
>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
Length = 1547
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 20/97 (20%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C C +GGDL+CC+G C +FH + C ++ + ++ C NC
Sbjct: 669 TCGICGDGGDLICCDG-CPSTFHQS--------CLNIQMLPSG-----DWHCPNC----- 709
Query: 92 QCFACGKLGSSDKETGAEVFPCVSAT-CGHFYHPHCV 127
C CG S+ E V V+ + C YH C+
Sbjct: 710 TCKFCGMADGSNAEDDTTVSELVTCSLCEKKYHTSCI 746
>gi|297474058|ref|XP_002687041.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Bos
taurus]
gi|296488234|tpg|DAA30347.1| TPA: tripartite motif-containing 24 isoform 2 [Bos taurus]
Length = 1016
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 783 EMRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 841
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
EVE + N E ++ + F KL DK
Sbjct: 842 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 871
>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
MLL3-like [Anolis carolinensis]
Length = 4817
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 53/129 (41%), Gaps = 26/129 (20%)
Query: 68 LGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHF 121
+ KDE E ++ F + ++ HQ C CG G GAE + CG
Sbjct: 881 ISYIKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGK-----GAEGRLLSCSQCGQC 935
Query: 122 YHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRC 181
YHP+CVS + + ++ + + C +C +C + KA + +C C
Sbjct: 936 YHPYCVSIKITK------------VVLHKGWRC--LECTVC-EACGKATDPGRLLLCDDC 980
Query: 182 PKAYHRKCL 190
+YH CL
Sbjct: 981 DISYHTYCL 989
>gi|359324309|ref|XP_003640336.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
[Canis lupus familiaris]
Length = 626
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH +
Sbjct: 485 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 519
>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
Length = 2526
Score = 41.2 bits (95), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 49/122 (40%), Gaps = 22/122 (18%)
Query: 73 DEVE-AMLNFFC-KNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVS 128
DEVE + C KN E+ Q C CG +G G E A C YH +CV+
Sbjct: 419 DEVEYTKIPVVCRKNDEFLQKTPLCLVCGSIGK-----GPEASMVSCANCSQTYHTYCVT 473
Query: 129 KLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRK 188
L D S I G + C C IC+ N D + + +C C +YH
Sbjct: 474 ---LHDK-------MNSAILGRGWRC--LDCTICEGCGNGGDEE-KLLLCDECDVSYHVY 520
Query: 189 CL 190
C+
Sbjct: 521 CM 522
>gi|358398564|gb|EHK47915.1| hypothetical protein TRIATDRAFT_129087 [Trichoderma atroviride IMI
206040]
Length = 1065
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 14/63 (22%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 88
D CS C N GD++CC+G C RSFH +D + H DE ++C C +
Sbjct: 633 DEYCSACGNTGDVVCCDG-CPRSFHFECVDMVQSDHLP------DE------WYCNECSF 679
Query: 89 KQH 91
+++
Sbjct: 680 RRY 682
>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
[Felis catus]
Length = 1211
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 961 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 996
>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
Length = 972
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/25 (68%), Positives = 20/25 (80%), Gaps = 1/25 (4%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH 54
D +CS C +GGDLLCC+G C RSFH
Sbjct: 600 DDLCSICADGGDLLCCDG-CPRSFH 623
>gi|440918679|ref|NP_001258993.1| transcription intermediary factor 1-alpha isoform 2 [Mus musculus]
gi|998813|gb|AAB34289.1| TIF1 [Mus sp.]
gi|148681703|gb|EDL13650.1| tripartite motif protein 24, isoform CRA_c [Mus musculus]
Length = 1017
Score = 41.2 bits (95), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 784 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 818
>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
Length = 972
Score = 41.2 bits (95), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 17/25 (68%), Positives = 20/25 (80%), Gaps = 1/25 (4%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH 54
D +CS C +GGDLLCC+G C RSFH
Sbjct: 600 DDLCSICADGGDLLCCDG-CPRSFH 623
>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
[Canis lupus familiaris]
Length = 1018
Score = 41.2 bits (95), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 785 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 843
Query: 73 DEVE 76
EVE
Sbjct: 844 PEVE 847
>gi|328868864|gb|EGG17242.1| hypothetical protein DFA_08232 [Dictyostelium fasciculatum]
Length = 1177
Score = 41.2 bits (95), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 44/109 (40%), Gaps = 21/109 (19%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C D++ + C S C YH C L+ + S+
Sbjct: 967 CFTCK--AEEDEKGRTSIVQCRS--CPKIYHRSCAG-------------LSHT---PRSW 1006
Query: 153 TCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
CP H C C++ N++ + F +C+ CP ++ CLP I +K E
Sbjct: 1007 KCPRHSCSKCRKPSNESGTG-SFFICKGCPSSFCINCLPEDIIILEKHE 1054
>gi|443696185|gb|ELT96956.1| hypothetical protein CAPTEDRAFT_106029, partial [Capitella teleta]
Length = 175
Score = 41.2 bits (95), Expect = 4.1, Method: Composition-based stats.
Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C +CG LG+++ E V + CG YHP+C + L R II
Sbjct: 24 QDLCVSCGSLGANE-----ESRLIVCSQCGQCYHPYCANVKLSR------------IILE 66
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C C +C +G + + + +C C +YH CL
Sbjct: 67 KGWRC--LDCTVC-EGCGRPHDESRLILCDECDISYHIYCL 104
>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
Length = 3691
Score = 40.8 bits (94), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 41/102 (40%), Gaps = 20/102 (19%)
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
KQ C CG G GAE + CG YHP CV+ + R ++
Sbjct: 148 KQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPFCVNVKMTR------------VVL 190
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C+ +D + +C C +YH CL
Sbjct: 191 TKGWRC--LECTVCEACGEASDPG-RLLLCDDCDISYHTYCL 229
>gi|442620097|ref|NP_001262766.1| bonus, isoform D [Drosophila melanogaster]
gi|440217667|gb|AGB96146.1| bonus, isoform D [Drosophila melanogaster]
Length = 1119
Score = 40.8 bits (94), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 37/72 (51%), Gaps = 7/72 (9%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 73
VH + + + +ED+ + C+ C +GG+L+CC+ +C + FH + H ++ D
Sbjct: 867 VHSSTDLSNTQEDDPNEDWCAVCLDGGELMCCD-KCPKVFH------QNCHIPAISSLPD 919
Query: 74 EVEAMLNFFCKN 85
E E+ C N
Sbjct: 920 ESESWQCLLCVN 931
>gi|10432686|dbj|BAB13834.1| unnamed protein product [Homo sapiens]
Length = 120
Score = 40.8 bits (94), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 72 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 107
>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
Length = 3396
Score = 40.8 bits (94), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)
Query: 89 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 148
+Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 842 QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 884
Query: 149 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 885 SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 923
>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
Length = 999
Score = 40.8 bits (94), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 749 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 784
>gi|440796543|gb|ELR17652.1| AT Hook plus PHD finger transcription factor family member (athp1),
putative, partial [Acanthamoeba castellanii str. Neff]
Length = 221
Score = 40.8 bits (94), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 13/60 (21%)
Query: 28 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHC-ASLGLTKDEVEAMLNFFCKNC 86
++ VC CDNGGDL+CC+ + GE +C A LGLT VE + +++C C
Sbjct: 62 VWKDVCWRCDNGGDLICCD----------LCPGE--YCFACLGLTHGSVEKIEDWYCGRC 109
>gi|442620101|ref|NP_001262768.1| bonus, isoform F [Drosophila melanogaster]
gi|440217669|gb|AGB96148.1| bonus, isoform F [Drosophila melanogaster]
Length = 1122
Score = 40.8 bits (94), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 37/72 (51%), Gaps = 7/72 (9%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 73
VH + + + +ED+ + C+ C +GG+L+CC+ +C + FH + H ++ D
Sbjct: 871 VHSSTDLSNTQEDDPNEDWCAVCLDGGELMCCD-KCPKVFH------QNCHIPAISSLPD 923
Query: 74 EVEAMLNFFCKN 85
E E+ C N
Sbjct: 924 ESESWQCLLCVN 935
>gi|432103232|gb|ELK30472.1| Tripartite motif-containing protein 66 [Myotis davidii]
Length = 1324
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 13/84 (15%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTKDEVEAMLNF 81
+ C+ C NGG+LLCC+ RC + +H + G+ LT+ E+E +
Sbjct: 1103 EDFCAVCLNGGELLCCD-RCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQPEME----Y 1157
Query: 82 FCKNCEYKQHQCFACGKLGSSDKE 105
C+N Y Q +A LG SD++
Sbjct: 1158 DCENARYNQPGSWAPPGLGISDQK 1181
>gi|442620099|ref|NP_001262767.1| bonus, isoform E [Drosophila melanogaster]
gi|440217668|gb|AGB96147.1| bonus, isoform E [Drosophila melanogaster]
Length = 1125
Score = 40.8 bits (94), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 37/72 (51%), Gaps = 7/72 (9%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 73
VH + + + +ED+ + C+ C +GG+L+CC+ +C + FH + H ++ D
Sbjct: 873 VHSSTDLSNTQEDDPNEDWCAVCLDGGELMCCD-KCPKVFH------QNCHIPAISSLPD 925
Query: 74 EVEAMLNFFCKN 85
E E+ C N
Sbjct: 926 ESESWQCLLCVN 937
>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
gorilla]
Length = 4782
Score = 40.8 bits (94), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 63/183 (34%), Gaps = 35/183 (19%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESH---------CASLGLTKDEVEAMLNFF 82
C+FC G + CCE +C + +H AG S C E + FF
Sbjct: 204 CAFCKRFGATIKCCEEKCTQMYHYPCAAGAGSFQDFSHIFLLCPEHIDQAPERYLLDQFF 263
Query: 83 CKNCEYKQH----------------QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHC 126
C C H QC C K+ + K++G + V TC YH C
Sbjct: 264 CTTCGQHYHGMCLDIAVTPLKRAGWQCPEC-KVCQNCKQSGEDSKMLVCDTCDKGYHTFC 322
Query: 127 VSKLL--LRDDEVAADQLAKSIIAGESFTCPL-HKCCICKQGENKADSDLQFAVCRRCPK 183
+ ++ + + I G + H C IC + D+ +C C K
Sbjct: 323 LQPVMKSVPTNGWKCKNCRICIECGTRSSSQWHHNCLICDNCYQQQDN-----LCPFCGK 377
Query: 184 AYH 186
YH
Sbjct: 378 CYH 380
>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
Length = 3474
Score = 40.8 bits (94), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 61/161 (37%), Gaps = 30/161 (18%)
Query: 33 CSFCDNGGDLLCCE-GRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C++C + G + C+ C R FH A S L FC Q
Sbjct: 308 CAYCSHYGAGISCKVTSCNRYFHFPCAAASSCFQDSKNLA---------LFCSQ-HLGQV 357
Query: 92 QCFACGKLGSSDKETGAEVFPCV-SATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG- 149
CG++ +V V + CG YH CV LL + AG
Sbjct: 358 PLLLCGEVTCVQCYGMGDVSNLVMCSVCGQHYHGSCVGLALL-----------PGVRAGW 406
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ +C + C +C+Q E+ + + +C RC KAYH CL
Sbjct: 407 QCVSCRV--CQVCRQPEDVS----KVMLCERCDKAYHPGCL 441
>gi|395539485|ref|XP_003771699.1| PREDICTED: transcription intermediary factor 1-alpha [Sarcophilus
harrisii]
Length = 880
Score = 40.8 bits (94), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
ES ++D+ + C+ C NGG+LLCCE +C + FH T
Sbjct: 647 ESRKDDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 681
>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
Length = 1135
Score = 40.8 bits (94), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 884 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 919
>gi|47209254|emb|CAF91993.1| unnamed protein product [Tetraodon nigroviridis]
Length = 525
Score = 40.8 bits (94), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 7/71 (9%)
Query: 18 EEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE------SHCASLGLT 71
+E+ + +ED+ + C+ C NGGDLLCC+ RC + FH E H S+G
Sbjct: 320 KENCAGKEDDPNEDWCAVCINGGDLLCCD-RCPKVFHMKCHRCERLLLYVFCHELSVGFR 378
Query: 72 KDEVEAMLNFF 82
K ++ N++
Sbjct: 379 KPVSSSVPNYY 389
>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
Length = 1188
Score = 40.8 bits (94), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 23/102 (22%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D C C +GGDL+CC+G C +FH + L + K A ++C NC
Sbjct: 650 DDTCGICGDGGDLICCDG-CPSTFHQSC----------LDIKKFPSGA---WYCYNC--- 692
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSAT--CGHFYHPHCVSK 129
C C K ++ ET P +S+ C YH C+++
Sbjct: 693 --SCKFCEKDEAAKHETST--LPSLSSCRLCEEKYHQACINQ 730
>gi|357289563|gb|AET72876.1| type II DNA modification enzyme [Phaeocystis globosa virus 12T]
gi|357292291|gb|AET73627.1| type II methylase [Phaeocystis globosa virus 14T]
Length = 279
Score = 40.8 bits (94), Expect = 5.3, Method: Composition-based stats.
Identities = 37/125 (29%), Positives = 63/125 (50%), Gaps = 17/125 (13%)
Query: 533 VNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKE 592
+ND D+ ++ G+ F +M D K L DI P +D K D+ +P
Sbjct: 35 INDYDIHLEPSAGSGSFFNIM----DNIKKVGL----DIEPTTDDV--VKMDYFDYKP-- 82
Query: 593 LAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERL---DRKESAYEL 649
+A L++G NPPFG + LA KF NK+ EF+ + I+P +R+ ++ ++L
Sbjct: 83 IAGKKYLVVG-NPPFGRVSALAVKFFNKSAEFS-NCIAFIIPRTFKRVSIQNKLNLNFKL 140
Query: 650 VWEDD 654
++ +D
Sbjct: 141 IYNED 145
>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
Length = 5262
Score = 40.8 bits (94), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 61/160 (38%), Gaps = 30/160 (18%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY-KQ 90
CS C G + C C R +H S L+ ++ + + Y ++
Sbjct: 172 CSHCTRLGASIPCRSPGCPRLYHFPCATASGSF-----LSMKTLQLLCPEHSEGAAYLEE 226
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
+C C G ++F C S CGH YH C+ D + A + A
Sbjct: 227 ARCAVCEGPGEL-----CDLFFCTS--CGHHYHGACL------DTALTARKRA------- 266
Query: 151 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ CP +C +C Q K +D + VC C K YH CL
Sbjct: 267 GWQCP--ECKVC-QACRKPGNDSKMLVCETCDKGYHTFCL 303
>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
Full=Ectodermin homolog; AltName: Full=Transcription
intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
Full=Tripartite motif-containing protein 33
gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
Length = 1142
Score = 40.8 bits (94), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 892 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 927
>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
Length = 1140
Score = 40.8 bits (94), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 890 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 925
>gi|395842293|ref|XP_003793952.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Otolemur garnettii]
Length = 1138
Score = 40.8 bits (94), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 888 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 923
>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
norvegicus]
Length = 1144
Score = 40.4 bits (93), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 894 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 929
>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
Length = 1264
Score = 40.4 bits (93), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 20/97 (20%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C C +GGDL+CC+G C +FH + C ++ + ++ C NC
Sbjct: 501 TCGICGDGGDLICCDG-CPSTFHQS--------CLNIQMLPSG-----DWHCPNC----- 541
Query: 92 QCFACGKLGSSDKETGAEVFPCVSAT-CGHFYHPHCV 127
C CG S+ E V V+ + C YH C+
Sbjct: 542 TCKFCGMADGSNAEDDTTVSELVTCSLCEKKYHTSCI 578
>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
Length = 1123
Score = 40.4 bits (93), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 890 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 925
>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
jacchus]
Length = 1127
Score = 40.4 bits (93), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
Length = 951
Score = 40.4 bits (93), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 701 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 736
>gi|395739021|ref|XP_002818554.2| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha [Pongo abelii]
Length = 1221
Score = 40.4 bits (93), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 988 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 1046
Query: 73 DEVE 76
EVE
Sbjct: 1047 PEVE 1050
>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
sapiens]
Length = 5265
Score = 40.4 bits (93), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 61/160 (38%), Gaps = 30/160 (18%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY-KQ 90
CS C G + C C R +H S L+ ++ + + Y ++
Sbjct: 172 CSHCTRLGASIPCRSPGCPRLYHFPCATASGSF-----LSMKTLQLLCPEHSEGAAYLEE 226
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
+C C G ++F C S CGH YH C+ D + A + A
Sbjct: 227 ARCAVCEGPGEL-----CDLFFCTS--CGHHYHGACL------DTALTARKRA------- 266
Query: 151 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ CP +C +C Q K +D + VC C K YH CL
Sbjct: 267 GWQCP--ECKVC-QACRKPGNDSKMLVCETCDKGYHTFCL 303
>gi|218189485|gb|EEC71912.1| hypothetical protein OsI_04692 [Oryza sativa Indica Group]
Length = 892
Score = 40.4 bits (93), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 20/72 (27%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C + G ++ C C YHPHCV K +E D + E F
Sbjct: 17 CFVC--------KDGGDIRVCDFKNCLKGYHPHCVGK-----EESFLD-------SAEQF 56
Query: 153 TCPLHKCCICKQ 164
C LHKC CK+
Sbjct: 57 ICELHKCVSCKR 68
>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
rotundus]
Length = 1056
Score = 40.4 bits (93), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 806 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 841
>gi|295657191|ref|XP_002789167.1| nucleoporin Pom152 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226284511|gb|EEH40077.1| nucleoporin Pom152 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1258
Score = 40.4 bits (93), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 71/157 (45%), Gaps = 19/157 (12%)
Query: 871 LSSPTIVIDGTSPLGVHSSKTIEMLSHDDGCQCDQMPH-RSP----------VNVSSGIL 919
LS+ T+V++GT PL + S+ + + H Q Q + RSP ++ S+ +
Sbjct: 330 LSNLTLVVEGTPPLKIKYSRKVNKIDHGVSFQSIQPENFRSPLIGQGHVDSLIDSSNPDI 389
Query: 920 EGESSKPIEIPALSGIGDGGYEHFGVGMSHHSAATVIDGASSLQGLSSKSIEMPSYTQFD 979
++ IE+P + GG + + H + V + +++ + S P QF
Sbjct: 390 SWAQAQRIEVPLNESLSTGGEWIYTIEEVHDACGNVANYSTASEDGDQFSKIAPQTQQF- 448
Query: 980 DNVHQHFEDKGVPRCSPAKVIAKSSLEDHSSKSIDIP 1016
VH+ PR S ++ A+S L+ +SID+P
Sbjct: 449 -VVHER------PRVSLSRCNAQSYLQVAKDQSIDLP 478
>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
jacchus]
Length = 1110
Score = 40.4 bits (93), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912
>gi|115441477|ref|NP_001045018.1| Os01g0884500 [Oryza sativa Japonica Group]
gi|56784490|dbj|BAD82583.1| zinc finger (CCCH-type) protein-like [Oryza sativa Japonica Group]
gi|113534549|dbj|BAF06932.1| Os01g0884500 [Oryza sativa Japonica Group]
Length = 892
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 20/72 (27%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C + G ++ C C YHPHCV K +E D + E F
Sbjct: 17 CFVC--------KDGGDIRVCDFKNCLKGYHPHCVGK-----EESFLD-------SAEQF 56
Query: 153 TCPLHKCCICKQ 164
C LHKC CK+
Sbjct: 57 ICELHKCVSCKR 68
>gi|222619635|gb|EEE55767.1| hypothetical protein OsJ_04319 [Oryza sativa Japonica Group]
Length = 893
Score = 40.4 bits (93), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 20/72 (27%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C + G ++ C C YHPHCV K +E D + E F
Sbjct: 17 CFVC--------KDGGDIRVCDFKNCLKGYHPHCVGK-----EESFLD-------SAEQF 56
Query: 153 TCPLHKCCICKQ 164
C LHKC CK+
Sbjct: 57 ICELHKCVSCKR 68
>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
caballus]
Length = 351
Score = 40.4 bits (93), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 241 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 276
>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
Length = 1301
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 43/105 (40%), Gaps = 26/105 (24%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFFCKNC 86
D C C +GGDL+CC+G C +FH + C + ML + C NC
Sbjct: 710 DDTCGICGDGGDLICCDG-CPSTFHQS--------CLDI--------QMLPPGEWRCMNC 752
Query: 87 EYKQHQCFACG-KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL 130
C CG G+S+K+ + + C YH C ++
Sbjct: 753 -----TCKFCGIASGTSEKDDASVCVLHICNLCEKKYHDSCTKEM 792
>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
Length = 933
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 700 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 758
Query: 73 DEVE 76
EVE
Sbjct: 759 PEVE 762
>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
Length = 1071
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 838 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 873
>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
Length = 961
Score = 40.4 bits (93), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 728 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 786
Query: 73 DEVE 76
EVE
Sbjct: 787 PEVE 790
>gi|170050214|ref|XP_001859681.1| set domain protein [Culex quinquefasciatus]
gi|167871729|gb|EDS35112.1| set domain protein [Culex quinquefasciatus]
Length = 2934
Score = 40.4 bits (93), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 63/172 (36%), Gaps = 52/172 (30%)
Query: 33 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCK-------- 84
C FC G L C+ C +SFH A A+ G +V N FCK
Sbjct: 233 CYFCSRYGASLVCKMSCPKSFHYPCIA------AAGGF---QVIQSYNCFCKEHLGQVPL 283
Query: 85 ------NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 138
NC QC A G +G+ + C + CG YH C V
Sbjct: 284 VCTDDINC----RQCSALGDVGN--------LMMC--SICGDHYHGTC----------VG 319
Query: 139 ADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
QL + +C KC IC+ ++ S+ + C +C K YH CL
Sbjct: 320 LAQLPGVRSGWQCGSC--KKCQICRVPDS---SEGRTVGCEQCDKIYHASCL 366
>gi|395837426|ref|XP_003791635.1| PREDICTED: transcription intermediary factor 1-alpha [Otolemur
garnettii]
Length = 1014
Score = 40.4 bits (93), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 11/92 (11%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 781 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 839
Query: 73 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 104
EVE + N E K+ + A KL DK
Sbjct: 840 PEVEYDCDTSSHNSEKKKTEGLA--KLTPIDK 869
>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
Full=ALL1-related protein; AltName: Full=Lysine
N-methyltransferase 2D; Short=KMT2D; AltName:
Full=Myeloid/lymphoid or mixed-lineage leukemia protein
2
gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
sapiens]
Length = 5537
Score = 40.4 bits (93), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 61/160 (38%), Gaps = 30/160 (18%)
Query: 33 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY-KQ 90
CS C G + C C R +H S L+ ++ + + Y ++
Sbjct: 172 CSHCTRLGASIPCRSPGCPRLYHFPCATASGSF-----LSMKTLQLLCPEHSEGAAYLEE 226
Query: 91 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 150
+C C G ++F C S CGH YH C+ D + A + A
Sbjct: 227 ARCAVCEGPGEL-----CDLFFCTS--CGHHYHGACL------DTALTARKRA------- 266
Query: 151 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ CP +C +C Q K +D + VC C K YH CL
Sbjct: 267 GWQCP--ECKVC-QACRKPGNDSKMLVCETCDKGYHTFCL 303
>gi|330801763|ref|XP_003288893.1| hypothetical protein DICPUDRAFT_98170 [Dictyostelium purpureum]
gi|325081038|gb|EGC34569.1| hypothetical protein DICPUDRAFT_98170 [Dictyostelium purpureum]
Length = 1560
Score = 40.4 bits (93), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 23/100 (23%)
Query: 35 FCDNGGD---LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C+N D ++ CE T D + S C + K E +F+C+ CE K
Sbjct: 355 ICNNNVDHGLMVQCE---------TCDKWQHSICYGI---KQESNLPKHFYCELCENKTM 402
Query: 92 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLL 131
C CGK KE+ ++ C+S C ++ H CVSK L
Sbjct: 403 DC-VCGK-----KESNVKIIQCLS--CNNWSHLTCVSKNL 434
>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
Length = 929
Score = 40.4 bits (93), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 696 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 754
Query: 73 DEVE 76
EVE
Sbjct: 755 PEVE 758
>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
griseus]
Length = 1008
Score = 40.4 bits (93), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 758 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 793
>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
[Macaca mulatta]
Length = 955
Score = 40.4 bits (93), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 722 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 780
Query: 73 DEVE 76
EVE
Sbjct: 781 PEVE 784
>gi|221509248|gb|EEE34817.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
VEG]
Length = 742
Score = 40.4 bits (93), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 13/74 (17%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK 72
D EE S E DS+C C GG+++CCEG C SFH +D LT+
Sbjct: 3 ASDAEEARSVSSRESNDSICFCCRGGGEVVCCEG-CPNSFHVGCLDTARRPQ-----LTE 56
Query: 73 DEVEAMLNFFCKNC 86
+E +FC C
Sbjct: 57 EE------WFCPEC 64
>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
lyrata]
Length = 1047
Score = 40.4 bits (93), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 45/100 (45%), Gaps = 22/100 (22%)
Query: 30 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 89
D +CS C GG L+ C+G C +FHA +C L D ++FC++C
Sbjct: 703 DVICSVCHYGGKLILCDG-CPSAFHA--------NCLGLEEVPDG-----DWFCESC--- 745
Query: 90 QHQCFACGK--LGSSDKETGAEVFPCVSATCGHFYHPHCV 127
C ACG+ L ++ K E F C YHP C+
Sbjct: 746 --CCGACGQFFLKATSKYAKEEKF-ISCKQCELKYHPSCL 782
>gi|403165978|ref|XP_003325884.2| hypothetical protein PGTG_07086 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375165992|gb|EFP81465.2| hypothetical protein PGTG_07086 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1043
Score = 40.0 bits (92), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 6/78 (7%)
Query: 151 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFE--DKLEEGIITRA 208
SFTC H+C +C++ +A L C+ CP AY +CLP + + D L E ++
Sbjct: 932 SFTCSQHRCVVCQRAGPEAGGMLY--RCQTCPDAYCEECLPHENFYSMGDTLPEFLLL-G 988
Query: 209 WEGLLPNHRI-LIYCLKH 225
+ + H I + CLKH
Sbjct: 989 YGPMSQAHYIRCVECLKH 1006
>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
carolinensis]
Length = 947
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 25/35 (71%), Gaps = 1/35 (2%)
Query: 22 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
+N++++ + C+ C NGGDLLCCE +C + FH T
Sbjct: 699 NNKDEDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 732
>gi|291238977|ref|XP_002739402.1| PREDICTED: rCG56742-like, partial [Saccoglossus kowalevskii]
Length = 1566
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 44/101 (43%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C +CG G ++ + C + CG YHP+CV+ + + ++
Sbjct: 798 QDMCVSCGSFG---RDAEGRLLTC--SQCGQCYHPYCVNIKITK------------VVLS 840
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C C +C +G KA + + +C C +YH CL
Sbjct: 841 KGWRC--LDCTVC-EGCGKASDEGRLLLCDDCDISYHTYCL 878
>gi|225685102|gb|EEH23386.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1258
Score = 40.0 bits (92), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 71/157 (45%), Gaps = 19/157 (12%)
Query: 871 LSSPTIVIDGTSPLGVHSSKTIEMLSHDDGCQCDQMPH-RSP----------VNVSSGIL 919
LS+ T+V++GT PL + S+ + + H Q Q + RSP ++ S+ +
Sbjct: 330 LSNLTLVVEGTPPLKIKYSRKVNKIDHGVSFQSIQPENFRSPLIGQGRVDSLIDPSNPDI 389
Query: 920 EGESSKPIEIPALSGIGDGGYEHFGVGMSHHSAATVIDGASSLQGLSSKSIEMPSYTQFD 979
++ IE+P + GG + + H + V + +++ + S P QF
Sbjct: 390 SWAQAQRIEVPLNESLSTGGEWIYTIEEVHDACGNVANYSTASEDGDLLSKIAPQTQQF- 448
Query: 980 DNVHQHFEDKGVPRCSPAKVIAKSSLEDHSSKSIDIP 1016
VH+ PR S ++ A+S L+ +SID+P
Sbjct: 449 -VVHER------PRVSLSRCNAQSYLQVAKDRSIDLP 478
>gi|66815871|ref|XP_641952.1| hypothetical protein DDB_G0278771 [Dictyostelium discoideum AX4]
gi|60470006|gb|EAL67987.1| hypothetical protein DDB_G0278771 [Dictyostelium discoideum AX4]
Length = 1350
Score = 40.0 bits (92), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 43/109 (39%), Gaps = 22/109 (20%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF+C K D + + C S C YH C LA + S+
Sbjct: 1028 CFSCKK--EEDSKGRTSMVQCRS--CPKIYHRSCAG-------------LAHT---PRSW 1067
Query: 153 TCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 201
C H C CK+ N++ F +C+ CP ++ CLP + DK E
Sbjct: 1068 KCSRHACHQCKKTPNESGGS--FFICKGCPSSFCITCLPNDVKILDKTE 1114
>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
troglodytes]
gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
paniscus]
gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
ligase TRIM24; AltName: Full=RING finger protein 82;
AltName: Full=Tripartite motif-containing protein 24
gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 40.0 bits (92), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 875
Query: 73 DEVE 76
EVE
Sbjct: 876 PEVE 879
>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Loxodonta africana]
Length = 1048
Score = 40.0 bits (92), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
++ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 816 DTRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 874
Query: 73 DEVE 76
EVE
Sbjct: 875 PEVE 878
>gi|237837385|ref|XP_002367990.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
gi|211965654|gb|EEB00850.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
Length = 743
Score = 40.0 bits (92), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 13/74 (17%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK 72
D EE S E DS+C C GG+++CCEG C SFH +D LT+
Sbjct: 3 ASDAEEARSVSSRESNDSICFCCRGGGEVVCCEG-CPNSFHVGCLDTARRPQ-----LTE 56
Query: 73 DEVEAMLNFFCKNC 86
+E +FC C
Sbjct: 57 EE------WFCPEC 64
>gi|403276202|ref|XP_003929796.1| PREDICTED: transcription intermediary factor 1-alpha [Saimiri
boliviensis boliviensis]
Length = 1010
Score = 40.0 bits (92), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 777 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 835
Query: 73 DEVE 76
EVE
Sbjct: 836 PEVE 839
>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Papio anubis]
Length = 1050
Score = 40.0 bits (92), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 875
Query: 73 DEVE 76
EVE
Sbjct: 876 PEVE 879
>gi|221488757|gb|EEE26971.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
GT1]
Length = 740
Score = 40.0 bits (92), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 13/74 (17%)
Query: 14 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK 72
D EE S E DS+C C GG+++CCEG C SFH +D LT+
Sbjct: 3 ASDAEEARSVSSRESNDSICFCCRGGGEVVCCEG-CPNSFHVGCLDTARRPQ-----LTE 56
Query: 73 DEVEAMLNFFCKNC 86
+E +FC C
Sbjct: 57 EE------WFCPEC 64
>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
griseus]
Length = 954
Score = 40.0 bits (92), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 721 EARKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 779
Query: 73 DEVE 76
EVE
Sbjct: 780 PEVE 783
>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Nomascus leucogenys]
gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
[Gorilla gorilla gorilla]
Length = 1050
Score = 40.0 bits (92), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 875
Query: 73 DEVE 76
EVE
Sbjct: 876 PEVE 879
>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
mulatta]
Length = 1050
Score = 40.0 bits (92), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 875
Query: 73 DEVE 76
EVE
Sbjct: 876 PEVE 879
>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
Length = 1050
Score = 40.0 bits (92), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 875
Query: 73 DEVE 76
EVE
Sbjct: 876 PEVE 879
>gi|296210521|ref|XP_002807106.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
1-alpha-like [Callithrix jacchus]
Length = 1045
Score = 40.0 bits (92), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 812 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 870
Query: 73 DEVE 76
EVE
Sbjct: 871 PEVE 874
>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
Length = 1036
Score = 40.0 bits (92), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
D ++++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 744 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 779
>gi|343959936|dbj|BAK63825.1| transcription intermediary factor 1-gamma [Pan troglodytes]
Length = 81
Score = 40.0 bits (92), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 20 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATI 57
D +N++D+ + C+ C NGGDLLCCE +C + FH T
Sbjct: 33 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLTC 69
>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
Length = 1579
Score = 40.0 bits (92), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 68/162 (41%), Gaps = 40/162 (24%)
Query: 4 FIVDDMEEDMVH----------DTEEDE----SNEEDELFDSVCSFCDNGGDLLCCEGRC 49
+VD ++ D++H D E + S E D+ D C C +GG+L+CC+G C
Sbjct: 966 ILVDGLDIDLLHCLINAWNMQSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDG-C 1024
Query: 50 LRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAE 109
+FH + LGL E + C NC C C + S D E A+
Sbjct: 1025 PSTFHMSC----------LGL---EALPTDYWCCSNC-----SCKFCHEHSSDDAEDTAD 1066
Query: 110 VFPCVSATCGH----FYHPHCVSKLLLRDDEVAA--DQLAKS 145
V + TC +YH + + D+V ++LA++
Sbjct: 1067 VDSSLH-TCSQIHNIWYHESGILSMTSAADDVRTWREELARA 1107
>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
Length = 1314
Score = 40.0 bits (92), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 26/100 (26%), Positives = 43/100 (43%), Gaps = 20/100 (20%)
Query: 32 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 91
C C +GGDL+CC+G C +FH + C + ++ + + C NC
Sbjct: 727 TCGICGDGGDLICCDG-CPSTFHQS--------CLDI-----QMLPLGEWHCPNC----- 767
Query: 92 QCFACG-KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL 130
C CG G+S+K+ + + C YH C ++
Sbjct: 768 TCKFCGIASGNSEKDDASVYVLQICNLCEKKYHDSCTKEM 807
>gi|344242940|gb|EGV99043.1| Transcription intermediary factor 1-alpha [Cricetulus griseus]
Length = 493
Score = 40.0 bits (92), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 260 EARKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 318
Query: 73 DEVE 76
EVE
Sbjct: 319 PEVE 322
>gi|344297160|ref|XP_003420267.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Loxodonta africana]
Length = 1014
Score = 40.0 bits (92), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 72
++ +ED+ + C+ C NGG+LLCCE +C + FH A +GE L+K
Sbjct: 782 DTRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 840
Query: 73 DEVE 76
EVE
Sbjct: 841 PEVE 844
>gi|326487516|dbj|BAJ89742.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 583
Score = 40.0 bits (92), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 29/71 (40%), Gaps = 20/71 (28%)
Query: 93 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 152
CF C + G + C C YHP+CV K DD + +D E F
Sbjct: 14 CFVC--------KDGGHLRLCDYRNCNKAYHPNCVEK---DDDFLNSD---------EDF 53
Query: 153 TCPLHKCCICK 163
C H CCICK
Sbjct: 54 ICGWHTCCICK 64
>gi|2267585|gb|AAB63585.1| transcription intermediary factor 1 [Homo sapiens]
Length = 1012
Score = 40.0 bits (92), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT--------IDAGEESHCASLGLTK 72
E+ +ED+ + C+ C NGG+LLCCE +C + FH + +GE L+K
Sbjct: 779 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 837
Query: 73 DEVE 76
EVE
Sbjct: 838 PEVE 841
>gi|226294417|gb|EEH49837.1| nucleoporin Pom152 [Paracoccidioides brasiliensis Pb18]
Length = 1259
Score = 40.0 bits (92), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 71/157 (45%), Gaps = 19/157 (12%)
Query: 871 LSSPTIVIDGTSPLGVHSSKTIEMLSHDDGCQCDQMPH-RSP----------VNVSSGIL 919
LS+ T+V++GT PL + S+ + + H Q Q + RSP ++ S+ +
Sbjct: 330 LSNLTLVVEGTPPLKIKYSRKVNKIDHGVSFQSIQPENFRSPLIGQGRVDSLIDPSNPDI 389
Query: 920 EGESSKPIEIPALSGIGDGGYEHFGVGMSHHSAATVIDGASSLQGLSSKSIEMPSYTQFD 979
++ IE+P + GG + + H + V + +++ + S P QF
Sbjct: 390 SWAQAQRIEVPLNESLSTGGEWIYTIEEVHDACGNVANYSTASEDGDLLSKIAPQTQQF- 448
Query: 980 DNVHQHFEDKGVPRCSPAKVIAKSSLEDHSSKSIDIP 1016
VH+ PR S ++ A+S L+ +SID+P
Sbjct: 449 -VVHER------PRVSLSRCNAQSYLQVAKDQSIDLP 478
>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
anubis]
Length = 1431
Score = 39.7 bits (91), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)
Query: 90 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 149
Q C CG G GAE + CG YHP+CVS + + ++
Sbjct: 693 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 735
Query: 150 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 190
+ + C +C +C + KA + +C C +YH CL
Sbjct: 736 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 773
>gi|402864955|ref|XP_003896705.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Papio anubis]
Length = 1016
Score = 39.7 bits (91), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH +
Sbjct: 783 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 817
>gi|332224572|ref|XP_003261443.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Nomascus leucogenys]
gi|426358064|ref|XP_004046342.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
[Gorilla gorilla gorilla]
Length = 1016
Score = 39.7 bits (91), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH +
Sbjct: 783 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 817
>gi|351701489|gb|EHB04408.1| Transcription intermediary factor 1-alpha [Heterocephalus glaber]
Length = 925
Score = 39.7 bits (91), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 21 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 56
E+ +ED+ + C+ C NGG+LLCCE +C + FH +
Sbjct: 692 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 726
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 20,210,710,882
Number of Sequences: 23463169
Number of extensions: 917140388
Number of successful extensions: 2539290
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 104
Number of HSP's successfully gapped in prelim test: 976
Number of HSP's that attempted gapping in prelim test: 2533676
Number of HSP's gapped (non-prelim): 4582
length of query: 1210
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 1056
effective length of database: 8,745,867,341
effective search space: 9235635912096
effective search space used: 9235635912096
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 83 (36.6 bits)