BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000584
         (1407 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359494102|ref|XP_002276879.2| PREDICTED: uncharacterized protein LOC100243584 [Vitis vinifera]
          Length = 1260

 Score = 1182 bits (3059), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 631/1153 (54%), Positives = 794/1153 (68%), Gaps = 46/1153 (3%)

Query: 1    MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
            MASSDDE E     VS+Y+F  +    + E ISFS LPIQW++ +     KE I+L G A
Sbjct: 1    MASSDDEGETLPGSVSNYHFVDD----KGEPISFSVLPIQWSKGDNLDSKKEPIFLDGNA 56

Query: 61   DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
            D+GLQKI+K VIAWKFDL++V PEI VLSKEN+WIKLQKPRK +E+I R+ILI V CL  
Sbjct: 57   DNGLQKIYKQVIAWKFDLSDVNPEISVLSKENNWIKLQKPRKSFEDIIRSILITVWCLHS 116

Query: 121  AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
             K+NPE + KS+WD LSR F LY+VRPS+NDLVDH  LI EA++RD+ LAKSKFL+TFLE
Sbjct: 117  MKKNPETSGKSLWDHLSRVFSLYDVRPSENDLVDHTTLISEAVKRDEGLAKSKFLLTFLE 176

Query: 181  EKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLL 240
            EKP KRK  ++ V T +  GFIVD M+ED + +T E  S+EE++LFDSVCS CDNGGDLL
Sbjct: 177  EKPRKRKSFEQDVPTTSKPGFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLL 236

Query: 241  CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSD 300
            CCEGRC+RSFHAT +AGEES CA+LG++  +VEAM NF+CKNC+YKQHQCF+CGKLGSSD
Sbjct: 237  CCEGRCMRSFHATKEAGEESLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSD 296

Query: 301  KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK 360
            K +GAEVF C +ATCG FYHP CV+KLL R+DE AA++L K+I AGE F CP+H+C +CK
Sbjct: 297  KSSGAEVFLCANATCGRFYHPQCVAKLLHREDEAAAEELQKNIYAGELFACPIHRCHVCK 356

Query: 361  QGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCL 420
            QGE+K D +LQFA+CRRCPK+YHRKCLPRKI+FED  EEGII RAW+GLLPN RILIYCL
Sbjct: 357  QGEDKKDLELQFAICRRCPKSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCL 415

Query: 421  KHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQ 480
            KHEID+ +GTPIRDHI FP  EE        R+ +  +S     KV S K SL S+  P+
Sbjct: 416  KHEIDELLGTPIRDHIKFPNDEEKM----EKRRSELFSSRKDLDKVVSKKRSLVSEDSPR 471

Query: 481  GKFSVKALKRV---PSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQ 537
             + +VKA K+V    S    G++ + SE+   G D S+R K T  S+KS   NVKS+S +
Sbjct: 472  ERMAVKATKQVEKLSSTVKDGDSTKKSEKRSSGPDPSKRLKVTGFSKKSLDDNVKSISKK 531

Query: 538  VDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELP 597
            VD+SS  D  KTSLGE+LYA        +++  K+D  + E  + V      KK  S LP
Sbjct: 532  VDKSSMADENKTSLGEQLYALI----KNRSEPRKEDTPNSELEQKVVT----KKTSSSLP 583

Query: 598  SLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAI 657
            SLD DS+ R+ +++K++ S + +E+++K+HK+PSTHAY+SK+ VD+ IT GKVEGS+EA+
Sbjct: 584  SLDRDSENRILAIIKESKSLITLEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEAL 643

Query: 658  RTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVD 717
            R ALKKL+  G SIEDAKAVCEPEVL+QI KWKNKLKVYLAPFLHGMRYTSFGRHFTKVD
Sbjct: 644  RAALKKLE-GGGSIEDAKAVCEPEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVD 702

Query: 718  KLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNF 777
            KL+ IV+KLH+YV +GD IVDFCCGANDFSCLMK+KL+E GK C YKNYD++  KNDFNF
Sbjct: 703  KLKEIVEKLHYYVKNGDTIVDFCCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNF 762

Query: 778  EKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERL 837
            EKRDWM+V+ KEL  GS+LIMGLNPPFGVKA LAN FINKAL+F PKLLILIVPPETERL
Sbjct: 763  EKRDWMSVKQKELPTGSQLIMGLNPPFGVKASLANMFINKALQFKPKLLILIVPPETERL 822

Query: 838  DRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKAL 897
            D+K   Y+L+WEDD  LSGKSFYLPGSVD NDKQ++QWN+  P LYLWSR D+   H+A+
Sbjct: 823  DKKRPPYDLIWEDDNELSGKSFYLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAI 882

Query: 898  AEKHGHISRPQSRTQMERNCYETHAVDHP--KEEGQGDASMLIDLPLQINVTKELRNEAR 955
            A+K GH+SR +  + +E+   E   +DHP   +   G  SM++D         E   E  
Sbjct: 883  AQKCGHVSRRRRVSHLEKIQNEEPVLDHPMADQTHSGHVSMMLD---------EHSVENH 933

Query: 956  EDDKAGFPDNATEGGGESS-HGHGDNQSGKTSRKRKRDRKKHGSG---MRENSPLDGQNR 1011
            E +     +  T G  ESS H   D +        +  +++HG G    R  +  D +  
Sbjct: 934  ELEHEERREIVTAGRVESSPHSGVDREDHGKKLLNENSKQRHGKGKHEKRTENISDDKQI 993

Query: 1012 GRHLASGIHGMSKHSPANIANVSPLLEGHSSKSIDMPSHVGSGDNDCQHFSNKGMPLSSP 1071
               ++    G S  S    ++    ++ H  +++   S V  G+    HF   G+P SS 
Sbjct: 994  MTPVSEMCKGTSCTSSPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHF-QPGVPDSS- 1051

Query: 1072 TIVIDGTSPLGVHSSKTIEMLSH---DDGCQCDQMPHRSPVNVSSGILEG--ESSKPIEI 1126
               +  T   G H+S   +M      D         HR    VS G+  G   S +P   
Sbjct: 1052 ---LQRTGYGGSHASIPEDMARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTS 1108

Query: 1127 PALSGIGDGGYEH 1139
                 I + GY H
Sbjct: 1109 YMRGSIDNLGYRH 1121



 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/253 (38%), Positives = 133/253 (52%), Gaps = 38/253 (15%)

Query: 1192 SPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHIGTTYYG-TQAGIPNDM 1249
            SP    A+S+++ H  +++   S    G + Y H +P    S +  T YG + A IP DM
Sbjct: 1009 SPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHFQPGVPDSSLQRTGYGGSHASIPEDM 1068

Query: 1250 GS-YGMSS--------------LNNGLSHGA-NLDERYTGYVRNT-DSLGYRPSM----S 1288
               Y + S              ++ GL +G  N +E +T Y+R + D+LGYR S+     
Sbjct: 1069 ARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRHSIRDRDE 1128

Query: 1289 TDRELTMWPLARIYGQDFP-----------APTPGYGQMGSVPSNLYGNLGSSAEASY-R 1336
              R   +    + YG   P              P +GQMGS PS  YG+ GS AE+SY R
Sbjct: 1129 YGRNADIRSQVQSYGLHDPIGMSQRSNYLAGQDPRFGQMGSFPST-YGHPGSGAESSYSR 1187

Query: 1337 MSTSAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRA-PQPGFFAD-MDFGPGFHP 1394
            M+TSAM RYAP+L +LN+TRMN+F  E  MP R   YD  A P+PGF AD M F PG H 
Sbjct: 1188 MNTSAMQRYAPQLDELNHTRMNSFGYERPMPIRNNIYDPLAPPRPGFQADSMGFAPGLHH 1247

Query: 1395 PFPQQGSGGWLDD 1407
            PF +Q S GWL++
Sbjct: 1248 PFSKQNSSGWLNE 1260


>gi|147814981|emb|CAN61366.1| hypothetical protein VITISV_020586 [Vitis vinifera]
          Length = 2238

 Score = 1177 bits (3046), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 633/1153 (54%), Positives = 791/1153 (68%), Gaps = 47/1153 (4%)

Query: 1    MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
            MASSDDE E     VS+Y+F  +    + E ISFS LPIQW++ +     KE I+L G A
Sbjct: 647  MASSDDEGETLPGSVSNYHFVDD----KGEPISFSVLPIQWSKGDNLDSKKEPIFLDGNA 702

Query: 61   DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
            D+GLQKI+K VIAWKFDL++V PEI VLSKEN+WIKLQKPRK +E+I R+ILI V CL  
Sbjct: 703  DNGLQKIYKQVIAWKFDLSDVNPEISVLSKENNWIKLQKPRKSFEDIIRSILITVWCLHS 762

Query: 121  AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
             K+NPE + KS+WD LSR F LY+VRPS+NDLVDH  LI EA++RD+ LAKSKFL+TFLE
Sbjct: 763  MKKNPETSGKSLWDHLSRVFSLYDVRPSENDLVDHTTLISEAVKRDEGLAKSKFLLTFLE 822

Query: 181  EKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLL 240
            EKP KRK S E V T +  GFIVD M+ED + +T E  S+EE++LFDSVCS CDNGGDLL
Sbjct: 823  EKPRKRK-SFEDVPTTSKPGFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLL 881

Query: 241  CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSD 300
            CCEGRC+RSFHAT +AGEES CA+LG++  +VEAM NF+CKNC+YKQHQCF+CGKLGSSD
Sbjct: 882  CCEGRCMRSFHATKEAGEESLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSD 941

Query: 301  KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK 360
            K +GAEVF C +ATCG FYHP CV+KLL R+DE AA+ L K+I AGE F CP+H+C +CK
Sbjct: 942  KSSGAEVFLCANATCGRFYHPQCVAKLLHREDEAAAEDLQKNIYAGELFACPIHRCHVCK 1001

Query: 361  QGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCL 420
            QGE+K D +LQFA+CRRCPK+YHRKCLPRKI+FED  EEGII RAW+GLLPN RILIYCL
Sbjct: 1002 QGEDKKDLELQFAICRRCPKSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCL 1060

Query: 421  KHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQ 480
            KHEID+ +GTPIRDHI FP  EE        R+ +  +S     KV S K SL S+  P+
Sbjct: 1061 KHEIDELLGTPIRDHIKFPNDEEKM----EKRRSELFSSRKDLDKVVSKKRSLVSEDSPR 1116

Query: 481  GKFSVKALKRV---PSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQ 537
             + +VKA K+V    S    G++ + SE+   G D S+R K T  S+KS   NVKS+S +
Sbjct: 1117 ERMAVKATKQVEKLSSTVKDGDSTKKSEKRSSGPDPSKRLKVTGFSKKSLDDNVKSISKK 1176

Query: 538  VDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELP 597
            VD+SS  D  KTSLGE+LYA        +    K+D  + E  + V  K    K  S LP
Sbjct: 1177 VDKSSMADENKTSLGEQLYALIKNRSEPR----KEDTPNSELEQKVVTK----KTSSSLP 1228

Query: 598  SLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAI 657
            SLD DS+ R+ +++K++ S + +E+++K+HK+PSTHAY+SK+ VD+ IT GKVEGS+EA+
Sbjct: 1229 SLDRDSENRILAIIKESKSLITLEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEAL 1288

Query: 658  RTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVD 717
            R ALKKL+  G SIEDAKAVCEPEVL+QI KWKNKLKVYLAPFLHGMRYTSFGRHFTKVD
Sbjct: 1289 RAALKKLE-GGGSIEDAKAVCEPEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVD 1347

Query: 718  KLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNF 777
            KL+ IV+KLH+YV +GD IVDFCCGANDFSCLMK+KL+E GK C YKNYD++  KNDFNF
Sbjct: 1348 KLKEIVEKLHYYVKNGDTIVDFCCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNF 1407

Query: 778  EKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERL 837
            EKRDWM+V+ KEL  GS+LIMGLNPPFGVKA LAN FINKAL+F PKLLILIVPPETERL
Sbjct: 1408 EKRDWMSVKQKELPTGSQLIMGLNPPFGVKASLANMFINKALQFKPKLLILIVPPETERL 1467

Query: 838  DRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKAL 897
            D+K   Y+L+WEDD  LSGKSFYLPGSVD NDKQ++QWN+  P LYLWSR D+   H+A+
Sbjct: 1468 DKKRPPYDLIWEDDNELSGKSFYLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAI 1527

Query: 898  AEKHGHISRPQSRTQMERNCYETHAVDHPK--EEGQGDASMLIDLPLQINVTKELRNEAR 955
            A+K GH+SR +  + +E+   E   +DHP   +   G  SM++D         E   E  
Sbjct: 1528 AQKCGHVSRRRRVSHLEKIQNEEPVLDHPMADQTHSGHVSMMLD---------EHSVENH 1578

Query: 956  EDDKAGFPDNATEGGGESS-HGHGDNQSGKTSRKRKRDRKKHGSG---MRENSPLDGQNR 1011
            E +     +  T G  ESS H   D +        +  +++HG G    R  +  D +  
Sbjct: 1579 ELEHEERREIVTAGRVESSPHSGVDREDHGKKLLNENSKQRHGKGKHEKRTENISDDKQI 1638

Query: 1012 GRHLASGIHGMSKHSPANIANVSPLLEGHSSKSIDMPSHVGSGDNDCQHFSNKGMPLSSP 1071
               ++    G S  S    ++    ++ H  +++   S V  G+    HF   G+P SS 
Sbjct: 1639 MTPVSEMCKGTSCTSSPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHF-QPGVPDSS- 1696

Query: 1072 TIVIDGTSPLGVHSSKTIEMLSH---DDGCQCDQMPHRSPVNVSSGILEG--ESSKPIEI 1126
               +  T   G H+S   +M      D         HR    VS G+  G   S +P   
Sbjct: 1697 ---LQRTGYGGSHASIPEDMARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTS 1753

Query: 1127 PALSGIGDGGYEH 1139
                 I + GY H
Sbjct: 1754 YMRGSIDNLGYRH 1766



 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/247 (38%), Positives = 128/247 (51%), Gaps = 38/247 (15%)

Query: 1192 SPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHIGTTYYG-TQAGIPNDM 1249
            SP    A+S+++ H  +++   S    G + Y H +P    S +  T YG + A IP DM
Sbjct: 1654 SPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHFQPGVPDSSLQRTGYGGSHASIPEDM 1713

Query: 1250 GS-YGMSS--------------LNNGLSHGA-NLDERYTGYVRNT-DSLGYRPSM----S 1288
               Y + S              ++ GL +G  N +E +T Y+R + D+LGYR S+     
Sbjct: 1714 ARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRHSIRDRDE 1773

Query: 1289 TDRELTMWPLARIYGQDFP-----------APTPGYGQMGSVPSNLYGNLGSSAEASY-R 1336
              R   +    + YG   P              P +GQMGS PS  YG+ GS AE+SY R
Sbjct: 1774 YGRNADIRSQVQSYGLHDPIGMSQRSNYLAGQDPRFGQMGSFPST-YGHPGSGAESSYSR 1832

Query: 1337 MSTSAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRA-PQPGFFAD-MDFGPGFHP 1394
            M+TSAM RYAP+L +LN+TRMN+F  E  MP R   YD  A P+PGF AD M F PG H 
Sbjct: 1833 MNTSAMQRYAPQLDELNHTRMNSFGYERPMPIRNNIYDPLAPPRPGFQADSMGFAPGLHH 1892

Query: 1395 PFPQQGS 1401
            PF +Q S
Sbjct: 1893 PFSKQNS 1899


>gi|255576262|ref|XP_002529024.1| protein binding protein, putative [Ricinus communis]
 gi|223531504|gb|EEF33335.1| protein binding protein, putative [Ricinus communis]
          Length = 1249

 Score = 1131 bits (2925), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 614/1074 (57%), Positives = 755/1074 (70%), Gaps = 36/1074 (3%)

Query: 1    MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTG-GCKELIYLRGA 59
            MASSDDE + G + VS+Y+F  +    +   ISFS LP QW+E+E  G   K  I+L G+
Sbjct: 1    MASSDDEADSGPQSVSNYHFVDD----EDAPISFSLLPFQWSESESVGEQNKHQIFLHGS 56

Query: 60   ADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
             D+GL+ I   VIAWKFDL N IP I V++K+ +WIKL+KPRK +E+I RT LI VHCL 
Sbjct: 57   VDNGLRTIHMEVIAWKFDLLNAIPSISVITKDKNWIKLEKPRKSFEKIIRTELITVHCLH 116

Query: 120  YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
            YA++ PEA+ KS+WD LS+ F LY+VR +QNDLVDHM LI EA++RDD LAKSKFL+ FL
Sbjct: 117  YARKYPEASKKSVWDHLSKVFSLYDVRFTQNDLVDHMALISEAVKRDDSLAKSKFLLAFL 176

Query: 180  EEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDL 239
            EEKP KR+ S+E +QT  MSGFIVDD+++DM  D EED   EE++   SVC+FCDNGG+L
Sbjct: 177  EEKPRKRRPSNEDIQTTDMSGFIVDDVDDDMFEDVEEDGEEEEED---SVCTFCDNGGEL 233

Query: 240  LCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSS 299
            LCC+G C+RSFHAT +AGEES C SLG T+ EVEA   F+CKNCEYKQHQCFACG+LGSS
Sbjct: 234  LCCDGSCMRSFHATKEAGEESMCVSLGFTEREVEATERFYCKNCEYKQHQCFACGELGSS 293

Query: 300  DKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG-ESFTCPLHKCCI 358
            DK +GAEVF C +ATCG+FYHP C++KLL ++DEVAA +L K I AG ESFTCP+HKCC+
Sbjct: 294  DKLSGAEVFRCANATCGYFYHPSCIAKLLHQEDEVAAKELQKKIAAGKESFTCPIHKCCV 353

Query: 359  CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIY 418
            CKQGENK   +LQFAVCRRCP +YHRKC+P +I FE K  E  I RAWE LLPN RILIY
Sbjct: 354  CKQGENKKIRELQFAVCRRCPTSYHRKCMPSEIVFEKKKGEEEI-RAWEDLLPN-RILIY 411

Query: 419  CLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAP 478
            CLKHEI D +GTPIRD I FP IEE K       K Q    P   +K  + K  LTS+  
Sbjct: 412  CLKHEIIDYLGTPIRD-IRFPDIEEKK-------KTQISDLPGSSEKDLAKKRRLTSEDL 463

Query: 479  PQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQV 538
              G   +K +K   S A +   ++ SE+L  GS   RR K  D SRKS K  +KS S+++
Sbjct: 464  FSGDAVIKKVKDSSSGARKVTNIKKSEKLSPGSTFLRRVKERDASRKSLKEKMKSTSIEL 523

Query: 539  DRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPS 598
            DRS++ +  KTSLG++L+   + + +EQ   GK+D    E  + VTVK    KL  ELPS
Sbjct: 524  DRSATANLNKTSLGDKLFD--IMKRSEQVHNGKKDVHTNEIDKPVTVKA-STKLSDELPS 580

Query: 599  LDEDSKRRLSSLMKDAASSVRMEEILKRHK--IPSTHAYASKSAVDKAITLGKVEGSVEA 656
            LD D++RRL +LMK+++S + ME++ K H+  IPSTHAY+ ++  +KAIT GKVEG+VEA
Sbjct: 581  LDADTERRLLALMKESSSLISMEDVRKTHQVHIPSTHAYSLRTVCEKAITAGKVEGAVEA 640

Query: 657  IRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKV 716
            +RTALKKL+ DG S EDAKAVC P  LSQ+FKWK+KL+VYLAPFL+GMRYTSFGRHFTKV
Sbjct: 641  VRTALKKLE-DGCSTEDAKAVCGPANLSQVFKWKSKLRVYLAPFLNGMRYTSFGRHFTKV 699

Query: 717  DKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFN 776
            +KL+ I + LHWYV DGD IVDFCCGANDFSCLMKKKL++T K C YKNYD++  KNDFN
Sbjct: 700  EKLEEITNLLHWYVEDGDTIVDFCCGANDFSCLMKKKLEQTRKTCSYKNYDVIQPKNDFN 759

Query: 777  FEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETER 836
            FEKRDWMTV P+EL P   LIMGLNPPFGVKA LANKFINKALEF PKLLILIVPPETER
Sbjct: 760  FEKRDWMTVRPEEL-PKEGLIMGLNPPFGVKAALANKFINKALEFKPKLLILIVPPETER 818

Query: 837  LDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKA 896
            LD+K+S Y LVWEDD+F+SGKSFYLPGS+DENDK+MDQWN+T PPLYLWSR D+   H A
Sbjct: 819  LDKKDSPYNLVWEDDRFVSGKSFYLPGSIDENDKRMDQWNLTTPPLYLWSRPDWHEKHLA 878

Query: 897  LAEKHGHISRPQSRTQMERNCYETHAVDHPKE--EGQGDASMLIDLPLQINVTKELRN-- 952
            +A+K GH+S  +  +  + N  ET   DHP E    + DAS L D    +   KEL+   
Sbjct: 879  IAQKQGHLSGQREGSSSKENYPETMTYDHPLEVYSSKADASELTDDDRLVQ-NKELKEPN 937

Query: 953  ---EAREDDKAGFPDNATEGGGESSHGHGDNQSG-KTSRKRKRDRKKHGSGMRENSPLDG 1008
                  E  K   P +      E S+G   +QS  KT RKRK    K G G  E  P   
Sbjct: 938  DNISVAEGSKECSPHDNGSRESEDSYGPERSQSKEKTLRKRKHGEDKLGRGTSEKLPKTR 997

Query: 1009 QNRGRHLASGIHGMSKH-SPANIANVSPLLEGHSSKSIDMPSHVGSGDNDCQHF 1061
            Q   +   S  +   +H SP  + N     EG +S+S +M  H   G     +F
Sbjct: 998  QTGAKPPRSNTYRGIRHCSPPKMVNSRSSQEGLTSRSFEMTPHAEVGKTSSPNF 1051



 Score =  125 bits (314), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 94/244 (38%), Positives = 133/244 (54%), Gaps = 26/244 (10%)

Query: 1186 KGVPRCSPAKVI-AKSSLEDHSSKSIDIPSQTGFG-SDYQHHEPTRSSSHIGTTYYGTQA 1243
            +G+  CSP K++ ++SS E  +S+S ++      G +   + E    SSH+ +   GT  
Sbjct: 1010 RGIRHCSPPKMVNSRSSQEGLTSRSFEMTPHAEVGKTSSPNFESGMFSSHMPS---GTAC 1066

Query: 1244 G--IPNDMGSYGMSSLN--------NGLSHGANLDERYTGYVR-NTDSLGYRPSMSTDRE 1292
            G    N  G     S+N        +G SH  NLDER TG +R +T+++GYR  +   RE
Sbjct: 1067 GNLTSNHDGVGRKFSMNSDEYLQGIHGFSH-PNLDERSTGPIRESTENIGYRSYVMGLRE 1125

Query: 1293 LTMWPLARIYGQ--DFPAPT----PGYGQMGSVPSNLYGNLGSSAEASYRMSTSAMDRYA 1346
              +    + YGQ  D  A      PGYG+MGS PS LY +LG+ ++  YRM+TSAM RYA
Sbjct: 1126 SDLRSQVQQYGQHPDSSAQRNFHDPGYGRMGSAPSMLYRHLGTPSDPLYRMNTSAMQRYA 1185

Query: 1347 PRLHQLNNTRMNTFRSEPFMPSRFGFYD--SRAPQPGFFAD-MDFGPGFHPPFPQQGSGG 1403
            PRL +LN+T M  F  +P M  R G Y+     P PG+  D M+F PG H P+    S G
Sbjct: 1186 PRLDELNHTMMGDFSPDPSMMHRNGMYNPRPPQPPPGYHIDSMNFAPGPHRPYSHHNSAG 1245

Query: 1404 WLDD 1407
            WL++
Sbjct: 1246 WLNE 1249


>gi|297737514|emb|CBI26715.3| unnamed protein product [Vitis vinifera]
          Length = 1201

 Score = 1116 bits (2886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 608/1150 (52%), Positives = 759/1150 (66%), Gaps = 99/1150 (8%)

Query: 1    MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
            MASSDDE E     VS+Y+F  +    + E ISFS LPIQW++ +     KE I+L G A
Sbjct: 1    MASSDDEGETLPGSVSNYHFVDD----KGEPISFSVLPIQWSKGDNLDSKKEPIFLDGNA 56

Query: 61   DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
            D+GLQKI+K VIAWKFDL++V PEI VLSKEN+WIKLQKPRK +E+I R+ILI V CL  
Sbjct: 57   DNGLQKIYKQVIAWKFDLSDVNPEISVLSKENNWIKLQKPRKSFEDIIRSILITVWCLHS 116

Query: 121  AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
             K+NPE + KS+WD LSR F LY+VRPS+NDLVDH  LI EA++RD+ LAKSKFL+TFLE
Sbjct: 117  MKKNPETSGKSLWDHLSRVFSLYDVRPSENDLVDHTTLISEAVKRDEGLAKSKFLLTFLE 176

Query: 181  EKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLL 240
            EKP KRK S E V T +  GFIVD M+ED + +T E  S+EE++LFDSVCS CDNGGDLL
Sbjct: 177  EKPRKRK-SFEDVPTTSKPGFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLL 235

Query: 241  CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSD 300
            CCEGRC+RSFHAT +AGEES CA+LG++  +VEAM NF+CKNC+YKQHQCF+CGKLGSSD
Sbjct: 236  CCEGRCMRSFHATKEAGEESLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSD 295

Query: 301  KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK 360
            K +GAEVF C +ATCG FYHP CV+KLL R+DE AA++L K+I AGE F CP+H+C +CK
Sbjct: 296  KSSGAEVFLCANATCGRFYHPQCVAKLLHREDEAAAEELQKNIYAGELFACPIHRCHVCK 355

Query: 361  QGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCL 420
            QGE+K D +LQFA+CRRCPK+YHRKCLPRKI+FED  EEGII RAW+GLLPN RILIYCL
Sbjct: 356  QGEDKKDLELQFAICRRCPKSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCL 414

Query: 421  KHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQ 480
            KHEID+ +GTPIRDHI FP  EE        R+ +  +S     KV S K SL S+  P 
Sbjct: 415  KHEIDELLGTPIRDHIKFPNDEEKM----EKRRSELFSSRKDLDKVVSKKRSLVSEDSPH 470

Query: 481  GKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDR 540
            G  + K+ KR                   G D S+R K T  S+KS   N          
Sbjct: 471  GDSTKKSEKRSS-----------------GPDPSKRLKVTGFSKKSLDDN---------- 503

Query: 541  SSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLD 600
                D+  + L +++                                + KK  S LPSLD
Sbjct: 504  ----DTPNSELEQKV--------------------------------VTKKTSSSLPSLD 527

Query: 601  EDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTA 660
             DS+ R+ +++K++ S + +E+++K+HK+PSTHAY+SK+ VD+ IT GKVEGS+EA+R A
Sbjct: 528  RDSENRILAIIKESKSLITLEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEALRAA 587

Query: 661  LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 720
            LKKL+  G SIEDAKAVCEPEVL+QI KWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKL+
Sbjct: 588  LKKLE-GGGSIEDAKAVCEPEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLK 646

Query: 721  AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 780
             IV+KLH+YV +GD IVDFCCGANDFSCLMK+KL+E GK C YKNYD++  KNDFNFEKR
Sbjct: 647  EIVEKLHYYVKNGDTIVDFCCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNFEKR 706

Query: 781  DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 840
            DWM+V+ KEL  GS+LIMGLNPPFGVKA LAN FINKAL+F PKLLILIVPPETERLD+K
Sbjct: 707  DWMSVKQKELPTGSQLIMGLNPPFGVKASLANMFINKALQFKPKLLILIVPPETERLDKK 766

Query: 841  ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 900
               Y+L+WEDD  LSGKSFYLPGSVD NDKQ++QWN+  P LYLWSR D+   H+A+A+K
Sbjct: 767  RPPYDLIWEDDNELSGKSFYLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAIAQK 826

Query: 901  HGHISRPQSRTQMERNCYETHAVDHP--KEEGQGDASMLIDLPLQINVTKELRNEAREDD 958
             GH+SR +  + +E+   E   +DHP   +   G  SM++D         E   E  E +
Sbjct: 827  CGHVSRRRRVSHLEKIQNEEPVLDHPMADQTHSGHVSMMLD---------EHSVENHELE 877

Query: 959  KAGFPDNATEGGGESS-HGHGDNQSGKTSRKRKRDRKKHGSG---MRENSPLDGQNRGRH 1014
                 +  T G  ESS H   D +        +  +++HG G    R  +  D +     
Sbjct: 878  HEERREIVTAGRVESSPHSGVDREDHGKKLLNENSKQRHGKGKHEKRTENISDDKQIMTP 937

Query: 1015 LASGIHGMSKHSPANIANVSPLLEGHSSKSIDMPSHVGSGDNDCQHFSNKGMPLSSPTIV 1074
            ++    G S  S    ++    ++ H  +++   S V  G+    HF   G+P SS    
Sbjct: 938  VSEMCKGTSCTSSPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHF-QPGVPDSS---- 992

Query: 1075 IDGTSPLGVHSSKTIEMLSH---DDGCQCDQMPHRSPVNVSSGILEG--ESSKPIEIPAL 1129
            +  T   G H+S   +M      D         HR    VS G+  G   S +P      
Sbjct: 993  LQRTGYGGSHASIPEDMARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMR 1052

Query: 1130 SGIGDGGYEH 1139
              I + GY H
Sbjct: 1053 GSIDNLGYRH 1062



 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/253 (38%), Positives = 133/253 (52%), Gaps = 38/253 (15%)

Query: 1192 SPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHIGTTYYG-TQAGIPNDM 1249
            SP    A+S+++ H  +++   S    G + Y H +P    S +  T YG + A IP DM
Sbjct: 950  SPRASDARSTVDIHQPEALKKSSPVEVGEEVYPHFQPGVPDSSLQRTGYGGSHASIPEDM 1009

Query: 1250 GS-YGMSS--------------LNNGLSHGA-NLDERYTGYVRNT-DSLGYRPSM----S 1288
               Y + S              ++ GL +G  N +E +T Y+R + D+LGYR S+     
Sbjct: 1010 ARRYRLDSEEPFSSTIHRWSTGVSPGLDYGIRNSEEPFTSYMRGSIDNLGYRHSIRDRDE 1069

Query: 1289 TDRELTMWPLARIYGQDFP-----------APTPGYGQMGSVPSNLYGNLGSSAEASY-R 1336
              R   +    + YG   P              P +GQMGS PS  YG+ GS AE+SY R
Sbjct: 1070 YGRNADIRSQVQSYGLHDPIGMSQRSNYLAGQDPRFGQMGSFPST-YGHPGSGAESSYSR 1128

Query: 1337 MSTSAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRA-PQPGFFAD-MDFGPGFHP 1394
            M+TSAM RYAP+L +LN+TRMN+F  E  MP R   YD  A P+PGF AD M F PG H 
Sbjct: 1129 MNTSAMQRYAPQLDELNHTRMNSFGYERPMPIRNNIYDPLAPPRPGFQADSMGFAPGLHH 1188

Query: 1395 PFPQQGSGGWLDD 1407
            PF +Q S GWL++
Sbjct: 1189 PFSKQNSSGWLNE 1201


>gi|356569700|ref|XP_003553034.1| PREDICTED: uncharacterized protein LOC100803073 [Glycine max]
          Length = 1225

 Score = 1015 bits (2624), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 549/1029 (53%), Positives = 697/1029 (67%), Gaps = 54/1029 (5%)

Query: 1    MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
            MASSDDE E     VS+Y+FE   +      + FS LPIQW+E++   G K  ++L G  
Sbjct: 1    MASSDDEGEAQPLSVSNYHFEDNKDAP----VCFSVLPIQWSESQSPVGKKMQVFLHGFV 56

Query: 61   DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMV-HCLS 119
            D+GLQK F  V+AW+FDL+ V PEI VLSK+  WIKL+KPRK YE+     +++  H LS
Sbjct: 57   DNGLQKFFVQVVAWRFDLSYVRPEILVLSKDGRWIKLEKPRKSYEDTIIRTILITIHFLS 116

Query: 120  YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
            Y K+NP+++AKS+WD LS+    YEV PSQNDL++HM L+ EA +RD  LAKSK L+  L
Sbjct: 117  YVKKNPDSSAKSVWDNLSKNKESYEVMPSQNDLLNHMTLMGEAAKRDTALAKSKLLLMVL 176

Query: 180  EEKPTKR--KLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGG 237
            E+K   +  K SD+ V+  A  GFI+DD++ DM+ +  ED S+ EDELFDSVC+ CDNGG
Sbjct: 177  EDKDKLKIKKPSDKEVKDLARPGFIIDDIDNDMIDEFGED-SDGEDELFDSVCAICDNGG 235

Query: 238  DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLG 297
             LLCC+G+C+RSFHA  + GEES CASLG ++ EV+ + NF+CKNCEY QHQCFACG LG
Sbjct: 236  QLLCCDGKCMRSFHANEEDGEESTCASLGFSRKEVDEIQNFYCKNCEYNQHQCFACGTLG 295

Query: 298  SSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCC 357
             SDK +GAEVF C SATCG FYHPHCV+KLL    E A  +L + I  G  FTCP H CC
Sbjct: 296  CSDKFSGAEVFKCASATCGFFYHPHCVAKLLHGIVEDAPKELEEKIAEGGPFTCPTHYCC 355

Query: 358  ICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILI 417
             CK+ E+K   D QFAVCRRCP++YHRKCLPR+IAF+D  +E IITRAWE LLPN+RILI
Sbjct: 356  ECKEMEDKKKHDFQFAVCRRCPRSYHRKCLPREIAFDDIEDEDIITRAWEDLLPNNRILI 415

Query: 418  YCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKA 477
            YCL+HEIDDE+GTPIRDHI FP ++     ID     +S    + K++V   K+++ SK 
Sbjct: 416  YCLEHEIDDELGTPIRDHIKFPNVKATVREIDAEENAKS----ATKERVILNKNNIDSKN 471

Query: 478  PPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQ 537
               GK +   + ++P K   G+  +     +  S+ SR+ K  + SR  F  N +S   +
Sbjct: 472  L-FGKKATAKVSKLPGKMSSGKVGDKKSEKISRSNISRK-KINEASR-CFNENKRSTISK 528

Query: 538  VDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELP 597
              + S     + SLG +L+A      +E    G +  +D     T+ VKP  KKL S LP
Sbjct: 529  ETKKSDGAENRPSLGAKLFA-LKQNSSEHINSGNE--ADDVAKNTLVVKPT-KKLSSTLP 584

Query: 598  SLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAI 657
            +LD DSKRRL +L K+A SSV +E ++K HK  +TH ++ KS V+K ITLGK+EGSVEA+
Sbjct: 585  ALDADSKRRLLALFKEATSSVTLENVIKEHKFAATHTHSLKSVVEKTITLGKLEGSVEAV 644

Query: 658  RTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVD 717
            RTAL+ L+ DG +I DA+AVC P+VL+QIFKWK+KLKVYLAP L+G RYTSFGRHFT+++
Sbjct: 645  RTALRMLE-DGHNIRDAEAVCGPDVLNQIFKWKDKLKVYLAPVLYGNRYTSFGRHFTQIE 703

Query: 718  KLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNF 777
            KL+ IVDKLHWYV +GD IVDFCCGANDFS LM KKL+ETGK C YKN+D+LP KNDFNF
Sbjct: 704  KLEGIVDKLHWYVQNGDTIVDFCCGANDFSILMNKKLEETGKRCSYKNFDLLPTKNDFNF 763

Query: 778  EKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERL 837
            E RDWMT++ KEL  GS+LIMGLNPPFG+KA LANKFI+KALEF PKLLILIVPPETERL
Sbjct: 764  EMRDWMTIQTKELPTGSQLIMGLNPPFGLKAALANKFIDKALEFRPKLLILIVPPETERL 823

Query: 838  DRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKAL 897
            D K S Y+LVWED +FL GKSFYLPGSVD ND+Q+DQWN+  PPLYLWSR D+   HKA+
Sbjct: 824  DEKRSPYDLVWEDKRFLLGKSFYLPGSVDANDRQIDQWNVKPPPLYLWSRPDWTDKHKAI 883

Query: 898  AEKHGHISRPQSRTQMERNCYETHAVDHPKEEGQGDASMLIDLPLQ--INVTKELRNEAR 955
            A KHGH    +   ++E    E     H  ++  G  SM    P    +N+T    NE +
Sbjct: 884  ARKHGHFISQRGLLRIESFDKEKSPASHTLDDSSGFNSM----PGHDILNLTDAPINEGQ 939

Query: 956  EDDKAGFPDNATEGGGESSHGHGDNQS-----------GKTSRKRKRDRKKHGSGMRENS 1004
                           G S HG+ D +S            KTS KRKR  +  G  +   S
Sbjct: 940  T--------------GCSPHGNVDRESQERQKYMVRKADKTSWKRKRSEENDGRRLGVTS 985

Query: 1005 ---PLDGQN 1010
               P+DG++
Sbjct: 986  PPNPIDGRS 994



 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 101/245 (41%), Gaps = 46/245 (18%)

Query: 1193 PAKVIAKSSLEDHSSKSIDIPSQTGFG-SDYQHHEPTRSSSH--IGTTYYGTQ--AGIPN 1247
            P  +  +SS+E    +    P     G   Y+H EPT SS    I   Y GTQ    + N
Sbjct: 987  PNPIDGRSSVESFQLRPDMPPPDYELGDKSYRHLEPTSSSRMGGIRAAYSGTQNWPSVAN 1046

Query: 1248 DMGSYGMSSLNNGLSHGANLDERYTGYVRN-TDSLGYRPSMSTD----RELTMWPLARIY 1302
             +   G++          ++ E ++   R+  +S+GYRP +  D    REL      R Y
Sbjct: 1047 PLYDSGIT----------DVGEHHSSLPRDIANSIGYRPYVREDENYLRELETRQQTRHY 1096

Query: 1303 GQDFPAPTPGYGQMGSVPSNLYGNLGSS------AEASYRMSTSAMDRYAPRLHQLNNTR 1356
            G   P            P+N + ++G S      A   Y M+T AM RYAPRL +LN+ R
Sbjct: 1097 GIQNPNSVMSNYLSVHDPANSH-HMGPSYPALALASEPYVMNTPAMQRYAPRLDELNHAR 1155

Query: 1357 MNTFRS------------------EPFMPSRFGFYDSRAPQPGFFADM-DFGPGFHPPFP 1397
            M+   S                  EP +  R G ++  A  PG+ + M  F  G H  + 
Sbjct: 1156 MDPLGSRLDELNHARMDPLGSRLDEPAIVGRNGAFERSALPPGYGSRMPGFAAGSHHMYS 1215

Query: 1398 QQGSG 1402
            +Q S 
Sbjct: 1216 RQNSA 1220


>gi|357459829|ref|XP_003600195.1| hypothetical protein MTR_3g055370 [Medicago truncatula]
 gi|355489243|gb|AES70446.1| hypothetical protein MTR_3g055370 [Medicago truncatula]
          Length = 1289

 Score = 1008 bits (2607), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 549/1060 (51%), Positives = 709/1060 (66%), Gaps = 97/1060 (9%)

Query: 1    MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
            MASSDDE +     VS+Y+F  + +      ++FS LPIQW+E+E   G KE ++L G A
Sbjct: 1    MASSDDEADTQPLSVSNYHFVDDKDAP----VTFSILPIQWSESESVEGKKEKVFLHGNA 56

Query: 61   DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
            D+GLQKIF  V+AW+FDL+NV PEI VL+K+  WIKLQKPRK YEEI R++LI V+ + Y
Sbjct: 57   DNGLQKIFMQVVAWRFDLSNVKPEISVLAKDKRWIKLQKPRKSYEEIVRSVLITVYFMHY 116

Query: 121  AKRNPEATAKSIWDFLSRF--FCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTF 178
             ++NPEA  KS+WD LS+   F  YEV+PS NDL++HM L+ EA  RD VLAKSK L+  
Sbjct: 117  VRKNPEALGKSVWDNLSKNKDFSHYEVKPSHNDLLNHMGLMGEAATRDAVLAKSKLLLMV 176

Query: 179  LEEKP--TKRKLSDEVVQTKAMSGFIVDDMEEDMV--HDTEEDESNEEDELFDSVCSFCD 234
            +E+K   + +KLSDE V+  A  GFI+ D  ++        E+ES+EEDELFDSVCSFCD
Sbjct: 177  MEDKDRMSIKKLSDEEVKELARPGFIIADDIDNDAIDETVAEEESDEEDELFDSVCSFCD 236

Query: 235  NGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACG 294
            NGG+LLCCEG+C+RSFHA  + GEES CASLG ++ EVE + NF+CKNCE+ +HQCFACG
Sbjct: 237  NGGELLCCEGKCMRSFHANEEDGEESSCASLGFSRKEVEEIQNFYCKNCEHNKHQCFACG 296

Query: 295  KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLH 354
            +LG SDK  GAEVF C SATCG FYHP CV+KLL      A  +L  +I  GE FTCP H
Sbjct: 297  ELGCSDKFAGAEVFKCASATCGFFYHPQCVAKLLHLVISDAPTELVTNIAKGEPFTCPAH 356

Query: 355  KCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHR 414
             C ICK+ ENK + +L FAVCRRCPK+YHRKCLPRK+AFED +EEGI+ RAWE LLPN+R
Sbjct: 357  YCRICKEMENKNEHELHFAVCRRCPKSYHRKCLPRKVAFEDIVEEGIVARAWEDLLPNNR 416

Query: 415  ILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLT 474
            ILIYCLKHEIDDE+GTPIRDHI FP +++              A P+ K+ + +  + L 
Sbjct: 417  ILIYCLKHEIDDELGTPIRDHIKFPYVKQK-------------AKPATKEVINNNNAKLD 463

Query: 475  SKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSL 534
                   K +   L ++  K   G+    +   ++GS+  R+ KA + SR+    N +  
Sbjct: 464  DL---HVKRTSATLPKLSGKMSFGKVGIENPGKILGSNIPRK-KANEASRRLLNENKRPT 519

Query: 535  SVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLIS 594
              + ++S   +  + SLG +LY+ +  +G++Q   G   N+  + + +V  +P  KKL S
Sbjct: 520  LKEAEKSDH-EENQPSLGLQLYSHY-QKGSKQINSGNHVNNVADNTLSVK-RP--KKLSS 574

Query: 595  ELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSV 654
              P LD DS+RRL +L+K+A+SS+ +E ++K HK  STH ++ K+ V+K IT+GK+EGSV
Sbjct: 575  APPQLDADSERRLLALVKEASSSITLESVIKEHKFVSTHTHSLKNVVEKTITMGKLEGSV 634

Query: 655  EAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFT 714
            EA+RTAL+ LD +G SI DA+AVC P+V++++FKWK+KLKVYLAP L+G RYTSFGRHFT
Sbjct: 635  EAVRTALRMLD-EGHSIRDAEAVCGPDVMNRLFKWKDKLKVYLAPVLYGNRYTSFGRHFT 693

Query: 715  KVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAK-- 772
            +V+KL+ IVDKLHWYV + DMIVDFCCGANDFS LMKKKL+ETGK+CLYKN+D+LP K  
Sbjct: 694  QVEKLEGIVDKLHWYVQNNDMIVDFCCGANDFSRLMKKKLEETGKSCLYKNFDLLPTKAA 753

Query: 773  ------------------------------------------NDFNFEKRDWMTVEPKEL 790
                                                      NDFNFE RDW+TV+ KEL
Sbjct: 754  LHRLKLKSLGSVFSPTSQSPPPQKQNPTRWVFPPDVVLICLQNDFNFEMRDWLTVQRKEL 813

Query: 791  APGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWED 850
              GS+LIMGLNPPFG+KA LANKFI+KALEF PKLLILIVPPETERLDRK S Y LVWED
Sbjct: 814  PLGSQLIMGLNPPFGLKAALANKFIDKALEFEPKLLILIVPPETERLDRKRSRYVLVWED 873

Query: 851  DQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSR 910
            ++FLSGKSFYLPGSVD NDKQM+QWN+  PPLYLWS  D+A  HK +A++HGH+ R +  
Sbjct: 874  ERFLSGKSFYLPGSVDSNDKQMEQWNVKPPPLYLWSHPDWADKHKLIAQEHGHLFRERDV 933

Query: 911  TQMERNCYETHAVDHPKEEGQGDASMLIDLPLQINVTKELRNEAREDDKAGFPDNATEGG 970
            ++ME    E  +  H  ++   D +ML D  L  +  K     +  D  + F     EG 
Sbjct: 934  SRMESFDKEKSSASHSMDDNYFDDTML-DRMLDHDFLK-----STSDQDSSFMIGQMEG- 986

Query: 971  GESSHGHGDNQSGK-----------TSRKRKRDRKKHGSG 999
              SSHG+    S +           TS KRKR  +  G G
Sbjct: 987  --SSHGNVFRVSQERQDYLTINAENTSWKRKRTEENDGRG 1024



 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 69/246 (28%), Positives = 94/246 (38%), Gaps = 72/246 (29%)

Query: 1213 PSQTGFGSDYQHHEPTRSSS-HIGTTYYGTQ--AGIPNDMGSYGMSSLNNGLSHGANLDE 1269
            P  +     Y+H EP  SS    G  Y GT     + + +  YG+          A+L E
Sbjct: 1065 PQSSIVDDSYRHLEPLSSSRVEFGQAYDGTHNWPNVADPLPDYGL----------ADLQE 1114

Query: 1270 RYTGYVRN-TDSLGYRPSMSTD------------RELTMWPLA----RIYGQDFPAPT-- 1310
              +G++ + T SLGYRP +  D             E  + PL      + G D+  P   
Sbjct: 1115 HNSGHLGDGTSSLGYRPHLRGDDIYPPGLPRPYLTEDVIQPLGVPRPYVMGADYIRPPRV 1174

Query: 1311 ---------------PGYGQMGSVPSNLYGNLGSSAEASY-------RMSTSAMDRYAPR 1348
                           P Y QMGS     Y   GS +E  Y       +MST AM RYAPR
Sbjct: 1175 PMPDPMSSSYLSGRGPAYSQMGST----YSVCGSGSELPYMMNTPAMQMSTPAMQRYAPR 1230

Query: 1349 LHQLNNTRMNTFRSEPFMPSRFGFYDSRAPQ------PGFFADMDFGPGFHPP-FPQQGS 1401
            L +LN+ + N+   E  +  R    +  A        PGF       PG  P  + +Q S
Sbjct: 1231 LDELNHVKTNSLGPEHPIVDRSTTSEHSAQSGSENVPPGF-------PGGSPHLYSRQNS 1283

Query: 1402 GGWLDD 1407
              W  D
Sbjct: 1284 SNWFSD 1289


>gi|145359280|ref|NP_200350.2| enhanced downy mildew 2 [Arabidopsis thaliana]
 gi|334188424|ref|NP_001190545.1| enhanced downy mildew 2 [Arabidopsis thaliana]
 gi|332009240|gb|AED96623.1| enhanced downy mildew 2 [Arabidopsis thaliana]
 gi|332009241|gb|AED96624.1| enhanced downy mildew 2 [Arabidopsis thaliana]
          Length = 1297

 Score =  972 bits (2513), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 509/943 (53%), Positives = 659/943 (69%), Gaps = 35/943 (3%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAW 74
            S+YYFE +     KE +SF+ LPIQW+  E+  G     YLRG +D+GL  + K V AW
Sbjct: 18  ASNYYFEDD----DKEPVSFARLPIQWSVEEKVDGSGLGFYLRGRSDNGLLPLHKLVKAW 73

Query: 75  KFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIWD 134
           ++DL+N  PEI VL+K+N WIKL++PRK Y E+ RT+L+ +H + + +RNP+A+ K++W+
Sbjct: 74  RYDLSNFQPEISVLTKDNIWIKLEEPRKSYGELIRTVLVTLHSIQFLRRNPQASEKALWE 133

Query: 135 FLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDEVVQ 194
            L+R    Y+V+PSQNDLVDH+ LI EA +RD  LA SKF++ FL +KPTKR+L DE   
Sbjct: 134 KLTRSLRSYDVKPSQNDLVDHIGLIAEAAKRDRNLANSKFILAFLTKKPTKRRLPDE--- 190

Query: 195 TKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATI 254
             A   FIV D E+  V   E++  +E+D+ F+SVC+ CDNGG++LCCEG CLRSFHAT 
Sbjct: 191 DNAKDDFIVGD-EDTYVASDEDELDDEDDDFFESVCAICDNGGEILCCEGSCLRSFHATK 249

Query: 255 DAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA-EVFPCVSA 313
             GE+S C SLG  K +VEA+  +FC NCE+K HQCF C  LGSSD  +GA EVF CVSA
Sbjct: 250 KDGEDSLCDSLGFNKMQVEAIQKYFCPNCEHKIHQCFICKNLGSSDNSSGAAEVFQCVSA 309

Query: 314 TCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFA 373
           TCG+FYHPHCV++ L   ++  ++ L + IIAGE +TCPLHKC +C+ GE K DS+LQFA
Sbjct: 310 TCGYFYHPHCVTRRLRLGNKEESEALERQIIAGE-YTCPLHKCSVCENGEVKTDSNLQFA 368

Query: 374 VCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIR 433
           VCRRCPK+YHRKCLPR+I+FED  +E I+TRAW+GLL N R+LIYC +HEID+E+ TP+R
Sbjct: 369 VCRRCPKSYHRKCLPREISFEDIEDEDILTRAWDGLLHN-RVLIYCQEHEIDEELLTPVR 427

Query: 434 DHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPS 493
           DH+ FP  EE K  +   R+   L S  G+ K A  K    +     GK S  + +    
Sbjct: 428 DHVKFPFTEEQKVFVKEQRR--ILESHVGRDK-ARLKVKDPALQDTCGKASKNSFRSSFP 484

Query: 494 KAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSK------ 547
            +  G + +    +    D SR+ K  D S K      KS  +  D   +  +K      
Sbjct: 485 SSKDGFSTKKHGLVSSVPDHSRKRKDIDPSIKHKMVPQKSQKMMEDSREAGKNKLGVKEA 544

Query: 548 ------KTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDE 601
                 K SLGERL++   T+     K G+    D + ++T ++    K+  SE+P+LD 
Sbjct: 545 RDAGKSKISLGERLFS--YTQEPNPVKPGRVIPVDSKHNKTDSIA--SKEPGSEIPTLDN 600

Query: 602 DSKRRLSSLMKDAASSVRMEEILKRHKIPSTHA-YASKSAVDKAITLGKVEGSVEAIRTA 660
           DS+RRL ++MK A   + M  ILK+ KI ST + +++++ VDK IT+GKVEGSV+AIRTA
Sbjct: 601 DSQRRLLAVMKKATEEITMGTILKKFKIQSTMSTHSTRNVVDKTITMGKVEGSVQAIRTA 660

Query: 661 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 720
           LKKL+ +G +IEDAKAVCEPEVLSQI KWK+KLKVYLAPFLHG RYTSFGRHFT  +KLQ
Sbjct: 661 LKKLE-EGGNIEDAKAVCEPEVLSQILKWKDKLKVYLAPFLHGARYTSFGRHFTNPEKLQ 719

Query: 721 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 780
            IVD+LHWY +DGDMIVDFCCG+NDFSCLM  KL+ETGK CLYKNYD+ PAKN+FNFE++
Sbjct: 720 QIVDRLHWYADDGDMIVDFCCGSNDFSCLMNAKLEETGKKCLYKNYDLFPAKNNFNFERK 779

Query: 781 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 840
           DWMTV   EL PGS+LIMGLNPPFGV A LANKFI KALEF PK+LILIVPPETERLD+K
Sbjct: 780 DWMTVSKDELEPGSKLIMGLNPPFGVNASLANKFITKALEFRPKILILIVPPETERLDKK 839

Query: 841 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 900
           +S+Y L+WED  FLSG SFYLPGSV+E DKQ++ WN+  PPL LWSR D+AA HK +AEK
Sbjct: 840 KSSYVLIWEDKTFLSGNSFYLPGSVNEEDKQLEDWNLVPPPLSLWSRSDFAAKHKKIAEK 899

Query: 901 HGHISRPQSRTQMERNCYETHAVDHPKEEGQGDASMLIDLPLQ 943
           H H+SR    ++++    E +A  HP     G   M  D+P++
Sbjct: 900 HCHLSRDVGSSKLKIVEEEANASLHPLGASDG---MCDDIPME 939


>gi|297796419|ref|XP_002866094.1| hypothetical protein ARALYDRAFT_495622 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311929|gb|EFH42353.1| hypothetical protein ARALYDRAFT_495622 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1294

 Score =  964 bits (2492), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 521/971 (53%), Positives = 664/971 (68%), Gaps = 60/971 (6%)

Query: 1   MASSDDEVE----VGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYL 56
           MA  DD+ E    V Q   S+YYFE +     KE +SF+ LPIQW+  ++  G     YL
Sbjct: 1   MAFVDDDEEEDFSVPQS-ASNYYFEDD----DKEPVSFARLPIQWSGKDKVDGSSLGFYL 55

Query: 57  RGAADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVH 116
           RG +D+GL  + K V AW+FDL+N  PE+ VL+K+N WIKL++PRK Y E+ RT+ + +H
Sbjct: 56  RGRSDNGLLPLHKLVKAWRFDLSNFRPEVSVLTKDNIWIKLEEPRKSYGELIRTVFVTLH 115

Query: 117 CLSYAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLV 176
            + + +RNP+A+ KS+W+ L++    Y+V+PSQNDLVDH+DLI EA ERD  L KSKF++
Sbjct: 116 SIQFLRRNPQASEKSLWEKLTKSLRSYDVKPSQNDLVDHIDLISEAAERDVNLGKSKFIL 175

Query: 177 TFLEEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNG 236
            FL +KPTKR+L DE     A   FIV D E+  V   E++  +E+D+ F+SVC+ CDNG
Sbjct: 176 AFLTKKPTKRRLPDE---DNAKDDFIVGD-EDTYVASDEDELDDEDDDFFESVCAICDNG 231

Query: 237 GDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKL 296
           G+LLCCEG CLRSFHAT   GE+S C SLGL K +VEA+  +FC NCE+K HQCF C  L
Sbjct: 232 GELLCCEGSCLRSFHATKKDGEDSLCDSLGLNKMQVEAIQKYFCPNCEHKIHQCFICKNL 291

Query: 297 GSSDKETGA-EVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHK 355
           GSSD  TGA EVF CVSATCG+FYHP CV++ L   ++  ++ L + IIAGE +TCPLHK
Sbjct: 292 GSSDNSTGAAEVFQCVSATCGYFYHPRCVTRRLRLGNKDDSEALERQIIAGE-YTCPLHK 350

Query: 356 CCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRI 415
           C +C+ GE K DS+LQFAVCRRCPK+YHRKCLPR+I+FED  +E I TRAW+GLL N R+
Sbjct: 351 CSVCENGEVKTDSNLQFAVCRRCPKSYHRKCLPREISFEDIADEDIFTRAWDGLLHN-RV 409

Query: 416 LIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKV--------- 466
           LIYC +HEID+E+ TP+RDH+ FP  EE K  I   R+   L S  G+ K          
Sbjct: 410 LIYCQEHEIDEELLTPLRDHVKFPFTEEKKVFIKEQRR--ILESHVGRDKARPKVKDPAL 467

Query: 467 ------ASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKAT 520
                 AS KS  +S    +  FS K       K G G ++    R     D S + K  
Sbjct: 468 QDTCGKASDKSFRSSFPSSKDGFSTK-------KHGLGSSVPDHSRKRKDIDPSIKHKM- 519

Query: 521 DVSRKSFK-------GNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQD 573
            V +KS K            L V+  R S     K SLGERL+  ++ E     K G+  
Sbjct: 520 -VPQKSHKMMENSREAGKNRLGVKETRESG--KSKVSLGERLFN-YIQE-PNLVKPGRMI 574

Query: 574 NSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPST- 632
             D + ++T ++    K+  SE+P+LD DS+RRL ++MK A   + M  +LK+ K+ ST 
Sbjct: 575 PVDSKHNKTDSIA--SKEPGSEIPTLDNDSQRRLLAVMKKATEEITMGTMLKKFKVQSTM 632

Query: 633 HAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNK 692
           +  ++++ VDK IT+GKVEGSV+A+RTALKKL+ +G +IEDAKAVCEPEVLSQI KWK+K
Sbjct: 633 NTNSTRNVVDKTITMGKVEGSVQAVRTALKKLE-EGGNIEDAKAVCEPEVLSQILKWKDK 691

Query: 693 LKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKK 752
           LKVYLAPFLHG RYTSFGRHFT  +KLQ IVD+LHWY  DGDMIVDFCCG+NDFSCLM  
Sbjct: 692 LKVYLAPFLHGARYTSFGRHFTNPEKLQQIVDRLHWYAEDGDMIVDFCCGSNDFSCLMNT 751

Query: 753 KLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLAN 812
           KL+ETGK CLYKNYD+  AKN+FNFE++DWMTV   EL PGS+LIMGLNPPFGV A LAN
Sbjct: 752 KLEETGKKCLYKNYDLFQAKNNFNFERKDWMTVSKDELEPGSKLIMGLNPPFGVNASLAN 811

Query: 813 KFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQM 872
           KFI KALEF PK+LILIVPPETERLD+K+S+Y L+WED  FLSG SFYLPGSV+E DKQ+
Sbjct: 812 KFITKALEFRPKILILIVPPETERLDKKKSSYVLIWEDKTFLSGNSFYLPGSVNEEDKQL 871

Query: 873 DQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEEGQG 932
           + WN+  PPL LWSR D+AA HK +AEKH H+SR    ++++    E  A  HP     G
Sbjct: 872 EDWNIVPPPLSLWSRSDFAAKHKKIAEKHCHLSRDVGSSKIKIVEEEAKASLHPLGPSDG 931

Query: 933 DASMLIDLPLQ 943
              M  D+P++
Sbjct: 932 ---MCDDIPVE 939


>gi|225446989|ref|XP_002268363.1| PREDICTED: uncharacterized protein LOC100248222 [Vitis vinifera]
          Length = 1216

 Score =  955 bits (2468), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 502/958 (52%), Positives = 649/958 (67%), Gaps = 51/958 (5%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           MASSD+E E+    + DYYF       + E ISFS LP+QW+++E   G K +I+LRG+A
Sbjct: 1   MASSDEEGEILPNCIRDYYFVD----YKDEPISFSILPLQWSKDENPDGLKMMIFLRGSA 56

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
             GLQKI+K VIAWKF+L++V PEIFVLSK+ +W++LQ PRK ++ I RTIL+ V  L +
Sbjct: 57  YDGLQKIYKQVIAWKFELSSVEPEIFVLSKDKNWMELQSPRKSFQNIVRTILVTVSWLHF 116

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            KRNPEA+ KS+W+ L + F  YE  PS+NDL+DHM LI+EA++R++ L KSK ++T+L 
Sbjct: 117 VKRNPEASGKSLWNHLLKSFSSYEFEPSENDLLDHMPLIQEAVKREEDLLKSKCMLTYLP 176

Query: 181 EKP-------------TKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFD 227
           EK              +   +    V T     FI    ++   +D +E+   + DELFD
Sbjct: 177 EKTGGETALYEGSLVLSSSNIPSSDVHTMRRYTFI----DDRDDNDEDEENDEDNDELFD 232

Query: 228 SVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           SVC+ CDNGG+LLCCEGRCLRSFHAT+DAG ES C SLG +  +VEA+ NF CKNC+Y+Q
Sbjct: 233 SVCAICDNGGELLCCEGRCLRSFHATVDAGTESFCESLGFSDAQVEAIQNFLCKNCQYQQ 292

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           HQCF CG LGSS++ +GAEVF C SATCG FYHP+CV+K L   + + A  L   I  G 
Sbjct: 293 HQCFVCGMLGSSNESSGAEVFRCASATCGRFYHPYCVAKRLHPMNNILAKHLQNKIAGGV 352

Query: 348 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWE 407
           SFTCPLHKC +CK+GENK   DLQFA+CRRCPKAYHRKCLP  I+FE    E I+ RAW 
Sbjct: 353 SFTCPLHKCFVCKRGENKGVDDLQFALCRRCPKAYHRKCLPGNISFECIYNENIMQRAWI 412

Query: 408 GLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVA 467
           GLLPN RILIYC++H+I+ ++ TP R+HI FP  E          KK     PS  +KV 
Sbjct: 413 GLLPN-RILIYCMEHKINRKLRTPERNHIRFPDPESKG-------KKHVSELPSSNEKVM 464

Query: 468 STKSSLTSKAPPQGKFSVKALK-RVPSKAGQGETMEISERLLV--GSDSSRRAKATDVSR 524
           S K ++ S+  P    +VK  K  V       ++ +  E+     G D   + K  D ++
Sbjct: 465 SKKRNIVSEIFPAESTAVKMTKLEVHRVVKDVDSTKFFEKRCSSQGFDPPTKQKINDATK 524

Query: 525 KSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVT 584
           K  + NVKS+ V++  S +V   ++SL  R Y              KQ N   +  +  +
Sbjct: 525 KFLRDNVKSVPVKICASVAVKGTQSSL--RNYNIK----------PKQQNIPSKVEKITS 572

Query: 585 VKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKA 644
           +KP  K+  S  P +D + + R+  LMK   SS  +EE  ++ K+  ++   SK+ +D  
Sbjct: 573 LKPSMKRASSSQPLMDAELETRIVDLMKSTTSSFSLEEFREKQKVLCSY---SKNVLDST 629

Query: 645 ITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGM 704
           IT GKVE SV+AIRTAL+KL+  G SIEDAKAVCEPEVL+QI +WK KLKVYLAPFLHGM
Sbjct: 630 ITQGKVEVSVKAIRTALEKLE-KGCSIEDAKAVCEPEVLNQIMRWKRKLKVYLAPFLHGM 688

Query: 705 RYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYK 764
           RYTSFGRHFTKV+KL+ +VD+LHWYV  GDMIVDFCCG+NDFSCLMK+KLD+ GK+C +K
Sbjct: 689 RYTSFGRHFTKVEKLREVVDRLHWYVQHGDMIVDFCCGSNDFSCLMKEKLDKVGKSCSFK 748

Query: 765 NYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPK 824
           NYD++  KNDF+FEKRDWM++   EL  GS+LIMGLNPPFGVKA LANKFI+KAL F PK
Sbjct: 749 NYDLIQPKNDFSFEKRDWMSIHLDELPAGSQLIMGLNPPFGVKASLANKFIDKALSFRPK 808

Query: 825 LLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYL 884
           LLILIVP ET+RLD K+SAY+L+WED+  LSGKSFYLPGSVD +DKQ++QWN+  P LYL
Sbjct: 809 LLILIVPKETKRLDEKDSAYDLIWEDEDILSGKSFYLPGSVDMHDKQLEQWNLLPPLLYL 868

Query: 885 WSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEEGQ---GDASMLID 939
           WSR D+ + HKA+A+K GHIS  Q    +E N  E    ++  EE     GD S L++
Sbjct: 869 WSRPDWTSRHKAVAQKCGHISIEQKDFLVEGNNVEREVSNYLMEENHDCYGDFSNLMN 926


>gi|9758171|dbj|BAB08556.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1332

 Score =  954 bits (2467), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 509/978 (52%), Positives = 659/978 (67%), Gaps = 70/978 (7%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAW 74
            S+YYFE +     KE +SF+ LPIQW+  E+  G     YLRG +D+GL  + K V AW
Sbjct: 18  ASNYYFEDD----DKEPVSFARLPIQWSVEEKVDGSGLGFYLRGRSDNGLLPLHKLVKAW 73

Query: 75  KFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIWD 134
           ++DL+N  PEI VL+K+N WIKL++PRK Y E+ RT+L+ +H + + +RNP+A+ K++W+
Sbjct: 74  RYDLSNFQPEISVLTKDNIWIKLEEPRKSYGELIRTVLVTLHSIQFLRRNPQASEKALWE 133

Query: 135 FLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDEVVQ 194
            L+R    Y+V+PSQNDLVDH+ LI EA +RD  LA SKF++ FL +KPTKR+L DE   
Sbjct: 134 KLTRSLRSYDVKPSQNDLVDHIGLIAEAAKRDRNLANSKFILAFLTKKPTKRRLPDE--- 190

Query: 195 TKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATI 254
             A   FIV D E+  V   E++  +E+D+ F+SVC+ CDNGG++LCCEG CLRSFHAT 
Sbjct: 191 DNAKDDFIVGD-EDTYVASDEDELDDEDDDFFESVCAICDNGGEILCCEGSCLRSFHATK 249

Query: 255 DAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA-EVFPCVSA 313
             GE+S C SLG  K +VEA+  +FC NCE+K HQCF C  LGSSD  +GA EVF CVSA
Sbjct: 250 KDGEDSLCDSLGFNKMQVEAIQKYFCPNCEHKIHQCFICKNLGSSDNSSGAAEVFQCVSA 309

Query: 314 TCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFA 373
           TCG+FYHPHCV++ L   ++  ++ L + IIAGE +TCPLHKC +C+ GE K DS+LQFA
Sbjct: 310 TCGYFYHPHCVTRRLRLGNKEESEALERQIIAGE-YTCPLHKCSVCENGEVKTDSNLQFA 368

Query: 374 VCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIR 433
           VCRRCPK+YHRKCLPR+I+FED  +E I+TRAW+GLL N R+LIYC +HEID+E+ TP+R
Sbjct: 369 VCRRCPKSYHRKCLPREISFEDIEDEDILTRAWDGLLHN-RVLIYCQEHEIDEELLTPVR 427

Query: 434 DHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPS 493
           DH+ FP  EE K  +   R+   L S  G+ K A  K    +     GK S  + +    
Sbjct: 428 DHVKFPFTEEQKVFVKEQRR--ILESHVGRDK-ARLKVKDPALQDTCGKASKNSFRSSFP 484

Query: 494 KAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSK------ 547
            +  G + +    +    D SR+ K  D S K      KS  +  D   +  +K      
Sbjct: 485 SSKDGFSTKKHGLVSSVPDHSRKRKDIDPSIKHKMVPQKSQKMMEDSREAGKNKLGVKEA 544

Query: 548 ------KTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDE 601
                 K SLGERL++   T+     K G+    D + ++T ++    K+  SE+P+LD 
Sbjct: 545 RDAGKSKISLGERLFS--YTQEPNPVKPGRVIPVDSKHNKTDSIA--SKEPGSEIPTLDN 600

Query: 602 DSKRRLSSLMKDAASSVRMEEILKRHKIPSTHA-YASKSAVDKAITLGKVEGSVEAIRTA 660
           DS+RRL ++MK A   + M  ILK+ KI ST + +++++ VDK IT+GKVEGSV+AIRTA
Sbjct: 601 DSQRRLLAVMKKATEEITMGTILKKFKIQSTMSTHSTRNVVDKTITMGKVEGSVQAIRTA 660

Query: 661 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 720
           LKKL+ +G +IEDAKAVCEPEVLSQI KWK+KLKVYLAPFLHG RYTSFGRHFT  +KLQ
Sbjct: 661 LKKLE-EGGNIEDAKAVCEPEVLSQILKWKDKLKVYLAPFLHGARYTSFGRHFTNPEKLQ 719

Query: 721 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 780
            IVD+LHWY +DGDMIVDFCCG+NDFSCLM  KL+ETGK CLYKNYD+ PAKN+FNFE++
Sbjct: 720 QIVDRLHWYADDGDMIVDFCCGSNDFSCLMNAKLEETGKKCLYKNYDLFPAKNNFNFERK 779

Query: 781 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETE----- 835
           DWMTV   EL PGS+LIMGLNPPFGV A LANKFI KALEF PK+LILIVPPETE     
Sbjct: 780 DWMTVSKDELEPGSKLIMGLNPPFGVNASLANKFITKALEFRPKILILIVPPETERFQFP 839

Query: 836 ------------------------------RLDRKESAYELVWEDDQFLSGKSFYLPGSV 865
                                         RLD+K+S+Y L+WED  FLSG SFYLPGSV
Sbjct: 840 SISSAPLYHSITLIYRLLSLSLVKSITFLNRLDKKKSSYVLIWEDKTFLSGNSFYLPGSV 899

Query: 866 DENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDH 925
           +E DKQ++ WN+  PPL LWSR D+AA HK +AEKH H+SR    ++++    E +A  H
Sbjct: 900 NEEDKQLEDWNLVPPPLSLWSRSDFAAKHKKIAEKHCHLSRDVGSSKLKIVEEEANASLH 959

Query: 926 PKEEGQGDASMLIDLPLQ 943
           P     G   M  D+P++
Sbjct: 960 PLGASDG---MCDDIPME 974


>gi|356499417|ref|XP_003518537.1| PREDICTED: uncharacterized protein LOC100806429 [Glycine max]
          Length = 1065

 Score =  904 bits (2335), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 469/916 (51%), Positives = 632/916 (68%), Gaps = 40/916 (4%)

Query: 2   ASSDDEVEVGQKL----VSDYYFEHEGERKQKEIISFSALPIQWNENERT-GGCKELIYL 56
           +SS+DE E   ++    VS+Y+   + +      +SF+ LPI W+++E +    K  +++
Sbjct: 6   SSSEDEAETEAEVQSLSVSNYHIVDDEDNP----VSFAVLPIHWSDSENSEAANKGKVFI 61

Query: 57  RGAADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVH 116
            G  D+GL+KIF  V AW+FDL+NV  EI +LSK+  WIKLQKPRK ++   RT+LI +H
Sbjct: 62  DGDTDNGLKKIFMQVTAWRFDLSNVRLEISLLSKDGRWIKLQKPRKGFQNKIRTVLITLH 121

Query: 117 CLSYAKRNPEATAKSIWDFLSRFFCLYE--VRPSQNDLVDHMDLIKEALERDDVLAKSKF 174
            L   K+  + +  S+W  LS+   L    V+PSQ DL DH+ LI+EA +RD VLAKSK 
Sbjct: 122 FLHRVKKKRQMSEISVWQDLSKDTELSSSGVKPSQKDLSDHVPLIREAAKRDSVLAKSKL 181

Query: 175 LVTFLEEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDS--VCSF 232
           L+  LE K   +KL D+ V   A  G  V  ++ DM+     DESNEE E  D   VC+ 
Sbjct: 182 LLMVLE-KLNSQKLLDKEVNDLAQPGLTVVGIDSDMI-----DESNEESEERDDLDVCAL 235

Query: 233 CDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFA 292
           CDNGG++ CC+G C+RSFHAT++AG E+ C SLG T+ EV+ + +F+CKNCEY QHQCFA
Sbjct: 236 CDNGGNVTCCDGVCMRSFHATVEAGRENSCVSLGFTQKEVDEIQSFYCKNCEYYQHQCFA 295

Query: 293 CGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCP 352
           CGKLGSSDK  GAEV  CVSATC  FYHPHCV+KLL +  +  A+ L ++I     F CP
Sbjct: 296 CGKLGSSDKVKGAEVIKCVSATCDRFYHPHCVAKLLPQLAKPVAEDLERNIADRVPFICP 355

Query: 353 LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPN 412
           LH CC+CK+ ENK D +LQFAVCRRCPK+YHRKCLPR+IA  ++  + II RAWEGLLPN
Sbjct: 356 LHYCCVCKELENKVDPELQFAVCRRCPKSYHRKCLPREIAPSNRGNKNIIQRAWEGLLPN 415

Query: 413 HRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSS 472
           +RILIYCL H+ID E+GTP+RDHI FP +E     I+   +++    P+ K++V   K +
Sbjct: 416 NRILIYCLNHKIDRELGTPVRDHIKFPNMEPTVQKINTTIEQKE---PATKERVILKKKN 472

Query: 473 L---TSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKG 529
           +    S      K S    K    K G  +T    ++++ GS+ SR+ K+ + SR     
Sbjct: 473 VDLDNSSGKSIAKGSKLTGKLSSHKVGSKKT----KKIISGSNISRKPKSKETSR-CLTE 527

Query: 530 NVKSLSVQVDRS-SSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPL 588
           N +S+S + + S S  +  + ++GE +YA    EG ++ K   + N+    +  ++VKP+
Sbjct: 528 NKRSISKKSEMSDSEQNYNQPTIGE-IYA-LQKEGLKRIKHDNKVNN--VITNPLSVKPI 583

Query: 589 RKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLG 648
            + L  ELP LD DS++ L +L K+A SS+ +E +L++H   STH +  ++ V+K IT+G
Sbjct: 584 -ESLSVELPPLDADSEKSLLTLFKEARSSITLESVLEKHTFASTHTHPLRNVVEKTITMG 642

Query: 649 KVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTS 708
           K+E SV A++TAL+KL+  G SI+D KA C+P+ L Q+FKWK++LK+YLAP L+G RYTS
Sbjct: 643 KLEYSVNAVQTALRKLE-SGCSIQDVKAFCDPDDLKQLFKWKDELKIYLAPVLYGNRYTS 701

Query: 709 FGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDI 768
           +GRHFT V+KL+ IVDKLHWYV + D IVDFCCGANDFS LMKKKL+E GK C Y+NYD+
Sbjct: 702 YGRHFTLVEKLEGIVDKLHWYVQNSDTIVDFCCGANDFSILMKKKLEENGKKCSYRNYDL 761

Query: 769 LPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLIL 828
           LP KNDF+FE+RDWMTV+P EL  GS+LIMGLNPPFG KA LANKF++KALEF PKL+IL
Sbjct: 762 LPTKNDFSFERRDWMTVQPTELPTGSQLIMGLNPPFGHKAALANKFVDKALEFKPKLVIL 821

Query: 829 IVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRH 888
           IVPPETERLD+K+S Y+L+WED+ FLSG SFYLPGSV   D+QMDQ N   P L LWSR 
Sbjct: 822 IVPPETERLDKKQSPYDLIWEDENFLSGTSFYLPGSV---DRQMDQRNARPPLLSLWSRP 878

Query: 889 DYAAHHKALAEKHGHI 904
           D+   HK +A+++GH+
Sbjct: 879 DWTTKHKVIAQENGHV 894


>gi|222640382|gb|EEE68514.1| hypothetical protein OsJ_26949 [Oryza sativa Japonica Group]
          Length = 1311

 Score =  823 bits (2126), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 444/934 (47%), Positives = 583/934 (62%), Gaps = 85/934 (9%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M SSDD++E   K V +YYF  + +      +SF  LP Q++  E     K+ +YLRG  
Sbjct: 1   MMSSDDDLEPQLKAVENYYFVDDNDVP----VSFDVLPFQFDAAEGVASFKKDVYLRGFT 56

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
           D GLQK++K V+AWK  L    PEI VLS E SWI L KPR  YEE  R++LI V  L +
Sbjct: 57  DGGLQKVYKQVVAWKLVLDGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHF 116

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            +R P  + K +WD L   F  + VRP ++D  +H +LIK   +RD  LA S+ L  F++
Sbjct: 117 VRRRPTDSEKDMWDHLYGVFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFIK 176

Query: 181 EKPTKR-----------------KLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEED 223
           +K  ++                 K   ++ Q    +G   D+MEE +     +  SN++D
Sbjct: 177 DKIMEKTNEVGSNNLDNKREPDIKQEPDIKQEPVAAG---DEMEEIVEEGIPDAPSNDDD 233

Query: 224 E---------LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEA 274
           +         LFDSVC+ CDNGG+LLCCEG C+RSFHA I  GE+S+CA+LG TK EV+A
Sbjct: 234 DDEEDEEDGDLFDSVCAICDNGGELLCCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKA 293

Query: 275 MLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEV 334
           + NF CKNC++KQHQCF CG+L  SD    A+VF C +ATCGHFYHP CV++LL  +   
Sbjct: 294 LKNFVCKNCDHKQHQCFVCGELEPSDG-PNAKVFLCNNATCGHFYHPRCVAQLLHPNSRN 352

Query: 335 AADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFE 394
            A ++ K I+AG SFTCP+H C  CK  E++    LQFAVCRRCP++YHRKCLPR+I+FE
Sbjct: 353 EASEMEKKIMAGFSFTCPVHWCFHCKGLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFE 412

Query: 395 DKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEEN-----KTIID 449
           D   +GIITRAWE    + RILIYCL HEID +IGTP RDHI FP +E++     K + +
Sbjct: 413 DINTQGIITRAWEL---SKRILIYCLDHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKE 469

Query: 450 RPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLV 509
              KK+ +   S   +    ++ L  K   +G  S KA          G   E  E L  
Sbjct: 470 LAEKKRRICDDSYVSEPLQKRAKLNEKFNAKGDKSKKA----------GVKSEFEEVL-- 517

Query: 510 GSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKF 569
                          +S K   +SL            K+T   E L              
Sbjct: 518 ---------------ESEKKKTRSL-----------KKRTQPEEPLVECAAAAAANNA-- 549

Query: 570 GKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKI 629
             +   +RE     +   + K  +S  P +D ++++R+S+L++   SS+ + +I +R  I
Sbjct: 550 -NRPVKEREKELGTSSLDMGKIPLSSFPIVDSETEKRISALVEKEVSSLTVADISRRCVI 608

Query: 630 PSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKW 689
           PST+A  S   +DK +  GK+E S++A++ AL+KL+ +G +++DAKAVCE EVL Q+ +W
Sbjct: 609 PSTYA-CSGRQIDKIVVRGKLERSIQAVKAALQKLE-NGGAVDDAKAVCESEVLRQLTRW 666

Query: 690 KNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCL 749
            NKL+VYLAPF+HGMRYTSFGRHFTK +KL  I +KLHWYV  GDMIVDF CG NDFS  
Sbjct: 667 HNKLRVYLAPFIHGMRYTSFGRHFTKKEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFSQF 726

Query: 750 MKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAG 809
           MK+KLD+ GK C +KNYD++  KN F+FEKRDWMTV  KEL  GS+LIMGLNPPFG KA 
Sbjct: 727 MKEKLDKVGKRCNFKNYDVIQPKNSFSFEKRDWMTVRQKELPHGSKLIMGLNPPFGPKAM 786

Query: 810 LANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEND 869
           LANKFI+KAL F PKL+ILIVP E ERLDRK+  Y+LVWEDDQ LSGKSFYLPGS+D +D
Sbjct: 787 LANKFIDKALTFKPKLIILIVPKEAERLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDVSD 846

Query: 870 KQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGH 903
           KQ+DQWN + PPLYLWSR D+   HK +AE+HGH
Sbjct: 847 KQIDQWNKSPPPLYLWSRPDWTQKHKRIAEQHGH 880


>gi|218200976|gb|EEC83403.1| hypothetical protein OsI_28850 [Oryza sativa Indica Group]
          Length = 1283

 Score =  823 bits (2125), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/908 (48%), Positives = 579/908 (63%), Gaps = 61/908 (6%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M SSDD++E   K V +YYF  + +      +SF  LP Q++  E     K+ +YLRG  
Sbjct: 1   MMSSDDDLEPQLKAVENYYFVDDNDVP----VSFDVLPFQFDAAEGVASFKKDVYLRGFT 56

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
           D GLQK++K V+AWK  L    PEI VLS E SWI L KPR  YEE  R++LI V  L +
Sbjct: 57  DGGLQKVYKQVVAWKLVLDGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHF 116

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            +R P  + K +WD L   F  + VRP ++D  +H +LIK   +RD  LA S+ L  F++
Sbjct: 117 VRRRPTDSEKDMWDHLYGVFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFIK 176

Query: 181 EKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLL 240
           +K  ++  ++EV Q + +   I D    D   D E++E  +    FDSVC+ CDNGG+LL
Sbjct: 177 DKIMEK--TNEVAQEEIVEEGIPDVPSNDDDDDEEDEEDGDS---FDSVCAICDNGGELL 231

Query: 241 CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSD 300
           CCEG C+RSFHA I  GE+S+CA+LG TK EV+A+ NF CKNC++KQHQCF CG+L  SD
Sbjct: 232 CCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKALKNFVCKNCDHKQHQCFVCGELEPSD 291

Query: 301 KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK 360
               A+VF C +ATCGHFYHP CV++LL  +    A ++ K I+AG SFTCP+H C  CK
Sbjct: 292 G-PNAKVFLCNNATCGHFYHPRCVAQLLHPNSRNEASEMEKKIMAGFSFTCPVHWCFHCK 350

Query: 361 QGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCL 420
             E++    LQFAVCRRCP++YHRKCLPR+I+FED   +GIITRAWE    + RILIYCL
Sbjct: 351 GLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFEDINTQGIITRAWEL---SKRILIYCL 407

Query: 421 KHEIDDEIGTPIRDHIIFPGIEEN-----KTIIDRPRKKQSLASPSGKQKVASTKSSLTS 475
            HEID +IGTP RDHI FP +E++     K + +   KK+ +   S   +    ++ L  
Sbjct: 408 DHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKELAEKKRRICDDSYVSEPLQKRAKLNE 467

Query: 476 KAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLS 535
           K   +G  S KA          G   E  E L                 +S K   +SL 
Sbjct: 468 KFNAKGDKSKKA----------GVKSEFEEVL-----------------ESEKKKTRSL- 499

Query: 536 VQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISE 595
                      K+T   E L                +   +RE     +   + K  +S 
Sbjct: 500 ----------KKRTQPEEPLVECAAAAAANNA---NRPVKEREKELGTSSLDMGKIPLSS 546

Query: 596 LPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVE 655
            P +D ++++R+S+L++   SS+ + +I +R  IPST+A  S   +DK +  GK+E S++
Sbjct: 547 FPIVDSETEKRISALVEKEVSSLTVADISRRCVIPSTYA-CSGRQIDKIVVRGKLERSIQ 605

Query: 656 AIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTK 715
           A++ AL+KL+ +G +++DAKAVCE EVL Q+ +W NKL+VYLAPF+HGMRYTSFGRHFTK
Sbjct: 606 AVKAALEKLE-NGGAVDDAKAVCESEVLRQLTRWHNKLRVYLAPFIHGMRYTSFGRHFTK 664

Query: 716 VDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDF 775
            +KL  I +KLHWYV  GDMIVDF CG NDFS  MK+KLD+ GK C +KNYD++  KN F
Sbjct: 665 KEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFSQFMKEKLDKVGKRCNFKNYDVIQPKNSF 724

Query: 776 NFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETE 835
           +FEKRDWMTV  KEL  GS+LIMGLNPPFG KA LANKFI+KAL F PKL+ILIVP E E
Sbjct: 725 SFEKRDWMTVRQKELPHGSKLIMGLNPPFGPKAMLANKFIDKALTFKPKLIILIVPKEAE 784

Query: 836 RLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHK 895
           RLDRK+  Y+LVWEDDQ LSGKSFYLPGS+D +DKQ+DQWN + PPLYLWSR D+   HK
Sbjct: 785 RLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDVSDKQIDQWNKSPPPLYLWSRPDWTQKHK 844

Query: 896 ALAEKHGH 903
            +AE+HGH
Sbjct: 845 RIAEQHGH 852


>gi|297739138|emb|CBI28789.3| unnamed protein product [Vitis vinifera]
          Length = 1025

 Score =  769 bits (1986), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 397/734 (54%), Positives = 499/734 (67%), Gaps = 43/734 (5%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           LFDSVC+ CDNGG+LLCCEGRCLRSFHAT+DAG ES C SLG +  +VEA+ NF CKNC+
Sbjct: 26  LFDSVCAICDNGGELLCCEGRCLRSFHATVDAGTESFCESLGFSDAQVEAIQNFLCKNCQ 85

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
           Y+QHQCF CG LGSS++ +GAEVF C SATCG FYHP+CV+K L   + + A  L   I 
Sbjct: 86  YQQHQCFVCGMLGSSNESSGAEVFRCASATCGRFYHPYCVAKRLHPMNNILAKHLQNKIA 145

Query: 345 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 404
            G SFTCPLHKC +CK+GENK   DLQFA+CRRCPKAYHRKCLP  I+FE    E I+ R
Sbjct: 146 GGVSFTCPLHKCFVCKRGENKGVDDLQFALCRRCPKAYHRKCLPGNISFECIYNENIMQR 205

Query: 405 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQ 464
           AW GLLPN RILIYC++H+I+ ++ TP R+HI FP  E          KK     PS  +
Sbjct: 206 AWIGLLPN-RILIYCMEHKINRKLRTPERNHIRFPDPESKG-------KKHVSELPSSNE 257

Query: 465 KVASTKSSLTSKAPPQGKFSVKALK-RVPSKAGQGETMEISERLLV--GSDSSRRAKATD 521
           KV S K ++ S+  P    +VK  K  V       ++ +  E+     G D   + K  D
Sbjct: 258 KVMSKKRNIVSEIFPAESTAVKMTKLEVHRVVKDVDSTKFFEKRCSSQGFDPPTKQKIND 317

Query: 522 VSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSR 581
            ++K  + NVKS+ V++  S +V   ++SL  R Y              KQ N   +  +
Sbjct: 318 ATKKFLRDNVKSVPVKICASVAVKGTQSSL--RNYNI----------KPKQQNIPSKVEK 365

Query: 582 TVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAV 641
             ++KP  K+  S  P +D + + R+  LMK   SS  +EE  ++ K+  ++   SK+ +
Sbjct: 366 ITSLKPSMKRASSSQPLMDAELETRIVDLMKSTTSSFSLEEFREKQKVLCSY---SKNVL 422

Query: 642 DKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFL 701
           D  IT GKVE SV+AIRTAL+KL+  G SIEDAKAVCEPEVL+QI +WK KLKVYLAPFL
Sbjct: 423 DSTITQGKVEVSVKAIRTALEKLE-KGCSIEDAKAVCEPEVLNQIMRWKRKLKVYLAPFL 481

Query: 702 HGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDM-------------IVDFCCGANDFSC 748
           HGMRYTSFGRHFTKV+KL+ +VD+LHWYV  GDM             IVDFCCG+NDFSC
Sbjct: 482 HGMRYTSFGRHFTKVEKLREVVDRLHWYVQHGDMSFYNSLFPECEIRIVDFCCGSNDFSC 541

Query: 749 LMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKA 808
           LMK+KLD+ GK+C +KNYD++  KNDF+FEKRDWM++   EL  GS+LIMGLNPPFGVKA
Sbjct: 542 LMKEKLDKVGKSCSFKNYDLIQPKNDFSFEKRDWMSIHLDELPAGSQLIMGLNPPFGVKA 601

Query: 809 GLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEN 868
            LANKFI+KAL F PKLLILIVP ET+RLD K+SAY+L+WED+  LSGKSFYLPGSVD +
Sbjct: 602 SLANKFIDKALSFRPKLLILIVPKETKRLDEKDSAYDLIWEDEDILSGKSFYLPGSVDMH 661

Query: 869 DKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKE 928
           DKQ++QWN+  P LYLWSR D+ + HKA+A+K GHIS  Q    +E N  E    ++  E
Sbjct: 662 DKQLEQWNLLPPLLYLWSRPDWTSRHKAVAQKCGHISIEQKDFLVEGNNVEREVSNYLME 721

Query: 929 EGQ---GDASMLID 939
           E     GD S L++
Sbjct: 722 ENHDCYGDFSNLMN 735


>gi|357512983|ref|XP_003626780.1| hypothetical protein MTR_8g009000 [Medicago truncatula]
 gi|355520802|gb|AET01256.1| hypothetical protein MTR_8g009000 [Medicago truncatula]
          Length = 1054

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/711 (48%), Positives = 471/711 (66%), Gaps = 49/711 (6%)

Query: 221 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFC 280
           E+D  +D+VC+ CDNGG++L CEGRCLRSFHAT++ G +S CASLG T+ EV A  NF+C
Sbjct: 305 EQDIGYDTVCAICDNGGEILPCEGRCLRSFHATLEDGRDSLCASLGYTRTEVNAFPNFYC 364

Query: 281 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 340
           +NC++K+HQCFACGKLGSSD+ +  EVFPCV+A CGH+YHP CV++LL    ++  +++ 
Sbjct: 365 ENCKHKKHQCFACGKLGSSDESSNPEVFPCVTANCGHYYHPECVARLLYPGIDIGQEEMR 424

Query: 341 KSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEG 400
           K II  ++F CPLH C +C++GEN+   DLQFA+CRRCPKAYHRKCLP++I+F      G
Sbjct: 425 KRIIIEKTFVCPLHICSLCRKGENRNVHDLQFAMCRRCPKAYHRKCLPKEISFTYDYYTG 484

Query: 401 IITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIID-RPRKKQSLAS 459
           I  RAW+ LL + RIL+YC+ H+I  E+GTP RDH+IFP  E  + II      ++  A 
Sbjct: 485 IEMRAWDNLL-DKRILMYCMNHKIVLELGTPARDHLIFPNKEVKRKIISTESLHREKDAI 543

Query: 460 PSGK-------QKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSD 512
           P  K        K    K ++  +   Q   S K ++++ SK  Q   M       V  D
Sbjct: 544 PLKKFFEDLLPNKTLKPKMTIKERVGLQMGGSSKVMEKICSK--QDTHMSTGP---VYFD 598

Query: 513 SSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQ 572
            +R+     +  ++  G+ +SL    +    + +   S   RL         ++ ++ +Q
Sbjct: 599 RARKY----LKVETMSGSNRSLP-NYENKVPLKNVNLSCNPRL---------DEARY-QQ 643

Query: 573 DNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPST 632
             S      T   KP  KK+ + L     D ++R+ +LM++A S++ ME+  K +   +T
Sbjct: 644 KRSVGRIEETSWKKPPVKKVKTSLEDRKADMEKRILALMEEATSTLNMEKFKKDNHAVNT 703

Query: 633 HAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKN- 691
            +  +++   K +TLGKVEGSV+A++ AL+KLD +G  IE+AKA+CEPE++ Q+F W+N 
Sbjct: 704 SSSLTETVFRKTLTLGKVEGSVKAVQIALQKLD-EGCGIEEAKAICEPEIIRQLFTWQNE 762

Query: 692 ------------------KLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDG 733
                             +LK+YLAPFLHG RYTSFGRHFTK+DKL+ IVD+LHWYV  G
Sbjct: 763 VAGIKSNSRPLGYRGSDKQLKIYLAPFLHGRRYTSFGRHFTKIDKLKEIVDRLHWYVQSG 822

Query: 734 DMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPG 793
           D ++DFCCGANDFSCLMK KL++TGK C +KNYD+  AKNDFNFEKRDWM+V+ +EL  G
Sbjct: 823 DTVLDFCCGANDFSCLMKSKLEQTGKLCSFKNYDLFQAKNDFNFEKRDWMSVQAEELPHG 882

Query: 794 SRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQF 853
           S LI+GLNPPFGV+  LANKFI+KAL F PKLLILIVP  T RLDRK++ Y L+W D++ 
Sbjct: 883 SNLIIGLNPPFGVRGSLANKFIDKALTFKPKLLILIVPKVTRRLDRKKAGYNLIWADEEI 942

Query: 854 LSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 904
            SGKSFYLPGSVD  DKQ++ WN+  PPLYLWSR D+ A H  +A  H HI
Sbjct: 943 CSGKSFYLPGSVDTRDKQLEDWNLKPPPLYLWSRPDWTAWHMQIARMHRHI 993



 Score =  129 bits (325), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 76/202 (37%), Positives = 121/202 (59%), Gaps = 4/202 (1%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           MASSD+E E+    V  Y+FE++    + E +S S+L + W+ N+ T   +  ++LRG  
Sbjct: 1   MASSDEEGEIVPDSVDGYWFEND----KAEFVSLSSLTLLWSVNDVTCNSEAKVFLRGTT 56

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
           D+GLQKI K +I W+F+L+   PEI VL +E  W+ L KP KC+E   R++L+ V+ L  
Sbjct: 57  DNGLQKIHKQIIGWRFELSFEQPEISVLLREKYWMTLLKPSKCFENTIRSVLVTVYWLHC 116

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            K  PE +  SI   + + F  +++ PS+ND++ HM LI EA++RD  L KSK+L+ F+ 
Sbjct: 117 VKWKPEESRASILVKVLKEFSSFDITPSENDVLCHMALISEAVKRDTDLTKSKYLLNFIG 176

Query: 181 EKPTKRKLSDEVVQTKAMSGFI 202
           +  +      +V  TK +   +
Sbjct: 177 KTCSNGDFHADVHTTKKLKVIV 198


>gi|357141809|ref|XP_003572354.1| PREDICTED: uncharacterized protein LOC100831561 [Brachypodium
           distachyon]
          Length = 888

 Score =  639 bits (1648), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 352/775 (45%), Positives = 460/775 (59%), Gaps = 98/775 (12%)

Query: 215 EEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEA 274
           E D+    D + D +C+ CDNGG++  CEG+CLRSFHAT  AGE+  C +LG T  + +A
Sbjct: 3   ENDDEVSSDSI-DLICALCDNGGEIASCEGKCLRSFHATKGAGED--CKTLGYTTVQFDA 59

Query: 275 MLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEV 334
           +  F CKNCE + +QCFAC +LGS+ K    EVFPC S  CGHFYH  CV+ LL  ++E 
Sbjct: 60  IKVFLCKNCEREIYQCFACHRLGSA-KTNPPEVFPCASPNCGHFYHAKCVAHLLFPENEA 118

Query: 335 AADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFE 394
            A +    I+ G  F CP+H+C +CK GE K    LQFAVCRRCPK+YHR+CLPRKI+F+
Sbjct: 119 KATEYMTKIVNGAKFGCPVHRCDVCKYGEKKEVEALQFAVCRRCPKSYHRRCLPRKISFD 178

Query: 395 DKLEEGI--ITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPR 452
           D +E G+    RAW+GLLPN+RILIYC+KH+ID ++ TP+RDHI FPG            
Sbjct: 179 DIIENGVCHFQRAWDGLLPNNRILIYCMKHDIDPKLRTPLRDHIKFPG------------ 226

Query: 453 KKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSD 512
                  P+  +K +                +V ++KRV  K  + E     E + V   
Sbjct: 227 ------DPALSRKPS----------------NVNSMKRV--KIRRLEECVAEECVAVPLS 262

Query: 513 SSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQ 572
           +S+R+  T     S     K +       +    ++   GE+     VT     + F + 
Sbjct: 263 NSKRSFGTTTCSSSSNLIAKRMKAPESGGAKACYRRPVSGEKA----VTSVIPVSSFPEV 318

Query: 573 D-NSDR-------ETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEIL 624
           D N+ R       +TS  +T++ ++KKL+              +S MK+  +        
Sbjct: 319 DINTARRIYEFAQKTSSEITIEDVQKKLV---------VSSTYTSFMKNTDT-------- 361

Query: 625 KRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLS 684
                               ITLGKVE SVEAIRTAL  L+ +G+ IE AK VC    L 
Sbjct: 362 --------------------ITLGKVEKSVEAIRTALHMLE-NGAGIEAAKGVCTQHDLF 400

Query: 685 QIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGAN 744
           Q+ KWKNKL +YLAPFLHGMRYTS+GRHFTK+DKL+ IVDKL WYV  GD +VDFCCG+N
Sbjct: 401 QLAKWKNKLNIYLAPFLHGMRYTSYGRHFTKLDKLEKIVDKLQWYVQSGDTVVDFCCGSN 460

Query: 745 DFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPF 804
           DFS L+K+KL+++ KNC YKNYD++  KNDFNFE+RDWMTV+P EL  GSRLIMGLNPPF
Sbjct: 461 DFSTLLKEKLEDSEKNCFYKNYDLIQPKNDFNFERRDWMTVQPDELPAGSRLIMGLNPPF 520

Query: 805 GVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGS 864
           G KA LAN+FINKAL F PKL++LIVP ETERLD+K   YEL+WED   LSGKSFYLPGS
Sbjct: 521 GFKASLANQFINKALSFKPKLIVLIVPRETERLDKKYPPYELIWEDSNQLSGKSFYLPGS 580

Query: 865 VDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI-SRPQSRTQMERNCYETHAV 923
            D ++KQM+QWNM+ PPL LWSR DYA  H  +A   GH+ S+        RN  +  +V
Sbjct: 581 FDADNKQMEQWNMSPPPLSLWSRSDYAQRHYEIARSKGHLCSKNPCSGDSHRNIADIASV 640

Query: 924 DHPKEEGQGDA-----SMLIDLPLQINVTKELRNEAREDDKAGFPDNATEGGGES 973
              +    G+A      M  + P   +V ++L  +   D  +   D  T+  G S
Sbjct: 641 STTRNLEIGNADINDEEMQGERPATASVIEQLLADTYHDTTSSPGDYWTDTNGRS 695


>gi|218201406|gb|EEC83833.1| hypothetical protein OsI_29778 [Oryza sativa Indica Group]
          Length = 955

 Score =  617 bits (1591), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 353/803 (43%), Positives = 466/803 (58%), Gaps = 135/803 (16%)

Query: 213 DTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDE- 271
           D  E ES+      D VC+ CDNGG++  CEG+CLRSFHA  DAGE+  C +LG T+ + 
Sbjct: 3   DNAEGESSAN---VDLVCALCDNGGEIASCEGKCLRSFHAVRDAGED--CQTLGYTRRQF 57

Query: 272 ---------------------------------VEAMLNFFCKNCEYKQHQCFACGKLGS 298
                                            ++A+  F CKNCE +++QCFAC +LGS
Sbjct: 58  DVSVLNLPLMNDTVFTSHVLHILDGNIHLAFPYLQALNPFLCKNCELEKYQCFACMRLGS 117

Query: 299 SDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCI 358
           +  +T  EVFPC SA CG+FYH  CV++LL  ++E  A +    I +G  F CPLHKC +
Sbjct: 118 AKTDT-PEVFPCASANCGYFYHAKCVAQLLFTENEAKALEYTTKIASGVKFACPLHKCDV 176

Query: 359 CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII--TRAWEGLLPNHRIL 416
           CK GENK + +LQFAVCRR            KIAF+D ++ G+    RAWEGLLPN+RIL
Sbjct: 177 CKYGENKDEKELQFAVCRR------------KIAFDDFVDNGVFHFQRAWEGLLPNNRIL 224

Query: 417 IYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSK 476
           I+CLKH+ID ++ TP RDHI FP   +N  +  +P     +      +KV   K  L   
Sbjct: 225 IFCLKHDIDPKLRTPTRDHIKFP---DNPAVTRKPFDVNGM-----NKKV--VKIRLLED 274

Query: 477 APPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSV 536
            PP                             + SD           +KSF G V   S 
Sbjct: 275 CPPAP---------------------------LSSD-----------KKSF-GTVNRFS- 294

Query: 537 QVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISEL 596
               SS V +KK  +        V+ GT+           RE +   +  PL     S  
Sbjct: 295 ----SSDVITKKRKV-------LVSGGTKHCALSA---VAREKTSVPSFIPL-----SSF 335

Query: 597 PSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEA 656
           P +D+ ++RR+    +  +S + +E+I K+  +PSTH   SK+  DK ITLG V+ SVEA
Sbjct: 336 PVIDKSTERRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKN-TDK-ITLGMVQRSVEA 393

Query: 657 IRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKV 716
           I  AL  L+ +G+SIED K+VC P  L Q+ +WKNKL +YLAPFLHGMRYTS+GRHFTK+
Sbjct: 394 INAALHMLE-NGASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRHFTKL 452

Query: 717 DKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFN 776
           DKL+ IVD+L WY+  GD +VDFCCG+NDFS L+K+KL+ + K+C YKN+D++  KNDFN
Sbjct: 453 DKLEQIVDRLQWYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPKNDFN 512

Query: 777 FEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETER 836
           FE+RDWMTV+P EL  G RLIMGLNPPFG KA LAN+FINKAL F PKL+ILIVP ETER
Sbjct: 513 FERRDWMTVQPDELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPKETER 572

Query: 837 LDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKA 896
           LDRK   YEL+WED   L+GKSFYLPGS+D ++K M+QWNM+ PPL LWSR D+A  HK 
Sbjct: 573 LDRKYPPYELIWEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWARKHKE 632

Query: 897 LAEKHGHISR-PQSRTQMERNCYETHAVDHPKEEGQGDASM-----LIDLPLQINVTKEL 950
           +A+  GHIS+        +R+   T    H +   +GD  +       + PL  +V  +L
Sbjct: 633 IAKTMGHISKNVWCLDDTQRSVVNT---GHAQMANEGDDDLDNKERQEEAPLNASVIDQL 689

Query: 951 RNEAREDDKAGFPDNATEGGGES 973
            ++   D  +   D  T+  G S
Sbjct: 690 LSDTYHDPTSSPGDYWTDTNGRS 712


>gi|222640820|gb|EEE68952.1| hypothetical protein OsJ_27843 [Oryza sativa Japonica Group]
          Length = 846

 Score =  542 bits (1397), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 304/675 (45%), Positives = 401/675 (59%), Gaps = 95/675 (14%)

Query: 307 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 366
           VFPC SA CG+FYH  CV++LL  ++E  A +    I +G  F CPLHKC +CK GENK 
Sbjct: 16  VFPCASANCGYFYHAKCVAQLLFTENEAKALEYTTKIASGVKFACPLHKCDVCKYGENKD 75

Query: 367 DSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII--TRAWEGLLPNHRILIYCLKHEI 424
           + +LQFAVCRR            KIAF+D ++ G+    RAWEGLLPN+RILI+CLKH+I
Sbjct: 76  EKELQFAVCRR------------KIAFDDFVDNGVFHFQRAWEGLLPNNRILIFCLKHDI 123

Query: 425 DDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFS 484
           D ++ TP RDHI FP   +N  +  +P                               F 
Sbjct: 124 DPKLRTPTRDHIKFP---DNPAVTRKP-------------------------------FD 149

Query: 485 VKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSV 544
           V  + +   K  +   +E      + SD           +KSF G V   S     SS V
Sbjct: 150 VNGMNK---KVVKIRLLEDCPPAPLSSD-----------KKSF-GTVNRFS-----SSDV 189

Query: 545 DSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSK 604
            +KK  +        V+ GT+           RE +   +  PL     S  P +D+ ++
Sbjct: 190 ITKKRKV-------LVSGGTKHCALSA---VAREKTSVPSFIPL-----SSFPVIDKSTE 234

Query: 605 RRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKL 664
           RR+    +  +S + +E+I K+  +PSTH   SK+  DK ITLG V+ SVEAI  AL  L
Sbjct: 235 RRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKN-TDK-ITLGMVQRSVEAINAALHML 292

Query: 665 DVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVD 724
           + +G+SIED K+VC P  L Q+ +WKNKL +YLAPFLHGMRYTS+GRHFTK+DKL+ IVD
Sbjct: 293 E-NGASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRHFTKLDKLEQIVD 351

Query: 725 KLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMT 784
           +L WY+  GD +VDFCCG+NDFS L+K+KL+ + K+C YKN+D++  KNDFNFE+RDWMT
Sbjct: 352 RLQWYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPKNDFNFERRDWMT 411

Query: 785 VEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAY 844
           V+P EL  G RLIMGLNPPFG KA LAN+FINKAL F PKL+ILIVP ETERLDRK   Y
Sbjct: 412 VQPDELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPKETERLDRKYPPY 471

Query: 845 ELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 904
           EL+WED   L+GKSFYLPGS+D ++K M+QWNM+ PPL LWSR D+A  HK +A+  GHI
Sbjct: 472 ELIWEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWARKHKEIAKTMGHI 531

Query: 905 SR-PQSRTQMERNCYETHAVDHPKEEGQGDASM-----LIDLPLQINVTKELRNEAREDD 958
           S+        +R+   T    H +   +GD  +       + PL  +V  +L ++   D 
Sbjct: 532 SKNVWCLDDTQRSVVNT---GHAQMANEGDDDLDNKERQEEAPLNASVIDQLLSDTYHDP 588

Query: 959 KAGFPDNATEGGGES 973
            +   D  T+  G S
Sbjct: 589 TSSPGDYWTDTNGRS 603


>gi|414588820|tpg|DAA39391.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 751

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 298/768 (38%), Positives = 435/768 (56%), Gaps = 90/768 (11%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAW 74
           V  YYF         E + FS LP  + + +    CK+ + LRG       K++  V+AW
Sbjct: 18  VESYYFV--DSTNDNEPVCFSTLPFLFGDTDDLPDCKKRLVLRGTGGDPGVKVYSEVVAW 75

Query: 75  KFDLTNVIPEIFVLSKE-NSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIW 133
              L    P+  VL+ E   W++L +P   YEE+ RT+LI  H L +  R P    +++W
Sbjct: 76  SLGLEGKQPQFAVLTAEGRRWVRLVRPLNSYEEMIRTVLITAHMLHFLMRKPHEPERTLW 135

Query: 134 DFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKS-----------------KFLV 176
           + L + F  ++VRPS++D  +H+ L+K+ + +D +LAKS                 +FL+
Sbjct: 136 NHLCKVFNKFDVRPSEDDFRNHISLMKQFVAKDPILAKSEGSLPHEWGLTKGKTKAQFLL 195

Query: 177 TFLEEKPTKRKLSDEVVQT------KAMSG----------------FIVDD--MEEDMVH 212
           +++  +    K+ + V++T      +  SG                FI DD  ++E +V 
Sbjct: 196 SYVMPRAVLCKVVNIVLKTILRVFVEGKSGKIFTEAGADHIDVKQPFITDDEDIDEMIVE 255

Query: 213 DTEEDESNEED-ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDE 271
           D + +   EED +LFDS+C+ CD+GGD+LCC+G C+RSFHA   +GE+S+C +LG T+ E
Sbjct: 256 DADNESDEEEDGDLFDSICAICDDGGDILCCDGPCMRSFHAKEGSGEDSYCDTLGYTEAE 315

Query: 272 VEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRD 331
           VEAM  F CKNCEYKQHQCF CG L  S+    A+VF C +ATCGHFYHP CV++ L  +
Sbjct: 316 VEAMKLFLCKNCEYKQHQCFICGVLEPSNG-AAAKVFLCNNATCGHFYHPKCVAQQLHPN 374

Query: 332 DEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
           +   A +L K I  G SFTCP+H C  CK  E++    LQFAVCRRCPK+YHRKCLPR+I
Sbjct: 375 NIDEASELEKKITDGFSFTCPIHWCFHCKGLEDRTQGPLQFAVCRRCPKSYHRKCLPREI 434

Query: 392 AFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRP 451
           +F++  E+G+I RAWE    +  ILIYCL HEID E+ TP+RDHI FP IE+    +   
Sbjct: 435 SFDEIEEKGVIIRAWEL---SKMILIYCLNHEIDSELETPVRDHIKFPRIEKPLQFV--- 488

Query: 452 RKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGS 511
            KK +   P  K++   T S    + PP+                  +T+++  ++ +  
Sbjct: 489 -KKGANLLPKKKKR---TYSESILEQPPK------------------DTVKMKSKVRMQE 526

Query: 512 DSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGK 571
               +    +VS KSF  NV      VD       K     +  Y A       +  F  
Sbjct: 527 SEQNKHDTREVSVKSFTQNV------VDTPVKKKGKLKENKQLPYVA-------KDHFVS 573

Query: 572 QDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPS 631
              S +E  + +   PL     S  P +D + ++R+ S++ +  SS+ ++++ +   +PS
Sbjct: 574 SPKSVKEQEQELVPLPLSGIRKSSFPKVDSEIEKRVISILGNEISSLTIKDVTRNCSLPS 633

Query: 632 THAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKN 691
           TH Y+ +   DK + LGK+E SV+A+  ALKKL  +G S+ DAKAVCEP+VL Q+ +W  
Sbjct: 634 THVYSGRPT-DK-MPLGKIERSVQAVEAALKKLK-NGGSVNDAKAVCEPDVLRQLARWHT 690

Query: 692 KLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDF 739
           KL+VY++PF+HG RY+SFGRHFTKV+KL  IVD+LHWYV  GDM+  F
Sbjct: 691 KLRVYISPFIHGTRYSSFGRHFTKVEKLVEIVDRLHWYVESGDMVYFF 738


>gi|224061083|ref|XP_002300346.1| predicted protein [Populus trichocarpa]
 gi|222847604|gb|EEE85151.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  474 bits (1220), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 236/428 (55%), Positives = 320/428 (74%), Gaps = 6/428 (1%)

Query: 292 ACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTC 351
           AC  LG +++E   EVF C +ATCGHFYHPHC + +L R+D+VAA++L K I AG+SF C
Sbjct: 57  ACESLGFTNREV--EVFRCANATCGHFYHPHCAATMLHREDKVAAEELRKKIAAGDSFAC 114

Query: 352 PLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLP 411
           P+HKCCICKQ E+K  SDLQFAVCRRCP +YH+KCLP++IAFE++ +E  I RAW+ LLP
Sbjct: 115 PIHKCCICKQVEDKKKSDLQFAVCRRCPTSYHQKCLPKEIAFENEADEDTIARAWQNLLP 174

Query: 412 NHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKS 471
           N RILIYCLKH+I ++IGTP+RDHI FP +   K    + +K+++   P+ +++  S K 
Sbjct: 175 N-RILIYCLKHDIIEDIGTPVRDHIRFPDV-GGKNTAAKVQKRKTSELPANEEESLSKKK 232

Query: 472 SLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNV 531
            LTS+    G F  KA K + S A   +    SE++   S+S  + +  + SRKS + N 
Sbjct: 233 RLTSEESFSGTFRTKASKEMFSSAKIVKITNDSEQISSESNSLGKMRMNNPSRKSIRENT 292

Query: 532 KSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKK 591
           KS S +V+RS++ +  KTSLG++LY  F+T  + + K  KQD    E  +++ VK + KK
Sbjct: 293 KSASSEVERSTAANVNKTSLGDKLY-GFMTIKSGKAKLRKQDIFGSELDKSLAVKSVGKK 351

Query: 592 LISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVE 651
           L SELPSLD D++RRL +L+K+AASS+ ++ ++K+H++PSTH ++SK+ VDK ITLGKVE
Sbjct: 352 LTSELPSLDADTQRRLLALVKEAASSITLDNVIKKHEVPSTHVHSSKNVVDKNITLGKVE 411

Query: 652 GSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGR 711
           G+VEA+RTALKKL+ +  SIEDAKAVCEP+VL+Q+FKWKNKLKVYLAPFL+GMRYTSFGR
Sbjct: 412 GTVEAVRTALKKLE-EKCSIEDAKAVCEPDVLNQVFKWKNKLKVYLAPFLYGMRYTSFGR 470

Query: 712 HFTKVDKL 719
           HFTKV+KL
Sbjct: 471 HFTKVEKL 478


>gi|357494761|ref|XP_003617669.1| hypothetical protein MTR_5g094150 [Medicago truncatula]
 gi|355519004|gb|AET00628.1| hypothetical protein MTR_5g094150 [Medicago truncatula]
          Length = 1169

 Score =  451 bits (1159), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 231/485 (47%), Positives = 306/485 (63%), Gaps = 35/485 (7%)

Query: 4   SDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNE------NERTGGCKELIYLR 57
           ++ E+E     V++Y+ E        E + FS LP +W        ++   G K  I+L 
Sbjct: 12  TESEIEFQSLYVTNYHLEDN----DGEPVCFSVLPFKWGASKVSKVDDTEKGLK--IFLH 65

Query: 58  GAADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHC 117
           G+AD+GLQ IF  V AWKFDLT + PE+ +LSK+  WIKLQKPRK +++  +  LI ++ 
Sbjct: 66  GSADNGLQTIFMEVTAWKFDLTGLKPEVSLLSKDKRWIKLQKPRKSFQKTVKNALITLYF 125

Query: 118 LSYAKRNPEATAKSIWDFL--SRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFL 175
           L  AK+NP  + +S W  +   +    Y  +PSQ DL+DHM LI EA  RD  LA SK L
Sbjct: 126 LRCAKKNPRLSVESFWCNVCKDKDLSSYGFKPSQKDLLDHMSLIGEATTRDVGLANSKLL 185

Query: 176 VTFLEEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDN 235
           ++ L +K   +K SDE             D   D++ +++ED S  ED+LFDS C+ CDN
Sbjct: 186 LSVLGDKSGDQKPSDEEF-----------DTNNDIIEESDED-SKVEDDLFDSFCALCDN 233

Query: 236 GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGK 295
           GG+L+ C+G C+RSFHAT   G ES C SLG TK EV+ +  F+CKNCEY QHQCFACG+
Sbjct: 234 GGNLIMCDGACMRSFHATKVDGRESLCDSLGFTKKEVDDIETFYCKNCEYHQHQCFACGE 293

Query: 296 LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHK 355
           LGSSDK+  AEVF C S TC  FYHPHC++ L+       ++ L K+   G++F+CP+H 
Sbjct: 294 LGSSDKDKDAEVFKCASETCDRFYHPHCIAPLVKH----VSEDLEKNNADGQTFSCPIHF 349

Query: 356 CCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRI 415
           CC+C+  ENK D +LQFAVC RCPK+YHR+CLP +IAF+D   + + TRAWEGLLPN+RI
Sbjct: 350 CCVCEGLENKMDPELQFAVCNRCPKSYHRRCLPTEIAFDDI--DDVATRAWEGLLPNNRI 407

Query: 416 LIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSG---KQKVASTKSS 472
           LIYCL HEI DE+GTP+RDHI FP    +   I+ P KK    +  G   K  V   KSS
Sbjct: 408 LIYCLNHEISDELGTPVRDHIKFPDTRASVRDINTPDKKMKPTTTEGVKFKSNVDFVKSS 467

Query: 473 LTSKA 477
             S A
Sbjct: 468 DKSTA 472



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/289 (26%), Positives = 130/289 (44%), Gaps = 73/289 (25%)

Query: 423  EIDDEIGTPIRDHIIFPGIE--------ENKTIIDRPRKKQSLASPSGKQKVASTKSSLT 474
            E  D++ TP+RD IIFP            NK ++    ++  L +  G  K +   ++  
Sbjct: 731  ETSDKLETPVRDDIIFPDTRATVLEINTSNKRMMPTTTERVELKNNVGYVKSSGKSTAHG 790

Query: 475  SKAP---PQGKFSVKALKRVPSKAGQGETME------------------ISERLLVGSDS 513
            SK P   P    S K+ +++ S +    T E                  +SE+ +  SD 
Sbjct: 791  SKVPGKLPSKTGSKKSTEKIVSGSDVSRTPESKEMSSKCVTEYKSSSKNLSEKAISCSDI 850

Query: 514  SRRAKATDVS--RKSFKGNVKSLSVQVDRSSSVDSKKTS--------------------- 550
            S   ++  +S   ++ K N K LS +V   S + SK  S                     
Sbjct: 851  SWNPESNKISLTEENKKSNSK-LSGKVISGSHISSKPKSNERSETGLTKNKKPMPKCEVK 909

Query: 551  ---LGERLYAAFVTEGTEQTKFGKQ-DNSDRETSRTVTVKPLRKKLISELPSLDEDSKRR 606
                G++L A+ + +G+EQ K+  Q D++D   ++T+++ PLR    S LP  D DS++ 
Sbjct: 910  KHPTGQKLCAS-MNKGSEQIKYDNQVDDAD---NQTLSIMPLR----SLLPPPDADSEKG 961

Query: 607  LSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVE 655
            L    K+A SS+ +E ++K+HK+            DKAIT+ K++ SV+
Sbjct: 962  LVDSFKEARSSILLEGVIKKHKLGDVG--------DKAITMRKLDDSVD 1002


>gi|414588821|tpg|DAA39392.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 731

 Score =  437 bits (1125), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 268/715 (37%), Positives = 393/715 (54%), Gaps = 90/715 (12%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAW 74
           V  YYF         E + FS LP  + + +    CK+ + LRG       K++  V+AW
Sbjct: 18  VESYYFV--DSTNDNEPVCFSTLPFLFGDTDDLPDCKKRLVLRGTGGDPGVKVYSEVVAW 75

Query: 75  KFDLTNVIPEIFVLSKE-NSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIW 133
              L    P+  VL+ E   W++L +P   YEE+ RT+LI  H L +  R P    +++W
Sbjct: 76  SLGLEGKQPQFAVLTAEGRRWVRLVRPLNSYEEMIRTVLITAHMLHFLMRKPHEPERTLW 135

Query: 134 DFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKS-----------------KFLV 176
           + L + F  ++VRPS++D  +H+ L+K+ + +D +LAKS                 +FL+
Sbjct: 136 NHLCKVFNKFDVRPSEDDFRNHISLMKQFVAKDPILAKSEGSLPHEWGLTKGKTKAQFLL 195

Query: 177 TFLEEKPTKRKLSDEVVQT------KAMSG----------------FIVDD--MEEDMVH 212
           +++  +    K+ + V++T      +  SG                FI DD  ++E +V 
Sbjct: 196 SYVMPRAVLCKVVNIVLKTILRVFVEGKSGKIFTEAGADHIDVKQPFITDDEDIDEMIVE 255

Query: 213 DTEEDESNEED-ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDE 271
           D + +   EED +LFDS+C+ CD+GGD+LCC+G C+RSFHA   +GE+S+C +LG T+ E
Sbjct: 256 DADNESDEEEDGDLFDSICAICDDGGDILCCDGPCMRSFHAKEGSGEDSYCDTLGYTEAE 315

Query: 272 VEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRD 331
           VEAM  F CKNCEYKQHQCF CG L  S+    A+VF C +ATCGHFYHP CV++ L  +
Sbjct: 316 VEAMKLFLCKNCEYKQHQCFICGVLEPSNG-AAAKVFLCNNATCGHFYHPKCVAQQLHPN 374

Query: 332 DEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
           +   A +L K I  G SFTCP+H C  CK  E++    LQFAVCRRCPK+YHRKCLPR+I
Sbjct: 375 NIDEASELEKKITDGFSFTCPIHWCFHCKGLEDRTQGPLQFAVCRRCPKSYHRKCLPREI 434

Query: 392 AFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRP 451
           +F++  E+G+I RAWE    +  ILIYCL HEID E+ TP+RDHI FP IE+    +   
Sbjct: 435 SFDEIEEKGVIIRAWEL---SKMILIYCLNHEIDSELETPVRDHIKFPRIEKPLQFV--- 488

Query: 452 RKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGS 511
            KK +   P  K++   T S    + PP+    +K+  R          M+ SE+     
Sbjct: 489 -KKGANLLPKKKKR---TYSESILEQPPKDTVKMKSKVR----------MQESEQ----- 529

Query: 512 DSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGK 571
               +    +VS KSF  NV      VD       K     +  Y A       +  F  
Sbjct: 530 ---NKHDTREVSVKSFTQNV------VDTPVKKKGKLKENKQLPYVA-------KDHFVS 573

Query: 572 QDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPS 631
              S +E  + +   PL     S  P +D + ++R+ S++ +  SS+ ++++ +   +PS
Sbjct: 574 SPKSVKEQEQELVPLPLSGIRKSSFPKVDSEIEKRVISILGNEISSLTIKDVTRNCSLPS 633

Query: 632 THAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQI 686
           TH Y+ +   DK + LGK+E SV+A+  ALKKL  +G S+ DAKAVCEP+VL Q+
Sbjct: 634 THVYSGRPT-DK-MPLGKIERSVQAVEAALKKLK-NGGSVNDAKAVCEPDVLRQL 685


>gi|115475968|ref|NP_001061580.1| Os08g0337300 [Oryza sativa Japonica Group]
 gi|38424060|dbj|BAD01750.1| unknown protein [Oryza sativa Japonica Group]
 gi|38637134|dbj|BAD03388.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623549|dbj|BAF23494.1| Os08g0337300 [Oryza sativa Japonica Group]
          Length = 585

 Score =  425 bits (1092), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 235/518 (45%), Positives = 315/518 (60%), Gaps = 42/518 (8%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M SSDD++E   K V +YYF  + +      +SF  LP Q++  E     K+ +YLRG  
Sbjct: 1   MMSSDDDLEPQLKAVENYYFVDDNDVP----VSFDVLPFQFDAAEGVASFKKDVYLRGFT 56

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
           D GLQK++K V+AWK  L    PEI VLS E SWI L KPR  YEE  R++LI V  L +
Sbjct: 57  DGGLQKVYKQVVAWKLVLDGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHF 116

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            +R P  + K +WD L   F  + VRP ++D  +H +LIK   +RD  LA S+ L  F++
Sbjct: 117 VRRRPTDSEKDMWDHLYGVFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFIK 176

Query: 181 EKPTKR-----------------KLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEED 223
           +K  ++                 K   ++ Q    +G   D+MEE +     +  SN++D
Sbjct: 177 DKIMEKTNEVGSNNLDNKREPDIKQEPDIKQEPVAAG---DEMEEIVEEGIPDAPSNDDD 233

Query: 224 E---------LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEA 274
           +         LFDSVC+ CDNGG+LLCCEG C+RSFHA I  GE+S+CA+LG TK EV+A
Sbjct: 234 DDEEDEEDGDLFDSVCAICDNGGELLCCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKA 293

Query: 275 MLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEV 334
           + NF CKNC++KQHQCF CG+L  SD    A+VF C +ATCGHFYHP CV++LL  +   
Sbjct: 294 LKNFVCKNCDHKQHQCFVCGELEPSDG-PNAKVFLCNNATCGHFYHPRCVAQLLHPNSRN 352

Query: 335 AADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFE 394
            A ++ K I+AG SFTCP+H C  CK  E++    LQFAVCRRCP++YHRKCLPR+I+FE
Sbjct: 353 EASEMEKKIMAGFSFTCPVHWCFHCKGLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFE 412

Query: 395 DKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEEN-----KTIID 449
           D   +GIITRAWE    + RILIYCL HEID +IGTP RDHI FP +E++     K + +
Sbjct: 413 DINTQGIITRAWEL---SKRILIYCLDHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKE 469

Query: 450 RPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKA 487
              KK+ +   S   +    ++ L  K   +G  S KA
Sbjct: 470 LAEKKRRICDDSYVSEPLQKRAKLNEKFNAKGDKSKKA 507


>gi|115477142|ref|NP_001062167.1| Os08g0502000 [Oryza sativa Japonica Group]
 gi|42407765|dbj|BAD08911.1| unknown protein [Oryza sativa Japonica Group]
 gi|113624136|dbj|BAF24081.1| Os08g0502000 [Oryza sativa Japonica Group]
 gi|215740510|dbj|BAG97166.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 694

 Score =  411 bits (1057), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 191/314 (60%), Positives = 243/314 (77%), Gaps = 3/314 (0%)

Query: 593 ISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEG 652
           +S  P +D+ ++RR+    +  +S + +E+I K+  +PSTH   SK+  DK ITLG V+ 
Sbjct: 71  LSSFPVIDKSTERRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKN-TDK-ITLGMVQR 128

Query: 653 SVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRH 712
           SVEAI  AL  L+ +G+SIED K+VC P  L Q+ +WKNKL +YLAPFLHGMRYTS+GRH
Sbjct: 129 SVEAINAALHMLE-NGASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRH 187

Query: 713 FTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAK 772
           FTK+DKL+ IVD+L WY+  GD +VDFCCG+NDFS L+K+KL+ + K+C YKN+D++  K
Sbjct: 188 FTKLDKLEQIVDRLQWYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPK 247

Query: 773 NDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPP 832
           NDFNFE+RDWMTV+P EL  G RLIMGLNPPFG KA LAN+FINKAL F PKL+ILIVP 
Sbjct: 248 NDFNFERRDWMTVQPDELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPK 307

Query: 833 ETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAA 892
           ETERLDRK   YEL+WED   L+GKSFYLPGS+D ++K M+QWNM+ PPL LWSR D+A 
Sbjct: 308 ETERLDRKYPPYELIWEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWAR 367

Query: 893 HHKALAEKHGHISR 906
            HK +A+  GHIS+
Sbjct: 368 KHKEIAKTMGHISK 381


>gi|356503198|ref|XP_003520398.1| PREDICTED: uncharacterized protein LOC100810533 [Glycine max]
          Length = 973

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/353 (55%), Positives = 251/353 (71%), Gaps = 26/353 (7%)

Query: 576 DRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHK-IPSTHA 634
           D    +T   KPL KK+ + L + + + +  + SLMKD  S+   EE  K H+   +T  
Sbjct: 559 DGSIGKTGIEKPL-KKIQTSLDANNAEMENSILSLMKDTMSTFNEEEFKKNHQAFSTTSG 617

Query: 635 YASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWK---- 690
           +    +  K +T GKVEGSV+AI+TAL++L+ +G SIE+AKA+C+P VL Q+F W+    
Sbjct: 618 FTEPVSHHKNLTQGKVEGSVKAIQTALQRLE-EGGSIEEAKAICDPGVLHQLFIWQFSDQ 676

Query: 691 -------------------NKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVN 731
                               +LKVYL+PFLHGMRYTSFGRHFTK+DKL+ + ++LHWYV 
Sbjct: 677 SGHSKNQLISEPVRTGLKQKQLKVYLSPFLHGMRYTSFGRHFTKIDKLKEVANRLHWYVQ 736

Query: 732 DGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELA 791
           +GD ++DFCCG+NDFS LMK KL+E GK+C +KNYD+   KNDFNFEKRDWM+V  +EL 
Sbjct: 737 NGDTVLDFCCGSNDFSRLMKSKLEEMGKSCSFKNYDLFQPKNDFNFEKRDWMSVNAEELP 796

Query: 792 PGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDD 851
            GS+LI+GLNPPFGVK  LANKFINKAL F PKLLILIVP  T+RLDRK+  Y+L+WEDD
Sbjct: 797 NGSQLIIGLNPPFGVKGSLANKFINKALTFKPKLLILIVPKVTKRLDRKKEGYDLIWEDD 856

Query: 852 QFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 904
           + LSGKSFYLPGSVD  DKQ++ WN+  PPLYLWSR D+   H+ +A+KH HI
Sbjct: 857 EMLSGKSFYLPGSVDTRDKQLEDWNLKPPPLYLWSRPDWTTKHREIAQKHCHI 909



 Score =  310 bits (794), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 165/327 (50%), Positives = 221/327 (67%), Gaps = 25/327 (7%)

Query: 144 EVRPSQND--LVDHMDLIKEALERDDVL--AKSKFLVTFLEEKPTKRKLSDEVVQTKAMS 199
           +V P++    +VD  D   E  ++ DVL   KSKF+  F +EK  +     +V  TK  S
Sbjct: 256 DVHPTKKSKFIVDSED---EENDQSDVLPTKKSKFIGDFEDEKNGQS----DVFPTKE-S 307

Query: 200 GFIVD-DMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGE 258
            FIVD + EE+   D E D   E++  +D+VC+ CDNGG++L CEGRCLRSFHAT +AG 
Sbjct: 308 KFIVDSEDEENDQSDGELDPDGEQNIGYDTVCAICDNGGEILPCEGRCLRSFHATKEAGI 367

Query: 259 ESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHF 318
           ++ C SLG T  +V+A  NF+C+NC+YK HQCFACGKLGSSD  + AEVFPCV+A CGH+
Sbjct: 368 DAVCESLGYTSAQVKAFPNFYCQNCKYKLHQCFACGKLGSSDVSSKAEVFPCVTANCGHY 427

Query: 319 YHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRC 378
           YHP CV++LL    +   +++ K +  G++F CPLH C +CK+GENK   DLQFA+CRRC
Sbjct: 428 YHPECVARLLSPSIDTEQEEMRKKVAMGKAFVCPLHICSLCKKGENKNFHDLQFAICRRC 487

Query: 379 PKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIF 438
           PKAYHRKCLPR+I+F    ++ I  RAW+GLL +H+ILIYCL H I  E+GTP RDH++F
Sbjct: 488 PKAYHRKCLPREISFVFDNDKSIEQRAWDGLL-DHQILIYCLDHVIVRELGTPARDHLVF 546

Query: 439 PGIEE-----------NKTIIDRPRKK 454
           P ++             KT I++P KK
Sbjct: 547 PDVKVRKKKKMVDGSIGKTGIEKPLKK 573



 Score =  157 bits (397), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 93/212 (43%), Positives = 134/212 (63%), Gaps = 8/212 (3%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           MASSD+E E+   LV  Y+FE+E E    E +S S+L + W+ +E   G K  ++LRG  
Sbjct: 1   MASSDEEGEIIPDLVDSYWFENEKE----EFVSLSSLTLLWSISETVCGLKAQVFLRGTT 56

Query: 61  DS-GLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
           D  GLQ I K +I W+F+L +  PEI VLSK+ SWI LQ+PRKC+E   + IL+ V+CL 
Sbjct: 57  DDDGLQTIHKQIIGWRFELPSGKPEISVLSKDKSWITLQRPRKCFESTIKAILVTVYCLH 116

Query: 120 YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
           + K NPE +   IW  + + F  +++ PS+NDL+ ++ LI+EA+ERD  L KSK+L+ F+
Sbjct: 117 FVKWNPEESQICIWVKMLKAFSSFDIVPSENDLLSNVTLIREAVERDKDLTKSKYLLDFM 176

Query: 180 EEKPTKRKL--SDEVVQTKAMSGFIVDDMEED 209
           E   +      S  V+ TK  S FIVD  +E+
Sbjct: 177 ERACSNEGFHESSNVLPTKK-SKFIVDSEDEE 207


>gi|218188313|gb|EEC70740.1| hypothetical protein OsI_02147 [Oryza sativa Indica Group]
          Length = 1296

 Score =  398 bits (1022), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 196/369 (53%), Positives = 263/369 (71%), Gaps = 21/369 (5%)

Query: 594 SELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGS 653
           S  P +D ++++R+ +L++  AS + +++I ++  +PSTHAY+ +  VD+ I  GK+E S
Sbjct: 486 SSFPRVDSETEKRVMALVEREASYLTLKDISRKCLMPSTHAYSGRQ-VDRIIATGKLERS 544

Query: 654 VEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHF 713
           V+A+  ALK L+ +G ++ DAKAVCEPEVL Q+ +W +KL+VY++PF++G RY+SFGRHF
Sbjct: 545 VQAVGAALKMLE-NGGNVNDAKAVCEPEVLKQLTRWHSKLRVYISPFIYGTRYSSFGRHF 603

Query: 714 TKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKN 773
           TKV+KL  IVDKLH YV  GD IVDFCCGANDFS LMK+KLD+  K C +KNYD++  +N
Sbjct: 604 TKVEKLVEIVDKLHCYVEPGDTIVDFCCGANDFSRLMKEKLDKVQKKCHFKNYDLIQPQN 663

Query: 774 DFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPE 833
            F FE++DWMTV PKEL  GS+LIMGLNPPFGVKA LANKFI+KAL F PKL+ILIVP E
Sbjct: 664 CFAFERKDWMTVRPKELPHGSKLIMGLNPPFGVKASLANKFIDKALTFKPKLVILIVPKE 723

Query: 834 TERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAH 893
           T+RLD+K++ Y+L+WED   LSGK+FYLPGSVD NDK ++ WN +APPLYLWS  D+   
Sbjct: 724 TKRLDQKKTPYDLIWEDSDCLSGKAFYLPGSVDVNDKVVEGWNASAPPLYLWSHPDWTRK 783

Query: 894 HKALAEKHGHISRPQSRTQMERN-------------CYETHAVDHPKEEGQGDASM---- 936
           H+ +AE+H H S  +  + +E +              Y  H     KEE  G  S     
Sbjct: 784 HRKVAEEHNHTSIGRIASHVEEDNLSDNLPMMKETESYGIHNSRSGKEENAGQTSCNLKG 843

Query: 937 --LIDLPLQ 943
             L DLP++
Sbjct: 844 TNLSDLPVR 852



 Score =  266 bits (679), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 148/319 (46%), Positives = 193/319 (60%), Gaps = 13/319 (4%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M   DD+ E     V +YY     E      +  S LP Q+ + +    CK+ ++L G A
Sbjct: 9   MMFDDDDDEPQLNAVDNYYLLDAREVP----VCLSVLPFQFKDTDEVPECKKDVFLWGTA 64

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
           D G+ K+++ VIAWK  L    PEI VLS E SWI L KP+  YEE  RTILI V  L +
Sbjct: 65  DPGI-KVYRKVIAWKLGLQGKQPEISVLSAEGSWISLTKPKNSYEEKIRTILITVQMLHF 123

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            K+ PE   K++W  L + F  +EVRPS++DL +H  LIK   E+D  LAKS+ L  F +
Sbjct: 124 LKKKPEEPEKNLWSHLRKIFDKFEVRPSEDDLRNHRSLIKHFAEKDSTLAKSEILQGFTQ 183

Query: 181 EKPTKRKLSDEVVQTKA--MSGFIVDDMEEDMVHDTEED---ESNEEDELFDSVCSFCDN 235
           E  T RK   EV   K      FI DD + + + D + +   +  EE++LFDS+CS CDN
Sbjct: 184 E--TSRKKFSEVGSDKVEIKVPFIADDEDIEEMADVDNNIESDEEEEEDLFDSICSICDN 241

Query: 236 GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGK 295
           GGDLLCC+G C+RSFHA I  GE+S+C +LG T+ EV+AM  F CKNCE+KQHQCF CG 
Sbjct: 242 GGDLLCCDGPCMRSFHAKIGTGEDSYCDTLGYTEAEVQAMKTFLCKNCEHKQHQCFICGA 301

Query: 296 LGSSDKETGAEVFPCVSAT 314
           L  SD  T A+VF C +AT
Sbjct: 302 LEPSDGPT-AKVFLCNNAT 319


>gi|222618537|gb|EEE54669.1| hypothetical protein OsJ_01967 [Oryza sativa Japonica Group]
          Length = 1296

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 195/369 (52%), Positives = 263/369 (71%), Gaps = 21/369 (5%)

Query: 594 SELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGS 653
           S  P +D ++++R+ +L++  AS + +++I ++  +PSTHAY+ +  VD+ I  GK+E S
Sbjct: 486 SSFPRVDSETEKRVMALVEREASYLTLKDISRKCLMPSTHAYSGRQ-VDRIIATGKLERS 544

Query: 654 VEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHF 713
           V+A+  ALK L+ +G ++ DAKAVCEPEVL Q+ +W +KL+VY++PF++G RY+SFGRHF
Sbjct: 545 VQAVGAALKMLE-NGGNVNDAKAVCEPEVLKQLTRWHSKLRVYISPFIYGTRYSSFGRHF 603

Query: 714 TKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKN 773
           TKV+KL  IVDKLH YV  GD IVDFCCGANDFS LMK+KLD+  K C +KNYD++  +N
Sbjct: 604 TKVEKLVEIVDKLHCYVEPGDTIVDFCCGANDFSRLMKEKLDKVQKKCHFKNYDLIQPQN 663

Query: 774 DFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPE 833
            F FE++DWMTV PKEL  GS+LIMGLNPPFGVKA LANKFI+KAL F PKL+ILIVP E
Sbjct: 664 CFAFERKDWMTVRPKELPHGSKLIMGLNPPFGVKASLANKFIDKALTFKPKLVILIVPKE 723

Query: 834 TERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAH 893
           T+RLD+K++ Y+L+WED   LSGK+FYLPGSVD NDK ++ WN +APPL+LWS  D+   
Sbjct: 724 TKRLDQKKTPYDLIWEDSDCLSGKAFYLPGSVDVNDKVVEGWNASAPPLHLWSHPDWTRK 783

Query: 894 HKALAEKHGHISRPQSRTQMERN-------------CYETHAVDHPKEEGQGDASM---- 936
           H+ +AE+H H S  +  + +E +              Y  H     KEE  G  S     
Sbjct: 784 HRKVAEEHNHTSIGRIASHVEEDNLSDNLPMMKETESYGIHNSRSGKEENAGQTSCNLKG 843

Query: 937 --LIDLPLQ 943
             L DLP++
Sbjct: 844 TNLSDLPVR 852



 Score =  263 bits (672), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 147/319 (46%), Positives = 192/319 (60%), Gaps = 13/319 (4%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M   DD+ E     V +YY     E      +  S LP Q+ + +    CK+ ++L G A
Sbjct: 9   MMFDDDDDEPQLNAVDNYYLLDAREVP----VCLSVLPFQFKDTDEVPECKKDVFLWGTA 64

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
           D G+ K+++ VIAWK  L    PEI VLS E SWI L KP+  YEE  RTILI V  L +
Sbjct: 65  DPGI-KVYRKVIAWKLGLQGKQPEISVLSAEGSWISLTKPKNSYEEKIRTILITVQMLHF 123

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            K+ PE   K++W  L + F  +EVRPS++DL +H  LIK   E+D  LAKS+ L  F +
Sbjct: 124 LKKKPEEPEKNLWSHLRKIFDKFEVRPSEDDLRNHRLLIKHFAEKDSTLAKSEILQGFTQ 183

Query: 181 EKPTKRKLSDEVVQTKA--MSGFIVDDMEEDMVHDTEED---ESNEEDELFDSVCSFCDN 235
           E  T RK   EV   K      FI DD + + + D + +   +  EE++LFDS+CS CDN
Sbjct: 184 E--TSRKKFSEVGSDKVEIKVPFIADDEDIEEMADVDNNIESDEEEEEDLFDSICSICDN 241

Query: 236 GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGK 295
           GGDLLCC+G C+RSFHA I  GE+S+C +LG T+ EV+AM  F CKNCE+KQHQCF CG 
Sbjct: 242 GGDLLCCDGPCMRSFHAKIGTGEDSYCDTLGYTEAEVQAMKTFLCKNCEHKQHQCFICGA 301

Query: 296 LGSSDKETGAEVFPCVSAT 314
           L  SD    A+VF C +AT
Sbjct: 302 LEPSDG-PAAKVFLCNNAT 319


>gi|413925249|gb|AFW65181.1| hypothetical protein ZEAMMB73_193747 [Zea mays]
          Length = 584

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 187/327 (57%), Positives = 240/327 (73%), Gaps = 4/327 (1%)

Query: 610 LMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGS 669
             + A++ + M+++ K+  +PST+A   ++A +  ITL  VE  VEA++ AL  L+ +G+
Sbjct: 4   FAQKASAGITMDDVKKKLVVPSTYAPNLQNADN--ITLENVEIYVEAVKGALHMLE-NGA 60

Query: 670 SIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWY 729
           SI DAK+VC P VL Q+ KWKNKL V LAPFLHGMRYTS+GRHFTK+DKLQ IVDKL WY
Sbjct: 61  SIADAKSVCPPNVLFQLVKWKNKLSVVLAPFLHGMRYTSYGRHFTKLDKLQLIVDKLQWY 120

Query: 730 VNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKE 789
           +  GD +VDFCCG+NDFS L+++ L+ +GKNC YKNYD++  KNDFNFE+RDWMTV+P E
Sbjct: 121 IQSGDTVVDFCCGSNDFSLLLREILESSGKNCFYKNYDLIQPKNDFNFERRDWMTVQPDE 180

Query: 790 LAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWE 849
           L  G RL+MGLNPPFG KA LAN+FINKAL F PKL+ILIVP ETERLD+K   YEL+W+
Sbjct: 181 LPTGCRLVMGLNPPFGTKASLANQFINKALTFKPKLIILIVPKETERLDKKYPPYELIWQ 240

Query: 850 DDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQS 909
           D   LSGKSFYLPGS+D ++K M+QWN++ PPL LWSR D+A  H  +A+  GHIS    
Sbjct: 241 DSDQLSGKSFYLPGSLDADNKVMEQWNVSPPPLSLWSRSDWAERHSEIAKSMGHISNEIG 300

Query: 910 RTQMERNCYETHAVDHPKEEGQGDASM 936
             Q +      H  +H + +  G A M
Sbjct: 301 DLQRDLAA-SIHTTEHVEADDAGVAGM 326


>gi|357153232|ref|XP_003576383.1| PREDICTED: uncharacterized protein LOC100846531 [Brachypodium
           distachyon]
          Length = 683

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 208/444 (46%), Positives = 283/444 (63%), Gaps = 13/444 (2%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENE-RTGGCKELIYLRGA 59
           M+  DD+ E    +V DYYF        K  + FS LPI + E+      CK  ++LRG 
Sbjct: 1   MSDDDDDSEPQVVVVKDYYFVD----ADKNALCFSVLPIWFKEDAVAVPECKTGVFLRGT 56

Query: 60  ADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
            D G+  ++K V+AWK  L    P++ VLSKE  WI L KP+  YEE +RTI I V  L 
Sbjct: 57  VDPGI-PVYKQVVAWKLGLDARQPDLAVLSKEGGWINLSKPKNSYEESFRTIFITVQMLH 115

Query: 120 YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
           + +R PE   K +W  L + F  ++VRPS++D  +H  L+K+  E+D  LA S+ L  F+
Sbjct: 116 FLRRKPEEPEKDLWIHLRKVFDKFDVRPSKDDFRNHHTLMKQFAEKDLRLANSEILKVFI 175

Query: 180 EEKPTKRKLSDEVVQTKAMSGFIV--DDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGG 237
            E+  K+    +    +    FI   +D+E+ +  D  E + + +D+LFDS C+ CDNGG
Sbjct: 176 GERFRKQISEVDSGNFEVKESFIAADEDVEDIVADDNVESDEDGDDDLFDSTCAICDNGG 235

Query: 238 DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLG 297
           DLL C+G C+RSFHA I  GE+S+C +LG T+ EVEAM  F CKNCEYKQHQCF CG L 
Sbjct: 236 DLLGCDGPCMRSFHAKIGTGEDSYCQTLGFTEAEVEAMKTFLCKNCEYKQHQCFICGVLE 295

Query: 298 SSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCC 357
            SD  T A+VF C +ATCG+FYHP CV++ L  ++++ A +  K I  G SFTC +H C 
Sbjct: 296 PSDGPT-AKVFLCNNATCGYFYHPKCVAQQLHPNNKIEALEKEKKIAGGSSFTCSIHWCF 354

Query: 358 ICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILI 417
            CK  E++ +  LQFAVCRRCPK+YHRKCLP +I FED  +E I+TRAW+    + RILI
Sbjct: 355 CCKGLEDRTEEHLQFAVCRRCPKSYHRKCLPSEIPFEDS-DEDIVTRAWDL---SQRILI 410

Query: 418 YCLKHEIDDEIGTPIRDHIIFPGI 441
           YC++HEID +I TP+R+HI FPG+
Sbjct: 411 YCMEHEIDLDIETPVRNHIKFPGL 434



 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 3/81 (3%)

Query: 609 SLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDG 668
           +L++   SS+ + +I ++  +PSTH Y+ +   DK I  GK+E SV+A+R AL  L V  
Sbjct: 601 ALVEKEVSSLTLNDISRKCLMPSTHVYSGRQT-DKIIATGKLERSVQAVRQALHLLAV-- 657

Query: 669 SSIEDAKAVCEPEVLSQIFKW 689
             +  AKA CEP+VL Q+ +W
Sbjct: 658 GDVNTAKATCEPQVLKQLARW 678


>gi|449453220|ref|XP_004144356.1| PREDICTED: uncharacterized protein LOC101205205 [Cucumis sativus]
          Length = 707

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 207/425 (48%), Positives = 278/425 (65%), Gaps = 34/425 (8%)

Query: 305 AEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGEN 364
           ++VF CV+ATCG+FYHP C+S+LL  +++VAA  L K I +GESF+CP+HKC +C  GEN
Sbjct: 8   SQVFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGEN 67

Query: 365 KADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEI 424
           K   +LQFAVCRRCPK+YHRKCLPRKI FE   +    TRAWE LLPN RILIYCL HEI
Sbjct: 68  KKIWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLPN-RILIYCLDHEI 126

Query: 425 DDEIGTPIRDHIIFPGIEENKTIIDR-------PRKKQSLASPSGKQKVASTKSSLTSKA 477
           D+EI TP RDHI FPG+EE++  I +        R+ +++     ++ V S K S+    
Sbjct: 127 DEEIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDL 186

Query: 478 PPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQ 537
             QGK + K  K     +  G+ +       + S  S+  K  ++SR S   N K  SV 
Sbjct: 187 --QGKSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESKNVKLGNISRNSL--NQKGESVL 242

Query: 538 VDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELP 597
           +D   ++  KK+SL  +  +A  T+  + +K  K+D S                    + 
Sbjct: 243 MDIDKTIKVKKSSLVGK--SAIPTKRFDPSKIYKEDRSG-------------------ML 281

Query: 598 SLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAI 657
            LD +S+RRL  +MK+ ASS+ +E+++K+HK+PSTHAY+ K  VDK I +GK+EGSV A+
Sbjct: 282 LLDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAV 341

Query: 658 RTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVD 717
           R AL+KL+ +G  IEDA+AVCEPEVL+ IFKWKNKL+VYLAPFL+GMRY+SFGRHFTKV+
Sbjct: 342 RAALRKLE-EGCCIEDAEAVCEPEVLNHIFKWKNKLRVYLAPFLYGMRYSSFGRHFTKVE 400

Query: 718 KLQAI 722
           KL  I
Sbjct: 401 KLVEI 405


>gi|356524309|ref|XP_003530772.1| PREDICTED: uncharacterized protein LOC100811715 [Glycine max]
          Length = 407

 Score =  381 bits (979), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 202/378 (53%), Positives = 256/378 (67%), Gaps = 12/378 (3%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           MASSDDE E     VS+YYF  +   K  E + FS LPIQW+E+E   G K  ++L G  
Sbjct: 1   MASSDDEGEAQALSVSNYYFVDD---KDDEPVCFSVLPIQWSESESPVGKKMQLFLHGFV 57

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMV-HCLS 119
           D+GLQK F  V+AW+FD + V PEI VLSK+  WI L+KPRK YE+      ++  H LS
Sbjct: 58  DNGLQKFFGQVVAWRFDFSYVRPEISVLSKDGRWIILEKPRKSYEDTIIRTTLITTHFLS 117

Query: 120 YAKRNPEATAKSIWDFLSRF--FCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVT 177
           Y K+NP+++AKS+WD LS+   F  YEV P Q DL++HM L+ EA +RD  L KSK L+ 
Sbjct: 118 YVKKNPDSSAKSVWDKLSKNKEFRSYEVMPLQKDLLNHMALMGEAAKRDTALEKSKLLLM 177

Query: 178 FLEEKPTKR--KLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDN 235
            LE+K   +  KLSDE V+  A  GFI+DD++  M+    +++S+ EDELFDSVC+ CDN
Sbjct: 178 VLEDKDMLKIKKLSDEEVKDLARPGFIIDDIDNGMI----DEDSDGEDELFDSVCAICDN 233

Query: 236 GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGK 295
           GG LLCC+G+C+RSFHA  + GEES CASLG ++ EV+ + NF+CKNCEY QHQCFACG 
Sbjct: 234 GGQLLCCDGKCMRSFHANEEDGEESTCASLGFSRKEVDEIQNFYCKNCEYNQHQCFACGT 293

Query: 296 LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHK 355
           LG SDK +GAEVF C SA CG FYHPHCV+KLL    E    +L + I  G  FTCP H 
Sbjct: 294 LGCSDKFSGAEVFKCASAICGFFYHPHCVAKLLHGVVEDDLKELERKIAEGGPFTCPTHY 353

Query: 356 CCICKQGENKADSDLQFA 373
           CC CK+ E+K   D  FA
Sbjct: 354 CCECKEMEDKKKHDFHFA 371


>gi|449456251|ref|XP_004145863.1| PREDICTED: uncharacterized protein LOC101205827 [Cucumis sativus]
          Length = 464

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 204/423 (48%), Positives = 273/423 (64%), Gaps = 34/423 (8%)

Query: 307 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 366
           VF CV+ATCG+FYHP C+S+LL  +++VAA  L K I +GESF+CP+HKC +C  GENK 
Sbjct: 56  VFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGENKK 115

Query: 367 DSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDD 426
             +LQFAVCRRCPK+YHRKCLPRKI FE   +    TRAWE LLPN RILIYCL HEID+
Sbjct: 116 IWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLPN-RILIYCLDHEIDE 174

Query: 427 EIGTPIRDHIIFPGIEENKTIIDR-------PRKKQSLASPSGKQKVASTKSSLTSKAPP 479
           EI TP RDHI FPG+EE++  I +        R+ +++     ++ V S K S+      
Sbjct: 175 EIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDL-- 232

Query: 480 QGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVD 539
           QGK + K  K     +  G+ +       + S  S   K  ++SR     N K  SV +D
Sbjct: 233 QGKSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESENVKLGNISRNCL--NQKGESVLMD 290

Query: 540 RSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSL 599
              ++  KK+SL  +  +A  T+  + +K  K+D S                    +  L
Sbjct: 291 IDKTIKVKKSSLVGK--SAIPTKRFDPSKIYKEDRSG-------------------MLLL 329

Query: 600 DEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRT 659
           D +S+RRL  +MK+ ASS+ +E+++K+HK+PSTHAY+ K  VDK I +GK+EGSV  +R 
Sbjct: 330 DANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVVVRA 389

Query: 660 ALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKL 719
           AL+KL+ +G  IEDA+AVCEPEVL+ IFKWKNKL+VYLAPFL+GMRY+SFGRHFTKV+KL
Sbjct: 390 ALRKLE-EGCCIEDAEAVCEPEVLNHIFKWKNKLRVYLAPFLYGMRYSSFGRHFTKVEKL 448

Query: 720 QAI 722
             +
Sbjct: 449 VEV 451


>gi|168059731|ref|XP_001781854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666661|gb|EDQ53309.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1275

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 282/894 (31%), Positives = 440/894 (49%), Gaps = 121/894 (13%)

Query: 54   IYLRGAADSGLQKIFKPVIAWKFDLT-----NVIPEIFVLSKENSWIKLQKPRKCYEEIY 108
            +YL G   SG + I + + AWK         N++ +I     + +W+KL KP+  Y+E  
Sbjct: 210  VYLHGK--SGGEYIMRSIRAWKLAFPGNGTFNILVKINTQKGDTNWVKLGKPQFGYKETA 267

Query: 109  RTILIMVHCLSYAKRNPEATAKSI------WDF--------LSRFFCLYEVRPSQNDLVD 154
             ++  +   L+  K  P A+   +      WD         LS+ F + E   + +  + 
Sbjct: 268  ESVFCIAKFLALLKEEPHASGDLVGKRLDEWDVPKATIYGHLSKRFAVVEQYINLDSGLK 327

Query: 155  HMDL-----IKEALERDDVLAKSKFLVTFLEEKPT---KRKLSDE---------VVQTKA 197
              D+     +  A     VLA S F     ++K     KR LSD+         V + K 
Sbjct: 328  DSDVSVDLKVHSADSGVSVLALSTFRSVVFKDKAANSGKRMLSDDSSDEDGNLRVKRHKK 387

Query: 198  MSGFIVDDME-----EDMVHDT------EEDESNEE-----DELFDSVCSFCDNGGDLLC 241
              G  VD+ E     ++++HD+       EDE + E     +EL DS+C  CD+GG+LLC
Sbjct: 388  DFGNTVDNNEVGKPEDEIMHDSGDEVNFVEDEKDVEIDEASEELGDSLCIICDDGGELLC 447

Query: 242  CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML-NFFCKNCEYKQHQCFACGKLGSSD 300
            C+G C+RSFHA  D   + +C +L LTK  +  M+  + CKNC YK+HQC+ CG+LG SD
Sbjct: 448  CDGPCMRSFHAIRDP--QHNCETLKLTKSAIAKMVGQWLCKNCVYKKHQCYVCGELGDSD 505

Query: 301  KETGA--EVFPCVSATCGHFYHPHCVS-KLLLRDDEVAADQLAKSIIAG-ESFTCPLHKC 356
            ++ G   EVF C  A CG FYHP CV+ ++ L +D+     LA +I AG ESF CPLH+C
Sbjct: 506  EDLGVKREVFVCDVACCGKFYHPTCVANEIALTEDDRKL--LATNIQAGVESFACPLHRC 563

Query: 357  CICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAF-EDKLEEGIITRAWEGLLPNHRI 415
              C +GE+  +  L  A CRRCP  +H+KCLP  I F E+  ++    RAW       R 
Sbjct: 564  KKCGKGEDSTEESLLLARCRRCPATWHKKCLPSGIRFAEETDDKHPEVRAWAFGDVYKRF 623

Query: 416  LIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRP-RKKQSLASPSGKQKVASTKSSLT 474
            +IYC  +  ++++  PI  +   P    +K+   R   +K  +    G  K    K    
Sbjct: 624  IIYCTGNNGNNQL--PICHYSPGPNEVVSKSYTLRSIYRKHKIMKDLGTPKRNHVKFPRV 681

Query: 475  SKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSL 534
             + P   +   +   +V ++A    ++E              AKA+     S K      
Sbjct: 682  LERPGDPEVFKQLKSKVAAEANVSPSLE--------------AKASS----SLK------ 717

Query: 535  SVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLIS 594
             + VD+S+    KK   G  L A              ++    E        P R  + +
Sbjct: 718  PILVDKSA----KKKHSGVELLA--------------KEKRPEEIRELPLASPPRLSISA 759

Query: 595  ELPSLDEDS-KRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGS 653
            ++    ED+ KR +  ++ ++A  V  + +  +  +PS +      +  K    G  +  
Sbjct: 760  DMLKKSEDAAKRVVLGVIAESAQKVTDQSVKLKLALPSIYRERRGFSFMKQ---GHKDSI 816

Query: 654  VEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHF 713
            ++A R  L+KL     ++E+AKA+C P  +++I   ++ LK +LAP L+G RY S+GRHF
Sbjct: 817  LKAARIVLEKLRSKKITVEEAKAMCNPSNINRIELEEDYLKTFLAPTLYGDRYVSYGRHF 876

Query: 714  TKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCL-YKNYDILPAK 772
            TK++KL+ +V +LH  + D D IVDF CG N FS L+   L + GK  L +KN+DI P K
Sbjct: 877  TKLEKLRKVVRRLHPLIWDHDTIVDFSCGDNSFSRLLHNALSDAGKTHLKFKNFDIFPPK 936

Query: 773  NDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPP 832
            + F FE++DW   +  E   G  L++GLNPPF      A KF+  AL   P+LL+LI PP
Sbjct: 937  DTFEFERKDWFDTKATEFGSGENLVIGLNPPF----PFAEKFVAHALMMQPRLLVLITPP 992

Query: 833  ETERLDRKESAYELVWEDDQFLSGKSFYLPGSV-DENDKQMDQWNMTAPPLYLW 885
               +++ +   Y  + +D   L   SFY+PGS+ D++ + + Q N   P   ++
Sbjct: 993  LKNKMEHR--GYNCLEDDHLLLDDNSFYVPGSLHDQHMESLQQVNAVVPHFCIY 1044


>gi|240256412|ref|NP_199620.4| EDM2-like protein1 [Arabidopsis thaliana]
 gi|332008234|gb|AED95617.1| EDM2-like protein1 [Arabidopsis thaliana]
          Length = 677

 Score =  351 bits (901), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 234/344 (68%), Gaps = 15/344 (4%)

Query: 601 EDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTA 660
           ++ ++R+  ++ +  SS   +E +K  +     +Y S +   K IT G V   V A R A
Sbjct: 344 DEVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAA 403

Query: 661 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 720
           LK  + +G    DA+A+ +P+ L Q+ K K KL++  +PFLHGMRYTSFGRHFT  +KL+
Sbjct: 404 LKMFE-EGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLK 459

Query: 721 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 780
            IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE R
Sbjct: 460 EIVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMR 519

Query: 781 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 840
           DW++V+ +EL  GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D  
Sbjct: 520 DWLSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAI 579

Query: 841 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 900
           +  YEL+WED   L+G SFYLPGSVD NDK ++QWN   PPLYLWSR D +  HK  A +
Sbjct: 580 DD-YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQ 638

Query: 901 HGHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 942
            GHI      TQM    Y  ++H  + P  +G  D  M+ D+ +
Sbjct: 639 QGHI------TQMHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 674



 Score =  204 bits (518), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 121/304 (39%), Positives = 171/304 (56%), Gaps = 36/304 (11%)

Query: 163 LERDDVLAKSKFLVTFL----EEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDE 218
           ++RD+ L KSKFL+TFL    +  P + +L  + +Q  A +    D + E+M  +     
Sbjct: 1   MKRDEDLTKSKFLITFLGKTSQTTPIEVELPTDHLQ-DAQTPMEQDSIVENMTDEENSSS 59

Query: 219 SNEEDEL---FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT-KDEVEA 274
           S+++ E    FD VCS CDNGG +LCCEG CLRSFH TI  G E+ C SLG T K +++A
Sbjct: 60  SDDDSEANLQFDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQA 119

Query: 275 MLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEV 334
           +  + C NC YKQHQC+ACG+LGSSD+    +VFPC ++ CGHFYHP CV++LL  DD+ 
Sbjct: 120 LGTYLCNNCLYKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQN 179

Query: 335 AADQLAKSIIAGESFTCPLHKCCICKQGENK---------AD-------SDLQFAVCRRC 378
            A++L   I A + F CPLH C +C   E+K         AD        ++   +    
Sbjct: 180 KAEELQAKIAARDCFACPLHTCKLCNMSEDKNQYACILLYADVAQQLITENVYLGILLLN 239

Query: 379 PKAYHRKCLPR-KIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHII 437
                R C    K+ F        +T A+  ++  H++L     HEID  I TP RDH++
Sbjct: 240 STLTMRHCKELGKVCF--------LTTAFLYIVSFHKLLNR--AHEIDGLILTPARDHLV 289

Query: 438 FPGI 441
           FP +
Sbjct: 290 FPDV 293


>gi|238481509|ref|NP_001154768.1| EDM2-like protein1 [Arabidopsis thaliana]
 gi|10177760|dbj|BAB11073.1| unnamed protein product [Arabidopsis thaliana]
 gi|332008235|gb|AED95618.1| EDM2-like protein1 [Arabidopsis thaliana]
          Length = 636

 Score =  350 bits (899), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 234/344 (68%), Gaps = 15/344 (4%)

Query: 601 EDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTA 660
           ++ ++R+  ++ +  SS   +E +K  +     +Y S +   K IT G V   V A R A
Sbjct: 303 DEVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAA 362

Query: 661 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 720
           LK  + +G    DA+A+ +P+ L Q+ K K KL++  +PFLHGMRYTSFGRHFT  +KL+
Sbjct: 363 LKMFE-EGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLK 418

Query: 721 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 780
            IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE R
Sbjct: 419 EIVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMR 478

Query: 781 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 840
           DW++V+ +EL  GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D  
Sbjct: 479 DWLSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAI 538

Query: 841 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 900
           +  YEL+WED   L+G SFYLPGSVD NDK ++QWN   PPLYLWSR D +  HK  A +
Sbjct: 539 DD-YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQ 597

Query: 901 HGHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 942
            GHI      TQM    Y  ++H  + P  +G  D  M+ D+ +
Sbjct: 598 QGHI------TQMHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 633



 Score =  191 bits (486), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 101/234 (43%), Positives = 135/234 (57%), Gaps = 28/234 (11%)

Query: 226 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT-KDEVEAMLNFFCKNCE 284
           FD VCS CDNGG +LCCEG CLRSFH TI  G E+ C SLG T K +++A+  + C NC 
Sbjct: 29  FDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQALGTYLCNNCL 88

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
           YKQHQC+ACG+LGSSD+    +VFPC ++ CGHFYHP CV++LL  DD+  A++L   I 
Sbjct: 89  YKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQNKAEELQAKIA 148

Query: 345 AGESFTCPLHKCCICKQGENK---------AD-------SDLQFAVCRRCPKAYHRKCLP 388
           A + F CPLH C +C   E+K         AD        ++   +         R C  
Sbjct: 149 ARDCFACPLHTCKLCNMSEDKNQYACILLYADVAQQLITENVYLGILLLNSTLTMRHCKE 208

Query: 389 R-KIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGI 441
             K+ F        +T A+  ++  H++L     HEID  I TP RDH++FP +
Sbjct: 209 LGKVCF--------LTTAFLYIVSFHKLLNR--AHEIDGLILTPARDHLVFPDV 252


>gi|52354517|gb|AAU44579.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
 gi|60547933|gb|AAX23930.1| hypothetical protein At5g48090 [Arabidopsis thaliana]
          Length = 364

 Score =  350 bits (899), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 234/344 (68%), Gaps = 15/344 (4%)

Query: 601 EDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTA 660
           ++ ++R+  ++ +  SS   +E +K  +     +Y S +   K IT G V   V A R A
Sbjct: 31  DEVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAA 90

Query: 661 LKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQ 720
           LK  + +G    DA+A+ +P+ L Q+ K K KL++  +PFLHGMRYTSFGRHFT  +KL+
Sbjct: 91  LKMFE-EGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLK 146

Query: 721 AIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKR 780
            IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE R
Sbjct: 147 EIVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMR 206

Query: 781 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRK 840
           DW++V+ +EL  GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D  
Sbjct: 207 DWLSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAI 266

Query: 841 ESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEK 900
           +  YEL+WED   L+G SFYLPGSVD NDK ++QWN   PPLYLWSR D +  HK  A +
Sbjct: 267 DD-YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQ 325

Query: 901 HGHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 942
            GHI      TQM    Y  ++H  + P  +G  D  M+ D+ +
Sbjct: 326 QGHI------TQMHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 361


>gi|297791955|ref|XP_002863862.1| hypothetical protein ARALYDRAFT_494862 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309697|gb|EFH40121.1| hypothetical protein ARALYDRAFT_494862 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 625

 Score =  345 bits (884), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 175/343 (51%), Positives = 234/343 (68%), Gaps = 19/343 (5%)

Query: 602 DSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTAL 661
           D ++R+  ++ +  SS   +E +K  K     +Y S + + K IT G V+  V A R AL
Sbjct: 297 DVEQRVMGIIDEVESSFNFDEFVKSRKPTHIKSYHSGNDISKNITTGLVQTHVNAARAAL 356

Query: 662 KKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQA 721
           K  + +G   +DA+A+ +P++L Q+ K K KL++YL+      RYTSFGRHFT  +KL+ 
Sbjct: 357 KMFE-EGRD-KDARAIFDPDILLQLMKHKTKLEIYLS------RYTSFGRHFTNPEKLKE 408

Query: 722 IVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRD 781
           IV++LHWYV +GD +VDFCCG+NDFSCLMK+KL ETGK C +KN+D++P KN+FNFEKRD
Sbjct: 409 IVERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKTCFFKNFDLIPPKNNFNFEKRD 468

Query: 782 WMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKE 841
           W++V+P+EL   S+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D   
Sbjct: 469 WLSVKPEELPDCSQLIMGLNPPFGHKASLANTFIKKALEFKPKILILIVPSETKRVDAI- 527

Query: 842 SAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKH 901
           + YEL+WED   L+G SFYLPGSVD NDK ++QWN   PPLYLWSR D++  HK  A + 
Sbjct: 528 ADYELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNIPPPLYLWSRRDWSRSHKTTALQQ 587

Query: 902 GHISRPQSRTQMERNCY--ETHAVDHPKEEGQGDASMLIDLPL 942
            HI      TQM    Y  + H  + P  +G  D  M+ D+ +
Sbjct: 588 DHI------TQMHHFTYNGDCHHTEIP--QGGDDIDMVKDMEI 622



 Score =  289 bits (739), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 133/243 (54%), Positives = 171/243 (70%), Gaps = 2/243 (0%)

Query: 200 GFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE 259
            FIV++M  D  + +  D+ ++ +  FD VCS CDNGG +LCCEG CLRSFH TI  G E
Sbjct: 4   NFIVNNMI-DEENSSGSDDDSQTNLQFDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIE 62

Query: 260 SHCASLGLT-KDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHF 318
           + C SLG T K +++A+  + C NC YKQHQC+ACG+LGSSD+ +  +VFPC ++ CGHF
Sbjct: 63  TECESLGFTYKTQIQALGTYLCNNCLYKQHQCYACGELGSSDENSSQQVFPCSASNCGHF 122

Query: 319 YHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRC 378
           YHP CV++LL  DD++ A++L   I A + F CPLH C +C   E+K +  L FAVCRRC
Sbjct: 123 YHPICVARLLCADDQIKAEELQAKIAARDYFACPLHICKVCNMSEDKNEYALHFAVCRRC 182

Query: 379 PKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIF 438
           P AYHRKCLPR+I  E   ++  + RAWE LLP +RILIYCL HEID  I TP RDH+IF
Sbjct: 183 PTAYHRKCLPREITSELNSDDDTLQRAWERLLPYNRILIYCLTHEIDGHILTPARDHLIF 242

Query: 439 PGI 441
           P I
Sbjct: 243 PDI 245


>gi|38424061|dbj|BAD01751.1| unknown protein [Oryza sativa Japonica Group]
 gi|38637135|dbj|BAD03389.1| unknown protein [Oryza sativa Japonica Group]
 gi|215704583|dbj|BAG94216.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 653

 Score =  341 bits (874), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 154/214 (71%), Positives = 177/214 (82%)

Query: 690 KNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCL 749
           +NKL+VYLAPF+HGMRYTSFGRHFTK +KL  I +KLHWYV  GDMIVDF CG NDFS  
Sbjct: 9   QNKLRVYLAPFIHGMRYTSFGRHFTKKEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFSQF 68

Query: 750 MKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAG 809
           MK+KLD+ GK C +KNYD++  KN F+FEKRDWMTV  KEL  GS+LIMGLNPPFG KA 
Sbjct: 69  MKEKLDKVGKRCNFKNYDVIQPKNSFSFEKRDWMTVRQKELPHGSKLIMGLNPPFGPKAM 128

Query: 810 LANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEND 869
           LANKFI+KAL F PKL+ILIVP E ERLDRK+  Y+LVWEDDQ LSGKSFYLPGS+D +D
Sbjct: 129 LANKFIDKALTFKPKLIILIVPKEAERLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDVSD 188

Query: 870 KQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGH 903
           KQ+DQWN + PPLYLWSR D+   HK +AE+HGH
Sbjct: 189 KQIDQWNKSPPPLYLWSRPDWTQKHKRIAEQHGH 222


>gi|52354515|gb|AAU44578.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
          Length = 272

 Score =  327 bits (838), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/272 (58%), Positives = 200/272 (73%), Gaps = 13/272 (4%)

Query: 673 DAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVND 732
           DA+A+ +P+ L Q+ K K KL++  +PFLHGMRYTSFGRHFT  +KL+ IV++LHWYV +
Sbjct: 9   DARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLKEIVERLHWYVEN 66

Query: 733 GDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAP 792
           GD +VDFCCG+NDFSCLMK+KL ETGK C YKN D++P KN+FNFE RDW++V+ +EL  
Sbjct: 67  GDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMRDWLSVKEEELPD 126

Query: 793 GSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQ 852
           GS+LIMGLNPPFG KA LAN FI KALEF PK+LILIVP ET+R+D  +  YEL+WED  
Sbjct: 127 GSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAIDD-YELIWEDRN 185

Query: 853 FLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQ 912
            L+G SFYLPGSVD NDK ++QWN   PPLYLWSR D +  HK  A + GHI      TQ
Sbjct: 186 LLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQQGHI------TQ 239

Query: 913 MERNCY--ETHAVDHPKEEGQGDASMLIDLPL 942
           M    Y  ++H  + P  +G  D  M+ D+ +
Sbjct: 240 MHHFTYNGDSHHTEIP--QGDDDTDMVQDMEI 269


>gi|449472641|ref|XP_004153655.1| PREDICTED: uncharacterized protein LOC101221422, partial [Cucumis
           sativus]
          Length = 364

 Score =  323 bits (827), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 181/391 (46%), Positives = 246/391 (62%), Gaps = 34/391 (8%)

Query: 307 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 366
           VF CV+ATCG+FYHP C+S+LL  +++VAA  L K I +GESF+CP+HKC +C  GENK 
Sbjct: 1   VFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGENKK 60

Query: 367 DSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDD 426
             +LQFAVCRRCPK+YHRKCLPRKI FE   +    TRAWE LLPN RILIYCL HEID+
Sbjct: 61  IWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLPN-RILIYCLDHEIDE 119

Query: 427 EIGTPIRDHIIFPGIEENKTIIDR-------PRKKQSLASPSGKQKVASTKSSLTSKAPP 479
           EI TP RDHI FPG+EE++  I +        R+ +++     ++ V S K S+      
Sbjct: 120 EIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDL-- 177

Query: 480 QGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVD 539
           QGK + K  K     +  G+ +       + S  S+  K  ++SR S   N K  SV +D
Sbjct: 178 QGKSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESKNVKLGNISRNSL--NQKGESVLMD 235

Query: 540 RSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSL 599
              ++  KK+SL  +  +A  T+  + +K  K+D S                    +  L
Sbjct: 236 IDKTIKVKKSSLVGK--SAMPTKRFDPSKIYKEDRSG-------------------MLLL 274

Query: 600 DEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRT 659
           D +S+RRL  +MK+ ASS+ +E+++K+HK+PSTHAY+ K  VDK I +GK+EGSV A+R 
Sbjct: 275 DANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAVRA 334

Query: 660 ALKKLDVDGSSIEDAKAVCEPEVLSQIFKWK 690
           AL+KL+ +G  IEDA+AVCEPEVL+ IFKWK
Sbjct: 335 ALRKLE-EGCCIEDAEAVCEPEVLNHIFKWK 364


>gi|414592041|tpg|DAA42612.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 1006

 Score =  321 bits (823), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 182/445 (40%), Positives = 257/445 (57%), Gaps = 29/445 (6%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGG-CKELIYLRGAADSGLQKIFKPVIA 73
           V+ YYFE +GE K    + FS LP Q+ E++      ++ ++L G  D  L  ++K V+A
Sbjct: 16  VNRYYFE-DGEEKP---VCFSILPFQFGEDDSEAVFLRKDVFLCGFVDKNL-PVYKEVVA 70

Query: 74  WKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIW 133
           WK  L +  P I+VLS E+ WIKL KPRKCY +I R+ LI V  L +  R  + ++  +W
Sbjct: 71  WKIRLDSEHPNIYVLSIEHKWIKLLKPRKCYGDIVRSTLITVQMLHFFGRGEQRSSNHLW 130

Query: 134 DFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEE--KPTKRKLSDE 191
           D L   F  Y  +P ++DL+ H  LIK  +E+D  L KSK L   +E   K TK+ L  E
Sbjct: 131 DHLDEVFGKYNPKPVEDDLMKHHTLIKLFVEKDQTLMKSKILQRLIENGFKRTKKALGME 190

Query: 192 VVQTKAMSGFIVDDMEEDMVHD-------------TEEDESNEEDELFDSVCSFCDNGGD 238
             Q+    G+     +++   +             +++ + + +D++ D +C+ CD+GG 
Sbjct: 191 -AQSIVSDGWRARKNDDNNYGNKDDSGDDCDGDGSSDDGDGSSDDDVTDQICALCDDGGH 249

Query: 239 LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGS 298
           LL C+G C RSFH T   G ES C SL  T  EV+ +  + C NC+ KQHQCF CG+L  
Sbjct: 250 LLSCDGPCKRSFHPTKKDGRESKCESLHYTSAEVKRIGTYLCANCKNKQHQCFRCGELEP 309

Query: 299 SDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCI 358
           S     A+VF C  A+CG+FYHP C+++LL  D      +L + I++G SF CP+H C  
Sbjct: 310 SHGPN-AKVFQCNQASCGYFYHPKCIAQLL--DPNATDGELERRIMSGMSFPCPIHWCFK 366

Query: 359 CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIY 418
           C   ENKA   LQ AVCRRCP+AYHR+CLPR ++F  K ++G   RAW+    +  I IY
Sbjct: 367 CGHMENKAQRALQLAVCRRCPRAYHRECLPRDLSFGTKDKDG-NQRAWK---LSDTIFIY 422

Query: 419 CLKHEIDDEIGTPIRDHIIFPGIEE 443
           CL HEID + GT  R+HI FP   E
Sbjct: 423 CLDHEIDKDTGTTSRNHIKFPATPE 447


>gi|414592040|tpg|DAA42611.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 1010

 Score =  319 bits (817), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 183/455 (40%), Positives = 253/455 (55%), Gaps = 45/455 (9%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGG-CKELIYLRGAADSGLQKIFKPVIA 73
           V+ YYFE +GE K    + FS LP Q+ E++      ++ ++L G  D  L  ++K V+A
Sbjct: 16  VNRYYFE-DGEEKP---VCFSILPFQFGEDDSEAVFLRKDVFLCGFVDKNL-PVYKEVVA 70

Query: 74  WKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIW 133
           WK  L +  P I+VLS E+ WIKL KPRKCY +I R+ LI V  L +  R  + ++  +W
Sbjct: 71  WKIRLDSEHPNIYVLSIEHKWIKLLKPRKCYGDIVRSTLITVQMLHFFGRGEQRSSNHLW 130

Query: 134 DFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEE--KPTKRKLSDE 191
           D L   F  Y  +P ++DL+ H  LIK  +E+D  L KSK L   +E   K TK+ +   
Sbjct: 131 DHLDEVFGKYNPKPVEDDLMKHHTLIKLFVEKDQTLMKSKILQRLIENGFKRTKKPI--- 187

Query: 192 VVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDE-----------------------LFDS 228
               KA+ G     +  D     + D++N  ++                       + D 
Sbjct: 188 ---LKAL-GMEAQSIVSDGWRARKNDDNNYGNKDDSGDDCDGDGSSDDGDGSSDDDVTDQ 243

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           +C+ CD+GG LL C+G C RSFH T   G ES C SL  T  EV+ +  + C NC+ KQH
Sbjct: 244 ICALCDDGGHLLSCDGPCKRSFHPTKKDGRESKCESLHYTSAEVKRIGTYLCANCKNKQH 303

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
           QCF CG+L  S     A+VF C  A+CG+FYHP C+++LL  D      +L + I++G S
Sbjct: 304 QCFRCGELEPSHGPN-AKVFQCNQASCGYFYHPKCIAQLL--DPNATDGELERRIMSGMS 360

Query: 349 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEG 408
           F CP+H C  C   ENKA   LQ AVCRRCP+AYHR+CLPR ++F  K ++G   RAW+ 
Sbjct: 361 FPCPIHWCFKCGHMENKAQRALQLAVCRRCPRAYHRECLPRDLSFGTKDKDG-NQRAWK- 418

Query: 409 LLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 443
              +  I IYCL HEID + GT  R+HI FP   E
Sbjct: 419 --LSDTIFIYCLDHEIDKDTGTTSRNHIKFPATPE 451


>gi|414884833|tpg|DAA60847.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 429

 Score =  311 bits (796), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 188/448 (41%), Positives = 255/448 (56%), Gaps = 45/448 (10%)

Query: 242 CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
           C+G C+RSFHA   +GE+S C +LG T+ EVEAM  F CKNCEYKQHQCF CG L  SD 
Sbjct: 22  CDGPCMRSFHAKEGSGEDSCCVTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEPSDG 81

Query: 302 ETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQ 361
           E  A+VF C +ATCGHFYHP CV++ L   +   A +L K I  G SFTCP+H C  CK 
Sbjct: 82  E-AAKVFLCNNATCGHFYHPKCVAQQLHPINIDEASELEKKITEGFSFTCPIHWCFHCKG 140

Query: 362 GENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLK 421
            E++    LQFAVCRRCPK+YHRKCLP +I+F++  E+ II RAWE    + RILIYCL 
Sbjct: 141 LEDRTQEPLQFAVCRRCPKSYHRKCLPSEISFDENEEDDIIIRAWEL---SKRILIYCLD 197

Query: 422 HEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSKAPPQG 481
           HEID E+ TP+RDHI FP IE+    +   +K  SL     K+  + T    + K   + 
Sbjct: 198 HEIDSELETPVRDHIKFPKIEKPLQFV---KKGSSLLPKKKKRTYSETVLEQSPKDTVKR 254

Query: 482 KFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRS 541
           K  V  L+R  +K    E                      VS KSF  N+      VD  
Sbjct: 255 KSKVHMLEREQNKQDTRE----------------------VSAKSFTQNL------VDTP 286

Query: 542 SSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDE 601
               +K     +  Y A       +  F     S +E  + +   PL     S  P +D 
Sbjct: 287 VKKKAKLKEKIQLPYVA-------KDHFISSPKSVKEQEQELVPLPLSAIRKSSFPKVDS 339

Query: 602 DSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTAL 661
           ++++R+ +++ +  SS+ ++++ +   +PSTH Y+ +   DK I LGK+E SV+A+  AL
Sbjct: 340 ETEKRVITILGNKVSSLTIKDVTRNCSLPSTHVYSGR-LTDK-IPLGKIERSVQAVEAAL 397

Query: 662 KKLDVDGSSIEDAKAVCEPEVLSQIFKW 689
           KKL+ +G S+ DAKAVCEPEVL Q+ +W
Sbjct: 398 KKLE-NGGSVNDAKAVCEPEVLRQLARW 424


>gi|449467187|ref|XP_004151306.1| PREDICTED: uncharacterized protein LOC101207373, partial [Cucumis
           sativus]
          Length = 625

 Score =  299 bits (766), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 140/204 (68%), Positives = 165/204 (80%), Gaps = 2/204 (0%)

Query: 736 IVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSR 795
           IVDFCCGANDFS LMKKKLDETGK C ++N+D +P KNDFNFEKRDWMTV+PKEL  GS+
Sbjct: 1   IVDFCCGANDFSVLMKKKLDETGKQCSFRNFDFIPPKNDFNFEKRDWMTVQPKELPKGSQ 60

Query: 796 LIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLS 855
           LIMGLNPPFGVKA LANKF++KALEFNPKLLILIVPPETERLD K + Y+LVWED +FLS
Sbjct: 61  LIMGLNPPFGVKAALANKFVDKALEFNPKLLILIVPPETERLDEKMTPYDLVWEDTEFLS 120

Query: 856 GKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHIS-RPQSRTQME 914
           GKSFYLPGSVD  DKQMDQWN+  P LYLWSR D+   H A+A++HGH+  R Q+ ++ E
Sbjct: 121 GKSFYLPGSVDAKDKQMDQWNVRPPVLYLWSRRDWTHKHTAIAQEHGHLWLRKQAHSEKE 180

Query: 915 RNCYETHAVDHPKEEGQGDASMLI 938
            N  +T  V   +E  +G +S L+
Sbjct: 181 -NASDTLRVRQLEESEKGKSSDLL 203



 Score = 64.7 bits (156), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 110/249 (44%), Gaps = 44/249 (17%)

Query: 1198 AKSSLEDH-SSKSIDIPSQT---GFGSDYQHHEP-----TRSSSHIGTTYYGTQAGIPND 1248
            A  ++ DH SSKS+++P      G G    HH+      T  ++++G  Y   Q  + +D
Sbjct: 382  ASPNVSDHTSSKSVEMPPYADVDGIG----HHQQSGSTMTEPNTNLGAPYDAAQISLTDD 437

Query: 1249 MG-SYGMSSL------NNGLSHGA---------NLDERYTGYVRNTDSLGYRP-SMSTD- 1290
            +   Y +++       N G S+ A         +++ER    +     L Y+P S   D 
Sbjct: 438  ITRKYNLNAEESYPRGNTGWSNNASPMYDIGSRHVEERIVDQMGGRVGLNYKPYSTGVDT 497

Query: 1291 --RELTMWPLARIYGQDFP---------APTPGYGQMGSVPSNLYGNLGSSAEASYRMST 1339
              R+  +    R YG              P P Y ++G++P++ YG+LG+ +E S  M+T
Sbjct: 498  YMRDSEIRSHIRHYGHPDTDNLRSNYQVGPDPRYSRIGAIPAS-YGHLGTFSEPSRWMNT 556

Query: 1340 SAMDRYAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRAPQPGFFA-DMDFGPGFHPPFPQ 1398
            SA  RY PRL +LN+TR+        M     F        GF      F PG   P+  
Sbjct: 557  SATQRYVPRLDELNHTRLGGMGVARQMNGGSTFDPRVHTSSGFRGVSQGFAPGPQYPYSN 616

Query: 1399 QGSGGWLDD 1407
            Q S GWL++
Sbjct: 617  QNSAGWLNE 625


>gi|222640381|gb|EEE68513.1| hypothetical protein OsJ_26947 [Oryza sativa Japonica Group]
          Length = 1287

 Score =  281 bits (718), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 185/507 (36%), Positives = 265/507 (52%), Gaps = 76/507 (14%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKEL--IYLRGAADSGLQKIFKPVI 72
           V  YYFE E E K    + FS LPI+++ENE    C +   + LRG  D+  + +FK VI
Sbjct: 15  VDIYYFE-ESEDKP---VCFSVLPIKFDENEEVS-CSDYKELNLRGFTDNN-RHVFKKVI 68

Query: 73  AWKFDLTNVIPEIFVLSKENSWIKLQKPRKCY-EEIYRTILIMVHCLSYAKRNPEATAKS 131
           AW+ DL    P+I VLS +  WI+L KPR CY E+  R+ILI V  L + ++ P+   +S
Sbjct: 69  AWRVDLDCQRPKISVLSSDGKWIELLKPRMCYYEKKVRSILITVQMLHFVRKCPKKQQRS 128

Query: 132 IWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDE 191
           ++D L   F  +   P+ +D+  H  LIK  +ERD +L KSK +  F+E+   K   + E
Sbjct: 129 LFDHLGEVFNKFGPVPNGDDIKKHHHLIKLFMERDPILVKSKIIRCFIEDASRK---AIE 185

Query: 192 VVQTKAMS--GFIVDD-------------------MEEDMVHDTEEDESNEEDELFDSVC 230
            +  +A +   FIV D                     +D   +  + + N  D+  D +C
Sbjct: 186 PIPRRACTEDRFIVSDESLESSDDCSYSNSDSSDDNNDDSTDEDTDTDGNATDDDTDMIC 245

Query: 231 SFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNCEYKQH 288
           + CD+GG LL CEG C RSFHA +  G +S C +L  T  E++   +  F C+NCE+ +H
Sbjct: 246 AICDDGGKLLSCEGPCKRSFHARVKHGRKSKCRTLRFTSVELKLKESGTFLCENCEHNEH 305

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
           QCF CG+L  S     A+VF C   +CGHFYHP C++ LL  +D   A +L + I  G S
Sbjct: 306 QCFKCGELEPS-YGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGACKLEEMIADGMS 364

Query: 349 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEG 408
           FTCP+H C  C + E++  S+ QFAVCRRCP++YH +CLPR                   
Sbjct: 365 FTCPVHWCFKCAKMEDRTRSEFQFAVCRRCPRSYHTECLPRH------------------ 406

Query: 409 LLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVA- 467
                        H ID   GTP  +HI FP I + K   +  +K    A  +GK+K + 
Sbjct: 407 -------------HGIDATTGTPSGEHIKFPSIPKIKKTKNLSKKD---AKVTGKRKKSG 450

Query: 468 ---STKSSLTSKAPPQGKFSVKALKRV 491
              STKS+  +   P+G+   K  +RV
Sbjct: 451 NKFSTKSTKLANVSPRGE--SKQTRRV 475



 Score =  163 bits (413), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 81/208 (38%), Positives = 119/208 (57%), Gaps = 21/208 (10%)

Query: 719 LQAIVDKLHWYVNDGDM---------------------IVDFCCGANDFSCLMKKKLDET 757
           L  I DKLHWYV  G+                      IVD C   ++FS LMK+KL++ 
Sbjct: 588 LVQIADKLHWYVQPGNTESCGKLPDGVLLERSIVVVREIVDLCFNMDNFSRLMKEKLEDV 647

Query: 758 GKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINK 817
            K C +KNYD+   KN+  F++ +W+T++PK+L  G  L+M L+ P  ++A  ANKF++K
Sbjct: 648 SKGCNFKNYDLFQHKNNLCFDESNWVTMQPKDLPHGLNLVMVLDLPLDIQAVSANKFVDK 707

Query: 818 ALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNM 877
            L F P+L+I++ P + ERLD KE  Y L+WE +Q + GK  Y PG VD ND   +  ++
Sbjct: 708 VLTFKPRLIIIVSPTDIERLDCKEEPYHLIWEYNQHIFGKPLYQPGDVDVNDDAKNGLHV 767

Query: 878 TAPPLYLWSRHDYAAHHKALAEKHGHIS 905
               + LWS  D    +  +A KHGH++
Sbjct: 768 IPLCVSLWSFPDKTEENMRIAGKHGHLN 795


>gi|357160452|ref|XP_003578769.1| PREDICTED: PHD finger-containing protein DDB_G0268158-like
           [Brachypodium distachyon]
          Length = 298

 Score =  262 bits (669), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 128/250 (51%), Positives = 168/250 (67%), Gaps = 18/250 (7%)

Query: 215 EEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEA 274
           EED SNE    +D+ C+ CD+GGD +CCEG CLRSFH T   GE+S C +LGLT++  + 
Sbjct: 27  EEDRSNE----WDNACAICDDGGDFICCEGGCLRSFHPTKKYGEDSMCTTLGLTEEWWQT 82

Query: 275 MLN-----FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
           + +     + CKNCEYKQHQCFACG LGSS   +G+EV  C    CG+FYHP CV++L+ 
Sbjct: 83  LQSNEQEKYICKNCEYKQHQCFACGLLGSSGLTSGSEVIQCKDKMCGYFYHPKCVAELVH 142

Query: 330 RDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPR 389
            D +  A    K +  G +F CPLH+C +CK+ EN+ D D+Q AVCRRCP AYHRKCLP 
Sbjct: 143 PDSKAQAIFFEKRVALGLTFHCPLHRCSLCKEAENRDDKDMQLAVCRRCPTAYHRKCLPS 202

Query: 390 KIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIID 449
            I+FE   +EG   RAW+ +LP+ +ILIYC+KH+ID ++ TP RDHI+FP         D
Sbjct: 203 DISFEKDGKEGTQQRAWDNVLPD-QILIYCMKHKIDKDLRTPKRDHIVFPD--------D 253

Query: 450 RPRKKQSLAS 459
           RP  + S +S
Sbjct: 254 RPLSEPSRSS 263


>gi|218200973|gb|EEC83400.1| hypothetical protein OsI_28843 [Oryza sativa Indica Group]
          Length = 615

 Score =  261 bits (666), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 163/464 (35%), Positives = 240/464 (51%), Gaps = 49/464 (10%)

Query: 29  KEIISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAWKFDLTNVIPEIFVL 88
           ++++ FS LP  ++E  R  G ++  Y+RG    G Q ++K + AW+ +L    P I VL
Sbjct: 27  EDLVCFSILPFWFDEEHRLPGSEKGAYIRGIDCEGRQPVYKRIEAWRVELDCEKPMISVL 86

Query: 89  SKENSWIKLQKPRKCY-EEIYRTILIMVHCLSYAKRNPEATAKSIWDFLSRFFCLYEVRP 147
           S E +WI+L  P   Y E+I R++LI +  + + +++P    +++   L   F  +  +P
Sbjct: 87  SSEGNWIRLLNPHPGYAEDIARSVLITIQMIHFVRKHPAKDERTLSKHLCEVFRKFFTKP 146

Query: 148 SQ-NDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDEVVQTKAMSGFIV--- 203
            + +DL     LIK  ++ D  L KSK L   LE+  T   +  E+ +      FIV   
Sbjct: 147 PEVDDLRKSYSLIKYFMKMDQTLVKSKILRRPLED--TWIIIETELRKASLNESFIVSNE 204

Query: 204 -----DDMEEDMVHDTEED------ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHA 252
                DD  ED    + +D      + N  D+  ++VC+ CD GG LL C+G C RSFH 
Sbjct: 205 SSTNCDDYTEDGSSGSSDDNGDNYTDGNASDDGTEAVCAICDQGGILLGCKGECKRSFHP 264

Query: 253 TIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVS 312
           T++ G +S C +LG T  EVE +  F C NC++KQHQCF CG+L SS  ET  +VF C +
Sbjct: 265 TLEDGTKSFCKTLGYTSREVEEIPIFICSNCQHKQHQCFKCGQLDSS-HETNPKVFQCCN 323

Query: 313 ATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQF 372
           A+CG FYHP CV+ LL  D    A  L K I  G +FTCP+H C  CKQ E+        
Sbjct: 324 ASCGRFYHPKCVAGLLEPD---GACGLEKRIADGMTFTCPVHWCFECKQIED-------- 372

Query: 373 AVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPI 432
                           R+I+F+ K    +  R WE    + +I IYCL H+ID    TP 
Sbjct: 373 ----------------REISFKKK---DVTARVWEVPKGDPKIFIYCLDHDIDATFRTPC 413

Query: 433 RDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSK 476
           RDHI FP   + + + D  RKK  +       +V+   + L++K
Sbjct: 414 RDHIKFPSAPQIERVKDLARKKVKVTDIRNTDEVSPESAELSTK 457


>gi|218186609|gb|EEC69036.1| hypothetical protein OsI_37847 [Oryza sativa Indica Group]
          Length = 345

 Score =  261 bits (666), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 137/301 (45%), Positives = 177/301 (58%), Gaps = 29/301 (9%)

Query: 226 FDSVCSFCDNGGDLLCCEGRCLRSFHAT-IDAGEESHCASLGLTKDEVEAMLN------- 277
           ++ VCS CD+GGD LCCEGRCLRSFHAT     E + C +LGLT+++ +           
Sbjct: 42  WEDVCSICDDGGDFLCCEGRCLRSFHATEKHITEINQCTTLGLTEEQWKIFRQNDEKSEP 101

Query: 278 -FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 336
            + CKNC+Y QHQCF+CG LGSSD  +GAEVF C    CGHFYHP C+++LL  D  +  
Sbjct: 102 LYICKNCKYNQHQCFSCGLLGSSDLSSGAEVFQCEDEKCGHFYHPKCLARLLYPDSSMQP 161

Query: 337 DQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDK 396
               + +  G  F CP+HKC +CK GENK D + QFAVCRRCP  YHRKCLP  I FED 
Sbjct: 162 LNFEEEVARGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDD 221

Query: 397 LEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP--------------GIE 442
              GI+ RAW+ +L   +ILIYC+KHEID E+GTP R+HIIFP              G E
Sbjct: 222 ESNGIVQRAWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFPDCKNLLTRGPSKPKGQE 280

Query: 443 ENKTIIDRPRKKQSL---ASPSGKQKVASTKSSLTSKAPPQG--KFSVKALKRVPSKAGQ 497
           +   ++D P +  S    + PS      + ++  T +    G   F+ KAL  +P     
Sbjct: 281 DAPAVLDIPEEDMSPDHSSEPSQPPAQPAAETDQTDQDLSNGFNSFAPKALFPLPYPGSC 340

Query: 498 G 498
           G
Sbjct: 341 G 341


>gi|224061077|ref|XP_002300345.1| predicted protein [Populus trichocarpa]
 gi|222847603|gb|EEE85150.1| predicted protein [Populus trichocarpa]
          Length = 582

 Score =  260 bits (665), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 152/297 (51%), Positives = 193/297 (64%), Gaps = 8/297 (2%)

Query: 771  AKNDFNFEKRDWMTVEPKELAP-GSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILI 829
             +NDFNFEKRDWMTV P EL   GS+LIMGLNPPFGVKA LANKFI+KAL+F PKLLILI
Sbjct: 100  TQNDFNFEKRDWMTVCPDELPKKGSQLIMGLNPPFGVKAALANKFIDKALQFKPKLLILI 159

Query: 830  VPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHD 889
            VPPETERLD+K+  Y LVWE+D FLSGKSFYLPGSV+ENDKQMDQWN+TAPPLYLWSR +
Sbjct: 160  VPPETERLDKKK-PYNLVWENDHFLSGKSFYLPGSVNENDKQMDQWNVTAPPLYLWSRQE 218

Query: 890  YAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEE-GQGDASMLID-LPLQINVT 947
            ++A HKA+A+KHGH  + Q  + +++N +ET   D   ++     ASML + +PLQ    
Sbjct: 219  WSAKHKAIAQKHGHPFKQQEISNLDKNHFETKTPDPVNDQYNNAGASMLPNYIPLQSKEP 278

Query: 948  KELRNEAREDDKAGFPD-NATEGGGESSHGHGDNQSGKTSRKRKRDRKKHGSGMRENSPL 1006
            +E       DD  G    N ++   + SH    + S +TSRK+++  K    G  E S  
Sbjct: 279  EESNCGIVNDDHKGRSQCNNSDRESQDSHCPRKSHSDETSRKKRQGEKMVERGTGEKSLE 338

Query: 1007 DGQNRGRHLASGIHGMSKHSPANIANVS--PLLEGHSSKSIDMPSHVGSGDNDCQHF 1061
              QN G+  +        H P+   N+    LL+G SS+S++  S  G G N  QH 
Sbjct: 339  GRQNGGKKPSPSDSDKGVHRPSPPPNIDGRSLLDG-SSRSVEKQSQAGIGKNCYQHL 394



 Score =  127 bits (319), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/244 (38%), Positives = 128/244 (52%), Gaps = 35/244 (14%)

Query: 1185 DKGVPRCSPAKVIAKSSLEDHSSKSIDIPSQTGFGSD-YQHHEPTRSSSHI---GTTYYG 1240
            DKGV R SP   I   SL D SS+S++  SQ G G + YQH +P+ S S+     T Y G
Sbjct: 353  DKGVHRPSPPPNIDGRSLLDGSSRSVEKQSQAGIGKNCYQHLDPSFSDSYSQQHATPYGG 412

Query: 1241 TQAGIPNDMGSYGMS------SLN-NGLSHGANLDERYTGYVRNTDSLGYRPSMSTDREL 1293
            + A   +DM     +      SLN +GLS G N++E+ T  +  T             E 
Sbjct: 413  SWASNHDDMNRRHCTNIHESYSLNIHGLSSGGNMEEQSTRCMNGT-------------EF 459

Query: 1294 TMWPLARIYG--------QDFPAPTP-GYGQMGSVPSNLYGNLGSSAEASYRMSTSAMDR 1344
               P   +YG         ++P+    GYG M   P+  YG++GS++E  Y M+ SAM R
Sbjct: 460  VRQPQVHLYGLQGADSARWNYPSGRYLGYGHMEPAPAIPYGHMGSASEPPYMMNMSAMQR 519

Query: 1345 YAPRLHQLNNTRMNTFRSEPFMPSRFGFYDSRAPQPGF-FADMDFGPGFHPPFPQQGSGG 1403
            YAPRL +LN+TRM++   EP M +R G YD RAP  G+ F  M F PG   P+P   S G
Sbjct: 520  YAPRLDELNHTRMSSLGPEPSMLNRNGSYDPRAPGAGYRFDSMGFAPGPQHPYPHH-SAG 578

Query: 1404 WLDD 1407
            WL++
Sbjct: 579  WLNE 582



 Score = 41.6 bits (96), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 27/48 (56%), Gaps = 16/48 (33%)

Query: 748 CLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSR 795
           C+MKKKL+E GK C YKNYD+                ++PK+ AP +R
Sbjct: 12  CIMKKKLEEMGKKCSYKNYDV----------------IQPKKYAPDNR 43


>gi|222640379|gb|EEE68511.1| hypothetical protein OsJ_26944 [Oryza sativa Japonica Group]
          Length = 566

 Score =  259 bits (662), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 162/464 (34%), Positives = 240/464 (51%), Gaps = 49/464 (10%)

Query: 29  KEIISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAWKFDLTNVIPEIFVL 88
           ++++ FS LP  ++E  R  G ++ +Y+RG    G Q ++K + AW+ +L    P I VL
Sbjct: 27  EDLVCFSILPFWFDEEHRLLGSEKGVYIRGIDCEGRQPVYKRIEAWRVELDCEKPMISVL 86

Query: 89  SKENSWIKLQKPRKCY-EEIYRTILIMVHCLSYAKRNPEATAKSIWDFLSRFFCLYEVRP 147
           S E +WI+L  P   Y E+I R++LI +  + + +++P    +++   L   F  +  +P
Sbjct: 87  SSEGNWIRLLNPHPGYAEDIARSVLITIQMIHFVRKHPAKDERTLSKHLCEVFRKFFTKP 146

Query: 148 SQ-NDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDEVVQTKAMSGFIV--- 203
            + +DL     LIK  ++ D  L KSK L   LE+  T   +  E+ +      FIV   
Sbjct: 147 PEVDDLRKSYSLIKYFMKMDQTLVKSKILRRPLED--TWIIIETELRKASPNESFIVSNE 204

Query: 204 -----DDMEEDMVHDTEED------ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHA 252
                DD  ED    + +D      + N  D+  ++VC+ CD GG LL C+G C RSFH 
Sbjct: 205 SSTNCDDYTEDGSSGSSDDNGDNYTDGNASDDGTEAVCAICDQGGILLGCKGECKRSFHP 264

Query: 253 TIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVS 312
            ++ G +S C +LG T  EVE +  F C NC++KQHQCF CG+L SS  ET  +VF C +
Sbjct: 265 KLEDGTKSFCKTLGYTSREVEEIPIFICSNCQHKQHQCFKCGQLDSS-HETNPKVFQCCN 323

Query: 313 ATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQF 372
           A+CG FYHP CV+ LL  D    A  L K I  G +FTCP+H C  CKQ E+        
Sbjct: 324 ASCGRFYHPKCVAGLLEPD---GACGLEKRIADGMTFTCPVHWCFECKQIED-------- 372

Query: 373 AVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPI 432
                           R+I+F+ K    +  R WE    + +I IYCL H+ID    TP 
Sbjct: 373 ----------------REISFKKK---DVTARVWEVPKGDPKIFIYCLDHDIDATFRTPC 413

Query: 433 RDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSSLTSK 476
           RDHI FP   + + + D  RKK  +       +V+   + L++K
Sbjct: 414 RDHIKFPSAPQIERVKDLARKKVKVTDIRNTDEVSPESAELSTK 457


>gi|224061090|ref|XP_002300347.1| predicted protein [Populus trichocarpa]
 gi|222847605|gb|EEE85152.1| predicted protein [Populus trichocarpa]
          Length = 238

 Score =  259 bits (661), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 141/242 (58%), Positives = 179/242 (73%), Gaps = 6/242 (2%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNE--NERTGGCKELIYLRG 58
           MASSDDE +   + VS Y+F  +    + E +SFS LPI+W E  N   G    +I+L+G
Sbjct: 1   MASSDDEADTLPESVSTYHFADD----KDEPLSFSLLPIRWRESKNFDDGKKNHMIFLKG 56

Query: 59  AADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCL 118
           + D+GL+ I+K VIAW FDL+N  P+I VL+KE  W++L KPRK YE + RT+LI VHCL
Sbjct: 57  SVDNGLRTIYKQVIAWTFDLSNTTPQISVLTKEKCWMELGKPRKSYEIMIRTVLITVHCL 116

Query: 119 SYAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTF 178
            +A+ NPEA+ KS+WD+LS+ F LYE RPSQNDLVDH+DLI EA+ R++ LAK KFL+ F
Sbjct: 117 HFARWNPEASGKSVWDYLSKTFSLYEHRPSQNDLVDHLDLIGEAVRRENSLAKCKFLLNF 176

Query: 179 LEEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGD 238
           L EKP K+ LSDE  Q   MS FIVDD+E+D   D EEDESN+EDELFDSVC+FCDNGG+
Sbjct: 177 LGEKPRKKMLSDEDFQAATMSAFIVDDVEDDNFEDLEEDESNDEDELFDSVCAFCDNGGN 236

Query: 239 LL 240
           LL
Sbjct: 237 LL 238


>gi|115487874|ref|NP_001066424.1| Os12g0224500 [Oryza sativa Japonica Group]
 gi|108862352|gb|ABA96791.2| hydroxyproline-rich glycoprotein family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113648931|dbj|BAF29443.1| Os12g0224500 [Oryza sativa Japonica Group]
 gi|215693390|dbj|BAG88772.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 345

 Score =  258 bits (658), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 122/223 (54%), Positives = 151/223 (67%), Gaps = 10/223 (4%)

Query: 226 FDSVCSFCDNGGDLLCCEGRCLRSFHAT-IDAGEESHCASLGLTKDEVEAMLN------- 277
           ++ VCS CD+GGD LCCEGRCLRSFHAT     E + C +LGLT+++ +           
Sbjct: 42  WEDVCSICDDGGDFLCCEGRCLRSFHATEKHITEINQCTTLGLTEEQWKIFRQNDEKSEP 101

Query: 278 -FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 336
            + CKNC+Y QHQCF+CG LGSSD  +GAEVF C    CGHFYHP C+++LL  D  +  
Sbjct: 102 LYICKNCKYNQHQCFSCGLLGSSDLSSGAEVFQCEDEKCGHFYHPKCLARLLYPDSSMQP 161

Query: 337 DQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDK 396
               + +  G  F CP+HKC +CK GENK D + QFAVCRRCP  YHRKCLP  I FED 
Sbjct: 162 LNFEEEVARGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDD 221

Query: 397 LEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 439
              GI+ RAW+ +L   +ILIYC+KHEID E+GTP R+HIIFP
Sbjct: 222 ESNGIVQRAWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFP 263


>gi|302782778|ref|XP_002973162.1| hypothetical protein SELMODRAFT_99176 [Selaginella moellendorffii]
 gi|300158915|gb|EFJ25536.1| hypothetical protein SELMODRAFT_99176 [Selaginella moellendorffii]
          Length = 215

 Score =  256 bits (653), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 115/217 (52%), Positives = 154/217 (70%), Gaps = 2/217 (0%)

Query: 694 KVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKK 753
           +VYL+P L GMRY+SFGRHFT+  KLQ+IVD+L +YVN GDM+VD CCG+N+FS LM  K
Sbjct: 1   RVYLSPMLFGMRYSSFGRHFTRTYKLQSIVDQLSYYVNPGDMVVDLCCGSNEFSRLMSDK 60

Query: 754 LDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANK 813
           L    K+C +KN+D +   + F+FE++D+M ++P+EL  G +LIMGLNPPFG +A LA+K
Sbjct: 61  LTAMKKDCRFKNFDFIQPSDTFHFERKDYMEIQPEELCSGDKLIMGLNPPFGFRAQLASK 120

Query: 814 FINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMD 873
           F+   ++F PK+++LI P ET R D +   YEL WED   L G +FYLPGSVD   K + 
Sbjct: 121 FVEHCVKFQPKIIVLIAPEETRRPDGR--YYELTWEDKDLLRGDAFYLPGSVDAEGKTIK 178

Query: 874 QWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSR 910
           Q N   P LYLWSR D+A  H+ +A +HGH+ R   R
Sbjct: 179 QENNVPPVLYLWSRVDFAEAHRKIARRHGHLRRSSGR 215


>gi|326503094|dbj|BAJ99172.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326526707|dbj|BAK00742.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 338

 Score =  256 bits (653), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 118/217 (54%), Positives = 152/217 (70%), Gaps = 7/217 (3%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN------FFCKN 282
           VC+FCD+GG+L+CC+G CLRSFH T + GE S C SLGL + + E          +FC N
Sbjct: 53  VCAFCDDGGELICCDGGCLRSFHPTKEHGERSMCTSLGLNEAKWEKHKALGEKGLYFCNN 112

Query: 283 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 342
           C++KQHQCFACG LGSS+     EVF C    CGHFYHP CV+++L  D++  A    + 
Sbjct: 113 CKHKQHQCFACGLLGSSNLTAEPEVFKCKHRKCGHFYHPGCVAEILYPDNKAKATCFEQC 172

Query: 343 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII 402
           +  G  F+CP+HKC  CK+ ENK D ++QFAVCRRCP AYHRKCLPR I+  +  EE + 
Sbjct: 173 VAGGLEFSCPVHKCKRCKEAENKHDKEMQFAVCRRCPTAYHRKCLPRDISLTEDEEEAMP 232

Query: 403 TRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 439
            RAW+G+LP+ +ILIYC KH+I  E+GTP RDHI+FP
Sbjct: 233 QRAWDGILPD-QILIYCRKHDIQKELGTPKRDHIVFP 268


>gi|357157830|ref|XP_003577927.1| PREDICTED: uncharacterized protein LOC100829433 [Brachypodium
           distachyon]
          Length = 632

 Score =  253 bits (647), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 115/171 (67%), Positives = 137/171 (80%)

Query: 735 MIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGS 794
           MIVDFCCGANDFS LMK+KLD   K C +KNYD++  +N F FE+RDWMTV+  EL  GS
Sbjct: 1   MIVDFCCGANDFSRLMKEKLDLVQKKCHFKNYDLIQPQNTFCFERRDWMTVQRNELPRGS 60

Query: 795 RLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFL 854
           RL+MGLNPPFGVKA LANKFI+KAL FNPKL+ILIVP ET+RLD+K++ Y+LVWED   L
Sbjct: 61  RLVMGLNPPFGVKAALANKFIDKALSFNPKLIILIVPKETKRLDQKKTPYDLVWEDGDCL 120

Query: 855 SGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHIS 905
           +GKSFYLPGSVD N+K +  WN +APPLYLWS  D+   HK +AE+H H S
Sbjct: 121 AGKSFYLPGSVDVNEKIVQGWNASAPPLYLWSHPDWTKKHKKVAEEHNHTS 171


>gi|449474294|ref|XP_004154131.1| PREDICTED: uncharacterized protein LOC101202815, partial [Cucumis
           sativus]
          Length = 248

 Score =  251 bits (640), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 136/254 (53%), Positives = 175/254 (68%), Gaps = 20/254 (7%)

Query: 1   MASSDDEVE-VGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGA 59
           MASSDDE E +    VS+Y+F    E    E ISFS LPI+W+E ER  G +E ++L G 
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKE----EPISFSILPIKWDEGERLEGSQEPVFLHGT 56

Query: 60  ADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
            D+GLQK++K V AW+F+++   PEI VLSKEN WI LQKPRK +E+  RTILI VHCL 
Sbjct: 57  VDNGLQKVYKQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLH 116

Query: 120 YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
             KRNP+ ++KS+WD L++ F LYEVRPSQNDLVDHM LI EA++RDD+LAKS+FL+ FL
Sbjct: 117 ALKRNPDTSSKSLWDLLAKTFSLYEVRPSQNDLVDHMSLIGEAIKRDDILAKSQFLLAFL 176

Query: 180 EEKPTKRKLSDEV----VQTKAMSGFIVDDME---------EDMVHDTEEDESNEEDELF 226
           EEKP KR    E+     Q      FIVDD +         E+   D  E+ES+  D++F
Sbjct: 177 EEKPVKRMSCHELFFQDAQRNMQPSFIVDDSDDEDMVDDAVEEESDDAVEEESD--DDVF 234

Query: 227 DSVCSFCDNGGDLL 240
           DSVC+FCDNGG+++
Sbjct: 235 DSVCAFCDNGGNII 248


>gi|357154941|ref|XP_003576954.1| PREDICTED: uncharacterized protein LOC100825038 [Brachypodium
           distachyon]
          Length = 332

 Score =  249 bits (637), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 119/228 (52%), Positives = 156/228 (68%), Gaps = 8/228 (3%)

Query: 219 SNEEDELFDS--VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML 276
           +  ED L D    C+ CD+GG+L+ CEG CLRSFH     G +S C SLGLT++  + + 
Sbjct: 59  TRREDGLSDEWDACAICDDGGELIWCEGGCLRSFHPIKICGLDSMCTSLGLTEEHWQTLH 118

Query: 277 -----NFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRD 331
                 + CKNCE KQHQCFACG LGSSD  +G+EVF C    CGHFYHP CV++LL  D
Sbjct: 119 ANKQEKYICKNCENKQHQCFACGLLGSSDLISGSEVFKCKEKNCGHFYHPKCVAELLYPD 178

Query: 332 DEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
            +  A    + + +G  F C +H+C +CK+ EN+ D+D+Q AVCRRCP AYHRKCLP  I
Sbjct: 179 SKGLAIYFEEHVASGLEFHCLMHRCSLCKEAENRDDTDMQLAVCRRCPTAYHRKCLPSDI 238

Query: 392 AFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 439
           +F +  +EG   RAW+ +LP+ +ILIYC+KHEID ++GTP RDHI+FP
Sbjct: 239 SFIEDNDEGTQQRAWDNILPD-QILIYCMKHEIDKDLGTPKRDHIVFP 285


>gi|242044182|ref|XP_002459962.1| hypothetical protein SORBIDRAFT_02g019320 [Sorghum bicolor]
 gi|241923339|gb|EER96483.1| hypothetical protein SORBIDRAFT_02g019320 [Sorghum bicolor]
          Length = 919

 Score =  249 bits (636), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 111/169 (65%), Positives = 136/169 (80%)

Query: 735 MIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGS 794
           +IVDFCCGANDFS LMK+KLD+  K C +KNYD++  ++ F FEKRDWMTV+P EL  GS
Sbjct: 301 LIVDFCCGANDFSRLMKEKLDKVQKKCNFKNYDLIQPQHRFCFEKRDWMTVQPNELPRGS 360

Query: 795 RLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFL 854
           +LIMGLNPPFGVKA LANKFI+KAL F PKL+ILIVP ET+RLD+K + Y+L+WED + L
Sbjct: 361 QLIMGLNPPFGVKASLANKFIDKALTFKPKLIILIVPKETKRLDQKRTPYDLIWEDSEIL 420

Query: 855 SGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGH 903
           +GKSFYLPGSVD NDK ++ WN +APPLYLWS   +   H  +AE+ GH
Sbjct: 421 AGKSFYLPGSVDLNDKTVEGWNASAPPLYLWSHPHWTKKHMQVAEERGH 469



 Score =  133 bits (334), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 68/157 (43%), Positives = 108/157 (68%), Gaps = 6/157 (3%)

Query: 569 FGKQDNSDRETSRTVTVKPL---RKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILK 625
           FG    S +E  +     PL   RK  +S  P++D ++++R+ +++ +  SS+ ++++ +
Sbjct: 54  FGSSPKSVKEEEQESPSLPLSATRKIPLSLFPTVDSETEKRVITILGNKVSSLTIKDVTR 113

Query: 626 RHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQ 685
              +PSTH Y+ +   DK I  GK+E SV+A+  ALKKL+ +G S+ DAKAVCEP+VL Q
Sbjct: 114 NCSLPSTHVYSGR-LTDK-IPQGKLERSVQAVEAALKKLE-NGGSVNDAKAVCEPDVLRQ 170

Query: 686 IFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAI 722
           + +W +KL+VY++PF+HG RY+SFGRHFTKV+KL  +
Sbjct: 171 LARWHSKLRVYISPFIHGTRYSSFGRHFTKVEKLVEV 207


>gi|357154938|ref|XP_003576953.1| PREDICTED: uncharacterized protein LOC100824733 [Brachypodium
           distachyon]
          Length = 315

 Score =  243 bits (621), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 112/233 (48%), Positives = 156/233 (66%), Gaps = 6/233 (2%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN----FFCKNCE 284
           VC+ CD GG L+CC+G CLRSFH T   GEES C +LGLT+++ + + +    + CKNC+
Sbjct: 36  VCAICDEGGSLICCDGGCLRSFHPTRKVGEESKCTTLGLTEEQWQILESNDEKYICKNCK 95

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
            KQHQCFACG LGSSD  +G EVF C    C  FYHP CV+++L  D +  A +    + 
Sbjct: 96  SKQHQCFACGLLGSSDLTSGTEVFQCKDKQCARFYHPKCVAEVLYPDSKSRALRFGNDVA 155

Query: 345 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 404
           +G  F CP+H+C +CK+  N+ D D++ AVC RCP AYHR CLPR+I+F +  E+GI  R
Sbjct: 156 SGLEFHCPMHRCSLCKEAGNRDDKDMRLAVCGRCPTAYHRTCLPREISFRENKEKGIQKR 215

Query: 405 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSL 457
           AW+ +LP+    IYC+KH+++ ++ TP RDHI+FP    N     RPR +++L
Sbjct: 216 AWDDVLPDQD-FIYCMKHKMEKDLRTPRRDHIVFPDT-RNPCAAKRPRGQETL 266


>gi|302789842|ref|XP_002976689.1| hypothetical protein SELMODRAFT_175739 [Selaginella moellendorffii]
 gi|300155727|gb|EFJ22358.1| hypothetical protein SELMODRAFT_175739 [Selaginella moellendorffii]
          Length = 209

 Score =  243 bits (621), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 110/210 (52%), Positives = 147/210 (70%), Gaps = 2/210 (0%)

Query: 701 LHGMRYTSFGRHFTKVDKLQAIVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKN 760
           L GMRY+SFGRHFT+  KLQ+IVD+L +YVN GDM+VD CCG+N+FS LM  KL    K+
Sbjct: 2   LFGMRYSSFGRHFTRTYKLQSIVDQLSYYVNPGDMVVDLCCGSNEFSRLMSDKLTAMKKD 61

Query: 761 CLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALE 820
           C +KN+D +   + F+FE++D+M ++P+EL    +LIMGLNPPFG +A LA+KF+   ++
Sbjct: 62  CRFKNFDFIQPSDTFHFERKDYMEIQPEELCSSDKLIMGLNPPFGFRAQLASKFVEHCVK 121

Query: 821 FNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNMTAP 880
           F PK+++LI P ET R D +   YEL WED   L G +FYLPGSVD   K + Q N   P
Sbjct: 122 FQPKIIVLIAPEETRRPDGR--YYELTWEDKDLLRGDAFYLPGSVDAEGKTIKQENNVPP 179

Query: 881 PLYLWSRHDYAAHHKALAEKHGHISRPQSR 910
            LYLWSR D+A  H+ +A +HGH+ R   R
Sbjct: 180 VLYLWSRVDFAEAHRKIARRHGHLRRSSGR 209


>gi|414884838|tpg|DAA60852.1| TPA: hypothetical protein ZEAMMB73_000278 [Zea mays]
          Length = 589

 Score =  221 bits (562), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 125/279 (44%), Positives = 167/279 (59%), Gaps = 34/279 (12%)

Query: 750 MKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAG 809
           MK+KLD+  K C +KNYD++  +N F FEKRDWMTV+P EL  GS+LIMGLNPPFGVKA 
Sbjct: 1   MKEKLDKVQKKCHFKNYDLIQPQNRFCFEKRDWMTVQPNELPHGSQLIMGLNPPFGVKAS 60

Query: 810 LANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEND 869
           LANKFI+KAL F PKL++LIVP ET+RLD++ + Y+L+WED + LSGKSFYLPGSVD ND
Sbjct: 61  LANKFIDKALTFKPKLIVLIVPKETKRLDQRRTPYDLIWEDSECLSGKSFYLPGSVDLND 120

Query: 870 KQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERNCYETHAVDHPKEE 929
           K ++ WN +APPLYLWS   +   H  +AE+HGH    +    +E    E++  D     
Sbjct: 121 KIVEGWNASAPPLYLWSHPGWTKKHMQVAEEHGHTCMGKIACHVE----ESNLSDDLSVR 176

Query: 930 GQGDASMLID------LPLQINVTKEL-----RNEAREDD-------------KAGFPDN 965
            Q DA+  ++       P    ++  L     R +AR ++             KA  PDN
Sbjct: 177 KQSDATNKLNSRSGKETPFNDRMSNLLDDGSARKQARSEEEKRTLGKIEHHVKKANIPDN 236

Query: 966 ---ATEGGGES---SHGHGDNQSGKTSRKRKRDRKKHGS 998
                E    S   SH   D + G+ +R+R  +R   G+
Sbjct: 237 LPVKKEPEASSKVMSHSGKDRKIGRYNRRRHDNRSSKGT 275


>gi|414588818|tpg|DAA39389.1| TPA: hypothetical protein ZEAMMB73_973048 [Zea mays]
          Length = 591

 Score =  217 bits (553), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 97/155 (62%), Positives = 122/155 (78%)

Query: 750 MKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAG 809
           MK+KLD+  K C +KNYD++  ++ F FEKRDWMTV+P EL  GS+LIMGLNPPFGVKA 
Sbjct: 1   MKEKLDKVQKKCHFKNYDLIQPQHHFCFEKRDWMTVQPNELPRGSQLIMGLNPPFGVKAS 60

Query: 810 LANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDEND 869
           LANKFI+KAL F PKL++LIVP ET+RLD+K+  Y+L+WED + L+GKSFYLPGSVD ND
Sbjct: 61  LANKFIDKALTFKPKLIVLIVPKETKRLDKKKIPYDLIWEDSECLAGKSFYLPGSVDLND 120

Query: 870 KQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 904
             ++ WN +APPLYLWS   +   H  +AE+HGH 
Sbjct: 121 NTVEGWNASAPPLYLWSHPHWTKKHMQVAEEHGHT 155


>gi|357140293|ref|XP_003571704.1| PREDICTED: uncharacterized protein LOC100832660 [Brachypodium
           distachyon]
          Length = 1028

 Score =  210 bits (535), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 111/276 (40%), Positives = 158/276 (57%), Gaps = 11/276 (3%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           +C+ CDNGG LL CEGRC R+FH     G ES+C +LG +  +++ + ++ CKNCE++QH
Sbjct: 263 LCAMCDNGGKLLSCEGRCKRAFHPREKDGRESNCDTLGFSSAQLQEIGHYLCKNCEFQQH 322

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
           QCF C  L  SD    A+VF C  A CGHFYHP C++KLL   D   A +L K I AG +
Sbjct: 323 QCFKCEDLEPSDGPN-AKVFQCYKACCGHFYHPSCIAKLLEPHDNDGACELEKRIAAGMA 381

Query: 349 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEG 408
           FTCP H C  C + E+  +  L  AVCRRCP +         I+FED  EE    ++   
Sbjct: 382 FTCPAHWCSKCGKMEDHTEIPLWLAVCRRCPIS--------NISFED-FEEKDGRKSIRA 432

Query: 409 LLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPS-GKQKVA 467
              + RI+IYC +HEI+D+IGTP RDHI  P I+++ TI         L+S +   + +A
Sbjct: 433 WHLDDRIIIYCRQHEIEDKIGTPRRDHIKLPSIKDSGTIGTHCMDHMKLSSTTESTRDLA 492

Query: 468 STKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEI 503
             K+ +T K        +   +++ +K  Q +  +I
Sbjct: 493 KKKAQVTGKRKMNTDQGLTGTEKLSNKISQEKAGQI 528



 Score =  116 bits (290), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 94/174 (54%), Gaps = 5/174 (2%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKEL-IYLRGAADSGLQKIFKPVIA 73
           V  Y+FE+      +E +S S LP++++E +         + L G AD+GL  +   V+A
Sbjct: 15  VDKYHFEN----GMQEPVSLSVLPLKFDEIDEVACLDSTKVDLHGFADNGLHTVHAKVVA 70

Query: 74  WKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIW 133
           W+       P+I VLS E  WI+L KPRKCYE+  R++LI V  L + K+ P    +S+ 
Sbjct: 71  WRVGFDYEQPKITVLSSEGYWIELLKPRKCYEKTVRSVLISVQMLHFVKKWPRKENRSLL 130

Query: 134 DFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRK 187
           + L   F  ++++PS+ DL     +IK   ERD  L KSK L   +E  P K K
Sbjct: 131 NHLCEVFNKFDIKPSERDLKKQHSIIKFFAERDPTLMKSKILQRIMENTPRKTK 184


>gi|218200975|gb|EEC83402.1| hypothetical protein OsI_28848 [Oryza sativa Indica Group]
          Length = 1266

 Score =  210 bits (534), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 134/362 (37%), Positives = 195/362 (53%), Gaps = 36/362 (9%)

Query: 15  VSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKEL--IYLRGAADSGLQKIFKPVI 72
           V  YYFE E E K    + FS LPI+++ENE    C +   + LRG  D+  + +FK VI
Sbjct: 15  VDIYYFE-ESEDKP---VCFSVLPIKFDENEEVS-CSDYKELNLRGFTDNN-RHVFKKVI 68

Query: 73  AWKFDLTNVIPEIFVLSKENSWIKLQKPRKCY-EEIYRTILIMVHCLSYAKRNPEATAKS 131
           AW+ DL    P+I VLS +  WI+L KPR CY E+  R+ILI V  L + ++ P+   +S
Sbjct: 69  AWRVDLDCQRPKISVLSSDGKWIELLKPRMCYYEKKVRSILITVQMLHFVRKCPKKQQRS 128

Query: 132 IWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDE 191
           ++D L   F  +   P+ +D+  H  LIK  +ERD +L KSK +  F+E+   K   + E
Sbjct: 129 LFDHLGEVFNKFGPVPNGDDIKKHHHLIKLFMERDPILVKSKIIRCFIEDASRK---AIE 185

Query: 192 VVQTKAMS--GFIVDD-------------------MEEDMVHDTEEDESNEEDELFDSVC 230
            +  +A +   FIV D                     +D   +  + + N  D+  D +C
Sbjct: 186 PIPRRACTEDRFIVSDESLESSDDCSYSDSDSSDDNNDDSTDEDTDTDGNATDDDTDMIC 245

Query: 231 SFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNCEYKQH 288
           + CD+GG LL CEG C RSFHA +  G +S C +L  T  E++   +  F C+NCE+ +H
Sbjct: 246 AICDDGGKLLSCEGPCKRSFHARVKHGRKSKCRTLRFTSVELKLKESGTFLCENCEHNEH 305

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
           QCF CG+L  S     A+VF C   +CGHFYHP C++ LL  +D   A +L + I     
Sbjct: 306 QCFKCGELEPS-YGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGACKLEEMIADAIM 364

Query: 349 FT 350
           F+
Sbjct: 365 FS 366



 Score =  164 bits (414), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 119/208 (57%), Gaps = 21/208 (10%)

Query: 719 LQAIVDKLHWYVNDGDM---------------------IVDFCCGANDFSCLMKKKLDET 757
           L  I DKLHWYV  G+                      IVD C   ++FS LMK+KL++ 
Sbjct: 567 LVQIADKLHWYVQPGNTESCGKLPDGVLLERSIVVVREIVDLCFNMDNFSRLMKEKLEDV 626

Query: 758 GKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINK 817
            K C +KNYD+   KN+  F++ +W+T++PK+L  G  L+M L+ P  V+A  ANKF++K
Sbjct: 627 SKGCNFKNYDLFQHKNNLCFDESNWVTMQPKDLPHGLNLVMVLDLPLDVQAVSANKFVDK 686

Query: 818 ALEFNPKLLILIVPPETERLDRKESAYELVWEDDQFLSGKSFYLPGSVDENDKQMDQWNM 877
            L F P+L+I++ P + ERLD KE  Y L+WE +Q + GK  Y PG VD ND   +  ++
Sbjct: 687 VLTFKPRLIIIVSPTDIERLDCKEEPYHLIWEYNQHIFGKPLYQPGDVDVNDDAKNGLHV 746

Query: 878 TAPPLYLWSRHDYAAHHKALAEKHGHIS 905
               + LWS  D    +  +A KHGH++
Sbjct: 747 IPLCVSLWSFPDKTEENMRIAGKHGHLN 774



 Score = 42.0 bits (97), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 71/161 (44%), Gaps = 34/161 (21%)

Query: 354 HKCCICKQGENKADSDLQFAVCRR--CPKAYHRKCLPRKIAFED-----KLEEGI---IT 403
           H+C  C + E     + +   C +  C   YH KC+   +  ED     KLEE I   I 
Sbjct: 305 HQCFKCGELEPSYGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGACKLEEMIADAIM 364

Query: 404 RAWEGLLPNHRILIYCLK---------HEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKK 454
            +W      H+++ +  K         H ID   GTP  +HI FP I + K   +  +K 
Sbjct: 365 FSW------HQMVFFTAKFPLKQGQRHHGIDATTGTPSGEHIKFPSIPKIKKTKNLSKKD 418

Query: 455 QSLASPSGKQKVA----STKSSLTSKAPPQGKFSVKALKRV 491
              A  +GK+K +    STKS+  +   P+G+   K  +RV
Sbjct: 419 ---AKVTGKRKKSGNKFSTKSTKLANVSPRGE--SKQTRRV 454


>gi|449470764|ref|XP_004153086.1| PREDICTED: uncharacterized protein LOC101206575, partial [Cucumis
           sativus]
          Length = 203

 Score =  209 bits (533), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 104/180 (57%), Positives = 133/180 (73%), Gaps = 5/180 (2%)

Query: 1   MASSDDEVE-VGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGA 59
           MASSDDE E +    VS+Y+F    E    E ISFS LPI+W+E ER  G +E ++L G 
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKE----EPISFSILPIKWDEGERLEGSQEPVFLHGT 56

Query: 60  ADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
            D+GLQK++K V AW+F+++   PEI VLSKEN WI LQKPRK +E+  RTILI VHCL 
Sbjct: 57  VDNGLQKVYKQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLH 116

Query: 120 YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
             KRNP+ ++KS+WD L++ F LYEVRPSQNDLVDHM LI EA++RDD+LAKS+ + TF+
Sbjct: 117 ALKRNPDTSSKSLWDLLAKTFSLYEVRPSQNDLVDHMSLIGEAVKRDDILAKSQAIETFI 176



 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/34 (70%), Positives = 29/34 (85%)

Query: 273 EAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAE 306
           +A+  F CKNCEYKQHQC+ACG LGSSD+ +GAE
Sbjct: 170 QAIETFICKNCEYKQHQCYACGNLGSSDQSSGAE 203


>gi|222616830|gb|EEE52962.1| hypothetical protein OsJ_35606 [Oryza sativa Japonica Group]
          Length = 312

 Score =  208 bits (529), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 102/215 (47%), Positives = 130/215 (60%), Gaps = 27/215 (12%)

Query: 226 FDSVCSFCDNGGDLLCCEGRCLRSFHAT-IDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           ++ VCS CD+GGD LCCEGRCLRSFHAT     E + C +LGLT+++ +           
Sbjct: 42  WEDVCSICDDGGDFLCCEGRCLRSFHATEKHITEINQCTTLGLTEEQWKIF--------- 92

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
                           ++   +VF C    CGHFYHP C+++LL  D  +      + + 
Sbjct: 93  ----------------RQNDEKVFQCEDEKCGHFYHPKCLARLLYPDSSMQPLNFEEEVA 136

Query: 345 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITR 404
            G  F CP+HKC +CK GENK D + QFAVCRRCP  YHRKCLP  I FED    GI+ R
Sbjct: 137 RGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDDESNGIVQR 196

Query: 405 AWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 439
           AW+ +L   +ILIYC+KHEID E+GTP R+HIIFP
Sbjct: 197 AWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFP 230


>gi|449534314|ref|XP_004174109.1| PREDICTED: uncharacterized protein LOC101225204 [Cucumis sativus]
          Length = 189

 Score =  204 bits (519), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 102/176 (57%), Positives = 129/176 (73%), Gaps = 5/176 (2%)

Query: 1   MASSDDEVE-VGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGA 59
           MASSDDE E +    VS+Y+F    E    E ISFS LPI+W+E ER  G +E ++L G 
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKE----EPISFSILPIKWDEGERLEGSQEPVFLHGT 56

Query: 60  ADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
            D+GLQK++  V AW+F+++   PEI VLSKEN WI LQKPRK +E+  RTILI VHCL 
Sbjct: 57  VDNGLQKVYNQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLH 116

Query: 120 YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFL 175
             KRNP+ ++KS+WD L++ F LYEVRPSQNDLVDHM LI EA++RDD+LAKS+ L
Sbjct: 117 ALKRNPDTSSKSLWDLLAKTFSLYEVRPSQNDLVDHMSLIGEAIKRDDILAKSQVL 172


>gi|52354519|gb|AAU44580.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
          Length = 268

 Score =  200 bits (508), Expect = 5e-48,   Method: Composition-based stats.
 Identities = 99/211 (46%), Positives = 136/211 (64%), Gaps = 9/211 (4%)

Query: 163 LERDDVLAKSKFLVTFL----EEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDE 218
           ++RD+ L KSKFL+TFL    +  P + +L  + +Q  A +    D + E+M  +     
Sbjct: 1   MKRDEDLTKSKFLITFLGKTSQTTPIEVELPTDHLQ-DAQTPMEQDSIVENMTDEENSSS 59

Query: 219 SNEEDEL---FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT-KDEVEA 274
           S+++ E    FD VCS CDNGG +LCCEG CLRSFH TI  G E+ C SLG T K +++A
Sbjct: 60  SDDDSEANLQFDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQA 119

Query: 275 MLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEV 334
           +  + C NC YKQHQC+ACG+LGSSD+    +VFPC ++ CGHFYHP CV++LL  DD+ 
Sbjct: 120 LGTYLCNNCLYKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQN 179

Query: 335 AADQLAKSIIAGESFTCPLHKCCICKQGENK 365
            A++L   I A + F CPLH C +C   E+K
Sbjct: 180 KAEELQAKIAARDCFACPLHTCKLCNMSEDK 210


>gi|242085260|ref|XP_002443055.1| hypothetical protein SORBIDRAFT_08g007180 [Sorghum bicolor]
 gi|241943748|gb|EES16893.1| hypothetical protein SORBIDRAFT_08g007180 [Sorghum bicolor]
          Length = 310

 Score =  196 bits (499), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 130/222 (58%), Gaps = 36/222 (16%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS-LGLTKDEVEAML----NFFCK 281
           D +C+ CD+GGD++CC+G C RSFH      E S C   L L+ ++ + +L    +F CK
Sbjct: 34  DDLCAICDDGGDVICCDGVCQRSFHLADGNSERSRCREILRLSAEQAKMILAADKDFICK 93

Query: 282 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 341
           NC+Y++HQCFACGKLGSSD  + A+VF C    CGHFYHP CV+KLL  D E  A     
Sbjct: 94  NCKYQEHQCFACGKLGSSDLSSEAKVFQCEVDDCGHFYHPKCVAKLLYPDSEEEATLFEV 153

Query: 342 SIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGI 401
            + A E FTCP+H+C +C+ GENK D ++QFAVCRRCP  YHR CLPR            
Sbjct: 154 HVAAREKFTCPIHECIVCRGGENKNDRNMQFAVCRRCPTTYHRMCLPR------------ 201

Query: 402 ITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 443
                              KH I  ++ TP  DHIIFP +++
Sbjct: 202 -------------------KHPIVKKLKTPKWDHIIFPDVKK 224


>gi|297739139|emb|CBI28790.3| unnamed protein product [Vitis vinifera]
          Length = 201

 Score =  195 bits (495), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/182 (53%), Positives = 134/182 (73%), Gaps = 4/182 (2%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           MASSD+E E+    + DYYF       + E ISFS LP+QW+++E   G K +I+LRG+A
Sbjct: 1   MASSDEEGEILPNCIRDYYFVD----YKDEPISFSILPLQWSKDENPDGLKMMIFLRGSA 56

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
             GLQKI+K VIAWKF+L++V PEIFVLSK+ +W++LQ PRK ++ I RTIL+ V  L +
Sbjct: 57  YDGLQKIYKQVIAWKFELSSVEPEIFVLSKDKNWMELQSPRKSFQNIVRTILVTVSWLHF 116

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            KRNPEA+ KS+W+ L + F  YE  PS+NDL+DHM LI+EA++R++ L KSK ++T+L 
Sbjct: 117 VKRNPEASGKSLWNHLLKSFSSYEFEPSENDLLDHMPLIQEAVKREEDLLKSKCMLTYLP 176

Query: 181 EK 182
           EK
Sbjct: 177 EK 178


>gi|168064191|ref|XP_001784048.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664434|gb|EDQ51154.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 541

 Score =  187 bits (476), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 105/240 (43%), Positives = 144/240 (60%), Gaps = 32/240 (13%)

Query: 213 DTEEDESNEEDELFDSVCSFCDNGGD----LLCCEGRCLRSFHATIDAGEESHCASLGLT 268
           ++++D  +E D     VC+FCD+G +    LLCC+G C+RSFH TID+G ++ C +L LT
Sbjct: 261 ESDDDCVDEPD-----VCAFCDDGVEGSERLLCCDGPCMRSFHPTIDSGWQNKCPTLRLT 315

Query: 269 KDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA----EVFPCVSATCGHFYHPHCV 324
           +  V   + + C NCE  QHQCF CGKLG   K TG+    +VF C    C  FYHP CV
Sbjct: 316 RAAVSVEI-WICPNCEVGQHQCFVCGKLG---KSTGSADHQQVFKCEHRQCKRFYHPACV 371

Query: 325 SKLLLRDDEVAADQLAKSI-IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYH 383
           +KLL+ D  VA + LA  I +  E+FTCPLHKC  C   E+KAD  L    CRRCP A+H
Sbjct: 372 AKLLVPD--VALNNLACRIQLKFETFTCPLHKCANCNLDEDKADPTLCLIKCRRCPAAWH 429

Query: 384 RKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 443
            +CLP          +   ++ W   L + + ++YC KH +D E+ TP R+HI FP +E+
Sbjct: 430 ERCLP----------QACRSQMWP--LEDGKCVMYCGKHRLDPELLTPERNHITFPELEK 477


>gi|168033774|ref|XP_001769389.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679309|gb|EDQ65758.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 707

 Score =  184 bits (468), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 104/233 (44%), Positives = 138/233 (59%), Gaps = 21/233 (9%)

Query: 215 EEDESNEEDELFDSVCSFCDNGGD----LLCCEGRCLRSFHATIDAGEESHCASLGLTKD 270
           +ED ++E D     VC FCD+G +    LLCC+G C+RSFH TID+G ++ C +L LT+ 
Sbjct: 443 DEDWADEPD-----VCVFCDDGVEGSQRLLCCDGPCMRSFHPTIDSGLQNKCKTLRLTRA 497

Query: 271 EVEAMLNFFCKNCEYKQHQCFACGKLG-SSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
            + ++  + C NCE  QHQCFACGKLG S++     EVF C    C  FYHP CV+KLL+
Sbjct: 498 AI-SVETWICPNCEVGQHQCFACGKLGKSTESADEQEVFVCDHNRCRRFYHPACVAKLLV 556

Query: 330 RDDEVAADQLAKSI-IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
              E A + LA  I +  E+FTCPLHKC  C   ENK D  L    CRRCP A+H KCLP
Sbjct: 557 --SEAAQNNLACRIQLKLETFTCPLHKCANCNLDENKIDPTLHLVKCRRCPVAWHEKCLP 614

Query: 389 RKIAFEDKLE-EGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPG 440
           +  +F   +    I+ R+    L + +      KH +D E+ TP R+HI FP 
Sbjct: 615 QYGSFNHLISVRSIVCRSQIWPLADGK------KHRLDPELLTPERNHITFPA 661



 Score = 76.6 bits (187), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 79/149 (53%), Gaps = 7/149 (4%)

Query: 32  ISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAWKFDLTNV-IPEIFVLSK 90
           + F+   IQW  ++      E +Y++G  + GLQ++   +  WK +L  V  P+  +   
Sbjct: 180 LPFAIREIQWERSQ-----AEKVYVQGKFN-GLQEVSVEIDLWKLELPIVGKPKFLIHEA 233

Query: 91  ENSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIWDFLSRFFCLYEVRPSQN 150
             +WI+L+KPR  Y E  + +++  H L +AK++PE+T  ++W ++ +    ++V+P+Q 
Sbjct: 234 AGAWIELRKPRDFYLETMKVVIVAAHFLIFAKQSPESTKGAVWRYIRKECSEFKVKPNQK 293

Query: 151 DLVDHMDLIKEALERDDVLAKSKFLVTFL 179
           DL     L++  +  D  L  S  +  F 
Sbjct: 294 DLSPAFLLLQAIVTSDPKLQASSVISQFF 322


>gi|242085258|ref|XP_002443054.1| hypothetical protein SORBIDRAFT_08g007170 [Sorghum bicolor]
 gi|241943747|gb|EES16892.1| hypothetical protein SORBIDRAFT_08g007170 [Sorghum bicolor]
          Length = 279

 Score =  164 bits (415), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 99/258 (38%), Positives = 132/258 (51%), Gaps = 51/258 (19%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN---FFCKNC 283
           D VC+ CD+GG + CC+  CLRSFH T + GE S C SLG+  +E + +++   F CKNC
Sbjct: 36  DDVCAICDDGGYVTCCDSGCLRSFHLTEEHGEGSKCPSLGINSEEAKMIIDKKDFICKNC 95

Query: 284 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
           +YKQHQC +CG LGSSD  +GAEVF C    CGHFYHP C+SKLL   D++ A       
Sbjct: 96  KYKQHQCSSCGLLGSSDLSSGAEVFQCKEYNCGHFYHPKCISKLLYPGDKLRACHF---- 151

Query: 344 IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIIT 403
                                  + D+ F   +  P  Y                  I  
Sbjct: 152 -----------------------EHDIPFET-KEGPNGY------------------IFQ 169

Query: 404 RAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTIIDRPRKKQSLASPSGK 463
           RAW+G+L + RILIYC+KHEI  E+G P R  IIFP  +EN  +   P          G+
Sbjct: 170 RAWDGILRD-RILIYCMKHEIVKELGIPRRKLIIFP-YDENLCVPQGPESAPKEQDTLGQ 227

Query: 464 QKVASTKSSLTSKAPPQG 481
           +++    SS  S++ P G
Sbjct: 228 EELLDHPSSEPSQSLPSG 245


>gi|449472830|ref|XP_004153707.1| PREDICTED: uncharacterized protein LOC101217475, partial [Cucumis
           sativus]
          Length = 200

 Score =  153 bits (386), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 85/188 (45%), Positives = 115/188 (61%), Gaps = 14/188 (7%)

Query: 1   MASSDDEVE-VGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGA 59
           MASSDDE E +    VS+Y+F    E    E ISFS LPI+W+E ER  G +E ++L G 
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKE----EPISFSILPIKWDEGERLEGSQEPVFLHGT 56

Query: 60  ADSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
            D+GLQK++K V AW+F+++   PEI VLSKEN WI LQKPRK +E+  RTILI VHCL 
Sbjct: 57  VDNGLQKVYKQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLH 116

Query: 120 YAKRNPEATAKSIWDFLSRFF-----CLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKF 174
             KRNP+ ++KS+WD L++ F     C+     ++ D     D     L +++V A   F
Sbjct: 117 ALKRNPDTSSKSLWDLLAKTFSCDGRCMRSFHATEED----GDCFSLGLSKEEVDAIETF 172

Query: 175 LVTFLEEK 182
           +    E K
Sbjct: 173 ICKNCEYK 180



 Score = 97.4 bits (241), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 43/65 (66%), Positives = 54/65 (83%), Gaps = 3/65 (4%)

Query: 242 CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
           C+GRC+RSFHAT + G+   C SLGL+K+EV+A+  F CKNCEYKQHQC+ACG LGSSD+
Sbjct: 139 CDGRCMRSFHATEEDGD---CFSLGLSKEEVDAIETFICKNCEYKQHQCYACGNLGSSDQ 195

Query: 302 ETGAE 306
            +GAE
Sbjct: 196 SSGAE 200


>gi|38424059|dbj|BAD01749.1| unknown protein [Oryza sativa Japonica Group]
 gi|38637133|dbj|BAD03387.1| unknown protein [Oryza sativa Japonica Group]
          Length = 657

 Score =  153 bits (386), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 78/182 (42%), Positives = 107/182 (58%), Gaps = 4/182 (2%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M SSDD++E   K V +YYF  + +      +SF  LP Q++  E     K+ +YLRG  
Sbjct: 1   MMSSDDDLEPQLKAVENYYFVDDNDVP----VSFDVLPFQFDAAEGVASFKKDVYLRGFT 56

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHCLSY 120
           D GLQK++K V+AWK  L    PEI VLS E SWI L KPR  YEE  R++LI V  L +
Sbjct: 57  DGGLQKVYKQVVAWKLVLDGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHF 116

Query: 121 AKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLE 180
            +R P  + K +WD L   F  + VRP ++D  +H +LIK   +RD  LA S+ L  F++
Sbjct: 117 VRRRPTDSEKDMWDHLYGVFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFIK 176

Query: 181 EK 182
           +K
Sbjct: 177 DK 178


>gi|414877686|tpg|DAA54817.1| TPA: hypothetical protein ZEAMMB73_710478 [Zea mays]
          Length = 277

 Score =  144 bits (363), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 93/269 (34%), Positives = 130/269 (48%), Gaps = 64/269 (23%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFFCKNC 283
           ++VC+ CD+GG+L CCEG C RSFH      E +   +LGLT  E + ++   +F CKNC
Sbjct: 39  ENVCAICDDGGNLTCCEGCCQRSFHLD---NEHNCIKTLGLTIQEAKNIIEKEDFICKNC 95

Query: 284 EYKQHQCFACGKLGSSD--KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 341
           +YKQHQCF CG LGSSD    +  +VF C    C  FYHP CV+++L  D +   +    
Sbjct: 96  QYKQHQCFICGSLGSSDDTDTSSQQVFQCEHDDCARFYHPKCVAQMLYPDSQ---ELFEL 152

Query: 342 SIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGI 401
            + AGESFTCP+H+C +CK+ +NK D  L                    + F        
Sbjct: 153 HVAAGESFTCPMHECIVCKEVDNKKDKTL--------------------LDF-------- 184

Query: 402 ITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP---------GIEENKTIIDRPR 452
                        + ++  KHEI  E+ TP R+HI+FP          +   +   D   
Sbjct: 185 -------------VNLFYRKHEIVRELQTPARNHIVFPESNVLKTFQALYAQREAPDHSS 231

Query: 453 KKQSLASP---SGKQKVASTKSSLTSKAP 478
            KQ+ +SP   +  Q      S L S AP
Sbjct: 232 SKQTHSSPPPAARDQNQCYCSSPLCSFAP 260


>gi|384251374|gb|EIE24852.1| hypothetical protein COCSUDRAFT_41130 [Coccomyxa subellipsoidea
            C-169]
          Length = 1073

 Score =  141 bits (356), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 86/278 (30%), Positives = 149/278 (53%), Gaps = 24/278 (8%)

Query: 613  DAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIE 672
            D A+S+R  E   R   P       +++++  IT  +++G V+A R AL           
Sbjct: 766  DLANSLRDVEANSRKPKPY------QNSLEPTITRDRLDGHVKACRGALDN--------P 811

Query: 673  DAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDKLHWYVND 732
             +  +    +++++++ ++ L++ LAP LHG  YTS+GRHFT    L+ +   L  ++  
Sbjct: 812  HSAQLMMGRLVNELYRAEDNLRMVLAPVLHGASYTSYGRHFTLPYLLEEVNKYLLPFMLT 871

Query: 733  GDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAP 792
            GD +VDF CG N +  ++K+     G     + +DI+ AK+  +F K  W+ V+  +L  
Sbjct: 872  GDTVVDFSCGYNVWLPMLKRMGLAEGLEIHGRAFDIITAKDTEDFVKSSWLDVDFGQLPR 931

Query: 793  GSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKESAYELVWEDDQ 852
            G +L++GLNPPFG    LA+KF+  A +F P++++LI PP T         Y ++ E+ +
Sbjct: 932  GDKLVIGLNPPFGKNGCLASKFVTHAAQFQPRVIVLIAPPGT----IIPPGYIVMLEETR 987

Query: 853  FLSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDY 890
             ++G++F+ PG VD+       WN   P   +  R DY
Sbjct: 988  LMAGEAFFKPG-VDKA-----SWNDVPPSTRILIREDY 1019



 Score =  122 bits (306), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 117/469 (24%), Positives = 177/469 (37%), Gaps = 94/469 (20%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESH-----------CASLGLTKDEVEAM 275
           D++C+ CD+GG+L+ C+G C+R+FHA I   EE             C  L +  +     
Sbjct: 185 DTICTKCDDGGELILCDGGCMRAFHAGIKPAEEEEEGPVYVDDPGSCNPLQIGAELFMRF 244

Query: 276 LN----FFCKNCEYKQHQCFACG--KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
            +    F C NC   +HQCFAC              EVF C    CG FYH  C++K   
Sbjct: 245 DSTNEPFMCPNCHSGKHQCFACKVEGSVGGSSSGLPEVFKCTVGDCGRFYHARCLNK--- 301

Query: 330 RDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPR 389
                  D    +I     F CPLH C  C +GEN    ++    CRRCP AYH  CLP+
Sbjct: 302 -------DPQEMAIF----FVCPLHTCKRCNEGENAEKGEV--ISCRRCPVAYHAGCLPK 348

Query: 390 KIAFEDKLEEGIITRAW-------EG-----LLPNHRI----------LIYCLKHEIDDE 427
           KI          +TR W       EG      LP  +           L+YC++H++   
Sbjct: 349 KIL------NSPVTRVWITRKLDKEGNVLPPPLPRTKAVEPECDIGQSLLYCMRHDMALG 402

Query: 428 IGTPIRDHII---------------FPGIEENKTIIDRPRKKQSLASPSGKQKVASTKSS 472
             TP+   +                FP +  ++ I+DR   +        KQ   + +++
Sbjct: 403 DATPVMGELFSPQLRRLWAKAQATKFPDLLWSRDIMDRFAARAERRRQQAKQNAQAERAA 462

Query: 473 LTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVK 532
             ++A  Q   +    +  P KA   +    +    V      R K       S    V 
Sbjct: 463 --TQASKQAGVARAVPQAAPKKAAVSKGAAPAAGAAVAVPKRPRTKPQPAPAASAAAAVA 520

Query: 533 SLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVK--PLRK 590
                V    ++D  +  L  R+ A          +  +Q    +     V  +  P   
Sbjct: 521 RKPAPV----AIDDNEIPLAVRIAAG---------RKAQQQRPQKPAPIVVEAEYIPFED 567

Query: 591 KLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKS 639
                +P++   S  +   + KD+AS+ R  E  KR   P T   AS +
Sbjct: 568 TPAEPVPAVRPPSPVK-PPIKKDSASAPRQSEHEKRLPAPDTRGDASAA 615


>gi|296082727|emb|CBI21732.3| unnamed protein product [Vitis vinifera]
          Length = 88

 Score =  139 bits (349), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 64/88 (72%), Positives = 77/88 (87%)

Query: 783 MTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERLDRKES 842
           MTV+P EL  GS LIMGLNPPFGV+A LANKFI++ALEF PKLL+LIVP ET+RLD+K+ 
Sbjct: 1   MTVKPSELPTGSHLIMGLNPPFGVQAALANKFIDQALEFKPKLLVLIVPQETQRLDKKKL 60

Query: 843 AYELVWEDDQFLSGKSFYLPGSVDENDK 870
            Y+LVW++D+ LSGKSFYLPGS+D NDK
Sbjct: 61  PYDLVWKEDEELSGKSFYLPGSIDVNDK 88


>gi|242085262|ref|XP_002443056.1| hypothetical protein SORBIDRAFT_08g007190 [Sorghum bicolor]
 gi|241943749|gb|EES16894.1| hypothetical protein SORBIDRAFT_08g007190 [Sorghum bicolor]
          Length = 232

 Score =  137 bits (346), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 110/217 (50%), Gaps = 50/217 (23%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN---FFCKNCEY 285
           VC+ CD+GG+L CCEG C R+FH  +D   +    +LGLT ++ + ++    F CKNC+Y
Sbjct: 41  VCAICDDGGNLTCCEGCCQRAFH--LDNKHDCI-KTLGLTIEDAKNIIEKEAFICKNCQY 97

Query: 286 KQHQCFACGKLGSSD---KETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 342
           KQHQC+ CG LGSSD     + AEVF C    C  FYHP CV+++L  D     +     
Sbjct: 98  KQHQCYICGSLGSSDDTDTSSQAEVFQCEHDDCARFYHPKCVAQMLYPDSHHRQELFELQ 157

Query: 343 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII 402
           + AGE F+CP+H+C +CK+ ENK                                     
Sbjct: 158 VAAGERFSCPMHECIVCKEVENKT------------------------------------ 181

Query: 403 TRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFP 439
               E  L N   L Y  KHEI  ++ TP RDHIIFP
Sbjct: 182 ----EKPLLNFVNLFY-RKHEIVRKLQTPARDHIIFP 213


>gi|242044180|ref|XP_002459961.1| hypothetical protein SORBIDRAFT_02g019310 [Sorghum bicolor]
 gi|241923338|gb|EER96482.1| hypothetical protein SORBIDRAFT_02g019310 [Sorghum bicolor]
          Length = 243

 Score =  135 bits (340), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/244 (34%), Positives = 131/244 (53%), Gaps = 7/244 (2%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M S DD+ E     V  YYF    +  + E + FS LP  + + +    CK+ + LRG  
Sbjct: 2   MMSDDDDAEPQLNAVESYYFVDSTD--ENEAVCFSNLPFLFGDTDDLPDCKKRLVLRGTG 59

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLSKE-NSWIKLQKPRKCYEEIYRTILIMVHCLS 119
           D G+ K++  V+AW+  L    P+  VL+ E   W++L +P K YEE+ RT+LI    L 
Sbjct: 60  DPGV-KVYNVVVAWRLGLEGKQPQFAVLTAEGRRWVRLIRPLKSYEEMIRTLLITAQMLH 118

Query: 120 YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
           + +R P    K++W+ L + F  ++VRPS++D  +H  L+KE   +D VL KS+ L  F+
Sbjct: 119 FLRRKPHEPEKTLWNHLCKVFNKFDVRPSEDDFRNHRSLMKEFAAKDPVLVKSEILQVFV 178

Query: 180 EEKPTKRKLSDEVVQTKAMSGFIVDDMEED---MVHDTEEDESNEEDELFDSVCSFCDNG 236
           E +  K               FI DD + D   +     E +  E+++LFDS+C+ CDNG
Sbjct: 179 EGRSRKNFTEVGADHIDVKQPFIADDEDIDEMIVEDADNESDEEEDEDLFDSICAICDNG 238

Query: 237 GDLL 240
           GD+L
Sbjct: 239 GDIL 242


>gi|108707357|gb|ABF95152.1| hydroxyproline-rich glycoprotein family protein, putative [Oryza
           sativa Japonica Group]
 gi|125585711|gb|EAZ26375.1| hypothetical protein OsJ_10258 [Oryza sativa Japonica Group]
          Length = 256

 Score =  134 bits (338), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 111/203 (54%), Gaps = 39/203 (19%)

Query: 212 HDTEEDESNEEDELF---DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT 268
           H ++ED  N +  +      +C+ CD+GG+L+ CEG C R FH TI    + +C +L ++
Sbjct: 5   HVSKEDSKNYKLRMGFVDGHICAICDDGGNLIRCEGACRRYFHRTISNDADFNCETLNMS 64

Query: 269 KDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETG-AEVFPCVSATCGHFYHPHCVSKL 327
           +++VE+   F CKNC YKQHQCF CG+LGSSD  +G AEV+ C  + C  FYHP C+++ 
Sbjct: 65  QEQVESS-KFICKNCVYKQHQCFGCGELGSSDMSSGSAEVYQCSKSRCRRFYHPKCLAEF 123

Query: 328 LLRDDEVAADQLAKSIIAGESFTCPLHKCCICK-QGE--------NKA------------ 366
                         S      F CPLH+C  CK +GE        NK             
Sbjct: 124 -------------DSSKNPPVFECPLHECFACKNKGEKYNEETCKNKGQESIKKKQGAEN 170

Query: 367 DSDLQFAVCRRCPKAYHRKCLPR 389
           +  +  A+CRRCP AYHRKCLPR
Sbjct: 171 NKKMHLALCRRCPIAYHRKCLPR 193


>gi|449532244|ref|XP_004173092.1| PREDICTED: uncharacterized protein LOC101229951 [Cucumis sativus]
          Length = 77

 Score =  133 bits (334), Expect = 7e-28,   Method: Composition-based stats.
 Identities = 59/69 (85%), Positives = 64/69 (92%)

Query: 772 KNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVP 831
           +NDFNFEKRDWMTV+PKE   GS+LIMGLNPPFGVKA LANKF++KALEFNPKLLILIVP
Sbjct: 9   QNDFNFEKRDWMTVQPKEFPKGSQLIMGLNPPFGVKAALANKFVDKALEFNPKLLILIVP 68

Query: 832 PETERLDRK 840
           PETERLD K
Sbjct: 69  PETERLDEK 77


>gi|414884832|tpg|DAA60846.1| TPA: hypothetical protein ZEAMMB73_559074 [Zea mays]
          Length = 243

 Score =  132 bits (333), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 83/244 (34%), Positives = 130/244 (53%), Gaps = 7/244 (2%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M S DD+ E     V  YYF     + +KE + FS LP  + + +     K+ + LRG  
Sbjct: 2   MMSDDDDAEPQLNAVEGYYFVDS--KNEKEPVCFSTLPFWFGDTDDLPDGKKKLVLRGTG 59

Query: 61  DSGLQKIFKPVIAWKFDLTNVIPEIFVLS-KENSWIKLQKPRKCYEEIYRTILIMVHCLS 119
           D G+ K++  V+A +  L    PE  VL+ K   WI+L +P   YEE+ RT+LI    L 
Sbjct: 60  DPGV-KVYDEVVACRLGLEGKQPEFAVLTAKGRRWIRLIRPLNSYEEMIRTVLITAQMLH 118

Query: 120 YAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFL 179
           + +R P    K++W+ L + F  ++V PS++D  +H  L+K+   +D VLAKS+ L  F+
Sbjct: 119 FLRRKPHEPEKTLWNHLCKVFNKFDVPPSEDDFRNHCSLMKQFAAKDPVLAKSEILRVFI 178

Query: 180 EEKPTKRKLSDEVVQTKAMSGFIVDDM---EEDMVHDTEEDESNEEDELFDSVCSFCDNG 236
           E K  K+              FI DD    E ++     E + ++++ LFDS+C+ CD+G
Sbjct: 179 EGKSRKKFTEVGADHIDVKQPFITDDEGIDEMNVEDVDNESDEDDDEYLFDSICAICDDG 238

Query: 237 GDLL 240
           GD+L
Sbjct: 239 GDIL 242


>gi|168001433|ref|XP_001753419.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695298|gb|EDQ81642.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 593

 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 134/254 (52%), Gaps = 37/254 (14%)

Query: 215 EEDESNEEDELFDS-VCSFCD----NGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTK 269
           E DE  +++ + +  VC+FCD    NG  LLCCEG C+RSFH TI++G ++ C +LGL+ 
Sbjct: 359 ETDEVGDDNWIDEPDVCAFCDDGVENGEKLLCCEGLCMRSFHPTINSGLQNKCRTLGLSP 418

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA------EVFPCVSATCGHFYHPHC 323
             V    ++ C NCE  QHQCFACGKLGSS   + +      EV  C +  C  FYHP C
Sbjct: 419 AAVNVK-SWICPNCEAGQHQCFACGKLGSSSSGSSSNSMGLLEVVVCGAKYCQRFYHPDC 477

Query: 324 VSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYH 383
           V KLL+ + E   D   +  ++ E+  CPLHKC   +  E+   +       RR    Y 
Sbjct: 478 VVKLLVPEAE-QKDLAMRIRLSLETLICPLHKC---RMPESSVAT-------RRWQVCYV 526

Query: 384 RKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEE 443
            +           + +G IT +  GL  N  + +   +H ID E+ TP RDHI+FP  + 
Sbjct: 527 LR----------GIGDGGITDS--GLAFNGSLWVSS-RHRIDPELLTPERDHILFPPPKP 573

Query: 444 NKTIIDRPRKKQSL 457
            +T   RPR  Q +
Sbjct: 574 GRT-TSRPRVTQPV 586



 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 47/81 (58%)

Query: 92  NSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIWDFLSRFFCLYEVRPSQND 151
            S ++L+KP   Y E  + +L+  + L++AK+NP ++   +W F+ +    ++ +PS ND
Sbjct: 128 TSRMELRKPHDFYLETMKVVLVAAYFLTFAKQNPNSSKGDVWRFVRKQCSGFKAKPSPND 187

Query: 152 LVDHMDLIKEALERDDVLAKS 172
           L     L+K  ++ D +L +S
Sbjct: 188 LQQAYLLLKSVVKSDLILQES 208


>gi|307105371|gb|EFN53620.1| hypothetical protein CHLNCDRAFT_58419 [Chlorella variabilis]
          Length = 819

 Score =  123 bits (308), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 73/219 (33%), Positives = 102/219 (46%), Gaps = 44/219 (20%)

Query: 226 FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN------FF 279
           FD+ C+ CD+GG+LL C+G C+RSFH +    E+  C  L L  D  + + +      F 
Sbjct: 386 FDTQCALCDDGGNLLACDGPCMRSFHVS---AEDDGCNLLCLPPDLAQTLTSAANSETFL 442

Query: 280 CKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
           C NC    HQC+ C + G    + G EVF C++A+CG  YHP C                
Sbjct: 443 CPNCLAGVHQCYVCKREG----KAGTEVFKCINASCGRHYHPECA--------------- 483

Query: 340 AKSIIAGESFTCPLHKCCICKQ-GENKADSDL-QFAVCRRCPKAYHRKCLPRKIAFEDKL 397
                AG+ F C LH C  C Q G+  A     +   CRRCP A+HR+CLP  I      
Sbjct: 484 --GAEAGQPFVCKLHSCHACGQPGDASAPGQAGELVPCRRCPVAFHRRCLP--IELPRHR 539

Query: 398 EEGI-ITRAW---------EGLLPNHRILIYCLKHEIDD 426
           + G  + R W         + L    + L+YC  H + +
Sbjct: 540 DGGASLARVWLADYEDESGKWLDGVEQSLLYCRNHSLGE 578



 Score = 47.4 bits (111), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 56/113 (49%), Gaps = 12/113 (10%)

Query: 615 ASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIE-- 672
           A S+  + + +  + P  +AY  K  +D+A    K++    +++ AL +   +   ++  
Sbjct: 696 APSLTKDSLWRATRRPEPYAYPMKKTIDEA----KLQALERSVQYALARRLPEAELLQLL 751

Query: 673 DAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLQAIVDK 725
           DA+AV        +    + L + LAP+LH  RY+S+GRHFT    L    D+
Sbjct: 752 DAQAV------RSVLSHYDDLLMVLAPYLHENRYSSYGRHFTSHRLLHTAADR 798


>gi|449473710|ref|XP_004153960.1| PREDICTED: uncharacterized protein LOC101211308, partial [Cucumis
           sativus]
          Length = 131

 Score =  119 bits (299), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 56/92 (60%), Positives = 76/92 (82%), Gaps = 1/92 (1%)

Query: 599 LDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIR 658
           LD +S+RRL  +MK+ ASS+ +E+++K+HK+PSTHAY+ K  VDK I +GK+EGSV A+R
Sbjct: 41  LDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAVR 100

Query: 659 TALKKLDVDGSSIEDAKAVCEPEVLSQIFKWK 690
            AL+KL+ +G  IEDA+AVCEPEVL+ IFKWK
Sbjct: 101 AALRKLE-EGCCIEDAEAVCEPEVLNHIFKWK 131


>gi|38424052|dbj|BAD01742.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|38637126|dbj|BAD03380.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 645

 Score =  109 bits (273), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 169/741 (22%), Positives = 277/741 (37%), Gaps = 167/741 (22%)

Query: 29  KEIISFSALPIQWNENERTGGCKELIYLRGAADSGLQKIFKPVIAWKFDLTNVIPEIFVL 88
           ++++ FS LP  ++E  R  G ++ +Y+RG    G Q ++K + AW+ +L    P I VL
Sbjct: 27  EDLVCFSILPFWFDEEHRLLGSEKGVYIRGIDCEGRQPVYKRIEAWRVELDCEKPMISVL 86

Query: 89  SKENSWIKLQKPRKCYEEIYRTILIMVHCLSYAKRNPEATAKSIWDFLSRFFCLYEVRPS 148
           S E +WI+L  P   Y E                           D   +FF      P 
Sbjct: 87  SSEGNWIRLLNPHPGYAE---------------------------DIARKFFT---KPPE 116

Query: 149 QNDLVDHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDEVVQTKAMSGFIVDDMEE 208
            +DL     LIK  ++ D  L KSK L   LE+  T   +  E+ +      FIV     
Sbjct: 117 VDDLRKSYSLIKYFMKMDQTLVKSKILRRPLED--TWIIIETELRKASPNESFIV----- 169

Query: 209 DMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT 268
                + E  +N +D   D      D+ GD                         + G  
Sbjct: 170 -----SNESSTNCDDYTEDGSSGSSDDNGDNY-----------------------TDGNA 201

Query: 269 KDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLL 328
            D+ +A+++                 K  S  K   + VF C +A+CG FYHP CV+ LL
Sbjct: 202 SDDEKALVS----------------EKTSSKKKYNTSLVFQCCNASCGRFYHPKCVAGLL 245

Query: 329 LRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
             D    A  L K I  G +FTCP+H C  CKQ E+                        
Sbjct: 246 EPD---GACGLEKRIADGMTFTCPVHWCFECKQIED------------------------ 278

Query: 389 RKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIFPGIEENKTII 448
           R+I+F+ K    +  R WE    + +I IYCL   I  + G+         G  E+  I 
Sbjct: 279 REISFKKK---DVTARVWEVPKGDPKIFIYCLLVSIAGKGGS--------LGTYEDIVI- 326

Query: 449 DRPRKKQSLASPSGKQKVASTKSSLTSKAPPQGKFSVKALKRVPSKAGQGETMEISERLL 508
                 Q+L S     +    +S  T + PP      +A++     +G+       ER  
Sbjct: 327 ------QTLPSQRRAHRHHQIRSHCTQRQPPNF-LRREAMREAVESSGEAVRERTGERTP 379

Query: 509 V-GSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSLGERLYAAFVTEGTE-- 565
               D + R +           N KS     D+  +V  K T +  +      +  T   
Sbjct: 380 AKDDDEAARKQGKQRWHSGATANTKSQESYEDQ-FNVSKKPTHISSQAGTCIASCQTRMN 438

Query: 566 ----QTKFGKQDNSDRETSRTVTVK-PLRKKLISELPSLDED----------SKRRLSSL 610
               +++  K+D  +  + +  T K  +++K   E   LD++             +L SL
Sbjct: 439 NKEHKSRRNKEDIPEMRSEKVATPKVSIKRKRKCEFEFLDDEVATTKGMAFQRAEKLPSL 498

Query: 611 MKDAASSVRME-----------EILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRT 659
           + +  + + +E           E+   HK P    Y S    D+ I L + E ++E I++
Sbjct: 499 IVEKETKMAIESNVFKQMEQNRELSSPHKRPKGGVYLS----DEQIKLVEYESTLELIQS 554

Query: 660 ALKKLDVDGSSIEDAKAVCEPEVLSQIFKWKNKLKVYLAPFLHGMR--YTSFGRHFTKVD 717
            ++++    + + D     E E LS  F  K   +V +      M   +    +H   +D
Sbjct: 555 KIREVKRLKTEVID---FVEIEELSS-FHRKATAEVLITKMQSDMSLVWGKLSQHLDGLD 610

Query: 718 KLQAIVDKLHWYVNDGDMIVD 738
           +   +  ++H  V     + D
Sbjct: 611 EEAMLDREMHSLVQKCYFVAD 631


>gi|242041461|ref|XP_002468125.1| hypothetical protein SORBIDRAFT_01g040025 [Sorghum bicolor]
 gi|241921979|gb|EER95123.1| hypothetical protein SORBIDRAFT_01g040025 [Sorghum bicolor]
          Length = 131

 Score = 97.8 bits (242), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/131 (45%), Positives = 69/131 (52%), Gaps = 27/131 (20%)

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA--EVFPCVSATCGHFYHPHCVSKL 327
           +E     NF CKNCEYK+HQCF CG+LGSS+   G+  EV  C    CG FYHP C+SK 
Sbjct: 16  NEKSKSFNFICKNCEYKKHQCFVCGELGSSEMPPGSPEEVLKCQKKYCGRFYHPKCLSKY 75

Query: 328 LLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----------QGENKADSDLQFAVCRR 377
               D     Q          F CPLH+C  CK          Q E K ++ L    CRR
Sbjct: 76  ----DPTKNRQ---------DFECPLHECHSCKNKGETVITSEQTEKKKETYL--VQCRR 120

Query: 378 CPKAYHRKCLP 388
           CP AYHRKCLP
Sbjct: 121 CPVAYHRKCLP 131


>gi|328865207|gb|EGG13593.1| PHD zinc finger-containing protein [Dictyostelium fasciculatum]
          Length = 584

 Score = 94.7 bits (234), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 70/219 (31%), Positives = 96/219 (43%), Gaps = 43/219 (19%)

Query: 214 TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 273
            EE +  +     ++VC+ C+ GG+LL C+G CLRSFH             LGL +  V 
Sbjct: 72  VEEKQLPKRQRKNEAVCTICEKGGELLMCDGACLRSFHV----------ECLGL-QSLVT 120

Query: 274 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
               + C +C  +Q+ CF+C K G      G ++  C    CG FYH  C+++       
Sbjct: 121 PGQRWECDDCLNQQNSCFSCKKRGI----IGMDLMKCKVHQCGKFYHHSCIAEF------ 170

Query: 334 VAADQLAKSI-IAGESFTCPLHKCCICKQ-GENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
                LAK +      F CPLH C  C Q G+ K     Q   C RCP AYH  C+P  +
Sbjct: 171 ----PLAKMVNTKSPRFNCPLHYCGKCGQSGDGK-----QSVHCFRCPAAYHVTCIPPGV 221

Query: 392 AFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGT 430
               K +E   T            LI C KH+ +  I T
Sbjct: 222 KMLTKSKETRKTG-----------LILCPKHQHETVIPT 249


>gi|66828485|ref|XP_647597.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
 gi|74859221|sp|Q55FD6.1|Y8158_DICDI RecName: Full=PHD finger-containing protein DDB_G0268158
 gi|60475597|gb|EAL73532.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
          Length = 688

 Score = 92.4 bits (228), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 93/192 (48%), Gaps = 23/192 (11%)

Query: 212 HDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK- 269
           ++T +D+  +     ++VC+FC+  G+LL C+G CLRSFH + + A    + +S  ++  
Sbjct: 121 NETNDDDKPKRPRKNEAVCTFCEKPGELLMCDGLCLRSFHISCLKARNLYNPSSSSISPV 180

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
             ++  + + C +C   Q+ CF+C K G      G ++  C    CG FYH  C      
Sbjct: 181 TTIDGTVRWECNDCVSSQNSCFSCKKRGI----IGIDLMKCKVHQCGKFYHYKC------ 230

Query: 330 RDDEVAADQLAKSI-IAGESFTCPLHKCCICK-QGENKADSDLQFAVCRRCPKAYHRKCL 387
               VA  +LAK I      F CPLH C +C+  G+ K     Q   C RCP AYH  C+
Sbjct: 231 ----VADYKLAKLINTKTPRFNCPLHYCSVCEVSGDGK-----QSVHCFRCPTAYHVICM 281

Query: 388 PRKIAFEDKLEE 399
              +    K  E
Sbjct: 282 QPGVKMLTKTRE 293


>gi|330833845|ref|XP_003291988.1| hypothetical protein DICPUDRAFT_156684 [Dictyostelium purpureum]
 gi|325077793|gb|EGC31483.1| hypothetical protein DICPUDRAFT_156684 [Dictyostelium purpureum]
          Length = 537

 Score = 91.7 bits (226), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 61/188 (32%), Positives = 86/188 (45%), Gaps = 27/188 (14%)

Query: 214 TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 273
            +E+E  +     +++C+ C+  GDLL C+G CLRSFH         H   L    D   
Sbjct: 73  NDEEEKPKRTRKNETICTVCEKPGDLLMCDGLCLRSFHVNCVGSANIH---LNTNPD--- 126

Query: 274 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
             + + C +C  +Q+ CF+C K G      G ++  C    CG FYH  CVS+       
Sbjct: 127 GSVRWECNDCVNQQNYCFSCKKRGI----IGMDLMKCKVHQCGKFYHYKCVSEY------ 176

Query: 334 VAADQLAKSI-IAGESFTCPLHKCCICK-QGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
               +LAK I      F CPLH C +C+  G+ K     Q   C RCP AYH  C+   +
Sbjct: 177 ----KLAKLINTKTPRFNCPLHYCAVCEVSGDGK-----QSVHCFRCPTAYHVICMQPGV 227

Query: 392 AFEDKLEE 399
               K +E
Sbjct: 228 KMLTKSKE 235


>gi|242081339|ref|XP_002445438.1| hypothetical protein SORBIDRAFT_07g019210 [Sorghum bicolor]
 gi|241941788|gb|EES14933.1| hypothetical protein SORBIDRAFT_07g019210 [Sorghum bicolor]
          Length = 149

 Score = 88.6 bits (218), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 49/128 (38%), Positives = 61/128 (47%), Gaps = 22/128 (17%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 283
           EL D VC+ C +GGDLL                       SLG T+ +VEAM NF C +C
Sbjct: 31  ELNDVVCALCGDGGDLLW----------------------SLGFTEAQVEAMQNFICHSC 68

Query: 284 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
             +QH+C +CG  GSS      +VF C   TCG FYH  C+S  L   D   A +    +
Sbjct: 69  RNRQHRCASCGIRGSSGVGNHPQVFRCNHETCGRFYHASCISAELHPGDPAEAGRCRVRV 128

Query: 344 IAGESFTC 351
            AG  F C
Sbjct: 129 AAGRPFWC 136


>gi|10177759|dbj|BAB11072.1| unnamed protein product [Arabidopsis thaliana]
          Length = 227

 Score = 88.6 bits (218), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 50/141 (35%), Positives = 75/141 (53%), Gaps = 7/141 (4%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKEL-IYLRGA 59
           M  S++E E+    V +YYF  + E    E++ F  LP++WN  +       L ++LRG 
Sbjct: 1   MEWSEEEGEILPDFVDNYYFIDQSE----ELVKFCDLPLKWNSVDYDDDSSGLSVFLRGN 56

Query: 60  ADSGLQKIFKPVIAWKFDLTNVI--PEIFVLSKENSWIKLQKPRKCYEEIYRTILIMVHC 117
            DSG +   K V  W F L++V   P+I VL     WI LQ PRK YE + RT L+ +  
Sbjct: 57  IDSGDESFSKLVKGWNFYLSSVDEHPKIEVLLHGLHWITLQSPRKSYETLIRTSLVTLQF 116

Query: 118 LSYAKRNPEATAKSIWDFLSR 138
           L + KR+P+  +  +W  L +
Sbjct: 117 LHFVKRDPDVASGDVWSSLQK 137


>gi|325193611|emb|CCA27889.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1008

 Score = 87.8 bits (216), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/187 (32%), Positives = 79/187 (42%), Gaps = 51/187 (27%)

Query: 226 FDSVCSFCDNG-GDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN---FFCK 281
            D +C  C+   GDL  C G C+ ++H         HC  +     +  A L+   + C 
Sbjct: 1   MDHICLRCEKSDGDLWLCTGLCVSAYHL--------HCLQI-----DANANLDPTTWKCP 47

Query: 282 NCEYKQHQCFACGKLGSSDKETGA------------------EVFPCVSATCGHFYHPHC 323
           +C+   H CF C + GSS     A                   V  C + +CG FYH  C
Sbjct: 48  SCQNHLHICFYCHQTGSSFYHENAVQQEPSQTHTPPLHDNISYVSKCRALSCGKFYHLEC 107

Query: 324 VSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYH 383
           ++KL L            + IAG  F CPLH C  C+Q   K +S      C RCP AYH
Sbjct: 108 ITKLAL------------ARIAGTHFICPLHTCASCEQSGAKKES----VRCARCPVAYH 151

Query: 384 RKCLPRK 390
             CLPRK
Sbjct: 152 TSCLPRK 158


>gi|348672863|gb|EGZ12683.1| hypothetical protein PHYSODRAFT_563400 [Phytophthora sojae]
          Length = 1174

 Score = 85.9 bits (211), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 73/175 (41%), Gaps = 42/175 (24%)

Query: 229 VCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFFCKNCE 284
           +C  C+  GD L+ C G C+ +FHA              L+ D   A      + C +C 
Sbjct: 80  ICVVCEREGDALVVCGGPCISAFHAAC------------LSPDSGAAQAQDDTWLCPSCR 127

Query: 285 YKQHQCFACGKLG------SSDKETGAE-----VFPCVSATCGHFYHPHCVSKLLLRDDE 333
            K H CF C + G       +    GA+     V  C + +CG FYH  C+++  L    
Sbjct: 128 SKTHACFHCKQTGVEALKDDAPASAGADPTLRPVRKCRALSCGKFYHQECIAQFPL---- 183

Query: 334 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
                    I     F CPLH C  C    N++ +  +   C RCP AYH +CLP
Sbjct: 184 -------ARIATNTHFICPLHTCAGC----NQSGAQQEAVRCMRCPVAYHARCLP 227


>gi|414884834|tpg|DAA60848.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 143

 Score = 85.9 bits (211), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/63 (61%), Positives = 45/63 (71%)

Query: 242 CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
           C+G C+RSFHA   +GE+S C +LG T+ EVEAM  F CKNCEYKQHQCF CG L  SD 
Sbjct: 22  CDGPCMRSFHAKEGSGEDSCCVTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEPSDG 81

Query: 302 ETG 304
           E  
Sbjct: 82  EAA 84


>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 810

 Score = 85.5 bits (210), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 84/178 (47%), Gaps = 32/178 (17%)

Query: 220 NEEDELFDSVCSFCDNG--GDLLCCEGRCLRSFHAT-IDAGEESHCASLGLTKDEVEAML 276
           N  D+   ++CS  D     D++ C+G CLRSFH   +D GE++        K+E     
Sbjct: 220 NSADDWICAMCSQLDTPCMSDMVTCDGPCLRSFHVVCLDLGEDA-------LKEE----- 267

Query: 277 NFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 336
            + C++CE  +H+C+ CG  G  +   G  VF C   +CG FYH HCV   L ++  V A
Sbjct: 268 KWLCEDCERAEHECWQCGDYGQDNVVGG--VFRCGVPSCGRFYHRHCVE--LNKNSVVKA 323

Query: 337 DQLAKSIIAGE------SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
           D     +  GE       F C  H C  C  G     + L    C +CP AYH  C+P
Sbjct: 324 D-----VDEGEDGQPIFKFRCAYHTCDTCCSGRGGTKNHLY--KCIKCPTAYHLNCIP 374


>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_b [Rattus norvegicus]
          Length = 2586

 Score = 84.3 bits (207), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 68/161 (42%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1436 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1482

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1483 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVV 1523

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1524 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1564


>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Rattus norvegicus]
 gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_a [Rattus norvegicus]
          Length = 2381

 Score = 84.0 bits (206), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 68/161 (42%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1231 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1277

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1278 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVV 1318

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1319 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1359


>gi|414884835|tpg|DAA60849.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 170

 Score = 83.2 bits (204), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 76/121 (62%), Gaps = 3/121 (2%)

Query: 569 FGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSLMKDAASSVRMEEILKRHK 628
           F     S +E  + +   PL     S  P +D ++++R+ +++ +  SS+ ++++ +   
Sbjct: 48  FISSPKSVKEQEQELVPLPLSAIRKSSFPKVDSETEKRVITILGNKVSSLTIKDVTRNCS 107

Query: 629 IPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLDVDGSSIEDAKAVCEPEVLSQIFK 688
           +PSTH Y+ +   DK I LGK+E SV+A+  ALKKL+ +G S+ DAKAVCEPEVL Q+ +
Sbjct: 108 LPSTHVYSGR-LTDK-IPLGKIERSVQAVEAALKKLE-NGGSVNDAKAVCEPEVLRQLAR 164

Query: 689 W 689
           W
Sbjct: 165 W 165


>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
           guttata]
          Length = 1339

 Score = 83.2 bits (204), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 64/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  GDLL CEG C R+FH +           LGL+         F C  C    H
Sbjct: 673 VCQLCEKTGDLLLCEGLCYRAFHVSC----------LGLSG---RPAGKFICSECTSGVH 719

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE  A+V  CV + CG FYH  CV K  L            ++     
Sbjct: 720 TCFVC-------KERKADVKRCVVSHCGKFYHEACVKKFHL------------TVFENRG 760

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      +   S  +   C RCP AYH
Sbjct: 761 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 797


>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Cricetulus griseus]
          Length = 2690

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               ++ 
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1631

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1632 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1672


>gi|301095030|ref|XP_002896617.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108847|gb|EEY66899.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1089

 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 53/172 (30%), Positives = 75/172 (43%), Gaps = 36/172 (20%)

Query: 229 VCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           +C  C+  G DL+ C G C+ +FH +         A  G++    EA   + C +C  K 
Sbjct: 74  ICVVCEREGDDLVVCAGPCISAFHLSCLP------ARSGVSDALDEA---WLCPSCTNKT 124

Query: 288 HQCFACGKLG------SSDKETGAE-----VFPCVSATCGHFYHPHCVSKLLLRDDEVAA 336
           H CF C + G       +   TG +     V  C + +CG FYH  C+++  L       
Sbjct: 125 HACFHCKETGVETLKDDAPASTGIDPNKRPVRKCRALSCGKFYHQECITQFPL------- 177

Query: 337 DQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
                 I     F CPLH C  C    N++ +  +   C RCP AYH KCLP
Sbjct: 178 ----ARIAINTHFICPLHTCAGC----NESGAQQEAVRCMRCPVAYHAKCLP 221


>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Cricetulus griseus]
          Length = 1436

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  GD L+ CEG C R FH             LGLT         F C+ C+ 
Sbjct: 701 DTVCQICETAGDCLVSCEGECCRHFHWEC----------LGLT---AVPEGRFICEECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       KE+G +V  C  + CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KESGEDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Cricetulus griseus]
          Length = 1387

 Score = 82.4 bits (202), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  GD L+ CEG C R FH             LGLT         F C+ C+ 
Sbjct: 695 DTVCQICETAGDCLVSCEGECCRHFHWEC----------LGLT---AVPEGRFICEECKT 741

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       KE+G +V  C  + CG FYH  CV K               +I  
Sbjct: 742 GQHPCFSC-------KESGEDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 782

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 783 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 821


>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Sarcophilus harrisii]
          Length = 1464

 Score = 82.4 bits (202), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G+LL CEG C  +FH +           LGL++        F C  C    H
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSC----------LGLSR---RPEGKFICSECTSGIH 716

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   EV  CV + CG FYH  CV K  L            ++     
Sbjct: 717 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVKKYHL------------TVFESRG 757

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 758 FRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCPVAYH 794


>gi|345315868|ref|XP_001516412.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2,
           partial [Ornithorhynchus anatinus]
          Length = 951

 Score = 82.4 bits (202), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH+          A LGL +        F C  C    H
Sbjct: 477 VCQLCEKTGSLLLCEGPCYGAFHS----------ACLGLPR---RPEGKFICSECTSGVH 523

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   EV  CV + CG FYH  CV K  L            ++     
Sbjct: 524 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVRKHHL------------TVFESRG 564

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 565 FRCPLHSCVSCHVSNPSNPRASKGKLMRCVRCPVAYH 601


>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Cricetulus griseus]
          Length = 2318

 Score = 82.4 bits (202), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1172 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1218

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               ++ 
Sbjct: 1219 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1259

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1260 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1300


>gi|380791865|gb|AFE67808.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
           [Macaca mulatta]
          Length = 949

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           2 [Cavia porcellus]
          Length = 1367

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/157 (35%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL +        F C  C    H
Sbjct: 670 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLPR---RPEGKFTCMECTSGIH 716

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+ AEV  CV   CG FYH  CV K  L            ++     
Sbjct: 717 SCFVC-------KESKAEVKRCVVTQCGKFYHEACVRKFPL------------TVFESRG 757

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 758 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 794


>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
           gallus]
          Length = 1369

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 64/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  GDLL CEG C R+FH +           LGL+         F C  C    H
Sbjct: 674 VCQLCEKTGDLLLCEGLCYRAFHVSC----------LGLSG---RPAGKFVCSECTSGVH 720

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE  A++  CV + CG FYH  CV K  L            ++     
Sbjct: 721 TCFVC-------KERKADLKRCVVSHCGKFYHEACVKKFHL------------TVFESRG 761

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      +   S  +   C RCP AYH
Sbjct: 762 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 798


>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Cavia porcellus]
          Length = 1366

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/157 (35%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL +        F C  C    H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLPR---RPEGKFTCMECTSGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+ AEV  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKAEVKRCVVTQCGKFYHEACVRKFPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
           [Monodelphis domestica]
          Length = 1366

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G+LL CEG C  +FH +           LGL++        F C  C    H
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSC----------LGLSR---RPEGKFICSECTSGIH 716

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   EV  CV + CG FYH  CV K  L            ++     
Sbjct: 717 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVKKYHL------------TVFESRG 757

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 758 FRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCPVAYH 794


>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Meleagris gallopavo]
          Length = 1348

 Score = 81.6 bits (200), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 64/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  GDLL CEG C R+FH +           LGL+         F C  C    H
Sbjct: 653 VCQLCEKTGDLLLCEGLCYRAFHVSC----------LGLSG---RPAGKFVCSECTSGVH 699

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE  A++  CV + CG FYH  CV K  L            ++     
Sbjct: 700 TCFVC-------KERKADLKRCVVSHCGKFYHEACVKKFHL------------TVFENRG 740

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      +   S  +   C RCP AYH
Sbjct: 741 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 777


>gi|255583040|ref|XP_002532288.1| hypothetical protein RCOM_0549780 [Ricinus communis]
 gi|223528022|gb|EEF30103.1| hypothetical protein RCOM_0549780 [Ricinus communis]
          Length = 164

 Score = 81.6 bits (200), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 42/87 (48%), Positives = 55/87 (63%), Gaps = 7/87 (8%)

Query: 834 TERLDRKESAYELVWEDDQFLSGKSFY------LPGSVDENDKQMDQWNMTAPPLYLWSR 887
            +RLD K + Y+L+WED   LSGK  +      LPGS+D NDKQ++Q N   PPLYLWSR
Sbjct: 79  VKRLDEK-APYKLIWEDHMLLSGKGTWQFTGSHLPGSIDVNDKQIEQRNSKPPPLYLWSR 137

Query: 888 HDYAAHHKALAEKHGHISRPQSRTQME 914
            D+ A H A+A + GH+S     + ME
Sbjct: 138 LDWIAKHMAIARQCGHMSNQHEVSVME 164



 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 24/42 (57%), Gaps = 3/42 (7%)

Query: 240 LCCEGRCLRSFHATIDAGEESHCASLGL---TKDEVEAMLNF 278
           LC  GRCLRSFH ++  G ES C SLG     + EV   L F
Sbjct: 9   LCWNGRCLRSFHPSVADGVESKCVSLGFRNKAEYEVWTTLQF 50


>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Oryzias latipes]
          Length = 2321

 Score = 81.6 bits (200), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 69/174 (39%), Gaps = 39/174 (22%)

Query: 214  TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 273
            ++ D   +   L ++VC  C+  GDLL CEG C  +FH          C  L      V 
Sbjct: 1454 SQVDGKGKTSSLKENVCQVCERPGDLLVCEGHCYGAFHL--------QCVGLS-----VA 1500

Query: 274  AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
                FFC+ C    H CF C K G        +V  C+   CG FYH  C+         
Sbjct: 1501 PKGKFFCQECNTGDHMCFVCKKSG--------DVKRCIIPLCGKFYHMDCI--------- 1543

Query: 334  VAADQLAKSIIA--GESFTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
                 LA S      + F CPLH C  C      N   S  + A C RCP AYH
Sbjct: 1544 -----LAFSATQPHNKGFRCPLHVCLSCHITNPVNVFSSKGRLARCVRCPVAYH 1592


>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
 gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
           construct]
          Length = 1446

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  GD L+ CEG C R FH             LGLT        +F C+ CE 
Sbjct: 711 DTVCQVCEKAGDCLVACEGECCRHFHVEC----------LGLT---AVPEGHFTCEECET 757

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C  + CG FYH  CV K               +I  
Sbjct: 758 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 798

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 799 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 837


>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
           Full=Nuclear SET domain-containing protein 3; AltName:
           Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
           homolog; Short=WHSC1-like protein 1
          Length = 1439

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  GD L+ CEG C R FH             LGLT        +F C+ CE 
Sbjct: 701 DTVCQVCEKAGDCLVACEGECCRHFHVEC----------LGLT---AVPEGHFTCEECET 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C  + CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|119583723|gb|EAW63319.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 992

 Score = 81.3 bits (199), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
 gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
           norvegicus]
          Length = 1396

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/159 (33%), Positives = 67/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+N GD L+ CEG C R FH          C  L    +       F C+ CE 
Sbjct: 702 DTVCQVCENTGDSLVACEGECCRHFHP--------ECLGLAAVPEG-----RFTCEECET 748

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C  + CG FYH  CV K               +I  
Sbjct: 749 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 789

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 790 SKGFRCPQHCCSSCSMEKDIHKASKGRMMKCLRCPIAYH 828


>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
          Length = 1381

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 68/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  GD L+ CEG C R FH             LGLT        +F C+ CE 
Sbjct: 686 DTVCQVCEKAGDCLVACEGECCRHFHVEC----------LGLT---AVPEGHFTCEECET 732

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C  + CG FYH  CV K               +I  
Sbjct: 733 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRK------------FPTAIFE 773

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 774 SKGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 812


>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Otolemur garnettii]
          Length = 2410

 Score = 80.9 bits (198), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 83/205 (40%), Gaps = 43/205 (20%)

Query: 189  SDEVVQTKAMSGFIVDDMEEDMVHDT-------EEDESNEEDELFDSVCSFCDNGGDLLC 241
            S+  + T   +G   + +EE + HD+        + E      L ++VC  C+  G+LL 
Sbjct: 1214 SEGELMTHRTTGSPKETVEEGIEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLL 1273

Query: 242  CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            CE +C  +FH             LGLT+        F C  C    H CF C       K
Sbjct: 1274 CEAQCCGAFHLEC----------LGLTE---MPRGKFICNECRTGIHTCFVC-------K 1313

Query: 302  ETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQ 361
            ++G +V  C+   CG FYH  CV K               ++I  + F C LH C  C  
Sbjct: 1314 QSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVIQNKGFRCSLHICITC-H 1360

Query: 362  GENKAD---SDLQFAVCRRCPKAYH 383
              N A    S  +   C RCP AYH
Sbjct: 1361 AANPASVSASKGRLMRCVRCPVAYH 1385


>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Oreochromis niloticus]
          Length = 2122

 Score = 80.5 bits (197), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 60/194 (30%), Positives = 75/194 (38%), Gaps = 47/194 (24%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L +SVC  C+  GDLL CEG C  +FH             +GL+         F C+ C 
Sbjct: 1245 LKESVCQVCERTGDLLVCEGHCYGAFHLQC----------IGLS---APPKGKFLCRECN 1291

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C K G   K        C+   CG FYH  C+              +A S  
Sbjct: 1292 TGVHACFVCKKSGDGVKR-------CIIPLCGKFYHTDCI--------------MAYSAT 1330

Query: 345  A--GESFTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEG 400
                + F CPLH C  C      N   S  + A C RCP AYH        A ++ +  G
Sbjct: 1331 QPHNKGFRCPLHVCLSCHITNPLNICSSKGRLARCVRCPVAYH--------ANDNCMAAG 1382

Query: 401  IITRAWEGLL-PNH 413
             +  A    L PNH
Sbjct: 1383 SLVLANNSFLCPNH 1396


>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
            Full=H4-K20-HMTase; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
          Length = 2588

 Score = 80.5 bits (197), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL +        F C  C 
Sbjct: 1439 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1485

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               ++ 
Sbjct: 1486 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1526

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1527 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1567


>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
            musculus]
          Length = 2588

 Score = 80.5 bits (197), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL +        F C  C 
Sbjct: 1439 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1485

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               ++ 
Sbjct: 1486 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1526

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1527 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1567


>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Mus musculus]
          Length = 2691

 Score = 80.5 bits (197), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL +        F C  C 
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1588

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               ++ 
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1629

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1630 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670


>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Cricetulus griseus]
          Length = 1365

 Score = 80.5 bits (197), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKMEVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHATNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
            musculus]
          Length = 2382

 Score = 80.5 bits (197), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 66/161 (40%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL +        F C  C 
Sbjct: 1233 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLPE---MPRGKFICNECH 1279

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               ++ 
Sbjct: 1280 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVT 1320

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F CPLH C  C      N + S  +   C RCP AYH
Sbjct: 1321 QNKGFRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1361


>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus
           griseus]
          Length = 1344

 Score = 80.1 bits (196), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 648 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 694

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 695 SCFVC-------KESKMEVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 735

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 736 FRCPLHSCVSCHATNPSNPRPSKGKMMRCVRCPVAYH 772


>gi|296222021|ref|XP_002757008.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Callithrix jacchus]
          Length = 1438

 Score = 80.1 bits (196), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C R FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Otolemur garnettii]
          Length = 1371

 Score = 80.1 bits (196), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH +           LGL++        F C  C    H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHLSC----------LGLSR---RPEGRFTCSECTSGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHETCVRKFPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPANPRPSKGKMMRCVRCPVAYH 793


>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
          Length = 2695

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1540 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1586

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1587 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1627

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1628 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1668


>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
          Length = 2695

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1540 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1586

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1587 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1627

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1628 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1668


>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
          Length = 1279

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 92/244 (37%), Gaps = 33/244 (13%)

Query: 154 DHMDLIKEALERDDVLAKSKFLVTFLEEKPTKRKLSDEVVQTKAMSGFIVDDMEEDMVHD 213
           D  D  K  L++ + L + +      +   T+RK +    + ++         + D    
Sbjct: 604 DVADTAKHVLDQWNALTQEEQQQWHAKAATTRRKAAKRAAERESQEDADAAAADGDASTT 663

Query: 214 TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 273
              ++S+ +    D  C  C   G+LLCCEG CL S+H          C  L        
Sbjct: 664 DAGEDSSVDVTKHDDYCGICGEAGNLLCCEGGCLSSYHL--------FCVGLSCAPQGA- 714

Query: 274 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
               F C  C    H CFAC + G  +   G +   C    CG  YH  C+S        
Sbjct: 715 ----FVCDACTTGNHLCFACEQPGGLE---GLQT--CSVRNCGKKYHRACIS-------- 757

Query: 334 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAF 393
                  ++ +   SF CPLHKC  C   +    S      C RCP AYH  C+P     
Sbjct: 758 ----NNPRAALKDNSFKCPLHKCANCTYPQA---STYPLVRCIRCPIAYHTCCVPAGCLH 810

Query: 394 EDKL 397
           E+ +
Sbjct: 811 ENAI 814


>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
 gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
          Length = 2695

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1540 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1586

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1587 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1627

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1628 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1668


>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 8 [Pan troglodytes]
 gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Pan paniscus]
 gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670


>gi|403294386|ref|XP_003938171.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Saimiri
           boliviensis boliviensis]
          Length = 1438

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C R FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|296222023|ref|XP_002757009.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
           [Callithrix jacchus]
          Length = 1389

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C R FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|296222019|ref|XP_002757007.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Callithrix jacchus]
          Length = 1427

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C R FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCRHFHL--------ECLGLTSLPDS-----RFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
 gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
          Length = 2426

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1271 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1317

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1318 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1358

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1359 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1399


>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Homo sapiens]
 gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=Androgen receptor
            coactivator 267 kDa protein; AltName: Full=Androgen
            receptor-associated protein of 267 kDa; AltName:
            Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
            Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
 gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
 gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
            sapiens]
          Length = 2696

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1541 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1587

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1588 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1628

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1629 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1669


>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670


>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 1 [Pongo abelii]
          Length = 2697

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670


>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
            sapiens]
          Length = 2593

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1438 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1484

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1485 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1525

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1526 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1566


>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
          Length = 1366

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 670 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 716

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 717 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 757

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 758 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 794


>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 2 [Pongo abelii]
          Length = 2594

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1439 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1485

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1486 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1526

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1527 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1567


>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Nomascus
            leucogenys]
          Length = 2697

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPKGKFICNECR 1588

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670


>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
          Length = 2427

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1272 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1318

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1319 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1359

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1360 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1400


>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
 gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
          Length = 1346

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 650 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 696

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 697 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 737

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 738 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 774


>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Papio anubis]
          Length = 2343

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1185 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1231

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1232 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1272

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1273 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1313


>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
          Length = 2596

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1438 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1484

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1485 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1525

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1526 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1566


>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
 gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
           Full=Multiple myeloma SET domain-containing protein;
           Short=MMSET; AltName: Full=Nuclear SET domain-containing
           protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
           syndrome candidate 1 protein homolog; Short=WHSC1
          Length = 1365

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Homo sapiens]
 gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
 gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
            sapiens]
          Length = 2427

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1272 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1318

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1319 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1359

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1360 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1400


>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Saimiri boliviensis boliviensis]
          Length = 2697

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECH 1588

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1670


>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1273 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1319

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1320 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1360

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1361 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1401


>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1273 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1319

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1320 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1360

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1361 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1401


>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Macaca mulatta]
          Length = 2329

 Score = 79.7 bits (195), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1185 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1231

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1232 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1272

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1273 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1313


>gi|325188110|emb|CCA22651.1| chromodomainhelicaseDNAbinding protein putative [Albugo laibachii
            Nc14]
          Length = 1883

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 71/160 (44%), Gaps = 27/160 (16%)

Query: 229  VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            +C+ C +GG +L C+G C RSFH             +G+T +  +   ++ C +C   +H
Sbjct: 1595 LCTLCGDGGLILLCDGPCHRSFHL----------ECIGMTHEPQDE--HWLCPDCNAGKH 1642

Query: 289  QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
             C  C K+G    E G  V  C  A CG FYH  C+      D  V         +  + 
Sbjct: 1643 MCLLCKKVGEMGVEFG--VLQCSMARCGRFYHRGCLEV----DRHV-------EWVGKKR 1689

Query: 349  FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
            F CP H C  CK+ +    S +    C  C +A+H  C+P
Sbjct: 1690 FRCPSHFCHSCKERKRPRQSTI--VSCLHCARAFHADCIP 1727


>gi|170284946|gb|AAI61082.1| whsc1 protein [Xenopus (Silurana) tropicalis]
          Length = 1028

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  GDL+ CEG C  +FH +           +GL+         + CK C     
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLSC----------IGLS---TRPAGKYLCKECTSGAR 659

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  C+   CG FYH  C+ K  L            ++     
Sbjct: 660 SCFLC-------KESNRDVKRCIVPHCGKFYHESCLRKYPL------------AVFESRG 700

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH+C  C      N   S  +   C RCP AYH
Sbjct: 701 FRCPLHRCATCYFSNPSNPRASKGKMVRCVRCPLAYH 737


>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Callithrix
            jacchus]
          Length = 2692

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1541 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECH 1587

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1588 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1628

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1629 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1669


>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
           mulatta]
 gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
           mulatta]
 gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
           mulatta]
          Length = 1365

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|449534225|ref|XP_004174066.1| PREDICTED: uncharacterized protein LOC101231072, partial [Cucumis
           sativus]
          Length = 158

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/176 (34%), Positives = 97/176 (55%), Gaps = 25/176 (14%)

Query: 480 QGKFSVKALKRVPSKAGQGETM-EISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQV 538
           QGK + K  K     +  G+ + +++ + L  S+S +  K  ++SR S   N K  SV +
Sbjct: 6   QGKSAAKVSKSFERSSSNGKLLGKVTAKSLWSSES-KNVKLGNISRNSL--NQKGESVLM 62

Query: 539 DRSSSVDSKKTSLGERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPS 598
           D   ++  KK+SL  +  +A  T+  + +K  K+D S                    +  
Sbjct: 63  DIDKTIKVKKSSLVGK--SAIPTKRFDPSKIYKEDRSG-------------------MLL 101

Query: 599 LDEDSKRRLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSV 654
           LD +S+RRL  +MK+ ASS+ +E+++K+HK+PSTHAY+ K  VDK I +GK+EGSV
Sbjct: 102 LDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSV 157


>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
           fascicularis]
          Length = 1365

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Sus scrofa]
          Length = 2392

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1236 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1282

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1283 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1323

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1324 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1364


>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
          Length = 1365

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
           guttata]
          Length = 1435

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ G+ LL CEG C   FH          C  L    +E      F C  C+ 
Sbjct: 698 DTVCQICESSGESLLACEGECCSMFHL--------ECLGLKAMPEE-----KFICTECKN 744

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +H CF+C       K  G +V  C  +TCG FYH  CV K             A ++  
Sbjct: 745 GEHTCFSC-------KLPGKDVKRCSVSTCGKFYHEACVRK------------FATALFE 785

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
              F CP H C  C   ++    S  + A C RCP AYH
Sbjct: 786 SRGFRCPQHCCTACSMDKDMHKASKGRMARCLRCPVAYH 824


>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Gorilla gorilla gorilla]
          Length = 1365

 Score = 79.3 bits (194), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
           Full=Multiple myeloma SET domain-containing protein;
           Short=MMSET; AltName: Full=Nuclear SET domain-containing
           protein 2; Short=NSD2; AltName: Full=Protein
           trithorax-5; AltName: Full=Wolf-Hirschhorn syndrome
           candidate 1 protein; Short=WHSC1
 gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
 gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
 gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
 gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
 gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
 gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
 gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
 gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
          Length = 1365

 Score = 79.3 bits (194), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Pongo abelii]
          Length = 1365

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           6 [Pan troglodytes]
 gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           7 [Pan troglodytes]
 gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           8 [Pan troglodytes]
 gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           10 [Pan troglodytes]
 gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
           paniscus]
 gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
          Length = 1365

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
          Length = 1298

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 602 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 648

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 649 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 689

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 690 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 726


>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           2 [Callithrix jacchus]
          Length = 1400

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 667 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 713

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   +V  CV   CG FYH  CV K  L            ++     
Sbjct: 714 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 754

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 755 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791


>gi|390461100|ref|XP_003732597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           3 [Callithrix jacchus]
          Length = 1389

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 667 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 713

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   +V  CV   CG FYH  CV K  L            ++     
Sbjct: 714 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 754

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 755 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791


>gi|344281620|ref|XP_003412576.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Loxodonta africana]
          Length = 1438

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 68/160 (42%), Gaps = 36/160 (22%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DAVCQICESTGDSLIPCEGECCKHFHL--------ECLGLASLPDG-----KFICMECQT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C   TCG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGTDVKRCSVGTCGKFYHETCVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN--KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++  KA S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIYKA-SKGRMMRCLRCPVAYH 827


>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Callithrix jacchus]
          Length = 1365

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
           [Saimiri boliviensis boliviensis]
          Length = 1368

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECTSGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   +V  CV   CG FYH  CV K  L            ++     
Sbjct: 716 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
          Length = 1446

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +GA+V  C  + CG FYH  CV K               +I  
Sbjct: 749 GQHPCFSC-------KVSGADVKRCSVSACGKFYHEACVRK------------FPTAIFE 789

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
           aries]
          Length = 1439

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C  +TCG FYH  CV K               +I  
Sbjct: 749 GQHPCFSC-------KVSGMDVKRCSVSTCGKFYHEACVRK------------FPTAIFE 789

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
          Length = 1364

 Score = 79.0 bits (193), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G L+ CEG C  +FH           A LGL++        F C  C    H
Sbjct: 668 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 714

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV + CG FYH  CV K  L            ++     
Sbjct: 715 SCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPL------------TVFESRG 755

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 756 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 792


>gi|242047476|ref|XP_002461484.1| hypothetical protein SORBIDRAFT_02g003420 [Sorghum bicolor]
 gi|241924861|gb|EER98005.1| hypothetical protein SORBIDRAFT_02g003420 [Sorghum bicolor]
          Length = 631

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 50/81 (61%), Gaps = 4/81 (4%)

Query: 363 ENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKH 422
           ENK    LQ AVCRRCP+AYH +CLPR+++   K ++G   RAW+    +  I  YCL H
Sbjct: 2   ENKTQRALQLAVCRRCPRAYHWECLPRELSLGAKDKDG-NPRAWK---LSKTIFFYCLDH 57

Query: 423 EIDDEIGTPIRDHIIFPGIEE 443
           EID +  T  R+HI FP   E
Sbjct: 58  EIDKDTRTASRNHIKFPATPE 78


>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Canis lupus familiaris]
          Length = 1364

 Score = 78.6 bits (192), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G L+ CEG C  +FH           A LGL++        F C  C    H
Sbjct: 668 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 714

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV + CG FYH  CV K  L            ++     
Sbjct: 715 SCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPL------------AVFESRG 755

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 756 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 792


>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Cavia
            porcellus]
          Length = 2509

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 77/190 (40%), Gaps = 43/190 (22%)

Query: 204  DDMEEDMVHDTEEDESNEED-------ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDA 256
            +++EE + HD     S +          L ++VC  C+  G+LL CE +C  +FH     
Sbjct: 1326 ENIEEGIEHDASMSSSKKMQGERGGGAALKENVCQNCEKVGELLLCEAQCCGAFHLEC-- 1383

Query: 257  GEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 316
                    LGLT+        F C  C    H CF C       K++G +V  C+   CG
Sbjct: 1384 --------LGLTE---MPRGKFICNECRTGIHTCFVC-------KQSGEDVKRCLLPLCG 1425

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKAD---SDLQFA 373
             FYH  CV K               +++  + F C LH C  C    N A    S  +  
Sbjct: 1426 KFYHEECVQK------------YPPTVMQNKGFRCSLHICITC-HAANPASVSASKGRLM 1472

Query: 374  VCRRCPKAYH 383
             C RCP AYH
Sbjct: 1473 RCVRCPVAYH 1482


>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Heterocephalus glaber]
          Length = 2698

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1541 LKENVCQNCEKVGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1587

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1588 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1628

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1629 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1669


>gi|332825882|ref|XP_003311721.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Pan
           troglodytes]
          Length = 1426

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
           carolinensis]
          Length = 1438

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 66/164 (40%), Gaps = 44/164 (26%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           DS+C  C++ G+ LL CEG C R FH             LGL+    +    F C  C+ 
Sbjct: 700 DSICQICESSGESLLSCEGECYRVFHTEC----------LGLSS---QPEGKFICIECKN 746

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K  G +V  C    CG FYH  CV K             A ++  
Sbjct: 747 GQHTCFSC-------KLPGKDVKRCSVNACGKFYHEACVRK------------FATTVFE 787

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 383
              F CP H C  C       D D+  A       C RCP AYH
Sbjct: 788 SRGFRCPQHCCTSC-----SVDKDIHKASKGRMVRCLRCPIAYH 826


>gi|397521375|ref|XP_003830772.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Pan
           paniscus]
          Length = 1426

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
           paniscus]
          Length = 1437

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|358419376|ref|XP_003584219.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
           taurus]
          Length = 1439

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +GA+V  C  + CG FYH  CV K               ++  
Sbjct: 749 GQHPCFSC-------KVSGADVKRCSVSACGKFYHEACVRK------------FPTAVFE 789

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|114619713|ref|XP_519713.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 6 [Pan
           troglodytes]
 gi|410226716|gb|JAA10577.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
 gi|410266866|gb|JAA21399.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
 gi|410353855|gb|JAA43531.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
          Length = 1437

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|375294191|ref|NP_001069063.2| histone-lysine N-methyltransferase NSD3 [Bos taurus]
 gi|359080579|ref|XP_003588019.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
           taurus]
          Length = 1439

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 67/159 (42%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 702 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLSSPPDG-----KFVCVECKT 748

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +GA+V  C  + CG FYH  CV K               ++  
Sbjct: 749 GQHPCFSC-------KVSGADVKRCSVSACGKFYHEACVRK------------FPTAVFE 789

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 790 SKGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Equus caballus]
          Length = 1365

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G L+ CEG C  +FH           A LGL++        F C  C    H
Sbjct: 669 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 715

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE   +V  CV + CG FYH  CV K  L            ++     
Sbjct: 716 TCFVC-------KENKTDVKRCVVSQCGKFYHEACVRKYPL------------TVFESRG 756

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|92098122|gb|AAI15007.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
          Length = 1437

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Ovis aries]
          Length = 2698

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1632 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 1672


>gi|114619717|ref|XP_001170799.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 4 [Pan
           troglodytes]
          Length = 1388

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
            lysine-20 specific [Desmodus rotundus]
          Length = 2699

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1632 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1672


>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Ailuropoda melanoleuca]
 gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
          Length = 2699

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1545 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1591

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1592 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1632

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1633 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1673


>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
           paniscus]
          Length = 1388

 Score = 78.2 bits (191), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Anolis carolinensis]
          Length = 2106

 Score = 78.2 bits (191), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 65/161 (40%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL++        F C  C 
Sbjct: 972  LKENVCQICEKPGELLLCEAQCCGAFHLEC----------LGLSE---MPKGKFICTECS 1018

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K +G  V  C+   CG +YH  C+ K               ++I
Sbjct: 1019 TGVHTCFVC-------KNSGENVKRCLLPLCGKYYHEECIQK------------YPPTVI 1059

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1060 QNKGFRCSLHICMTCHAANPTNVSASKGRLMRCVRCPVAYH 1100


>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus
           norvegicus]
          Length = 1346

 Score = 78.2 bits (191), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 650 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSQ---RPEGRFTCTECASGIH 696

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  C+   CG FYH  CV K  L            ++     
Sbjct: 697 SCFVC-------KESKMEVKRCMVNQCGKFYHEACVKKYPL------------TVFESRG 737

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 738 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 774


>gi|219841942|gb|AAI43511.1| WHSC1L1 protein [Homo sapiens]
          Length = 1426

 Score = 78.2 bits (191), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Bos grunniens mutus]
          Length = 2698

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1632 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 1672


>gi|12697312|emb|CAC28350.1| putative chromatin modulator [Homo sapiens]
          Length = 1437

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|13699811|ref|NP_075447.1| histone-lysine N-methyltransferase NSD3 isoform long [Homo sapiens]
 gi|74761342|sp|Q9BZ95.1|NSD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
           Full=Nuclear SET domain-containing protein 3; AltName:
           Full=Protein whistle; AltName: Full=WHSC1-like 1 isoform
           9 with methyltransferase activity to lysine; AltName:
           Full=Wolf-Hirschhorn syndrome candidate 1-like protein
           1; Short=WHSC1-like protein 1
 gi|12642817|gb|AAK00355.1| putative protein WHSC1L1l [Homo sapiens]
 gi|75517229|gb|AAI01718.1| WHSC1L1 protein, isoform long [Homo sapiens]
 gi|109731704|gb|AAI13470.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
 gi|119583726|gb|EAW63322.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_c [Homo
           sapiens]
 gi|168277388|dbj|BAG10672.1| histone-lysine N-methyltransferase NSD3 [synthetic construct]
          Length = 1437

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Sarcophilus harrisii]
          Length = 2717

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             +GLT+        F CK C 
Sbjct: 1544 LKENVCQNCEKVGELLLCEAQCCGAFHLEC----------IGLTE---MPKGKFICKECR 1590

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K +G +V  C+   CG FYH  C+ K               +++
Sbjct: 1591 TGIHTCFVC-------KTSGEDVKRCLLPLCGKFYHEACIQK------------YPPTVL 1631

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1632 QNKGFRCSLHMCITC-HAANPASLSASKGRLMRCVRCPVAYH 1672


>gi|384247005|gb|EIE20493.1| hypothetical protein COCSUDRAFT_57640 [Coccomyxa subellipsoidea
            C-169]
          Length = 2740

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 68/152 (44%), Gaps = 31/152 (20%)

Query: 226  FDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
              SVCS C     L+ C G CLR++H        S C  LGL        + +FC  C+ 
Sbjct: 1862 LSSVCSAC-----LVVCGGPCLRAYH--------SGC--LGLPVLPAPGSV-WFCPECDS 1905

Query: 286  KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             + +CF+CG+ G+  ++    +  C    CG FYH  CV +L+         Q+AK    
Sbjct: 1906 GRMRCFSCGEFGAGFEDP--TIRKCSLGVCGRFYHVRCVDRLV-------NTQMAKR--- 1953

Query: 346  GESFTCPLHKCCICKQGENKADSDLQFAVCRR 377
            G  F CP H C +C +  +  D      VC R
Sbjct: 1954 GAHFRCPQHYCSVCAKSGDGVD---MVVVCAR 1982


>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 5 [Canis lupus familiaris]
          Length = 2698

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1542 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1588

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1589 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1629

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1630 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1670


>gi|12697314|emb|CAC28351.1| Putative Chromatin modulator [Homo sapiens]
          Length = 1388

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Bos taurus]
          Length = 2389

 Score = 77.8 bits (190), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1235 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1281

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1282 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1322

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1323 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 1363


>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
            [Oryctolagus cuniculus]
          Length = 2431

 Score = 77.8 bits (190), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1275 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1321

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1322 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1362

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1363 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1403


>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Felis catus]
          Length = 2432

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1276 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1322

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1323 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1363

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1364 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1404


>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Loxodonta africana]
          Length = 2702

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1545 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1591

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1592 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1632

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1633 QNKGFRCSLHICITC-HAANPASVTASKGRLMRCVRCPVAYH 1673


>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Bos taurus]
 gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
           taurus]
 gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
          Length = 1365

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 66/159 (41%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           + VC  C+  G L+ CEG C  +FH           A LGL++     +L   C  C   
Sbjct: 667 EHVCQLCEEPGSLVLCEGPCCGAFHL----------ACLGLSRRPEGRLL---CGECTSG 713

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H CF C       KE+ ++V  CV + CG FYH  CV +  L            ++   
Sbjct: 714 IHSCFVC-------KESKSDVKRCVVSQCGKFYHEACVRRFPL------------TVFES 754

Query: 347 ESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
             F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 755 RGFRCPLHSCLSCHASNPSNPRPSKGKMLRCVRCPVAYH 793


>gi|351699801|gb|EHB02720.1| Histone-lysine N-methyltransferase NSD3 [Heterocephalus glaber]
          Length = 1439

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 53/160 (33%), Positives = 67/160 (41%), Gaps = 36/160 (22%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  GD L+ CEG C + FH          C  L    D       F C+ C+ 
Sbjct: 701 DTVCQICERSGDSLVPCEGECCKHFHL--------ECLGLTSVPDG-----KFMCRECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K  G +V  C    CG FYH  CV K             + +I  
Sbjct: 748 GQHPCFSC-------KVPGKDVKRCSVGACGKFYHEACVRK------------FSTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN--KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++  KA S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIYKA-SKGRMMRCLRCPVAYH 827


>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
            [Oryctolagus cuniculus]
          Length = 2700

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 1590

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1632 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1672


>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Gallus gallus]
          Length = 2412

 Score = 77.4 bits (189), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 64/161 (39%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL++        F C  C 
Sbjct: 1256 LKENVCQICEKPGELLLCEAQCCGAFHLQC----------LGLSEMPTG---KFICNECS 1302

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K  G +V  C+   CG +YH  C+ K               ++ 
Sbjct: 1303 TGVHTCFVC-------KNCGQDVKRCLLPLCGKYYHEECIQK------------YPPTVT 1343

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1344 QNKGFRCSLHICMTCHAANPTNISASKGRLMRCVRCPVAYH 1384


>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
           scrofa]
          Length = 1338

 Score = 77.4 bits (189), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 53/159 (33%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           + VC  C+  G L+ CEG C  +FH           A LGL +   E  L   C  C   
Sbjct: 640 EHVCQLCEKPGSLVLCEGPCCGAFHL----------ACLGLAR-RPEGRLT--CLECTSG 686

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H CF C       KE+ A+V  CV + CG FYH  CV +  L            ++   
Sbjct: 687 VHSCFVC-------KESKADVRRCVVSQCGKFYHEACVRRFPL------------TVFES 727

Query: 347 ESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
             F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 728 RGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 766


>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
          Length = 2055

 Score = 77.4 bits (189), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 66/161 (40%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CEG+C  +FH             +GLT+        F C+ C+
Sbjct: 1224 LKENVCQVCEKTGELLLCEGQCCGAFHLQC----------IGLTET---PKGRFICQECK 1270

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K+   EV  C+   CG FYH  C+ K             + ++ 
Sbjct: 1271 MGVHTCFVC-------KKPDKEVRRCMIPVCGKFYHMDCILK------------YSPTVA 1311

Query: 345  AGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 383
                F C +H C  C      N   S  +   C RCP AYH
Sbjct: 1312 QNRGFRCSIHVCLSCYITNPNNPGISKGRLTRCVRCPVAYH 1352


>gi|281208068|gb|EFA82246.1| PHD zinc finger-containing protein [Polysphondylium pallidum PN500]
          Length = 533

 Score = 77.4 bits (189), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 69/240 (28%), Positives = 101/240 (42%), Gaps = 55/240 (22%)

Query: 185 KRKLSDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEG 244
           K K+ D++V +  ++         ++ HD   D   ++    +++C+ C+   +LL C+G
Sbjct: 36  KLKVPDQIVASPKINN--------NVTHDA--DAKAKKPRKNEAICTVCEKSDNLLMCDG 85

Query: 245 RCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETG 304
            CLRSFH  ID         +GL   +      + C +C   Q+ CFAC + G       
Sbjct: 86  PCLRSFH--IDC--------IGL---DSMPTSRWECSDCSKNQNICFACKERGI----IN 128

Query: 305 AEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI-IAGESFTCPLHKCCIC-KQG 362
             +  C    CG FYH  C+S+           +L K +      F CPLH C  C   G
Sbjct: 129 MNLTKCKVHQCGKFYHNKCISEF----------KLTKVVNTKSPRFNCPLHYCESCGVSG 178

Query: 363 ENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKH 422
           + K     Q   C RCP AYH  C+P  +    K +E   TR           LI C KH
Sbjct: 179 DGK-----QSVHCFRCPTAYHVICMPPVVKMLTKSKE---TRKTG--------LILCPKH 222


>gi|384252957|gb|EIE26432.1| hypothetical protein COCSUDRAFT_58970 [Coccomyxa subellipsoidea
           C-169]
          Length = 707

 Score = 77.4 bits (189), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 72/166 (43%), Gaps = 28/166 (16%)

Query: 230 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 289
           C  C +GG+LL C+G CLRSFH       E          D  EA   ++C +C      
Sbjct: 321 CGRCADGGELLECDGTCLRSFHLNCLTPSERPSPD-----DPPEA--PWYCPDCRLGIGA 373

Query: 290 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL-------LLRDDEVAADQLAKS 342
           C  C + G      G  +  C   +CG+++H  C++ L       + R+D   A+     
Sbjct: 374 CAICKQTGV----VGQHILKCKMGSCGYYFHNQCLNGLRDSGLLKIHREDPAVANGGNGE 429

Query: 343 IIAGESFTCPLHKCCICKQGENKADSDLQFAV-CRRCPKAYHRKCL 387
            +    FTCP H C +C+        D Q  V C RCP AYH + L
Sbjct: 430 RV----FTCPAHFCHVCRM-----SGDAQRMVRCWRCPTAYHSRGL 466


>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
          Length = 1322

 Score = 77.0 bits (188), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 700 DTVCQICESSGDSLIPCEGECYKHFHL--------ECLGLTSLPDG-----KFICMECKT 746

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  C+ K               +I  
Sbjct: 747 GQHPCFSC-------KVSGTDVKRCSVGACGKFYHEACIRK------------FPTAIFE 787

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 788 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 826


>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
           (predicted) [Rattus norvegicus]
          Length = 1298

 Score = 77.0 bits (188), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 602 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSQ---RPEGRFTCTECASGIH 648

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  C+   CG FYH  CV K  L            ++     
Sbjct: 649 SCFVC-------KESKMEVKRCMVNQCGKFYHEACVKKYPL------------TVFESRG 689

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 690 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 726


>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Gallus gallus]
          Length = 1436

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ G+ L+ CEG C  +FH          C  L    +E      FFC  C+ 
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +H CF+C       K  G +V  C  + CG FYH  CV K             A ++  
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 383
              F CP H C  C       D D+  A       C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Sarcophilus harrisii]
          Length = 1437

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 70/161 (43%), Gaps = 38/161 (23%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNC 283
           D+VC  C++ GD L+ CEG C + FH  +D         LGL+     ++ N  F C  C
Sbjct: 701 DTVCQICESYGDSLVVCEGECCKHFH--LDC--------LGLS-----SLPNGRFLCTEC 745

Query: 284 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
           +  QH CF+C       K  G +V  C    CG FYH  CV K             A ++
Sbjct: 746 KTGQHTCFSC-------KVPGKDVKRCSVTACGKFYHEACVRK------------FATAV 786

Query: 344 IAGESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
                F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 787 FESRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 827


>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
           [Meleagris gallopavo]
          Length = 1436

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ G+ L+ CEG C  +FH          C  L    +E      FFC  C+ 
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +H CF+C       K  G +V  C  + CG FYH  CV K             A ++  
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 383
              F CP H C  C       D D+  A       C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|395739588|ref|XP_002819060.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pongo
           abelii]
          Length = 1438

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
          Length = 1438

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFVCMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
           [Xenopus (Silurana) tropicalis]
          Length = 1298

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  GDL+ CEG C  +FH +           +GL+         + CK C     
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLSC----------IGLS---TRPAGKYLCKECTSGAR 659

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  C+   CG FYH  C+ K  L            ++     
Sbjct: 660 SCFLC-------KESNRDVKRCIVPHCGKFYHESCLRKYPL------------AVFESRG 700

Query: 349 FTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYH 383
           F CPLH+C  C      N   S  +   C RCP AYH
Sbjct: 701 FRCPLHRCATCYFSNPSNPRASKGKMVRCVRCPLAYH 737


>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
          Length = 1437

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD ++ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSVIACEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Gallus gallus]
          Length = 1386

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ G+ L+ CEG C  +FH          C  L    +E      FFC  C+ 
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +H CF+C       K  G +V  C  + CG FYH  CV K             A ++  
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 383
              F CP H C  C       D D+  A       C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
           [Meleagris gallopavo]
          Length = 1386

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 67/164 (40%), Gaps = 44/164 (26%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ G+ L+ CEG C  +FH          C  L    +E      FFC  C+ 
Sbjct: 699 DTVCQICESSGESLVSCEGECCSTFHM--------ECLGLKAMPEE-----KFFCTECKN 745

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +H CF+C       K  G +V  C  + CG FYH  CV K             A ++  
Sbjct: 746 GEHTCFSC-------KLPGKDVKRCSVSACGKFYHEACVRK------------FATALFE 786

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 383
              F CP H C  C       D D+  A       C RCP AYH
Sbjct: 787 SRGFRCPQHCCSAC-----SVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like, partial [Meleagris gallopavo]
          Length = 2336

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 64/161 (39%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL++        F C  C 
Sbjct: 1244 LKENVCQICEKPGELLLCEAQCCGAFHLQC----------LGLSEMPTG---KFICNECS 1290

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K  G +V  C+   CG +YH  C+ K               ++ 
Sbjct: 1291 TGVHTCFVC-------KSCGQDVKRCLLPLCGKYYHEECIQK------------YPPTVT 1331

Query: 345  AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
              + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 1332 QNKGFRCSLHICMTCHAANPTNISASKGRLMRCVRCPVAYH 1372


>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Sarcophilus harrisii]
          Length = 1389

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 70/161 (43%), Gaps = 38/161 (23%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNC 283
           D+VC  C++ GD L+ CEG C + FH  +D         LGL+     ++ N  F C  C
Sbjct: 701 DTVCQICESYGDSLVVCEGECCKHFH--LDC--------LGLS-----SLPNGRFLCTEC 745

Query: 284 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
           +  QH CF+C       K  G +V  C    CG FYH  CV K             A ++
Sbjct: 746 KTGQHTCFSC-------KVPGKDVKRCSVTACGKFYHEACVRK------------FATAV 786

Query: 344 IAGESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
                F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 787 FESRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 827


>gi|301117486|ref|XP_002906471.1| chromodomain-helicase-DNA-binding protein, putative [Phytophthora
            infestans T30-4]
 gi|262107820|gb|EEY65872.1| chromodomain-helicase-DNA-binding protein, putative [Phytophthora
            infestans T30-4]
          Length = 2158

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 70/172 (40%), Gaps = 30/172 (17%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D +C+ C +GG +L C+G C RSFH          C  +   KDE      + C +C   
Sbjct: 1659 DDLCTLCGDGGLILLCDGPCHRSFHLD--------CVGM---KDEPNDE-QWLCPDCAEG 1706

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            +H C  C ++G    E G  V  C  A CG FYH  C+           A+      +  
Sbjct: 1707 RHMCLICKQVGEMGVEFG--VTQCSVAKCGRFYHKGCL-----------AENSRVEWVGK 1753

Query: 347  ESFTCPLHKCCICKQ-----GENKADSDLQFAVCRRCPKAYHRKCLPRKIAF 393
            + F CP H C  C +        K + +     C  C +A+H  C+P    F
Sbjct: 1754 KRFRCPSHFCHACSKQATTNSNKKGEDNTGLVSCIHCSQAFHPDCIPSADKF 1805


>gi|297299251|ref|XP_001091186.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Macaca mulatta]
          Length = 1419

 Score = 77.0 bits (188), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFVCMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|380814048|gb|AFE78898.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca
           mulatta]
 gi|383419477|gb|AFH32952.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca
           mulatta]
          Length = 1438

 Score = 77.0 bits (188), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFVCMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
            [Saccoglossus kowalevskii]
          Length = 1787

 Score = 77.0 bits (188), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 62/159 (38%), Gaps = 34/159 (21%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            ++VC  C+  G+LL CEG+C  +FH          C  L      V     F C  C   
Sbjct: 922  ENVCQVCERPGELLLCEGQCCGAFHL--------DCIGLQQMPSGV-----FKCDECISG 968

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
             H CF C K          EV  C  A CG FYH  C+ K  L            +   G
Sbjct: 969  VHSCFICTKC-------DIEVKRCSVALCGKFYHEDCLKKWTL------------TRFDG 1009

Query: 347  ESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 383
            +  TCPLH C  C      N   +  +   C RCP AYH
Sbjct: 1010 KGVTCPLHTCQACAADNPRNPKATKGRLTRCVRCPTAYH 1048


>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Tupaia chinensis]
          Length = 2687

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 66/162 (40%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL +        F C  C 
Sbjct: 1533 LKENVCQNCEKVGELLLCEAQCCGAFHLEC----------LGLAE---MPRGKFICNECR 1579

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1580 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1620

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1621 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1661


>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
          Length = 1059

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 67/161 (41%), Gaps = 34/161 (21%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 504 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 550

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 551 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 591

Query: 345 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
             + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 592 QNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 632


>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Equus caballus]
          Length = 2700

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 66/162 (40%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL +        F C  C 
Sbjct: 1544 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLNE---MPRGKFICNECR 1590

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1591 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1631

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1632 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1672


>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
           rubripes]
          Length = 1499

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/164 (34%), Positives = 67/164 (40%), Gaps = 43/164 (26%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  G+ L+ CEG C R FH             LGLT    +    F C  C+ 
Sbjct: 749 DTVCQICEAYGESLVVCEGDCNRQFHLEC----------LGLT---AQPEGRFICLECQN 795

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF+C       K  G EV  C  + CG +YH  CV KL           L  +   
Sbjct: 796 GDHPCFSC-------KTAGQEVTRCSVSGCGCYYHEDCVRKL-----------LGTTSSP 837

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 383
           G  F CP H C  C       + DLQ A       C RCP AYH
Sbjct: 838 GGGFCCPQHICSTC-----CLERDLQRASKGRLMRCIRCPVAYH 876


>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
           gorilla]
          Length = 1397

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 661 DTVCQICESPGDSLIPCEGECCKHFHL--------ECLGLTSLPDS-----KFICMECKT 707

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 708 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 748

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 749 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 787


>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Otolemur garnettii]
          Length = 1438

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGDLLC-CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L  CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLTPCEGECCKHFHL--------ECLGLTSVPDG-----KFICTECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827


>gi|395847339|ref|XP_003796336.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
           [Otolemur garnettii]
          Length = 1427

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGDLLC-CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L  CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLTPCEGECCKHFHL--------ECLGLTSVPDG-----KFICTECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827


>gi|348688432|gb|EGZ28246.1| hypothetical protein PHYSODRAFT_470076 [Phytophthora sojae]
          Length = 2182

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 70/177 (39%), Gaps = 35/177 (19%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D +C+ C +GG +L C+G C RSFH          C  +   KDE      + C +C   
Sbjct: 1658 DDLCTLCGDGGLILLCDGPCHRSFHL--------ECVGM---KDEPNDE-QWLCPDCAEG 1705

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            +H C  C ++G    E G  V  C  A CG FYH  C++K                 +  
Sbjct: 1706 RHMCLICKQVGEMGVEFG--VTQCSVAKCGRFYHKGCLAK-----------SSRVEWVGK 1752

Query: 347  ESFTCPLHKCCICKQGENKA----------DSDLQFAVCRRCPKAYHRKCLPRKIAF 393
            + F CP H C  C Q   K           D       C  C +A+H +C+P    F
Sbjct: 1753 KRFRCPSHFCHACSQASKKESAAKSKKKKDDVKTHVVSCIHCSQAFHPECIPSADKF 1809


>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Otolemur garnettii]
          Length = 1389

 Score = 75.9 bits (185), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGDLLC-CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L  CEG C + FH          C  L    D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLTPCEGECCKHFHL--------ECLGLTSVPDG-----KFICTECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827


>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
          Length = 1435

 Score = 75.9 bits (185), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C   FH          C  L    D       F C  C+ 
Sbjct: 698 DTVCQICESSGDSLIACEGECCTHFHL--------ECLGLPSLPDG-----KFICVECQT 744

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 745 GQHPCFSC-------KMSGKDVKRCSVGACGKFYHEACVRK------------FPTAIFE 785

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 786 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 824


>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Taeniopygia guttata]
          Length = 1651

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 65/161 (40%), Gaps = 34/161 (21%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L ++VC  C+  G+LL CE +C  +FH             LGL++        F C  C 
Sbjct: 270 LKENVCQICEKPGELLLCEAQCCGAFHLQC----------LGLSE---MPKGKFICNECS 316

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       K  G +V  C+   CG +YH  C+ K               +++
Sbjct: 317 TGVHTCFVC-------KSCGEDVKRCLLPLCGKYYHEACIQK------------YPPTVM 357

Query: 345 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
             + F C LH C  C      N + S  +   C RCP AYH
Sbjct: 358 QNKGFRCSLHICMTCHAANPANISASKGRLMRCVRCPVAYH 398


>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific, partial [Pteropus alecto]
          Length = 2202

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 66/162 (40%), Gaps = 36/162 (22%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            L ++VC  C+  G+LL CE +C  +FH             LGL +        F C  C 
Sbjct: 990  LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLNE---MPRGKFICNECR 1036

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 1037 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 1077

Query: 345  AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
              + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 1078 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 1118


>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
           specific [Crassostrea gigas]
          Length = 1079

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 74/173 (42%), Gaps = 35/173 (20%)

Query: 213 DTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEV 272
           D +E+E N   +    +C  C++ GDL+ CEG C   FH +        C S  + +   
Sbjct: 298 DNQEEEKNSLCKSPRYLCQVCEDNGDLMECEGECHNYFHKS--------CVSENVEEP-- 347

Query: 273 EAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDD 332
                F C+ C+   H CFAC       KE       C    CG FYH +C++KL+    
Sbjct: 348 -----FKCEECKSGVHTCFAC-------KEKDENTKKCSVHLCGKFYHENCLNKLMHSKS 395

Query: 333 EVAADQLAKSIIAGESFTCPLHKCCICKQGENKAD--SDLQFAVCRRCPKAYH 383
           +           + + F CPLH+C  C    +K    S  +   C RCP AYH
Sbjct: 396 D-----------SNKGFVCPLHQCATCAIESSKKTKASKGRLLRCVRCPTAYH 437


>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Papio anubis]
          Length = 1013

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 317 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 363

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 364 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 404

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 405 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 441


>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
          Length = 1440

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 66/159 (41%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ G+ L+ CEG C   FH          C  L    DE      F C  C+ 
Sbjct: 691 DTVCQICESSGESLVSCEGECCGVFHP--------ECLGLKSLPDE-----KFICAECKN 737

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +H CF+C       K  G +V  C  +TCG FYH  CV K             A ++  
Sbjct: 738 GEHTCFSC-------KLPGKDVKRCSVSTCGKFYHEACVRK------------FATALFE 778

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
              F CP H C  C   ++    S  +   C RCP A+H
Sbjct: 779 SRGFRCPQHCCTACSMDKDIHKASKGRMVRCLRCPVAFH 817


>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
          Length = 857

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 161 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 207

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 208 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 248

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 249 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 285


>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Bos taurus]
          Length = 1470

 Score = 75.1 bits (183), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 316 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 362

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 363 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 403

Query: 345 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
             + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 404 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 444


>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
          Length = 851

 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 155 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASGIH 201

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  EV  CV   CG FYH  CV K  L            ++     
Sbjct: 202 SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFESRG 242

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 243 FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 279


>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
           [Strongylocentrotus purpuratus]
          Length = 1736

 Score = 74.7 bits (182), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 66/160 (41%), Gaps = 34/160 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           ++VC  C+  G LL CEG C  +FH  +D         +GL   +V    +F C  C   
Sbjct: 831 ENVCQVCERTGQLLLCEGGCCGAFH--LDC--------IGL---QVAPSGSFRCDECISG 877

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H CF C       K +  EV  C    CG +YH  C+ +  L            +    
Sbjct: 878 VHSCFVC-------KLSDQEVRRCHVPVCGKYYHEGCIRRFPL------------TRFDS 918

Query: 347 ESFTCPLHKCCICKQGENKAD--SDLQFAVCRRCPKAYHR 384
             FTCPLH C  C     K+   S  +   C RCP AYH+
Sbjct: 919 RGFTCPLHACVACFADNPKSTKASRGRLMRCVRCPTAYHQ 958


>gi|348554221|ref|XP_003462924.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Cavia
           porcellus]
          Length = 1438

 Score = 74.7 bits (182), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 64/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C R FH          C       D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLVPCEGECYRYFHL--------ECLGWTSVPDG-----KFTCMECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K  G +V  C    CG FYH  CV K             + +I  
Sbjct: 748 GQHPCFSC-------KVPGEDVKRCSVGACGKFYHEACVRK------------FSTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
          Length = 2363

 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 62/159 (38%), Gaps = 34/159 (21%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            +++C  C+  G+LL CEG+C  +FH          C SL            F C  C+  
Sbjct: 1549 ENICQVCEKTGELLLCEGQCCGAFHLA--------CISLAEAPKG-----KFICPECKSG 1595

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
             H CF C       K+   +V  C+   CG FYH  C++              A ++   
Sbjct: 1596 IHTCFVC-------KKRSEDVRRCMIPVCGKFYHGECIA------------NFAPTVAVN 1636

Query: 347  ESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 383
              F C +H C  C      +   S  +   C RCP AYH
Sbjct: 1637 RGFRCSIHVCLTCFIANPNSSTISKGRLVRCVRCPIAYH 1675


>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
          Length = 1363

 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 64/161 (39%), Gaps = 38/161 (23%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ- 287
           VC  C+  G L+ CEG C  +FH           A LGL++        F C  C   + 
Sbjct: 663 VCQLCEKTGSLVLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGRF 709

Query: 288 ---HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       KE+  +V  CV + CG FYH  CV K  L            ++ 
Sbjct: 710 SWIHSCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPL------------TVF 750

Query: 345 AGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
               F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 751 ESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791


>gi|414884836|tpg|DAA60850.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 103

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 60/84 (71%), Gaps = 3/84 (3%)

Query: 606 RLSSLMKDAASSVRMEEILKRHKIPSTHAYASKSAVDKAITLGKVEGSVEAIRTALKKLD 665
           R+ +++ +  SS+ ++++ +   +PSTH Y+ +   DK I LGK+E SV+A+  ALKKL+
Sbjct: 18  RVITILGNKVSSLTIKDVTRNCSLPSTHVYSGR-LTDK-IPLGKIERSVQAVEAALKKLE 75

Query: 666 VDGSSIEDAKAVCEPEVLSQIFKW 689
            +G S+ DAKAVCEPEVL Q+ +W
Sbjct: 76  -NGGSVNDAKAVCEPEVLRQLARW 98


>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
          Length = 1641

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227  DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
            D+VC  C++ GD L+ CEG C + FH          C  +    D       F C  C+ 
Sbjct: 895  DTVCQICESSGDSLIPCEGECYKYFHL--------ECLGVTSLPDG-----KFICMECKT 941

Query: 286  KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             +H CF+C       K +G +V  C    CG FYH  CV K               ++  
Sbjct: 942  GKHPCFSC-------KVSGTDVKRCSVGACGKFYHEACVRK------------FPTAVFE 982

Query: 346  GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
             + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 983  SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 1021


>gi|452821906|gb|EME28931.1| hypothetical protein Gasu_36670 [Galdieria sulphuraria]
          Length = 1451

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 67/156 (42%), Gaps = 20/156 (12%)

Query: 234  DNGGDLLCCEGRCLRSFHATIDAGEESH---CASLGLTKDEVEAMLNFFCKNCEYKQHQC 290
            D    L+ C+G C ++FH     G+ES      S+ L   +   +    C NC   +H C
Sbjct: 1291 DASNPLIACDGPCQKTFHYECAPGDESERPPIESIELYASDERPLWQ--CHNCTSGEHIC 1348

Query: 291  FACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFT 350
            F+CG+ G    +    +  C   +CG FYH  C  +          + LA+    G  F 
Sbjct: 1349 FSCGRPGHI-ADVNDPLRKCSLGSCGRFYHNSCAQQ----------EPLARLASDGNWFR 1397

Query: 351  CPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKC 386
            CP H C +C   E   DS      C  CP+A+H +C
Sbjct: 1398 CPQHYCVVC---EESGDSRPMIK-CIYCPRAWHVQC 1429


>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
          Length = 2421

 Score = 74.3 bits (181), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 64/161 (39%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            + +++C  C+  G+LL CEG+C  +FH          C SL            F C  C+
Sbjct: 1599 MKENICQVCEKTGELLLCEGQCCGAFHLP--------CISLAEAPKG-----KFVCPECK 1645

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K+ G +V  C+   CG FYH  C++              A ++ 
Sbjct: 1646 SGIHTCFVC-------KKRGEDVRRCMIPVCGKFYHGECIA------------NYAPTVP 1686

Query: 345  AGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 383
                F C +H C  C      +   S  +   C RCP AYH
Sbjct: 1687 VNRGFRCSIHVCLTCFIANPNSSNISKGRLVRCVRCPVAYH 1727


>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
          Length = 1421

 Score = 73.9 bits (180), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/157 (32%), Positives = 60/157 (38%), Gaps = 37/157 (23%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++             C    H
Sbjct: 560 VCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSRRPEGRF------TCSEWIH 603

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 604 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 644

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 645 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 681


>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
           [Desmodus rotundus]
          Length = 1438

 Score = 73.9 bits (180), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 63/159 (39%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C       D       F C  C+ 
Sbjct: 701 DTVCQICESSGDSLIPCEGECYKYFHL--------ECLGSPPLPDG-----KFICVECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K  G EV  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVPGTEVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
          Length = 1275

 Score = 73.9 bits (180), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 119 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 165

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 166 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 206

Query: 345 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
             + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 207 QNKGFRCSLHICTTC-HAANPASVSASKGRLMRCVRCPVAYH 247


>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like
           [Oreochromis niloticus]
          Length = 1605

 Score = 73.2 bits (178), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 65/159 (40%), Gaps = 33/159 (20%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  GD L+ CEG C R FH             LGL+         F C  C  
Sbjct: 857 DTVCQICEVYGDGLMVCEGDCNRQFHMEC----------LGLSS---PPEGRFTCAECRT 903

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF+C       K  G EV  C  + CG +YH  CV KL              +   
Sbjct: 904 GNHPCFSC-------KTVGREVTRCSVSGCGCYYHEDCVRKL-----------PGTTSSP 945

Query: 346 GESFTCPLHKC-CICKQGENKADSDLQFAVCRRCPKAYH 383
           G  F+CP H C   C + + +  S  +   C RCP AYH
Sbjct: 946 GGGFSCPQHSCSTCCLERDPQRASKGRLIRCIRCPLAYH 984


>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           NSD3-like [Sus scrofa]
          Length = 1438

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++  D L+ CEG C + FH             LGLT         F C  C+ 
Sbjct: 701 DTVCQICESPSDSLIPCEGECCKHFHLEC----------LGLTS---LPDGKFVCVECKT 747

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 748 GQHPCFSC-------KVSGTDVKRCSVGACGKFYHEACVRK------------FPTAIFE 788

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 789 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|428162029|gb|EKX31238.1| hypothetical protein GUITHDRAFT_83305 [Guillardia theta CCMP2712]
          Length = 171

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 47/166 (28%), Positives = 66/166 (39%), Gaps = 29/166 (17%)

Query: 227 DSVCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D  C  C    G L+CC+G C RSF          H A LG+ ++E +    + C  C+ 
Sbjct: 27  DYHCDICLGWEGTLVCCDGSCRRSF----------HLACLGMDEEENDEEEEWLCNLCKV 76

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
              +C  C     SD +    +  C   +C  ++H  C+       D            A
Sbjct: 77  GAKRCMIC-----SDSQDSENMIHCKVESCKKYFHRDCLKTWNCEVD------------A 119

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
              FTCP H C  C Q        + F  C  CP A+H KCLP ++
Sbjct: 120 AGRFTCPRHTCKACNQHSYTGKQGVMFK-CIDCPAAFHFKCLPSQV 164


>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
          Length = 2214

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 66/161 (40%), Gaps = 34/161 (21%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            + +++C  C+  G+LL CEG+C  +FH          C SL    D  +    F C  C+
Sbjct: 1424 MKETICQVCEKTGELLLCEGQCCGAFHLA--------CISLA---DAPKG--KFVCPECK 1470

Query: 285  YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
               H CF C       K+ G +V  C+   CG FYH  C++              A +  
Sbjct: 1471 SGVHTCFVC-------KKRGEDVRRCMIPVCGKFYHGECIT------------NHAPTAP 1511

Query: 345  AGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 383
                F C +H C  C      + + S  +   C RCP AYH
Sbjct: 1512 VNRGFRCSIHVCLTCFIANPNSSSISKGRLVRCVRCPVAYH 1552


>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
           latipes]
          Length = 1597

 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 70/177 (39%), Gaps = 46/177 (25%)

Query: 214 TEEDESNEEDELFDSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEV 272
           + +D SNE+    D+VC  C+  G+ L+ CEG C R FH          C  L    D  
Sbjct: 840 SRQDSSNEKR---DTVCQICEAYGEGLMVCEGDCSRQFHP--------ECLGLSAPPDG- 887

Query: 273 EAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDD 332
                F C  C+   H CF+C       K TG EV  C +  C  +YH  CV K      
Sbjct: 888 ----RFTCLECKNSNHPCFSC-------KTTGQEVSRCSAPGCSCYYHEDCVWK------ 930

Query: 333 EVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAV------CRRCPKAYH 383
                    +   G  F CP H C  C       + DLQ A       C RCP AYH
Sbjct: 931 -----HPGTTSSPGGGFCCPQHSCSTC-----CLERDLQRASKGRLIHCIRCPTAYH 977


>gi|443709486|gb|ELU04158.1| hypothetical protein CAPTEDRAFT_93578, partial [Capitella teleta]
          Length = 1129

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/159 (34%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           +S+C  C+   +LL C G CL  FHA          A LGL+   V     F C  C   
Sbjct: 490 ESLCGVCEKPNELLHCVGPCLGHFHA----------ACLGLS---VVPTSTFKCDECSTG 536

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H CF C K   +DKET      C S  CG FYH  CVS             +  + +  
Sbjct: 537 IHSCFTCKK---ADKETKK----CSSHGCGKFYHMKCVSG------------IRSAKVDN 577

Query: 347 ESFTCPLHKCCICKQGENKADSDLQ--FAVCRRCPKAYH 383
             FTCPLH C  C   +       Q     C RCP AYH
Sbjct: 578 GHFTCPLHACATCAVDDVYKPKPKQNRLLRCVRCPTAYH 616


>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
          Length = 1235

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D++C  C++ GD LL CEG C R FH             LGLT         F C  C+ 
Sbjct: 555 DTLCQVCESAGDSLLPCEGDCCRHFHLEC----------LGLTS---LPDGKFTCVECQT 601

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF+C       K  G +V  C  + CG FYH  CV K               ++  
Sbjct: 602 GLHPCFSC-------KTPGKDVKRCSVSACGKFYHEACVRK------------FPTAVFE 642

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 643 SKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 681


>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
          Length = 742

 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+  G LL CEG C  +FH           A LGL++        F C  C    H
Sbjct: 46  VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCSECASGIH 92

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       KE+  +V  CV   CG FYH  CV K  L            ++     
Sbjct: 93  SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESRG 133

Query: 349 FTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
           F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 134 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 170


>gi|325180900|emb|CCA15310.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
          Length = 1258

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 39/187 (20%)

Query: 222 EDELFDSVCSFCDNGGD--LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFF 279
           ++EL    C   D  G   ++ C+  C  ++H +           LGLT DE +    + 
Sbjct: 310 DEELCGICCELEDEEGSNLVVWCKKSCYTAYHLS----------CLGLT-DEFDDS-EWK 357

Query: 280 CKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL---------R 330
           C+ CE ++  CFACG+ GS D+  G  VF C S  C  FYH  CV              R
Sbjct: 358 CEQCENREQICFACGRNGSIDERGG--VFKCCSQRCHKFYHYSCVEGCRRTRFYGSKRKR 415

Query: 331 DDEV--AADQLAKSIIAGES-------FTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 381
           + ++   +D L+      E        F CP H C +C   E+   S+L F  C +CP++
Sbjct: 416 NPQIRKESDNLSMESDTNEEDFKYKFKFRCPRHICAVC---EDAKSSELMF--CIKCPES 470

Query: 382 YHRKCLP 388
           YH  C+P
Sbjct: 471 YHTSCVP 477


>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
          Length = 704

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 63/159 (39%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           + VC  C+  G LL CEG C  +FH           A LGL++        F C  C   
Sbjct: 6   EYVCQLCEKTGSLLLCEGPCCGAFHL----------ACLGLSR---RPEGRFTCTECASG 52

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H CF C       KE+  EV  CV   CG FYH  CV K  L            ++   
Sbjct: 53  IHSCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPL------------TVFES 93

Query: 347 ESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
             F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 94  RGFRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 132


>gi|355708043|gb|AES03146.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
           furo]
          Length = 588

 Score = 71.6 bits (174), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 67/162 (41%), Gaps = 36/162 (22%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L ++VC  C+  G+LL CE +C  +FH             LGLT+        F C  C 
Sbjct: 365 LKENVCQNCEKLGELLLCEAQCCGAFHLEC----------LGLTE---MPRGKFICNECR 411

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       K++G +V  C+   CG FYH  CV K               +++
Sbjct: 412 TGIHTCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK------------YPPTVM 452

Query: 345 AGESFTCPLHKCCICKQGENKAD---SDLQFAVCRRCPKAYH 383
             + F C LH C  C    N A    S  +   C RCP AYH
Sbjct: 453 QNKGFRCSLHICITC-HAANPASVSASKGRLMRCVRCPVAYH 493


>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific-like [Takifugu rubripes]
          Length = 1169

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 66/160 (41%), Gaps = 33/160 (20%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L ++VC  C+  GDLL C+G C  +FH             +GL+         FFC+ C 
Sbjct: 377 LKENVCQVCERTGDLLACDGHCYGAFHPQC----------IGLS---AAPKGKFFCRECS 423

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       K +G  V  C+   CG FYH  C++              + +  
Sbjct: 424 SGVHSCFVC-------KTSGNGVKRCMIPLCGKFYHNDCIT------------AFSATQP 464

Query: 345 AGESFTCPLHKCCICKQGENKAD-SDLQFAVCRRCPKAYH 383
             +   CPLH C  C      ++ S  + A C RCP AYH
Sbjct: 465 HNKGLRCPLHVCLSCHINYPHSNCSKGRLARCVRCPVAYH 504


>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
          Length = 926

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 65/159 (40%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 180 DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASLPDG-----KFICVECKT 226

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            QH CF+C       K +G +V  C    CG FYH  CV K               +I  
Sbjct: 227 GQHACFSC-------KVSGTDVKRCSVGACGKFYHEACVRK------------FPTAIFE 267

Query: 346 GESFTCPLHKCCICK-QGENKADSDLQFAVCRRCPKAYH 383
            + F CP H C  C  + +    S  +   C RCP AYH
Sbjct: 268 SKGFRCPQHCCSACSVEKDIYKASKGRMMRCLRCPVAYH 306


>gi|224010613|ref|XP_002294264.1| hypothetical protein THAPSDRAFT_269843 [Thalassiosira pseudonana
           CCMP1335]
 gi|220970281|gb|EED88619.1| hypothetical protein THAPSDRAFT_269843 [Thalassiosira pseudonana
           CCMP1335]
          Length = 392

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 69/151 (45%), Gaps = 31/151 (20%)

Query: 226 FDSVCSFC-------DNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNF 278
           F  +C+ C       D    LL CEG C R FH T        CA+L     + E    +
Sbjct: 208 FTWICTECREAECLDDPDAVLLICEGLCNRPFHPT--------CANLLSPPPDNET---W 256

Query: 279 FCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL-----LLRDDE 333
            C++CE  +HQC AC   G  D     +V+ C +  CG F+H  C+S       ++ +++
Sbjct: 257 ICQDCEQGRHQCAACHNYGQDD----VDVYCCTAKNCGLFFHEACLSMYNVDVQVVEEEQ 312

Query: 334 VAADQLA----KSIIAGESFTCPLHKCCICK 360
            A D  A    ++I++   F CP H C  C+
Sbjct: 313 YADDDTADLQERTIVSMPMFKCPAHSCFTCQ 343


>gi|297791953|ref|XP_002863861.1| hypothetical protein ARALYDRAFT_331222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309696|gb|EFH40120.1| hypothetical protein ARALYDRAFT_331222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 193

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/138 (32%), Positives = 74/138 (53%), Gaps = 9/138 (6%)

Query: 1   MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGGCKELIYLRGAA 60
           M  S++E E+    V +YYF  + +   K    F  +P+  +  + +G     ++LRG  
Sbjct: 1   MEWSEEEGEILPDSVDEYYFVDQSDTPVK----FCDVPLNVDSTDSSGSP---VFLRGNI 53

Query: 61  DSGLQKIFKPVIAWKFDLT-NVIPEI-FVLSKENSWIKLQKPRKCYEEIYRTILIMVHCL 118
           +SG +   K V  W F+L+ +  P+I  +L     WI LQ PRK YE + RT L+ +  L
Sbjct: 54  NSGDESFSKLVKGWNFELSVDEHPKIEVLLHGMMHWITLQSPRKSYEALIRTTLVTLQFL 113

Query: 119 SYAKRNPEATAKSIWDFL 136
            + KRNP+A++  +W+ L
Sbjct: 114 HFVKRNPDASSDDVWNSL 131


>gi|298715287|emb|CBJ27936.1| Chromodomain-helicase-DNA-binding protein 8 [Ectocarpus siliculosus]
          Length = 3661

 Score = 70.5 bits (171), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/168 (29%), Positives = 69/168 (41%), Gaps = 34/168 (20%)

Query: 229  VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            VC+ C++GG  + C+G C RSFH           A LG+  +  E    + C  C  K  
Sbjct: 2739 VCARCEDGGVTIMCDGPCQRSFHP----------ACLGMDDNPEED--PWMCNRCMNKVQ 2786

Query: 289  QCFACGKLGSS-DKETGAEVFP-----CVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 342
            +C  CGK GS  D    A   P     C  ++CG +YH  C+ K+       + +     
Sbjct: 2787 KCLECGKKGSEMDSHNRAVKIPGGVSRCQLSSCGRYYHKECLDKITPNRTSYSKEG---- 2842

Query: 343  IIAGESFTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYHRKCL 387
                 +F CP H C  C     K  ++L       C RC KA    CL
Sbjct: 2843 -----NFKCPQHFCIDC----GKTSTNLGPRTLVKCLRCAKARCPDCL 2881


>gi|189534132|ref|XP_001923087.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Danio rerio]
          Length = 773

 Score = 70.5 bits (171), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 63/161 (39%), Gaps = 36/161 (22%)

Query: 232 FCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCF 291
            C+  GDLL CEG+C  +FH       E                  F C+ C    H CF
Sbjct: 1   MCEKQGDLLLCEGQCCGAFHPQCTGLNEP-------------PTGKFLCQECTSGVHSCF 47

Query: 292 ACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTC 351
           AC +LG        +V  C+   CG FYH  C            A   A ++    +F C
Sbjct: 48  ACKRLGE-------DVRRCMVPGCGKFYHGEC------------AASHAPTVPLNRAFRC 88

Query: 352 PLHKC--CICKQGENKADSDLQFAVCRRCPKAYHRK--CLP 388
           PLH C  C      N + +  Q   C RCP AYH    C+P
Sbjct: 89  PLHACLSCFILNPANPSVAKGQLTRCIRCPVAYHSSDFCIP 129


>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Ornithorhynchus anatinus]
          Length = 1437

 Score = 69.7 bits (169), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 61/159 (38%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++  D L+ CEG C R FH          C  L    D       F C  CE 
Sbjct: 700 DTVCQVCESYSDSLVACEGECFRHFHL--------ECLGLTSLPDG-----KFVCVECET 746

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF+C       K  G +V  C  + CG FYH  CV                 +I  
Sbjct: 747 GHHTCFSC-------KVPGKDVKRCSVSACGKFYHEACVRI------------FPTAIFE 787

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
              F CP H C  C   ++    S  +   C +CP AYH
Sbjct: 788 ARGFRCPQHCCSACSMEKDIHKASKGRMMRCLKCPIAYH 826


>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Oreochromis niloticus]
          Length = 1167

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 62/158 (39%), Gaps = 35/158 (22%)

Query: 229 VCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           VC  C+  G DL+ CEG+C   FH         HC    L  D+        C+ C    
Sbjct: 461 VCQMCEQAGEDLVPCEGQCCGMFHL--------HCLGQTLNPDD-----KLLCQECSTGI 507

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C KL         EV  C    CG FYH  CV    L            ++   +
Sbjct: 508 HSCFICKKLE-------GEVRRCHVPHCGKFYHEACVRLNTL------------TVFDNK 548

Query: 348 SFTCPLHKCCICKQG-ENKADSDL-QFAVCRRCPKAYH 383
            F CPLH C  C  G   K  S+  +   C RCP AYH
Sbjct: 549 GFRCPLHTCLSCHYGCRTKHKSNKGRLMRCLRCPVAYH 586


>gi|242080051|ref|XP_002444794.1| hypothetical protein SORBIDRAFT_07g028156 [Sorghum bicolor]
 gi|241941144|gb|EES14289.1| hypothetical protein SORBIDRAFT_07g028156 [Sorghum bicolor]
          Length = 212

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 30/51 (58%), Positives = 38/51 (74%)

Query: 854 LSGKSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHI 904
           LSGKSFYLPGS+  ++K M+QWNM+ PPL LWSR D+A  H  +A+   HI
Sbjct: 2   LSGKSFYLPGSLGVDNKVMEQWNMSPPPLSLWSRSDWAKRHSDIAKSMRHI 52


>gi|340500320|gb|EGR27208.1| myb domain protein [Ichthyophthirius multifiliis]
          Length = 1057

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 76/185 (41%), Gaps = 36/185 (19%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRCLRSFH----ATIDAGEE-------SHCASLGLTKDEV 272
           +++  +CS C+       C+  C RSFH     TI+  +         +   L     ++
Sbjct: 697 DIYKKLCSLCEQPKCTFFCQSFCKRSFHEQCKETIEQKQNLKEKADIYYPKELNFEDLQL 756

Query: 273 EAMLN--FFCKNCEYKQHQCFACGKLGSSD------KETGAEVFPCVSATCGHFYHPHCV 324
           + M N  + C +C+     CF C   G  +      K+    V  C +A C  +YH +C+
Sbjct: 757 QQMTNNKYICIDCQGNMVICFICKNYGIYNNNQKLKKQKNDSVNKCSTANCNKYYHLNCI 816

Query: 325 SKLLLRDDEVAADQLAKSIIA-GESFTCPLHKCCICKQ-GENKADSDLQFAVCRRCPKAY 382
                         L K I +  + F C LH C  C Q G++K  S      C +CPKAY
Sbjct: 817 KNA----------PLFKFIDSNNKRFRCSLHYCASCTQSGDSKVISQ-----CIKCPKAY 861

Query: 383 HRKCL 387
           H  CL
Sbjct: 862 HLGCL 866


>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
           domestica]
          Length = 1435

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 61/159 (38%), Gaps = 34/159 (21%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++  D L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 699 DTVCQICESYSDSLVVCEGECCKHFHPD--------CLGLSSLPDG-----RFLCTECKT 745

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
               CF+C       K  G +V  C    CG FYH  CV K             A ++  
Sbjct: 746 GHRTCFSC-------KIPGKDVKRCSVPACGKFYHEACVRK------------FATAVFE 786

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
              F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 787 SRGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 825


>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Xenopus (Silurana) tropicalis]
          Length = 2440

 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 55/198 (27%), Positives = 73/198 (36%), Gaps = 51/198 (25%)

Query: 225  LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
            + ++VC  C+  G+LL CE +C  +FH             LG+   E      F C  C 
Sbjct: 1356 MKENVCQVCEKPGELLLCEAQCCGAFHLQC----------LGM---EAMPQGKFVCTECS 1402

Query: 285  YKQ----HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 340
                   H CF C       KE+   V  C+   CG +YH  C            A +  
Sbjct: 1403 SGTLQGYHTCFVC-------KESDQGVKRCMLPLCGKYYHEEC------------ALKYP 1443

Query: 341  KSIIAGESFTCPLHKCCIC--KQGENKADSDLQFAVCRRCPKAYHRK--CLPRKIAFEDK 396
             +      F C LH C  C      N + S  +   C RCP AYH    CLP        
Sbjct: 1444 PATQQNRGFRCSLHICSTCYATNPSNPSASKGRLMRCVRCPIAYHANDFCLP-------- 1495

Query: 397  LEEGIITRAWEGLL-PNH 413
               G +T A   ++ PNH
Sbjct: 1496 --AGAVTLASNSIICPNH 1511


>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
 gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
          Length = 729

 Score = 67.0 bits (162), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 73/178 (41%), Gaps = 36/178 (20%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 270
           V   +E+ + E+    + VC+ C+  G  L C G C  +FHA            LG++  
Sbjct: 179 VGPLQENSAVEKPPTKEEVCTVCEKMGATLFCTGPCKLAFHAD----------CLGVS-- 226

Query: 271 EVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLR 330
                  F C  C   +H C  C  LG ++K        C   +CG FYH  C++KL L 
Sbjct: 227 --HVPRAFVCDECTTGEHLCLVCKDLGETEK--------CSLESCGCFYHKKCLAKLPL- 275

Query: 331 DDEVAADQLAKSIIAGESFTCPLHKCCICKQGE-NKADSDLQFAVCRRCPKAYHRKCL 387
                        +  + F CP H C  C Q +    ++  +   C RCP A+H  CL
Sbjct: 276 ------------PLKQDPFVCPRHFCLGCFQEKPTTLNAKGRLLRCVRCPSAFHVGCL 321


>gi|431897324|gb|ELK06586.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
          Length = 915

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 62/173 (35%), Gaps = 44/173 (25%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           + VC  C+  G L  C G C  +FH +           LGL +   E  L   C  C   
Sbjct: 594 EHVCQLCEKTGGLALCVGPCCGAFHPSC----------LGLPRTP-EGRLT--CHECASG 640

Query: 287 Q---------HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
           +         H CF C       KE+  +V  CV + CG FYH  CV +  L        
Sbjct: 641 KCRVPTGPWVHSCFVC-------KESAGDVKRCVVSQCGKFYHEACVRRFPL-------- 685

Query: 338 QLAKSIIAGESFTCPLHKCCIC---KQGENKADSDLQFAVCRRCPKAYHRKCL 387
               ++     F CPLH C  C        +         C  CP A+H  CL
Sbjct: 686 ----AVFESRGFRCPLHSCVSCHASNPSNPRPSKGGSLLCCESCPAAFHPDCL 734


>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
          Length = 1461

 Score = 66.6 bits (161), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)

Query: 227 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           +SVC  C+  G D++ C G+C  ++H         HC  +  + +++       C  C  
Sbjct: 759 ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 804

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF C       K++  EV  C +  CG FYH  CV    L            ++  
Sbjct: 805 GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 845

Query: 346 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 383
              F CPLH C  C      A   +  +   C RCP AYH
Sbjct: 846 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 885


>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
           magnipapillata]
          Length = 1214

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 63/159 (39%), Gaps = 31/159 (19%)

Query: 227 DSVCSFCDNG-GDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D  C  C+   G L+ C+G C  SFH          C  +   K       +F C  C  
Sbjct: 450 DLYCCICEGSEGQLITCQGSCFNSFHF--------DCLGVSCIKT------SFTCDECLS 495

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF C K G+        +  C    CG  YH  C+ K+ +    V  +   K    
Sbjct: 496 NNHCCFFCKKPGA--------ILKCSHNMCGKHYHQDCLIKIPVIKSNVNENNANK---- 543

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHR 384
              F CPLH C +C +  +K +   +   C RCP AYH+
Sbjct: 544 ---FICPLHNCRLCSEKASKGNLT-KLLKCIRCPTAYHQ 578


>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
          Length = 1461

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)

Query: 227 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           +SVC  C+  G D++ C G+C  ++H         HC  +  + +++       C  C  
Sbjct: 759 ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 804

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF C       K++  EV  C +  CG FYH  CV    L            ++  
Sbjct: 805 GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 845

Query: 346 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 383
              F CPLH C  C      A   +  +   C RCP AYH
Sbjct: 846 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 885


>gi|145484966|ref|XP_001428492.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395578|emb|CAK61094.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1405

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 79/211 (37%), Gaps = 70/211 (33%)

Query: 229  VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            +C+ C+     L C G C R FH     G + +  + G      E  + + C++CE  + 
Sbjct: 1166 LCAICEKQKCDLFCRGFCRRQFHKECLEGGQYNGQTQG------EINIKYVCQDCEKYKG 1219

Query: 289  QCFACGKLGS-----------SDKE------------------TG--------------- 304
             CF C K G+            DK+                  TG               
Sbjct: 1220 TCFVCLKQGTFYPNQSKKKTQQDKQFIDDDGYYDNQGEQIIVATGKITRARAQQQLQQQL 1279

Query: 305  AEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS--IIAGE--SFTCPLHKCCICK 360
            +E+  C S  C   YH  CV             Q +K+  I+ GE   F C LH C  CK
Sbjct: 1280 SELVKC-SLNCHKLYHFACV-------------QTSKNFKILDGERQKFKCALHFCEKCK 1325

Query: 361  QGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
               N  D + +   C RCPK+YH KC P+ I
Sbjct: 1326 DKSN--DDNQKMIQCLRCPKSYHEKCAPKGI 1354


>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Takifugu rubripes]
          Length = 1329

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 62/158 (39%), Gaps = 36/158 (22%)

Query: 229 VCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           VC  C+  G DL+ CEG+C   FH             LG+   E+E  L   C+ C    
Sbjct: 630 VCQSCEKAGEDLVPCEGQCCGMFHLQC----------LGMM--ELEDKL--LCQECSTGV 675

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C       K++   V  C    CG FYH  C+    L            ++   +
Sbjct: 676 HSCFHC-------KKSEGSVRRCHVPHCGKFYHEACIRLNPL------------TVFDNK 716

Query: 348 SFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 383
            F CPLH C  C C        S  +   C RCP AYH
Sbjct: 717 GFRCPLHTCLGCCCSSRTKHKPSKGRLMRCLRCPVAYH 754


>gi|168031756|ref|XP_001768386.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680311|gb|EDQ66748.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 615

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 76/181 (41%), Gaps = 30/181 (16%)

Query: 208 EDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGL 267
           +DM  +    +   +D+ F  +C  C + G+LLCCE RC   FH +           LGL
Sbjct: 12  KDMPKEDGRKKGKRDDDGFAFICDVCSDFGELLCCE-RCRSGFHLS----------CLGL 60

Query: 268 TKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL 327
             D+   +  + C +C   + +CF C   GS +     ++  C    CG +YH  C SK 
Sbjct: 61  --DKCPDVEPWLCSSCAENKVRCFKCKAFGSLE----VDLVKCAHRNCGKYYHKDC-SKG 113

Query: 328 LLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            +R      D  A          CP H C  C++ +  A    +   C  CP AYH  C 
Sbjct: 114 WVRIPPKKTDNGA--------MVCPRHHCDACRKCQKNA----KLHRCLYCPVAYHESCS 161

Query: 388 P 388
           P
Sbjct: 162 P 162


>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
           [Oryzias latipes]
          Length = 1167

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 68/182 (37%), Gaps = 40/182 (21%)

Query: 205 DMEEDMVHDTEEDESNEEDELFDSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCA 263
           D   D +  T++ E  +E      VC  C+  G DL+ CEG+C   FH         HC 
Sbjct: 544 DSPSDSLDGTKKGEKRKE-----FVCQVCEQAGEDLVPCEGQCCGMFHL--------HCL 590

Query: 264 SLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHC 323
                 ++        C+ C    H CFAC       K++  EV  C    CG FYH  C
Sbjct: 591 GPSFKPED-----KLLCQQCRAGIHSCFAC-------KQSEGEVRRCHVVHCGKFYHEAC 638

Query: 324 VSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQ--FAVCRRCPKA 381
           +    L            ++   +   CPLH C  C  G        +     C RCP A
Sbjct: 639 IRLNPL------------TVFDNKGLRCPLHACLSCHYGSRTKHKSTKGRLMRCLRCPVA 686

Query: 382 YH 383
           YH
Sbjct: 687 YH 688


>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
          Length = 1366

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)

Query: 227 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           +SVC  C+  G D++ C G+C  ++H         HC  +  + +++       C  C  
Sbjct: 664 ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 709

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF C       K++  EV  C +  CG FYH  CV    L            ++  
Sbjct: 710 GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 750

Query: 346 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 383
              F CPLH C  C      A   +  +   C RCP AYH
Sbjct: 751 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 790


>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
 gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
          Length = 1541

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 64/165 (38%), Gaps = 43/165 (26%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           +++C  C+  G+LL CEG C  +FH  +D         +GL +        F C  C   
Sbjct: 680 ENLCQVCEQVGELLLCEGSCCGAFH--LDC--------IGLQQ---MPTGTFKCDECISG 726

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H CF C K   + K        C    CG +YH  C+ K               ++   
Sbjct: 727 VHTCFVCRKSEVTTKR-------CSIPICGKYYHEDCLRK------------FPNTVFEA 767

Query: 347 ESFTCPLHKCCICKQGENKADSDLQ--------FAVCRRCPKAYH 383
           + F CPLH C  C      A  D++         A C RCP AYH
Sbjct: 768 KGFRCPLHVCGTCVA---VAGGDVKKVKSRGRILARCVRCPTAYH 809


>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
           leucogenys]
          Length = 780

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/176 (31%), Positives = 68/176 (38%), Gaps = 50/176 (28%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTK-------------DEVEAM 275
           VC  C+  G LL CEG C  +FH           A LGL++             D  + +
Sbjct: 142 VCQLCEKPGSLLLCEGPCCGAFHL----------ACLGLSRRPEGRFTCSECASDRSKVI 191

Query: 276 LN---FFCKNC---EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
           L+   F  +N    E   H CF C       KE+  +V  CV   CG FYH  CV K  L
Sbjct: 192 LDGWIFLLENAWTSEQGIHSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL 244

Query: 330 RDDEVAADQLAKSIIAGESFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
                       ++     F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 245 ------------TVFESRGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 288


>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 768

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 63/157 (40%), Gaps = 33/157 (21%)

Query: 228 SVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           ++   C+  GDLL C+G C  +FH          C  L    +       FFC+ C    
Sbjct: 45  TILQVCERTGDLLVCDGHCYGAFH--------PQCIGLSAAPEG-----RFFCRECSSGV 91

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C K G+  K        C+   CG FYH  C++              + +    +
Sbjct: 92  HSCFVCKKSGNGVKR-------CMIPLCGKFYHSDCIA------------TFSATQPHNK 132

Query: 348 SFTCPLHKCCICKQGENKAD-SDLQFAVCRRCPKAYH 383
            F CPLH C  C      ++ S  + A C RCP AYH
Sbjct: 133 GFRCPLHVCLSCHVAYPHSNCSKGRLARCVRCPVAYH 169


>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1404

 Score = 63.9 bits (154), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 54/134 (40%), Gaps = 32/134 (23%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C+  G+ L+ CEG C R FH             LGLT         F C  C  
Sbjct: 713 DTVCQICEAYGEGLVVCEGDCSRQFHLEC----------LGLT---ALPEGRFTCLECLN 759

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +H CF+C       K  G EV  C  + CG FYH  CV KL           L  +   
Sbjct: 760 GKHPCFSC-------KTAGREVTRCSVSGCGCFYHEDCVRKL-----------LGTTSSP 801

Query: 346 GESFTCPLHKCCIC 359
           G  F CP H C  C
Sbjct: 802 GGGFCCPQHICSTC 815


>gi|224129130|ref|XP_002320508.1| predicted protein [Populus trichocarpa]
 gi|222861281|gb|EEE98823.1| predicted protein [Populus trichocarpa]
          Length = 190

 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 5/82 (6%)

Query: 857 KSFYLPGSVDENDKQMDQWNMTAPPLYLWSRHDYAAHHKALAEKHGHISRPQSRTQMERN 916
           KSFYL GS   N KQ++ W +  PP+Y+WSR D+ A    +A +HGH  R     +  + 
Sbjct: 29  KSFYLSGSAVANGKQIEDWTVNTPPIYIWSRADWTAWRIPIAREHGHGIRALVTGEQSKG 88

Query: 917 CY-----ETHAVDHPKEEGQGD 933
           C+     E    DH K+   G+
Sbjct: 89  CFGYVDIEGWESDHTKKHSHGN 110


>gi|427798749|gb|JAA64826.1| Putative histone-lysine n-methyltransferase nsd2, partial
           [Rhipicephalus pulchellus]
          Length = 755

 Score = 63.5 bits (153), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 49/173 (28%), Positives = 69/173 (39%), Gaps = 36/173 (20%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML 276
           +E+  +++    +C  C+  G  L C G C  SFH  +D         +G+T+       
Sbjct: 258 NENGIDEKAKGQLCLICEATGATLTCTGPCRMSFH--LDC--------IGITQPPDA--- 304

Query: 277 NFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 336
            F C  C    H C  C    ++ K        C    CG FYH  C+  L L   E   
Sbjct: 305 -FLCDECTTGSHSCLVCKGTEATQK--------CSMEQCGTFYHISCLKTLPLPIKE--- 352

Query: 337 DQLAKSIIAGESFTCPLHKCCICKQGENKADS-DLQFAVCRRCPKAYHRKCLP 388
                     +S  CPLH C  C Q + +A S   +   C +CP A+H  CLP
Sbjct: 353 ----------DSLVCPLHFCLPCLQQKPRAVSMKGRLLRCVQCPSAFHSGCLP 395


>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
          Length = 1562

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 51/167 (30%), Positives = 64/167 (38%), Gaps = 50/167 (29%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATI---DAGEESHCASLGLTKDEVEAMLNFFCKN 282
           D+VC  C+  GD L+ CEG C R FH      ++G+ES                   C+ 
Sbjct: 811 DTVCHVCETFGDSLVSCEGDCNRLFHPECMGSNSGKESE----------------TVCQE 854

Query: 283 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 342
           C+   H CF+C       K T  ++  C    CG +YH  CV K               S
Sbjct: 855 CKTGSHPCFSC-------KVTEGDMKRCSVNGCGRYYHETCVRK------------YTGS 895

Query: 343 IIAGESFTCPLHKCCICKQGENKADSDLQFA------VCRRCPKAYH 383
               +   CP H C  C       D DLQ A       C RCP AYH
Sbjct: 896 ASDTKGLRCPQHSCATC-----CLDRDLQKAGKGRMMRCIRCPVAYH 937


>gi|118367847|ref|XP_001017133.1| Type III restriction enzyme, res subunit family protein [Tetrahymena
            thermophila]
 gi|89298900|gb|EAR96888.1| Type III restriction enzyme, res subunit family protein [Tetrahymena
            thermophila SB210]
          Length = 2184

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/239 (23%), Positives = 86/239 (35%), Gaps = 84/239 (35%)

Query: 221  EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT----IDAGEESHCASLGLTK------- 269
            ++++LF  +C +CD     + C+G C RSFH+     ++AG  +       T+       
Sbjct: 1561 KKEDLFKKLCYYCDRPKCTVFCQGHCKRSFHSECKERVEAGWINENGMTSETRIVPTEDH 1620

Query: 270  ---DEVEAMLN--FFCKNCEYKQHQC---------FACGKLGSSDK-------------- 301
               + ++ MLN  + CK+CE     C         F    L   +K              
Sbjct: 1621 YDEERLKKMLNINYTCKDCEANTAICFICKKKGSFFPPAVLNKQNKRIQQQKQKIIAEKE 1680

Query: 302  ------------------------------ETGAEVFPCVSATCGHFYHPHCVSKLLLRD 331
                                          +  +++  C +A C  FYHP C+    L  
Sbjct: 1681 KQKDPSNEIVEESENELDEDYDLVEDDSVSQKTSDLTKCSTANCNKFYHPVCIKPYAL-- 1738

Query: 332  DEVAADQLAKSIIAG-ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPR 389
                     K I +  + F CPLH C  C       DS +  + C RCPKAYH +C  +
Sbjct: 1739 --------FKYIDSNNKRFRCPLHYCAHCFIS---GDS-MAISQCVRCPKAYHLRCFEK 1785


>gi|300121581|emb|CBK22099.2| unnamed protein product [Blastocystis hominis]
          Length = 213

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 74/163 (45%), Gaps = 36/163 (22%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           +C+ C+  G  + C G+C R+FH           A L  T   V++   + C +CE + H
Sbjct: 5   ICAECNKFGVDIVCFGKCHRAFHQ----------ACLPCT---VQSTRRWICPDCEKEVH 51

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
           +C  C +  S DK+    +  C +  C  F+HP CV +   R+                 
Sbjct: 52  RCHQCKEFAS-DKD----LIQCQATDCYLFFHPSCVPEQFRRN---------------RP 91

Query: 349 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
           F CP H C  C+Q E++ D  L+   C  C K+YH  CLP ++
Sbjct: 92  FLCPAHCCSFCRQWEDERDPLLK---CAYCCKSYHEYCLPLEV 131


>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Ovis aries]
          Length = 1273

 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 58/149 (38%), Gaps = 40/149 (26%)

Query: 237 GDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKL 296
           G L+ CEG C  +FH           A LGL++           +      H CF C   
Sbjct: 627 GSLVLCEGPCCGAFHL----------ACLGLSRRP---------EGRSRGTHSCFVC--- 664

Query: 297 GSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKC 356
               KE+ ++V  CV + CG FYH  CV +  L            ++     F CPLH C
Sbjct: 665 ----KESKSDVKRCVVSQCGKFYHEACVRRFPL------------TVFESRGFRCPLHSC 708

Query: 357 CICKQGE--NKADSDLQFAVCRRCPKAYH 383
             C      N   S  +   C RCP AYH
Sbjct: 709 LSCHASNPSNPRPSKGKMLRCVRCPVAYH 737


>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1443

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 57/154 (37%), Gaps = 35/154 (22%)

Query: 233 CDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCF 291
           C+  G DL+ CEG+C   FH          C  + L  D+        C+ C    H CF
Sbjct: 676 CEEAGEDLVPCEGQCCGMFHL--------QCLGVLLEPDD-----KVLCQECSTGVHSCF 722

Query: 292 ACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTC 351
            C       K++   V  C    CG FYH  C+    L            ++   + F C
Sbjct: 723 HC-------KKSEGSVRRCHVPHCGKFYHEACIRLNPL------------TVFDNKGFRC 763

Query: 352 PLHKCCICKQGENKADSDLQ--FAVCRRCPKAYH 383
           PLH C  C  G        +     C RCP AYH
Sbjct: 764 PLHTCLGCCYGNRTKPKSTKGRLMRCLRCPVAYH 797


>gi|47221386|emb|CAF97304.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 722

 Score = 60.1 bits (144), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 61/161 (37%), Gaps = 34/161 (21%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L  +V   C+  G+LL CEG+C  +FH          C SL            F C  C+
Sbjct: 42  LLVTVVQVCEKTGELLLCEGQCCGAFHLP--------CISLAEAPKG-----KFICPECK 88

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
              H CF C       K+   +V  C+   CG FYH  C++              A ++ 
Sbjct: 89  SGIHTCFVC-------KKRSDDVRRCMIPVCGKFYHGECIA------------NFAPTVA 129

Query: 345 AGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYH 383
               F C +H C  C      +   S  +   C RCP AYH
Sbjct: 130 VNRGFRCSIHVCLTCFIANPNSSTISKGRLVRCVRCPIAYH 170


>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
            domestica]
          Length = 2705

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/190 (27%), Positives = 66/190 (34%), Gaps = 45/190 (23%)

Query: 204  DDMEEDMVHD-------TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDA 256
            D +EE + HD         + E      L ++VC  C+  G+LL CE +C  +FH     
Sbjct: 1517 DSIEESVEHDHGMPVSKKMQAERGGGAALKENVCQNCEKVGELLLCEAQCCGAFHLEC-- 1574

Query: 257  GEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 316
                    +GLT+        F CK C    H CF C       K  G +V  C+   CG
Sbjct: 1575 --------IGLTE---MPKGKFICKECRTGIHTCFVC-------KPDGEDVKRCLLPLCG 1616

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKAD---SDLQFA 373
             F                              F C LH C  C    N A    S  +  
Sbjct: 1617 KFT--------------XXXXXXXXXXXXXXGFRCSLHMCITC-HAANPASLSASKGRLM 1661

Query: 374  VCRRCPKAYH 383
             C RCP AYH
Sbjct: 1662 RCVRCPVAYH 1671


>gi|428175286|gb|EKX44177.1| hypothetical protein GUITHDRAFT_109961 [Guillardia theta CCMP2712]
          Length = 707

 Score = 59.3 bits (142), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 57/111 (51%), Gaps = 10/111 (9%)

Query: 722 IVDKLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYD-ILPAKNDFNFEKR 780
           +V+ L   +N  D  VDF CG N F  ++ +    +     + +YD  +P   D N+  +
Sbjct: 555 LVEALQGVINQSDHFVDFACGNNRFGSILCQCFGLS-----WSSYDEAVP--EDRNYMGK 607

Query: 781 DWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVP 831
            WM   P ++   + +++G+ PP      +   ++  AL+F P+LL+L+VP
Sbjct: 608 SWMLAGPSDVP--ANVVIGIFPPLHKGPIMCEMYLQHALQFRPRLLVLLVP 656


>gi|145491053|ref|XP_001431526.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398631|emb|CAK64128.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1668

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/215 (22%), Positives = 84/215 (39%), Gaps = 57/215 (26%)

Query: 219  SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCAS------------- 264
            S  ++EL   +C  C+     + C G C R+FH A  D  E +   +             
Sbjct: 1240 SKSKEELIKKLCYVCERPNCSVFCMGHCRRAFHIACKDLLETTEYINIEGPDQDFLNNHS 1299

Query: 265  ---LGLTKDEVEAMLN--FFCKNCEYKQHQCFACGKLGSS--DKETGAEVFP-------- 309
               L  ++++++  +N  + C +C      C  C   G+   +K+   EV          
Sbjct: 1300 FPELNWSEEQLKENVNIRYSCPDCRNSLVVCLLCKTKGTYPPEKKQKEEVIASSDENDPL 1359

Query: 310  ---------------CVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG-ESFTCPL 353
                           C +A C  ++H +C+           A+ L+K++ +  E F CP 
Sbjct: 1360 EDNVRKIKNKSAISKCSTANCNRYFHLNCIQ----------ANPLSKTLDSNVELFRCPS 1409

Query: 354  HKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
            H C  CK   N +++      C RC +++H KC P
Sbjct: 1410 HVCVFCKV--NSSNTTTALIHCVRCCRSFHSKCAP 1442


>gi|198420162|ref|XP_002123587.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1-like 1
            [Ciona intestinalis]
          Length = 1485

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/163 (30%), Positives = 73/163 (44%), Gaps = 27/163 (16%)

Query: 227  DSVCSFCDNGGD---LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 283
            +++CS C++  D   L+ C G C  SFH          C    L +  + ++++F CK C
Sbjct: 1331 ENICSKCESASDEEELVHCIGGCCASFHP--------RC----LEEGAITSIVSFTCKLC 1378

Query: 284  EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSK---LLLRDDEVAADQLA 340
            +   H CF C K  S+D   G E+  C    CG  YH  C+++    LL+ D+    +  
Sbjct: 1379 KEDNHPCFICKKSSSNDDSLG-EIKRCSVPKCGRSYHTTCLTERKIALLQHDDDDQKKQD 1437

Query: 341  KSIIAGE-----SFTCPLHKCCICKQGENKADSDLQFAVCRRC 378
            K           S  C LH+C  C   E K DS++  A   RC
Sbjct: 1438 KEKEKERQAILNSSKCTLHQCGTCTTKE-KEDSNV--ASSNRC 1477


>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
           candidate 1 (WHSC1) [Danio rerio]
          Length = 728

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 36/160 (22%)

Query: 227 DSVCSFCDNGG-DLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           +SVC  C+  G D++ C G+C  ++H         HC  +  + +++       C  C  
Sbjct: 26  ESVCLVCEQTGEDIVTCAGQCYGTYHL--------HCIGVERSAEKI------LCTACST 71

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
             H CF C       K++  EV  C +  CG FYH  CV    L            ++  
Sbjct: 72  GVHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSAL------------TVFE 112

Query: 346 GESFTCPLHKCCICKQGENKA--DSDLQFAVCRRCPKAYH 383
              F CPLH C  C      A   +  +   C RCP AYH
Sbjct: 113 NRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 152


>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
          Length = 1912

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 55/125 (44%), Gaps = 10/125 (8%)

Query: 271  EVEAMLNFFCKNCE-YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
            E+     F CKNC+ Y    CF C K   S KE       C  A C  +YH  C+     
Sbjct: 1276 EITDYSEFKCKNCQKYDPPVCFVC-KYPISPKEKQGHRQKCQVAHCNKYYHLECLDHWP- 1333

Query: 330  RDDEVAADQLAKSIIAGESFTCPLHKC--CICK--QGENKADSDLQFAVCRRCPKAYH-- 383
               +    +++++    E+ TCP H C  C+C   +G     S  + A C RCP  YH  
Sbjct: 1334 -QTQFNGGEISRTNKFSEALTCPRHVCHTCVCDDPRGCKTRFSGDKLARCVRCPATYHTF 1392

Query: 384  RKCLP 388
             KCLP
Sbjct: 1393 TKCLP 1397


>gi|145522682|ref|XP_001447185.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414685|emb|CAK79788.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1508

 Score = 58.2 bits (139), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 50/225 (22%), Positives = 86/225 (38%), Gaps = 57/225 (25%)

Query: 209  DMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCAS--- 264
            D+  D     S  ++EL   +C  C+     + C G C R+FH A  D  E +   +   
Sbjct: 1078 DLGEDARRLASKSKEELIKKLCYVCERPNCSVFCMGHCRRAFHIACKDLLETTEYINIEG 1137

Query: 265  -------------LGLTKDEVEAMLN--FFCKNCEYKQHQCFACGKLGSS--DKETGAEV 307
                         L  ++++++  +N  + C +C      C  C   G+   +K+   EV
Sbjct: 1138 PDQDFLNNHSFPELNWSEEQLKENVNIRYSCPDCRNCLVVCLLCKSKGTYPPEKKQKEEV 1197

Query: 308  FP-----------------------CVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
                                     C +A C  ++H  C+           A+ L+K++ 
Sbjct: 1198 IASSDENDPLEDNVKKIKNKSAISKCSTANCNRYFHLSCIQ----------ANPLSKTLD 1247

Query: 345  AG-ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
            +  E F CP H C  CK   N +++      C RC +++H KC P
Sbjct: 1248 SNVELFRCPSHVCVFCKV--NSSNTTTALIHCVRCCRSFHSKCAP 1290


>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1048

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 60/173 (34%), Gaps = 49/173 (28%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  C  C   G L+CC+G C  +FH          C S+ L     E    F C  C   
Sbjct: 439 DDTCRICGELGQLICCDGGCRGAFHL--------ECLSI-LQPPTGE----FRCDECSTG 485

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H C+ C K+       GA++  C    C   YH  C  K      +  AD         
Sbjct: 486 NHTCYTCDKV-------GADLIKCQFPHCNKLYHRGCAEK------QFKADNF------- 525

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 399
                    C +C  G +         VC  CP AYH  C+    AF  K +E
Sbjct: 526 ---------CLVCGTGGD-------LVVCDGCPGAYHAACIKSTFAFTGKPDE 562


>gi|384253423|gb|EIE26898.1| hypothetical protein COCSUDRAFT_59402 [Coccomyxa subellipsoidea
           C-169]
          Length = 534

 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 60/166 (36%), Gaps = 43/166 (25%)

Query: 230 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 289
           C  C +GG+LL C+G CLRSFH    A  E         +   E      C         
Sbjct: 146 CGCCGDGGELLECDGMCLRSFHQNCLAPSE---------RPNPENPPETPC--------- 187

Query: 290 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL----LLRDDEVAADQLAKSIIA 345
                            + PC   +CG ++H  C++ L    LL+         A     
Sbjct: 188 -----------------ILPCKMGSCGWYFHNACLNGLRDSGLLKIHREDPGVAAGGEDG 230

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
              FTCP H C +C          LQ   C RCP AYH  C+P  +
Sbjct: 231 QRVFTCPAHFCHVCGN-SGAGRHTLQ---CWRCPTAYHATCVPAGV 272


>gi|145512399|ref|XP_001442116.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409388|emb|CAK74719.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1659

 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 46/217 (21%), Positives = 80/217 (36%), Gaps = 56/217 (25%)

Query: 219  SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATI-----------------DAGEESH 261
            S  ++EL   +C  C+     + C G C R+FH                    D   ++ 
Sbjct: 1240 SKSKEELIKKLCYVCERSNCSIFCMGHCRRAFHDACKELLETTDYINVEGPDQDFLNKNA 1299

Query: 262  CASLGLTKDEVEAMLN--FFCKNCEYKQHQCFACGKLGSS--DKETGAEVF--------- 308
                  + ++++  +N  + C +C      C  C + G+   DK+   E+          
Sbjct: 1300 FPEFNWSGEQLKEKINIRYSCPDCRNNLVVCLLCKQKGTYPPDKKQKEEIVQIDEFDPSD 1359

Query: 309  -------------PCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI-IAGESFTCPLH 354
                          C +A C  ++H  C+           A+ L+KS+    + F CP H
Sbjct: 1360 DNVKKSKNKSIISKCSTANCNRYFHLSCIQ----------ANPLSKSLDTNADLFRCPSH 1409

Query: 355  KCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKI 391
             C  CK   N ++       C RC +++H KC P +I
Sbjct: 1410 ICVFCKI--NSSNMTTALIHCIRCCRSFHSKCAPPEI 1444


>gi|348683653|gb|EGZ23468.1| hypothetical protein PHYSODRAFT_479200 [Phytophthora sojae]
          Length = 466

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 47/163 (28%), Positives = 65/163 (39%), Gaps = 23/163 (14%)

Query: 225 LFDSVCSFCDNGG-DLLCCEGRCLRSFHA-TIDAGEESHCASLGLTKDEVEAMLNFFCKN 282
           LF S C  C NGG  LL C G C R+ H   +D  +           ++ +A   + C  
Sbjct: 99  LFPSSCFLCGNGGRGLLHCGGSCARAAHQHCVDKLQAPVVGQPLSAAEKKQAAERWKCAQ 158

Query: 283 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 342
           C    H+C  CG LG   +  G +   C    CG+ +H  C+        + A D+    
Sbjct: 159 CLRGLHRCQRCGFLGH--ESNGMK--KCSVLDCGYHFHAQCLP-------DAAQDE---G 204

Query: 343 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCR-----RCPK 380
           + AG  F CP H C  C   E         +VC      RCP+
Sbjct: 205 VHAG--FVCPRHTCATCGAQETDMRRCKSCSVCHHMTHFRCPR 245


>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
           catus]
          Length = 1300

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 40/98 (40%), Gaps = 21/98 (21%)

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C       KE+  +V  CV + CG FYH  CV K  L            ++    
Sbjct: 650 HSCFVC-------KESKTDVRRCVVSQCGKFYHEACVRKYPL------------TVFESR 690

Query: 348 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
            F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 691 GFRCPLHSCVTCHASNPSNPRPSKGKMMRCVRCPVAYH 728


>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
           glaber]
          Length = 1372

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 39/98 (39%), Gaps = 21/98 (21%)

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C       KE  AEV  CV   CG FYH  CV +  L            ++    
Sbjct: 728 HACFVC-------KERKAEVKRCVVTQCGKFYHDACVRRYPL------------TVFESR 768

Query: 348 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
            F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 769 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 806


>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
          Length = 1093

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 68/286 (23%), Positives = 103/286 (36%), Gaps = 69/286 (24%)

Query: 144 EVRPSQNDLVDHMDLIKEALERDDVLAKSKFLVTFLEE--KPTKRKLSDEVVQTKAMSGF 201
           E+  ++ D +  ++  KE+    D         TF++E  K    K+  E   +    G 
Sbjct: 380 EINTTKEDEISEINTTKESDTSTD---------TFIKENKKIKTEKIDRENSVSSEQDGP 430

Query: 202 IVDDMEEDMVHDTEEDESNEED-ELF-----DSVCSFCDNGGDLLCCEGRCLRSFH-ATI 254
            +   ++D V++TE+ E+      LF     + VC  C+  G L  C+G C   FH + +
Sbjct: 431 DIKRAKQDNVYNTEKSETKRRPYNLFKGMKQEKVCQICEKTGKLTRCKGPCYSYFHLSCV 490

Query: 255 DAGE-------------ESHCASLGLTKDEV--EAMLN----------FFCKNC-EYKQH 288
             GE             E   + L + K  +  E+  N          F C +C      
Sbjct: 491 KPGESSPEYSVDENTLDERLLSDLNIIKRSINGESENNGKSDEQEDETFKCIDCLSGVAP 550

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C        E   +   C+   CG  YH  C+                +S   G  
Sbjct: 551 ACFLCN-------EREGDRIRCIVPACGKHYHSKCLI------------PWPQSHWQGGR 591

Query: 349 FTCPLHKCCICK----QGENKADSDLQFAVCRRCPKAYHRK--CLP 388
            TCP H C  C     Q       + + A C RCP +YH    CLP
Sbjct: 592 LTCPYHVCHTCSSDNPQNNRSRAPNEKVAKCVRCPSSYHASALCLP 637


>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis
           lupus familiaris]
          Length = 742

 Score = 53.9 bits (128), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 58/159 (36%), Gaps = 43/159 (27%)

Query: 227 DSVCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D+VC  C++ GD L+ CEG C + FH          C  L    D       F C  C+ 
Sbjct: 41  DTVCQICESSGDSLIPCEGECCKHFHL--------ECLGLASVPDG-----KFICIECK- 86

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
                            +G +V  C    CG FYH  CV K               +I  
Sbjct: 87  ---------------TVSGTDVKRCSVGACGKFYHEACVRK------------FPTAIFE 119

Query: 346 GESFTCPLHKCCICKQGEN-KADSDLQFAVCRRCPKAYH 383
            + F CP H C  C   ++    S  +   C RCP AYH
Sbjct: 120 SKGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPVAYH 158


>gi|449438600|ref|XP_004137076.1| PREDICTED: uncharacterized protein LOC101210001 [Cucumis sativus]
          Length = 541

 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/92 (40%), Positives = 49/92 (53%), Gaps = 19/92 (20%)

Query: 166 DDVLAKSKFLVTFLEEKPTKRKLSDEV-------VQTKAMSGFIVDD----------MEE 208
           D     S FL+ FLEEKP KR    E+        Q      FIVDD          +EE
Sbjct: 28  DPCAVWSLFLLAFLEEKPVKRMSCHELGQLYFQDAQRNMQPSFIVDDSDDEDMVDDAVEE 87

Query: 209 DMVHDTEEDESNEEDELFDSVCSFCDNGGDLL 240
           +   D  E+ES  +D++FDSVC+FCDNGG+++
Sbjct: 88  ESDDDAVEEES--DDDVFDSVCAFCDNGGNII 117


>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Amphimedon queenslandica]
          Length = 1171

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 66/167 (39%), Gaps = 32/167 (19%)

Query: 228 SVCSFCDNGGD----LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 283
           ++CS CDN  +    ++ C+G CL SFH  +D         LGL          F C  C
Sbjct: 377 NICSICDNPSNPTQCIITCKGHCLNSFH--VDC--------LGLIS---PPSFPFVCDEC 423

Query: 284 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
                 C+AC K      +  + + PC   +C   YH  C+            D    ++
Sbjct: 424 TLSPSVCYACKK---PSVDPVSSLVPCSHHSCSQLYHLSCIRSC----GNFKFDSSNPNV 476

Query: 344 IAGESFTCPLHKC--CICKQGENKADSDLQFAVCRRCPKAYHR-KCL 387
           I     +C LH C  C+C  G      + +   C +CP + H+  CL
Sbjct: 477 I----LSCGLHSCAKCVCSDGP-PVTGNQKLMQCLKCPLSLHKYSCL 518


>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
          Length = 1740

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 23/106 (21%)

Query: 290  CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
            C  CG++G+  ++       C    CG FYHP C+ KL          Q   S+ A +SF
Sbjct: 1007 CLVCGEMGAQGRQK------CSLHQCGRFYHPECL-KLW--------PQTQWSLNASDSF 1051

Query: 350  TCPLHKCCIC-----KQGENKADSDLQFAVCRRCPKAYHRK--CLP 388
             CP H C  C     +   ++  SD +   C +CP  YH    C+P
Sbjct: 1052 VCPRHVCHTCISDDPRAANSRCSSD-KIVKCLKCPATYHSSNYCVP 1096


>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
          Length = 1795

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 47/106 (44%), Gaps = 23/106 (21%)

Query: 290  CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
            C  CG++G+  ++       C    CG FYHP C+ KL          Q   S+ A +SF
Sbjct: 1062 CLVCGEMGAQGRQK------CSLHQCGRFYHPECL-KLW--------PQTQWSLNASDSF 1106

Query: 350  TCPLHKCCIC-----KQGENKADSDLQFAVCRRCPKAYHRK--CLP 388
             CP H C  C     +   ++  SD +   C +CP  YH    C+P
Sbjct: 1107 VCPRHVCHTCISDDPRAANSRCSSD-KIVKCLKCPATYHSSNYCVP 1151


>gi|357162868|ref|XP_003579549.1| PREDICTED: uncharacterized protein LOC100839049 [Brachypodium
            distachyon]
          Length = 1416

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 88/219 (40%), Gaps = 47/219 (21%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C FC +GG+LLCC+  C  ++H    + +E    S             ++C NC   
Sbjct: 1036 DDTCGFCGDGGELLCCDN-CPSTYHEACLSSQELPEGS-------------WYCHNC--- 1078

Query: 287  QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL-----RDDEVAADQLA 340
               C +CG  +   +  + +++  C+   CG  YH  C+ +++L     R D     +  
Sbjct: 1079 --TCRSCGNPVNEKEVSSFSDILKCLQ--CGDAYHNTCIDRVMLPSDGKRSDTWFCGRYC 1134

Query: 341  KSIIAGESFTCPLHKCCICKQG-ENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 399
            K I  G      LH     + G EN  ++DL + +  RC     R    +KI        
Sbjct: 1135 KEIFMG------LHS----QVGVENVINNDLSWTIL-RCNSDGQRLHSAQKI-------- 1175

Query: 400  GIITRAWEGLLPNHRILIYCLKHEIDDEIGTPIRDHIIF 438
            G++T     L     +L  C    +D   G  +  H+++
Sbjct: 1176 GLMTECNTKLAVALTLLEECFIRMVDPRTGVDMIPHVLY 1214


>gi|301115892|ref|XP_002905675.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110464|gb|EEY68516.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 470

 Score = 52.0 bits (123), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 59/166 (35%), Gaps = 32/166 (19%)

Query: 225 LFDSVCSFCDNGG-DLLCCEGRCLRSFHAT----IDAGEESHCASLGLTKDEVEAMLNFF 279
           LF S C  C NGG  LL C G C RS H      + A       SL   K   E   N+ 
Sbjct: 92  LFPSHCFLCGNGGRGLLHCGGSCARSAHQNCVNQLQAPTVGEPVSLAERKQAAE---NWK 148

Query: 280 CKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
           C  C    H+C  CG +G         +  C    CG+ +H  C     L DDE      
Sbjct: 149 CAQCLRGLHRCQRCGFMGHEAN----GMRKCSVLDCGYHFHEQC-----LPDDET----- 194

Query: 340 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVC-----RRCPK 380
                A   F CP H C  C   E          VC      RCP+
Sbjct: 195 -----AHAGFVCPRHTCATCGTQETDMRRCKSCTVCDHMTHLRCPR 235


>gi|301605820|ref|XP_002932540.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Xenopus
           (Silurana) tropicalis]
          Length = 5215

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 88/218 (40%), Gaps = 47/218 (21%)

Query: 211 VHDTEEDESNEEDELFDS----VCSFCDN-GGDLLCCEGRCLRSFHATIDAG----EESH 261
           VH  EE E    D+   S     C++C + G  + CCE  C +++H    AG    ++ +
Sbjct: 587 VHQNEEQELVSVDKAVLSGSTERCAYCKHLGATIKCCEETCTQTYHYPCAAGAGTFQDLN 646

Query: 262 CASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHP 321
           C +L L  + ++  L    ++       C  C   G    +   ++F     TCG  YH 
Sbjct: 647 CLTL-LCPEHIDQALERSKEDAN-----CALCDSSG----DLLDQLF---CTTCGQHYHG 693

Query: 322 HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 381
            C+        ++A   L ++      + CP   C +C+  ++  D D Q  VC  C K 
Sbjct: 694 MCL--------DIAVTPLKRA-----GWQCP--DCKVCQNCKHSGD-DNQMLVCDTCDKG 737

Query: 382 YHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYC 419
           YH  CL         + + + T  W+    N RI   C
Sbjct: 738 YHTFCL-------QPVMDSVPTNGWK--CKNCRICTEC 766


>gi|403352806|gb|EJY75922.1| Type III restriction enzyme, res subunit family protein [Oxytricha
            trifallax]
          Length = 2584

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 38/85 (44%), Gaps = 12/85 (14%)

Query: 307  VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 366
            V  C    C  +YH  CV     ++  V+     K     + F CPLH C IC+      
Sbjct: 2299 VVRCTLGGCFKYYHVDCV----YQNKNVSCINSMKV----QRFRCPLHYCEICQAS---G 2347

Query: 367  DSDLQFAVCRRCPKAYHRKCLPRKI 391
            DS L    C  CP AYH KCLP+ +
Sbjct: 2348 DSVL-IVQCVNCPTAYHLKCLPQDL 2371


>gi|294461989|gb|ADE76550.1| unknown [Picea sitchensis]
          Length = 135

 Score = 51.6 bits (122), Expect = 0.003,   Method: Composition-based stats.
 Identities = 31/73 (42%), Positives = 39/73 (53%), Gaps = 2/73 (2%)

Query: 1337 MSTSAMDRYAPRL-HQLNNTRMNTFRSEPF-MPSRFGFYDSRAPQPGFFADMDFGPGFHP 1394
            +++SAM RYAPRL  Q+N TR N  + E     S +G       Q   F   DF PG   
Sbjct: 63   LTSSAMQRYAPRLDQQMNYTRSNLHQIEQAPRNSIYGAGQVLGSQNFPFDSRDFAPGPQN 122

Query: 1395 PFPQQGSGGWLDD 1407
            P+P Q S GW+DD
Sbjct: 123  PYPHQSSSGWIDD 135


>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
 gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
          Length = 725

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 69/170 (40%), Gaps = 41/170 (24%)

Query: 229 VCSFCDNGGD-LLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           VC  C+  G+ L+ C+G C   FH             LG+T+  V+    F C+ C    
Sbjct: 40  VCMLCEGIGEKLVTCQGGCFSKFHLE----------CLGITEWPVKG---FKCQECTSGT 86

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
            +CF CG        +   V PC  A CG  YH  C+S+      +V+ +   +      
Sbjct: 87  RKCFVCG-------SSEPPVLPCKVAGCGKCYHDTCISQFTC--GQVSEEGCIR------ 131

Query: 348 SFTCPLHKCCICKQ---GENKADSDL----QFAVCRRCPKAYH--RKCLP 388
              CP H C  C      +N  + D+    + A C  CP +YH    CLP
Sbjct: 132 ---CPRHVCTTCTAFAISKNDENFDVVIQKKLARCIYCPVSYHASEACLP 178


>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
 gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
          Length = 1480

 Score = 51.2 bits (121), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/166 (23%), Positives = 61/166 (36%), Gaps = 23/166 (13%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC-EYKQ 287
           VC+ C    DL+ C  +C   +H             + +  D       F C +C   K 
Sbjct: 813 VCAVCTKPHDLVKC-TKCYNHYHLAC-------LTDVPIKPDPAGENKTFTCTDCVMLKA 864

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
             CF C     + KE   E F CV   CG  YH +C+             +   +     
Sbjct: 865 PTCFVCNDQDDAVKE--EEKFRCVMNGCGKQYHLNCLRLF-------PQHKFTGTSSKSS 915

Query: 348 SFTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYHR--KCLP 388
           +  CP H C  C   + ++++         C +CP +YH   +C+P
Sbjct: 916 TLYCPTHTCHTCVSDDPRSNATTTKGHLIRCIKCPSSYHTEARCIP 961


>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
          Length = 715

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C       KE+  +V  CV   CG FYH  CV K  L            ++    
Sbjct: 65  HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESR 105

Query: 348 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
            F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 106 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 143


>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Macaca mulatta]
          Length = 713

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C       KE+  +V  CV   CG FYH  CV K  L            ++    
Sbjct: 63  HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESR 103

Query: 348 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
            F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 104 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 141


>gi|300122245|emb|CBK22818.2| unnamed protein product [Blastocystis hominis]
          Length = 171

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 74/186 (39%), Gaps = 35/186 (18%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D++C+ C+     + C G CLRSFH          C  L    D      ++ C +C  +
Sbjct: 6   DNICAICERPNCPVRCSGGCLRSFHI--------QCLRLTSVPDR-----SWKCNDCVKR 52

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            H+CF C +     +E  +E+  C    C  ++H              A  +     ++ 
Sbjct: 53  SHECFECKQ-----RELDSELVQCSYPDCRRYFH-----------KREACCRCNPRSLST 96

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFED--KLEEGIITR 404
             F CP H+C  C      +D +     C RC K YH  CLP  +   D  ++ E +  +
Sbjct: 97  LDFVCPSHRCYAC---HGPSDMNNPLLKCFRCTKCYHYSCLPPSVQSLDSKRIRECLQPK 153

Query: 405 -AWEGL 409
             W G+
Sbjct: 154 PTWRGI 159


>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
          Length = 713

 Score = 50.8 bits (120), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H CF C       KE+  +V  CV   CG FYH  CV K  L            ++    
Sbjct: 63  HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPL------------TVFESR 103

Query: 348 SFTCPLHKCCICKQGE--NKADSDLQFAVCRRCPKAYH 383
            F CPLH C  C      N   S  +   C RCP AYH
Sbjct: 104 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 141


>gi|315639351|ref|ZP_07894513.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis JV21]
 gi|315480677|gb|EFU71319.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis JV21]
          Length = 365

 Score = 50.4 bits (119), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 67/146 (45%), Gaps = 27/146 (18%)

Query: 700 FLHGMRYT-------SFGRHFTKVDKLQAIVDKLHWYVNDGDM------IVDFCCGANDF 746
           FL G +YT       S    FTK +  +    KL  ++N   +       ++  CG   F
Sbjct: 49  FLLGNKYTLEKENYRSHNEFFTKKESAKYCYKKLMDFLNQHSIDVGEYVFIEPSCGDLSF 108

Query: 747 SCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGV 806
             LM K   + G +  YKN +IL  +N  NF         PK+   G  LI+G NPPFG+
Sbjct: 109 YDLMPKN-SKIGIDLTYKNKEIL-CENFLNFT--------PKK--EGKYLILG-NPPFGL 155

Query: 807 KAGLANKFINKALEFNPKLLILIVPP 832
           +  LA +FIN A  F    +  I+PP
Sbjct: 156 RGNLALRFINHAYRF-ADFIAFILPP 180


>gi|57506149|ref|ZP_00372070.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis RM3195]
 gi|57015544|gb|EAL52337.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis RM3195]
          Length = 377

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 67/146 (45%), Gaps = 27/146 (18%)

Query: 700 FLHGMRYT-------SFGRHFTKVDKLQAIVDKLHWYVNDGDM------IVDFCCGANDF 746
           FL G +YT       S    FTK +  +    KL  ++N  ++       ++  CG   F
Sbjct: 49  FLLGNKYTLEKENYRSHNEFFTKKESAKYCYKKLIDFLNQHNIDVGKYVFIEPSCGDLSF 108

Query: 747 SCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGV 806
             LM K     G +  YKN +IL  +N  NF         PK+   G  LI+G NPPFG+
Sbjct: 109 YDLMPKN-SRIGIDLTYKNKEIL-CENFLNFT--------PKK--EGKYLILG-NPPFGL 155

Query: 807 KAGLANKFINKALEFNPKLLILIVPP 832
           +  LA +FIN A  F    +  I+PP
Sbjct: 156 RGNLALRFINHAYHF-ADFIAFILPP 180


>gi|328716964|ref|XP_001950932.2| PREDICTED: hypothetical protein LOC100161826, partial
           [Acyrthosiphon pisum]
          Length = 1530

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 32/120 (26%), Positives = 46/120 (38%), Gaps = 17/120 (14%)

Query: 276 LNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 335
           L + C  C+  +  CF CG     D +   +   C  ++CG ++H  C+       D   
Sbjct: 771 LKYMCSLCKAGKTNCFVCGL----DIDDPGQKIVCKISSCGKYFHESCLK------DWPQ 820

Query: 336 ADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAV-----CRRCPKAYHRK--CLP 388
              +  S        CP H C +C     K+     F       C +CP AYHR   CLP
Sbjct: 821 CQWIQGSRNNVRCVVCPHHVCHLCISDNPKSSCTTHFPAEKITRCIKCPTAYHRSEYCLP 880


>gi|325183066|emb|CCA17522.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1283

 Score = 50.1 bits (118), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 14/83 (16%)

Query: 212 HDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH---------ATIDAGEESHC 262
           HD++ D   + +  +   C+ C +GG+LLCC+ RC R+FH         + + A + S  
Sbjct: 121 HDSDHDAWTDHNRWY---CNLCKDGGELLCCD-RCPRAFHLKWYVGCFPSAVVAHQASRY 176

Query: 263 ASLGLTKDEVEAMLNFFCKNCEY 285
           ASLGL K+E+     ++CK C Y
Sbjct: 177 ASLGLQKEEIPES-EWYCKFCAY 198


>gi|390596238|gb|EIN05640.1| hypothetical protein PUNSTDRAFT_145612 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1008

 Score = 49.7 bits (117), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 26/134 (19%)

Query: 230 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 289
           C +C +GG+L+CC GRC R FH          C   GLT  +++      C     +QH 
Sbjct: 863 CHYCRDGGELVCC-GRCPRVFH--------KECR--GLTAAQLKRNFAIIC-----EQHS 906

Query: 290 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
           C  C +   S  + G  +F C   TC   +   C+ +    D +   + L + ++ G  F
Sbjct: 907 CVVCNR---STGDAGGMLFRC--QTCPQAFCEDCLPE---GDIDAVGETLPEFLLLGYGF 958

Query: 350 TCPLH--KCCICKQ 361
           +   +  +C  C+Q
Sbjct: 959 SPSAYYIRCHDCRQ 972


>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
           latipes]
          Length = 4455

 Score = 49.7 bits (117), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 69/170 (40%), Gaps = 35/170 (20%)

Query: 223 DELFDS----VCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 277
           D+  DS    VC+FC + G  L C E  C RS+H           A+ G  +D  +  + 
Sbjct: 400 DKAIDSGSIQVCAFCRHLGASLRCQETGCTRSYHMPC-------AAAAGACQDWNQRRI- 451

Query: 278 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
              ++      QC  C   G S    G  +  C    CG  YH  C+      D  V   
Sbjct: 452 LCTQHARTGSSQCRLCAGSGDSG---GLLMCSC----CGSCYHGSCL------DPPVTPS 498

Query: 338 QLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            L++       + CP  +C +C+    + DS +   +C RC KAYH  CL
Sbjct: 499 PLSRV-----GWQCP--QCRVCRSCSLQGDSGV--LLCARCDKAYHAHCL 539



 Score = 42.7 bits (99), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 43/102 (42%), Gaps = 20/102 (19%)

Query: 286  KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            KQ  C  CG  G      GAE      + CG  YHP+CV+  + R            +I 
Sbjct: 1018 KQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVNIKITR------------VIL 1060

Query: 346  GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
             + + C   +C +C+   + +D   +  +C  C  +YH  CL
Sbjct: 1061 TKGWRC--LECTVCEACGDASDPG-RLLLCDDCDISYHTYCL 1099


>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1350

 Score = 49.7 bits (117), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 34/144 (23%)

Query: 195  TKAMSGFIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLL 240
            +K     +VD ++ D++H      D + D         S E D+  D  C  C +GG+L+
Sbjct: 983  SKPYRNILVDGLDNDLLHCLIIAWDKQSDSERQAFFPVSTEGDDPNDDTCGICGDGGNLI 1042

Query: 241  CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSD 300
            CC+G C  +FH +    EE       L  D+      + C NC      C  C +  + D
Sbjct: 1043 CCDG-CPSTFHMSCLELEE-------LPSDD------WRCTNC-----SCKLCHEHLNHD 1083

Query: 301  KETGAEVFPCVSAT-CGHFYHPHC 323
                AE+ P  S + C   YHP C
Sbjct: 1084 APDNAEIDPLHSCSQCEKKYHPSC 1107


>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
          Length = 4884

 Score = 49.3 bits (116), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 52/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG        G  +D      NFF    E+   
Sbjct: 285 CAFCKHLGATIKCCEDKCTQMYHYPCAAG-------AGTFQD----FRNFFLLCPEHIDQ 333

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 334 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 378

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 379 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 422

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W+    N RI + C
Sbjct: 423 KSVPTNGWK--CKNCRICVEC 441



 Score = 44.7 bits (104), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 54/131 (41%), Gaps = 26/131 (19%)

Query: 263  ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
            A L   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG
Sbjct: 925  ADLSSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 979

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
              YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 980  QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1024

Query: 377  RCPKAYHRKCL 387
             C  +YH  CL
Sbjct: 1025 DCDISYHTYCL 1035


>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Ailuropoda melanoleuca]
          Length = 4927

 Score = 48.9 bits (115), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 52/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG        G  +D      NFF    E+   
Sbjct: 300 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----FRNFFLLCPEHIDQ 348

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 349 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 393

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 394 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 437

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W+    N RI + C
Sbjct: 438 KSVPTNGWK--CKNCRICVEC 456



 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 26/126 (20%)

Query: 268  TKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHP 321
            +KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP
Sbjct: 944  SKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHP 998

Query: 322  HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 381
            +CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +
Sbjct: 999  YCVSIKITK------------VVLSKGWRC--LECTVC-EACGKASDPGRLLLCDDCDIS 1043

Query: 382  YHRKCL 387
            YH  CL
Sbjct: 1044 YHTYCL 1049


>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           isoform 2 [Bombus impatiens]
          Length = 1239

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 67/196 (34%), Gaps = 54/196 (27%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 277
           + VC  C+  G L  C+G C   FH + +  GE S  H     +  D++   +N      
Sbjct: 411 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILNDINEIKQSN 470

Query: 278 -----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 319
                            F C +C       CF C        E   +   C    CG  Y
Sbjct: 471 TDNDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 523

Query: 320 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 374
           H  C+                +S   G   TCP H C  C   +N  DS     + + A 
Sbjct: 524 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTC-SSDNPQDSHSRAPNEKLAR 570

Query: 375 CRRCPKAYH--RKCLP 388
           C RCP +YH    CLP
Sbjct: 571 CVRCPSSYHTSTSCLP 586


>gi|255550532|ref|XP_002516316.1| hypothetical protein RCOM_1188780 [Ricinus communis]
 gi|223544546|gb|EEF46063.1| hypothetical protein RCOM_1188780 [Ricinus communis]
          Length = 499

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 49/102 (48%), Gaps = 26/102 (25%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           L D++CSFC  GGDL+ C+ +C  +FH             LG  + +   + N+FC +C 
Sbjct: 141 LSDTICSFCHYGGDLILCD-KCPSTFH-------------LGCLELKDVPLENWFCPSC- 185

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSK 326
                C  CGK G S   T A    C+   C   YH HC++K
Sbjct: 186 ----CCELCGK-GDSSTSTNA----CLQ--CARAYHVHCLTK 216


>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
            finger protein HRX) (ALL-1) (Trithorax-like protein)
            (Lysine N-methyltransferase 2A) (CXXC-type zinc finger
            protein 7) [Ciona intestinalis]
          Length = 3406

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 44/99 (44%), Gaps = 17/99 (17%)

Query: 290  CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
            C  CG   S+   +   +  C  A C   YHP C     L  D +A  Q+ ++     ++
Sbjct: 1024 CLICGSFSSAKSISEGALVHC--ACCCEPYHPFCAEPDFLNTDVLA--QMKRN-----TW 1074

Query: 350  TCPLHKCC-ICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
             C   +CC +C   +N         VCRRC K+YH +CL
Sbjct: 1075 ICRKCQCCHVCGHPKN-------LLVCRRCKKSYHSECL 1106


>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific-like [Megachile rotundata]
          Length = 1302

 Score = 48.9 bits (115), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 67/195 (34%), Gaps = 52/195 (26%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASL-------------------G 266
           + VC  C+  G L  C+G C   FH + +  GE S   S+                    
Sbjct: 474 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDESTMDDKILNDIKEIKGDN 533

Query: 267 LTKDEVEAMLN------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 319
           + +DE    ++      F C +C       CF C        E   +   C    CG  Y
Sbjct: 534 IDEDETSGKIDEQEDELFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 586

Query: 320 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVC 375
           H  C+                +S   G   TCP H C  C     Q  +   S+ + A C
Sbjct: 587 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTCSSDNPQDSHPRTSNEKLARC 634

Query: 376 RRCPKAYH--RKCLP 388
            RCP +YH    CLP
Sbjct: 635 VRCPSSYHTSTSCLP 649


>gi|300121090|emb|CBK21472.2| unnamed protein product [Blastocystis hominis]
          Length = 1183

 Score = 48.9 bits (115), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 35/87 (40%), Gaps = 12/87 (13%)

Query: 307  VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 366
            VF C+  +CG FYH +C+  +     E  +D           F CP H CC C +  N  
Sbjct: 984  VFKCLVKSCGRFYHVNCIQNMEYGSKEHFSDL--------THFRCPAHFCCECHETGNTK 1035

Query: 367  DSDLQFAVCRRCPKAYHRKCLPRKIAF 393
                    C  C KA H  CL  +  F
Sbjct: 1036 ----HLVQCVMCYKAIHVSCLTLENGF 1058


>gi|359321427|ref|XP_003639590.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3 [Canis lupus familiaris]
          Length = 4874

 Score = 48.9 bits (115), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 69/201 (34%), Gaps = 56/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAG--------------------------EESHC 262
           C+FC + G  + CCE +C + +H    AG                          E+++C
Sbjct: 253 CAFCKHLGATIKCCEEKCTQIYHYPCAAGAGTFQDFRNFFLLCPEHIDQAPERSKEDANC 312

Query: 263 ASLGLTKDEVEAMLNFFCKNCEYKQH----------------QCFACGKLGSSDKETGAE 306
           A      D ++    FFC  C    H                QC  C K+  + KETG  
Sbjct: 313 AVCDSPGDLLD---QFFCTTCGQHYHGMCLDIAVTPLKRAGWQCPEC-KVCQNCKETGKN 368

Query: 307 VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPL----HKCCICKQG 362
            F     TC   Y+P CVS L+         Q     I  E  T       H C +C   
Sbjct: 369 TFVLFCFTCSLNYNPFCVSPLVRIVPTNLFTQCRNCRICVECGTRSSSQWHHNCLVCDSC 428

Query: 363 ENKADSDLQFAVCRRCPKAYH 383
             + D+     +C  C K YH
Sbjct: 429 YQQQDN-----LCPFCGKCYH 444



 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)

Query: 269  KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 322
            KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP+
Sbjct: 898  KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 952

Query: 323  CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 382
            CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +Y
Sbjct: 953  CVSIKITK------------VVLSKGWRC--LECTVC-EACGKASDPGRLLLCDDCDISY 997

Query: 383  HRKCL 387
            H  CL
Sbjct: 998  HTYCL 1002


>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           isoform 1 [Bombus impatiens]
          Length = 1230

 Score = 48.9 bits (115), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 67/196 (34%), Gaps = 54/196 (27%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 277
           + VC  C+  G L  C+G C   FH + +  GE S  H     +  D++   +N      
Sbjct: 402 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILNDINEIKQSN 461

Query: 278 -----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 319
                            F C +C       CF C        E   +   C    CG  Y
Sbjct: 462 TDNDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 514

Query: 320 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 374
           H  C+                +S   G   TCP H C  C   +N  DS     + + A 
Sbjct: 515 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTC-SSDNPQDSHSRAPNEKLAR 561

Query: 375 CRRCPKAYH--RKCLP 388
           C RCP +YH    CLP
Sbjct: 562 CVRCPSSYHTSTSCLP 577


>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
          Length = 4780

 Score = 48.5 bits (114), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 52/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG        G  +D      NFF    E+   
Sbjct: 156 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----FRNFFLLCPEHIDQ 204

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 205 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 249

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 250 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 293

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W+    N RI + C
Sbjct: 294 KSVPTNGWK--CKNCRICVEC 312



 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 26/126 (20%)

Query: 268 TKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHP 321
           +KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP
Sbjct: 800 SKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHP 854

Query: 322 HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 381
           +CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +
Sbjct: 855 YCVSIKITK------------VVLSKGWRC--LECTVC-EACGKASDPGRLLLCDDCDIS 899

Query: 382 YHRKCL 387
           YH  CL
Sbjct: 900 YHTYCL 905


>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Nasonia vitripennis]
          Length = 1382

 Score = 48.5 bits (114), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 63/194 (32%), Gaps = 51/194 (26%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTKDEVEAML--------- 276
           + VC  C+  G L+ C G C   FH A +  GE S   S     +E  A           
Sbjct: 574 ERVCQICEKTGKLIRCRGPCHSYFHLACVKPGESSPEPSEAGDIEENNAYKEDLKEIKAK 633

Query: 277 ---------------NFFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYH 320
                          NF C +C       CF C        E   E   C    CG  YH
Sbjct: 634 SKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVC-------HERDGERTKCSILACGKHYH 686

Query: 321 PHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVCR 376
           P C+                +    G   TCP H C  C     Q  +   +  +FA C 
Sbjct: 687 PDCLK------------SWPQCQWQGGRLTCPHHICHTCASDNPQNSHPRSAGEKFAKCV 734

Query: 377 RCPKAYHR--KCLP 388
           +CP  YH    CLP
Sbjct: 735 KCPSTYHASISCLP 748


>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
 gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
          Length = 1181

 Score = 48.5 bits (114), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 40/167 (23%), Positives = 62/167 (37%), Gaps = 31/167 (18%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ- 287
            C+ C    D++ C   C   FH      + S    L            F C  C   Q 
Sbjct: 690 ACAVCLEPNDVVKCSA-CHNHFHPRCIGAKTSSDPKL------------FKCVECADGQT 736

Query: 288 HQCFACGKLGSSDKETGAEV-FPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           H+CF C      D    +E    C  + CG +YH HC+   L    ++ +   + ++   
Sbjct: 737 HRCFVCN---DQDATIASETKHRCALSGCGKYYHIHCLR--LFPQHKITSTPNSSTLF-- 789

Query: 347 ESFTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYH--RKCLP 388
               CP H C  C   + + ++         C +CP +YH   KC+P
Sbjct: 790 ----CPYHTCHTCVSDDPRTNATATRGSLVRCIKCPSSYHPDAKCVP 832


>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
           H3 lysine-36 and H4 lysine-20 specific-like [Bombus
           terrestris]
          Length = 1238

 Score = 48.1 bits (113), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 67/196 (34%), Gaps = 54/196 (27%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 277
           + VC  C+  G L  C+G C   FH + +  GE S  H     +  D++   +N      
Sbjct: 410 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILDDINEIKQSN 469

Query: 278 -----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 319
                            F C +C       CF C        E   +   C    CG  Y
Sbjct: 470 IDNDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 522

Query: 320 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 374
           H  C+                +S   G   TCP H C  C   +N  DS     + + A 
Sbjct: 523 HSSCLK------------SWPQSHWQGGRLTCPYHVCHTC-SSDNPQDSHSRAPNEKLAR 569

Query: 375 CRRCPKAYH--RKCLP 388
           C RCP +YH    CLP
Sbjct: 570 CVRCPSSYHTSTSCLP 585


>gi|310831063|ref|YP_003969706.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis
           virus BV-PW1]
 gi|309386247|gb|ADO67107.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis
           virus BV-PW1]
          Length = 297

 Score = 48.1 bits (113), Expect = 0.037,   Method: Composition-based stats.
 Identities = 45/169 (26%), Positives = 80/169 (47%), Gaps = 18/169 (10%)

Query: 703 GMRYTSFGRHFTKVDKLQAIVD--KLHWYVNDGDMIVDFCCGANDFSCLMKKKLDETGKN 760
           G+   +  +++TK + ++  +D  K +  +N  D I++   G   F       ++E    
Sbjct: 31  GLNRNTMDKYYTKTNIVKNCIDSIKSNVIINKLDTIIEPSAGNGAF-------IEEIKHL 83

Query: 761 CL-YKNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGLNPPFGVKAGLANKFINKAL 819
           C  YK YDILP  N+    K+D++ ++          I+G NPPFG ++ +A KFI K  
Sbjct: 84  CNNYKFYDILPENNEII--KQDFLKLDVNNFKNQKIHIIG-NPPFGRQSSIAIKFIKKCC 140

Query: 820 EFNPKLLILIVPPETERLDRK--ESAYELVWEDDQFLSGKSFYLPGSVD 866
            F   +  ++     +   RK     Y LV+E+D     K+ +L  ++D
Sbjct: 141 LFAESISFILPKSFKKESMRKYFNKYYHLVFEEDLL---KNCFLVNNID 186


>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Nasonia vitripennis]
          Length = 1317

 Score = 47.8 bits (112), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 63/194 (32%), Gaps = 51/194 (26%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTKDEVEAML--------- 276
           + VC  C+  G L+ C G C   FH A +  GE S   S     +E  A           
Sbjct: 509 ERVCQICEKTGKLIRCRGPCHSYFHLACVKPGESSPEPSEAGDIEENNAYKEDLKEIKAK 568

Query: 277 ---------------NFFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYH 320
                          NF C +C       CF C        E   E   C    CG  YH
Sbjct: 569 SKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVC-------HERDGERTKCSILACGKHYH 621

Query: 321 PHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVCR 376
           P C+                +    G   TCP H C  C     Q  +   +  +FA C 
Sbjct: 622 PDCLK------------SWPQCQWQGGRLTCPHHICHTCASDNPQNSHPRSAGEKFAKCV 669

Query: 377 RCPKAYHR--KCLP 388
           +CP  YH    CLP
Sbjct: 670 KCPSTYHASISCLP 683


>gi|432916836|ref|XP_004079403.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
           latipes]
          Length = 4802

 Score = 47.8 bits (112), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 49/194 (25%), Positives = 74/194 (38%), Gaps = 41/194 (21%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATID--AGEESHCASLGL-TKDEVEAMLNFFCKNCEY 285
           C++C   G  + CC   C + +H      AG      SL L   D +E  ++ F  +   
Sbjct: 285 CAYCKRLGASIKCCAEGCAQLYHYPCAGAAGTFQDIRSLSLLCPDHIEVAIHKFVDDVN- 343

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
               C  C   G    +   ++F C S  CG  YH  C+              +A + + 
Sbjct: 344 ----CVLCDSPG----DLLDQLF-CTS--CGLHYHGMCLD-------------MAVTPLR 379

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRA 405
              + CP  +C +C+  +N  D D +  VC  C K YH  CL   +       E + T  
Sbjct: 380 RAGWQCP--ECKVCQTCKNHGD-DTKMLVCDMCDKGYHTFCLQPAM-------ESLPTNG 429

Query: 406 WEGLLPNHRILIYC 419
           W     N R+ I C
Sbjct: 430 WR--CKNCRVCIQC 441



 Score = 42.0 bits (97), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 35/141 (24%), Positives = 54/141 (38%), Gaps = 25/141 (17%)

Query: 252  ATIDAGEESHCASLGLTKDEVEAMLN---FFCKNCEY--KQHQCFACGKLGSSDKETGAE 306
            ++I+ G  +      + ++E  AM N    F  N  +  KQ  C  CG  G      GAE
Sbjct: 923  SSINPGLTTFEMQYPVKEEEENAMHNTVVIFSSNDSFTLKQDMCVVCGSFG-----LGAE 977

Query: 307  VFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKA 366
                  A CG  YHP CV   +              ++  + + C   +C +C+      
Sbjct: 978  GRLLACAQCGQCYHPFCVGIKI------------NKVVLSKGWRC--LECTVCEACGQAT 1023

Query: 367  DSDLQFAVCRRCPKAYHRKCL 387
            D   +  +C  C  +YH  CL
Sbjct: 1024 DPG-RLLLCDDCDISYHTYCL 1043


>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Equus caballus]
          Length = 4910

 Score = 47.8 bits (112), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 76/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C +++H    AG        G  +D      +FF    E+   
Sbjct: 245 CAFCKHLGATIKCCEEKCTQTYHYPCAAG-------AGTFQD----FSHFFLLCPEHIDQ 293

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 294 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 338

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 339 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 382

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W+    N RI + C
Sbjct: 383 KSVPTNGWK--CKNCRICVEC 401



 Score = 44.7 bits (104), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 54/131 (41%), Gaps = 26/131 (19%)

Query: 263 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
           A L   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG
Sbjct: 885 ADLSSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 939

Query: 317 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
             YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 940 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 984

Query: 377 RCPKAYHRKCL 387
            C  +YH  CL
Sbjct: 985 DCDISYHTYCL 995


>gi|38567828|emb|CAE05777.2| OSJNBb0020J19.6 [Oryza sativa Japonica Group]
          Length = 1566

 Score = 47.8 bits (112), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C FC +GG+LLCC+  C  ++H T  + +E    S             ++C NC   
Sbjct: 1025 DDTCGFCGDGGELLCCDN-CPSTYHQTCLSDQELPEGS-------------WYCHNC--- 1067

Query: 287  QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
               C +CG  L   +  T + +  C+   CG  YH  C+ + +L
Sbjct: 1068 --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 1107


>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
           mutus]
          Length = 4905

 Score = 47.8 bits (112), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 76/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG        G  +D    + +FF    E+   
Sbjct: 231 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGAFQD----LSHFFLLCPEHIDQ 279

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 280 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 324

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 325 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 368

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W+    N RI + C
Sbjct: 369 KSVPTNGWK--CKNCRICVEC 387



 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)

Query: 269 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 322
           KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP+
Sbjct: 872 KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 926

Query: 323 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 382
           CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +Y
Sbjct: 927 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 971

Query: 383 HRKCL 387
           H  CL
Sbjct: 972 HTYCL 976


>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
           porcellus]
          Length = 4878

 Score = 47.8 bits (112), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 76/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG       +G  +D      +FF    E+   
Sbjct: 241 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------VGTFQD----FSHFFLLCPEHIDQ 289

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 290 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 334

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 335 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 378

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W+    N RI + C
Sbjct: 379 KSVPTNGWK--CKNCRICVEC 397



 Score = 43.9 bits (102), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)

Query: 263 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
           A +   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG
Sbjct: 878 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 932

Query: 317 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
             YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 933 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 977

Query: 377 RCPKAYHRKCL 387
            C  +YH  CL
Sbjct: 978 DCDISYHTYCL 988


>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
           distachyon]
          Length = 1589

 Score = 47.4 bits (111), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 46/98 (46%), Gaps = 27/98 (27%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            C  C +GGDLLCC+ RC  +FH           A LG+   E+ +  ++FC+NC     
Sbjct: 851 TCGICGDGGDLLCCD-RCTSTFHV----------ACLGI---EMPSG-DWFCRNC----- 890

Query: 289 QCFACGKLGSSDKETG--AEVFPCVSATCGHFYHPHCV 324
               C   GS+++ T   AE+  C+   C   YH  C 
Sbjct: 891 ---ICKFCGSAEERTSSPAELLSCLQ--CSRKYHQVCA 923


>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
            distachyon]
          Length = 1344

 Score = 47.4 bits (111), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 61/144 (42%), Gaps = 34/144 (23%)

Query: 195  TKAMSGFIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLL 240
            +K     +VD +++D++H      D + D         S E D+  D  C  C +GG+L+
Sbjct: 929  SKPYRNVLVDGLDKDLLHCLISAWDKQSDSERQSFFPVSTEGDDPNDDTCGICGDGGNLI 988

Query: 241  CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSD 300
            CC+G C  +FH +    EE       L  D+      + C NC      C  C +  + D
Sbjct: 989  CCDG-CPSTFHMSCLELEE-------LPSDD------WRCANC-----CCKFCQEHSNDD 1029

Query: 301  KETGAEVFP-CVSATCGHFYHPHC 323
                AEV   C  + C   YHP C
Sbjct: 1030 APDIAEVDSLCTCSQCEENYHPVC 1053


>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
           echinatior]
          Length = 1304

 Score = 47.4 bits (111), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 58/235 (24%), Positives = 81/235 (34%), Gaps = 62/235 (26%)

Query: 192 VVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELF-----DSVCSFCDNGGDLLCCEGRC 246
           + + K M   I  D EE+ V +T+         LF     + VC  C+  G L  C+G C
Sbjct: 481 IKENKKMKSKI--DKEENFVSETKRSRLYN---LFKGMKQEKVCQICEKTGQLTRCKGPC 535

Query: 247 LRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN-----------------------FFC 280
              FH + +  GE S  +        D + + LN                       F C
Sbjct: 536 YSYFHLSCVKPGESSPEYSVDENTLDDRLLSDLNIIKKSINGESENNGKSDEQEDETFKC 595

Query: 281 KNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
            +C       CF C        E   +   C+   CG  YH +C+               
Sbjct: 596 IDCLSGVAPACFLCN-------EREGDRIRCIVPACGKHYHSNCLL------------SW 636

Query: 340 AKSIIAGESFTCPLHKCCICK----QGENKADSDLQFAVCRRCPKAYHRK--CLP 388
            +S   G   TCP H C  C     Q +     + + A C RCP +YH    CLP
Sbjct: 637 PQSHWQGGRLTCPYHVCHTCSSDNPQDKRSRAPNEKMARCVRCPSSYHASTLCLP 691


>gi|256075776|ref|XP_002574192.1| SET domain protein [Schistosoma mansoni]
          Length = 1343

 Score = 47.4 bits (111), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 62/173 (35%), Gaps = 30/173 (17%)

Query: 229 VCSFC----DNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML-NFFCKNC 283
           VC  C    +  G +  C+G C R  H         HC           +    F C  C
Sbjct: 287 VCPVCEVYSNAPGQMFQCQGPCGRIVHP--------HCMRYKTPPPADNSRPEKFRCPQC 338

Query: 284 EYKQHQCFACGKLGSSDKETGA-EVFPCVSATCGHFYHPHCVS---KLLLRDDEVAADQL 339
              +  C  CGK   +  + GA +++ C    C   +H  C+S    +L R     +D +
Sbjct: 339 LIGEFLCSICGKPSETGSKDGAGQLYACQVINCSRHFHRDCLSGWPGILTR----PSDNV 394

Query: 340 AKSIIAGESFTCPLHKCCICKQGENKAD---------SDLQFAVCRRCPKAYH 383
           A      +   CP H C  C    N+ +         S+  F  C RCP  +H
Sbjct: 395 ATRTTTFQVLRCPAHTCNTCYLESNETNCHPVKKPCSSEGPFLECVRCPAVFH 447


>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
 gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
          Length = 1259

 Score = 47.4 bits (111), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 35/165 (21%), Positives = 64/165 (38%), Gaps = 20/165 (12%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           VC  C+   ++  C  +C    H    A +      +    D+ + +    C      + 
Sbjct: 608 VCYICNRPNNVTKC-SKCTLHLHLVCLANDPEEVVKMQELVDQ-KKLCCEKCSTTSIVEK 665

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
            CF C       +++  +++ CV   C   YH  C+ +L  +  +V+A           +
Sbjct: 666 TCFICN--DEIPEKSNEQIYRCVVGKCTQAYHISCL-QLFPQVRQVSAS----------T 712

Query: 349 FTCPLHKCCICKQGENKADSDL---QFAVCRRCPKAYHR--KCLP 388
             CP H C  C   E ++ + +     A C +CP +YH    C+P
Sbjct: 713 IICPYHTCHTCVASEPRSTASMVKTTLAHCLKCPTSYHPSGNCIP 757


>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
           Japonica Group]
          Length = 1169

 Score = 47.0 bits (110), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 39/147 (26%)

Query: 195 TKAMSGFIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLL 240
           +K     +VD ++ D++H      D + D         S E D+  D  C  C +GG+L+
Sbjct: 757 SKPYRNVLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLI 816

Query: 241 CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGS 298
           CC+G C  +FH +        C        E+EA+   ++ C  C      C  C +   
Sbjct: 817 CCDG-CPSTFHMS--------CL-------ELEALPSDDWRCAKC-----SCKFCQEHSR 855

Query: 299 SDKETGAEVFP--CVSATCGHFYHPHC 323
            D +  AEV    C  + C   YHP C
Sbjct: 856 QDAQDIAEVDSSLCTCSQCEEKYHPGC 882


>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3 [Ovis aries]
          Length = 4922

 Score = 47.0 bits (110), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 75/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG        G  +D    + +FF    E+   
Sbjct: 256 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----LSHFFLLCPEHIDQ 304

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 305 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 349

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 350 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 393

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W     N RI + C
Sbjct: 394 KSVPTNGWR--CKNCRICVEC 412



 Score = 43.5 bits (101), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)

Query: 263  ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
            A +   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG
Sbjct: 891  ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 945

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
              YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 946  QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 990

Query: 377  RCPKAYHRKCL 387
             C  +YH  CL
Sbjct: 991  DCDISYHTYCL 1001


>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Apis mellifera]
          Length = 1218

 Score = 47.0 bits (110), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 67/196 (34%), Gaps = 54/196 (27%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEV----------- 272
           + VC  C+  G L  C+G C   FH + +  GE S  H     +  D++           
Sbjct: 405 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDENIMDDKILDDIKEIKQNN 464

Query: 273 -----------EAMLNFF-CKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 319
                      E    FF C +C       CF C        E   +   C    CG  Y
Sbjct: 465 IDDDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 517

Query: 320 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 374
           H  C+                +S   G   TCP H C  C   +N  DS     + + A 
Sbjct: 518 HSSCLK------------SWPQSHWQGGRLTCPYHICHTC-SSDNPQDSHSRAPNEKLAR 564

Query: 375 CRRCPKAYH--RKCLP 388
           C RCP +YH    CLP
Sbjct: 565 CVRCPSSYHTSTSCLP 580


>gi|353231818|emb|CCD79173.1| putative set domain protein [Schistosoma mansoni]
          Length = 1306

 Score = 47.0 bits (110), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 62/173 (35%), Gaps = 30/173 (17%)

Query: 229 VCSFC----DNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML-NFFCKNC 283
           VC  C    +  G +  C+G C R  H         HC           +    F C  C
Sbjct: 287 VCPVCEVYSNAPGQMFQCQGPCGRIVHP--------HCMRYKTPPPADNSRPEKFRCPQC 338

Query: 284 EYKQHQCFACGKLGSSDKETGA-EVFPCVSATCGHFYHPHCVS---KLLLRDDEVAADQL 339
              +  C  CGK   +  + GA +++ C    C   +H  C+S    +L R     +D +
Sbjct: 339 LIGEFLCSICGKPSETGSKDGAGQLYACQVINCSRHFHRDCLSGWPGILTR----PSDNV 394

Query: 340 AKSIIAGESFTCPLHKCCICKQGENKAD---------SDLQFAVCRRCPKAYH 383
           A      +   CP H C  C    N+ +         S+  F  C RCP  +H
Sbjct: 395 ATRTTTFQVLRCPAHTCNTCYLESNETNCHPVKKPCSSEGPFLECVRCPAVFH 447


>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
          Length = 1305

 Score = 47.0 bits (110), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 39/147 (26%)

Query: 195  TKAMSGFIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLL 240
            +K     +VD ++ D++H      D + D         S E D+  D  C  C +GG+L+
Sbjct: 893  SKPYRNVLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLI 952

Query: 241  CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGS 298
            CC+G C  +FH +        C        E+EA+   ++ C  C      C  C +   
Sbjct: 953  CCDG-CPSTFHMS--------CL-------ELEALPSDDWRCAKC-----SCKFCQEHSR 991

Query: 299  SDKETGAEVFP--CVSATCGHFYHPHC 323
             D +  AEV    C  + C   YHP C
Sbjct: 992  QDAQDIAEVDSSLCTCSQCEEKYHPGC 1018


>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
          Length = 1324

 Score = 47.0 bits (110), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 61/147 (41%), Gaps = 39/147 (26%)

Query: 195  TKAMSGFIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLL 240
            +K     +VD ++ D++H      D + D         S E D+  D  C  C +GG+L+
Sbjct: 912  SKPYRNVLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLI 971

Query: 241  CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGS 298
            CC+G C  +FH +        C        E+EA+   ++ C  C      C  C +   
Sbjct: 972  CCDG-CPSTFHMS--------CL-------ELEALPSDDWRCAKC-----SCKFCQEHSR 1010

Query: 299  SDKETGAEVFP--CVSATCGHFYHPHC 323
             D +  AEV    C  + C   YHP C
Sbjct: 1011 QDAQDIAEVDSSLCTCSQCEEKYHPGC 1037


>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
          Length = 4785

 Score = 47.0 bits (110), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 76/201 (37%), Gaps = 55/201 (27%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C +++H     G        G  +D    + +FF    E+   
Sbjct: 217 CAFCKHLGATIKCCEEKCTQTYHYPCAVG-------AGTFQD----VSHFFLLCPEHIDQ 265

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 266 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DIAVTPL 310

Query: 340 AKSIIAGESFTCPLHKCCI-CKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            ++      + CP  K C  CKQ    +  D +  VC  C K YH  CL         + 
Sbjct: 311 KRA-----GWQCPECKVCQNCKQ----SGEDSKMLVCDTCDKGYHTFCL-------QPVM 354

Query: 399 EGIITRAWEGLLPNHRILIYC 419
           + + T  W+    N RI + C
Sbjct: 355 KSVPTNGWK--CKNCRICVEC 373


>gi|156374107|ref|XP_001629650.1| predicted protein [Nematostella vectensis]
 gi|156216655|gb|EDO37587.1| predicted protein [Nematostella vectensis]
          Length = 265

 Score = 47.0 bits (110), Expect = 0.090,   Method: Composition-based stats.
 Identities = 47/163 (28%), Positives = 67/163 (41%), Gaps = 32/163 (19%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFH----ATIDAGEESHCASLGLTKDEVEAMLNFFCKNCE 284
           C+ C   G  ++C   RC R++H    A+  A +E H  ++ L  D +E           
Sbjct: 116 CAHCSRFGASVVCQVPRCGRTYHYPCAASAGAFQEIHSMTM-LCPDHLED-----AGRLG 169

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
             + QC+ CG+     KE  AE+  C S  CG  YH  C+      D  V    L +   
Sbjct: 170 GAEAQCYLCGEA----KEI-AEMLFCTS--CGRHYHGRCL------DPAVEITSLVRM-- 214

Query: 345 AGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
               + CP  K C   QG  +   D +  VC  C + YH  CL
Sbjct: 215 ---GWQCPDCKVC---QGCRQPGDDNKMLVCDVCDRGYHTFCL 251


>gi|380012561|ref|XP_003690348.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2-like [Apis florea]
          Length = 1208

 Score = 46.6 bits (109), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 67/196 (34%), Gaps = 54/196 (27%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEV----------- 272
           + VC  C+  G L  C+G C   FH + +  GE S  H     +  D++           
Sbjct: 404 EKVCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDENIMDDKILDDIKEIKQNN 463

Query: 273 -----------EAMLNFF-CKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFY 319
                      E    FF C +C       CF C        E   +   C    CG  Y
Sbjct: 464 IDDDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 516

Query: 320 HPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAV 374
           H  C+                +S   G   TCP H C  C   +N  DS     + + A 
Sbjct: 517 HSSCLK------------SWPQSHWQGGRLTCPYHICHTC-SSDNPQDSHSRAPNEKLAR 563

Query: 375 CRRCPKAYH--RKCLP 388
           C RCP +YH    CLP
Sbjct: 564 CVRCPSSYHTSTSCLP 579


>gi|270001730|gb|EEZ98177.1| hypothetical protein TcasGA2_TC000606 [Tribolium castaneum]
          Length = 5215

 Score = 46.6 bits (109), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 62/164 (37%), Gaps = 39/164 (23%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN------ 282
           CSFC++ G  +LC    C + +H          CA+      +V  +L  FC N      
Sbjct: 311 CSFCNHYGASILCKIEGCTKVYHFP--------CATAS-GAFQVLNLLALFCSNHIGQAS 361

Query: 283 CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 342
            E +   C+ C  LG       A +  C  ++CG  YH  CV              LA+ 
Sbjct: 362 LECENSVCYTCKTLGDI-----ANLMFC--SSCGEHYHGICVG-------------LAQL 401

Query: 343 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKC 386
                 + C   KC IC+      D + +   C +C K YH  C
Sbjct: 402 PGVRAGWQC--RKCRICQVCRMTGD-ETKLMTCEQCDKIYHSTC 442


>gi|422293139|gb|EKU20439.1| phd zinc finger-containing protein, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 174

 Score = 46.6 bits (109), Expect = 0.11,   Method: Composition-based stats.
 Identities = 17/49 (34%), Positives = 26/49 (53%)

Query: 349 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKL 397
           F CP+H C +C +G    ++  Q   C  CP+AYH  C+P    + + L
Sbjct: 10  FVCPMHICDLCGKGRESGNNRNQLFPCWLCPRAYHLNCIPPACKYHEYL 58


>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
          Length = 1179

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 50/123 (40%), Gaps = 19/123 (15%)

Query: 274 AMLNFFCKNCEYKQHQCFACG-----KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLL 328
           A  +F C+ C  ++  CF C      +   +D++  + +  C    CG  YH HC     
Sbjct: 629 ASTSFVCRICCERKFTCFKCKMEIVPETEENDQKGDSSIAWCPVEDCGKVYHKHC----- 683

Query: 329 LRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAV--------CRRCPK 380
           L +DE++     K       + CPLH C  C   ++     ++           C RCP 
Sbjct: 684 LENDELSVPYKPKDSTQT-LYKCPLHNCRSCVLKKDDKKDSIKLNSSSKGKMFKCIRCPA 742

Query: 381 AYH 383
           AYH
Sbjct: 743 AYH 745


>gi|313221637|emb|CBY36122.1| unnamed protein product [Oikopleura dioica]
          Length = 913

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 50/123 (40%), Gaps = 19/123 (15%)

Query: 274 AMLNFFCKNCEYKQHQCFACG-----KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLL 328
           A  +F C+ C  ++  CF C      +   +D++  + +  C    CG  YH HC     
Sbjct: 629 ASTSFVCRICCERKFTCFKCKMEIVPETEENDQKGDSSIAWCPVEDCGKVYHKHC----- 683

Query: 329 LRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAV--------CRRCPK 380
           L +DE++     K       + CPLH C  C   ++     ++           C RCP 
Sbjct: 684 LENDELSVPYKPKDSTQT-LYKCPLHNCRSCVLKKDDKKDSIKLNSSSKGKMFKCIRCPA 742

Query: 381 AYH 383
           AYH
Sbjct: 743 AYH 745


>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
 gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
          Length = 690

 Score = 46.6 bits (109), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 40/110 (36%), Gaps = 21/110 (19%)

Query: 278 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
           F C  C    H CF C + G        +V  C    C  FYH  C+             
Sbjct: 25  FKCDECTSGVHSCFVCRQTG--------DVKACSQPLCSKFYHKECLQSYKC-------- 68

Query: 338 QLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
               S I G+   CPLH C  C   +   +   +   C RCP AYH  CL
Sbjct: 69  ----SKIDGDRIYCPLHFCSTCISNKTPVNRG-RLTKCIRCPTAYHAGCL 113


>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
           harrisii]
          Length = 4951

 Score = 46.2 bits (108), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 77/200 (38%), Gaps = 53/200 (26%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG        G  +D    + NF     E+   
Sbjct: 329 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----ISNFSLLCPEHIDQ 377

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 ++  C  C   G    + G + F     TCG  YH  C+        ++A   L
Sbjct: 378 APERSKEEANCSVCDSPG----DLGDQFF---CTTCGQHYHGMCL--------DIAVTAL 422

Query: 340 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 399
            ++      + CP   C +C+  ++  + D +  VC  C K YH  CL         + +
Sbjct: 423 KRA-----GWQCP--DCKVCQNCKHSGE-DSKMLVCDTCDKGYHTFCL-------QPVMD 467

Query: 400 GIITRAWEGLLPNHRILIYC 419
            + T  W+    N RI   C
Sbjct: 468 SVPTNGWK--CKNCRICAEC 485



 Score = 43.5 bits (101), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 26/126 (20%)

Query: 268  TKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHP 321
            +KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP
Sbjct: 983  SKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHP 1037

Query: 322  HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 381
            +CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +
Sbjct: 1038 YCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDIS 1082

Query: 382  YHRKCL 387
            YH  CL
Sbjct: 1083 YHTYCL 1088


>gi|253747453|gb|EET02131.1| Hypothetical protein GL50581_575 [Giardia intestinalis ATCC 50581]
          Length = 292

 Score = 46.2 bits (108), Expect = 0.13,   Method: Composition-based stats.
 Identities = 44/171 (25%), Positives = 53/171 (30%), Gaps = 36/171 (21%)

Query: 230 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESH-----CASLGLTKDEVEAMLNFFCKNCE 284
           CS C    D   C G C R F  T     E        ++      +      F C  C 
Sbjct: 7   CSICLGQADF-TCSGNCRRCFCKTCFTSPEVQSILQIASTPAQQSADSSTPATFLCLYCS 65

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSII 344
             Q+ C+ C +       T  + F C    CG     HC   L L    V   Q      
Sbjct: 66  RGQNICYYCFR------PTSQKDFVCTYNNCG--ISAHCTCLLSLYKQNVMPAQ------ 111

Query: 345 AGESFTCPLHKCCICKQGE---------NKADSDLQFAVCRRCPKAYHRKC 386
                 C LH C IC  G           K++  L    C RC K  H KC
Sbjct: 112 ------CLLHTCAICNAGSPYPFSRSRGRKSNESLML-TCARCLKTAHLKC 155


>gi|330805158|ref|XP_003290553.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
 gi|325079299|gb|EGC32905.1| hypothetical protein DICPUDRAFT_81283 [Dictyostelium purpureum]
          Length = 895

 Score = 46.2 bits (108), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 81/221 (36%), Gaps = 67/221 (30%)

Query: 220 NEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFF 279
           N+E  + D  CS C +GGDLLCC+  C +SFH          C +  L  +E+    +++
Sbjct: 59  NKEGLINDDFCSSCKDGGDLLCCDS-CEKSFHLM--------CLNPPL--EEIPEG-DWY 106

Query: 280 CKNCEYKQHQCFACGKLGS--------------SDKETGAEVFPCVSATCGHFYHPHCVS 325
           C +C+YK+ +        +              S +E      P  S+  G       +S
Sbjct: 107 CNSCKYKKSKTNVTKSPSTTIINNKTYFKESEQSPEEMSPPYLPISSSPIG-----STMS 161

Query: 326 KLLLRDDEVAADQLAKSIIAGESFTCPLH-------------KCCICKQGENKADSDLQF 372
            L+        D L  S +   +F  P                C +C++  N  D     
Sbjct: 162 SLV--------DNL--SSVNPSTFQLPQEYTKNVNRNSSKKLNCLVCEESSNSGD----I 207

Query: 373 AVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNH 413
             C +C  AYH  C+       D    G  T AW  L P H
Sbjct: 208 LQCNKCNAAYHSTCV-------DSSSLGNKTSAW--LCPKH 239


>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
          Length = 777

 Score = 46.2 bits (108), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 39/147 (26%)

Query: 195 TKAMSGFIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLL 240
           +K     +VD ++ D++H      D + D         S E D+  D  C  C +GG+L+
Sbjct: 365 SKPYRNVLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLI 424

Query: 241 CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGS 298
           CC+G C  +FH               ++  E+EA+   ++ C  C      C  C +   
Sbjct: 425 CCDG-CPSTFH---------------MSCLELEALPSDDWRCAKC-----SCKFCQEHSR 463

Query: 299 SDKETGAEVFP--CVSATCGHFYHPHC 323
            D +  AEV    C  + C   YHP C
Sbjct: 464 QDAQDIAEVDSSLCTCSQCEEKYHPGC 490


>gi|72386767|ref|XP_843808.1| transcription activator [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62359823|gb|AAX80252.1| transcription activator, putative [Trypanosoma brucei]
 gi|70800340|gb|AAZ10249.1| transcription activator, putative [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 948

 Score = 45.8 bits (107), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 19/100 (19%)

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H+CF CG+           +F C  + C   YH  C ++ +L           + + A  
Sbjct: 795 HECFVCGE----GMRPLQPLFHC--SWCTKAYHAECANERVL----------PEGVSAPR 838

Query: 348 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           ++TCP H+C +C   E  A +D    +C  CP A+   CL
Sbjct: 839 NWTCPRHRCDLC---EKVATTDGALFMCYECPAAFCFDCL 875


>gi|242017983|ref|XP_002429463.1| set domain protein, putative [Pediculus humanus corporis]
 gi|212514395|gb|EEB16725.1| set domain protein, putative [Pediculus humanus corporis]
          Length = 2269

 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 48/211 (22%), Positives = 80/211 (37%), Gaps = 51/211 (24%)

Query: 220  NEEDELFDSVCSFCDNGGDLLC---CEGR----CLRSFHATIDAGEESHCASL------- 265
            NEE+ +      F +N   + C   C+G     C    +A +   +E+H   +       
Sbjct: 1236 NEEEYVCQICFEFLENEKTIKCKGLCQGSFHKSCTEKLNANLPPIDETHMPVIKNKRRRK 1295

Query: 266  ----GLTKDEVEAM-----LNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 316
                G +    EA       ++ C +C   ++ CF C     + +        C +A CG
Sbjct: 1296 LFANGDSDSPCEAQEKDEEQDWMCGDCLRGENPCFVCNSRSGNRQR-------CCAAYCG 1348

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGES------------FTCPLHKCCICKQGEN 364
             FYH  C+ KL  +        + KS    ++             +CPLH C  C   +N
Sbjct: 1349 KFYHQKCL-KLFPQTIYNQGAPIRKSHALQQNKTSNSESPHYQILSCPLHVCHTC-ASDN 1406

Query: 365  KADSDLQF-----AVCRRCPKAYH--RKCLP 388
             +D + ++       C +CP AYH    C+P
Sbjct: 1407 PSDCNSKYLSERLVKCIKCPSAYHVGNYCVP 1437


>gi|261326898|emb|CBH09871.1| transcription activator, putative [Trypanosoma brucei gambiense
           DAL972]
          Length = 948

 Score = 45.8 bits (107), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 19/100 (19%)

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
           H+CF CG+           +F C  + C   YH  C ++ +L           + + A  
Sbjct: 795 HECFVCGE----GMRPLQPLFHC--SWCMKAYHAECANERVL----------PEGVSAPR 838

Query: 348 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           ++TCP H+C +C   E  A +D    +C  CP A+   CL
Sbjct: 839 NWTCPRHRCDLC---EKVATTDGALFMCYECPAAFCFDCL 875


>gi|307207985|gb|EFN85544.1| Probable histone-lysine N-methyltransferase NSD2 [Harpegnathos
           saltator]
          Length = 599

 Score = 45.8 bits (107), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 67/195 (34%), Gaps = 53/195 (27%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEES--HCASLGLTKDEVEAMLN------ 277
           + VC  C+  G L  C+G C   FH   +  GE S  H      ++D++   +N      
Sbjct: 232 EKVCQICEKTGKLTRCKGPCYSYFHLPCVKPGESSPEHSVDDNTSEDKIIDDVNVIKRSI 291

Query: 278 ----------------FFCKNC-EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYH 320
                           F C +C       CF C        E   +   C    CG  YH
Sbjct: 292 NDRDDNVKNEEQEDEMFKCIDCLSGVAPACFICN-------EREGDRIRCSVMACGKHYH 344

Query: 321 PHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADS-----DLQFAVC 375
             C+                +S   G   TCP H C  C   +N  DS     + + A C
Sbjct: 345 SMCLK------------SWPQSHWQGGRLTCPYHLCHTC-SSDNPQDSHSRAPNEKMAKC 391

Query: 376 RRCPKAYHRK--CLP 388
            RCP +YH    CLP
Sbjct: 392 VRCPSSYHTSTLCLP 406


>gi|195028344|ref|XP_001987036.1| GH21693 [Drosophila grimshawi]
 gi|193903036|gb|EDW01903.1| GH21693 [Drosophila grimshawi]
          Length = 1461

 Score = 45.8 bits (107), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 63/167 (37%), Gaps = 29/167 (17%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRC--LRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCK 281
           +  +  CSFCD  G  + C+  C  +  F     AG      S  +   E  + +   C 
Sbjct: 151 DYLEQKCSFCDQYGASINCKMNCRQMHHFPCAAAAGCFLILESFTVFCTEHLSQVPLICS 210

Query: 282 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 341
           +      +C  C  LG   K     +  C S  CG  +H  C+    L D          
Sbjct: 211 D---NNVECLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTR-------- 252

Query: 342 SIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                  ++C    KC IC+Q E    +D++F  C +C K YH  CL
Sbjct: 253 -----SGWSCARCTKCQICRQQE---ANDIKFVKCEQCQKIYHANCL 291


>gi|412989266|emb|CCO15857.1| polybromo-1 [Bathycoccus prasinos]
          Length = 2424

 Score = 45.4 bits (106), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 18/88 (20%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           +CS C   G L+CC+G C  +FHA + AGE ++ A          A   +FC        
Sbjct: 681 MCSVCGRSGSLVCCDG-CPSAFHA-VCAGEPTNRAL---------AETEWFCP------- 722

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCG 316
           +C A  +   SDKE  A+ FP +   C 
Sbjct: 723 ECVAVPEQIRSDKELRAQAFPFLRTECA 750


>gi|224121588|ref|XP_002330738.1| predicted protein [Populus trichocarpa]
 gi|222872514|gb|EEF09645.1| predicted protein [Populus trichocarpa]
          Length = 727

 Score = 45.4 bits (106), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 48/103 (46%), Gaps = 20/103 (19%)

Query: 221 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFC 280
           +ED+  D  C  C +GG+L+CC+  C  +FH           A L  T+D  E   +++C
Sbjct: 213 DEDDKNDDSCGLCGDGGELICCD-NCPSTFHQ----------ACL-CTEDLPEG--SWYC 258

Query: 281 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHC 323
            NC      C+ CG L  +DKE  + V       C H YH  C
Sbjct: 259 PNC-----TCWICGDL-VNDKEASSSVGAYKCLQCEHKYHGAC 295


>gi|342320470|gb|EGU12410.1| Hypothetical Protein RTG_01435 [Rhodotorula glutinis ATCC 204091]
          Length = 767

 Score = 45.4 bits (106), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 27/59 (45%), Gaps = 9/59 (15%)

Query: 230 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           C  C   G  LCCEG CLRSFH +        C    L  DEV    ++FCK C    H
Sbjct: 132 CDSCGGKGHFLCCEGGCLRSFHFS--------CLEPPLEIDEVPDE-SWFCKACRAAAH 181


>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
          Length = 5713

 Score = 45.4 bits (106), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 44/172 (25%), Positives = 65/172 (37%), Gaps = 52/172 (30%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           C+FC + G  + C    C R FH          CA+      + ++ L+ FC      QH
Sbjct: 321 CAFCSHYGAGIPCKVASCNRYFHLP--------CAAASSCFQDTKS-LSLFCS-----QH 366

Query: 289 ------------QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAA 336
                        C  C  +G       + +  C  + CG  YH  CV   LL       
Sbjct: 367 LGQVPLLLNGDVTCMQCCGMGDV-----SNLVMC--SICGQHYHGSCVGLALL------- 412

Query: 337 DQLAKSIIAG-ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                 + AG +  +C    C +C+Q E+ +    +  +C RC KAYH  CL
Sbjct: 413 ----PGVRAGWQCASC--RVCQVCRQPEDVS----KVMLCERCEKAYHPSCL 454


>gi|400599137|gb|EJP66841.1| PHD-finger domain-containing protein [Beauveria bassiana ARSEF
           2860]
          Length = 1226

 Score = 45.1 bits (105), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 16/70 (22%)

Query: 214 TEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVE 273
           T +D + + DE     CS C N GD+LCC+G C RSFH          C +L  ++D  +
Sbjct: 757 TPKDPATDNDEF----CSACGNAGDVLCCDG-CPRSFHF--------ECVNLAQSEDLPD 803

Query: 274 AMLNFFCKNC 283
              +++C  C
Sbjct: 804 ---DWYCNEC 810


>gi|390335528|ref|XP_003724176.1| PREDICTED: uncharacterized protein LOC591084 isoform 2
           [Strongylocentrotus purpuratus]
          Length = 4860

 Score = 45.1 bits (105), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 69/174 (39%), Gaps = 30/174 (17%)

Query: 220 NEEDELFDSV---CSFCDN-GGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVE 273
           N +  +F  +   CS+C   G  + C E  C R +H      +G      SL L   E  
Sbjct: 307 NVDKAVFSGITQRCSYCHRFGATIFCVEQGCNRVYHYPCAASSGSFQGIKSLILLCPE-- 364

Query: 274 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
             L+   +  E ++ +C  C   G+      +E   C S  CG  YH  C+      D  
Sbjct: 365 -HLDLAEERVEMEELECAICDLPGNL-----SESLFCTS--CGQHYHGSCL------DPP 410

Query: 334 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           V+ D + ++      + CP   C IC+      D D +  VC  C K YH  CL
Sbjct: 411 VSIDPVVRA-----GWQCP--NCKICQTCRQPGD-DNKMLVCDTCDKGYHTFCL 456



 Score = 43.5 bits (101), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C +CG  G      G+E      + CG  YHP+CVS  + +            ++  
Sbjct: 785 QDMCLSCGSFG-----LGSEGRLLTCSQCGQCYHPYCVSIKITK------------VVLS 827

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C    C +C +G  K+  + +  +C  C  +YH  CL
Sbjct: 828 KGWRC--LDCTVC-EGCGKSSDEARLLLCDDCDISYHTYCL 865


>gi|413920095|gb|AFW60027.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
          Length = 1339

 Score = 45.1 bits (105), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 65/163 (39%), Gaps = 50/163 (30%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C FC +GG+LLCC+  C  ++H    + +E    S             ++C NC   
Sbjct: 958  DDTCGFCGDGGELLCCD-NCPSTYHQACLSAKELPEGS-------------WYCHNC--- 1000

Query: 287  QHQCFAC-GKLGSSDKETGAEVFPCVSATCGHFYHPHCV--SKLLLRDDEVAADQLAKSI 343
               C  C G     +  T + +F C    CG  YH  C+   KL L       DQ++++ 
Sbjct: 1001 --TCQVCGGPFSEKEVSTFSAIFKCFQ--CGDAYHDTCIEQEKLPLE------DQISQTW 1050

Query: 344  IAGESFTCPLHKCCICKQ----------GENKADSDLQFAVCR 376
              G+           CK+           +N  DSDL +++ R
Sbjct: 1051 FCGK----------YCKEIFIGLRSHVGTDNILDSDLSWSILR 1083


>gi|390335530|ref|XP_795757.3| PREDICTED: uncharacterized protein LOC591084 isoform 3
           [Strongylocentrotus purpuratus]
          Length = 4856

 Score = 45.1 bits (105), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 69/174 (39%), Gaps = 30/174 (17%)

Query: 220 NEEDELFDSV---CSFCDN-GGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVE 273
           N +  +F  +   CS+C   G  + C E  C R +H      +G      SL L   E  
Sbjct: 306 NVDKAVFSGITQRCSYCHRFGATIFCVEQGCNRVYHYPCAASSGSFQGIKSLILLCPE-- 363

Query: 274 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
             L+   +  E ++ +C  C   G+      +E   C S  CG  YH  C+      D  
Sbjct: 364 -HLDLAEERVEMEELECAICDLPGNL-----SESLFCTS--CGQHYHGSCL------DPP 409

Query: 334 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           V+ D + ++      + CP   C IC+      D D +  VC  C K YH  CL
Sbjct: 410 VSIDPVVRA-----GWQCP--NCKICQTCRQPGD-DNKMLVCDTCDKGYHTFCL 455



 Score = 43.5 bits (101), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C +CG  G      G+E      + CG  YHP+CVS  + +            ++  
Sbjct: 765 QDMCLSCGSFG-----LGSEGRLLTCSQCGQCYHPYCVSIKITK------------VVLS 807

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C    C +C +G  K+  + +  +C  C  +YH  CL
Sbjct: 808 KGWRC--LDCTVC-EGCGKSSDEARLLLCDDCDISYHTYCL 845


>gi|390335526|ref|XP_003724175.1| PREDICTED: uncharacterized protein LOC591084 isoform 1
           [Strongylocentrotus purpuratus]
          Length = 4873

 Score = 45.1 bits (105), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 69/174 (39%), Gaps = 30/174 (17%)

Query: 220 NEEDELFDSV---CSFCDN-GGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVE 273
           N +  +F  +   CS+C   G  + C E  C R +H      +G      SL L   E  
Sbjct: 323 NVDKAVFSGITQRCSYCHRFGATIFCVEQGCNRVYHYPCAASSGSFQGIKSLILLCPE-- 380

Query: 274 AMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
             L+   +  E ++ +C  C   G+      +E   C S  CG  YH  C+      D  
Sbjct: 381 -HLDLAEERVEMEELECAICDLPGNL-----SESLFCTS--CGQHYHGSCL------DPP 426

Query: 334 VAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           V+ D + ++      + CP   C IC+      D D +  VC  C K YH  CL
Sbjct: 427 VSIDPVVRA-----GWQCP--NCKICQTCRQPGD-DNKMLVCDTCDKGYHTFCL 472



 Score = 43.5 bits (101), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 43/101 (42%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C +CG  G      G+E      + CG  YHP+CVS  + +            ++  
Sbjct: 782 QDMCLSCGSFG-----LGSEGRLLTCSQCGQCYHPYCVSIKITK------------VVLS 824

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C    C +C +G  K+  + +  +C  C  +YH  CL
Sbjct: 825 KGWRC--LDCTVC-EGCGKSSDEARLLLCDDCDISYHTYCL 862


>gi|393224442|gb|EJD32834.1| hypothetical protein AURDEDRAFT_178069 [Auricularia delicata
           TFB-10046 SS5]
          Length = 242

 Score = 45.1 bits (105), Expect = 0.33,   Method: Composition-based stats.
 Identities = 49/170 (28%), Positives = 67/170 (39%), Gaps = 36/170 (21%)

Query: 228 SVCSFCDNGGD--LLCCEGRCLRSFHA----TIDAGEESHCASLGLTKDEVEAMLNFFCK 281
           S C  C   GD  LL C+  CLR  H+    TI  G +   A +   +  V     + C+
Sbjct: 61  SSCDVCMQPGDVLLLRCD-MCLRRVHSLCLPTI-PGTDILAAVVRAEEQGVPLPRIWRCE 118

Query: 282 NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 341
            C  +   C  C      D+  G EV  C   TC   YH  CV            ++   
Sbjct: 119 ECCPRS--CSLC------DRVVGKEVVTC--RTCKWTYHTGCVR-----------EKWGM 157

Query: 342 SIIAGES----FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            ++AG+      TCP   C  C +    A  D   A+C  C KA+H+ CL
Sbjct: 158 VVLAGDQGFRCKTCPSESCDACGRA---APDDFGTAICGTCGKAWHKDCL 204


>gi|413920094|gb|AFW60026.1| hypothetical protein ZEAMMB73_389394 [Zea mays]
          Length = 1168

 Score = 44.7 bits (104), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 65/163 (39%), Gaps = 50/163 (30%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C FC +GG+LLCC+  C  ++H    + +E    S             ++C NC   
Sbjct: 958  DDTCGFCGDGGELLCCD-NCPSTYHQACLSAKELPEGS-------------WYCHNC--- 1000

Query: 287  QHQCFAC-GKLGSSDKETGAEVFPCVSATCGHFYHPHCV--SKLLLRDDEVAADQLAKSI 343
               C  C G     +  T + +F C    CG  YH  C+   KL L       DQ++++ 
Sbjct: 1001 --TCQVCGGPFSEKEVSTFSAIFKCFQ--CGDAYHDTCIEQEKLPLE------DQISQTW 1050

Query: 344  IAGESFTCPLHKCCICKQ----------GENKADSDLQFAVCR 376
              G+           CK+           +N  DSDL +++ R
Sbjct: 1051 FCGK----------YCKEIFIGLRSHVGTDNILDSDLSWSILR 1083


>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like
           [Monodelphis domestica]
          Length = 4862

 Score = 44.7 bits (104), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 75/200 (37%), Gaps = 53/200 (26%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  + CCE +C + +H    AG        G  +D      NF     E+   
Sbjct: 294 CAFCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQD----FSNFSLLCPEHIDQ 342

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 ++  C  C   G        + F C   TCG  YH  C+        ++A   L
Sbjct: 343 APERSKEEANCAVCDSPGDL-----VDQFFCT--TCGQHYHGMCL--------DIAVTPL 387

Query: 340 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 399
            ++      + CP   C +C+  ++  + D +  VC  C K YH  CL         + +
Sbjct: 388 KRA-----GWQCP--DCKVCQNCKHSGE-DSKMLVCDTCDKGYHTFCL-------QPIMD 432

Query: 400 GIITRAWEGLLPNHRILIYC 419
            + T  W+    N RI   C
Sbjct: 433 SVPTNGWK--CKNCRICAEC 450



 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)

Query: 269  KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 322
            KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP+
Sbjct: 952  KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 1006

Query: 323  CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 382
            CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +Y
Sbjct: 1007 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 1051

Query: 383  HRKCL 387
            H  CL
Sbjct: 1052 HTYCL 1056


>gi|405958289|gb|EKC24431.1| Histone-lysine N-methyltransferase MLL3 [Crassostrea gigas]
          Length = 4990

 Score = 44.7 bits (104), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 65/166 (39%), Gaps = 38/166 (22%)

Query: 230  CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            CS+C   G  + C    C + +H          CA+ G    + +  L   C +    Q 
Sbjct: 1771 CSYCRRYGATITCIYPECSKKYHYP--------CAAAGACFQD-KKKLAILCPD-HSDQA 1820

Query: 289  QCFA-----CGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
            +  A     C     +DK  G ++F C S  CGH YH  C+   +    EV A       
Sbjct: 1821 ETIAGEEAFCVLCCQADK-IGKQLF-CTS--CGHHYHGGCLHPSVALSPEVRA------- 1869

Query: 344  IAGESFTCPLHKCC-ICKQ-GENKADSDLQFAVCRRCPKAYHRKCL 387
                 + CP  K C +C+Q GE     D +  VC  C K YH  CL
Sbjct: 1870 ----GWQCPDCKVCQMCRQPGE-----DSKMLVCDTCDKGYHTFCL 1906


>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
 gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
          Length = 640

 Score = 44.3 bits (103), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 39/147 (26%)

Query: 195 TKAMSGFIVDDMEEDMVH------DTEEDE--------SNEEDELFDSVCSFCDNGGDLL 240
           +K     +VD ++ D++H      D + D         S E D+  D  C  C +GG+L+
Sbjct: 228 SKPYRNVLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLI 287

Query: 241 CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML--NFFCKNCEYKQHQCFACGKLGS 298
           CC+G C  +FH               ++  E+EA+   ++ C  C      C  C +   
Sbjct: 288 CCDG-CPSTFH---------------MSCLELEALPSDDWRCAKC-----SCKFCQEHSR 326

Query: 299 SDKETGAEVFP--CVSATCGHFYHPHC 323
            D +  AEV    C  + C   YHP C
Sbjct: 327 QDAQDIAEVDSSLCTCSQCEEKYHPGC 353


>gi|444724233|gb|ELW64844.1| Histone-lysine N-methyltransferase MLL3 [Tupaia chinensis]
          Length = 4664

 Score = 44.3 bits (103), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 58/146 (39%), Gaps = 26/146 (17%)

Query: 248 RSFHATIDAGEESHCASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDK 301
           R F       +++  A     KDE E  ++     F  + ++  HQ  C  CG  G    
Sbjct: 688 RKFSPGRPRSKQASTADTPSCKDEEENSMHNTVVLFSSSDKFTAHQDMCVVCGSFGQ--- 744

Query: 302 ETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQ 361
             GAE      + CG  YHP+CVS  + +            ++  + + C   +C +C +
Sbjct: 745 --GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-E 787

Query: 362 GENKADSDLQFAVCRRCPKAYHRKCL 387
              KA    +  +C  C  +YH  CL
Sbjct: 788 ACGKATDPGRLLLCDDCDISYHTYCL 813


>gi|255566581|ref|XP_002524275.1| DNA binding protein, putative [Ricinus communis]
 gi|223536466|gb|EEF38114.1| DNA binding protein, putative [Ricinus communis]
          Length = 1336

 Score = 44.3 bits (103), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 24/104 (23%)

Query: 222 EDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCK 281
           +D+  D  C  C +GG+L+CC+  C  +FH    + EE    S             ++C 
Sbjct: 861 DDDENDDSCGLCGDGGELICCDN-CPSTFHQACLSTEELPEGS-------------WYCP 906

Query: 282 NCEYKQHQCFACGKLGSSDKE--TGAEVFPCVSATCGHFYHPHC 323
           NC      C+ CG+L  +DKE    +  F C  + C H YH  C
Sbjct: 907 NC-----TCWICGEL-VNDKEDINSSNAFKC--SQCEHKYHDSC 942


>gi|189230178|ref|NP_001121413.1| uncharacterized protein LOC100158501 [Xenopus (Silurana)
           tropicalis]
 gi|183985616|gb|AAI66113.1| LOC100158501 protein [Xenopus (Silurana) tropicalis]
          Length = 336

 Score = 44.3 bits (103), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 39/80 (48%), Gaps = 13/80 (16%)

Query: 209 DMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLT 268
           D+  D E  + N+++      CS C +GGDLLCCE  C RSFH+        H  S    
Sbjct: 181 DICTDCESLQQNDDE------CSVCGDGGDLLCCE-ECPRSFHSNC------HVPSTSEA 227

Query: 269 KDEVEAMLNFFCKNCEYKQH 288
           K   E+ +  FCK  + + H
Sbjct: 228 KSSSESWICTFCKMQQMRMH 247


>gi|213627659|gb|AAI70319.1| Ectodermin [Xenopus laevis]
          Length = 1091

 Score = 44.3 bits (103), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D SN++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 840 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 875


>gi|213625298|gb|AAI70291.1| Ectodermin [Xenopus laevis]
          Length = 1091

 Score = 44.3 bits (103), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D SN++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 840 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 875


>gi|225461640|ref|XP_002283071.1| PREDICTED: uncharacterized protein LOC100248637 [Vitis vinifera]
          Length = 1444

 Score = 44.3 bits (103), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 22/111 (19%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C  C +GG+L+CC+  C  +FH          C S    K+  E   N++C NC   
Sbjct: 1031 DDSCGLCGDGGELICCDN-CPSTFHQA--------CLS---AKELPEG--NWYCPNC--- 1073

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
               C  CG L   D+E  +       + C H YH  C+ +  ++  EV  D
Sbjct: 1074 --TCRICGDL-VKDREASSSFLALKCSQCEHKYHMPCLKEKCVK--EVGGD 1119


>gi|81294213|gb|AAI07916.1| Trim24 protein [Rattus norvegicus]
          Length = 607

 Score = 44.3 bits (103), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS--------LGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T      +H  S          L+K
Sbjct: 374 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSK 432

Query: 270 DEVE 273
            EVE
Sbjct: 433 PEVE 436


>gi|193785757|dbj|BAG51192.1| unnamed protein product [Homo sapiens]
          Length = 759

 Score = 44.3 bits (103), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 509 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 544


>gi|147783856|emb|CAN65752.1| hypothetical protein VITISV_026339 [Vitis vinifera]
          Length = 1380

 Score = 43.9 bits (102), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 22/111 (19%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C  C +GG+L+CC+  C  +FH          C S    K+  E   N++C NC   
Sbjct: 967  DDSCGLCGDGGELICCDN-CPSTFHQA--------CLS---AKELPEG--NWYCPNC--- 1009

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
               C  CG L   D+E  +       + C H YH  C+ +  ++  EV  D
Sbjct: 1010 --TCRICGDL-VKDREASSSFLALKCSQCEHKYHMPCLKEKCVK--EVGGD 1055


>gi|395730024|ref|XP_002810443.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pongo
           abelii]
          Length = 1127

 Score = 43.9 bits (102), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D SN++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|297664001|ref|XP_002810444.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pongo
           abelii]
          Length = 1110

 Score = 43.9 bits (102), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D SN++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|426330876|ref|XP_004026430.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like [Gorilla gorilla
           gorilla]
          Length = 759

 Score = 43.9 bits (102), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 509 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 544


>gi|15232453|ref|NP_188116.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
           domain [Arabidopsis thaliana]
 gi|332642075|gb|AEE75596.1| Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger
           domain [Arabidopsis thaliana]
          Length = 1189

 Score = 43.9 bits (102), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 22/113 (19%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 277
           E   +D+  D  C  C +GG+L+CC+  C  +FH          C S+     +V    +
Sbjct: 717 EKASDDDPNDDSCGVCGDGGELICCDN-CPSTFHQA--------CLSM-----QVLPEGS 762

Query: 278 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLR 330
           ++C +C      C+ C +L S + E   + F C  + C H YH  C+  +  R
Sbjct: 763 WYCSSC-----TCWICSELVSDNAERSQD-FKC--SQCAHKYHGTCLQGISKR 807


>gi|301123573|ref|XP_002909513.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100275|gb|EEY58327.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1294

 Score = 43.9 bits (102), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 37/74 (50%), Gaps = 18/74 (24%)

Query: 223  DELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTK-DEVEAMLNFFCK 281
            D+ +D  CS C  GG+LLCC+G C R+FH T           +GL K  E E    +FC 
Sbjct: 1188 DDQWDVDCSVCGLGGELLCCDG-CPRAFHVTC----------IGLEKIPETE----WFCN 1232

Query: 282  NCEYKQHQCFACGK 295
             C  +   C AC K
Sbjct: 1233 ECNLQ--TCGACKK 1244


>gi|8777481|dbj|BAA97061.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1145

 Score = 43.9 bits (102), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 22/113 (19%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 277
           E   +D+  D  C  C +GG+L+CC+  C  +FH          C S+     +V    +
Sbjct: 673 EKASDDDPNDDSCGVCGDGGELICCDN-CPSTFHQA--------CLSM-----QVLPEGS 718

Query: 278 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLR 330
           ++C +C      C+ C +L S + E   + F C  + C H YH  C+  +  R
Sbjct: 719 WYCSSC-----TCWICSELVSDNAERSQD-FKC--SQCAHKYHGTCLQGISKR 763


>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Loxodonta africana]
          Length = 4785

 Score = 43.9 bits (102), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)

Query: 263 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
           A +   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG
Sbjct: 873 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 927

Query: 317 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
             YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 928 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 972

Query: 377 RCPKAYHRKCL 387
            C  +YH  CL
Sbjct: 973 DCDISYHTYCL 983


>gi|242077796|ref|XP_002448834.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
 gi|241940017|gb|EES13162.1| hypothetical protein SORBIDRAFT_06g034065 [Sorghum bicolor]
          Length = 1357

 Score = 43.9 bits (102), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 66/163 (40%), Gaps = 50/163 (30%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C FC +GG+LLCC+  C  ++H    + +E    S             ++C NC   
Sbjct: 982  DDTCGFCGDGGELLCCD-NCPSTYHPACLSAKELPEGS-------------WYCHNC--- 1024

Query: 287  QHQCFAC-GKLGSSDKETGAEVFPCVSATCGHFYHPHCV--SKLLLRDDEVAADQLAKSI 343
               C  C G +   +  T + +F C    CG  YH  C+   KL L       DQ++++ 
Sbjct: 1025 --TCQICGGPVSEKEVSTFSAIFKCFQ--CGDAYHDTCIEQEKLPLE------DQISQTW 1074

Query: 344  IAGESFTCPLHKCCICKQ----------GENKADSDLQFAVCR 376
              G+           CK+           EN  DS+L +++ R
Sbjct: 1075 FCGK----------YCKEIFIGLRSHVGTENILDSELSWSILR 1107


>gi|218195887|gb|EEC78314.1| hypothetical protein OsI_18046 [Oryza sativa Indica Group]
          Length = 517

 Score = 43.9 bits (102), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  C FC +GG+LLCC+  C  ++H T  + +E    S             ++C NC   
Sbjct: 5   DDTCGFCGDGGELLCCD-NCPSTYHQTCLSDQELPEGS-------------WYCHNC--- 47

Query: 287 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
              C +CG  L   +  T + +  C+   CG  YH  C+ + +L
Sbjct: 48  --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 87


>gi|222629834|gb|EEE61966.1| hypothetical protein OsJ_16739 [Oryza sativa Japonica Group]
          Length = 517

 Score = 43.9 bits (102), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  C FC +GG+LLCC+  C  ++H T  + +E    S             ++C NC   
Sbjct: 5   DDTCGFCGDGGELLCCD-NCPSTYHQTCLSDQELPEGS-------------WYCHNC--- 47

Query: 287 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
              C +CG  L   +  T + +  C+   CG  YH  C+ + +L
Sbjct: 48  --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 87


>gi|8843783|dbj|BAA97331.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1095

 Score = 43.9 bits (102), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 54/126 (42%), Gaps = 28/126 (22%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  CS C  GG L+ C+G C  +FHA        +C  L    D      ++FC++C   
Sbjct: 703 DVFCSVCHYGGKLILCDG-CPSAFHA--------NCLGLEDVPDG-----DWFCQSC--- 745

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSA-TCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
              C ACG+       T A+    +S   C   YHP C     LR D  A D L K  I 
Sbjct: 746 --CCGACGQFFLKTTSTNAKEEKFISCKQCELKYHPSC-----LRYDG-ACDSLDK--IL 795

Query: 346 GESFTC 351
           GE + C
Sbjct: 796 GEKWFC 801


>gi|357167602|ref|XP_003581243.1| PREDICTED: uncharacterized protein LOC100841912 [Brachypodium
            distachyon]
          Length = 1317

 Score = 43.5 bits (101), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 48/108 (44%), Gaps = 22/108 (20%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
            D  C  C +GG+L+CC+  C  S+H      +E    S             ++C +C   
Sbjct: 1001 DDTCGLCGDGGELICCDN-CPASYHVACLPSQEIPDGS-------------WYCSSC--- 1043

Query: 287  QHQCFACGKLGSS-DKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDE 333
              +C  CG++ SS +  T    F C  + C   YH  C+S  +L ++E
Sbjct: 1044 --RCDVCGEVVSSKEPRTPLHAFEC--SQCERQYHIKCISGKVLCNEE 1087


>gi|296132030|ref|YP_003639277.1| TDP-4-keto-6-deoxy-D-glucose transaminase [Thermincola potens JR]
 gi|296030608|gb|ADG81376.1| TDP-4-keto-6-deoxy-D-glucose transaminase [Thermincola potens JR]
          Length = 377

 Score = 43.5 bits (101), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 95/268 (35%), Gaps = 37/268 (13%)

Query: 373 AVCRRCPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYCLKHEIDDEIGTPI 432
           A   R  K       P  +  ++KL E  +T   + ++P H   + C             
Sbjct: 88  AFVLRGAKIVFTDIRPDTMNMDEKLIERAVTERTKAIVPVHYAGVAC------------- 134

Query: 433 RDHIIFPGIEENKTIIDRPRKKQSLASPSGKQKVAST-KSSLTSKAPPQGKFSVKALKRV 491
                     +  TI+    K   L      Q V ST K          G +S    K  
Sbjct: 135 ----------DMDTIMKIAGKHNLLVIEDAAQAVMSTYKGKALGTIGHLGCYSFHDTKNY 184

Query: 492 PSKAGQGETMEISERLLVGSDSSRRAKATDVSRKSFKGNVKSLSVQVDRSSSVDSKKTSL 551
               G+G  + I++R  V      R K T+  R SF      L  +VD+ S +D   + L
Sbjct: 185 T--CGEGGALLINDRRFVDRAEYIREKGTN--RSSF------LRGEVDKYSWIDVGSSYL 234

Query: 552 GERLYAAFVTEGTEQTKFGKQDNSDRETSRTVTVKPLRKKLISELPSLDEDSKRRLSSL- 610
              L AAF+    E+ +   +D         + +KPLR + + ELP + +D +   S   
Sbjct: 235 PSELNAAFLYAQLEEAEAINRDRLKSWERYYIGLKPLRDRGVIELPHIPDDCRHNASIFY 294

Query: 611 --MKDAASSVRMEEILKRHKIPSTHAYA 636
             ++D        + L+ H +     Y 
Sbjct: 295 IKVRDLQERTEFLQYLREHGVQGAFHYV 322


>gi|302142909|emb|CBI20204.3| unnamed protein product [Vitis vinifera]
          Length = 1300

 Score = 43.5 bits (101), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 22/111 (19%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  C  C +GG+L+CC+  C  +FH          C S    K+  E   N++C NC   
Sbjct: 870 DDSCGLCGDGGELICCDN-CPSTFHQA--------CLS---AKELPEG--NWYCPNC--- 912

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
              C  CG L   D+E  +       + C H YH  C+ +  ++  EV  D
Sbjct: 913 --TCRICGDL-VKDREASSSFLALKCSQCEHKYHMPCLKEKCVK--EVGGD 958


>gi|350596089|ref|XP_003125883.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1, partial
           [Sus scrofa]
          Length = 881

 Score = 43.5 bits (101), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D SN++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 631 DGSNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 666


>gi|67971672|dbj|BAE02178.1| unnamed protein product [Macaca fascicularis]
          Length = 592

 Score = 43.5 bits (101), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 359 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 394


>gi|449492124|ref|XP_002187267.2| PREDICTED: histone-lysine N-methyltransferase MLL3 [Taeniopygia
           guttata]
          Length = 4871

 Score = 43.5 bits (101), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 51/222 (22%), Positives = 87/222 (39%), Gaps = 53/222 (23%)

Query: 206 MEEDMVHDTEEDESNEEDELFDSVCSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCAS 264
           M+E ++ D ++   + E E     C+FC + G  + CCE +C + +H    AG       
Sbjct: 235 MKEQLLMDVDKSCVSGETER----CAFCKHLGATIKCCEEKCTQMYHYPCAAG------- 283

Query: 265 LGLTKDEVEAMLNFFCKN-------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGH 317
            G  +D   + L+  C +          ++  C  C   G    +   ++F     TCG 
Sbjct: 284 AGTFQDF--SNLSLLCPDHIDQAPERSKEEANCAVCDSPG----DLLDQLF---CTTCGQ 334

Query: 318 FYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRR 377
            YH  C+      D +V   + A        + CP   C +C+  ++  + D +  VC  
Sbjct: 335 HYHGMCL------DIQVTPLKRA-------GWQCP--DCKVCQNCKHSGE-DNKMLVCDT 378

Query: 378 CPKAYHRKCLPRKIAFEDKLEEGIITRAWEGLLPNHRILIYC 419
           C K YH  CL         + + + T  W+    N R+   C
Sbjct: 379 CDKGYHTFCL-------QPVMDAVPTNGWK--CKNCRVCAEC 411



 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)

Query: 269 KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 322
           KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP+
Sbjct: 886 KDEEENSMHNTVVLFSNSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 940

Query: 323 CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 382
           CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +Y
Sbjct: 941 CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 985

Query: 383 HRKCL 387
           H  CL
Sbjct: 986 HTYCL 990


>gi|342877378|gb|EGU78844.1| hypothetical protein FOXB_10633 [Fusarium oxysporum Fo5176]
          Length = 1223

 Score = 43.5 bits (101), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 24/88 (27%)

Query: 201 FIVDDMEEDMVHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEES 260
              + + +D V D +ED            CS C   GD++CC+G C RSFH         
Sbjct: 775 LATNGVAKDQVSDNDED------------CSACGAAGDVVCCDG-CPRSFHF-------- 813

Query: 261 HCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            C  + +T D +     +FC  C YK++
Sbjct: 814 ECVGM-ITSDHLPD--EWFCNECLYKRY 838


>gi|47228227|emb|CAG07622.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 4527

 Score = 43.5 bits (101), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 36/142 (25%), Positives = 56/142 (39%), Gaps = 25/142 (17%)

Query: 251 HATIDAGEESHCASLGLTKDEVEAMLN---FFCKNCEY--KQHQCFACGKLGSSDKETGA 305
           +A+I+ G  +      + ++E  AM N    F  N  +  KQ  C  CG  G      GA
Sbjct: 628 NASINPGVITVETPYPVKEEEENAMHNTVVIFSSNDNFTLKQDMCVVCGSFG-----LGA 682

Query: 306 EVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENK 365
           E      A CG  YHP+CV   +              ++  + + C   +C +C+     
Sbjct: 683 EGRLLACAQCGQCYHPYCVGIKI------------NKVVLSKGWRC--LECTVCEACGQA 728

Query: 366 ADSDLQFAVCRRCPKAYHRKCL 387
            D   +  +C  C  +YH  CL
Sbjct: 729 TDPG-RLLLCDDCDISYHTYCL 749


>gi|149031399|gb|EDL86389.1| rCG56742 [Rattus norvegicus]
          Length = 4499

 Score = 43.5 bits (101), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 26/131 (19%)

Query: 263 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
           A +   KDE E  ++     F  + ++  HQ  C  CG  G  ++     +  C  + CG
Sbjct: 439 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGEE---GRLLAC--SQCG 493

Query: 317 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
             YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 494 QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 538

Query: 377 RCPKAYHRKCL 387
            C  +YH  CL
Sbjct: 539 DCDISYHTYCL 549


>gi|20521928|dbj|BAA96030.2| KIAA1506 protein [Homo sapiens]
          Length = 3310

 Score = 43.5 bits (101), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 402 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 444

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 445 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 482


>gi|355726088|gb|AES08760.1| tripartite motif-containing 33 [Mustela putorius furo]
          Length = 616

 Score = 43.1 bits (100), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 432 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 467


>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
           [Oryctolagus cuniculus]
          Length = 4865

 Score = 43.1 bits (100), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)

Query: 263 ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
           A +   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG
Sbjct: 878 ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 932

Query: 317 HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
             YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 933 QCYHPYCVSIKISK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 977

Query: 377 RCPKAYHRKCL 387
            C  +YH  CL
Sbjct: 978 DCDISYHTYCL 988


>gi|431895735|gb|ELK05154.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
          Length = 4032

 Score = 43.1 bits (100), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 44/109 (40%), Gaps = 23/109 (21%)

Query: 279 FCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQ 338
            C +C   Q  C  CG  G      GAE      + CG  YHP+CVS  + +        
Sbjct: 3   VCSSC---QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK-------- 46

Query: 339 LAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
               ++  + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 47  ----VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 88


>gi|156230798|gb|AAI51838.1| MLL3 protein [Homo sapiens]
          Length = 3314

 Score = 43.1 bits (100), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 402 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 444

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 445 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 482


>gi|334323908|ref|XP_001382045.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1
           [Monodelphis domestica]
          Length = 1143

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           +N+ED+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 895 NNKEDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 928


>gi|15237720|ref|NP_200669.1| putative PHD finger transcription factor [Arabidopsis thaliana]
 gi|332009693|gb|AED97076.1| putative PHD finger transcription factor [Arabidopsis thaliana]
          Length = 1065

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 54/126 (42%), Gaps = 28/126 (22%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  CS C  GG L+ C+G C  +FHA        +C  L    D      ++FC++C   
Sbjct: 694 DVFCSVCHYGGKLILCDG-CPSAFHA--------NCLGLEDVPDG-----DWFCQSC--- 736

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSA-TCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
              C ACG+       T A+    +S   C   YHP C     LR D  A D L K  I 
Sbjct: 737 --CCGACGQFFLKTTSTNAKEEKFISCKQCELKYHPSC-----LRYDG-ACDSLDK--IL 786

Query: 346 GESFTC 351
           GE + C
Sbjct: 787 GEKWFC 792


>gi|334323910|ref|XP_003340460.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2
           [Monodelphis domestica]
          Length = 1126

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           +N+ED+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 895 NNKEDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 928


>gi|440893214|gb|ELR46066.1| Transcription intermediary factor 1-alpha, partial [Bos grunniens
           mutus]
          Length = 940

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 11/92 (11%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 707 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 765

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            EVE   +    N E ++ + F   KL   DK
Sbjct: 766 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 795


>gi|363729903|ref|XP_418542.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gallus gallus]
          Length = 4906

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)

Query: 269  KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 322
            KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP+
Sbjct: 916  KDEEENSMHNTVVLFSNSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 970

Query: 323  CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 382
            CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +Y
Sbjct: 971  CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 1015

Query: 383  HRKCL 387
            H  CL
Sbjct: 1016 HTYCL 1020



 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 76/198 (38%), Gaps = 49/198 (24%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN------ 282
           C++C + G  + CCE +C + +H    AG        G  +D   + L+  C +      
Sbjct: 287 CAYCKHLGATIKCCEEKCTQMYHYPCAAG-------AGTFQDF--SNLSLLCPDHIDQAP 337

Query: 283 -CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAK 341
               ++  C  C   G    +   ++F     TCG  YH  C+      D +V   + A 
Sbjct: 338 ERSKEEANCAVCDSPG----DLLDQLF---CTTCGQHYHGMCL------DIQVTPLKRA- 383

Query: 342 SIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGI 401
                  + CP   C +C+  ++  + D +  VC  C K YH  CL         + + +
Sbjct: 384 ------GWQCP--DCKVCQNCKHSGE-DNKMLVCDTCDKGYHTFCL-------QPVMDSV 427

Query: 402 ITRAWEGLLPNHRILIYC 419
            T  W+    N R+   C
Sbjct: 428 PTNGWK--CKNCRVCAEC 443


>gi|41350061|gb|AAS00364.1| unknown [Homo sapiens]
          Length = 2185

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 18  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 61  KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98


>gi|149411557|ref|XP_001506399.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial
           [Ornithorhynchus anatinus]
          Length = 1050

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           +N+ED+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 802 NNKEDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 835


>gi|413923002|gb|AFW62934.1| putative SET-domain containing protein family, partial [Zea mays]
          Length = 248

 Score = 43.1 bits (100), Expect = 1.1,   Method: Composition-based stats.
 Identities = 29/100 (29%), Positives = 37/100 (37%), Gaps = 25/100 (25%)

Query: 289 QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGES 348
           +C  C K+    +E       C    C   +H  CV++               S    ES
Sbjct: 147 ECRLCSKIIYVGEE-----IKCSVKKCPQMFHLKCVAE-------------DTSNFTVES 188

Query: 349 FTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
           F CP H C +CKQ        + F  C RC  A H KC P
Sbjct: 189 FRCPQHGCMVCKQ-------KMFFWRCGRCTVAAHTKCAP 221


>gi|395535724|ref|XP_003769871.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Sarcophilus
           harrisii]
          Length = 962

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           SN+E++  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 714 SNKEEDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 747


>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
            griseus]
          Length = 4871

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 26/131 (19%)

Query: 263  ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
            A +   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG
Sbjct: 903  ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCG 957

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
              YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 958  QCYHPYCVSIKISK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1002

Query: 377  RCPKAYHRKCL 387
             C  +YH  CL
Sbjct: 1003 DCDISYHTYCL 1013


>gi|351697386|gb|EHB00305.1| E3 ubiquitin-protein ligase TRIM33 [Heterocephalus glaber]
          Length = 980

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 730 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 765


>gi|346324763|gb|EGX94360.1| Zinc finger domain-containing protein, PHD-finger [Cordyceps
           militaris CM01]
          Length = 1368

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 12/57 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNC 283
           D  CS C N GD+LCC+G C RSFH          C +L  ++D  +   +++C  C
Sbjct: 896 DEFCSACGNAGDVLCCDG-CPRSFHF--------ECVNLTQSEDLPD---DWYCSEC 940


>gi|4325109|gb|AAD17259.1| transcriptional intermediary factor 1 gamma [Homo sapiens]
          Length = 1120

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|195122760|ref|XP_002005879.1| GI18846 [Drosophila mojavensis]
 gi|193910947|gb|EDW09814.1| GI18846 [Drosophila mojavensis]
          Length = 1465

 Score = 43.1 bits (100), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 43/168 (25%), Positives = 62/168 (36%), Gaps = 29/168 (17%)

Query: 223 DELFDSVCSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLNFFC 280
           +E  +  CSFC   G  + C+  C +  H      AG      S  +   E  + +   C
Sbjct: 146 NEYLEQKCSFCGQYGASITCKMNCRQVHHWPCAAAAGCFLILESFTVFCTEHLSQVPLIC 205

Query: 281 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 340
            +       C  C  LG   K     +  C S  CG  +H  C+    L D         
Sbjct: 206 TD---NNVGCLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTR------- 248

Query: 341 KSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                   ++C    KC IC+Q E    +D++F  C +C K YH  CL
Sbjct: 249 ------SGWSCARCTKCQICRQQE---ANDIKFIKCEQCQKIYHATCL 287


>gi|402855754|ref|XP_003892480.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 3 [Papio
           anubis]
          Length = 1151

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 901 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 936


>gi|402855750|ref|XP_003892478.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Papio
           anubis]
          Length = 1127

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|348587088|ref|XP_003479300.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 2 [Cavia
           porcellus]
          Length = 1128

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 878 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 913


>gi|332809940|ref|XP_513668.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Pan
           troglodytes]
 gi|410219030|gb|JAA06734.1| tripartite motif containing 33 [Pan troglodytes]
 gi|410250342|gb|JAA13138.1| tripartite motif containing 33 [Pan troglodytes]
 gi|410297422|gb|JAA27311.1| tripartite motif containing 33 [Pan troglodytes]
          Length = 1127

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|297279625|ref|XP_001099267.2| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 1 [Macaca
           mulatta]
          Length = 1151

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 901 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 936


>gi|383419055|gb|AFH32741.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Macaca mulatta]
          Length = 1127

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|348587086|ref|XP_003479299.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33-like isoform 1 [Cavia
           porcellus]
          Length = 1111

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 878 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 913


>gi|74027249|ref|NP_056990.3| E3 ubiquitin-protein ligase TRIM33 isoform alpha [Homo sapiens]
 gi|313104270|sp|Q9UPN9.3|TRI33_HUMAN RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
           Full=Ectodermin homolog; AltName: Full=RET-fused gene 7
           protein; Short=Protein Rfg7; AltName: Full=Transcription
           intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
           Full=Tripartite motif-containing protein 33
          Length = 1127

 Score = 43.1 bits (100), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|431911670|gb|ELK13818.1| Transcription intermediary factor 1-alpha [Pteropus alecto]
          Length = 1615

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 44/92 (47%), Gaps = 13/92 (14%)

Query: 218  ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
            E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 1384 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 1442

Query: 270  DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
             EVE   +    N E K+      GKL   DK
Sbjct: 1443 PEVEYDCDAPSHNSEKKKTD----GKLTPIDK 1470


>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
          Length = 1746

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 46/131 (35%), Gaps = 34/131 (25%)

Query: 303 TGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE--SFTCPLHKCCICK 360
           +G ++  C    C  +YHP C+ K             A  +  G   SFTCP H C  C 
Sbjct: 642 SGDQMIRCSVRACRRWYHPSCLRK----------PPFAVVVREGRSGSFTCPAHTCLACS 691

Query: 361 QGE--NKADSDLQFAVCRRCPKAYHRK--CLPRKIAFEDKLEEGIITRAWEGLLPNHRIL 416
                        F  C  CP AYH    CLP                  + + PN   L
Sbjct: 692 AETPGTMPRPSPHFIRCVMCPAAYHPGDWCLP---------------AGSKEIAPN---L 733

Query: 417 IYCLKHEIDDE 427
           I C +H + DE
Sbjct: 734 IICPRHALQDE 744


>gi|397468071|ref|XP_003805720.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
            [Pan paniscus]
          Length = 1258

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217  DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
            D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 1008 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 1043


>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
            MLL3-like [Ornithorhynchus anatinus]
          Length = 4910

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 52/125 (41%), Gaps = 26/125 (20%)

Query: 269  KDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPH 322
            KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  YHP+
Sbjct: 936  KDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPY 990

Query: 323  CVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAY 382
            CVS  + +            ++  + + C   +C +C +   KA    +  +C  C  +Y
Sbjct: 991  CVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISY 1035

Query: 383  HRKCL 387
            H  CL
Sbjct: 1036 HTYCL 1040



 Score = 42.4 bits (98), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 48/196 (24%), Positives = 76/196 (38%), Gaps = 45/196 (22%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAG-----EESHCASLGLTKDEVEAMLNFFCKNC 283
           C+FC + G  + CCE +C + +H    AG     + SH + L     +     +    NC
Sbjct: 280 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHSSLLCPEHIDQAPERSKEEANC 339

Query: 284 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
                 C + G L         + F C   TCG  YH  C+        ++A   L ++ 
Sbjct: 340 AV----CDSPGDL--------LDQFYCT--TCGQHYHGMCL--------DIAITPLKRA- 376

Query: 344 IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGIIT 403
                + CP   C +C+  ++  + D +  VC  C K YH  CL   I       + + T
Sbjct: 377 ----GWQCP--DCKVCQNCKHSGE-DNKMLVCDTCDKGYHTFCLQPVI-------DSVPT 422

Query: 404 RAWEGLLPNHRILIYC 419
             W+    N R+   C
Sbjct: 423 NGWK--CKNCRVCAEC 436


>gi|12407441|gb|AAG53509.1|AF220136_1 tripartite motif protein TRIM33 alpha [Homo sapiens]
 gi|119577003|gb|EAW56599.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
 gi|119577005|gb|EAW56601.1| tripartite motif-containing 33, isoform CRA_a [Homo sapiens]
 gi|168273162|dbj|BAG10420.1| tripartite motif-containing protein 33 [synthetic construct]
          Length = 1127

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|402855752|ref|XP_003892479.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Papio
           anubis]
          Length = 1110

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|383419057|gb|AFH32742.1| E3 ubiquitin-protein ligase TRIM33 isoform beta [Macaca mulatta]
          Length = 1110

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|5689563|dbj|BAA83065.1| KIAA1113 protein [Homo sapiens]
          Length = 1131

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 881 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 916


>gi|332809942|ref|XP_003308352.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Pan
           troglodytes]
 gi|410219028|gb|JAA06733.1| tripartite motif containing 33 [Pan troglodytes]
 gi|410250340|gb|JAA13137.1| tripartite motif containing 33 [Pan troglodytes]
 gi|410297420|gb|JAA27310.1| tripartite motif containing 33 [Pan troglodytes]
          Length = 1110

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|297603635|ref|NP_001054361.2| Os04g0691700 [Oryza sativa Japonica Group]
 gi|215741180|dbj|BAG97675.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675919|dbj|BAF16275.2| Os04g0691700 [Oryza sativa Japonica Group]
          Length = 385

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 22/104 (21%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  C FC +GG+LLCC+  C  ++H T  + +E    S             ++C NC   
Sbjct: 5   DDTCGFCGDGGELLCCD-NCPSTYHQTCLSDQELPEGS-------------WYCHNC--- 47

Query: 287 QHQCFACGK-LGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLL 329
              C +CG  L   +  T + +  C+   CG  YH  C+ + +L
Sbjct: 48  --TCRSCGNPLSEKEVSTFSAILKCLQ--CGDSYHDTCIDQEML 87


>gi|195382495|ref|XP_002049965.1| GJ21880 [Drosophila virilis]
 gi|194144762|gb|EDW61158.1| GJ21880 [Drosophila virilis]
          Length = 1458

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 63/166 (37%), Gaps = 29/166 (17%)

Query: 230 CSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           CSFC   G  + C+  C +  H   +  AG      S  +   E  + +   C +     
Sbjct: 150 CSFCGQYGASITCKMNCRQVHHWPCSSAAGCLLILESFTVFCTEHLSQVPIICTD---NN 206

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
            +C  C  LG   K     +  C S  CG  +H  C+    L D                
Sbjct: 207 VECLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTR-------------S 246

Query: 348 SFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIA 392
            ++C    KC IC+Q E    +D++F  C +C K YH  CL   I+
Sbjct: 247 GWSCARCTKCQICRQHET---NDIKFIKCEQCQKMYHAMCLRPTIS 289


>gi|74027251|ref|NP_148980.2| E3 ubiquitin-protein ligase TRIM33 isoform beta [Homo sapiens]
          Length = 1110

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|297279627|ref|XP_002801759.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33 isoform 2 [Macaca
           mulatta]
          Length = 1110

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|171686528|ref|XP_001908205.1| hypothetical protein [Podospora anserina S mat+]
 gi|170943225|emb|CAP68878.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1304

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 13/74 (17%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 270
           V + +++ +N   +  D  C+ C + G+L+CC+G C RSFH         +C    L +D
Sbjct: 841 VGNPQKEVTNHAQDDNDDYCASCSSNGELICCDG-CTRSFHF--------NCVDPPLARD 891

Query: 271 EVEAMLN-FFCKNC 283
              AM N +FC  C
Sbjct: 892 ---AMPNEWFCNVC 902


>gi|12407443|gb|AAG53510.1|AF220137_1 tripartite motif protein TRIM33 beta [Homo sapiens]
 gi|119577004|gb|EAW56600.1| tripartite motif-containing 33, isoform CRA_b [Homo sapiens]
          Length = 1110

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|380797829|gb|AFE70790.1| E3 ubiquitin-protein ligase TRIM33 isoform alpha, partial [Macaca
           mulatta]
          Length = 1044

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 794 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 829


>gi|441636827|ref|XP_003268060.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Nomascus leucogenys]
          Length = 1041

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 791 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 826


>gi|380797797|gb|AFE70774.1| E3 ubiquitin-protein ligase TRIM33 isoform beta, partial [Macaca
           mulatta]
          Length = 1027

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 794 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 829


>gi|367036601|ref|XP_003648681.1| hypothetical protein THITE_2106426 [Thielavia terrestris NRRL 8126]
 gi|346995942|gb|AEO62345.1| hypothetical protein THITE_2106426 [Thielavia terrestris NRRL 8126]
          Length = 1698

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 46/109 (42%), Gaps = 14/109 (12%)

Query: 281 KNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLA 340
           K+ ++KQ     CG  G +D      + PC    C + YH  C+     RD  V      
Sbjct: 353 KSYDFKQAHRLVCGSCGYADDRNKGALIPCQG--CSNSYHKACIGVRSTRDHRVTK---- 406

Query: 341 KSIIAGESFTCPLHKC-CICKQGENKADSDLQFAVCRRCPKAYHRKCLP 388
              +A ESF      C CI K+ + +A +     VC+ C  A +  C+P
Sbjct: 407 ---VAAESFVLQCRFCICIYKKKDIRAPA---HDVCQGC-HAKNPSCVP 448


>gi|297745878|emb|CBI15934.3| unnamed protein product [Vitis vinifera]
          Length = 994

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 49/113 (43%), Gaps = 28/113 (24%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 277
           +SNE     D +CS C  GGDL+ C+  C  SFH +           LGL   +V    +
Sbjct: 657 KSNERHHENDHICSVCHYGGDLVLCD-HCPSSFHKS----------CLGL---KVGCFGD 702

Query: 278 FFCKNCEYKQHQCFACGKLGSSDKETGAE----VFPCVSATCGHFYHPHCVSK 326
           +FC +C         CG  G +  + G+E    VF C    C   YH  C+ K
Sbjct: 703 WFCPSC--------CCGICGENKFDGGSEQDNVVFSCYQ--CERQYHVGCLRK 745


>gi|355745559|gb|EHH50184.1| hypothetical protein EGM_00970, partial [Macaca fascicularis]
          Length = 1012

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 762 DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 797


>gi|313212234|emb|CBY36242.1| unnamed protein product [Oikopleura dioica]
          Length = 906

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 68/164 (41%), Gaps = 35/164 (21%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFH--ATIDAG---EESHCASLGLTKDEVEAMLNFFCKNC 283
           CS+C N G  + C    C   +H    + +G   + + C+ L   +   +A L      C
Sbjct: 104 CSYCQNYGASVRCKVSNCSSFYHYPCAVASGAFQDAASCSVLCAPEHRHQARL-IDPSGC 162

Query: 284 EYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSI 343
           E+    C   G+L        +++  C   TCG   H  C+++ ++   EV A       
Sbjct: 163 EF----CSQVGEL--------SDLLFCT--TCGAHSHARCLNEGIIVTGEVRA------- 201

Query: 344 IAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
              + +TC      IC+Q   K+D D Q  +C  C K +H  CL
Sbjct: 202 -GWQCYTCK-----ICQQCR-KSDDDAQMIICETCDKGWHTYCL 238


>gi|148671129|gb|EDL03076.1| mCG113864 [Mus musculus]
          Length = 4532

 Score = 42.7 bits (99), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
           +Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++ 
Sbjct: 527 QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 569

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 570 SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 608


>gi|417413221|gb|JAA52953.1| Putative transcription intermediary factor 1-alpha, partial
           [Desmodus rotundus]
          Length = 951

 Score = 42.7 bits (99), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 11/92 (11%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 718 EARKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 776

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            EVE   +    N E ++ +    GKL   DK
Sbjct: 777 PEVEYDCDAPSHNSEKRKTE--GPGKLAPIDK 806


>gi|149065269|gb|EDM15345.1| rCG27932, isoform CRA_a [Rattus norvegicus]
          Length = 1050

 Score = 42.7 bits (99), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS--------LGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T      +H  S          L+K
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSK 875

Query: 270 DEVE 273
            EVE
Sbjct: 876 PEVE 879


>gi|74151452|dbj|BAE38840.1| unnamed protein product [Mus musculus]
          Length = 764

 Score = 42.7 bits (99), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 531 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 565


>gi|356541753|ref|XP_003539338.1| PREDICTED: uncharacterized protein LOC100814680 [Glycine max]
          Length = 1120

 Score = 42.7 bits (99), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 25/100 (25%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNCE 284
           D +CS C  GG+L+ C+ +C  SFH T           LGL     E + N  +FC +C 
Sbjct: 737 DYICSVCHYGGELILCD-KCPSSFHKT----------CLGL-----EDIPNGDWFCPSC- 779

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV 324
                C  CG+      +   ++ PC+   C H YH  C+
Sbjct: 780 ----CCGICGQRKIDGDDEVGQLLPCIQ--CEHKYHVRCL 813


>gi|358375036|dbj|GAA91623.1| phosphogluconate dehydrogenase, NAD-binding [Aspergillus kawachii
           IFO 4308]
          Length = 320

 Score = 42.7 bits (99), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 9/99 (9%)

Query: 112 LIMVH---CLSYAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDV 168
           L++ H    L+YA    EAT +  W+         E+RPS  DLVD  D +   +   D 
Sbjct: 23  LLIAHHYRVLTYAADRSEATQRRAWE------AGIELRPSMQDLVDQSDCLLSIVPPRDA 76

Query: 169 LAKSKFLVTFLEEKPTKRKLSDEVVQTKAMSGFIVDDME 207
           LA ++ +V     +PT R     V+   A+S     +ME
Sbjct: 77  LATARRIVEASSARPTVRDTPLWVIDLNAVSPKTAREME 115


>gi|334348540|ref|XP_003342075.1| PREDICTED: transcription intermediary factor 1-alpha isoform 3
           [Monodelphis domestica]
          Length = 1060

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           ES +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 827 ESRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 861


>gi|126340851|ref|XP_001374515.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
           [Monodelphis domestica]
          Length = 1061

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           ES +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 828 ESRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 862


>gi|10568112|gb|AAF74766.2| ALR-like protein [Homo sapiens]
          Length = 4025

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 18  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 61  KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98


>gi|332870121|ref|XP_519508.3| PREDICTED: histone-lysine N-methyltransferase MLL3 [Pan
           troglodytes]
          Length = 4026

 Score = 42.4 bits (98), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 18  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 61  KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98


>gi|47228685|emb|CAG07417.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 4301

 Score = 42.4 bits (98), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 29/150 (19%)

Query: 247 LRSFHATIDAGEESHCASLGLTK-------DEVEAMLNFFCKNCEYK--QHQCFACGKLG 297
           L++  + IDA   S  +  G+TK       D ++  +  F    ++   Q  C  CG  G
Sbjct: 137 LKAMASCIDAVLLSMASDTGVTKEEEEEEDDTMQNTVVLFSNTDKFVLLQDMCVVCGSFG 196

Query: 298 SSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCC 357
              K +  ++  C  A C   YHP+CV      + ++   +L K       + C   +C 
Sbjct: 197 ---KGSEGQLLAC--AQCAQCYHPYCV------NSKITKTKLRK------GWRC--LECI 237

Query: 358 ICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           +C+    KA    +  +C  C  +YH  CL
Sbjct: 238 VCEMC-GKASDPSRLLLCDDCDVSYHTYCL 266


>gi|403276503|ref|XP_003929937.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Saimiri
           boliviensis boliviensis]
          Length = 4029

 Score = 42.4 bits (98), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 18  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 60

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 61  KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 98


>gi|149065270|gb|EDM15346.1| rCG27932, isoform CRA_b [Rattus norvegicus]
          Length = 1016

 Score = 42.4 bits (98), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCAS--------LGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T      +H  S          L+K
Sbjct: 783 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLTCHVPTLTHFPSGEWICTFCRDLSK 841

Query: 270 DEVE 273
            EVE
Sbjct: 842 PEVE 845


>gi|410644486|ref|ZP_11354967.1| hypothetical protein GAGA_0501 [Glaciecola agarilytica NO2]
 gi|410136044|dbj|GAC03366.1| hypothetical protein GAGA_0501 [Glaciecola agarilytica NO2]
          Length = 366

 Score = 42.4 bits (98), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 54/107 (50%), Gaps = 20/107 (18%)

Query: 742 GANDFSCLMKKKLDETGKNCLYKNYDI------LPAKNDFNFEKRDWMTVEPKELAPGSR 795
           G  D +  +K  L + G++C+   YDI      L  +  FN ++RD++TV+P+   P   
Sbjct: 113 GLCDLAMSLKDYLIQEGRDCILDGYDIDPLNVELSKQKGFNVQERDFLTVQPE---PIYD 169

Query: 796 LIMGLNPPFGVKAGL-----ANKFINKALEFNPKLLILIVPPETERL 837
           L++ +NPPF   A +     A++F+         +L+ ++P    RL
Sbjct: 170 LVL-MNPPFDSSAFVDHVRHASRFLTPT-----GVLVSVIPVNMPRL 210


>gi|363743208|ref|XP_418009.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Gallus gallus]
          Length = 987

 Score = 42.4 bits (98), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 739 TNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 772


>gi|356546822|ref|XP_003541821.1| PREDICTED: uncharacterized protein LOC100795889 [Glycine max]
          Length = 1310

 Score = 42.4 bits (98), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 46/98 (46%), Gaps = 18/98 (18%)

Query: 221 EEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE-------------SHCASLGL 267
           +E++  D  C  C  GG+L+CC+  C  +FH    + +E               C +L +
Sbjct: 792 DENDKNDDSCGLCGEGGELICCDN-CPSTFHLACLSTQEIPDGDWYCTNCTCRICGNLVI 850

Query: 268 TKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGA 305
            KD ++A  +  C  CE+K H+   C  L   DK+ GA
Sbjct: 851 DKDTLDAHDSLQCSQCEHKYHE--KC--LEDRDKQEGA 884


>gi|348687109|gb|EGZ26923.1| hypothetical protein PHYSODRAFT_293066 [Phytophthora sojae]
          Length = 1341

 Score = 42.4 bits (98), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 54/122 (44%), Gaps = 28/122 (22%)

Query: 189 SDEVVQTKAMSGFIVDDMEEDMVHDTEEDESNEEDELFDS--------VCSFCDNGGDLL 240
           SDE  ++ A +G +  D     +H  EED  NE  E  D          C+ C +GG LL
Sbjct: 116 SDEDAESVATTGDVAIDTP---LHSGEED-GNESGETADDGWADHNRWYCNICKDGGQLL 171

Query: 241 CCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSD 300
           CC+ RC R+FH +           LG++ D +     ++CK C     +C    +L    
Sbjct: 172 CCD-RCPRAFHMSC----------LGMSVDMIPDS-EWYCKMC----TECLDRRRLKKES 215

Query: 301 KE 302
           KE
Sbjct: 216 KE 217


>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
          Length = 1503

 Score = 42.4 bits (98), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/134 (28%), Positives = 48/134 (35%), Gaps = 40/134 (29%)

Query: 303 TGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE--SFTCPLHKCCICK 360
           +G ++  C    C  +YHP C+ K             A  +  G   SFTCP H C  C 
Sbjct: 642 SGDQMIRCSVRACRRWYHPSCLRK----------PPFAVVVREGRSGSFTCPAHTCLACS 691

Query: 361 QGE--NKADSDLQFAVCRRCPKAYHRK--CLP---RKIAFEDKLEEGIITRAWEGLLPNH 413
                        F  C  CP AYH    CLP   ++IA                  PN 
Sbjct: 692 AETPGTMPRPSPHFIRCVMCPAAYHPGDWCLPAGSKEIA------------------PN- 732

Query: 414 RILIYCLKHEIDDE 427
             LI C +H + DE
Sbjct: 733 --LIICPRHALQDE 744


>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
            norvegicus]
          Length = 4930

 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 26/131 (19%)

Query: 263  ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
            A +   KDE E  ++     F  + ++  HQ  C  CG  G  ++     +  C  + CG
Sbjct: 921  ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGEE---GRLLAC--SQCG 975

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
              YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 976  QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1020

Query: 377  RCPKAYHRKCL 387
             C  +YH  CL
Sbjct: 1021 DCDISYHTYCL 1031


>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
            norvegicus]
          Length = 4931

 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 26/131 (19%)

Query: 263  ASLGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCG 316
            A +   KDE E  ++     F  + ++  HQ  C  CG  G  ++     +  C  + CG
Sbjct: 922  ADISSNKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGQGEE---GRLLAC--SQCG 976

Query: 317  HFYHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCR 376
              YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C 
Sbjct: 977  QCYHPYCVSIKITK------------VVLSKGWRC--LECTVC-EACGKATDPGRLLLCD 1021

Query: 377  RCPKAYHRKCL 387
             C  +YH  CL
Sbjct: 1022 DCDISYHTYCL 1032


>gi|345570767|gb|EGX53588.1| hypothetical protein AOL_s00006g454 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1515

 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 54/132 (40%), Gaps = 26/132 (19%)

Query: 230  CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQ 289
            C  C +GGD++CC   C RS+H   D  +E H       K +    + + C      QH+
Sbjct: 1133 CQVCTDGGDIVCCSS-CPRSYH--YDCLDEEH-------KYKSNGKMQYHC-----SQHE 1177

Query: 290  CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV----SKLLLRDDEVAADQLAKSIIA 345
            C  C +  S   + G  ++ C    C   Y   C+    +KL+   DE+   QL      
Sbjct: 1178 CHDCEQKTS---DAGNMLYRC--RFCERAYCEDCLDFDNAKLI--GDEILEYQLLDYYAI 1230

Query: 346  GESFTCPLHKCC 357
             ++F    H C 
Sbjct: 1231 PQAFYVECHNCV 1242


>gi|355561196|gb|EHH17882.1| hypothetical protein EGK_14365, partial [Macaca mulatta]
          Length = 4575

 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 563 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 605

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 606 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 643


>gi|345782718|ref|XP_533013.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Canis lupus
           familiaris]
          Length = 1203

 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 953 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 988


>gi|334348538|ref|XP_003342074.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2
           [Monodelphis domestica]
          Length = 1027

 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           ES +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 794 ESRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 828


>gi|195431535|ref|XP_002063792.1| GK15714 [Drosophila willistoni]
 gi|194159877|gb|EDW74778.1| GK15714 [Drosophila willistoni]
          Length = 1503

 Score = 42.4 bits (98), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 41/173 (23%), Positives = 64/173 (36%), Gaps = 41/173 (23%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 282
           E+ +  C FC   G  + C+  C R+ H          CA+       +E+    FC   
Sbjct: 140 EVLEQKCVFCSQYGASINCKMNC-RTVH-------HYPCAAAAGCLLILES-FTVFCTEH 190

Query: 283 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 335
                  C     +C +C  LG   K         + ++CG  +H  C+    L D    
Sbjct: 191 LSQVPLICSDNNVECLSCSSLGDLSKLI-------MCSSCGDHFHSTCIGLANLPDTR-- 241

Query: 336 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                        + C    KC IC+Q ++   +D++F  C +C K YH  CL
Sbjct: 242 -----------SGWCCARCTKCQICRQQDS---NDIKFVKCEQCQKIYHASCL 280


>gi|357490843|ref|XP_003615709.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
 gi|355517044|gb|AES98667.1| Chromodomain helicase-DNA-binding protein [Medicago truncatula]
          Length = 1144

 Score = 42.4 bits (98), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 59/128 (46%), Gaps = 27/128 (21%)

Query: 202 IVDDMEEDMV-HDTEEDESNEEDELF----DSVCSFCDNGGDLLCCEGRCLRSFHATIDA 256
           ++D M E M  H T E      ++LF    D++CS C+ GG+L+ C+ +C  ++H     
Sbjct: 755 LLDCMMEVMQDHRTREITEKPHNDLFEGENDNICSVCNYGGELILCD-QCPSAYH----- 808

Query: 257 GEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCG 316
               +C +L    D      ++FC +C     +C  CG+    + E G     C+   C 
Sbjct: 809 ---KNCLNLEGIPDG-----DWFCPSC-----RCGICGQNKIEETEDG-HFLTCIQ--CE 852

Query: 317 HFYHPHCV 324
           H YH  C+
Sbjct: 853 HKYHVECL 860



 Score = 40.4 bits (93), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 57/160 (35%), Gaps = 43/160 (26%)

Query: 228 SVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQ 287
           S C F  +G  LL C    ++  H T +  E+ H        D  E   +  C  C Y  
Sbjct: 744 SACIFLKDGRSLLDCMMEVMQD-HRTREITEKPH-------NDLFEGENDNICSVCNY-- 793

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
                           G E+  C    C   YH +C++               + I  G+
Sbjct: 794 ----------------GGELILC--DQCPSAYHKNCLN--------------LEGIPDGD 821

Query: 348 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            F CP  +C IC Q + +   D  F  C +C   YH +CL
Sbjct: 822 WF-CPSCRCGICGQNKIEETEDGHFLTCIQCEHKYHVECL 860


>gi|196016261|ref|XP_002117984.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
 gi|190579457|gb|EDV19552.1| hypothetical protein TRIADDRAFT_33351 [Trichoplax adhaerens]
          Length = 183

 Score = 42.4 bits (98), Expect = 2.2,   Method: Composition-based stats.
 Identities = 33/114 (28%), Positives = 49/114 (42%), Gaps = 23/114 (20%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG +G   K     +  C    CG  YH +CVS            +L +S+I  
Sbjct: 29  QDMCLCCGSIG---KGIDNSLIACFQ--CGQSYHHYCVSA-----------KLTRSVIV- 71

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEG 400
             + C    C +C +G  KA  + +  +C  C  +YH  CL  ++   DK+ EG
Sbjct: 72  NGWRC--LDCAVC-EGCGKAGDEDRLLLCDECDISYHTYCLNPQL---DKVPEG 119


>gi|449490172|ref|XP_002194420.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Taeniopygia guttata]
          Length = 912

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 664 NNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 697


>gi|355680664|gb|AER96599.1| olfactory receptor family 2 subfamily AV-like protein [Mustela
           putorius furo]
          Length = 529

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 24/34 (70%), Gaps = 1/34 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH 251
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH
Sbjct: 485 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFH 517


>gi|348500783|ref|XP_003437952.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Oreochromis
           niloticus]
          Length = 4872

 Score = 42.0 bits (97), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 44/197 (22%), Positives = 74/197 (37%), Gaps = 47/197 (23%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           C++C   G  + CC   C + +H          CA    T  ++ ++     ++ E   H
Sbjct: 285 CAYCKRLGASIKCCAEGCAQLYHYP--------CAGAAGTFQDIRSLSLLCPEHIELATH 336

Query: 289 Q------CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKS 342
           +      C  C   G    +   ++F C S  CG  YH  C+              +A +
Sbjct: 337 KFVDDINCVLCDSPG----DLLDQLF-CTS--CGLHYHGICLD-------------MAVT 376

Query: 343 IIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEEGII 402
            +    + CP  +C +C+  +N  + D +  VC  C K YH  CL   I       + + 
Sbjct: 377 PLRRAGWQCP--ECKVCQTCKNPGE-DTKMLVCDMCDKGYHTFCLQPVI-------DTLP 426

Query: 403 TRAWEGLLPNHRILIYC 419
           T  W     N R+ + C
Sbjct: 427 TNGWR--CQNCRVCLQC 441



 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 49/126 (38%), Gaps = 25/126 (19%)

Query: 267  LTKDEVEAMLN---FFCKNCEY--KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHP 321
            + ++E  AM N    F  N  +  KQ  C  CG  G      GAE      A CG  YHP
Sbjct: 964  IKEEEENAMHNTVVIFSSNDSFTLKQDMCVVCGSFG-----LGAEGRLLACAQCGQCYHP 1018

Query: 322  HCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKA 381
             CV   + +            ++  + + C   +C +C+      D   +  +C  C  +
Sbjct: 1019 FCVGIKITK------------VVLSKGWRC--LECTVCEACGQATDPG-RLLLCDDCDIS 1063

Query: 382  YHRKCL 387
            YH  CL
Sbjct: 1064 YHTYCL 1069


>gi|449299559|gb|EMC95572.1| hypothetical protein BAUCODRAFT_494698 [Baudoinia compniacensis
           UAMH 10762]
          Length = 1048

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 31/67 (46%), Gaps = 9/67 (13%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML 276
           D+ NEE    +  C+ C   G LLCC+G C RSFH T        C    L +D  E   
Sbjct: 622 DDDNEEQSENNDFCAACGGSGYLLCCDG-CDRSFHFT--------CLDPPLNEDASELNE 672

Query: 277 NFFCKNC 283
            ++C  C
Sbjct: 673 PWYCFVC 679


>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
          Length = 4884

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 957  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999

Query: 347  ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037


>gi|281208576|gb|EFA82752.1| hypothetical protein PPL_04447 [Polysphondylium pallidum PN500]
          Length = 985

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 42/109 (38%), Gaps = 22/109 (20%)

Query: 290 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
           CF C K    D +    +  C S  C   YH  C               LA +     S+
Sbjct: 771 CFLCKK--EEDPKGRTSIVQCRS--CPKIYHRSCAG-------------LAHT---PRSW 810

Query: 350 TCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
            CP H C  C++  N  +S   F +C+ CP +Y   CLP  I    K E
Sbjct: 811 KCPRHSCHSCRKTPN--ESGGSFFICKECPSSYCITCLPNDITILQKHE 857


>gi|426216300|ref|XP_004002403.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Ovis
           aries]
          Length = 1110

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|329664784|ref|NP_001192947.1| E3 ubiquitin-protein ligase TRIM33 [Bos taurus]
 gi|296489481|tpg|DAA31594.1| TPA: tripartite motif protein TRIM33 beta-like isoform 1 [Bos
           taurus]
          Length = 1126

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 876 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 911


>gi|296489482|tpg|DAA31595.1| TPA: tripartite motif protein TRIM33 beta-like isoform 2 [Bos
           taurus]
          Length = 1109

 Score = 42.0 bits (97), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 876 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 911


>gi|426216298|ref|XP_004002402.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Ovis
           aries]
          Length = 1127

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
            homolog
          Length = 4903

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)

Query: 286  KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++ 
Sbjct: 949  QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 991

Query: 346  GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
             + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 992  SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1030


>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
          Length = 4904

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)

Query: 286  KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
            +Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++ 
Sbjct: 950  QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 992

Query: 346  GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
             + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 993  SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1031


>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
          Length = 4911

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 957  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999

Query: 347  ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037


>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
          Length = 4911

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 957  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999

Query: 347  ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037


>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
 gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
            Full=Homologous to ALR protein; AltName: Full=Lysine
            N-methyltransferase 2C; Short=KMT2C; AltName:
            Full=Myeloid/lymphoid or mixed-lineage leukemia protein 3
          Length = 4911

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 957  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999

Query: 347  ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037


>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
          Length = 4912

 Score = 42.0 bits (97), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 957  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 999

Query: 347  ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 1000 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1037


>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
          Length = 4916

 Score = 42.0 bits (97), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 904 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 946

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 947 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 984


>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
           leucogenys]
          Length = 4856

 Score = 42.0 bits (97), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 889 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 931

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 932 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 969


>gi|440913222|gb|ELR62702.1| E3 ubiquitin-protein ligase TRIM33, partial [Bos grunniens mutus]
          Length = 1032

 Score = 42.0 bits (97), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 782 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 817


>gi|431896518|gb|ELK05930.1| E3 ubiquitin-protein ligase TRIM33 [Pteropus alecto]
          Length = 1116

 Score = 42.0 bits (97), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D S+++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 823 DGSSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 858


>gi|431895734|gb|ELK05153.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
          Length = 921

 Score = 42.0 bits (97), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 75/200 (37%), Gaps = 53/200 (26%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY--- 285
           C+FC + G  L CC+ RC + +H    AG        G  +D    + +FF    E+   
Sbjct: 220 CAFCKHLGATLRCCDERCTQMYHYPCAAG-------AGTFQD----LSHFFLLCPEHIDQ 268

Query: 286 ------KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQL 339
                 +   C  C   G        + F C   TCG  YH  C+        +VA   L
Sbjct: 269 APERSKEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCL--------DVAVTPL 313

Query: 340 AKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLEE 399
            ++      + CP  K C   Q   ++  D +  VC  C K YH  CL         + +
Sbjct: 314 KRA-----GWQCPECKVC---QNCKQSGEDSKMLVCDTCDKGYHTFCL-------QPVMK 358

Query: 400 GIITRAWEGLLPNHRILIYC 419
            + T  W+    N RI + C
Sbjct: 359 SVPTNGWK--CKNCRICVAC 376


>gi|194754301|ref|XP_001959434.1| GF12873 [Drosophila ananassae]
 gi|190620732|gb|EDV36256.1| GF12873 [Drosophila ananassae]
          Length = 1486

 Score = 42.0 bits (97), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 41/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 282
           E  +  C+FC   G  + C+ +C +  H          CA+       +E+    FC   
Sbjct: 141 EFLEQKCNFCGRYGASINCKMKCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191

Query: 283 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 335
                  C     +C  C  LG   K         + +TCG  +H  CV    L D    
Sbjct: 192 LSQVILICSDNNVECLTCSSLGDLSKLI-------MCSTCGDHFHSTCVGLANLPDTR-- 242

Query: 336 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                        + C    KC IC+  ++   +DL++  C +C K YH  CL
Sbjct: 243 -----------SGWNCARCTKCQICRVQDS---NDLKYVKCEQCQKIYHASCL 281


>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
 gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
          Length = 1604

 Score = 42.0 bits (97), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 47/108 (43%), Gaps = 29/108 (26%)

Query: 223 DELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFF 279
           D+  D  C  C +GGDL+CC+G C  +FH +        C  +         ML   ++ 
Sbjct: 757 DDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CLDI--------MMLPPGDWH 799

Query: 280 CKNCEYKQHQCFACGKLGSSD--KETGAEVFPCVSAT-CGHFYHPHCV 324
           C NC      C  CG + S D  +E G  V   ++ + C   YH  C+
Sbjct: 800 CPNC-----TCKFCG-IASEDFVQEDGTNVSELLTCSLCAKKYHKSCL 841


>gi|301756811|ref|XP_002914259.1| PREDICTED: transcription intermediary factor 1-alpha-like
           [Ailuropoda melanoleuca]
          Length = 1118

 Score = 42.0 bits (97), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 885 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 943

Query: 270 DEVE 273
            EVE
Sbjct: 944 PEVE 947


>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
           mulatta]
          Length = 4785

 Score = 42.0 bits (97), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 873 QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 915

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 916 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 953


>gi|24762433|ref|NP_611847.2| lost PHDs of trr [Drosophila melanogaster]
 gi|21626677|gb|AAF47094.2| lost PHDs of trr [Drosophila melanogaster]
 gi|85861118|gb|ABC86508.1| HL01030p [Drosophila melanogaster]
          Length = 1482

 Score = 42.0 bits (97), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 40/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 282
           E  +  C+FC   G  + C+  C +  H          CA+       +E+    FC   
Sbjct: 141 EFLEQKCNFCGRYGASINCKMNCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191

Query: 283 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 335
                  C     +C +C  LG   K         + +TCG  +H  C+    L D    
Sbjct: 192 LSQVPVICSDNNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-- 242

Query: 336 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                        + C    KC IC+Q ++   +D ++  C +C K YH  CL
Sbjct: 243 -----------SGWNCARCTKCQICRQQDS---NDTKYVKCEQCQKTYHASCL 281


>gi|355733003|gb|AES10880.1| myeloid/lymphoid or mixed-lineage leukemia 3 isoform 2 [Mustela
           putorius furo]
          Length = 102

 Score = 42.0 bits (97), Expect = 2.9,   Method: Composition-based stats.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 6   QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 48

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 49  KGWRC--LECTVC-EACGKASDPGRLLLCDDCDISYHTYCL 86


>gi|297474056|ref|XP_002687040.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Bos
           taurus]
 gi|296488233|tpg|DAA30346.1| TPA: tripartite motif-containing 24 isoform 1 [Bos taurus]
          Length = 1050

 Score = 41.6 bits (96), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E  +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 817 EMRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 875

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            EVE   +    N E ++ + F   KL   DK
Sbjct: 876 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 905


>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
            jacchus]
          Length = 4909

 Score = 41.6 bits (96), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 951  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 993

Query: 347  ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 994  KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1031


>gi|358411974|ref|XP_617403.5| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
           1-alpha [Bos taurus]
          Length = 1047

 Score = 41.6 bits (96), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E  +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 814 EMRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 872

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            EVE   +    N E ++ + F   KL   DK
Sbjct: 873 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 902


>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
            garnettii]
          Length = 4945

 Score = 41.6 bits (96), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 41/101 (40%), Gaps = 20/101 (19%)

Query: 287  QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
            Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++  
Sbjct: 994  QDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVLS 1036

Query: 347  ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 1037 KGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 1074


>gi|342320729|gb|EGU12668.1| ISWI chromatin-remodeling complex ATPase ISW2 [Rhodotorula glutinis
            ATCC 204091]
          Length = 1120

 Score = 41.6 bits (96), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 31/118 (26%), Positives = 54/118 (45%), Gaps = 23/118 (19%)

Query: 229  VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
             C  C +GG++  C G C R++HA         C+  G++K ++ A + ++C     +QH
Sbjct: 979  ACLTCGDGGEIYLCSG-CPRAYHA-------GDCS--GMSKRDLAATVQYYC-----QQH 1023

Query: 289  QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
             C  C +   + +  G  +F C   TC   +   C   L   D EV  + L + ++ G
Sbjct: 1024 TCSVCLR---NTQAAGGLLFRC--QTCPAAF---CEDDLPTTDLEVVGEVLPEFLLLG 1073


>gi|149747791|ref|XP_001497035.1| PREDICTED: transcription intermediary factor 1-alpha [Equus
           caballus]
          Length = 942

 Score = 41.6 bits (96), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 709 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 767

Query: 270 DEVE 273
            EVE
Sbjct: 768 PEVE 771


>gi|198456152|ref|XP_001360232.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
 gi|198135513|gb|EAL24806.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
          Length = 1486

 Score = 41.6 bits (96), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 40/171 (23%), Positives = 62/171 (36%), Gaps = 29/171 (16%)

Query: 220 NEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLN 277
           N   E     C+FC   G  + C+  C +  H      AG      S  +   E  + + 
Sbjct: 136 NHIGEFLKQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESFTVFCTEHLSQVP 195

Query: 278 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
             C +      +C +C  LG   K         + +TCG  +H  C+    L D      
Sbjct: 196 LICSD---NNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR---- 241

Query: 338 QLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                      + C    KC IC+Q ++   +DL++  C +C K YH  C 
Sbjct: 242 ---------SGWNCARCTKCQICRQQDS---NDLKYVKCEQCQKIYHASCF 280


>gi|359321453|ref|XP_003639599.1| PREDICTED: transcription intermediary factor 1-alpha-like isoform 1
           [Canis lupus familiaris]
          Length = 961

 Score = 41.6 bits (96), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 728 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 786

Query: 270 DEVE 273
            EVE
Sbjct: 787 PEVE 790


>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
          Length = 1547

 Score = 41.6 bits (96), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 20/97 (20%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            C  C +GGDL+CC+G C  +FH +        C ++ +         ++ C NC     
Sbjct: 669 TCGICGDGGDLICCDG-CPSTFHQS--------CLNIQMLPSG-----DWHCPNC----- 709

Query: 289 QCFACGKLGSSDKETGAEVFPCVSAT-CGHFYHPHCV 324
            C  CG    S+ E    V   V+ + C   YH  C+
Sbjct: 710 TCKFCGMADGSNAEDDTTVSELVTCSLCEKKYHTSCI 746


>gi|356541759|ref|XP_003539341.1| PREDICTED: uncharacterized protein LOC100818931 [Glycine max]
          Length = 1218

 Score = 41.6 bits (96), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 25/100 (25%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN--FFCKNCE 284
           D +CS C  GG+L+ C+ +C  SFH T           LGL     E + N  +FC +C 
Sbjct: 641 DYICSVCHYGGELILCD-KCPSSFHKT----------CLGL-----EDIPNGDWFCPSC- 683

Query: 285 YKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCV 324
                C  CG+      +   ++ PC+   C H YH  C+
Sbjct: 684 ----CCGICGQRKIDRDDEVEQLLPCIQ--CEHKYHVRCL 717


>gi|449271665|gb|EMC81949.1| E3 ubiquitin-protein ligase TRIM33 [Columba livia]
          Length = 878

 Score = 41.6 bits (96), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 647 NNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 680


>gi|426228499|ref|XP_004008341.1| PREDICTED: transcription intermediary factor 1-alpha [Ovis aries]
          Length = 944

 Score = 41.6 bits (96), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 711 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 769

Query: 270 DEVE 273
            EVE
Sbjct: 770 PEVE 773


>gi|281340137|gb|EFB15721.1| hypothetical protein PANDA_002125 [Ailuropoda melanoleuca]
          Length = 1000

 Score = 41.6 bits (96), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 767 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 825

Query: 270 DEVE 273
            EVE
Sbjct: 826 PEVE 829


>gi|19343832|gb|AAH25482.1| Trim24 protein [Mus musculus]
          Length = 516

 Score = 41.6 bits (96), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 283 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 317


>gi|301115432|ref|XP_002905445.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110234|gb|EEY68286.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1038

 Score = 41.6 bits (96), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 8/55 (14%)

Query: 204 DDMEEDMVHDTEEDESNEEDELFDS-------VCSFCDNGGDLLCCEGRCLRSFH 251
           D+  ED     E+DE  E+D   DS        C  C   GDL+CC+ +C RS+H
Sbjct: 828 DEETEDESFRQEDDEGQEDDSGSDSELDANLDFCEVCQGAGDLVCCD-KCPRSYH 881


>gi|195149375|ref|XP_002015633.1| GL11176 [Drosophila persimilis]
 gi|194109480|gb|EDW31523.1| GL11176 [Drosophila persimilis]
          Length = 1486

 Score = 41.6 bits (96), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 40/171 (23%), Positives = 62/171 (36%), Gaps = 29/171 (16%)

Query: 220 NEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--ATIDAGEESHCASLGLTKDEVEAMLN 277
           N   E     C+FC   G  + C+  C +  H      AG      S  +   E  + + 
Sbjct: 136 NHIGEFLKQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESFTVFCTEHLSQVP 195

Query: 278 FFCKNCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAAD 337
             C +      +C +C  LG   K         + +TCG  +H  C+    L D      
Sbjct: 196 LICSD---NNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR---- 241

Query: 338 QLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                      + C    KC IC+Q ++   +DL++  C +C K YH  C 
Sbjct: 242 ---------SGWNCARCTKCQICRQQDS---NDLKYVKCEQCQKIYHASCF 280


>gi|349603841|gb|AEP99562.1| Transcription intermediary factor 1-alpha-like protein, partial
           [Equus caballus]
          Length = 412

 Score = 41.6 bits (96), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 179 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 237

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            EVE   +    N E K+ +     KL   DK
Sbjct: 238 PEVEYDCDAPSHNSEKKKTEGLV--KLTPIDK 267


>gi|449456166|ref|XP_004145821.1| PREDICTED: uncharacterized protein LOC101214170 [Cucumis sativus]
          Length = 972

 Score = 41.6 bits (96), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 17/25 (68%), Positives = 20/25 (80%), Gaps = 1/25 (4%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH 251
           D +CS C +GGDLLCC+G C RSFH
Sbjct: 600 DDLCSICADGGDLLCCDG-CPRSFH 623


>gi|449496288|ref|XP_004160094.1| PREDICTED: uncharacterized LOC101214170 [Cucumis sativus]
          Length = 972

 Score = 41.2 bits (95), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 17/25 (68%), Positives = 20/25 (80%), Gaps = 1/25 (4%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH 251
           D +CS C +GGDLLCC+G C RSFH
Sbjct: 600 DDLCSICADGGDLLCCDG-CPRSFH 623


>gi|94420998|ref|NP_659542.3| transcription intermediary factor 1-alpha isoform 1 [Mus musculus]
 gi|6175058|sp|Q64127.1|TIF1A_MOUSE RecName: Full=Transcription intermediary factor 1-alpha;
           Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
           ligase Trim24; AltName: Full=Tripartite motif-containing
           protein 24
 gi|998815|gb|AAB34290.1| TIF1 isoform [Mus sp.]
 gi|34785214|gb|AAH56959.1| Tripartite motif-containing 24 [Mus musculus]
 gi|148681701|gb|EDL13648.1| tripartite motif protein 24, isoform CRA_a [Mus musculus]
          Length = 1051

 Score = 41.2 bits (95), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 818 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 852


>gi|195489371|ref|XP_002092710.1| GE14338 [Drosophila yakuba]
 gi|194178811|gb|EDW92422.1| GE14338 [Drosophila yakuba]
          Length = 1481

 Score = 41.2 bits (95), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 40/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 282
           E  +  C+FC   G  + C+  C +  H          CA+       +E+    FC   
Sbjct: 141 EFLEQKCNFCGRYGASINCKMNCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191

Query: 283 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 335
                  C     +C +C  LG   K         + +TCG  +H  C+    L D    
Sbjct: 192 LSQVPVICSDNNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-- 242

Query: 336 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                        + C    KC IC+Q ++   +D ++  C +C K YH  CL
Sbjct: 243 -----------SGWNCARCTKCQICRQQDS---NDTKYVKCEQCQKIYHASCL 281


>gi|443696185|gb|ELT96956.1| hypothetical protein CAPTEDRAFT_106029, partial [Capitella teleta]
          Length = 175

 Score = 41.2 bits (95), Expect = 4.0,   Method: Composition-based stats.
 Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C +CG LG+++     E    V + CG  YHP+C +  L R            II  
Sbjct: 24  QDLCVSCGSLGANE-----ESRLIVCSQCGQCYHPYCANVKLSR------------IILE 66

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C    C +C +G  +   + +  +C  C  +YH  CL
Sbjct: 67  KGWRC--LDCTVC-EGCGRPHDESRLILCDECDISYHIYCL 104


>gi|410989866|ref|XP_004001479.1| PREDICTED: LOW QUALITY PROTEIN: E3 ubiquitin-protein ligase TRIM33
           [Felis catus]
          Length = 1211

 Score = 41.2 bits (95), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 961 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 996


>gi|328868864|gb|EGG17242.1| hypothetical protein DFA_08232 [Dictyostelium fasciculatum]
          Length = 1177

 Score = 41.2 bits (95), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 44/109 (40%), Gaps = 21/109 (19%)

Query: 290  CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
            CF C      D++    +  C S  C   YH  C               L+ +     S+
Sbjct: 967  CFTCK--AEEDEKGRTSIVQCRS--CPKIYHRSCAG-------------LSHT---PRSW 1006

Query: 350  TCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
             CP H C  C++  N++ +   F +C+ CP ++   CLP  I   +K E
Sbjct: 1007 KCPRHSCSKCRKPSNESGTG-SFFICKGCPSSFCINCLPEDIIILEKHE 1054


>gi|74221665|dbj|BAE21530.1| unnamed protein product [Mus musculus]
          Length = 981

 Score = 41.2 bits (95), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 748 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 782


>gi|290999391|ref|XP_002682263.1| cytosine-5 DNA methyltransferase [Naegleria gruberi]
 gi|284095890|gb|EFC49519.1| cytosine-5 DNA methyltransferase [Naegleria gruberi]
          Length = 1527

 Score = 41.2 bits (95), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 16/121 (13%)

Query: 63  GLQKIFKPVIAWKFDLTNVIPEIFVL----SKENSWIKLQKPRKCYEEI----YRTILIM 114
           G + +F P+  W  +    +P + +L    SK+  WI+L  P K Y  I    Y+TI + 
Sbjct: 429 GSRSVFGPIKEWAIEY-RFLPVVSILVETSSKDTEWIELISPTKEYFHIFNSTYQTIAVC 487

Query: 115 VHCLSYAKRNPEATAKSIWDFLSRFFCLYEVRPSQNDLVDHMDLIKEALERDDVLAKSKF 174
              ++  +R+P+ + +SI + L +    +   P+  +        K  +  D +L KSKF
Sbjct: 488 HALINLVQRHPKESYESIIEMLKKAQLAFIYHPTIRN-------TKPIVSEDYILEKSKF 540

Query: 175 L 175
           +
Sbjct: 541 I 541


>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 2548

 Score = 41.2 bits (95), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 6/71 (8%)

Query: 325  SKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGE------NKADSDLQFAVCRRC 378
            +KL    +E   ++  + II  + F CPLH C +CK+ +      +K +       C +C
Sbjct: 1099 NKLFNGKEETLNNENDEKIIVLKKFICPLHICYVCKEFDINNTESSKKELKNNLFRCIKC 1158

Query: 379  PKAYHRKCLPR 389
             K+ HRKC+ +
Sbjct: 1159 YKSVHRKCMNQ 1169


>gi|297474058|ref|XP_002687041.1| PREDICTED: transcription intermediary factor 1-alpha isoform 2 [Bos
           taurus]
 gi|296488234|tpg|DAA30347.1| TPA: tripartite motif-containing 24 isoform 2 [Bos taurus]
          Length = 1016

 Score = 41.2 bits (95), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E  +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 783 EMRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 841

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            EVE   +    N E ++ + F   KL   DK
Sbjct: 842 PEVEYDCDVPVHNSEKRKTEGFV--KLTPIDK 871


>gi|47458930|ref|YP_015792.1| putative type II DNA methylase protein [Mycoplasma mobile 163K]
 gi|47458258|gb|AAT27581.1| putative type II DNA methylase protein [Mycoplasma mobile 163K]
          Length = 309

 Score = 41.2 bits (95), Expect = 4.3,   Method: Composition-based stats.
 Identities = 26/69 (37%), Positives = 41/69 (59%), Gaps = 3/69 (4%)

Query: 764 KNYDILPAKNDFNFEKRDWMTVEPKELAPGSRLIMGL-NPPFGVKAGLANKFINKALEFN 822
           + +DI P KN+   EK+D+  V   E+    +  + + NPPFG K  LA KF+NK+LE +
Sbjct: 65  RAFDIDP-KNNSIIEKKDFFDVNFNEMNLNKKKTVTIGNPPFGKKGDLALKFLNKSLE-H 122

Query: 823 PKLLILIVP 831
             ++  I+P
Sbjct: 123 SGIVAFILP 131


>gi|194885797|ref|XP_001976493.1| GG22900 [Drosophila erecta]
 gi|190659680|gb|EDV56893.1| GG22900 [Drosophila erecta]
          Length = 1481

 Score = 41.2 bits (95), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 40/173 (23%), Positives = 62/173 (35%), Gaps = 41/173 (23%)

Query: 224 ELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKN- 282
           E  +  C+FC   G  + C+  C +  H          CA+       +E+    FC   
Sbjct: 141 EFLEQKCNFCGRFGASINCKMNCRQVHHWP--------CAAAAGCLLILES-FTVFCTEH 191

Query: 283 -------CEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 335
                  C     +C +C  LG   K         + +TCG  +H  C+    L D    
Sbjct: 192 LSQVPVICSDNNVECLSCSSLGDLSKLI-------MCSTCGDHFHSTCIGLANLPDTR-- 242

Query: 336 ADQLAKSIIAGESFTCP-LHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
                        + C    KC IC+Q ++   +D ++  C +C K YH  CL
Sbjct: 243 -----------SGWNCARCTKCQICRQQDS---NDTKYVKCEQCQKIYHASCL 281


>gi|410953041|ref|XP_003983185.1| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
            1-alpha [Felis catus]
          Length = 1236

 Score = 41.2 bits (95), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 1/62 (1%)

Query: 218  ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLN 277
            E+ +ED+  +  C+ C NGG+LLCCE +C + FH +      ++     +   + E +L 
Sbjct: 1041 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPRWIIVMQKCERLLL 1099

Query: 278  FF 279
            F 
Sbjct: 1100 FL 1101


>gi|359324309|ref|XP_003640336.1| PREDICTED: transcription intermediary factor 1-alpha-like, partial
           [Canis lupus familiaris]
          Length = 626

 Score = 41.2 bits (95), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 485 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 519


>gi|440918697|ref|NP_001259005.1| transcription intermediary factor 1-alpha isoform 3 [Mus musculus]
          Length = 981

 Score = 41.2 bits (95), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 748 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 782


>gi|440918679|ref|NP_001258993.1| transcription intermediary factor 1-alpha isoform 2 [Mus musculus]
 gi|998813|gb|AAB34289.1| TIF1 [Mus sp.]
 gi|148681703|gb|EDL13650.1| tripartite motif protein 24, isoform CRA_c [Mus musculus]
          Length = 1017

 Score = 41.2 bits (95), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 784 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 818


>gi|358398564|gb|EHK47915.1| hypothetical protein TRIATDRAFT_129087 [Trichoderma atroviride IMI
           206040]
          Length = 1065

 Score = 41.2 bits (95), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 14/63 (22%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY 285
           D  CS C N GD++CC+G C RSFH   +D  +  H        DE      ++C  C +
Sbjct: 633 DEYCSACGNTGDVVCCDG-CPRSFHFECVDMVQSDHLP------DE------WYCNECSF 679

Query: 286 KQH 288
           +++
Sbjct: 680 RRY 682


>gi|268574556|ref|XP_002642257.1| C. briggsae CBR-SET-16 protein [Caenorhabditis briggsae]
          Length = 2526

 Score = 41.2 bits (95), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 49/122 (40%), Gaps = 22/122 (18%)

Query: 270 DEVE-AMLNFFC-KNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVS 325
           DEVE   +   C KN E+ Q    C  CG +G      G E      A C   YH +CV+
Sbjct: 419 DEVEYTKIPVVCRKNDEFLQKTPLCLVCGSIGK-----GPEASMVSCANCSQTYHTYCVT 473

Query: 326 KLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRK 385
              L D          S I G  + C    C IC+   N  D + +  +C  C  +YH  
Sbjct: 474 ---LHDK-------MNSAILGRGWRC--LDCTICEGCGNGGDEE-KLLLCDECDVSYHVY 520

Query: 386 CL 387
           C+
Sbjct: 521 CM 522


>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
          Length = 1264

 Score = 41.2 bits (95), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 20/97 (20%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            C  C +GGDL+CC+G C  +FH +        C ++ +         ++ C NC     
Sbjct: 501 TCGICGDGGDLICCDG-CPSTFHQS--------CLNIQMLPSG-----DWHCPNC----- 541

Query: 289 QCFACGKLGSSDKETGAEVFPCVSAT-CGHFYHPHCV 324
            C  CG    S+ E    V   V+ + C   YH  C+
Sbjct: 542 TCKFCGMADGSNAEDDTTVSELVTCSLCEKKYHTSCI 578


>gi|432103993|gb|ELK30826.1| E3 ubiquitin-protein ligase TRIM33 [Myotis davidii]
          Length = 999

 Score = 41.2 bits (95), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 749 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 784


>gi|327274410|ref|XP_003221970.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Anolis carolinensis]
          Length = 4817

 Score = 41.2 bits (95), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 33/129 (25%), Positives = 53/129 (41%), Gaps = 26/129 (20%)

Query: 265 LGLTKDEVEAMLN----FFCKNCEYKQHQ--CFACGKLGSSDKETGAEVFPCVSATCGHF 318
           +   KDE E  ++     F  + ++  HQ  C  CG  G      GAE      + CG  
Sbjct: 881 ISYIKDEEENSMHNTVVLFSSSDKFTLHQDMCVVCGSFGK-----GAEGRLLSCSQCGQC 935

Query: 319 YHPHCVSKLLLRDDEVAADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRC 378
           YHP+CVS  + +            ++  + + C   +C +C +   KA    +  +C  C
Sbjct: 936 YHPYCVSIKITK------------VVLHKGWRC--LECTVC-EACGKATDPGRLLLCDDC 980

Query: 379 PKAYHRKCL 387
             +YH  CL
Sbjct: 981 DISYHTYCL 989


>gi|10432686|dbj|BAB13834.1| unnamed protein product [Homo sapiens]
          Length = 120

 Score = 40.8 bits (94), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 72  DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 107


>gi|9758217|dbj|BAB08573.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1188

 Score = 40.8 bits (94), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 45/102 (44%), Gaps = 23/102 (22%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D  C  C +GGDL+CC+G C  +FH +           L + K    A   ++C NC   
Sbjct: 650 DDTCGICGDGGDLICCDG-CPSTFHQSC----------LDIKKFPSGA---WYCYNC--- 692

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSAT--CGHFYHPHCVSK 326
              C  C K  ++  ET     P +S+   C   YH  C+++
Sbjct: 693 --SCKFCEKDEAAKHETS--TLPSLSSCRLCEEKYHQACINQ 730


>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
          Length = 3474

 Score = 40.8 bits (94), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 61/161 (37%), Gaps = 30/161 (18%)

Query: 230 CSFCDNGGDLLCCE-GRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
           C++C + G  + C+   C R FH    A       S  L           FC      Q 
Sbjct: 308 CAYCSHYGAGISCKVTSCNRYFHFPCAAASSCFQDSKNLA---------LFCSQ-HLGQV 357

Query: 289 QCFACGKLGSSDKETGAEVFPCV-SATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG- 346
               CG++         +V   V  + CG  YH  CV   LL             + AG 
Sbjct: 358 PLLLCGEVTCVQCYGMGDVSNLVMCSVCGQHYHGSCVGLALL-----------PGVRAGW 406

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           +  +C +  C +C+Q E+ +    +  +C RC KAYH  CL
Sbjct: 407 QCVSCRV--CQVCRQPEDVS----KVMLCERCDKAYHPGCL 441


>gi|359321455|ref|XP_852147.3| PREDICTED: transcription intermediary factor 1-alpha-like isoform 2
           [Canis lupus familiaris]
          Length = 1018

 Score = 40.8 bits (94), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 785 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 843

Query: 270 DEVE 273
            EVE
Sbjct: 844 PEVE 847


>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
          Length = 3396

 Score = 40.8 bits (94), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 42/102 (41%), Gaps = 20/102 (19%)

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
           +Q  C  CG  G      GAE      + CG  YHP+CVS  + +            ++ 
Sbjct: 842 QQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPYCVSIKITK------------VVL 884

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C +   KA    +  +C  C  +YH  CL
Sbjct: 885 SKGWRC--LECTVC-EACGKATDPGRLLLCDDCDISYHTYCL 923


>gi|47209254|emb|CAF91993.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 525

 Score = 40.8 bits (94), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 7/71 (9%)

Query: 215 EEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEE------SHCASLGLT 268
           +E+ + +ED+  +  C+ C NGGDLLCC+ RC + FH      E        H  S+G  
Sbjct: 320 KENCAGKEDDPNEDWCAVCINGGDLLCCD-RCPKVFHMKCHRCERLLLYVFCHELSVGFR 378

Query: 269 KDEVEAMLNFF 279
           K    ++ N++
Sbjct: 379 KPVSSSVPNYY 389


>gi|440796543|gb|ELR17652.1| AT Hook plus PHD finger transcription factor family member (athp1),
           putative, partial [Acanthamoeba castellanii str. Neff]
          Length = 221

 Score = 40.8 bits (94), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 33/60 (55%), Gaps = 13/60 (21%)

Query: 225 LFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHC-ASLGLTKDEVEAMLNFFCKNC 283
           ++  VC  CDNGGDL+CC+          +  GE  +C A LGLT   VE + +++C  C
Sbjct: 62  VWKDVCWRCDNGGDLICCD----------LCPGE--YCFACLGLTHGSVEKIEDWYCGRC 109


>gi|338725560|ref|XP_001495926.3| PREDICTED: e3 ubiquitin-protein ligase TRIM33 [Equus caballus]
          Length = 1135

 Score = 40.8 bits (94), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 884 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 919


>gi|47228511|emb|CAG05331.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 3691

 Score = 40.8 bits (94), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 41/102 (40%), Gaps = 20/102 (19%)

Query: 286 KQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIA 345
           KQ  C  CG  G      GAE      + CG  YHP CV+  + R            ++ 
Sbjct: 148 KQDMCVVCGSFGQ-----GAEGRLLACSQCGQCYHPFCVNVKMTR------------VVL 190

Query: 346 GESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + + C   +C +C+     +D   +  +C  C  +YH  CL
Sbjct: 191 TKGWRC--LECTVCEACGEASDPG-RLLLCDDCDISYHTYCL 229


>gi|56404945|sp|Q99PP7.2|TRI33_MOUSE RecName: Full=E3 ubiquitin-protein ligase TRIM33; AltName:
           Full=Ectodermin homolog; AltName: Full=Transcription
           intermediary factor 1-gamma; Short=TIF1-gamma; AltName:
           Full=Tripartite motif-containing protein 33
 gi|41763896|gb|AAS10352.1| transcriptional intermediary factor 1 gamma [Mus musculus]
          Length = 1142

 Score = 40.8 bits (94), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 892 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 927


>gi|119637828|ref|NP_444400.2| E3 ubiquitin-protein ligase TRIM33 isoform 1 [Mus musculus]
          Length = 1140

 Score = 40.8 bits (94), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 890 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 925


>gi|109467304|ref|XP_001064349.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
           norvegicus]
 gi|392345961|ref|XP_345267.3| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Rattus
           norvegicus]
          Length = 1144

 Score = 40.8 bits (94), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 894 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 929


>gi|395842293|ref|XP_003793952.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 [Otolemur garnettii]
          Length = 1138

 Score = 40.8 bits (94), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 888 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 923


>gi|395539485|ref|XP_003771699.1| PREDICTED: transcription intermediary factor 1-alpha [Sarcophilus
           harrisii]
          Length = 880

 Score = 40.8 bits (94), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           ES ++D+  +  C+ C NGG+LLCCE +C + FH T
Sbjct: 647 ESRKDDDPNEDWCAVCQNGGELLCCE-KCPKVFHLT 681


>gi|119637830|ref|NP_001073299.1| E3 ubiquitin-protein ligase TRIM33 isoform 2 [Mus musculus]
          Length = 1123

 Score = 40.8 bits (94), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 890 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 925


>gi|148675650|gb|EDL07597.1| tripartite motif protein 33 [Mus musculus]
          Length = 951

 Score = 40.8 bits (94), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 701 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 736


>gi|442620101|ref|NP_001262768.1| bonus, isoform F [Drosophila melanogaster]
 gi|440217669|gb|AGB96148.1| bonus, isoform F [Drosophila melanogaster]
          Length = 1122

 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 37/72 (51%), Gaps = 7/72 (9%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 270
           VH + +  + +ED+  +  C+ C +GG+L+CC+ +C + FH      +  H  ++    D
Sbjct: 871 VHSSTDLSNTQEDDPNEDWCAVCLDGGELMCCD-KCPKVFH------QNCHIPAISSLPD 923

Query: 271 EVEAMLNFFCKN 282
           E E+     C N
Sbjct: 924 ESESWQCLLCVN 935


>gi|432103232|gb|ELK30472.1| Tripartite motif-containing protein 66 [Myotis davidii]
          Length = 1324

 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 13/84 (15%)

Query: 227  DSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTKDEVEAMLNF 278
            +  C+ C NGG+LLCC+ RC + +H         +   G+        LT+ E+E    +
Sbjct: 1103 EDFCAVCLNGGELLCCD-RCPKVYHLSCHLPALLSFPGGDWVCTLCRSLTQPEME----Y 1157

Query: 279  FCKNCEYKQHQCFACGKLGSSDKE 302
             C+N  Y Q   +A   LG SD++
Sbjct: 1158 DCENARYNQPGSWAPPGLGISDQK 1181


>gi|442620099|ref|NP_001262767.1| bonus, isoform E [Drosophila melanogaster]
 gi|440217668|gb|AGB96147.1| bonus, isoform E [Drosophila melanogaster]
          Length = 1125

 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 37/72 (51%), Gaps = 7/72 (9%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 270
           VH + +  + +ED+  +  C+ C +GG+L+CC+ +C + FH      +  H  ++    D
Sbjct: 873 VHSSTDLSNTQEDDPNEDWCAVCLDGGELMCCD-KCPKVFH------QNCHIPAISSLPD 925

Query: 271 EVEAMLNFFCKN 282
           E E+     C N
Sbjct: 926 ESESWQCLLCVN 937


>gi|442620097|ref|NP_001262766.1| bonus, isoform D [Drosophila melanogaster]
 gi|440217667|gb|AGB96146.1| bonus, isoform D [Drosophila melanogaster]
          Length = 1119

 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 37/72 (51%), Gaps = 7/72 (9%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKD 270
           VH + +  + +ED+  +  C+ C +GG+L+CC+ +C + FH      +  H  ++    D
Sbjct: 867 VHSSTDLSNTQEDDPNEDWCAVCLDGGELMCCD-KCPKVFH------QNCHIPAISSLPD 919

Query: 271 EVEAMLNFFCKN 282
           E E+     C N
Sbjct: 920 ESESWQCLLCVN 931


>gi|390466407|ref|XP_002807066.2| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 1 [Callithrix
           jacchus]
          Length = 1127

 Score = 40.8 bits (94), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|119578440|gb|EAW58036.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_c [Homo
           sapiens]
          Length = 5265

 Score = 40.8 bits (94), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 61/160 (38%), Gaps = 30/160 (18%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY-KQ 287
           CS C   G  + C    C R +H        S      L+   ++ +     +   Y ++
Sbjct: 172 CSHCTRLGASIPCRSPGCPRLYHFPCATASGSF-----LSMKTLQLLCPEHSEGAAYLEE 226

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
            +C  C   G        ++F C S  CGH YH  C+      D  + A + A       
Sbjct: 227 ARCAVCEGPGEL-----CDLFFCTS--CGHHYHGACL------DTALTARKRA------- 266

Query: 348 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + CP  +C +C Q   K  +D +  VC  C K YH  CL
Sbjct: 267 GWQCP--ECKVC-QACRKPGNDSKMLVCETCDKGYHTFCL 303


>gi|390466409|ref|XP_003733584.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33 isoform 2 [Callithrix
           jacchus]
          Length = 1110

 Score = 40.8 bits (94), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 877 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 912


>gi|403165978|ref|XP_003325884.2| hypothetical protein PGTG_07086 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375165992|gb|EFP81465.2| hypothetical protein PGTG_07086 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1043

 Score = 40.8 bits (94), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 6/78 (7%)

Query: 348  SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRK--IAFEDKLEEGIITRA 405
            SFTC  H+C +C++   +A   L    C+ CP AY  +CLP +   +  D L E ++   
Sbjct: 932  SFTCSQHRCVVCQRAGPEAGGML--YRCQTCPDAYCEECLPHENFYSMGDTLPEFLLL-G 988

Query: 406  WEGLLPNHRI-LIYCLKH 422
            +  +   H I  + CLKH
Sbjct: 989  YGPMSQAHYIRCVECLKH 1006


>gi|2358285|gb|AAC51734.1| ALR [Homo sapiens]
          Length = 5262

 Score = 40.8 bits (94), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 61/160 (38%), Gaps = 30/160 (18%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY-KQ 287
           CS C   G  + C    C R +H        S      L+   ++ +     +   Y ++
Sbjct: 172 CSHCTRLGASIPCRSPGCPRLYHFPCATASGSF-----LSMKTLQLLCPEHSEGAAYLEE 226

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
            +C  C   G        ++F C S  CGH YH  C+      D  + A + A       
Sbjct: 227 ARCAVCEGPGEL-----CDLFFCTS--CGHHYHGACL------DTALTARKRA------- 266

Query: 348 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + CP  +C +C Q   K  +D +  VC  C K YH  CL
Sbjct: 267 GWQCP--ECKVC-QACRKPGNDSKMLVCETCDKGYHTFCL 303


>gi|417413404|gb|JAA53031.1| Putative e3 ubiquitin-protein ligase trim33, partial [Desmodus
           rotundus]
          Length = 1056

 Score = 40.8 bits (94), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 806 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 841


>gi|413933083|gb|AFW67634.1| hypothetical protein ZEAMMB73_811991 [Zea mays]
          Length = 1579

 Score = 40.8 bits (94), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 42/157 (26%), Positives = 63/157 (40%), Gaps = 38/157 (24%)

Query: 196  KAMSGFIVDDMEEDMVH----------DTEEDE----SNEEDELFDSVCSFCDNGGDLLC 241
            K     +VD ++ D++H          D E  +    S E D+  D  C  C +GG+L+C
Sbjct: 961  KPYRNILVDGLDIDLLHCLINAWNMQSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLIC 1020

Query: 242  CEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            C+G C  +FH +           LGL   E      + C NC      C  C +  S D 
Sbjct: 1021 CDG-CPSTFHMSC----------LGL---EALPTDYWCCSNC-----SCKFCHEHSSDDA 1061

Query: 302  ETGAEVFPCVSATCGH----FYHPHCVSKLLLRDDEV 334
            E  A+V   +  TC      +YH   +  +    D+V
Sbjct: 1062 EDTADVDSSLH-TCSQIHNIWYHESGILSMTSAADDV 1097


>gi|426358564|ref|XP_004046577.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Gorilla gorilla
           gorilla]
          Length = 4782

 Score = 40.4 bits (93), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 43/183 (23%), Positives = 63/183 (34%), Gaps = 35/183 (19%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESH---------CASLGLTKDEVEAMLNFF 279
           C+FC   G  + CCE +C + +H    AG  S          C        E   +  FF
Sbjct: 204 CAFCKRFGATIKCCEEKCTQMYHYPCAAGAGSFQDFSHIFLLCPEHIDQAPERYLLDQFF 263

Query: 280 CKNCEYKQH----------------QCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHC 323
           C  C    H                QC  C K+  + K++G +    V  TC   YH  C
Sbjct: 264 CTTCGQHYHGMCLDIAVTPLKRAGWQCPEC-KVCQNCKQSGEDSKMLVCDTCDKGYHTFC 322

Query: 324 VSKLL--LRDDEVAADQLAKSIIAGESFTCPL-HKCCICKQGENKADSDLQFAVCRRCPK 380
           +  ++  +  +          I  G   +    H C IC     + D+     +C  C K
Sbjct: 323 LQPVMKSVPTNGWKCKNCRICIECGTRSSSQWHHNCLICDNCYQQQDN-----LCPFCGK 377

Query: 381 AYH 383
            YH
Sbjct: 378 CYH 380


>gi|170050214|ref|XP_001859681.1| set domain protein [Culex quinquefasciatus]
 gi|167871729|gb|EDS35112.1| set domain protein [Culex quinquefasciatus]
          Length = 2934

 Score = 40.4 bits (93), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 47/172 (27%), Positives = 63/172 (36%), Gaps = 52/172 (30%)

Query: 230 CSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCK-------- 281
           C FC   G  L C+  C +SFH    A      A+ G    +V    N FCK        
Sbjct: 233 CYFCSRYGASLVCKMSCPKSFHYPCIA------AAGGF---QVIQSYNCFCKEHLGQVPL 283

Query: 282 ------NCEYKQHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVA 335
                 NC     QC A G +G+        +  C  + CG  YH  C          V 
Sbjct: 284 VCTDDINC----RQCSALGDVGN--------LMMC--SICGDHYHGTC----------VG 319

Query: 336 ADQLAKSIIAGESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
             QL       +  +C   KC IC+  ++   S+ +   C +C K YH  CL
Sbjct: 320 LAQLPGVRSGWQCGSC--KKCQICRVPDS---SEGRTVGCEQCDKIYHASCL 366


>gi|37360250|dbj|BAC98103.1| mKIAA1113 protein [Mus musculus]
          Length = 1071

 Score = 40.4 bits (93), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 838 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 873


>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
          Length = 1301

 Score = 40.4 bits (93), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 43/105 (40%), Gaps = 26/105 (24%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAML---NFFCKNC 283
           D  C  C +GGDL+CC+G C  +FH +        C  +         ML    + C NC
Sbjct: 710 DDTCGICGDGGDLICCDG-CPSTFHQS--------CLDI--------QMLPPGEWRCMNC 752

Query: 284 EYKQHQCFACG-KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL 327
                 C  CG   G+S+K+  +     +   C   YH  C  ++
Sbjct: 753 -----TCKFCGIASGTSEKDDASVCVLHICNLCEKKYHDSCTKEM 792


>gi|295657191|ref|XP_002789167.1| nucleoporin Pom152 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284511|gb|EEH40077.1| nucleoporin Pom152 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1258

 Score = 40.4 bits (93), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 42/158 (26%), Positives = 73/158 (46%), Gaps = 21/158 (13%)

Query: 1068 LSSPTIVIDGTSPLGVHSSKTIEMLSHDDGCQCDQMPH-RSPVNVSSGILEG--ESSKP- 1123
            LS+ T+V++GT PL +  S+ +  + H    Q  Q  + RSP+ +  G ++   +SS P 
Sbjct: 330  LSNLTLVVEGTPPLKIKYSRKVNKIDHGVSFQSIQPENFRSPL-IGQGHVDSLIDSSNPD 388

Query: 1124 --------IEIPALSGIGDGGYEHFGVGMSHHSAATVIDGASSLQGLSSKSIEMPSYTQF 1175
                    IE+P    +  GG   + +   H +   V + +++ +     S   P   QF
Sbjct: 389  ISWAQAQRIEVPLNESLSTGGEWIYTIEEVHDACGNVANYSTASEDGDQFSKIAPQTQQF 448

Query: 1176 DDNVHQHFEDKGVPRCSPAKVIAKSSLEDHSSKSIDIP 1213
               VH+       PR S ++  A+S L+    +SID+P
Sbjct: 449  --VVHER------PRVSLSRCNAQSYLQVAKDQSIDLP 478


>gi|66815871|ref|XP_641952.1| hypothetical protein DDB_G0278771 [Dictyostelium discoideum AX4]
 gi|60470006|gb|EAL67987.1| hypothetical protein DDB_G0278771 [Dictyostelium discoideum AX4]
          Length = 1350

 Score = 40.4 bits (93), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 43/109 (39%), Gaps = 22/109 (20%)

Query: 290  CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
            CF+C K    D +    +  C S  C   YH  C               LA +     S+
Sbjct: 1028 CFSCKK--EEDSKGRTSMVQCRS--CPKIYHRSCAG-------------LAHT---PRSW 1067

Query: 350  TCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCLPRKIAFEDKLE 398
             C  H C  CK+  N++     F +C+ CP ++   CLP  +   DK E
Sbjct: 1068 KCSRHACHQCKKTPNESGGS--FFICKGCPSSFCITCLPNDVKILDKTE 1114


>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1047

 Score = 40.4 bits (93), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 45/100 (45%), Gaps = 22/100 (22%)

Query: 227 DSVCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYK 286
           D +CS C  GG L+ C+G C  +FHA        +C  L    D      ++FC++C   
Sbjct: 703 DVICSVCHYGGKLILCDG-CPSAFHA--------NCLGLEEVPDG-----DWFCESC--- 745

Query: 287 QHQCFACGK--LGSSDKETGAEVFPCVSATCGHFYHPHCV 324
              C ACG+  L ++ K    E F      C   YHP C+
Sbjct: 746 --CCGACGQFFLKATSKYAKEEKF-ISCKQCELKYHPSCL 782


>gi|148762969|ref|NP_003473.3| histone-lysine N-methyltransferase MLL2 [Homo sapiens]
 gi|313104132|sp|O14686.2|MLL2_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL2; AltName:
           Full=ALL1-related protein; AltName: Full=Lysine
           N-methyltransferase 2D; Short=KMT2D; AltName:
           Full=Myeloid/lymphoid or mixed-lineage leukemia protein
           2
 gi|119578439|gb|EAW58035.1| myeloid/lymphoid or mixed-lineage leukemia 2, isoform CRA_b [Homo
           sapiens]
          Length = 5537

 Score = 40.4 bits (93), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 61/160 (38%), Gaps = 30/160 (18%)

Query: 230 CSFCDN-GGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEY-KQ 287
           CS C   G  + C    C R +H        S      L+   ++ +     +   Y ++
Sbjct: 172 CSHCTRLGASIPCRSPGCPRLYHFPCATASGSF-----LSMKTLQLLCPEHSEGAAYLEE 226

Query: 288 HQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGE 347
            +C  C   G        ++F C S  CGH YH  C+      D  + A + A       
Sbjct: 227 ARCAVCEGPGEL-----CDLFFCTS--CGHHYHGACL------DTALTARKRA------- 266

Query: 348 SFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
            + CP  +C +C Q   K  +D +  VC  C K YH  CL
Sbjct: 267 GWQCP--ECKVC-QACRKPGNDSKMLVCETCDKGYHTFCL 303


>gi|115441477|ref|NP_001045018.1| Os01g0884500 [Oryza sativa Japonica Group]
 gi|56784490|dbj|BAD82583.1| zinc finger (CCCH-type) protein-like [Oryza sativa Japonica Group]
 gi|113534549|dbj|BAF06932.1| Os01g0884500 [Oryza sativa Japonica Group]
          Length = 892

 Score = 40.4 bits (93), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 20/72 (27%)

Query: 290 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
           CF C        + G ++  C    C   YHPHCV K     +E   D       + E F
Sbjct: 17  CFVC--------KDGGDIRVCDFKNCLKGYHPHCVGK-----EESFLD-------SAEQF 56

Query: 350 TCPLHKCCICKQ 361
            C LHKC  CK+
Sbjct: 57  ICELHKCVSCKR 68


>gi|354487414|ref|XP_003505868.1| PREDICTED: E3 ubiquitin-protein ligase TRIM33, partial [Cricetulus
           griseus]
          Length = 1008

 Score = 40.4 bits (93), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 758 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 793


>gi|349602974|gb|AEP98947.1| E3 ubiquitin-protein ligase TRIM33-like protein, partial [Equus
           caballus]
          Length = 351

 Score = 40.4 bits (93), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 241 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 276


>gi|395739021|ref|XP_002818554.2| PREDICTED: LOW QUALITY PROTEIN: transcription intermediary factor
            1-alpha [Pongo abelii]
          Length = 1221

 Score = 40.4 bits (93), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218  ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
            E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 988  ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 1022


>gi|395837426|ref|XP_003791635.1| PREDICTED: transcription intermediary factor 1-alpha [Otolemur
           garnettii]
          Length = 1014

 Score = 40.4 bits (93), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 11/92 (11%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH            +GE        L+K
Sbjct: 781 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 839

Query: 270 DEVEAMLNFFCKNCEYKQHQCFACGKLGSSDK 301
            EVE   +    N E K+ +  A  KL   DK
Sbjct: 840 PEVEYDCDTSSHNSEKKKTEGLA--KLTPIDK 869


>gi|221509248|gb|EEE34817.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
           VEG]
          Length = 742

 Score = 40.4 bits (93), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 13/74 (17%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK 269
             D EE  S    E  DS+C  C  GG+++CCEG C  SFH   +D           LT+
Sbjct: 3   ASDAEEARSVSSRESNDSICFCCRGGGEVVCCEG-CPNSFHVGCLDTARRPQ-----LTE 56

Query: 270 DEVEAMLNFFCKNC 283
           +E      +FC  C
Sbjct: 57  EE------WFCPEC 64


>gi|218189485|gb|EEC71912.1| hypothetical protein OsI_04692 [Oryza sativa Indica Group]
          Length = 892

 Score = 40.4 bits (93), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 20/72 (27%)

Query: 290 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
           CF C        + G ++  C    C   YHPHCV K     +E   D       + E F
Sbjct: 17  CFVC--------KDGGDIRVCDFKNCLKGYHPHCVGK-----EESFLD-------SAEQF 56

Query: 350 TCPLHKCCICKQ 361
            C LHKC  CK+
Sbjct: 57  ICELHKCVSCKR 68


>gi|222619635|gb|EEE55767.1| hypothetical protein OsJ_04319 [Oryza sativa Japonica Group]
          Length = 893

 Score = 40.4 bits (93), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 30/72 (41%), Gaps = 20/72 (27%)

Query: 290 CFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAGESF 349
           CF C        + G ++  C    C   YHPHCV K     +E   D       + E F
Sbjct: 17  CFVC--------KDGGDIRVCDFKNCLKGYHPHCVGK-----EESFLD-------SAEQF 56

Query: 350 TCPLHKCCICKQ 361
            C LHKC  CK+
Sbjct: 57  ICELHKCVSCKR 68


>gi|194380288|dbj|BAG63911.1| unnamed protein product [Homo sapiens]
          Length = 961

 Score = 40.4 bits (93), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 728 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 762


>gi|327271546|ref|XP_003220548.1| PREDICTED: e3 ubiquitin-protein ligase TRIM33-like [Anolis
           carolinensis]
          Length = 947

 Score = 40.4 bits (93), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 219 SNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           +N++++  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 699 NNKDEDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 732


>gi|355748043|gb|EHH52540.1| hypothetical protein EGM_12996, partial [Macaca fascicularis]
          Length = 929

 Score = 40.4 bits (93), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH            +GE        L+K
Sbjct: 696 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 754

Query: 270 DEVE 273
            EVE
Sbjct: 755 PEVE 758


>gi|242047478|ref|XP_002461485.1| hypothetical protein SORBIDRAFT_02g003430 [Sorghum bicolor]
 gi|241924862|gb|EER98006.1| hypothetical protein SORBIDRAFT_02g003430 [Sorghum bicolor]
          Length = 88

 Score = 40.4 bits (93), Expect = 8.3,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 42/76 (55%), Gaps = 6/76 (7%)

Query: 1  MASSDDEVEVGQKLVSDYYFEHEGERKQKEIISFSALPIQWNENERTGG-CKELIYLRGA 59
          M+  DD V+   + V+ YYFE +GE    E + FS LP Q+ EN+      ++ ++L G 
Sbjct: 1  MSDDDDGVDPEIEDVNGYYFE-DGE---GEPVCFSILPFQFGENDNEADFSRKNVFLHGF 56

Query: 60 ADSGLQKIFKPVIAWK 75
           D     ++K V+AWK
Sbjct: 57 VDQS-PHVYKEVVAWK 71


>gi|444724699|gb|ELW65298.1| E3 ubiquitin-protein ligase TRIM33 [Tupaia chinensis]
          Length = 1036

 Score = 40.4 bits (93), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           D ++++D+  +  C+ C NGGDLLCCE +C + FH T
Sbjct: 744 DGNSKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLT 779


>gi|355561030|gb|EHH17716.1| hypothetical protein EGK_14177, partial [Macaca mulatta]
          Length = 933

 Score = 40.0 bits (92), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 33/64 (51%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH            +GE        L+K
Sbjct: 700 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLTNFPSGEWICTFCRDLSK 758

Query: 270 DEVE 273
            EVE
Sbjct: 759 PEVE 762


>gi|380798995|gb|AFE71373.1| transcription intermediary factor 1-alpha isoform a, partial
           [Macaca mulatta]
          Length = 955

 Score = 40.0 bits (92), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 722 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 756


>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
          Length = 1314

 Score = 40.0 bits (92), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 26/100 (26%), Positives = 43/100 (43%), Gaps = 20/100 (20%)

Query: 229 VCSFCDNGGDLLCCEGRCLRSFHATIDAGEESHCASLGLTKDEVEAMLNFFCKNCEYKQH 288
            C  C +GGDL+CC+G C  +FH +        C  +     ++  +  + C NC     
Sbjct: 727 TCGICGDGGDLICCDG-CPSTFHQS--------CLDI-----QMLPLGEWHCPNC----- 767

Query: 289 QCFACG-KLGSSDKETGAEVFPCVSATCGHFYHPHCVSKL 327
            C  CG   G+S+K+  +     +   C   YH  C  ++
Sbjct: 768 TCKFCGIASGNSEKDDASVYVLQICNLCEKKYHDSCTKEM 807


>gi|225685102|gb|EEH23386.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1258

 Score = 40.0 bits (92), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 39/157 (24%), Positives = 71/157 (45%), Gaps = 19/157 (12%)

Query: 1068 LSSPTIVIDGTSPLGVHSSKTIEMLSHDDGCQCDQMPH-RSP----------VNVSSGIL 1116
            LS+ T+V++GT PL +  S+ +  + H    Q  Q  + RSP          ++ S+  +
Sbjct: 330  LSNLTLVVEGTPPLKIKYSRKVNKIDHGVSFQSIQPENFRSPLIGQGRVDSLIDPSNPDI 389

Query: 1117 EGESSKPIEIPALSGIGDGGYEHFGVGMSHHSAATVIDGASSLQGLSSKSIEMPSYTQFD 1176
                ++ IE+P    +  GG   + +   H +   V + +++ +     S   P   QF 
Sbjct: 390  SWAQAQRIEVPLNESLSTGGEWIYTIEEVHDACGNVANYSTASEDGDLLSKIAPQTQQF- 448

Query: 1177 DNVHQHFEDKGVPRCSPAKVIAKSSLEDHSSKSIDIP 1213
              VH+       PR S ++  A+S L+    +SID+P
Sbjct: 449  -VVHER------PRVSLSRCNAQSYLQVAKDRSIDLP 478


>gi|237837385|ref|XP_002367990.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
 gi|211965654|gb|EEB00850.1| PHD-finger domain-containing protein [Toxoplasma gondii ME49]
          Length = 743

 Score = 40.0 bits (92), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 13/74 (17%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK 269
             D EE  S    E  DS+C  C  GG+++CCEG C  SFH   +D           LT+
Sbjct: 3   ASDAEEARSVSSRESNDSICFCCRGGGEVVCCEG-CPNSFHVGCLDTARRPQ-----LTE 56

Query: 270 DEVEAMLNFFCKNC 283
           +E      +FC  C
Sbjct: 57  EE------WFCPEC 64


>gi|291238977|ref|XP_002739402.1| PREDICTED: rCG56742-like, partial [Saccoglossus kowalevskii]
          Length = 1566

 Score = 40.0 bits (92), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 44/101 (43%), Gaps = 20/101 (19%)

Query: 287 QHQCFACGKLGSSDKETGAEVFPCVSATCGHFYHPHCVSKLLLRDDEVAADQLAKSIIAG 346
           Q  C +CG  G   ++    +  C  + CG  YHP+CV+  + +            ++  
Sbjct: 798 QDMCVSCGSFG---RDAEGRLLTC--SQCGQCYHPYCVNIKITK------------VVLS 840

Query: 347 ESFTCPLHKCCICKQGENKADSDLQFAVCRRCPKAYHRKCL 387
           + + C    C +C +G  KA  + +  +C  C  +YH  CL
Sbjct: 841 KGWRC--LDCTVC-EGCGKASDEGRLLLCDDCDISYHTYCL 878


>gi|221488757|gb|EEE26971.1| PHD-finger domain-containing protein, putative [Toxoplasma gondii
           GT1]
          Length = 740

 Score = 40.0 bits (92), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 34/74 (45%), Gaps = 13/74 (17%)

Query: 211 VHDTEEDESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH-ATIDAGEESHCASLGLTK 269
             D EE  S    E  DS+C  C  GG+++CCEG C  SFH   +D           LT+
Sbjct: 3   ASDAEEARSVSSRESNDSICFCCRGGGEVVCCEG-CPNSFHVGCLDTARRPQ-----LTE 56

Query: 270 DEVEAMLNFFCKNC 283
           +E      +FC  C
Sbjct: 57  EE------WFCPEC 64


>gi|357289563|gb|AET72876.1| type II DNA modification enzyme [Phaeocystis globosa virus 12T]
 gi|357292291|gb|AET73627.1| type II methylase [Phaeocystis globosa virus 14T]
          Length = 279

 Score = 40.0 bits (92), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 63/125 (50%), Gaps = 17/125 (13%)

Query: 730 VNDGDMIVDFCCGANDFSCLMKKKLDETGKNCLYKNYDILPAKNDFNFEKRDWMTVEPKE 789
           +ND D+ ++   G+  F  +M    D   K  L    DI P  +D    K D+   +P  
Sbjct: 35  INDYDIHLEPSAGSGSFFNIM----DNIKKVGL----DIEPTTDDV--VKMDYFDYKP-- 82

Query: 790 LAPGSRLIMGLNPPFGVKAGLANKFINKALEFNPKLLILIVPPETERL---DRKESAYEL 846
           +A    L++G NPPFG  + LA KF NK+ EF+   +  I+P   +R+   ++    ++L
Sbjct: 83  IAGKKYLVVG-NPPFGRVSALAVKFFNKSAEFS-NCIAFIIPRTFKRVSIQNKLNLNFKL 140

Query: 847 VWEDD 851
           ++ +D
Sbjct: 141 IYNED 145


>gi|354482186|ref|XP_003503281.1| PREDICTED: transcription intermediary factor 1-alpha [Cricetulus
           griseus]
          Length = 954

 Score = 40.0 bits (92), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 721 EARKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 755


>gi|343959936|dbj|BAK63825.1| transcription intermediary factor 1-gamma [Pan troglodytes]
          Length = 81

 Score = 40.0 bits (92), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 26/38 (68%), Gaps = 1/38 (2%)

Query: 217 DESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHATI 254
           D +N++D+  +  C+ C NGGDLLCCE +C + FH T 
Sbjct: 33  DGNNKDDDPNEDWCAVCQNGGDLLCCE-KCPKVFHLTC 69


>gi|47419911|ref|NP_056989.2| transcription intermediary factor 1-alpha isoform a [Homo sapiens]
 gi|114616226|ref|XP_519410.2| PREDICTED: transcription intermediary factor 1-alpha isoform 5 [Pan
           troglodytes]
 gi|397484615|ref|XP_003813469.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1 [Pan
           paniscus]
 gi|12746552|sp|O15164.3|TIF1A_HUMAN RecName: Full=Transcription intermediary factor 1-alpha;
           Short=TIF1-alpha; AltName: Full=E3 ubiquitin-protein
           ligase TRIM24; AltName: Full=RING finger protein 82;
           AltName: Full=Tripartite motif-containing protein 24
 gi|21040397|gb|AAH28689.2| Tripartite motif-containing 24 [Homo sapiens]
 gi|51094801|gb|EAL24047.1| transcriptional intermediary factor 1 [Homo sapiens]
 gi|61363838|gb|AAX42452.1| transcriptional intermediary factor 1 [synthetic construct]
 gi|119604291|gb|EAW83885.1| tripartite motif-containing 24, isoform CRA_b [Homo sapiens]
 gi|193786782|dbj|BAG52105.1| unnamed protein product [Homo sapiens]
 gi|261860458|dbj|BAI46751.1| tripartite motif-containing protein 24 [synthetic construct]
 gi|410249256|gb|JAA12595.1| tripartite motif containing 24 [Pan troglodytes]
 gi|410287764|gb|JAA22482.1| tripartite motif containing 24 [Pan troglodytes]
          Length = 1050

 Score = 40.0 bits (92), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 851


>gi|344297158|ref|XP_003420266.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
           [Loxodonta africana]
          Length = 1048

 Score = 40.0 bits (92), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 9/64 (14%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFH--------ATIDAGEESHCASLGLTK 269
           ++ +ED+  +  C+ C NGG+LLCCE +C + FH        A   +GE        L+K
Sbjct: 816 DTRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLSCHVPTLANFPSGEWICTFCRDLSK 874

Query: 270 DEVE 273
            EVE
Sbjct: 875 PEVE 878


>gi|410215746|gb|JAA05092.1| tripartite motif containing 24 [Pan troglodytes]
          Length = 1050

 Score = 40.0 bits (92), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 851


>gi|332224570|ref|XP_003261442.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
           [Nomascus leucogenys]
 gi|426358062|ref|XP_004046341.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
           [Gorilla gorilla gorilla]
          Length = 1050

 Score = 40.0 bits (92), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 851


>gi|402864953|ref|XP_003896704.1| PREDICTED: transcription intermediary factor 1-alpha isoform 1
           [Papio anubis]
          Length = 1050

 Score = 40.0 bits (92), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 851


>gi|386780660|ref|NP_001247764.1| transcription intermediary factor 1-alpha [Macaca mulatta]
 gi|383422497|gb|AFH34462.1| transcription intermediary factor 1-alpha isoform a [Macaca
           mulatta]
          Length = 1050

 Score = 40.0 bits (92), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 218 ESNEEDELFDSVCSFCDNGGDLLCCEGRCLRSFHAT 253
           E+ +ED+  +  C+ C NGG+LLCCE +C + FH +
Sbjct: 817 ETRKEDDPNEDWCAVCQNGGELLCCE-KCPKVFHLS 851


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.133    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,349,904,297
Number of Sequences: 23463169
Number of extensions: 1057528469
Number of successful extensions: 2890783
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 112
Number of HSP's successfully gapped in prelim test: 993
Number of HSP's that attempted gapping in prelim test: 2884917
Number of HSP's gapped (non-prelim): 4714
length of query: 1407
length of database: 8,064,228,071
effective HSP length: 156
effective length of query: 1251
effective length of database: 8,698,941,003
effective search space: 10882375194753
effective search space used: 10882375194753
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)