BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 002824
         (876 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225446989|ref|XP_002268363.1| PREDICTED: uncharacterized protein LOC100248222 [Vitis vinifera]
          Length = 1216

 Score =  986 bits (2548), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 505/899 (56%), Positives = 650/899 (72%), Gaps = 30/899 (3%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S+E+ + L   + DY F+D  +E ISFS+LPLQW   +     +M +FL G+  DG 
Sbjct: 1   MASSDEEGEILPNCIRDYYFVDYKDEPISFSILPLQWSKDENPDGLKMMIFLRGSAYDGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
               +QVIAWKFELS  +PEI+VLSK  NW+ LQSP+KSF+NIVRTIL+TV WL+ VK+N
Sbjct: 61  QKIYKQVIAWKFELSSVEPEIFVLSKDKNWMELQSPRKSFQNIVRTILVTVSWLHFVKRN 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANH-- 178
           PEAS K +   +LK F SYE EPSENDLLDHMP+ +EA +R++DL  SK +L+++     
Sbjct: 121 PEASGKSLWNHLLKSFSSYEFEPSENDLLDHMPLIQEAVKREEDLLKSKCMLTYLPEKTG 180

Query: 179 ------------PETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDEN--YFAVCAICDD 224
                         +N    DV T  +   I D +++++ E  D+D +  + +VCAICD+
Sbjct: 181 GETALYEGSLVLSSSNIPSSDVHTMRRYTFIDDRDDNDEDEENDEDNDELFDSVCAICDN 240

Query: 225 GGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGM 284
           GG++  C+GRCLRSFHAT+ AG  + C+SLG++ AQ++A+ NFLC+NC YQ+HQCF CGM
Sbjct: 241 GGELLCCEGRCLRSFHATVDAGTESFCESLGFSDAQVEAIQNFLCKNCQYQQHQCFVCGM 300

Query: 285 LGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHK 344
           LGSS++SS  EVF C SATCG+FYHP CV+K LHP N  LA+ L+ +IA G SFTCP+HK
Sbjct: 301 LGSSNESSGAEVFRCASATCGRFYHPYCVAKRLHPMNNILAKHLQNKIAGGVSFTCPLHK 360

Query: 345 CFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFS-DADENNFQRAWVDLLPNNRI 403
           CFVC++ E+  V+DLQ A+CRRCPKAYHRKCLP  I+F    +EN  QRAW+ LLPN RI
Sbjct: 361 CFVCKRGENKGVDDLQFALCRRCPKAYHRKCLPGNISFECIYNENIMQRAWIGLLPN-RI 419

Query: 404 LIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVADK 463
           LIYC+EHKI  +L+TP R+H++FP  E K KK   EL  + EK ++ KRNIVSE F A+ 
Sbjct: 420 LIYCMEHKINRKLRTPERNHIRFPDPESKGKKHVSELPSSNEKVMSKKRNIVSEIFPAES 479

Query: 464 TVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADKSKP-- 521
           T VK  KL EV+       S    +KR   Q      K  I D  +K L+D   KS P  
Sbjct: 480 TAVKMTKL-EVHRVVKDVDSTKFFEKRCSSQGFDPPTKQKINDATKKFLRDNV-KSVPVK 537

Query: 522 ---SLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELL 578
              S+    T    R++ +KP  +NI  SK++   S +  MK+ + SQP +DAE+E  ++
Sbjct: 538 ICASVAVKGTQSSLRNYNIKPKQQNIP-SKVEKITSLKPSMKRASSSQPLMDAELETRIV 596

Query: 579 ALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGC 638
            L+K +TSSF+ EEF +   V  ++   SK++L+ +IT G VE SVKA+R ALE L+ GC
Sbjct: 597 DLMKSTTSSFSLEEFREKQKVLCSY---SKNVLDSTITQGKVEVSVKAIRTALEKLEKGC 653

Query: 639 DIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWY 698
            IEDAKAVC PE+L QI +WKRKL VYLAPFLHGMRYTSFGRHFTKVEKL+E+VDRLHWY
Sbjct: 654 SIEDAKAVCEPEVLNQIMRWKRKLKVYLAPFLHGMRYTSFGRHFTKVEKLREVVDRLHWY 713

Query: 699 VRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEE 758
           V+ GD IVDFCCG+NDFSC+MK KL+++GKSCSF+NYDLIQPKNDFSFEKRDWM++  +E
Sbjct: 714 VQHGDMIVDFCCGSNDFSCLMKEKLDKVGKSCSFKNYDLIQPKNDFSFEKRDWMSIHLDE 773

Query: 759 LPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-ASYNLIWE 817
           LP GSQLIMGLNPPFGVKASLANKFI +AL F+PKL++LIVP+ET+RLD+K ++Y+LIWE
Sbjct: 774 LPAGSQLIMGLNPPFGVKASLANKFIDKALSFRPKLLILIVPKETKRLDEKDSAYDLIWE 833

Query: 818 DNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITVEE 876
           D ++LSGKSFYLPGS+D+HD QLEQWN  PP LYLWSR DWT+ HK +A   GHI++E+
Sbjct: 834 DEDILSGKSFYLPGSVDMHDKQLEQWNLLPPLLYLWSRPDWTSRHKAVAQKCGHISIEQ 892


>gi|359494102|ref|XP_002276879.2| PREDICTED: uncharacterized protein LOC100243584 [Vitis vinifera]
          Length = 1260

 Score =  959 bits (2479), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 493/900 (54%), Positives = 646/900 (71%), Gaps = 37/900 (4%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S+++ +TL   VS+Y F+D   E ISFSVLP+QW   D + S +  +FL GN  +G 
Sbjct: 1   MASSDDEGETLPGSVSNYHFVDDKGEPISFSVLPIQWSKGDNLDSKKEPIFLDGNADNGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
               +QVIAWKF+LS   PEI VLSK+NNWI LQ P+KSF++I+R+ILITV  L+ +K+N
Sbjct: 61  QKIYKQVIAWKFDLSDVNPEISVLSKENNWIKLQKPRKSFEDIIRSILITVWCLHSMKKN 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHP- 179
           PE S K + + + +VF  Y+V PSENDL+DH  +  EA +RD+ LA SKFLL+F+   P 
Sbjct: 121 PETSGKSLWDHLSRVFSLYDVRPSENDLVDHTTLISEAVKRDEGLAKSKFLLTFLEEKPR 180

Query: 180 ETNTFHEDVRTTEKPKVIVDW-NEDEDSER----VDKDENYF-AVCAICDDGGDVTFCDG 233
           +  +F +DV TT KP  IVD+ +ED  SE      D++E+ F +VC++CD+GGD+  C+G
Sbjct: 181 KRKSFEQDVPTTSKPGFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLLCCEG 240

Query: 234 RCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSS 293
           RC+RSFHAT  AG+ +LC +LG + AQ++A+ NF C+NC Y++HQCF+CG LGSSDKSS 
Sbjct: 241 RCMRSFHATKEAGEESLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSDKSSG 300

Query: 294 QEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSED 353
            EVF C +ATCG+FYHP+CV+KLLH ++E+ AEEL++ I AGE F CP+H+C VC+Q ED
Sbjct: 301 AEVFLCANATCGRFYHPQCVAKLLHREDEAAAEELQKNIYAGELFACPIHRCHVCKQGED 360

Query: 354 MNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF-QRAWVDLLPNNRILIYCLEHKI 412
               +LQ AICRRCPK+YHRKCLP +I+F D DE    QRAW  LLPN RILIYCL+H+I
Sbjct: 361 KKDLELQFAICRRCPKSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCLKHEI 419

Query: 413 ISELKTPARDHLKFPGVEGKRKKEDLELLLTE---EKDVASKRNIVSESFVADKTVVKKL 469
              L TP RDH+KFP  E K +K   EL  +    +K V+ KR++VSE    ++  VK  
Sbjct: 420 DELLGTPIRDHIKFPNDEEKMEKRRSELFSSRKDLDKVVSKKRSLVSEDSPRERMAVKAT 479

Query: 470 KLAE-VYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKD-------------I 515
           K  E + S    G S  + +KR    D    K+  +T   +KSL D             +
Sbjct: 480 KQVEKLSSTVKDGDSTKKSEKRSSGPDPS--KRLKVTGFSKKSLDDNVKSISKKVDKSSM 537

Query: 516 ADKSKPSLRKD-STLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEME 574
           AD++K SL +    L+K+RS   +P   +   S+++  +      KK + S PS+D + E
Sbjct: 538 ADENKTSLGEQLYALIKNRS---EPRKEDTPNSELEQKVV----TKKTSSSLPSLDRDSE 590

Query: 575 KELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEML 634
             +LA+IK+S S    E+ MK H VP THA+ SK+ ++++IT G VEGS++A+RAAL+ L
Sbjct: 591 NRILAIIKESKSLITLEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEALRAALKKL 650

Query: 635 DGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDR 694
           +GG  IEDAKAVC PE+L QI +WK KL VYLAPFLHGMRYTSFGRHFTKV+KLKEIV++
Sbjct: 651 EGGGSIEDAKAVCEPEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLKEIVEK 710

Query: 695 LHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTV 754
           LH+YV++GDTIVDFCCGANDFSC+MK KLE+MGK CS++NYD+IQPKNDF+FEKRDWM+V
Sbjct: 711 LHYYVKNGDTIVDFCCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNFEKRDWMSV 770

Query: 755 RPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-ASYN 813
           + +ELP GSQLIMGLNPPFGVKASLAN FI++AL+FKPKL++LIVP ET RLD+K   Y+
Sbjct: 771 KQKELPTGSQLIMGLNPPFGVKASLANMFINKALQFKPKLLILIVPPETERLDKKRPPYD 830

Query: 814 LIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
           LIWED+  LSGKSFYLPGS+DV+D Q+EQWN  PP LYLWSR DWT  H+ IA   GH++
Sbjct: 831 LIWEDDNELSGKSFYLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAIAQKCGHVS 890


>gi|147814981|emb|CAN61366.1| hypothetical protein VITISV_020586 [Vitis vinifera]
          Length = 2238

 Score =  954 bits (2465), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/899 (54%), Positives = 643/899 (71%), Gaps = 36/899 (4%)

Query: 1    MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
            M  S+++ +TL   VS+Y F+D   E ISFSVLP+QW   D + S +  +FL GN  +G 
Sbjct: 647  MASSDDEGETLPGSVSNYHFVDDKGEPISFSVLPIQWSKGDNLDSKKEPIFLDGNADNGL 706

Query: 61   WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
                +QVIAWKF+LS   PEI VLSK+NNWI LQ P+KSF++I+R+ILITV  L+ +K+N
Sbjct: 707  QKIYKQVIAWKFDLSDVNPEISVLSKENNWIKLQKPRKSFEDIIRSILITVWCLHSMKKN 766

Query: 121  PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPE 180
            PE S K + + + +VF  Y+V PSENDL+DH  +  EA +RD+ LA SKFLL+F+   P 
Sbjct: 767  PETSGKSLWDHLSRVFSLYDVRPSENDLVDHTTLISEAVKRDEGLAKSKFLLTFLEEKPR 826

Query: 181  TNTFHEDVRTTEKPKVIVDW-NEDEDSER----VDKDENYF-AVCAICDDGGDVTFCDGR 234
                 EDV TT KP  IVD+ +ED  SE      D++E+ F +VC++CD+GGD+  C+GR
Sbjct: 827  KRKSFEDVPTTSKPGFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLLCCEGR 886

Query: 235  CLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQ 294
            C+RSFHAT  AG+ +LC +LG + AQ++A+ NF C+NC Y++HQCF+CG LGSSDKSS  
Sbjct: 887  CMRSFHATKEAGEESLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSDKSSGA 946

Query: 295  EVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDM 354
            EVF C +ATCG+FYHP+CV+KLLH ++E+ AE+L++ I AGE F CP+H+C VC+Q ED 
Sbjct: 947  EVFLCANATCGRFYHPQCVAKLLHREDEAAAEDLQKNIYAGELFACPIHRCHVCKQGEDK 1006

Query: 355  NVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF-QRAWVDLLPNNRILIYCLEHKII 413
               +LQ AICRRCPK+YHRKCLP +I+F D DE    QRAW  LLPN RILIYCL+H+I 
Sbjct: 1007 KDLELQFAICRRCPKSYHRKCLPRKISFEDLDEEGIIQRAWDGLLPN-RILIYCLKHEID 1065

Query: 414  SELKTPARDHLKFPGVEGKRKKEDLELLLTE---EKDVASKRNIVSESFVADKTVVKKLK 470
              L TP RDH+KFP  E K +K   EL  +    +K V+ KR++VSE    ++  VK  K
Sbjct: 1066 ELLGTPIRDHIKFPNDEEKMEKRRSELFSSRKDLDKVVSKKRSLVSEDSPRERMAVKATK 1125

Query: 471  LAE-VYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKD-------------IA 516
              E + S    G S  + +KR    D    K+  +T   +KSL D             +A
Sbjct: 1126 QVEKLSSTVKDGDSTKKSEKRSSGPDPS--KRLKVTGFSKKSLDDNVKSISKKVDKSSMA 1183

Query: 517  DKSKPSLRKD-STLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEK 575
            D++K SL +    L+K+RS   +P   +   S+++  +      KK + S PS+D + E 
Sbjct: 1184 DENKTSLGEQLYALIKNRS---EPRKEDTPNSELEQKVV----TKKTSSSLPSLDRDSEN 1236

Query: 576  ELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLD 635
             +LA+IK+S S    E+ MK H VP THA+ SK+ ++++IT G VEGS++A+RAAL+ L+
Sbjct: 1237 RILAIIKESKSLITLEDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEALRAALKKLE 1296

Query: 636  GGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRL 695
            GG  IEDAKAVC PE+L QI +WK KL VYLAPFLHGMRYTSFGRHFTKV+KLKEIV++L
Sbjct: 1297 GGGSIEDAKAVCEPEVLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLKEIVEKL 1356

Query: 696  HWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVR 755
            H+YV++GDTIVDFCCGANDFSC+MK KLE+MGK CS++NYD+IQPKNDF+FEKRDWM+V+
Sbjct: 1357 HYYVKNGDTIVDFCCGANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNFEKRDWMSVK 1416

Query: 756  PEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-ASYNL 814
             +ELP GSQLIMGLNPPFGVKASLAN FI++AL+FKPKL++LIVP ET RLD+K   Y+L
Sbjct: 1417 QKELPTGSQLIMGLNPPFGVKASLANMFINKALQFKPKLLILIVPPETERLDKKRPPYDL 1476

Query: 815  IWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
            IWED+  LSGKSFYLPGS+DV+D Q+EQWN  PP LYLWSR DWT  H+ IA   GH++
Sbjct: 1477 IWEDDNELSGKSFYLPGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAIAQKCGHVS 1535


>gi|297737514|emb|CBI26715.3| unnamed protein product [Vitis vinifera]
          Length = 1201

 Score =  934 bits (2415), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 478/884 (54%), Positives = 618/884 (69%), Gaps = 64/884 (7%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S+++ +TL   VS+Y F+D   E ISFSVLP+QW   D + S +  +FL GN  +G 
Sbjct: 1   MASSDDEGETLPGSVSNYHFVDDKGEPISFSVLPIQWSKGDNLDSKKEPIFLDGNADNGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
               +QVIAWKF+LS   PEI VLSK+NNWI LQ P+KSF++I+R+ILITV  L+ +K+N
Sbjct: 61  QKIYKQVIAWKFDLSDVNPEISVLSKENNWIKLQKPRKSFEDIIRSILITVWCLHSMKKN 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPE 180
           PE S K + + + +VF  Y+V PSENDL+DH  +  EA +RD+ LA SKFLL+F+   P 
Sbjct: 121 PETSGKSLWDHLSRVFSLYDVRPSENDLVDHTTLISEAVKRDEGLAKSKFLLTFLEEKPR 180

Query: 181 TNTFHEDVRTTEKPKVIVDW-NEDEDSER----VDKDENYF-AVCAICDDGGDVTFCDGR 234
                EDV TT KP  IVD+ +ED  SE      D++E+ F +VC++CD+GGD+  C+GR
Sbjct: 181 KRKSFEDVPTTSKPGFIVDYMDEDGISETGEVGSDEEEDLFDSVCSMCDNGGDLLCCEGR 240

Query: 235 CLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQ 294
           C+RSFHAT  AG+ +LC +LG + AQ++A+ NF C+NC Y++HQCF+CG LGSSDKSS  
Sbjct: 241 CMRSFHATKEAGEESLCATLGMSVAQVEAMQNFYCKNCKYKQHQCFSCGKLGSSDKSSGA 300

Query: 295 EVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDM 354
           EVF C +ATCG+FYHP+CV+KLLH ++E+ AEEL++ I AGE F CP+H+C VC+Q ED 
Sbjct: 301 EVFLCANATCGRFYHPQCVAKLLHREDEAAAEELQKNIYAGELFACPIHRCHVCKQGEDK 360

Query: 355 NVEDLQLAICRRCPKAYHRKCLPTEITFSDADENN-FQRAWVDLLPNNRILIYCLEHKII 413
              +LQ AICRRCPK+YHRKCLP +I+F D DE    QRAW  LLP NRILIYCL+H+I 
Sbjct: 361 KDLELQFAICRRCPKSYHRKCLPRKISFEDLDEEGIIQRAWDGLLP-NRILIYCLKHEID 419

Query: 414 SELKTPARDHLKFPGVEGKRKKEDLELLLTE---EKDVASKRNIVSESFVADKTVVKKLK 470
             L TP RDH+KFP  E K +K   EL  +    +K V+ KR++VSE      +  K  K
Sbjct: 420 ELLGTPIRDHIKFPNDEEKMEKRRSELFSSRKDLDKVVSKKRSLVSEDSPHGDSTKKSEK 479

Query: 471 LAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADKSKPSLRKDSTLL 530
            +   SG D                    K+  +T   +KSL D                
Sbjct: 480 RS---SGPDPS------------------KRLKVTGFSKKSLDD---------------- 502

Query: 531 KSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLALIKDSTSSFNE 590
                       +   S+++  +      KK + S PS+D + E  +LA+IK+S S    
Sbjct: 503 -----------NDTPNSELEQKVV----TKKTSSSLPSLDRDSENRILAIIKESKSLITL 547

Query: 591 EEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPE 650
           E+ MK H VP THA+ SK+ ++++IT G VEGS++A+RAAL+ L+GG  IEDAKAVC PE
Sbjct: 548 EDVMKKHKVPSTHAYSSKNTVDRTITQGKVEGSIEALRAALKKLEGGGSIEDAKAVCEPE 607

Query: 651 ILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCC 710
           +L QI +WK KL VYLAPFLHGMRYTSFGRHFTKV+KLKEIV++LH+YV++GDTIVDFCC
Sbjct: 608 VLNQIVKWKNKLKVYLAPFLHGMRYTSFGRHFTKVDKLKEIVEKLHYYVKNGDTIVDFCC 667

Query: 711 GANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLN 770
           GANDFSC+MK KLE+MGK CS++NYD+IQPKNDF+FEKRDWM+V+ +ELP GSQLIMGLN
Sbjct: 668 GANDFSCLMKQKLEEMGKKCSYKNYDVIQPKNDFNFEKRDWMSVKQKELPTGSQLIMGLN 727

Query: 771 PPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-ASYNLIWEDNEVLSGKSFYL 829
           PPFGVKASLAN FI++AL+FKPKL++LIVP ET RLD+K   Y+LIWED+  LSGKSFYL
Sbjct: 728 PPFGVKASLANMFINKALQFKPKLLILIVPPETERLDKKRPPYDLIWEDDNELSGKSFYL 787

Query: 830 PGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
           PGS+DV+D Q+EQWN  PP LYLWSR DWT  H+ IA   GH++
Sbjct: 788 PGSVDVNDKQIEQWNVNPPLLYLWSRQDWTTKHRAIAQKCGHVS 831


>gi|255576262|ref|XP_002529024.1| protein binding protein, putative [Ricinus communis]
 gi|223531504|gb|EEF33335.1| protein binding protein, putative [Ricinus communis]
          Length = 1249

 Score =  883 bits (2282), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 459/895 (51%), Positives = 622/895 (69%), Gaps = 30/895 (3%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVG-SSEMQVFLHGNIGDG 59
           M  S+++ D+    VS+Y F+D  +  ISFS+LP QW   + VG  ++ Q+FLHG++ +G
Sbjct: 1   MASSDDEADSGPQSVSNYHFVDDEDAPISFSLLPFQWSESESVGEQNKHQIFLHGSVDNG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQ 119
               + +VIAWKF+L  A P I V++K  NWI L+ P+KSF+ I+RT LITVH L+  ++
Sbjct: 61  LRTIHMEVIAWKFDLLNAIPSISVITKDKNWIKLEKPRKSFEKIIRTELITVHCLHYARK 120

Query: 120 NPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHP 179
            PEAS+K + + + KVF  Y+V  ++NDL+DHM +  EA +RD  LA SKFLL+F+   P
Sbjct: 121 YPEASKKSVWDHLSKVFSLYDVRFTQNDLVDHMALISEAVKRDDSLAKSKFLLAFLEEKP 180

Query: 180 -ETNTFHEDVRTTEKPKVIVDWNEDE---DSERVDKDENYFAVCAICDDGGDVTFCDGRC 235
            +    +ED++TT+    IVD  +D+   D E   ++E   +VC  CD+GG++  CDG C
Sbjct: 181 RKRRPSNEDIQTTDMSGFIVDDVDDDMFEDVEEDGEEEEEDSVCTFCDNGGELLCCDGSC 240

Query: 236 LRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQE 295
           +RSFHAT  AG+ ++C SLG+T+ +++A   F C+NC Y++HQCFACG LGSSDK S  E
Sbjct: 241 MRSFHATKEAGEESMCVSLGFTEREVEATERFYCKNCEYKQHQCFACGELGSSDKLSGAE 300

Query: 296 VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG-ESFTCPVHKCFVCQQSEDM 354
           VF C +ATCG FYHP C++KLLH ++E  A+EL+++IAAG ESFTCP+HKC VC+Q E+ 
Sbjct: 301 VFRCANATCGYFYHPSCIAKLLHQEDEVAAKELQKKIAAGKESFTCPIHKCCVCKQGENK 360

Query: 355 NVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYCLEHKIIS 414
            + +LQ A+CRRCP +YHRKC+P+EI F         RAW DLLPN RILIYCL+H+II 
Sbjct: 361 KIRELQFAVCRRCPTSYHRKCMPSEIVFEKKKGEEEIRAWEDLLPN-RILIYCLKHEIID 419

Query: 415 ELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVADKTVVKKLKLAEV 474
            L TP RD ++FP +E K+K +  +L  + EKD+A KR + SE   +   V+KK+K  + 
Sbjct: 420 YLGTPIRD-IRFPDIEEKKKTQISDLPGSSEKDLAKKRRLTSEDLFSGDAVIKKVK--DS 476

Query: 475 YSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKD-------------IADKSKP 521
            SGA    +  + +K  P      L++    D  RKSLK+              A+ +K 
Sbjct: 477 SSGARKVTNIKKSEKLSPGSTF--LRRVKERDASRKSLKEKMKSTSIELDRSATANLNKT 534

Query: 522 SLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLALI 581
           SL      +  RS  V  G +++  +++   ++ +   K ++   PS+DA+ E+ LLAL+
Sbjct: 535 SLGDKLFDIMKRSEQVHNGKKDVHTNEIDKPVTVKASTK-LSDELPSLDADTERRLLALM 593

Query: 582 KDSTSSFNEEEFMKSHIV--PITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCD 639
           K+S+S  + E+  K+H V  P THA+  + + EK+IT G VEG+V+AVR AL+ L+ GC 
Sbjct: 594 KESSSLISMEDVRKTHQVHIPSTHAYSLRTVCEKAITAGKVEGAVEAVRTALKKLEDGCS 653

Query: 640 IEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYV 699
            EDAKAVC P  L Q+F+WK KL VYLAPFL+GMRYTSFGRHFTKVEKL+EI + LHWYV
Sbjct: 654 TEDAKAVCGPANLSQVFKWKSKLRVYLAPFLNGMRYTSFGRHFTKVEKLEEITNLLHWYV 713

Query: 700 RSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEEL 759
             GDTIVDFCCGANDFSC+MK KLEQ  K+CS++NYD+IQPKNDF+FEKRDWMTVRPEEL
Sbjct: 714 EDGDTIVDFCCGANDFSCLMKKKLEQTRKTCSYKNYDVIQPKNDFNFEKRDWMTVRPEEL 773

Query: 760 PDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWED 818
           P    LIMGLNPPFGVKA+LANKFI++AL+FKPKL++LIVP ET RLD+K S YNL+WED
Sbjct: 774 PK-EGLIMGLNPPFGVKAALANKFINKALEFKPKLLILIVPPETERLDKKDSPYNLVWED 832

Query: 819 NEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
           +  +SGKSFYLPGS+D +D +++QWN   PPLYLWSR DW   H  IA  +GH++
Sbjct: 833 DRFVSGKSFYLPGSIDENDKRMDQWNLTTPPLYLWSRPDWHEKHLAIAQKQGHLS 887


>gi|356569700|ref|XP_003553034.1| PREDICTED: uncharacterized protein LOC100803073 [Glycine max]
          Length = 1225

 Score =  855 bits (2208), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 441/897 (49%), Positives = 591/897 (65%), Gaps = 32/897 (3%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S+++ +   L VS+Y F D  +  + FSVLP+QW         +MQVFLHG + +G 
Sbjct: 1   MASSDDEGEAQPLSVSNYHFEDNKDAPVCFSVLPIQWSESQSPVGKKMQVFLHGFVDNGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITV-HWLYCVKQ 119
             F  QV+AW+F+LSY +PEI VLSK   WI L+ P+KS+++ +   ++   H+L  VK+
Sbjct: 61  QKFFVQVVAWRFDLSYVRPEILVLSKDGRWIKLEKPRKSYEDTIIRTILITIHFLSYVKK 120

Query: 120 NPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHP 179
           NP++S K + + + K   SYEV PS+NDLL+HM +  EAA+RD  LA SK LL  + +  
Sbjct: 121 NPDSSAKSVWDNLSKNKESYEVMPSQNDLLNHMTLMGEAAKRDTALAKSKLLLMVLEDKD 180

Query: 180 E---TNTFHEDVRTTEKPKVIV---------DWNEDEDSERVDKDENYFAVCAICDDGGD 227
           +        ++V+   +P  I+         ++ ED D E    DE + +VCAICD+GG 
Sbjct: 181 KLKIKKPSDKEVKDLARPGFIIDDIDNDMIDEFGEDSDGE----DELFDSVCAICDNGGQ 236

Query: 228 VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS 287
           +  CDG+C+RSFHA    G+ + C SLG+++ ++D + NF C+NC Y +HQCFACG LG 
Sbjct: 237 LLCCDGKCMRSFHANEEDGEESTCASLGFSRKEVDEIQNFYCKNCEYNQHQCFACGTLGC 296

Query: 288 SDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFV 347
           SDK S  EVF C SATCG FYHP CV+KLLH   E   +EL E+IA G  FTCP H C  
Sbjct: 297 SDKFSGAEVFKCASATCGFFYHPHCVAKLLHGIVEDAPKELEEKIAEGGPFTCPTHYCCE 356

Query: 348 CQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSD-ADENNFQRAWVDLLPNNRILIY 406
           C++ ED    D Q A+CRRCP++YHRKCLP EI F D  DE+   RAW DLLPNNRILIY
Sbjct: 357 CKEMEDKKKHDFQFAVCRRCPRSYHRKCLPREIAFDDIEDEDIITRAWEDLLPNNRILIY 416

Query: 407 CLEHKIISELKTPARDHLKFPGVEGKRKKEDLE---LLLTEEKDVASKRNIVSESFVADK 463
           CLEH+I  EL TP RDH+KFP V+   ++ D E      T+E+ + +K NI S++    K
Sbjct: 417 CLEHEIDDELGTPIRDHIKFPNVKATVREIDAEENAKSATKERVILNKNNIDSKNLFGKK 476

Query: 464 TVVKKLKLAEVYSGADVG------MSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIAD 517
              K  KL    S   VG      +S S I ++   +      +   +   +++ K    
Sbjct: 477 ATAKVSKLPGKMSSGKVGDKKSEKISRSNISRKKINEASRCFNENKRSTISKETKKSDGA 536

Query: 518 KSKPSLRKDSTLLKSRSFV-VKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKE 576
           +++PSL      LK  S   +  G+   D +K    +   +  KK++ + P++DA+ ++ 
Sbjct: 537 ENRPSLGAKLFALKQNSSEHINSGNEADDVAKNTLVV---KPTKKLSSTLPALDADSKRR 593

Query: 577 LLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDG 636
           LLAL K++TSS   E  +K H    TH H  K ++EK+ITLG +EGSV+AVR AL ML+ 
Sbjct: 594 LLALFKEATSSVTLENVIKEHKFAATHTHSLKSVVEKTITLGKLEGSVEAVRTALRMLED 653

Query: 637 GCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLH 696
           G +I DA+AVC P++L QIF+WK KL VYLAP L+G RYTSFGRHFT++EKL+ IVD+LH
Sbjct: 654 GHNIRDAEAVCGPDVLNQIFKWKDKLKVYLAPVLYGNRYTSFGRHFTQIEKLEGIVDKLH 713

Query: 697 WYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRP 756
           WYV++GDTIVDFCCGANDFS +M  KLE+ GK CS++N+DL+  KNDF+FE RDWMT++ 
Sbjct: 714 WYVQNGDTIVDFCCGANDFSILMNKKLEETGKRCSYKNFDLLPTKNDFNFEMRDWMTIQT 773

Query: 757 EELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLI 815
           +ELP GSQLIMGLNPPFG+KA+LANKFI +AL+F+PKL++LIVP ET RLD+K S Y+L+
Sbjct: 774 KELPTGSQLIMGLNPPFGLKAALANKFIDKALEFRPKLLILIVPPETERLDEKRSPYDLV 833

Query: 816 WEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
           WED   L GKSFYLPGS+D +D Q++QWN KPPPLYLWSR DWT  HK IA   GH 
Sbjct: 834 WEDKRFLLGKSFYLPGSVDANDRQIDQWNVKPPPLYLWSRPDWTDKHKAIARKHGHF 890


>gi|357459829|ref|XP_003600195.1| hypothetical protein MTR_3g055370 [Medicago truncatula]
 gi|355489243|gb|AES70446.1| hypothetical protein MTR_3g055370 [Medicago truncatula]
          Length = 1289

 Score =  810 bits (2092), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/940 (46%), Positives = 596/940 (63%), Gaps = 75/940 (7%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S+++ DT  L VS+Y F+D  +  ++FS+LP+QW   + V   + +VFLHGN  +G 
Sbjct: 1   MASSDDEADTQPLSVSNYHFVDDKDAPVTFSILPIQWSESESVEGKKEKVFLHGNADNGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
                QV+AW+F+LS  +PEI VL+K   WI LQ P+KS++ IVR++LITV++++ V++N
Sbjct: 61  QKIFMQVVAWRFDLSNVKPEISVLAKDKRWIKLQKPRKSYEEIVRSVLITVYFMHYVRKN 120

Query: 121 PEASRKYIREQMLKV--FCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANH 178
           PEA  K + + + K   F  YEV+PS NDLL+HM +  EAA RD  LA SK LL  M + 
Sbjct: 121 PEALGKSVWDNLSKNKDFSHYEVKPSHNDLLNHMGLMGEAATRDAVLAKSKLLLMVMEDK 180

Query: 179 PETNT---FHEDVRTTEKPKVIVDWNEDEDS--ERVDKDEN------YFAVCAICDDGGD 227
              +      E+V+   +P  I+  + D D+  E V ++E+      + +VC+ CD+GG+
Sbjct: 181 DRMSIKKLSDEEVKELARPGFIIADDIDNDAIDETVAEEESDEEDELFDSVCSFCDNGGE 240

Query: 228 VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS 287
           +  C+G+C+RSFHA    G+ + C SLG+++ +++ + NF C+NC + +HQCFACG LG 
Sbjct: 241 LLCCEGKCMRSFHANEEDGEESSCASLGFSRKEVEEIQNFYCKNCEHNKHQCFACGELGC 300

Query: 288 SDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFV 347
           SDK +  EVF C SATCG FYHP+CV+KLLH        EL   IA GE FTCP H C +
Sbjct: 301 SDKFAGAEVFKCASATCGFFYHPQCVAKLLHLVISDAPTELVTNIAKGEPFTCPAHYCRI 360

Query: 348 CQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSD-ADENNFQRAWVDLLPNNRILIY 406
           C++ E+ N  +L  A+CRRCPK+YHRKCLP ++ F D  +E    RAW DLLPNNRILIY
Sbjct: 361 CKEMENKNEHELHFAVCRRCPKSYHRKCLPRKVAFEDIVEEGIVARAWEDLLPNNRILIY 420

Query: 407 CLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVADKTVV 466
           CL+H+I  EL TP RDH+KFP V+ K K           K+V +  N   +     +T  
Sbjct: 421 CLKHEIDDELGTPIRDHIKFPYVKQKAK--------PATKEVINNNNAKLDDLHVKRTSA 472

Query: 467 KKLKLAEVYSGADVGMSNSE--IKKRWPRQDVHSLKKPNITDTGRKSLKDIADKSKPSLR 524
              KL+   S   VG+ N    +    PR+  +   +  + +  R +LK+ A+KS     
Sbjct: 473 TLPKLSGKMSFGKVGIENPGKILGSNIPRKKANEASRRLLNENKRPTLKE-AEKSDHEEN 531

Query: 525 KDSTLLKSRSFVVKPGHRNIDGSKMKNSISDR----RRMKKVNISQPSVDAEMEKELLAL 580
           + S  L+  S   K G + I+     N+++D     +R KK++ + P +DA+ E+ LLAL
Sbjct: 532 QPSLGLQLYSHYQK-GSKQINSGNHVNNVADNTLSVKRPKKLSSAPPQLDADSERRLLAL 590

Query: 581 IKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDI 640
           +K+++SS   E  +K H    TH H  K+++EK+IT+G +EGSV+AVR AL MLD G  I
Sbjct: 591 VKEASSSITLESVIKEHKFVSTHTHSLKNVVEKTITMGKLEGSVEAVRTALRMLDEGHSI 650

Query: 641 EDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVR 700
            DA+AVC P+++ ++F+WK KL VYLAP L+G RYTSFGRHFT+VEKL+ IVD+LHWYV+
Sbjct: 651 RDAEAVCGPDVMNRLFKWKDKLKVYLAPVLYGNRYTSFGRHFTQVEKLEGIVDKLHWYVQ 710

Query: 701 SGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPK------------------- 741
           + D IVDFCCGANDFS +MK KLE+ GKSC ++N+DL+  K                   
Sbjct: 711 NNDMIVDFCCGANDFSRLMKKKLEETGKSCLYKNFDLLPTKAALHRLKLKSLGSVFSPTS 770

Query: 742 -------------------------NDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVK 776
                                    NDF+FE RDW+TV+ +ELP GSQLIMGLNPPFG+K
Sbjct: 771 QSPPPQKQNPTRWVFPPDVVLICLQNDFNFEMRDWLTVQRKELPLGSQLIMGLNPPFGLK 830

Query: 777 ASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDNEVLSGKSFYLPGSLDV 835
           A+LANKFI +AL+F+PKL++LIVP ET RLD+K S Y L+WED   LSGKSFYLPGS+D 
Sbjct: 831 AALANKFIDKALEFEPKLLILIVPPETERLDRKRSRYVLVWEDERFLSGKSFYLPGSVDS 890

Query: 836 HDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITVE 875
           +D Q+EQWN KPPPLYLWS  DW   HK IA   GH+  E
Sbjct: 891 NDKQMEQWNVKPPPLYLWSHPDWADKHKLIAQEHGHLFRE 930


>gi|297739138|emb|CBI28789.3| unnamed protein product [Vitis vinifera]
          Length = 1025

 Score =  804 bits (2077), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/680 (58%), Positives = 505/680 (74%), Gaps = 27/680 (3%)

Query: 217 AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           +VCAICD+GG++  C+GRCLRSFHAT+ AG  + C+SLG++ AQ++A+ NFLC+NC YQ+
Sbjct: 29  SVCAICDNGGELLCCEGRCLRSFHATVDAGTESFCESLGFSDAQVEAIQNFLCKNCQYQQ 88

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           HQCF CGMLGSS++SS  EVF C SATCG+FYHP CV+K LHP N  LA+ L+ +IA G 
Sbjct: 89  HQCFVCGMLGSSNESSGAEVFRCASATCGRFYHPYCVAKRLHPMNNILAKHLQNKIAGGV 148

Query: 337 SFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFS-DADENNFQRAWV 395
           SFTCP+HKCFVC++ E+  V+DLQ A+CRRCPKAYHRKCLP  I+F    +EN  QRAW+
Sbjct: 149 SFTCPLHKCFVCKRGENKGVDDLQFALCRRCPKAYHRKCLPGNISFECIYNENIMQRAWI 208

Query: 396 DLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIV 455
            LLP NRILIYC+EHKI  +L+TP R+H++FP  E K KK   EL  + EK ++ KRNIV
Sbjct: 209 GLLP-NRILIYCMEHKINRKLRTPERNHIRFPDPESKGKKHVSELPSSNEKVMSKKRNIV 267

Query: 456 SESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDI 515
           SE F A+ T VK  KL EV+       S    +KR   Q      K  I D  +K L+D 
Sbjct: 268 SEIFPAESTAVKMTKL-EVHRVVKDVDSTKFFEKRCSSQGFDPPTKQKINDATKKFLRDN 326

Query: 516 ADKSKP-----SLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVD 570
             KS P     S+    T    R++ +KP  +NI  SK++   S +  MK+ + SQP +D
Sbjct: 327 V-KSVPVKICASVAVKGTQSSLRNYNIKPKQQNIP-SKVEKITSLKPSMKRASSSQPLMD 384

Query: 571 AEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAA 630
           AE+E  ++ L+K +TSSF+ EEF +   V  +   +SK++L+ +IT G VE SVKA+R A
Sbjct: 385 AELETRIVDLMKSTTSSFSLEEFREKQKVLCS---YSKNVLDSTITQGKVEVSVKAIRTA 441

Query: 631 LEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKE 690
           LE L+ GC IEDAKAVC PE+L QI +WKRKL VYLAPFLHGMRYTSFGRHFTKVEKL+E
Sbjct: 442 LEKLEKGCSIEDAKAVCEPEVLNQIMRWKRKLKVYLAPFLHGMRYTSFGRHFTKVEKLRE 501

Query: 691 IVDRLHWYVRSGDT-------------IVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDL 737
           +VDRLHWYV+ GD              IVDFCCG+NDFSC+MK KL+++GKSCSF+NYDL
Sbjct: 502 VVDRLHWYVQHGDMSFYNSLFPECEIRIVDFCCGSNDFSCLMKEKLDKVGKSCSFKNYDL 561

Query: 738 IQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVL 797
           IQPKNDFSFEKRDWM++  +ELP GSQLIMGLNPPFGVKASLANKFI +AL F+PKL++L
Sbjct: 562 IQPKNDFSFEKRDWMSIHLDELPAGSQLIMGLNPPFGVKASLANKFIDKALSFRPKLLIL 621

Query: 798 IVPQETRRLDQK-ASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRA 856
           IVP+ET+RLD+K ++Y+LIWED ++LSGKSFYLPGS+D+HD QLEQWN  PP LYLWSR 
Sbjct: 622 IVPKETKRLDEKDSAYDLIWEDEDILSGKSFYLPGSVDMHDKQLEQWNLLPPLLYLWSRP 681

Query: 857 DWTASHKKIALGRGHITVEE 876
           DWT+ HK +A   GHI++E+
Sbjct: 682 DWTSRHKAVAQKCGHISIEQ 701


>gi|356499417|ref|XP_003518537.1| PREDICTED: uncharacterized protein LOC100806429 [Glycine max]
          Length = 1065

 Score =  790 bits (2039), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/897 (46%), Positives = 578/897 (64%), Gaps = 42/897 (4%)

Query: 4   SEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQW-DGCDIVGSSEMQVFLHGNIGDGPWN 62
           +E + +  +L VS+Y  +D  +  +SF+VLP+ W D  +   +++ +VF+ G+  +G   
Sbjct: 12  AETEAEVQSLSVSNYHIVDDEDNPVSFAVLPIHWSDSENSEAANKGKVFIDGDTDNGLKK 71

Query: 63  FNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPE 122
              QV AW+F+LS  + EI +LSK   WI LQ P+K F+N +RT+LIT+H+L+ VK+  +
Sbjct: 72  IFMQVTAWRFDLSNVRLEISLLSKDGRWIKLQKPRKGFQNKIRTVLITLHFLHRVKKKRQ 131

Query: 123 ASRKYIREQMLK--VFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPE 180
            S   + + + K     S  V+PS+ DL DH+P+ REAA+RD  LA SK LL  +     
Sbjct: 132 MSEISVWQDLSKDTELSSSGVKPSQKDLSDHVPLIREAAKRDSVLAKSKLLLMVLEKLNS 191

Query: 181 TNTFHEDVRTTEKPKV--------IVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCD 232
                ++V    +P +        ++D + +E  ER D D     VCA+CD+GG+VT CD
Sbjct: 192 QKLLDKEVNDLAQPGLTVVGIDSDMIDESNEESEERDDLD-----VCALCDNGGNVTCCD 246

Query: 233 GRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSS 292
           G C+RSFHAT+ AG+   C SLG+TQ ++D + +F C+NC Y +HQCFACG LGSSDK  
Sbjct: 247 GVCMRSFHATVEAGRENSCVSLGFTQKEVDEIQSFYCKNCEYYQHQCFACGKLGSSDKVK 306

Query: 293 SQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSE 352
             EV  CVSATC +FYHP CV+KLL    + +AE+L   IA    F CP+H C VC++ E
Sbjct: 307 GAEVIKCVSATCDRFYHPHCVAKLLPQLAKPVAEDLERNIADRVPFICPLHYCCVCKELE 366

Query: 353 DMNVEDLQLAICRRCPKAYHRKCLPTEITFSD-ADENNFQRAWVDLLPNNRILIYCLEHK 411
           +    +LQ A+CRRCPK+YHRKCLP EI  S+  ++N  QRAW  LLPNNRILIYCL HK
Sbjct: 367 NKVDPELQFAVCRRCPKSYHRKCLPREIAPSNRGNKNIIQRAWEGLLPNNRILIYCLNHK 426

Query: 412 IISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVAD------KTV 465
           I  EL TP RDH+KFP +E   +K +  +   E+K+ A+K  ++ +    D      K++
Sbjct: 427 IDRELGTPVRDHIKFPNMEPTVQKINTTI---EQKEPATKERVILKKKNVDLDNSSGKSI 483

Query: 466 VKKLKLAEVYSGADVG-------MSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADK 518
            K  KL    S   VG       +S S I ++   ++       N     +KS    +++
Sbjct: 484 AKGSKLTGKLSSHKVGSKKTKKIISGSNISRKPKSKETSRCLTENKRSISKKSEMSDSEQ 543

Query: 519 S--KPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKE 576
           +  +P++ +   L K     +K  H N   + + N +S  + ++ +++  P +DA+ EK 
Sbjct: 544 NYNQPTIGEIYALQKEGLKRIK--HDNKVNNVITNPLS-VKPIESLSVELPPLDADSEKS 600

Query: 577 LLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDG 636
           LL L K++ SS   E  ++ H    TH H  ++++EK+IT+G +E SV AV+ AL  L+ 
Sbjct: 601 LLTLFKEARSSITLESVLEKHTFASTHTHPLRNVVEKTITMGKLEYSVNAVQTALRKLES 660

Query: 637 GCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLH 696
           GC I+D KA C P+ L Q+F+WK +L +YLAP L+G RYTS+GRHFT VEKL+ IVD+LH
Sbjct: 661 GCSIQDVKAFCDPDDLKQLFKWKDELKIYLAPVLYGNRYTSYGRHFTLVEKLEGIVDKLH 720

Query: 697 WYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRP 756
           WYV++ DTIVDFCCGANDFS +MK KLE+ GK CS+RNYDL+  KNDFSFE+RDWMTV+P
Sbjct: 721 WYVQNSDTIVDFCCGANDFSILMKKKLEENGKKCSYRNYDLLPTKNDFSFERRDWMTVQP 780

Query: 757 EELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLI 815
            ELP GSQLIMGLNPPFG KA+LANKF+ +AL+FKPKL++LIVP ET RLD+K S Y+LI
Sbjct: 781 TELPTGSQLIMGLNPPFGHKAALANKFVDKALEFKPKLVILIVPPETERLDKKQSPYDLI 840

Query: 816 WEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
           WED   LSG SFYLPGS+   D Q++Q N +PP L LWSR DWT  HK IA   GH+
Sbjct: 841 WEDENFLSGTSFYLPGSV---DRQMDQRNARPPLLSLWSRPDWTTKHKVIAQENGHV 894


>gi|145359280|ref|NP_200350.2| enhanced downy mildew 2 [Arabidopsis thaliana]
 gi|334188424|ref|NP_001190545.1| enhanced downy mildew 2 [Arabidopsis thaliana]
 gi|332009240|gb|AED96623.1| enhanced downy mildew 2 [Arabidopsis thaliana]
 gi|332009241|gb|AED96624.1| enhanced downy mildew 2 [Arabidopsis thaliana]
          Length = 1297

 Score =  773 bits (1997), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/907 (46%), Positives = 577/907 (63%), Gaps = 48/907 (5%)

Query: 5   EEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFN 64
           EE+D ++    S+Y F D ++E +SF+ LP+QW   + V  S +  +L G   +G    +
Sbjct: 8   EEEDFSVPQSASNYYFEDDDKEPVSFARLPIQWSVEEKVDGSGLGFYLRGRSDNGLLPLH 67

Query: 65  EQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEAS 124
           + V AW+++LS  QPEI VL+K N WI L+ P+KS+  ++RT+L+T+H +  +++NP+AS
Sbjct: 68  KLVKAWRYDLSNFQPEISVLTKDNIWIKLEEPRKSYGELIRTVLVTLHSIQFLRRNPQAS 127

Query: 125 RKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTF 184
            K + E++ +   SY+V+PS+NDL+DH+ +  EAA+RD++LANSKF+L+F+   P     
Sbjct: 128 EKALWEKLTRSLRSYDVKPSQNDLVDHIGLIAEAAKRDRNLANSKFILAFLTKKPTKRRL 187

Query: 185 HEDVRTTEKPKVIVDWNEDEDS--------ERVDKDENYFAVCAICDDGGDVTFCDGRCL 236
            ++     K   IV    DED+           + D+ + +VCAICD+GG++  C+G CL
Sbjct: 188 PDE--DNAKDDFIVG---DEDTYVASDEDELDDEDDDFFESVCAICDNGGEILCCEGSCL 242

Query: 237 RSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSS-SQE 295
           RSFHAT   G+++LC SLG+ + Q++A+  + C NC ++ HQCF C  LGSSD SS + E
Sbjct: 243 RSFHATKKDGEDSLCDSLGFNKMQVEAIQKYFCPNCEHKIHQCFICKNLGSSDNSSGAAE 302

Query: 296 VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
           VF CVSATCG FYHP CV++ L   N+  +E L  +I AGE +TCP+HKC VC+  E   
Sbjct: 303 VFQCVSATCGYFYHPHCVTRRLRLGNKEESEALERQIIAGE-YTCPLHKCSVCENGEVKT 361

Query: 356 VEDLQLAICRRCPKAYHRKCLPTEITFSD-ADENNFQRAWVDLLPNNRILIYCLEHKIIS 414
             +LQ A+CRRCPK+YHRKCLP EI+F D  DE+   RAW  LL +NR+LIYC EH+I  
Sbjct: 362 DSNLQFAVCRRCPKSYHRKCLPREISFEDIEDEDILTRAWDGLL-HNRVLIYCQEHEIDE 420

Query: 415 ELKTPARDHLKFPGVEGK----------------RKKEDLELLLTEEKDVASK--RNIVS 456
           EL TP RDH+KFP  E +                R K  L++     +D   K  +N   
Sbjct: 421 ELLTPVRDHVKFPFTEEQKVFVKEQRRILESHVGRDKARLKVKDPALQDTCGKASKNSFR 480

Query: 457 ESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDT--------G 508
            SF + K      K   V S  D      +I      + V    +  + D+        G
Sbjct: 481 SSFPSSKDGFSTKKHGLVSSVPDHSRKRKDIDPSIKHKMVPQKSQKMMEDSREAGKNKLG 540

Query: 509 RKSLKDIADKSKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPS 568
            K  +D A KSK SL +           VKPG   +     K++ +D    K+     P+
Sbjct: 541 VKEARD-AGKSKISLGERLFSYTQEPNPVKPGR--VIPVDSKHNKTDSIASKEPGSEIPT 597

Query: 569 VDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHS-KHLLEKSITLGLVEGSVKAV 627
           +D + ++ LLA++K +T        +K   +  T + HS +++++K+IT+G VEGSV+A+
Sbjct: 598 LDNDSQRRLLAVMKKATEEITMGTILKKFKIQSTMSTHSTRNVVDKTITMGKVEGSVQAI 657

Query: 628 RAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEK 687
           R AL+ L+ G +IEDAKAVC PE+L QI +WK KL VYLAPFLHG RYTSFGRHFT  EK
Sbjct: 658 RTALKKLEEGGNIEDAKAVCEPEVLSQILKWKDKLKVYLAPFLHGARYTSFGRHFTNPEK 717

Query: 688 LKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFE 747
           L++IVDRLHWY   GD IVDFCCG+NDFSC+M  KLE+ GK C ++NYDL   KN+F+FE
Sbjct: 718 LQQIVDRLHWYADDGDMIVDFCCGSNDFSCLMNAKLEETGKKCLYKNYDLFPAKNNFNFE 777

Query: 748 KRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLD 807
           ++DWMTV  +EL  GS+LIMGLNPPFGV ASLANKFI++AL+F+PK+++LIVP ET RLD
Sbjct: 778 RKDWMTVSKDELEPGSKLIMGLNPPFGVNASLANKFITKALEFRPKILILIVPPETERLD 837

Query: 808 -QKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIA 866
            +K+SY LIWED   LSG SFYLPGS++  D QLE WN  PPPL LWSR+D+ A HKKIA
Sbjct: 838 KKKSSYVLIWEDKTFLSGNSFYLPGSVNEEDKQLEDWNLVPPPLSLWSRSDFAAKHKKIA 897

Query: 867 LGRGHIT 873
               H++
Sbjct: 898 EKHCHLS 904


>gi|297796419|ref|XP_002866094.1| hypothetical protein ARALYDRAFT_495622 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311929|gb|EFH42353.1| hypothetical protein ARALYDRAFT_495622 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1294

 Score =  773 bits (1997), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/907 (45%), Positives = 582/907 (64%), Gaps = 48/907 (5%)

Query: 5   EEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFN 64
           EE+D ++    S+Y F D ++E +SF+ LP+QW G D V  S +  +L G   +G    +
Sbjct: 8   EEEDFSVPQSASNYYFEDDDKEPVSFARLPIQWSGKDKVDGSSLGFYLRGRSDNGLLPLH 67

Query: 65  EQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEAS 124
           + V AW+F+LS  +PE+ VL+K N WI L+ P+KS+  ++RT+ +T+H +  +++NP+AS
Sbjct: 68  KLVKAWRFDLSNFRPEVSVLTKDNIWIKLEEPRKSYGELIRTVFVTLHSIQFLRRNPQAS 127

Query: 125 RKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTF 184
            K + E++ K   SY+V+PS+NDL+DH+ +  EAAERD +L  SKF+L+F+   P     
Sbjct: 128 EKSLWEKLTKSLRSYDVKPSQNDLVDHIDLISEAAERDVNLGKSKFILAFLTKKPTKRRL 187

Query: 185 HEDVRTTEKPKVIVDWNEDEDS--------ERVDKDENYFAVCAICDDGGDVTFCDGRCL 236
            ++     K   IV    DED+           + D+ + +VCAICD+GG++  C+G CL
Sbjct: 188 PDE--DNAKDDFIVG---DEDTYVASDEDELDDEDDDFFESVCAICDNGGELLCCEGSCL 242

Query: 237 RSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSS-SQE 295
           RSFHAT   G+++LC SLG  + Q++A+  + C NC ++ HQCF C  LGSSD S+ + E
Sbjct: 243 RSFHATKKDGEDSLCDSLGLNKMQVEAIQKYFCPNCEHKIHQCFICKNLGSSDNSTGAAE 302

Query: 296 VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
           VF CVSATCG FYHP CV++ L   N+  +E L  +I AGE +TCP+HKC VC+  E   
Sbjct: 303 VFQCVSATCGYFYHPRCVTRRLRLGNKDDSEALERQIIAGE-YTCPLHKCSVCENGEVKT 361

Query: 356 VEDLQLAICRRCPKAYHRKCLPTEITFSD-ADENNFQRAWVDLLPNNRILIYCLEHKIIS 414
             +LQ A+CRRCPK+YHRKCLP EI+F D ADE+ F RAW  LL +NR+LIYC EH+I  
Sbjct: 362 DSNLQFAVCRRCPKSYHRKCLPREISFEDIADEDIFTRAWDGLL-HNRVLIYCQEHEIDE 420

Query: 415 ELKTPARDHLKFPGVEGKRK--KEDLELLLTEEKDVASKRNIVSESF------VADKTVV 466
           EL TP RDH+KFP  E K+   KE   +L +      ++  +   +        +DK+  
Sbjct: 421 ELLTPLRDHVKFPFTEEKKVFIKEQRRILESHVGRDKARPKVKDPALQDTCGKASDKSFR 480

Query: 467 KKLKLA-EVYSGADVGMSNSEIKKRWPRQDVH-SLKKPNITDTGRKSLKDIADKSKPSL- 523
                + + +S    G+ +S       R+D+  S+K   +     K +++  +  K  L 
Sbjct: 481 SSFPSSKDGFSTKKHGLGSSVPDHSRKRKDIDPSIKHKMVPQKSHKMMENSREAGKNRLG 540

Query: 524 -------RKDSTLLKSRSF-------VVKPGHR-NIDGSKMKNSISDRRRMKKVNISQPS 568
                   K    L  R F       +VKPG    +D    K++ +D    K+     P+
Sbjct: 541 VKETRESGKSKVSLGERLFNYIQEPNLVKPGRMIPVDS---KHNKTDSIASKEPGSEIPT 597

Query: 569 VDAEMEKELLALIKDSTSSFNEEEFMKSHIVPIT-HAHHSKHLLEKSITLGLVEGSVKAV 627
           +D + ++ LLA++K +T        +K   V  T + + ++++++K+IT+G VEGSV+AV
Sbjct: 598 LDNDSQRRLLAVMKKATEEITMGTMLKKFKVQSTMNTNSTRNVVDKTITMGKVEGSVQAV 657

Query: 628 RAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEK 687
           R AL+ L+ G +IEDAKAVC PE+L QI +WK KL VYLAPFLHG RYTSFGRHFT  EK
Sbjct: 658 RTALKKLEEGGNIEDAKAVCEPEVLSQILKWKDKLKVYLAPFLHGARYTSFGRHFTNPEK 717

Query: 688 LKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFE 747
           L++IVDRLHWY   GD IVDFCCG+NDFSC+M  KLE+ GK C ++NYDL Q KN+F+FE
Sbjct: 718 LQQIVDRLHWYAEDGDMIVDFCCGSNDFSCLMNTKLEETGKKCLYKNYDLFQAKNNFNFE 777

Query: 748 KRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLD 807
           ++DWMTV  +EL  GS+LIMGLNPPFGV ASLANKFI++AL+F+PK+++LIVP ET RLD
Sbjct: 778 RKDWMTVSKDELEPGSKLIMGLNPPFGVNASLANKFITKALEFRPKILILIVPPETERLD 837

Query: 808 -QKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIA 866
            +K+SY LIWED   LSG SFYLPGS++  D QLE WN  PPPL LWSR+D+ A HKKIA
Sbjct: 838 KKKSSYVLIWEDKTFLSGNSFYLPGSVNEEDKQLEDWNIVPPPLSLWSRSDFAAKHKKIA 897

Query: 867 LGRGHIT 873
               H++
Sbjct: 898 EKHCHLS 904


>gi|9758171|dbj|BAB08556.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1332

 Score =  757 bits (1955), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/942 (44%), Positives = 575/942 (61%), Gaps = 83/942 (8%)

Query: 5   EEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFN 64
           EE+D ++    S+Y F D ++E +SF+ LP+QW   + V  S +  +L G   +G    +
Sbjct: 8   EEEDFSVPQSASNYYFEDDDKEPVSFARLPIQWSVEEKVDGSGLGFYLRGRSDNGLLPLH 67

Query: 65  EQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEAS 124
           + V AW+++LS  QPEI VL+K N WI L+ P+KS+  ++RT+L+T+H +  +++NP+AS
Sbjct: 68  KLVKAWRYDLSNFQPEISVLTKDNIWIKLEEPRKSYGELIRTVLVTLHSIQFLRRNPQAS 127

Query: 125 RKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTF 184
            K + E++ +   SY+V+PS+NDL+DH+ +  EAA+RD++LANSKF+L+F+   P     
Sbjct: 128 EKALWEKLTRSLRSYDVKPSQNDLVDHIGLIAEAAKRDRNLANSKFILAFLTKKPTKRRL 187

Query: 185 HEDVRTTEKPKVIVDWNEDEDS--------ERVDKDENYFAVCAICDDGGDVTFCDGRCL 236
            ++     K   IV    DED+           + D+ + +VCAICD+GG++  C+G CL
Sbjct: 188 PDE--DNAKDDFIVG---DEDTYVASDEDELDDEDDDFFESVCAICDNGGEILCCEGSCL 242

Query: 237 RSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSS-SQE 295
           RSFHAT   G+++LC SLG+ + Q++A+  + C NC ++ HQCF C  LGSSD SS + E
Sbjct: 243 RSFHATKKDGEDSLCDSLGFNKMQVEAIQKYFCPNCEHKIHQCFICKNLGSSDNSSGAAE 302

Query: 296 VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
           VF CVSATCG FYHP CV++ L   N+  +E L  +I AGE +TCP+HKC VC+  E   
Sbjct: 303 VFQCVSATCGYFYHPHCVTRRLRLGNKEESEALERQIIAGE-YTCPLHKCSVCENGEVKT 361

Query: 356 VEDLQLAICRRCPKAYHRKCLPTEITFSD-ADENNFQRAWVDLLPNNRILIYCLEHKIIS 414
             +LQ A+CRRCPK+YHRKCLP EI+F D  DE+   RAW  LL +NR+LIYC EH+I  
Sbjct: 362 DSNLQFAVCRRCPKSYHRKCLPREISFEDIEDEDILTRAWDGLL-HNRVLIYCQEHEIDE 420

Query: 415 ELKTPARDHLKFPGVEGK----------------RKKEDLELLLTEEKDVASK--RNIVS 456
           EL TP RDH+KFP  E +                R K  L++     +D   K  +N   
Sbjct: 421 ELLTPVRDHVKFPFTEEQKVFVKEQRRILESHVGRDKARLKVKDPALQDTCGKASKNSFR 480

Query: 457 ESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDT--------G 508
            SF + K      K   V S  D      +I      + V    +  + D+        G
Sbjct: 481 SSFPSSKDGFSTKKHGLVSSVPDHSRKRKDIDPSIKHKMVPQKSQKMMEDSREAGKNKLG 540

Query: 509 RKSLKDIADKSKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPS 568
            K  +D A KSK SL +           VKPG   +     K++ +D    K+     P+
Sbjct: 541 VKEARD-AGKSKISLGERLFSYTQEPNPVKPGR--VIPVDSKHNKTDSIASKEPGSEIPT 597

Query: 569 VDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHS-KHLLEKSITLGLVEGSVKAV 627
           +D + ++ LLA++K +T        +K   +  T + HS +++++K+IT+G VEGSV+A+
Sbjct: 598 LDNDSQRRLLAVMKKATEEITMGTILKKFKIQSTMSTHSTRNVVDKTITMGKVEGSVQAI 657

Query: 628 RAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEK 687
           R AL+ L+ G +IEDAKAVC PE+L QI +WK KL VYLAPFLHG RYTSFGRHFT  EK
Sbjct: 658 RTALKKLEEGGNIEDAKAVCEPEVLSQILKWKDKLKVYLAPFLHGARYTSFGRHFTNPEK 717

Query: 688 LKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFE 747
           L++IVDRLHWY   GD IVDFCCG+NDFSC+M  KLE+ GK C ++NYDL   KN+F+FE
Sbjct: 718 LQQIVDRLHWYADDGDMIVDFCCGSNDFSCLMNAKLEETGKKCLYKNYDLFPAKNNFNFE 777

Query: 748 KRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLD 807
           ++DWMTV  +EL  GS+LIMGLNPPFGV ASLANKFI++AL+F+PK+++LIVP ET R  
Sbjct: 778 RKDWMTVSKDELEPGSKLIMGLNPPFGVNASLANKFITKALEFRPKILILIVPPETERFQ 837

Query: 808 ------------------------------------QKASYNLIWEDNEVLSGKSFYLPG 831
                                               +K+SY LIWED   LSG SFYLPG
Sbjct: 838 FPSISSAPLYHSITLIYRLLSLSLVKSITFLNRLDKKKSSYVLIWEDKTFLSGNSFYLPG 897

Query: 832 SLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
           S++  D QLE WN  PPPL LWSR+D+ A HKKIA    H++
Sbjct: 898 SVNEEDKQLEDWNLVPPPLSLWSRSDFAAKHKKIAEKHCHLS 939


>gi|218200976|gb|EEC83403.1| hypothetical protein OsI_28850 [Oryza sativa Indica Group]
          Length = 1283

 Score =  748 bits (1930), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 395/876 (45%), Positives = 562/876 (64%), Gaps = 29/876 (3%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S++D +     V +Y F+D N+  +SF VLP Q+D  + V S +  V+L G    G 
Sbjct: 1   MMSSDDDLEPQLKAVENYYFVDDNDVPVSFDVLPFQFDAAEGVASFKKDVYLRGFTDGGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
               +QV+AWK  L    PEI VLS + +WI L  P+ S++  +R++LITV  L+ V++ 
Sbjct: 61  QKVYKQVVAWKLVLDGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHFVRRR 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANH-- 178
           P  S K + + +  VF  + V P E+D  +H  + +  A+RD DLANS+ L  F+ +   
Sbjct: 121 PTDSEKDMWDHLYGVFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFIKDKIM 180

Query: 179 PETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            +TN   ++    E    +   ++D+D E  +  +++ +VCAICD+GG++  C+G C+RS
Sbjct: 181 EKTNEVAQEEIVEEGIPDVPSNDDDDDEEDEEDGDSFDSVCAICDNGGELLCCEGPCMRS 240

Query: 239 FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
           FHA I  G+++ C +LGYT+A++ A+ NF+C+NC +++HQCF CG L  SD  +++ VF 
Sbjct: 241 FHAKIRDGEDSYCATLGYTKAEVKALKNFVCKNCDHKQHQCFVCGELEPSDGPNAK-VFL 299

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVED 358
           C +ATCG FYHP CV++LLHP++ + A E+ ++I AG SFTCPVH CF C+  ED   E 
Sbjct: 300 CNNATCGHFYHPRCVAQLLHPNSRNEASEMEKKIMAGFSFTCPVHWCFHCKGLEDRTQEP 359

Query: 359 LQLAICRRCPKAYHRKCLPTEITFSDADENNF-QRAWVDLLPNNRILIYCLEHKIISELK 417
           LQ A+CRRCP++YHRKCLP EI+F D +      RAW +L  + RILIYCL+H+I  ++ 
Sbjct: 360 LQFAVCRRCPRSYHRKCLPREISFEDINTQGIITRAW-EL--SKRILIYCLDHEIDLDIG 416

Query: 418 TPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVADKTVVKKLKLAEVYSG 477
           TP RDH+KFP VE        ++    EK    KR I  +S+V++  + K+ KL E ++ 
Sbjct: 417 TPPRDHIKFPHVEKSAYSAKKKVKELAEK----KRRICDDSYVSE-PLQKRAKLNEKFNA 471

Query: 478 ADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADKSKPSLRKDSTLLKSRSFVV 537
                  + +K  +            + ++ +K  + +  +++P    +  L++  +   
Sbjct: 472 KGDKSKKAGVKSEFEE----------VLESEKKKTRSLKKRTQP----EEPLVECAAAAA 517

Query: 538 KPGHRNIDGSKMKNSISDRRRMKKVNISQ-PSVDAEMEKELLALIKDSTSSFNEEEFMKS 596
                     + K   +    M K+ +S  P VD+E EK + AL++   SS    +  + 
Sbjct: 518 ANNANRPVKEREKELGTSSLDMGKIPLSSFPIVDSETEKRISALVEKEVSSLTVADISRR 577

Query: 597 HIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILCQIF 656
            ++P T+A   + + +K +  G +E S++AV+AALE L+ G  ++DAKAVC  E+L Q+ 
Sbjct: 578 CVIPSTYACSGRQI-DKIVVRGKLERSIQAVKAALEKLENGGAVDDAKAVCESEVLRQLT 636

Query: 657 QWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGANDFS 716
           +W  KL VYLAPF+HGMRYTSFGRHFTK EKL EI ++LHWYV+ GD IVDF CG NDFS
Sbjct: 637 RWHNKLRVYLAPFIHGMRYTSFGRHFTKKEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFS 696

Query: 717 CMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVK 776
             MK KL+++GK C+F+NYD+IQPKN FSFEKRDWMTVR +ELP GS+LIMGLNPPFG K
Sbjct: 697 QFMKEKLDKVGKRCNFKNYDVIQPKNSFSFEKRDWMTVRQKELPHGSKLIMGLNPPFGPK 756

Query: 777 ASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDNEVLSGKSFYLPGSLDV 835
           A LANKFI +AL FKPKLI+LIVP+E  RLD+K   Y+L+WED++ LSGKSFYLPGSLDV
Sbjct: 757 AMLANKFIDKALTFKPKLIILIVPKEAERLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDV 816

Query: 836 HDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGH 871
            D Q++QWN  PPPLYLWSR DWT  HK+IA   GH
Sbjct: 817 SDKQIDQWNKSPPPLYLWSRPDWTQKHKRIAEQHGH 852


>gi|222640382|gb|EEE68514.1| hypothetical protein OsJ_26949 [Oryza sativa Japonica Group]
          Length = 1311

 Score =  738 bits (1904), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 397/904 (43%), Positives = 561/904 (62%), Gaps = 57/904 (6%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S++D +     V +Y F+D N+  +SF VLP Q+D  + V S +  V+L G    G 
Sbjct: 1   MMSSDDDLEPQLKAVENYYFVDDNDVPVSFDVLPFQFDAAEGVASFKKDVYLRGFTDGGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
               +QV+AWK  L    PEI VLS + +WI L  P+ S++  +R++LITV  L+ V++ 
Sbjct: 61  QKVYKQVVAWKLVLDGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHFVRRR 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFM----- 175
           P  S K + + +  VF  + V P E+D  +H  + +  A+RD DLANS+ L  F+     
Sbjct: 121 PTDSEKDMWDHLYGVFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFIKDKIM 180

Query: 176 --ANHPETNTF----HEDVR----TTEKPKVIVDWNEDEDSERV-DKDEN---------- 214
              N   +N        D++      ++P    D  E+   E + D   N          
Sbjct: 181 EKTNEVGSNNLDNKREPDIKQEPDIKQEPVAAGDEMEEIVEEGIPDAPSNDDDDDEEDEE 240

Query: 215 ----YFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQ 270
               + +VCAICD+GG++  C+G C+RSFHA I  G+++ C +LGYT+A++ A+ NF+C+
Sbjct: 241 DGDLFDSVCAICDNGGELLCCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKALKNFVCK 300

Query: 271 NCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRE 330
           NC +++HQCF CG L  SD  +++ VF C +ATCG FYHP CV++LLHP++ + A E+ +
Sbjct: 301 NCDHKQHQCFVCGELEPSDGPNAK-VFLCNNATCGHFYHPRCVAQLLHPNSRNEASEMEK 359

Query: 331 RIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENN- 389
           +I AG SFTCPVH CF C+  ED   E LQ A+CRRCP++YHRKCLP EI+F D +    
Sbjct: 360 KIMAGFSFTCPVHWCFHCKGLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFEDINTQGI 419

Query: 390 FQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVA 449
             RAW +L  + RILIYCL+H+I  ++ TP RDH+KFP VE        ++    EK   
Sbjct: 420 ITRAW-EL--SKRILIYCLDHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKELAEK--- 473

Query: 450 SKRNIVSESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGR 509
            KR I  +S+V++  + K+ KL E ++        + +K  +            + ++ +
Sbjct: 474 -KRRICDDSYVSE-PLQKRAKLNEKFNAKGDKSKKAGVKSEF----------EEVLESEK 521

Query: 510 KSLKDIADKSKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQ-PS 568
           K  + +  +++P    +  L++  +             + K   +    M K+ +S  P 
Sbjct: 522 KKTRSLKKRTQP----EEPLVECAAAAAANNANRPVKEREKELGTSSLDMGKIPLSSFPI 577

Query: 569 VDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVR 628
           VD+E EK + AL++   SS    +  +  ++P T+A   +  ++K +  G +E S++AV+
Sbjct: 578 VDSETEKRISALVEKEVSSLTVADISRRCVIPSTYACSGRQ-IDKIVVRGKLERSIQAVK 636

Query: 629 AALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKL 688
           AAL+ L+ G  ++DAKAVC  E+L Q+ +W  KL VYLAPF+HGMRYTSFGRHFTK EKL
Sbjct: 637 AALQKLENGGAVDDAKAVCESEVLRQLTRWHNKLRVYLAPFIHGMRYTSFGRHFTKKEKL 696

Query: 689 KEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEK 748
            EI ++LHWYV+ GD IVDF CG NDFS  MK KL+++GK C+F+NYD+IQPKN FSFEK
Sbjct: 697 IEIAEKLHWYVQPGDMIVDFSCGTNDFSQFMKEKLDKVGKRCNFKNYDVIQPKNSFSFEK 756

Query: 749 RDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQ 808
           RDWMTVR +ELP GS+LIMGLNPPFG KA LANKFI +AL FKPKLI+LIVP+E  RLD+
Sbjct: 757 RDWMTVRQKELPHGSKLIMGLNPPFGPKAMLANKFIDKALTFKPKLIILIVPKEAERLDR 816

Query: 809 KAS-YNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIAL 867
           K   Y+L+WED++ LSGKSFYLPGSLDV D Q++QWN  PPPLYLWSR DWT  HK+IA 
Sbjct: 817 KQQPYDLVWEDDQRLSGKSFYLPGSLDVSDKQIDQWNKSPPPLYLWSRPDWTQKHKRIAE 876

Query: 868 GRGH 871
             GH
Sbjct: 877 QHGH 880


>gi|357512983|ref|XP_003626780.1| hypothetical protein MTR_8g009000 [Medicago truncatula]
 gi|355520802|gb|AET01256.1| hypothetical protein MTR_8g009000 [Medicago truncatula]
          Length = 1054

 Score =  723 bits (1866), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/711 (52%), Positives = 489/711 (68%), Gaps = 47/711 (6%)

Query: 201 NEDEDSERV-----DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLG 255
           N+  D E V     ++D  Y  VCAICD+GG++  C+GRCLRSFHAT+  G+++LC SLG
Sbjct: 291 NDKSDGEGVLNLDEEQDIGYDTVCAICDNGGEILPCEGRCLRSFHATLEDGRDSLCASLG 350

Query: 256 YTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSK 315
           YT+ +++A PNF C+NC +++HQCFACG LGSSD+SS+ EVFPCV+A CG +YHPECV++
Sbjct: 351 YTRTEVNAFPNFYCENCKHKKHQCFACGKLGSSDESSNPEVFPCVTANCGHYYHPECVAR 410

Query: 316 LLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKC 375
           LL+P  +   EE+R+RI   ++F CP+H C +C++ E+ NV DLQ A+CRRCPKAYHRKC
Sbjct: 411 LLYPGIDIGQEEMRKRIIIEKTFVCPLHICSLCRKGENRNVHDLQFAMCRRCPKAYHRKC 470

Query: 376 LPTEITFS-DADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRK 434
           LP EI+F+ D       RAW +LL + RIL+YC+ HKI+ EL TPARDHL FP  E KRK
Sbjct: 471 LPKEISFTYDYYTGIEMRAWDNLL-DKRILMYCMNHKIVLELGTPARDHLIFPNKEVKRK 529

Query: 435 KEDLELLLTEEKDVASKRNIVSESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQ 494
               E L  E+  +  K+    E  + +KT+  K+ + E   G  +G S+  ++K   +Q
Sbjct: 530 IISTESLHREKDAIPLKK--FFEDLLPNKTLKPKMTIKE-RVGLQMGGSSKVMEKICSKQ 586

Query: 495 DVHSLKKPNITDTGRKSLK----DIADKSKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMK 550
           D H    P   D  RK LK      +++S P+  ++   LK+ +    P    +D ++ +
Sbjct: 587 DTHMSTGPVYFDRARKYLKVETMSGSNRSLPNY-ENKVPLKNVNLSCNP---RLDEARYQ 642

Query: 551 NSISDRR---------RMKKVNISQPSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPI 601
              S  R          +KKV  S     A+MEK +LAL++++TS+ N E+F K +    
Sbjct: 643 QKRSVGRIEETSWKKPPVKKVKTSLEDRKADMEKRILALMEEATSTLNMEKFKKDNHAVN 702

Query: 602 THAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILCQIFQWK-- 659
           T +  ++ +  K++TLG VEGSVKAV+ AL+ LD GC IE+AKA+C PEI+ Q+F W+  
Sbjct: 703 TSSSLTETVFRKTLTLGKVEGSVKAVQIALQKLDEGCGIEEAKAICEPEIIRQLFTWQNE 762

Query: 660 -----------------RKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSG 702
                            ++L +YLAPFLHG RYTSFGRHFTK++KLKEIVDRLHWYV+SG
Sbjct: 763 VAGIKSNSRPLGYRGSDKQLKIYLAPFLHGRRYTSFGRHFTKIDKLKEIVDRLHWYVQSG 822

Query: 703 DTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDG 762
           DT++DFCCGANDFSC+MK KLEQ GK CSF+NYDL Q KNDF+FEKRDWM+V+ EELP G
Sbjct: 823 DTVLDFCCGANDFSCLMKSKLEQTGKLCSFKNYDLFQAKNDFNFEKRDWMSVQAEELPHG 882

Query: 763 SQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLD-QKASYNLIWEDNEV 821
           S LI+GLNPPFGV+ SLANKFI +AL FKPKL++LIVP+ TRRLD +KA YNLIW D E+
Sbjct: 883 SNLIIGLNPPFGVRGSLANKFIDKALTFKPKLLILIVPKVTRRLDRKKAGYNLIWADEEI 942

Query: 822 LSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
            SGKSFYLPGS+D  D QLE WN KPPPLYLWSR DWTA H +IA    HI
Sbjct: 943 CSGKSFYLPGSVDTRDKQLEDWNLKPPPLYLWSRPDWTAWHMQIARMHRHI 993



 Score =  163 bits (412), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/207 (42%), Positives = 129/207 (62%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S+E+ + +   V  Y F +   E +S S L L W   D+  +SE +VFL G   +G 
Sbjct: 1   MASSDEEGEIVPDSVDGYWFENDKAEFVSLSSLTLLWSVNDVTCNSEAKVFLRGTTDNGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
              ++Q+I W+FELS+ QPEI VL ++  W+ L  P K F+N +R++L+TV+WL+CVK  
Sbjct: 61  QKIHKQIIGWRFELSFEQPEISVLLREKYWMTLLKPSKCFENTIRSVLVTVYWLHCVKWK 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPE 180
           PE SR  I  ++LK F S+++ PSEND+L HM +  EA +RD DL  SK+LL+F+     
Sbjct: 121 PEESRASILVKVLKEFSSFDITPSENDVLCHMALISEAVKRDTDLTKSKYLLNFIGKTCS 180

Query: 181 TNTFHEDVRTTEKPKVIVDWNEDEDSE 207
              FH DV TT+K KVIV+  E+ + E
Sbjct: 181 NGDFHADVHTTKKLKVIVESEENPEQE 207


>gi|357141809|ref|XP_003572354.1| PREDICTED: uncharacterized protein LOC100831561 [Brachypodium
           distachyon]
          Length = 888

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 329/662 (49%), Positives = 430/662 (64%), Gaps = 64/662 (9%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           +CA+CD+GG++  C+G+CLRSFHAT  AG++  C++LGYT  Q DA+  FLC+NC  + +
Sbjct: 16  ICALCDNGGEIASCEGKCLRSFHATKGAGED--CKTLGYTTVQFDAIKVFLCKNCEREIY 73

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
           QCFAC  LGS+ K++  EVFPC S  CG FYH +CV+ LL P+NE+ A E   +I  G  
Sbjct: 74  QCFACHRLGSA-KTNPPEVFPCASPNCGHFYHAKCVAHLLFPENEAKATEYMTKIVNGAK 132

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADEN---NFQRAW 394
           F CPVH+C VC+  E   VE LQ A+CRRCPK+YHR+CLP +I+F D  EN   +FQRAW
Sbjct: 133 FGCPVHRCDVCKYGEKKEVEALQFAVCRRCPKSYHRRCLPRKISFDDIIENGVCHFQRAW 192

Query: 395 VDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEG-KRKKEDLELLLTEEKDVASKRN 453
             LLPNNRILIYC++H I  +L+TP RDH+KFPG     RK  ++  +    K V  +R 
Sbjct: 193 DGLLPNNRILIYCMKHDIDPKLRTPLRDHIKFPGDPALSRKPSNVNSM----KRVKIRR- 247

Query: 454 IVSESFVADKTVVKKLKLAEVYSGADVGMSNSE-IKKRWPRQDVHSLKKPNITDTGRKSL 512
              E  VA++ V   L  ++   G     S+S  I KR        +K P          
Sbjct: 248 --LEECVAEECVAVPLSNSKRSFGTTTCSSSSNLIAKR--------MKAPE--------- 288

Query: 513 KDIADKSKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQ-PSVDA 571
              +  +K   R                 R + G K   S+        + +S  P VD 
Sbjct: 289 ---SGGAKACYR-----------------RPVSGEKAVTSV--------IPVSSFPEVDI 320

Query: 572 EMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAAL 631
              + +    + ++S    E+  K  +V  T+    K+    +ITLG VE SV+A+R AL
Sbjct: 321 NTARRIYEFAQKTSSEITIEDVQKKLVVSSTYTSFMKNT--DTITLGKVEKSVEAIRTAL 378

Query: 632 EMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEI 691
            ML+ G  IE AK VC    L Q+ +WK KL++YLAPFLHGMRYTS+GRHFTK++KL++I
Sbjct: 379 HMLENGAGIEAAKGVCTQHDLFQLAKWKNKLNIYLAPFLHGMRYTSYGRHFTKLDKLEKI 438

Query: 692 VDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDW 751
           VD+L WYV+SGDT+VDFCCG+NDFS ++K KLE   K+C ++NYDLIQPKNDF+FE+RDW
Sbjct: 439 VDKLQWYVQSGDTVVDFCCGSNDFSTLLKEKLEDSEKNCFYKNYDLIQPKNDFNFERRDW 498

Query: 752 MTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-A 810
           MTV+P+ELP GS+LIMGLNPPFG KASLAN+FI++AL FKPKLIVLIVP+ET RLD+K  
Sbjct: 499 MTVQPDELPAGSRLIMGLNPPFGFKASLANQFINKALSFKPKLIVLIVPRETERLDKKYP 558

Query: 811 SYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRG 870
            Y LIWED+  LSGKSFYLPGS D  + Q+EQWN  PPPL LWSR+D+   H +IA  +G
Sbjct: 559 PYELIWEDSNQLSGKSFYLPGSFDADNKQMEQWNMSPPPLSLWSRSDYAQRHYEIARSKG 618

Query: 871 HI 872
           H+
Sbjct: 619 HL 620


>gi|218201406|gb|EEC83833.1| hypothetical protein OsI_29778 [Oryza sativa Indica Group]
          Length = 955

 Score =  581 bits (1497), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 326/718 (45%), Positives = 430/718 (59%), Gaps = 118/718 (16%)

Query: 197 IVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
           + D  E E S  VD       VCA+CD+GG++  C+G+CLRSFHA   AG++  CQ+LGY
Sbjct: 1   MTDNAEGESSANVD------LVCALCDNGGEIASCEGKCLRSFHAVRDAGED--CQTLGY 52

Query: 257 TQAQID----------------------------------AVPNFLCQNCVYQEHQCFAC 282
           T+ Q D                                  A+  FLC+NC  +++QCFAC
Sbjct: 53  TRRQFDVSVLNLPLMNDTVFTSHVLHILDGNIHLAFPYLQALNPFLCKNCELEKYQCFAC 112

Query: 283 GMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPV 342
             LGS+ K+ + EVFPC SA CG FYH +CV++LL  +NE+ A E   +IA+G  F CP+
Sbjct: 113 MRLGSA-KTDTPEVFPCASANCGYFYHAKCVAQLLFTENEAKALEYTTKIASGVKFACPL 171

Query: 343 HKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADEN---NFQRAWVDLLP 399
           HKC VC+  E+ + ++LQ A+CRR            +I F D  +N   +FQRAW  LLP
Sbjct: 172 HKCDVCKYGENKDEKELQFAVCRR------------KIAFDDFVDNGVFHFQRAWEGLLP 219

Query: 400 NNRILIYCLEHKIISELKTPARDHLKF---PGVEGKRKKEDLELLLTEEKDVASKRNIVS 456
           NNRILI+CL+H I  +L+TP RDH+KF   P V   RK  D+  +               
Sbjct: 220 NNRILIFCLKHDIDPKLRTPTRDHIKFPDNPAV--TRKPFDVNGM--------------- 262

Query: 457 ESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIA 516
                +K VVK ++L E    A +                 S  K +     R S  D+ 
Sbjct: 263 -----NKKVVK-IRLLEDCPPAPL-----------------SSDKKSFGTVNRFSSSDVI 299

Query: 517 DKSKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKE 576
            K +  L    T   + S V          ++ K S+     +     S P +D   E+ 
Sbjct: 300 TKKRKVLVSGGTKHCALSAV----------AREKTSVPSFIPLS----SFPVIDKSTERR 345

Query: 577 LLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDG 636
           +    +  +S    E+  K  +VP TH   SK+     ITLG+V+ SV+A+ AAL ML+ 
Sbjct: 346 IHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKNT--DKITLGMVQRSVEAINAALHMLEN 403

Query: 637 GCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLH 696
           G  IED K+VC P  L Q+ +WK KL++YLAPFLHGMRYTS+GRHFTK++KL++IVDRL 
Sbjct: 404 GASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRHFTKLDKLEQIVDRLQ 463

Query: 697 WYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRP 756
           WY+ SGDT+VDFCCG+NDFS ++K KLE   KSC ++N+DLIQPKNDF+FE+RDWMTV+P
Sbjct: 464 WYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPKNDFNFERRDWMTVQP 523

Query: 757 EELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-ASYNLI 815
           +ELP G +LIMGLNPPFG KASLAN+FI++AL FKPKLI+LIVP+ET RLD+K   Y LI
Sbjct: 524 DELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPKETERLDRKYPPYELI 583

Query: 816 WEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
           WED+  L+GKSFYLPGSLD  +  +EQWN  PPPL LWSR+DW   HK+IA   GHI+
Sbjct: 584 WEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWARKHKEIAKTMGHIS 641


>gi|222640820|gb|EEE68952.1| hypothetical protein OsJ_27843 [Oryza sativa Japonica Group]
          Length = 846

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 268/593 (45%), Positives = 350/593 (59%), Gaps = 91/593 (15%)

Query: 296 VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
           VFPC SA CG FYH +CV++LL  +NE+ A E   +IA+G  F CP+HKC VC+  E+ +
Sbjct: 16  VFPCASANCGYFYHAKCVAQLLFTENEAKALEYTTKIASGVKFACPLHKCDVCKYGENKD 75

Query: 356 VEDLQLAICRRCPKAYHRKCLPTEITFSDADEN---NFQRAWVDLLPNNRILIYCLEHKI 412
            ++LQ A+CRR            +I F D  +N   +FQRAW  LLPNNRILI+CL+H I
Sbjct: 76  EKELQFAVCRR------------KIAFDDFVDNGVFHFQRAWEGLLPNNRILIFCLKHDI 123

Query: 413 ISELKTPARDHLKFPGVEG-KRKKEDLELLLTEEKDVASKRNIVSESFVADKTVVKKLKL 471
             +L+TP RDH+KFP      RK  D+  +                    +K VV K++L
Sbjct: 124 DPKLRTPTRDHIKFPDNPAVTRKPFDVNGM--------------------NKKVV-KIRL 162

Query: 472 AEVYSGADVGMSNSEIK--KRWPRQDVHSLKKPNITDTGRK--SLKDIADK--SKPSLRK 525
            E    A +           R+   DV + K+  +   G K  +L  +A +  S PS   
Sbjct: 163 LEDCPPAPLSSDKKSFGTVNRFSSSDVITKKRKVLVSGGTKHCALSAVAREKTSVPSF-- 220

Query: 526 DSTLLKSRSFVVKPGHRNIDGSKMKNS----ISDRRRMKKVNISQPSVDAEMEKELLALI 581
               L S   + K   R I     K S    I D ++   V  +   V    +K  L ++
Sbjct: 221 --IPLSSFPVIDKSTERRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKNTDKITLGMV 278

Query: 582 KDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDIE 641
           + S  + N                 + H+LE                        G  IE
Sbjct: 279 QRSVEAINA----------------ALHMLEN-----------------------GASIE 299

Query: 642 DAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRS 701
           D K+VC P  L Q+ +WK KL++YLAPFLHGMRYTS+GRHFTK++KL++IVDRL WY+ S
Sbjct: 300 DVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRHFTKLDKLEQIVDRLQWYIES 359

Query: 702 GDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPD 761
           GDT+VDFCCG+NDFS ++K KLE   KSC ++N+DLIQPKNDF+FE+RDWMTV+P+ELP 
Sbjct: 360 GDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPKNDFNFERRDWMTVQPDELPT 419

Query: 762 GSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-ASYNLIWEDNE 820
           G +LIMGLNPPFG KASLAN+FI++AL FKPKLI+LIVP+ET RLD+K   Y LIWED+ 
Sbjct: 420 GCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPKETERLDRKYPPYELIWEDSH 479

Query: 821 VLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
            L+GKSFYLPGSLD  +  +EQWN  PPPL LWSR+DW   HK+IA   GHI+
Sbjct: 480 QLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWARKHKEIAKTMGHIS 532


>gi|414588820|tpg|DAA39391.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 751

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 274/752 (36%), Positives = 416/752 (55%), Gaps = 89/752 (11%)

Query: 15  VSDYEFLD--QNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFNEQVIAWKF 72
           V  Y F+D   + E + FS LP  +   D +   + ++ L G  GD       +V+AW  
Sbjct: 18  VESYYFVDSTNDNEPVCFSTLPFLFGDTDDLPDCKKRLVLRGTGGDPGVKVYSEVVAWSL 77

Query: 73  ELSYAQPEIWVLSKQNN-WIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKYIREQ 131
            L   QP+  VL+ +   W+ L  P  S++ ++RT+LIT H L+ + + P    + +   
Sbjct: 78  GLEGKQPQFAVLTAEGRRWVRLVRPLNSYEEMIRTVLITAHMLHFLMRKPHEPERTLWNH 137

Query: 132 MLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANS-----------------KFLLSF 174
           + KVF  ++V PSE+D  +H+ + ++   +D  LA S                 +FLLS+
Sbjct: 138 LCKVFNKFDVRPSEDDFRNHISLMKQFVAKDPILAKSEGSLPHEWGLTKGKTKAQFLLSY 197

Query: 175 MANHP---------------------------ETNTFHEDVRTT------EKPKVIVDWN 201
           +                               E    H DV+        +  ++IV+  
Sbjct: 198 VMPRAVLCKVVNIVLKTILRVFVEGKSGKIFTEAGADHIDVKQPFITDDEDIDEMIVEDA 257

Query: 202 EDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQI 261
           ++E  E  D D  + ++CAICDDGGD+  CDG C+RSFHA   +G+++ C +LGYT+A++
Sbjct: 258 DNESDEEEDGDL-FDSICAICDDGGDILCCDGPCMRSFHAKEGSGEDSYCDTLGYTEAEV 316

Query: 262 DAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDN 321
           +A+  FLC+NC Y++HQCF CG+L  S+ ++++ VF C +ATCG FYHP+CV++ LHP+N
Sbjct: 317 EAMKLFLCKNCEYKQHQCFICGVLEPSNGAAAK-VFLCNNATCGHFYHPKCVAQQLHPNN 375

Query: 322 ESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
              A EL ++I  G SFTCP+H CF C+  ED     LQ A+CRRCPK+YHRKCLP EI+
Sbjct: 376 IDEASELEKKITDGFSFTCPIHWCFHCKGLEDRTQGPLQFAVCRRCPKSYHRKCLPREIS 435

Query: 382 FSDADENN-FQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRK--KEDL 438
           F + +E     RAW +L  +  ILIYCL H+I SEL+TP RDH+KFP +E   +  K+  
Sbjct: 436 FDEIEEKGVIIRAW-EL--SKMILIYCLNHEIDSELETPVRDHIKFPRIEKPLQFVKKGA 492

Query: 439 ELLLTEEKDVASKRNIVSESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQ-DVH 497
            LL  ++K   S+            +++++     V   + V M  SE  K   R+  V 
Sbjct: 493 NLLPKKKKRTYSE------------SILEQPPKDTVKMKSKVRMQESEQNKHDTREVSVK 540

Query: 498 SLKKPNITDTGRKSLKDIADKSK-PSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDR 556
           S  + N+ DT  K    + +  + P + KD  +   +S  VK   + +    +   +S  
Sbjct: 541 SFTQ-NVVDTPVKKKGKLKENKQLPYVAKDHFVSSPKS--VKEQEQEL----VPLPLSGI 593

Query: 557 RRMKKVNISQPSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSIT 616
           R+      S P VD+E+EK +++++ +  SS   ++  ++  +P TH +  +      + 
Sbjct: 594 RKS-----SFPKVDSEIEKRVISILGNEISSLTIKDVTRNCSLPSTHVYSGRPT--DKMP 646

Query: 617 LGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYT 676
           LG +E SV+AV AAL+ L  G  + DAKAVC P++L Q+ +W  KL VY++PF+HG RY+
Sbjct: 647 LGKIERSVQAVEAALKKLKNGGSVNDAKAVCEPDVLRQLARWHTKLRVYISPFIHGTRYS 706

Query: 677 SFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDF 708
           SFGRHFTKVEKL EIVDRLHWYV SGD +  F
Sbjct: 707 SFGRHFTKVEKLVEIVDRLHWYVESGDMVYFF 738


>gi|218188313|gb|EEC70740.1| hypothetical protein OsI_02147 [Oryza sativa Indica Group]
          Length = 1296

 Score =  443 bits (1140), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 248/535 (46%), Positives = 341/535 (63%), Gaps = 35/535 (6%)

Query: 343 HKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDA-DENNFQRAWVDLLPNN 401
           H+CF+C   E  +    ++ +C             TEI+F D+ +E+   RAW +L  + 
Sbjct: 294 HQCFICGALEPSDGPTAKVFLCNNA----------TEISFEDSENEDAITRAW-EL--SK 340

Query: 402 RILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVA 461
           RILIYCL+H+I +++ TP RDH+KFP      K      LL +   V  K+   SES V 
Sbjct: 341 RILIYCLDHEIDTDIDTPIRDHIKFPRTPNIEKPA---RLLKKSTKVVKKKRPFSES-VP 396

Query: 462 DKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADKSKP 521
           D+   +  KL ++     V +  S+   R  +    S  +  +    +K  + + D  +P
Sbjct: 397 DQLSTELRKLPDM-----VCIQESD---RARKISARSSSEQFVVKPDKKKARFLKDTPQP 448

Query: 522 SLRKDSTLLKSRSF-VVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLAL 580
               D  LLK  S  + KP      G  +    S     K    S P VD+E EK ++AL
Sbjct: 449 ----DPCLLKDASTGITKPAKEQ--GKLLVTMPSSSTSTKIPQSSFPRVDSETEKRVMAL 502

Query: 581 IKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDI 640
           ++   S    ++  +  ++P THA+  + + ++ I  G +E SV+AV AAL+ML+ G ++
Sbjct: 503 VEREASYLTLKDISRKCLMPSTHAYSGRQV-DRIIATGKLERSVQAVGAALKMLENGGNV 561

Query: 641 EDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVR 700
            DAKAVC PE+L Q+ +W  KL VY++PF++G RY+SFGRHFTKVEKL EIVD+LH YV 
Sbjct: 562 NDAKAVCEPEVLKQLTRWHSKLRVYISPFIYGTRYSSFGRHFTKVEKLVEIVDKLHCYVE 621

Query: 701 SGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELP 760
            GDTIVDFCCGANDFS +MK KL+++ K C F+NYDLIQP+N F+FE++DWMTVRP+ELP
Sbjct: 622 PGDTIVDFCCGANDFSRLMKEKLDKVQKKCHFKNYDLIQPQNCFAFERKDWMTVRPKELP 681

Query: 761 DGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDN 819
            GS+LIMGLNPPFGVKASLANKFI +AL FKPKL++LIVP+ET+RLDQK + Y+LIWED+
Sbjct: 682 HGSKLIMGLNPPFGVKASLANKFIDKALTFKPKLVILIVPKETKRLDQKKTPYDLIWEDS 741

Query: 820 EVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITV 874
           + LSGK+FYLPGS+DV+D  +E WN   PPLYLWS  DWT  H+K+A    H ++
Sbjct: 742 DCLSGKAFYLPGSVDVNDKVVEGWNASAPPLYLWSHPDWTRKHRKVAEEHNHTSI 796



 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 181/316 (57%), Gaps = 18/316 (5%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M + ++DD+     V +Y  LD  E  +  SVLP Q+   D V   +  VFL G    G 
Sbjct: 9   MMFDDDDDEPQLNAVDNYYLLDAREVPVCLSVLPFQFKDTDEVPECKKDVFLWGTADPG- 67

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
                +VIAWK  L   QPEI VLS + +WI L  PK S++  +RTILITV  L+ +K+ 
Sbjct: 68  IKVYRKVIAWKLGLQGKQPEISVLSAEGSWISLTKPKNSYEEKIRTILITVQMLHFLKKK 127

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPE 180
           PE   K +   + K+F  +EV PSE+DL +H  + +  AE+D  LA S+ L  F     E
Sbjct: 128 PEEPEKNLWSHLRKIFDKFEVRPSEDDLRNHRSLIKHFAEKDSTLAKSEILQGFTQ---E 184

Query: 181 TNTFHEDVRTTEKPKVIVDW-NEDED-SERVDKDEN-----------YFAVCAICDDGGD 227
           T+        ++K ++ V +  +DED  E  D D N           + ++C+ICD+GGD
Sbjct: 185 TSRKKFSEVGSDKVEIKVPFIADDEDIEEMADVDNNIESDEEEEEDLFDSICSICDNGGD 244

Query: 228 VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS 287
           +  CDG C+RSFHA I  G+++ C +LGYT+A++ A+  FLC+NC +++HQCF CG L  
Sbjct: 245 LLCCDGPCMRSFHAKIGTGEDSYCDTLGYTEAEVQAMKTFLCKNCEHKQHQCFICGALEP 304

Query: 288 SDKSSSQEVFPCVSAT 303
           SD  +++ VF C +AT
Sbjct: 305 SDGPTAK-VFLCNNAT 319


>gi|222618537|gb|EEE54669.1| hypothetical protein OsJ_01967 [Oryza sativa Japonica Group]
          Length = 1296

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 247/535 (46%), Positives = 341/535 (63%), Gaps = 35/535 (6%)

Query: 343 HKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDAD-ENNFQRAWVDLLPNN 401
           H+CF+C   E  +    ++ +C             TEI+F D++ E+   RAW +L  + 
Sbjct: 294 HQCFICGALEPSDGPAAKVFLCNNA----------TEISFEDSENEDAITRAW-EL--SK 340

Query: 402 RILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVA 461
           RILIYCL+H+I +++ TP RDH+KFP      K      LL +   V  K+   SES V 
Sbjct: 341 RILIYCLDHEIDTDIDTPIRDHIKFPRTPNIEKPA---RLLKKSTKVVKKKRPFSES-VP 396

Query: 462 DKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADKSKP 521
           D+   +  KL ++     V +  S+   R  +    S  +  +    +K  + + D  +P
Sbjct: 397 DQLSTELRKLPDM-----VCIQESD---RARKISARSSSEQFVVKPDKKKARFLKDTPQP 448

Query: 522 SLRKDSTLLKSRSF-VVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLAL 580
               D  LLK  S  + KP      G  +    S     K    S P VD+E EK ++AL
Sbjct: 449 ----DPCLLKDASTGITKPAKEQ--GKLLVTMPSSSTSTKIPQSSFPRVDSETEKRVMAL 502

Query: 581 IKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDI 640
           ++   S    ++  +  ++P THA+  + + ++ I  G +E SV+AV AAL+ML+ G ++
Sbjct: 503 VEREASYLTLKDISRKCLMPSTHAYSGRQV-DRIIATGKLERSVQAVGAALKMLENGGNV 561

Query: 641 EDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVR 700
            DAKAVC PE+L Q+ +W  KL VY++PF++G RY+SFGRHFTKVEKL EIVD+LH YV 
Sbjct: 562 NDAKAVCEPEVLKQLTRWHSKLRVYISPFIYGTRYSSFGRHFTKVEKLVEIVDKLHCYVE 621

Query: 701 SGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELP 760
            GDTIVDFCCGANDFS +MK KL+++ K C F+NYDLIQP+N F+FE++DWMTVRP+ELP
Sbjct: 622 PGDTIVDFCCGANDFSRLMKEKLDKVQKKCHFKNYDLIQPQNCFAFERKDWMTVRPKELP 681

Query: 761 DGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDN 819
            GS+LIMGLNPPFGVKASLANKFI +AL FKPKL++LIVP+ET+RLDQK + Y+LIWED+
Sbjct: 682 HGSKLIMGLNPPFGVKASLANKFIDKALTFKPKLVILIVPKETKRLDQKKTPYDLIWEDS 741

Query: 820 EVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITV 874
           + LSGK+FYLPGS+DV+D  +E WN   PPL+LWS  DWT  H+K+A    H ++
Sbjct: 742 DCLSGKAFYLPGSVDVNDKVVEGWNASAPPLHLWSHPDWTRKHRKVAEEHNHTSI 796



 Score =  216 bits (550), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 181/316 (57%), Gaps = 18/316 (5%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M + ++DD+     V +Y  LD  E  +  SVLP Q+   D V   +  VFL G    G 
Sbjct: 9   MMFDDDDDEPQLNAVDNYYLLDAREVPVCLSVLPFQFKDTDEVPECKKDVFLWGTADPG- 67

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
                +VIAWK  L   QPEI VLS + +WI L  PK S++  +RTILITV  L+ +K+ 
Sbjct: 68  IKVYRKVIAWKLGLQGKQPEISVLSAEGSWISLTKPKNSYEEKIRTILITVQMLHFLKKK 127

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPE 180
           PE   K +   + K+F  +EV PSE+DL +H  + +  AE+D  LA S+ L  F     E
Sbjct: 128 PEEPEKNLWSHLRKIFDKFEVRPSEDDLRNHRLLIKHFAEKDSTLAKSEILQGFTQ---E 184

Query: 181 TNTFHEDVRTTEKPKVIVDW-NEDED-SERVDKDEN-----------YFAVCAICDDGGD 227
           T+        ++K ++ V +  +DED  E  D D N           + ++C+ICD+GGD
Sbjct: 185 TSRKKFSEVGSDKVEIKVPFIADDEDIEEMADVDNNIESDEEEEEDLFDSICSICDNGGD 244

Query: 228 VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS 287
           +  CDG C+RSFHA I  G+++ C +LGYT+A++ A+  FLC+NC +++HQCF CG L  
Sbjct: 245 LLCCDGPCMRSFHAKIGTGEDSYCDTLGYTEAEVQAMKTFLCKNCEHKQHQCFICGALEP 304

Query: 288 SDKSSSQEVFPCVSAT 303
           SD  +++ VF C +AT
Sbjct: 305 SDGPAAK-VFLCNNAT 319


>gi|356503198|ref|XP_003520398.1| PREDICTED: uncharacterized protein LOC100810533 [Glycine max]
          Length = 973

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 209/356 (58%), Positives = 267/356 (75%), Gaps = 27/356 (7%)

Query: 545 DGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHA 604
           DGS  K  I   + +KK+  S  + +AEME  +L+L+KD+ S+FNEEEF K+H    T +
Sbjct: 559 DGSIGKTGI--EKPLKKIQTSLDANNAEMENSILSLMKDTMSTFNEEEFKKNHQAFSTTS 616

Query: 605 HHSKHLLE-KSITLGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILCQIFQWK---- 659
             ++ +   K++T G VEGSVKA++ AL+ L+ G  IE+AKA+C P +L Q+F W+    
Sbjct: 617 GFTEPVSHHKNLTQGKVEGSVKAIQTALQRLEEGGSIEEAKAICDPGVLHQLFIWQFSDQ 676

Query: 660 -------------------RKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVR 700
                              ++L VYL+PFLHGMRYTSFGRHFTK++KLKE+ +RLHWYV+
Sbjct: 677 SGHSKNQLISEPVRTGLKQKQLKVYLSPFLHGMRYTSFGRHFTKIDKLKEVANRLHWYVQ 736

Query: 701 SGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELP 760
           +GDT++DFCCG+NDFS +MK KLE+MGKSCSF+NYDL QPKNDF+FEKRDWM+V  EELP
Sbjct: 737 NGDTVLDFCCGSNDFSRLMKSKLEEMGKSCSFKNYDLFQPKNDFNFEKRDWMSVNAEELP 796

Query: 761 DGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKA-SYNLIWEDN 819
           +GSQLI+GLNPPFGVK SLANKFI++AL FKPKL++LIVP+ T+RLD+K   Y+LIWED+
Sbjct: 797 NGSQLIIGLNPPFGVKGSLANKFINKALTFKPKLLILIVPKVTKRLDRKKEGYDLIWEDD 856

Query: 820 EVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITVE 875
           E+LSGKSFYLPGS+D  D QLE WN KPPPLYLWSR DWT  H++IA    HI  E
Sbjct: 857 EMLSGKSFYLPGSVDTRDKQLEDWNLKPPPLYLWSRPDWTTKHREIAQKHCHIKEE 912



 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 155/295 (52%), Positives = 202/295 (68%), Gaps = 26/295 (8%)

Query: 142 EPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKVIVDWN 201
           E  END  D +P  +           SKF+  F     +      DV  T++ K IVD +
Sbjct: 270 EDEENDQSDVLPTKK-----------SKFIGDF----EDEKNGQSDVFPTKESKFIVD-S 313

Query: 202 EDEDSERVD--------KDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQS 253
           EDE++++ D        ++  Y  VCAICD+GG++  C+GRCLRSFHAT  AG +A+C+S
Sbjct: 314 EDEENDQSDGELDPDGEQNIGYDTVCAICDNGGEILPCEGRCLRSFHATKEAGIDAVCES 373

Query: 254 LGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
           LGYT AQ+ A PNF CQNC Y+ HQCFACG LGSSD SS  EVFPCV+A CG +YHPECV
Sbjct: 374 LGYTSAQVKAFPNFYCQNCKYKLHQCFACGKLGSSDVSSKAEVFPCVTANCGHYYHPECV 433

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
           ++LL P  ++  EE+R+++A G++F CP+H C +C++ E+ N  DLQ AICRRCPKAYHR
Sbjct: 434 ARLLSPSIDTEQEEMRKKVAMGKAFVCPLHICSLCKKGENKNFHDLQFAICRRCPKAYHR 493

Query: 374 KCLPTEITFS-DADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
           KCLP EI+F  D D++  QRAW  LL +++ILIYCL+H I+ EL TPARDHL FP
Sbjct: 494 KCLPREISFVFDNDKSIEQRAWDGLL-DHQILIYCLDHVIVRELGTPARDHLVFP 547



 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 94/213 (44%), Positives = 133/213 (62%), Gaps = 4/213 (1%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGD-G 59
           M  S+E+ + +   V  Y F ++ EE +S S L L W   + V   + QVFL G   D G
Sbjct: 1   MASSDEEGEIIPDLVDSYWFENEKEEFVSLSSLTLLWSISETVCGLKAQVFLRGTTDDDG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQ 119
               ++Q+I W+FEL   +PEI VLSK  +WI LQ P+K F++ ++ IL+TV+ L+ VK 
Sbjct: 61  LQTIHKQIIGWRFELPSGKPEISVLSKDKSWITLQRPRKCFESTIKAILVTVYCLHFVKW 120

Query: 120 NPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHP 179
           NPE S+  I  +MLK F S+++ PSENDLL ++ + REA ERDKDL  SK+LL FM    
Sbjct: 121 NPEESQICIWVKMLKAFSSFDIVPSENDLLSNVTLIREAVERDKDLTKSKYLLDFMERAC 180

Query: 180 ETNTFHE--DVRTTEKPKVIVDWNEDEDSERVD 210
               FHE  +V  T+K K IVD +EDE++++ D
Sbjct: 181 SNEGFHESSNVLPTKKSKFIVD-SEDEENDQSD 212


>gi|115477142|ref|NP_001062167.1| Os08g0502000 [Oryza sativa Japonica Group]
 gi|42407765|dbj|BAD08911.1| unknown protein [Oryza sativa Japonica Group]
 gi|113624136|dbj|BAF24081.1| Os08g0502000 [Oryza sativa Japonica Group]
 gi|215740510|dbj|BAG97166.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 694

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 187/310 (60%), Positives = 236/310 (76%), Gaps = 3/310 (0%)

Query: 565 SQPSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSV 624
           S P +D   E+ +    +  +S    E+  K  +VP TH   SK+     ITLG+V+ SV
Sbjct: 73  SFPVIDKSTERRIHEFAQKVSSDITIEDIQKKLVVPSTHTPVSKNT--DKITLGMVQRSV 130

Query: 625 KAVRAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTK 684
           +A+ AAL ML+ G  IED K+VC P  L Q+ +WK KL++YLAPFLHGMRYTS+GRHFTK
Sbjct: 131 EAINAALHMLENGASIEDVKSVCAPSDLFQLARWKNKLNIYLAPFLHGMRYTSYGRHFTK 190

Query: 685 VEKLKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDF 744
           ++KL++IVDRL WY+ SGDT+VDFCCG+NDFS ++K KLE   KSC ++N+DLIQPKNDF
Sbjct: 191 LDKLEQIVDRLQWYIESGDTVVDFCCGSNDFSLLLKEKLEASEKSCFYKNFDLIQPKNDF 250

Query: 745 SFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETR 804
           +FE+RDWMTV+P+ELP G +LIMGLNPPFG KASLAN+FI++AL FKPKLI+LIVP+ET 
Sbjct: 251 NFERRDWMTVQPDELPTGCRLIMGLNPPFGFKASLANQFINKALTFKPKLIILIVPKETE 310

Query: 805 RLDQK-ASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHK 863
           RLD+K   Y LIWED+  L+GKSFYLPGSLD  +  +EQWN  PPPL LWSR+DW   HK
Sbjct: 311 RLDRKYPPYELIWEDSHQLAGKSFYLPGSLDADNKIMEQWNMSPPPLSLWSRSDWARKHK 370

Query: 864 KIALGRGHIT 873
           +IA   GHI+
Sbjct: 371 EIAKTMGHIS 380


>gi|357494761|ref|XP_003617669.1| hypothetical protein MTR_5g094150 [Medicago truncatula]
 gi|355519004|gb|AET00628.1| hypothetical protein MTR_5g094150 [Medicago truncatula]
          Length = 1169

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 191/422 (45%), Positives = 268/422 (63%), Gaps = 16/422 (3%)

Query: 12  TLYVSDYEFLDQNEESISFSVLPLQWDGCDI--VGSSE--MQVFLHGNIGDGPWNFNEQV 67
           +LYV++Y   D + E + FSVLP +W    +  V  +E  +++FLHG+  +G      +V
Sbjct: 20  SLYVTNYHLEDNDGEPVCFSVLPFKWGASKVSKVDDTEKGLKIFLHGSADNGLQTIFMEV 79

Query: 68  IAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKY 127
            AWKF+L+  +PE+ +LSK   WI LQ P+KSF+  V+  LIT+++L C K+NP  S + 
Sbjct: 80  TAWKFDLTGLKPEVSLLSKDKRWIKLQKPRKSFQKTVKNALITLYFLRCAKKNPRLSVES 139

Query: 128 IREQMLKV--FCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFH 185
               + K     SY  +PS+ DLLDHM +  EA  RD  LANSK LLS + +        
Sbjct: 140 FWCNVCKDKDLSSYGFKPSQKDLLDHMSLIGEATTRDVGLANSKLLLSVLGDKSGDQKPS 199

Query: 186 EDVRTTEKPKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITA 245
           ++   T    +I + +ED   E    D+ + + CA+CD+GG++  CDG C+RSFHAT   
Sbjct: 200 DEEFDTNN-DIIEESDEDSKVE----DDLFDSFCALCDNGGNLIMCDGACMRSFHATKVD 254

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
           G+ +LC SLG+T+ ++D +  F C+NC Y +HQCFACG LGSSDK    EVF C S TC 
Sbjct: 255 GRESLCDSLGFTKKEVDDIETFYCKNCEYHQHQCFACGELGSSDKDKDAEVFKCASETCD 314

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICR 365
           +FYHP C++ L+    + ++E+L +  A G++F+CP+H C VC+  E+    +LQ A+C 
Sbjct: 315 RFYHPHCIAPLV----KHVSEDLEKNNADGQTFSCPIHFCCVCEGLENKMDPELQFAVCN 370

Query: 366 RCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLK 425
           RCPK+YHR+CLPTEI F D D+    RAW  LLPNNRILIYCL H+I  EL TP RDH+K
Sbjct: 371 RCPKSYHRRCLPTEIAFDDIDD-VATRAWEGLLPNNRILIYCLNHEISDELGTPVRDHIK 429

Query: 426 FP 427
           FP
Sbjct: 430 FP 431


>gi|297791955|ref|XP_002863862.1| hypothetical protein ARALYDRAFT_494862 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309697|gb|EFH40121.1| hypothetical protein ARALYDRAFT_494862 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 625

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 192/340 (56%), Positives = 249/340 (73%), Gaps = 13/340 (3%)

Query: 534 SFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLALIKDSTSSFNEEEF 593
           S    P HR  +G   +  I+D  +  K +IS   V    E+ ++ +I +  SSFN +EF
Sbjct: 265 SMETDPNHR--EGLFTRLDITDPYKKPKKDISVNDV----EQRVMGIIDEVESSFNFDEF 318

Query: 594 MKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILC 653
           +KS       ++HS + + K+IT GLV+  V A RAAL+M + G D +DA+A+  P+IL 
Sbjct: 319 VKSRKPTHIKSYHSGNDISKNITTGLVQTHVNAARAALKMFEEGRD-KDARAIFDPDILL 377

Query: 654 QIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGAN 713
           Q+ + K KL++YL+      RYTSFGRHFT  EKLKEIV+RLHWYV +GDT+VDFCCG+N
Sbjct: 378 QLMKHKTKLEIYLS------RYTSFGRHFTNPEKLKEIVERLHWYVENGDTVVDFCCGSN 431

Query: 714 DFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPF 773
           DFSC+MK KL + GK+C F+N+DLI PKN+F+FEKRDW++V+PEELPD SQLIMGLNPPF
Sbjct: 432 DFSCLMKEKLMETGKTCFFKNFDLIPPKNNFNFEKRDWLSVKPEELPDCSQLIMGLNPPF 491

Query: 774 GVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKASYNLIWEDNEVLSGKSFYLPGSL 833
           G KASLAN FI +AL+FKPK+++LIVP ET+R+D  A Y LIWED  +L+G SFYLPGS+
Sbjct: 492 GHKASLANTFIKKALEFKPKILILIVPSETKRVDAIADYELIWEDRNLLAGMSFYLPGSV 551

Query: 834 DVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHIT 873
           DV+D  +EQWN  PPPLYLWSR DW+ SHK  AL + HIT
Sbjct: 552 DVNDKTIEQWNNIPPPLYLWSRRDWSRSHKTTALQQDHIT 591



 Score =  276 bits (705), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 131/249 (52%), Positives = 174/249 (69%), Gaps = 6/249 (2%)

Query: 192 EKPKVIVDWNEDEDSERVDKDEN----YFAVCAICDDGGDVTFCDGRCLRSFHATITAGK 247
           E+  ++ +  ++E+S   D D      +  VC+ICD+GG V  C+G CLRSFH TI  G 
Sbjct: 2   EQNFIVNNMIDEENSSGSDDDSQTNLQFDPVCSICDNGGYVLCCEGSCLRSFHPTIADGI 61

Query: 248 NALCQSLGYT-QAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQ 306
              C+SLG+T + QI A+  +LC NC+Y++HQC+ACG LGSSD++SSQ+VFPC ++ CG 
Sbjct: 62  ETECESLGFTYKTQIQALGTYLCNNCLYKQHQCYACGELGSSDENSSQQVFPCSASNCGH 121

Query: 307 FYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRR 366
           FYHP CV++LL  D++  AEEL+ +IAA + F CP+H C VC  SED N   L  A+CRR
Sbjct: 122 FYHPICVARLLCADDQIKAEELQAKIAARDYFACPLHICKVCNMSEDKNEYALHFAVCRR 181

Query: 367 CPKAYHRKCLPTEITFS-DADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLK 425
           CP AYHRKCLP EIT   ++D++  QRAW  LLP NRILIYCL H+I   + TPARDHL 
Sbjct: 182 CPTAYHRKCLPREITSELNSDDDTLQRAWERLLPYNRILIYCLTHEIDGHILTPARDHLI 241

Query: 426 FPGVEGKRK 434
           FP + G+R+
Sbjct: 242 FPDISGQRR 250


>gi|413925249|gb|AFW65181.1| hypothetical protein ZEAMMB73_193747 [Zea mays]
          Length = 584

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 176/300 (58%), Positives = 231/300 (77%), Gaps = 3/300 (1%)

Query: 577 LLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDG 636
           +    + +++    ++  K  +VP T+A + ++    +ITL  VE  V+AV+ AL ML+ 
Sbjct: 1   MYEFAQKASAGITMDDVKKKLVVPSTYAPNLQN--ADNITLENVEIYVEAVKGALHMLEN 58

Query: 637 GCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLH 696
           G  I DAK+VCPP +L Q+ +WK KL V LAPFLHGMRYTS+GRHFTK++KL+ IVD+L 
Sbjct: 59  GASIADAKSVCPPNVLFQLVKWKNKLSVVLAPFLHGMRYTSYGRHFTKLDKLQLIVDKLQ 118

Query: 697 WYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRP 756
           WY++SGDT+VDFCCG+NDFS +++  LE  GK+C ++NYDLIQPKNDF+FE+RDWMTV+P
Sbjct: 119 WYIQSGDTVVDFCCGSNDFSLLLREILESSGKNCFYKNYDLIQPKNDFNFERRDWMTVQP 178

Query: 757 EELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQK-ASYNLI 815
           +ELP G +L+MGLNPPFG KASLAN+FI++AL FKPKLI+LIVP+ET RLD+K   Y LI
Sbjct: 179 DELPTGCRLVMGLNPPFGTKASLANQFINKALTFKPKLIILIVPKETERLDKKYPPYELI 238

Query: 816 WEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITVE 875
           W+D++ LSGKSFYLPGSLD  +  +EQWN  PPPL LWSR+DW   H +IA   GHI+ E
Sbjct: 239 WQDSDQLSGKSFYLPGSLDADNKVMEQWNVSPPPLSLWSRSDWAERHSEIAKSMGHISNE 298


>gi|52354517|gb|AAU44579.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
 gi|60547933|gb|AAX23930.1| hypothetical protein At5g48090 [Arabidopsis thaliana]
          Length = 364

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 180/302 (59%), Positives = 231/302 (76%), Gaps = 3/302 (0%)

Query: 572 EMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAAL 631
           E+E+ ++ +I +  SSFN +EF+KS       ++HS +   K+IT GLV   V A RAAL
Sbjct: 32  EVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAAL 91

Query: 632 EMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEI 691
           +M + G D  DA+A+  P+ L Q+ + K+KL++  +PFLHGMRYTSFGRHFT  EKLKEI
Sbjct: 92  KMFEEGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLKEI 148

Query: 692 VDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDW 751
           V+RLHWYV +GDT+VDFCCG+NDFSC+MK KL + GK C ++N DLI PKN+F+FE RDW
Sbjct: 149 VERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMRDW 208

Query: 752 MTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS 811
           ++V+ EELPDGSQLIMGLNPPFG KASLAN FI +AL+FKPK+++LIVP ET+R+D    
Sbjct: 209 LSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAIDD 268

Query: 812 YNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGH 871
           Y LIWED  +L+G SFYLPGS+DV+D  +EQWN   PPLYLWSR D + SHK  A+ +GH
Sbjct: 269 YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQQGH 328

Query: 872 IT 873
           IT
Sbjct: 329 IT 330


>gi|240256412|ref|NP_199620.4| EDM2-like protein1 [Arabidopsis thaliana]
 gi|332008234|gb|AED95617.1| EDM2-like protein1 [Arabidopsis thaliana]
          Length = 677

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 180/302 (59%), Positives = 231/302 (76%), Gaps = 3/302 (0%)

Query: 572 EMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAAL 631
           E+E+ ++ +I +  SSFN +EF+KS       ++HS +   K+IT GLV   V A RAAL
Sbjct: 345 EVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAAL 404

Query: 632 EMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEI 691
           +M + G D  DA+A+  P+ L Q+ + K+KL++  +PFLHGMRYTSFGRHFT  EKLKEI
Sbjct: 405 KMFEEGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLKEI 461

Query: 692 VDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDW 751
           V+RLHWYV +GDT+VDFCCG+NDFSC+MK KL + GK C ++N DLI PKN+F+FE RDW
Sbjct: 462 VERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMRDW 521

Query: 752 MTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS 811
           ++V+ EELPDGSQLIMGLNPPFG KASLAN FI +AL+FKPK+++LIVP ET+R+D    
Sbjct: 522 LSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAIDD 581

Query: 812 YNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGH 871
           Y LIWED  +L+G SFYLPGS+DV+D  +EQWN   PPLYLWSR D + SHK  A+ +GH
Sbjct: 582 YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQQGH 641

Query: 872 IT 873
           IT
Sbjct: 642 IT 643



 Score =  199 bits (506), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 113/306 (36%), Positives = 167/306 (54%), Gaps = 40/306 (13%)

Query: 160 ERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKVIVDWNEDEDSERVD--------- 210
           +RD+DL  SKFL++F+    +T     ++ T           +D   E +          
Sbjct: 2   KRDEDLTKSKFLITFLGKTSQTTPIEVELPTDHLQDAQTPMEQDSIVENMTDEENSSSSD 61

Query: 211 ----KDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYT-QAQIDAVP 265
                +  +  VC+ICD+GG V  C+G CLRSFH TI  G    C+SLG+T + QI A+ 
Sbjct: 62  DDSEANLQFDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQALG 121

Query: 266 NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLA 325
            +LC NC+Y++HQC+ACG LGSSD++ SQ+VFPC ++ CG FYHPECV++LL  D+++ A
Sbjct: 122 TYLCNNCLYKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQNKA 181

Query: 326 EELRERIAAGESFTCPVHKCFVCQQSEDMN----------------VEDLQLAICRRCPK 369
           EEL+ +IAA + F CP+H C +C  SED N                 E++ L I      
Sbjct: 182 EELQAKIAARDCFACPLHTCKLCNMSEDKNQYACILLYADVAQQLITENVYLGILLLNST 241

Query: 370 AYHRKCLP-TEITFSDADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPG 428
              R C    ++ F          A++ ++  +++L     H+I   + TPARDHL FP 
Sbjct: 242 LTMRHCKELGKVCF-------LTTAFLYIVSFHKLLNRA--HEIDGLILTPARDHLVFPD 292

Query: 429 VEGKRK 434
           V G+R+
Sbjct: 293 VSGQRR 298


>gi|238481509|ref|NP_001154768.1| EDM2-like protein1 [Arabidopsis thaliana]
 gi|10177760|dbj|BAB11073.1| unnamed protein product [Arabidopsis thaliana]
 gi|332008235|gb|AED95618.1| EDM2-like protein1 [Arabidopsis thaliana]
          Length = 636

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 180/302 (59%), Positives = 231/302 (76%), Gaps = 3/302 (0%)

Query: 572 EMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAAL 631
           E+E+ ++ +I +  SSFN +EF+KS       ++HS +   K+IT GLV   V A RAAL
Sbjct: 304 EVEQRVMGIIDEVESSFNFDEFVKSRRPTHIKSYHSGNYFSKNITTGLVRTHVNAARAAL 363

Query: 632 EMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEI 691
           +M + G D  DA+A+  P+ L Q+ + K+KL++  +PFLHGMRYTSFGRHFT  EKLKEI
Sbjct: 364 KMFEEGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLKEI 420

Query: 692 VDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDW 751
           V+RLHWYV +GDT+VDFCCG+NDFSC+MK KL + GK C ++N DLI PKN+F+FE RDW
Sbjct: 421 VERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMRDW 480

Query: 752 MTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS 811
           ++V+ EELPDGSQLIMGLNPPFG KASLAN FI +AL+FKPK+++LIVP ET+R+D    
Sbjct: 481 LSVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAIDD 540

Query: 812 YNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGH 871
           Y LIWED  +L+G SFYLPGS+DV+D  +EQWN   PPLYLWSR D + SHK  A+ +GH
Sbjct: 541 YELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQQGH 600

Query: 872 IT 873
           IT
Sbjct: 601 IT 602



 Score =  189 bits (481), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 100/235 (42%), Positives = 142/235 (60%), Gaps = 27/235 (11%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYT-QAQIDAVPNFLCQNCVYQE 276
           VC+ICD+GG V  C+G CLRSFH TI  G    C+SLG+T + QI A+  +LC NC+Y++
Sbjct: 32  VCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQALGTYLCNNCLYKQ 91

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           HQC+ACG LGSSD++ SQ+VFPC ++ CG FYHPECV++LL  D+++ AEEL+ +IAA +
Sbjct: 92  HQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQNKAEELQAKIAARD 151

Query: 337 SFTCPVHKCFVCQQSEDMN----------------VEDLQLAICRRCPKAYHRKCLP-TE 379
            F CP+H C +C  SED N                 E++ L I         R C    +
Sbjct: 152 CFACPLHTCKLCNMSEDKNQYACILLYADVAQQLITENVYLGILLLNSTLTMRHCKELGK 211

Query: 380 ITFSDADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRK 434
           + F          A++ ++  +++L     H+I   + TPARDHL FP V G+R+
Sbjct: 212 VCF-------LTTAFLYIVSFHKLLNR--AHEIDGLILTPARDHLVFPDVSGQRR 257


>gi|357153232|ref|XP_003576383.1| PREDICTED: uncharacterized protein LOC100846531 [Brachypodium
           distachyon]
          Length = 683

 Score =  371 bits (952), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 194/441 (43%), Positives = 280/441 (63%), Gaps = 23/441 (5%)

Query: 4   SEEDDDT--LTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQ--VFLHGNIGDG 59
           S++DDD+    + V DY F+D ++ ++ FSVLP+ W   D V   E +  VFL G +  G
Sbjct: 2   SDDDDDSEPQVVVVKDYYFVDADKNALCFSVLPI-WFKEDAVAVPECKTGVFLRGTVDPG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQ 119
              + +QV+AWK  L   QP++ VLSK+  WI L  PK S++   RTI ITV  L+ +++
Sbjct: 61  IPVY-KQVVAWKLGLDARQPDLAVLSKEGGWINLSKPKNSYEESFRTIFITVQMLHFLRR 119

Query: 120 NPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHP 179
            PE   K +   + KVF  ++V PS++D  +H  + ++ AE+D  LANS+ L  F+    
Sbjct: 120 KPEEPEKDLWIHLRKVFDKFDVRPSKDDFRNHHTLMKQFAEKDLRLANSEILKVFIGERF 179

Query: 180 ETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDENY-----------FAVCAICDDGGDV 228
                  D    E  +  +    DED E +  D+N             + CAICD+GGD+
Sbjct: 180 RKQISEVDSGNFEVKESFI--AADEDVEDIVADDNVESDEDGDDDLFDSTCAICDNGGDL 237

Query: 229 TFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSS 288
             CDG C+RSFHA I  G+++ CQ+LG+T+A+++A+  FLC+NC Y++HQCF CG+L  S
Sbjct: 238 LGCDGPCMRSFHAKIGTGEDSYCQTLGFTEAEVEAMKTFLCKNCEYKQHQCFICGVLEPS 297

Query: 289 DKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVC 348
           D  +++ VF C +ATCG FYHP+CV++ LHP+N+  A E  ++IA G SFTC +H CF C
Sbjct: 298 DGPTAK-VFLCNNATCGYFYHPKCVAQQLHPNNKIEALEKEKKIAGGSSFTCSIHWCFCC 356

Query: 349 QQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYCL 408
           +  ED   E LQ A+CRRCPK+YHRKCLP+EI F D+DE+   RAW DL  + RILIYC+
Sbjct: 357 KGLEDRTEEHLQFAVCRRCPKSYHRKCLPSEIPFEDSDEDIVTRAW-DL--SQRILIYCM 413

Query: 409 EHKIISELKTPARDHLKFPGV 429
           EH+I  +++TP R+H+KFPG+
Sbjct: 414 EHEIDLDIETPVRNHIKFPGL 434



 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 577 LLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDG 636
           ++AL++   SS    +  +  ++P TH +  +   +K I  G +E SV+AVR AL +L  
Sbjct: 599 VIALVEKEVSSLTLNDISRKCLMPSTHVYSGRQT-DKIIATGKLERSVQAVRQALHLLAV 657

Query: 637 GCDIEDAKAVCPPEILCQIFQW 658
           G D+  AKA C P++L Q+ +W
Sbjct: 658 G-DVNTAKATCEPQVLKQLARW 678


>gi|224061083|ref|XP_002300346.1| predicted protein [Populus trichocarpa]
 gi|222847604|gb|EEE85151.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 198/432 (45%), Positives = 280/432 (64%), Gaps = 30/432 (6%)

Query: 281 ACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTC 340
           AC  LG +++    EVF C +ATCG FYHP C + +LH +++  AEELR++IAAG+SF C
Sbjct: 57  ACESLGFTNREV--EVFRCANATCGHFYHPHCAATMLHREDKVAAEELRKKIAAGDSFAC 114

Query: 341 PVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITF-SDADENNFQRAWVDLLP 399
           P+HKC +C+Q ED    DLQ A+CRRCP +YH+KCLP EI F ++ADE+   RAW +LLP
Sbjct: 115 PIHKCCICKQVEDKKKSDLQFAVCRRCPTSYHQKCLPKEIAFENEADEDTIARAWQNLLP 174

Query: 400 NNRILIYCLEHKIISELKTPARDHLKFPGVEGK------RKKEDLELLLTEEKDVASKRN 453
           N RILIYCL+H II ++ TP RDH++FP V GK      +K++  EL   EE+ ++ K+ 
Sbjct: 175 N-RILIYCLKHDIIEDIGTPVRDHIRFPDVGGKNTAAKVQKRKTSELPANEEESLSKKKR 233

Query: 454 IVSESFVADKTVVKKLKLAEVYSGAD-VGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSL 512
           + SE   +     K  K  E++S A  V ++N          + +SL K  + +  RKS+
Sbjct: 234 LTSEESFSGTFRTKASK--EMFSSAKIVKITNDS---EQISSESNSLGKMRMNNPSRKSI 288

Query: 513 KD-------------IADKSKPSLR-KDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRR 558
           ++              A+ +K SL  K    +  +S   K   ++I GS++  S++ +  
Sbjct: 289 RENTKSASSEVERSTAANVNKTSLGDKLYGFMTIKSGKAKLRKQDIFGSELDKSLAVKSV 348

Query: 559 MKKVNISQPSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLG 618
            KK+    PS+DA+ ++ LLAL+K++ SS   +  +K H VP TH H SK++++K+ITLG
Sbjct: 349 GKKLTSELPSLDADTQRRLLALVKEAASSITLDNVIKKHEVPSTHVHSSKNVVDKNITLG 408

Query: 619 LVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSF 678
            VEG+V+AVR AL+ L+  C IEDAKAVC P++L Q+F+WK KL VYLAPFL+GMRYTSF
Sbjct: 409 KVEGTVEAVRTALKKLEEKCSIEDAKAVCEPDVLNQVFKWKNKLKVYLAPFLYGMRYTSF 468

Query: 679 GRHFTKVEKLKE 690
           GRHFTKVEKL E
Sbjct: 469 GRHFTKVEKLVE 480


>gi|414588821|tpg|DAA39392.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 731

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 237/699 (33%), Positives = 373/699 (53%), Gaps = 89/699 (12%)

Query: 15  VSDYEFLD--QNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFNEQVIAWKF 72
           V  Y F+D   + E + FS LP  +   D +   + ++ L G  GD       +V+AW  
Sbjct: 18  VESYYFVDSTNDNEPVCFSTLPFLFGDTDDLPDCKKRLVLRGTGGDPGVKVYSEVVAWSL 77

Query: 73  ELSYAQPEIWVLSKQNN-WIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKYIREQ 131
            L   QP+  VL+ +   W+ L  P  S++ ++RT+LIT H L+ + + P    + +   
Sbjct: 78  GLEGKQPQFAVLTAEGRRWVRLVRPLNSYEEMIRTVLITAHMLHFLMRKPHEPERTLWNH 137

Query: 132 MLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANS-----------------KFLLSF 174
           + KVF  ++V PSE+D  +H+ + ++   +D  LA S                 +FLLS+
Sbjct: 138 LCKVFNKFDVRPSEDDFRNHISLMKQFVAKDPILAKSEGSLPHEWGLTKGKTKAQFLLSY 197

Query: 175 MANHP---------------------------ETNTFHEDVRTT------EKPKVIVDWN 201
           +                               E    H DV+        +  ++IV+  
Sbjct: 198 VMPRAVLCKVVNIVLKTILRVFVEGKSGKIFTEAGADHIDVKQPFITDDEDIDEMIVEDA 257

Query: 202 EDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQI 261
           ++E  E  D D  + ++CAICDDGGD+  CDG C+RSFHA   +G+++ C +LGYT+A++
Sbjct: 258 DNESDEEEDGDL-FDSICAICDDGGDILCCDGPCMRSFHAKEGSGEDSYCDTLGYTEAEV 316

Query: 262 DAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDN 321
           +A+  FLC+NC Y++HQCF CG+L  S+ ++++ VF C +ATCG FYHP+CV++ LHP+N
Sbjct: 317 EAMKLFLCKNCEYKQHQCFICGVLEPSNGAAAK-VFLCNNATCGHFYHPKCVAQQLHPNN 375

Query: 322 ESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
              A EL ++I  G SFTCP+H CF C+  ED     LQ A+CRRCPK+YHRKCLP EI+
Sbjct: 376 IDEASELEKKITDGFSFTCPIHWCFHCKGLEDRTQGPLQFAVCRRCPKSYHRKCLPREIS 435

Query: 382 FSDADENN-FQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRK--KEDL 438
           F + +E     RAW +L  +  ILIYCL H+I SEL+TP RDH+KFP +E   +  K+  
Sbjct: 436 FDEIEEKGVIIRAW-EL--SKMILIYCLNHEIDSELETPVRDHIKFPRIEKPLQFVKKGA 492

Query: 439 ELLLTEEKDVASKRNIVSESFVADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQ-DVH 497
            LL  ++K   S            ++++++     V   + V M  SE  K   R+  V 
Sbjct: 493 NLLPKKKKRTYS------------ESILEQPPKDTVKMKSKVRMQESEQNKHDTREVSVK 540

Query: 498 SLKKPNITDTGRKSLKDIADKSK-PSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDR 556
           S  + N+ DT  K    + +  + P + KD  +   +S  VK   + +    +   +S  
Sbjct: 541 SFTQ-NVVDTPVKKKGKLKENKQLPYVAKDHFVSSPKS--VKEQEQEL----VPLPLSGI 593

Query: 557 RRMKKVNISQPSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSIT 616
           R+      S P VD+E+EK +++++ +  SS   ++  ++  +P TH +  +      + 
Sbjct: 594 RKS-----SFPKVDSEIEKRVISILGNEISSLTIKDVTRNCSLPSTHVYSGRP--TDKMP 646

Query: 617 LGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILCQI 655
           LG +E SV+AV AAL+ L  G  + DAKAVC P++L Q+
Sbjct: 647 LGKIERSVQAVEAALKKLKNGGSVNDAKAVCEPDVLRQL 685


>gi|115475968|ref|NP_001061580.1| Os08g0337300 [Oryza sativa Japonica Group]
 gi|38424060|dbj|BAD01750.1| unknown protein [Oryza sativa Japonica Group]
 gi|38637134|dbj|BAD03388.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623549|dbj|BAF23494.1| Os08g0337300 [Oryza sativa Japonica Group]
          Length = 585

 Score =  356 bits (914), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 202/513 (39%), Positives = 300/513 (58%), Gaps = 50/513 (9%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S++D +     V +Y F+D N+  +SF VLP Q+D  + V S +  V+L G    G 
Sbjct: 1   MMSSDDDLEPQLKAVENYYFVDDNDVPVSFDVLPFQFDAAEGVASFKKDVYLRGFTDGGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
               +QV+AWK  L    PEI VLS + +WI L  P+ S++  +R++LITV  L+ V++ 
Sbjct: 61  QKVYKQVVAWKLVLDGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHFVRRR 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFM----- 175
           P  S K + + +  VF  + V P E+D  +H  + +  A+RD DLANS+ L  F+     
Sbjct: 121 PTDSEKDMWDHLYGVFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFIKDKIM 180

Query: 176 --ANHPETNTF----HEDVR----TTEKPKVIVDWNEDEDSERV-DKDEN---------- 214
              N   +N        D++      ++P    D  E+   E + D   N          
Sbjct: 181 EKTNEVGSNNLDNKREPDIKQEPDIKQEPVAAGDEMEEIVEEGIPDAPSNDDDDDEEDEE 240

Query: 215 ----YFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQ 270
               + +VCAICD+GG++  C+G C+RSFHA I  G+++ C +LGYT+A++ A+ NF+C+
Sbjct: 241 DGDLFDSVCAICDNGGELLCCEGPCMRSFHAKIRDGEDSYCATLGYTKAEVKALKNFVCK 300

Query: 271 NCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRE 330
           NC +++HQCF CG L  SD  +++ VF C +ATCG FYHP CV++LLHP++ + A E+ +
Sbjct: 301 NCDHKQHQCFVCGELEPSDGPNAK-VFLCNNATCGHFYHPRCVAQLLHPNSRNEASEMEK 359

Query: 331 RIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENN- 389
           +I AG SFTCPVH CF C+  ED   E LQ A+CRRCP++YHRKCLP EI+F D +    
Sbjct: 360 KIMAGFSFTCPVHWCFHCKGLEDRTQEPLQFAVCRRCPRSYHRKCLPREISFEDINTQGI 419

Query: 390 FQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVE-----GKRKKEDLELLLTE 444
             RAW +L  + RILIYCL+H+I  ++ TP RDH+KFP VE      K+K ++L      
Sbjct: 420 ITRAW-EL--SKRILIYCLDHEIDLDIGTPPRDHIKFPHVEKSAYSAKKKVKEL------ 470

Query: 445 EKDVASKRNIVSESFVADKTVVKKLKLAEVYSG 477
                 KR I  +S+V++  + K+ KL E ++ 
Sbjct: 471 ---AEKKRRICDDSYVSE-PLQKRAKLNEKFNA 499


>gi|38424061|dbj|BAD01751.1| unknown protein [Oryza sativa Japonica Group]
 gi|38637135|dbj|BAD03389.1| unknown protein [Oryza sativa Japonica Group]
 gi|215704583|dbj|BAG94216.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 653

 Score =  340 bits (871), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 154/215 (71%), Positives = 178/215 (82%), Gaps = 1/215 (0%)

Query: 659 KRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGANDFSCM 718
           + KL VYLAPF+HGMRYTSFGRHFTK EKL EI ++LHWYV+ GD IVDF CG NDFS  
Sbjct: 9   QNKLRVYLAPFIHGMRYTSFGRHFTKKEKLIEIAEKLHWYVQPGDMIVDFSCGTNDFSQF 68

Query: 719 MKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKAS 778
           MK KL+++GK C+F+NYD+IQPKN FSFEKRDWMTVR +ELP GS+LIMGLNPPFG KA 
Sbjct: 69  MKEKLDKVGKRCNFKNYDVIQPKNSFSFEKRDWMTVRQKELPHGSKLIMGLNPPFGPKAM 128

Query: 779 LANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDNEVLSGKSFYLPGSLDVHD 837
           LANKFI +AL FKPKLI+LIVP+E  RLD+K   Y+L+WED++ LSGKSFYLPGSLDV D
Sbjct: 129 LANKFIDKALTFKPKLIILIVPKEAERLDRKQQPYDLVWEDDQRLSGKSFYLPGSLDVSD 188

Query: 838 NQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
            Q++QWN  PPPLYLWSR DWT  HK+IA   GH 
Sbjct: 189 KQIDQWNKSPPPLYLWSRPDWTQKHKRIAEQHGHT 223


>gi|168059731|ref|XP_001781854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666661|gb|EDQ53309.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1275

 Score =  339 bits (870), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 284/937 (30%), Positives = 429/937 (45%), Gaps = 177/937 (18%)

Query: 22   DQNEESISFSVLPLQWDGCDI--VGSSEMQVFLHGNIGDGPWNFNEQVIAWKFELSYAQP 79
            D+N + +S   +PL+++G       S+  +V+LHG  G G +     + AWK        
Sbjct: 180  DENNKPVSLCDVPLEFEGRSTKDFASTSTEVYLHGKSG-GEY-IMRSIRAWKLAFP-GNG 236

Query: 80   EIWVLSKQN------NWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEAS--------- 124
               +L K N      NW+ L  P+  +K    ++     +L  +K+ P AS         
Sbjct: 237  TFNILVKINTQKGDTNWVKLGKPQFGYKETAESVFCIAKFLALLKEEPHASGDLVGKRLD 296

Query: 125  ---------------RKYIREQMLKVFCSYEVEPSENDLLDHMPIS----------REAA 159
                           R  + EQ + +    +      DL  H   S          R   
Sbjct: 297  EWDVPKATIYGHLSKRFAVVEQYINLDSGLKDSDVSVDLKVHSADSGVSVLALSTFRSVV 356

Query: 160  ERDKDLANSKFLLSFMANHPETNT--------FHEDVRTTE--KP--KVIVDWNED---- 203
             +DK   + K +LS  ++  + N         F   V   E  KP  +++ D  ++    
Sbjct: 357  FKDKAANSGKRMLSDDSSDEDGNLRVKRHKKDFGNTVDNNEVGKPEDEIMHDSGDEVNFV 416

Query: 204  EDSERVDKDENYF----AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQA 259
            ED + V+ DE       ++C ICDDGG++  CDG C+RSFHA      N  C++L  T++
Sbjct: 417  EDEKDVEIDEASEELGDSLCIICDDGGELLCCDGPCMRSFHAIRDPQHN--CETLKLTKS 474

Query: 260  QI-DAVPNFLCQNCVYQEHQCFACGMLGSSDK--SSSQEVFPCVSATCGQFYHPECVSK- 315
             I   V  +LC+NCVY++HQC+ CG LG SD+     +EVF C  A CG+FYHP CV+  
Sbjct: 475  AIAKMVGQWLCKNCVYKKHQCYVCGELGDSDEDLGVKREVFVCDVACCGKFYHPTCVANE 534

Query: 316  --LLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
              L   D + LA  ++   A  ESF CP+H+C  C + ED   E L LA CRRCP  +H+
Sbjct: 535  IALTEDDRKLLATNIQ---AGVESFACPLHRCKKCGKGEDSTEESLLLARCRRCPATWHK 591

Query: 374  KCLPTEITFSDA--DENNFQRAWVDLLPNNRILIYCL----------------------- 408
            KCLP+ I F++   D++   RAW       R +IYC                        
Sbjct: 592  KCLPSGIRFAEETDDKHPEVRAWAFGDVYKRFIIYCTGNNGNNQLPICHYSPGPNEVVSK 651

Query: 409  ---------EHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESF 459
                     +HKI+ +L TP R+H+KFP V    +  D E+    +  VA++ N VS S 
Sbjct: 652  SYTLRSIYRKHKIMKDLGTPKRNHVKFPRV--LERPGDPEVFKQLKSKVAAEAN-VSPSL 708

Query: 460  VADKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADKS 519
             A  +    LK   V   A    S  E+  +  R +                ++++   S
Sbjct: 709  EAKAS--SSLKPILVDKSAKKKHSGVELLAKEKRPE---------------EIRELPLAS 751

Query: 520  KPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLA 579
             P L   + +LK      K   R + G   +++     +  K+ ++ PS+  E       
Sbjct: 752  PPRLSISADMLKKSEDAAK---RVVLGVIAESAQKVTDQSVKLKLALPSIYRERR----- 803

Query: 580  LIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDG-GC 638
                    F+   FMK                      G  +  +KA R  LE L     
Sbjct: 804  -------GFS---FMKQ---------------------GHKDSILKAARIVLEKLRSKKI 832

Query: 639  DIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWY 698
             +E+AKA+C P  + +I   +  L  +LAP L+G RY S+GRHFTK+EKL+++V RLH  
Sbjct: 833  TVEEAKAMCNPSNINRIELEEDYLKTFLAPTLYGDRYVSYGRHFTKLEKLRKVVRRLHPL 892

Query: 699  VRSGDTIVDFCCGANDFSCMMKVKLEQMGKS-CSFRNYDLIQPKNDFSFEKRDWMTVRPE 757
            +   DTIVDF CG N FS ++   L   GK+   F+N+D+  PK+ F FE++DW   +  
Sbjct: 893  IWDHDTIVDFSCGDNSFSRLLHNALSDAGKTHLKFKNFDIFPPKDTFEFERKDWFDTKAT 952

Query: 758  ELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKASYNLIWE 817
            E   G  L++GLNPPF      A KF++ AL  +P+L+VLI P    +++ +  YN + +
Sbjct: 953  EFGSGENLVIGLNPPF----PFAEKFVAHALMMQPRLLVLITPPLKNKMEHRG-YNCLED 1007

Query: 818  DNEVLSGKSFYLPGSL-DVHDNQLEQWNCKPPPLYLW 853
            D+ +L   SFY+PGSL D H   L+Q N   P   ++
Sbjct: 1008 DHLLLDDNSFYVPGSLHDQHMESLQQVNAVVPHFCIY 1044


>gi|52354515|gb|AAU44578.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
          Length = 272

 Score =  335 bits (858), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 155/241 (64%), Positives = 193/241 (80%), Gaps = 3/241 (1%)

Query: 633 MLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIV 692
           M + G D  DA+A+  P+ L Q+ + K+KL++  +PFLHGMRYTSFGRHFT  EKLKEIV
Sbjct: 1   MFEEGRD-RDARALFDPDSLLQLMKHKKKLEI--SPFLHGMRYTSFGRHFTNPEKLKEIV 57

Query: 693 DRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWM 752
           +RLHWYV +GDT+VDFCCG+NDFSC+MK KL + GK C ++N DLI PKN+F+FE RDW+
Sbjct: 58  ERLHWYVENGDTVVDFCCGSNDFSCLMKEKLMETGKICFYKNLDLIPPKNNFNFEMRDWL 117

Query: 753 TVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKASY 812
           +V+ EELPDGSQLIMGLNPPFG KASLAN FI +AL+FKPK+++LIVP ET+R+D    Y
Sbjct: 118 SVKEEELPDGSQLIMGLNPPFGYKASLANTFIKKALEFKPKILILIVPSETKRVDAIDDY 177

Query: 813 NLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
            LIWED  +L+G SFYLPGS+DV+D  +EQWN   PPLYLWSR D + SHK  A+ +GHI
Sbjct: 178 ELIWEDRNLLAGMSFYLPGSVDVNDKTIEQWNNITPPLYLWSRRDLSRSHKTTAVQQGHI 237

Query: 873 T 873
           T
Sbjct: 238 T 238


>gi|449453220|ref|XP_004144356.1| PREDICTED: uncharacterized protein LOC101205205 [Cucumis sativus]
          Length = 707

 Score =  326 bits (835), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 190/414 (45%), Positives = 255/414 (61%), Gaps = 34/414 (8%)

Query: 295 EVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDM 354
           +VF CV+ATCG FYHP+C+S+LLHP+N+  A +L ++IA+GESF+CPVHKC VC   E+ 
Sbjct: 9   QVFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGENK 68

Query: 355 NVEDLQLAICRRCPKAYHRKCLPTEITFSDA-DENNFQRAWVDLLPNNRILIYCLEHKII 413
            + +LQ A+CRRCPK+YHRKCLP +ITF  + D     RAW  LLP NRILIYCL+H+I 
Sbjct: 69  KIWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLP-NRILIYCLDHEID 127

Query: 414 SELKTPARDHLKFPGVEGKR---KKEDLELLLTEEKDV----ASKRNIVSESF-----VA 461
            E++TPARDH+KFPG+E  R   +K  L +  T          S+ N+VS+       + 
Sbjct: 128 EEIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDLQ 187

Query: 462 DKTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQD----VHSLKKPNITDTGRKSLKDIAD 517
            K+  K  K  E  S     +     K  W  +     + ++ + ++   G   L DI  
Sbjct: 188 GKSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESKNVKLGNISRNSLNQKGESVLMDIDK 247

Query: 518 KSKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKEL 577
             K  ++K S + KS      P  R  D SK+     DR  M         +DA  E+ L
Sbjct: 248 TIK--VKKSSLVGKS----AIPTKR-FDPSKIYK--EDRSGML-------LLDANSERRL 291

Query: 578 LALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGG 637
           + ++K+  SS   E+ +K H VP THA+  KH+++K+I +G +EGSV AVRAAL  L+ G
Sbjct: 292 MDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAVRAALRKLEEG 351

Query: 638 CDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEI 691
           C IEDA+AVC PE+L  IF+WK KL VYLAPFL+GMRY+SFGRHFTKVEKL EI
Sbjct: 352 CCIEDAEAVCEPEVLNHIFKWKNKLRVYLAPFLYGMRYSSFGRHFTKVEKLVEI 405


>gi|449456251|ref|XP_004145863.1| PREDICTED: uncharacterized protein LOC101205827 [Cucumis sativus]
          Length = 464

 Score =  321 bits (823), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 186/411 (45%), Positives = 252/411 (61%), Gaps = 30/411 (7%)

Query: 296 VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
           VF CV+ATCG FYHP+C+S+LLHP+N+  A +L ++IA+GESF+CPVHKC VC   E+  
Sbjct: 56  VFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGENKK 115

Query: 356 VEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF-QRAWVDLLPNNRILIYCLEHKIIS 414
           + +LQ A+CRRCPK+YHRKCLP +ITF  +++     RAW  LLP NRILIYCL+H+I  
Sbjct: 116 IWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLP-NRILIYCLDHEIDE 174

Query: 415 ELKTPARDHLKFPGVEGKR---KKEDLELLLTEEKDV----ASKRNIVSESF-----VAD 462
           E++TPARDH+KFPG+E  R   +K  L +  T          S+ N+VS+       +  
Sbjct: 175 EIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDLQG 234

Query: 463 KTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDT--GRKSLKDIADKSK 520
           K+  K  K  E  S     +     K  W  +   ++K  NI+     +K    + D  K
Sbjct: 235 KSAAKVSKSFERSSSDGKLLGKVTAKSLWSSES-ENVKLGNISRNCLNQKGESVLMDIDK 293

Query: 521 PSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLAL 580
               K S+L+   +   K      D SK+     DR  M         +DA  E+ L+ +
Sbjct: 294 TIKVKKSSLVGKSAIPTK----RFDPSKIYK--EDRSGM-------LLLDANSERRLMDM 340

Query: 581 IKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDI 640
           +K+  SS   E+ +K H VP THA+  KH+++K+I +G +EGSV  VRAAL  L+ GC I
Sbjct: 341 MKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVVVRAALRKLEEGCCI 400

Query: 641 EDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEI 691
           EDA+AVC PE+L  IF+WK KL VYLAPFL+GMRY+SFGRHFTKVEKL E+
Sbjct: 401 EDAEAVCEPEVLNHIFKWKNKLRVYLAPFLYGMRYSSFGRHFTKVEKLVEV 451


>gi|356524309|ref|XP_003530772.1| PREDICTED: uncharacterized protein LOC100811715 [Glycine max]
          Length = 407

 Score =  305 bits (781), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 162/375 (43%), Positives = 230/375 (61%), Gaps = 17/375 (4%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLD-QNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDG 59
           M  S+++ +   L VS+Y F+D +++E + FSVLP+QW   +     +MQ+FLHG + +G
Sbjct: 1   MASSDDEGEAQALSVSNYYFVDDKDDEPVCFSVLPIQWSESESPVGKKMQLFLHGFVDNG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITV-HWLYCVK 118
              F  QV+AW+F+ SY +PEI VLSK   WI+L+ P+KS+++ +    +   H+L  VK
Sbjct: 61  LQKFFGQVVAWRFDFSYVRPEISVLSKDGRWIILEKPRKSYEDTIIRTTLITTHFLSYVK 120

Query: 119 QNPEASRKYIREQMLK--VFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMA 176
           +NP++S K + +++ K   F SYEV P + DLL+HM +  EAA+RD  L  SK LL  + 
Sbjct: 121 KNPDSSAKSVWDKLSKNKEFRSYEVMPLQKDLLNHMALMGEAAKRDTALEKSKLLLMVLE 180

Query: 177 NHPETNT---FHEDVRTTEKPKVIVD------WNEDEDSERVDKDENYFAVCAICDDGGD 227
           +           E+V+   +P  I+D       +ED D E    DE + +VCAICD+GG 
Sbjct: 181 DKDMLKIKKLSDEEVKDLARPGFIIDDIDNGMIDEDSDGE----DELFDSVCAICDNGGQ 236

Query: 228 VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS 287
           +  CDG+C+RSFHA    G+ + C SLG+++ ++D + NF C+NC Y +HQCFACG LG 
Sbjct: 237 LLCCDGKCMRSFHANEEDGEESTCASLGFSRKEVDEIQNFYCKNCEYNQHQCFACGTLGC 296

Query: 288 SDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFV 347
           SDK S  EVF C SA CG FYHP CV+KLLH   E   +EL  +IA G  FTCP H C  
Sbjct: 297 SDKFSGAEVFKCASAICGFFYHPHCVAKLLHGVVEDDLKELERKIAEGGPFTCPTHYCCE 356

Query: 348 CQQSEDMNVEDLQLA 362
           C++ ED    D   A
Sbjct: 357 CKEMEDKKKHDFHFA 371


>gi|414884833|tpg|DAA60847.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 429

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 175/435 (40%), Positives = 261/435 (60%), Gaps = 33/435 (7%)

Query: 228 VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS 287
           + FCDG C+RSFHA   +G+++ C +LGYT+A+++A+  FLC+NC Y++HQCF CG+L  
Sbjct: 19  IFFCDGPCMRSFHAKEGSGEDSCCVTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEP 78

Query: 288 SDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFV 347
           SD  +++ VF C +ATCG FYHP+CV++ LHP N   A EL ++I  G SFTCP+H CF 
Sbjct: 79  SDGEAAK-VFLCNNATCGHFYHPKCVAQQLHPINIDEASELEKKITEGFSFTCPIHWCFH 137

Query: 348 CQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENN-FQRAWVDLLPNNRILIY 406
           C+  ED   E LQ A+CRRCPK+YHRKCLP+EI+F + +E++   RAW +L  + RILIY
Sbjct: 138 CKGLEDRTQEPLQFAVCRRCPKSYHRKCLPSEISFDENEEDDIIIRAW-EL--SKRILIY 194

Query: 407 CLEHKIISELKTPARDHLKFPGVEGKRK--KEDLELLLTEEKDVASKRNIVSESFVADKT 464
           CL+H+I SEL+TP RDH+KFP +E   +  K+   LL  ++K   S            +T
Sbjct: 195 CLDHEIDSELETPVRDHIKFPKIEKPLQFVKKGSSLLPKKKKRTYS------------ET 242

Query: 465 VVKKLKLAEVYSGADVGMSNSEIKKRWPRQDVHSLKKPNITDTGRKSLKDIADKSK-PSL 523
           V+++     V   + V M   E  K+  R+        N+ DT  K    + +K + P +
Sbjct: 243 VLEQSPKDTVKRKSKVHMLEREQNKQDTREVSAKSFTQNLVDTPVKKKAKLKEKIQLPYV 302

Query: 524 RKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLALIKD 583
            KD  +   +S  VK   + +    +   +S  R+      S P VD+E EK ++ ++ +
Sbjct: 303 AKDHFISSPKS--VKEQEQEL----VPLPLSAIRKS-----SFPKVDSETEKRVITILGN 351

Query: 584 STSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGCDIEDA 643
             SS   ++  ++  +P TH +  +  L   I LG +E SV+AV AAL+ L+ G  + DA
Sbjct: 352 KVSSLTIKDVTRNCSLPSTHVYSGR--LTDKIPLGKIERSVQAVEAALKKLENGGSVNDA 409

Query: 644 KAVCPPEILCQIFQW 658
           KAVC PE+L Q+ +W
Sbjct: 410 KAVCEPEVLRQLARW 424


>gi|414592040|tpg|DAA42611.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 1010

 Score =  285 bits (729), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 162/450 (36%), Positives = 239/450 (53%), Gaps = 51/450 (11%)

Query: 15  VSDYEFLDQNEESISFSVLPLQWDGCDIVGSS-EMQVFLHGNIGDGPWNFNEQVIAWKFE 73
           V+ Y F D  E+ + FS+LP Q+   D         VFL G + D      ++V+AWK  
Sbjct: 16  VNRYYFEDGEEKPVCFSILPFQFGEDDSEAVFLRKDVFLCGFV-DKNLPVYKEVVAWKIR 74

Query: 74  LSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKYIREQML 133
           L    P I+VLS ++ WI L  P+K + +IVR+ LITV  L+   +  + S  ++ + + 
Sbjct: 75  LDSEHPNIYVLSIEHKWIKLLKPRKCYGDIVRSTLITVQMLHFFGRGEQRSSNHLWDHLD 134

Query: 134 KVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVRTTEK 193
           +VF  Y  +P E+DL+ H  + +   E+D+ L  SK L   + N           + T+K
Sbjct: 135 EVFGKYNPKPVEDDLMKHHTLIKLFVEKDQTLMKSKILQRLIEN---------GFKRTKK 185

Query: 194 PKVIVDWNE------DEDSERVDKDENYF----------------------------AVC 219
           P +     E      D    R + D NY                              +C
Sbjct: 186 PILKALGMEAQSIVSDGWRARKNDDNNYGNKDDSGDDCDGDGSSDDGDGSSDDDVTDQIC 245

Query: 220 AICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQC 279
           A+CDDGG +  CDG C RSFH T   G+ + C+SL YT A++  +  +LC NC  ++HQC
Sbjct: 246 ALCDDGGHLLSCDGPCKRSFHPTKKDGRESKCESLHYTSAEVKRIGTYLCANCKNKQHQC 305

Query: 280 FACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFT 339
           F CG L  S   +++ VF C  A+CG FYHP+C+++LL P+  +   EL  RI +G SF 
Sbjct: 306 FRCGELEPSHGPNAK-VFQCNQASCGYFYHPKCIAQLLDPN--ATDGELERRIMSGMSFP 362

Query: 340 CPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLP 399
           CP+H CF C   E+     LQLA+CRRCP+AYHR+CLP +++F   D++  QRAW     
Sbjct: 363 CPIHWCFKCGHMENKAQRALQLAVCRRCPRAYHRECLPRDLSFGTKDKDGNQRAWK---L 419

Query: 400 NNRILIYCLEHKIISELKTPARDHLKFPGV 429
           ++ I IYCL+H+I  +  T +R+H+KFP  
Sbjct: 420 SDTIFIYCLDHEIDKDTGTTSRNHIKFPAT 449


>gi|414592041|tpg|DAA42612.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 1006

 Score =  285 bits (728), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 160/445 (35%), Positives = 238/445 (53%), Gaps = 45/445 (10%)

Query: 15  VSDYEFLDQNEESISFSVLPLQWDGCDIVGSS-EMQVFLHGNIGDGPWNFNEQVIAWKFE 73
           V+ Y F D  E+ + FS+LP Q+   D         VFL G + D      ++V+AWK  
Sbjct: 16  VNRYYFEDGEEKPVCFSILPFQFGEDDSEAVFLRKDVFLCGFV-DKNLPVYKEVVAWKIR 74

Query: 74  LSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKYIREQML 133
           L    P I+VLS ++ WI L  P+K + +IVR+ LITV  L+   +  + S  ++ + + 
Sbjct: 75  LDSEHPNIYVLSIEHKWIKLLKPRKCYGDIVRSTLITVQMLHFFGRGEQRSSNHLWDHLD 134

Query: 134 KVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVRTTEK 193
           +VF  Y  +P E+DL+ H  + +   E+D+ L  SK L   + N  +     +     E 
Sbjct: 135 EVFGKYNPKPVEDDLMKHHTLIKLFVEKDQTLMKSKILQRLIENGFKRT---KKALGMEA 191

Query: 194 PKVIVD-WNEDEDSERVDKDENYF----------------------------AVCAICDD 224
             ++ D W       R + D NY                              +CA+CDD
Sbjct: 192 QSIVSDGWR-----ARKNDDNNYGNKDDSGDDCDGDGSSDDGDGSSDDDVTDQICALCDD 246

Query: 225 GGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGM 284
           GG +  CDG C RSFH T   G+ + C+SL YT A++  +  +LC NC  ++HQCF CG 
Sbjct: 247 GGHLLSCDGPCKRSFHPTKKDGRESKCESLHYTSAEVKRIGTYLCANCKNKQHQCFRCGE 306

Query: 285 LGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHK 344
           L  S   +++ VF C  A+CG FYHP+C+++LL P+  +   EL  RI +G SF CP+H 
Sbjct: 307 LEPSHGPNAK-VFQCNQASCGYFYHPKCIAQLLDPN--ATDGELERRIMSGMSFPCPIHW 363

Query: 345 CFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRIL 404
           CF C   E+     LQLA+CRRCP+AYHR+CLP +++F   D++  QRAW     ++ I 
Sbjct: 364 CFKCGHMENKAQRALQLAVCRRCPRAYHRECLPRDLSFGTKDKDGNQRAWK---LSDTIF 420

Query: 405 IYCLEHKIISELKTPARDHLKFPGV 429
           IYCL+H+I  +  T +R+H+KFP  
Sbjct: 421 IYCLDHEIDKDTGTTSRNHIKFPAT 445


>gi|449467187|ref|XP_004151306.1| PREDICTED: uncharacterized protein LOC101207373, partial [Cucumis
           sativus]
          Length = 625

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 119/169 (70%), Positives = 141/169 (83%), Gaps = 1/169 (0%)

Query: 705 IVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQ 764
           IVDFCCGANDFS +MK KL++ GK CSFRN+D I PKNDF+FEKRDWMTV+P+ELP GSQ
Sbjct: 1   IVDFCCGANDFSVLMKKKLDETGKQCSFRNFDFIPPKNDFNFEKRDWMTVQPKELPKGSQ 60

Query: 765 LIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDNEVLS 823
           LIMGLNPPFGVKA+LANKF+ +AL+F PKL++LIVP ET RLD+K + Y+L+WED E LS
Sbjct: 61  LIMGLNPPFGVKAALANKFVDKALEFNPKLLILIVPPETERLDEKMTPYDLVWEDTEFLS 120

Query: 824 GKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
           GKSFYLPGS+D  D Q++QWN +PP LYLWSR DWT  H  IA   GH+
Sbjct: 121 GKSFYLPGSVDAKDKQMDQWNVRPPVLYLWSRRDWTHKHTAIAQEHGHL 169


>gi|449472641|ref|XP_004153655.1| PREDICTED: uncharacterized protein LOC101221422, partial [Cucumis
           sativus]
          Length = 364

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 162/381 (42%), Positives = 226/381 (59%), Gaps = 34/381 (8%)

Query: 296 VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
           VF CV+ATCG FYHP+C+S+LLHP+N+  A +L ++IA+GESF+CPVHKC VC   E+  
Sbjct: 1   VFQCVNATCGYFYHPKCISRLLHPENKVAAGDLEKKIASGESFSCPVHKCSVCALGENKK 60

Query: 356 VEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF-QRAWVDLLPNNRILIYCLEHKIIS 414
           + +LQ A+CRRCPK+YHRKCLP +ITF  +++     RAW  LLP NRILIYCL+H+I  
Sbjct: 61  IWELQFAVCRRCPKSYHRKCLPRKITFEGSEDGETPTRAWEKLLP-NRILIYCLDHEIDE 119

Query: 415 ELKTPARDHLKFPGVEGKR---KKEDLELLLTEEKDV----ASKRNIVSESF-----VAD 462
           E++TPARDH+KFPG+E  R   +K  L +  T          S+ N+VS+       +  
Sbjct: 120 EIETPARDHIKFPGLEESRLPIQKRKLPISDTRRGKTIVFRGSRENVVSKKGSMPDDLQG 179

Query: 463 KTVVKKLKLAEVYSGADVGMSNSEIKKRWPRQD----VHSLKKPNITDTGRKSLKDIADK 518
           K+  K  K  E  S     +     K  W  +     + ++ + ++   G   L DI   
Sbjct: 180 KSAAKVSKSFERSSSDGKLLGKVTAKSLWSSESKNVKLGNISRNSLNQKGESVLMDIDKT 239

Query: 519 SKPSLRKDSTLLKSRSFVVKPGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELL 578
            K  ++K S + KS      P  R  D SK+     DR  M         +DA  E+ L+
Sbjct: 240 IK--VKKSSLVGKS----AMPTKR-FDPSKIYK--EDRSGM-------LLLDANSERRLM 283

Query: 579 ALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLDGGC 638
            ++K+  SS   E+ +K H VP THA+  KH+++K+I +G +EGSV AVRAAL  L+ GC
Sbjct: 284 DMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAVRAALRKLEEGC 343

Query: 639 DIEDAKAVCPPEILCQIFQWK 659
            IEDA+AVC PE+L  IF+WK
Sbjct: 344 CIEDAEAVCEPEVLNHIFKWK 364


>gi|242044182|ref|XP_002459962.1| hypothetical protein SORBIDRAFT_02g019320 [Sorghum bicolor]
 gi|241923339|gb|EER96483.1| hypothetical protein SORBIDRAFT_02g019320 [Sorghum bicolor]
          Length = 919

 Score =  262 bits (669), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 118/168 (70%), Positives = 142/168 (84%), Gaps = 1/168 (0%)

Query: 705 IVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQ 764
           IVDFCCGANDFS +MK KL+++ K C+F+NYDLIQP++ F FEKRDWMTV+P ELP GSQ
Sbjct: 302 IVDFCCGANDFSRLMKEKLDKVQKKCNFKNYDLIQPQHRFCFEKRDWMTVQPNELPRGSQ 361

Query: 765 LIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDNEVLS 823
           LIMGLNPPFGVKASLANKFI +AL FKPKLI+LIVP+ET+RLDQK + Y+LIWED+E+L+
Sbjct: 362 LIMGLNPPFGVKASLANKFIDKALTFKPKLIILIVPKETKRLDQKRTPYDLIWEDSEILA 421

Query: 824 GKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGH 871
           GKSFYLPGS+D++D  +E WN   PPLYLWS   WT  H ++A  RGH
Sbjct: 422 GKSFYLPGSVDLNDKTVEGWNASAPPLYLWSHPHWTKKHMQVAEERGH 469



 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 92/143 (64%), Gaps = 4/143 (2%)

Query: 567 PSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKA 626
           P+VD+E EK ++ ++ +  SS   ++  ++  +P TH +  +  L   I  G +E SV+A
Sbjct: 85  PTVDSETEKRVITILGNKVSSLTIKDVTRNCSLPSTHVYSGR--LTDKIPQGKLERSVQA 142

Query: 627 VRAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVE 686
           V AAL+ L+ G  + DAKAVC P++L Q+ +W  KL VY++PF+HG RY+SFGRHFTKVE
Sbjct: 143 VEAALKKLENGGSVNDAKAVCEPDVLRQLARWHSKLRVYISPFIHGTRYSSFGRHFTKVE 202

Query: 687 KLKEIVDRLHWYVRSGDTIVDFC 709
           KL E+  R    +R G    +FC
Sbjct: 203 KLVEVHGR--HTMRMGYQAEEFC 223


>gi|302782778|ref|XP_002973162.1| hypothetical protein SELMODRAFT_99176 [Selaginella moellendorffii]
 gi|300158915|gb|EFJ25536.1| hypothetical protein SELMODRAFT_99176 [Selaginella moellendorffii]
          Length = 215

 Score =  258 bits (660), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 116/209 (55%), Positives = 153/209 (73%), Gaps = 1/209 (0%)

Query: 664 VYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKL 723
           VYL+P L GMRY+SFGRHFT+  KL+ IVD+L +YV  GD +VD CCG+N+FS +M  KL
Sbjct: 2   VYLSPMLFGMRYSSFGRHFTRTYKLQSIVDQLSYYVNPGDMVVDLCCGSNEFSRLMSDKL 61

Query: 724 EQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKF 783
             M K C F+N+D IQP + F FE++D+M ++PEEL  G +LIMGLNPPFG +A LA+KF
Sbjct: 62  TAMKKDCRFKNFDFIQPSDTFHFERKDYMEIQPEELCSGDKLIMGLNPPFGFRAQLASKF 121

Query: 784 ISQALKFKPKLIVLIVPQETRRLDQKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQW 843
           +   +KF+PK+IVLI P+ETRR D +  Y L WED ++L G +FYLPGS+D     ++Q 
Sbjct: 122 VEHCVKFQPKIIVLIAPEETRRPDGRY-YELTWEDKDLLRGDAFYLPGSVDAEGKTIKQE 180

Query: 844 NCKPPPLYLWSRADWTASHKKIALGRGHI 872
           N  PP LYLWSR D+  +H+KIA   GH+
Sbjct: 181 NNVPPVLYLWSRVDFAEAHRKIARRHGHL 209


>gi|357157830|ref|XP_003577927.1| PREDICTED: uncharacterized protein LOC100829433 [Brachypodium
           distachyon]
          Length = 632

 Score =  249 bits (635), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 111/171 (64%), Positives = 138/171 (80%), Gaps = 1/171 (0%)

Query: 705 IVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQ 764
           IVDFCCGANDFS +MK KL+ + K C F+NYDLIQP+N F FE+RDWMTV+  ELP GS+
Sbjct: 2   IVDFCCGANDFSRLMKEKLDLVQKKCHFKNYDLIQPQNTFCFERRDWMTVQRNELPRGSR 61

Query: 765 LIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDNEVLS 823
           L+MGLNPPFGVKA+LANKFI +AL F PKLI+LIVP+ET+RLDQK + Y+L+WED + L+
Sbjct: 62  LVMGLNPPFGVKAALANKFIDKALSFNPKLIILIVPKETKRLDQKKTPYDLVWEDGDCLA 121

Query: 824 GKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITV 874
           GKSFYLPGS+DV++  ++ WN   PPLYLWS  DWT  HKK+A    H ++
Sbjct: 122 GKSFYLPGSVDVNEKIVQGWNASAPPLYLWSHPDWTKKHKKVAEEHNHTSL 172


>gi|218200973|gb|EEC83400.1| hypothetical protein OsI_28843 [Oryza sativa Indica Group]
          Length = 615

 Score =  248 bits (633), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 157/484 (32%), Positives = 242/484 (50%), Gaps = 59/484 (12%)

Query: 1   MGYSEEDDDTLTLY--VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGD 58
           M   ++DDD       V  Y   D +E+ + FS+LP  +D    +  SE   ++ G   +
Sbjct: 1   MALFDDDDDEFPQLKDVDQYACKDSHEDLVCFSILPFWFDEEHRLPGSEKGAYIRGIDCE 60

Query: 59  GPWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSF-KNIVRTILITVHWLYCV 117
           G     +++ AW+ EL   +P I VLS + NWI L +P   + ++I R++LIT+  ++ V
Sbjct: 61  GRQPVYKRIEAWRVELDCEKPMISVLSSEGNWIRLLNPHPGYAEDIARSVLITIQMIHFV 120

Query: 118 KQNPEASRKYIREQMLKVFCSYEVEPSE-NDLLDHMPISREAAERDKDLANSKFLL---- 172
           +++P    + + + + +VF  +  +P E +DL     + +   + D+ L  SK L     
Sbjct: 121 RKHPAKDERTLSKHLCEVFRKFFTKPPEVDDLRKSYSLIKYFMKMDQTLVKSKILRRPLE 180

Query: 173 -----------------SFM-ANHPETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDEN 214
                            SF+ +N   TN    D  T +      D N D  ++    D+ 
Sbjct: 181 DTWIIIETELRKASLNESFIVSNESSTNC---DDYTEDGSSGSSDDNGDNYTDGNASDDG 237

Query: 215 YFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVY 274
             AVCAICD GG +  C G C RSFH T+  G  + C++LGYT  +++ +P F+C NC +
Sbjct: 238 TEAVCAICDQGGILLGCKGECKRSFHPTLEDGTKSFCKTLGYTSREVEEIPIFICSNCQH 297

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++HQCF CG L SS +++ + VF C +A+CG+FYHP+CV+ LL PD    A  L +RIA 
Sbjct: 298 KQHQCFKCGQLDSSHETNPK-VFQCCNASCGRFYHPKCVAGLLEPDG---ACGLEKRIAD 353

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAW 394
           G +FTCPVH CF C+Q ED                         EI+F   D     R W
Sbjct: 354 GMTFTCPVHWCFECKQIEDR------------------------EISFKKKDVT--ARVW 387

Query: 395 VDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNI 454
                + +I IYCL+H I +  +TP RDH+KFP      + +DL     +  D+ +   +
Sbjct: 388 EVPKGDPKIFIYCLDHDIDATFRTPCRDHIKFPSAPQIERVKDLARKKVKVTDIRNTDEV 447

Query: 455 VSES 458
             ES
Sbjct: 448 SPES 451


>gi|302789842|ref|XP_002976689.1| hypothetical protein SELMODRAFT_175739 [Selaginella moellendorffii]
 gi|300155727|gb|EFJ22358.1| hypothetical protein SELMODRAFT_175739 [Selaginella moellendorffii]
          Length = 209

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 111/203 (54%), Positives = 147/203 (72%), Gaps = 1/203 (0%)

Query: 670 LHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKS 729
           L GMRY+SFGRHFT+  KL+ IVD+L +YV  GD +VD CCG+N+FS +M  KL  M K 
Sbjct: 2   LFGMRYSSFGRHFTRTYKLQSIVDQLSYYVNPGDMVVDLCCGSNEFSRLMSDKLTAMKKD 61

Query: 730 CSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALK 789
           C F+N+D IQP + F FE++D+M ++PEEL    +LIMGLNPPFG +A LA+KF+   +K
Sbjct: 62  CRFKNFDFIQPSDTFHFERKDYMEIQPEELCSSDKLIMGLNPPFGFRAQLASKFVEHCVK 121

Query: 790 FKPKLIVLIVPQETRRLDQKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPP 849
           F+PK+IVLI P+ETRR D +  Y L WED ++L G +FYLPGS+D     ++Q N  PP 
Sbjct: 122 FQPKIIVLIAPEETRRPDGRY-YELTWEDKDLLRGDAFYLPGSVDAEGKTIKQENNVPPV 180

Query: 850 LYLWSRADWTASHKKIALGRGHI 872
           LYLWSR D+  +H+KIA   GH+
Sbjct: 181 LYLWSRVDFAEAHRKIARRHGHL 203


>gi|222640379|gb|EEE68511.1| hypothetical protein OsJ_26944 [Oryza sativa Japonica Group]
          Length = 566

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 157/484 (32%), Positives = 242/484 (50%), Gaps = 59/484 (12%)

Query: 1   MGYSEEDDDTLTLY--VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGD 58
           M   ++DDD       V  Y   D +E+ + FS+LP  +D    +  SE  V++ G   +
Sbjct: 1   MALFDDDDDEFPQLKDVDQYACKDSHEDLVCFSILPFWFDEEHRLLGSEKGVYIRGIDCE 60

Query: 59  GPWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSF-KNIVRTILITVHWLYCV 117
           G     +++ AW+ EL   +P I VLS + NWI L +P   + ++I R++LIT+  ++ V
Sbjct: 61  GRQPVYKRIEAWRVELDCEKPMISVLSSEGNWIRLLNPHPGYAEDIARSVLITIQMIHFV 120

Query: 118 KQNPEASRKYIREQMLKVFCSYEVEPSE-NDLLDHMPISREAAERDKDLANSKFLL---- 172
           +++P    + + + + +VF  +  +P E +DL     + +   + D+ L  SK L     
Sbjct: 121 RKHPAKDERTLSKHLCEVFRKFFTKPPEVDDLRKSYSLIKYFMKMDQTLVKSKILRRPLE 180

Query: 173 -----------------SFM-ANHPETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDEN 214
                            SF+ +N   TN    D  T +      D N D  ++    D+ 
Sbjct: 181 DTWIIIETELRKASPNESFIVSNESSTNC---DDYTEDGSSGSSDDNGDNYTDGNASDDG 237

Query: 215 YFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVY 274
             AVCAICD GG +  C G C RSFH  +  G  + C++LGYT  +++ +P F+C NC +
Sbjct: 238 TEAVCAICDQGGILLGCKGECKRSFHPKLEDGTKSFCKTLGYTSREVEEIPIFICSNCQH 297

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++HQCF CG L SS +++ + VF C +A+CG+FYHP+CV+ LL PD    A  L +RIA 
Sbjct: 298 KQHQCFKCGQLDSSHETNPK-VFQCCNASCGRFYHPKCVAGLLEPDG---ACGLEKRIAD 353

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAW 394
           G +FTCPVH CF C+Q ED                         EI+F   D     R W
Sbjct: 354 GMTFTCPVHWCFECKQIEDR------------------------EISFKKKDVT--ARVW 387

Query: 395 VDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNI 454
                + +I IYCL+H I +  +TP RDH+KFP      + +DL     +  D+ +   +
Sbjct: 388 EVPKGDPKIFIYCLDHDIDATFRTPCRDHIKFPSAPQIERVKDLARKKVKVTDIRNTDEV 447

Query: 455 VSES 458
             ES
Sbjct: 448 SPES 451


>gi|357160452|ref|XP_003578769.1| PREDICTED: PHD finger-containing protein DDB_G0268158-like
           [Brachypodium distachyon]
          Length = 298

 Score =  243 bits (621), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 110/224 (49%), Positives = 160/224 (71%), Gaps = 7/224 (3%)

Query: 210 DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQA-----QIDAV 264
           D+   +   CAICDDGGD   C+G CLRSFH T   G++++C +LG T+      Q +  
Sbjct: 29  DRSNEWDNACAICDDGGDFICCEGGCLRSFHPTKKYGEDSMCTTLGLTEEWWQTLQSNEQ 88

Query: 265 PNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESL 324
             ++C+NC Y++HQCFACG+LGSS  +S  EV  C    CG FYHP+CV++L+HPD+++ 
Sbjct: 89  EKYICKNCEYKQHQCFACGLLGSSGLTSGSEVIQCKDKMCGYFYHPKCVAELVHPDSKAQ 148

Query: 325 AEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITF-S 383
           A    +R+A G +F CP+H+C +C+++E+ + +D+QLA+CRRCP AYHRKCLP++I+F  
Sbjct: 149 AIFFEKRVALGLTFHCPLHRCSLCKEAENRDDKDMQLAVCRRCPTAYHRKCLPSDISFEK 208

Query: 384 DADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
           D  E   QRAW ++LP ++ILIYC++HKI  +L+TP RDH+ FP
Sbjct: 209 DGKEGTQQRAWDNVLP-DQILIYCMKHKIDKDLRTPKRDHIVFP 251


>gi|357154941|ref|XP_003576954.1| PREDICTED: uncharacterized protein LOC100825038 [Brachypodium
           distachyon]
          Length = 332

 Score =  239 bits (609), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 113/240 (47%), Positives = 162/240 (67%), Gaps = 14/240 (5%)

Query: 194 PKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQS 253
           P+++    ED  S+  D        CAICDDGG++ +C+G CLRSFH     G +++C S
Sbjct: 54  PRLVATRREDGLSDEWD-------ACAICDDGGELIWCEGGCLRSFHPIKICGLDSMCTS 106

Query: 254 LGYTQAQIDAV-----PNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFY 308
           LG T+     +       ++C+NC  ++HQCFACG+LGSSD  S  EVF C    CG FY
Sbjct: 107 LGLTEEHWQTLHANKQEKYICKNCENKQHQCFACGLLGSSDLISGSEVFKCKEKNCGHFY 166

Query: 309 HPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCP 368
           HP+CV++LL+PD++ LA    E +A+G  F C +H+C +C+++E+ +  D+QLA+CRRCP
Sbjct: 167 HPKCVAELLYPDSKGLAIYFEEHVASGLEFHCLMHRCSLCKEAENRDDTDMQLAVCRRCP 226

Query: 369 KAYHRKCLPTEITF-SDADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
            AYHRKCLP++I+F  D DE   QRAW ++LP ++ILIYC++H+I  +L TP RDH+ FP
Sbjct: 227 TAYHRKCLPSDISFIEDNDEGTQQRAWDNILP-DQILIYCMKHEIDKDLGTPKRDHIVFP 285


>gi|222640381|gb|EEE68513.1| hypothetical protein OsJ_26947 [Oryza sativa Japonica Group]
          Length = 1287

 Score =  233 bits (595), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 151/488 (30%), Positives = 233/488 (47%), Gaps = 73/488 (14%)

Query: 18  YEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFNEQVIAWKFELSYA 77
           Y F +  ++ + FSVLP+++D  + V  S+ +        D   +  ++VIAW+ +L   
Sbjct: 18  YYFEESEDKPVCFSVLPIKFDENEEVSCSDYKELNLRGFTDNNRHVFKKVIAWRVDLDCQ 77

Query: 78  QPEIWVLSKQNNWIMLQSPKKS-FKNIVRTILITVHWLYCVKQNPEASRKYIREQMLKVF 136
           +P+I VLS    WI L  P+   ++  VR+ILITV  L+ V++ P+  ++ + + + +VF
Sbjct: 78  RPKISVLSSDGKWIELLKPRMCYYEKKVRSILITVQMLHFVRKCPKKQQRSLFDHLGEVF 137

Query: 137 CSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKV 196
             +   P+ +D+  H  + +   ERD  L  SK +  F+ +           R   + + 
Sbjct: 138 NKFGPVPNGDDIKKHHHLIKLFMERDPILVKSKIIRCFIEDASRKAIEPIPRRACTEDRF 197

Query: 197 IVDWNEDEDSERVDK----------------------------DENYFAVCAICDDGGDV 228
           IV    DE  E  D                             D++   +CAICDDGG +
Sbjct: 198 IV---SDESLESSDDCSYSNSDSSDDNNDDSTDEDTDTDGNATDDDTDMICAICDDGGKL 254

Query: 229 TFCDGRCLRSFHATITAGKNALCQSLGYTQAQ--IDAVPNFLCQNCVYQEHQCFACGMLG 286
             C+G C RSFHA +  G+ + C++L +T  +  +     FLC+NC + EHQCF CG L 
Sbjct: 255 LSCEGPCKRSFHARVKHGRKSKCRTLRFTSVELKLKESGTFLCENCEHNEHQCFKCGELE 314

Query: 287 SSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCF 346
            S    + +VF C   +CG FYHP+C++ LL P++   A +L E IA G SFTCPVH CF
Sbjct: 315 PS-YGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGACKLEEMIADGMSFTCPVHWCF 373

Query: 347 VCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIY 406
            C + ED    + Q A+CRRCP++YH +CLP                             
Sbjct: 374 KCAKMEDRTRSEFQFAVCRRCPRSYHTECLPR---------------------------- 405

Query: 407 CLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVADKTVV 466
              H I +   TP+ +H+KFP +   +K ++L     ++  V  KR      F       
Sbjct: 406 --HHGIDATTGTPSGEHIKFPSIPKIKKTKNLS---KKDAKVTGKRKKSGNKF-----ST 455

Query: 467 KKLKLAEV 474
           K  KLA V
Sbjct: 456 KSTKLANV 463



 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 83/211 (39%), Positives = 126/211 (59%), Gaps = 22/211 (10%)

Query: 686 EKLKEIVDRLHWYVRSGDT---------------------IVDFCCGANDFSCMMKVKLE 724
           E L +I D+LHWYV+ G+T                     IVD C   ++FS +MK KLE
Sbjct: 586 ECLVQIADKLHWYVQPGNTESCGKLPDGVLLERSIVVVREIVDLCFNMDNFSRLMKEKLE 645

Query: 725 QMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFI 784
            + K C+F+NYDL Q KN+  F++ +W+T++P++LP G  L+M L+ P  ++A  ANKF+
Sbjct: 646 DVSKGCNFKNYDLFQHKNNLCFDESNWVTMQPKDLPHGLNLVMVLDLPLDIQAVSANKFV 705

Query: 785 SQALKFKPKLIVLIVPQETRRLD-QKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQW 843
            + L FKP+LI+++ P +  RLD ++  Y+LIWE N+ + GK  Y PG +DV+D+     
Sbjct: 706 DKVLTFKPRLIIIVSPTDIERLDCKEEPYHLIWEYNQHIFGKPLYQPGDVDVNDDAKNGL 765

Query: 844 NCKPPPLYLWSRADWTASHKKIALGRGHITV 874
           +  P  + LWS  D T  + +IA   GH+ V
Sbjct: 766 HVIPLCVSLWSFPDKTEENMRIAGKHGHLNV 796


>gi|326503094|dbj|BAJ99172.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326526707|dbj|BAK00742.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 338

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 114/267 (42%), Positives = 163/267 (61%), Gaps = 15/267 (5%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQ------IDAVPNFLCQN 271
           VCA CDDGG++  CDG CLRSFH T   G+ ++C SLG  +A+      +     + C N
Sbjct: 53  VCAFCDDGGELICCDGGCLRSFHPTKEHGERSMCTSLGLNEAKWEKHKALGEKGLYFCNN 112

Query: 272 CVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRER 331
           C +++HQCFACG+LGSS+ ++  EVF C    CG FYHP CV+++L+PDN++ A    + 
Sbjct: 113 CKHKQHQCFACGLLGSSNLTAEPEVFKCKHRKCGHFYHPGCVAEILYPDNKAKATCFEQC 172

Query: 332 IAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFS-DADENNF 390
           +A G  F+CPVHKC  C+++E+ + +++Q A+CRRCP AYHRKCLP +I+ + D +E   
Sbjct: 173 VAGGLEFSCPVHKCKRCKEAENKHDKEMQFAVCRRCPTAYHRKCLPRDISLTEDEEEAMP 232

Query: 391 QRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVAS 450
           QRAW  +LP ++ILIYC +H I  EL TP RDH+ FP        E         +    
Sbjct: 233 QRAWDGILP-DQILIYCRKHDIQKELGTPKRDHIVFPDARNPGAAE-------RHRGAPQ 284

Query: 451 KRNIVSESFVADKTVVKKLKLAEVYSG 477
           +  I+ E  + D+   +    AE  SG
Sbjct: 285 EHGILDEDELVDRNPSQSPGPAETGSG 311


>gi|414884838|tpg|DAA60852.1| TPA: hypothetical protein ZEAMMB73_000278 [Zea mays]
          Length = 589

 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 107/155 (69%), Positives = 126/155 (81%), Gaps = 1/155 (0%)

Query: 719 MKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKAS 778
           MK KL+++ K C F+NYDLIQP+N F FEKRDWMTV+P ELP GSQLIMGLNPPFGVKAS
Sbjct: 1   MKEKLDKVQKKCHFKNYDLIQPQNRFCFEKRDWMTVQPNELPHGSQLIMGLNPPFGVKAS 60

Query: 779 LANKFISQALKFKPKLIVLIVPQETRRLDQKAS-YNLIWEDNEVLSGKSFYLPGSLDVHD 837
           LANKFI +AL FKPKLIVLIVP+ET+RLDQ+ + Y+LIWED+E LSGKSFYLPGS+D++D
Sbjct: 61  LANKFIDKALTFKPKLIVLIVPKETKRLDQRRTPYDLIWEDSECLSGKSFYLPGSVDLND 120

Query: 838 NQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
             +E WN   PPLYLWS   WT  H ++A   GH 
Sbjct: 121 KIVEGWNASAPPLYLWSHPGWTKKHMQVAEEHGHT 155


>gi|218186609|gb|EEC69036.1| hypothetical protein OsI_37847 [Oryza sativa Indica Group]
          Length = 345

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 114/222 (51%), Positives = 147/222 (66%), Gaps = 15/222 (6%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHAT---ITAGKNALCQSLGYTQAQID--------AVPN 266
           VC+ICDDGGD   C+GRCLRSFHAT   IT      C +LG T+ Q          + P 
Sbjct: 45  VCSICDDGGDFLCCEGRCLRSFHATEKHITEINQ--CTTLGLTEEQWKIFRQNDEKSEPL 102

Query: 267 FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAE 326
           ++C+NC Y +HQCF+CG+LGSSD SS  EVF C    CG FYHP+C+++LL+PD+     
Sbjct: 103 YICKNCKYNQHQCFSCGLLGSSDLSSGAEVFQCEDEKCGHFYHPKCLARLLYPDSSMQPL 162

Query: 327 ELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDAD 386
              E +A G  F CPVHKC VC+  E+ N  + Q A+CRRCP  YHRKCLP++I F D +
Sbjct: 163 NFEEEVARGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDDE 222

Query: 387 ENNF-QRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
            N   QRAW D+L  ++ILIYC++H+I  EL TPAR+H+ FP
Sbjct: 223 SNGIVQRAWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFP 263


>gi|414588818|tpg|DAA39389.1| TPA: hypothetical protein ZEAMMB73_973048 [Zea mays]
          Length = 591

 Score =  230 bits (587), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 106/155 (68%), Positives = 126/155 (81%), Gaps = 1/155 (0%)

Query: 719 MKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKAS 778
           MK KL+++ K C F+NYDLIQP++ F FEKRDWMTV+P ELP GSQLIMGLNPPFGVKAS
Sbjct: 1   MKEKLDKVQKKCHFKNYDLIQPQHHFCFEKRDWMTVQPNELPRGSQLIMGLNPPFGVKAS 60

Query: 779 LANKFISQALKFKPKLIVLIVPQETRRLDQKA-SYNLIWEDNEVLSGKSFYLPGSLDVHD 837
           LANKFI +AL FKPKLIVLIVP+ET+RLD+K   Y+LIWED+E L+GKSFYLPGS+D++D
Sbjct: 61  LANKFIDKALTFKPKLIVLIVPKETKRLDKKKIPYDLIWEDSECLAGKSFYLPGSVDLND 120

Query: 838 NQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHI 872
           N +E WN   PPLYLWS   WT  H ++A   GH 
Sbjct: 121 NTVEGWNASAPPLYLWSHPHWTKKHMQVAEEHGHT 155


>gi|115487874|ref|NP_001066424.1| Os12g0224500 [Oryza sativa Japonica Group]
 gi|108862352|gb|ABA96791.2| hydroxyproline-rich glycoprotein family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113648931|dbj|BAF29443.1| Os12g0224500 [Oryza sativa Japonica Group]
 gi|215693390|dbj|BAG88772.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 345

 Score =  230 bits (586), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 114/222 (51%), Positives = 147/222 (66%), Gaps = 15/222 (6%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHAT---ITAGKNALCQSLGYTQAQID--------AVPN 266
           VC+ICDDGGD   C+GRCLRSFHAT   IT      C +LG T+ Q          + P 
Sbjct: 45  VCSICDDGGDFLCCEGRCLRSFHATEKHITEINQ--CTTLGLTEEQWKIFRQNDEKSEPL 102

Query: 267 FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAE 326
           ++C+NC Y +HQCF+CG+LGSSD SS  EVF C    CG FYHP+C+++LL+PD+     
Sbjct: 103 YICKNCKYNQHQCFSCGLLGSSDLSSGAEVFQCEDEKCGHFYHPKCLARLLYPDSSMQPL 162

Query: 327 ELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDAD 386
              E +A G  F CPVHKC VC+  E+ N  + Q A+CRRCP  YHRKCLP++I F D +
Sbjct: 163 NFEEEVARGLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDDE 222

Query: 387 ENNF-QRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
            N   QRAW D+L  ++ILIYC++H+I  EL TPAR+H+ FP
Sbjct: 223 SNGIVQRAWDDVL-RDQILIYCMKHEIDPELGTPARNHIIFP 263


>gi|357154938|ref|XP_003576953.1| PREDICTED: uncharacterized protein LOC100824733 [Brachypodium
           distachyon]
          Length = 315

 Score =  224 bits (571), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 104/239 (43%), Positives = 157/239 (65%), Gaps = 6/239 (2%)

Query: 194 PKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQS 253
           PK   + +   ++++         VCAICD+GG +  CDG CLRSFH T   G+ + C +
Sbjct: 12  PKAKRERDCKRNTDKRKAQPTELDVCAICDEGGSLICCDGGCLRSFHPTRKVGEESKCTT 71

Query: 254 LGYTQAQIDAVPN----FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYH 309
           LG T+ Q   + +    ++C+NC  ++HQCFACG+LGSSD +S  EVF C    C +FYH
Sbjct: 72  LGLTEEQWQILESNDEKYICKNCKSKQHQCFACGLLGSSDLTSGTEVFQCKDKQCARFYH 131

Query: 310 PECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPK 369
           P+CV+++L+PD++S A      +A+G  F CP+H+C +C+++ + + +D++LA+C RCP 
Sbjct: 132 PKCVAEVLYPDSKSRALRFGNDVASGLEFHCPMHRCSLCKEAGNRDDKDMRLAVCGRCPT 191

Query: 370 AYHRKCLPTEITFSDADENNFQ-RAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
           AYHR CLP EI+F +  E   Q RAW D+LP+    IYC++HK+  +L+TP RDH+ FP
Sbjct: 192 AYHRTCLPREISFRENKEKGIQKRAWDDVLPDQD-FIYCMKHKMEKDLRTPRRDHIVFP 249


>gi|224061077|ref|XP_002300345.1| predicted protein [Populus trichocarpa]
 gi|222847603|gb|EEE85150.1| predicted protein [Populus trichocarpa]
          Length = 582

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 95/144 (65%), Positives = 116/144 (80%), Gaps = 3/144 (2%)

Query: 729 SCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPD-GSQLIMGLNPPFGVKASLANKFISQA 787
           +C+   + L Q  NDF+FEKRDWMTV P+ELP  GSQLIMGLNPPFGVKA+LANKFI +A
Sbjct: 91  TCNLVKFPLTQ--NDFNFEKRDWMTVCPDELPKKGSQLIMGLNPPFGVKAALANKFIDKA 148

Query: 788 LKFKPKLIVLIVPQETRRLDQKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKP 847
           L+FKPKL++LIVP ET RLD+K  YNL+WE++  LSGKSFYLPGS++ +D Q++QWN   
Sbjct: 149 LQFKPKLLILIVPPETERLDKKKPYNLVWENDHFLSGKSFYLPGSVNENDKQMDQWNVTA 208

Query: 848 PPLYLWSRADWTASHKKIALGRGH 871
           PPLYLWSR +W+A HK IA   GH
Sbjct: 209 PPLYLWSRQEWSAKHKAIAQKHGH 232



 Score = 46.6 bits (109), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 18/25 (72%), Positives = 23/25 (92%)

Query: 717 CMMKVKLEQMGKSCSFRNYDLIQPK 741
           C+MK KLE+MGK CS++NYD+IQPK
Sbjct: 12  CIMKKKLEEMGKKCSYKNYDVIQPK 36


>gi|357140293|ref|XP_003571704.1| PREDICTED: uncharacterized protein LOC100832660 [Brachypodium
           distachyon]
          Length = 1028

 Score =  196 bits (499), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 104/274 (37%), Positives = 157/274 (57%), Gaps = 18/274 (6%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           +CA+CD+GG +  C+GRC R+FH     G+ + C +LG++ AQ+  + ++LC+NC +Q+H
Sbjct: 263 LCAMCDNGGKLLSCEGRCKRAFHPREKDGRESNCDTLGFSSAQLQEIGHYLCKNCEFQQH 322

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
           QCF C  L  SD  +++ VF C  A CG FYHP C++KLL P +   A EL +RIAAG +
Sbjct: 323 QCFKCEDLEPSDGPNAK-VFQCYKACCGHFYHPSCIAKLLEPHDNDGACELEKRIAAGMA 381

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDL 397
           FTCP H C  C + ED     L LA+CRRCP         + I+F D +E + +++    
Sbjct: 382 FTCPAHWCSKCGKMEDHTEIPLWLAVCRRCP--------ISNISFEDFEEKDGRKSIRAW 433

Query: 398 LPNNRILIYCLEHKIISELKTPARDHLKFP-----GVEGKRKKEDLELLLTEE--KDVAS 450
             ++RI+IYC +H+I  ++ TP RDH+K P     G  G    + ++L  T E  +D+A 
Sbjct: 434 HLDDRIIIYCRQHEIEDKIGTPRRDHIKLPSIKDSGTIGTHCMDHMKLSSTTESTRDLAK 493

Query: 451 KRNIVS--ESFVADKTVVKKLKLAEVYSGADVGM 482
           K+  V+       D+ +    KL+   S    G 
Sbjct: 494 KKAQVTGKRKMNTDQGLTGTEKLSNKISQEKAGQ 527



 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 61/166 (36%), Positives = 91/166 (54%), Gaps = 1/166 (0%)

Query: 15  VSDYEFLDQNEESISFSVLPLQWDGCDIVGS-SEMQVFLHGNIGDGPWNFNEQVIAWKFE 73
           V  Y F +  +E +S SVLPL++D  D V      +V LHG   +G    + +V+AW+  
Sbjct: 15  VDKYHFENGMQEPVSLSVLPLKFDEIDEVACLDSTKVDLHGFADNGLHTVHAKVVAWRVG 74

Query: 74  LSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKYIREQML 133
             Y QP+I VLS +  WI L  P+K ++  VR++LI+V  L+ VK+ P    + +   + 
Sbjct: 75  FDYEQPKITVLSSEGYWIELLKPRKCYEKTVRSVLISVQMLHFVKKWPRKENRSLLNHLC 134

Query: 134 KVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHP 179
           +VF  ++++PSE DL     I +  AERD  L  SK L   M N P
Sbjct: 135 EVFNKFDIKPSERDLKKQHSIIKFFAERDPTLMKSKILQRIMENTP 180


>gi|242085260|ref|XP_002443055.1| hypothetical protein SORBIDRAFT_08g007180 [Sorghum bicolor]
 gi|241943748|gb|EES16893.1| hypothetical protein SORBIDRAFT_08g007180 [Sorghum bicolor]
          Length = 310

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 101/231 (43%), Positives = 130/231 (56%), Gaps = 39/231 (16%)

Query: 208 RVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQ-----ID 262
           R D D+    +CAICDDGGDV  CDG C RSFH      + + C+ +    A+     + 
Sbjct: 30  RTDGDD----LCAICDDGGDVICCDGVCQRSFHLADGNSERSRCREILRLSAEQAKMILA 85

Query: 263 AVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNE 322
           A  +F+C+NC YQEHQCFACG LGSSD SS  +VF C    CG FYHP+CV+KLL+PD+E
Sbjct: 86  ADKDFICKNCKYQEHQCFACGKLGSSDLSSEAKVFQCEVDDCGHFYHPKCVAKLLYPDSE 145

Query: 323 SLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITF 382
             A      +AA E FTCP+H+C VC+  E+ N  ++Q A+CRRCP  YHR CLP     
Sbjct: 146 EEATLFEVHVAAREKFTCPIHECIVCRGGENKNDRNMQFAVCRRCPTTYHRMCLPR---- 201

Query: 383 SDADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKR 433
                                     +H I+ +LKTP  DH+ FP V+  R
Sbjct: 202 --------------------------KHPIVKKLKTPKWDHIIFPDVKKIR 226


>gi|52354519|gb|AAU44580.1| hypothetical protein AT5G48090 [Arabidopsis thaliana]
          Length = 268

 Score =  191 bits (485), Expect = 1e-45,   Method: Composition-based stats.
 Identities = 91/210 (43%), Positives = 130/210 (61%), Gaps = 14/210 (6%)

Query: 160 ERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKVIVDWNEDEDSERVD--------- 210
           +RD+DL  SKFL++F+    +T     ++ T           +D   E +          
Sbjct: 2   KRDEDLTKSKFLITFLGKTSQTTPIEVELPTDHLQDAQTPMEQDSIVENMTDEENSSSSD 61

Query: 211 ----KDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYT-QAQIDAVP 265
                +  +  VC+ICD+GG V  C+G CLRSFH TI  G    C+SLG+T + QI A+ 
Sbjct: 62  DDSEANLQFDPVCSICDNGGYVLCCEGSCLRSFHPTIADGIETECESLGFTDKTQIQALG 121

Query: 266 NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLA 325
            +LC NC+Y++HQC+ACG LGSSD++ SQ+VFPC ++ CG FYHPECV++LL  D+++ A
Sbjct: 122 TYLCNNCLYKQHQCYACGELGSSDENFSQQVFPCSASNCGHFYHPECVARLLCADDQNKA 181

Query: 326 EELRERIAAGESFTCPVHKCFVCQQSEDMN 355
           EEL+ +IAA + F CP+H C +C  SED N
Sbjct: 182 EELQAKIAARDCFACPLHTCKLCNMSEDKN 211


>gi|449474294|ref|XP_004154131.1| PREDICTED: uncharacterized protein LOC101202815, partial [Cucumis
           sativus]
          Length = 248

 Score =  190 bits (483), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 104/247 (42%), Positives = 155/247 (62%), Gaps = 19/247 (7%)

Query: 1   MGYSEEDDDTLTLY-VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDG 59
           M  S+++ +TL +  VS+Y F+D  EE ISFS+LP++WD  + +  S+  VFLHG + +G
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKEEPISFSILPIKWDEGERLEGSQEPVFLHGTVDNG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQ 119
                +QV AW+F +S ++PEI VLSK+N WI LQ P+KSF++ +RTILITVH L+ +K+
Sbjct: 61  LQKVYKQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLHALKR 120

Query: 120 NPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHP 179
           NP+ S K + + + K F  YEV PS+NDL+DHM +  EA +RD  LA S+FLL+F+   P
Sbjct: 121 NPDTSSKSLWDLLAKTFSLYEVRPSQNDLVDHMSLIGEAIKRDDILAKSQFLLAFLEEKP 180

Query: 180 ETNT-----FHEDVRTTEKPKVIVDW------------NEDEDSERVDKDENYF-AVCAI 221
                    F +D +   +P  IVD              E +D+   + D++ F +VCA 
Sbjct: 181 VKRMSCHELFFQDAQRNMQPSFIVDDSDDEDMVDDAVEEESDDAVEEESDDDVFDSVCAF 240

Query: 222 CDDGGDV 228
           CD+GG++
Sbjct: 241 CDNGGNI 247


>gi|297739139|emb|CBI28790.3| unnamed protein product [Vitis vinifera]
          Length = 201

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 95/175 (54%), Positives = 126/175 (72%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M  S+E+ + L   + DY F+D  +E ISFS+LPLQW   +     +M +FL G+  DG 
Sbjct: 1   MASSDEEGEILPNCIRDYYFVDYKDEPISFSILPLQWSKDENPDGLKMMIFLRGSAYDGL 60

Query: 61  WNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQN 120
               +QVIAWKFELS  +PEI+VLSK  NW+ LQSP+KSF+NIVRTIL+TV WL+ VK+N
Sbjct: 61  QKIYKQVIAWKFELSSVEPEIFVLSKDKNWMELQSPRKSFQNIVRTILVTVSWLHFVKRN 120

Query: 121 PEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFM 175
           PEAS K +   +LK F SYE EPSENDLLDHMP+ +EA +R++DL  SK +L+++
Sbjct: 121 PEASGKSLWNHLLKSFSSYEFEPSENDLLDHMPLIQEAVKREEDLLKSKCMLTYL 175


>gi|222616830|gb|EEE52962.1| hypothetical protein OsJ_35606 [Oryza sativa Japonica Group]
          Length = 312

 Score =  181 bits (459), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 95/214 (44%), Positives = 125/214 (58%), Gaps = 32/214 (14%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHAT---ITAGKNALCQSLGYTQAQIDAVPNFLCQNCVY 274
           VC+ICDDGGD   C+GRCLRSFHAT   IT      C +LG T+ Q              
Sbjct: 45  VCSICDDGGDFLCCEGRCLRSFHATEKHITEINQ--CTTLGLTEEQWKIF---------- 92

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
                          + + ++VF C    CG FYHP+C+++LL+PD+        E +A 
Sbjct: 93  ---------------RQNDEKVFQCEDEKCGHFYHPKCLARLLYPDSSMQPLNFEEEVAR 137

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF-QRA 393
           G  F CPVHKC VC+  E+ N  + Q A+CRRCP  YHRKCLP++I F D + N   QRA
Sbjct: 138 GLKFLCPVHKCHVCKGGENKNDMENQFAVCRRCPTVYHRKCLPSDIVFEDDESNGIVQRA 197

Query: 394 WVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
           W D+L  ++ILIYC++H+I  EL TPAR+H+ FP
Sbjct: 198 WDDVL-RDQILIYCMKHEIDPELGTPARNHIIFP 230


>gi|218200975|gb|EEC83402.1| hypothetical protein OsI_28848 [Oryza sativa Indica Group]
          Length = 1266

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 175/353 (49%), Gaps = 35/353 (9%)

Query: 18  YEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFNEQVIAWKFELSYA 77
           Y F +  ++ + FSVLP+++D  + V  S+ +        D   +  ++VIAW+ +L   
Sbjct: 18  YYFEESEDKPVCFSVLPIKFDENEEVSCSDYKELNLRGFTDNNRHVFKKVIAWRVDLDCQ 77

Query: 78  QPEIWVLSKQNNWIMLQSPKKS-FKNIVRTILITVHWLYCVKQNPEASRKYIREQMLKVF 136
           +P+I VLS    WI L  P+   ++  VR+ILITV  L+ V++ P+  ++ + + + +VF
Sbjct: 78  RPKISVLSSDGKWIELLKPRMCYYEKKVRSILITVQMLHFVRKCPKKQQRSLFDHLGEVF 137

Query: 137 CSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKV 196
             +   P+ +D+  H  + +   ERD  L  SK +  F+ +           R   + + 
Sbjct: 138 NKFGPVPNGDDIKKHHHLIKLFMERDPILVKSKIIRCFIEDASRKAIEPIPRRACTEDRF 197

Query: 197 IVDWNEDEDSERVDK----------------------------DENYFAVCAICDDGGDV 228
           IV    DE  E  D                             D++   +CAICDDGG +
Sbjct: 198 IV---SDESLESSDDCSYSDSDSSDDNNDDSTDEDTDTDGNATDDDTDMICAICDDGGKL 254

Query: 229 TFCDGRCLRSFHATITAGKNALCQSLGYTQAQ--IDAVPNFLCQNCVYQEHQCFACGMLG 286
             C+G C RSFHA +  G+ + C++L +T  +  +     FLC+NC + EHQCF CG L 
Sbjct: 255 LSCEGPCKRSFHARVKHGRKSKCRTLRFTSVELKLKESGTFLCENCEHNEHQCFKCGELE 314

Query: 287 SSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFT 339
            S    + +VF C   +CG FYHP+C++ LL P++   A +L E IA    F+
Sbjct: 315 PS-YGPNAKVFQCNKESCGHFYHPKCIAVLLEPEDTDGACKLEEMIADAIMFS 366



 Score =  163 bits (413), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 84/211 (39%), Positives = 126/211 (59%), Gaps = 22/211 (10%)

Query: 686 EKLKEIVDRLHWYVRSGDT---------------------IVDFCCGANDFSCMMKVKLE 724
           E L +I D+LHWYV+ G+T                     IVD C   ++FS +MK KLE
Sbjct: 565 ECLVQIADKLHWYVQPGNTESCGKLPDGVLLERSIVVVREIVDLCFNMDNFSRLMKEKLE 624

Query: 725 QMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFI 784
            + K C+F+NYDL Q KN+  F++ +W+T++P++LP G  L+M L+ P  V+A  ANKF+
Sbjct: 625 DVSKGCNFKNYDLFQHKNNLCFDESNWVTMQPKDLPHGLNLVMVLDLPLDVQAVSANKFV 684

Query: 785 SQALKFKPKLIVLIVPQETRRLD-QKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQW 843
            + L FKP+LI+++ P +  RLD ++  Y+LIWE N+ + GK  Y PG +DV+D+     
Sbjct: 685 DKVLTFKPRLIIIVSPTDIERLDCKEEPYHLIWEYNQHIFGKPLYQPGDVDVNDDAKNGL 744

Query: 844 NCKPPPLYLWSRADWTASHKKIALGRGHITV 874
           +  P  + LWS  D T  + +IA   GH+ V
Sbjct: 745 HVIPLCVSLWSFPDKTEENMRIAGKHGHLNV 775


>gi|449470764|ref|XP_004153086.1| PREDICTED: uncharacterized protein LOC101206575, partial [Cucumis
           sativus]
          Length = 203

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 83/176 (47%), Positives = 123/176 (69%), Gaps = 1/176 (0%)

Query: 1   MGYSEEDDDTLTLY-VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDG 59
           M  S+++ +TL +  VS+Y F+D  EE ISFS+LP++WD  + +  S+  VFLHG + +G
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKEEPISFSILPIKWDEGERLEGSQEPVFLHGTVDNG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQ 119
                +QV AW+F +S ++PEI VLSK+N WI LQ P+KSF++ +RTILITVH L+ +K+
Sbjct: 61  LQKVYKQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLHALKR 120

Query: 120 NPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFM 175
           NP+ S K + + + K F  YEV PS+NDL+DHM +  EA +RD  LA S+ + +F+
Sbjct: 121 NPDTSSKSLWDLLAKTFSLYEVRPSQNDLVDHMSLIGEAVKRDDILAKSQAIETFI 176



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/37 (56%), Positives = 29/37 (78%)

Query: 259 AQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQE 295
           A+  A+  F+C+NC Y++HQC+ACG LGSSD+SS  E
Sbjct: 167 AKSQAIETFICKNCEYKQHQCYACGNLGSSDQSSGAE 203


>gi|449534314|ref|XP_004174109.1| PREDICTED: uncharacterized protein LOC101225204 [Cucumis sativus]
          Length = 189

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 83/173 (47%), Positives = 119/173 (68%), Gaps = 1/173 (0%)

Query: 1   MGYSEEDDDTLTLY-VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDG 59
           M  S+++ +TL +  VS+Y F+D  EE ISFS+LP++WD  + +  S+  VFLHG + +G
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKEEPISFSILPIKWDEGERLEGSQEPVFLHGTVDNG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQ 119
                 QV AW+F +S ++PEI VLSK+N WI LQ P+KSF++ +RTILITVH L+ +K+
Sbjct: 61  LQKVYNQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLHALKR 120

Query: 120 NPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLL 172
           NP+ S K + + + K F  YEV PS+NDL+DHM +  EA +RD  LA S+ L 
Sbjct: 121 NPDTSSKSLWDLLAKTFSLYEVRPSQNDLVDHMSLIGEAIKRDDILAKSQVLF 173


>gi|168064191|ref|XP_001784048.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664434|gb|EDQ51154.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 541

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 91/220 (41%), Positives = 126/220 (57%), Gaps = 22/220 (10%)

Query: 218 VCAICDDGGD----VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCV 273
           VCA CDDG +    +  CDG C+RSFH TI +G    C +L  T+A + +V  ++C NC 
Sbjct: 272 VCAFCDDGVEGSERLLCCDGPCMRSFHPTIDSGWQNKCPTLRLTRAAV-SVEIWICPNCE 330

Query: 274 YQEHQCFACGMLGSSDKSSS-QEVFPCVSATCGQFYHPECVSKLLHPDN--ESLAEELRE 330
             +HQCF CG LG S  S+  Q+VF C    C +FYHP CV+KLL PD    +LA  ++ 
Sbjct: 331 VGQHQCFVCGKLGKSTGSADHQQVFKCEHRQCKRFYHPACVAKLLVPDVALNNLACRIQL 390

Query: 331 RIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF 390
           +    E+FTCP+HKC  C   ED     L L  CRRCP A+H +CLP             
Sbjct: 391 KF---ETFTCPLHKCANCNLDEDKADPTLCLIKCRRCPAAWHERCLPQAC---------- 437

Query: 391 QRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVE 430
            R+ +  L + + ++YC +H++  EL TP R+H+ FP +E
Sbjct: 438 -RSQMWPLEDGKCVMYCGKHRLDPELLTPERNHITFPELE 476


>gi|224061090|ref|XP_002300347.1| predicted protein [Populus trichocarpa]
 gi|222847605|gb|EEE85152.1| predicted protein [Populus trichocarpa]
          Length = 238

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 93/237 (39%), Positives = 141/237 (59%), Gaps = 9/237 (3%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIV--GSSEMQVFLHGNIGD 58
           M  S+++ DTL   VS Y F D  +E +SFS+LP++W        G     +FL G++ +
Sbjct: 1   MASSDDEADTLPESVSTYHFADDKDEPLSFSLLPIRWRESKNFDDGKKNHMIFLKGSVDN 60

Query: 59  GPWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVK 118
           G     +QVIAW F+LS   P+I VL+K+  W+ L  P+KS++ ++RT+LITVH L+  +
Sbjct: 61  GLRTIYKQVIAWTFDLSNTTPQISVLTKEKCWMELGKPRKSYEIMIRTVLITVHCLHFAR 120

Query: 119 QNPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANH 178
            NPEAS K + + + K F  YE  PS+NDL+DH+ +  EA  R+  LA  KFLL+F+   
Sbjct: 121 WNPEASGKSVWDYLSKTFSLYEHRPSQNDLVDHLDLIGEAVRRENSLAKCKFLLNFLGEK 180

Query: 179 PETNTFH-EDVRTTEKPKVIVDWNEDEDSERVDKDEN------YFAVCAICDDGGDV 228
           P       ED +       IVD  ED++ E +++DE+      + +VCA CD+GG++
Sbjct: 181 PRKKMLSDEDFQAATMSAFIVDDVEDDNFEDLEEDESNDEDELFDSVCAFCDNGGNL 237


>gi|168033774|ref|XP_001769389.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679309|gb|EDQ65758.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 707

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 99/261 (37%), Positives = 142/261 (54%), Gaps = 22/261 (8%)

Query: 189 RTTEKPKVIVDWNEDEDSERVDKDENYFA---VCAICDDGGD----VTFCDGRCLRSFHA 241
           R   +P     + + + S ++  DE++     VC  CDDG +    +  CDG C+RSFH 
Sbjct: 420 RKPGRPARGNSFRKGKRSAKLGGDEDWADEPDVCVFCDDGVEGSQRLLCCDGPCMRSFHP 479

Query: 242 TITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSS-SQEVFPCV 300
           TI +G    C++L  T+A I +V  ++C NC   +HQCFACG LG S +S+  QEVF C 
Sbjct: 480 TIDSGLQNKCKTLRLTRAAI-SVETWICPNCEVGQHQCFACGKLGKSTESADEQEVFVCD 538

Query: 301 SATCGQFYHPECVSKLLHPDNESLAEELRERIAAG-ESFTCPVHKCFVCQQSEDMNVEDL 359
              C +FYHP CV+KLL   +E+    L  RI    E+FTCP+HKC  C   E+     L
Sbjct: 539 HNRCRRFYHPACVAKLL--VSEAAQNNLACRIQLKLETFTCPLHKCANCNLDENKIDPTL 596

Query: 360 QLAICRRCPKAYHRKCLPTEITFSD--ADENNFQRAWVDLLPNNRILIYCLEHKIISELK 417
            L  CRRCP A+H KCLP   +F+   +  +   R+ +  L + +      +H++  EL 
Sbjct: 597 HLVKCRRCPVAWHEKCLPQYGSFNHLISVRSIVCRSQIWPLADGK------KHRLDPELL 650

Query: 418 TPARDHLKFPGVEGKRKKEDL 438
           TP R+H+ FP   G +  E L
Sbjct: 651 TPERNHITFPA--GSKYSESL 669



 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 31/149 (20%), Positives = 69/149 (46%), Gaps = 7/149 (4%)

Query: 28  ISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFNEQVIAWKFELS-YAQPEIWVLSK 86
           + F++  +QW+      S   +V++ G   +G    + ++  WK EL    +P+  +   
Sbjct: 180 LPFAIREIQWER-----SQAEKVYVQGKF-NGLQEVSVEIDLWKLELPIVGKPKFLIHEA 233

Query: 87  QNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKYIREQMLKVFCSYEVEPSEN 146
              WI L+ P+  +   ++ +++  H+L   KQ+PE+++  +   + K    ++V+P++ 
Sbjct: 234 AGAWIELRKPRDFYLETMKVVIVAAHFLIFAKQSPESTKGAVWRYIRKECSEFKVKPNQK 293

Query: 147 DLLDHMPISREAAERDKDLANSKFLLSFM 175
           DL     + +     D  L  S  +  F 
Sbjct: 294 DLSPAFLLLQAIVTSDPKLQASSVISQFF 322


>gi|384251374|gb|EIE24852.1| hypothetical protein COCSUDRAFT_41130 [Coccomyxa subellipsoidea
            C-169]
          Length = 1073

 Score =  145 bits (367), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 86/248 (34%), Positives = 134/248 (54%), Gaps = 16/248 (6%)

Query: 611  LEKSITLGLVEGSVKAVRAALEMLDGGCDIEDAKAVCPPEILCQIFQWKRKLDVYLAPFL 670
            LE +IT   ++G VKA R AL       D   +  +    ++ ++++ +  L + LAP L
Sbjct: 788  LEPTITRDRLDGHVKACRGAL-------DNPHSAQLMMGRLVNELYRAEDNLRMVLAPVL 840

Query: 671  HGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSC 730
            HG  YTS+GRHFT    L+E+   L  ++ +GDT+VDF CG N +  M+K      G   
Sbjct: 841  HGASYTSYGRHFTLPYLLEEVNKYLLPFMLTGDTVVDFSCGYNVWLPMLKRMGLAEGLEI 900

Query: 731  SFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKF 790
              R +D+I  K+   F K  W+ V   +LP G +L++GLNPPFG    LA+KF++ A +F
Sbjct: 901  HGRAFDIITAKDTEDFVKSSWLDVDFGQLPRGDKLVIGLNPPFGKNGCLASKFVTHAAQF 960

Query: 791  KPKLIVLIVPQETRRLDQKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCKPPPL 850
            +P++IVLI P  T        Y ++ E+  +++G++F+ PG           WN  PP  
Sbjct: 961  QPRVIVLIAPPGT---IIPPGYIVMLEETRLMAGEAFFKPGV------DKASWNDVPPST 1011

Query: 851  YLWSRADW 858
             +  R D+
Sbjct: 1012 RILIREDY 1019



 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 80/249 (32%), Positives = 108/249 (43%), Gaps = 60/249 (24%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKN-----------ALCQSLGYTQ---AQIDA 263
           +C  CDDGG++  CDG C+R+FHA I   +              C  L        + D+
Sbjct: 187 ICTKCDDGGELILCDGGCMRAFHAGIKPAEEEEEGPVYVDDPGSCNPLQIGAELFMRFDS 246

Query: 264 VPN-FLCQNCVYQEHQCFACGMLGSSDKSSSQ--EVFPCVSATCGQFYHPECVSKLLHPD 320
               F+C NC   +HQCFAC + GS   SSS   EVF C    CG+FYH  C++K    D
Sbjct: 247 TNEPFMCPNCHSGKHQCFACKVEGSVGGSSSGLPEVFKCTVGDCGRFYHARCLNK----D 302

Query: 321 NESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEI 380
            + +A            F CP+H C  C + E  N E  ++  CRRCP AYH  CLP +I
Sbjct: 303 PQEMAI----------FFVCPLHTCKRCNEGE--NAEKGEVISCRRCPVAYHAGCLPKKI 350

Query: 381 TFSDADENNFQRAWV--------DLLPN--------------NRILIYCLEHKIISELKT 418
             S        R W+        ++LP                + L+YC+ H +     T
Sbjct: 351 LNSP-----VTRVWITRKLDKEGNVLPPPLPRTKAVEPECDIGQSLLYCMRHDMALGDAT 405

Query: 419 PARDHLKFP 427
           P    L  P
Sbjct: 406 PVMGELFSP 414


>gi|242085258|ref|XP_002443054.1| hypothetical protein SORBIDRAFT_08g007170 [Sorghum bicolor]
 gi|241943747|gb|EES16892.1| hypothetical protein SORBIDRAFT_08g007170 [Sorghum bicolor]
          Length = 279

 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/213 (39%), Positives = 110/213 (51%), Gaps = 49/213 (23%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAV---PNFLCQNCVY 274
           VCAICDDGG VT CD  CLRSFH T   G+ + C SLG    +   +    +F+C+NC Y
Sbjct: 38  VCAICDDGGYVTCCDSGCLRSFHLTEEHGEGSKCPSLGINSEEAKMIIDKKDFICKNCKY 97

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++HQC +CG+LGSSD SS  EVF C    CG FYHP+C+SKLL+P      ++LR     
Sbjct: 98  KQHQCSSCGLLGSSDLSSGAEVFQCKEYNCGHFYHPKCISKLLYP-----GDKLR----- 147

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAW 394
                        C    D+  E       +  P  Y                  FQRAW
Sbjct: 148 ------------ACHFEHDIPFET------KEGPNGY-----------------IFQRAW 172

Query: 395 VDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
             +L  +RILIYC++H+I+ EL  P R  + FP
Sbjct: 173 DGIL-RDRILIYCMKHEIVKELGIPRRKLIIFP 204


>gi|414877686|tpg|DAA54817.1| TPA: hypothetical protein ZEAMMB73_710478 [Zea mays]
          Length = 277

 Score =  143 bits (360), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 84/223 (37%), Positives = 126/223 (56%), Gaps = 54/223 (24%)

Query: 210 DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAV---PN 266
           +K EN   VCAICDDGG++T C+G C RSFH  +    N + ++LG T  +   +    +
Sbjct: 36  NKGEN---VCAICDDGGNLTCCEGCCQRSFH--LDNEHNCI-KTLGLTIQEAKNIIEKED 89

Query: 267 FLCQNCVYQEHQCFACGMLGSSDKS--SSQEVFPCVSATCGQFYHPECVSKLLHPDNESL 324
           F+C+NC Y++HQCF CG LGSSD +  SSQ+VF C    C +FYHP+CV+++L+PD++ L
Sbjct: 90  FICKNCQYKQHQCFICGSLGSSDDTDTSSQQVFQCEHDDCARFYHPKCVAQMLYPDSQEL 149

Query: 325 AEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSD 384
            E     +AAGESFTCP+H+C VC++                                  
Sbjct: 150 FE---LHVAAGESFTCPMHECIVCKEV--------------------------------- 173

Query: 385 ADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
             +N   +  +D      + ++  +H+I+ EL+TPAR+H+ FP
Sbjct: 174 --DNKKDKTLLDF-----VNLFYRKHEIVRELQTPARNHIVFP 209


>gi|108707357|gb|ABF95152.1| hydroxyproline-rich glycoprotein family protein, putative [Oryza
           sativa Japonica Group]
 gi|125585711|gb|EAZ26375.1| hypothetical protein OsJ_10258 [Oryza sativa Japonica Group]
          Length = 256

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 75/182 (41%), Positives = 105/182 (57%), Gaps = 36/182 (19%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           +CAICDDGG++  C+G C R FH TI+   +  C++L  +Q Q+++   F+C+NCVY++H
Sbjct: 25  ICAICDDGGNLIRCEGACRRYFHRTISNDADFNCETLNMSQEQVES-SKFICKNCVYKQH 83

Query: 278 QCFACGMLGSSDKSS-SQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           QCF CG LGSSD SS S EV+ C  + C +FYHP+C+++     N  +            
Sbjct: 84  QCFGCGELGSSDMSSGSAEVYQCSKSRCRRFYHPKCLAEFDSSKNPPV------------ 131

Query: 337 SFTCPVHKCFVCQ-QSEDMNVE--------------------DLQLAICRRCPKAYHRKC 375
            F CP+H+CF C+ + E  N E                     + LA+CRRCP AYHRKC
Sbjct: 132 -FECPLHECFACKNKGEKYNEETCKNKGQESIKKKQGAENNKKMHLALCRRCPIAYHRKC 190

Query: 376 LP 377
           LP
Sbjct: 191 LP 192


>gi|242085262|ref|XP_002443056.1| hypothetical protein SORBIDRAFT_08g007190 [Sorghum bicolor]
 gi|241943749|gb|EES16894.1| hypothetical protein SORBIDRAFT_08g007190 [Sorghum bicolor]
          Length = 232

 Score =  139 bits (350), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 116/222 (52%), Gaps = 49/222 (22%)

Query: 212 DENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYT---QAQIDAVPNFL 268
           D     VCAICDDGG++T C+G C R+FH      K+   ++LG T      I     F+
Sbjct: 35  DNKGLDVCAICDDGGNLTCCEGCCQRAFHLD---NKHDCIKTLGLTIEDAKNIIEKEAFI 91

Query: 269 CQNCVYQEHQCFACGMLGSSD---KSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLA 325
           C+NC Y++HQC+ CG LGSSD    SS  EVF C    C +FYHP+CV+++L+PD+    
Sbjct: 92  CKNCQYKQHQCYICGSLGSSDDTDTSSQAEVFQCEHDDCARFYHPKCVAQMLYPDSHHRQ 151

Query: 326 EELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDA 385
           E    ++AAGE F+CP+H+C VC++ E+   + L                          
Sbjct: 152 ELFELQVAAGERFSCPMHECIVCKEVENKTEKPLL------------------------- 186

Query: 386 DENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
                          N + ++  +H+I+ +L+TPARDH+ FP
Sbjct: 187 ---------------NFVNLFYRKHEIVRKLQTPARDHIIFP 213


>gi|296082727|emb|CBI21732.3| unnamed protein product [Vitis vinifera]
          Length = 88

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 65/87 (74%), Positives = 79/87 (90%), Gaps = 1/87 (1%)

Query: 752 MTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKPKLIVLIVPQETRRLDQKA- 810
           MTV+P ELP GS LIMGLNPPFGV+A+LANKFI QAL+FKPKL+VLIVPQET+RLD+K  
Sbjct: 1   MTVKPSELPTGSHLIMGLNPPFGVQAALANKFIDQALEFKPKLLVLIVPQETQRLDKKKL 60

Query: 811 SYNLIWEDNEVLSGKSFYLPGSLDVHD 837
            Y+L+W+++E LSGKSFYLPGS+DV+D
Sbjct: 61  PYDLVWKEDEELSGKSFYLPGSIDVND 87


>gi|449472830|ref|XP_004153707.1| PREDICTED: uncharacterized protein LOC101217475, partial [Cucumis
           sativus]
          Length = 200

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/137 (46%), Positives = 96/137 (70%), Gaps = 1/137 (0%)

Query: 1   MGYSEEDDDTLTLY-VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDG 59
           M  S+++ +TL +  VS+Y F+D  EE ISFS+LP++WD  + +  S+  VFLHG + +G
Sbjct: 1   MASSDDEAETLPVPPVSNYHFVDHKEEPISFSILPIKWDEGERLEGSQEPVFLHGTVDNG 60

Query: 60  PWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQ 119
                +QV AW+F +S ++PEI VLSK+N WI LQ P+KSF++ +RTILITVH L+ +K+
Sbjct: 61  LQKVYKQVTAWRFNISGSKPEISVLSKENVWITLQKPRKSFEDTIRTILITVHCLHALKR 120

Query: 120 NPEASRKYIREQMLKVF 136
           NP+ S K + + + K F
Sbjct: 121 NPDTSSKSLWDLLAKTF 137



 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 37/65 (56%), Positives = 49/65 (75%), Gaps = 3/65 (4%)

Query: 231 CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDK 290
           CDGRC+RSFHAT   G    C SLG ++ ++DA+  F+C+NC Y++HQC+ACG LGSSD+
Sbjct: 139 CDGRCMRSFHATEEDGD---CFSLGLSKEEVDAIETFICKNCEYKQHQCYACGNLGSSDQ 195

Query: 291 SSSQE 295
           SS  E
Sbjct: 196 SSGAE 200


>gi|307105371|gb|EFN53620.1| hypothetical protein CHLNCDRAFT_58419 [Chlorella variabilis]
          Length = 819

 Score =  129 bits (325), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 97/208 (46%), Gaps = 35/208 (16%)

Query: 219 CAICDDGGDVTFCDGRCLRSFHATIT-AGKNALCQSLGYTQAQIDAV--PNFLCQNCVYQ 275
           CA+CDDGG++  CDG C+RSFH +    G N LC      Q    A     FLC NC+  
Sbjct: 390 CALCDDGGNLLACDGPCMRSFHVSAEDDGCNLLCLPPDLAQTLTSAANSETFLCPNCLAG 449

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
            HQC+ C   G     +  EVF C++A+CG+ YHPEC                     AG
Sbjct: 450 VHQCYVCKREG----KAGTEVFKCINASCGRHYHPECAG-----------------AEAG 488

Query: 336 ESFTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRA 393
           + F C +H C  C Q  D +   +  +L  CRRCP A+HR+CLP E+        +  R 
Sbjct: 489 QPFVCKLHSCHACGQPGDASAPGQAGELVPCRRCPVAFHRRCLPIELPRHRDGGASLARV 548

Query: 394 WVD---------LLPNNRILIYCLEHKI 412
           W+          L    + L+YC  H +
Sbjct: 549 WLADYEDESGKWLDGVEQSLLYCRNHSL 576


>gi|38424059|dbj|BAD01749.1| unknown protein [Oryza sativa Japonica Group]
 gi|38637133|dbj|BAD03387.1| unknown protein [Oryza sativa Japonica Group]
          Length = 657

 Score =  120 bits (301), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 94/161 (58%)

Query: 15  VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGPWNFNEQVIAWKFEL 74
           V +Y F+D N+  +SF VLP Q+D  + V S +  V+L G    G     +QV+AWK  L
Sbjct: 15  VENYYFVDDNDVPVSFDVLPFQFDAAEGVASFKKDVYLRGFTDGGLQKVYKQVVAWKLVL 74

Query: 75  SYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVKQNPEASRKYIREQMLK 134
               PEI VLS + +WI L  P+ S++  +R++LITV  L+ V++ P  S K + + +  
Sbjct: 75  DGDSPEIAVLSTEGSWIALLKPRPSYEETIRSVLITVEMLHFVRRRPTDSEKDMWDHLYG 134

Query: 135 VFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFM 175
           VF  + V P E+D  +H  + +  A+RD DLANS+ L  F+
Sbjct: 135 VFERFVVRPLEDDFANHQNLIKLFAQRDPDLANSQVLQVFI 175


>gi|449532244|ref|XP_004173092.1| PREDICTED: uncharacterized protein LOC101229951 [Cucumis sativus]
          Length = 77

 Score =  120 bits (300), Expect = 4e-24,   Method: Composition-based stats.
 Identities = 56/84 (66%), Positives = 67/84 (79%), Gaps = 7/84 (8%)

Query: 726 MGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFIS 785
           M  SC F        +NDF+FEKRDWMTV+P+E P GSQLIMGLNPPFGVKA+LANKF+ 
Sbjct: 1   MAVSCLFL-------QNDFNFEKRDWMTVQPKEFPKGSQLIMGLNPPFGVKAALANKFVD 53

Query: 786 QALKFKPKLIVLIVPQETRRLDQK 809
           +AL+F PKL++LIVP ET RLD+K
Sbjct: 54  KALEFNPKLLILIVPPETERLDEK 77


>gi|168001433|ref|XP_001753419.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695298|gb|EDQ81642.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 593

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/278 (33%), Positives = 129/278 (46%), Gaps = 43/278 (15%)

Query: 160 ERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDENYFAVC 219
           E + DL     L       P  +     VR   K     + +E  D   +D+ +    VC
Sbjct: 325 EMEPDLFPPNILTPEPKKKPGRSLLLRSVRKKRK-----ETDEVGDDNWIDEPD----VC 375

Query: 220 AICDDGGD----VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
           A CDDG +    +  C+G C+RSFH TI +G    C++LG + A ++ V +++C NC   
Sbjct: 376 AFCDDGVENGEKLLCCEGLCMRSFHPTINSGLQNKCRTLGLSPAAVN-VKSWICPNCEAG 434

Query: 276 EHQCFACGMLGSSDKSSSQ------EVFPCVSATCGQFYHPECVSKLLHPDNESLAEELR 329
           +HQCFACG LGSS   SS       EV  C +  C +FYHP+CV KLL P+ E     +R
Sbjct: 435 QHQCFACGKLGSSSSGSSSNSMGLLEVVVCGAKYCQRFYHPDCVVKLLVPEAEQKDLAMR 494

Query: 330 ERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENN 389
            R++  E+  CP+HKC + + S    V   +  +C        R      IT S    N 
Sbjct: 495 IRLSL-ETLICPLHKCRMPESS----VATRRWQVC-----YVLRGIGDGGITDSGLAFNG 544

Query: 390 FQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFP 427
               WV              H+I  EL TP RDH+ FP
Sbjct: 545 --SLWVS-----------SRHRIDPELLTPERDHILFP 569


>gi|242081339|ref|XP_002445438.1| hypothetical protein SORBIDRAFT_07g019210 [Sorghum bicolor]
 gi|241941788|gb|EES14933.1| hypothetical protein SORBIDRAFT_07g019210 [Sorghum bicolor]
          Length = 149

 Score =  103 bits (258), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 49/123 (39%), Positives = 69/123 (56%), Gaps = 22/123 (17%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VCA+C DGGD                      L  SLG+T+AQ++A+ NF+C +C  ++H
Sbjct: 36  VCALCGDGGD----------------------LLWSLGFTEAQVEAMQNFICHSCRNRQH 73

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
           +C +CG+ GSS   +  +VF C   TCG+FYH  C+S  LHP + + A   R R+AAG  
Sbjct: 74  RCASCGIRGSSGVGNHPQVFRCNHETCGRFYHASCISAELHPGDPAEAGRCRVRVAAGRP 133

Query: 338 FTC 340
           F C
Sbjct: 134 FWC 136


>gi|242044180|ref|XP_002459961.1| hypothetical protein SORBIDRAFT_02g019310 [Sorghum bicolor]
 gi|241923338|gb|EER96482.1| hypothetical protein SORBIDRAFT_02g019310 [Sorghum bicolor]
          Length = 243

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 125/245 (51%), Gaps = 21/245 (8%)

Query: 1   MGYSEEDDDTLTL-YVSDYEFLDQNEE--SISFSVLPLQWDGCDIVGSSEMQVFLHGNIG 57
           M  S++DD    L  V  Y F+D  +E  ++ FS LP  +   D +   + ++ L G  G
Sbjct: 1   MMMSDDDDAEPQLNAVESYYFVDSTDENEAVCFSNLPFLFGDTDDLPDCKKRLVLRGT-G 59

Query: 58  DGPWNFNEQVIAWKFELSYAQPEIWVLSKQNN-WIMLQSPKKSFKNIVRTILITVHWLYC 116
           D        V+AW+  L   QP+  VL+ +   W+ L  P KS++ ++RT+LIT   L+ 
Sbjct: 60  DPGVKVYNVVVAWRLGLEGKQPQFAVLTAEGRRWVRLIRPLKSYEEMIRTLLITAQMLHF 119

Query: 117 VKQNPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFM- 175
           +++ P    K +   + KVF  ++V PSE+D  +H  + +E A +D  L  S+ L  F+ 
Sbjct: 120 LRRKPHEPEKTLWNHLCKVFNKFDVRPSEDDFRNHRSLMKEFAAKDPVLVKSEILQVFVE 179

Query: 176 ----ANHPETNTFHEDVRTTEKPKVIVDWNEDE--------DSERVDKDENYFAVCAICD 223
                N  E    H DV+   +P +  D + DE        +S+  + ++ + ++CAICD
Sbjct: 180 GRSRKNFTEVGADHIDVK---QPFIADDEDIDEMIVEDADNESDEEEDEDLFDSICAICD 236

Query: 224 DGGDV 228
           +GGD+
Sbjct: 237 NGGDI 241


>gi|414884832|tpg|DAA60846.1| TPA: hypothetical protein ZEAMMB73_559074 [Zea mays]
          Length = 243

 Score = 99.8 bits (247), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 77/245 (31%), Positives = 123/245 (50%), Gaps = 21/245 (8%)

Query: 1   MGYSEEDDDTLTL-YVSDYEFLD-QNE-ESISFSVLPLQWDGCDIVGSSEMQVFLHGNIG 57
           M  S++DD    L  V  Y F+D +NE E + FS LP  +   D +   + ++ L G  G
Sbjct: 1   MMMSDDDDAEPQLNAVEGYYFVDSKNEKEPVCFSTLPFWFGDTDDLPDGKKKLVLRGT-G 59

Query: 58  DGPWNFNEQVIAWKFELSYAQPEIWVLS-KQNNWIMLQSPKKSFKNIVRTILITVHWLYC 116
           D      ++V+A +  L   QPE  VL+ K   WI L  P  S++ ++RT+LIT   L+ 
Sbjct: 60  DPGVKVYDEVVACRLGLEGKQPEFAVLTAKGRRWIRLIRPLNSYEEMIRTVLITAQMLHF 119

Query: 117 VKQNPEASRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMA 176
           +++ P    K +   + KVF  ++V PSE+D  +H  + ++ A +D  LA S+ L  F+ 
Sbjct: 120 LRRKPHEPEKTLWNHLCKVFNKFDVPPSEDDFRNHCSLMKQFAAKDPVLAKSEILRVFIE 179

Query: 177 N-----HPETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDEN--------YFAVCAICD 223
                   E    H DV+   +P +  D   DE +     +E+        + ++CAICD
Sbjct: 180 GKSRKKFTEVGADHIDVK---QPFITDDEGIDEMNVEDVDNESDEDDDEYLFDSICAICD 236

Query: 224 DGGDV 228
           DGGD+
Sbjct: 237 DGGDI 241


>gi|449473710|ref|XP_004153960.1| PREDICTED: uncharacterized protein LOC101211308, partial [Cucumis
           sativus]
          Length = 131

 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/91 (51%), Positives = 66/91 (72%)

Query: 569 VDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVR 628
           +DA  E+ L+ ++K+  SS   E+ +K H VP THA+  KH+++K+I +G +EGSV AVR
Sbjct: 41  LDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSVVAVR 100

Query: 629 AALEMLDGGCDIEDAKAVCPPEILCQIFQWK 659
           AAL  L+ GC IEDA+AVC PE+L  IF+WK
Sbjct: 101 AALRKLEEGCCIEDAEAVCEPEVLNHIFKWK 131


>gi|10177759|dbj|BAB11072.1| unnamed protein product [Arabidopsis thaliana]
          Length = 227

 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 55/138 (39%), Positives = 86/138 (62%), Gaps = 3/138 (2%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVG-SSEMQVFLHGNIGDG 59
           M +SEE+ + L  +V +Y F+DQ+EE + F  LPL+W+  D    SS + VFL GNI  G
Sbjct: 1   MEWSEEEGEILPDFVDNYYFIDQSEELVKFCDLPLKWNSVDYDDDSSGLSVFLRGNIDSG 60

Query: 60  PWNFNEQVIAWKFELSYA--QPEIWVLSKQNNWIMLQSPKKSFKNIVRTILITVHWLYCV 117
             +F++ V  W F LS     P+I VL    +WI LQSP+KS++ ++RT L+T+ +L+ V
Sbjct: 61  DESFSKLVKGWNFYLSSVDEHPKIEVLLHGLHWITLQSPRKSYETLIRTSLVTLQFLHFV 120

Query: 118 KQNPEASRKYIREQMLKV 135
           K++P+ +   +   + KV
Sbjct: 121 KRDPDVASGDVWSSLQKV 138


>gi|330833845|ref|XP_003291988.1| hypothetical protein DICPUDRAFT_156684 [Dictyostelium purpureum]
 gi|325077793|gb|EGC31483.1| hypothetical protein DICPUDRAFT_156684 [Dictyostelium purpureum]
          Length = 537

 Score = 96.3 bits (238), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 58/179 (32%), Positives = 85/179 (47%), Gaps = 32/179 (17%)

Query: 201 NEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQ 260
           +E+E  +R  K+E    +C +C+  GD+  CDG CLRSFH       N       +    
Sbjct: 74  DEEEKPKRTRKNE---TICTVCEKPGDLLMCDGLCLRSFHVNCVGSANI------HLNTN 124

Query: 261 IDAVPNFLCQNCVYQEHQCFAC---GMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLL 317
            D    + C +CV Q++ CF+C   G++G        ++  C    CG+FYH +CVS+  
Sbjct: 125 PDGSVRWECNDCVNQQNYCFSCKKRGIIG-------MDLMKCKVHQCGKFYHYKCVSEY- 176

Query: 318 HPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                 LA+ +  +      F CP+H C VC+ S D      Q   C RCP AYH  C+
Sbjct: 177 -----KLAKLINTKTPR---FNCPLHYCAVCEVSGD----GKQSVHCFRCPTAYHVICM 223


>gi|242041461|ref|XP_002468125.1| hypothetical protein SORBIDRAFT_01g040025 [Sorghum bicolor]
 gi|241921979|gb|EER95123.1| hypothetical protein SORBIDRAFT_01g040025 [Sorghum bicolor]
          Length = 131

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 66/122 (54%), Gaps = 23/122 (18%)

Query: 266 NFLCQNCVYQEHQCFACGMLGSSD--KSSSQEVFPCVSATCGQFYHPECVSKLLHPDNES 323
           NF+C+NC Y++HQCF CG LGSS+    S +EV  C    CG+FYHP+C+SK     N  
Sbjct: 23  NFICKNCEYKKHQCFVCGELGSSEMPPGSPEEVLKCQKKYCGRFYHPKCLSKYDPTKNR- 81

Query: 324 LAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQ--------LAICRRCPKAYHRKC 375
                       + F CP+H+C  C+   +  +   Q        L  CRRCP AYHRKC
Sbjct: 82  ------------QDFECPLHECHSCKNKGETVITSEQTEKKKETYLVQCRRCPVAYHRKC 129

Query: 376 LP 377
           LP
Sbjct: 130 LP 131


>gi|348672863|gb|EGZ12683.1| hypothetical protein PHYSODRAFT_563400 [Phytophthora sojae]
          Length = 1174

 Score = 94.0 bits (232), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 65/197 (32%), Positives = 90/197 (45%), Gaps = 43/197 (21%)

Query: 193 KPKVIVDWNEDEDSERVDKDENYFAVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALC 251
           +PK++     + DS+R  K E+   +C +C+  GD +  C G C+ +FHA        L 
Sbjct: 62  RPKLV---KSEPDSDR-SKGEH---ICVVCEREGDALVVCGGPCISAFHAAC------LS 108

Query: 252 QSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLG----SSDKSSSQEVFP-------CV 300
              G  QAQ D    +LC +C  + H CF C   G      D  +S    P       C 
Sbjct: 109 PDSGAAQAQDD---TWLCPSCRSKTHACFHCKQTGVEALKDDAPASAGADPTLRPVRKCR 165

Query: 301 SATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQ 360
           + +CG+FYH EC+++        LA     RIA    F CP+H C  C QS        +
Sbjct: 166 ALSCGKFYHQECIAQF------PLA-----RIATNTHFICPLHTCAGCNQSGAQQ----E 210

Query: 361 LAICRRCPKAYHRKCLP 377
              C RCP AYH +CLP
Sbjct: 211 AVRCMRCPVAYHARCLP 227


>gi|66828485|ref|XP_647597.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
 gi|74859221|sp|Q55FD6.1|Y8158_DICDI RecName: Full=PHD finger-containing protein DDB_G0268158
 gi|60475597|gb|EAL73532.1| PHD zinc finger-containing protein [Dictyostelium discoideum AX4]
          Length = 688

 Score = 94.0 bits (232), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 28/183 (15%)

Query: 199 DWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQ 258
           + N+D+  +R  K+E   AVC  C+  G++  CDG CLRSFH +    +N    S     
Sbjct: 122 ETNDDDKPKRPRKNE---AVCTFCEKPGELLMCDGLCLRSFHISCLKARNLYNPSSSSIS 178

Query: 259 --AQIDAVPNFLCQNCVYQEHQCFAC---GMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
               ID    + C +CV  ++ CF+C   G++G        ++  C    CG+FYH +CV
Sbjct: 179 PVTTIDGTVRWECNDCVSSQNSCFSCKKRGIIGI-------DLMKCKVHQCGKFYHYKCV 231

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
           +      +  LA+ +  +      F CP+H C VC+ S D      Q   C RCP AYH 
Sbjct: 232 A------DYKLAKLINTKTPR---FNCPLHYCSVCEVSGDGK----QSVHCFRCPTAYHV 278

Query: 374 KCL 376
            C+
Sbjct: 279 ICM 281


>gi|328865207|gb|EGG13593.1| PHD zinc finger-containing protein [Dictyostelium fasciculatum]
          Length = 584

 Score = 92.4 bits (228), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 59/182 (32%), Positives = 88/182 (48%), Gaps = 37/182 (20%)

Query: 202 EDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQI 261
           E +  +R  K+E   AVC IC+ GG++  CDG CLRSFH           + LG  Q+ +
Sbjct: 74  EKQLPKRQRKNE---AVCTICEKGGELLMCDGACLRSFHV----------ECLGL-QSLV 119

Query: 262 DAVPNFLCQNCVYQEHQCFAC---GMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLH 318
                + C +C+ Q++ CF+C   G++G        ++  C    CG+FYH  C+++   
Sbjct: 120 TPGQRWECDDCLNQQNSCFSCKKRGIIG-------MDLMKCKVHQCGKFYHHSCIAEF-- 170

Query: 319 PDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPT 378
                LA+ +  +      F CP+H C  C QS D      Q   C RCP AYH  C+P 
Sbjct: 171 ----PLAKMVNTK---SPRFNCPLHYCGKCGQSGD----GKQSVHCFRCPAAYHVTCIPP 219

Query: 379 EI 380
            +
Sbjct: 220 GV 221


>gi|255583040|ref|XP_002532288.1| hypothetical protein RCOM_0549780 [Ricinus communis]
 gi|223528022|gb|EEF30103.1| hypothetical protein RCOM_0549780 [Ricinus communis]
          Length = 164

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/77 (57%), Positives = 54/77 (70%), Gaps = 6/77 (7%)

Query: 803 TRRLDQKASYNLIWEDNEVLSGKSFY------LPGSLDVHDNQLEQWNCKPPPLYLWSRA 856
            +RLD+KA Y LIWED+ +LSGK  +      LPGS+DV+D Q+EQ N KPPPLYLWSR 
Sbjct: 79  VKRLDEKAPYKLIWEDHMLLSGKGTWQFTGSHLPGSIDVNDKQIEQRNSKPPPLYLWSRL 138

Query: 857 DWTASHKKIALGRGHIT 873
           DW A H  IA   GH++
Sbjct: 139 DWIAKHMAIARQCGHMS 155


>gi|297791953|ref|XP_002863861.1| hypothetical protein ARALYDRAFT_331222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309696|gb|EFH40120.1| hypothetical protein ARALYDRAFT_331222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 193

 Score = 90.5 bits (223), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 54/126 (42%), Positives = 80/126 (63%), Gaps = 5/126 (3%)

Query: 1   MGYSEEDDDTLTLYVSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGDGP 60
           M +SEE+ + L   V +Y F+DQ++  + F  +PL  D  D  GS    VFL GNI  G 
Sbjct: 1   MEWSEEEGEILPDSVDEYYFVDQSDTPVKFCDVPLNVDSTDSSGSP---VFLRGNINSGD 57

Query: 61  WNFNEQVIAWKFELSYAQ-PEIWVL-SKQNNWIMLQSPKKSFKNIVRTILITVHWLYCVK 118
            +F++ V  W FELS  + P+I VL     +WI LQSP+KS++ ++RT L+T+ +L+ VK
Sbjct: 58  ESFSKLVKGWNFELSVDEHPKIEVLLHGMMHWITLQSPRKSYEALIRTTLVTLQFLHFVK 117

Query: 119 QNPEAS 124
           +NP+AS
Sbjct: 118 RNPDAS 123


>gi|301095030|ref|XP_002896617.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108847|gb|EEY66899.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1089

 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/173 (30%), Positives = 79/173 (45%), Gaps = 36/173 (20%)

Query: 218 VCAICD-DGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           +C +C+ +G D+  C G C+ +FH +    ++ +  +L            +LC +C  + 
Sbjct: 74  ICVVCEREGDDLVVCAGPCISAFHLSCLPARSGVSDALDEA---------WLCPSCTNKT 124

Query: 277 HQCFACGMLG----SSDKSSSQEVFP-------CVSATCGQFYHPECVSKLLHPDNESLA 325
           H CF C   G      D  +S  + P       C + +CG+FYH EC+++        LA
Sbjct: 125 HACFHCKETGVETLKDDAPASTGIDPNKRPVRKCRALSCGKFYHQECITQF------PLA 178

Query: 326 EELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPT 378
                RIA    F CP+H C  C +S        +   C RCP AYH KCLP+
Sbjct: 179 -----RIAINTHFICPLHTCAGCNESGAQQ----EAVRCMRCPVAYHAKCLPS 222


>gi|414884834|tpg|DAA60848.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 143

 Score = 84.0 bits (206), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 33/67 (49%), Positives = 52/67 (77%)

Query: 228 VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS 287
           + FCDG C+RSFHA   +G+++ C +LGYT+A+++A+  FLC+NC Y++HQCF CG+L  
Sbjct: 19  IFFCDGPCMRSFHAKEGSGEDSCCVTLGYTEAEVEAMKLFLCKNCEYKQHQCFICGVLEP 78

Query: 288 SDKSSSQ 294
           SD  +++
Sbjct: 79  SDGEAAK 85


>gi|298715287|emb|CBJ27936.1| Chromodomain-helicase-DNA-binding protein 8 [Ectocarpus siliculosus]
          Length = 3661

 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 78/170 (45%), Gaps = 34/170 (20%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN---FLCQNCVY 274
            VCA C+DGG    CDG C RSFH             LG     +D  P    ++C  C+ 
Sbjct: 2739 VCARCEDGGVTIMCDGPCQRSFHPAC----------LG-----MDDNPEEDPWMCNRCMN 2783

Query: 275  QEHQCFACGMLGSSDKSSSQEV-FP-----CVSATCGQFYHPECVSKLLHPDNESLAEEL 328
            +  +C  CG  GS   S ++ V  P     C  ++CG++YH EC+ K+  P+  S ++E 
Sbjct: 2784 KVQKCLECGKKGSEMDSHNRAVKIPGGVSRCQLSSCGRYYHKECLDKIT-PNRTSYSKE- 2841

Query: 329  RERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPT 378
                    +F CP H C  C ++   N+    L  C RC KA    CL T
Sbjct: 2842 -------GNFKCPQHFCIDCGKTS-TNLGPRTLVKCLRCAKARCPDCLKT 2883


>gi|301117486|ref|XP_002906471.1| chromodomain-helicase-DNA-binding protein, putative [Phytophthora
            infestans T30-4]
 gi|262107820|gb|EEY65872.1| chromodomain-helicase-DNA-binding protein, putative [Phytophthora
            infestans T30-4]
          Length = 2158

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 70/179 (39%), Gaps = 36/179 (20%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN---FLCQNCVY 274
            +C +C DGG +  CDG C RSFH      K+                PN   +LC +C  
Sbjct: 1661 LCTLCGDGGLILLCDGPCHRSFHLDCVGMKDE---------------PNDEQWLCPDCAE 1705

Query: 275  QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
              H C  C  +G  +      V  C  A CG+FYH  C           LAE  R     
Sbjct: 1706 GRHMCLICKQVG--EMGVEFGVTQCSVAKCGRFYHKGC-----------LAENSRVEWVG 1752

Query: 335  GESFTCPVHKCFVCQQSEDMNV-----EDLQLAICRRCPKAYHRKCLPTEITFSDADEN 388
             + F CP H C  C +    N      ++  L  C  C +A+H  C+P+   F    +N
Sbjct: 1753 KKRFRCPSHFCHACSKQATTNSNKKGEDNTGLVSCIHCSQAFHPDCIPSADKFIRLSKN 1811


>gi|405966105|gb|EKC31425.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
           specific [Crassostrea gigas]
          Length = 1079

 Score = 80.1 bits (196), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 76/174 (43%), Gaps = 37/174 (21%)

Query: 201 NEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQ 260
           N++E+   + K   Y  +C +C+D GD+  C+G C   FH +        C S    +  
Sbjct: 299 NQEEEKNSLCKSPRY--LCQVCEDNGDLMECEGECHNYFHKS--------CVSENVEEP- 347

Query: 261 IDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPD 320
                 F C+ C    H CFAC       K   +    C    CG+FYH  C++KL+H  
Sbjct: 348 ------FKCEECKSGVHTCFAC-------KEKDENTKKCSVHLCGKFYHENCLNKLMHSK 394

Query: 321 NESLAEELRERIAAGESFTCPVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYH 372
           ++S            + F CP+H+C  C  + S+       +L  C RCP AYH
Sbjct: 395 SDS-----------NKGFVCPLHQCATCAIESSKKTKASKGRLLRCVRCPTAYH 437


>gi|348535504|ref|XP_003455240.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Oreochromis niloticus]
          Length = 2122

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 69/162 (42%), Gaps = 42/162 (25%)

Query: 217  AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
            +VC +C+  GD+  C+G C  +FH           Q +G     + A P   FLC+ C  
Sbjct: 1248 SVCQVCERTGDLLVCEGHCYGAFHL----------QCIG-----LSAPPKGKFLCRECNT 1292

Query: 275  QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVS--KLLHPDNESLAEELRERI 332
              H CF C       K S   V  C+   CG+FYH +C+       P N           
Sbjct: 1293 GVHACFVC-------KKSGDGVKRCIIPLCGKFYHTDCIMAYSATQPHN----------- 1334

Query: 333  AAGESFTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
               + F CP+H C  C  +  +N+     +LA C RCP AYH
Sbjct: 1335 ---KGFRCPLHVCLSCHITNPLNICSSKGRLARCVRCPVAYH 1373


>gi|157822347|ref|NP_001100807.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Rattus norvegicus]
 gi|149039888|gb|EDL94004.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_a [Rattus norvegicus]
          Length = 2381

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 82/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK I++   + D     S+R+  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1196 RTAASPKEILEEGIEHDPGMSASKRLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1255

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1256 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1295

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F CP+H C  C  +   NV  
Sbjct: 1296 CLLPLCGKFYHEECVQK--YPPT----------VVQNKGFRCPLHICITCHAANPANVSA 1343

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1344 SKGRLMRCVRCPVAYH 1359



 Score = 39.3 bits (90), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1230 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1280

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1281 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1310


>gi|384252957|gb|EIE26432.1| hypothetical protein COCSUDRAFT_58970 [Coccomyxa subellipsoidea
           C-169]
          Length = 707

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/271 (26%), Positives = 107/271 (39%), Gaps = 50/271 (18%)

Query: 219 CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQ 278
           C  C DGG++  CDG CLRSFH       N L  S   +       P + C +C      
Sbjct: 321 CGRCADGGELLECDGTCLRSFHL------NCLTPSERPSPDDPPEAP-WYCPDCRLGIGA 373

Query: 279 CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKL-------LHPDNESLAEELR-E 330
           C  C   G       Q +  C   +CG ++H +C++ L       +H ++ ++A     E
Sbjct: 374 CAICKQTG----VVGQHILKCKMGSCGYYFHNQCLNGLRDSGLLKIHREDPAVANGGNGE 429

Query: 331 RIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL--PTEITFS----- 383
           R+     FTCP H C VC+ S D      ++  C RCP AYH + L  P+  T S     
Sbjct: 430 RV-----FTCPAHFCHVCRMSGDAQ----RMVRCWRCPTAYHSRGLSAPSLGTASVQMIA 480

Query: 384 --DADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELL 441
                 N  + A     P+NR +  C   +   E K P     + P  +G     +   +
Sbjct: 481 PVPTPRNQHEAA-----PSNRAVFDCGASEQPVERKRP-----RKPKKDGGNASRN---V 527

Query: 442 LTEEKDVASKRNIVSESFVADKTVVKKLKLA 472
            T+  D A   N + +       V  +   A
Sbjct: 528 ATKRPDTAGPSNAIRQPQPQQPPVAARPSTA 558


>gi|452821906|gb|EME28931.1| hypothetical protein Gasu_36670 [Galdieria sulphuraria]
          Length = 1451

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 75/172 (43%), Gaps = 20/172 (11%)

Query: 208  RVDKDENYFAVCAIC--DDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQI-DAV 264
            R+D    Y   C +   D    +  CDG C ++FH     G  +    +   +    D  
Sbjct: 1274 RIDLSSAYCLACELPMEDASNPLIACDGPCQKTFHYECAPGDESERPPIESIELYASDER 1333

Query: 265  PNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESL 324
            P + C NC   EH CF+CG  G      +  +  C   +CG+FYH  C  +      E L
Sbjct: 1334 PLWQCHNCTSGEHICFSCGRPGHI-ADVNDPLRKCSLGSCGRFYHNSCAQQ------EPL 1386

Query: 325  AEELRERIAA-GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKC 375
            A     R+A+ G  F CP H C VC++S D       +  C  CP+A+H +C
Sbjct: 1387 A-----RLASDGNWFRCPQHYCVVCEESGDSR----PMIKCIYCPRAWHVQC 1429


>gi|149039889|gb|EDL94005.1| nuclear receptor binding SET domain protein 1 (predicted), isoform
            CRA_b [Rattus norvegicus]
          Length = 2586

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 82/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK I++   + D     S+R+  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1401 RTAASPKEILEEGIEHDPGMSASKRLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1460

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1461 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1500

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F CP+H C  C  +   NV  
Sbjct: 1501 CLLPLCGKFYHEECVQK--YPPT----------VVQNKGFRCPLHICITCHAANPANVSA 1548

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1549 SKGRLMRCVRCPVAYH 1564



 Score = 39.3 bits (90), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1435 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1485

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1486 ----------HTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1515


>gi|432879768|ref|XP_004073538.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Oryzias latipes]
          Length = 2321

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 67/159 (42%), Gaps = 39/159 (24%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC +C+  GD+  C+G C  +FH           Q +G + A       F CQ C   +H
Sbjct: 1469 VCQVCERPGDLLVCEGHCYGAFHL----------QCVGLSVAPKG---KFFCQECNTGDH 1515

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKL--LHPDNESLAEELRERIAAG 335
             CF C   G        +V  C+   CG+FYH +C+       P N              
Sbjct: 1516 MCFVCKKSG--------DVKRCIIPLCGKFYHMDCILAFSATQPHN-------------- 1553

Query: 336  ESFTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            + F CP+H C  C  +  +NV     +LA C RCP AYH
Sbjct: 1554 KGFRCPLHVCLSCHITNPVNVFSSKGRLARCVRCPVAYH 1592


>gi|326928449|ref|XP_003210391.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like, partial [Meleagris gallopavo]
          Length = 2336

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 67/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC IC+  G++  C+ +C  +FH           Q LG ++        F+C  C    H
Sbjct: 1248 VCQICEKPGELLLCEAQCCGAFH----------LQCLGLSEMPTG---KFICNECSTGVH 1294

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       KS  Q+V  C+   CG++YH EC+ K  +P            +   + 
Sbjct: 1295 TCFVC-------KSCGQDVKRCLLPLCGKYYHEECIQK--YPPT----------VTQNKG 1335

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   N+     +L  C RCP AYH
Sbjct: 1336 FRCSLHICMTCHAANPTNISASKGRLMRCVRCPVAYH 1372


>gi|292621054|ref|XP_683890.4| PREDICTED: hypothetical protein LOC556086 [Danio rerio]
          Length = 2055

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC +C+  G++  C+G+C  +FH           Q +G T+        F+CQ C    H
Sbjct: 1228 VCQVCEKTGELLLCEGQCCGAFHL----------QCIGLTETPKG---RFICQECKMGVH 1274

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K   +EV  C+   CG+FYH +C+ K                +A    
Sbjct: 1275 TCFVC-------KKPDKEVRRCMIPVCGKFYHMDCILK------------YSPTVAQNRG 1315

Query: 338  FTCPVHKCFVCQQSEDMN--VEDLQLAICRRCPKAYH 372
            F C +H C  C  +   N  +   +L  C RCP AYH
Sbjct: 1316 FRCSIHVCLSCYITNPNNPGISKGRLTRCVRCPVAYH 1352


>gi|354472091|ref|XP_003498274.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Cricetulus griseus]
          Length = 1436

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/160 (33%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCV 273
            VC IC+  GD +  C+G C R FH           + LG T     AVP   F+C+ C 
Sbjct: 702 TVCQICETAGDCLVSCEGECCRHFHW----------ECLGLT-----AVPEGRFICEECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C  + CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KESGEDVKRCSVSVCGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|354472093|ref|XP_003498275.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Cricetulus griseus]
          Length = 1387

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 53/160 (33%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCV 273
            VC IC+  GD +  C+G C R FH           + LG T     AVP   F+C+ C 
Sbjct: 696 TVCQICETAGDCLVSCEGECCRHFHW----------ECLGLT-----AVPEGRFICEECK 740

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C  + CG+FYH  CV K      ES          
Sbjct: 741 TGQHPCFSC-------KESGEDVKRCSVSVCGKFYHEACVRKFPTAIFES---------- 783

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 784 --KGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 821


>gi|189534132|ref|XP_001923087.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Danio rerio]
          Length = 773

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 73/171 (42%), Gaps = 37/171 (21%)

Query: 221 ICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCF 280
           +C+  GD+  C+G+C  +FH           Q  G  +        FLCQ C    H CF
Sbjct: 1   MCEKQGDLLLCEGQCCGAFHP----------QCTGLNEPPTG---KFLCQECTSGVHSCF 47

Query: 281 ACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTC 340
           AC  LG       ++V  C+   CG+FYH EC            A      +    +F C
Sbjct: 48  ACKRLG-------EDVRRCMVPGCGKFYHGEC------------AASHAPTVPLNRAFRC 88

Query: 341 PVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYHRK--CLPT-EITFSDAD 386
           P+H C  C      + +V   QL  C RCP AYH    C+P   +T +D++
Sbjct: 89  PLHACLSCFILNPANPSVAKGQLTRCIRCPVAYHSSDFCIPAGSVTLTDSN 139


>gi|281208068|gb|EFA82246.1| PHD zinc finger-containing protein [Polysphondylium pallidum PN500]
          Length = 533

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 53/186 (28%), Positives = 82/186 (44%), Gaps = 35/186 (18%)

Query: 194 PKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQS 253
           PK+  +   D D+ +  K     A+C +C+   ++  CDG CLRSFH             
Sbjct: 47  PKINNNVTHDADA-KAKKPRKNEAICTVCEKSDNLLMCDGPCLRSFHIDCIG-------- 97

Query: 254 LGYTQAQIDAVPN--FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPE 311
                  +D++P   + C +C   ++ CFAC   G  + + ++    C    CG+FYH +
Sbjct: 98  -------LDSMPTSRWECSDCSKNQNICFACKERGIINMNLTK----CKVHQCGKFYHNK 146

Query: 312 CVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAY 371
           C+S+        L + +  +      F CP+H C  C  S D      Q   C RCP AY
Sbjct: 147 CISEF------KLTKVVNTK---SPRFNCPLHYCESCGVSGDGK----QSVHCFRCPTAY 193

Query: 372 HRKCLP 377
           H  C+P
Sbjct: 194 HVICMP 199


>gi|410923178|ref|XP_003975059.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Takifugu
           rubripes]
          Length = 1499

 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 69/234 (29%), Positives = 98/234 (41%), Gaps = 58/234 (24%)

Query: 156 REAAERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKVIVDWNEDE-------DSER 208
           R  A  D ++A+S++           +T   D R    P+ +   N D        D++ 
Sbjct: 684 RSRASTDVEMASSQY----------RDTSDSDSRGLSDPQGLFGKNLDSPAAADGSDTQS 733

Query: 209 VD-----KDENYF---AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQA 259
           VD     +D N      VC IC+  G+ +  C+G C R FH           + LG T A
Sbjct: 734 VDSGLSRQDSNTDKRDTVCQICEAYGESLVVCEGDCNRQFHL----------ECLGLT-A 782

Query: 260 QIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHP 319
           Q +    F+C  C   +H CF+C       K++ QEV  C  + CG +YH +CV KLL  
Sbjct: 783 QPEG--RFICLECQNGDHPCFSC-------KTAGQEVTRCSVSGCGCYYHEDCVRKLLGT 833

Query: 320 DNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
            +           + G  F CP H C  C    D+      +L  C RCP AYH
Sbjct: 834 TS-----------SPGGGFCCPQHICSTCCLERDLQRASKGRLMRCIRCPVAYH 876


>gi|363739108|ref|XP_414538.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Gallus gallus]
          Length = 2412

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 67/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC IC+  G++  C+ +C  +FH           Q LG ++        F+C  C    H
Sbjct: 1260 VCQICEKPGELLLCEAQCCGAFH----------LQCLGLSEMPTG---KFICNECSTGVH 1306

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K+  Q+V  C+   CG++YH EC+ K  +P            +   + 
Sbjct: 1307 TCFVC-------KNCGQDVKRCLLPLCGKYYHEECIQK--YPPT----------VTQNKG 1347

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   N+     +L  C RCP AYH
Sbjct: 1348 FRCSLHICMTCHAANPTNISASKGRLMRCVRCPVAYH 1384


>gi|118572948|sp|Q6P2L6.2|NSD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
           Full=Nuclear SET domain-containing protein 3; AltName:
           Full=Wolf-Hirschhorn syndrome candidate 1-like protein 1
           homolog; Short=WHSC1-like protein 1
          Length = 1439

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCV 273
            VC +C+  GD +  C+G C R FH           + LG T     AVP  +F C+ C 
Sbjct: 702 TVCQVCEKAGDCLVACEGECCRHFHV----------ECLGLT-----AVPEGHFTCEECE 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C  + CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|124486903|ref|NP_001074738.1| histone-lysine N-methyltransferase NSD3 isoform 2 [Mus musculus]
 gi|189442807|gb|AAI67226.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (human) [synthetic
           construct]
          Length = 1446

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCV 273
            VC +C+  GD +  C+G C R FH           + LG T     AVP  +F C+ C 
Sbjct: 712 TVCQVCEKAGDCLVACEGECCRHFHV----------ECLGLT-----AVPEGHFTCEECE 756

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C  + CG+FYH  CV K      ES          
Sbjct: 757 TGQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRKFPTAIFES---------- 799

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 800 --KGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 837


>gi|395847335|ref|XP_003796334.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Otolemur garnettii]
          Length = 1438

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +T C+G C + FH           + LG T     +VP+  F+C  C 
Sbjct: 702 TVCQICESSGDSLTPCEGECCKHFHL----------ECLGLT-----SVPDGKFICTECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827


>gi|224080887|ref|XP_002197925.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Taeniopygia
           guttata]
          Length = 1435

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 70/159 (44%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
           VC IC+  G+ +  C+G C   FH           + LG     + A+P   F+C  C  
Sbjct: 700 VCQICESSGESLLACEGECCSMFHL----------ECLG-----LKAMPEEKFICTECKN 744

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            EH CF+C + G       ++V  C  +TCG+FYH  CV K      ES           
Sbjct: 745 GEHTCFSCKLPG-------KDVKRCSVSTCGKFYHEACVRKFATALFES----------- 786

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
              F CP H C  C   +DM+     ++A C RCP AYH
Sbjct: 787 -RGFRCPQHCCTACSMDKDMHKASKGRMARCLRCPVAYH 824


>gi|395847337|ref|XP_003796335.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Otolemur garnettii]
          Length = 1389

 Score = 76.6 bits (187), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +T C+G C + FH           + LG T     +VP+  F+C  C 
Sbjct: 702 TVCQICESSGDSLTPCEGECCKHFHL----------ECLGLT-----SVPDGKFICTECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827


>gi|354471955|ref|XP_003498206.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Cricetulus griseus]
          Length = 2690

 Score = 76.6 bits (187), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 67/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG T+        F+C  C    H
Sbjct: 1548 VCQNCEKLGELLLCEAQCCGAFHL----------ECLGLTEMPRG---KFICNECRTGIH 1594

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1595 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VTQNKG 1635

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F CP+H C  C  +   NV     +L  C RCP AYH
Sbjct: 1636 FRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1672



 Score = 39.7 bits (91), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1543 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1593

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1594 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1623


>gi|38424052|dbj|BAD01742.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|38637126|dbj|BAD03380.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 645

 Score = 76.6 bits (187), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 64/127 (50%), Gaps = 35/127 (27%)

Query: 288 SDKSSSQE------VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCP 341
           S+K+SS++      VF C +A+CG+FYHP+CV+ LL PD    A  L +RIA G +FTCP
Sbjct: 210 SEKTSSKKKYNTSLVFQCCNASCGRFYHPKCVAGLLEPDG---ACGLEKRIADGMTFTCP 266

Query: 342 VHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNN 401
           VH CF C+Q ED                         EI+F   D     R W     + 
Sbjct: 267 VHWCFECKQIEDR------------------------EISFKKKDVT--ARVWEVPKGDP 300

Query: 402 RILIYCL 408
           +I IYCL
Sbjct: 301 KIFIYCL 307



 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 55/111 (49%), Gaps = 3/111 (2%)

Query: 1   MGYSEEDDDTLTLY--VSDYEFLDQNEESISFSVLPLQWDGCDIVGSSEMQVFLHGNIGD 58
           M   ++DDD       V  Y   D +E+ + FS+LP  +D    +  SE  V++ G   +
Sbjct: 1   MALFDDDDDEFPQLKDVDQYACKDSHEDLVCFSILPFWFDEEHRLLGSEKGVYIRGIDCE 60

Query: 59  GPWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSF-KNIVRTIL 108
           G     +++ AW+ EL   +P I VLS + NWI L +P   + ++I R   
Sbjct: 61  GRQPVYKRIEAWRVELDCEKPMISVLSSEGNWIRLLNPHPGYAEDIARKFF 111


>gi|395847339|ref|XP_003796336.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
           [Otolemur garnettii]
          Length = 1427

 Score = 76.6 bits (187), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +T C+G C + FH           + LG T     +VP+  F+C  C 
Sbjct: 702 TVCQICESSGDSLTPCEGECCKHFHL----------ECLGLT-----SVPDGKFICTECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMLRCLRCPVAYH 827


>gi|327265653|ref|XP_003217622.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Anolis carolinensis]
          Length = 2106

 Score = 76.6 bits (187), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 86/208 (41%), Gaps = 47/208 (22%)

Query: 180  ETNTFHEDVR---TTEKPKVIVDWNEDE-----DSERVDKDENYFA-----VCAICDDGG 226
            ET++F  D     TT  PK I +   +       S+R+  +    A     VC IC+  G
Sbjct: 925  ETHSFQRDRAANGTTGSPKEITEEGSENYPGVPSSKRMQGERGGGAALKENVCQICEKPG 984

Query: 227  DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLG 286
            ++  C+ +C  +FH           + LG ++        F+C  C    H CF C    
Sbjct: 985  ELLLCEAQCCGAFHL----------ECLGLSEMPKG---KFICTECSTGVHTCFVC---- 1027

Query: 287  SSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCF 346
               K+S + V  C+   CG++YH EC+ K  +P            +   + F C +H C 
Sbjct: 1028 ---KNSGENVKRCLLPLCGKYYHEECIQK--YPPT----------VIQNKGFRCSLHICM 1072

Query: 347  VCQQSEDMNV--EDLQLAICRRCPKAYH 372
             C  +   NV     +L  C RCP AYH
Sbjct: 1073 TCHAANPTNVSASKGRLMRCVRCPVAYH 1100


>gi|325193611|emb|CCA27889.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1008

 Score = 76.6 bits (187), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 69/218 (31%), Positives = 91/218 (41%), Gaps = 71/218 (32%)

Query: 218 VCAICDDG-GDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN-----FLCQN 271
           +C  C+   GD+  C G C+ ++H                   QIDA  N     + C +
Sbjct: 4   ICLRCEKSDGDLWLCTGLCVSAYHLHC---------------LQIDANANLDPTTWKCPS 48

Query: 272 CVYQEHQCFACGMLGSS---DKSSSQE---------------VFPCVSATCGQFYHPECV 313
           C    H CF C   GSS   + +  QE               V  C + +CG+FYH EC+
Sbjct: 49  CQNHLHICFYCHQTGSSFYHENAVQQEPSQTHTPPLHDNISYVSKCRALSCGKFYHLECI 108

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
           +KL      +LA     RI AG  F CP+H C  C+QS     E ++   C RCP AYH 
Sbjct: 109 TKL------ALA-----RI-AGTHFICPLHTCASCEQS-GAKKESVR---CARCPVAYHT 152

Query: 374 KCLPTEITFSDADENNFQRAWVDLLPNNRILIYCLEHK 411
            CLP              R  V LL  N IL  C +H+
Sbjct: 153 SCLP--------------RKNVQLLTGNCIL--CPKHE 174


>gi|242047476|ref|XP_002461484.1| hypothetical protein SORBIDRAFT_02g003420 [Sorghum bicolor]
 gi|241924861|gb|EER98005.1| hypothetical protein SORBIDRAFT_02g003420 [Sorghum bicolor]
          Length = 631

 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 70/135 (51%), Gaps = 8/135 (5%)

Query: 352 EDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYCLEHK 411
           E+     LQLA+CRRCP+AYH +CLP E++    D++   RAW     +  I  YCL+H+
Sbjct: 2   ENKTQRALQLAVCRRCPRAYHWECLPRELSLGAKDKDGNPRAWK---LSKTIFFYCLDHE 58

Query: 412 IISELKTPARDHLKFPGV-EGKRKKEDLELLLTEEKDVASKRNIVSESFVADKTVVKKLK 470
           I  + +T +R+H+KFP   E  + KE    L   +  +  KR   ++       +  +L 
Sbjct: 59  IDKDTRTASRNHIKFPATPECTKTKE----LGNRKGRMTGKRRKNTDQSTEPTELSNRLY 114

Query: 471 LAEVYSGADVGMSNS 485
            AE     +VG  ++
Sbjct: 115 GAESEQADNVGAKST 129


>gi|148700883|gb|EDL32830.1| mCG14519 [Mus musculus]
          Length = 1381

 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 71/159 (44%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
           VC +C+  GD +  C+G C R FH           + LG T     AVP  +F C+ C  
Sbjct: 688 VCQVCEKAGDCLVACEGECCRHFHV----------ECLGLT-----AVPEGHFTCEECET 732

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            +H CF+C       K S ++V  C  + CG+FYH  CV K      ES           
Sbjct: 733 GQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRKFPTAIFES----------- 774

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
            + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 775 -KGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 812


>gi|344240382|gb|EGV96485.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Cricetulus griseus]
          Length = 2318

 Score = 76.3 bits (186), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 67/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG T+        F+C  C    H
Sbjct: 1176 VCQNCEKLGELLLCEAQCCGAFHL----------ECLGLTEMPRG---KFICNECRTGIH 1222

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1223 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VTQNKG 1263

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F CP+H C  C  +   NV     +L  C RCP AYH
Sbjct: 1264 FRCPLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1300



 Score = 39.3 bits (90), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1171 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1221

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1222 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1251


>gi|224050217|ref|XP_002195834.1| PREDICTED: histone-lysine N-methyltransferase NSD2 [Taeniopygia
           guttata]
          Length = 1339

 Score = 76.3 bits (186), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  GD+  C+G C R+FH +        C  L    A       F+C  C    H
Sbjct: 673 VCQLCEKTGDLLLCEGLCYRAFHVS--------CLGLSGRPAG-----KFICSECTSGVH 719

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K    +V  CV + CG+FYH  CV K                +     
Sbjct: 720 TCFVC-------KERKADVKRCVVSHCGKFYHEACVKK------------FHLTVFENRG 760

Query: 338 FTCPVHKCFVCQQSEDMN--VEDLQLAICRRCPKAYH 372
           F CP+H C  C  S   +  +   ++  C RCP AYH
Sbjct: 761 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 797


>gi|348554221|ref|XP_003462924.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Cavia
           porcellus]
          Length = 1438

 Score = 76.3 bits (186), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C R FH           + LG+T     +VP+  F C  C 
Sbjct: 702 TVCQICESSGDSLVPCEGECYRYFHL----------ECLGWT-----SVPDGKFTCMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G       ++V  C    CG+FYH  CV K       S A      I 
Sbjct: 747 TGQHPCFSCKVPG-------EDVKRCSVGACGKFYHEACVRKF------STA------IF 787

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 788 ESKGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|348688432|gb|EGZ28246.1| hypothetical protein PHYSODRAFT_470076 [Phytophthora sojae]
          Length = 2182

 Score = 75.9 bits (185), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 72/184 (39%), Gaps = 41/184 (22%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN---FLCQNCVY 274
            +C +C DGG +  CDG C RSFH      K+                PN   +LC +C  
Sbjct: 1660 LCTLCGDGGLILLCDGPCHRSFHLECVGMKDE---------------PNDEQWLCPDCAE 1704

Query: 275  QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
              H C  C  +G  +      V  C  A CG+FYH  C           LA+  R     
Sbjct: 1705 GRHMCLICKQVG--EMGVEFGVTQCSVAKCGRFYHKGC-----------LAKSSRVEWVG 1751

Query: 335  GESFTCPVHKCFVCQQ--------SEDMNVEDLQLAI--CRRCPKAYHRKCLPTEITFSD 384
             + F CP H C  C Q              +D++  +  C  C +A+H +C+P+   F  
Sbjct: 1752 KKRFRCPSHFCHACSQASKKESAAKSKKKKDDVKTHVVSCIHCSQAFHPECIPSADKFIR 1811

Query: 385  ADEN 388
              +N
Sbjct: 1812 LSKN 1815


>gi|410914004|ref|XP_003970478.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific-like [Takifugu rubripes]
          Length = 1169

 Score = 75.9 bits (185), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 65/158 (41%), Gaps = 37/158 (23%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  GD+  CDG C  +FH           Q +G + A       F C+ C    H
Sbjct: 381 VCQVCERTGDLLACDGHCYGAFHP----------QCIGLSAAPKG---KFFCRECSSGVH 427

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKL--LHPDNESLAEELRERIAAG 335
            CF C       K+S   V  C+   CG+FYH +C++      P N+ L           
Sbjct: 428 SCFVC-------KTSGNGVKRCMIPLCGKFYHNDCITAFSATQPHNKGL----------- 469

Query: 336 ESFTCPVHKCFVCQ-QSEDMNVEDLQLAICRRCPKAYH 372
               CP+H C  C       N    +LA C RCP AYH
Sbjct: 470 ---RCPLHVCLSCHINYPHSNCSKGRLARCVRCPVAYH 504


>gi|296222021|ref|XP_002757008.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Callithrix jacchus]
          Length = 1438

 Score = 75.9 bits (185), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C R FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCRHFHL----------ECLGLT-----SLPDSRFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|403294386|ref|XP_003938171.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Saimiri
           boliviensis boliviensis]
          Length = 1438

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C R FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCRHFHL----------ECLGLT-----SLPDSRFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|296222019|ref|XP_002757007.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Callithrix jacchus]
          Length = 1427

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C R FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCRHFHL----------ECLGLT-----SLPDSRFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|296222023|ref|XP_002757009.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3
           [Callithrix jacchus]
          Length = 1389

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C R FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCRHFHL----------ECLGLT-----SLPDSRFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|241998002|ref|XP_002433644.1| set domain protein, putative [Ixodes scapularis]
 gi|215495403|gb|EEC05044.1| set domain protein, putative [Ixodes scapularis]
          Length = 729

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/175 (30%), Positives = 75/175 (42%), Gaps = 38/175 (21%)

Query: 204 EDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDA 263
           +++  V+K      VC +C+  G   FC G C  +FHA            LG     +  
Sbjct: 183 QENSAVEKPPTKEEVCTVCEKMGATLFCTGPCKLAFHA----------DCLG-----VSH 227

Query: 264 VPN-FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNE 322
           VP  F+C  C   EH C  C  LG ++K S +        +CG FYH +C++KL  P  +
Sbjct: 228 VPRAFVCDECTTGEHLCLVCKDLGETEKCSLE--------SCGCFYHKKCLAKLPLPLKQ 279

Query: 323 SLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE-DLQLAICRRCPKAYHRKCL 376
                        + F CP H C  C Q +   +    +L  C RCP A+H  CL
Sbjct: 280 -------------DPFVCPRHFCLGCFQEKPTTLNAKGRLLRCVRCPSAFHVGCL 321


>gi|410307858|gb|JAA32529.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
          Length = 1437

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD V  C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSVIACEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|118090799|ref|XP_420839.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Gallus
           gallus]
          Length = 1369

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 64/157 (40%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  GD+  C+G C R+FH +        C  L    A       F+C  C    H
Sbjct: 674 VCQLCEKTGDLLLCEGLCYRAFHVS--------CLGLSGRPAG-----KFVCSECTSGVH 720

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K    ++  CV + CG+FYH  CV K      ES              
Sbjct: 721 TCFVC-------KERKADLKRCVVSHCGKFYHEACVKKFHLTVFES------------RG 761

Query: 338 FTCPVHKCFVCQQSEDMN--VEDLQLAICRRCPKAYH 372
           F CP+H C  C  S   +  +   ++  C RCP AYH
Sbjct: 762 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 798


>gi|326919530|ref|XP_003206033.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Meleagris gallopavo]
          Length = 1348

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  GD+  C+G C R+FH +        C  L    A       F+C  C    H
Sbjct: 653 VCQLCEKTGDLLLCEGLCYRAFHVS--------CLGLSGRPAG-----KFVCSECTSGVH 699

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K    ++  CV + CG+FYH  CV K                +     
Sbjct: 700 TCFVC-------KERKADLKRCVVSHCGKFYHEACVKK------------FHLTVFENRG 740

Query: 338 FTCPVHKCFVCQQSEDMN--VEDLQLAICRRCPKAYH 372
           F CP+H C  C  S   +  +   ++  C RCP AYH
Sbjct: 741 FRCPLHSCLSCHVSNPSHPRISKGKMMRCVRCPVAYH 777


>gi|410914796|ref|XP_003970873.1| PREDICTED: uncharacterized protein LOC101068764 [Takifugu rubripes]
          Length = 2363

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            +C +C+  G++  C+G+C  +FH          C SL            F+C  C    H
Sbjct: 1551 ICQVCEKTGELLLCEGQCCGAFHLA--------CISLAEAPKG-----KFICPECKSGIH 1597

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K  S++V  C+   CG+FYH EC++                 +A    
Sbjct: 1598 TCFVC-------KKRSEDVRRCMIPVCGKFYHGECIANFA------------PTVAVNRG 1638

Query: 338  FTCPVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYH 372
            F C +H C  C         +   +L  C RCP AYH
Sbjct: 1639 FRCSIHVCLTCFIANPNSSTISKGRLVRCVRCPIAYH 1675


>gi|325188110|emb|CCA22651.1| chromodomainhelicaseDNAbinding protein putative [Albugo laibachii
            Nc14]
          Length = 1883

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 67/161 (41%), Gaps = 27/161 (16%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            +C +C DGG +  CDG C RSFH           + +G T    D   ++LC +C   +H
Sbjct: 1595 LCTLCGDGGLILLCDGPCHRSFHL----------ECIGMTHEPQDE--HWLCPDCNAGKH 1642

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             C  C  +G  +      V  C  A CG+FYH  C+    H +               + 
Sbjct: 1643 MCLLCKKVG--EMGVEFGVLQCSMARCGRFYHRGCLEVDRHVE-----------WVGKKR 1689

Query: 338  FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPT 378
            F CP H C  C+  E        +  C  C +A+H  C+P+
Sbjct: 1690 FRCPSHFCHSCK--ERKRPRQSTIVSCLHCARAFHADCIPS 1728


>gi|351699801|gb|EHB02720.1| Histone-lysine N-methyltransferase NSD3 [Heterocephalus glaber]
          Length = 1439

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH           + LG T     +VP+  F+C+ C 
Sbjct: 702 TVCQICERSGDSLVPCEGECCKHFHL----------ECLGLT-----SVPDGKFMCRECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G       ++V  C    CG+FYH  CV K       S A      I 
Sbjct: 747 TGQHPCFSCKVPG-------KDVKRCSVGACGKFYHEACVRKF------STA------IF 787

Query: 334 AGESFTCPVHKCFVCQQSEDM-NVEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D+      ++  C RCP AYH
Sbjct: 788 ESKGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPVAYH 827


>gi|157821603|ref|NP_001099560.1| histone-lysine N-methyltransferase NSD3 [Rattus norvegicus]
 gi|149057818|gb|EDM09061.1| Wolf-Hirschhorn syndrome candidate 1-like 1 (predicted) [Rattus
           norvegicus]
          Length = 1396

 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCV 273
            VC +C++ GD +  C+G C R FH           + LG     + AVP   F C+ C 
Sbjct: 703 TVCQVCENTGDSLVACEGECCRHFHP----------ECLG-----LAAVPEGRFTCEECE 747

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C  + CG+FYH  CV K      ES          
Sbjct: 748 TGQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRKFPTAIFES---------- 790

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 791 --KGFRCPQHCCSSCSMEKDIHKASKGRMMKCLRCPIAYH 828


>gi|344281620|ref|XP_003412576.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Loxodonta africana]
          Length = 1438

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
           AVC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 AVCQICESTGDSLIPCEGECCKHFHL----------ECLG-----LASLPDGKFICMECQ 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G+       +V  C   TCG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSCKVSGT-------DVKRCSVGTCGKFYHETCVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDM-NVEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D+      ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPVAYH 827


>gi|148709230|gb|EDL41176.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_b [Mus
            musculus]
          Length = 2382

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK I++   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1198 RTAASPKEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1257

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG  +        F+C  C    H CF C       K S ++V  
Sbjct: 1258 FHL----------ECLGLPEMPRG---KFICNECHTGIHTCFVC-------KQSGEDVKR 1297

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F CP+H C  C  +   NV  
Sbjct: 1298 CLLPLCGKFYHEECVQK--YPPT----------VTQNKGFRCPLHICITCHAANPANVSA 1345

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1346 SKGRLMRCVRCPVAYH 1361


>gi|395507428|ref|XP_003758026.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Sarcophilus harrisii]
          Length = 1437

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 68/160 (42%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH             LG     + ++PN  FLC  C 
Sbjct: 702 TVCQICESYGDSLVVCEGECCKHFHLDC----------LG-----LSSLPNGRFLCTECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G       ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHTCFSCKVPG-------KDVKRCSVTACGKFYHEACVRKFATAVFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
               F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --RGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 827


>gi|118918400|ref|NP_032765.3| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Mus musculus]
          Length = 2691

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK I++   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1507 RTAASPKEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1566

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG  +        F+C  C    H CF C       K S ++V  
Sbjct: 1567 FH----------LECLGLPEMPRG---KFICNECHTGIHTCFVC-------KQSGEDVKR 1606

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F CP+H C  C  +   NV  
Sbjct: 1607 CLLPLCGKFYHEECVQK--YPPT----------VTQNKGFRCPLHICITCHAANPANVSA 1654

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1655 SKGRLMRCVRCPVAYH 1670


>gi|332825882|ref|XP_003311721.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Pan
           troglodytes]
          Length = 1426

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|397521375|ref|XP_003830772.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Pan
           paniscus]
          Length = 1426

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|68565655|sp|O88491.1|NSD1_MOUSE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=H3-K36-HMTase; AltName:
            Full=H4-K20-HMTase; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|3329465|gb|AAC40182.1| NSD1 protein [Mus musculus]
          Length = 2588

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK I++   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1404 RTAASPKEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1463

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG  +        F+C  C    H CF C       K S ++V  
Sbjct: 1464 FHL----------ECLGLPEMPRG---KFICNECHTGIHTCFVC-------KQSGEDVKR 1503

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F CP+H C  C  +   NV  
Sbjct: 1504 CLLPLCGKFYHEECVQK--YPPT----------VTQNKGFRCPLHICITCHAANPANVSA 1551

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1552 SKGRLMRCVRCPVAYH 1567


>gi|397521373|ref|XP_003830771.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pan
           paniscus]
          Length = 1437

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|114619713|ref|XP_519713.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 6 [Pan
           troglodytes]
 gi|410226716|gb|JAA10577.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
 gi|410266866|gb|JAA21399.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
 gi|410353855|gb|JAA43531.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Pan troglodytes]
          Length = 1437

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|395507430|ref|XP_003758027.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Sarcophilus harrisii]
          Length = 1389

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 68/160 (42%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH             LG     + ++PN  FLC  C 
Sbjct: 702 TVCQICESYGDSLVVCEGECCKHFHLDC----------LG-----LSSLPNGRFLCTECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G       ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHTCFSCKVPG-------KDVKRCSVTACGKFYHEACVRKFATAVFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
               F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --RGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 827


>gi|148709229|gb|EDL41175.1| nuclear receptor-binding SET-domain protein 1, isoform CRA_a [Mus
            musculus]
          Length = 2588

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK I++   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1404 RTAASPKEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1463

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG  +        F+C  C    H CF C       K S ++V  
Sbjct: 1464 FHL----------ECLGLPEMPRG---KFICNECHTGIHTCFVC-------KQSGEDVKR 1503

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F CP+H C  C  +   NV  
Sbjct: 1504 CLLPLCGKFYHEECVQK--YPPT----------VTQNKGFRCPLHICITCHAANPANVSA 1551

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1552 SKGRLMRCVRCPVAYH 1567


>gi|114619717|ref|XP_001170799.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 4 [Pan
           troglodytes]
          Length = 1388

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|345315868|ref|XP_001516412.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2,
           partial [Ornithorhynchus anatinus]
          Length = 951

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 71/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH+    
Sbjct: 445 SESPYESADETQTEASVSSKKSERGAAAKKEYVCQLCEKTGSLLLCEGPCYGAFHSAC-- 502

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG  +        F+C  C    H CF C       K    EV  CV + CG
Sbjct: 503 --------LGLPRRPEG---KFICSECTSGVHSCFVC-------KEKKTEVKRCVVSQCG 544

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN--VEDLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      +L  
Sbjct: 545 KFYHEACVRKHHLTVFES------------RGFRCPLHSCVSCHVSNPSNPRASKGKLMR 592

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 593 CVRCPVAYH 601


>gi|397521377|ref|XP_003830773.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 3 [Pan
           paniscus]
          Length = 1388

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 70/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIACEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|390360513|ref|XP_785219.3| PREDICTED: histone-lysine N-methyltransferase NSD3-like
           [Strongylocentrotus purpuratus]
          Length = 1736

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 62/158 (39%), Gaps = 34/158 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             L     Q+    +F C  C+   H
Sbjct: 833 VCQVCERTGQLLLCEGGCCGAFH-------------LDCIGLQVAPSGSFRCDECISGVH 879

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S QEV  C    CG++YH  C+ +      +S              
Sbjct: 880 SCFVC-------KLSDQEVRRCHVPVCGKYYHEGCIRRFPLTRFDS------------RG 920

Query: 338 FTCPVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYHR 373
           FTCP+H C  C     +       +L  C RCP AYH+
Sbjct: 921 FTCPLHACVACFADNPKSTKASRGRLMRCVRCPTAYHQ 958


>gi|402873563|ref|XP_003900641.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Papio anubis]
          Length = 2343

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 77/196 (39%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDEDS----------ERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + DS          ER         VC  C+  G++  C+ +C  +
Sbjct: 1150 RTAASPKETVEEGVEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1209

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1210 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1249

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1250 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1297

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1298 SKGRLMRCVRCPVAYH 1313



 Score = 40.0 bits (92), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1184 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1234

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1235 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1264


>gi|395739588|ref|XP_002819060.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Pongo
           abelii]
          Length = 1438

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|119583723|gb|EAW63319.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 992

 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|380815578|gb|AFE79663.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
 gi|383420747|gb|AFH33587.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Macaca mulatta]
          Length = 2426

 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 77/196 (39%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDEDS----------ERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + DS          ER         VC  C+  G++  C+ +C  +
Sbjct: 1236 RTAASPKETVEEGVEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1295

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1296 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1335

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1336 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1383

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1384 SKGRLMRCVRCPVAYH 1399



 Score = 39.7 bits (91), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1270 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1320

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1321 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1350


>gi|170284946|gb|AAI61082.1| whsc1 protein [Xenopus (Silurana) tropicalis]
          Length = 1028

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  GD+  C+G C  +FH +        C  L    A       +LC+ C     
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLS--------CIGLSTRPAG-----KYLCKECTSGAR 659

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S+++V  C+   CG+FYH  C+ K        LA      +     
Sbjct: 660 SCFLC-------KESNRDVKRCIVPHCGKFYHESCLRKY------PLA------VFESRG 700

Query: 338 FTCPVHKCFVCQQSEDMN--VEDLQLAICRRCPKAYH 372
           F CP+H+C  C  S   N      ++  C RCP AYH
Sbjct: 701 FRCPLHRCATCYFSNPSNPRASKGKMVRCVRCPLAYH 737


>gi|402878017|ref|XP_003902703.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Papio anubis]
          Length = 1438

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLGLT-----SLPDSKFVCMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|297295821|ref|XP_001094467.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Macaca mulatta]
          Length = 2329

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 77/196 (39%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDEDS----------ERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + DS          ER         VC  C+  G++  C+ +C  +
Sbjct: 1150 RTAASPKETVEEGVEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1209

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1210 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1249

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1250 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1297

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1298 SKGRLMRCVRCPVAYH 1313



 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1184 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1234

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1235 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1264


>gi|380814048|gb|AFE78898.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca
           mulatta]
 gi|383419477|gb|AFH32952.1| histone-lysine N-methyltransferase NSD3 isoform long [Macaca
           mulatta]
          Length = 1438

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLGLT-----SLPDSKFVCMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|297299251|ref|XP_001091186.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Macaca mulatta]
          Length = 1419

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLGLT-----SLPDSKFVCMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|355750457|gb|EHH54795.1| hypothetical protein EGM_15701 [Macaca fascicularis]
          Length = 2695

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 77/196 (39%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDEDS----------ERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + DS          ER         VC  C+  G++  C+ +C  +
Sbjct: 1505 RTAASPKETVEEGVEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1564

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1565 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1604

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1605 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1652

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1653 SKGRLMRCVRCPVAYH 1668



 Score = 40.0 bits (92), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1539 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1589

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1590 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1619


>gi|355691890|gb|EHH27075.1| hypothetical protein EGK_17188 [Macaca mulatta]
          Length = 2695

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 77/196 (39%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDEDS----------ERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + DS          ER         VC  C+  G++  C+ +C  +
Sbjct: 1505 RTAASPKETVEEGVEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1564

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1565 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1604

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1605 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1652

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1653 SKGRLMRCVRCPVAYH 1668



 Score = 40.0 bits (92), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1539 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1589

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1590 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1619


>gi|380815580|gb|AFE79664.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
 gi|383420749|gb|AFH33588.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Macaca mulatta]
          Length = 2695

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 77/196 (39%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDEDS----------ERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + DS          ER         VC  C+  G++  C+ +C  +
Sbjct: 1505 RTAASPKETVEEGVEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1564

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1565 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1604

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1605 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1652

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1653 SKGRLMRCVRCPVAYH 1668



 Score = 40.0 bits (92), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1539 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1589

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1590 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1619


>gi|300796853|ref|NP_001178481.1| probable histone-lysine N-methyltransferase NSD2 [Rattus
           norvegicus]
          Length = 1346

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 618 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 675

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG +Q        F C  C    H CF C       K S  EV  C+   CG
Sbjct: 676 --------LGLSQRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCMVNQCG 717

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 718 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 765

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 766 CVRCPVAYH 774


>gi|27477095|ref|NP_758859.1| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform a [Homo sapiens]
 gi|16755530|gb|AAL27991.1|AF380302_1 androgen receptor-associated coregulator 267-a [Homo sapiens]
 gi|119605437|gb|EAW85031.1| nuclear receptor binding SET domain protein 1, isoform CRA_a [Homo
            sapiens]
          Length = 2427

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1237 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1296

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1297 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1336

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1337 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1384

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1385 SKGRLMRCVRCPVAYH 1400



 Score = 39.7 bits (91), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1271 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1321

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1322 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1351


>gi|348527922|ref|XP_003451468.1| PREDICTED: hypothetical protein LOC100692734 [Oreochromis niloticus]
          Length = 2421

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/157 (26%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            +C +C+  G++  C+G+C  +FH          C SL            F+C  C    H
Sbjct: 1603 ICQVCEKTGELLLCEGQCCGAFHLP--------CISLAEAPKG-----KFVCPECKSGIH 1649

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C   G       ++V  C+   CG+FYH EC++                 +     
Sbjct: 1650 TCFVCKKRG-------EDVRRCMIPVCGKFYHGECIANYA------------PTVPVNRG 1690

Query: 338  FTCPVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYH 372
            F C +H C  C        N+   +L  C RCP AYH
Sbjct: 1691 FRCSIHVCLTCFIANPNSSNISKGRLVRCVRCPVAYH 1727


>gi|410216828|gb|JAA05633.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260118|gb|JAA18025.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1238 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1297

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1298 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1337

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1338 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1385

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1386 SKGRLMRCVRCPVAYH 1401



 Score = 39.7 bits (91), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1272 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1322

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1323 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1352


>gi|410303854|gb|JAA30527.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341931|gb|JAA39912.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2428

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1238 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1297

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1298 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1337

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1338 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1385

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1386 SKGRLMRCVRCPVAYH 1401



 Score = 39.7 bits (91), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1272 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1322

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1323 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1352


>gi|395861196|ref|XP_003802879.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Otolemur garnettii]
          Length = 2410

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 78/196 (39%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDEDS----------ERVDKDENYFAVCAICDDGGDVTFCDGRCLRS 238
            RTT  PK  V+   + DS          ER         VC  C+  G++  C+ +C  +
Sbjct: 1222 RTTGSPKETVEEGIEHDSGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1281

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1282 FH----------LECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1321

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1322 CLLPLCGKFYHEECVQK--YPPT----------VIQNKGFRCSLHICITCHAANPASVSA 1369

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1370 SKGRLMRCVRCPVAYH 1385



 Score = 40.0 bits (92), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1256 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1306

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1307 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1336


>gi|431902251|gb|ELK08752.1| Histone-lysine N-methyltransferase NSD3 [Pteropus alecto]
          Length = 1322

 Score = 73.6 bits (179), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 701 TVCQICESSGDSLIPCEGECYKHFHL----------ECLGLT-----SLPDGKFICMECK 745

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G+       +V  C    CG+FYH  C+ K      ES          
Sbjct: 746 TGQHPCFSCKVSGT-------DVKRCSVGACGKFYHEACIRKFPTAIFES---------- 788

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 789 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 826


>gi|149047443|gb|EDM00113.1| similar to Wolf-Hirschhorn syndrome candidate 1 protein isoform 3
           (predicted) [Rattus norvegicus]
          Length = 1298

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 570 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 627

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG +Q        F C  C    H CF C       K S  EV  C+   CG
Sbjct: 628 --------LGLSQRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCMVNQCG 669

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 670 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 717

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 718 CVRCPVAYH 726


>gi|187956219|gb|AAI50629.1| Nuclear receptor binding SET domain protein 1 [Homo sapiens]
          Length = 2427

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1237 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1296

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1297 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1336

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1337 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1384

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1385 SKGRLMRCVRCPVAYH 1400



 Score = 39.7 bits (91), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1271 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1321

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1322 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1351


>gi|432099958|gb|ELK28852.1| Histone-lysine N-methyltransferase NSD3 [Myotis davidii]
          Length = 1641

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217  AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
             VC IC+  GD    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 896  TVCQICESSGDSLIPCEGECYKYFHL----------ECLGVT-----SLPDGKFICMECK 940

Query: 274  YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
              +H CF+C + G+       +V  C    CG+FYH  CV K      ES          
Sbjct: 941  TGKHPCFSCKVSGT-------DVKRCSVGACGKFYHEACVRKFPTAVFES---------- 983

Query: 334  AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
              + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 984  --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 1021


>gi|295424164|ref|NP_001074571.2| histone-lysine N-methyltransferase NSD2 isoform 1 [Mus musculus]
          Length = 1366

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 638 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 695

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 696 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVNQCG 737

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 738 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 785

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 786 CVRCPVAYH 794


>gi|295424166|ref|NP_780440.2| histone-lysine N-methyltransferase NSD2 isoform 2 [Mus musculus]
 gi|118572947|sp|Q8BVE8.2|NSD2_MOUSE RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
           Full=Multiple myeloma SET domain-containing protein;
           Short=MMSET; AltName: Full=Nuclear SET domain-containing
           protein 2; Short=NSD2; AltName: Full=Wolf-Hirschhorn
           syndrome candidate 1 protein homolog; Short=WHSC1
          Length = 1365

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 637 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 694

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 695 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVNQCG 736

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 737 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 784

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 785 CVRCPVAYH 793


>gi|326432726|gb|EGD78296.1| hypothetical protein PTSG_09362 [Salpingoeca sp. ATCC 50818]
          Length = 1279

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 80/189 (42%), Gaps = 37/189 (19%)

Query: 197 IVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
             D  ED   + V K ++Y   C IC + G++  C+G CL S+H          C  +G 
Sbjct: 662 TTDAGEDSSVD-VTKHDDY---CGICGEAGNLLCCEGGCLSSYH--------LFC--VGL 707

Query: 257 TQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKL 316
           + A   A   F+C  C    H CFAC   G  +   +     C    CG+ YH  C+S  
Sbjct: 708 SCAPQGA---FVCDACTTGNHLCFACEQPGGLEGLQT-----CSVRNCGKKYHRACIS-- 757

Query: 317 LHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              +N   A  L++      SF CP+HKC  C   +        L  C RCP AYH  C+
Sbjct: 758 ---NNPRAA--LKD-----NSFKCPLHKCANCTYPQ---ASTYPLVRCIRCPIAYHTCCV 804

Query: 377 PTEITFSDA 385
           P      +A
Sbjct: 805 PAGCLHENA 813


>gi|298706866|emb|CBJ25830.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 810

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 75/175 (42%), Gaps = 27/175 (15%)

Query: 218 VCAICDD-----GGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNC 272
           +CA+C         D+  CDG CLRSFH         +C  LG    + +    +LC++C
Sbjct: 226 ICAMCSQLDTPCMSDMVTCDGPCLRSFHV--------VCLDLGEDALKEE---KWLCEDC 274

Query: 273 VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              EH+C+ CG  G  +      VF C   +CG+FYH  CV      +  S+ +   +  
Sbjct: 275 ERAEHECWQCGDYGQDNVVGG--VFRCGVPSCGRFYHRHCVEL----NKNSVVKADVDEG 328

Query: 333 AAGE---SFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSD 384
             G+    F C  H C  C            L  C +CP AYH  C+P +  + +
Sbjct: 329 EDGQPIFKFRCAYHTCDTCCSGRGGTKN--HLYKCIKCPTAYHLNCIPPDARYHE 381


>gi|242080051|ref|XP_002444794.1| hypothetical protein SORBIDRAFT_07g028156 [Sorghum bicolor]
 gi|241941144|gb|EES14289.1| hypothetical protein SORBIDRAFT_07g028156 [Sorghum bicolor]
          Length = 212

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 33/54 (61%), Positives = 36/54 (66%)

Query: 822 LSGKSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGHITVE 875
           LSGKSFYLPGSL V +  +EQWN  PPPL LWSR+DW   H  IA    HI  E
Sbjct: 2   LSGKSFYLPGSLGVDNKVMEQWNMSPPPLSLWSRSDWAKRHSDIAKSMRHIPNE 55


>gi|441595720|ref|XP_004087266.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Nomascus
            leucogenys]
          Length = 2697

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1507 RTAASPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1566

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1567 FHL----------ECLGLTEMPKG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1606

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1607 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1654

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1655 SKGRLMRCVRCPVAYH 1670



 Score = 40.0 bits (92), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1541 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPKGKFICNECRTGI 1591

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1592 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1621


>gi|432873648|ref|XP_004072321.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Oryzias
           latipes]
          Length = 1597

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 54/171 (31%), Positives = 72/171 (42%), Gaps = 37/171 (21%)

Query: 206 SERVDKDENYFAVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAV 264
           S +   +E    VC IC+  G+ +  C+G C R FH           + LG     + A 
Sbjct: 840 SRQDSSNEKRDTVCQICEAYGEGLMVCEGDCSRQFHP----------ECLG-----LSAP 884

Query: 265 PN--FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNE 322
           P+  F C  C    H CF+C       K++ QEV  C +  C  +YH +CV K  HP   
Sbjct: 885 PDGRFTCLECKNSNHPCFSC-------KTTGQEVSRCSAPGCSCYYHEDCVWK--HPGTT 935

Query: 323 SLAEELRERIAAGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
           S         + G  F CP H C  C    D+      +L  C RCP AYH
Sbjct: 936 S---------SPGGGFCCPQHSCSTCCLERDLQRASKGRLIHCIRCPTAYH 977


>gi|162318272|gb|AAI56161.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
 gi|162318442|gb|AAI56968.1| Wolf-Hirschhorn syndrome candidate 1 (human) [synthetic construct]
          Length = 1346

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 618 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 675

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 676 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVNQCG 717

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 718 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 765

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 766 CVRCPVAYH 774


>gi|297676794|ref|XP_002816309.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 1 [Pongo abelii]
          Length = 2697

 Score = 73.2 bits (178), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1507 RTAASPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1566

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1567 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1606

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1607 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1654

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1655 SKGRLMRCVRCPVAYH 1670



 Score = 40.0 bits (92), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1541 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1591

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1592 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1621


>gi|19923586|ref|NP_071900.2| histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific isoform b [Homo sapiens]
 gi|32469769|sp|Q96L73.1|NSD1_HUMAN RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific; AltName: Full=Androgen receptor
            coactivator 267 kDa protein; AltName: Full=Androgen
            receptor-associated protein of 267 kDa; AltName:
            Full=H3-K36-HMTase; AltName: Full=H4-K20-HMTase; AltName:
            Full=Lysine N-methyltransferase 3B; AltName: Full=Nuclear
            receptor-binding SET domain-containing protein 1;
            Short=NR-binding SET domain-containing protein
 gi|17530097|gb|AAL40694.1|AF395588_1 putative nuclear protein NSD1 [Homo sapiens]
 gi|16751269|gb|AAL06645.1| androgen receptor associated coregulator 267-b [Homo sapiens]
 gi|119605438|gb|EAW85032.1| nuclear receptor binding SET domain protein 1, isoform CRA_b [Homo
            sapiens]
          Length = 2696

 Score = 73.2 bits (178), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1506 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1565

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1566 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1605

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1606 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1653

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1654 SKGRLMRCVRCPVAYH 1669



 Score = 40.0 bits (92), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1540 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1590

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1591 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1620


>gi|426359420|ref|XP_004046973.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Gorilla gorilla
           gorilla]
          Length = 1397

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 662 TVCQICESPGDSLIPCEGECCKHFHL----------ECLGLT-----SLPDSKFICMECK 706

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 707 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 749

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 750 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 787


>gi|410216830|gb|JAA05634.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410260120|gb|JAA18026.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1507 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1566

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1567 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1606

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1607 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1654

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1655 SKGRLMRCVRCPVAYH 1670



 Score = 39.7 bits (91), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1541 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1591

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1592 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1621


>gi|395736540|ref|XP_003776772.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 2 [Pongo abelii]
          Length = 2594

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1404 RTAASPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1463

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1464 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1503

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1504 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1551

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1552 SKGRLMRCVRCPVAYH 1567



 Score = 40.0 bits (92), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1438 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1488

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1489 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1518


>gi|114603589|ref|XP_527132.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 8 [Pan troglodytes]
 gi|397470588|ref|XP_003806901.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Pan paniscus]
 gi|410303856|gb|JAA30528.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
 gi|410341933|gb|JAA39913.1| nuclear receptor binding SET domain protein 1 [Pan troglodytes]
          Length = 2697

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1507 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1566

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1567 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1606

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1607 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1654

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1655 SKGRLMRCVRCPVAYH 1670



 Score = 39.7 bits (91), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1541 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1591

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1592 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1621


>gi|148705490|gb|EDL37437.1| mCG16344 [Mus musculus]
          Length = 1298

 Score = 72.8 bits (177), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 570 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 627

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 628 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVNQCG 669

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 670 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 717

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 718 CVRCPVAYH 726


>gi|15213542|gb|AAK92049.1|AF322907_1 NSD1 [Homo sapiens]
          Length = 2596

 Score = 72.8 bits (177), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1403 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1462

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1463 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1502

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1503 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1550

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1551 SKGRLMRCVRCPVAYH 1566



 Score = 40.0 bits (92), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1437 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1487

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1488 ----------HTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1517


>gi|417407050|gb|JAA50158.1| Putative histone-lysine n-methyltransferase h3 lysine-36 and h4
            lysine-20 specific [Desmodus rotundus]
          Length = 2699

 Score = 72.8 bits (177), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 79/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   D D     S ++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1509 RTAASPKEAVEEGTDADHGMPASRKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1568

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1569 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1608

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1609 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 1656

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1657 SKGRLMRCVRCPVAYH 1672



 Score = 39.7 bits (91), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1543 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1593

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1594 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1623


>gi|119605439|gb|EAW85033.1| nuclear receptor binding SET domain protein 1, isoform CRA_c [Homo
            sapiens]
          Length = 2593

 Score = 72.8 bits (177), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1403 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1462

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1463 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1502

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1503 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1550

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1551 SKGRLMRCVRCPVAYH 1566



 Score = 39.7 bits (91), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1437 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1487

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1488 ----------HTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1517


>gi|350580826|ref|XP_003123715.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Sus scrofa]
          Length = 2392

 Score = 72.8 bits (177), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG T+        F+C  C    H
Sbjct: 1240 VCQNCEKLGELLLCEAQCCGAFHL----------ECLGLTEMPRG---KFICNECRTGIH 1286

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1287 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VMQNKG 1327

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   NV     +L  C RCP AYH
Sbjct: 1328 FRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1364



 Score = 39.7 bits (91), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1235 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1285

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1286 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1315


>gi|403290056|ref|XP_003936149.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Saimiri boliviensis boliviensis]
          Length = 2697

 Score = 72.8 bits (177), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1507 RTAASPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1566

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1567 FHL----------ECLGLTEMPRG---KFICNECHTGIHTCFVC-------KQSGEDVKR 1606

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1607 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1654

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1655 SKGRLMRCVRCPVAYH 1670



 Score = 39.3 bits (90), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 45/104 (43%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+          L E  R + 
Sbjct: 1541 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLG---------LTEMPRGKF 1582

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               E  T  +H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1583 ICNECHTG-IHTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1621


>gi|92098122|gb|AAI15007.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
          Length = 1437

 Score = 72.8 bits (177), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|219841942|gb|AAI43511.1| WHSC1L1 protein [Homo sapiens]
          Length = 1426

 Score = 72.8 bits (177), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|13699811|ref|NP_075447.1| histone-lysine N-methyltransferase NSD3 isoform long [Homo sapiens]
 gi|74761342|sp|Q9BZ95.1|NSD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD3; AltName:
           Full=Nuclear SET domain-containing protein 3; AltName:
           Full=Protein whistle; AltName: Full=WHSC1-like 1 isoform
           9 with methyltransferase activity to lysine; AltName:
           Full=Wolf-Hirschhorn syndrome candidate 1-like protein
           1; Short=WHSC1-like protein 1
 gi|12642817|gb|AAK00355.1| putative protein WHSC1L1l [Homo sapiens]
 gi|75517229|gb|AAI01718.1| WHSC1L1 protein, isoform long [Homo sapiens]
 gi|109731704|gb|AAI13470.1| Wolf-Hirschhorn syndrome candidate 1-like 1 [Homo sapiens]
 gi|119583726|gb|EAW63322.1| Wolf-Hirschhorn syndrome candidate 1-like 1, isoform CRA_c [Homo
           sapiens]
 gi|168277388|dbj|BAG10672.1| histone-lysine N-methyltransferase NSD3 [synthetic construct]
          Length = 1437

 Score = 72.4 bits (176), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|12697312|emb|CAC28350.1| putative chromatin modulator [Homo sapiens]
          Length = 1437

 Score = 72.4 bits (176), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|291387890|ref|XP_002710469.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 2
            [Oryctolagus cuniculus]
          Length = 2431

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RTT  PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1240 RTTASPKEAVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1299

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1300 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1339

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1340 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 1387

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1388 SKGRLMRCVRCPVAYH 1403



 Score = 39.7 bits (91), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1274 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1324

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1325 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1354


>gi|12697314|emb|CAC28351.1| Putative Chromatin modulator [Homo sapiens]
          Length = 1388

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|354483938|ref|XP_003504149.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Cricetulus griseus]
          Length = 1365

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/189 (29%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 637 SESPYESADETQTEASVSSKKSERGVAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 694

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 695 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVTQCG 736

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  +   N      ++  
Sbjct: 737 KFYHEACVKKYPLTVFES------------RGFRCPLHSCVSCHATNPSNPRPSKGKMMR 784

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 785 CVRCPVAYH 793


>gi|296193510|ref|XP_002806650.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific [Callithrix
            jacchus]
          Length = 2692

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG T+        F+C  C    H
Sbjct: 1545 VCQNCEKLGELLLCEAQCCGAFHL----------ECLGLTEMPRG---KFICNECHTGIH 1591

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1592 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VMQNKG 1632

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   NV     +L  C RCP AYH
Sbjct: 1633 FRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYH 1669



 Score = 39.3 bits (90), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 45/104 (43%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+          L E  R + 
Sbjct: 1540 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLG---------LTEMPRGKF 1581

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               E  T  +H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1582 ICNECHTG-IHTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1620


>gi|426256406|ref|XP_004021831.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2 [Ovis
           aries]
          Length = 1439

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + + P+  F+C  C 
Sbjct: 703 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LSSPPDGKFVCVECK 747

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S  +V  C  +TCG+FYH  CV K      ES          
Sbjct: 748 TGQHPCFSC-------KVSGMDVKRCSVSTCGKFYHEACVRKFPTAIFES---------- 790

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 791 --KGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|291387888|ref|XP_002710468.1| PREDICTED: nuclear receptor binding SET domain protein 1 isoform 1
            [Oryctolagus cuniculus]
          Length = 2700

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RTT  PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1509 RTTASPKEAVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1568

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1569 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1608

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1609 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 1656

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1657 SKGRLMRCVRCPVAYH 1672



 Score = 39.7 bits (91), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1543 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1593

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1594 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1623


>gi|344244292|gb|EGW00396.1| putative histone-lysine N-methyltransferase NSD2 [Cricetulus
           griseus]
          Length = 1344

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/189 (29%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 616 SESPYESADETQTEASVSSKKSERGVAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 673

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 674 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVTQCG 715

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  +   N      ++  
Sbjct: 716 KFYHEACVKKYPLTVFES------------RGFRCPLHSCVSCHATNPSNPRPSKGKMMR 763

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 764 CVRCPVAYH 772


>gi|348571625|ref|XP_003471596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Cavia porcellus]
          Length = 1366

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 69/189 (36%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 637 SESPYESADETQTEVSISSKKSERGVAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 694

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG  +        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 695 --------LGLPRRPEG---KFTCMECTSGIHSCFVC-------KESKAEVKRCVVTQCG 736

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 737 KFYHEACVRKFPLTVFES------------RGFRCPLHSCVSCHASNPSNPRPSKGKMMR 784

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 785 CVRCPVAYH 793


>gi|348527268|ref|XP_003451141.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like
           [Oreochromis niloticus]
          Length = 1605

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 63/227 (27%), Positives = 89/227 (39%), Gaps = 41/227 (18%)

Query: 156 REAAERDKDLANSKFLL-----SFMANHPETNTFHEDVRTTEKPKVIVDWNEDEDSERVD 210
           R  A  D ++A+S++       S   N P+   F + + +           +  DS    
Sbjct: 789 RSRASTDVEMASSQYRDTSDSDSRGLNDPQVGLFGKSLDSPAAADADASDTQSVDSGLSR 848

Query: 211 KDENYF---AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN 266
           +D +      VC IC+  GD +  C+G C R FH           + LG +         
Sbjct: 849 QDSSTSKRDTVCQICEVYGDGLMVCEGDCNRQFHM----------ECLGLSSPPEG---R 895

Query: 267 FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAE 326
           F C  C    H CF+C  +G       +EV  C  + CG +YH +CV KL  P   S   
Sbjct: 896 FTCAECRTGNHPCFSCKTVG-------REVTRCSVSGCGCYYHEDCVRKL--PGTTS--- 943

Query: 327 ELRERIAAGESFTCPVHKCFVCQQSED-MNVEDLQLAICRRCPKAYH 372
                 + G  F+CP H C  C    D       +L  C RCP AYH
Sbjct: 944 ------SPGGGFSCPQHSCSTCCLERDPQRASKGRLIRCIRCPLAYH 984


>gi|291409090|ref|XP_002720827.1| PREDICTED: WHSC1L1 protein [Oryctolagus cuniculus]
          Length = 1435

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD +  C+G C   FH           + LG     + ++P+  F+C  C 
Sbjct: 699 TVCQICESSGDSLIACEGECCTHFHL----------ECLG-----LPSLPDGKFICVECQ 743

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C M G       ++V  C    CG+FYH  CV K      ES          
Sbjct: 744 TGQHPCFSCKMSG-------KDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 786

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 787 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 824


>gi|348571627|ref|XP_003471597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           2 [Cavia porcellus]
          Length = 1367

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 69/189 (36%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 638 SESPYESADETQTEVSISSKKSERGVAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 695

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG  +        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 696 --------LGLPRRPEG---KFTCMECTSGIHSCFVC-------KESKAEVKRCVVTQCG 737

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 738 KFYHEACVRKFPLTVFES------------RGFRCPLHSCVSCHASNPSNPRPSKGKMMR 785

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 786 CVRCPVAYH 794


>gi|301614673|ref|XP_002936809.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
           [Xenopus (Silurana) tropicalis]
          Length = 1298

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  GD+  C+G C  +FH +        C  L    A       +LC+ C     
Sbjct: 613 VCQVCEKVGDLMLCEGVCCSAFHLS--------CIGLSTRPAG-----KYLCKECTSGAR 659

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S+++V  C+   CG+FYH  C+ K        LA      +     
Sbjct: 660 SCFLC-------KESNRDVKRCIVPHCGKFYHESCLRKY------PLA------VFESRG 700

Query: 338 FTCPVHKCFVCQQSEDMN--VEDLQLAICRRCPKAYH 372
           F CP+H+C  C  S   N      ++  C RCP AYH
Sbjct: 701 FRCPLHRCATCYFSNPSNPRASKGKMVRCVRCPLAYH 737


>gi|449474840|ref|XP_002193971.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Taeniopygia guttata]
          Length = 1651

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC IC+  G++  C+ +C  +FH           Q LG ++        F+C  C    H
Sbjct: 274 VCQICEKPGELLLCEAQCCGAFHL----------QCLGLSEMPKG---KFICNECSTGVH 320

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       KS  ++V  C+   CG++YH  C+ K  +P            +   + 
Sbjct: 321 TCFVC-------KSCGEDVKRCLLPLCGKYYHEACIQK--YPPT----------VMQNKG 361

Query: 338 FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
           F C +H C  C  +   N+     +L  C RCP AYH
Sbjct: 362 FRCSLHICMTCHAANPANISASKGRLMRCVRCPVAYH 398


>gi|414884835|tpg|DAA60849.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 170

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 57/92 (61%), Gaps = 2/92 (2%)

Query: 567 PSVDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKA 626
           P VD+E EK ++ ++ +  SS   ++  ++  +P TH +  +  L   I LG +E SV+A
Sbjct: 76  PKVDSETEKRVITILGNKVSSLTIKDVTRNCSLPSTHVYSGR--LTDKIPLGKIERSVQA 133

Query: 627 VRAALEMLDGGCDIEDAKAVCPPEILCQIFQW 658
           V AAL+ L+ G  + DAKAVC PE+L Q+ +W
Sbjct: 134 VEAALKKLENGGSVNDAKAVCEPEVLRQLARW 165


>gi|16549858|dbj|BAB70868.1| unnamed protein product [Homo sapiens]
          Length = 1059

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189 RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
           RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 469 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 528

Query: 239 FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
           FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 529 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 568

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
           C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 569 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 616

Query: 357 EDLQLAICRRCPKAYH 372
              +L  C RCP AYH
Sbjct: 617 SKGRLMRCVRCPVAYH 632


>gi|118101386|ref|XP_001232891.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Gallus gallus]
          Length = 1436

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 68/159 (42%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
           VC IC+  G+ +  C+G C  +FH           + LG     + A+P   F C  C  
Sbjct: 701 VCQICESSGESLVSCEGECCSTFHM----------ECLG-----LKAMPEEKFFCTECKN 745

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            EH CF+C + G       ++V  C  + CG+FYH  CV K      ES           
Sbjct: 746 GEHTCFSCKLPG-------KDVKRCSVSACGKFYHEACVRKFATALFES----------- 787

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
              F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 788 -RGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|395543169|ref|XP_003773493.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Sarcophilus harrisii]
          Length = 1464

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G++  C+G C  +FH +           LG ++        F+C  C    H
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSC----------LGLSRRPEG---KFICSECTSGIH 716

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K    EV  CV + CG+FYH  CV K      ES              
Sbjct: 717 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVKKYHLTVFES------------RG 757

Query: 338 FTCPVHKCFVCQQSEDMNVEDL--QLAICRRCPKAYH 372
           F CP+H C  C  +   N      ++  C RCP AYH
Sbjct: 758 FRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCPVAYH 794


>gi|126332220|ref|XP_001374612.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
           [Monodelphis domestica]
          Length = 1366

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 63/157 (40%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G++  C+G C  +FH +           LG ++        F+C  C    H
Sbjct: 670 VCQLCEKSGNLLLCEGPCYGAFHLSC----------LGLSRRPEG---KFICSECTSGIH 716

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K    EV  CV + CG+FYH  CV K      ES              
Sbjct: 717 SCFVC-------KEKKTEVKRCVVSQCGKFYHEACVKKYHLTVFES------------RG 757

Query: 338 FTCPVHKCFVCQQSEDMNVEDL--QLAICRRCPKAYH 372
           F CP+H C  C  +   N      ++  C RCP AYH
Sbjct: 758 FRCPLHSCVSCHVTNPSNPRSSKGKMMRCVRCPVAYH 794


>gi|380791865|gb|AFE67808.1| putative histone-lysine N-methyltransferase NSD2 isoform 1, partial
           [Macaca mulatta]
          Length = 949

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|325180900|emb|CCA15310.1| histonelysine Nmethyltransferase putative [Albugo laibachii Nc14]
          Length = 1258

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/370 (23%), Positives = 148/370 (40%), Gaps = 90/370 (24%)

Query: 49  QVFLHGNIGDGPWNFNEQVIAWKFELSYAQPEIWVLSKQNNWIMLQSPKKSFKNIVRTIL 108
           QV      GD     N+++  W         ++W++ ++ + I  +S   S ++++R   
Sbjct: 157 QVLKMLQAGDCSDVVNDKIRNW------LAAKVWIIRREESCI--RSVMLSLEHLLRKYC 208

Query: 109 ITVHWLYCVKQNPEA------SRKYIREQMLKVFCSYEVEPSENDLLDHMPISREAAERD 162
             +H   C+ +  +       +RK   EQ++K+  + ++EP E   L    +    +E  
Sbjct: 209 TYLHIPKCLTRAAKVWKFGNITRK---EQIVKLIKNIKMEPIE---LKENNVQLLESEST 262

Query: 163 KDLANSKFLLSFMANHPETNTFHEDVRTTEKPKVI--VDWNEDE-------DSERVDKDE 213
             + N K L                 R + K K+   +  N ++       +S R +   
Sbjct: 263 DPITNGKML----------------TRASRKAKLFCAIPQNSNQCLVQQRPNSTREEALR 306

Query: 214 NYF--AVCAIC----DDGGD--VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP 265
           N F   +C IC    D+ G   V +C   C  ++H +           LG T    D+  
Sbjct: 307 NTFDEELCGICCELEDEEGSNLVVWCKKSCYTAYHLS----------CLGLTDEFDDS-- 354

Query: 266 NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV-----------S 314
            + C+ C  +E  CFACG  GS D+     VF C S  C +FYH  CV            
Sbjct: 355 EWKCEQCENREQICFACGRNGSIDERGG--VFKCCSQRCHKFYHYSCVEGCRRTRFYGSK 412

Query: 315 KLLHP------DNESLAEELRER-IAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRC 367
           +  +P      DN S+  +  E        F CP H C VC+ ++       +L  C +C
Sbjct: 413 RKRNPQIRKESDNLSMESDTNEEDFKYKFKFRCPRHICAVCEDAKSS-----ELMFCIKC 467

Query: 368 PKAYHRKCLP 377
           P++YH  C+P
Sbjct: 468 PESYHTSCVP 477


>gi|340500320|gb|EGR27208.1| myb domain protein [Ichthyophthirius multifiliis]
          Length = 1057

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 78/184 (42%), Gaps = 32/184 (17%)

Query: 215 YFAVCAICDDGGDVTFCDGRCLRSFH----ATITAGKNA-------LCQSLGYTQAQIDA 263
           Y  +C++C+      FC   C RSFH     TI   +N          + L +   Q+  
Sbjct: 699 YKKLCSLCEQPKCTFFCQSFCKRSFHEQCKETIEQKQNLKEKADIYYPKELNFEDLQLQQ 758

Query: 264 VPN--FLCQNCVYQEHQCFAC---GMLGSSDKSSSQE---VFPCVSATCGQFYHPECVSK 315
           + N  ++C +C      CF C   G+  ++ K   Q+   V  C +A C ++YH  C+  
Sbjct: 759 MTNNKYICIDCQGNMVICFICKNYGIYNNNQKLKKQKNDSVNKCSTANCNKYYHLNCIKN 818

Query: 316 LLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKC 375
                    A   +   +  + F C +H C  C QS D  V    ++ C +CPKAYH  C
Sbjct: 819 ---------APLFKFIDSNNKRFRCSLHYCASCTQSGDSKV----ISQCIKCPKAYHLGC 865

Query: 376 LPTE 379
           L  E
Sbjct: 866 LNGE 869


>gi|395857586|ref|XP_003801172.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Otolemur garnettii]
          Length = 1371

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH +           LG ++        F C  C    H
Sbjct: 669 VCQLCEKTGSLLLCEGPCCGAFHLSC----------LGLSRRPEG---RFTCSECTSGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHETCVRKFPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPANPRPSKGKMMRCVRCPVAYH 793


>gi|326932813|ref|XP_003212507.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 1
           [Meleagris gallopavo]
          Length = 1436

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 68/159 (42%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
           VC IC+  G+ +  C+G C  +FH           + LG     + A+P   F C  C  
Sbjct: 701 VCQICESSGESLVSCEGECCSTFHM----------ECLG-----LKAMPEEKFFCTECKN 745

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            EH CF+C + G       ++V  C  + CG+FYH  CV K      ES           
Sbjct: 746 GEHTCFSCKLPG-------KDVKRCSVSACGKFYHEACVRKFATALFES----------- 787

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
              F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 788 -RGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|326932815|ref|XP_003212508.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like isoform 2
           [Meleagris gallopavo]
          Length = 1386

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 68/159 (42%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
           VC IC+  G+ +  C+G C  +FH           + LG     + A+P   F C  C  
Sbjct: 701 VCQICESSGESLVSCEGECCSTFHM----------ECLG-----LKAMPEEKFFCTECKN 745

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            EH CF+C + G       ++V  C  + CG+FYH  CV K      ES           
Sbjct: 746 GEHTCFSCKLPG-------KDVKRCSVSACGKFYHEACVRKFATALFES----------- 787

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
              F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 788 -RGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|118101388|ref|XP_424390.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Gallus gallus]
          Length = 1386

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 68/159 (42%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
           VC IC+  G+ +  C+G C  +FH           + LG     + A+P   F C  C  
Sbjct: 701 VCQICESSGESLVSCEGECCSTFHM----------ECLG-----LKAMPEEKFFCTECKN 745

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            EH CF+C + G       ++V  C  + CG+FYH  CV K      ES           
Sbjct: 746 GEHTCFSCKLPG-------KDVKRCSVSACGKFYHEACVRKFATALFES----------- 787

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
              F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 788 -RGFRCPQHCCSACSVDKDIHKASKGRMVRCFRCPIAYH 825


>gi|344265319|ref|XP_003404732.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Loxodonta africana]
          Length = 2702

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  ++   + D     S+R+  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1510 RTAASPKEAIEEGVEHDHGMPVSKRMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1569

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1570 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1609

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1610 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVTA 1657

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1658 SKGRLMRCVRCPVAYH 1673



 Score = 39.3 bits (90), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1544 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1594

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1595 ----------HTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1624


>gi|449270866|gb|EMC81514.1| Histone-lysine N-methyltransferase NSD3 [Columba livia]
          Length = 1440

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 70/159 (44%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCVY 274
           VC IC+  G+ +  C+G C   FH           + LG     + ++P+  F+C  C  
Sbjct: 693 VCQICESSGESLVSCEGECCGVFHP----------ECLG-----LKSLPDEKFICAECKN 737

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            EH CF+C + G       ++V  C  +TCG+FYH  CV K      ES           
Sbjct: 738 GEHTCFSCKLPG-------KDVKRCSVSTCGKFYHEACVRKFATALFES----------- 779

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
              F CP H C  C   +D++     ++  C RCP A+H
Sbjct: 780 -RGFRCPQHCCTACSMDKDIHKASKGRMVRCLRCPVAFH 817


>gi|345798392|ref|XP_536224.3| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Canis lupus familiaris]
          Length = 1364

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 668 VCQLCEKTGSLVLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 714

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV + CG+FYH  CV K        LA      +     
Sbjct: 715 SCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKY------PLA------VFESRG 755

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 756 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 792


>gi|348574862|ref|XP_003473209.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Cavia
            porcellus]
          Length = 2509

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG T+        F+C  C    H
Sbjct: 1358 VCQNCEKVGELLLCEAQCCGAFHL----------ECLGLTEMPRG---KFICNECRTGIH 1404

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1405 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VMQNKG 1445

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   +V     +L  C RCP AYH
Sbjct: 1446 FRCSLHICITCHAANPASVSASKGRLMRCVRCPVAYH 1482


>gi|76666643|ref|XP_613048.2| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Bos taurus]
 gi|297476142|ref|XP_002688498.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Bos
           taurus]
 gi|296486298|tpg|DAA28411.1| TPA: Wolf-Hirschhorn syndrome candidate 1 [Bos taurus]
          Length = 1365

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 62/157 (39%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C++ G +  C+G C  +FH             LG ++         LC  C    H
Sbjct: 669 VCQLCEEPGSLVLCEGPCCGAFHLAC----------LGLSRRPEG---RLLCGECTSGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV + CG+FYH  CV +      ES              
Sbjct: 716 SCFVC-------KESKSDVKRCVVSQCGKFYHEACVRRFPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCLSCHASNPSNPRPSKGKMLRCVRCPVAYH 793


>gi|73953273|ref|XP_865778.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific isoform 5 [Canis lupus familiaris]
          Length = 2698

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+ + + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1507 RTAASPKETVEESVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1566

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1567 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1606

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1607 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 1654

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1655 SKGRLMRCVRCPVAYH 1670



 Score = 39.7 bits (91), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1541 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1591

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1592 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1621


>gi|301785552|ref|XP_002928188.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific-like [Ailuropoda melanoleuca]
 gi|281342107|gb|EFB17691.1| hypothetical protein PANDA_018107 [Ailuropoda melanoleuca]
          Length = 2699

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1510 RTAASPKETVEEGVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1569

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1570 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1609

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1610 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 1657

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1658 SKGRLMRCVRCPVAYH 1673



 Score = 39.7 bits (91), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1544 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1594

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1595 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1624


>gi|301762334|ref|XP_002916587.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2-like [Ailuropoda melanoleuca]
          Length = 1364

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 668 VCQLCEKTGSLVLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 714

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV + CG+FYH  CV K      ES              
Sbjct: 715 SCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPLTVFES------------RG 755

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 756 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 792


>gi|410949106|ref|XP_003981265.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Felis catus]
          Length = 2432

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1241 RTAASPKDTVEEGVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1300

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1301 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1340

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1341 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 1388

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1389 SKGRLMRCVRCPVAYH 1404



 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1275 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1325

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1326 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1355


>gi|417406466|gb|JAA49891.1| Putative histone-lysine n-methyltransferase nsd3-like isoform 3
           [Desmodus rotundus]
          Length = 1438

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 67/160 (41%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG        +P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECYKYFHL----------ECLGSP-----PLPDGKFICVECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G+       EV  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSCKVPGT-------EVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|351708443|gb|EHB11362.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Heterocephalus glaber]
          Length = 2698

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG T+        F+C  C    H
Sbjct: 1545 VCQNCEKVGELLLCEAQCCGAFHL----------ECLGLTEMPRG---KFICNECRTGIH 1591

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1592 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VMQNKG 1632

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   +V     +L  C RCP AYH
Sbjct: 1633 FRCSLHICITCHAANPASVSASKGRLMRCVRCPVAYH 1669


>gi|440907576|gb|ELR57709.1| Histone-lysine N-methyltransferase NSD3 [Bos grunniens mutus]
          Length = 1446

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + + P+  F+C  C 
Sbjct: 703 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LSSPPDGKFVCVECK 747

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G+       +V  C  + CG+FYH  CV K      ES          
Sbjct: 748 TGQHPCFSCKVSGA-------DVKRCSVSACGKFYHEACVRKFPTAIFES---------- 790

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 791 --KGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|384253423|gb|EIE26898.1| hypothetical protein COCSUDRAFT_59402 [Coccomyxa subellipsoidea
           C-169]
          Length = 534

 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 68/177 (38%), Gaps = 44/177 (24%)

Query: 208 RVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNF 267
           R+  D N F  C  C DGG++  CDG CLRSFH                           
Sbjct: 136 RLKTDWNAF-FCGCCGDGGELLECDGMCLRSFH--------------------------- 167

Query: 268 LCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEE 327
             QNC+    +          +   +  + PC   +CG ++H  C++ L       +  E
Sbjct: 168 --QNCLAPSER------PNPENPPETPCILPCKMGSCGWYFHNACLNGLRDSGLLKIHRE 219

Query: 328 LRERIAAGES----FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEI 380
                A GE     FTCP H C VC  S       LQ   C RCP AYH  C+P  +
Sbjct: 220 DPGVAAGGEDGQRVFTCPAHFCHVCGNS-GAGRHTLQ---CWRCPTAYHATCVPAGV 272


>gi|358419376|ref|XP_003584219.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
           taurus]
          Length = 1439

 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + + P+  F+C  C 
Sbjct: 703 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LSSPPDGKFVCVECK 747

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G+       +V  C  + CG+FYH  CV K      ES          
Sbjct: 748 TGQHPCFSCKVSGA-------DVKRCSVSACGKFYHEACVRKFPTAVFES---------- 790

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 791 --KGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|37360238|dbj|BAC98097.1| mKIAA1090 protein [Mus musculus]
          Length = 857

 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 129 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 186

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 187 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVNQCG 228

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 229 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 276

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 277 CVRCPVAYH 285


>gi|375294191|ref|NP_001069063.2| histone-lysine N-methyltransferase NSD3 [Bos taurus]
 gi|359080579|ref|XP_003588019.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1 [Bos
           taurus]
          Length = 1439

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + + P+  F+C  C 
Sbjct: 703 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LSSPPDGKFVCVECK 747

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G+       +V  C  + CG+FYH  CV K      ES          
Sbjct: 748 TGQHPCFSCKVSGA-------DVKRCSVSACGKFYHEACVRKFPTAVFES---------- 790

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 791 --KGFRCPQHCCSACSMEKDIHKASKGRMVRCLRCPVAYH 828


>gi|119895257|ref|XP_592234.3| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific, partial [Bos taurus]
          Length = 2389

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1200 RTAASPKETVEEGVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1259

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1260 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1299

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1300 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICTTCHAANPASVSA 1347

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1348 SKGRLMRCVRCPVAYH 1363



 Score = 39.3 bits (90), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1234 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1284

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1285 ----------HTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1314


>gi|440898362|gb|ELR49876.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Bos grunniens mutus]
          Length = 2698

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1509 RTAASPKETVEEGVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1568

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1569 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1608

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1609 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICTTCHAANPASVSA 1656

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1657 SKGRLMRCVRCPVAYH 1672



 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1543 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1593

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1594 ----------HTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1623


>gi|28204960|gb|AAH46473.1| Whsc1 protein, partial [Mus musculus]
          Length = 851

 Score = 70.1 bits (170), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 123 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 180

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 181 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVNQCG 222

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 223 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 270

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 271 CVRCPVAYH 279


>gi|427798749|gb|JAA64826.1| Putative histone-lysine n-methyltransferase nsd2, partial
           [Rhipicephalus pulchellus]
          Length = 755

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 64/162 (39%), Gaps = 36/162 (22%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           +C IC+  G    C G C  SFH             +G TQ   DA   FLC  C    H
Sbjct: 270 LCLICEATGATLTCTGPCRMSFHL----------DCIGITQPP-DA---FLCDECTTGSH 315

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            C  C    ++ K S ++        CG FYH  C+  L  P  E             +S
Sbjct: 316 SCLVCKGTEATQKCSMEQ--------CGTFYHISCLKTLPLPIKE-------------DS 354

Query: 338 FTCPVHKCFVCQQSEDMNVE-DLQLAICRRCPKAYHRKCLPT 378
             CP+H C  C Q +   V    +L  C +CP A+H  CLP 
Sbjct: 355 LVCPLHFCLPCLQQKPRAVSMKGRLLRCVQCPSAFHSGCLPA 396


>gi|355557406|gb|EHH14186.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca mulatta]
          Length = 1365

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|383421363|gb|AFH33895.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
           mulatta]
 gi|384949270|gb|AFI38240.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
           mulatta]
 gi|387540940|gb|AFJ71097.1| putative histone-lysine N-methyltransferase NSD2 isoform 1 [Macaca
           mulatta]
          Length = 1365

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|355744804|gb|EHH49429.1| Putative histone-lysine N-methyltransferase NSD2 [Macaca
           fascicularis]
          Length = 1365

 Score = 69.7 bits (169), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|297672976|ref|XP_002814554.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Pongo abelii]
          Length = 1365

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|19913348|ref|NP_579877.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913350|ref|NP_579878.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|19913358|ref|NP_579890.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|109633019|ref|NP_001035889.1| histone-lysine N-methyltransferase NSD2 isoform 1 [Homo sapiens]
 gi|74706096|sp|O96028.1|NSD2_HUMAN RecName: Full=Histone-lysine N-methyltransferase NSD2; AltName:
           Full=Multiple myeloma SET domain-containing protein;
           Short=MMSET; AltName: Full=Nuclear SET domain-containing
           protein 2; Short=NSD2; AltName: Full=Protein
           trithorax-5; AltName: Full=Wolf-Hirschhorn syndrome
           candidate 1 protein; Short=WHSC1
 gi|3249713|gb|AAC24150.1| MMSET type II [Homo sapiens]
 gi|4378019|gb|AAD19343.1| putative WHSC1 protein [Homo sapiens]
 gi|4521954|gb|AAD21770.1| putative WHSC1 protein [Homo sapiens]
 gi|4521955|gb|AAD21771.1| putative WHSC1 protein [Homo sapiens]
 gi|5123789|emb|CAB45386.1| TRX5 protein [Homo sapiens]
 gi|6683809|gb|AAF23370.1| MMSET type II [Homo sapiens]
 gi|119602958|gb|EAW82552.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602959|gb|EAW82553.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|119602962|gb|EAW82556.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_e [Homo sapiens]
 gi|168273154|dbj|BAG10416.1| histone-lysine N-methyltransferase NSD2 [synthetic construct]
 gi|187252511|gb|AAI66668.1| Wolf-Hirschhorn syndrome candidate 1 [synthetic construct]
          Length = 1365

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|114592860|ref|XP_001146084.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           6 [Pan troglodytes]
 gi|114592864|ref|XP_001146248.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           7 [Pan troglodytes]
 gi|114592866|ref|XP_001146323.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           8 [Pan troglodytes]
 gi|114592870|ref|XP_001146473.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           10 [Pan troglodytes]
 gi|397483594|ref|XP_003812984.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Pan
           paniscus]
 gi|410227780|gb|JAA11109.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410259494|gb|JAA17713.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410299310|gb|JAA28255.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
 gi|410334709|gb|JAA36301.1| Wolf-Hirschhorn syndrome candidate 1 [Pan troglodytes]
          Length = 1365

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|426343599|ref|XP_004038381.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Gorilla gorilla gorilla]
          Length = 1365

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|426229361|ref|XP_004008759.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Ovis aries]
          Length = 2698

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 66/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG T+        F+C  C    H
Sbjct: 1548 VCQNCEKLGELLLCEAQCCGAFHL----------ECLGLTEMPRG---KFICNECRTGIH 1594

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1595 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VMQNKG 1635

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   +V     +L  C RCP AYH
Sbjct: 1636 FRCSLHICTTCHAANPASVSASKGRLMRCVRCPVAYH 1672



 Score = 39.7 bits (91), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1543 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1593

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1594 ----------HTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1623


>gi|149756942|ref|XP_001488967.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Equus caballus]
          Length = 1365

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 61/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKTGSLVLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K +  +V  CV + CG+FYH  CV K      ES              
Sbjct: 716 TCFVC-------KENKTDVKRCVVSQCGKFYHEACVRKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|327284319|ref|XP_003226886.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Anolis
           carolinensis]
          Length = 1438

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 68/158 (43%), Gaps = 34/158 (21%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
           ++C IC+  G+    C+G C R FH           + LG + +Q +    F+C  C   
Sbjct: 701 SICQICESSGESLLSCEGECYRVFHT----------ECLGLS-SQPEG--KFICIECKNG 747

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
           +H CF+C + G       ++V  C    CG+FYH  CV K      ES            
Sbjct: 748 QHTCFSCKLPG-------KDVKRCSVNACGKFYHEACVRKFATTVFES------------ 788

Query: 336 ESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 789 RGFRCPQHCCTSCSVDKDIHKASKGRMVRCLRCPIAYH 826


>gi|47222897|emb|CAF99053.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 768

 Score = 69.7 bits (169), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 63/155 (40%), Gaps = 37/155 (23%)

Query: 221 ICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCF 280
           +C+  GD+  CDG C  +FH           Q +G + A       F C+ C    H CF
Sbjct: 49  VCERTGDLLVCDGHCYGAFHP----------QCIGLSAAPEG---RFFCRECSSGVHSCF 95

Query: 281 ACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKL--LHPDNESLAEELRERIAAGESF 338
            C       K S   V  C+   CG+FYH +C++      P N              + F
Sbjct: 96  VC-------KKSGNGVKRCMIPLCGKFYHSDCIATFSATQPHN--------------KGF 134

Query: 339 TCPVHKCFVCQQS-EDMNVEDLQLAICRRCPKAYH 372
            CP+H C  C  +    N    +LA C RCP AYH
Sbjct: 135 RCPLHVCLSCHVAYPHSNCSKGRLARCVRCPVAYH 169


>gi|291223879|ref|XP_002731935.1| PREDICTED: Wolf-Hirschhorn syndrome candidate 1 protein-like
            [Saccoglossus kowalevskii]
          Length = 1787

 Score = 69.3 bits (168), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 64/159 (40%), Gaps = 38/159 (23%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC +C+  G++  C+G+C  +FH          C  L    + +     F C  C+   H
Sbjct: 924  VCQVCERPGELLLCEGQCCGAFHLD--------CIGLQQMPSGV-----FKCDECISGVH 970

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSK--LLHPDNESLAEELRERIAAG 335
             CF C            EV  C  A CG+FYH +C+ K  L   D              G
Sbjct: 971  SCFIC-------TKCDIEVKRCSVALCGKFYHEDCLKKWTLTRFD--------------G 1009

Query: 336  ESFTCPVHKCFVCQQSEDMNVEDLQ--LAICRRCPKAYH 372
            +  TCP+H C  C      N +  +  L  C RCP AYH
Sbjct: 1010 KGVTCPLHTCQACAADNPRNPKATKGRLTRCVRCPTAYH 1048


>gi|359067302|ref|XP_002689078.2| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific [Bos taurus]
          Length = 1470

 Score = 69.3 bits (168), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189 RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
           RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 281 RTAASPKETVEEGVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 340

Query: 239 FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
           FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 341 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 380

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
           C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 381 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICTTCHAANPASVSA 428

Query: 357 EDLQLAICRRCPKAYH 372
              +L  C RCP AYH
Sbjct: 429 SKGRLMRCVRCPVAYH 444


>gi|355708043|gb|AES03146.1| nuclear receptor binding SET domain protein 1 [Mustela putorius
           furo]
          Length = 588

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189 RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
           RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 330 RTAASPKETVEEGVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 389

Query: 239 FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
           FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 390 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 429

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
           C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 430 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 477

Query: 357 EDLQLAICRRCPKAYH 372
              +L  C RCP AYH
Sbjct: 478 SKGRLMRCVRCPVAYH 493


>gi|432901504|ref|XP_004076868.1| PREDICTED: uncharacterized protein LOC101161079 [Oryzias latipes]
          Length = 2214

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/158 (25%), Positives = 60/158 (37%), Gaps = 34/158 (21%)

Query: 217  AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
             +C +C+  G++  C+G+C  +FH          C SL            F+C  C    
Sbjct: 1427 TICQVCEKTGELLLCEGQCCGAFHLA--------CISLADAPKG-----KFVCPECKSGV 1473

Query: 277  HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
            H CF C   G       ++V  C+   CG+FYH EC++                      
Sbjct: 1474 HTCFVCKKRG-------EDVRRCMIPVCGKFYHGECITNHA------------PTAPVNR 1514

Query: 337  SFTCPVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYH 372
             F C +H C  C        ++   +L  C RCP AYH
Sbjct: 1515 GFRCSIHVCLTCFIANPNSSSISKGRLVRCVRCPVAYH 1552


>gi|431892716|gb|ELK03149.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific, partial [Pteropus alecto]
          Length = 2202

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/196 (27%), Positives = 79/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 955  RTAASPKETVEEGVENDHGMPVSKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1014

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG  +        F+C  C    H CF C       K S ++V  
Sbjct: 1015 FHL----------ECLGLNEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1054

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 1055 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPASVSA 1102

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1103 SKGRLMRCVRCPVAYH 1118


>gi|350593412|ref|XP_003483678.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           NSD3-like [Sus scrofa]
          Length = 1438

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+   D    C+G C + FH           + LG T     ++P+  F+C  C 
Sbjct: 702 TVCQICESPSDSLIPCEGECCKHFHL----------ECLGLT-----SLPDGKFVCVECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C + G+       +V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSCKVSGT-------DVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>gi|443709486|gb|ELU04158.1| hypothetical protein CAPTEDRAFT_93578, partial [Capitella teleta]
          Length = 1129

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 63/158 (39%), Gaps = 34/158 (21%)

Query: 217 AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           ++C +C+   ++  C G CL  FHA            LG +   +     F C  C    
Sbjct: 491 SLCGVCEKPNELLHCVGPCLGHFHAAC----------LGLS---VVPTSTFKCDECSTGI 537

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K + +E   C S  CG+FYH +CVS             +R       
Sbjct: 538 HSCFTC-------KKADKETKKCSSHGCGKFYHMKCVSG------------IRSAKVDNG 578

Query: 337 SFTCPVHKCFVCQQSEDMNVEDLQ--LAICRRCPKAYH 372
            FTCP+H C  C   +    +  Q  L  C RCP AYH
Sbjct: 579 HFTCPLHACATCAVDDVYKPKPKQNRLLRCVRCPTAYH 616


>gi|395505173|ref|XP_003756919.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Sarcophilus harrisii]
          Length = 2717

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/157 (26%), Positives = 67/157 (42%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + +G T+        F+C+ C    H
Sbjct: 1548 VCQNCEKVGELLLCEAQCCGAFHL----------ECIGLTEMPKG---KFICKECRTGIH 1594

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K+S ++V  C+   CG+FYH  C+ K  +P            +   + 
Sbjct: 1595 TCFVC-------KTSGEDVKRCLLPLCGKFYHEACIQK--YPPT----------VLQNKG 1635

Query: 338  FTCPVHKCFVCQQSE--DMNVEDLQLAICRRCPKAYH 372
            F C +H C  C  +    ++    +L  C RCP AYH
Sbjct: 1636 FRCSLHMCITCHAANPASLSASKGRLMRCVRCPVAYH 1672


>gi|149726051|ref|XP_001502479.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Equus caballus]
          Length = 2700

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 65/157 (41%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG  +        F+C  C    H
Sbjct: 1548 VCQNCEKLGELLLCEAQCCGAFHL----------ECLGLNEMPRG---KFICNECRTGIH 1594

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1595 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VMQNKG 1635

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   +V     +L  C RCP AYH
Sbjct: 1636 FRCSLHICITCHAANPASVSASKGRLMRCVRCPVAYH 1672


>gi|444706655|gb|ELW47981.1| Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific [Tupaia chinensis]
          Length = 2687

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 65/157 (41%), Gaps = 34/157 (21%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + LG  +        F+C  C    H
Sbjct: 1537 VCQNCEKVGELLLCEAQCCGAFH----------LECLGLAEMPRG---KFICNECRTGIH 1583

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K S ++V  C+   CG+FYH ECV K  +P            +   + 
Sbjct: 1584 TCFVC-------KQSGEDVKRCLLPLCGKFYHEECVQK--YPPT----------VMQNKG 1624

Query: 338  FTCPVHKCFVCQQSEDMNV--EDLQLAICRRCPKAYH 372
            F C +H C  C  +   +V     +L  C RCP AYH
Sbjct: 1625 FRCSLHICITCHAANPASVSASKGRLMRCVRCPVAYH 1661


>gi|403287002|ref|XP_003934751.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2
           [Saimiri boliviensis boliviensis]
          Length = 1368

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECTSGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K +  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|296197020|ref|XP_002746091.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           1 [Callithrix jacchus]
          Length = 1365

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K +  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>gi|390461100|ref|XP_003732597.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           3 [Callithrix jacchus]
          Length = 1389

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 667 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 713

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K +  +V  CV   CG+FYH  CV K      ES              
Sbjct: 714 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 754

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 755 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791


>gi|390461098|ref|XP_003732596.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 isoform
           2 [Callithrix jacchus]
          Length = 1400

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 667 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 713

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K +  +V  CV   CG+FYH  CV K      ES              
Sbjct: 714 SCFVC-------KENKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 754

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 755 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791


>gi|296485540|tpg|DAA27655.1| TPA: nuclear receptor binding SET domain protein 1 [Bos taurus]
          Length = 1275

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189 RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
           RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 84  RTAASPKETVEEGVENDHGMPASKKLQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 143

Query: 239 FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
           FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 144 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 183

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
           C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   +V  
Sbjct: 184 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICTTCHAANPASVSA 231

Query: 357 EDLQLAICRRCPKAYH 372
              +L  C RCP AYH
Sbjct: 232 SKGRLMRCVRCPVAYH 247


>gi|444511191|gb|ELV09829.1| Histone-lysine N-methyltransferase NSD3 [Tupaia chinensis]
          Length = 1235

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 68/159 (42%), Gaps = 38/159 (23%)

Query: 218 VCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCVY 274
           +C +C+  GD    C+G C R FH           + LG T     ++P+  F C  C  
Sbjct: 557 LCQVCESAGDSLLPCEGDCCRHFHL----------ECLGLT-----SLPDGKFTCVECQT 601

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
             H CF+C       K+  ++V  C  + CG+FYH  CV K      ES           
Sbjct: 602 GLHPCFSC-------KTPGKDVKRCSVSACGKFYHEACVRKFPTAVFES----------- 643

Query: 335 GESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
            + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 644 -KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 681


>gi|431897324|gb|ELK06586.1| Putative histone-lysine N-methyltransferase NSD2 [Pteropus alecto]
          Length = 915

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/202 (27%), Positives = 72/202 (35%), Gaps = 41/202 (20%)

Query: 192 EKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATI--- 243
           E P    D  + E S    K E   A     VC +C+  G +  C G C  +FH +    
Sbjct: 565 ESPYESADETQTEVSISSKKSERGVAAKKEHVCQLCEKTGGLALCVGPCCGAFHPSCLGL 624

Query: 244 --TAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVS 301
             T      C      + ++   P           H CF C       K S+ +V  CV 
Sbjct: 625 PRTPEGRLTCHECASGKCRVPTGPWV---------HSCFVC-------KESAGDVKRCVV 668

Query: 302 ATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQ- 360
           + CG+FYH  CV +      ES              F CP+H C  C  S   N    + 
Sbjct: 669 SQCGKFYHEACVRRFPLAVFES------------RGFRCPLHSCVSCHASNPSNPRPSKG 716

Query: 361 --LAICRRCPKAYHRKCLPTEI 380
             L  C  CP A+H  CL  E 
Sbjct: 717 GSLLCCESCPAAFHPDCLGIET 738


>gi|350587283|ref|XP_003128857.3| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Sus
           scrofa]
          Length = 1338

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 68/188 (36%), Gaps = 39/188 (20%)

Query: 192 EKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITAG 246
           E P    D  + E S    + E   A     VC +C+  G +  C+G C  +FH      
Sbjct: 611 ESPDESADETQTEASVSSKRSERGAAAKKEHVCQLCEKPGSLVLCEGPCCGAFHLAC--- 667

Query: 247 KNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQ 306
                  LG  +          C  C    H CF C       K S  +V  CV + CG+
Sbjct: 668 -------LGLARRPEG---RLTCLECTSGVHSCFVC-------KESKADVRRCVVSQCGK 710

Query: 307 FYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAIC 364
           FYH  CV +      ES              F CP+H C  C  S   N      ++  C
Sbjct: 711 FYHEACVRRFPLTVFES------------RGFRCPLHSCVSCHASNPSNPRPSKGKMMRC 758

Query: 365 RRCPKAYH 372
            RCP AYH
Sbjct: 759 VRCPVAYH 766


>gi|281346901|gb|EFB22485.1| hypothetical protein PANDA_005493 [Ailuropoda melanoleuca]
          Length = 926

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 67/160 (41%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 181 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDGKFICVECK 225

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S  +V  C    CG+FYH  CV K      ES          
Sbjct: 226 TGQHACFSC-------KVSGTDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 268

Query: 334 AGESFTCPVHKCFVCQQSEDM-NVEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D+      ++  C RCP AYH
Sbjct: 269 --KGFRCPQHCCSACSVEKDIYKASKGRMMRCLRCPVAYH 306


>gi|402852477|ref|XP_003890948.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Papio anubis]
          Length = 1013

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 317 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 363

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 364 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 404

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 405 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 441


>gi|301615056|ref|XP_002936997.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
            lysine-20 specific [Xenopus (Silurana) tropicalis]
          Length = 2440

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 50/170 (29%), Positives = 69/170 (40%), Gaps = 44/170 (25%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNC--- 272
            VC +C+  G++  C+ +C  +FH           Q LG     ++A+P   F+C  C   
Sbjct: 1360 VCQVCEKPGELLLCEAQCCGAFHL----------QCLG-----MEAMPQGKFVCTECSSG 1404

Query: 273  -VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRER 331
             +   H CF C       K S Q V  C+   CG++YH EC   L +P            
Sbjct: 1405 TLQGYHTCFVC-------KESDQGVKRCMLPLCGKYYHEECA--LKYPP----------A 1445

Query: 332  IAAGESFTCPVHKCFVCQQSEDMN--VEDLQLAICRRCPKAYHRK--CLP 377
                  F C +H C  C  +   N      +L  C RCP AYH    CLP
Sbjct: 1446 TQQNRGFRCSLHICSTCYATNPSNPSASKGRLMRCVRCPIAYHANDFCLP 1495


>gi|281339990|gb|EFB15574.1| hypothetical protein PANDA_004672 [Ailuropoda melanoleuca]
          Length = 1363

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 61/161 (37%), Gaps = 38/161 (23%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE- 276
           VC +C+  G +  C+G C  +FH             LG ++        F C  C     
Sbjct: 663 VCQLCEKTGSLVLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGRF 709

Query: 277 ---HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
              H CF C       K S  +V  CV + CG+FYH  CV K      ES          
Sbjct: 710 SWIHSCFVC-------KESKTDVKRCVVSQCGKFYHEACVRKYPLTVFES---------- 752

Query: 334 AGESFTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
               F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 753 --RGFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 791


>gi|145484966|ref|XP_001428492.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395578|emb|CAK61094.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1405

 Score = 66.2 bits (160), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 114/295 (38%), Gaps = 80/295 (27%)

Query: 147  DLLDHMPISREAAERDKDLANSKFL-LS--------FMANHPETNTFHEDVRTTEKPKVI 197
            D+L+ +  S+   +  K+LANS  L LS           N    NT     +  E PK  
Sbjct: 1079 DILNLIEFSKFFEKDFKNLANSWILELSKPSRRFKKITENDLMINTAPTSQQVIETPKPT 1138

Query: 198  ----VDWNEDEDSERVDKDENY---FAVCAICDDGGDVTFCDGRCLRSFHATITAGKNAL 250
                ++W+E E+ +++    +      +CAIC+      FC G C R FH     G    
Sbjct: 1139 KNFELNWSEYEEYKKLQTKSSQDLQIRLCAICEKQKCDLFCRGFCRRQFHKECLEG---- 1194

Query: 251  CQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGS-----SDKSSSQ----------- 294
             Q  G TQ +I+    ++CQ+C   +  CF C   G+     S K + Q           
Sbjct: 1195 GQYNGQTQGEINI--KYVCQDCEKYKGTCFVCLKQGTFYPNQSKKKTQQDKQFIDDDGYY 1252

Query: 295  ----------------------------EVFPCVSATCGQFYHPECVSKLLHPDNESLAE 326
                                        E+  C S  C + YH  CV       N  + +
Sbjct: 1253 DNQGEQIIVATGKITRARAQQQLQQQLSELVKC-SLNCHKLYHFACVQT---SKNFKILD 1308

Query: 327  ELRERIAAGESFTCPVHKCFVCQ-QSEDMNVEDLQLAICRRCPKAYHRKCLPTEI 380
              R++      F C +H C  C+ +S D N + +Q   C RCPK+YH KC P  I
Sbjct: 1309 GERQK------FKCALHFCEKCKDKSNDDNQKMIQ---CLRCPKSYHEKCAPKGI 1354


>gi|126303359|ref|XP_001372863.1| PREDICTED: histone-lysine N-methyltransferase NSD3 [Monodelphis
           domestica]
          Length = 1435

 Score = 66.2 bits (160), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 65/160 (40%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+   D +  C+G C + FH             LG     + ++P+  FLC  C 
Sbjct: 700 TVCQICESYSDSLVVCEGECCKHFHPDC----------LG-----LSSLPDGRFLCTECK 744

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
                CF+C + G       ++V  C    CG+FYH  CV K      ES          
Sbjct: 745 TGHRTCFSCKIPG-------KDVKRCSVPACGKFYHEACVRKFATAVFES---------- 787

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
               F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 788 --RGFRCPQHCCSTCSMEKDIHKASKGRMMRCLRCPIAYH 825


>gi|260800140|ref|XP_002594994.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
 gi|229280233|gb|EEN51005.1| hypothetical protein BRAFLDRAFT_99284 [Branchiostoma floridae]
          Length = 1541

 Score = 65.9 bits (159), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 41/160 (25%), Positives = 63/160 (39%), Gaps = 37/160 (23%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           +C +C+  G++  C+G C  +FH             +G  Q        F C  C+   H
Sbjct: 682 LCQVCEQVGELLLCEGSCCGAFHLDC----------IGLQQMPTG---TFKCDECISGVH 728

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       + S      C    CG++YH +C+ K  +             +   + 
Sbjct: 729 TCFVC-------RKSEVTTKRCSIPICGKYYHEDCLRKFPN------------TVFEAKG 769

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQ-----LAICRRCPKAYH 372
           F CP+H C  C      +V+ ++     LA C RCP AYH
Sbjct: 770 FRCPLHVCGTCVAVAGGDVKKVKSRGRILARCVRCPTAYH 809


>gi|149634094|ref|XP_001506476.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Ornithorhynchus anatinus]
          Length = 1437

 Score = 65.9 bits (159), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 68/160 (42%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC +C+   D +  C+G C R FH           + LG T     ++P+  F+C  C 
Sbjct: 701 TVCQVCESYSDSLVACEGECFRHFHL----------ECLGLT-----SLPDGKFVCVECE 745

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
              H CF+C + G       ++V  C  + CG+FYH  CV   + P            I 
Sbjct: 746 TGHHTCFSCKVPG-------KDVKRCSVSACGKFYHEACVR--IFPT----------AIF 786

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
               F CP H C  C   +D++     ++  C +CP AYH
Sbjct: 787 EARGFRCPQHCCSACSMEKDIHKASKGRMMRCLKCPIAYH 826


>gi|47226564|emb|CAG08580.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1404

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 89/217 (41%), Gaps = 61/217 (28%)

Query: 156 REAAERDKDLANSKFLLSFMANHPETNTFHEDVRTTEKPKVIVDWNEDE-------DSER 208
           R  A  D ++A+S++           +T   D R    P+ +   N D        D++ 
Sbjct: 648 RSRASTDVEMASSQY----------RDTSDSDSRGLSDPQGLFGKNLDSPAAADGSDTQS 697

Query: 209 VD-----KDENYF---AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQA 259
           VD     +D N      VC IC+  G+ +  C+G C R FH           + LG T  
Sbjct: 698 VDSGLSRQDSNTDKRDTVCQICEAYGEGLVVCEGDCSRQFHL----------ECLGLT-- 745

Query: 260 QIDAVP--NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLL 317
              A+P   F C  C+  +H CF+C       K++ +EV  C  + CG FYH +CV KLL
Sbjct: 746 ---ALPEGRFTCLECLNGKHPCFSC-------KTAGREVTRCSVSGCGCFYHEDCVRKLL 795

Query: 318 HPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDM 354
              +           + G  F CP H C  C    D+
Sbjct: 796 GTTS-----------SPGGGFCCPQHICSTCCLERDL 821


>gi|190349638|gb|ACE75882.1| multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP [Homo sapiens]
          Length = 704

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 8   VCQLCEKTGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCTECASGIH 54

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  EV  CV   CG+FYH  CV K      ES              
Sbjct: 55  SCFVC-------KESKMEVKRCVVNQCGKFYHEACVKKYPLTVFES------------RG 95

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 96  FRCPLHSCMSCHASNPSNPRPSKGKMMRCVRCPVAYH 132


>gi|119602957|gb|EAW82551.1| Wolf-Hirschhorn syndrome candidate 1, isoform CRA_d [Homo sapiens]
          Length = 742

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 55/189 (29%), Positives = 69/189 (36%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E         VC +C+  G +  C+G C  +FH     
Sbjct: 14  SESPYESADETQTEVSVSSKKSERGVTAKKEYVCQLCEKPGSLLLCEGPCCGAFHLAC-- 71

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  +V  CV   CG
Sbjct: 72  --------LGLSRRPEG---RFTCSECASGIHSCFVC-------KESKTDVKRCVVTQCG 113

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 114 KFYHEACVKKYPLTVFES------------RGFRCPLHSCVSCHASNPSNPRPSKGKMMR 161

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 162 CVRCPVAYH 170


>gi|444722051|gb|ELW62755.1| putative histone-lysine N-methyltransferase NSD2 [Tupaia chinensis]
          Length = 1421

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 60/157 (38%), Gaps = 37/157 (23%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C   +   H
Sbjct: 560 VCQLCEKTGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSEWI---H 603

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 604 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 644

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 645 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 681


>gi|432952957|ref|XP_004085262.1| PREDICTED: histone-lysine N-methyltransferase NSD2-like, partial
           [Oryzias latipes]
          Length = 1167

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 64/162 (39%), Gaps = 43/162 (26%)

Query: 218 VCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           VC +C+  G D+  C+G+C   FH             LG +    D     LCQ C    
Sbjct: 563 VCQVCEQAGEDLVPCEGQCCGMFHL----------HCLGPSFKPED---KLLCQQCRAGI 609

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHP----DNESLAEELRERI 332
           H CFAC       K S  EV  C    CG+FYH  C+   L+P    DN+ L        
Sbjct: 610 HSCFAC-------KQSEGEVRRCHVVHCGKFYHEACIR--LNPLTVFDNKGL-------- 652

Query: 333 AAGESFTCPVHKCFVCQQSEDMNVEDLQ--LAICRRCPKAYH 372
                  CP+H C  C        +  +  L  C RCP AYH
Sbjct: 653 ------RCPLHACLSCHYGSRTKHKSTKGRLMRCLRCPVAYH 688


>gi|47221386|emb|CAF97304.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 722

 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 60/158 (37%), Gaps = 34/158 (21%)

Query: 217 AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
            V  +C+  G++  C+G+C  +FH          C SL            F+C  C    
Sbjct: 45  TVVQVCEKTGELLLCEGQCCGAFHLP--------CISLAEAPKG-----KFICPECKSGI 91

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K  S +V  C+   CG+FYH EC++                 +A   
Sbjct: 92  HTCFVC-------KKRSDDVRRCMIPVCGKFYHGECIANFA------------PTVAVNR 132

Query: 337 SFTCPVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYH 372
            F C +H C  C         +   +L  C RCP AYH
Sbjct: 133 GFRCSIHVCLTCFIANPNSSTISKGRLVRCVRCPIAYH 170


>gi|384247005|gb|EIE20493.1| hypothetical protein COCSUDRAFT_57640 [Coccomyxa subellipsoidea
            C-169]
          Length = 2740

 Score = 63.2 bits (152), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 40/137 (29%), Positives = 59/137 (43%), Gaps = 28/137 (20%)

Query: 217  AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
            +VC+ C     +  C G CLR++H+         C  L    A       + C  C    
Sbjct: 1864 SVCSAC-----LVVCGGPCLRAYHSG--------CLGLPVLPAPGSV---WFCPECDSGR 1907

Query: 277  HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
             +CF+CG  G+  +  +  +  C    CG+FYH  CV +L+   N  +A+        G 
Sbjct: 1908 MRCFSCGEFGAGFEDPT--IRKCSLGVCGRFYHVRCVDRLV---NTQMAKR-------GA 1955

Query: 337  SFTCPVHKCFVCQQSED 353
             F CP H C VC +S D
Sbjct: 1956 HFRCPQHYCSVCAKSGD 1972


>gi|224010613|ref|XP_002294264.1| hypothetical protein THAPSDRAFT_269843 [Thalassiosira pseudonana
           CCMP1335]
 gi|220970281|gb|EED88619.1| hypothetical protein THAPSDRAFT_269843 [Thalassiosira pseudonana
           CCMP1335]
          Length = 392

 Score = 62.8 bits (151), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 61/133 (45%), Gaps = 32/133 (24%)

Query: 230 FCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN---FLCQNCVYQEHQCFACGMLG 286
            C+G C R FH T        C +L      +   P+   ++CQ+C    HQC AC   G
Sbjct: 230 ICEGLCNRPFHPT--------CANL------LSPPPDNETWICQDCEQGRHQCAACHNYG 275

Query: 287 SSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEE---------LRER-IAAGE 336
             D     +V+ C +  CG F+H  C+S + + D + + EE         L+ER I +  
Sbjct: 276 QDD----VDVYCCTAKNCGLFFHEACLS-MYNVDVQVVEEEQYADDDTADLQERTIVSMP 330

Query: 337 SFTCPVHKCFVCQ 349
            F CP H CF CQ
Sbjct: 331 MFKCPAHSCFTCQ 343


>gi|449665927|ref|XP_002164851.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Hydra
           magnipapillata]
          Length = 1214

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 66/157 (42%), Gaps = 33/157 (21%)

Query: 219 CAICDDG-GDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           C IC+   G +  C G C  SFH             LG +  +     +F C  C+   H
Sbjct: 453 CCICEGSEGQLITCQGSCFNSFHF----------DCLGVSCIKT----SFTCDECLSNNH 498

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C   G+        +  C    CG+ YH +C+ K+       +   + E  A    
Sbjct: 499 CCFFCKKPGA--------ILKCSHNMCGKHYHQDCLIKI-----PVIKSNVNENNA--NK 543

Query: 338 FTCPVHKCFVCQQSEDMNVEDL-QLAICRRCPKAYHR 373
           F CP+H C +C  SE  +  +L +L  C RCP AYH+
Sbjct: 544 FICPLHNCRLC--SEKASKGNLTKLLKCIRCPTAYHQ 578


>gi|300122245|emb|CBK22818.2| unnamed protein product [Blastocystis hominis]
          Length = 171

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 69/170 (40%), Gaps = 36/170 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCVYQ 275
           +CAIC+       C G CLRSFH           Q L     ++ +VP+  + C +CV +
Sbjct: 8   ICAICERPNCPVRCSGGCLRSFHI----------QCL-----RLTSVPDRSWKCNDCVKR 52

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
            H+CF C       +    E+  C    C +++H                     R  + 
Sbjct: 53  SHECFEC-----KQRELDSELVQCSYPDCRRYFH-----------KREACCRCNPRSLST 96

Query: 336 ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDA 385
             F CP H+C+ C    DMN   L+   C RC K YH  CLP  +   D+
Sbjct: 97  LDFVCPSHRCYACHGPSDMNNPLLK---CFRCTKCYHYSCLPPSVQSLDS 143


>gi|428175286|gb|EKX44177.1| hypothetical protein GUITHDRAFT_109961 [Guillardia theta CCMP2712]
          Length = 707

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 68/140 (48%), Gaps = 13/140 (9%)

Query: 678 FGRHFTKVEKLKE-----IVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSF 732
            GR      ++KE     +V+ L   +   D  VDF CG N F  ++    +  G   S+
Sbjct: 537 IGRETATRLRVKEHAVYRLVEALQGVINQSDHFVDFACGNNRFGSIL---CQCFG--LSW 591

Query: 733 RNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKP 792
            +YD   P+ D ++  + WM   P ++P  + +++G+ PP      +   ++  AL+F+P
Sbjct: 592 SSYDEAVPE-DRNYMGKSWMLAGPSDVP--ANVVIGIFPPLHKGPIMCEMYLQHALQFRP 648

Query: 793 KLIVLIVPQETRRLDQKASY 812
           +L+VL+VP   R    K  Y
Sbjct: 649 RLLVLLVPASCRWCPDKRKY 668


>gi|213624868|gb|AAI71696.1| Wolf-Hirschhorn syndrome candidate 1 [Danio rerio]
          Length = 1461

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 63/164 (38%), Gaps = 46/164 (28%)

Query: 217 AVCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
           +VC +C+  G D+  C G+C  ++H             +G  ++        LC  C   
Sbjct: 760 SVCLVCEQTGEDIVTCAGQCYGTYHL----------HCIGVERS----AEKILCTACSTG 805

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA- 334
            H CF C       K S  EV  C +  CG+FYH  CV                 R++A 
Sbjct: 806 VHVCFTC-------KKSEGEVRRCCALHCGRFYHEACV-----------------RLSAL 841

Query: 335 ----GESFTCPVHKCFVCQQSEDMNVEDL--QLAICRRCPKAYH 372
                  F CP+H C  C  S     +    ++  C RCP AYH
Sbjct: 842 TVFENRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 885


>gi|128485462|ref|NP_001076020.1| probable histone-lysine N-methyltransferase NSD2 [Danio rerio]
          Length = 1461

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 63/164 (38%), Gaps = 46/164 (28%)

Query: 217 AVCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
           +VC +C+  G D+  C G+C  ++H             +G  ++        LC  C   
Sbjct: 760 SVCLVCEQTGEDIVTCAGQCYGTYHL----------HCIGVERS----AEKILCTACSTG 805

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA- 334
            H CF C       K S  EV  C +  CG+FYH  CV                 R++A 
Sbjct: 806 VHVCFTC-------KKSEGEVRRCCALHCGRFYHEACV-----------------RLSAL 841

Query: 335 ----GESFTCPVHKCFVCQQSEDMNVEDL--QLAICRRCPKAYH 372
                  F CP+H C  C  S     +    ++  C RCP AYH
Sbjct: 842 TVFENRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 885


>gi|86278478|gb|ABC88477.1| Wolf-Hirschhorn syndrome candidate 1 protein [Danio rerio]
          Length = 1366

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 63/164 (38%), Gaps = 46/164 (28%)

Query: 217 AVCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
           +VC +C+  G D+  C G+C  ++H             +G  ++        LC  C   
Sbjct: 665 SVCLVCEQTGEDIVTCAGQCYGTYHL----------HCIGVERS----AEKILCTACSTG 710

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA- 334
            H CF C       K S  EV  C +  CG+FYH  CV                 R++A 
Sbjct: 711 VHVCFTC-------KKSEGEVRRCCALHCGRFYHEACV-----------------RLSAL 746

Query: 335 ----GESFTCPVHKCFVCQQSEDMNVEDL--QLAICRRCPKAYH 372
                  F CP+H C  C  S     +    ++  C RCP AYH
Sbjct: 747 TVFENRGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 790


>gi|168031756|ref|XP_001768386.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680311|gb|EDQ66748.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 615

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/185 (23%), Positives = 80/185 (43%), Gaps = 37/185 (20%)

Query: 197 IVDWNEDEDSERVDKDENYFA-VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLG 255
           + D  +++  ++  +D++ FA +C +C D G++  C+ RC   FH +             
Sbjct: 11  VKDMPKEDGRKKGKRDDDGFAFICDVCSDFGELLCCE-RCRSGFHLSCLG---------- 59

Query: 256 YTQAQIDAVPN---FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPEC 312
                +D  P+   +LC +C   + +CF C   GS +     ++  C    CG++YH +C
Sbjct: 60  -----LDKCPDVEPWLCSSCAENKVRCFKCKAFGSLE----VDLVKCAHRNCGKYYHKDC 110

Query: 313 VSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYH 372
               +          +  +     +  CP H C  C++ +    ++ +L  C  CP AYH
Sbjct: 111 SKGWV---------RIPPKKTDNGAMVCPRHHCDACRKCQ----KNAKLHRCLYCPVAYH 157

Query: 373 RKCLP 377
             C P
Sbjct: 158 ESCSP 162


>gi|118367847|ref|XP_001017133.1| Type III restriction enzyme, res subunit family protein [Tetrahymena
            thermophila]
 gi|89298900|gb|EAR96888.1| Type III restriction enzyme, res subunit family protein [Tetrahymena
            thermophila SB210]
          Length = 2184

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 84/243 (34%), Gaps = 93/243 (38%)

Query: 209  VDKDENYFA-VCAICDDGGDVTFCDGRCLRSFHAT----ITAG---KNALCQSL------ 254
            V K E+ F  +C  CD      FC G C RSFH+     + AG   +N +          
Sbjct: 1559 VSKKEDLFKKLCYYCDRPKCTVFCQGHCKRSFHSECKERVEAGWINENGMTSETRIVPTE 1618

Query: 255  -GYTQAQIDAV--PNFLCQNCVYQEHQCFACGMLGS------------------------ 287
              Y + ++  +   N+ C++C      CF C   GS                        
Sbjct: 1619 DHYDEERLKKMLNINYTCKDCEANTAICFICKKKGSFFPPAVLNKQNKRIQQQKQKIIAE 1678

Query: 288  --------------------------SDKSSSQ---EVFPCVSATCGQFYHPECVS---- 314
                                       D S SQ   ++  C +A C +FYHP C+     
Sbjct: 1679 KEKQKDPSNEIVEESENELDEDYDLVEDDSVSQKTSDLTKCSTANCNKFYHPVCIKPYAL 1738

Query: 315  -KLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
             K +  +N+               F CP+H C  C  S D     + ++ C RCPKAYH 
Sbjct: 1739 FKYIDSNNKR--------------FRCPLHYCAHCFISGD----SMAISQCVRCPKAYHL 1780

Query: 374  KCL 376
            +C 
Sbjct: 1781 RCF 1783


>gi|410898830|ref|XP_003962900.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Takifugu rubripes]
          Length = 1329

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 60/158 (37%), Gaps = 36/158 (22%)

Query: 218 VCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           VC  C+  G D+  C+G+C   FH           Q LG  + +       LCQ C    
Sbjct: 630 VCQSCEKAGEDLVPCEGQCCGMFHL----------QCLGMMELED----KLLCQECSTGV 675

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K S   V  C    CG+FYH  C+   L+P            +   +
Sbjct: 676 HSCFHC-------KKSEGSVRRCHVPHCGKFYHEACIR--LNPLT----------VFDNK 716

Query: 337 SFTCPVHKCF--VCQQSEDMNVEDLQLAICRRCPKAYH 372
            F CP+H C    C           +L  C RCP AYH
Sbjct: 717 GFRCPLHTCLGCCCSSRTKHKPSKGRLMRCLRCPVAYH 754


>gi|47216786|emb|CAG03790.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1443

 Score = 60.1 bits (144), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 62/161 (38%), Gaps = 35/161 (21%)

Query: 215 YFAVCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCV 273
           +F     C++ G D+  C+G+C   FH           Q LG      D V   LCQ C 
Sbjct: 669 FFLSPQSCEEAGEDLVPCEGQCCGMFHL----------QCLGVLLEPDDKV---LCQECS 715

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
              H CF C       K S   V  C    CG+FYH  C+   L+P            + 
Sbjct: 716 TGVHSCFHC-------KKSEGSVRRCHVPHCGKFYHEACIR--LNPLT----------VF 756

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDL--QLAICRRCPKAYH 372
             + F CP+H C  C        +    +L  C RCP AYH
Sbjct: 757 DNKGFRCPLHTCLGCCYGNRTKPKSTKGRLMRCLRCPVAYH 797



 Score = 40.4 bits (93), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 15/85 (17%)

Query: 292 SSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQS 351
           + +++ PC    CG F H +C+  LL PD++ L +E           +  VH CF C++S
Sbjct: 679 AGEDLVPCEGQCCGMF-HLQCLGVLLEPDDKVLCQEC----------STGVHSCFHCKKS 727

Query: 352 EDMNVEDLQLAICRRCPKAYHRKCL 376
           E      ++      C K YH  C+
Sbjct: 728 EG----SVRRCHVPHCGKFYHEACI 748


>gi|428162029|gb|EKX31238.1| hypothetical protein GUITHDRAFT_83305 [Guillardia theta CCMP2712]
          Length = 171

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 44/161 (27%), Positives = 65/161 (40%), Gaps = 28/161 (17%)

Query: 226 GDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGML 285
           G +  CDG C RSFH             LG  + + D    +LC  C     +C  C   
Sbjct: 38  GTLVCCDGSCRRSFHLAC----------LGMDEEENDEEEEWLCNLCKVGAKRCMIC--- 84

Query: 286 GSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKC 345
             SD   S+ +  C   +C +++H +C+            +     + A   FTCP H C
Sbjct: 85  --SDSQDSENMIHCKVESCKKYFHRDCL------------KTWNCEVDAAGRFTCPRHTC 130

Query: 346 FVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDAD 386
             C Q      + +    C  CP A+H KCLP+++  S  D
Sbjct: 131 KACNQHSYTGKQGVMFK-CIDCPAAFHFKCLPSQVNMSRCD 170


>gi|348520760|ref|XP_003447895.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Oreochromis niloticus]
          Length = 1167

 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 61/160 (38%), Gaps = 39/160 (24%)

Query: 218 VCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           VC +C+  G D+  C+G+C   FH             LG T    D +   LCQ C    
Sbjct: 461 VCQMCEQAGEDLVPCEGQCCGMFHL----------HCLGQTLNPDDKL---LCQECSTGI 507

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV--SKLLHPDNESLAEELRERIAA 334
           H CF C  L         EV  C    CG+FYH  CV  + L   DN             
Sbjct: 508 HSCFICKKL-------EGEVRRCHVPHCGKFYHEACVRLNTLTVFDN------------- 547

Query: 335 GESFTCPVHKCFVCQQS--EDMNVEDLQLAICRRCPKAYH 372
            + F CP+H C  C             +L  C RCP AYH
Sbjct: 548 -KGFRCPLHTCLSCHYGCRTKHKSNKGRLMRCLRCPVAYH 586


>gi|300121581|emb|CBK22099.2| unnamed protein product [Blastocystis hominis]
          Length = 213

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 64/163 (39%), Gaps = 36/163 (22%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           +CA C+  G    C G+C R+FH                    + +   ++C +C  + H
Sbjct: 5   ICAECNKFGVDIVCFGKCHRAFHQACLP-------------CTVQSTRRWICPDCEKEVH 51

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
           +C  C    S      +++  C +  C  F+HP CV                E+      
Sbjct: 52  RCHQCKEFASD-----KDLIQCQATDCYLFFHPSCVP---------------EQFRRNRP 91

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEI 380
           F CP H C  C+Q ED   E   L  C  C K+YH  CLP E+
Sbjct: 92  FLCPAHCCSFCRQWED---ERDPLLKCAYCCKSYHEYCLPLEV 131


>gi|196013861|ref|XP_002116791.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
 gi|190580769|gb|EDV20850.1| hypothetical protein TRIADDRAFT_31338 [Trichoplax adhaerens]
          Length = 725

 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 52/172 (30%), Positives = 69/172 (40%), Gaps = 41/172 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
            VC +C+  G+ +  C G C   FH           + LG T+     V  F CQ C   
Sbjct: 39  GVCMLCEGIGEKLVTCQGGCFSKFHL----------ECLGITEW---PVKGFKCQECTSG 85

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
             +CF CG       SS   V PC  A CG+ YH  C+S+        ++EE        
Sbjct: 86  TRKCFVCG-------SSEPPVLPCKVAGCGKCYHDTCISQFTCGQ---VSEE-------- 127

Query: 336 ESFTCPVHKCFVC------QQSEDMNVE-DLQLAICRRCPKAYH--RKCLPT 378
               CP H C  C      +  E+ +V    +LA C  CP +YH    CLP 
Sbjct: 128 GCIRCPRHVCTTCTAFAISKNDENFDVVIQKKLARCIYCPVSYHASEACLPA 179


>gi|414884836|tpg|DAA60850.1| TPA: putative EDM2-like family protein [Zea mays]
          Length = 103

 Score = 59.3 bits (142), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 50/83 (60%), Gaps = 2/83 (2%)

Query: 576 ELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSVKAVRAALEMLD 635
            ++ ++ +  SS   ++  ++  +P TH +  +  L   I LG +E SV+AV AAL+ L+
Sbjct: 18  RVITILGNKVSSLTIKDVTRNCSLPSTHVYSGR--LTDKIPLGKIERSVQAVEAALKKLE 75

Query: 636 GGCDIEDAKAVCPPEILCQIFQW 658
            G  + DAKAVC PE+L Q+ +W
Sbjct: 76  NGGSVNDAKAVCEPEVLRQLARW 98


>gi|441664377|ref|XP_003279042.2| PREDICTED: histone-lysine N-methyltransferase NSD2-like [Nomascus
           leucogenys]
          Length = 780

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/198 (28%), Positives = 71/198 (35%), Gaps = 35/198 (17%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E         VC +C+  G +  C+G C  +FH     
Sbjct: 110 SESPYESADETQTEVSVSSKKSERGVTAKKEYVCQLCEKPGSLLLCEGPCCGAFHLACLG 169

Query: 246 ------GKNALCQSLGYTQAQIDAVPNFLCQNCVYQE---HQCFACGMLGSSDKSSSQEV 296
                 G+    +        I     FL +N    E   H CF C       K S  +V
Sbjct: 170 LSRRPEGRFTCSECASDRSKVILDGWIFLLENAWTSEQGIHSCFVC-------KESKTDV 222

Query: 297 FPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV 356
             CV   CG+FYH  CV K      ES              F CP+H C  C  S   N 
Sbjct: 223 KRCVVTQCGKFYHEACVKKYPLTVFES------------RGFRCPLHSCVSCHASNPSNP 270

Query: 357 E--DLQLAICRRCPKAYH 372
                ++  C RCP AYH
Sbjct: 271 RPSKGKMMRCVRCPVAYH 288


>gi|224129130|ref|XP_002320508.1| predicted protein [Populus trichocarpa]
 gi|222861281|gb|EEE98823.1| predicted protein [Populus trichocarpa]
          Length = 190

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/47 (53%), Positives = 29/47 (61%)

Query: 825 KSFYLPGSLDVHDNQLEQWNCKPPPLYLWSRADWTASHKKIALGRGH 871
           KSFYL GS   +  Q+E W    PP+Y+WSRADWTA    IA   GH
Sbjct: 29  KSFYLSGSAVANGKQIEDWTVNTPPIYIWSRADWTAWRIPIAREHGH 75


>gi|326671180|ref|XP_694414.5| PREDICTED: histone-lysine N-methyltransferase NSD3 [Danio rerio]
          Length = 1562

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 63/158 (39%), Gaps = 34/158 (21%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
            VC +C+  GD +  C+G C R FH     G N+  +S              +CQ C   
Sbjct: 812 TVCHVCETFGDSLVSCEGDCNRLFHPECM-GSNSGKES------------ETVCQECKTG 858

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
            H CF+C       K +  ++  C    CG++YH  CV K     +++            
Sbjct: 859 SHPCFSC-------KVTEGDMKRCSVNGCGRYYHETCVRKYTGSASDT------------ 899

Query: 336 ESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
           +   CP H C  C    D+      ++  C RCP AYH
Sbjct: 900 KGLRCPQHSCATCCLDRDLQKAGKGRMMRCIRCPVAYH 937


>gi|145522682|ref|XP_001447185.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414685|emb|CAK79788.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1508

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 62/271 (22%), Positives = 110/271 (40%), Gaps = 70/271 (25%)

Query: 164  DLANSKFLLSFMANHPE--TNTFHEDVRTTEKPKVIVDWNEDEDSERV---DKDENYFAV 218
            DL N     ++  N+ E   N  +E  + + + K + D +  ED+ R+    K+E    +
Sbjct: 1039 DLLNKINSSNYAKNYRELAMNWIYELSKPSRRIKKLTDMDLGEDARRLASKSKEELIKKL 1098

Query: 219  CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQS--------------------LGYTQ 258
            C +C+      FC G C R+FH    A K+ L  +                    L +++
Sbjct: 1099 CYVCERPNCSVFCMGHCRRAFH---IACKDLLETTEYINIEGPDQDFLNNHSFPELNWSE 1155

Query: 259  AQIDAVPNFL-----CQNCVYQEHQCFACG-----------MLGSSDKSSSQE------- 295
             Q+    N       C+NC+     C + G           ++ SSD++   E       
Sbjct: 1156 EQLKENVNIRYSCPDCRNCLVVCLLCKSKGTYPPEKKQKEEVIASSDENDPLEDNVKKIK 1215

Query: 296  ----VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQS 351
                +  C +A C +++H  C+    +P +++L   +       E F CP H C  C+  
Sbjct: 1216 NKSAISKCSTANCNRYFHLSCIQA--NPLSKTLDSNV-------ELFRCPSHVCVFCK-- 1264

Query: 352  EDMNVEDLQLAI--CRRCPKAYHRKCLPTEI 380
              +N  +   A+  C RC +++H KC P E+
Sbjct: 1265 --VNSSNTTTALIHCVRCCRSFHSKCAPPEV 1293


>gi|242017983|ref|XP_002429463.1| set domain protein, putative [Pediculus humanus corporis]
 gi|212514395|gb|EEB16725.1| set domain protein, putative [Pediculus humanus corporis]
          Length = 2269

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 68/274 (24%), Positives = 102/274 (37%), Gaps = 75/274 (27%)

Query: 161  RDKDLANSKFLLSFMANHPETNT---FHEDVRTTEKPKVIVDWNEDEDSERVDKDENYFA 217
            ++ D  + K  L F+ + P+ N+   F + V T  K K  +         R D +E  + 
Sbjct: 1192 KNSDKDSPKITLKFVRS-PKKNSKRKFRKSVETDGKVKKRI---------RKDGNEEEY- 1240

Query: 218  VCAICDD---GGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQI------------- 261
            VC IC +         C G C  SFH + T   NA    +  T   +             
Sbjct: 1241 VCQICFEFLENEKTIKCKGLCQGSFHKSCTEKLNANLPPIDETHMPVIKNKRRRKLFANG 1300

Query: 262  -----------DAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHP 310
                       D   +++C +C+  E+ CF C        S S     C +A CG+FYH 
Sbjct: 1301 DSDSPCEAQEKDEEQDWMCGDCLRGENPCFVCN-------SRSGNRQRCCAAYCGKFYHQ 1353

Query: 311  ECV----------------SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVC--QQSE 352
            +C+                S  L  +  S +E    +I      +CP+H C  C      
Sbjct: 1354 KCLKLFPQTIYNQGAPIRKSHALQQNKTSNSESPHYQI-----LSCPLHVCHTCASDNPS 1408

Query: 353  DMNVEDL--QLAICRRCPKAYH--RKCLPTEITF 382
            D N + L  +L  C +CP AYH    C+P   T 
Sbjct: 1409 DCNSKYLSERLVKCIKCPSAYHVGNYCVPAGSTI 1442


>gi|167526088|ref|XP_001747378.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774213|gb|EDQ87845.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1048

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 62/189 (32%), Gaps = 49/189 (25%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            C IC + G +  CDG C  +FH           + L   Q        F C  C    H
Sbjct: 441 TCRICGELGQLICCDGGCRGAFHL----------ECLSILQPPTG---EFRCDECSTGNH 487

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            C+ C  +G+       ++  C    C + YH  C  K    DN                
Sbjct: 488 TCYTCDKVGA-------DLIKCQFPHCNKLYHRGCAEKQFKADNF--------------- 525

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDL 397
                  C VC    D       L +C  CP AYH  C+ +   F+   +   Q    D 
Sbjct: 526 -------CLVCGTGGD-------LVVCDGCPGAYHAACIKSTFAFTGKPDEQGQWFCHDC 571

Query: 398 LPNNRILIY 406
           L   + +I+
Sbjct: 572 LTGTKSMIH 580


>gi|94732456|emb|CAK03662.1| novel protein similar to vertebrate Wolf-Hirschhorn syndrome
           candidate 1 (WHSC1) [Danio rerio]
          Length = 728

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/161 (28%), Positives = 63/161 (39%), Gaps = 40/161 (24%)

Query: 217 AVCAICDDGG-DVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ 275
           +VC +C+  G D+  C G+C  ++H             +G  ++        LC  C   
Sbjct: 27  SVCLVCEQTGEDIVTCAGQCYGTYHL----------HCIGVERS----AEKILCTACSTG 72

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV--SKLLHPDNESLAEELRERIA 333
            H CF C       K S  EV  C +  CG+FYH  CV  S L   +N            
Sbjct: 73  VHVCFTC-------KKSEGEVRRCCALHCGRFYHEACVRLSALTVFEN------------ 113

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDL--QLAICRRCPKAYH 372
               F CP+H C  C  S     +    ++  C RCP AYH
Sbjct: 114 --RGFRCPLHTCLSCHYSGRAACKATKGKMMRCLRCPVAYH 152


>gi|145512399|ref|XP_001442116.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124409388|emb|CAK74719.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1659

 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 52/228 (22%), Positives = 86/228 (37%), Gaps = 63/228 (27%)

Query: 202  EDEDSERV---DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITA------------- 245
            + ED+ R+    K+E    +C +C+      FC G C R+FH                  
Sbjct: 1231 QGEDARRLASKSKEELIKKLCYVCERSNCSIFCMGHCRRAFHDACKELLETTDYINVEGP 1290

Query: 246  -----GKNALCQSLGYTQAQIDAVPN--FLCQNCVYQEHQCFACGMLGS--SDKSSSQEV 296
                  KNA  +   ++  Q+    N  + C +C      C  C   G+   DK   +E+
Sbjct: 1291 DQDFLNKNAFPE-FNWSGEQLKEKINIRYSCPDCRNNLVVCLLCKQKGTYPPDKKQKEEI 1349

Query: 297  F----------------------PCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
                                    C +A C +++H  C+    +P ++SL          
Sbjct: 1350 VQIDEFDPSDDNVKKSKNKSIISKCSTANCNRYFHLSCIQA--NPLSKSLD-------TN 1400

Query: 335  GESFTCPVHKCFVCQQSEDMNVEDLQLAI--CRRCPKAYHRKCLPTEI 380
             + F CP H C  C+    +N  ++  A+  C RC +++H KC P EI
Sbjct: 1401 ADLFRCPSHICVFCK----INSSNMTTALIHCIRCCRSFHSKCAPPEI 1444


>gi|347972366|ref|XP_316738.5| AGAP004656-PA [Anopheles gambiae str. PEST]
 gi|333469400|gb|EAA11974.5| AGAP004656-PA [Anopheles gambiae str. PEST]
          Length = 1259

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 67/168 (39%), Gaps = 26/168 (15%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ-- 275
           VC IC+   +VT C  +C    H    A      + +   Q  +D      C+ C     
Sbjct: 608 VCYICNRPNNVTKC-SKCTLHLHLVCLANDP---EEVVKMQELVDQ-KKLCCEKCSTTSI 662

Query: 276 -EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            E  CF C       + S+++++ CV   C Q YH  C+   L P         + R  +
Sbjct: 663 VEKTCFICN--DEIPEKSNEQIYRCVVGKCTQAYHISCLQ--LFP---------QVRQVS 709

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDL---QLAICRRCPKAYH--RKCLP 377
             +  CP H C  C  SE  +   +    LA C +CP +YH    C+P
Sbjct: 710 ASTIICPYHTCHTCVASEPRSTASMVKTTLAHCLKCPTSYHPSGNCIP 757


>gi|426232375|ref|XP_004010202.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2 [Ovis aries]
          Length = 1273

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 57/153 (37%), Gaps = 42/153 (27%)

Query: 223 DDGGDVTFCDGRCLRSFH-ATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFA 281
           ++ G +  C+G C  +FH A +   +    +S G                     H CF 
Sbjct: 624 EEPGSLVLCEGPCCGAFHLACLGLSRRPEGRSRGT--------------------HSCFV 663

Query: 282 CGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCP 341
           C       K S  +V  CV + CG+FYH  CV +      ES              F CP
Sbjct: 664 C-------KESKSDVKRCVVSQCGKFYHEACVRRFPLTVFES------------RGFRCP 704

Query: 342 VHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           +H C  C  S   N      ++  C RCP AYH
Sbjct: 705 LHSCLSCHASNPSNPRPSKGKMLRCVRCPVAYH 737


>gi|198420162|ref|XP_002123587.1| PREDICTED: similar to Wolf-Hirschhorn syndrome candidate 1-like 1
            [Ciona intestinalis]
          Length = 1485

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 46/165 (27%), Positives = 66/165 (40%), Gaps = 24/165 (14%)

Query: 211  KDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQ 270
            K EN  + C    D  ++  C G C  SFH                 +  I ++ +F C+
Sbjct: 1329 KRENICSKCESASDEEELVHCIGGCCASFHPRC------------LEEGAITSIVSFTCK 1376

Query: 271  NCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSK-----LLH---PDNE 322
             C    H CF C    S+D S   E+  C    CG+ YH  C+++     L H      +
Sbjct: 1377 LCKEDNHPCFICKKSSSNDDSLG-EIKRCSVPKCGRSYHTTCLTERKIALLQHDDDDQKK 1435

Query: 323  SLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRC 367
               E+ +ER A   S  C +H+C  C   E    ED  +A   RC
Sbjct: 1436 QDKEKEKERQAILNSSKCTLHQCGTCTTKEK---EDSNVASSNRC 1477


>gi|72386767|ref|XP_843808.1| transcription activator [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62359823|gb|AAX80252.1| transcription activator, putative [Trypanosoma brucei]
 gi|70800340|gb|AAZ10249.1| transcription activator, putative [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 948

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 19/114 (16%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H+CF CG          Q +F C  + C + YH EC ++ + P          E ++A  
Sbjct: 795 HECFVCG----EGMRPLQPLFHC--SWCTKAYHAECANERVLP----------EGVSAPR 838

Query: 337 SFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF 390
           ++TCP H+C +C   E +   D  L +C  CP A+   CL  +    D D  NF
Sbjct: 839 NWTCPRHRCDLC---EKVATTDGALFMCYECPAAFCFDCLDKDYLDLDNDGVNF 889


>gi|261326898|emb|CBH09871.1| transcription activator, putative [Trypanosoma brucei gambiense
           DAL972]
          Length = 948

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 59/137 (43%), Gaps = 20/137 (14%)

Query: 255 GYTQAQIDAVP-NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
           G     + A P   + +  +   H+CF CG          Q +F C  + C + YH EC 
Sbjct: 772 GGASTMLSAKPIGRVARGTIGVTHECFVCG----EGMRPLQPLFHC--SWCMKAYHAECA 825

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
           ++ + P          E ++A  ++TCP H+C +C   E +   D  L +C  CP A+  
Sbjct: 826 NERVLP----------EGVSAPRNWTCPRHRCDLC---EKVATTDGALFMCYECPAAFCF 872

Query: 374 KCLPTEITFSDADENNF 390
            CL  +    D D  NF
Sbjct: 873 DCLDKDYLDLDNDGVNF 889


>gi|313221637|emb|CBY36122.1| unnamed protein product [Oikopleura dioica]
          Length = 913

 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 61/139 (43%), Gaps = 21/139 (15%)

Query: 249 ALCQSLGYTQAQI--DAVPNFLCQNCVYQEHQCFACGM-----LGSSDKSSSQEVFPCVS 301
           A+C+   ++Q Q    A  +F+C+ C  ++  CF C M        +D+     +  C  
Sbjct: 613 AICKHFFHSQCQPCDTASTSFVCRICCERKFTCFKCKMEIVPETEENDQKGDSSIAWCPV 672

Query: 302 ATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQL 361
             CG+ YH  C+      +N+ L+   + + +    + CP+H C  C   +D   + ++L
Sbjct: 673 EDCGKVYHKHCL------ENDELSVPYKPKDSTQTLYKCPLHNCRSCVLKKDDKKDSIKL 726

Query: 362 AI--------CRRCPKAYH 372
                     C RCP AYH
Sbjct: 727 NSSSKGKMFKCIRCPAAYH 745


>gi|313227685|emb|CBY22833.1| unnamed protein product [Oikopleura dioica]
          Length = 1179

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 61/139 (43%), Gaps = 21/139 (15%)

Query: 249 ALCQSLGYTQAQI--DAVPNFLCQNCVYQEHQCFACGM-----LGSSDKSSSQEVFPCVS 301
           A+C+   ++Q Q    A  +F+C+ C  ++  CF C M        +D+     +  C  
Sbjct: 613 AICKHFFHSQCQPCDTASTSFVCRICCERKFTCFKCKMEIVPETEENDQKGDSSIAWCPV 672

Query: 302 ATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQL 361
             CG+ YH  C+      +N+ L+   + + +    + CP+H C  C   +D   + ++L
Sbjct: 673 EDCGKVYHKHCL------ENDELSVPYKPKDSTQTLYKCPLHNCRSCVLKKDDKKDSIKL 726

Query: 362 AI--------CRRCPKAYH 372
                     C RCP AYH
Sbjct: 727 NSSSKGKMFKCIRCPAAYH 745


>gi|170050731|ref|XP_001861443.1| set domain protein [Culex quinquefasciatus]
 gi|167872245|gb|EDS35628.1| set domain protein [Culex quinquefasciatus]
          Length = 1181

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 63/166 (37%), Gaps = 29/166 (17%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCV-YQE 276
            CA+C +  DV  C   C   FH      K +    L            F C  C   Q 
Sbjct: 690 ACAVCLEPNDVVKCSA-CHNHFHPRCIGAKTSSDPKL------------FKCVECADGQT 736

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H+CF C    ++  S ++    C  + CG++YH  C+   L P ++  +           
Sbjct: 737 HRCFVCNDQDATIASETKHR--CALSGCGKYYHIHCLR--LFPQHKITS------TPNSS 786

Query: 337 SFTCPVHKCFVCQQSE---DMNVEDLQLAICRRCPKAYH--RKCLP 377
           +  CP H C  C   +   +       L  C +CP +YH   KC+P
Sbjct: 787 TLFCPYHTCHTCVSDDPRTNATATRGSLVRCIKCPSSYHPDAKCVP 832


>gi|345781638|ref|XP_003432154.1| PREDICTED: histone-lysine N-methyltransferase NSD3-like [Canis
           lupus familiaris]
          Length = 742

 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 60/160 (37%), Gaps = 47/160 (29%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + +VP+  F+C  C 
Sbjct: 42  TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASVPDGKFICIEC- 85

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
                             S  +V  C    CG+FYH  CV K      ES          
Sbjct: 86  ---------------KTVSGTDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 120

Query: 334 AGESFTCPVHKCFVCQQSEDM-NVEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D+      ++  C RCP AYH
Sbjct: 121 --KGFRCPQHCCSACSMEKDIYKASKGRMMRCLRCPVAYH 158


>gi|145491053|ref|XP_001431526.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398631|emb|CAK64128.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1668

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 52/231 (22%), Positives = 89/231 (38%), Gaps = 68/231 (29%)

Query: 202  EDEDSERV---DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQS----- 253
            + ED+ R+    K+E    +C +C+      FC G C R+FH    A K+ L  +     
Sbjct: 1231 QGEDARRLASKSKEELIKKLCYVCERPNCSVFCMGHCRRAFH---IACKDLLETTEYINI 1287

Query: 254  ---------------LGYTQAQIDAVPN--FLCQNCVYQEHQCFACGMLGS--SDKSSSQ 294
                           L +++ Q+    N  + C +C      C  C   G+   +K   +
Sbjct: 1288 EGPDQDFLNNHSFPELNWSEEQLKENVNIRYSCPDCRNSLVVCLLCKTKGTYPPEKKQKE 1347

Query: 295  EVFP-----------------------CVSATCGQFYHPECVSKLLHPDNESLAEELRER 331
            EV                         C +A C +++H  C+    +P +++L   +   
Sbjct: 1348 EVIASSDENDPLEDNVRKIKNKSAISKCSTANCNRYFHLNCIQA--NPLSKTLDSNV--- 1402

Query: 332  IAAGESFTCPVHKCFVCQQSEDMNVEDLQLAI--CRRCPKAYHRKCLPTEI 380
                E F CP H C  C+    +N  +   A+  C RC +++H KC P E+
Sbjct: 1403 ----ELFRCPSHVCVFCK----VNSSNTTTALIHCVRCCRSFHSKCAPPEV 1445


>gi|449534225|ref|XP_004174066.1| PREDICTED: uncharacterized protein LOC101231072, partial [Cucumis
           sativus]
          Length = 158

 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 39/56 (69%)

Query: 569 VDAEMEKELLALIKDSTSSFNEEEFMKSHIVPITHAHHSKHLLEKSITLGLVEGSV 624
           +DA  E+ L+ ++K+  SS   E+ +K H VP THA+  KH+++K+I +G +EGSV
Sbjct: 102 LDANSERRLMDMMKNVASSITLEDVIKKHKVPSTHAYSLKHVVDKTIKMGKLEGSV 157


>gi|157126101|ref|XP_001654536.1| set domain protein [Aedes aegypti]
 gi|108873380|gb|EAT37605.1| AAEL010414-PA [Aedes aegypti]
          Length = 1480

 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 44/179 (24%), Positives = 65/179 (36%), Gaps = 30/179 (16%)

Query: 212 DENYFA-------VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAV 264
           D+NY         VCA+C    D+  C  +C   +H              G  +      
Sbjct: 800 DKNYLLKGVPRGLVCAVCTKPHDLVKCT-KCYNHYHLACLTDVPIKPDPAGENKT----- 853

Query: 265 PNFLCQNCVY-QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNES 323
             F C +CV  +   CF C      D    +E F CV   CG+ YH  C+   L P ++ 
Sbjct: 854 --FTCTDCVMLKAPTCFVCN--DQDDAVKEEEKFRCVMNGCGKQYHLNCLR--LFPQHKF 907

Query: 324 LAEELRERIAAGESFTCPVHKCFVCQQSE---DMNVEDLQLAICRRCPKAYHR--KCLP 377
                    +   +  CP H C  C   +   +       L  C +CP +YH   +C+P
Sbjct: 908 TGTS-----SKSSTLYCPTHTCHTCVSDDPRSNATTTKGHLIRCIKCPSSYHTEARCIP 961


>gi|357627347|gb|EHJ77076.1| hypothetical protein KGM_14526 [Danaus plexippus]
          Length = 1912

 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 53/126 (42%), Gaps = 12/126 (9%)

Query: 260  QIDAVPNFLCQNCV-YQEHQCFACGM-LGSSDKSSSQEVFPCVSATCGQFYHPECVSKLL 317
            +I     F C+NC  Y    CF C   +   +K   ++   C  A C ++YH EC+    
Sbjct: 1276 EITDYSEFKCKNCQKYDPPVCFVCKYPISPKEKQGHRQ--KCQVAHCNKYYHLECLDHW- 1332

Query: 318  HPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDL----QLAICRRCPKAYH- 372
             P  +    E+       E+ TCP H C  C   +    +      +LA C RCP  YH 
Sbjct: 1333 -PQTQFNGGEISRTNKFSEALTCPRHVCHTCVCDDPRGCKTRFSGDKLARCVRCPATYHT 1391

Query: 373  -RKCLP 377
              KCLP
Sbjct: 1392 FTKCLP 1397


>gi|410958014|ref|XP_003985618.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2 [Felis
           catus]
          Length = 1300

 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 40/98 (40%), Gaps = 21/98 (21%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K S  +V  CV + CG+FYH  CV K      ES             
Sbjct: 650 HSCFVC-------KESKTDVRRCVVSQCGKFYHEACVRKYPLTVFES------------R 690

Query: 337 SFTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
            F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 691 GFRCPLHSCVTCHASNPSNPRPSKGKMMRCVRCPVAYH 728


>gi|91090902|ref|XP_973711.1| PREDICTED: similar to NSD1 [Tribolium castaneum]
          Length = 1795

 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 23/110 (20%)

Query: 279  CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESF 338
            C  CG +G+  +        C    CG+FYHPEC+          L  + +  + A +SF
Sbjct: 1062 CLVCGEMGAQGRQK------CSLHQCGRFYHPECLK---------LWPQTQWSLNASDSF 1106

Query: 339  TCPVHKCFVC----QQSEDMNVEDLQLAICRRCPKAYHRK--CLP--TEI 380
             CP H C  C     ++ +      ++  C +CP  YH    C+P  TEI
Sbjct: 1107 VCPRHVCHTCISDDPRAANSRCSSDKIVKCLKCPATYHSSNYCVPAGTEI 1156


>gi|270014006|gb|EFA10454.1| hypothetical protein TcasGA2_TC012700 [Tribolium castaneum]
          Length = 1740

 Score = 52.8 bits (125), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 23/110 (20%)

Query: 279  CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESF 338
            C  CG +G+  +        C    CG+FYHPEC+          L  + +  + A +SF
Sbjct: 1007 CLVCGEMGAQGRQK------CSLHQCGRFYHPECLK---------LWPQTQWSLNASDSF 1051

Query: 339  TCPVHKCFVC----QQSEDMNVEDLQLAICRRCPKAYHRK--CLP--TEI 380
             CP H C  C     ++ +      ++  C +CP  YH    C+P  TEI
Sbjct: 1052 VCPRHVCHTCISDDPRAANSRCSSDKIVKCLKCPATYHSSNYCVPAGTEI 1101


>gi|322788177|gb|EFZ13959.1| hypothetical protein SINV_06678 [Solenopsis invicta]
          Length = 1093

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 67/195 (34%), Gaps = 56/195 (28%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHAT----------------------------ITAGKNA 249
           VC IC+  G +T C G C   FH +                            I    N 
Sbjct: 464 VCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEYSVDENTLDERLLSDLNIIKRSING 523

Query: 250 LCQSLGYTQAQIDAVPNFLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFY 308
             ++ G +  Q D    F C +C+      CF C            +   C+   CG+ Y
Sbjct: 524 ESENNGKSDEQEDET--FKCIDCLSGVAPACFLCN-------EREGDRIRCIVPACGKHY 574

Query: 309 HPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN----VEDLQLAIC 364
           H +C    L P  +S  +        G   TCP H C  C      N      + ++A C
Sbjct: 575 HSKC----LIPWPQSHWQ--------GGRLTCPYHVCHTCSSDNPQNNRSRAPNEKVAKC 622

Query: 365 RRCPKAYHRK--CLP 377
            RCP +YH    CLP
Sbjct: 623 VRCPSSYHASALCLP 637


>gi|403352806|gb|EJY75922.1| Type III restriction enzyme, res subunit family protein [Oxytricha
            trifallax]
          Length = 2584

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 12/85 (14%)

Query: 296  VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
            V  C    C ++YH +CV       N S    ++      + F CP+H C +CQ S D  
Sbjct: 2299 VVRCTLGGCFKYYHVDCV---YQNKNVSCINSMK-----VQRFRCPLHYCEICQASGD-- 2348

Query: 356  VEDLQLAICRRCPKAYHRKCLPTEI 380
               + +  C  CP AYH KCLP ++
Sbjct: 2349 --SVLIVQCVNCPTAYHLKCLPQDL 2371


>gi|332028801|gb|EGI68830.1| Putative histone-lysine N-methyltransferase NSD2 [Acromyrmex
           echinatior]
          Length = 1304

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 75/223 (33%), Gaps = 74/223 (33%)

Query: 208 RVDKDENYFA------------------VCAICDDGGDVTFCDGRCLRSFHAT------- 242
           ++DK+EN+ +                  VC IC+  G +T C G C   FH +       
Sbjct: 490 KIDKEENFVSETKRSRLYNLFKGMKQEKVCQICEKTGQLTRCKGPCYSYFHLSCVKPGES 549

Query: 243 ---------------------ITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQ-EHQCF 280
                                I    N   ++ G +  Q D    F C +C+      CF
Sbjct: 550 SPEYSVDENTLDDRLLSDLNIIKKSINGESENNGKSDEQEDET--FKCIDCLSGVAPACF 607

Query: 281 ACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTC 340
            C            +   C+   CG+ YH  C+  L  P +             G   TC
Sbjct: 608 LCN-------EREGDRIRCIVPACGKHYHSNCL--LSWPQSH----------WQGGRLTC 648

Query: 341 PVHKCFVCQ----QSEDMNVEDLQLAICRRCPKAYHRK--CLP 377
           P H C  C     Q +     + ++A C RCP +YH    CLP
Sbjct: 649 PYHVCHTCSSDNPQDKRSRAPNEKMARCVRCPSSYHASTLCLP 691


>gi|332023034|gb|EGI63299.1| Histone-lysine N-methyltransferase trr [Acromyrmex echinatior]
          Length = 3474

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/170 (28%), Positives = 67/170 (39%), Gaps = 38/170 (22%)

Query: 219 CAICDDGGDVTFCD-GRCLRSFHATITAGKNALCQSLG---YTQAQIDAVPNFLCQ--NC 272
           CA C   G    C    C R FH    A  +    S     +    +  VP  LC    C
Sbjct: 308 CAYCSHYGAGISCKVTSCNRYFHFPCAAASSCFQDSKNLALFCSQHLGQVPLLLCGEVTC 367

Query: 273 VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
           V    QC+  G + +            + + CGQ YH  CV   L P            +
Sbjct: 368 V----QCYGMGDVSNL----------VMCSVCGQHYHGSCVGLALLPG-----------V 402

Query: 333 AAG-ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
            AG +  +C V  C VC+Q ED++    ++ +C RC KAYH  CL   +T
Sbjct: 403 RAGWQCVSCRV--CQVCRQPEDVS----KVMLCERCDKAYHPGCLRPIVT 446


>gi|334311241|ref|XP_003339591.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
            H3 lysine-36 and H4 lysine-20 specific-like [Monodelphis
            domestica]
          Length = 2705

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/157 (22%), Positives = 56/157 (35%), Gaps = 36/157 (22%)

Query: 218  VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            VC  C+  G++  C+ +C  +FH           + +G T+        F+C+ C    H
Sbjct: 1549 VCQNCEKVGELLLCEAQCCGAFHL----------ECIGLTEMPKG---KFICKECRTGIH 1595

Query: 278  QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
             CF C       K   ++V  C+   CG+F                              
Sbjct: 1596 TCFVC-------KPDGEDVKRCLLPLCGKFT--------------XXXXXXXXXXXXXXG 1634

Query: 338  FTCPVHKCFVCQQSE--DMNVEDLQLAICRRCPKAYH 372
            F C +H C  C  +    ++    +L  C RCP AYH
Sbjct: 1635 FRCSLHMCITCHAANPASLSASKGRLMRCVRCPVAYH 1671


>gi|351704076|gb|EHB06995.1| Putative histone-lysine N-methyltransferase NSD2 [Heterocephalus
           glaber]
          Length = 1372

 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 38/98 (38%), Gaps = 21/98 (21%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K    EV  CV   CG+FYH  CV +      ES             
Sbjct: 728 HACFVC-------KERKAEVKRCVVTQCGKFYHDACVRRYPLTVFES------------R 768

Query: 337 SFTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
            F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 769 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 806


>gi|156390320|ref|XP_001635219.1| predicted protein [Nematostella vectensis]
 gi|156222310|gb|EDO43156.1| predicted protein [Nematostella vectensis]
          Length = 690

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 42/110 (38%), Gaps = 21/110 (19%)

Query: 267 FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAE 326
           F C  C    H CF C   G        +V  C    C +FYH EC+            +
Sbjct: 25  FKCDECTSGVHSCFVCRQTG--------DVKACSQPLCSKFYHKECL------------Q 64

Query: 327 ELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             +     G+   CP+H C  C  S    V   +L  C RCP AYH  CL
Sbjct: 65  SYKCSKIDGDRIYCPLHFCSTCI-SNKTPVNRGRLTKCIRCPTAYHAGCL 113


>gi|328861152|gb|EGG10256.1| hypothetical protein MELLADRAFT_94587 [Melampsora larici-populina
           98AG31]
          Length = 984

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 38/159 (23%), Positives = 70/159 (44%), Gaps = 24/159 (15%)

Query: 204 EDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDA 263
           +D++R  +  ++   C +C DGG++  C+  C R  HA          + LG++  +++ 
Sbjct: 828 QDTKRTKRKFDHEEACIVCHDGGELYLCN-HCPRVGHA----------KCLGWSNKKLNQ 876

Query: 264 VPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKL-LHPDNE 322
              F+C      +H C +C   G S   S   ++ C   TC   Y  EC+ +   HP  E
Sbjct: 877 STTFIC-----TQHSCTSC---GRSTNDSGGMLYRC--QTCSDAYCEECLPEEDFHPIGE 926

Query: 323 SLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQL 361
           ++ E +   +  G   +    +C  C +    N E L++
Sbjct: 927 AIPEFI--LLGYGPISSAYYIRCVECIKHFKANPETLKM 963


>gi|301115892|ref|XP_002905675.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110464|gb|EEY68516.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 470

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 52/209 (24%), Positives = 73/209 (34%), Gaps = 51/209 (24%)

Query: 179 PETNTFHE---------DVRTTEKPKVIVDWNEDEDSERVDKDENYFAVCAICDDGG-DV 228
           P   +FHE         DVR    P V +                + + C +C +GG  +
Sbjct: 64  PHEISFHEHECARRQRNDVRVEAAPSVTL----------------FPSHCFLCGNGGRGL 107

Query: 229 TFCDGRCLRSFHATITAGKNALCQSLGYTQAQI-DAVPNFLCQNCVYQEHQCFACGMLGS 287
             C G C RS H        A       + A+   A  N+ C  C+   H+C  CG +G 
Sbjct: 108 LHCGGSCARSAHQNCVNQLQAPTVGEPVSLAERKQAAENWKCAQCLRGLHRCQRCGFMGH 167

Query: 288 SDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFV 347
                 +    C    CG  +H +C+     PD+E+          A   F CP H C  
Sbjct: 168 EANGMRK----CSVLDCGYHFHEQCL-----PDDET----------AHAGFVCPRHTCAT 208

Query: 348 C-QQSEDMNVEDLQLAICRRCPKAYHRKC 375
           C  Q  DM     +   C  C    H +C
Sbjct: 209 CGTQETDMR----RCKSCTVCDHMTHLRC 233


>gi|345493934|ref|XP_001600694.2| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 1
           [Nasonia vitripennis]
          Length = 1382

 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 74/221 (33%), Gaps = 51/221 (23%)

Query: 194 PKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFH-ATITAGK----- 247
           P +++D  +     ++ K      VC IC+  G +  C G C   FH A +  G+     
Sbjct: 552 PSLMIDTGKRHRPYKLFKGMKNERVCQICEKTGKLIRCRGPCHSYFHLACVKPGESSPEP 611

Query: 248 ---------NALCQSLGYTQAQIDAVP----------NFLCQNCVYQ-EHQCFACGMLGS 287
                    NA  + L   +A+   +           NF C +C+      CF C     
Sbjct: 612 SEAGDIEENNAYKEDLKEIKAKSKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVC----- 666

Query: 288 SDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFV 347
                  E   C    CG+ YHP+C+        +            G   TCP H C  
Sbjct: 667 --HERDGERTKCSILACGKHYHPDCLKSWPQCQWQ------------GGRLTCPHHICHT 712

Query: 348 CQQSEDMNVEDL----QLAICRRCPKAYHR--KCLPTEITF 382
           C      N        + A C +CP  YH    CLP   T 
Sbjct: 713 CASDNPQNSHPRSAGEKFAKCVKCPSTYHASISCLPAGSTI 753


>gi|40789042|dbj|BAA83042.2| KIAA1090 protein [Homo sapiens]
          Length = 715

 Score = 49.3 bits (116), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K S  +V  CV   CG+FYH  CV K      ES             
Sbjct: 65  HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------R 105

Query: 337 SFTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
            F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 106 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 143


>gi|340718068|ref|XP_003397494.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase,
           H3 lysine-36 and H4 lysine-20 specific-like [Bombus
           terrestris]
          Length = 1238

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 67/194 (34%), Gaps = 52/194 (26%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFH-ATITAGKNALCQSLGYT--------------QAQID 262
           VC IC+  G +T C G C   FH + +  G+++   S+                 Q+ ID
Sbjct: 412 VCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILDDINEIKQSNID 471

Query: 263 AVPN-----------FLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHP 310
              N           F C +C+      CF C            +   C    CG+ YH 
Sbjct: 472 NDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHYHS 524

Query: 311 ECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQ----QSEDMNVEDLQLAICRR 366
            C+            +   +    G   TCP H C  C     Q       + +LA C R
Sbjct: 525 SCL------------KSWPQSHWQGGRLTCPYHVCHTCSSDNPQDSHSRAPNEKLARCVR 572

Query: 367 CPKAYH--RKCLPT 378
           CP +YH    CLP 
Sbjct: 573 CPSSYHTSTSCLPA 586


>gi|414883708|tpg|DAA59722.1| TPA: hypothetical protein ZEAMMB73_219102, partial [Zea mays]
          Length = 999

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 53/190 (27%), Positives = 83/190 (43%), Gaps = 30/190 (15%)

Query: 179 PETNTFHEDVRTTEKPKVIVDWNEDEDSE-----RVDK-DENYFAVCAICDDGGDVTFCD 232
           P  N F E  + +    ++  W +   SE     ++DK D+ +   CAIC DGGD+  CD
Sbjct: 711 PYANIFLEGGKVSLLQCLLDAWEKHTQSENKGFYKIDKGDDEHDDTCAICGDGGDLVCCD 770

Query: 233 GRCLRSFH-----ATITAG----KNALCQSLGYTQAQIDAVPNFL--CQNCVYQEHQCFA 281
             C  +FH       + +G    ++ LC+  G+ Q +  + P  L  C  C  + HQ  +
Sbjct: 771 -HCASTFHLDCLGIKLPSGDWYCRSCLCRFCGFPQEKPSSSPELLLSCLQCSRKYHQTCS 829

Query: 282 CGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCP 341
            G       + S    P  S  C  F  P C  K+    N+ L   ++  + AG S++  
Sbjct: 830 SGT-----GTDSGCTMPGTSIDC--FCSPGC-RKIYKRLNKLLG--IKNHMEAGFSWSL- 878

Query: 342 VHKCFVCQQS 351
           VH CF   Q+
Sbjct: 879 VH-CFANDQA 887


>gi|156230137|gb|AAI52413.1| WHSC1 protein [Homo sapiens]
          Length = 713

 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K S  +V  CV   CG+FYH  CV K      ES             
Sbjct: 63  HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------R 103

Query: 337 SFTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
            F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 104 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 141


>gi|297282129|ref|XP_002802212.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Macaca mulatta]
          Length = 713

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 39/98 (39%), Gaps = 21/98 (21%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF C       K S  +V  CV   CG+FYH  CV K      ES             
Sbjct: 63  HSCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------R 103

Query: 337 SFTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
            F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 104 GFRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 141


>gi|383864320|ref|XP_003707627.1| PREDICTED: histone-lysine N-methyltransferase, H3 lysine-36 and H4
           lysine-20 specific-like [Megachile rotundata]
          Length = 1302

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 67/194 (34%), Gaps = 52/194 (26%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFH-ATITAGKNALCQSLGY-------------------- 256
           VC IC+  G +T C G C   FH + +  G+++   S+                      
Sbjct: 476 VCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDESTMDDKILNDIKEIKGDNID 535

Query: 257 ---TQAQIDAVPN--FLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHP 310
              T  +ID   +  F C +C+      CF C            +   C    CG+ YH 
Sbjct: 536 EDETSGKIDEQEDELFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHYHS 588

Query: 311 ECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQ----QSEDMNVEDLQLAICRR 366
            C+            +   +    G   TCP H C  C     Q       + +LA C R
Sbjct: 589 SCL------------KSWPQSHWQGGRLTCPYHVCHTCSSDNPQDSHPRTSNEKLARCVR 636

Query: 367 CPKAYH--RKCLPT 378
           CP +YH    CLP 
Sbjct: 637 CPSSYHTSTSCLPA 650


>gi|380024451|ref|XP_003696009.1| PREDICTED: uncharacterized protein LOC100866111 [Apis florea]
          Length = 5713

 Score = 48.9 bits (115), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 49/163 (30%), Positives = 66/163 (40%), Gaps = 34/163 (20%)

Query: 219 CAICDDGGDVTFCD-GRCLRSFHATITAGKNAL--CQSLG-YTQAQIDAVPNFLCQNCVY 274
           CA C   G    C    C R FH    A  +     +SL  +    +  VP  L  +   
Sbjct: 321 CAFCSHYGAGIPCKVASCNRYFHLPCAAASSCFQDTKSLSLFCSQHLGQVPLLLNGDVTC 380

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
              QC  CGM   S+          + + CGQ YH  CV   L P            + A
Sbjct: 381 M--QC--CGMGDVSNL--------VMCSICGQHYHGSCVGLALLPG-----------VRA 417

Query: 335 G-ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
           G +  +C V  C VC+Q ED++    ++ +C RC KAYH  CL
Sbjct: 418 GWQCASCRV--CQVCRQPEDVS----KVMLCERCEKAYHPSCL 454


>gi|358255504|dbj|GAA57198.1| probable histone-lysine N-methyltransferase NSD2 [Clonorchis
           sinensis]
          Length = 1293

 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 44/168 (26%), Positives = 62/168 (36%), Gaps = 22/168 (13%)

Query: 218 VCAIC----DDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCV 273
           VC  C    +  G +  C G C R  H +    K              +    F C  C+
Sbjct: 291 VCPACEVYSNVPGQMVKCLGPCGRLVHPSCMRYKTP-------PPPDNNRPDKFKCSECL 343

Query: 274 YQEHQCFACGMLGSSD-KSSSQEVFPCVSATCGQFYHPECVSKLLHPDN---ESLAEELR 329
             ++ C  CG       K +   ++ C    CG+ +H ECV  L  P      SL+E   
Sbjct: 344 SGDYLCTVCGKPADPKAKGNLNCLYLCPVRDCGRHFHRECV--LEWPGTVCRPSLSEHKT 401

Query: 330 ERIAAGESFTCPVHKCFVCQQSEDMN-----VEDLQLAICRRCPKAYH 372
            R +A +   CP H C  C    D +       +  L  C RCP A+H
Sbjct: 402 SRSSAFQLIRCPAHCCQTCWLERDPSAPQPTTSETPLLQCVRCPAAFH 449


>gi|328781326|ref|XP_003249962.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Apis mellifera]
          Length = 1218

 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 66/194 (34%), Gaps = 52/194 (26%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFH-ATITAGKNALCQSLGYT--------------QAQID 262
           VC IC+  G +T C G C   FH + +  G+++   S+                 Q  ID
Sbjct: 407 VCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDENIMDDKILDDIKEIKQNNID 466

Query: 263 AVPN-----------FLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHP 310
              N           F C +C+      CF C            +   C    CG+ YH 
Sbjct: 467 DDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHYHS 519

Query: 311 ECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQ----QSEDMNVEDLQLAICRR 366
            C+            +   +    G   TCP H C  C     Q       + +LA C R
Sbjct: 520 SCL------------KSWPQSHWQGGRLTCPYHICHTCSSDNPQDSHSRAPNEKLARCVR 567

Query: 367 CPKAYH--RKCLPT 378
           CP +YH    CLP 
Sbjct: 568 CPSSYHTSTSCLPA 581


>gi|380012561|ref|XP_003690348.1| PREDICTED: LOW QUALITY PROTEIN: probable histone-lysine
           N-methyltransferase NSD2-like [Apis florea]
          Length = 1208

 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 66/194 (34%), Gaps = 52/194 (26%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFH-ATITAGKNALCQSLG--------------YTQAQID 262
           VC IC+  G +T C G C   FH + +  G+++   S+                 Q  ID
Sbjct: 406 VCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSIDENIMDDKILDDIKEIKQNNID 465

Query: 263 AVPN-----------FLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHP 310
              N           F C +C+      CF C            +   C    CG+ YH 
Sbjct: 466 DDENNGKSEEQEDEFFKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHYHS 518

Query: 311 ECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQ----QSEDMNVEDLQLAICRR 366
            C+            +   +    G   TCP H C  C     Q       + +LA C R
Sbjct: 519 SCL------------KSWPQSHWQGGRLTCPYHICHTCSSDNPQDSHSRAPNEKLARCVR 566

Query: 367 CPKAYH--RKCLPT 378
           CP +YH    CLP 
Sbjct: 567 CPSSYHTSTSCLPA 580


>gi|345493936|ref|XP_003427184.1| PREDICTED: histone-lysine N-methyltransferase NSD3 isoform 2
           [Nasonia vitripennis]
          Length = 1317

 Score = 48.9 bits (115), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 74/221 (33%), Gaps = 51/221 (23%)

Query: 194 PKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFH-ATITAGK----- 247
           P +++D  +     ++ K      VC IC+  G +  C G C   FH A +  G+     
Sbjct: 487 PSLMIDTGKRHRPYKLFKGMKNERVCQICEKTGKLIRCRGPCHSYFHLACVKPGESSPEP 546

Query: 248 ---------NALCQSLGYTQAQIDAVP----------NFLCQNCVYQ-EHQCFACGMLGS 287
                    NA  + L   +A+   +           NF C +C+      CF C     
Sbjct: 547 SEAGDIEENNAYKEDLKEIKAKSKEIQEKEEAAFDYDNFKCIDCLSGVAPPCFVC----- 601

Query: 288 SDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFV 347
                  E   C    CG+ YHP+C+        +            G   TCP H C  
Sbjct: 602 --HERDGERTKCSILACGKHYHPDCLKSWPQCQWQ------------GGRLTCPHHICHT 647

Query: 348 CQQSEDMNVEDL----QLAICRRCPKAYHR--KCLPTEITF 382
           C      N        + A C +CP  YH    CLP   T 
Sbjct: 648 CASDNPQNSHPRSAGEKFAKCVKCPSTYHASISCLPAGSTI 688


>gi|374463364|gb|AEZ53127.1| Wolf-Hirschhorn syndrome candidate 1-like 1, partial [Onychostoma
           macrolepis]
          Length = 116

 Score = 48.5 bits (114), Expect = 0.015,   Method: Composition-based stats.
 Identities = 37/139 (26%), Positives = 51/139 (36%), Gaps = 33/139 (23%)

Query: 218 VCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           VC +C+  GD +  C+G C R FH     G N+     G    Q       LCQ C    
Sbjct: 6   VCHVCETFGDSLVSCEGDCNRLFHPE-RIGSNS-----GTDGEQ-------LCQECKTGS 52

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF+C       K    ++  C    CG++YH  C  K      ++             
Sbjct: 53  HPCFSC-------KVEEGDMKRCSVNACGRYYHETCARKYTGTTTDT------------R 93

Query: 337 SFTCPVHKCFVCQQSEDMN 355
              CP H C  C    D++
Sbjct: 94  GLRCPQHSCATCCLDRDLH 112


>gi|300121090|emb|CBK21472.2| unnamed protein product [Blastocystis hominis]
          Length = 1183

 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 37/87 (42%), Gaps = 12/87 (13%)

Query: 296  VFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMN 355
            VF C+  +CG+FYH  C+  + +   E  ++           F CP H C  C ++ +  
Sbjct: 984  VFKCLVKSCGRFYHVNCIQNMEYGSKEHFSDLTH--------FRCPAHFCCECHETGNTK 1035

Query: 356  VEDLQLAICRRCPKAYHRKCLPTEITF 382
                 L  C  C KA H  CL  E  F
Sbjct: 1036 ----HLVQCVMCYKAIHVSCLTLENGF 1058


>gi|350405219|ref|XP_003487363.1| PREDICTED: hypothetical protein LOC100745609 [Bombus impatiens]
          Length = 5619

 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 66/168 (39%), Gaps = 44/168 (26%)

Query: 219 CAICDDGGDVTFCD-GRCLRSFHATITAGKNAL--CQSLG-YTQAQIDAVPNFL-----C 269
           CA C   G    C    C R FH    A  +     +SL  +    +  VP  L     C
Sbjct: 312 CACCSHYGAGIPCKVASCNRYFHLPCAAASSCFQDTKSLSLFCSQHLGQVPLLLNGDITC 371

Query: 270 QNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELR 329
            +C         CGM   S+          + + CGQ YH  CV   L P          
Sbjct: 372 MHC---------CGMGDVSNL--------VMCSICGQHYHGSCVGLALLPG--------- 405

Query: 330 ERIAAG-ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             + AG +  +C V  C VC+Q ED++    ++ +C RC KAYH  CL
Sbjct: 406 --VRAGWQCVSCRV--CQVCRQPEDVS----KVMLCERCEKAYHPSCL 445


>gi|340726153|ref|XP_003401426.1| PREDICTED: hypothetical protein LOC100646364 [Bombus terrestris]
          Length = 5622

 Score = 48.5 bits (114), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 66/168 (39%), Gaps = 44/168 (26%)

Query: 219 CAICDDGGDVTFCD-GRCLRSFHATITAGKNAL--CQSLG-YTQAQIDAVPNFL-----C 269
           CA C   G    C    C R FH    A  +     +SL  +    +  VP  L     C
Sbjct: 312 CACCSHYGAGIPCKVASCNRYFHLPCAAASSCFQDTKSLSLFCSQHLGQVPLLLNGDITC 371

Query: 270 QNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELR 329
            +C         CGM   S+          + + CGQ YH  CV   L P          
Sbjct: 372 MHC---------CGMGDVSNL--------VMCSICGQHYHGSCVGLALLPG--------- 405

Query: 330 ERIAAG-ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             + AG +  +C V  C VC+Q ED++    ++ +C RC KAYH  CL
Sbjct: 406 --VRAGWQCVSCRV--CQVCRQPEDVS----KVMLCERCEKAYHPSCL 445


>gi|256084142|ref|XP_002578291.1| SET domain protein [Schistosoma mansoni]
          Length = 1746

 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 52/144 (36%), Gaps = 32/144 (22%)

Query: 292 SSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVC--Q 349
           S  ++  C    C ++YHP C+ K         A  +RE      SFTCP H C  C  +
Sbjct: 642 SGDQMIRCSVRACRRWYHPSCLRK------PPFAVVVRE--GRSGSFTCPAHTCLACSAE 693

Query: 350 QSEDMNVEDLQLAICRRCPKAYHRK--CLPTEITFSDADENNFQRAWVDLLPNNRILIYC 407
               M         C  CP AYH    CLP                  ++ PN   LI C
Sbjct: 694 TPGTMPRPSPHFIRCVMCPAAYHPGDWCLPA--------------GSKEIAPN---LIIC 736

Query: 408 LEHKIISELK---TPARDHLKFPG 428
             H +  E K   +P    LK P 
Sbjct: 737 PRHALQDECKLYTSPPNIQLKLPS 760


>gi|350420879|ref|XP_003492658.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           isoform 1 [Bombus impatiens]
          Length = 1230

 Score = 48.1 bits (113), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 47/196 (23%), Positives = 67/196 (34%), Gaps = 56/196 (28%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFH-ATITAGKNALCQSL---------------------- 254
           VC IC+  G +T C G C   FH + +  G+++   S+                      
Sbjct: 404 VCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILNDINEIKQSNTD 463

Query: 255 -----GYTQAQIDAVPNFLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFY 308
                G ++ Q D    F C +C+      CF C            +   C    CG+ Y
Sbjct: 464 NDENNGKSEEQEDEF--FKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 514

Query: 309 HPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQ----QSEDMNVEDLQLAIC 364
           H  C+            +   +    G   TCP H C  C     Q       + +LA C
Sbjct: 515 HSSCL------------KSWPQSHWQGGRLTCPYHVCHTCSSDNPQDSHSRAPNEKLARC 562

Query: 365 RRCPKAYH--RKCLPT 378
            RCP +YH    CLP 
Sbjct: 563 VRCPSSYHTSTSCLPA 578


>gi|350420881|ref|XP_003492659.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           isoform 2 [Bombus impatiens]
          Length = 1239

 Score = 48.1 bits (113), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 47/196 (23%), Positives = 67/196 (34%), Gaps = 56/196 (28%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFH-ATITAGKNALCQSL---------------------- 254
           VC IC+  G +T C G C   FH + +  G+++   S+                      
Sbjct: 413 VCQICEKTGKLTRCKGPCYSYFHLSCVKPGESSPEHSVDENIMDDKILNDINEIKQSNTD 472

Query: 255 -----GYTQAQIDAVPNFLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFY 308
                G ++ Q D    F C +C+      CF C            +   C    CG+ Y
Sbjct: 473 NDENNGKSEEQEDEF--FKCIDCLSGVAPACFICN-------EREGDRIRCSVLACGKHY 523

Query: 309 HPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQ----QSEDMNVEDLQLAIC 364
           H  C+            +   +    G   TCP H C  C     Q       + +LA C
Sbjct: 524 HSSCL------------KSWPQSHWQGGRLTCPYHVCHTCSSDNPQDSHSRAPNEKLARC 571

Query: 365 RRCPKAYH--RKCLPT 378
            RCP +YH    CLP 
Sbjct: 572 VRCPSSYHTSTSCLPA 587


>gi|353232109|emb|CCD79464.1| putative set domain protein [Schistosoma mansoni]
          Length = 1503

 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 52/144 (36%), Gaps = 32/144 (22%)

Query: 292 SSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVC--Q 349
           S  ++  C    C ++YHP C+ K         A  +RE      SFTCP H C  C  +
Sbjct: 642 SGDQMIRCSVRACRRWYHPSCLRK------PPFAVVVRE--GRSGSFTCPAHTCLACSAE 693

Query: 350 QSEDMNVEDLQLAICRRCPKAYHRK--CLPTEITFSDADENNFQRAWVDLLPNNRILIYC 407
               M         C  CP AYH    CLP                  ++ PN   LI C
Sbjct: 694 TPGTMPRPSPHFIRCVMCPAAYHPGDWCLPA--------------GSKEIAPN---LIIC 736

Query: 408 LEHKIISELK---TPARDHLKFPG 428
             H +  E K   +P    LK P 
Sbjct: 737 PRHALQDECKLYTSPPNIQLKLPS 760


>gi|340372263|ref|XP_003384664.1| PREDICTED: probable histone-lysine N-methyltransferase NSD2-like
           [Amphimedon queenslandica]
          Length = 1171

 Score = 48.1 bits (113), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 45/161 (27%), Positives = 64/161 (39%), Gaps = 29/161 (18%)

Query: 218 VCAICDDGGDVT----FCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCV 273
           +C+ICD+  + T     C G CL SFH             LG       + P F+C  C 
Sbjct: 378 ICSICDNPSNPTQCIITCKGHCLNSFHV----------DCLGLISPP--SFP-FVCDECT 424

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
                C+AC    S D  SS  + PC   +C Q YH  C+    +   +S    +     
Sbjct: 425 LSPSVCYACKK-PSVDPVSS--LVPCSHHSCSQLYHLSCIRSCGNFKFDSSNPNV----- 476

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQ-LAICRRCPKAYHR 373
                +C +H C  C  S+   V   Q L  C +CP + H+
Sbjct: 477 ---ILSCGLHSCAKCVCSDGPPVTGNQKLMQCLKCPLSLHK 514


>gi|124513208|ref|XP_001349960.1| SET domain protein, putative [Plasmodium falciparum 3D7]
 gi|23615377|emb|CAD52368.1| SET domain protein, putative [Plasmodium falciparum 3D7]
          Length = 2548

 Score = 47.8 bits (112), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 6/69 (8%)

Query: 314  SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQ------LAICRRC 367
            +KL +   E+L  E  E+I   + F CP+H C+VC++ +  N E  +      L  C +C
Sbjct: 1099 NKLFNGKEETLNNENDEKIIVLKKFICPLHICYVCKEFDINNTESSKKELKNNLFRCIKC 1158

Query: 368  PKAYHRKCL 376
             K+ HRKC+
Sbjct: 1159 YKSVHRKCM 1167


>gi|432926624|ref|XP_004080920.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Oryzias
           latipes]
          Length = 4455

 Score = 47.4 bits (111), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 69/172 (40%), Gaps = 35/172 (20%)

Query: 209 VDK--DENYFAVCAICDDGGDVTFC-DGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP 265
           VDK  D     VCA C   G    C +  C RS+H    A   A CQ   + Q +I    
Sbjct: 399 VDKAIDSGSIQVCAFCRHLGASLRCQETGCTRSYHMPCAAAAGA-CQD--WNQRRI---- 451

Query: 266 NFLC-QNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESL 324
             LC Q+      QC  C   GS D   S  +  C  + CG  YH  C+   + P   S 
Sbjct: 452 --LCTQHARTGSSQCRLCA--GSGD---SGGLLMC--SCCGSCYHGSCLDPPVTPSPLS- 501

Query: 325 AEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                 R+     + CP  +C VC+        D  + +C RC KAYH  CL
Sbjct: 502 ------RVG----WQCP--QCRVCRSCSLQG--DSGVLLCARCDKAYHAHCL 539


>gi|326499283|dbj|BAK06132.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1350

 Score = 46.6 bits (109), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 73/175 (41%), Gaps = 48/175 (27%)

Query: 196  VIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKN 248
            +I+ W++  DSER        + D+     C IC DGG++  CDG C  +FH +      
Sbjct: 1002 LIIAWDKQSDSERQAFFPVSTEGDDPNDDTCGICGDGGNLICCDG-CPSTFHMSCL---- 1056

Query: 249  ALCQSLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSAT-CG 305
                       +++ +P  ++ C NC      C  C    + D   + E+ P  S + C 
Sbjct: 1057 -----------ELEELPSDDWRCTNC-----SCKLCHEHLNHDAPDNAEIDPLHSCSQCE 1100

Query: 306  QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQ 360
            + YHP C      P+ E L+    +   AG  F         CQQS  +  E+LQ
Sbjct: 1101 KKYHPSC-----SPETEKLSSVSSQ---AGNHF---------CQQSCRLLFEELQ 1138


>gi|359477348|ref|XP_002278432.2| PREDICTED: uncharacterized protein LOC100247619 [Vitis vinifera]
          Length = 1547

 Score = 46.2 bits (108), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 53/124 (42%), Gaps = 31/124 (25%)

Query: 200 WNEDEDSERVD-------KDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQ 252
           WN  E+SER          D+     C IC DGGD+  CDG C  +FH +        C 
Sbjct: 644 WNRQEESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CL 694

Query: 253 SLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSS---DKSSSQEVFPCVSATCGQFYH 309
           ++     Q+    ++ C NC      C  CGM   S   D ++  E+  C  + C + YH
Sbjct: 695 NI-----QMLPSGDWHCPNCT-----CKFCGMADGSNAEDDTTVSELVTC--SLCEKKYH 742

Query: 310 PECV 313
             C+
Sbjct: 743 TSCI 746


>gi|393224442|gb|EJD32834.1| hypothetical protein AURDEDRAFT_178069 [Auricularia delicata
           TFB-10046 SS5]
          Length = 242

 Score = 45.4 bits (106), Expect = 0.13,   Method: Composition-based stats.
 Identities = 51/198 (25%), Positives = 72/198 (36%), Gaps = 53/198 (26%)

Query: 192 EKPKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTF--CDGRCLRSFHA---TITAG 246
           E  +++ DW  D  S            C +C   GDV    CD  CLR  H+       G
Sbjct: 47  ESSRILYDWVCDVCSS-----------CDVCMQPGDVLLLRCD-MCLRRVHSLCLPTIPG 94

Query: 247 KNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQ 306
            + L   +   +  +     + C+ C      C  C      D+   +EV  C   TC  
Sbjct: 95  TDILAAVVRAEEQGVPLPRIWRCEECC--PRSCSLC------DRVVGKEVVTC--RTCKW 144

Query: 307 FYHPECVSKLLHPDNESLAEELRER-----IAAGESF---TCPVHKCFVCQQSEDMNVED 358
            YH  CV               RE+     +A  + F   TCP   C  C ++     +D
Sbjct: 145 TYHTGCV---------------REKWGMVVLAGDQGFRCKTCPSESCDACGRAAP---DD 186

Query: 359 LQLAICRRCPKAYHRKCL 376
              AIC  C KA+H+ CL
Sbjct: 187 FGTAICGTCGKAWHKDCL 204


>gi|297737048|emb|CBI26249.3| unnamed protein product [Vitis vinifera]
          Length = 1264

 Score = 45.4 bits (106), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 53/124 (42%), Gaps = 31/124 (25%)

Query: 200 WNEDEDSERVD-------KDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQ 252
           WN  E+SER          D+     C IC DGGD+  CDG C  +FH +        C 
Sbjct: 476 WNRQEESERSGFHPIDVDGDDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CL 526

Query: 253 SLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSS---DKSSSQEVFPCVSATCGQFYH 309
           ++     Q+    ++ C NC      C  CGM   S   D ++  E+  C  + C + YH
Sbjct: 527 NI-----QMLPSGDWHCPNCT-----CKFCGMADGSNAEDDTTVSELVTC--SLCEKKYH 574

Query: 310 PECV 313
             C+
Sbjct: 575 TSCI 578


>gi|310831063|ref|YP_003969706.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis
           virus BV-PW1]
 gi|309386247|gb|ADO67107.1| putative DNA N6-adenine methyltransferase [Cafeteria roenbergensis
           virus BV-PW1]
          Length = 297

 Score = 45.4 bits (106), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 41/170 (24%), Positives = 86/170 (50%), Gaps = 21/170 (12%)

Query: 672 GMRYTSFGRHFTKVEKLKEIVDRL--HWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKS 729
           G+   +  +++TK   +K  +D +  +  +   DTI++   G   F       +E++   
Sbjct: 31  GLNRNTMDKYYTKTNIVKNCIDSIKSNVIINKLDTIIEPSAGNGAF-------IEEIKHL 83

Query: 730 CS-FRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQAL 788
           C+ ++ YD++   N+    K+D++ +      +    I+G NPPFG ++S+A KFI +  
Sbjct: 84  CNNYKFYDILPENNEII--KQDFLKLDVNNFKNQKIHIIG-NPPFGRQSSIAIKFIKKCC 140

Query: 789 KFKPKLIVLIVP----QETRRLDQKASYNLIWEDNEVLSGKSFYLPGSLD 834
            F  + I  I+P    +E+ R      Y+L++E++ +   K+ +L  ++D
Sbjct: 141 LF-AESISFILPKSFKKESMRKYFNKYYHLVFEEDLL---KNCFLVNNID 186


>gi|76157779|gb|AAX28599.2| SJCHGC07936 protein [Schistosoma japonicum]
          Length = 238

 Score = 45.1 bits (105), Expect = 0.16,   Method: Composition-based stats.
 Identities = 39/136 (28%), Positives = 49/136 (36%), Gaps = 32/136 (23%)

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVC--QQSEDMNV 356
           C    C ++YHP C+ K   P    +  E R       SFTCP H C  C  +    M  
Sbjct: 1   CSVRACRRWYHPSCLRK---PPFAVVVREGRS-----GSFTCPAHTCLACSAETPGTMPR 52

Query: 357 EDLQLAICRRCPKAYH--RKCLPTEITFSDADENNFQRAWVDLLPNNRILIYCLEHKIIS 414
                  C  CP AYH    C+P                  ++ PN   LI C  H +  
Sbjct: 53  PSPHYIRCVMCPAAYHPGEWCVPA--------------GSKEIAPN---LIICPRHALQD 95

Query: 415 ELK---TPARDHLKFP 427
           E K   +P    LK P
Sbjct: 96  ECKLYTSPPNIQLKLP 111


>gi|348683653|gb|EGZ23468.1| hypothetical protein PHYSODRAFT_479200 [Phytophthora sojae]
          Length = 466

 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 60/159 (37%), Gaps = 25/159 (15%)

Query: 219 CAICDDGG-DVTFCDGRCLRSFHATITAGKNA--LCQSLGYTQAQIDAVPNFLCQNCVYQ 275
           C +C +GG  +  C G C R+ H        A  + Q L   + +  A   + C  C+  
Sbjct: 104 CFLCGNGGRGLLHCGGSCARAAHQHCVDKLQAPVVGQPLSAAEKK-QAAERWKCAQCLRG 162

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
            H+C  CG LG       +    C    CG  +H +C+     PD         E + AG
Sbjct: 163 LHRCQRCGFLGHESNGMKK----CSVLDCGYHFHAQCL-----PDAAQ-----DEGVHAG 208

Query: 336 ESFTCPVHKCFVCQQSEDMNVEDLQLAICR-----RCPK 369
             F CP H C  C   E         ++C      RCP+
Sbjct: 209 --FVCPRHTCATCGAQETDMRRCKSCSVCHHMTHFRCPR 245


>gi|256075776|ref|XP_002574192.1| SET domain protein [Schistosoma mansoni]
          Length = 1343

 Score = 45.1 bits (105), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 39/174 (22%), Positives = 63/174 (36%), Gaps = 32/174 (18%)

Query: 218 VCAIC----DDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP-NFLCQNC 272
           VC +C    +  G +  C G C R  H          C          ++ P  F C  C
Sbjct: 287 VCPVCEVYSNAPGQMFQCQGPCGRIVHPH--------CMRYKTPPPADNSRPEKFRCPQC 338

Query: 273 VYQEHQCFACGMLG-SSDKSSSQEVFPCVSATCGQFYHPECVSK----LLHPDNESLAEE 327
           +  E  C  CG    +  K  + +++ C    C + +H +C+S     L  P     ++ 
Sbjct: 339 LIGEFLCSICGKPSETGSKDGAGQLYACQVINCSRHFHRDCLSGWPGILTRP-----SDN 393

Query: 328 LRERIAAGESFTCPVHKCFVCQ-QSEDMNVEDLQLAI--------CRRCPKAYH 372
           +  R    +   CP H C  C  +S + N   ++           C RCP  +H
Sbjct: 394 VATRTTTFQVLRCPAHTCNTCYLESNETNCHPVKKPCSSEGPFLECVRCPAVFH 447


>gi|307207985|gb|EFN85544.1| Probable histone-lysine N-methyltransferase NSD2 [Harpegnathos
           saltator]
          Length = 599

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 66/193 (34%), Gaps = 51/193 (26%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFH-ATITAGKNALCQSLGYTQAQI--------------- 261
           VC IC+  G +T C G C   FH   +  G+++   S+    ++                
Sbjct: 234 VCQICEKTGKLTRCKGPCYSYFHLPCVKPGESSPEHSVDDNTSEDKIIDDVNVIKRSIND 293

Query: 262 --DAVPN-------FLCQNCVYQ-EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPE 311
             D V N       F C +C+      CF C            +   C    CG+ YH  
Sbjct: 294 RDDNVKNEEQEDEMFKCIDCLSGVAPACFICN-------EREGDRIRCSVMACGKHYHSM 346

Query: 312 CVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQ----QSEDMNVEDLQLAICRRC 367
           C+            +   +    G   TCP H C  C     Q       + ++A C RC
Sbjct: 347 CL------------KSWPQSHWQGGRLTCPYHLCHTCSSDNPQDSHSRAPNEKMAKCVRC 394

Query: 368 PKAYHRK--CLPT 378
           P +YH    CLP 
Sbjct: 395 PSSYHTSTLCLPA 407


>gi|353231818|emb|CCD79173.1| putative set domain protein [Schistosoma mansoni]
          Length = 1306

 Score = 45.1 bits (105), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 39/174 (22%), Positives = 63/174 (36%), Gaps = 32/174 (18%)

Query: 218 VCAIC----DDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP-NFLCQNC 272
           VC +C    +  G +  C G C R  H          C          ++ P  F C  C
Sbjct: 287 VCPVCEVYSNAPGQMFQCQGPCGRIVHPH--------CMRYKTPPPADNSRPEKFRCPQC 338

Query: 273 VYQEHQCFACGMLG-SSDKSSSQEVFPCVSATCGQFYHPECVSK----LLHPDNESLAEE 327
           +  E  C  CG    +  K  + +++ C    C + +H +C+S     L  P     ++ 
Sbjct: 339 LIGEFLCSICGKPSETGSKDGAGQLYACQVINCSRHFHRDCLSGWPGILTRP-----SDN 393

Query: 328 LRERIAAGESFTCPVHKCFVCQ-QSEDMNVEDLQLAI--------CRRCPKAYH 372
           +  R    +   CP H C  C  +S + N   ++           C RCP  +H
Sbjct: 394 VATRTTTFQVLRCPAHTCNTCYLESNETNCHPVKKPCSSEGPFLECVRCPAVFH 447


>gi|195028344|ref|XP_001987036.1| GH21693 [Drosophila grimshawi]
 gi|193903036|gb|EDW01903.1| GH21693 [Drosophila grimshawi]
          Length = 1461

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 59/162 (36%), Gaps = 31/162 (19%)

Query: 219 CAICDDGGDVTFCDGRC--LRSFHATITAGKNALCQSLG-YTQAQIDAVPNFLCQNCVYQ 275
           C+ CD  G    C   C  +  F     AG   + +S   +    +  VP     N V  
Sbjct: 157 CSFCDQYGASINCKMNCRQMHHFPCAAAAGCFLILESFTVFCTEHLSQVPLICSDNNV-- 214

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
             +C  C  LG   K     +  C S  CG  +H  C+     PD  S            
Sbjct: 215 --ECLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTRS------------ 253

Query: 336 ESFTCP-VHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             ++C    KC +C+Q E     D++   C +C K YH  CL
Sbjct: 254 -GWSCARCTKCQICRQQE---ANDIKFVKCEQCQKIYHANCL 291


>gi|125599281|gb|EAZ38857.1| hypothetical protein OsJ_23274 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 44.7 bits (104), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 58/152 (38%), Gaps = 26/152 (17%)

Query: 178 HPETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDENYFAV----------CAICDDGGD 227
            P  N F ED   T    ++  W +   SE+    + ++ V          C IC DGGD
Sbjct: 687 QPYENIFLEDGGATLSQCLVDAWKKQSQSEK----KGFYKVDPGDDPDDDTCGICGDGGD 742

Query: 228 VTFCDGRCLRSFHATITAGK---------NALCQSLGYTQAQIDAVPNFL-CQNCVYQEH 277
           +  CD  C  +FH      K         + +C+  G TQ    +    L C  C  + H
Sbjct: 743 LLCCD-NCPSTFHLACLGIKMPSGDWHCSSCICRFCGSTQEITTSSAELLSCLQCSRKYH 801

Query: 278 QCFACGMLGSSDKSSSQEVFPC-VSATCGQFY 308
           Q  A G +  S K+ S     C  S  C + Y
Sbjct: 802 QVCAPGTMKDSVKAESNSSTDCFCSPGCRKIY 833


>gi|34394455|dbj|BAC83629.1| PHD finger transcription factor-like [Oryza sativa Japonica Group]
          Length = 1442

 Score = 44.7 bits (104), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 58/152 (38%), Gaps = 26/152 (17%)

Query: 178 HPETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDENYFAV----------CAICDDGGD 227
            P  N F ED   T    ++  W +   SE+    + ++ V          C IC DGGD
Sbjct: 688 QPYENIFLEDGGATLSQCLVDAWKKQSQSEK----KGFYKVDPGDDPDDDTCGICGDGGD 743

Query: 228 VTFCDGRCLRSFHATITAGK---------NALCQSLGYTQAQIDAVPNFL-CQNCVYQEH 277
           +  CD  C  +FH      K         + +C+  G TQ    +    L C  C  + H
Sbjct: 744 LLCCD-NCPSTFHLACLGIKMPSGDWHCSSCICRFCGSTQEITTSSAELLSCLQCSRKYH 802

Query: 278 QCFACGMLGSSDKSSSQEVFPC-VSATCGQFY 308
           Q  A G +  S K+ S     C  S  C + Y
Sbjct: 803 QVCAPGTMKDSVKAESNSSTDCFCSPGCRKIY 834


>gi|328716964|ref|XP_001950932.2| PREDICTED: hypothetical protein LOC100161826, partial
           [Acyrthosiphon pisum]
          Length = 1530

 Score = 44.7 bits (104), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 28/118 (23%), Positives = 48/118 (40%), Gaps = 17/118 (14%)

Query: 267 FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAE 326
           ++C  C   +  CF CG+    D     +   C  ++CG+++H  C+            +
Sbjct: 773 YMCSLCKAGKTNCFVCGL----DIDDPGQKIVCKISSCGKYFHESCLKDWPQ------CQ 822

Query: 327 ELRERIAAGESFTCPVHKCFVC-----QQSEDMNVEDLQLAICRRCPKAYHRK--CLP 377
            ++          CP H C +C     + S   +    ++  C +CP AYHR   CLP
Sbjct: 823 WIQGSRNNVRCVVCPHHVCHLCISDNPKSSCTTHFPAEKITRCIKCPTAYHRSEYCLP 880


>gi|218199171|gb|EEC81598.1| hypothetical protein OsI_25074 [Oryza sativa Indica Group]
          Length = 1019

 Score = 44.7 bits (104), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 61/152 (40%), Gaps = 26/152 (17%)

Query: 178 HPETNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDENYFAV----------CAICDDGGD 227
            P  N F ED   T    ++  W +   SE+    + ++ V          C IC DGGD
Sbjct: 265 QPYENIFLEDGGATLSQCLVDAWKKQSQSEK----KGFYKVDPGDDPDDDTCGICGDGGD 320

Query: 228 VTFCDGRCLRSFHAT-----ITAG----KNALCQSLGYTQAQIDAVPNFL-CQNCVYQEH 277
           +  CD  C  +FH       + +G    ++ +C+  G TQ    +    L C  C  + H
Sbjct: 321 LLCCD-NCPSTFHLACLGIKMPSGDWHCRSCICRFCGSTQEITTSSAELLSCLQCSRKYH 379

Query: 278 QCFACGMLGSSDKSSSQEVFPC-VSATCGQFY 308
           Q  A G +  S K+ S     C  S  C + Y
Sbjct: 380 QVCAPGTMKDSVKAESNSSTDCFCSPGCRKIY 411


>gi|348683753|gb|EGZ23568.1| hypothetical protein PHYSODRAFT_487680 [Phytophthora sojae]
          Length = 868

 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 36/78 (46%), Gaps = 23/78 (29%)

Query: 218 VCAIC-----DDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP---NFLC 269
           +C IC     DD  D+  CDG CL+S+H +           LG     I++ P    +LC
Sbjct: 351 LCGICGEIEDDDLTDMILCDGGCLKSYHFSC----------LG-----IESAPEGEEWLC 395

Query: 270 QNCVYQEHQCFACGMLGS 287
           + C   E  CF CG  G+
Sbjct: 396 EQCRTNEQLCFGCGRNGT 413


>gi|242043058|ref|XP_002459400.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
 gi|241922777|gb|EER95921.1| hypothetical protein SORBIDRAFT_02g004100 [Sorghum bicolor]
          Length = 1437

 Score = 44.3 bits (103), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 20/117 (17%)

Query: 208 RVDKDEN-YFAVCAICDDGGDVTFCDGRCLRSFH-----ATITAG----KNALCQSLGYT 257
           ++DK E+ +   CAIC DGGD+  CD  C  +FH       + +G    ++ LC+  G+ 
Sbjct: 725 KIDKGEDEHDDTCAICGDGGDLVCCD-HCASTFHLDCLGIKLPSGDWYCRSCLCRFCGFP 783

Query: 258 QAQIDAVPNFL--CQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPEC 312
           Q +  + P  L  C  C  + HQ  + G     D +      P  S  C  F  P C
Sbjct: 784 QEKPSSSPELLLSCLQCSRKYHQTCSSGTGTDFDCT-----IPGTSIDC--FCSPGC 833


>gi|315639351|ref|ZP_07894513.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis JV21]
 gi|315480677|gb|EFU71319.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis JV21]
          Length = 365

 Score = 43.9 bits (102), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 27/145 (18%)

Query: 669 FLHGMRYT-------SFGRHFTKVEK----LKEIVDRLHWY-VRSGDTI-VDFCCGANDF 715
           FL G +YT       S    FTK E      K+++D L+ + +  G+ + ++  CG   F
Sbjct: 49  FLLGNKYTLEKENYRSHNEFFTKKESAKYCYKKLMDFLNQHSIDVGEYVFIEPSCGDLSF 108

Query: 716 SCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGV 775
             +M  K  ++G   +++N +++           +++   P++  +G  LI+G NPPFG+
Sbjct: 109 YDLMP-KNSKIGIDLTYKNKEILC---------ENFLNFTPKK--EGKYLILG-NPPFGL 155

Query: 776 KASLANKFISQALKFKPKLIVLIVP 800
           + +LA +FI+ A +F    I  I+P
Sbjct: 156 RGNLALRFINHAYRF-ADFIAFILP 179


>gi|255552191|ref|XP_002517140.1| hypothetical protein RCOM_0912170 [Ricinus communis]
 gi|223543775|gb|EEF45303.1| hypothetical protein RCOM_0912170 [Ricinus communis]
          Length = 1604

 Score = 43.5 bits (101), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 52/133 (39%), Gaps = 36/133 (27%)

Query: 200 WNEDEDSERV-------DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQ 252
           WN  E  ER+       D D+     C IC DGGD+  CDG C  +FH +          
Sbjct: 738 WNRQESIERIGFHSVNTDGDDPNDDTCGICGDGGDLICCDG-CPSTFHQSCL-------- 788

Query: 253 SLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLG----SSDKSSSQEVFPCVSATCGQ 306
                   I  +P  ++ C NC      C  CG+        D ++  E+  C  + C +
Sbjct: 789 -------DIMMLPPGDWHCPNCT-----CKFCGIASEDFVQEDGTNVSELLTC--SLCAK 834

Query: 307 FYHPECVSKLLHP 319
            YH  C+  +  P
Sbjct: 835 KYHKSCLQDVDAP 847


>gi|390596238|gb|EIN05640.1| hypothetical protein PUNSTDRAFT_145612 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1008

 Score = 43.5 bits (101), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 56/134 (41%), Gaps = 26/134 (19%)

Query: 219 CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQ 278
           C  C DGG++  C GRC R FH           +  G T AQ+      +C     ++H 
Sbjct: 863 CHYCRDGGELVCC-GRCPRVFHK----------ECRGLTAAQLKRNFAIIC-----EQHS 906

Query: 279 CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESF 338
           C  C     S   +   +F C   TC Q +  +C   L   D +++ E L E +  G  F
Sbjct: 907 CVVC---NRSTGDAGGMLFRC--QTCPQAFCEDC---LPEGDIDAVGETLPEFLLLGYGF 958

Query: 339 TCPVH--KCFVCQQ 350
           +   +  +C  C+Q
Sbjct: 959 SPSAYYIRCHDCRQ 972


>gi|301115083|ref|XP_002999311.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
 gi|262111405|gb|EEY69457.1| histone-lysine N-methyltransferase, putative [Phytophthora
           infestans T30-4]
          Length = 905

 Score = 43.5 bits (101), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 36/78 (46%), Gaps = 23/78 (29%)

Query: 218 VCAIC-----DDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP---NFLC 269
           +C +C     D+  D+  CDG CL+S+H               ++   ID+ P    +LC
Sbjct: 343 LCGVCGEIEDDELTDMILCDGGCLKSYH---------------FSCLGIDSAPEGKEWLC 387

Query: 270 QNCVYQEHQCFACGMLGS 287
           + C   E  CF+CG  G+
Sbjct: 388 EQCRTNEQLCFSCGRNGT 405


>gi|156374107|ref|XP_001629650.1| predicted protein [Nematostella vectensis]
 gi|156216655|gb|EDO37587.1| predicted protein [Nematostella vectensis]
          Length = 265

 Score = 43.5 bits (101), Expect = 0.52,   Method: Composition-based stats.
 Identities = 45/164 (27%), Positives = 63/164 (38%), Gaps = 24/164 (14%)

Query: 219 CAICDDGGDVTFCD-GRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           CA C   G    C   RC R++H    A   A  +    T    D + +         E 
Sbjct: 116 CAHCSRFGASVVCQVPRCGRTYHYPCAASAGAFQEIHSMTMLCPDHLED--AGRLGGAEA 173

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
           QC+ CG     +     E+  C S  CG+ YH  C    L P     A E+   +  G  
Sbjct: 174 QCYLCG-----EAKEIAEMLFCTS--CGRHYHGRC----LDP-----AVEITSLVRMG-- 215

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
           + CP   C VCQ       +D ++ +C  C + YH  CL   +T
Sbjct: 216 WQCP--DCKVCQGCRQPG-DDNKMLVCDVCDRGYHTFCLDPPMT 256


>gi|321460437|gb|EFX71479.1| Mes-4-like protein [Daphnia pulex]
          Length = 706

 Score = 43.5 bits (101), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 49/114 (42%), Gaps = 19/114 (16%)

Query: 267 FLCQNC-VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLA 325
           FLC NC       C  C       + S  E+  C    CG+ YH  C+ KL H  + S+ 
Sbjct: 28  FLCLNCNPSTSSNCCLC-------RKSDGELLICNLKLCGRRYHRHCL-KLFH--SPSIK 77

Query: 326 EELRERIAAGESFTCPVHKCFVCQQS-EDMNVEDLQLAICRRCPKAYH--RKCL 376
           +E          FTCP H C  C     ++N  + +L  C  CP AYH   KCL
Sbjct: 78  QE-----KPASQFTCPAHYCHTCVADLNELNQVEKKLLRCIHCPTAYHPSEKCL 126


>gi|413923002|gb|AFW62934.1| putative SET-domain containing protein family, partial [Zea mays]
          Length = 248

 Score = 42.7 bits (99), Expect = 0.80,   Method: Composition-based stats.
 Identities = 27/95 (28%), Positives = 37/95 (38%), Gaps = 20/95 (21%)

Query: 295 EVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDM 354
           E   C    C Q +H +CV++    D  +   E         SF CP H C VC+Q    
Sbjct: 159 EEIKCSVKKCPQMFHLKCVAE----DTSNFTVE---------SFRCPQHGCMVCKQK--- 202

Query: 355 NVEDLQLAICRRCPKAYHRKCLPTEITFSDADENN 389
               +    C RC  A H KC P  +     D+ +
Sbjct: 203 ----MFFWRCGRCTVAAHTKCAPWPVVHLKNDQGS 233


>gi|195489371|ref|XP_002092710.1| GE14338 [Drosophila yakuba]
 gi|194178811|gb|EDW92422.1| GE14338 [Drosophila yakuba]
          Length = 1481

 Score = 42.7 bits (99), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 60/183 (32%), Gaps = 38/183 (20%)

Query: 206 SERVDKDENYFAV---------CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
           S+  D D  YFA          C  C   G    C   C +  H    A    L     +
Sbjct: 125 SQISDTDAAYFATHFAEFLEQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESF 184

Query: 257 T---QAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
           T      +  VP     N V    +C +C  LG   K     +  C  +TCG  +H  C+
Sbjct: 185 TVFCTEHLSQVPVICSDNNV----ECLSCSSLGDLSK-----LIMC--STCGDHFHSTCI 233

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
                PD  S              + C   +C  CQ     +  D +   C +C K YH 
Sbjct: 234 GLANLPDTRS-------------GWNCA--RCTKCQICRQQDSNDTKYVKCEQCQKIYHA 278

Query: 374 KCL 376
            CL
Sbjct: 279 SCL 281


>gi|57506149|ref|ZP_00372070.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis RM3195]
 gi|57015544|gb|EAL52337.1| putative type II DNA modification enzyme (methyltransferase)
           [Campylobacter upsaliensis RM3195]
          Length = 377

 Score = 42.7 bits (99), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 72/145 (49%), Gaps = 27/145 (18%)

Query: 669 FLHGMRYT-------SFGRHFTKVEK----LKEIVDRLHWY-VRSGDTI-VDFCCGANDF 715
           FL G +YT       S    FTK E      K+++D L+ + +  G  + ++  CG   F
Sbjct: 49  FLLGNKYTLEKENYRSHNEFFTKKESAKYCYKKLIDFLNQHNIDVGKYVFIEPSCGDLSF 108

Query: 716 SCMMKVKLEQMGKSCSFRNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGV 775
             +M  K  ++G   +++N +++           +++   P++  +G  LI+G NPPFG+
Sbjct: 109 YDLMP-KNSRIGIDLTYKNKEILC---------ENFLNFTPKK--EGKYLILG-NPPFGL 155

Query: 776 KASLANKFISQALKFKPKLIVLIVP 800
           + +LA +FI+ A  F    I  I+P
Sbjct: 156 RGNLALRFINHAYHF-ADFIAFILP 179


>gi|24762433|ref|NP_611847.2| lost PHDs of trr [Drosophila melanogaster]
 gi|21626677|gb|AAF47094.2| lost PHDs of trr [Drosophila melanogaster]
 gi|85861118|gb|ABC86508.1| HL01030p [Drosophila melanogaster]
          Length = 1482

 Score = 42.7 bits (99), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 60/183 (32%), Gaps = 38/183 (20%)

Query: 206 SERVDKDENYFAV---------CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
           S+  D D  YFA          C  C   G    C   C +  H    A    L     +
Sbjct: 125 SQISDADGAYFATHFAEFLEQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESF 184

Query: 257 T---QAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
           T      +  VP     N V    +C +C  LG   K     +  C  +TCG  +H  C+
Sbjct: 185 TVFCTEHLSQVPVICSDNNV----ECLSCSSLGDLSK-----LIMC--STCGDHFHSTCI 233

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
                PD  S              + C   +C  CQ     +  D +   C +C K YH 
Sbjct: 234 GLANLPDTRS-------------GWNCA--RCTKCQICRQQDSNDTKYVKCEQCQKTYHA 278

Query: 374 KCL 376
            CL
Sbjct: 279 SCL 281


>gi|327277055|ref|XP_003223281.1| PREDICTED: hypothetical protein LOC100554175 [Anolis carolinensis]
          Length = 5261

 Score = 42.7 bits (99), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 61/253 (24%), Positives = 91/253 (35%), Gaps = 42/253 (16%)

Query: 219 CAICDDGGDVTFCDGR-CLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           C  C+  G    C    C R +H    A      QS+   +      P  L Q    ++ 
Sbjct: 177 CEHCNRWGATIPCHAEGCQRHYHFPCAAASGCF-QSMKSLKL---LCPEHLDQAVQMEDS 232

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
           +C  C   G       +++  C S  CG  YH  C+   + P   S              
Sbjct: 233 RCMVCDAPGEL-----RDLLFCTS--CGLHYHGTCLEITVTPRKRS-------------G 272

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDL 397
           + C  H+C VCQ    ++ ED ++ +C  C K YH  CL   I    AD    +    D 
Sbjct: 273 WQC--HECKVCQTCR-LSGEDSRMLVCEACEKCYHTYCLKPAIESVPADSWKCKTELAD- 328

Query: 398 LPNNRILIYCLEHKIISELKTPARDHLKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSE 457
                   +   H  +    TPA D      VE    +  LE   T E++ A+  ++  +
Sbjct: 329 --------FGTHHTTLFPNYTPAADV----SVEHMECEIKLEAPATPEREAAADPDL-GK 375

Query: 458 SFVADKTVVKKLK 470
             + D   VKK K
Sbjct: 376 GLLEDAEEVKKRK 388


>gi|356495799|ref|XP_003516760.1| PREDICTED: uncharacterized protein LOC100814247 [Glycine max]
          Length = 1314

 Score = 42.7 bits (99), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 54/128 (42%), Gaps = 27/128 (21%)

Query: 197 IVDWNEDEDSERV-------DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNA 249
           I  WN  E SE++       D D+     C IC DGGD+  CDG C  +FH +       
Sbjct: 699 IEAWNRQEHSEKICFHSVDIDGDDPNDDTCGICGDGGDLICCDG-CPSTFHQS------- 750

Query: 250 LCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGML-GSSDKSSSQEVFPCVSATCGQFY 308
            C  +     Q+  +  + C NC      C  CG+  G+S+K  +      +   C + Y
Sbjct: 751 -CLDI-----QMLPLGEWHCPNCT-----CKFCGIASGNSEKDDASVYVLQICNLCEKKY 799

Query: 309 HPECVSKL 316
           H  C  ++
Sbjct: 800 HDSCTKEM 807


>gi|357115296|ref|XP_003559426.1| PREDICTED: uncharacterized protein LOC100827015 [Brachypodium
            distachyon]
          Length = 1344

 Score = 42.4 bits (98), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 55/134 (41%), Gaps = 31/134 (23%)

Query: 196  VIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKN 248
            +I  W++  DSER        + D+     C IC DGG++  CDG C  +FH +      
Sbjct: 948  LISAWDKQSDSERQSFFPVSTEGDDPNDDTCGICGDGGNLICCDG-CPSTFHMSCL---- 1002

Query: 249  ALCQSLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP-CVSATCG 305
                       +++ +P  ++ C NC      C  C    + D     EV   C  + C 
Sbjct: 1003 -----------ELEELPSDDWRCANCC-----CKFCQEHSNDDAPDIAEVDSLCTCSQCE 1046

Query: 306  QFYHPECVSKLLHP 319
            + YHP C  +  +P
Sbjct: 1047 ENYHPVCSPETENP 1060


>gi|194885797|ref|XP_001976493.1| GG22900 [Drosophila erecta]
 gi|190659680|gb|EDV56893.1| GG22900 [Drosophila erecta]
          Length = 1481

 Score = 42.4 bits (98), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 45/183 (24%), Positives = 60/183 (32%), Gaps = 38/183 (20%)

Query: 206 SERVDKDENYFAV---------CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
           S+  D D  YFA          C  C   G    C   C +  H    A    L     +
Sbjct: 125 SQISDADAAYFATHFAEFLEQKCNFCGRFGASINCKMNCRQVHHWPCAAAAGCLLILESF 184

Query: 257 T---QAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
           T      +  VP     N V    +C +C  LG   K     +  C  +TCG  +H  C+
Sbjct: 185 TVFCTEHLSQVPVICSDNNV----ECLSCSSLGDLSK-----LIMC--STCGDHFHSTCI 233

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
                PD  S              + C   +C  CQ     +  D +   C +C K YH 
Sbjct: 234 GLANLPDTRS-------------GWNCA--RCTKCQICRQQDSNDTKYVKCEQCQKIYHA 278

Query: 374 KCL 376
            CL
Sbjct: 279 SCL 281


>gi|440895698|gb|ELR47828.1| Histone-lysine N-methyltransferase MLL3, partial [Bos grunniens
           mutus]
          Length = 4905

 Score = 42.4 bits (98), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 61/163 (37%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG  A  Q L +        ID  P        
Sbjct: 231 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGAF-QDLSHFFLLCPEHIDQAPER-----S 284

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 285 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 327

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 328 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 364


>gi|356540327|ref|XP_003538641.1| PREDICTED: uncharacterized protein LOC100801863 [Glycine max]
          Length = 1301

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 51/125 (40%), Gaps = 27/125 (21%)

Query: 200 WNEDEDSERV-------DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQ 252
           WN  E +E++       D  +     C IC DGGD+  CDG C  +FH +        C 
Sbjct: 687 WNRQEHAEKIGFHSVDIDGGDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CL 737

Query: 253 SLGYTQAQIDAVPNFLCQNCVYQEHQCFACGML-GSSDKSSSQEVFPCVSATCGQFYHPE 311
            +     Q+     + C NC      C  CG+  G+S+K  +      +   C + YH  
Sbjct: 738 DI-----QMLPPGEWRCMNCT-----CKFCGIASGTSEKDDASVCVLHICNLCEKKYHDS 787

Query: 312 CVSKL 316
           C  ++
Sbjct: 788 CTKEM 792


>gi|195382495|ref|XP_002049965.1| GJ21880 [Drosophila virilis]
 gi|194144762|gb|EDW61158.1| GJ21880 [Drosophila virilis]
          Length = 1458

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 59/167 (35%), Gaps = 31/167 (18%)

Query: 219 CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYT---QAQIDAVPNFLCQNCVYQ 275
           C+ C   G    C   C +  H   ++    L     +T      +  VP     N V  
Sbjct: 150 CSFCGQYGASITCKMNCRQVHHWPCSSAAGCLLILESFTVFCTEHLSQVPIICTDNNV-- 207

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
             +C  C  LG   K     +  C S  CG  +H  C+     PD  S            
Sbjct: 208 --ECLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTRS------------ 246

Query: 336 ESFTCP-VHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
             ++C    KC +C+Q E     D++   C +C K YH  CL   I+
Sbjct: 247 -GWSCARCTKCQICRQHE---TNDIKFIKCEQCQKMYHAMCLRPTIS 289


>gi|321261507|ref|XP_003195473.1| histone acetyltransferase [Cryptococcus gattii WM276]
 gi|317461946|gb|ADV23686.1| Histone acetyltransferase, putative [Cryptococcus gattii WM276]
          Length = 947

 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 13/107 (12%)

Query: 270 QNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELR 329
           Q  V +E  C  CG   + +K   QE      A CG+  HP C++         L  +LR
Sbjct: 14  QGAVIREDFCSFCGGTDTINKQGVQETM-VSCAACGRSGHPTCLNM--------LTPKLR 64

Query: 330 ERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
           +R+   + + C   +C +C+Q E +  +D +L  C  C + +H  CL
Sbjct: 65  KRVMMYD-WHC--IECKMCEQCE-IKGDDSRLMFCDTCDRGWHSYCL 107


>gi|407407107|gb|EKF31071.1| transcription activator, putative [Trypanosoma cruzi marinkellei]
          Length = 843

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 27/115 (23%), Positives = 47/115 (40%), Gaps = 21/115 (18%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
           H CF CG +        + ++ C  A C + YH +C+ +            L+  +    
Sbjct: 688 HHCFVCGEI----MRPLEPLYHC--AVCPKAYHADCIGE----------RRLKAGVVGPR 731

Query: 337 SFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQ 391
            +TCP H C  C + +     D  + +C  CP ++   CL +   + + DE   Q
Sbjct: 732 RWTCPRHSCVSCGKVQGF---DGAVFMCTECPSSFCFDCLDSR--YFELDETRTQ 781


>gi|357119285|ref|XP_003561373.1| PREDICTED: uncharacterized protein LOC100845556 [Brachypodium
           distachyon]
          Length = 1589

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 42/101 (41%), Gaps = 27/101 (26%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
            C IC DGGD+  CD RC  +FH             LG      D    + C+NC+    
Sbjct: 851 TCGICGDGGDLLCCD-RCTSTFHVAC----------LGIEMPSGD----WFCRNCI---- 891

Query: 278 QCFACGMLGSSDK--SSSQEVFPCVSATCGQFYHPECVSKL 316
               C   GS+++  SS  E+  C+   C + YH  C   +
Sbjct: 892 ----CKFCGSAEERTSSPAELLSCLQ--CSRKYHQVCAQGI 926


>gi|194754301|ref|XP_001959434.1| GF12873 [Drosophila ananassae]
 gi|190620732|gb|EDV36256.1| GF12873 [Drosophila ananassae]
          Length = 1486

 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 44/180 (24%), Positives = 60/180 (33%), Gaps = 32/180 (17%)

Query: 206 SERVDKDENYFAV---------CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
           S+  D D  YFA          C  C   G    C  +C +  H    A    L     +
Sbjct: 125 SQITDADAAYFASHFAEFLEQKCNFCGRYGASINCKMKCRQVHHWPCAAAAGCLLILESF 184

Query: 257 TQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKL 316
           T    + +   +   C     +C  C  LG   K     +  C  +TCG  +H  CV   
Sbjct: 185 TVFCTEHLSQVILI-CSDNNVECLTCSSLGDLSK-----LIMC--STCGDHFHSTCVGLA 236

Query: 317 LHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             PD  S              + C   +C  CQ     +  DL+   C +C K YH  CL
Sbjct: 237 NLPDTRS-------------GWNC--ARCTKCQICRVQDSNDLKYVKCEQCQKIYHASCL 281


>gi|108711065|gb|ABF98860.1| acetyltransferase, GNAT family protein, expressed [Oryza sativa
           Japonica Group]
          Length = 1169

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 55/139 (39%), Gaps = 32/139 (23%)

Query: 196 VIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKN 248
           +I  W++  DSER        + D+     C IC DGG++  CDG C  +FH +      
Sbjct: 776 LINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-CPSTFHMSCL---- 830

Query: 249 ALCQSLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP--CVSATC 304
                      +++A+P  ++ C  C      C  C      D     EV    C  + C
Sbjct: 831 -----------ELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDIAEVDSSLCTCSQC 874

Query: 305 GQFYHPECVSKLLHPDNES 323
            + YHP C  +  +  N S
Sbjct: 875 EEKYHPGCSPETTNTSNVS 893


>gi|222625793|gb|EEE59925.1| hypothetical protein OsJ_12562 [Oryza sativa Japonica Group]
          Length = 777

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 55/139 (39%), Gaps = 32/139 (23%)

Query: 196 VIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKN 248
           +I  W++  DSER        + D+     C IC DGG++  CDG C  +FH +      
Sbjct: 384 LINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-CPSTFHMSCL---- 438

Query: 249 ALCQSLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP--CVSATC 304
                      +++A+P  ++ C  C      C  C      D     EV    C  + C
Sbjct: 439 -----------ELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDIAEVDSSLCTCSQC 482

Query: 305 GQFYHPECVSKLLHPDNES 323
            + YHP C  +  +  N S
Sbjct: 483 EEKYHPGCSPETTNTSNVS 501


>gi|224451138|ref|NP_001020229.2| synaptonemal complex central element protein 1 [Rattus norvegicus]
 gi|118573897|sp|Q5XHZ2.2|SYCE1_RAT RecName: Full=Synaptonemal complex central element protein 1
 gi|149061503|gb|EDM11926.1| hypothetical protein LOC502372 [Rattus norvegicus]
          Length = 329

 Score = 41.6 bits (96), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 78/182 (42%), Gaps = 29/182 (15%)

Query: 73  ELSYAQPEIW-VLSKQNNWIMLQSPKKSFKNIV--RTILITVHWLYCVKQNPEASRKYIR 129
           EL  AQ  +W  L K+ +   L   K   K+I+  +   + +  L+C ++  EA RK+  
Sbjct: 75  ELGEAQT-VWDTLQKEVDS--LHEEKVRLKDILNRKEETLRIMQLHCQEKESEAERKHSM 131

Query: 130 EQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVR 189
            Q  K   S+         L++   + +A  R   L   + L + M+ H +T  FH+   
Sbjct: 132 LQECKDRISF---------LNNQIDNEKAKLRRLRLDFEEHLETLMSQHKDTMEFHKPEH 182

Query: 190 TTEKPKVIVDWNEDEDSERVDKDENYFA---------VCAICDDGGDVTFCDGRCLRSFH 240
            TE+  V+     D   E++ K+E             +CA+C   G  TF DG  LRS  
Sbjct: 183 LTEEMSVL-----DSSKEQLLKEEKLIKAKLEDVQHRLCALCGPEGSSTFSDGLFLRSHE 237

Query: 241 AT 242
           A 
Sbjct: 238 AA 239


>gi|449444240|ref|XP_004139883.1| PREDICTED: uncharacterized protein LOC101210263 [Cucumis sativus]
          Length = 1314

 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 51/124 (41%), Gaps = 32/124 (25%)

Query: 200 WNEDEDSE-------RVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQ 252
           WN  E+S+        +D D+     C IC DGGD+  CDG C  +FH +        C 
Sbjct: 681 WNRQEESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CL 731

Query: 253 SLGYTQAQIDAVPNFLCQNCVYQEHQCFACGM----LGSSDKSSSQEVFPCVSATCGQFY 308
            +      I    ++ C NC      C  CG+    +   D +S  E+  C+   C + +
Sbjct: 732 DI-----LIPPPGDWHCPNCT-----CKYCGVASIDICQGDNTSVSEISTCI--LCEKKF 779

Query: 309 HPEC 312
           H  C
Sbjct: 780 HESC 783


>gi|126341226|ref|XP_001372106.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like
           [Monodelphis domestica]
          Length = 4862

 Score = 41.6 bits (96), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 58/162 (35%), Gaps = 32/162 (19%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT---QAQIDAVPNFLCQNCVY 274
           CA C   G  +  C+ +C + +H    AG         ++      ID  P         
Sbjct: 294 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSNFSLLCPEHIDQAPER-----SK 348

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           +E  C  C   G        + F C   TCGQ YH  C+   + P   +           
Sbjct: 349 EEANCAVCDSPGDL-----VDQFFCT--TCGQHYHGMCLDIAVTPLKRA----------- 390

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              + CP   C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 391 --GWQCP--DCKVCQNCKHSG-EDSKMLVCDTCDKGYHTFCL 427


>gi|198437220|ref|XP_002124518.1| PREDICTED: similar to Histone-lysine N-methyltransferase HRX (Zinc
            finger protein HRX) (ALL-1) (Trithorax-like protein)
            (Lysine N-methyltransferase 2A) (CXXC-type zinc finger
            protein 7) [Ciona intestinalis]
          Length = 3406

 Score = 41.6 bits (96), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 48/113 (42%), Gaps = 22/113 (19%)

Query: 279  CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPD--NESLAEELRERIAAGE 336
            C  CG   S+   S   +  C  A C + YHP C      PD  N  +  +++       
Sbjct: 1024 CLICGSFSSAKSISEGALVHC--ACCCEPYHPFCA----EPDFLNTDVLAQMKRNTWICR 1077

Query: 337  SFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL-PTEIT--FSDAD 386
               C    C VC   ++       L +CRRC K+YH +CL P+  T  +SD D
Sbjct: 1078 KCQC----CHVCGHPKN-------LLVCRRCKKSYHSECLGPSYPTNCYSDED 1119


>gi|449492632|ref|XP_004159054.1| PREDICTED: uncharacterized LOC101210263 [Cucumis sativus]
          Length = 1213

 Score = 41.6 bits (96), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 51/124 (41%), Gaps = 32/124 (25%)

Query: 200 WNEDEDSE-------RVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQ 252
           WN  E+S+        +D D+     C IC DGGD+  CDG C  +FH +        C 
Sbjct: 563 WNRQEESKLLSFHTVEIDGDDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CL 613

Query: 253 SLGYTQAQIDAVPNFLCQNCVYQEHQCFACGM----LGSSDKSSSQEVFPCVSATCGQFY 308
            +      I    ++ C NC      C  CG+    +   D +S  E+  C+   C + +
Sbjct: 614 DI-----LIPPPGDWHCPNCT-----CKYCGVASIDICQGDNTSVSEISTCI--LCEKKF 661

Query: 309 HPEC 312
           H  C
Sbjct: 662 HESC 665


>gi|395539758|ref|XP_003771833.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Sarcophilus
           harrisii]
          Length = 4951

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 61/165 (36%), Gaps = 38/165 (23%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT---QAQIDAVPNFLCQNCVY 274
           CA C   G  +  C+ +C + +H    AG         ++      ID  P         
Sbjct: 329 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDISNFSLLCPEHIDQAPER-----SK 383

Query: 275 QEHQCFAC---GMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRER 331
           +E  C  C   G LG        + F C   TCGQ YH  C+          +A    +R
Sbjct: 384 EEANCSVCDSPGDLG--------DQFFCT--TCGQHYHGMCLD---------IAVTALKR 424

Query: 332 IAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                 + CP   C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 425 AG----WQCP--DCKVCQNCKHSG-EDSKMLVCDTCDKGYHTFCL 462


>gi|403165978|ref|XP_003325884.2| hypothetical protein PGTG_07086 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375165992|gb|EFP81465.2| hypothetical protein PGTG_07086 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 1043

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 2/50 (4%)

Query: 337 SFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDAD 386
           SFTC  H+C VCQ++         L  C+ CP AY  +CLP E  +S  D
Sbjct: 932 SFTCSQHRCVVCQRAGPE--AGGMLYRCQTCPDAYCEECLPHENFYSMGD 979


>gi|345324243|ref|XP_003430797.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Ornithorhynchus anatinus]
          Length = 4910

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQ----AQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   ++       ID  P        
Sbjct: 280 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHSSLLCPEHIDQAPER-----S 333

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            +E  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 334 KEEANCAVCDSPGDL-----LDQFYCT--TCGQHYHGMCLDIAITPLKRA---------- 376

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP   C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 377 ---GWQCP--DCKVCQNCKHSG-EDNKMLVCDTCDKGYHTFCL 413


>gi|357460667|ref|XP_003600615.1| hypothetical protein MTR_3g064280 [Medicago truncatula]
 gi|355489663|gb|AES70866.1| hypothetical protein MTR_3g064280 [Medicago truncatula]
          Length = 814

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 23/95 (24%)

Query: 219 CAICDDGGDVTFCDGR-CLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           C +C DGG++  CD R CL+++HA          + LG+  + ++   N+ C +     H
Sbjct: 18  CFVCKDGGELRVCDFRDCLKTYHA----------KCLGHDASFMENDNNWCCGS-----H 62

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQ-FYHPE 311
            C+ CG      ++S      C  A CG+ FY  E
Sbjct: 63  YCYLCG------RASKFMCLCCPIAVCGRCFYDAE 91


>gi|53733613|gb|AAH83907.1| Hypothetical protein LOC502372 [Rattus norvegicus]
          Length = 291

 Score = 41.2 bits (95), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 78/182 (42%), Gaps = 29/182 (15%)

Query: 73  ELSYAQPEIW-VLSKQNNWIMLQSPKKSFKNIV--RTILITVHWLYCVKQNPEASRKYIR 129
           EL  AQ  +W  L K+ +   L   K   K+I+  +   + +  L+C ++  EA RK+  
Sbjct: 37  ELGEAQT-VWDTLQKEVDS--LHEEKVRLKDILNRKEETLRIMQLHCQEKESEAERKHSM 93

Query: 130 EQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVR 189
            Q  K   S+         L++   + +A  R   L   + L + M+ H +T  FH+   
Sbjct: 94  LQECKDRISF---------LNNQIDNEKAKLRRLRLDFEEHLETLMSQHKDTMEFHKPEH 144

Query: 190 TTEKPKVIVDWNEDEDSERVDKDENYFA---------VCAICDDGGDVTFCDGRCLRSFH 240
            TE+  V+     D   E++ K+E             +CA+C   G  TF DG  LRS  
Sbjct: 145 LTEEMSVL-----DSSKEQLLKEEKLIKAKLEDVQHRLCALCGPEGSSTFSDGLFLRSHE 199

Query: 241 AT 242
           A 
Sbjct: 200 AA 201


>gi|14626277|gb|AAK71545.1|AC087852_5 unknown protein [Oryza sativa Japonica Group]
          Length = 1324

 Score = 41.2 bits (95), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 55/139 (39%), Gaps = 32/139 (23%)

Query: 196  VIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKN 248
            +I  W++  DSER        + D+     C IC DGG++  CDG C  +FH +      
Sbjct: 931  LINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-CPSTFHMSCL---- 985

Query: 249  ALCQSLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP--CVSATC 304
                       +++A+P  ++ C  C      C  C      D     EV    C  + C
Sbjct: 986  -----------ELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDIAEVDSSLCTCSQC 1029

Query: 305  GQFYHPECVSKLLHPDNES 323
             + YHP C  +  +  N S
Sbjct: 1030 EEKYHPGCSPETTNTSNVS 1048


>gi|218193747|gb|EEC76174.1| hypothetical protein OsI_13499 [Oryza sativa Indica Group]
          Length = 1305

 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 55/139 (39%), Gaps = 32/139 (23%)

Query: 196  VIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKN 248
            +I  W++  DSER        + D+     C IC DGG++  CDG C  +FH +      
Sbjct: 912  LINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICCDG-CPSTFHMSCL---- 966

Query: 249  ALCQSLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP--CVSATC 304
                       +++A+P  ++ C  C      C  C      D     EV    C  + C
Sbjct: 967  -----------ELEALPSDDWRCAKC-----SCKFCQEHSRQDAQDIAEVDSSLCTCSQC 1010

Query: 305  GQFYHPECVSKLLHPDNES 323
             + YHP C  +  +  N S
Sbjct: 1011 EEKYHPGCSPETTNTSNVS 1029


>gi|338724475|ref|XP_001495649.3| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Equus caballus]
          Length = 4910

 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 60/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C +++H    AG     Q   +        ID  P        
Sbjct: 245 CAFCKHLGATIKCCEEKCTQTYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 298

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 299 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 341

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 342 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 378


>gi|397469943|ref|XP_003806597.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3, partial [Pan paniscus]
          Length = 4810

 Score = 41.2 bits (95), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 60/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG  +  Q   +        ID  P        
Sbjct: 201 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGSF-QDFSHIFLLCPEHIDQAPER-----S 254

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 255 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 297

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 298 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 334


>gi|449510359|ref|XP_004163643.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224338
            [Cucumis sativus]
          Length = 1403

 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 26/110 (23%)

Query: 209  VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--N 266
            VD+D+     C IC DGG++  CD  C  +FH               ++   I  +P  N
Sbjct: 938  VDEDDRNDDSCGICGDGGELICCD-NCPSTFH---------------HSCLSIQELPEGN 981

Query: 267  FLCQNCVYQEHQCFACG-MLGSSDKSSSQEVFPCVSATCGQFYHPECVSK 315
            + C NC      C  CG ++   + SSS +   C    C Q YH +C+ +
Sbjct: 982  WYCLNCT-----CRICGDLVNFEEISSSSDALKCFQ--CEQKYHGQCLKQ 1024


>gi|449456717|ref|XP_004146095.1| PREDICTED: uncharacterized protein LOC101204381 [Cucumis sativus]
          Length = 1393

 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 48/110 (43%), Gaps = 26/110 (23%)

Query: 209  VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--N 266
            VD+D+     C IC DGG++  CD  C  +FH               ++   I  +P  N
Sbjct: 938  VDEDDRNDDSCGICGDGGELICCD-NCPSTFH---------------HSCLSIQELPEGN 981

Query: 267  FLCQNCVYQEHQCFACG-MLGSSDKSSSQEVFPCVSATCGQFYHPECVSK 315
            + C NC      C  CG ++   + SSS +   C    C Q YH +C+ +
Sbjct: 982  WYCLNCT-----CRICGDLVNFEEISSSSDALKCFQ--CEQKYHGQCLKQ 1024


>gi|392339743|ref|XP_003753895.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
           norvegicus]
          Length = 4931

 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 61/163 (37%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 284 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 337

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P        LR    
Sbjct: 338 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTP--------LRR--- 379

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
           AG  + CP  +C VCQ       ED ++ +C  C K YH  CL
Sbjct: 380 AG--WQCP--ECKVCQNCRQSG-EDSKMLVCDTCDKGYHTFCL 417


>gi|242016925|ref|XP_002428945.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
 gi|212513774|gb|EEB16207.1| hypothetical protein Phum_PHUM411800 [Pediculus humanus corporis]
          Length = 6073

 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 44/105 (41%), Gaps = 25/105 (23%)

Query: 273 VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
           + +  QC  C  LG+        +  C S  CG  +H  CV   L P            +
Sbjct: 373 IGETSQCSTCLSLGNVSN-----ILMCTS--CGAHHHGSCVGLALLPG-----------V 414

Query: 333 AAG-ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
            AG + F C V  C VC+Q  ++     ++ +C  C KAYH  CL
Sbjct: 415 RAGWQCFECRV--CQVCRQPSEIG----KIMLCESCDKAYHPSCL 453


>gi|403351815|gb|EJY75406.1| hypothetical protein OXYTRI_03207 [Oxytricha trifallax]
          Length = 732

 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 41/179 (22%), Positives = 63/179 (35%), Gaps = 39/179 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATIT-------------AGKNALCQSLGYTQAQIDAV 264
           VC +C+  G    C   C + +H   T               K+   Q         +  
Sbjct: 303 VCGMCEKPGASIQCASECKKFYHKECTEKMAIKTVIPQESTAKDTDSQKAEIPGPSNNGS 362

Query: 265 PNFL--CQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNE 322
           P F   C+NC+ +  +C  C   G    S    +  C  A                 + E
Sbjct: 363 PQFQIHCENCLTRIQKCQHCQQQGQIGLS----MIKCAVAK----------------ERE 402

Query: 323 SLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
           SL + ++E++ +   F C  H C  C  SE    +D  +  C +C  AYH KC    +T
Sbjct: 403 SLPDNVKEQLKS-PVFICGGHHCHGC--SEPFRTDD-SIIRCLKCVAAYHEKCKEAAMT 457


>gi|431895734|gb|ELK05153.1| Histone-lysine N-methyltransferase MLL3 [Pteropus alecto]
          Length = 921

 Score = 40.8 bits (94), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 60/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  CD RC + +H    AG     Q L +        ID  P        
Sbjct: 220 CAFCKHLGATLRCCDERCTQMYHYPCAAGAGTF-QDLSHFFLLCPEHIDQAPER-----S 273

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 274 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDVAVTPLKRA---------- 316

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 317 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 353


>gi|392347077|ref|XP_003749721.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Rattus
           norvegicus]
          Length = 4930

 Score = 40.8 bits (94), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 61/163 (37%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 284 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 337

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P        LR    
Sbjct: 338 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTP--------LRR--- 379

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
           AG  + CP  +C VCQ       ED ++ +C  C K YH  CL
Sbjct: 380 AG--WQCP--ECKVCQNCRQSG-EDSKMLVCDTCDKGYHTFCL 417


>gi|426228657|ref|XP_004008414.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3 [Ovis aries]
          Length = 4922

 Score = 40.8 bits (94), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 60/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q L +        ID  P        
Sbjct: 256 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDLSHFFLLCPEHIDQAPER-----S 309

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 310 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 352

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 353 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 389


>gi|270015132|gb|EFA11580.1| trithorax [Tribolium castaneum]
          Length = 2343

 Score = 40.8 bits (94), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 20/103 (19%)

Query: 279 CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESF 338
           CF CG  G       + +  C  + C + YHP C+ +   P   S +   R+ +     +
Sbjct: 603 CFLCGSAGR------EALLHC--SLCCEPYHPFCLER--SPPFSSTSNNNRQNV-----W 647

Query: 339 TCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
            CP  +C  C      N  D Q   C++C KAYH +CL T+ T
Sbjct: 648 LCP--RCTTCNA---CNRADRQKIHCQKCSKAYHSECLKTDWT 685


>gi|297601684|ref|NP_001051260.2| Os03g0747600 [Oryza sativa Japonica Group]
 gi|255674895|dbj|BAF13174.2| Os03g0747600 [Oryza sativa Japonica Group]
          Length = 640

 Score = 40.8 bits (94), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 60/156 (38%), Gaps = 32/156 (20%)

Query: 179 PETNTFHEDVRTTEKPKVIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFC 231
           P  N   + + T     +I  W++  DSER        + D+     C IC DGG++  C
Sbjct: 230 PYRNVLVDGLDTDLLHCLINAWDKQSDSERQAFFPISTETDDPNDDTCGICGDGGNLICC 289

Query: 232 DGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVYQEHQCFACGMLGSSD 289
           DG C  +FH +                 +++A+P  ++ C  C      C  C      D
Sbjct: 290 DG-CPSTFHMSCL---------------ELEALPSDDWRCAKC-----SCKFCQEHSRQD 328

Query: 290 KSSSQEVFP--CVSATCGQFYHPECVSKLLHPDNES 323
                EV    C  + C + YHP C  +  +  N S
Sbjct: 329 AQDIAEVDSSLCTCSQCEEKYHPGCSPETTNTSNVS 364


>gi|195122760|ref|XP_002005879.1| GI18846 [Drosophila mojavensis]
 gi|193910947|gb|EDW09814.1| GI18846 [Drosophila mojavensis]
          Length = 1465

 Score = 40.8 bits (94), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 57/162 (35%), Gaps = 31/162 (19%)

Query: 219 CAICDDGGDVTFCDGRCLRSFH--ATITAGKNALCQSLG-YTQAQIDAVPNFLCQNCVYQ 275
           C+ C   G    C   C +  H      AG   + +S   +    +  VP     N V  
Sbjct: 153 CSFCGQYGASITCKMNCRQVHHWPCAAAAGCFLILESFTVFCTEHLSQVPLICTDNNV-- 210

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
              C  C  LG   K     +  C S  CG  +H  C+     PD  S            
Sbjct: 211 --GCLTCSSLGDLSK-----LIMCCS--CGDHFHSTCIGLANLPDTRS------------ 249

Query: 336 ESFTCP-VHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             ++C    KC +C+Q E     D++   C +C K YH  CL
Sbjct: 250 -GWSCARCTKCQICRQQE---ANDIKFIKCEQCQKIYHATCL 287


>gi|413933082|gb|AFW67633.1| hypothetical protein ZEAMMB73_811991, partial [Zea mays]
          Length = 1376

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 70/174 (40%), Gaps = 44/174 (25%)

Query: 196  VIVDWNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKN 248
            +I  WN   D+ER       ++ D+     C IC DGG++  CDG C  +FH +      
Sbjct: 979  LINAWNMQSDAERQDFFPVSIEGDDPNDDTCGICGDGGNLICCDG-CPSTFHMSC----- 1032

Query: 249  ALCQSLGYTQAQIDAVPN--FLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQ 306
                 LG     ++A+P   + C NC      C  C    S D   + +V   +  TC Q
Sbjct: 1033 -----LG-----LEALPTDYWCCSNC-----SCKFCHEHSSDDAEDTADVDSSLH-TCSQ 1076

Query: 307  FYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQ 360
                E  ++   PD +S+A  L  +                CQQS  +  E+LQ
Sbjct: 1077 CE--EQCTEACSPDIDSIATNLSSQTGN-----------LFCQQSCRLLFEELQ 1117


>gi|401410482|ref|XP_003884689.1| putative SNF2 family N-terminal domain containing protein [Neospora
            caninum Liverpool]
 gi|325119107|emb|CBZ54659.1| putative SNF2 family N-terminal domain containing protein [Neospora
            caninum Liverpool]
          Length = 2550

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 35/154 (22%), Positives = 66/154 (42%), Gaps = 15/154 (9%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
            +  E +CF CG     + SS         A+ G+  +     +  H   E L  +++   
Sbjct: 1623 IKHEQKCFLCGNGKDFEHSSVDADGNTTKASLGELVYCSGCPRAYHRVCEGLPRDVK--- 1679

Query: 333  AAGESFTCPVHKCFVC--QQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSDADENNF 390
               +S+ C  H+C +C  + S+  N+    L  C +CP ++   C P +    +  E  +
Sbjct: 1680 ---KSWRCRWHECCLCFRKTSQCGNM----LIHCAKCPTSFCYDCFPPDYCRHNVSEEYY 1732

Query: 391  ---QRAWVDLLPNNRILIYCLEHKIISELKTPAR 421
               ++  +++ P N IL+ C   K + E +T  R
Sbjct: 1733 SYLRQRGMNVTPQNWILLLCSRCKAVEEQQTRRR 1766


>gi|359479699|ref|XP_003632336.1| PREDICTED: uncharacterized protein LOC100853644 [Vitis vinifera]
          Length = 1003

 Score = 40.4 bits (93), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 8/50 (16%)

Query: 200 WNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHAT 242
           WN+ E+SER       V  D+     C IC DGGD+  CDG C  +FH +
Sbjct: 263 WNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDG-CPSTFHQS 311


>gi|198456152|ref|XP_001360232.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
 gi|198135513|gb|EAL24806.2| GA18992 [Drosophila pseudoobscura pseudoobscura]
          Length = 1486

 Score = 40.4 bits (93), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 59/183 (32%), Gaps = 38/183 (20%)

Query: 206 SERVDKDENYFA---------VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
           S+  D D  YF           C  C   G    C   C +  H    A    L     +
Sbjct: 124 SQIADSDAAYFVNHIGEFLKQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESF 183

Query: 257 T---QAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
           T      +  VP     N V    +C +C  LG   K     +  C  +TCG  +H  C+
Sbjct: 184 TVFCTEHLSQVPLICSDNNV----ECLSCSSLGDLSK-----LIMC--STCGDHFHSTCI 232

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
                PD  S              + C   +C  CQ     +  DL+   C +C K YH 
Sbjct: 233 GLANLPDTRS-------------GWNC--ARCTKCQICRQQDSNDLKYVKCEQCQKIYHA 277

Query: 374 KCL 376
            C 
Sbjct: 278 SCF 280


>gi|91718902|ref|NP_733751.2| histone-lysine N-methyltransferase MLL3 [Homo sapiens]
 gi|221222521|sp|Q8NEZ4.3|MLL3_HUMAN RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
           Full=Homologous to ALR protein; AltName: Full=Lysine
           N-methyltransferase 2C; Short=KMT2C; AltName:
           Full=Myeloid/lymphoid or mixed-lineage leukemia protein
           3
          Length = 4911

 Score = 40.4 bits (93), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 338

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 339 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 381

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418


>gi|119574356|gb|EAW53971.1| hCG1990594, isoform CRA_a [Homo sapiens]
          Length = 4911

 Score = 40.4 bits (93), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 338

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 339 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 381

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418


>gi|21427632|gb|AAK00583.1| MLL3 [Homo sapiens]
          Length = 4911

 Score = 40.4 bits (93), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 338

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 339 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 381

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418


>gi|410336273|gb|JAA37083.1| myeloid/lymphoid or mixed-lineage leukemia 3 [Pan troglodytes]
          Length = 4912

 Score = 40.4 bits (93), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 338

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 339 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 381

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418


>gi|395540930|ref|XP_003772403.1| PREDICTED: histone-lysine N-methyltransferase MLL2 [Sarcophilus
           harrisii]
          Length = 5047

 Score = 40.4 bits (93), Expect = 4.0,   Method: Composition-based stats.
 Identities = 33/106 (31%), Positives = 41/106 (38%), Gaps = 23/106 (21%)

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++  C  C  LG       +++  C S  CGQ YH  C           L   L  R  A
Sbjct: 224 EDAHCVVCDGLGEL-----RDLLFCTS--CGQHYHGAC-----------LDTALTARKRA 265

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEI 380
           G  + CP   C VCQ       ED  + +C  C K YH  CL   I
Sbjct: 266 G--WQCP--DCKVCQTCRQPG-EDSMMLVCEACDKGYHTFCLKPAI 306


>gi|392585649|gb|EIW74988.1| hypothetical protein CONPUDRAFT_169853 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1053

 Score = 40.4 bits (93), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 55/123 (44%), Gaps = 36/123 (29%)

Query: 219  CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQ 278
            C  C DGG++  C G C R FH     G+ A     GYT A++  +P   C      +H+
Sbjct: 911  CLHCRDGGELVVC-GLCPRVFH-----GRCA-----GYTAAEVRRLPQLHC-----SQHE 954

Query: 279  CFACGMLGSSDKSSSQE----VFPCVSATCGQFYHPECVSKLLHPDNE--SLAEELRERI 332
            C  C       K  +QE    +F C   TC Q +  +C+     P+ E  ++ + L E +
Sbjct: 955  CCQC-------KRKTQETGGMLFRC--QTCPQAFCEDCL-----PEGELDAVGDTLPEFL 1000

Query: 333  AAG 335
            A G
Sbjct: 1001 ALG 1003


>gi|355748156|gb|EHH52653.1| hypothetical protein EGM_13123, partial [Macaca fascicularis]
          Length = 4916

 Score = 40.4 bits (93), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 232 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 285

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 286 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 328

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 329 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 365


>gi|195149375|ref|XP_002015633.1| GL11176 [Drosophila persimilis]
 gi|194109480|gb|EDW31523.1| GL11176 [Drosophila persimilis]
          Length = 1486

 Score = 40.4 bits (93), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 59/183 (32%), Gaps = 38/183 (20%)

Query: 206 SERVDKDENYFA---------VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGY 256
           S+  D D  YF           C  C   G    C   C +  H    A    L     +
Sbjct: 124 SQIADSDAAYFVNHIGEFLKQKCNFCGRYGASINCKMNCRQVHHWPCAAAAGCLLILESF 183

Query: 257 T---QAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
           T      +  VP     N V    +C +C  LG   K     +  C  +TCG  +H  C+
Sbjct: 184 TVFCTEHLSQVPLICSDNNV----ECLSCSSLGDLSK-----LIMC--STCGDHFHSTCI 232

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
                PD  S              + C   +C  CQ     +  DL+   C +C K YH 
Sbjct: 233 GLANLPDTRS-------------GWNC--ARCTKCQICRQQDSNDLKYVKCEQCQKIYHA 277

Query: 374 KCL 376
            C 
Sbjct: 278 SCF 280


>gi|405122036|gb|AFR96804.1| Myst4 protein [Cryptococcus neoformans var. grubii H99]
          Length = 943

 Score = 40.4 bits (93), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 13/107 (12%)

Query: 270 QNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELR 329
            + V +E  C  CG   + +K   QE      A CG+  HP C++         L  +LR
Sbjct: 14  HSTVVREDFCSFCGGTDAVNKQGVQETM-VSCAACGRSGHPTCLNM--------LTPKLR 64

Query: 330 ERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
           +R+   + + C   +C  C+Q E +  +D +L  C  C + +H  CL
Sbjct: 65  KRVMMYD-WHC--IECKTCEQCE-IKGDDSRLMFCDTCDRGWHSYCL 107


>gi|380006423|gb|AFD29602.1| NSD-1 [Schmidtea mediterranea]
          Length = 914

 Score = 40.4 bits (93), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 45/204 (22%), Positives = 79/204 (38%), Gaps = 25/204 (12%)

Query: 219 CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFL--CQNCVYQE 276
           C  C +  ++  C+     +FH           + L + Q  I+   N L  C  C    
Sbjct: 213 CRYCLETKNIVQCNKCLFNNFHV----------KCLEFLQNSIETKINDLDRCLECSLGI 262

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGE 336
            +C+AC  +   D S+ Q    C+  TC +++H  C+             E+ +    G+
Sbjct: 263 IRCYACYKI--VDISNEQGFMRCIIKTCTKWFHTNCLE-------TGPMLEMTKFPKPGQ 313

Query: 337 SFTCPVHKCFVCQ-QSEDMNVEDLQLAICRRCPKAYHRK--CLPTEITFSDADENNFQRA 393
              CP H C  C+  +        +L +C  CP  +H    C+P      +  E    R 
Sbjct: 314 -LVCPCHICHKCELVNPSPKYRCTKLIVCIMCPATFHTGDWCIPAGSVELNTGEIICPRH 372

Query: 394 WVDLLPNNRILIYCLEHKIISELK 417
            VDL  +  I++     ++ S+L+
Sbjct: 373 LVDLNFSMNIMVPPFVARVKSDLQ 396


>gi|296210171|ref|XP_002751860.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Callithrix
           jacchus]
          Length = 4909

 Score = 40.4 bits (93), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 338

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 339 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 381

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418


>gi|34610109|gb|AAN11291.1| mixed-lineage leukemia 3 protein [Mus musculus]
          Length = 3396

 Score = 40.4 bits (93), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 216 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 269

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 270 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 312

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 313 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 349


>gi|402865478|ref|XP_003896948.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Papio
           anubis]
          Length = 1431

 Score = 40.4 bits (93), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 21  CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 74

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 75  KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 117

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 118 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 154


>gi|119574357|gb|EAW53972.1| hCG1990594, isoform CRA_b [Homo sapiens]
          Length = 4884

 Score = 40.4 bits (93), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 338

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 339 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 381

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418


>gi|332243363|ref|XP_003270849.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Nomascus
           leucogenys]
          Length = 4856

 Score = 40.4 bits (93), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 217 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 270

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 271 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 313

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 314 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 350


>gi|344276554|ref|XP_003410073.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Loxodonta africana]
          Length = 4785

 Score = 40.4 bits (93), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 59/162 (36%), Gaps = 32/162 (19%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT---QAQIDAVPNFLCQNCVY 274
           CA C   G  +  C+ +C + +H    AG         ++      ID  P         
Sbjct: 233 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFSHFSLLCPEHIDQAPER-----SK 287

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++  C  C   G        + F C   TCGQ YH  C+   + P   +           
Sbjct: 288 EDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA----------- 329

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 330 --GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 366


>gi|154338886|ref|XP_001565665.1| putative helicase-like protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134062717|emb|CAM39160.1| putative helicase-like protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 974

 Score = 40.4 bits (93), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 26/102 (25%), Positives = 47/102 (46%), Gaps = 19/102 (18%)

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           Q H+CF CG +        + +  C +  C + YH  C+ K      E++     +R+  
Sbjct: 768 QTHRCFCCGDV----MHPMEPLLHCTA--CPKAYHAACIGKRPPRSGEAV-----KRL-- 814

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              ++CP H+CF C + +     D  + +C  CP+++   CL
Sbjct: 815 ---WSCPRHECFSCGKQQ---AADGAIFMCDTCPRSFCFDCL 850


>gi|342180222|emb|CCC89699.1| putative transcription activator [Trypanosoma congolense IL3000]
          Length = 952

 Score = 40.4 bits (93), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 53/132 (40%), Gaps = 24/132 (18%)

Query: 264 VPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNES 323
            P FL    +    +CF CG    +     + ++ C S  C + YH ECV +        
Sbjct: 786 TPAFLT---IGVTRECFVCG----NPMKPLRPLYHCCS--CPKAYHGECVGE-------- 828

Query: 324 LAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFS 383
              E  E       + CP H C  C  S+    ED  L +C  CP A+   CL  +  + 
Sbjct: 829 --REPGEGGVTPRVWKCPRHACDGCGSSQS---EDGALFLCYECPSAFCFDCL--DPRYL 881

Query: 384 DADENNFQRAWV 395
           + DE+  Q A +
Sbjct: 882 ELDESGLQFAHI 893


>gi|298705328|emb|CBJ49018.1| GF22528 [Ectocarpus siliculosus]
          Length = 1516

 Score = 40.4 bits (93), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 26/41 (63%), Gaps = 2/41 (4%)

Query: 200 WNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFH 240
           W+  +D   V +D N+F  C++C + GD+T CD +C R FH
Sbjct: 296 WSVSQDGTLVGEDGNHF-FCSVCGESGDITCCD-QCPRVFH 334


>gi|296085211|emb|CBI28706.3| unnamed protein product [Vitis vinifera]
          Length = 912

 Score = 40.4 bits (93), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 27/50 (54%), Gaps = 8/50 (16%)

Query: 200 WNEDEDSER-------VDKDENYFAVCAICDDGGDVTFCDGRCLRSFHAT 242
           WN+ E+SER       V  D+     C IC DGGD+  CDG C  +FH +
Sbjct: 226 WNKQEESERSGFHLVDVGADDPNDDTCGICGDGGDLICCDG-CPSTFHQS 274


>gi|322792929|gb|EFZ16759.1| hypothetical protein SINV_09310 [Solenopsis invicta]
          Length = 549

 Score = 40.4 bits (93), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 51/181 (28%), Positives = 71/181 (39%), Gaps = 43/181 (23%)

Query: 219 CAICDDGGDVTFCD-GRCLRSFHATITAGKNAL--CQSLG-YTQAQIDAVPNFLCQNCVY 274
           CA C   G    C    C R FH    A  +     ++L  +    +  VP  LC     
Sbjct: 152 CAYCSHYGAGIPCKVTSCNRYFHFPCAAASSCFQDTKNLALFCSQHLGQVPLLLCG---- 207

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            E  C  C   G  D S+       + + CGQ YH  CV   L P            + A
Sbjct: 208 -EVTCVQC--YGMGDVSNLV-----MCSVCGQHYHGSCVGLALLPG-----------VRA 248

Query: 335 G-ESFTCPVHKCFVCQQSEDM-------NVEDLQ------LAICRRCPKAYHRKCLPTEI 380
           G +  +C V  C VC+Q ED+       N++++       + +C RC KAYH  CL   +
Sbjct: 249 GWQCVSCRV--CQVCRQPEDVSKINVLQNIKNIHSKSYTHVMLCERCDKAYHPGCLRPIV 306

Query: 381 T 381
           T
Sbjct: 307 T 307


>gi|395838450|ref|XP_003792128.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Otolemur
           garnettii]
          Length = 4945

 Score = 40.0 bits (92), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 323 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 376

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 377 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 419

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 420 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 456


>gi|410953278|ref|XP_003983299.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Felis catus]
          Length = 4884

 Score = 40.0 bits (92), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 58/162 (35%), Gaps = 32/162 (19%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY---TQAQIDAVPNFLCQNCVY 274
           CA C   G  +  C+ +C + +H    AG         +       ID  P         
Sbjct: 285 CAFCKHLGATIKCCEDKCTQMYHYPCAAGAGTFQDFRNFFLLCPEHIDQAPER-----SK 339

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++  C  C   G        + F C   TCGQ YH  C+   + P   +           
Sbjct: 340 EDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA----------- 381

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 --GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418


>gi|354478318|ref|XP_003501362.1| PREDICTED: histone-lysine N-methyltransferase MLL3 [Cricetulus
           griseus]
          Length = 4871

 Score = 40.0 bits (92), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 268 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 321

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 322 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 364

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 365 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 401


>gi|300176465|emb|CBK23776.2| unnamed protein product [Blastocystis hominis]
          Length = 209

 Score = 40.0 bits (92), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 10/49 (20%)

Query: 209 VDKDENYFAVCAICDDGGDVTFCDGRCLRSFH------ATITAGKNALC 251
           V+ DE    +C IC DGGD+  CDG C R +H       ++  G+  LC
Sbjct: 52  VENDE----ICYICGDGGDLLLCDGGCARGYHLSCLNLTSVPEGETWLC 96


>gi|449520998|ref|XP_004167519.1| PREDICTED: zinc finger CCCH domain-containing protein 19-like
           [Cucumis sativus]
          Length = 471

 Score = 40.0 bits (92), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 45/121 (37%), Gaps = 25/121 (20%)

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVED 358
           C    C + YHPECV +      ES AE       + + + C  H CF+C ++       
Sbjct: 29  CDFKGCLKAYHPECVGR-----EESFAE-------SEDRWICGCHSCFLCHKTSKFRCVG 76

Query: 359 LQLAICRRC-----------PKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYC 407
              A+C RC            + +   CL   +   D  + +     VD   N+R    C
Sbjct: 77  CPQAVCGRCIYSTEFVCIRGSRGFCNHCLKLALLIEDGKDVDIDGTKVDF--NDRDTYEC 134

Query: 408 L 408
           L
Sbjct: 135 L 135


>gi|37999865|sp|Q8BRH4.2|MLL3_MOUSE RecName: Full=Histone-lysine N-methyltransferase MLL3; AltName:
           Full=Myeloid/lymphoid or mixed-lineage leukemia protein
           3 homolog
          Length = 4903

 Score = 40.0 bits (92), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 284 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 337

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 338 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 380

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 381 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 417


>gi|124487063|ref|NP_001074852.1| histone-lysine N-methyltransferase MLL3 [Mus musculus]
          Length = 4904

 Score = 40.0 bits (92), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 284 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 337

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 338 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 380

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 381 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 417


>gi|195431535|ref|XP_002063792.1| GK15714 [Drosophila willistoni]
 gi|194159877|gb|EDW74778.1| GK15714 [Drosophila willistoni]
          Length = 1503

 Score = 40.0 bits (92), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 53/161 (32%), Gaps = 29/161 (18%)

Query: 219 CAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYT---QAQIDAVPNFLCQNCVYQ 275
           C  C   G    C   C    H    A    L     +T      +  VP     N V  
Sbjct: 146 CVFCSQYGASINCKMNCRTVHHYPCAAAAGCLLILESFTVFCTEHLSQVPLICSDNNV-- 203

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
             +C +C  LG   K     +  C  ++CG  +H  C+     PD  S            
Sbjct: 204 --ECLSCSSLGDLSK-----LIMC--SSCGDHFHSTCIGLANLPDTRS------------ 242

Query: 336 ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             + C   +C  CQ     +  D++   C +C K YH  CL
Sbjct: 243 -GWCCA--RCTKCQICRQQDSNDIKFVKCEQCQKIYHASCL 280


>gi|356540325|ref|XP_003538640.1| PREDICTED: uncharacterized protein LOC100801320 [Glycine max]
          Length = 1254

 Score = 40.0 bits (92), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 52/127 (40%), Gaps = 31/127 (24%)

Query: 200 WNEDEDSERV-------DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQ 252
           WN  E +E++       D ++     C IC DGGD+  CDG C  +FH +        C 
Sbjct: 686 WNRQEHAEKIGFHSVDIDGNDPNDDTCGICGDGGDLICCDG-CPSTFHQS--------CL 736

Query: 253 SLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSS---DKSSSQEVFPCVSATCGQFYH 309
            +     Q+     + C NC      C  CG+   +   D +S   +  C+   C + YH
Sbjct: 737 DI-----QMLPPGEWHCPNCT-----CKFCGIASETSDKDDASVNVLRTCI--LCEKKYH 784

Query: 310 PECVSKL 316
             C  ++
Sbjct: 785 DSCTKEM 791


>gi|357467443|ref|XP_003604006.1| Flavonol synthase/flavanone 3-hydroxylase [Medicago truncatula]
 gi|355493054|gb|AES74257.1| Flavonol synthase/flavanone 3-hydroxylase [Medicago truncatula]
          Length = 1942

 Score = 40.0 bits (92), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 30/67 (44%), Gaps = 7/67 (10%)

Query: 218 VCAICDDGGDVTFCDGR-CLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQE 276
           VC IC DGGD+  CD R C +++H +      A  Q+ G            LC NC    
Sbjct: 696 VCFICFDGGDLVLCDRRGCPKAYHPSCVNRDEAFFQTKGKWNCGW-----HLCSNCEKNA 750

Query: 277 HQ-CFAC 282
           H  C+ C
Sbjct: 751 HYLCYTC 757


>gi|449431880|ref|XP_004133728.1| PREDICTED: zinc finger CCCH domain-containing protein 19-like
           [Cucumis sativus]
          Length = 746

 Score = 40.0 bits (92), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 45/121 (37%), Gaps = 25/121 (20%)

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVED 358
           C    C + YHPECV +      ES AE       + + + C  H CF+C ++       
Sbjct: 29  CDFKGCLKAYHPECVGR-----EESFAE-------SEDRWICGCHSCFLCHKTSKFRCVG 76

Query: 359 LQLAICRRC-----------PKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYC 407
              A+C RC            + +   CL   +   D  + +     VD   N+R    C
Sbjct: 77  CPQAVCGRCIYSTEFVCIRGSRGFCNHCLKLALLIEDGKDVDIDGTKVDF--NDRDTYEC 134

Query: 408 L 408
           L
Sbjct: 135 L 135


>gi|291397406|ref|XP_002715125.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 3-like
           [Oryctolagus cuniculus]
          Length = 4865

 Score = 40.0 bits (92), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 239 CAFCKHLGATIKCCEEKCTQIYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 292

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 293 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 335

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 336 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 372


>gi|410644486|ref|ZP_11354967.1| hypothetical protein GAGA_0501 [Glaciecola agarilytica NO2]
 gi|410136044|dbj|GAC03366.1| hypothetical protein GAGA_0501 [Glaciecola agarilytica NO2]
          Length = 366

 Score = 40.0 bits (92), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 56/114 (49%), Gaps = 22/114 (19%)

Query: 705 IVDFCCGANDFSCMMKVKLEQMGKSCSFRNYDLIQPKN-------DFSFEKRDWMTVRPE 757
           +++   G  D +  +K  L Q G+ C    YD I P N        F+ ++RD++TV+PE
Sbjct: 107 VLEPSIGLCDLAMSLKDYLIQEGRDCILDGYD-IDPLNVELSKQKGFNVQERDFLTVQPE 165

Query: 758 ELPDGSQLIMGLNPPFGVKASL-----ANKFISQALKFKPKLIVLIVPQETRRL 806
            + D    ++ +NPPF   A +     A++F++        ++V ++P    RL
Sbjct: 166 PIYD----LVLMNPPFDSSAFVDHVRHASRFLTPT-----GVLVSVIPVNMPRL 210


>gi|301759361|ref|XP_002915551.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           MLL3-like [Ailuropoda melanoleuca]
          Length = 4927

 Score = 39.7 bits (91), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 58/162 (35%), Gaps = 32/162 (19%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY---TQAQIDAVPNFLCQNCVY 274
           CA C   G  +  C+ +C + +H    AG         +       ID  P         
Sbjct: 300 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFRNFFLLCPEHIDQAPER-----SK 354

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++  C  C   G        + F C   TCGQ YH  C+   + P   +           
Sbjct: 355 EDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA----------- 396

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 397 --GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 433


>gi|163955097|ref|YP_001648201.1| hypothetical protein OsV5_124f [Ostreococcus virus OsV5]
 gi|163638546|gb|ABY27905.1| hypothetical protein OsV5_124f [Ostreococcus virus OsV5]
          Length = 280

 Score = 39.7 bits (91), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 82/192 (42%), Gaps = 38/192 (19%)

Query: 673 MRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGANDFSCMMKVKLEQMGKSCSF 732
           +R     + +TK    K+ + ++ W  +  D +++   G   F   + +  +++G     
Sbjct: 6   VRNQGLDKFYTKPSVAKKCISKIVW--KDWDFVIEPSAGNGSFLDWIPIP-DKIGLD--- 59

Query: 733 RNYDLIQPKNDFSFEKRDWMTVRPEELPDGSQLIMGLNPPFGVKASLANKFISQALKFKP 792
                I+P +D    K D+    P +  D  ++++  NPPFG  +SLA KF + A K+  
Sbjct: 60  -----IEPDSDHVI-KADFFDYEPPK--DKKRILVIGNPPFGKVSSLAIKFFNHAAKWS- 110

Query: 793 KLIVLIVPQETRRLDQKASYNLIWEDNEVLSGKSFYLPGSLDVHDNQLEQWNCK-PPPL- 850
             I  I+P+  RR+  +   +L           SF+L     +HD  +    C   PP+ 
Sbjct: 111 NCIAFIIPKTFRRVSVQNKLDL-----------SFHL-----IHDEDIPSKPCSFEPPMS 154

Query: 851 -----YLWSRAD 857
                 +W R D
Sbjct: 155 VKCCFQIWERRD 166


>gi|297796793|ref|XP_002866281.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312116|gb|EFH42540.1| hypothetical protein ARALYDRAFT_358079 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1047

 Score = 39.7 bits (91), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 26/101 (25%)

Query: 217 AVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCVY 274
            +C++C  GG +  CDG C  +FHA                   ++ VP  ++ C++C  
Sbjct: 704 VICSVCHYGGKLILCDG-CPSAFHANCLG---------------LEEVPDGDWFCESCC- 746

Query: 275 QEHQCFACG--MLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
               C ACG   L ++ K + +E F      C   YHP C+
Sbjct: 747 ----CGACGQFFLKATSKYAKEEKF-ISCKQCELKYHPSCL 782


>gi|297739485|emb|CBI29667.3| unnamed protein product [Vitis vinifera]
          Length = 819

 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 30/101 (29%), Positives = 39/101 (38%), Gaps = 28/101 (27%)

Query: 276 EHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAG 335
           E  CF C   G         +  C    C + YHP+CV K      +S  E       + 
Sbjct: 16  EDYCFVCKDGGL--------LMVCDYKDCLKAYHPQCVGK-----EDSFLE-------SD 55

Query: 336 ESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
           ES++C  H CF CQ++            C  CPKA    CL
Sbjct: 56  ESWSCSWHSCFNCQKASKFQ--------CYVCPKAVCGSCL 88


>gi|281339843|gb|EFB15427.1| hypothetical protein PANDA_003530 [Ailuropoda melanoleuca]
          Length = 4780

 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 58/162 (35%), Gaps = 32/162 (19%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY---TQAQIDAVPNFLCQNCVY 274
           CA C   G  +  C+ +C + +H    AG         +       ID  P         
Sbjct: 156 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFQDFRNFFLLCPEHIDQAPER-----SK 210

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++  C  C   G        + F C   TCGQ YH  C+   + P   +           
Sbjct: 211 EDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA----------- 252

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 253 --GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 289


>gi|281208576|gb|EFA82752.1| hypothetical protein PPL_04447 [Polysphondylium pallidum PN500]
          Length = 985

 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 30/104 (28%), Positives = 39/104 (37%), Gaps = 22/104 (21%)

Query: 279 CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESF 338
           CF C      D      +  C S  C + YH  C           LA   R       S+
Sbjct: 771 CFLCK--KEEDPKGRTSIVQCRS--CPKIYHRSCAG---------LAHTPR-------SW 810

Query: 339 TCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITF 382
            CP H C  C+++   N       IC+ CP +Y   CLP +IT 
Sbjct: 811 KCPRHSCHSCRKTP--NESGGSFFICKECPSSYCITCLPNDITI 852


>gi|255574954|ref|XP_002528383.1| set domain protein, putative [Ricinus communis]
 gi|223532171|gb|EEF33976.1| set domain protein, putative [Ricinus communis]
          Length = 500

 Score = 39.7 bits (91), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 30/79 (37%), Gaps = 21/79 (26%)

Query: 299 CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVED 358
           C    CG  +H  CV +  H  N              + F CP H CFVC+Q  D     
Sbjct: 148 CSVRNCGGVFHFICVKETFHVSNP-------------KKFKCPQHACFVCKQKFDWR--- 191

Query: 359 LQLAICRRCPKAYHRKCLP 377
                C RC  A H +C P
Sbjct: 192 -----CVRCLMASHDRCGP 205


>gi|297289715|ref|XP_001107669.2| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Macaca
           mulatta]
          Length = 4785

 Score = 39.7 bits (91), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 40/162 (24%), Positives = 57/162 (35%), Gaps = 32/162 (19%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNAL---CQSLGYTQAQIDAVPNFLCQNCVY 274
           CA C   G  +  C+ +C + +H    AG                 ID  P         
Sbjct: 201 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTFHDFSHIFLLCPEHIDQAPER-----SK 255

Query: 275 QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
           ++  C  C   G        + F C   TCGQ YH  C+   + P   +           
Sbjct: 256 EDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA----------- 297

Query: 335 GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
              + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 298 --GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 334


>gi|348568065|ref|XP_003469819.1| PREDICTED: histone-lysine N-methyltransferase MLL3-like [Cavia
           porcellus]
          Length = 4878

 Score = 39.7 bits (91), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 241 CAFCKHLGATIKCCEEKCTQMYHYPCAAGVGTF-QDFSHFFLLCPEHIDQAPER-----S 294

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 295 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 337

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 338 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 374


>gi|145479813|ref|XP_001425929.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124393001|emb|CAK58531.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1299

 Score = 39.3 bits (90), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 61/154 (39%), Gaps = 35/154 (22%)

Query: 210 DKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLC 269
           D +E+Y   C  C        C   C RSFH      K            ++ A   +  
Sbjct: 79  DYNEDY---CWKCRQKNRPLLCCDSCYRSFHMACVGIK------------KMPAGSWYCP 123

Query: 270 QNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSK-------LLHPDN- 321
           Q C Y++  C  C      D+ S+ E   C  + C  F H EC+ K       L  P N 
Sbjct: 124 QCCQYEQSYCPYC------DEQSTNEKIIC--SKCNTFIHLECILKDIPFEAVLRSPVNR 175

Query: 322 ESLAEELRERIAAGESFT--CPV--HKCFVCQQS 351
           + ++EEL +++   E  T  CP+  H C  C QS
Sbjct: 176 KHISEELFQQLLPAEKPTKNCPLIFHCCLRCLQS 209


>gi|432097048|gb|ELK27546.1| Histone-lysine N-methyltransferase MLL3 [Myotis davidii]
          Length = 4785

 Score = 39.3 bits (90), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 40/163 (24%), Positives = 60/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C +++H     G     Q + +        ID  P        
Sbjct: 217 CAFCKHLGATIKCCEEKCTQTYHYPCAVGAGTF-QDVSHFFLLCPEHIDQAPER-----S 270

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 271 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 313

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 314 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 350


>gi|422293139|gb|EKU20439.1| phd zinc finger-containing protein, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 174

 Score = 39.3 bits (90), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 16/47 (34%), Positives = 24/47 (51%)

Query: 338 FTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEITFSD 384
           F CP+H C +C +  +      QL  C  CP+AYH  C+P    + +
Sbjct: 10  FVCPMHICDLCGKGRESGNNRNQLFPCWLCPRAYHLNCIPPACKYHE 56


>gi|432916804|ref|XP_004079392.1| PREDICTED: autoimmune regulator-like [Oryzias latipes]
          Length = 384

 Score = 39.3 bits (90), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 33/69 (47%), Gaps = 19/69 (27%)

Query: 176 ANHPE--TNTFHEDVRTTEKPKVIVDWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDG 233
           A  PE  TN F     TT    +++  N+DE              CA+C DGG++  CDG
Sbjct: 131 AEMPESATNAFRHQGETT--TSMVIHCNDDE--------------CAVCKDGGELICCDG 174

Query: 234 RCLRSFHAT 242
            C R+FH T
Sbjct: 175 -CPRAFHLT 182


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,289,795,497
Number of Sequences: 23463169
Number of extensions: 616631346
Number of successful extensions: 1448142
Number of sequences better than 100.0: 636
Number of HSP's better than 100.0 without gapping: 104
Number of HSP's successfully gapped in prelim test: 532
Number of HSP's that attempted gapping in prelim test: 1444829
Number of HSP's gapped (non-prelim): 2371
length of query: 876
length of database: 8,064,228,071
effective HSP length: 152
effective length of query: 724
effective length of database: 8,792,793,679
effective search space: 6365982623596
effective search space used: 6365982623596
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)