BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 003658
         (805 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q55FD6|Y8158_DICDI PHD finger-containing protein DDB_G0268158 OS=Dictyostelium
           discoideum GN=DDB_G0268158 PE=4 SV=1
          Length = 688

 Score = 94.0 bits (232), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 28/183 (15%)

Query: 199 DWNEDEDSERVDKDENYFAVCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQ 258
           + N+D+  +R  K+E   AVC  C+  G++  CDG CLRSFH +    +N    S     
Sbjct: 122 ETNDDDKPKRPRKNE---AVCTFCEKPGELLMCDGLCLRSFHISCLKARNLYNPSSSSIS 178

Query: 259 --AQIDAVPNFLCQNCVYQEHQCFAC---GMLGSSDKSSSQEVFPCVSATCGQFYHPECV 313
               ID    + C +CV  ++ CF+C   G++G        ++  C    CG+FYH +CV
Sbjct: 179 PVTTIDGTVRWECNDCVSSQNSCFSCKKRGIIGI-------DLMKCKVHQCGKFYHYKCV 231

Query: 314 SKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHR 373
           +      +  LA+ +  +      F CP+H C VC+ S D      Q   C RCP AYH 
Sbjct: 232 A------DYKLAKLINTKTPR---FNCPLHYCSVCEVSGDGK----QSVHCFRCPTAYHV 278

Query: 374 KCL 376
            C+
Sbjct: 279 ICM 281


>sp|Q6P2L6|NSD3_MOUSE Histone-lysine N-methyltransferase NSD3 OS=Mus musculus GN=Whsc1l1
           PE=1 SV=2
          Length = 1439

 Score = 76.6 bits (187), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/160 (32%), Positives = 71/160 (44%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGD-VTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVP--NFLCQNCV 273
            VC +C+  GD +  C+G C R FH           + LG T     AVP  +F C+ C 
Sbjct: 702 TVCQVCEKAGDCLVACEGECCRHFHV----------ECLGLT-----AVPEGHFTCEECE 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C  + CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVSVCGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSSCSMEKDIHKASKGRMMRCLRCPVAYH 827


>sp|O88491|NSD1_MOUSE Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific OS=Mus musculus GN=Nsd1 PE=1 SV=1
          Length = 2588

 Score = 73.9 bits (180), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 81/196 (41%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK I++   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1404 RTAASPKEILEEGVEHDPGMSASKKLQVERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1463

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG  +        F+C  C    H CF C       K S ++V  
Sbjct: 1464 FH----------LECLGLPEMPRG---KFICNECHTGIHTCFVC-------KQSGEDVKR 1503

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F CP+H C  C  +   NV  
Sbjct: 1504 CLLPLCGKFYHEECVQK--YPPT----------VTQNKGFRCPLHICITCHAANPANVSA 1551

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1552 SKGRLMRCVRCPVAYH 1567



 Score = 38.9 bits (89), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 45/104 (43%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+          L E  R + 
Sbjct: 1438 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLG---------LPEMPRGKF 1479

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               E  T  +H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1480 ICNECHTG-IHTCFVCKQS----GEDVKRCLLPLCGKFYHEECV 1518



 Score = 34.7 bits (78), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 25/105 (23%), Positives = 43/105 (40%), Gaps = 34/105 (32%)

Query: 275  QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            +E +CF+CG  G        ++  C    C + YH +C++    P              A
Sbjct: 2015 REDECFSCGDAG--------QLVSCKKPGCPKVYHADCLNLTKRP--------------A 2052

Query: 335  GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAY---HRKCL 376
            G+ + CP H+C VC +           + C  CP ++   HR+ +
Sbjct: 2053 GK-WECPWHQCDVCGKE--------AASFCEMCPSSFCKQHREGM 2088


>sp|Q8BVE8|NSD2_MOUSE Histone-lysine N-methyltransferase NSD2 OS=Mus musculus GN=Whsc1
           PE=1 SV=2
          Length = 1365

 Score = 72.8 bits (177), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 70/189 (37%), Gaps = 39/189 (20%)

Query: 191 TEKPKVIVDWNEDEDSERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRSFHATITA 245
           +E P    D  + E S    K E   A     VC +C+  G +  C+G C  +FH     
Sbjct: 637 SESPYESADETQTEASVSSKKSERGMAAKKEYVCQLCEKTGSLLLCEGPCCGAFHLAC-- 694

Query: 246 GKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFPCVSATCG 305
                   LG ++        F C  C    H CF C       K S  EV  CV   CG
Sbjct: 695 --------LGLSRRPEG---RFTCTECASGIHSCFVC-------KESKMEVKRCVVNQCG 736

Query: 306 QFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVE--DLQLAI 363
           +FYH  CV K      ES              F CP+H C  C  S   N      ++  
Sbjct: 737 KFYHEACVKKYPLTVFES------------RGFRCPLHSCMSCHASNPSNPRPSKGKMMR 784

Query: 364 CRRCPKAYH 372
           C RCP AYH
Sbjct: 785 CVRCPVAYH 793


>sp|Q96L73|NSD1_HUMAN Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific OS=Homo sapiens GN=NSD1 PE=1 SV=1
          Length = 2696

 Score = 72.4 bits (176), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 55/196 (28%), Positives = 80/196 (40%), Gaps = 44/196 (22%)

Query: 189  RTTEKPKVIVDWNEDED-----SERVDKDENYFA-----VCAICDDGGDVTFCDGRCLRS 238
            RT   PK  V+   + D     S+++  +    A     VC  C+  G++  C+ +C  +
Sbjct: 1506 RTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEAQCCGA 1565

Query: 239  FHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEHQCFACGMLGSSDKSSSQEVFP 298
            FH           + LG T+        F+C  C    H CF C       K S ++V  
Sbjct: 1566 FHL----------ECLGLTEMPRG---KFICNECRTGIHTCFVC-------KQSGEDVKR 1605

Query: 299  CVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNV-- 356
            C+   CG+FYH ECV K  +P            +   + F C +H C  C  +   NV  
Sbjct: 1606 CLLPLCGKFYHEECVQK--YPPT----------VMQNKGFRCSLHICITCHAANPANVSA 1653

Query: 357  EDLQLAICRRCPKAYH 372
               +L  C RCP AYH
Sbjct: 1654 SKGRLMRCVRCPVAYH 1669



 Score = 39.3 bits (90), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 44/104 (42%), Gaps = 23/104 (22%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              +E+ C  C  LG        E+  C +  CG F H EC+     P  + +  E R  I
Sbjct: 1540 ALKENVCQNCEKLG--------ELLLCEAQCCGAF-HLECLGLTEMPRGKFICNECRTGI 1590

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                      H CFVC+QS     ED++  +   C K YH +C+
Sbjct: 1591 ----------HTCFVCKQSG----EDVKRCLLPLCGKFYHEECV 1620



 Score = 33.9 bits (76), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/105 (22%), Positives = 43/105 (40%), Gaps = 34/105 (32%)

Query: 275  QEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAA 334
            +E +CF+CG  G        ++  C    C + YH +C++    P              A
Sbjct: 2117 REDECFSCGDAG--------QLVSCKKPGCPKVYHADCLNLTKRP--------------A 2154

Query: 335  GESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAY---HRKCL 376
            G+ + CP H+C +C +           + C  CP ++   HR+ +
Sbjct: 2155 GK-WECPWHQCDICGKE--------AASFCEMCPSSFCKQHREGM 2190


>sp|Q9BZ95|NSD3_HUMAN Histone-lysine N-methyltransferase NSD3 OS=Homo sapiens GN=WHSC1L1
           PE=1 SV=1
          Length = 1437

 Score = 72.0 bits (175), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 69/160 (43%), Gaps = 38/160 (23%)

Query: 217 AVCAICDDGGDVTF-CDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPN--FLCQNCV 273
            VC IC+  GD    C+G C + FH           + LG     + ++P+  F+C  C 
Sbjct: 702 TVCQICESSGDSLIPCEGECCKHFHL----------ECLG-----LASLPDSKFICMECK 746

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
             +H CF+C       K S ++V  C    CG+FYH  CV K      ES          
Sbjct: 747 TGQHPCFSC-------KVSGKDVKRCSVGACGKFYHEACVRKFPTAIFES---------- 789

Query: 334 AGESFTCPVHKCFVCQQSEDMN-VEDLQLAICRRCPKAYH 372
             + F CP H C  C   +D++     ++  C RCP AYH
Sbjct: 790 --KGFRCPQHCCSACSMEKDIHKASKGRMMRCLRCPVAYH 827


>sp|O96028|NSD2_HUMAN Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens GN=WHSC1
           PE=1 SV=1
          Length = 1365

 Score = 69.3 bits (168), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 60/157 (38%), Gaps = 34/157 (21%)

Query: 218 VCAICDDGGDVTFCDGRCLRSFHATITAGKNALCQSLGYTQAQIDAVPNFLCQNCVYQEH 277
           VC +C+  G +  C+G C  +FH             LG ++        F C  C    H
Sbjct: 669 VCQLCEKPGSLLLCEGPCCGAFHLAC----------LGLSRRPEG---RFTCSECASGIH 715

Query: 278 QCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGES 337
            CF C       K S  +V  CV   CG+FYH  CV K      ES              
Sbjct: 716 SCFVC-------KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFES------------RG 756

Query: 338 FTCPVHKCFVCQQSEDMNVE--DLQLAICRRCPKAYH 372
           F CP+H C  C  S   N      ++  C RCP AYH
Sbjct: 757 FRCPLHSCVSCHASNPSNPRPSKGKMMRCVRCPVAYH 793


>sp|Q5XHZ2|SYCE1_RAT Synaptonemal complex central element protein 1 OS=Rattus norvegicus
           GN=Syce1 PE=2 SV=2
          Length = 329

 Score = 41.6 bits (96), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 78/182 (42%), Gaps = 29/182 (15%)

Query: 73  ELSYAQPEIW-VLSKQNNWIMLQSPKKSFKNIV--RTILITVHWLYCVKQNPEASRKYIR 129
           EL  AQ  +W  L K+ +   L   K   K+I+  +   + +  L+C ++  EA RK+  
Sbjct: 75  ELGEAQT-VWDTLQKEVDS--LHEEKVRLKDILNRKEETLRIMQLHCQEKESEAERKHSM 131

Query: 130 EQMLKVFCSYEVEPSENDLLDHMPISREAAERDKDLANSKFLLSFMANHPETNTFHEDVR 189
            Q  K   S+         L++   + +A  R   L   + L + M+ H +T  FH+   
Sbjct: 132 LQECKDRISF---------LNNQIDNEKAKLRRLRLDFEEHLETLMSQHKDTMEFHKPEH 182

Query: 190 TTEKPKVIVDWNEDEDSERVDKDENYFA---------VCAICDDGGDVTFCDGRCLRSFH 240
            TE+  V+     D   E++ K+E             +CA+C   G  TF DG  LRS  
Sbjct: 183 LTEEMSVL-----DSSKEQLLKEEKLIKAKLEDVQHRLCALCGPEGSSTFSDGLFLRSHE 237

Query: 241 AT 242
           A 
Sbjct: 238 AA 239


>sp|Q8NEZ4|MLL3_HUMAN Histone-lysine N-methyltransferase MLL3 OS=Homo sapiens GN=MLL3
           PE=1 SV=3
          Length = 4911

 Score = 40.4 bits (93), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGYT----QAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 285 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHIFLLCPEHIDQAPER-----S 338

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 339 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 381

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 382 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 418



 Score = 33.1 bits (74), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 40/98 (40%), Gaps = 20/98 (20%)

Query: 279  CFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESF 338
            C  CG  G   + +   +  C  + CGQ YHP CVS  +             ++   + +
Sbjct: 960  CVVCGSFG---QGAEGRLLAC--SQCGQCYHPYCVSIKI------------TKVVLSKGW 1002

Query: 339  TCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
             C   +C VC+       +  +L +C  C  +YH  CL
Sbjct: 1003 RC--LECTVCEACGKA-TDPGRLLLCDDCDISYHTYCL 1037


>sp|Q8BRH4|MLL3_MOUSE Histone-lysine N-methyltransferase MLL3 OS=Mus musculus GN=Mll3
           PE=2 SV=2
          Length = 4903

 Score = 39.7 bits (91), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 41/163 (25%), Positives = 59/163 (36%), Gaps = 34/163 (20%)

Query: 219 CAICDD-GGDVTFCDGRCLRSFHATITAGKNALCQSLGY----TQAQIDAVPNFLCQNCV 273
           CA C   G  +  C+ +C + +H    AG     Q   +        ID  P        
Sbjct: 284 CAFCKHLGATIKCCEEKCTQMYHYPCAAGAGTF-QDFSHFFLLCPEHIDQAPER-----S 337

Query: 274 YQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIA 333
            ++  C  C   G        + F C   TCGQ YH  C+   + P   +          
Sbjct: 338 KEDANCAVCDSPGDL-----LDQFFCT--TCGQHYHGMCLDIAVTPLKRA---------- 380

Query: 334 AGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + CP  +C VCQ  +    ED ++ +C  C K YH  CL
Sbjct: 381 ---GWQCP--ECKVCQNCKQSG-EDSKMLVCDTCDKGYHTFCL 417



 Score = 34.7 bits (78), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 42/104 (40%), Gaps = 20/104 (19%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
              Q+  C  CG  G   + +   +  C  + CGQ YHP CVS  +             ++
Sbjct: 947  TLQQDMCVVCGSFG---QGAEGRLLAC--SQCGQCYHPYCVSIKI------------TKV 989

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
               + + C   +C VC+       +  +L +C  C  +YH  CL
Sbjct: 990  VLSKGWRC--LECTVCEACGKA-TDPGRLLLCDDCDISYHTYCL 1030


>sp|Q9SIV5|C3H19_ARATH Zinc finger CCCH domain-containing protein 19 OS=Arabidopsis
           thaliana GN=NERD PE=1 SV=3
          Length = 1773

 Score = 36.6 bits (83), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 9/72 (12%)

Query: 218 VCAICDDGGDVTFCDGR-CLRSFHATITAGKNALCQSLG------YTQAQIDAVPNFLCQ 270
           VC +C DGGD+  CD R C +++H +      A  Q+ G      +  ++ +    +LC 
Sbjct: 601 VCFMCFDGGDLVLCDRRGCTKAYHPSCVDRDEAFFQTKGKWNCGWHLCSKCEKTATYLCY 660

Query: 271 NCVYQEHQCFAC 282
            C++    C  C
Sbjct: 661 TCMF--SLCKGC 670


>sp|O14686|MLL2_HUMAN Histone-lysine N-methyltransferase MLL2 OS=Homo sapiens GN=MLL2
           PE=1 SV=2
          Length = 5537

 Score = 36.6 bits (83), Expect = 0.86,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 34/82 (41%), Gaps = 18/82 (21%)

Query: 295 EVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDM 354
           ++F C S  CG  YH  C           L   L  R  AG  + CP  +C VCQ     
Sbjct: 240 DLFFCTS--CGHHYHGAC-----------LDTALTARKRAG--WQCP--ECKVCQACRKP 282

Query: 355 NVEDLQLAICRRCPKAYHRKCL 376
              D ++ +C  C K YH  CL
Sbjct: 283 G-NDSKMLVCETCDKGYHTFCL 303



 Score = 36.2 bits (82), Expect = 1.1,   Method: Composition-based stats.
 Identities = 30/116 (25%), Positives = 42/116 (36%), Gaps = 35/116 (30%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVS-----------KLLHPDNESLA 325
            +C  C  LG+S         PC S  C + YH  C +           +LL P++   A
Sbjct: 170 QRCSHCTRLGAS--------IPCRSPGCPRLYHFPCATASGSFLSMKTLQLLCPEHSEGA 221

Query: 326 EELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
             L E             +C VC+   ++      L  C  C   YH  CL T +T
Sbjct: 222 AYLEEA------------RCAVCEGPGEL----CDLFFCTSCGHHYHGACLDTALT 261



 Score = 35.8 bits (81), Expect = 1.3,   Method: Composition-based stats.
 Identities = 28/104 (26%), Positives = 42/104 (40%), Gaps = 20/104 (19%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
            V  +  C  CG  G   + +   +  C  + C Q YHP CV       N  + + +   +
Sbjct: 1374 VLMQDMCVVCGSFG---RGAEGHLLAC--SQCSQCYHPYCV-------NSKITKVM--LL 1419

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                   C V  C VC Q+ D +    +L +C  C  +YH  CL
Sbjct: 1420 KGWRCVECIV--CEVCGQASDPS----RLLLCDDCDISYHTYCL 1457


>sp|Q6PDK2|MLL2_MOUSE Histone-lysine N-methyltransferase MLL2 OS=Mus musculus GN=Mll2
           PE=1 SV=2
          Length = 5588

 Score = 36.2 bits (82), Expect = 0.98,   Method: Composition-based stats.
 Identities = 29/116 (25%), Positives = 40/116 (34%), Gaps = 35/116 (30%)

Query: 277 HQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVS-----------KLLHPDNESLA 325
            +C  C   G+S         PC S  C + YH  C +           +LL P++   A
Sbjct: 170 QRCSHCARFGAS--------VPCRSPGCSRLYHFPCATASGSFLSMKTLQLLCPEHSDGA 221

Query: 326 EELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCLPTEIT 381
             L E             +C VC+    +      L  C  C   YH  CL T +T
Sbjct: 222 AHLEEA------------RCAVCEGPGQL----CDLLFCTSCGHHYHGACLDTALT 261



 Score = 36.2 bits (82), Expect = 1.2,   Method: Composition-based stats.
 Identities = 28/104 (26%), Positives = 42/104 (40%), Gaps = 20/104 (19%)

Query: 273  VYQEHQCFACGMLGSSDKSSSQEVFPCVSATCGQFYHPECVSKLLHPDNESLAEELRERI 332
            V  +  C  CG  G   + +   +  C  + C Q YHP CV       N  + + +   +
Sbjct: 1330 VLMQDMCVVCGSFG---RGAEGHLLAC--SQCSQCYHPYCV-------NSKITKVM--LL 1375

Query: 333  AAGESFTCPVHKCFVCQQSEDMNVEDLQLAICRRCPKAYHRKCL 376
                   C V  C VC Q+ D +    +L +C  C  +YH  CL
Sbjct: 1376 KGWRCVECIV--CEVCGQASDPS----RLLLCDDCDISYHTYCL 1413



 Score = 36.2 bits (82), Expect = 1.2,   Method: Composition-based stats.
 Identities = 23/74 (31%), Positives = 31/74 (41%), Gaps = 16/74 (21%)

Query: 303 TCGQFYHPECVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSEDMNVEDLQLA 362
           +CG  YH  C+   L           R+R     S+ CP  +C VCQ        D ++ 
Sbjct: 246 SCGHHYHGACLDTAL---------TARKR----ASWQCP--ECKVCQSCRKPG-NDSKML 289

Query: 363 ICRRCPKAYHRKCL 376
           +C  C K YH  CL
Sbjct: 290 VCETCDKGYHTFCL 303


>sp|Q8MT36|MES4_DROME Probable histone-lysine N-methyltransferase Mes-4 OS=Drosophila
           melanogaster GN=Mes-4 PE=1 SV=2
          Length = 1427

 Score = 35.4 bits (80), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 33/132 (25%), Positives = 47/132 (35%), Gaps = 32/132 (24%)

Query: 266 NFLCQNC-VYQEHQCFACGMLGS--------SDKSSSQ-----EVFPCVSATCGQFYHPE 311
           + +C  C V +   C  C  + S         + SSS      ++  C    CG+ +H  
Sbjct: 853 SLVCHECNVGEPEGCVICHQVESPAVPSTPRKEDSSSHTPIEDKLLTCSQPMCGKRFHTS 912

Query: 312 CVSKLLHPDNESLAEELRERIAAGESFTCPVHKCFVCQQSED----MNVEDLQLAICRRC 367
           C      P   S     R          CP H C  C   +       +   +LA C RC
Sbjct: 913 CCK--YWPQASSSKHSAR----------CPRHVCHTCVSDDPSGKFQQLGSSKLAKCVRC 960

Query: 368 PKAYHR--KCLP 377
           P  YH+  KC+P
Sbjct: 961 PATYHQLSKCIP 972


>sp|Q6Z3U3|Y7797_ORYSJ B3 domain-containing protein Os07g0679700 OS=Oryza sativa subsp.
           japonica GN=Os07g0679700 PE=2 SV=1
          Length = 949

 Score = 35.0 bits (79), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 53/271 (19%), Positives = 104/271 (38%), Gaps = 46/271 (16%)

Query: 364 CRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYCLEHKIISELKTPARDH 423
           C  C K  H  C+ ++ +F   D    Q             + C+++  +  + +P    
Sbjct: 75  CSFCGKRLHCGCIASKNSFDLLDSGGVQ------------CVTCIKNSAVQSVPSPVVPK 122

Query: 424 LKFPGVEGKRKKEDLELLLTEEKDVASKRNIVSESFVADK-----TVVKKLKLAEVYSGA 478
           L F     +R       L  +  D+ S R + + S + D      T++ K  L  +    
Sbjct: 123 L-FSSQNNQR-------LFGKSDDLLSGRPLETSSLMVDARNDDLTIIAKNNLPFMVKNV 174

Query: 479 DVGMSNSEIKKR----WPRQDVHSLKKPNITDTGRKSLKDIADKSKPSLRKDSTLLKSRS 534
           + G S++ ++++      RQ    L   +I D GR           P L +  + L+SR 
Sbjct: 175 EAGQSSNILRQKELENGARQIKWELPTLSIGDMGRI----------PFLTRSQSALESRR 224

Query: 535 FVVK-PGHRNIDGSKMKNSISDRRRMKKVNISQPSVDAEMEKELLALIKDSTSSFNEEEF 593
              K P   +     +  +  +       N ++    + +E+ +L+     T+ F E   
Sbjct: 225 DENKDPTTESTTSESLSEACLNMSLGIASNGNKLEATSTVERPMLS----PTTGFPEGRE 280

Query: 594 MKSHIVPITHAHHSKHLLEKSITLGLVEGSV 624
           + + + P  HA  ++H L +   +G  EG+V
Sbjct: 281 LTTALSPFQHAQRARHFLTRPPRVG--EGAV 309


>sp|Q9Z0E3|AIRE_MOUSE Autoimmune regulator OS=Mus musculus GN=Aire PE=1 SV=1
          Length = 552

 Score = 34.7 bits (78), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 13/22 (59%), Positives = 17/22 (77%), Gaps = 1/22 (4%)

Query: 219 CAICDDGGDVTFCDGRCLRSFH 240
           CA+C DGG++  CDG C R+FH
Sbjct: 301 CAVCHDGGELICCDG-CPRAFH 321


>sp|O43918|AIRE_HUMAN Autoimmune regulator OS=Homo sapiens GN=AIRE PE=1 SV=1
          Length = 545

 Score = 34.7 bits (78), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 13/22 (59%), Positives = 17/22 (77%), Gaps = 1/22 (4%)

Query: 219 CAICDDGGDVTFCDGRCLRSFH 240
           CA+C DGG++  CDG C R+FH
Sbjct: 299 CAVCRDGGELICCDG-CPRAFH 319


>sp|O69300|LON_CAMJE Lon protease OS=Campylobacter jejuni subsp. jejuni serotype O:2
           (strain NCTC 11168) GN=lon PE=2 SV=2
          Length = 791

 Score = 34.7 bits (78), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 16/143 (11%)

Query: 654 QIFQWKRKLDVYLAPFLHGMRYTSFGRHFTKVEKLKEIVDRLHWYVRSGDTIVDFCCGAN 713
           ++ +++++L++    F+H   Y    +   K E       R+H        I  +   A 
Sbjct: 249 EVREYQKRLELK-KKFMHEDAYKEIKKQIEKFE-------RIHQDNSEASMIQTYIETAL 300

Query: 714 D--FSCMMKVKLE--QMGKSCSFRNYDLIQPKNDFS--FEKRDWMTVRPEELPDGSQLIM 767
           D  F  + K KL+  ++ K  +  +Y L +PK      F  R+ +  R     DG+++I+
Sbjct: 301 DIPFEKISKKKLDIKEVSKQLNHDHYALNKPKERIEEYFAVRELLEKRKIAEKDGAKVIL 360

Query: 768 GLNPPFGV-KASLANKFISQALK 789
            L  P GV K SLAN  +S+ALK
Sbjct: 361 CLYGPPGVGKTSLANS-VSKALK 382


>sp|Q8U1R6|PUS10_PYRFU tRNA pseudouridine synthase Pus10 OS=Pyrococcus furiosus (strain
           ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) GN=pus10 PE=1
           SV=1
          Length = 388

 Score = 34.3 bits (77), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 64/150 (42%), Gaps = 28/150 (18%)

Query: 357 EDLQLAICRRCPKAYHRKCLPTEITFSDADENNFQRAWVDLLPNNRILIYCLEHKIISEL 416
           E +   ICR   K +H KC+       D D        V +L N R          + E+
Sbjct: 183 ESIATIICRAFKKHFHGKCIFKGAGREDVD--------VRMLGNGR--------PFVVEI 226

Query: 417 KTPARDHLKFPGVE------GKRKKEDLELLLTEEKD--VASKRNIVSESFVADKTVVKK 468
           K P +  +    +E      GK +  +L  +  EE +  + ++   V E+ V  K  + K
Sbjct: 227 KRPRKRKVNLKDIEEEINQSGKVEVLNLRFITPEEAERILTTRHRKVYEAIVYVKDGITK 286

Query: 469 LKLAEVYSGADVGMSNSEIKKRWPRQDVHS 498
            ++ +V       + N+EIK+R PR+ ++S
Sbjct: 287 EEVEKVVK----SLKNAEIKQRTPRRVLNS 312


>sp|A4IWC4|CH60_FRATW 60 kDa chaperonin OS=Francisella tularensis subsp. tularensis
           (strain WY96-3418) GN=groL PE=3 SV=1
          Length = 544

 Score = 33.5 bits (75), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 24/163 (14%)

Query: 498 SLKKPNITDTGRKSLKDIADKSKPSLRKDSTLLK------------SRSFVVKPGHRNID 545
           ++K P   D  +  L+DIA  +  +   +   +K            SR  V K     ID
Sbjct: 275 AVKAPGFGDRRKAMLEDIATLTGATFVSEDLSMKLEETNMEHLGTASRVLVTKDNTTIID 334

Query: 546 GSKMKNSISDRRRMKKVNISQPSVDAEMEK--ELLALIKDSTSSFN----EEEFMKSHIV 599
           G+  K +I+ R  + K NI++ + D + EK  E LA +    +        E  MK    
Sbjct: 335 GAGEKEAIAKRINVIKANIAEANSDYDREKLQERLAKLSGGVAVIKVGAVTEAEMKEKKD 394

Query: 600 PITHA-HHSKHLLEKSITLGLVEGSVKAVRA--ALEMLDGGCD 639
            +  A H ++  +E+ I  G   G V  +RA  AL+ L G  D
Sbjct: 395 RVDDALHATRAAVEEGIVAG---GGVALIRAQKALDGLTGEND 434


>sp|Q9XTJ4|NHR3_CAEEL Nuclear hormone receptor family member nhr-3 OS=Caenorhabditis
           elegans GN=nhr-3 PE=2 SV=1
          Length = 463

 Score = 33.5 bits (75), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 42/82 (51%), Gaps = 14/82 (17%)

Query: 209 VDKDENYFAVCAICDDGGD-----VTFCDGRCLRSFHATITAGKNALCQSLGYT-QAQID 262
           +D +E+   +C++C D        V  C G C   F  T+ AGKN +C+   Y+ + +ID
Sbjct: 45  IDGEES--TICSVCCDEASGRHYGVVACFG-CKGFFRRTVRAGKNYVCR---YSKKCRID 98

Query: 263 AVPNFLCQNCVYQEHQCFACGM 284
                +C++C +Q  +C   GM
Sbjct: 99  KAGRNVCRSCRFQ--KCLEVGM 118


>sp|Q5UQR0|DPOL_MIMIV DNA polymerase OS=Acanthamoeba polyphaga mimivirus GN=POLB PE=3
           SV=1
          Length = 1740

 Score = 33.1 bits (74), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 24/95 (25%), Positives = 48/95 (50%), Gaps = 13/95 (13%)

Query: 492 PRQDVHSLKKPNITDTG-RKSLKDIADKSKPSLRKDSTLLKSRSFVVKPG-------HRN 543
           P+++ H   +  ITD    + LKDI DK       +  +++++ +  +         H+N
Sbjct: 768 PQKEYHRFAQEIITDEQINRELKDIFDKINTVFENNVAIIQNQKYFTEKNISELIDKHKN 827

Query: 544 IDGSKMKN-----SISDRRRMKKVNISQPSVDAEM 573
           I  SK+++     S+SD+R+ K V+  + S+D  +
Sbjct: 828 ISDSKIEDIEFDESLSDKRKNKLVDAEKDSLDKNI 862


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.134    0.409 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 310,420,995
Number of Sequences: 539616
Number of extensions: 13638457
Number of successful extensions: 34516
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 55
Number of HSP's that attempted gapping in prelim test: 34372
Number of HSP's gapped (non-prelim): 152
length of query: 805
length of database: 191,569,459
effective HSP length: 126
effective length of query: 679
effective length of database: 123,577,843
effective search space: 83909355397
effective search space used: 83909355397
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)