BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000067
         (2445 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|O23372|ATXR3_ARATH Probable histone-lysine N-methyltransferase ATXR3 OS=Arabidopsis
            thaliana GN=ATXR3 PE=1 SV=2
          Length = 2335

 Score = 2681 bits (6950), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 1432/2524 (56%), Positives = 1777/2524 (70%), Gaps = 270/2524 (10%)

Query: 1    MGDGGVACMPLQQQQQHNSIMERFPISDKTTICVGNSSNNSNKTNNNSISNNNDNKTNND 60
            M DGGVACMPL       +IME+ PI +KTT+C GN S                 KT   
Sbjct: 1    MSDGGVACMPLL------NIMEKLPIVEKTTLCGGNES-----------------KTAAT 37

Query: 61   SSNNNGSSSSKNNETNKSNVKKNGVSTKTVRKKIVK-IKKVIAVKKKEVQKNSGSS---- 115
            + N + S ++K  E+  +N K +  S    +K+IVK I+KV+  + K+ QK +       
Sbjct: 38   TENGHTSIATKVPESQPAN-KPSASSQPVKKKRIVKVIRKVVKRRPKQPQKQADEQLKDQ 96

Query: 116  ---------------------KSNNNGENIDNKNVENGGAVGEVVTVDKENLKNEEVEEG 154
                                 KS   G     K VENGG  G            +EVEEG
Sbjct: 97   PPSQVVQLPAESQLQIKEQDKKSEFKGGTSGVKEVENGGDSG----------FKDEVEEG 146

Query: 155  ELGTLKW----ENGEFVQPEKSQPQSQLQSQSKQIEKGEIIV------------------ 192
            ELGTLK     ENGE + P KS        Q  +IEKGEI+                   
Sbjct: 147  ELGTLKLHEDLENGE-ISPVKSL-------QKSEIEKGEIVGESWKKDEPTKGEFSHLKY 198

Query: 193  ---------FSS-KCRRGETEKGESGLWRGNKDDIEKGEFIPDRWHK-EVVKDEYGYSKS 241
                     FS+ K  +G  E+ E   WR   D+IEKGEFIPDRW K +  KD++ Y +S
Sbjct: 199  HKGYVERRDFSADKNWKGGKEEREFRSWRDPSDEIEKGEFIPDRWQKMDTGKDDHSYIRS 258

Query: 242  RR----------YDYKLERTPPSGKYSGEDVYRRKEFDRSGSQHSKSSSRWESGQERNVR 291
            RR          Y+Y+ ERTPP G++  ED+Y ++EF               SG +R  R
Sbjct: 259  RRNGVDREKTWKYEYEYERTPPGGRFVNEDIYHQREF--------------RSGLDRTTR 304

Query: 292  ISSKIVDDEGLYKGEHNNGKNHGREYFH-GNRFKRHGTDSDSGDRKY-YGDYGDFAGLKS 349
            ISSKIV +E L+K E+NN  N  +EY   GNR KRHG + DS +RK+ Y DYGD+   K 
Sbjct: 305  ISSKIVIEENLHKNEYNNSSNFVKEYSSTGNRLKRHGAEPDSIERKHSYADYGDYGSSKC 364

Query: 350  RRLSDDYNSRSVHSEHYSRHSVEKFHRNSSSSRISSLDKYSSRHHEPSLSSRVIYDRHGR 409
            R+LSDD  SRS+HS+HYS+HS E+ +R+S  S+ SSL+KY  +H + S  ++   D+HG 
Sbjct: 365  RKLSDDC-SRSLHSDHYSQHSAERLYRDSYPSKNSSLEKYPRKHQDASFPAKAFSDKHGH 423

Query: 410  SPSHSDRSPHDRGRYYDHRDRSPSRHDRSPYTRDRSPYTFDRSPYSRERSPYNRDRSPYA 469
            SPS SD SPHDR RY+++R                     DRSPY+RERSPY  ++S +A
Sbjct: 424  SPSRSDWSPHDRSRYHENR---------------------DRSPYARERSPYIFEKSSHA 462

Query: 470  REKSPYDRSRHYDHRNRSPFSAERSPQDRARFHDRSDRTPNYLERSPLHRSRPNNHREAS 529
            R++SP DR  H     RSP  +E SP DR+R  DR D  PN++E +   R+R N HRE S
Sbjct: 463  RKRSPRDRRHH--DYRRSPSYSEWSPHDRSRPSDRRDYIPNFMEDTQSDRNRRNGHREIS 520

Query: 530  SKTGASEKRNARYDSKGHEDKLGPKDSNARCSRSSAKESQDKSNVQDLNVSDEKTANCES 589
             K+G  E+R+ +  ++  E K   K+SN + S SS+KE Q K+ + + ++  EK + C+S
Sbjct: 521  RKSGVRERRDCQTGTE-LEIKHKYKESNGKESTSSSKELQGKNILYNNSLLVEKNSVCDS 579

Query: 590  HKEEQPQSSSVDCKEPPQVDGPPLEELVSMEEDMDICDTPPHVPAVTDSSVGKWFYLDHC 649
             K   P ++    KEP QV   P EEL SME DMDICDTPPH P  +DSS+GKWFYLD+ 
Sbjct: 580  SKIPVPCATG---KEPVQVGEAPTEELPSMEVDMDICDTPPHEPMASDSSLGKWFYLDYY 636

Query: 650  GMECGPSRLCDLKTLVEEGVLVSDHFIKHLDSNRWETVENAVSPLVTVNFPSITSDSVTQ 709
            G E GP+RL DLK L+E+G+L SDH IKH D+NRW                         
Sbjct: 637  GTEHGPARLSDLKALMEQGILFSDHMIKHSDNNRW------------------------- 671

Query: 710  LVSPPEASGNLLADTGDTA------QSTGEEFPVTLQSQCCPDGSAAAAESSEDLHIDVR 763
            LV+PPEA GNLL D  DT       Q  G+  P  +  +  PDG     E+ ED  ID+R
Sbjct: 672  LVNPPEAPGNLLEDIADTTEAVCIEQGAGDSLPELVSVRTLPDGKEIFVENREDFQIDMR 731

Query: 764  VGALLDGFTVIPGKEIETLGEILQTTFERVDWQNNGGPTWHGACVGEQKPGDQKVDELYI 823
            V  LLDG T+ PG+E ETLGE L+     V+++           VG  +P  + ++E   
Sbjct: 732  VENLLDGRTITPGREFETLGEALKVN---VEFEETRRCVTSEGVVGMFRPMKRAIEEFKS 788

Query: 824  SDTKMKEAAELKSGDKDHWVVCFDSDEWFSGRWSCKGGDWKRNDEAAQDRCSRKKQVLND 883
             D    E+ E+ S              WFSGRWSCKGGDW R DEA+QDR  +KK VLND
Sbjct: 789  DDAYGSESDEIGS--------------WFSGRWSCKGGDWIRQDEASQDRYYKKKIVLND 834

Query: 884  GFPLCQMPKSGYEDPRWNQKDDLYYPSHSRRLDLPPWAYACPDERNDGSGGSRSTQSKLA 943
            GFPLC M KSG+EDPRW+ KDDLYYP  S RL+LP WA++  DERN              
Sbjct: 835  GFPLCLMQKSGHEDPRWHHKDDLYYPLSSSRLELPLWAFSVVDERNQ------------- 881

Query: 944  AVRGVKGTMLPVVRINACVVNDHGSFVSEPRSKVRAKERHSSRSARSYSSANDVRRSSAE 1003
              RGVK ++L VVR+N+ VVND    + +PR+KVR+KER  SR AR   +++D +R S E
Sbjct: 882  -TRGVKASLLSVVRLNSLVVNDQVPPIPDPRAKVRSKERCPSRPARPSPASSDSKRESVE 940

Query: 1004 SDSHSKARNNQDSQGSWKSIACINTPKDRLCTVDDLQLQLGEWYYLDGAGHERGPSSFSE 1063
            S S S A   QDSQG WK+   +NTP+DRLCTVDDLQL +G+W+Y DGAG E+GP SFSE
Sbjct: 941  SHSQSTASTGQDSQGLWKTDTSVNTPRDRLCTVDDLQLHIGDWFYTDGAGQEQGPLSFSE 1000

Query: 1064 LQVLVDQGCIQKHTSVFRKFDKVWVPLTFATETSASTVRNHGEKIMPSGDSSGLPPTQSQ 1123
            LQ LV++G I+ H+SVFRK DK+WVP+T  T++  +     G+         GL  +++Q
Sbjct: 1001 LQKLVEKGFIKSHSSVFRKSDKIWVPVTSITKSPETIAMLRGKTPALPSACQGLVVSETQ 1060

Query: 1124 DAVLGESNNNVNSNAFHTMHPQFIGYTRGKLHELVMKSYKNREFAAAINEVLDPWINAKQ 1183
            D    E + ++NS  FH +HPQF+GY RGKLH+LVMK++K+R+F+AAIN+V+D WI+A+Q
Sbjct: 1061 DFKYSEMDTSLNS--FHGVHPQFLGYFRGKLHQLVMKTFKSRDFSAAINDVVDSWIHARQ 1118

Query: 1184 PKKETE-HVYRKSEGDTRAGKRARLLVRESDGDEETEEELQTIQDESTFEDLCGDASFPG 1242
            PKKE+E ++Y+ SE ++   KRARL+  ES  D E E+     +DE TFEDLCGD +F  
Sbjct: 1119 PKKESEKYMYQSSELNSCYTKRARLMAGESGEDSEMEDTQMFQKDELTFEDLCGDLTFNI 1178

Query: 1243 EESASSAIESGGWGLLDGHTLAHVFHFLRSDMKSLAFASLTCRHWRAAVRFYKGISRQVD 1302
            E + S+      WGLLDGH LA VFH LR D+KSLAFAS+TCRHW+A +  YK ISRQVD
Sbjct: 1179 EGNRSAGTVGIYWGLLDGHALARVFHMLRYDVKSLAFASMTCRHWKATINSYKDISRQVD 1238

Query: 1303 LSSVGPNCTDSLIRKTLNAFDKEKLNSILLVGCTNITSGMLEEILQSFPHLSSIDIRGCG 1362
            LSS+GP+CTDS +R  +N ++KEK++SI+LVGCTN+T+ MLEEIL+  P +SS+DI GC 
Sbjct: 1239 LSSLGPSCTDSRLRSIMNTYNKEKIDSIILVGCTNVTASMLEEILRLHPRISSVDITGCS 1298

Query: 1363 QFGELALKFPNINWVKSQKSRGAKFNDSRSKIRSLKQITEKSSSAPKSKGLGDDMDDFGD 1422
            QFG+L + + N++W++ Q +R  + +   S+IRSLKQ T+      KSKGLG D DDFG+
Sbjct: 1299 QFGDLTVNYKNVSWLRCQNTRSGELH---SRIRSLKQTTD----VAKSKGLGGDTDDFGN 1351

Query: 1423 LKDYFESVDKRDSANQSFRRSLYQRSKVFDARKSSSILSRDARMRRWSIKKSENGYKRME 1482
            LKDYF+ V+KRDSANQ FRRSLY+RSK++DAR+SS+ILSRDAR+RRW+IKKSE+GYKR+E
Sbjct: 1352 LKDYFDRVEKRDSANQLFRRSLYKRSKLYDARRSSAILSRDARIRRWAIKKSEHGYKRVE 1411

Query: 1483 EFLASSLKEIMRVNTFEFFVPKVAEIEGRMKKGYYISHGLGSVKDDISRMCRDAIKAKNR 1542
            EFLASSL+ IM+ NTF+FF  KV++IE +MK GYY+SHGL SVK+DISRMCR+AIK    
Sbjct: 1412 EFLASSLRGIMKQNTFDFFALKVSQIEEKMKNGYYVSHGLRSVKEDISRMCREAIK---- 1467

Query: 1543 GSAGDMNRITTLFIQLATRLEQGAKSSYYEREEMMKSWKDESPAGLYSATSKYKKKLSKM 1602
                                           +E+MKSW+D S  GL SAT KY KKLSK 
Sbjct: 1468 -------------------------------DELMKSWQDGS--GLSSAT-KYNKKLSKT 1493

Query: 1603 VSERKYMNRSNGTSLANGDFDYGEYASDREIRKRLSKLNRKSLDSGSETSDDLDGSSEDG 1662
            V+E+KYM+R++ T   NG  DYGEYASDREI++RLSKLNRKS  S S+TS +    S++G
Sbjct: 1494 VAEKKYMSRTSDTFGVNGASDYGEYASDREIKRRLSKLNRKSFSSESDTSSE---LSDNG 1550

Query: 1663 KSDSESTVSDTDSDMDFRSDGRARESRGAGDFTTDEGLD-FSDDREWGARMTKASLVPPV 1721
            KSD+ S+ S ++S+ D RS+GR+++ R    FT D+  D  +++REWGARMTKASLVPPV
Sbjct: 1551 KSDNYSSASASESESDIRSEGRSQDLRIEKYFTADDSFDSVTEEREWGARMTKASLVPPV 1610

Query: 1722 TRKYEVIDQYVIVADEEDVRRKMRVSLPEDYAEKLNAQKNGSEELDMELPEVKDYKPRKQ 1781
            TRKYEVI++Y IVADEE+V+RKMRVSLPEDY EKLNAQ+NG EELDMELPEVK+YKPRK 
Sbjct: 1611 TRKYEVIEKYAIVADEEEVQRKMRVSLPEDYGEKLNAQRNGIEELDMELPEVKEYKPRKL 1670

Query: 1782 LGDQVFEQEVYGIDPYTHNLLLDSMPDELDWNLLEKHLFIEDVLLRTLNKQVRHFTGTGN 1841
            LGD+V EQEVYGIDPYTHNLLLDSMP ELDW+L +KH FIEDV+LRTLN+QVR FTG+G+
Sbjct: 1671 LGDEVLEQEVYGIDPYTHNLLLDSMPGELDWSLQDKHSFIEDVVLRTLNRQVRLFTGSGS 1730

Query: 1842 TPMMYPLQPVIEEIEKEAVDDCDVRTMKMCRGILKAMDSRPDDKYVAYRKGLGVVCNKEG 1901
            TPM++PL+PVIEE+++ A ++CD+RTMKMC+G+LK ++SR DDKYV+YRKGLGVVCNKEG
Sbjct: 1731 TPMVFPLRPVIEELKESAREECDIRTMKMCQGVLKEIESRSDDKYVSYRKGLGVVCNKEG 1790

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLV 1961
            GFGE+DFVVEFLGEVYPVWKWFEKQDGIRSLQ+N  DPAPEFYNIYLERPKGDADGYDLV
Sbjct: 1791 GFGEEDFVVEFLGEVYPVWKWFEKQDGIRSLQENKTDPAPEFYNIYLERPKGDADGYDLV 1850

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            VVDAMH ANYASRICHSCRPNCEAKVTAVDGHYQIGIY+VR I YGEEITFDYNSVTESK
Sbjct: 1851 VVDAMHMANYASRICHSCRPNCEAKVTAVDGHYQIGIYSVRAIEYGEEITFDYNSVTESK 1910

Query: 2022 EEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKELHGLLDRHQLMLEACELNSVSEEDYL 2081
            EEYEASVCLCGSQVCRGSYLNLTGEGAF+KVLK+ HGLL+RH+LMLEAC LNSVSEEDYL
Sbjct: 1911 EEYEASVCLCGSQVCRGSYLNLTGEGAFQKVLKDWHGLLERHRLMLEACVLNSVSEEDYL 1970

Query: 2082 ELGRAGLGSCLLGGLPNWVVAYSARLVRFINLERTKLPEEILRHNLEEKRKYFSDICLEV 2141
            ELGRAGLGSCLLGGLP+W++AYSARLVRFIN ERTKLPEEIL+HNLEEKRKYFSDI L+V
Sbjct: 1971 ELGRAGLGSCLLGGLPDWMIAYSARLVRFINFERTKLPEEILKHNLEEKRKYFSDIHLDV 2030

Query: 2142 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRCVFGDPKKAPPPVERLSPEETVSFLWKG 2201
            EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMR VFGDPK APPP+ERL+PEETVSF+W G
Sbjct: 2031 EKSDAEVQAEGVYNQRLQNLAVTLDKVRYVMRHVFGDPKNAPPPLERLTPEETVSFVWNG 2090

Query: 2202 EGSLVEELIQCMAPHVEEDVLNDLKSKIQAHDPSGSEDIQRELRKSLLWLRDEVRNLPCT 2261
            +GSLV+EL+Q ++PH+EE  LN+L+SKI  HDPSGS D+ +EL++SLLWLRDE+R+LPCT
Sbjct: 2091 DGSLVDELLQSLSPHLEEGPLNELRSKIHGHDPSGSADVLKELQRSLLWLRDEIRDLPCT 2150

Query: 2262 YKCRHDAAADLIHIYAYTKCFFRVQEYKAFTSPPVYISPLDLGPKYADKLGADLQVYRKT 2321
            YKCR+DAAADLIHIYAYTKCFF+V+EY++F S PV+ISPLDLG KYADKLG  ++ YRKT
Sbjct: 2151 YKCRNDAAADLIHIYAYTKCFFKVREYQSFISSPVHISPLDLGAKYADKLGESIKEYRKT 2210

Query: 2322 YGENYCLGQLIFWHIQTNADPDCTLARASRGCLSLPDIGSFYAKVQKPSRHRVYGPKTVR 2381
            YGENYCLGQLI+W+ QTN DPD TL +A+RGCLSLPD+ SFYAK QKPS+HRVYGPKTV+
Sbjct: 2211 YGENYCLGQLIYWYNQTNTDPDLTLVKATRGCLSLPDVASFYAKAQKPSKHRVYGPKTVK 2270

Query: 2382 FMLSRMEKQPQRPWPKDRIWAFKSSPRIFGSPMLDSSL-TGCPLDREMVHWLKHRPAIFQ 2440
             M+S+M KQPQRPWPKD+IW FKS+PR+FGSPM D+ L     LDRE++ WL++R  +FQ
Sbjct: 2271 TMVSQMSKQPQRPWPKDKIWTFKSTPRVFGSPMFDAVLNNSSSLDRELLQWLRNRRHVFQ 2330

Query: 2441 AMWD 2444
            A WD
Sbjct: 2331 ATWD 2334


>sp|Q9Y7R4|SET1_SCHPO Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
            GN=set1 PE=1 SV=1
          Length = 920

 Score = 82.0 bits (201), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 26/141 (18%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL---V 1961
            ++D V+E++GE+            IR    +N +        Y+    GD+  + +   V
Sbjct: 803  KNDMVIEYIGEI------------IRQRVADNREKN------YVREGIGDSYLFRIDEDV 844

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            +VDA  K N A  I HSC PNC A++  V+G  +I IY  R I +GEE+T+DY    +  
Sbjct: 845  IVDATKKGNIARFINHSCAPNCIARIIRVEGKRKIVIYADRDIMHGEELTYDY----KFP 900

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
            EE +   CLCG+  CRG YLN
Sbjct: 901  EEADKIPCLCGAPTCRG-YLN 920


>sp|Q18221|SET2_CAEEL Probable histone-lysine N-methyltransferase set-2 OS=Caenorhabditis
            elegans GN=set-2 PE=2 SV=2
          Length = 1507

 Score = 79.3 bits (194), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/145 (37%), Positives = 72/145 (49%), Gaps = 28/145 (19%)

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNI---YLERPKGDADGY 1958
                D+ +VE++G+             IRSL     + A E   I   YL R        
Sbjct: 1387 SIAPDEMIVEYIGQT------------IRSLVAEEREKAYERRGIGSSYLFR-------I 1427

Query: 1959 DLV-VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSV 2017
            DL  V+DA  + N+A  I HSC+PNC AKV  ++G  +I IY+   I  GEEIT+DY   
Sbjct: 1428 DLHHVIDATKRGNFARFINHSCQPNCYAKVLTIEGEKRIVIYSRTIIKKGEEITYDYKFP 1487

Query: 2018 TESKEEYEASVCLCGSQVCRGSYLN 2042
             E     +   CLCG++ CRG YLN
Sbjct: 1488 IED----DKIDCLCGAKTCRG-YLN 1507


>sp|Q4PB36|SET1_USTMA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Ustilago
            maydis (strain 521 / FGSC 9021) GN=SET1 PE=3 SV=1
          Length = 1468

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 67/137 (48%), Gaps = 28/137 (20%)

Query: 1907 DFVVEFLGEVYPVW------KWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDL 1960
            D V+E++GEV          K +E+Q                 ++ YL R   D      
Sbjct: 1351 DMVIEYVGEVVRQQVADEREKQYERQGN---------------FSTYLFRVDDD------ 1389

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
            +VVDA HK N A  + H C PNC AK+  ++G  +I ++    I  GEE+T+DY   + +
Sbjct: 1390 LVVDATHKGNIARLMNHCCTPNCNAKILTLNGEKRIVLFAKTAIRAGEELTYDYKFQSSA 1449

Query: 2021 KEEYEASVCLCGSQVCR 2037
             +E +A  CLCGS  CR
Sbjct: 1450 DDE-DAIPCLCGSPGCR 1465


>sp|Q99MY8|ASH1L_MOUSE Histone-lysine N-methyltransferase ASH1L OS=Mus musculus GN=Ash1l
            PE=1 SV=3
          Length = 2958

 Score = 76.6 bits (187), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2138 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2174

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2175 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2234

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2235 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2272


>sp|Q9NR48|ASH1L_HUMAN Histone-lysine N-methyltransferase ASH1L OS=Homo sapiens GN=ASH1L
            PE=1 SV=2
          Length = 2969

 Score = 76.3 bits (186), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 30/160 (18%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            +++ A  KG G+   +    G+  F++E+LGEV              S Q        EF
Sbjct: 2148 ERFRAEEKGWGIRTKEPLKAGQ--FIIEYLGEVV-------------SEQ--------EF 2184

Query: 1944 YNIYLERPKGDADGYDL-----VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGI 1998
             N  +E+    +D Y L     +V+D+    N A  I HSC PNCE +  +V+G Y+IG+
Sbjct: 2185 RNRMIEQYHNHSDHYCLNLDSGMVIDSYRMGNEARFINHSCDPNCEMQKWSVNGVYRIGL 2244

Query: 1999 YTVRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            Y ++ +  G E+T+DYN  + + E+ +  +C CG + CRG
Sbjct: 2245 YALKDMPAGTELTYDYNFHSFNVEKQQ--LCKCGFEKCRG 2282


>sp|Q9VYD1|C1716_DROME Probable histone-lysine N-methyltransferase CG1716 OS=Drosophila
            melanogaster GN=Set2 PE=1 SV=2
          Length = 2313

 Score = 75.9 bits (185), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 77/149 (51%), Gaps = 20/149 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+        GE  F++E++GEV    + FE++  + S  +N       +Y + L 
Sbjct: 1371 KKGCGITAELLIPPGE--FIMEYVGEVIDSEE-FERRQHLYSKDRNRH-----YYFMAL- 1421

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              +G+A      V+DA  K N +  I HSC PN E +   V+G  +IG ++V+ I  GEE
Sbjct: 1422 --RGEA------VIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE 1473

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            ITFDY  +   +   +A  C C +  CRG
Sbjct: 1474 ITFDYQYLRYGR---DAQRCYCEAANCRG 1499


>sp|Q4I5R3|SET1_GIBZE Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC
            9075 / NRRL 31084) GN=SET1 PE=3 SV=2
          Length = 1263

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 70/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        + +  I  +++N           YL+   G +  +   D
Sbjct: 1141 IAKDDMIIEYVGE--------QVRQQISEIRENR----------YLKSGIGSSYLFRIDD 1182

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1183 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIALNEELTYDYKFERE 1242

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1243 IG-STDRIPCLCGTAACKG-FLN 1263


>sp|Q2GWF3|SET1_CHAGB Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Chaetomium globosum (strain ATCC 6205 / CBS 148.51 /
            DSM 1962 / NBRC 6347 / NRRL 1970) GN=SET1 PE=3 SV=1
          Length = 1076

 Score = 73.6 bits (179), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 70/141 (49%), Gaps = 23/141 (16%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---DLV 1961
            +DD ++E++GE        E +  I  L++N           YL+   G +  +   D  
Sbjct: 956  KDDMIIEYVGE--------EVRQQIAELRENR----------YLKSGIGSSYLFRIDDNT 997

Query: 1962 VVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESK 2021
            V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E  
Sbjct: 998  VIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERELG 1057

Query: 2022 EEYEASVCLCGSQVCRGSYLN 2042
               +   CLCG+  C+G +LN
Sbjct: 1058 ST-DRIPCLCGTAACKG-FLN 1076


>sp|O14026|SET2_SCHPO Histone-lysine N-methyltransferase, H3 lysine-36 specific
            OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
            GN=set2 PE=1 SV=1
          Length = 798

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 75/155 (48%), Gaps = 20/155 (12%)

Query: 1884 DKYVAYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEF 1943
            D ++  +KG G+    +    +D FV E++GEV P  K+ ++      +++ + +    F
Sbjct: 183  DVFLTEKKGFGL--RADANLPKDTFVYEYIGEVIPEQKFRKR------MRQYDSEGIKHF 234

Query: 1944 YNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRG 2003
            Y + L+  KG+        +DA  + + A    HSCRPNC      V    ++GI+  R 
Sbjct: 235  YFMMLQ--KGE-------YIDATKRGSLARFCNHSCRPNCYVDKWMVGDKLRMGIFCKRD 285

Query: 2004 IHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            I  GEE+TFDYN     +   +A  C CG   C G
Sbjct: 286  IIRGEELTFDYNV---DRYGAQAQPCYCGEPCCVG 317


>sp|Q1LY77|SE1BA_DANRE Histone-lysine N-methyltransferase SETD1B-A OS=Danio rerio GN=setd1ba
            PE=1 SV=2
          Length = 1844

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1768 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSRQPINVNEEITYDYKFPIED 1827

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
                E   CLCG++ CRG+
Sbjct: 1828 ----EKIPCLCGAENCRGT 1842


>sp|Q5ABG1|SET1_CANAL Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
            albicans (strain SC5314 / ATCC MYA-2876) GN=SET1 PE=3
            SV=1
          Length = 1040

 Score = 72.4 bits (176), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 2/84 (2%)

Query: 1959 DLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVT 2018
            D  V+DA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    
Sbjct: 959  DNTVIDATKKGGIARFINHCCSPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFER 1018

Query: 2019 ESKEEYEASVCLCGSQVCRGSYLN 2042
            E+ +E E   CLCG+  C+G YLN
Sbjct: 1019 ETNDE-ERIRCLCGAPGCKG-YLN 1040


>sp|Q03164|MLL1_HUMAN Histone-lysine N-methyltransferase MLL OS=Homo sapiens GN=MLL PE=1
            SV=5
          Length = 3969

 Score = 71.6 bits (174), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3840 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3882

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3883 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3936

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3937 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3969


>sp|P55200|MLL1_MOUSE Histone-lysine N-methyltransferase MLL OS=Mus musculus GN=Mll PE=1
            SV=3
          Length = 3966

 Score = 71.6 bits (174), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 31/156 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----I 1946
            G G+ C +    GE   V+E+ G V            IRS+Q +  +   ++Y+      
Sbjct: 3837 GRGLFCKRNIDAGE--MVIEYAGNV------------IRSIQTDKRE---KYYDSKGIGC 3879

Query: 1947 YLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHY 2006
            Y+ R        D  VVDA    N A  I HSC PNC ++V  +DG   I I+ +R I+ 
Sbjct: 3880 YMFRID------DSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYR 3933

Query: 2007 GEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            GEE+T+DY    E  +      C CG++ CR  +LN
Sbjct: 3934 GEELTYDYKFPIE--DASNKLPCNCGAKKCR-KFLN 3966


>sp|Q9BYW2|SETD2_HUMAN Histone-lysine N-methyltransferase SETD2 OS=Homo sapiens GN=SETD2
            PE=1 SV=3
          Length = 2564

 Score = 71.2 bits (173), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 50/152 (32%), Positives = 72/152 (47%), Gaps = 21/152 (13%)

Query: 1890 RKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLE 1949
            +KG G+   K+     + FV+E+ GEV    K F+ +    +  KN         + Y  
Sbjct: 1559 KKGWGLRAAKD--LPSNTFVLEYCGEVLD-HKEFKARVKEYARNKN--------IHYYFM 1607

Query: 1950 RPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEE 2009
              K D       ++DA  K N +  + HSC PNCE +   V+G  ++G +T + +  G E
Sbjct: 1608 ALKNDE------IIDATQKGNCSRFMNHSCEPNCETQKWTVNGQLRVGFFTTKLVPSGSE 1661

Query: 2010 ITFDYNSVTESKEEYEASVCLCGSQVCRGSYL 2041
            +TFDY      K   EA  C CGS  CRG YL
Sbjct: 1662 LTFDYQFQRYGK---EAQKCFCGSANCRG-YL 1689


>sp|Q9UPS6|SET1B_HUMAN Histone-lysine N-methyltransferase SETD1B OS=Homo sapiens GN=SETD1B
            PE=1 SV=2
          Length = 1923

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1847 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1906

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1907 VK----IPCLCGSENCRGT 1921


>sp|Q8X0S9|SET1_NEUCR Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A /
            CBS 708.71 / DSM 1257 / FGSC 987) GN=set-1 PE=3 SV=1
          Length = 1313

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 69/143 (48%), Gaps = 23/143 (16%)

Query: 1903 FGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGY---D 1959
              +DD ++E++GE        E +  I  L++            YL+   G +  +   D
Sbjct: 1191 INKDDMIIEYVGE--------EVRQQIAELREAR----------YLKSGIGSSYLFRIDD 1232

Query: 1960 LVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTE 2019
              V+DA  K   A  I HSC PNC AK+  V+G  +I IY +R I   EE+T+DY    E
Sbjct: 1233 NTVIDATKKGGIARFINHSCMPNCTAKIIKVEGSKRIVIYALRDIAQNEELTYDYKFERE 1292

Query: 2020 SKEEYEASVCLCGSQVCRGSYLN 2042
                 +   CLCG+  C+G +LN
Sbjct: 1293 IGST-DRIPCLCGTAACKG-FLN 1313


>sp|Q8CFT2|SET1B_MOUSE Histone-lysine N-methyltransferase SETD1B OS=Mus musculus GN=Setd1b
            PE=2 SV=2
          Length = 1985

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1909 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1968

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1969 VK----IPCLCGSENCRGT 1983


>sp|Q5F3P8|SET1B_CHICK Histone-lysine N-methyltransferase SETD1B OS=Gallus gallus GN=SETD1B
            PE=2 SV=1
          Length = 2008

 Score = 70.9 bits (172), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1932 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 1991

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCGS+ CRG+
Sbjct: 1992 VK----IPCLCGSENCRGT 2006


>sp|O96028|NSD2_HUMAN Histone-lysine N-methyltransferase NSD2 OS=Homo sapiens GN=WHSC1 PE=1
            SV=1
          Length = 1365

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKHAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>sp|Q24742|TRX_DROVI Histone-lysine N-methyltransferase trithorax OS=Drosophila virilis
            GN=trx PE=3 SV=1
          Length = 3828

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 72/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3701 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3746

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I HSC PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3747 KID----DNLVVDATMRGNAARFINHSCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3802

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3803 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3828


>sp|Q6BKL7|SET1_DEBHA Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Debaryomyces hansenii (strain ATCC 36239 / CBS 767 /
            JCM 1990 / NBRC 0083 / IGC 2968) GN=SET1 PE=3 SV=2
          Length = 1088

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             VVDA  K   A  I H C P+C AK+  V+G  +I IY +R I   EE+T+DY    E+
Sbjct: 1009 TVVDATKKGGIARFINHCCNPSCTAKIIKVEGKKRIVIYALRDIEANEELTYDYKFEKET 1068

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +  E   CLCG+  C+G YLN
Sbjct: 1069 NDA-ERIRCLCGAPGCKG-YLN 1088


>sp|Q2UMH3|SET1_ASPOR Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
            GN=set1 PE=3 SV=1
          Length = 1229

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1150 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1209

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1210 DSD-DRIPCLCGSTGCKG-FLN 1229


>sp|Q8BVE8|NSD2_MOUSE Histone-lysine N-methyltransferase NSD2 OS=Mus musculus GN=Whsc1 PE=1
            SV=2
          Length = 1365

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 79/148 (53%), Gaps = 20/148 (13%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            KG G+V  ++   GE  FV E++GE+       ++++ +  ++  +E+    FY + +++
Sbjct: 1073 KGWGLVAKRDIRKGE--FVNEYVGEL------IDEEECMARIKYAHENDITHFYMLTIDK 1124

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
             +         ++DA  K NY+  + HSC+PNCE     V+G  ++G++ V  I  G E+
Sbjct: 1125 DR---------IIDAGPKGNYSRFMNHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTEL 1175

Query: 2011 TFDYNSVTESKEEYEASVCLCGSQVCRG 2038
            TF+YN      E+   +VC CG+  C G
Sbjct: 1176 TFNYNLDCLGNEK---TVCRCGASNCSG 1200


>sp|Q66J90|SET1B_XENLA Histone-lysine N-methyltransferase SETD1B OS=Xenopus laevis GN=setd1b
            PE=2 SV=1
          Length = 1938

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1862 TIIDATKCGNFARFINHSCNPNCYAKVVTVESQKKIVIYSKQYINVNEEITYDYKFPIED 1921

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCG++ CRG+
Sbjct: 1922 VK----IPCLCGAENCRGT 1936


>sp|Q5B0Y5|SET1_EMENI Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS
            112.46 / NRRL 194 / M139) GN=set1 PE=3 SV=1
          Length = 1220

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1141 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIERDEELTYDYKFEREW 1200

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1201 DSD-DRIPCLCGSAGCKG-FLN 1220


>sp|Q945S8|ASHH3_ARATH Histone-lysine N-methyltransferase ASHH3 OS=Arabidopsis thaliana
            GN=ASHH3 PE=2 SV=2
          Length = 363

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 84/187 (44%), Gaps = 36/187 (19%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V  +E   GE  F++E++GEV       + +     L K        FY   + R 
Sbjct: 127  GSGIVAEEEIEAGE--FIIEYVGEV------IDDKTCEERLWKMKHRGETNFYLCEITRD 178

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA HK N +  I HSC PN + +   +DG  +IGI+  RGI  GE +T
Sbjct: 179  ---------MVIDATHKGNKSRYINHSCNPNTQMQKWIIDGETRIGIFATRGIKKGEHLT 229

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR------GSYLNLTGEGAFEKVLKEL--------- 2056
            +DY  V    ++     C CG+  CR       S   +  + AF  V  EL         
Sbjct: 230  YDYQFVQFGADQD----CHCGAVGCRRKLGVKPSKPKIASDEAFNLVAHELAQTLPKVHQ 285

Query: 2057 HGLLDRH 2063
            +GL++RH
Sbjct: 286  NGLVNRH 292


>sp|Q1DR06|SET1_COCIM Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Coccidioides immitis (strain RS) GN=SET1 PE=3 SV=1
          Length = 1271

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1192 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIDRDEELTYDYKFEREW 1251

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1252 DSD-DRIPCLCGSAGCKG-FLN 1271


>sp|Q6CIT4|SET1_KLULA Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 /
            DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=SET1 PE=3
            SV=1
          Length = 1000

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I H C P+C AK+  VDG  +I IY +R I   EE+T+DY    E+
Sbjct: 921  TVIDATKRGGIARFINHCCEPSCTAKIIKVDGRKRIVIYALRDIGTNEELTYDYKFERET 980

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 981  -DEGERLPCLCGAPSCKG-FLN 1000


>sp|Q08D57|SET1B_XENTR Histone-lysine N-methyltransferase SETD1B OS=Xenopus tropicalis
            GN=setd1b PE=2 SV=1
          Length = 1956

 Score = 69.7 bits (169), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             ++DA    N+A  I HSC PNC AKV  V+   +I IY+ + I+  EEIT+DY    E 
Sbjct: 1880 TIIDATKCGNFARFINHSCNPNCYAKVITVESQKKIVIYSKQYINVNEEITYDYKFPIED 1939

Query: 2021 KEEYEASVCLCGSQVCRGS 2039
             +      CLCG++ CRG+
Sbjct: 1940 VK----IPCLCGAENCRGT 1954


>sp|Q4WNH8|SET1_ASPFU Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293 /
            CBS 101355 / FGSC A1100) GN=set1 PE=3 SV=1
          Length = 1241

 Score = 68.9 bits (167), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  +   A  I HSC PNC AK+  VDG  +I IY +R I   EE+T+DY    E 
Sbjct: 1162 TVIDATKRGGIARFINHSCTPNCTAKIIKVDGSKRIVIYALRDIGRDEELTYDYKFEREW 1221

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              + +   CLCGS  C+G +LN
Sbjct: 1222 DSD-DRIPCLCGSTGCKG-FLN 1241


>sp|P20659|TRX_DROME Histone-lysine N-methyltransferase trithorax OS=Drosophila
            melanogaster GN=trx PE=1 SV=4
          Length = 3726

 Score = 68.9 bits (167), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 71/151 (47%), Gaps = 23/151 (15%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+ C K+   GE   V+E+ GE+            IRS   +  +   +   I     
Sbjct: 3599 GRGLYCTKDIEAGE--MVIEYAGEL------------IRSTLTDKRERYYDSRGIGCYMF 3644

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
            K D    D +VVDA  + N A  I H C PNC +KV  + GH  I I+ +R I  GEE+T
Sbjct: 3645 KID----DNLVVDATMRGNAARFINHCCEPNCYSKVVDILGHKHIIIFALRRIVQGEELT 3700

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCRGSYLN 2042
            +DY    E     E   C CGS+ CR  YLN
Sbjct: 3701 YDYKFPFED----EKIPCSCGSKRCR-KYLN 3726


>sp|P38827|SET1_YEAST Histone-lysine N-methyltransferase, H3 lysine-4 specific
            OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
            GN=SET1 PE=1 SV=1
          Length = 1080

 Score = 68.9 bits (167), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C PNC AK+  V G  +I IY +R I   EE+T+DY    E 
Sbjct: 1001 TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREK 1060

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 1061 DDE-ERLPCLCGAPNCKG-FLN 1080


>sp|Q9M1X9|ASHH4_ARATH Putative histone-lysine N-methyltransferase ASHH4 OS=Arabidopsis
            thaliana GN=ASHH4 PE=3 SV=1
          Length = 352

 Score = 68.2 bits (165), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 71/146 (48%), Gaps = 21/146 (14%)

Query: 1892 GLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERP 1951
            G G+V +++   GE  F++E++GEV       + +     L K N      FY   +   
Sbjct: 122  GYGIVADEDINSGE--FIIEYVGEV------IDDKICEERLWKLNHKVETNFYLCQINWN 173

Query: 1952 KGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEIT 2011
                     +V+DA HK N +  I HSC PN E +   +DG  +IGI+  R I+ GE++T
Sbjct: 174  ---------MVIDATHKGNKSRYINHSCSPNTEMQKWIIDGETRIGIFATRFINKGEQLT 224

Query: 2012 FDYNSVTESKEEYEASVCLCGSQVCR 2037
            +DY  V    ++     C CG+  CR
Sbjct: 225  YDYQFVQFGADQD----CYCGAVCCR 246


>sp|Q6FKB1|SET1_CANGA Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Candida
            glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC
            0622 / NRRL Y-65) GN=SET1 PE=3 SV=1
          Length = 1111

 Score = 68.2 bits (165), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 46/82 (56%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E+
Sbjct: 1032 TVIDATKKGGIARFINHCCEPSCTAKIIKVGGKRRIVIYALRDIAANEELTYDYKFERET 1091

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
              E E   CLCG+  C+G +LN
Sbjct: 1092 DAE-ERLPCLCGAPSCKG-FLN 1111


>sp|Q2LAE1|ASHH2_ARATH Histone-lysine N-methyltransferase ASHH2 OS=Arabidopsis thaliana
            GN=ASHH2 PE=1 SV=1
          Length = 1759

 Score = 68.2 bits (165), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 66/134 (49%), Gaps = 17/134 (12%)

Query: 1905 EDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVD 1964
            E  F++E++GEV  +  +  +Q       + +      FY + L       +G +  V+D
Sbjct: 1048 EGQFLIEYVGEVLDMQSYETRQKEYAFKGQKH------FYFMTL-------NGNE--VID 1092

Query: 1965 AMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEY 2024
            A  K N    I HSC PNC  +   V+G   +GI++++ +  G+E+TFDYN V       
Sbjct: 1093 AGAKGNLGRFINHSCEPNCRTEKWMVNGEICVGIFSMQDLKKGQELTFDYNYVRVFG--A 1150

Query: 2025 EASVCLCGSQVCRG 2038
             A  C CGS  CRG
Sbjct: 1151 AAKKCYCGSSHCRG 1164


>sp|Q75D88|SET1_ASHGO Histone-lysine N-methyltransferase, H3 lysine-4 specific OS=Ashbya
            gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 /
            NRRL Y-1056) GN=SET1 PE=3 SV=2
          Length = 975

 Score = 67.0 bits (162), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 1961 VVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTES 2020
             V+DA  K   A  I H C P+C AK+  V G  +I IY +R I   EE+T+DY    E+
Sbjct: 896  TVIDATKKGGIARFINHCCDPSCTAKIIKVGGMKRIVIYALRDIAANEELTYDYKFERET 955

Query: 2021 KEEYEASVCLCGSQVCRGSYLN 2042
             +E E   CLCG+  C+G +LN
Sbjct: 956  DDE-ERLPCLCGAPNCKG-FLN 975


>sp|Q8MT36|MES4_DROME Probable histone-lysine N-methyltransferase Mes-4 OS=Drosophila
            melanogaster GN=Mes-4 PE=1 SV=2
          Length = 1427

 Score = 66.2 bits (160), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 74/151 (49%), Gaps = 25/151 (16%)

Query: 1891 KGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLER 1950
            +G G+V  +    G  DFV+E++GEV          +  R +++   D    +Y + +E+
Sbjct: 1244 RGFGLVNREPIAVG--DFVIEYVGEV------INHAEFQRRMEQKQRDRDENYYFLGVEK 1295

Query: 1951 PKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEI 2010
                       ++DA  K N A  + HSC PNCE +   V+  +++GI+ ++ I    E+
Sbjct: 1296 D---------FIIDAGPKGNLARFMNHSCEPNCETQKWTVNCIHRVGIFAIKDIPVNSEL 1346

Query: 2011 TFDY---NSVTESKEEYEASVCLCGSQVCRG 2038
            TF+Y   + +  SK+      C CG++ C G
Sbjct: 1347 TFNYLWDDLMNNSKK-----ACFCGAKRCSG 1372


>sp|Q96L73|NSD1_HUMAN Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific OS=Homo sapiens GN=NSD1 PE=1 SV=1
          Length = 2696

 Score = 65.5 bits (158), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1966 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 2010

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 2011 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 2067

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 2068 TVCKCGAPNCSG 2079


>sp|O88491|NSD1_MOUSE Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20
            specific OS=Mus musculus GN=Nsd1 PE=1 SV=1
          Length = 2588

 Score = 65.1 bits (157), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 37/132 (28%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 1907 DFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYNIYLERPKGDADGYDLVVVDAM 1966
            +FV E++GE+       ++++    ++   E     FY + L++ +         ++DA 
Sbjct: 1864 EFVNEYVGEL------IDEEECRARIRYAQEHDITNFYMLTLDKDR---------IIDAG 1908

Query: 1967 HKANYASRICHSCRPNCEAKVTAVDGHYQIGIYTVRGIHYGEEITFDYNSVTESKEEYEA 2026
             K NYA  + H C+PNCE +  +V+G  ++G++ +  I  G E+TF+YN       +   
Sbjct: 1909 PKGNYARFMNHCCQPNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYNLECLGNGK--- 1965

Query: 2027 SVCLCGSQVCRG 2038
            +VC CG+  C G
Sbjct: 1966 TVCKCGAPNCSG 1977


>sp|Q6P2L6|NSD3_MOUSE Histone-lysine N-methyltransferase NSD3 OS=Mus musculus GN=Whsc1l1
            PE=1 SV=2
          Length = 1439

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 86/175 (49%), Gaps = 22/175 (12%)

Query: 1882 PDDKYV-AYRKGLGVVCNKEGGFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPA 1940
            PD + +   R+G G+   +    GE  FV E++GE+       ++++    +++ +E+  
Sbjct: 1148 PDAEVIKTERRGWGLRTKRSIKKGE--FVNEYVGEL------IDEEECRLRIKRAHENSV 1199

Query: 1941 PEFYNIYLERPKGDADGYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQIGIYT 2000
              FY + + + +         ++DA  K NY+  + HSC PNCE +   V+G  ++G++ 
Sbjct: 1200 TNFYMLTVTKDR---------IIDAGPKGNYSRFMNHSCNPNCETQKWTVNGDVRVGLFA 1250

Query: 2001 VRGIHYGEEITFDYNSVTESKEEYEASVCLCGSQVCRGSYLNLTGEGAFEKVLKE 2055
            +  I  G E+TF+YN           +VC CG+  C G +L +  + A    + E
Sbjct: 1251 LCDIPAGMELTFNYNLDCLGNGR---TVCHCGADNCSG-FLGVRPKSACTSAVDE 1301


>sp|C6KTD2|HKNMT_PLAF7 Putative histone-lysine N-methyltransferase PFF1440w OS=Plasmodium
            falciparum (isolate 3D7) GN=PFF1440w PE=3 SV=1
          Length = 6753

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 70/143 (48%), Gaps = 21/143 (14%)

Query: 1902 GFGEDDFVVEFLGEVYPVWKWFEKQDGIRSLQKNNEDPAPEFYN-----IYLERPKGDAD 1956
            G+G   +  EF+ E  PV ++    + IR++     D   ++Y+      Y+ R   +  
Sbjct: 6623 GYGL--YTCEFINEGEPVIEYI--GEYIRNII---SDKREKYYDKIESSCYMFRLNEN-- 6673

Query: 1957 GYDLVVVDAMHKANYASRICHSCRPNCEAKVTAVDGHYQ-IGIYTVRGIHYGEEITFDYN 2015
                +++DA    N +  I HSC PNC  K+ + D + + I I+  R I   EEIT+DY 
Sbjct: 6674 ----IIIDATKWGNVSRFINHSCEPNCFCKIVSCDQNLKHIVIFAKRDIAAHEEITYDYQ 6729

Query: 2016 SVTESKEEYEASVCLCGSQVCRG 2038
               ES  E +  +CLCGS  C G
Sbjct: 6730 FGVES--EGKKLICLCGSSTCLG 6750


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.314    0.131    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 965,474,865
Number of Sequences: 539616
Number of extensions: 44537966
Number of successful extensions: 298373
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 1297
Number of HSP's successfully gapped in prelim test: 1195
Number of HSP's that attempted gapping in prelim test: 127499
Number of HSP's gapped (non-prelim): 54081
length of query: 2445
length of database: 191,569,459
effective HSP length: 134
effective length of query: 2311
effective length of database: 119,260,915
effective search space: 275611974565
effective search space used: 275611974565
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 70 (31.6 bits)