RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8112
         (687 letters)



>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
           Provisional.
          Length = 450

 Score =  408 bits (1051), Expect = e-136
 Identities = 165/312 (52%), Positives = 207/312 (66%), Gaps = 18/312 (5%)

Query: 1   PAEIIQNRYNELE--NCFEYYVHYDGFNRRLDEWVQKHRIMNSRFDMSEQHWKNKENQAI 58
           P ++I+ R       N +EYYVHY  FNRRLDEWV+       + D+        E    
Sbjct: 72  PVKVIERRRGGSGGPNDYEYYVHYTEFNRRLDEWVKL-----EQLDLDTVETVGDEK--- 123

Query: 59  DLLDQSDRKITRNQKRRHDEINHVQKTYEEMDPTTAALEKEHEAITKVKYIDKIMIGKCE 118
                +  K+TR+QKR+ DE +  +  +EE+D    A  +EHE  TKVK I  I +G+ E
Sbjct: 124 VEDKVASLKMTRHQKRKIDETHVEEG-HEELDA---ASLREHEEFTKVKNIATIELGRYE 179

Query: 119 IDTWYFSPYPDECKKKPKLWICEFCLKYMTMERTYRYHKSECTYCHPPGKEIYRCG---- 174
           IDTWYFSP+P E     KL+ CEFCLK+M  +   + H  +C   HPPG EIYR      
Sbjct: 180 IDTWYFSPFPPEYNDCSKLYFCEFCLKFMKRKEQLQRHMKKCDLKHPPGDEIYRHPTRQE 239

Query: 175 NISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYGAHLVGYFSK 234
            +S++EVDG  +K+YCQNLC LAKLFLDHKTLY+DVD FLFY+LC  D  G H+VGYFSK
Sbjct: 240 GLSMFEVDGKKNKVYCQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSK 299

Query: 235 EKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIGSPEKPLSDLGKLSYRSY 294
           EK S +  N+ACILTLPPYQR+GYGKFLI+FSYELSK EG +G+PE+PLSDLG +SYR Y
Sbjct: 300 EKHSEEDYNLACILTLPPYQRKGYGKFLIAFSYELSKREGKVGTPERPLSDLGLVSYRGY 359

Query: 295 WSWVLLGILRNS 306
           W+ VLL IL+  
Sbjct: 360 WTRVLLEILKKH 371



 Score =  368 bits (946), Expect = e-121
 Identities = 141/228 (61%), Positives = 176/228 (77%), Gaps = 7/228 (3%)

Query: 454 KSECTYCHPPGKEIYRCG----NISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDP 509
             +C   HPPG EIYR       +S++EVDG  +K+YCQNLC LAKLFLDHKTLY+DVD 
Sbjct: 218 MKKCDLKHPPGDEIYRHPTRQEGLSMFEVDGKKNKVYCQNLCYLAKLFLDHKTLYYDVDL 277

Query: 510 FLFYILCVIDKYGAHLVGYFSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKV 569
           FLFY+LC  D  G H+VGYFSKEK S +  N+ACILTLPPYQR+GYGKFLI+FSYELSK 
Sbjct: 278 FLFYVLCECDDRGCHMVGYFSKEKHSEEDYNLACILTLPPYQRKGYGKFLIAFSYELSKR 337

Query: 570 EGLIGSPEKPLSDLGKLSYRSYWSWVLLGILRNSKGNSTTIKELSEMTSIAQTDIISTLQ 629
           EG +G+PE+PLSDLG +SYR YW+ VLL IL+  KGN  +IKELS+MT+I   DI+STLQ
Sbjct: 338 EGKVGTPERPLSDLGLVSYRGYWTRVLLEILKKHKGN-ISIKELSDMTAIKAEDIVSTLQ 396

Query: 630 AMNMVKYWKGQHVICVTPKIVEEHIQSTQYKRPRLQVDPTYLRWTPPP 677
           ++N+++Y KGQHVIC  PK++EEH+++    R  L+VDP+ L WTP  
Sbjct: 397 SLNLIQYRKGQHVICADPKVLEEHLKA--AGRGGLEVDPSKLIWTPYK 442



 Score =  165 bits (419), Expect = 2e-44
 Identities = 65/139 (46%), Positives = 80/139 (57%), Gaps = 27/139 (19%)

Query: 324 FPIECKKKPKLWICEFCLKYMTMERTYRYHKSECTYCHPPGKEIYRCG----NISIYEVD 379
           FP E     KL+ CEFCLK+M  +   + H  +C   HPPG EIYR       +S++EVD
Sbjct: 188 FPPEYNDCSKLYFCEFCLKFMKRKEQLQRHMKKCDLKHPPGDEIYRHPTRQEGLSMFEVD 247

Query: 380 GSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYDHKTLYFDVDPFLFYILC 439
           G  +K+YCQNLC LAKLFLDHKTLY+DVD FLFY+LC  D  D                 
Sbjct: 248 GKKNKVYCQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECD--DR---------------- 289

Query: 440 VIDKYGAHLVGYFSKSECT 458
                G H+VGYFSK + +
Sbjct: 290 -----GCHMVGYFSKEKHS 303


>gnl|CDD|216741 pfam01853, MOZ_SAS, MOZ/SAS family.  This region of these proteins
           has been suggested to be homologous to
           acetyltransferases.
          Length = 189

 Score =  332 bits (854), Expect = e-111
 Identities = 135/188 (71%), Positives = 153/188 (81%), Gaps = 1/188 (0%)

Query: 460 CHPPGKEIYRCGNISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVID 519
            HPPG EIYR GNIS++EVDG   KLYCQNLCLLAKLFLDHKTLY+DVDPFLFYIL   D
Sbjct: 2   RHPPGNEIYRKGNISVFEVDGRKQKLYCQNLCLLAKLFLDHKTLYYDVDPFLFYILTETD 61

Query: 520 KYGAHLVGYFSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIGSPEKP 579
           + G H+VGYFSKEKES +  N+ACILTLPPYQR+GYGK LI FSYELS+ EG IGSPEKP
Sbjct: 62  ETGCHIVGYFSKEKESSENYNLACILTLPPYQRKGYGKLLIDFSYELSRKEGKIGSPEKP 121

Query: 580 LSDLGKLSYRSYWSWVLLGILRNSKGNST-TIKELSEMTSIAQTDIISTLQAMNMVKYWK 638
           LSDLG LSYRSYWS  LL +L   +     TIKELS+MT I   DIISTLQ++NM+KY+K
Sbjct: 122 LSDLGLLSYRSYWSQTLLELLLEHRDEHKITIKELSKMTGITVEDIISTLQSLNMIKYYK 181

Query: 639 GQHVICVT 646
           GQH+IC+ 
Sbjct: 182 GQHIICLQ 189



 Score =  277 bits (711), Expect = 9e-90
 Identities = 109/142 (76%), Positives = 119/142 (83%)

Query: 163 CHPPGKEIYRCGNISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVID 222
            HPPG EIYR GNIS++EVDG   KLYCQNLCLLAKLFLDHKTLY+DVDPFLFYIL   D
Sbjct: 2   RHPPGNEIYRKGNISVFEVDGRKQKLYCQNLCLLAKLFLDHKTLYYDVDPFLFYILTETD 61

Query: 223 KYGAHLVGYFSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIGSPEKP 282
           + G H+VGYFSKEKES +  N+ACILTLPPYQR+GYGK LI FSYELS+ EG IGSPEKP
Sbjct: 62  ETGCHIVGYFSKEKESSENYNLACILTLPPYQRKGYGKLLIDFSYELSRKEGKIGSPEKP 121

Query: 283 LSDLGKLSYRSYWSWVLLGILR 304
           LSDLG LSYRSYWS  LL +L 
Sbjct: 122 LSDLGLLSYRSYWSQTLLELLL 143



 Score =  132 bits (333), Expect = 2e-35
 Identities = 56/97 (57%), Positives = 62/97 (63%), Gaps = 23/97 (23%)

Query: 360 CHPPGKEIYRCGNISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVID 419
            HPPG EIYR GNIS++EVDG   KLYCQNLCLLAKLFL                     
Sbjct: 2   RHPPGNEIYRKGNISVFEVDGRKQKLYCQNLCLLAKLFL--------------------- 40

Query: 420 KYDHKTLYFDVDPFLFYILCVIDKYGAHLVGYFSKSE 456
             DHKTLY+DVDPFLFYIL   D+ G H+VGYFSK +
Sbjct: 41  --DHKTLYYDVDPFLFYILTETDETGCHIVGYFSKEK 75


>gnl|CDD|215642 PLN03238, PLN03238, probable histone acetyltransferase MYST;
           Provisional.
          Length = 290

 Score =  326 bits (838), Expect = e-107
 Identities = 136/219 (62%), Positives = 171/219 (78%), Gaps = 3/219 (1%)

Query: 90  DPTTAALEKEHEAITKVKYIDKIMIGKCEIDTWYFSPYPDECKKKPKLWICEFCLKYMTM 149
           DP  A LE+EHE  TKVK I+ I +GK E+DTWY+SPYP+      KL+ICE+CLKYM  
Sbjct: 1   DPVLAELEREHEETTKVKNIEMIELGKYEMDTWYYSPYPEPYASCTKLYICEYCLKYMRK 60

Query: 150 ERTYRYHKSECTYCHPPGKEIYRC---GNISIYEVDGSFHKLYCQNLCLLAKLFLDHKTL 206
           +++   H ++C    PPG  IY     G +S++EVDG   K+YCQNLCLLAKLFLDHKTL
Sbjct: 61  KKSLLRHLAKCDIRQPPGGGIYGAVTEGPLSVFEVDGKKAKVYCQNLCLLAKLFLDHKTL 120

Query: 207 YFDVDPFLFYILCVIDKYGAHLVGYFSKEKESPDGNNVACILTLPPYQRQGYGKFLISFS 266
           Y+DVDPFLFY++  +D +G+H+VGYFSKEK S +  N+ACILTLPPYQR+GYGKFLISF+
Sbjct: 121 YYDVDPFLFYVMTEVDDHGSHIVGYFSKEKVSAEDYNLACILTLPPYQRKGYGKFLISFA 180

Query: 267 YELSKVEGLIGSPEKPLSDLGKLSYRSYWSWVLLGILRN 305
           YELSK EG +G+PE+PLSDLGK+S+RSYW+ VLL  LR+
Sbjct: 181 YELSKREGKVGTPERPLSDLGKVSFRSYWTRVLLEQLRD 219



 Score =  311 bits (799), Expect = e-101
 Identities = 133/227 (58%), Positives = 174/227 (76%), Gaps = 7/227 (3%)

Query: 454 KSECTYCHPPGKEIYRC---GNISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPF 510
            ++C    PPG  IY     G +S++EVDG   K+YCQNLCLLAKLFLDHKTLY+DVDPF
Sbjct: 68  LAKCDIRQPPGGGIYGAVTEGPLSVFEVDGKKAKVYCQNLCLLAKLFLDHKTLYYDVDPF 127

Query: 511 LFYILCVIDKYGAHLVGYFSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVE 570
           LFY++  +D +G+H+VGYFSKEK S +  N+ACILTLPPYQR+GYGKFLISF+YELSK E
Sbjct: 128 LFYVMTEVDDHGSHIVGYFSKEKVSAEDYNLACILTLPPYQRKGYGKFLISFAYELSKRE 187

Query: 571 GLIGSPEKPLSDLGKLSYRSYWSWVLLGILRNSKGNSTTIKELSEMTSIAQTDIISTLQA 630
           G +G+PE+PLSDLGK+S+RSYW+ VLL  LR+ KG+  +IK+LS  T I   DI+STLQ+
Sbjct: 188 GKVGTPERPLSDLGKVSFRSYWTRVLLEQLRDVKGD-VSIKDLSLATGIRGEDIVSTLQS 246

Query: 631 MNMVKYWKGQHVICVTPKIVEEHIQSTQYKRPRLQVDPTYLRWTPPP 677
           +N++KYWKGQHVI V  ++++EH     ++R  ++VD  +  W P P
Sbjct: 247 LNLIKYWKGQHVIHVDQRVLDEHWAKFAHQRV-IEVDCLH--WQPLP 290



 Score =  147 bits (373), Expect = 8e-40
 Identities = 61/134 (45%), Positives = 80/134 (59%), Gaps = 26/134 (19%)

Query: 324 FPIECKKKPKLWICEFCLKYMTMERTYRYHKSECTYCHPPGKEIYRC---GNISIYEVDG 380
           +P       KL+ICE+CLKYM  +++   H ++C    PPG  IY     G +S++EVDG
Sbjct: 38  YPEPYASCTKLYICEYCLKYMRKKKSLLRHLAKCDIRQPPGGGIYGAVTEGPLSVFEVDG 97

Query: 381 SFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYDHKTLYFDVDPFLFYILCV 440
              K+YCQNLCLLAKLFL                       DHKTLY+DVDPFLFY++  
Sbjct: 98  KKAKVYCQNLCLLAKLFL-----------------------DHKTLYYDVDPFLFYVMTE 134

Query: 441 IDKYGAHLVGYFSK 454
           +D +G+H+VGYFSK
Sbjct: 135 VDDHGSHIVGYFSK 148


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score =  326 bits (837), Expect = e-105
 Identities = 147/308 (47%), Positives = 193/308 (62%), Gaps = 10/308 (3%)

Query: 2   AEIIQNRYNELENCFEYYVHYDGFNRRLDEWVQKHRIMNSRFDMS-----EQHWKNKENQ 56
           AEI++    +     ++YVHY   NRRLDEW+    I N    +S     +Q  K K+ +
Sbjct: 23  AEILEINTRKSR--IKFYVHYVELNRRLDEWITADLI-NLGAAISIPKRKKQTEKGKKEK 79

Query: 57  AIDLLDQSDRKITRNQKRRHDEINHVQKTYEEMDPTTAALEKEHEAITKVKYIDKIMIGK 116
              + D+ D      Q      I++ ++  +     +        A  +VK I++I +G 
Sbjct: 80  KPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQNPHEGA--RVKNINEIKLGN 137

Query: 117 CEIDTWYFSPYPDECKKKPKLWICEFCLKYMTMERTYRYHKSECTYCHPPGKEIYRCGNI 176
            EI+ WYFSPYP+E      ++ICEFCLKY   + +   H+ +C+  HPPG EIYR   I
Sbjct: 138 YEIEPWYFSPYPEEFSDLDIVYICEFCLKYYGSQTSLVRHRKKCSLQHPPGNEIYRDKYI 197

Query: 177 SIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYGAHLVGYFSKEK 236
           S +E+DG   +LYC+NLCLL+KLFLDHKTLY+DVDPFLFY+L      G HLVGYFSKEK
Sbjct: 198 SFFEIDGRKQRLYCRNLCLLSKLFLDHKTLYYDVDPFLFYVLTERGDTGCHLVGYFSKEK 257

Query: 237 ESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIGSPEKPLSDLGKLSYRSYWS 296
           ES    N+ACILTLPPYQR+GYGK LI FSY LS+ EG +GSPEKPLSDLG LSYR+YWS
Sbjct: 258 ESEQDYNLACILTLPPYQRRGYGKLLIDFSYLLSQKEGKVGSPEKPLSDLGLLSYRAYWS 317

Query: 297 WVLLGILR 304
            ++  +L 
Sbjct: 318 EIVAKLLL 325



 Score =  287 bits (735), Expect = 2e-90
 Identities = 123/253 (48%), Positives = 163/253 (64%), Gaps = 10/253 (3%)

Query: 421 YDHKTLYFDVDPFLFYILCVIDKYGAHLVGYFSKSECTYCHPPGKEIYRCGNISIYEVDG 480
            D   +Y  +  F             H      + +C+  HPPG EIYR   IS +E+DG
Sbjct: 153 SDLDIVY--ICEFCLKYYGSQTSLVRH------RKKCSLQHPPGNEIYRDKYISFFEIDG 204

Query: 481 SFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYGAHLVGYFSKEKESPDGNN 540
              +LYC+NLCLL+KLFLDHKTLY+DVDPFLFY+L      G HLVGYFSKEKES    N
Sbjct: 205 RKQRLYCRNLCLLSKLFLDHKTLYYDVDPFLFYVLTERGDTGCHLVGYFSKEKESEQDYN 264

Query: 541 VACILTLPPYQRQGYGKFLISFSYELSKVEGLIGSPEKPLSDLGKLSYRSYWSWVLLGIL 600
           +ACILTLPPYQR+GYGK LI FSY LS+ EG +GSPEKPLSDLG LSYR+YWS ++  +L
Sbjct: 265 LACILTLPPYQRRGYGKLLIDFSYLLSQKEGKVGSPEKPLSDLGLLSYRAYWSEIVAKLL 324

Query: 601 RNSKGNSTTIKELSEMTSIAQTDIISTLQAMNMVKYWKGQHVICVTPKIVEEHIQSTQYK 660
                  T I E+S+ T ++  D+I TL+A+N+++ +KGQ++I +    +  +++    K
Sbjct: 325 LKMDKEITDINEISKETGMSTDDVIHTLEALNILREYKGQYIISLNSDKLHNYLRLWSKK 384

Query: 661 RPRLQVDPTYLRW 673
           R R  ++P  L W
Sbjct: 385 RRR--INPDLLLW 395



 Score =  143 bits (363), Expect = 2e-37
 Identities = 61/131 (46%), Positives = 76/131 (58%), Gaps = 23/131 (17%)

Query: 324 FPIECKKKPKLWICEFCLKYMTMERTYRYHKSECTYCHPPGKEIYRCGNISIYEVDGSFH 383
           +P E      ++ICEFCLKY   + +   H+ +C+  HPPG EIYR   IS +E+DG   
Sbjct: 148 YPEEFSDLDIVYICEFCLKYYGSQTSLVRHRKKCSLQHPPGNEIYRDKYISFFEIDGRKQ 207

Query: 384 KLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYDHKTLYFDVDPFLFYILCVIDK 443
           +LYC+NLCLL+KLFLDHKTLY+DVD                       PFLFY+L     
Sbjct: 208 RLYCRNLCLLSKLFLDHKTLYYDVD-----------------------PFLFYVLTERGD 244

Query: 444 YGAHLVGYFSK 454
            G HLVGYFSK
Sbjct: 245 TGCHLVGYFSK 255


>gnl|CDD|178777 PLN03239, PLN03239, histone acetyltransferase; Provisional.
          Length = 351

 Score =  269 bits (688), Expect = 3e-84
 Identities = 138/295 (46%), Positives = 181/295 (61%), Gaps = 25/295 (8%)

Query: 18  YYVHYDGFNRRLDEWVQKHRIMNSRFDMSEQHWKNKENQAIDLLDQSDRKITRNQKRRHD 77
           YYVHY  FNRR+DEW+ K +       +   H        +      +  +        D
Sbjct: 1   YYVHYKDFNRRMDEWISKDKSNEEILALPSDH--------LATHTVGEDVVATIAAPELD 52

Query: 78  EINHVQKTYEEMDPTTAALEKEHEAITKVKYIDKIMIGKCEIDTWYFSPYPDECKKKPK- 136
           E       +E +D   AAL KEHE +TKVK +  + +G  ++DTWYFSP P E  K    
Sbjct: 53  E-------HEGLD--DAAL-KEHEEVTKVKNVAFLELGPYQMDTWYFSPLPKELFKAGGF 102

Query: 137 ---LWICEFCLKYMTME-RTYRYHKSEC--TYCHPPGKEIYRCGNISIYEVDGSFHKLYC 190
              L++CEF   +   +    R+   E      HPPG EIYRCG+++++EVDG   ++YC
Sbjct: 103 IDVLYVCEFSFGFFARKSELLRFQAKELPKERRHPPGNEIYRCGDLAMFEVDGFEERIYC 162

Query: 191 QNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYGAHLVGYFSKEKESPDGNNVACILTL 250
           QNLC +AKLFLDHKTLYFDVDPFLFY+LC +D+ G H VGY+SKEK S  G N+ACILT 
Sbjct: 163 QNLCYIAKLFLDHKTLYFDVDPFLFYVLCEVDERGFHPVGYYSKEKYSDVGYNLACILTF 222

Query: 251 PPYQRQGYGKFLISFSYELSKVEGLIGSPEKPLSDLGKLSYRSYWSWVLLGILRN 305
           P +QR+GYG+FLI+FSYELSK E  +GSPEKP+SDLG+ +Y  YW   ++  L N
Sbjct: 223 PAHQRKGYGRFLIAFSYELSKKEEKVGSPEKPMSDLGQQAYIPYWGSTIVDFLLN 277



 Score =  249 bits (638), Expect = 7e-77
 Identities = 116/217 (53%), Positives = 155/217 (71%), Gaps = 4/217 (1%)

Query: 461 HPPGKEIYRCGNISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDK 520
           HPPG EIYRCG+++++EVDG   ++YCQNLC +AKLFLDHKTLYFDVDPFLFY+LC +D+
Sbjct: 136 HPPGNEIYRCGDLAMFEVDGFEERIYCQNLCYIAKLFLDHKTLYFDVDPFLFYVLCEVDE 195

Query: 521 YGAHLVGYFSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIGSPEKPL 580
            G H VGY+SKEK S  G N+ACILT P +QR+GYG+FLI+FSYELSK E  +GSPEKP+
Sbjct: 196 RGFHPVGYYSKEKYSDVGYNLACILTFPAHQRKGYGRFLIAFSYELSKKEEKVGSPEKPM 255

Query: 581 SDLGKLSYRSYWSWVLLGILRNSKGNST--TIKELSEMTSIAQTDIISTLQAMNMVKYWK 638
           SDLG+ +Y  YW   ++  L N  GN +  +I ++++ TSI   DI+  L  + ++K+  
Sbjct: 256 SDLGQQAYIPYWGSTIVDFLLNHSGNDSSLSIMDIAKKTSIMAEDIVFALNQLGILKFIN 315

Query: 639 GQHVICVTPKIVEEHIQSTQYKRPRLQVDPTYLRWTP 675
           G + I     ++EE  +    K PR  VDP+ L WTP
Sbjct: 316 GIYFIAAEKGLLEELAEKHPVKEPR--VDPSKLHWTP 350



 Score =  115 bits (289), Expect = 5e-28
 Identities = 56/126 (44%), Positives = 74/126 (58%), Gaps = 26/126 (20%)

Query: 334 LWICEFCLKYMTME-RTYRYHKSEC--TYCHPPGKEIYRCGNISIYEVDGSFHKLYCQNL 390
           L++CEF   +   +    R+   E      HPPG EIYRCG+++++EVDG   ++YCQNL
Sbjct: 106 LYVCEFSFGFFARKSELLRFQAKELPKERRHPPGNEIYRCGDLAMFEVDGFEERIYCQNL 165

Query: 391 CLLAKLFLDHKTLYFDVDPFLFYILCVIDKYDHKTLYFDVDPFLFYILCVIDKYGAHLVG 450
           C +AKLFLDHKTLYF                       DVDPFLFY+LC +D+ G H VG
Sbjct: 166 CYIAKLFLDHKTLYF-----------------------DVDPFLFYVLCEVDERGFHPVG 202

Query: 451 YFSKSE 456
           Y+SK +
Sbjct: 203 YYSKEK 208


>gnl|CDD|173359 PTZ00064, PTZ00064, histone acetyltransferase; Provisional.
          Length = 552

 Score =  273 bits (700), Expect = 2e-83
 Identities = 133/301 (44%), Positives = 188/301 (62%), Gaps = 18/301 (5%)

Query: 10  NELENCFEYYVHYDGFNRRLDEWVQKHRIMNSRFDMSE----------QHWKNKENQAID 59
           ++++  +E+YVH+ G NRRLD WV+   I  S FD+ E          Q    K   ++ 
Sbjct: 143 SQIKEDYEFYVHFRGLNRRLDRWVKGKDIKLS-FDVEELNDPNLIERFQKQGIKFISSLS 201

Query: 60  LLDQSDRKITRNQKRRHDEIN----HVQKTYEEMDPTTAALEKEHEAITKVKYIDKIMIG 115
           + + +++   +++KR    ++         +E MD   +A+  +HE  T+++ I ++ IG
Sbjct: 202 VSNSANKSGNKSKKRNVGVLDISDGEDPDEHEGMD--HSAIL-DHEETTRLRTIGRVRIG 258

Query: 116 KCEIDTWYFSPYPDECKKKPKLWICEFCLKYMTMERTYRYHKSECTYCHPPGKEIYRCGN 175
           K  +DTWYFSP PDE +    L  CE+CL +   E     H S C   HPPG EIYR  N
Sbjct: 259 KFILDTWYFSPLPDEYQNVDTLHFCEYCLDFFCFEDELIRHLSRCQLRHPPGNEIYRKDN 318

Query: 176 ISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYGAHLVGYFSKE 235
           IS++E+DG+  + Y +NLC LAKLFLDHKTL +DV+PFLFYI+  +D+ G H+VGYFSKE
Sbjct: 319 ISVFEIDGALTRGYAENLCYLAKLFLDHKTLQYDVEPFLFYIVTEVDEEGCHIVGYFSKE 378

Query: 236 KESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIGSPEKPLSDLGKLSYRSYW 295
           K S    N+ACILTLP YQR+GYGK L+  SY+LS  EG  G PE+PLSDLG+  Y ++W
Sbjct: 379 KVSLLHYNLACILTLPCYQRKGYGKLLVDLSYKLSLKEGKWGHPERPLSDLGRAIYNNWW 438

Query: 296 S 296
           +
Sbjct: 439 A 439



 Score =  206 bits (524), Expect = 2e-58
 Identities = 107/250 (42%), Positives = 144/250 (57%), Gaps = 24/250 (9%)

Query: 455 SECTYCHPPGKEIYRCGNISIYEVDGSFHKLYCQNLCLLAKLFLDHKTLYFDVDPFLFYI 514
           S C   HPPG EIYR  NIS++E+DG+  + Y +NLC LAKLFLDHKTL +DV+PFLFYI
Sbjct: 301 SRCQLRHPPGNEIYRKDNISVFEIDGALTRGYAENLCYLAKLFLDHKTLQYDVEPFLFYI 360

Query: 515 LCVIDKYGAHLVGYFSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIG 574
           +  +D+ G H+VGYFSKEK S    N+ACILTLP YQR+GYGK L+  SY+LS  EG  G
Sbjct: 361 VTEVDEEGCHIVGYFSKEKVSLLHYNLACILTLPCYQRKGYGKLLVDLSYKLSLKEGKWG 420

Query: 575 SPEKPLSDLGKLSYRSYW----SWVLLGILRNSK--------------GNSTTIKELSEM 616
            PE+PLSDLG+  Y ++W    S  LL   + +K                   I  +   
Sbjct: 421 HPERPLSDLGRAIYNNWWAHRISEYLLEYFKQNKICERGGSKQPLQVSNYWKFIDNVVRS 480

Query: 617 TSIAQTDIISTLQAMNMVKYWKGQHVICVTPKIVEEHIQSTQYKRPRLQVDPTYLRWTP- 675
           T I + D+I  L+   +++  K QH I    + ++  ++ +   RP + +   Y  W P 
Sbjct: 481 TGIRREDVIRILEENGIMRNIKDQHYIFCNQEFLKGIVKRS--GRPGITLIDKYFNWVPF 538

Query: 676 ---PPKRVHS 682
              PP  V S
Sbjct: 539 SRAPPSEVES 548



 Score =  121 bits (304), Expect = 7e-29
 Identities = 58/138 (42%), Positives = 75/138 (54%), Gaps = 23/138 (16%)

Query: 324 FPIECKKKPKLWICEFCLKYMTMERTYRYHKSECTYCHPPGKEIYRCGNISIYEVDGSFH 383
            P E +    L  CE+CL +   E     H S C   HPPG EIYR  NIS++E+DG+  
Sbjct: 270 LPDEYQNVDTLHFCEYCLDFFCFEDELIRHLSRCQLRHPPGNEIYRKDNISVFEIDGALT 329

Query: 384 KLYCQNLCLLAKLFLDHKTLYFDVDPFLFYILCVIDKYDHKTLYFDVDPFLFYILCVIDK 443
           + Y +NLC LAKLFL                       DHKTL +DV+PFLFYI+  +D+
Sbjct: 330 RGYAENLCYLAKLFL-----------------------DHKTLQYDVEPFLFYIVTEVDE 366

Query: 444 YGAHLVGYFSKSECTYCH 461
            G H+VGYFSK + +  H
Sbjct: 367 EGCHIVGYFSKEKVSLLH 384


>gnl|CDD|152153 pfam11717, Tudor-knot, RNA binding activity-knot of a
          chromodomain.  This is a novel knotted tudor domain
          which is required for binding to RNA. The know
          influences the loop conformation of the helical turn
          Ht2 - residues 61-6 3- that is located at the side
          opposite the knot in the tudor domain-chromodomain;
          stabilisation of Ht2 is essential for RNA binding.
          Length = 55

 Score = 58.4 bits (142), Expect = 3e-11
 Identities = 21/38 (55%), Positives = 25/38 (65%), Gaps = 2/38 (5%)

Query: 1  PAEIIQNRYNELENCFEYYVHYDGFNRRLDEWVQKHRI 38
           AEI+       E  +EYYVHY GFN+RLDEWV + RI
Sbjct: 18 EAEILS--IRPKEGKYEYYVHYVGFNKRLDEWVDRDRI 53


>gnl|CDD|237991 cd00024, CHROMO, Chromatin organization modifier (chromo) domain
          is a conserved region of around 50 amino acids found in
          a variety of chromosomal proteins, which appear to play
          a role in the functional organization of the eukaryotic
          nucleus. Experimental evidence implicates the chromo
          domain in the binding activity of these proteins to
          methylated histone tails and maybe RNA. May occur as
          single instance, in a tandem arrangement or followd by
          a related "chromo shadow" domain.
          Length = 55

 Score = 33.8 bits (78), Expect = 0.017
 Identities = 8/38 (21%), Positives = 18/38 (47%)

Query: 1  PAEIIQNRYNELENCFEYYVHYDGFNRRLDEWVQKHRI 38
            +I+ +R  +    +EY V + G++   D W  +  +
Sbjct: 5  VEKILDHRKKKDGGEYEYLVKWKGYSYSEDTWEPEENL 42


>gnl|CDD|173926 cd04301, NAT_SF, N-Acyltransferase superfamily: Various enzymes
           that characteristically catalyze the transfer of an acyl
           group to a substrate.  NAT (N-Acyltransferase) is a
           large superfamily of enzymes that mostly catalyze the
           transfer of an acyl group to a substrate and are
           implicated in a variety of functions, ranging from
           bacterial antibiotic resistance to circadian rhythms in
           mammals. Members include GCN5-related
           N-Acetyltransferases (GNAT) such as Aminoglycoside
           N-acetyltransferases, Histone N-acetyltransferase (HAT)
           enzymes, and Serotonin N-acetyltransferase, which
           catalyze the transfer of an acetyl group to a substrate.
           The kinetic mechanism of most GNATs involves the ordered
           formation of a ternary complex: the reaction begins with
           Acetyl Coenzyme A (AcCoA) binding, followed by binding
           of substrate, then direct transfer of the acetyl group
           from AcCoA to the substrate, followed by product and
           subsequent CoA release. Other family members include
           Arginine/ornithine N-succinyltransferase, Myristoyl-CoA:
           protein N-myristoyltransferase, and
           Acyl-homoserinelactone synthase which have a similar
           catalytic mechanism but differ in types of acyl groups
           transferred. Leucyl/phenylalanyl-tRNA-protein
           transferase and FemXAB nonribosomal peptidyltransferases
           which catalyze similar peptidyltransferase reactions are
           also included.
          Length = 65

 Score = 32.6 bits (75), Expect = 0.054
 Identities = 10/40 (25%), Positives = 18/40 (45%), Gaps = 2/40 (5%)

Query: 228 LVGY--FSKEKESPDGNNVACILTLPPYQRQGYGKFLISF 265
           +VG+   S +    D   +  +  LP Y+ +G G  L+  
Sbjct: 10  IVGFASLSPDGSGGDTAYIGDLAVLPEYRGKGIGSALLEA 49



 Score = 32.6 bits (75), Expect = 0.054
 Identities = 10/40 (25%), Positives = 18/40 (45%), Gaps = 2/40 (5%)

Query: 525 LVGY--FSKEKESPDGNNVACILTLPPYQRQGYGKFLISF 562
           +VG+   S +    D   +  +  LP Y+ +G G  L+  
Sbjct: 10  IVGFASLSPDGSGGDTAYIGDLAVLPEYRGKGIGSALLEA 49


>gnl|CDD|214605 smart00298, CHROMO, Chromatin organization modifier domain. 
          Length = 55

 Score = 32.2 bits (74), Expect = 0.078
 Identities = 9/40 (22%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 1  PAEIIQNRYNELENCFEYYVHYDGFNRRLDEWVQKHRIMN 40
            +I+ +R+ +     EY V + G++   D W  +  ++N
Sbjct: 4  VEKILDHRWKKKGE-LEYLVKWKGYSYSEDTWEPEENLLN 42


>gnl|CDD|218250 pfam04760, IF2_N, Translation initiation factor IF-2, N-terminal
           region.  This conserved feature at the N-terminus of
           bacterial translation initiation factor IF2 has recently
           had its structure solved. It shows structural similarity
           to the tRNA anticodon Stem Contact Fold domains of the
           methionyl-tRNA and glutaminyl-tRNA synthetases, and a
           similar fold is also found in the B5 domain of the
           phenylalanine-tRNA synthetase.
          Length = 52

 Score = 29.7 bits (68), Expect = 0.52
 Identities = 5/28 (17%), Positives = 15/28 (53%)

Query: 608 TTIKELSEMTSIAQTDIISTLQAMNMVK 635
             + EL++   ++  ++I  L  + ++K
Sbjct: 4   IRVYELAKELGVSSKELIKKLFKLGIMK 31


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 33.3 bits (76), Expect = 0.58
 Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 11/53 (20%)

Query: 606 NSTTIKELSEMTSIAQTDIISTLQAMNMVKYWKGQHVICVTP-KIVEEHIQST 657
           +  TI+ELS +  I +T+II +L       + KG   I VT  +I++  I S 
Sbjct: 164 SPLTIQELSTLLCIPETEIIKSL-------FLKG---ISVTVNQIIDISIISQ 206


>gnl|CDD|235563 PRK05689, fliJ, flagellar biosynthesis chaperone; Validated.
          Length = 147

 Score = 31.1 bits (71), Expect = 1.0
 Identities = 7/39 (17%), Positives = 17/39 (43%), Gaps = 7/39 (17%)

Query: 19 YVHYDGFNRRLDEWVQKHRIMNSRFDMSEQHWKNKENQA 57
          +++Y  F ++L++ + + R            W  K + A
Sbjct: 66 WINYQQFLQQLEKAITQQR-------QQLTQWTQKVDNA 97


>gnl|CDD|128778 smart00502, BBC, B-Box C-terminal domain.  Coiled coil region
           C-terminal to (some) B-Box domains.
          Length = 127

 Score = 30.3 bits (69), Expect = 1.2
 Identities = 19/98 (19%), Positives = 37/98 (37%), Gaps = 12/98 (12%)

Query: 5   IQNRYNELENCFEYYVHYDGFNRRLDEWVQKHRIMNSRFDMSEQHWKNKENQAIDLLDQS 64
           ++    ++E   +    +D     L   + K +       + E   + KEN+   L  Q 
Sbjct: 33  VEENAADVEAQIK--AAFD----ELRNALNKRKKQ-----LLEDLEEQKENKLKVLEQQ- 80

Query: 65  DRKITRNQKRRHDEINHVQKTYEEMDPTTAALEKEHEA 102
              +T+ Q++    IN  ++     DPT   L K+   
Sbjct: 81  LESLTQKQEKLSHAINFTEEALNSGDPTELLLSKKLII 118


>gnl|CDD|237470 PRK13688, PRK13688, hypothetical protein; Provisional.
          Length = 156

 Score = 30.4 bits (69), Expect = 1.9
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 250 LPPYQRQGYGKFLISF 265
           LP YQ +GYG+ L+ F
Sbjct: 88  LPKYQNRGYGEMLVDF 103



 Score = 30.4 bits (69), Expect = 1.9
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 547 LPPYQRQGYGKFLISF 562
           LP YQ +GYG+ L+ F
Sbjct: 88  LPKYQNRGYGEMLVDF 103


>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
          Length = 746

 Score = 30.9 bits (70), Expect = 2.7
 Identities = 8/25 (32%), Positives = 14/25 (56%)

Query: 22  YDGFNRRLDEWVQKHRIMNSRFDMS 46
           Y   +  L +  QK  I+NSR +++
Sbjct: 181 YGAHDNFLSDQAQKGAILNSRVELN 205


>gnl|CDD|216007 pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family.  This
           family contains proteins with N-acetyltransferase
           functions such as Elp3-related proteins.
          Length = 80

 Score = 28.4 bits (64), Expect = 2.7
 Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 2/58 (3%)

Query: 222 DKYGAHLVGY--FSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIG 277
            +    LVG+   S   E  +   +  +   P Y+ +G G  L+    E ++  GL  
Sbjct: 1   AEEDGELVGFASLSIIDEEGNVAEIEGLAVDPEYRGKGIGTALLEALEEYARELGLKR 58



 Score = 28.4 bits (64), Expect = 2.7
 Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 2/58 (3%)

Query: 519 DKYGAHLVGY--FSKEKESPDGNNVACILTLPPYQRQGYGKFLISFSYELSKVEGLIG 574
            +    LVG+   S   E  +   +  +   P Y+ +G G  L+    E ++  GL  
Sbjct: 1   AEEDGELVGFASLSIIDEEGNVAEIEGLAVDPEYRGKGIGTALLEALEEYARELGLKR 58


>gnl|CDD|222340 pfam13718, GNAT_acetyltr_2, GNAT acetyltransferase 2.  This domain
           has N-acetyltransferase activity. It has a GCN5-related
           N-acetyltransferase (GNAT) fold.
          Length = 169

 Score = 29.0 bits (66), Expect = 5.1
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 247 ILTLPPYQRQGYGKFLISFSYELSKVEGL 275
           I   P YQR GYG  L+    +    EG+
Sbjct: 94  IAVHPDYQRMGYGSRLLELLEQYYIPEGV 122



 Score = 29.0 bits (66), Expect = 5.1
 Identities = 11/29 (37%), Positives = 14/29 (48%)

Query: 544 ILTLPPYQRQGYGKFLISFSYELSKVEGL 572
           I   P YQR GYG  L+    +    EG+
Sbjct: 94  IAVHPDYQRMGYGSRLLELLEQYYIPEGV 122


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 30.0 bits (67), Expect = 5.9
 Identities = 17/89 (19%), Positives = 33/89 (37%), Gaps = 9/89 (10%)

Query: 28  RLDEWVQKHRIMNSRFDMSEQHWKNKENQAIDLLDQ---------SDRKITRNQKRRHDE 78
            L +  Q          M  ++ K  + +A ++ DQ         S R+I ++ +   D 
Sbjct: 190 TLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDP 249

Query: 79  INHVQKTYEEMDPTTAALEKEHEAITKVK 107
           + +  K  E        L+ E +A+   K
Sbjct: 250 LKNRLKEIEHNLSKIMKLDNEIKALKSRK 278


>gnl|CDD|222185 pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain.  This
           domain catalyzes N-acetyltransferase reactions.
          Length = 79

 Score = 27.5 bits (62), Expect = 5.9
 Identities = 7/22 (31%), Positives = 11/22 (50%)

Query: 244 VACILTLPPYQRQGYGKFLISF 265
           +  +   P Y+ QGYG  L+  
Sbjct: 29  IGGVAVDPEYRGQGYGSKLLRH 50



 Score = 27.5 bits (62), Expect = 5.9
 Identities = 7/22 (31%), Positives = 11/22 (50%)

Query: 541 VACILTLPPYQRQGYGKFLISF 562
           +  +   P Y+ QGYG  L+  
Sbjct: 29  IGGVAVDPEYRGQGYGSKLLRH 50


>gnl|CDD|188360 TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype
           I-C/DVULG.  The CRISPR-associated protein Cas1 is
           virtually universal to CRISPR systems. CRISPR, an
           acronym for Clustered Regularly Interspaced Short
           Palindromic Repeats, is prokaryotic immunity system for
           foreign DNA, mostly from phage. CRISPR systems belong to
           different subtypes, distinguished by both nature of the
           repeats, the makeup of the cohort of associated Cas
           proteins, and by molecular phylogeny within the more
           universal Cas proteins such as this one. This model is
           of type EXCEPTION and provides more specific information
           than the EQUIVALOG model TIGR00287. It describes the
           Cas1 protein particular to the DVULG subtype of
           CRISPR/Cas system.
          Length = 340

 Score = 29.5 bits (67), Expect = 6.7
 Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 42  RFDMSEQHWKNKENQAIDLLDQSDRK--ITRNQKRRHDEINH 81
           R  ++ + ++ +E  A+ LL    RK  +   Q+R+ +EI H
Sbjct: 264 RGQLTAKDFEVREGGAV-LLTDDARKTVLVAYQERKQEEILH 304


>gnl|CDD|187852 cd09721, Cas1_I-C, CRISPR/Cas system-associated protein Cas1.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Cas1 is the most universal CRISPR system protein thought
           to be involved in spacer integration; Cas1 is
           metal-dependent deoxyribonuclease, also binds RNA; Shown
           to possess a unique fold consisting of a N-terminal
           beta-strand domain and a C-terminal alpha-helical
           domain.
          Length = 338

 Score = 29.5 bits (67), Expect = 6.7
 Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 42  RFDMSEQHWKNKENQAIDLLDQSDRK--ITRNQKRRHDEINH 81
           R  ++ + ++ +E  A+ LL    RK  +   Q+R+ +EI H
Sbjct: 263 RGQLTAKDFEVREGGAV-LLTDDARKTVLVAYQERKQEEITH 303


>gnl|CDD|217951 pfam04188, Mannosyl_trans2, Mannosyltransferase (PIG-V)).  This is
           a family of eukaryotic ER membrane proteins that are
           involved in the synthesis of
           glycosylphosphatidylinositol (GPI), a glycolipid that
           anchors many proteins to the eukaryotic cell surface.
           Proteins in this family are involved in transferring the
           second mannose in the biosynthetic pathway of GPI.
          Length = 412

 Score = 29.0 bits (65), Expect = 9.0
 Identities = 12/36 (33%), Positives = 17/36 (47%), Gaps = 2/36 (5%)

Query: 278 SPEKPLSDLGKLSYRSYWS--WVLLGILRNSNFFPF 311
              KPL D   + +  YW   ++LL  L  +NF P 
Sbjct: 376 KMWKPLGDDKIVKFLIYWLAFYILLQTLLFANFLPP 411


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.141    0.457 

Gapped
Lambda     K      H
   0.267   0.0753    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 35,554,618
Number of extensions: 3486147
Number of successful extensions: 2830
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2801
Number of HSP's successfully gapped: 52
Length of query: 687
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 583
Effective length of database: 6,324,786
Effective search space: 3687350238
Effective search space used: 3687350238
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (27.6 bits)