RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6980
         (760 letters)



>gnl|CDD|215056 PLN00104, PLN00104, MYST -like histone acetyltransferase;
           Provisional.
          Length = 450

 Score =  376 bits (966), Expect = e-123
 Identities = 156/301 (51%), Positives = 198/301 (65%), Gaps = 22/301 (7%)

Query: 256 QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKY 315
           + HEE TK             N A IE G+YE+ TWY SPFP EY    KL+ CEFCLK+
Sbjct: 159 REHEEFTKVK-----------NIATIELGRYEIDTWYFSPFPPEYNDCSKLYFCEFCLKF 207

Query: 316 TKSKAVLERHRDKCAWRHPPATEIYRK----ENLSVFEVDGNQNKFYCQNLCLLAKLFLD 371
            K K  L+RH  KC  +HPP  EIYR     E LS+FEVDG +NK YCQNLC LAKLFLD
Sbjct: 208 MKRKEQLQRHMKKCDLKHPPGDEIYRHPTRQEGLSMFEVDGKKNKVYCQNLCYLAKLFLD 267

Query: 372 HKTLYYDVEPFLFYVLTQNDDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFL 431
           HKTLYYDV+ FLFYVL + DD+GCH+VGYFSKEKH ++ YN++CI+TLP YQRKGYG+FL
Sbjct: 268 HKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEEDYNLACILTLPPYQRKGYGKFL 327

Query: 432 IDFSYLLSKKEGQRGTPEKPLSDLGRVSYHAYWKSVLLEYLDTIRNQKLICIDQMCADTG 491
           I FSY LSK+EG+ GTPE+PLSDLG VSY  YW  VLLE L   +    I I ++   T 
Sbjct: 328 IAFSYELSKREGKVGTPERPLSDLGLVSYRGYWTRVLLEILKKHKGN--ISIKELSDMTA 385

Query: 492 LYHHDVAETLELLGMLRTKHGDSSEPCIVINWAIVDAHMKRLEQSKTRIKIDPECLRWTP 551
           +   D+  TL+ L +++ + G   +  I  +  +++ H+K     +  +++DP  L WTP
Sbjct: 386 IKAEDIVSTLQSLNLIQYRKG---QHVICADPKVLEEHLK--AAGRGGLEVDPSKLIWTP 440

Query: 552 L 552
            
Sbjct: 441 Y 441



 Score = 47.1 bits (112), Expect = 3e-05
 Identities = 21/46 (45%), Positives = 24/46 (52%), Gaps = 11/46 (23%)

Query: 34  QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYA 79
           + HEE TK             N A IE G+YE+ TWY SPFP EY 
Sbjct: 159 REHEEFTKVK-----------NIATIELGRYEIDTWYFSPFPPEYN 193


>gnl|CDD|216741 pfam01853, MOZ_SAS, MOZ/SAS family.  This region of these proteins
           has been suggested to be homologous to
           acetyltransferases.
          Length = 189

 Score =  333 bits (857), Expect = e-110
 Identities = 113/183 (61%), Positives = 139/183 (75%)

Query: 331 WRHPPATEIYRKENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQN 390
            RHPP  EIYRK N+SVFEVDG + K YCQNLCLLAKLFLDHKTLYYDV+PFLFY+LT+ 
Sbjct: 1   LRHPPGNEIYRKGNISVFEVDGRKQKLYCQNLCLLAKLFLDHKTLYYDVDPFLFYILTET 60

Query: 391 DDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEK 450
           D+ GCH+VGYFSKEK   + YN++CI+TLP YQRKGYG+ LIDFSY LS+KEG+ G+PEK
Sbjct: 61  DETGCHIVGYFSKEKESSENYNLACILTLPPYQRKGYGKLLIDFSYELSRKEGKIGSPEK 120

Query: 451 PLSDLGRVSYHAYWKSVLLEYLDTIRNQKLICIDQMCADTGLYHHDVAETLELLGMLRTK 510
           PLSDLG +SY +YW   LLE L   R++  I I ++   TG+   D+  TL+ L M++  
Sbjct: 121 PLSDLGLLSYRSYWSQTLLELLLEHRDEHKITIKELSKMTGITVEDIISTLQSLNMIKYY 180

Query: 511 HGD 513
            G 
Sbjct: 181 KGQ 183


>gnl|CDD|215642 PLN03238, PLN03238, probable histone acetyltransferase MYST;
           Provisional.
          Length = 290

 Score =  332 bits (854), Expect = e-109
 Identities = 147/298 (49%), Positives = 192/298 (64%), Gaps = 22/298 (7%)

Query: 258 HEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKYTK 317
           HEE TK             N   IE G+YE+ TWY SP+P+ YA   KL++CE+CLKY +
Sbjct: 11  HEETTKVK-----------NIEMIELGKYEMDTWYYSPYPEPYASCTKLYICEYCLKYMR 59

Query: 318 SKAVLERHRDKCAWRHPPATEIYR---KENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKT 374
            K  L RH  KC  R PP   IY    +  LSVFEVDG + K YCQNLCLLAKLFLDHKT
Sbjct: 60  KKKSLLRHLAKCDIRQPPGGGIYGAVTEGPLSVFEVDGKKAKVYCQNLCLLAKLFLDHKT 119

Query: 375 LYYDVEPFLFYVLTQNDDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
           LYYDV+PFLFYV+T+ DD G H+VGYFSKEK   + YN++CI+TLP YQRKGYG+FLI F
Sbjct: 120 LYYDVDPFLFYVMTEVDDHGSHIVGYFSKEKVSAEDYNLACILTLPPYQRKGYGKFLISF 179

Query: 435 SYLLSKKEGQRGTPEKPLSDLGRVSYHAYWKSVLLEYLDTIRNQKLICIDQMCADTGLYH 494
           +Y LSK+EG+ GTPE+PLSDLG+VS+ +YW  VLLE L  ++    + I  +   TG+  
Sbjct: 180 AYELSKREGKVGTPERPLSDLGKVSFRSYWTRVLLEQLRDVKGD--VSIKDLSLATGIRG 237

Query: 495 HDVAETLELLGMLRTKHGDSSEPCIVINWAIVDAHMKRLEQSKTRIKIDPECLRWTPL 552
            D+  TL+ L +++   G   +  I ++  ++D H  +    +  I++D     W PL
Sbjct: 238 EDIVSTLQSLNLIKYWKG---QHVIHVDQRVLDEHWAKFAHQRV-IEVDCL--HWQPL 289



 Score = 43.7 bits (103), Expect = 2e-04
 Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 11/49 (22%)

Query: 36 HEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARNTQV 84
          HEE TK             N   IE G+YE+ TWY SP+P+ YA  T++
Sbjct: 11 HEETTKVK-----------NIEMIELGKYEMDTWYYSPYPEPYASCTKL 48


>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
           structure and dynamics].
          Length = 395

 Score =  327 bits (840), Expect = e-105
 Identities = 140/269 (52%), Positives = 192/269 (71%), Gaps = 6/269 (2%)

Query: 281 IEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKYTKSKAVLERHRDKCAWRHPPATEIY 340
           I+ G YE++ WY SP+P+E++ L  +++CEFCLKY  S+  L RHR KC+ +HPP  EIY
Sbjct: 133 IKLGNYEIEPWYFSPYPEEFSDLDIVYICEFCLKYYGSQTSLVRHRKKCSLQHPPGNEIY 192

Query: 341 RKENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQNDDKGCHLVGY 400
           R + +S FE+DG + + YC+NLCLL+KLFLDHKTLYYDV+PFLFYVLT+  D GCHLVGY
Sbjct: 193 RDKYISFFEIDGRKQRLYCRNLCLLSKLFLDHKTLYYDVDPFLFYVLTERGDTGCHLVGY 252

Query: 401 FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEKPLSDLGRVSY 460
           FSKEK  +Q YN++CI+TLP YQR+GYG+ LIDFSYLLS+KEG+ G+PEKPLSDLG +SY
Sbjct: 253 FSKEKESEQDYNLACILTLPPYQRRGYGKLLIDFSYLLSQKEGKVGSPEKPLSDLGLLSY 312

Query: 461 HAYWKSVLLEYLDTIRNQKLICIDQMCADTGLYHHDVAETLELLGMLRTKHGDSSEPCIV 520
            AYW  ++ + L  +  +    I+++  +TG+   DV  TLE L +LR       +  I 
Sbjct: 313 RAYWSEIVAKLLLKMDKEIT-DINEISKETGMSTDDVIHTLEALNILRE---YKGQYIIS 368

Query: 521 INWAIVDAHMKRLEQSKTRIKIDPECLRW 549
           +N   +  +++    SK R +I+P+ L W
Sbjct: 369 LNSDKLHNYLR--LWSKKRRRINPDLLLW 395



 Score = 29.4 bits (66), Expect = 7.8
 Identities = 11/26 (42%), Positives = 18/26 (69%)

Query: 59  IEFGQYEVKTWYSSPFPQEYARNTQV 84
           I+ G YE++ WY SP+P+E++    V
Sbjct: 133 IKLGNYEIEPWYFSPYPEEFSDLDIV 158


>gnl|CDD|173359 PTZ00064, PTZ00064, histone acetyltransferase; Provisional.
          Length = 552

 Score =  275 bits (705), Expect = 1e-83
 Identities = 126/271 (46%), Positives = 162/271 (59%), Gaps = 28/271 (10%)

Query: 256 QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARLPKLFLCEFCLKY 315
             HEE T+   +             +  G++ + TWY SP P EY  +  L  CE+CL +
Sbjct: 241 LDHEETTRLRTI-----------GRVRIGKFILDTWYFSPLPDEYQNVDTLHFCEYCLDF 289

Query: 316 TKSKAVLERHRDKCAWRHPPATEIYRKENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTL 375
              +  L RH  +C  RHPP  EIYRK+N+SVFE+DG   + Y +NLC LAKLFLDHKTL
Sbjct: 290 FCFEDELIRHLSRCQLRHPPGNEIYRKDNISVFEIDGALTRGYAENLCYLAKLFLDHKTL 349

Query: 376 YYDVEPFLFYVLTQNDDKGCHLVGYFSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFS 435
            YDVEPFLFY++T+ D++GCH+VGYFSKEK     YN++CI+TLP YQRKGYG+ L+D S
Sbjct: 350 QYDVEPFLFYIVTEVDEEGCHIVGYFSKEKVSLLHYNLACILTLPCYQRKGYGKLLVDLS 409

Query: 436 YLLSKKEGQRGTPEKPLSDLGRVSYHAYWKSVLLEYLDTIRNQKLIC------------- 482
           Y LS KEG+ G PE+PLSDLGR  Y+ +W   + EYL     Q  IC             
Sbjct: 410 YKLSLKEGKWGHPERPLSDLGRAIYNNWWAHRISEYLLEYFKQNKICERGGSKQPLQVSN 469

Query: 483 ----IDQMCADTGLYHHDVAETLELLGMLRT 509
               ID +   TG+   DV   LE  G++R 
Sbjct: 470 YWKFIDNVVRSTGIRREDVIRILEENGIMRN 500



 Score = 33.5 bits (76), Expect = 0.52
 Identities = 13/45 (28%), Positives = 19/45 (42%), Gaps = 11/45 (24%)

Query: 34  QAHEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEY 78
             HEE T+   +             +  G++ + TWY SP P EY
Sbjct: 241 LDHEETTRLRTI-----------GRVRIGKFILDTWYFSPLPDEY 274


>gnl|CDD|178777 PLN03239, PLN03239, histone acetyltransferase; Provisional.
          Length = 351

 Score =  261 bits (667), Expect = 2e-80
 Identities = 139/333 (41%), Positives = 194/333 (58%), Gaps = 30/333 (9%)

Query: 229 VSEDVVSKLLATDLAPGVTRKDIDLYKQAHEEATKATPLLPLAVPEQINPAAIEFGQYEV 288
           V EDVV+ + A +L       D  L  + HEE TK             N A +E G Y++
Sbjct: 38  VGEDVVATIAAPELDEHEGLDDAAL--KEHEEVTKVK-----------NVAFLELGPYQM 84

Query: 289 KTWYSSPFPQEYAR----LPKLFLCEFCLKYTKSKAVLERHRDKC---AWRHPPATEIYR 341
            TWY SP P+E  +    +  L++CEF   +   K+ L R + K      RHPP  EIYR
Sbjct: 85  DTWYFSPLPKELFKAGGFIDVLYVCEFSFGFFARKSELLRFQAKELPKERRHPPGNEIYR 144

Query: 342 KENLSVFEVDGNQNKFYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTQNDDKGCHLVGYF 401
             +L++FEVDG + + YCQNLC +AKLFLDHKTLY+DV+PFLFYVL + D++G H VGY+
Sbjct: 145 CGDLAMFEVDGFEERIYCQNLCYIAKLFLDHKTLYFDVDPFLFYVLCEVDERGFHPVGYY 204

Query: 402 SKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEKPLSDLGRVSYH 461
           SKEK+    YN++CI+T P +QRKGYGRFLI FSY LSKKE + G+PEKP+SDLG+ +Y 
Sbjct: 205 SKEKYSDVGYNLACILTFPAHQRKGYGRFLIAFSYELSKKEEKVGSPEKPMSDLGQQAYI 264

Query: 462 AYWKSVLLEY-LDTIRNQKLICIDQMCADTGLYHHDVAETLELLGMLRTKHGDSSEPCIV 520
            YW S ++++ L+   N   + I  +   T +   D+   L  LG+L+  +G       +
Sbjct: 265 PYWGSTIVDFLLNHSGNDSSLSIMDIAKKTSIMAEDIVFALNQLGILKFING-------I 317

Query: 521 INWAIVDAHMKRLEQSK--TRIKIDPECLRWTP 551
              A     ++ L +       ++DP  L WTP
Sbjct: 318 YFIAAEKGLLEELAEKHPVKEPRVDPSKLHWTP 350



 Score = 41.6 bits (97), Expect = 0.001
 Identities = 25/76 (32%), Positives = 33/76 (43%), Gaps = 12/76 (15%)

Query: 36  HEEATKATPLLPLAVPEQINPAAIEFGQYEVKTWYSSPFPQEYARNTQVVSKLLATDLAP 95
           HEE TK             N A +E G Y++ TWY SP P+E  +    +  L   + + 
Sbjct: 65  HEEVTKVK-----------NVAFLELGPYQMDTWYFSPLPKELFKAGGFIDVLYVCEFSF 113

Query: 96  GV-TRKDIDLYKQAHE 110
           G   RK   L  QA E
Sbjct: 114 GFFARKSELLRFQAKE 129


>gnl|CDD|214584 smart00249, PHD, PHD zinc finger.  The plant homeodomain (PHD)
           finger is a C4HC3 zinc-finger-like motif found in
           nuclear proteins thought to be involved in epigenetics
           and chromatin-mediated transcriptional regulation. The
           PHD finger binds two zinc ions using the so-called
           'cross-brace' motif and is thus structurally related to
           the RING finger and the FYVE finger. It is not yet known
           if PHD fingers have a common molecular function. Several
           reports suggest that it can function as a
           protein-protein interacton domain and it was recently
           demonstrated that the PHD finger of p300 can cooperate
           with the adjacent BROMO domain in nucleosome binding in
           vitro. Other reports suggesting that the PHD finger is a
           ubiquitin ligase have been refuted as these domains were
           RING fingers misidentified as PHD fingers.
          Length = 47

 Score = 40.3 bits (94), Expect = 9e-05
 Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 141 LIKCCACNVYYHIICLQPPLERRLK-VPWKCTSC 173
           L++C  C+ +YH  CL PPL        W C  C
Sbjct: 14  LLQCDGCDRWYHQTCLGPPLLEEEPDGKWYCPKC 47


>gnl|CDD|173926 cd04301, NAT_SF, N-Acyltransferase superfamily: Various enzymes
           that characteristically catalyze the transfer of an acyl
           group to a substrate.  NAT (N-Acyltransferase) is a
           large superfamily of enzymes that mostly catalyze the
           transfer of an acyl group to a substrate and are
           implicated in a variety of functions, ranging from
           bacterial antibiotic resistance to circadian rhythms in
           mammals. Members include GCN5-related
           N-Acetyltransferases (GNAT) such as Aminoglycoside
           N-acetyltransferases, Histone N-acetyltransferase (HAT)
           enzymes, and Serotonin N-acetyltransferase, which
           catalyze the transfer of an acetyl group to a substrate.
           The kinetic mechanism of most GNATs involves the ordered
           formation of a ternary complex: the reaction begins with
           Acetyl Coenzyme A (AcCoA) binding, followed by binding
           of substrate, then direct transfer of the acetyl group
           from AcCoA to the substrate, followed by product and
           subsequent CoA release. Other family members include
           Arginine/ornithine N-succinyltransferase, Myristoyl-CoA:
           protein N-myristoyltransferase, and
           Acyl-homoserinelactone synthase which have a similar
           catalytic mechanism but differ in types of acyl groups
           transferred. Leucyl/phenylalanyl-tRNA-protein
           transferase and FemXAB nonribosomal peptidyltransferases
           which catalyze similar peptidyltransferase reactions are
           also included.
          Length = 65

 Score = 38.8 bits (91), Expect = 5e-04
 Identities = 12/53 (22%), Positives = 23/53 (43%), Gaps = 6/53 (11%)

Query: 384 FYVLTQNDDKGCHLVGY--FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
           F V   + +    +VG+   S +        +  +  LP+Y+ KG G  L++ 
Sbjct: 1   FLVAEDDGE----IVGFASLSPDGSGGDTAYIGDLAVLPEYRGKGIGSALLEA 49


>gnl|CDD|201356 pfam00628, PHD, PHD-finger.  PHD folds into an interleaved type of
           Zn-finger chelating 2 Zn ions in a similar manner to
           that of the RING and FYVE domains. Several PHD fingers
           have been identified as binding modules of methylated
           histone H3.
          Length = 51

 Score = 35.5 bits (82), Expect = 0.004
 Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 141 LIKCCACNVYYHIICLQPPLER--RLKVPWKCTSCESG 176
           L+ C  C+ ++H+ CL PPLE     +  W C  C+  
Sbjct: 14  LLLCDGCDRWFHLACLGPPLEPEEIPEGEWYCPECKPK 51


>gnl|CDD|222185 pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain.  This
           domain catalyzes N-acetyltransferase reactions.
          Length = 79

 Score = 35.2 bits (82), Expect = 0.011
 Identities = 10/54 (18%), Positives = 19/54 (35%), Gaps = 9/54 (16%)

Query: 383 LFYVLTQNDDKGCHLVGY--FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
            F V   +      +VG+              +  +   P+Y+ +GYG  L+  
Sbjct: 4   HFLVAYDDG----EIVGFLRLRPIGEGAY---IGGVAVDPEYRGQGYGSKLLRH 50


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 38.3 bits (89), Expect = 0.019
 Identities = 20/97 (20%), Positives = 38/97 (39%), Gaps = 3/97 (3%)

Query: 611 LSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNT 670
           L+L    AS       ++    + + +T A +TT  E+  TEE D         + ++  
Sbjct: 13  LTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAP 72

Query: 671 TASETTASETPC---TEELDVMTPSSSLNESGAGVPV 704
           ++SET  +  P    T+       + +   +    PV
Sbjct: 73  SSSETAETSDPTSEATDTTTSEARTVTPAATETSKPV 109


>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
          Length = 1465

 Score = 37.9 bits (88), Expect = 0.027
 Identities = 22/134 (16%), Positives = 50/134 (37%), Gaps = 11/134 (8%)

Query: 587  EKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTAS 646
            + +D+   E     +  K+G ++  + +  AA P     P   +K   +   +  +    
Sbjct: 1315 DDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKP-----PAAAKKRGPATVQSGQKLLTE 1369

Query: 647  ETPCTEELDVMTPSS-----RKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAG 701
                 E + + +P       R    +K + +    A+    TE  + ++ SSS  +    
Sbjct: 1370 MLKPAEAIGI-SPEKKVRKMRASPFNKKSGSVLGRAATNKETESSENVSGSSSSEKDEID 1428

Query: 702  VPVVKKRRKFRRKK 715
            V    + ++  RK+
Sbjct: 1429 VSAKPRPQRANRKQ 1442


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 35.1 bits (81), Expect = 0.12
 Identities = 13/101 (12%), Positives = 30/101 (29%), Gaps = 5/101 (4%)

Query: 550 TPLVSHIVNPYK-TLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRK 608
           + ++  I++ +  T  K   +P S    +    + E      E E +  T   K  +   
Sbjct: 2   SLILCMILSFFSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKED 61

Query: 609 RKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETP 649
                + ++     +   KK  K+             +   
Sbjct: 62  ----KNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKK 98



 Score = 29.3 bits (66), Expect = 9.5
 Identities = 20/102 (19%), Positives = 35/102 (34%), Gaps = 5/102 (4%)

Query: 635 SKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSS 694
           S  T  S+        +  L +        SE +N  A+ +T  +    +  +    S  
Sbjct: 13  SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEK 72

Query: 695 LNESGAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPNP 736
             +        +K+      +   G+ TPKK KK+    P P
Sbjct: 73  KKKKKK-----EKKEPKSEGETKLGFKTPKKSKKTKKKPPKP 109


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 34.8 bits (79), Expect = 0.17
 Identities = 38/197 (19%), Positives = 68/197 (34%), Gaps = 29/197 (14%)

Query: 561 KTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRK---------- 610
           K  T + + P++    + + T   TT     AET+E  VV +       K          
Sbjct: 156 KETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWALSV 215

Query: 611 ---------LSLDTDAASPVV---EVTPKKTRKESESKNTTASETTASETPCTEELDVMT 658
                    +S +  ++S +    ++  K+T   +  K    +E  A+E      +   T
Sbjct: 216 KYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENT 275

Query: 659 PSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAGVPVVKKRRKFRRKKNNT 718
            ++   +E K TT  + TA + P         PS++ N +                 NNT
Sbjct: 276 NTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNT-------NTNTNTNNT 328

Query: 719 GWDTPKKKKKSTNITPN 735
              TP K   +   +  
Sbjct: 329 NTSTPSKNTNTNTNSNT 345



 Score = 30.2 bits (67), Expect = 4.4
 Identities = 25/118 (21%), Positives = 50/118 (42%), Gaps = 15/118 (12%)

Query: 561 KTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASP 620
           KT    + K ++    +   T T TTEK+ E  T+++T  K             T+AA P
Sbjct: 257 KTEAPAAEKQAAPVVKENTNTNTATTEKK-ETTTQQQTAPKAP-----------TEAAKP 304

Query: 621 VVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTAS 678
               +      ++ +   T +  T + TP     +  T ++   + + NT A++ +++
Sbjct: 305 APAPSTNTNANKTNTNTNTNTNNTNTSTPSK---NTNTNTNSNTNTNSNTNANQGSSN 359


>gnl|CDD|237470 PRK13688, PRK13688, hypothetical protein; Provisional.
          Length = 156

 Score = 33.4 bits (77), Expect = 0.17
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query: 419 LPQYQRKGYGRFLIDF 434
           LP+YQ +GYG  L+DF
Sbjct: 88  LPKYQNRGYGEMLVDF 103


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 34.7 bits (79), Expect = 0.19
 Identities = 26/96 (27%), Positives = 40/96 (41%), Gaps = 4/96 (4%)

Query: 551 PLVSHIVNPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRK 610
           P +  IV+  +T   + +  S+  N   E  AT T E   E   E     K+T+   + K
Sbjct: 44  PGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEP----KQTETKEQTK 99

Query: 611 LSLDTDAASPVVEVTPKKTRKESESKNTTASETTAS 646
              +   A+  VE  P +    S   N T+S T A+
Sbjct: 100 TPEEKQPAAKQVEKAPAEPATVSNPDNATSSSTPAT 135



 Score = 29.7 bits (66), Expect = 7.0
 Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 2/61 (3%)

Query: 618 ASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTA 677
           A+P     P    K++E+K  T  +T   + P  ++++         S   N T+S T A
Sbjct: 77  ATPENTTEPTVEPKQTETKEQT--KTPEEKQPAAKQVEKAPAEPATVSNPDNATSSSTPA 134

Query: 678 S 678
           +
Sbjct: 135 T 135


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 33.9 bits (78), Expect = 0.36
 Identities = 32/155 (20%), Positives = 49/155 (31%), Gaps = 7/155 (4%)

Query: 587 EKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTAS 646
           E E+E++ EE     K   GR RK   +        +   K+ + E + K  +  E    
Sbjct: 402 EGEEESDEEENEEPSKKNVGR-RKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEEL- 459

Query: 647 ETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAG--VPV 704
           E     +++ +     K SE       E    E          T S   +          
Sbjct: 460 EDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE---ENPWLKTTSSVGKSAKKQDSKKKS 516

Query: 705 VKKRRKFRRKKNNTGWDTPKKKKKSTNITPNPVLK 739
             K  K   K +       KKKKK  +I  +  L 
Sbjct: 517 SSKLDKAANKISKAAVKVKKKKKKEKSIDLDDDLI 551



 Score = 32.0 bits (73), Expect = 1.4
 Identities = 34/172 (19%), Positives = 61/172 (35%), Gaps = 19/172 (11%)

Query: 565 KESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLS-LDTDAASPVVE 623
           +E ++  S  NV       E  EKE E++  ++    + K  ++        D     VE
Sbjct: 409 EEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVE 468

Query: 624 VTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCT 683
               K  K SE       E    E     +    T SS  +S  K  +  ++++      
Sbjct: 469 KVANKLLKRSEKAQKEEEEEELDEENPWLK----TTSSVGKSAKKQDSKKKSSSKLD--- 521

Query: 684 EELDVMTPSSSLNESGAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPN 735
                    ++   S A V V KK++K   +K+    D    ++ S  +  +
Sbjct: 522 --------KAANKISKAAVKVKKKKKK---EKSIDLDDDLIDEEDSIKLDVD 562


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 33.6 bits (76), Expect = 0.37
 Identities = 24/142 (16%), Positives = 52/142 (36%), Gaps = 11/142 (7%)

Query: 580 TTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVT---------PKKTR 630
            T T  + K ++ ++ +   V   K  R   +++   A S   +            +K  
Sbjct: 21  ITVTTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAP 80

Query: 631 KESESKNTTASETTASETPCTEE--LDVMTPSSRKESESKNTTASETTASETPCTEELDV 688
             +E K + +     S+    E+  L++    + K+ +S+ TT S T  ++       + 
Sbjct: 81  NTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTTPPSTNT 140

Query: 689 MTPSSSLNESGAGVPVVKKRRK 710
             P  S        P +K+ + 
Sbjct: 141 PQPMQSTKSDTPQSPTIKQAQT 162


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This
           family includes several uncharacterized proteins from
           Caenorhabditis elegans.
          Length = 517

 Score = 33.1 bits (74), Expect = 0.61
 Identities = 53/188 (28%), Positives = 83/188 (44%), Gaps = 17/188 (9%)

Query: 558 NPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDA 617
           N  K+ ++ ++ P+S   V  E     T E   +A  +E    KK K+ +++  S +  A
Sbjct: 219 NKKKSESEATAAPASVEQV-VEQPKVVTEEPHQQAAPQE----KKNKKNKRKSESENVPA 273

Query: 618 AS-----PVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTA 672
           AS     PVVE TP  +  E++ KN    + + SE    E +    P S+K +   N   
Sbjct: 274 ASETPVEPVVETTPPAS--ENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDF 331

Query: 673 SE-TTASETPCTEELDVMTPSSSLNESGAGV--PVVKKRRKFRRKKNNTGWDTPKKKKKS 729
            +  TA E P  E  +  TP++ + E    V   VV+K       +N       KKK +S
Sbjct: 332 LDFVTAKEEPKDEPAE--TPAAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSES 389

Query: 730 TNITPNPV 737
             +T  PV
Sbjct: 390 EKVTEQPV 397


>gnl|CDD|223532 COG0456, RimI, Acetyltransferases [General function prediction
           only].
          Length = 177

 Score = 31.9 bits (72), Expect = 0.66
 Identities = 13/78 (16%), Positives = 26/78 (33%), Gaps = 8/78 (10%)

Query: 376 YYDVEPFLFYVLTQN---DDKGCHLVGYFSKEKH-----CQQKYNVSCIMTLPQYQRKGY 427
                P L  V             +VG+              + ++  +   P+Y+ +G 
Sbjct: 49  DLTQAPELLLVAETGGLDGLLDGKVVGFLLVRVVDGRPSADHEGHIYNLAVDPEYRGRGI 108

Query: 428 GRFLIDFSYLLSKKEGQR 445
           GR L+D +    ++ G  
Sbjct: 109 GRALLDEALERLRERGLA 126


>gnl|CDD|240069 cd04718, BAH_plant_2, BAH, or Bromo Adjacent Homology domain,
           plant-specific sub-family with unknown function. BAH
           domains are found in a variety of proteins playing roles
           in transcriptional silencing and the remodeling of
           chromatin. It is assumed that in most or all of these
           instances the BAH domain mediates protein-protein
           interactions.
          Length = 148

 Score = 31.4 bits (71), Expect = 0.82
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 3/50 (6%)

Query: 151 YHIICLQPPLERRLKVPWKCTSCE---SGTTEPSPAKTSPGRARASLTTS 197
           +H+ CL+PPL+   +  W C  CE   SG +       +   A   L + 
Sbjct: 2   FHLCCLRPPLKEVPEGDWICPFCEVEKSGQSAMPQLPPTSRSACEKLLSG 51


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 32.6 bits (74), Expect = 0.87
 Identities = 31/164 (18%), Positives = 61/164 (37%), Gaps = 30/164 (18%)

Query: 579 ETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNT 638
               + +  K+  ++T  E    K K  RK K   + D       +  KK R E +S + 
Sbjct: 724 AEKDSLSAPKKQTSKTASEKSSSKGK--RKHKNDEEADK------IESKKQRLEEKSSSC 775

Query: 639 TASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNES 698
           + S +++               SRK S +K               EE+ + +PSS L+ S
Sbjct: 776 SPSSSSSHHHSS------SNKESRKSSRNKE--------------EEM-LPSPSSPLSSS 814

Query: 699 GAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPNPVLKQQK 742
                   ++R  RR+++ +    P     + + + +    + +
Sbjct: 815 SPKPEHPSRKRP-RRQEDTSSSSGPFSASSTKSSSKSSSTSKHR 857


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 32.5 bits (74), Expect = 0.99
 Identities = 22/105 (20%), Positives = 35/105 (33%), Gaps = 12/105 (11%)

Query: 578 AETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKN 637
           A  T T  T        E      KT+         +T        V   KT   ++++ 
Sbjct: 759 ASGTGTSITTSG-TFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKT--VTQTQT 815

Query: 638 TTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPC 682
           TT+S +    T         +P+    S +  T+ S+TT     C
Sbjct: 816 TTSSPSPTQTT---------SPTQTSTSTTTTTSPSQTTTGGGIC 851


>gnl|CDD|216007 pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family.  This
           family contains proteins with N-acetyltransferase
           functions such as Elp3-related proteins.
          Length = 80

 Score = 29.5 bits (67), Expect = 1.2
 Identities = 10/41 (24%), Positives = 17/41 (41%), Gaps = 2/41 (4%)

Query: 396 HLVGY--FSKEKHCQQKYNVSCIMTLPQYQRKGYGRFLIDF 434
            LVG+   S          +  +   P+Y+ KG G  L++ 
Sbjct: 6   ELVGFASLSIIDEEGNVAEIEGLAVDPEYRGKGIGTALLEA 46


>gnl|CDD|221891 pfam13007, LZ_Tnp_IS66, Transposase C of IS166 homeodomain.  This
           is a leucine-zipper-like or homeodomain-like region of
           transposase TnpC of insertion element IS66.
          Length = 77

 Score = 29.2 bits (66), Expect = 1.3
 Identities = 10/36 (27%), Positives = 12/36 (33%), Gaps = 1/36 (2%)

Query: 575 NVDAETTATETTEKEDEAETEEETVVKKTKRGRKRK 610
             + E  A    E E EA    E      +R   RK
Sbjct: 32  LEELEADAAAL-EAEAEAAAAPEQPAAPPRRRPGRK 66


>gnl|CDD|176116 cd08425, PBP2_CynR, The C-terminal substrate-binding domain of the
           LysR-type transcriptional regulator CynR, contains the
           type 2 periplasmic binding fold.  CynR is a LysR-like
           transcriptional regulator of the cyn operon, which
           encodes genes that allow cyanate to be used as a sole
           source of nitrogen. The operon includes three genes in
           the following order: cynT (cyanate permease), cynS
           (cyanase), and cynX (a protein of unknown function).
           CynR negatively regulates its own expression
           independently of cyanate. CynR binds to DNA and induces
           bending of DNA in the presence or absence of cyanate,
           but the amount of bending is decreased by cyanate. The
           CynR of LysR-type transcriptional regulator family is
           composed of two functional domains joined by a linker
           helix involved in oligomerization: an N-terminal HTH
           (helix-turn-helix) domain, which is responsible for the
           DNA-binding specificity, and a C-terminal
           substrate-binding domain, which is structurally
           homologous to the type 2 periplasmic binding proteins
           (PBP2). The PBP2 are responsible for the uptake of a
           variety of substrates such as phosphate, sulfate,
           polysaccharides, lysine/arginine/ornithine, and
           histidine. The PBP2 bind their ligand in the cleft
           between these domains in a manner resembling a Venus
           flytrap. After binding their specific ligand with high
           affinity, they can interact with a cognate membrane
           transport complex comprised of two integral membrane
           domains and two cytoplasmically located ATPase domains.
           This interaction triggers the ligand translocation
           across the cytoplasmic membrane energized by ATP
           hydrolysis.
          Length = 197

 Score = 30.8 bits (70), Expect = 1.6
 Identities = 26/89 (29%), Positives = 33/89 (37%), Gaps = 31/89 (34%)

Query: 88  LLATDLAPGVTRKDIDLYKQAHE-------EATKATPLL------PLAVPEQINPAAIEF 134
           LL+ D A   TR+ ID Y Q          EA   + +L       LA    I P AI  
Sbjct: 104 LLSPDFA---TRQHIDRYFQKQGIKPRIAIEANSISAVLEVVRRGRLAT---ILPDAIAR 157

Query: 135 GQYEVCLIKCCACNVYYHIICLQPPLERR 163
            Q  +C +             L+PPL  R
Sbjct: 158 EQPGLCAVA------------LEPPLPGR 174



 Score = 29.6 bits (67), Expect = 5.2
 Identities = 29/110 (26%), Positives = 40/110 (36%), Gaps = 27/110 (24%)

Query: 211 AAKGTPQKRGKRA-SVDMMVSEDVVSKLLATDLAPGVTRKDIDLYKQAHE-------EAT 262
                P  + + A ++D + +E +   LL+ D A   TR+ ID Y Q          EA 
Sbjct: 79  VGATHPLAQRRTALTLDDLAAEPLA--LLSPDFA---TRQHIDRYFQKQGIKPRIAIEAN 133

Query: 263 KATPLL------PLAVPEQINPAAIEFGQ---YEVKTWYSSPFPQEYARL 303
             + +L       LA    I P AI   Q     V      P P   A L
Sbjct: 134 SISAVLEVVRRGRLAT---ILPDAIAREQPGLCAVAL--EPPLPGRTAAL 178


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 31.3 bits (71), Expect = 2.0
 Identities = 24/174 (13%), Positives = 54/174 (31%), Gaps = 37/174 (21%)

Query: 575 NVDAETTATETTEKEDEAETEEETVVKKTKR-----GRKRKLSLDTDAASPVVEVTPKKT 629
           N++++ +  E     +++  +       + +        ++   D   A       P  +
Sbjct: 128 NLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTS 187

Query: 630 RKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVM 689
            K+  S   T    + S+                   S +T   ++++ +    + +   
Sbjct: 188 NKQPNSPKPTQPNQSNSQ-----------------PASDDTANQKSSSKD---NQSMSDS 227

Query: 690 TPSSSLNESGAGVPVVKKRRKFRRKKNNTGWDTPKKK-KKSTNITPNPVLKQQK 742
              S L++                KK    + +  KK K  T+ T NP L  Q 
Sbjct: 228 ALDSILDQYSED-----------AKKTQKDYASQSKKDKTETSNTKNPQLPTQD 270



 Score = 30.1 bits (68), Expect = 5.8
 Identities = 9/68 (13%), Positives = 26/68 (38%)

Query: 631 KESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMT 690
            +   K++TA  T+       ++      SS+   ++ N   S    ++   +      +
Sbjct: 27  ADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTS 86

Query: 691 PSSSLNES 698
            S+++ + 
Sbjct: 87  DSNNIIDF 94


>gnl|CDD|217502 pfam03343, SART-1, SART-1 family.  SART-1 is a protein involved in
           cell cycle arrest and pre-mRNA splicing. It has been
           shown to be a component of U4/U6 x U5 tri-snRNP complex
           in human, Schizosaccharomyces pombe and Saccharomyces
           cerevisiae. SART-1 is a known tumour antigen in a range
           of cancers recognised by T cells.
          Length = 603

 Score = 31.3 bits (71), Expect = 2.2
 Identities = 20/108 (18%), Positives = 32/108 (29%), Gaps = 5/108 (4%)

Query: 554 SHIVNPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKR----GRKR 609
               +  +   +E  K     NV  +   +  T   D  +  E    KK K+     +KR
Sbjct: 230 GSTDDEAEKKRQEVKKKLKINNVSLDDD-STETPASDYYDVSEMVKFKKPKKKKKKKKKR 288

Query: 610 KLSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVM 657
           +  LD D   P  E          +      +    S     EE +  
Sbjct: 289 RKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKEEQEDD 336


>gnl|CDD|222340 pfam13718, GNAT_acetyltr_2, GNAT acetyltransferase 2.  This domain
           has N-acetyltransferase activity. It has a GCN5-related
           N-acetyltransferase (GNAT) fold.
          Length = 169

 Score = 30.1 bits (69), Expect = 2.8
 Identities = 22/76 (28%), Positives = 29/76 (38%), Gaps = 21/76 (27%)

Query: 413 VSCIMTLPQYQRKGYGRFLIDFSYLLSKKEGQRGTPEKPLSD-LGRVSY------HAYWK 465
           +  I   P YQR GYG  L++        EG          D LG VS+        +W+
Sbjct: 91  IVRIAVHPDYQRMGYGSRLLELLEQYYIPEGV---------DYLG-VSFGATPELLKFWQ 140

Query: 466 S---VLLEYLDTIRNQ 478
               V + YL   RN 
Sbjct: 141 KNGFVPV-YLGQTRNA 155


>gnl|CDD|236539 PRK09490, metH, B12-dependent methionine synthase; Provisional.
          Length = 1229

 Score = 30.9 bits (71), Expect = 3.1
 Identities = 15/61 (24%), Positives = 23/61 (37%), Gaps = 4/61 (6%)

Query: 233 VVSKLLATDLAPGV---TRKDIDLYKQAHEEATKATPLLPLAVPEQINPAAIEFGQYEVK 289
           VVS LL+ +        TR + +  ++ H       PLL L      N   I++  Y   
Sbjct: 871 VVSSLLSDEQRDAYVAETRAEYEKVREQHARKKPRKPLLTLEAARA-NRFKIDWEAYTPP 929

Query: 290 T 290
            
Sbjct: 930 K 930



 Score = 30.9 bits (71), Expect = 3.1
 Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 4/58 (6%)

Query: 84  VVSKLLATDLAPGV---TRKDIDLYKQAHEEATKATPLLPLAVPEQINPAAIEFGQYE 138
           VVS LL+ +        TR + +  ++ H       PLL L      N   I++  Y 
Sbjct: 871 VVSSLLSDEQRDAYVAETRAEYEKVREQHARKKPRKPLLTLEAARA-NRFKIDWEAYT 927


>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
           homolog A (Rrp7A) and similar proteins.  The family
           corresponds to the RRP7 domain of Rrp7A, also termed
           gastric cancer antigen Zg14, and similar proteins which
           are yeast ribosomal RNA-processing protein 7 (Rrp7p)
           homologs mainly found in Metazoans. The cellular
           function of Rrp7A remains unclear currently. Rrp7A
           harbors an N-terminal RNA recognition motif (RRM), also
           termed RBD (RNA binding domain) or RNP
           (ribonucleoprotein domain), and a C-terminal RRP7
           domain.
          Length = 129

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 19/51 (37%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 587 EKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKN 637
           EKE EAE +E+  V  TK+GR+ K +   ++ +       KK +K+ E KN
Sbjct: 40  EKEKEAEPDEDGWVTVTKKGRRPKTA-RKESVAAKAAEKEKKKKKKKELKN 89


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.
           The proteins in this family are poorly characterized,
           but an investigation has indicated that the immediate
           early protein is required the down-regulation of MHC
           class I expression in dendritic cells. Human herpesvirus
           6 immediate early protein is also referred to as U90.
          Length = 993

 Score = 30.8 bits (69), Expect = 3.3
 Identities = 28/158 (17%), Positives = 53/158 (33%), Gaps = 6/158 (3%)

Query: 579 ETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNT 638
           E+ +  TT K  E  T E  +    +   +    +     SP V    K+T    E    
Sbjct: 377 ESHSKNTTGKNPEVCTREYKLDFFEEIDTQNLAKMYPSTPSPDVPGKSKETETFVEVSRQ 436

Query: 639 TASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNES 698
           T +E T+    CT  +  +      +S +  ++A E         + L      S + ++
Sbjct: 437 TGTEQTSPNCVCTASVTDLGGPDNIKSITGLSSAKEAL------MKSLLDTQADSVVTQT 490

Query: 699 GAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNITPNP 736
            +          +  K+N   +     + K T+    P
Sbjct: 491 VSKNNEAFNMSLYNLKRNEETYQDKNSRDKKTDNQAGP 528


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 30.7 bits (69), Expect = 4.0
 Identities = 34/131 (25%), Positives = 55/131 (41%), Gaps = 16/131 (12%)

Query: 611 LSLDTDAASPVVEV---TPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESES 667
           +S D   +S  +E     P+ T    E+  + A E T S    T +   +TP++ K + S
Sbjct: 75  VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRTAK---ITPTTPKNNYS 131

Query: 668 KNTTASETTASETPCTEELDVMTPSSSLNE--SGAGVPVVKKRRKFRRKKNNTGWDTPKK 725
                +E    +TP        TPS +LN   S +G   VK      R +  +   T + 
Sbjct: 132 PTAAGTERVKEDTPA-------TPSRALNHYISTSGRQRVKSYTPKPRGEVKSSSPT-QT 183

Query: 726 KKKSTNITPNP 736
           ++K    TP+P
Sbjct: 184 REKVRKYTPSP 194


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is
           a family of proteins conserved in yeasts. The function
           is not known. The Schizosaccharomyces pombe member is
           SPBC18E5.07 and the Saccharomyces cerevisiae member is
           AIM21.
          Length = 671

 Score = 30.7 bits (69), Expect = 4.1
 Identities = 34/174 (19%), Positives = 50/174 (28%), Gaps = 21/174 (12%)

Query: 550 TPLVSHIVNPYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKR 609
                      K  +   S P    +   E    E  E     +  E  V   T+   +R
Sbjct: 345 HESDPKSRENSKPASIYGSVPDLIRHTPLEDV--EEYEPLFPEDESEIAVKPPTEESSRR 402

Query: 610 KLSLDTDAAS-PVVEVTPKKTRKESESKNTTASETTASETPCTE---------------E 653
                    S  V E +P   +  +     +     ASETP  E               E
Sbjct: 403 PEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASETPEQETSRSSSEVSLDPHQSE 462

Query: 654 LDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNESGAGVPVVKK 707
           L      +R E  SK    S     + P  E  +++T   +  E  +  P V K
Sbjct: 463 LKSEKKKARPE-VSKQRFPSRDVWEDAP--ESQELVTTEETPEEVKSSSPGVTK 513


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 30.3 bits (69), Expect = 4.4
 Identities = 18/103 (17%), Positives = 33/103 (32%), Gaps = 3/103 (2%)

Query: 597 ETVVKKTKRGRKRKLSLDT--DAASPVVEVTPKKTRKESESKNTTASETTASETPCTEEL 654
           E    K   GR  KL+L++   +AS   +  P   +       T       +  P     
Sbjct: 495 EQAFAKV-LGRSIKLNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPP 553

Query: 655 DVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNE 697
                +++  S +     ++++       E     T  SS  E
Sbjct: 554 PPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPEE 596


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
           Members of this family are the S-layer-forming
           halobacterial major cell surface glycoprotein. The
           highest scores below model cutoffs are fragmentary
           paralogs to actual members of the family. Modifications
           include at N-linked and O-linked glycosylation, a
           C-terminal diphytanylglyceryl modification, and probable
           cleavage of the PGF-CTERM tail.
          Length = 782

 Score = 30.2 bits (68), Expect = 4.5
 Identities = 17/62 (27%), Positives = 24/62 (38%), Gaps = 9/62 (14%)

Query: 621 VVEVTPK-KTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASE 679
           VVE T +  T    +   TT   TT  E          T ++   + ++  T   TT S 
Sbjct: 707 VVEETERPDTTTSEDPTTTTTPTTTGPEET--------TETAEPTTTTEEPTEETTTGSS 758

Query: 680 TP 681
           TP
Sbjct: 759 TP 760


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 29.9 bits (66), Expect = 4.9
 Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 1/68 (1%)

Query: 638 TTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASETPCTEELDVMTPSSSLNE 697
           T++  +TAS    T    V TPS      S N + + TT S  P T    + T ++++  
Sbjct: 25  TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTS-APITTTAILSTNTTTVTS 83

Query: 698 SGAGVPVV 705
           +G  V  V
Sbjct: 84  TGTTVTPV 91



 Score = 29.1 bits (64), Expect = 8.8
 Identities = 36/156 (23%), Positives = 53/156 (33%), Gaps = 14/156 (8%)

Query: 564 TKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAA---SP 620
           T   S  +S GNV   T  T  +       T + T +  T         L T+     S 
Sbjct: 25  TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST 84

Query: 621 VVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTASETTASET 680
              VTP  T   + + N T   T  + T         T  +     S  TT S +T S T
Sbjct: 85  GTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVT-----SNVTTRSSSTTSAT 139

Query: 681 PCTEELDVMTP------SSSLNESGAGVPVVKKRRK 710
                   + P      +S+  ++ A +P V   R+
Sbjct: 140 TRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQ 175


>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
           Validated.
          Length = 339

 Score = 30.1 bits (69), Expect = 5.1
 Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 11/44 (25%)

Query: 190 ARASLTTSFRERKKQLKQQRLAAK-----------GTPQKRGKR 222
            + ++  +  ERK++L+   L A+           G   + G  
Sbjct: 64  LKQAIEAALEERKEELEAAALNARLAAETIDVTLPGRRIELGSL 107


>gnl|CDD|212096 cd01022, GH57N_like, N-terminal catalytic domain of heat stable
           retaining glycoside hydrolase family 57.  Glycoside
           hydrolase family 57(GH57) is a chiefly prokaryotic
           family with the majority of thermostable enzymes coming
           from extremophiles (many of these are archaeal
           hyperthermophiles), which exhibit the enzyme
           specificities of alpha-amylase (EC 3.2.1.1),
           4-alpha-glucanotransferase (EC 2.4.1.25),
           amylopullulanase (EC 3.2.1.1/41), and
           alpha-galactosidase (EC 3.2.1.22). This family also
           includes many hypothetical proteins with uncharacterized
           activity and specificity. GH57s cleave alpha-glycosidic
           bonds by employing a retaining mechanism, which involves
           a glycosyl-enzyme intermediate, allowing
           transglycosylation.
          Length = 313

 Score = 29.7 bits (67), Expect = 5.4
 Identities = 16/80 (20%), Positives = 24/80 (30%), Gaps = 11/80 (13%)

Query: 217 QKRGKRASVDM--------MVSEDVVSKLLATDLAPGVTRKDIDLYKQAHEEATKAT--- 265
              G    V +         +++ VV K   +     V     +L      E        
Sbjct: 43  VDEGPDPKVALTISGVLLEQLADPVVQKGFTSRYNDNVLDALKELVDTGQVELLGCGYTH 102

Query: 266 PLLPLAVPEQINPAAIEFGQ 285
             LPL  P++   A IE G 
Sbjct: 103 AYLPLLGPKEDVRAQIEAGL 122


>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
          Length = 1560

 Score = 30.1 bits (68), Expect = 5.6
 Identities = 17/67 (25%), Positives = 23/67 (34%), Gaps = 5/67 (7%)

Query: 560 YKTLTKES--SKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKR-GRKRKLSLDTD 616
           Y TL  E   +K S  G+ DAE    +       A   E  V K+          +LD  
Sbjct: 874 YATLAAELKENKDSKEGDADAEVAEVDAA--PGGAVDHEPPVAKQEGNAEGGDGAALDAA 931

Query: 617 AASPVVE 623
           A   +  
Sbjct: 932 AGRLMTR 938


>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein.  The YqfQ-like protein family
           includes the B. subtilis YqfQ protein, also known as
           VrrA, which is functionally uncharacterized. This family
           of proteins is found in bacteria. Proteins in this
           family are typically between 146 and 237 amino acids in
           length. There are two conserved sequence motifs: QYGP
           and PKLY.
          Length = 155

 Score = 29.0 bits (65), Expect = 5.8
 Identities = 13/59 (22%), Positives = 23/59 (38%)

Query: 578 AETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESK 636
           ++    ET E+  +   +E+    KT+   K+K  +           T  K  K S+ K
Sbjct: 94  SDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKPKPSKPK 152



 Score = 28.6 bits (64), Expect = 8.2
 Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 19/80 (23%)

Query: 567 SSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAASPVVEVTP 626
               S     + E  +T+ TE+ED  ET+      ++K  +KR                P
Sbjct: 90  ELSSSDDEEEETEEESTDETEQEDPPETK-----TESKEKKKR--------------EVP 130

Query: 627 KKTRKESESKNTTASETTAS 646
           K   ++ + K        + 
Sbjct: 131 KPKTEKEKPKTEPKKPKPSK 150


>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
          Length = 803

 Score = 29.8 bits (67), Expect = 7.1
 Identities = 15/80 (18%), Positives = 31/80 (38%), Gaps = 7/80 (8%)

Query: 559 PYKTLTKESSKPSSGGNVDAETTATETTEKEDEAETEEETVVKKTKRGRKRKLSLDTDAA 618
           P  + + + SK   G +   +    E+   ++E   ++    +K  R +  K+  D+ A 
Sbjct: 411 PAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAG 470

Query: 619 SPVVEVTPKKTRKESESKNT 638
                   KK   +S+  N 
Sbjct: 471 -------GKKESVKSQEDNN 483


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 29.6 bits (66), Expect = 7.6
 Identities = 18/65 (27%), Positives = 33/65 (50%), Gaps = 3/65 (4%)

Query: 611 LSLDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNT 670
           + L +   SP    T + +  ES+S  +T +E   +E P +E     +P+S   S S ++
Sbjct: 11  IILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSES---PSPNSNSSSSSSSS 67

Query: 671 TASET 675
           ++S T
Sbjct: 68  SSSIT 72


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 29.4 bits (66), Expect = 8.1
 Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 6/82 (7%)

Query: 595 EEETVVKKTKRGRKRKLSLDTDAASPVVEVTPKKTRKESESKNTTASETTASET---PCT 651
           EEE  + K K     K S     A+P  + +P   ++E   K  ++ E  AS+    P +
Sbjct: 189 EEEEDIGKFK---DYKPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245

Query: 652 EELDVMTPSSRKESESKNTTAS 673
            +    +P +RK +E  N   S
Sbjct: 246 GDRIFASPLARKLAEDNNVPLS 267


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 29.3 bits (65), Expect = 8.8
 Identities = 21/124 (16%), Positives = 41/124 (33%), Gaps = 9/124 (7%)

Query: 613 LDTDAASPVVEVTPKKTRKESESKNTTASETTASETPCTEELDVMTPSSRKESESKNTTA 672
           +DT    PV  VTP     +S +   +++   +             PSS     +  T A
Sbjct: 293 IDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAV---------IPSSVTTQSATTTQA 343

Query: 673 SETTASETPCTEELDVMTPSSSLNESGAGVPVVKKRRKFRRKKNNTGWDTPKKKKKSTNI 732
           S    S          +  + +L  +       +       ++++TG  T      +T++
Sbjct: 344 SAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSL 403

Query: 733 TPNP 736
              P
Sbjct: 404 PAAP 407


>gnl|CDD|201184 pfam00368, HMG-CoA_red, Hydroxymethylglutaryl-coenzyme A reductase.
            The HMG-CoA reductases catalyze the conversion of
           HMG-CoA to mevalonate, which is the rate-limiting step
           in the synthesis of isoprenoids like cholesterol.
           Probably because of the critical role of this enzyme in
           cholesterol homeostasis, mammalian HMG-CoA reductase is
           heavily regulated at the transcriptional, translational,
           and post-translational levels.
          Length = 370

 Score = 29.0 bits (66), Expect = 9.8
 Identities = 14/59 (23%), Positives = 25/59 (42%), Gaps = 6/59 (10%)

Query: 183 AKTSPGRARASLTTSFRERKKQLKQQRLAAKGTPQKRGKRASVDMMVSEDVVSKLLATD 241
            +  PG    +L+ +    KK       +A    + RGK    + ++  +VV K+L T 
Sbjct: 180 EEEFPGMELLALSGNLCTDKKP------SAINWIEGRGKSVVAEAVIPAEVVEKVLKTT 232


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.129    0.381 

Gapped
Lambda     K      H
   0.267   0.0598    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 37,137,507
Number of extensions: 3543102
Number of successful extensions: 2750
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2673
Number of HSP's successfully gapped: 115
Length of query: 760
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 656
Effective length of database: 6,324,786
Effective search space: 4149059616
Effective search space used: 4149059616
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 63 (28.3 bits)