RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2046
         (138 letters)



>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score =  144 bits (366), Expect = 3e-41
 Identities = 65/128 (50%), Positives = 85/128 (66%), Gaps = 1/128 (0%)

Query: 12  YKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWRHLRLDG 71
           Y   + LV  SGK+  LD++LP LK+   RVLIFSQ   +LDIL  Y+  RG+++ R+DG
Sbjct: 460 YTTGEHLVENSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDG 519

Query: 72  ATQVSSRQELIDEYNRD-QDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAED 130
            T    R   ID +N+   + F FLLST+AGGLGINL  AD VI++D D+NP  D QA+D
Sbjct: 520 NTGGEDRDASIDAFNKPGSEKFVFLLSTRAGGLGINLATADIVILYDSDWNPQVDLQAQD 579

Query: 131 RCHRVGQT 138
           R HR+GQ 
Sbjct: 580 RAHRIGQK 587


>gnl|CDD|223627 COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family
           [Transcription / DNA replication, recombination, and
           repair].
          Length = 866

 Score =  117 bits (293), Expect = 2e-31
 Identities = 66/119 (55%), Positives = 84/119 (70%), Gaps = 3/119 (2%)

Query: 23  GKLKKLDEILPD-LKKNGH--RVLIFSQFIFVLDILGHYMDIRGWRHLRLDGATQVSSRQ 79
           GKL+ LDE+L D L + GH  +VLIFSQF  VLD+L  Y+   G +++RLDG+T    RQ
Sbjct: 692 GKLQALDELLLDKLLEEGHYHKVLIFSQFTPVLDLLEDYLKALGIKYVRLDGSTPAKRRQ 751

Query: 80  ELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCHRVGQT 138
           ELID +N D++   FLLS KAGGLG+NLT ADTVI+ D  +NP  + QA DR HR+GQ 
Sbjct: 752 ELIDRFNADEEEKVFLLSLKAGGLGLNLTGADTVILFDPWWNPAVELQAIDRAHRIGQK 810


>gnl|CDD|238034 cd00079, HELICc, Helicase superfamily c-terminal domain; associated
           with DEXDc-, DEAD-, and DEAH-box proteins, yeast
           initiation factor 4A, Ski2p, and Hepatitis C virus NS3
           helicases; this domain is found in a wide variety of
           helicases and helicase related proteins; may not be an
           autonomously folding unit, but an integral part of the
           helicase; 4 helicase superfamilies at present according
           to the organization of their signature motifs; all
           helicases share the ability to unwind nucleic acid
           duplexes with a distinct directional polarity; they
           utilize the free energy from nucleoside triphosphate
           hydrolysis to fuel their translocation along DNA,
           unwinding the duplex in the process.
          Length = 131

 Score = 85.0 bits (211), Expect = 3e-22
 Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 2/121 (1%)

Query: 18  LVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWRHLRLDGATQVSS 77
           L VE  KL+ L E+L +  K G +VLIF     +LD L   +   G +   L G      
Sbjct: 7   LPVEDEKLEALLELLKEHLKKGGKVLIFCPSKKMLDELAELLRKPGIKVAALHGDGSQEE 66

Query: 78  RQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCHRVGQ 137
           R+E++ ++   +     L++T     GI+L     VI +D+ ++P +  Q   R  R GQ
Sbjct: 67  REEVLKDFREGE--IVVLVATDVIARGIDLPNVSVVINYDLPWSPSSYLQRIGRAGRAGQ 124

Query: 138 T 138
            
Sbjct: 125 K 125


>gnl|CDD|201125 pfam00271, Helicase_C, Helicase conserved C-terminal domain.  The
           Prosite family is restricted to DEAD/H helicases,
           whereas this domain family is found in a wide variety of
           helicases and helicase related proteins. It may be that
           this is not an autonomously folding unit, but an
           integral part of the helicase.
          Length = 78

 Score = 76.0 bits (188), Expect = 2e-19
 Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 2/79 (2%)

Query: 58  YMDIRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHD 117
            +   G +  RL G      R+E+++++   +     L++T   G GI+L   + VI +D
Sbjct: 2   LLRKPGIKVARLHGGLSQEEREEILEDFRNGK--SKVLVATDVAGRGIDLPDVNLVINYD 59

Query: 118 VDFNPYNDKQAEDRCHRVG 136
           + +NP +  Q   R  R G
Sbjct: 60  LPWNPASYIQRIGRAGRAG 78


>gnl|CDD|197757 smart00490, HELICc, helicase superfamily c-terminal domain. 
          Length = 82

 Score = 73.4 bits (181), Expect = 3e-18
 Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 2/84 (2%)

Query: 53  DILGHYMDIRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADT 112
           + L   +   G +  RL G      R+E++D++N        L++T     G++L   D 
Sbjct: 1   EELAELLKELGIKVARLHGGLSQEEREEILDKFNNG--KIKVLVATDVAERGLDLPGVDL 58

Query: 113 VIIHDVDFNPYNDKQAEDRCHRVG 136
           VII+D+ ++P +  Q   R  R G
Sbjct: 59  VIIYDLPWSPASYIQRIGRAGRAG 82


>gnl|CDD|240273 PTZ00110, PTZ00110, helicase; Provisional.
          Length = 545

 Score = 39.8 bits (93), Expect = 2e-04
 Identities = 31/122 (25%), Positives = 57/122 (46%), Gaps = 8/122 (6%)

Query: 17  DLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDIRGWRHLRLDGATQVS 76
            +V E  K  KL  +L  + ++G ++LIF +     D L   + + GW  L + G  +  
Sbjct: 355 FVVEEHEKRGKLKMLLQRIMRDGDKILIFVETKKGADFLTKELRLDGWPALCIHGDKKQE 414

Query: 77  SRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCHRVG 136
            R  +++E+   +     +++T     G+++     VI  + DF      Q ED  HR+G
Sbjct: 415 ERTWVLNEFKTGKS--PIMIATDVASRGLDVKDVKYVI--NFDF----PNQIEDYVHRIG 466

Query: 137 QT 138
           +T
Sbjct: 467 RT 468


>gnl|CDD|235307 PRK04537, PRK04537, ATP-dependent RNA helicase RhlB; Provisional.
          Length = 572

 Score = 33.0 bits (75), Expect = 0.033
 Identities = 30/140 (21%), Positives = 58/140 (41%), Gaps = 24/140 (17%)

Query: 15  PDDLVVESGKLKKL-----------DEILPDL-----KKNGHRVLIFSQFIFVLDILGHY 58
           P+ LVVE+  +              +E    L     +  G R ++F      ++ +   
Sbjct: 217 PEKLVVETETITAARVRQRIYFPADEEKQTLLLGLLSRSEGARTMVFVNTKAFVERVART 276

Query: 59  MDIRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLSTKAGGLGINLTAADTVIIHDV 118
           ++  G+R   L G      R+ L++ + + Q     L++T     G+++     V  +D+
Sbjct: 277 LERHGYRVGVLSGDVPQKKRESLLNRFQKGQ--LEILVATDVAARGLHIDGVKYVYNYDL 334

Query: 119 DFNPYNDKQAEDRCHRVGQT 138
            F+      AED  HR+G+T
Sbjct: 335 PFD------AEDYVHRIGRT 348


>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
           II-B/NMENI.  Members of this family of CRISPR-associated
           (cas) protein are found, so far, in CRISPR/cas loci in
           Wolinella succinogenes DSM 1740, Legionella pneumophila
           str. Paris, and Francisella tularensis, where the last
           probably is an example of a degenerate CRISPR locus,
           having neither repeats nor a functional Cas1. The
           characteristic repeat length is 37 base pairs and period
           is about 72. One region of this large protein shows
           sequence similarity to pfam01844, HNH endonuclease
           [Mobile and extrachromosomal element functions, Other].
          Length = 802

 Score = 31.5 bits (71), Expect = 0.12
 Identities = 11/38 (28%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 61  IRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLST 98
           ++ WRHL+ D  +   + + LI +  + QD  + L  T
Sbjct: 285 LKNWRHLKGDQESL--AVRNLIQQLKQKQDYISILEKT 320


>gnl|CDD|218255 pfam04769, MAT_Alpha1, Mating-type protein MAT alpha 1.  This
           family includes Saccharomyces cerevisiae mating type
           protein alpha 1. Mat alpha 1 is a transcription
           activator which activates mating-type alpha-specific
           genes. MAT alpha 1 and MCM 1 bind cooperatively to PQ
           elements upstream of alpha-specific genes. Alpha 1
           interacts in vivo with STE12, linking expression of
           alpha-specific genes to the alpha-pheromone (pfam04648)
           response pathway.
          Length = 200

 Score = 28.9 bits (65), Expect = 0.70
 Identities = 11/51 (21%), Positives = 20/51 (39%), Gaps = 10/51 (19%)

Query: 57  HYMDIRGWRHLRLDGATQVSSR----------QELIDEYNRDQDLFAFLLS 97
            Y+++ GW   + +  T    R          Q +   Y  ++DL  F+ S
Sbjct: 125 AYLELLGWSLTQNELGTLTLERWTAPKQQRLGQPMNQLYLTEKDLITFVQS 175


>gnl|CDD|223194 COG0116, COG0116, Predicted N6-adenine-specific DNA methylase [DNA
           replication, recombination, and repair].
          Length = 381

 Score = 28.4 bits (64), Expect = 1.0
 Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 9/69 (13%)

Query: 2   ALKQTLDCAKYKVPDDLVVESGKLK-----KLDEILPDLKKNGHRVLIFSQFIFVLDILG 56
            L  + D A+  +  D +V+  + K      +D   PD++ N    L        +D  G
Sbjct: 103 HLFTSPDIAR--IVKDAIVDRFRRKYGRRPSVDLDGPDVRINVE--LDKDTATLGIDTTG 158

Query: 57  HYMDIRGWR 65
             +  RG+R
Sbjct: 159 DSLHKRGYR 167


>gnl|CDD|187835 cd09704, Csx12, CRISPR/Cas system-associated protein Cas9.  CRISPR
           (Clustered Regularly Interspaced Short Palindromic
           Repeats) and associated Cas proteins comprise a system
           for heritable host defense by prokaryotic cells against
           phage and other foreign DNA; Very large protein
           containing McrA/HNH-nuclease related domain and a
           RuvC-like nuclease domain; signature gene for type II.
          Length = 804

 Score = 28.4 bits (63), Expect = 1.2
 Identities = 14/38 (36%), Positives = 19/38 (50%), Gaps = 2/38 (5%)

Query: 61  IRGWRHLRLDGATQVSSRQELIDEYNRDQDLFAFLLST 98
           I G  H + DGA    S ++L +E  +   L  FLL T
Sbjct: 285 ILGEDHDKKDGAKY--SYKDLCNELKQKVGLIDFLLET 320


>gnl|CDD|212061 cd11492, SLC5sbd_NIS-SMVT, Na(+)/iodide (NIS) and
           Na(+)/multivitamin (SMVT) cotransporters, and related
           proteins; solute binding domain.  NIS (encoded by the
           SLC5A5 gene) transports I-, and other anions including
           ClO4-, SCN-, and Br-. SMVT (encoded by the SLC5A6 gene)
           transports biotin, pantothenic acid and lipoate. This
           subfamily also includes SMCT1 and -2. SMCT1(encoded by
           the SLC5A8 gene) is a high-affinity transporter of
           various monocarboxylates including lactate and pyruvate,
           short-chain fatty acids, ketone bodies, nicotinate and
           its structural analogs, pyroglutamate, benzoate and its
           derivatives, and iodide. SMCT2 (encoded by the SLC5A12
           gene) is a low-affinity transporter for short-chain
           fatty acids, lactate, pyruvate, and nicotinate. This
           subgroup belongs to the solute carrier 5 (SLC5)
           transporter family.
          Length = 522

 Score = 27.8 bits (63), Expect = 1.7
 Identities = 17/49 (34%), Positives = 22/49 (44%), Gaps = 17/49 (34%)

Query: 10  AKYKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHY 58
           AKY   D L   +G + K D++LP                FV+DILGH 
Sbjct: 291 AKYHDCDPL--TAGLISKPDQLLP---------------YFVMDILGHL 322


>gnl|CDD|173933 cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like.
          Glycerol-1-phosphate dehydrogenase-like. The proteins
          of this family have not been characterized. The protein
          sequences have high similarity with that of
          glycerol-1-phosphate dehydrogenase (G1PDH). G1PDH plays
          a role in the synthesis of phosphoglycerolipids in
          Gram-positive bacterial species. It catalyzes the
          reversibly reduction of dihydroxyacetone phosphate
          (DHAP) to glycerol-1-phosphate (G1P) in a
          NADH-dependent manner. Its activity requires Ni++ ion.
          This family is bacteria specific.
          Length = 331

 Score = 28.0 bits (63), Expect = 1.8
 Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 15 PDDLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
          P  L +  G + +L E+L D   N  RV + S
Sbjct: 1  PLVLDIGEGAIARLGELLSDRNPNFGRVAVVS 32


>gnl|CDD|212048 cd11478, SLC5sbd_u2, Uncharacterized bacterial solute carrier 5
           subfamily; putative solute-binding domain.  SLC5 (also
           called the sodium/glucose cotransporter family or solute
           sodium symporter family) is a family of proteins that
           co-transports Na+ with sugars, amino acids, inorganic
           ions or vitamins. Prokaryotic members of this family
           include Vibrio parahaemolyticus glucose/galactose
           (vSGLT), and Escherichia coli proline (PutP) and
           pantothenate (PutF) cotransporters. One member of the
           SLC5 family, human SGLT3, has been characterized as a
           glucose sensor and not a transporter. This subfamily
           belongs to the solute carrier 5 (SLC5) transporter
           family.
          Length = 495

 Score = 27.2 bits (61), Expect = 2.9
 Identities = 17/49 (34%), Positives = 27/49 (55%), Gaps = 8/49 (16%)

Query: 91  LFAFLLSTKAGGLGINLTAADTVIIHDVDFNPYNDKQAEDRCH--RVGQ 137
           L A  +S  AG    N++A +TV  +D+ +  Y  K A D+ H  +VG+
Sbjct: 332 LLAAFMSGMAG----NVSAFNTVFTYDI-YQTYIRKDASDK-HYLKVGR 374


>gnl|CDD|221566 pfam12416, DUF3668, Cep120 protein.  This family includes the
           Cep120 protein which is associated with centriole
           structure and function.
          Length = 334

 Score = 26.9 bits (60), Expect = 3.3
 Identities = 11/44 (25%), Positives = 18/44 (40%), Gaps = 7/44 (15%)

Query: 18  LVVESGKLKKLDEILPDLKKNGHRVLIFSQFIFVLDILGHYMDI 61
           L V     + L++++P   K        S F F   +LG   D+
Sbjct: 190 LSVTLKSAENLEQLVPSSHKQNE-----SGFFFYYSLLG--NDV 226


>gnl|CDD|225650 COG3108, COG3108, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 185

 Score = 26.6 bits (59), Expect = 3.7
 Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 6/47 (12%)

Query: 25  LKKLDEILPDLKKNG-HRV--LIFSQFIFVLDILGHY---MDIRGWR 65
           L  L+ +L D ++N   R+   +F     +  + GH        G+R
Sbjct: 66  LSGLNRLLRDWRQNEVVRMDPRLFDLVYQLKTLEGHRRPVQVTSGYR 112


>gnl|CDD|213304 cd05938, hsFATP2a_ACSVL_like, Fatty acid transport proteins (FATP)
           including hsFATP2, hsFATP5, and hsFATP6, and similar
           proteins.  Fatty acid transport proteins (FATP) of this
           family transport long-chain or very-long-chain fatty
           acids across the plasma membrane. At least five copies
           of FATPs are identified in mammalian cells. This family
           includes hsFATP2, hsFATP5, and hsFATP6, and similar
           proteins. Each FATP has unique patterns of tissue
           distribution. These FATPs also have fatty acid CoA
           synthetase activity, thus playing dual roles as fatty
           acid transporters and its activation enzymes. The hsFATP
           proteins exist in two splice variants; the b variant,
           lacking exon 3, has no acyl-CoA synthetase activity.
           FATPs are key players in the trafficking of exogenous
           fatty acids into the cell and in intracellular fatty
           acid homeostasis.
          Length = 535

 Score = 27.0 bits (60), Expect = 4.1
 Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 3/46 (6%)

Query: 1   MALKQTLDCAKYKVPDDLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
            +L     C   +V   LV +   L+ ++EILP L+  G RV   S
Sbjct: 66  GSLLHCFRCCGARV---LVADPELLEAVEEILPALRAMGVRVFYLS 108


>gnl|CDD|219361 pfam07277, SapC, SapC.  This family contains a number of bacterial
           SapC proteins approximately 250 residues long. In
           Campylobacter fetus, SapC forms part of a
           paracrystalline surface layer (S-layer) that confers
           serum resistance.
          Length = 220

 Score = 26.8 bits (60), Expect = 4.3
 Identities = 15/43 (34%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 6   TLDCAKYKVPDDLVVESGKLKKL-DEILPDLKKNGHRVLIFSQ 47
           TLD  K  +     ++  KL  L DE L +L K+G+  LI++ 
Sbjct: 168 TLDGEKRNLNGFYTIDEEKLNALSDEALLELHKSGYLPLIYAH 210


>gnl|CDD|224626 COG1712, COG1712, Predicted dinucleotide-utilizing enzyme
          [General function prediction only].
          Length = 255

 Score = 26.6 bits (59), Expect = 5.1
 Identities = 11/50 (22%), Positives = 20/50 (40%), Gaps = 5/50 (10%)

Query: 2  ALKQTLDCAKYKVPD-----DLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
                      + +     DLVVE+   + + E +P + K G  V++ S
Sbjct: 42 LEASVGRRCVSDIDELIAEVDLVVEAASPEAVREYVPKILKAGIDVIVMS 91


>gnl|CDD|237343 PRK13304, PRK13304, L-aspartate dehydrogenase; Reviewed.
          Length = 265

 Score = 26.1 bits (58), Expect = 6.7
 Identities = 14/34 (41%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 14 VPD-DLVVESGKLKKLDEILPDLKKNGHRVLIFS 46
          V D DLVVE   +  ++E++P   +NG  V+I S
Sbjct: 59 VEDVDLVVECASVNAVEEVVPKSLENGKDVIIMS 92


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.141    0.418 

Gapped
Lambda     K      H
   0.267   0.0781    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,256,687
Number of extensions: 675795
Number of successful extensions: 771
Number of sequences better than 10.0: 1
Number of HSP's gapped: 765
Number of HSP's successfully gapped: 36
Length of query: 138
Length of database: 10,937,602
Length adjustment: 87
Effective length of query: 51
Effective length of database: 7,078,804
Effective search space: 361019004
Effective search space used: 361019004
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (24.5 bits)