RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6619
         (230 letters)



>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 65.9 bits (161), Expect = 2e-12
 Identities = 42/181 (23%), Positives = 71/181 (39%), Gaps = 29/181 (16%)

Query: 51  IFFRSSVLPQLEAERDKKLSSRVIEFQARCRGYLARKH-LAKLKVQDVAVRCIQRNVRKF 109
           +FF++ VL  LE  RD KL +     Q   RG   R+  L  LK     ++ IQ   R  
Sbjct: 726 VFFKAGVLAALEDMRDAKLDNIATRIQRAIRGRYLRRRYLQALKRIK-KIQVIQHGFRLR 784

Query: 110 MDVRDWSWWRLYLKINPLLNVHRTD----------QELQAKSVQ---------------- 143
             V     WRL++K+ PLL++  +            +LQ    +                
Sbjct: 785 RLVDYELKWRLFIKLQPLLSLLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKA 844

Query: 144 DVAVRRIQRNVPKLMDVRDWSWWRLYLKINP-LLNVHRTDQELQAKSEELEALKSKLEKL 202
           +V +++  R++             +YL+    +    R  QEL+   + + +LK    +L
Sbjct: 845 EVLIQKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLEL 904

Query: 203 E 203
           E
Sbjct: 905 E 905


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 33.0 bits (75), Expect = 0.10
 Identities = 25/99 (25%), Positives = 48/99 (48%), Gaps = 10/99 (10%)

Query: 129 NVHRTDQELQAKSVQDVAVR---RIQRNVPKLMDVRDWSWWRLYLKINPLLNVHRTDQEL 185
           +V   D EL  K +++  VR   +  + VP+ MD++       Y ++   L      +E 
Sbjct: 99  DVEHIDVELVRKELKNALVRAGLKTLQRVPEYMDLK-----EDYEELKEKLE--ELQKEK 151

Query: 186 QAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKIRE 224
           +   +ELE L+++ E+++     L+ +N+RLE   K   
Sbjct: 152 EELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLP 190


>gnl|CDD|238846 cd01713, PAPS_reductase, This domain is found in phosphoadenosine
           phosphosulphate (PAPS) reductase enzymes or PAPS
           sulphotransferase. PAPS reductase is part of the adenine
           nucleotide alpha hydrolases superfamily also including N
           type ATP PPases and ATP sulphurylases. A highly modified
           version of the P loop, the fingerprint peptide of
           mononucleotide-binding proteins, is present in the
           active site of the protein, which appears to be a
           positively charged cleft containing a number of
           conserved arginine and lysine residues. Although PAPS
           reductase has no ATPase activity, it shows a striking
           similarity to the structure of the ATP pyrophosphatase
           (ATP PPase) domain of GMP synthetase, indicating that
           both enzyme families have evolved from a common
           ancestral nucleotide-binding fold.   The enzyme uses
           thioredoxin as an electron donor for the reduction of
           PAPS to phospho-adenosine-phosphate (PAP) . It is also
           found in NodP nodulation protein P from Rhizobium
           meliloti which has ATP sulphurylase activity (sulphate
           adenylate transferase) .
          Length = 173

 Score = 32.0 bits (73), Expect = 0.13
 Identities = 23/112 (20%), Positives = 34/112 (30%), Gaps = 7/112 (6%)

Query: 73  VIEFQARCRGYLARKHLAKLKVQDVAVRCIQRNVRKFMD-VRDWSWWRLYLKINPLLNVH 131
             EF  R         L  ++  D     +   ++ F     D  W    LK+ PL    
Sbjct: 43  TYEFVDRVAERY-GLPLVVVRPPDSPAEGLALGLKGFPLPSPDRRWCCRILKVEPLRRAL 101

Query: 132 RTDQELQAKSVQDVAVRRIQ-RNVPKLMDVRDWSWWRLYLKINPLLNVHRTD 182
           +              +RR +      L  V         LK+NPLL+    D
Sbjct: 102 KELG----VVAWITGIRRDESARRALLPVVWTDDGKGGILKVNPLLDWTYED 149


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 31.5 bits (72), Expect = 0.20
 Identities = 14/52 (26%), Positives = 24/52 (46%)

Query: 175 LLNVHRTDQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKIREFL 226
           L  + +         EE   L+ +L +L+ E   L+ +N RL+   + R FL
Sbjct: 117 LARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQRRWFL 168


>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator.  DivIC from B.
           subtilis is necessary for both vegetative and
           sporulation septum formation. These proteins are mainly
           composed of an amino terminal coiled-coil.
          Length = 80

 Score = 29.9 bits (68), Expect = 0.26
 Identities = 17/46 (36%), Positives = 26/46 (56%)

Query: 172 INPLLNVHRTDQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLE 217
           +N L   ++ +QE+ A   EL  LK++ E+LE E   LK D   +E
Sbjct: 13  VNGLSAYYQLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIE 58


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
           biogenesis, outer membrane].
          Length = 284

 Score = 31.2 bits (71), Expect = 0.39
 Identities = 12/34 (35%), Positives = 23/34 (67%)

Query: 183 QELQAKSEELEALKSKLEKLEVERNLLKQDNTRL 216
           ++L  ++EEL+   ++LE+L  E   L+++N RL
Sbjct: 69  KDLALENEELKKELAELEQLLEEVESLEEENKRL 102


>gnl|CDD|197470 smart00015, IQ, Calmodulin-binding motif.  Short
          calmodulin-binding motif containing conserved Ile and
          Gln residues.
          Length = 23

 Score = 26.9 bits (61), Expect = 0.84
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 68 KLSSRVIEFQARCRGYLARKHL 89
          +L+   I  QA  RGYLARK  
Sbjct: 1  RLTRAAIIIQAAWRGYLARKRY 22


>gnl|CDD|201341 pfam00612, IQ, IQ calmodulin-binding motif.  Calmodulin-binding
          motif.
          Length = 21

 Score = 26.9 bits (61), Expect = 0.94
 Identities = 10/17 (58%), Positives = 11/17 (64%)

Query: 73 VIEFQARCRGYLARKHL 89
           I+ QA  RGYLARK  
Sbjct: 4  AIKIQAAWRGYLARKRY 20


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.7 bits (68), Expect = 1.1
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 6/54 (11%)

Query: 180 RTDQELQAKSEELEALKSKLEKLEVE-RNLLKQDNTRLE-----TKTKIREFLL 227
           + ++EL+ K +ELE  + +LEK E E   L+++    LE     T  + +E LL
Sbjct: 107 KREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILL 160


>gnl|CDD|152704 pfam12269, zf-CpG_bind_C, CpG binding protein zinc finger C
           terminal domain.  This domain family is found in
           eukaryotes, and is approximately 240 amino acids in
           length. This domain is the zinc finger domain of a CpG
           binding DNA methyltransferase protein. It contains a
           CxxC motif which forms the zinc finger and binds to DNA.
          Length = 236

 Score = 29.0 bits (65), Expect = 1.6
 Identities = 13/46 (28%), Positives = 24/46 (52%)

Query: 177 NVHRTDQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKI 222
                 QEL+ + +ELE + S+ ++L V+ N  + D  + +T   I
Sbjct: 40  VARTRLQELERRHQELETIISRAKRLTVDENNEQDDADQEDTDLSI 85


>gnl|CDD|222255 pfam13600, DUF4140, N-terminal domain of unknown function
           (DUF4140).  This family is often found at the N-terminus
           of its member proteins, with DUF4139, pfam13598, at the
           C-terminus.
          Length = 101

 Score = 28.0 bits (63), Expect = 1.6
 Identities = 14/46 (30%), Positives = 22/46 (47%)

Query: 176 LNVHRTDQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTK 221
           ++V R     +A S  L  L+ +L  LE E   L+ +   LE + K
Sbjct: 53  VSVRRDYLPDEATSPALAELEEELRALEDELAALEAEKEALEAQIK 98


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
           fungi is controlled by the loci that determine the
           mating type of an individual, and only individuals with
           differing mating types can mate. Basidiomycete fungi
           have evolved a unique mating system, termed tetrapolar
           or bifactorial incompatibility, in which mating type is
           determined by two unlinked loci; compatibility at both
           loci is required for mating to occur. The multi-allelic
           tetrapolar mating system is considered to be a novel
           innovation that could have only evolved once, and is
           thus unique to the mushroom fungi. This domain is
           C-terminal to the homeodomain transcription factor
           region.
          Length = 418

 Score = 29.0 bits (65), Expect = 1.9
 Identities = 10/25 (40%), Positives = 14/25 (56%)

Query: 181 TDQELQAKSEELEALKSKLEKLEVE 205
              EL+AK  ELE L+++   L  E
Sbjct: 392 PSSELEAKRRELEELEARAAALRAE 416


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 28.4 bits (64), Expect = 2.3
 Identities = 11/46 (23%), Positives = 25/46 (54%)

Query: 179 HRTDQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKIRE 224
               +E   + E++E L+ ++ KL+ +    KQ N ++E   +I++
Sbjct: 171 KEDLKERVERLEQIEELEKEIAKLKKKLKKEKQFNRKVELNLEIKK 216


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 28.6 bits (64), Expect = 3.3
 Identities = 12/47 (25%), Positives = 28/47 (59%)

Query: 178 VHRTDQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKIRE 224
           +   +  ++    E++AL SK++++  E   ++Q+N  LE + KI++
Sbjct: 279 IDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKK 325


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.5 bits (64), Expect = 3.4
 Identities = 15/47 (31%), Positives = 22/47 (46%)

Query: 182 DQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKIREFLLP 228
           ++EL+    EL  LK ++EKL      L+    RLE +    E  L 
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELE 936


>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal.  This domain is the C-terminal
           region of the CASP family of proteins. It is a Golgi
           membrane protein which is thought to have a role in
           vesicle transport.
          Length = 245

 Score = 28.1 bits (63), Expect = 3.8
 Identities = 11/32 (34%), Positives = 14/32 (43%)

Query: 190 EELEALKSKLEKLEVERNLLKQDNTRLETKTK 221
           EEL      +  L  E   LK DN +L  K +
Sbjct: 100 EELRKQNQTISSLRRELESLKADNIKLYEKIR 131


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 27.9 bits (62), Expect = 4.1
 Identities = 19/61 (31%), Positives = 33/61 (54%), Gaps = 6/61 (9%)

Query: 171 KINPLLNVHRTDQ------ELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKIRE 224
           +I PLL     ++      + + K E LE L+ +LE+LE E++LL++     E K ++ E
Sbjct: 132 EILPLLLYLGQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEE 191

Query: 225 F 225
            
Sbjct: 192 L 192


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 28.3 bits (64), Expect = 4.2
 Identities = 14/36 (38%), Positives = 24/36 (66%), Gaps = 3/36 (8%)

Query: 183 QELQAKSEELEALKSKLEKLEVERNLLKQDNTRLET 218
           ++L+ K  +LEA   KLE+ E+ER  LK +  +L++
Sbjct: 18  RQLELKLRDLEAENEKLER-ELER--LKSELEKLKS 50


>gnl|CDD|233398 TIGR01407, dinG_rel, DnaQ family exonuclease/DinG family helicase,
           putative.  This model represents a family of proteins in
           Gram-positive bacteria. The N-terminal region of about
           200 amino acids resembles the epsilon subunit of E. coli
           DNA polymerase III and the homologous region of the
           Gram-positive type DNA polymerase III alpha subunit. The
           epsilon subunit contains an exonuclease domain. The
           remainder of this protein family resembles a predicted
           ATP-dependent helicase, the DNA damage-inducible protein
           DinG of E. coli [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 850

 Score = 28.2 bits (63), Expect = 4.2
 Identities = 19/54 (35%), Positives = 26/54 (48%), Gaps = 3/54 (5%)

Query: 174 PLLNVHRTDQELQAKSEELEALKSKLEKLE---VERNLLKQDNTRLETKTKIRE 224
              N HR D + QA +E L  L  K+EKL    +E+ L   D    E+   I+E
Sbjct: 140 THENPHRADSDAQATAELLLLLFEKMEKLPLDTLEQLLELSDQLLYESYDIIQE 193


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 27.4 bits (61), Expect = 5.2
 Identities = 15/29 (51%), Positives = 22/29 (75%)

Query: 190 EELEALKSKLEKLEVERNLLKQDNTRLET 218
           E L++L S+ ++LEVE  LL++DN RL T
Sbjct: 36  ENLDSLASENQELEVELELLREDNERLST 64


>gnl|CDD|218155 pfam04575, DUF560, Protein of unknown function (DUF560).  Family of
           hypothetical bacterial proteins.
          Length = 301

 Score = 27.6 bits (62), Expect = 5.4
 Identities = 11/56 (19%), Positives = 17/56 (30%), Gaps = 15/56 (26%)

Query: 118 WRLYLKINPLLNVHRTDQELQAKSVQDVAVRRIQRNVPKLMDVRDWSWWRLYLKIN 173
            R Y   +   N  R D+E    S+           +      RD  W  +  K+ 
Sbjct: 233 KRQYKDKDLFGNKKRKDKEYSV-SLS----------LWH----RDLHWLGITPKLT 273


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 27.7 bits (62), Expect = 6.1
 Identities = 13/38 (34%), Positives = 19/38 (50%)

Query: 182 DQELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETK 219
           +QE +   E LE L+  L  LE E   +K +   LE +
Sbjct: 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR 766


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 27.7 bits (62), Expect = 6.2
 Identities = 15/41 (36%), Positives = 25/41 (60%)

Query: 184 ELQAKSEELEALKSKLEKLEVERNLLKQDNTRLETKTKIRE 224
           +L+ + +ELE LK ++E+LE E   L+    +LE K +  E
Sbjct: 225 KLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELE 265



 Score = 27.7 bits (62), Expect = 6.6
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 183 QELQAKSEELEALKSKLEKLEVERNLLKQDNTRLE 217
           +EL+ K+EE E LK KL KL+ E   LK++  +LE
Sbjct: 518 EELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLE 552


>gnl|CDD|238682 cd01386, MYSc_type_XVIII, Myosin motor domain, type XVIII myosins.
           This catalytic (head) domain has ATPase activity and
           belongs to the larger group of P-loop NTPases. Myosins
           are actin-dependent molecular motors that play important
           roles in muscle contraction, cell motility, and
           organelle transport. The head domain is a molecular
           motor, which utilizes ATP hydrolysis to generate
           directed movement toward the plus end along actin
           filaments. A cyclical interaction between myosin and
           actin provides the driving force. Rates of ATP
           hydrolysis and consequently the speed of movement along
           actin filaments vary widely, from about 0.04 micrometer
           per second for myosin I to 4.5 micrometer per second for
           myosin II in skeletal muscle. Myosin II moves in
           discrete steps about 5-10 nm long and generates 1-5
           piconewtons of force. Upon ATP binding, the myosin head
           dissociates from an actin filament. ATP hydrolysis
           causes the head to pivot and associate with a new actin
           subunit. The release of Pi causes the head to pivot and
           move the filament (power stroke). Release of ADP
           completes the cycle.
          Length = 767

 Score = 27.9 bits (62), Expect = 6.2
 Identities = 10/16 (62%), Positives = 14/16 (87%)

Query: 50  QIFFRSSVLPQLEAER 65
           Q+FFR+ VL +LEA+R
Sbjct: 752 QVFFRAGVLSRLEAQR 767


>gnl|CDD|217433 pfam03223, V-ATPase_C, V-ATPase subunit C. 
          Length = 371

 Score = 27.2 bits (61), Expect = 7.6
 Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 9/53 (16%)

Query: 160 VRDWSW----WRLYLKINPLL-----NVHRTDQELQAKSEELEALKSKLEKLE 203
           +  + W    +     +  L+      V + D +L++KS E    KS L  LE
Sbjct: 102 LTRFQWDSAKYPTKQSLKELVDLLSKEVAQIDNDLRSKSAEYNNAKSNLAALE 154


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 26.9 bits (60), Expect = 8.1
 Identities = 11/34 (32%), Positives = 19/34 (55%)

Query: 183 QELQAKSEELEALKSKLEKLEVERNLLKQDNTRL 216
            +L+ ++EEL+    +LE    E   L+ +N RL
Sbjct: 72  FDLREENEELKKELLELESRLQELEQLEAENARL 105


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 27.2 bits (61), Expect = 9.1
 Identities = 14/48 (29%), Positives = 25/48 (52%), Gaps = 2/48 (4%)

Query: 183 QELQAKSEELEALKSKLEKLEVERNLLKQ--DNTRLETKTKIREFLLP 228
           ++L+   E  E L  +LE+LE E   LK+  ++   E K  + + + P
Sbjct: 374 EQLEKLLETKEKLSEELEELEEELKELKEELESLYSEGKISVNKTIYP 421


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.133    0.395 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,391,507
Number of extensions: 1063461
Number of successful extensions: 1987
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1908
Number of HSP's successfully gapped: 157
Length of query: 230
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 136
Effective length of database: 6,768,326
Effective search space: 920492336
Effective search space used: 920492336
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 57 (25.6 bits)