RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8452
         (166 letters)



>gnl|CDD|236196 PRK08241, PRK08241, RNA polymerase factor sigma-70; Validated.
          Length = 339

 Score = 29.9 bits (68), Expect = 0.60
 Identities = 11/20 (55%), Positives = 11/20 (55%)

Query: 141 YDVAKLAALLHRAADMSHPP 160
           YDV  L ALL   A  S PP
Sbjct: 230 YDVDALVALLTEDATWSMPP 249


>gnl|CDD|188577 TIGR04062, dnd_assoc_4, dnd system-associated protein 4.  A DNA
           sulfur modification system, dnd (degradation during
           electrophoresis), is sparsely and sporadically
           distributed among the bacteria. Members of this protein
           fam ily are strictly limited to species with the dnd
           operon, and are found close to the dnd operon on the
           chromosomes of species such as Nostoc sp. PCC 7120,
           Geobacter uraniireducens Rf4, and Roseobacter
           denitrificans OCh 114 [DNA metabolism,
           Restriction/modification].
          Length = 151

 Score = 29.2 bits (66), Expect = 0.68
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 119 APDYLEGLLLVLPNEQIPQDIVYDVAKLAALL 150
           A DY E LLL++ +E+  ++   +   L+  L
Sbjct: 120 AVDYSERLLLLIISERFDEEQPDEEFDLSKFL 151


>gnl|CDD|221522 pfam12309, KBP_C, KIF-1 binding protein C terminal.  This family of
           proteins is found in bacteria and eukaryotes. Proteins
           in this family are typically between 365 and 621 amino
           acids in length. There is a conserved LLP sequence
           motif. KBP is a binding partner for KIF1Balpha that is a
           regulator of its transport function and thus represents
           a type of kinesin interacting protein.
          Length = 365

 Score = 28.5 bits (64), Expect = 1.9
 Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 3/27 (11%)

Query: 74  DYLEGLIVHLNTIFPVFYLIFCRSVWY 100
           D LE ++  LN   P +YL  CR + +
Sbjct: 198 DLLEPILDELN---PQYYLTLCRQLLF 221


>gnl|CDD|218520 pfam05241, EBP, Emopamil binding protein.  Emopamil binding protein
           (EBP) is as a gene that encodes a non-glycosylated type
           I integral membrane protein of endoplasmic reticulum and
           shows high level expression in epithelial tissues. The
           EBP protein has emopamil binding domains, including the
           sterol acceptor site and the catalytic centre, which
           show Delta8-Delta7 sterol isomerase activity. Human
           sterol isomerase, a homologue of mouse EBP, is suggested
           not only to play a role in cholesterol biosynthesis, but
           also to affect lipoprotein internalisation. In humans,
           mutations of EBP are known to cause the genetic disorder
           of X-linked dominant chondrodysplasia punctata (CDPX2).
           This syndrome of humans is lethal in most males, and
           affected females display asymmetric hyperkeratotic skin
           and skeletal abnormalities.
          Length = 193

 Score = 27.6 bits (62), Expect = 2.6
 Identities = 18/66 (27%), Positives = 27/66 (40%), Gaps = 10/66 (15%)

Query: 42  VFYLIFCRSVWYFPLRL---DCQLYVQVLFNQIAPDYLEGLIVHLNTIFP-----VFYLI 93
           V Y I         L++     QLY  VL+   A ++L+G +       P      FY +
Sbjct: 105 VAYAIAKDKPLRHFLQIVVSVGQLYGTVLY--FATEWLDGFVHGREFSTPEPLYFWFYFV 162

Query: 94  FCRSVW 99
           F  S+W
Sbjct: 163 FMNSLW 168


>gnl|CDD|132005 TIGR02960, SigX5, RNA polymerase sigma-70 factor, TIGR02960 family.
            This group of sigma factors are members of the sigma-70
           family (TIGR02937). They and appear by homology, tree
           building, bidirectional best hits and one-to-a-genome
           distribution, to represent a conserved family.
          Length = 324

 Score = 26.7 bits (59), Expect = 6.6
 Identities = 10/21 (47%), Positives = 11/21 (52%)

Query: 141 YDVAKLAALLHRAADMSHPPA 161
           YD+  L ALLH  A    PP 
Sbjct: 220 YDLDALTALLHEDAIWEMPPY 240


>gnl|CDD|221296 pfam11894, DUF3414, Protein of unknown function (DUF3414).  This
            family of proteins are functionally uncharacterized. This
            protein is found in eukaryotes. Proteins in this family
            are typically between 764 to 2011 amino acids in length.
            This protein has a conserved LLG sequence motif.
          Length = 1612

 Score = 26.5 bits (59), Expect = 7.7
 Identities = 21/101 (20%), Positives = 34/101 (33%), Gaps = 26/101 (25%)

Query: 60   CQLYVQVLFNQIAPDYLEGLIVHLNTIFPVFYLIFCRSVWYFPLRLDCQLYVQVLFNQIA 119
             +  +  L    +    +G      TIF +               LD    +   F +  
Sbjct: 1108 KERILSALLGGSSTYTADGEPGDNPTIFSL---------------LDF---LDFEFKEEL 1149

Query: 120  P----DYLEGLLLVLPNEQIPQ----DIVYDVAKLAALLHR 152
            P    ++ +GL L L  E  P       +YD++KL  LL  
Sbjct: 1150 PPPQLEFFDGLDLDLCLENEPDNDYGVRLYDLSKLERLLRL 1190


>gnl|CDD|232865 TIGR00189, tesB, acyl-CoA thioesterase II.  Function: hydrolyzes a
           broad range of acyl-CoA thioesters. Physiological
           function is not known. Subunit: homotetramer [Fatty acid
           and phospholipid metabolism, Biosynthesis].
          Length = 271

 Score = 26.2 bits (58), Expect = 8.4
 Identities = 18/84 (21%), Positives = 22/84 (26%), Gaps = 19/84 (22%)

Query: 69  NQIAPDYLEGLIVHLNTIFPVFYLIFCRSVW---YFPLRLDCQLYVQVLFNQIAPDYLEG 125
           NQ+A  Y   L   L  + P       R V    Y   + D   YV              
Sbjct: 124 NQLATKYPATLPRFLKHVVPFERPFEIRPVNLLNYLGGKEDPPQYV-------------- 169

Query: 126 LLLVLPNEQIPQDIVYDVAKLAAL 149
                    +P D       LA L
Sbjct: 170 --WRRARGSLPDDPRLHQCALAYL 191


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.333    0.148    0.494 

Gapped
Lambda     K      H
   0.267   0.0843    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,705,236
Number of extensions: 793844
Number of successful extensions: 854
Number of sequences better than 10.0: 1
Number of HSP's gapped: 854
Number of HSP's successfully gapped: 21
Length of query: 166
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 76
Effective length of database: 6,945,742
Effective search space: 527876392
Effective search space used: 527876392
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 55 (24.8 bits)