RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14212
         (111 letters)



>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
          34 amino acids
          [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-
          [FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence;
          found in a variety of organisms including bacteria,
          cyanobacteria, yeast, fungi, plants, and humans in
          various subcellular locations; involved in a variety of
          functions including protein-protein interactions, but
          common features in the interaction partners have not
          been defined; involved in chaperone, cell-cycle,
          transciption, and protein transport complexes; the
          number of TPR motifs varies among proteins (1,3-11,13
          15,16,19); 5-6 tandem repeats generate a right-handed
          helical structure with an amphipathic channel that is
          thought to accomodate an alpha-helix of a target
          protein; it has been proposed that TPR proteins
          preferably interact with WD-40 repeat proteins, but in
          many instances several TPR-proteins seem to aggregate
          to multi-protein complexes; examples of TPR-proteins
          include, Cdc16p, Cdc23p and Cdc27p components of the
          cyclosome/APC, the Pex5p/Pas10p receptor for
          peroxisomal targeting signals, the Tom70p co-receptor
          for mitochondrial targeting signals, Ser/Thr
          phosphatase 5C and the p110 subunit of O-GlcNAc
          transferase; three copies of the repeat are present
          here.
          Length = 100

 Score = 30.8 bits (70), Expect = 0.054
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 3/53 (5%)

Query: 9  PHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLALSCHQ 61
          P N   +   A   Y  G     E A+  Y  A+ L+  N +A Y L L+ ++
Sbjct: 31 PDNADAYYNLAAAYYKLG---KYEEALEDYEKALELDPDNAKAYYNLGLAYYK 80



 Score = 27.0 bits (60), Expect = 1.2
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 28 LENIELAISHYLMAINLNEKNIRALYGLAL 57
          L + + A+ +Y  A+ L+  N  A Y LA 
Sbjct: 13 LGDYDEALEYYEKALELDPDNADAYYNLAA 42


>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein.  Tymoviruses
           are single stranded RNA viruses. This family includes a
           protein of unknown function that has been named based on
           its molecular weight. Tymoviruses such as the ononis
           yellow mosaic tymovirus encode only three proteins. Of
           these two are overlapping this protein overlaps a larger
           ORF that is thought to be the polymerase.
          Length = 458

 Score = 28.9 bits (65), Expect = 0.44
 Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 2/24 (8%)

Query: 4   LFLHHP--HNHLLHQRYADILYTQ 25
             LHH   H + LH R +D+L + 
Sbjct: 130 HHLHHSSKHRNRLHARRSDVLPSL 153


>gnl|CDD|221596 pfam12481, DUF3700, Aluminium induced protein.  This domain family
           is found in eukaryotes, and is approximately 120 amino
           acids in length. There are two conserved sequence
           motifs: YGL and LRDR. This family is related to GATase
           enzyme domains.
          Length = 228

 Score = 28.3 bits (64), Expect = 0.62
 Identities = 14/48 (29%), Positives = 19/48 (39%), Gaps = 7/48 (14%)

Query: 8   HPHNHLLHQR----YADI--LYTQGGLENIELAISHYLMAINLNEKNI 49
           H     L  R      DI  L+ QG LEN+      Y ++   NE  +
Sbjct: 60  HSKQSPLLPRLFAVVDDIFCLF-QGHLENLASLRQQYGLSKTANEAML 106


>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat. 
          Length = 69

 Score = 26.9 bits (60), Expect = 0.75
 Identities = 13/45 (28%), Positives = 20/45 (44%), Gaps = 3/45 (6%)

Query: 13 LLHQRYADILYTQGGLENIELAISHYLMAINLNEKNIRALYGLAL 57
             +   + L+  G   + + AI  Y  A+ L+  N  A Y LAL
Sbjct: 4  EALKNLGNALFKLG---DYDEAIEAYEKALELDPDNAEAYYNLAL 45



 Score = 24.6 bits (54), Expect = 6.5
 Identities = 8/38 (21%), Positives = 14/38 (36%), Gaps = 2/38 (5%)

Query: 9  PHNHLLHQRYADILYTQGGLENIELAISHYLMAINLNE 46
          P N   +   A      G  ++ E A+     A+ L+ 
Sbjct: 34 PDNAEAYYNLALAYLKLG--KDYEEALEDLEKALELDP 69


>gnl|CDD|234418 TIGR03964, mycofact_creat, mycofactocin system creatininase family
           protein.  Members of this protein family are
           uncharacterized Actinobacterial proteins, with homology
           to creatinine amidohydrolase from Pseudomonas. Members
           occur only in the context of the mycofactocin system
           [Unknown function, Enzymes of unknown specificity].
          Length = 228

 Score = 27.7 bits (62), Expect = 0.97
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 80  MMWVSKHLARQYEEQQGNTETLTELMSAL 108
           ++ +S    R    + GNT  L ELM  L
Sbjct: 149 LLHLSPDRVRMDRAEAGNTAPLAELMPRL 177


>gnl|CDD|239717 cd03748, Ntn_PGA, Penicillin G acylase (PGA) is the key enzyme in
           the industrial production of beta-lactam antibiotics.
           PGA hydrolyzes the side chain of penicillin G and
           related beta-lactam antibiotics releasing 6-amino
           penicillanic acid (6-APA), a building block in the
           production of semisynthetic penicillins.  PGA is widely
           distributed among microorganisms, including bacteria,
           yeast and filamentous fungi but it's in vivo role
           remains unclear.
          Length = 488

 Score = 26.3 bits (58), Expect = 3.5
 Identities = 11/39 (28%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 66  AKCSAAKKKEISKQMMWVSKHLARQYEE--QQGNTETLT 102
           +K  A    E+   M W  +  A+ +EE   Q + + LT
Sbjct: 138 SKARAWDGYELQSLMAWTKQTKAKNWEEWLDQASKQALT 176


>gnl|CDD|217874 pfam04074, DUF386, Domain of unknown function (DUF386).  This
          family consists of conserved hypothetical proteins,
          typically about 150 amino acids in length, with no
          known function.
          Length = 153

 Score = 25.7 bits (57), Expect = 4.1
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query: 14 LHQRYADILYTQGGLENIELAIS 36
          +H+RY DI     G E IE A +
Sbjct: 65 VHRRYIDIQILLEGQEKIEYATA 87


>gnl|CDD|200386 TIGR04135, FibroRuminTarg, Cys-rich radical SAM target,
          FibroRumin family.  Members of this protein family are
          cysteine-rich small peptides, about 52 amino acids
          long, that are proposed targets for modification by a
          radical SAM enzyme. Known occurrences are as tandem
          gene pairs Fibrobacter succinogenes subsp. succinogenes
          S85 (missed gene calls) and in Ruminococcus albus 8.
          Length = 52

 Score = 24.7 bits (53), Expect = 4.3
 Identities = 8/37 (21%), Positives = 20/37 (54%)

Query: 39 LMAINLNEKNIRALYGLALSCHQVLTSAKCSAAKKKE 75
           M  N+ ++ + A     ++C+    + KC A+K+++
Sbjct: 10 EMFGNIADEFMAAAACACVACNGCKCACKCRASKEED 46


>gnl|CDD|131755 TIGR02708, L_lactate_ox, L-lactate oxidase.  Members of this
           protein oxidize L-lactate to pyruvate, reducing
           molecular oxygen to hydrogen peroxide. The enzyme is
           known in Aerococcus viridans, Streptococcus iniae, and
           some strains of Streptococcus pyogenes where it appears
           to contribute to virulence [Energy metabolism, Other].
          Length = 367

 Score = 26.0 bits (57), Expect = 4.8
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query: 81  MWVSKHLARQYEEQQGNTETLTELMSAL 108
           +WV+ H  RQ +      ++L E+  A+
Sbjct: 253 IWVTNHGGRQLDGGPAAFDSLQEVAEAV 280


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 25.7 bits (56), Expect = 6.7
 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 72  KKKEISKQMMWVSKHLARQYEEQQGNTETLTELMSALQV 110
           K KE   +M+       R+Y+E +  TE    LMSALQ 
Sbjct: 550 KLKEEECRMLEKEAQELRKYQESEKETEV---LMSALQA 585


>gnl|CDD|213811 TIGR03427, ABC_peri_uca, ABC transporter periplasmic binding
           protein, urea carboxylase region.  Members of this
           family are ABC transporter periplasmic binding proteins
           associated with the urea carboxylase/allophanate
           hydrolase pathway, an alternative to urease for urea
           degradation. The protein is restricted to bacteria with
           the pathway, with its gene close to the urea carboxylase
           and allophanate hydrolase genes. The substrate for this
           transporter therefore is likely to be urea or a compound
           from which urea is easily derived [Transport and binding
           proteins, Unknown substrate].
          Length = 328

 Score = 25.5 bits (56), Expect = 6.8
 Identities = 10/25 (40%), Positives = 19/25 (76%), Gaps = 5/25 (20%)

Query: 31  IELAISHYLMA-----INLNEKNIR 50
           +EL++SHYL+A     + L+EK+++
Sbjct: 112 VELSVSHYLLARALESVGLSEKDVK 136


>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
           Provisional.
          Length = 134

 Score = 24.8 bits (54), Expect = 8.2
 Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 14/74 (18%)

Query: 39  LMAINLNEKNIRALYGLALS-----CHQVLTSAKCSAAKKKEISKQMMWVSKHLARQYEE 93
           +M  +LN  NIR L  +A         ++L   +    +++E  +Q         R+  E
Sbjct: 3   VMLQSLN--NIRTLRAMAREFSIDVLEEMLEKFRVVTKERREEEEQQ-------QRELAE 53

Query: 94  QQGNTETLTELMSA 107
           +Q    T  ELM A
Sbjct: 54  RQEKINTWLELMKA 67


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.128    0.367 

Gapped
Lambda     K      H
   0.267   0.0867    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,405,043
Number of extensions: 440219
Number of successful extensions: 430
Number of sequences better than 10.0: 1
Number of HSP's gapped: 429
Number of HSP's successfully gapped: 42
Length of query: 111
Length of database: 10,937,602
Length adjustment: 75
Effective length of query: 36
Effective length of database: 7,611,052
Effective search space: 273997872
Effective search space used: 273997872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (23.9 bits)