RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9441
         (418 letters)



>gnl|CDD|144338 pfam00698, Acyl_transf_1, Acyl transferase domain. 
          Length = 319

 Score = 83.6 bits (207), Expect = 6e-18
 Identities = 33/151 (21%), Positives = 55/151 (36%), Gaps = 14/151 (9%)

Query: 2   LSPQGMCKAFDVAGHAFHSKYIAEAGPKLRSSLDRIIPAPKPRSARWISSSIPEDHWGTP 61
           +  +G+        +A HS  +   G  L  +L  I P   PR   + ++S         
Sbjct: 176 VEARGVRARVIAVDYASHSPQVEAIGDALALALADIAP-RTPRVPFYSTTSGDPS----- 229

Query: 62  LAQQSSPAYHVNNLLSPVLFYEALQHV--PENAVVIEIAPHALLQAIIKR-----AKPGA 114
             ++    Y   NL +PV F EA+         V +E++PH +L A ++          A
Sbjct: 230 DQRELDAEYWYRNLRNPVRFAEAVLAAAEQGYLVFVEVSPHPVLLAAVEETLKSADGKDA 289

Query: 115 VSIGLTKRGAPDLPAHLLSGLGKLYNAGLQP 145
             +G   R   D     L  L   +  G+  
Sbjct: 290 TLVGTLIRDQGD-LVTFLYALAVAHLTGVAV 319


>gnl|CDD|214838 smart00827, PKS_AT, Acyl transferase domain in polyketide synthase
           (PKS) enzymes. 
          Length = 298

 Score = 57.4 bits (140), Expect = 3e-09
 Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 12/104 (11%)

Query: 16  HAFHSKYIAEAGPKLRSSLDRIIPAPKPRSARWISSSIPEDHWGTPL--AQQSSPAYHVN 73
           HAFHS ++     + R++L  + P P PR   ++S+       GT +  A+     Y V 
Sbjct: 190 HAFHSPHMEPILDEFRAALAGLTPRP-PRIP-FVSTVT-----GTLIDGAELDDADYWVR 242

Query: 74  NLLSPVLFYEALQHVPENA---VVIEIAPHALLQAIIKRAKPGA 114
           NL  PV F +A++ +       V +E+ PH +L   IK+    A
Sbjct: 243 NLREPVRFADAVRALLAEGGVTVFLEVGPHPVLTGPIKQTLAAA 286


>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 1061

 Score = 58.2 bits (141), Expect = 5e-09
 Identities = 50/217 (23%), Positives = 75/217 (34%), Gaps = 13/217 (5%)

Query: 16  HAFHSKYIAEAGPKLRSSLDRIIPAPKPRSARWISSSIPEDHWGTPLAQQSSPAYHVNNL 75
           HAFHS  +     +  ++L  +  AP+P     IS+                  Y V +L
Sbjct: 719 HAFHSPLMDPILDEFAAALADL--APRPPQIPLISNV----TGDLAGEPGGDAQYWVQHL 772

Query: 76  LSPVLFYEALQHVPENA--VVIEIAPHALLQAIIKRAKPGAVSIGLTKRGAPDLPAHLLS 133
             PV F +A+     +     IE+ P  +L   IK+    A+    T R         L+
Sbjct: 773 RQPVRFADAIAAALADGARTFIEVGPGPVLTESIKQTLRDAILSIATLRRDAPELLSFLA 832

Query: 134 GLGKLYNAGLQPRLNQLLPPVK-FPVGRGTPMIQSLVEWDHSTEWSVASFAGKGGSR--- 189
            L +L+ AG+    + L+       V   T   Q    W       VAS    G      
Sbjct: 833 ALAQLFVAGVAVDWSPLVYGPDGRLVELPTYPFQRQRFWLDPEALGVASHPLLGAVLGRP 892

Query: 190 -SGECVIDVDLSKEEDAYLSGHAIDGRILFPATGTEE 225
            SGE V+   L      +L+ H +      P     E
Sbjct: 893 SSGEVVLQGSLDLLAVPWLADHDVRNVAALPGAAYVE 929


>gnl|CDD|214837 smart00826, PKS_DH, Dehydratase domain in polyketide synthase (PKS)
           enzymes. 
          Length = 167

 Score = 40.3 bits (95), Expect = 5e-04
 Identities = 31/177 (17%), Positives = 55/177 (31%), Gaps = 40/177 (22%)

Query: 188 SRSGECVIDVDLSKEEDAYLSGHAIDGRILFPATGTEE-----AVEHFLQKIESIPRDDE 242
           +  G  V+   LS     +L+ H + G ++ P     E     A E        +  +  
Sbjct: 11  ADGGGVVLTGRLSLRTHPWLADHRVGGTVVLPGAAYVELALAAADEVGGGAPARL-EELT 69

Query: 243 FIALLHEIHNTNIAGHPYRGYTVLSDNSATH-ELAVSEFELCESGSVAV----------- 290
             A L                 VL ++ A   ++ V   +     +  V           
Sbjct: 70  LEAPL-----------------VLPEDGAVRVQVVVGAPDEDGRRTFTVYSRPDGDGPWT 112

Query: 291 ---SGRIRTPEDVSKEQLDLPELTPAISSEMLELSSADVYKDLRLRGYDYSGAFRGV 344
              +G +R                P   +E   +   D+Y+ L  RG +Y  AF+G+
Sbjct: 113 RHATGTLRPAAAAPAAPAADLAAWPPAGAE--PVDVDDLYERLAARGLEYGPAFQGL 167


>gnl|CDD|233680 TIGR02001, gcw_chp, conserved hypothetical protein,
           proteobacterial.  This model represents a conserved
           hypothetical protein about 240 residues in length found
           so far in Proteobacteria including Shewanella
           oneidensis, Ralstonia solanacearum, and Colwellia
           psychrerythraea, usually as part of a paralogous family.
           The function is unknown.
          Length = 243

 Score = 33.2 bits (76), Expect = 0.18
 Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 8/35 (22%)

Query: 341 FRGVSQSDNKG-FTGKLDWT-------GNWISYID 367
           FRGVSQ+++     G LD++       G W S +D
Sbjct: 46  FRGVSQTNSDPAIQGGLDYSHESGFYAGVWASNVD 80


>gnl|CDD|216072 pfam00708, Acylphosphatase, Acylphosphatase. 
          Length = 90

 Score = 28.7 bits (65), Expect = 1.6
 Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 207 LSGHA---IDGRILFPATGTEEAVEHFLQKIESIP 238
           L G+     DG +   A G EE V+ F++ ++S P
Sbjct: 31  LKGYVRNLPDGTVEIVAQGPEEDVDKFIEWLKSGP 65


>gnl|CDD|131900 TIGR02853, spore_dpaA, dipicolinic acid synthetase, A subunit.
           This predicted Rossman fold-containing protein is the A
           subunit of dipicolinic acid synthetase as found in most,
           though not all, endospore-forming low-GC Gram-positive
           bacteria; it is absent in Clostridium. The B subunit is
           represented by TIGR02852. This protein is also known as
           SpoVFA [Cellular processes, Sporulation and
           germination].
          Length = 287

 Score = 30.4 bits (69), Expect = 1.7
 Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 17/63 (26%)

Query: 72  VNNLLSPVLFYEALQHVPENAVVIEIAPHALLQAIIKRAKPGAV------SIGLTKRGAP 125
           +N + + VL  + L  +P++AV+I++A           +KPG          G+    AP
Sbjct: 214 INTIPALVLTADVLSKLPKHAVIIDLA-----------SKPGGTDFEYAKKRGIKALLAP 262

Query: 126 DLP 128
            LP
Sbjct: 263 GLP 265


>gnl|CDD|214836 smart00825, PKS_KS, Beta-ketoacyl synthase.  The structure of
           beta-ketoacyl synthase is similar to that of the
           thiolase family and also chalcone synthase. The active
           site of beta-ketoacyl synthase is located between the N
           and C-terminal domains.
          Length = 298

 Score = 29.6 bits (68), Expect = 2.8
 Identities = 9/12 (75%), Positives = 9/12 (75%)

Query: 1   MLSPQGMCKAFD 12
           MLSP G CK FD
Sbjct: 141 MLSPDGRCKTFD 152


>gnl|CDD|205017 pfam12682, Flavodoxin_4, Flavodoxin.  This is a family of
           flavodoxins. Flavodoxins are electron transfer proteins
           that carry a molecule of non-covalently bound FMN.
          Length = 156

 Score = 29.0 bits (65), Expect = 2.9
 Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 5/57 (8%)

Query: 178 SVASFAGKGGSRSGECVIDVDLSKEEDAYLSGHAIDGRILFPATGTEEAVEHFLQKI 234
            V  F   GGS  G    ++     + AY     ++G++L     T + V  +L+K+
Sbjct: 105 IVIPFCTSGGSGIGNTEKNL-----QQAYPDAVWLEGKLLNRGQVTRDEVTEWLKKL 156


>gnl|CDD|140213 PTZ00186, PTZ00186, heat shock 70 kDa precursor protein;
           Provisional.
          Length = 657

 Score = 29.3 bits (65), Expect = 4.1
 Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 1/62 (1%)

Query: 263 YTVLSDNSATHELAVSEFELCESGSVAVSGRIRTPEDVSKEQLDLPELTPAISSEMLELS 322
           + V + N  TH L   +F+L  S  +    R  +  D+SKE++ L  +  A      ELS
Sbjct: 236 FEVKATNGDTH-LGGEDFDLALSDYILEEFRKTSGIDLSKERMALQRVREAAEKAKCELS 294

Query: 323 SA 324
           SA
Sbjct: 295 SA 296


>gnl|CDD|216820 pfam01974, tRNA_int_endo, tRNA intron endonuclease, catalytic
           C-terminal domain.  Members of this family cleave pre
           tRNA at the 5' and 3' splice sites to release the intron
           EC:3.1.27.9.
          Length = 85

 Score = 27.2 bits (61), Expect = 4.9
 Identities = 10/23 (43%), Positives = 12/23 (52%), Gaps = 6/23 (26%)

Query: 326 VYKDLRLRGY------DYSGAFR 342
           VY+DLR RGY       +   FR
Sbjct: 7   VYRDLRERGYVVRSGLKFGADFR 29


>gnl|CDD|205181 pfam12997, DUF3881, Domain of unknown function, E. rectale Gene
           description (DUF3881).  Based on Eubacterium rectale
           gene EUBREC_3695. As seen in gene expression experiments
           (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE14
           737), it appears to be upregulated in the presence of
           Bacteroides thetaiotaomicron vs when isolated in
           culture.
          Length = 283

 Score = 28.7 bits (65), Expect = 5.0
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query: 201 KEEDAYLSGHAIDGRILFPATGTEEAVE 228
           K +   LSG ++ G+IL P    EE   
Sbjct: 131 KAKSVTLSGLSVSGKILLPVNKDEEQER 158


>gnl|CDD|216448 pfam01347, Vitellogenin_N, Lipoprotein amino terminal region.  This
           family contains regions from: Vitellogenin, Microsomal
           triglyceride transfer protein and apolipoprotein B-100.
           These proteins are all involved in lipid transport. This
           family contains the LV1n chain from lipovitellin, that
           contains two structural domains.
          Length = 579

 Score = 28.8 bits (65), Expect = 6.6
 Identities = 10/36 (27%), Positives = 14/36 (38%), Gaps = 4/36 (11%)

Query: 205 AYLSGHAIDGRILF----PATGTEEAVEHFLQKIES 236
             L     D R +F       GT  A++   Q I+S
Sbjct: 338 RQLYAKKKDARRIFLDAVAQAGTPPALKFIKQWIKS 373


>gnl|CDD|193528 cd05651, M20_ArgE_DapE_like5, M20 Peptidases with similarity to
           acetylornithine deacetylases and
           succinyl-diaminopimelate desuccinylases.  Peptidase M20
           family, Uncharacterized protein subfamily with
           similarity to acetylornithine
           deacetylase/succinyl-diaminopimelate desuccinylase
           (ArgE/DapE) subfamily. ArgE/DapE enzymes catalyze
           analogous reactions and share a common activator, the
           metal ion (usually Co2+ or Zn2+). ArgE catalyzes a broad
           range of substrates, including N-acetylornithine,
           alpha-N-acetylmethionine and alpha-N-formylmethionine,
           while DapE catalyzes the hydrolysis of
           N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
           L,L-diaminopimelate and succinate. Proteins in this
           subfamily are bacterial, and have been inferred by
           homology as being related to both ArgE and DapE.
          Length = 342

 Score = 28.4 bits (64), Expect = 7.8
 Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 3/23 (13%)

Query: 42  KPRSARWISSSIPEDHWGTPLAQ 64
           KPRS R  SS+IP DH   P+ Q
Sbjct: 264 KPRSFRLNSSAIPLDH---PIVQ 283


>gnl|CDD|216758 pfam01877, DUF54, Protein of unknown function DUF54.  PH1010 is
           composed of five alpha-helices (1-5) and eight
           beta-strands (1-8) with the following topology: beta-1,
           alpha-1, beta-2, beta-3, alpha-2, alpha-3, beta-4,
           beta-5, alpha-4, beta-6, alpha-5, beta-7, beta-8. The
           first six beta-strands (1-6) form a slightly twisted
           antiparallel beta-sheet and face five alpha-helices on
           one side. The last two beta-strands form an antiparallel
           beta-sheet in the C-terminus. PH1010 forms a
           characteristic homodimer structure in the crystal.
           Although this archaebacterial family has no known
           function, dimerisation of the molecule is crucial for
           function. It would appear that the structure resembles
           that of some ribosomal proteins such as the 50S
           ribosomal protein L5. Although the structure resembles
           that of the RRM-type RNA-binding domain of the ribosomal
           L5 protein, the residues involved in RNA-binding in the
           L5 protein are not conserved in this family. This
           suggests that members of this family do not function as
           RNA-binding proteins.
          Length = 118

 Score = 26.8 bits (60), Expect = 9.4
 Identities = 9/55 (16%), Positives = 20/55 (36%), Gaps = 3/55 (5%)

Query: 196 DVDLSKEEDAYLSGHAIDGRILFPATGTEEAVEHFLQKIESIPRDDEFIALLHEI 250
           D ++  EE     GH  +   +       +     L+K+  + R+   +    E+
Sbjct: 28  DAEIEVEE---AEGHYGNPITILEVRLERKGALKSLEKLRELLREQRILTARSEL 79


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.134    0.400 

Gapped
Lambda     K      H
   0.267   0.0893    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,819,807
Number of extensions: 2153053
Number of successful extensions: 1821
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1813
Number of HSP's successfully gapped: 26
Length of query: 418
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 319
Effective length of database: 6,546,556
Effective search space: 2088351364
Effective search space used: 2088351364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.6 bits)