RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9441
(418 letters)
>gnl|CDD|144338 pfam00698, Acyl_transf_1, Acyl transferase domain.
Length = 319
Score = 83.6 bits (207), Expect = 6e-18
Identities = 33/151 (21%), Positives = 55/151 (36%), Gaps = 14/151 (9%)
Query: 2 LSPQGMCKAFDVAGHAFHSKYIAEAGPKLRSSLDRIIPAPKPRSARWISSSIPEDHWGTP 61
+ +G+ +A HS + G L +L I P PR + ++S
Sbjct: 176 VEARGVRARVIAVDYASHSPQVEAIGDALALALADIAP-RTPRVPFYSTTSGDPS----- 229
Query: 62 LAQQSSPAYHVNNLLSPVLFYEALQHV--PENAVVIEIAPHALLQAIIKR-----AKPGA 114
++ Y NL +PV F EA+ V +E++PH +L A ++ A
Sbjct: 230 DQRELDAEYWYRNLRNPVRFAEAVLAAAEQGYLVFVEVSPHPVLLAAVEETLKSADGKDA 289
Query: 115 VSIGLTKRGAPDLPAHLLSGLGKLYNAGLQP 145
+G R D L L + G+
Sbjct: 290 TLVGTLIRDQGD-LVTFLYALAVAHLTGVAV 319
>gnl|CDD|214838 smart00827, PKS_AT, Acyl transferase domain in polyketide synthase
(PKS) enzymes.
Length = 298
Score = 57.4 bits (140), Expect = 3e-09
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 12/104 (11%)
Query: 16 HAFHSKYIAEAGPKLRSSLDRIIPAPKPRSARWISSSIPEDHWGTPL--AQQSSPAYHVN 73
HAFHS ++ + R++L + P P PR ++S+ GT + A+ Y V
Sbjct: 190 HAFHSPHMEPILDEFRAALAGLTPRP-PRIP-FVSTVT-----GTLIDGAELDDADYWVR 242
Query: 74 NLLSPVLFYEALQHVPENA---VVIEIAPHALLQAIIKRAKPGA 114
NL PV F +A++ + V +E+ PH +L IK+ A
Sbjct: 243 NLREPVRFADAVRALLAEGGVTVFLEVGPHPVLTGPIKQTLAAA 286
>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 1061
Score = 58.2 bits (141), Expect = 5e-09
Identities = 50/217 (23%), Positives = 75/217 (34%), Gaps = 13/217 (5%)
Query: 16 HAFHSKYIAEAGPKLRSSLDRIIPAPKPRSARWISSSIPEDHWGTPLAQQSSPAYHVNNL 75
HAFHS + + ++L + AP+P IS+ Y V +L
Sbjct: 719 HAFHSPLMDPILDEFAAALADL--APRPPQIPLISNV----TGDLAGEPGGDAQYWVQHL 772
Query: 76 LSPVLFYEALQHVPENA--VVIEIAPHALLQAIIKRAKPGAVSIGLTKRGAPDLPAHLLS 133
PV F +A+ + IE+ P +L IK+ A+ T R L+
Sbjct: 773 RQPVRFADAIAAALADGARTFIEVGPGPVLTESIKQTLRDAILSIATLRRDAPELLSFLA 832
Query: 134 GLGKLYNAGLQPRLNQLLPPVK-FPVGRGTPMIQSLVEWDHSTEWSVASFAGKGGSR--- 189
L +L+ AG+ + L+ V T Q W VAS G
Sbjct: 833 ALAQLFVAGVAVDWSPLVYGPDGRLVELPTYPFQRQRFWLDPEALGVASHPLLGAVLGRP 892
Query: 190 -SGECVIDVDLSKEEDAYLSGHAIDGRILFPATGTEE 225
SGE V+ L +L+ H + P E
Sbjct: 893 SSGEVVLQGSLDLLAVPWLADHDVRNVAALPGAAYVE 929
>gnl|CDD|214837 smart00826, PKS_DH, Dehydratase domain in polyketide synthase (PKS)
enzymes.
Length = 167
Score = 40.3 bits (95), Expect = 5e-04
Identities = 31/177 (17%), Positives = 55/177 (31%), Gaps = 40/177 (22%)
Query: 188 SRSGECVIDVDLSKEEDAYLSGHAIDGRILFPATGTEE-----AVEHFLQKIESIPRDDE 242
+ G V+ LS +L+ H + G ++ P E A E + +
Sbjct: 11 ADGGGVVLTGRLSLRTHPWLADHRVGGTVVLPGAAYVELALAAADEVGGGAPARL-EELT 69
Query: 243 FIALLHEIHNTNIAGHPYRGYTVLSDNSATH-ELAVSEFELCESGSVAV----------- 290
A L VL ++ A ++ V + + V
Sbjct: 70 LEAPL-----------------VLPEDGAVRVQVVVGAPDEDGRRTFTVYSRPDGDGPWT 112
Query: 291 ---SGRIRTPEDVSKEQLDLPELTPAISSEMLELSSADVYKDLRLRGYDYSGAFRGV 344
+G +R P +E + D+Y+ L RG +Y AF+G+
Sbjct: 113 RHATGTLRPAAAAPAAPAADLAAWPPAGAE--PVDVDDLYERLAARGLEYGPAFQGL 167
>gnl|CDD|233680 TIGR02001, gcw_chp, conserved hypothetical protein,
proteobacterial. This model represents a conserved
hypothetical protein about 240 residues in length found
so far in Proteobacteria including Shewanella
oneidensis, Ralstonia solanacearum, and Colwellia
psychrerythraea, usually as part of a paralogous family.
The function is unknown.
Length = 243
Score = 33.2 bits (76), Expect = 0.18
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 8/35 (22%)
Query: 341 FRGVSQSDNKG-FTGKLDWT-------GNWISYID 367
FRGVSQ+++ G LD++ G W S +D
Sbjct: 46 FRGVSQTNSDPAIQGGLDYSHESGFYAGVWASNVD 80
>gnl|CDD|216072 pfam00708, Acylphosphatase, Acylphosphatase.
Length = 90
Score = 28.7 bits (65), Expect = 1.6
Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 207 LSGHA---IDGRILFPATGTEEAVEHFLQKIESIP 238
L G+ DG + A G EE V+ F++ ++S P
Sbjct: 31 LKGYVRNLPDGTVEIVAQGPEEDVDKFIEWLKSGP 65
>gnl|CDD|131900 TIGR02853, spore_dpaA, dipicolinic acid synthetase, A subunit.
This predicted Rossman fold-containing protein is the A
subunit of dipicolinic acid synthetase as found in most,
though not all, endospore-forming low-GC Gram-positive
bacteria; it is absent in Clostridium. The B subunit is
represented by TIGR02852. This protein is also known as
SpoVFA [Cellular processes, Sporulation and
germination].
Length = 287
Score = 30.4 bits (69), Expect = 1.7
Identities = 17/63 (26%), Positives = 29/63 (46%), Gaps = 17/63 (26%)
Query: 72 VNNLLSPVLFYEALQHVPENAVVIEIAPHALLQAIIKRAKPGAV------SIGLTKRGAP 125
+N + + VL + L +P++AV+I++A +KPG G+ AP
Sbjct: 214 INTIPALVLTADVLSKLPKHAVIIDLA-----------SKPGGTDFEYAKKRGIKALLAP 262
Query: 126 DLP 128
LP
Sbjct: 263 GLP 265
>gnl|CDD|214836 smart00825, PKS_KS, Beta-ketoacyl synthase. The structure of
beta-ketoacyl synthase is similar to that of the
thiolase family and also chalcone synthase. The active
site of beta-ketoacyl synthase is located between the N
and C-terminal domains.
Length = 298
Score = 29.6 bits (68), Expect = 2.8
Identities = 9/12 (75%), Positives = 9/12 (75%)
Query: 1 MLSPQGMCKAFD 12
MLSP G CK FD
Sbjct: 141 MLSPDGRCKTFD 152
>gnl|CDD|205017 pfam12682, Flavodoxin_4, Flavodoxin. This is a family of
flavodoxins. Flavodoxins are electron transfer proteins
that carry a molecule of non-covalently bound FMN.
Length = 156
Score = 29.0 bits (65), Expect = 2.9
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 5/57 (8%)
Query: 178 SVASFAGKGGSRSGECVIDVDLSKEEDAYLSGHAIDGRILFPATGTEEAVEHFLQKI 234
V F GGS G ++ + AY ++G++L T + V +L+K+
Sbjct: 105 IVIPFCTSGGSGIGNTEKNL-----QQAYPDAVWLEGKLLNRGQVTRDEVTEWLKKL 156
>gnl|CDD|140213 PTZ00186, PTZ00186, heat shock 70 kDa precursor protein;
Provisional.
Length = 657
Score = 29.3 bits (65), Expect = 4.1
Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 1/62 (1%)
Query: 263 YTVLSDNSATHELAVSEFELCESGSVAVSGRIRTPEDVSKEQLDLPELTPAISSEMLELS 322
+ V + N TH L +F+L S + R + D+SKE++ L + A ELS
Sbjct: 236 FEVKATNGDTH-LGGEDFDLALSDYILEEFRKTSGIDLSKERMALQRVREAAEKAKCELS 294
Query: 323 SA 324
SA
Sbjct: 295 SA 296
>gnl|CDD|216820 pfam01974, tRNA_int_endo, tRNA intron endonuclease, catalytic
C-terminal domain. Members of this family cleave pre
tRNA at the 5' and 3' splice sites to release the intron
EC:3.1.27.9.
Length = 85
Score = 27.2 bits (61), Expect = 4.9
Identities = 10/23 (43%), Positives = 12/23 (52%), Gaps = 6/23 (26%)
Query: 326 VYKDLRLRGY------DYSGAFR 342
VY+DLR RGY + FR
Sbjct: 7 VYRDLRERGYVVRSGLKFGADFR 29
>gnl|CDD|205181 pfam12997, DUF3881, Domain of unknown function, E. rectale Gene
description (DUF3881). Based on Eubacterium rectale
gene EUBREC_3695. As seen in gene expression experiments
(http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE14
737), it appears to be upregulated in the presence of
Bacteroides thetaiotaomicron vs when isolated in
culture.
Length = 283
Score = 28.7 bits (65), Expect = 5.0
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 201 KEEDAYLSGHAIDGRILFPATGTEEAVE 228
K + LSG ++ G+IL P EE
Sbjct: 131 KAKSVTLSGLSVSGKILLPVNKDEEQER 158
>gnl|CDD|216448 pfam01347, Vitellogenin_N, Lipoprotein amino terminal region. This
family contains regions from: Vitellogenin, Microsomal
triglyceride transfer protein and apolipoprotein B-100.
These proteins are all involved in lipid transport. This
family contains the LV1n chain from lipovitellin, that
contains two structural domains.
Length = 579
Score = 28.8 bits (65), Expect = 6.6
Identities = 10/36 (27%), Positives = 14/36 (38%), Gaps = 4/36 (11%)
Query: 205 AYLSGHAIDGRILF----PATGTEEAVEHFLQKIES 236
L D R +F GT A++ Q I+S
Sbjct: 338 RQLYAKKKDARRIFLDAVAQAGTPPALKFIKQWIKS 373
>gnl|CDD|193528 cd05651, M20_ArgE_DapE_like5, M20 Peptidases with similarity to
acetylornithine deacetylases and
succinyl-diaminopimelate desuccinylases. Peptidase M20
family, Uncharacterized protein subfamily with
similarity to acetylornithine
deacetylase/succinyl-diaminopimelate desuccinylase
(ArgE/DapE) subfamily. ArgE/DapE enzymes catalyze
analogous reactions and share a common activator, the
metal ion (usually Co2+ or Zn2+). ArgE catalyzes a broad
range of substrates, including N-acetylornithine,
alpha-N-acetylmethionine and alpha-N-formylmethionine,
while DapE catalyzes the hydrolysis of
N-succinyl-L,L-diaminopimelate (L,L-SDAP) to
L,L-diaminopimelate and succinate. Proteins in this
subfamily are bacterial, and have been inferred by
homology as being related to both ArgE and DapE.
Length = 342
Score = 28.4 bits (64), Expect = 7.8
Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 3/23 (13%)
Query: 42 KPRSARWISSSIPEDHWGTPLAQ 64
KPRS R SS+IP DH P+ Q
Sbjct: 264 KPRSFRLNSSAIPLDH---PIVQ 283
>gnl|CDD|216758 pfam01877, DUF54, Protein of unknown function DUF54. PH1010 is
composed of five alpha-helices (1-5) and eight
beta-strands (1-8) with the following topology: beta-1,
alpha-1, beta-2, beta-3, alpha-2, alpha-3, beta-4,
beta-5, alpha-4, beta-6, alpha-5, beta-7, beta-8. The
first six beta-strands (1-6) form a slightly twisted
antiparallel beta-sheet and face five alpha-helices on
one side. The last two beta-strands form an antiparallel
beta-sheet in the C-terminus. PH1010 forms a
characteristic homodimer structure in the crystal.
Although this archaebacterial family has no known
function, dimerisation of the molecule is crucial for
function. It would appear that the structure resembles
that of some ribosomal proteins such as the 50S
ribosomal protein L5. Although the structure resembles
that of the RRM-type RNA-binding domain of the ribosomal
L5 protein, the residues involved in RNA-binding in the
L5 protein are not conserved in this family. This
suggests that members of this family do not function as
RNA-binding proteins.
Length = 118
Score = 26.8 bits (60), Expect = 9.4
Identities = 9/55 (16%), Positives = 20/55 (36%), Gaps = 3/55 (5%)
Query: 196 DVDLSKEEDAYLSGHAIDGRILFPATGTEEAVEHFLQKIESIPRDDEFIALLHEI 250
D ++ EE GH + + + L+K+ + R+ + E+
Sbjct: 28 DAEIEVEE---AEGHYGNPITILEVRLERKGALKSLEKLRELLREQRILTARSEL 79
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.134 0.400
Gapped
Lambda K H
0.267 0.0893 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,819,807
Number of extensions: 2153053
Number of successful extensions: 1821
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1813
Number of HSP's successfully gapped: 26
Length of query: 418
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 319
Effective length of database: 6,546,556
Effective search space: 2088351364
Effective search space used: 2088351364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.6 bits)