RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy494
(207 letters)
>gnl|CDD|144338 pfam00698, Acyl_transf_1, Acyl transferase domain.
Length = 319
Score = 59.0 bits (143), Expect = 2e-10
Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 2/64 (3%)
Query: 1 MGSQWNGMGTDLLR-IPLFRQSMERLDAVLKPH-GVDLFHILTSTDNTLFDNILNSFVGI 58
GSQW GMG DLL+ P+F ++ R D KP G + +L + L + + +
Sbjct: 7 QGSQWAGMGMDLLKTSPVFAAAIARCDEAFKPQYGFSVLDVLRNNPEGLLERVDVVQPAL 66
Query: 59 AACQ 62
A Q
Sbjct: 67 FAMQ 70
Score = 58.2 bits (141), Expect = 3e-10
Identities = 26/117 (22%), Positives = 48/117 (41%), Gaps = 11/117 (9%)
Query: 56 VGIAACQPRSSKWISSSILEDAWGSPLAQTSSAEYHTNNLLSSVFFEEASAHI--PANAI 113
IA PR + ++S + AEY NL + V F EA +
Sbjct: 209 ADIAPRTPRVPFYSTTSG-----DPSDQRELDAEYWYRNLRNPVRFAEAVLAAAEQGYLV 263
Query: 114 CIEIAPHGLLQAILK---RSLAEKEVVNIPLTLRGVKDGVKFILNSIGKLYLNGLDL 167
+E++PH +L A ++ +S K+ + +R D + L ++ +L G+ +
Sbjct: 264 FVEVSPHPVLLAAVEETLKSADGKDATLVGTLIRDQGD-LVTFLYALAVAHLTGVAV 319
>gnl|CDD|214838 smart00827, PKS_AT, Acyl transferase domain in polyketide
synthase (PKS) enzymes.
Length = 298
Score = 52.8 bits (128), Expect = 2e-08
Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Query: 2 GSQWNGMGTDLL-RIPLFRQSMERLDAVLKPH-GVDLFHILTSTD 44
GSQW GMG +L P+FR++++ DA L+P G L +L D
Sbjct: 6 GSQWAGMGRELYETEPVFREALDECDAALQPLLGWSLLDVLLGED 50
Score = 38.5 bits (91), Expect = 0.001
Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 8/92 (8%)
Query: 58 IAACQPRSSK--WISSSILEDAWGSPLAQTSSAEYHTNNLLSSVFFEEASAHIPANA--- 112
+A PR + ++S+ + A+ A+Y NL V F +A + A
Sbjct: 208 LAGLTPRPPRIPFVST--VTGTLIDG-AELDDADYWVRNLREPVRFADAVRALLAEGGVT 264
Query: 113 ICIEIAPHGLLQAILKRSLAEKEVVNIPLTLR 144
+ +E+ PH +L +K++LA + +LR
Sbjct: 265 VFLEVGPHPVLTGPIKQTLAAAGSAVVLPSLR 296
>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 1061
Score = 42.4 bits (100), Expect = 8e-05
Identities = 30/128 (23%), Positives = 56/128 (43%), Gaps = 9/128 (7%)
Query: 51 ILNSFVGIAAC-QPRSSKW-ISSSILEDAWGSPLAQTSSAEYHTNNLLSSVFFEEASAHI 108
IL+ F A PR + + S++ D A+Y +L V F +A A
Sbjct: 729 ILDEFAAALADLAPRPPQIPLISNVTGDL---AGEPGGDAQYWVQHLRQPVRFADAIAAA 785
Query: 109 PANA--ICIEIAPHGLLQAILKRSLAEKEVVNIPLTLRGVKDGVKFILNSIGKLYLNGLD 166
A+ IE+ P +L +K++L + + TLR + L ++ +L++ G+
Sbjct: 786 LADGARTFIEVGPGPVLTESIKQTLRDA--ILSIATLRRDAPELLSFLAALAQLFVAGVA 843
Query: 167 LNLAPLYP 174
++ +PL
Sbjct: 844 VDWSPLVY 851
Score = 29.0 bits (65), Expect = 2.3
Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 2 GSQWNGMGTDLLRI-PLFRQSMERLDAVLKPH-GVDLFHILTSTDNTLFDNILNSFVGIA 59
GSQW GMG +L + P+F + + L+A+L G L ++ + D ++I + +
Sbjct: 535 GSQWAGMGRELYALEPVFASAFDALEALLHRLLGFSLPEVIFAPDYPFLESIDFAQPALF 594
Query: 60 ACQ 62
A
Sbjct: 595 AVS 597
>gnl|CDD|223408 COG0331, FabD, (acyl-carrier-protein) S-malonyltransferase [Lipid
metabolism].
Length = 310
Score = 32.6 bits (75), Expect = 0.12
Identities = 15/62 (24%), Positives = 25/62 (40%), Gaps = 4/62 (6%)
Query: 2 GSQWNGMGTDLLRI-PLFRQSMERLDAVLKPHGVDLFHILTSTDNTLFDNILNSFVGIAA 60
GSQ GMG DL P +++ + D L G DL+ ++ + N+ +
Sbjct: 12 GSQSLGMGKDLYENSPEAKETFDEADEAL---GFDLWALVFEGPEEELNLTQNTQPALLL 68
Query: 61 CQ 62
Sbjct: 69 VS 70
>gnl|CDD|216226 pfam00978, RdRP_2, RNA dependent RNA polymerase. This family may
represent an RNA dependent RNA polymerase. The family
also contains the following proteins: 2A protein from
bromoviruses, putative RNA dependent RNA polymerase from
tobamoviruses, non structural polyprotein from
togaviruses.
Length = 441
Score = 29.5 bits (67), Expect = 1.2
Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 4/34 (11%)
Query: 16 PLFRQSMERLDAVLKPHGVDLFHILTSTDNTLFD 49
P+FR+ ERL VLKP T ++L
Sbjct: 172 PIFRELFERLLYVLKPK----VVFPTGMTSSLIA 201
>gnl|CDD|203465 pfam06481, COX_ARM, COX Aromatic Rich Motif. COX2 (Cytochrome O
ubiquinol OXidase 2) is a major component of the
respiratory complex during vegetative growth. It
transfers electrons from a quinol to the binuclear
centre of the catalytic subunit 1. The function of this
region is not known.
Length = 47
Score = 26.0 bits (58), Expect = 2.3
Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 6/40 (15%)
Query: 21 SMERLDAVLKP---HGVDLFHILTSTDNTLFDNILNSFVG 57
A+ KP + V+ F +S D LFD+I+N ++G
Sbjct: 8 DRATYLALAKPSENNPVEYF---SSVDPGLFDDIVNKYMG 44
>gnl|CDD|100007 cd06150, YjgF_YER057c_UK114_like_2, This group of proteins belong
to a large family of YjgF/YER057c/UK114-like proteins
present in bacteria, archaea, and eukaryotes with no
definitive function. The conserved domain is similar
in structure to chorismate mutase but there is no
sequence similarity and no functional connection.
Members of this family have been implicated in
isoleucine (Yeo7, Ibm1, aldR) and purine (YjgF)
biosynthesis, as well as threonine anaerobic
degradation (tdcF) and mitochondrial DNA maintenance
(Ibm1). This domain homotrimerizes forming a distinct
intersubunit cavity that may serve as a small molecule
binding site.
Length = 105
Score = 27.1 bits (61), Expect = 2.9
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 19 RQSMERLDAVLKPHGVDLFHILTST 43
RQ + ++DA+L G D IL++T
Sbjct: 32 RQVLAKIDALLAEAGSDKSRILSAT 56
>gnl|CDD|148754 pfam07326, DUF1466, Protein of unknown function (DUF1466). This
family consists of several hypothetical mammalian
proteins of around 240 residues in length.
Length = 239
Score = 27.5 bits (60), Expect = 4.2
Identities = 13/33 (39%), Positives = 16/33 (48%)
Query: 1 MGSQWNGMGTDLLRIPLFRQSMERLDAVLKPHG 33
M SQW GMGT + R L + L+P G
Sbjct: 1 MASQWQGMGTSVRRRSLLHEEQLEDKEELQPAG 33
>gnl|CDD|234502 TIGR04212, GlyGly_RbtA, Acinetobacter rhombotarget A. Members of
this protein family are found, so far, exclusively in
the genus Acinetobacter. Members average just over 600
amino acids in length, including a 22-amino acid
C-terminal putative protein sorting recognition
sequence, GlyGly-CTERM (TIGR03501). The GlyGly-CTERM
signal always co-occurs with a subfamily of the rhomboid
family intramembrane serine proteases called
rhombosortase (TIGR03902). Members occur paired with a
second rhombosortase target, with which it also shares
an N-terminal motif CSLREA. This protein is designated
Acinetobacter rhombotarget A (rbtA).
Length = 605
Score = 27.8 bits (62), Expect = 4.8
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 32 HGVDLFHILTSTDNTLFDNILNSFVGI 58
HG D+F I T T F++ +NS++ I
Sbjct: 538 HGADIFKIRVVTTTTRFNDHINSYIEI 564
>gnl|CDD|233826 TIGR02346, chap_CCT_theta, T-complex protein 1, theta subunit.
Members of this family, all eukaryotic, are part of the
group II chaperonin complex called CCT (chaperonin
containing TCP-1) or TRiC. The archaeal equivalent group
II chaperonin is often called the thermosome. Both are
somewhat related to the group I chaperonin of bacterial,
GroEL/GroES. This family consists exclusively of the CCT
alpha chain (part of a paralogous family) from animals,
plants, fungi, and other eukaryotes.
Length = 531
Score = 27.4 bits (61), Expect = 6.6
Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 3/77 (3%)
Query: 121 GLLQAILKRSLAEKEVVNIPLTLRGVKDGVKFILNSIGKLYL-NGLDLNLAPLYPEVQYP 179
GL +A++K A KE+ I T G K ++N + KL++ N L L EVQ+P
Sbjct: 15 GLEEAVIKNIEACKELSQITRTSLGPNGMNKMVINHLDKLFVTNDAATILREL--EVQHP 72
Query: 180 VSRGTKPLGHFVDWEHG 196
++ + E G
Sbjct: 73 AAKLLVMASEMQENEIG 89
>gnl|CDD|219629 pfam07899, Frigida, Frigida-like protein. This family is composed
of plant proteins that are similar to FRIGIDA protein
expressed by Arabidopsis thaliana. This protein is
probably nuclear and is required for the regulation of
flowering time in the late-flowering phenotype. It is
known to increase RNA levels of flowering locus C.
Allelic variation at the FRIGIDA locus is a major
determinant of natural variation in flowering time.
Length = 293
Score = 26.9 bits (60), Expect = 6.7
Identities = 14/61 (22%), Positives = 24/61 (39%), Gaps = 5/61 (8%)
Query: 111 NAICIEIAPHGLLQ---AILKRSLAEKEVVNIPLTLRGVKDGVKFILNSIGKLYLNGLDL 167
+C ++ GLL+ K + +E +P LR D K +L++I Y
Sbjct: 16 RRLCEKMDSRGLLKFVIENRKELASLRE--EVPAALRCAPDPAKLVLDAIEGFYPPDSKG 73
Query: 168 N 168
Sbjct: 74 G 74
>gnl|CDD|100004 cd00448, YjgF_YER057c_UK114_family, YjgF, YER057c, and UK114
belong to a large family of proteins present in
bacteria, archaea, and eukaryotes with no definitive
function. The conserved domain is similar in structure
to chorismate mutase but there is no sequence
similarity and no functional connection. Members of
this family have been implicated in isoleucine (Yeo7,
Ibm1, aldR) and purine (YjgF) biosynthesis, as well as
threonine anaerobic degradation (tdcF) and
mitochondrial DNA maintenance (Ibm1). This domain
homotrimerizes forming a distinct intersubunit cavity
that may serve as a small molecule binding site.
Length = 107
Score = 25.6 bits (57), Expect = 9.4
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 19 RQSMERLDAVLKPHGVDLFHILTST 43
RQ++E L+AVL+ G L ++ T
Sbjct: 35 RQALENLEAVLEAAGGSLDDVVKVT 59
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.408
Gapped
Lambda K H
0.267 0.0893 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,474,097
Number of extensions: 954911
Number of successful extensions: 754
Number of sequences better than 10.0: 1
Number of HSP's gapped: 748
Number of HSP's successfully gapped: 19
Length of query: 207
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 115
Effective length of database: 6,857,034
Effective search space: 788558910
Effective search space used: 788558910
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.4 bits)