RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14480
(190 letters)
>gnl|CDD|214614 smart00317, SET, SET (Su(var)3-9, Enhancer-of-zeste, Trithorax)
domain. Putative methyl transferase, based on outlier
plant homologues.
Length = 124
Score = 42.7 bits (101), Expect = 1e-05
Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 3/50 (6%)
Query: 107 ASFIEHNCYPNLYKSFT---DSGQVLLRAMKPIAPGDHLSICYTDPLWGT 153
A FI H+C PN F ++++ A++ I PG+ L+I Y
Sbjct: 75 ARFINHSCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
>gnl|CDD|216155 pfam00856, SET, SET domain. SET domains are protein lysine
methyltransferase enzymes. SET domains appear to be
protein-protein interaction domains. It has been
demonstrated that SET domains mediate interactions with
a family of proteins that display similarity with
dual-specificity phosphatases (dsPTPases). A subset of
SET domains have been called PR domains. These domains
are divergent in sequence from other SET domains, but
also appear to mediate protein-protein interaction. The
SET domain consists of two regions known as SET-N and
SET-C. SET-C forms an unusual and conserved knot-like
structure of probably functional importance.
Additionally to SET-N and SET-C, an insert region
(SET-I) and flanking regions of high structural
variability form part of the overall structure.
Length = 113
Score = 37.9 bits (88), Expect = 5e-04
Identities = 16/43 (37%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 107 ASFIEHNCYPNLYKSFT---DSGQVLLRAMKPIAPGDHLSICY 146
A FI H+C PN F ++++RA++ I PG+ L+I Y
Sbjct: 70 ARFINHSCEPNCEVRFVFVNGGDRIVVRALRDIKPGEELTIDY 112
>gnl|CDD|225491 COG2940, COG2940, Proteins containing SET domain [General function
prediction only].
Length = 480
Score = 36.3 bits (84), Expect = 0.006
Identities = 19/76 (25%), Positives = 30/76 (39%), Gaps = 3/76 (3%)
Query: 104 FDRASFIEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
D A FI H+C PN S ++ + A++ I G+ L+ Y L + L
Sbjct: 403 GDVARFINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLL 462
Query: 161 QTSKYFICQCERCRDP 176
+ C +RC
Sbjct: 463 EKRWGCACGEDRCSHT 478
>gnl|CDD|148445 pfam06838, Alum_res, Aluminium resistance protein. This family
represents the aluminium resistance protein, which
confers resistance to aluminium in bacteria. Structural
characterisation has shown proteins in this family bind
a pyridoxal-5'-phosphate cofactor, and that they belongs
to the PLP dependent aminotransferase superfamily.
Crystal packing analysis suggests that in solution, a
tetramer may be the stable oligomeric form. This is in
contrast to most other aminotransferases which form
dimers (information from TOPSAN).
Length = 405
Score = 30.4 bits (69), Expect = 0.49
Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 13/58 (22%)
Query: 47 YKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ----VNAHEMPLTEPSY 100
+ PKY R +++Q + K ++ E+++ C IQ +++H P EP+
Sbjct: 299 FPVLPKYNEKRTDLIQAV----KFGDK---EKLIAFCQAIQAASPIDSHVDP--EPAD 347
>gnl|CDD|225652 COG3110, COG3110, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 216
Score = 29.0 bits (65), Expect = 1.2
Identities = 14/78 (17%), Positives = 27/78 (34%), Gaps = 11/78 (14%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
+ IE + P+ Q++ R +K I G + + PL T N
Sbjct: 43 LLKGKDSIELDDGPH---------QIVFRVIKTIRLGSDRRLYISPPLIVTFNA--QNID 91
Query: 163 SKYFICQCERCRDPTELN 180
+F+ + R+ +
Sbjct: 92 LVFFLPRLRNEREANKFI 109
>gnl|CDD|213866 TIGR03818, MotA1, flagellar motor stator protein MotA. The MotA
protein, along with its partner MotB, comprise the
stator complex of the bacterial flagellar motor. MotAB
span the cytoplasmic membrane and undergo conformational
changes powered by the translocation of protons. These
conformational changes in turn are communicated to the
rotor assembly, producing torque. This model represents
one family of MotA proteins which are often not
identified by the "transporter, MotA/TolQ/ExbB proton
channel family" model, pfam01618.
Length = 282
Score = 29.0 bits (66), Expect = 1.5
Identities = 11/39 (28%), Positives = 23/39 (58%), Gaps = 5/39 (12%)
Query: 37 LMQLEAHVEEYKNSP---KYENDRRN--VVQFLLNFFKL 70
LM +E+H+E + S KY ++ +V+F+ ++ +L
Sbjct: 94 LMAIESHIENPEESSIFSKYPKILKDHHLVEFICDYLRL 132
>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region. This family
consists of the N terminal region of various alpha, beta
and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
protein complexes. The adaptor protein (AP) complexes
are involved in the formation of clathrin-coated pits
and vesicles. The N-terminal region of the various
adaptor proteins (APs) is constant by comparison to the
C-terminal which is variable within members of the AP-2
family; and it has been proposed that this constant
region interacts with another uniform component of the
coated vesicles.
Length = 522
Score = 28.7 bits (65), Expect = 2.1
Identities = 24/124 (19%), Positives = 42/124 (33%), Gaps = 21/124 (16%)
Query: 13 PSYECIIALRCLYK---KSHQPQIWDKLMQ--LEAHVEEYKN---------SPKYENDRR 58
+ AL LYK +S+ +I +L++ E E+K + K+ D
Sbjct: 315 DISIRLRALDLLYKLVDESNVKEIVKELLKYVSEIADPEFKIKLVKAIGRLAEKFPTDAE 374
Query: 59 NVVQFLLNFFKLNEEFTQEEIL-KICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPN 117
+ LL L + +EI+ I II+ E + +E P
Sbjct: 375 WCIDVLLELLSLAGSYVVDEIVEVIRDIIRKYP------ELREYILEHLCELLEDIESPE 428
Query: 118 LYKS 121
+
Sbjct: 429 ARAA 432
>gnl|CDD|187718 cd08649, FMT_core_NRPS_like, N-terminal formyl transferase
catalytic core domain of NRPS_like proteins, one of the
proteins involved in the synthesis of Oxazolomycin.
This family represents the N-terminal formyl transferase
catalytic core domain present in a subgroup of
non-ribosomal peptide synthetases. In Streptomyces albus
a member of this family has been shown to be involved in
the synthesis of oxazolomycin (OZM). OZM is a hybrid
peptide-polyketide antibiotic and exhibits potent
antitumor and antiviral activities. It is a multi-domain
protein consisting of a formyl transferase domain, a
Flavin-utilizing monoxygenase domain, a LuxE domain
functioning as an acyl protein synthetase and a
pp-binding domain, which may function as an acyl
carrier. It shows sequence similarity with other
peptide-polyketide biosynthesis proteins.
Length = 166
Score = 28.0 bits (63), Expect = 2.3
Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 11/71 (15%)
Query: 77 EEILKICGIIQVNAHEMPLTEPSYI-------AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
E+L + +N H+ PL P Y A+ + H + + D+G +L
Sbjct: 76 SEVLALPRKGAINFHDGPL--PRYAGLNATSWALLAGET--RHGVTWHRIEEGVDAGDIL 131
Query: 130 LRAMKPIAPGD 140
++ IAP D
Sbjct: 132 VQRPFDIAPDD 142
>gnl|CDD|237306 PRK13208, valS, valyl-tRNA synthetase; Reviewed.
Length = 800
Score = 28.2 bits (64), Expect = 3.3
Identities = 11/44 (25%), Positives = 17/44 (38%), Gaps = 7/44 (15%)
Query: 44 VEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ 87
VE+Y K + R F +L E T E+ K + +
Sbjct: 95 VEKYYGIRKDDISRE-------EFIELCRELTDEDEKKFRELWR 131
>gnl|CDD|222408 pfam13835, DUF4194, Domain of unknown function (DUF4194).
Length = 165
Score = 27.2 bits (61), Expect = 3.7
Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 18 IIAL--RCLYKKSHQPQIWDKLMQLEAHVEEY 47
+IAL P++WD L++ + V +Y
Sbjct: 1 LIALLKGPYISGEKHPKLWDVLLRYQEAVRDY 32
>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 27.8 bits (62), Expect = 4.5
Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 25 YKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN 66
YK+ + W+ LM ++A +++N K R + LN
Sbjct: 280 YKELMGEEQWNALM-VKAQETQFQNDAKLTEQFRLGINSALN 320
>gnl|CDD|223268 COG0190, FolD, 5,10-methylene-tetrahydrofolate
dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase
[Coenzyme metabolism].
Length = 283
Score = 27.1 bits (61), Expect = 5.5
Identities = 9/30 (30%), Positives = 16/30 (53%), Gaps = 2/30 (6%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHE 92
+ L E+ T+EE+L + I ++NA
Sbjct: 60 IASELYDLPEDITEEELLAL--IDELNADP 87
>gnl|CDD|219695 pfam08011, DUF1703, Protein of unknown function (DUF1703). This
family contains many hypothetical bacterial proteins.
It has been identified as a member of the PD-(D/E)XK
nuclease superfamily through transitive meta profile
searches. DUF1703 has the predicted secondary structure
pattern of the restriction endonuclease-like fold core
and contains an additional beta-strand at the
C-terminus.
Length = 101
Score = 26.0 bits (58), Expect = 5.8
Identities = 11/43 (25%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 27 KSHQPQIWDKLMQLEA--HVEEYKNSPKYENDRRNVVQFLLNF 67
K + + DK + EA ++EY + KY+ D + + + + F
Sbjct: 49 KYIKVKDSDKEIAEEALEQIKEYGYAEKYKGDGKKIKKIGIVF 91
>gnl|CDD|181414 PRK08410, PRK08410, 2-hydroxyacid dehydrogenase; Provisional.
Length = 311
Score = 26.9 bits (60), Expect = 7.2
Identities = 13/30 (43%), Positives = 18/30 (60%), Gaps = 6/30 (20%)
Query: 71 NEEFTQ---EEILKICGIIQVNAHEMPLTE 97
NEE+ + EE+LK II ++A PL E
Sbjct: 182 NEEYERVSLEELLKTSDIISIHA---PLNE 208
>gnl|CDD|234344 TIGR03754, conj_TOL_TraD, conjugative coupling factor TraD, TOL
family. Members of this protein are assigned by
homology to the TraD family of conjugative coupling
factor. This particular clade serves as a marker for an
extended gene region that occurs occasionally on
plasmids, including the toluene catabolism TOL plasmid.
More commonly, the gene region is chromosomal, flanked
by various markers of conjugative transfer and
insertion.
Length = 643
Score = 26.9 bits (60), Expect = 8.9
Identities = 10/40 (25%), Positives = 21/40 (52%), Gaps = 5/40 (12%)
Query: 29 HQPQIWDKLMQLEAHVEEYKNSPKYENDRRN----VVQFL 64
H P+ W+ + +L A +++ +N P+ R + Q+L
Sbjct: 331 HDPKAWELIEELIARLKD-RNVPRNMQGRPPRVVAIEQYL 369
>gnl|CDD|223845 COG0774, LpxC, UDP-3-O-acyl-N-acetylglucosamine deacetylase [Cell
envelope biogenesis, outer membrane].
Length = 300
Score = 26.5 bits (59), Expect = 9.2
Identities = 24/109 (22%), Positives = 40/109 (36%), Gaps = 30/109 (27%)
Query: 85 IIQVNAHEMPLTEPS---YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPI----- 136
II+V+ E+P+ + S ++ + D A E N K F +R KP+
Sbjct: 93 IIEVDGPEIPIMDGSAAPFVELIDEAGIKEQNA----KKKF-------IRIKKPVRVEDG 141
Query: 137 ------APGDHLSICYT----DPLWGTINRRHHL-QTSKYFICQCERCR 174
P D + YT P G + +++ F + R R
Sbjct: 142 DKWAEFTPYDGFRLSYTIDFDHPAIGRQWQSFEFDFSAESFRKEIARAR 190
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.138 0.443
Gapped
Lambda K H
0.267 0.0717 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,547,695
Number of extensions: 852796
Number of successful extensions: 774
Number of sequences better than 10.0: 1
Number of HSP's gapped: 770
Number of HSP's successfully gapped: 33
Length of query: 190
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 99
Effective length of database: 6,901,388
Effective search space: 683237412
Effective search space used: 683237412
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 56 (25.4 bits)