RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17364
(312 letters)
>gnl|CDD|236491 PRK09377, tsf, elongation factor Ts; Provisional.
Length = 290
Score = 173 bits (442), Expect = 1e-52
Identities = 97/311 (31%), Positives = 151/311 (48%), Gaps = 70/311 (22%)
Query: 25 SLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLI 84
+L+ +LR++TG +CKKAL + D+ KA +WL+++ G AKA+K AGR ++GL+
Sbjct: 7 ALVKELRERTGAGMMDCKKALTEADGDIEKAIEWLRKK----GLAKAAKKAGRVAAEGLV 62
Query: 85 SIAVEGKHATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQPQEPFAKVFLDTA 144
+ V+G +VE N ETDFVA+N+ FQ +A ++ A L V E + LD
Sbjct: 63 AAKVDGNKGVLVEVNSETDFVAKNEDFQALANEVAEAALAAKPADV---EALLALKLD-- 117
Query: 145 QLQSLAGPENKSLADHVAILISSVGENLVLRRAACVTVNEDHDVAGFTHPSPGLEHTGPI 204
++ + LI+ +GEN+ LRR A + + V + H
Sbjct: 118 ---------GGTVEEARTELIAKIGENISLRRFARLEKDGGV-VGSYLHG---------- 157
Query: 205 LGKFGSLMVYQDLKTGDKQQNVQNVARQLCQHVIGMNPKSIGSEEDTPNED--------- 255
G+ G L+ L+ GD++ +A+ + H+ MNP+ + S ED P E
Sbjct: 158 GGRIGVLVA---LEGGDEE-----LAKDIAMHIAAMNPEYL-SREDVPAEVVEKEREIAK 208
Query: 256 --------PEE---------------ETIMYHQEFLLDPTQYVGEVIVAAGIKPVEFLRF 292
PEE E ++ Q F+ DP + VG+++ AG K V F+RF
Sbjct: 209 EQAKEEGKPEEIVEKIVEGRLNKFLKEVVLLEQPFVKDPKKTVGQLLKEAGAKVVGFVRF 268
Query: 293 ECGEGCEESEE 303
E GEG E+ EE
Sbjct: 269 EVGEGIEKKEE 279
>gnl|CDD|223342 COG0264, Tsf, Translation elongation factor Ts [Translation,
ribosomal structure and biogenesis].
Length = 296
Score = 169 bits (430), Expect = 8e-51
Identities = 100/313 (31%), Positives = 150/313 (47%), Gaps = 69/313 (22%)
Query: 25 SLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLI 84
+L+ +LR+KTG +CKKAL+ D+ KA +WL +E G AKA+K AGR ++GLI
Sbjct: 7 ALVKELREKTGAGMMDCKKALEEANGDIEKAIEWL----REKGIAKAAKKAGRIAAEGLI 62
Query: 85 SIAVE--GKHATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQPQEPFAKVFLD 142
+ V+ GK A +VE NCETDFVA+N +FQ +A I+ + K D
Sbjct: 63 AAKVDGDGKKAVLVEVNCETDFVAKNAEFQELANKIA-------------KAALEKKPAD 109
Query: 143 TAQLQSLAGPENKSLADHVAILISSVGENLVLRRAACVTVNEDHDVAGFTHPSPGLEHTG 202
+L++ P K++ + +A LI+ +GEN+ LRR A V D V + H +
Sbjct: 110 VEELKAAFEPGGKTVEEEIAALIAKIGENISLRRFA-VLEAGDGVVGSYLHGN------- 161
Query: 203 PILGKFGSLMVYQDLKTGDKQQNVQNVARQLCQHVIGMNPKSIGSEEDTPNED------- 255
G+ G L+ + + +A+ + H+ MNP+ + S ED P E
Sbjct: 162 ---GRIGVLVALK------GGAADEELAKDIAMHIAAMNPQYL-SREDVPAEVVEKEREI 211
Query: 256 ----------PE---------------EETIMYHQEFLLDPTQYVGEVIVAAGIKPVEFL 290
PE E + Q F+ DP + V +++ A K EF+
Sbjct: 212 FLAQLKAEGKPENIVEKIVEGRMNKFLAEVCLLEQPFVKDPKKTVEQLLKEANAKVTEFV 271
Query: 291 RFECGEGCEESEE 303
RFE GEG E+ EE
Sbjct: 272 RFEVGEGIEKKEE 284
>gnl|CDD|232832 TIGR00116, tsf, translation elongation factor Ts. Translational
elongation factor Ts (EF-Ts) catalyzes the exchange of
GTP for the GDP of the EF-Tu.GDP complex as part of the
cycle of translation elongation. This protein is found
in Bacteria, mitochondria, and chloroplasts [Protein
synthesis, Translation factors].
Length = 291
Score = 127 bits (321), Expect = 7e-35
Identities = 86/310 (27%), Positives = 137/310 (44%), Gaps = 67/310 (21%)
Query: 25 SLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLI 84
L+ +LR++TG +CKKAL D KA KWL+E+ G AKA+K A R ++G+I
Sbjct: 7 QLVKELRERTGAGMMDCKKALVEANGDFEKAIKWLREK----GIAKAAKKADRVAAEGVI 62
Query: 85 SIAVEGKHATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQPQEPFAKVFLDTA 144
+ + A +VE N ETDFVA+N+ F+ A + T ++
Sbjct: 63 VLKSDNHKAVIVEVNSETDFVAKNEGFKEFANKLLDELKANPITTLE-----------EL 111
Query: 145 QLQSLAGPENKSLADHVAILISSVGENLVLRRAACVTVNEDHDVAGFTHPSPGLEHTGPI 204
Q Q L ENK +++A L + +GEN+VLRR A V + + + + H +
Sbjct: 112 QKQEL---ENKEKVEYLASLAAKIGENIVLRRVA-VLEGQSNVIFSYLHAN--------- 158
Query: 205 LGKFGSLMVYQDLKTGDKQQNVQNVARQLCQHVIGMNPKSIGSEEDTPNEDPEEETI--- 261
+ G L K + + +A+ L HV P+ I ++ + +E I
Sbjct: 159 -ARIGVL-------VELKGKADEELAKHLAMHVAANKPQFIDQDDVSQEWVKKERQIITD 210
Query: 262 ----------------------------MYHQEFLLDPTQYVGEVIVAAGIKPVEFLRFE 293
+ Q+F++DP++ VG+ + K EF+RFE
Sbjct: 211 QAELSGKPKEILEKMVEGRMKKFLAEISLLGQKFVMDPSKTVGQFLKEKNAKVTEFVRFE 270
Query: 294 CGEGCEESEE 303
GEG E+ E
Sbjct: 271 VGEGIEKKAE 280
>gnl|CDD|183447 PRK12332, tsf, elongation factor Ts; Reviewed.
Length = 198
Score = 91.9 bits (229), Expect = 2e-22
Identities = 61/184 (33%), Positives = 92/184 (50%), Gaps = 40/184 (21%)
Query: 29 KLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLIS--I 86
+LR+KTG +CKKAL+ D+ KA +WL+E+ G AKA+K AGR ++GL+ I
Sbjct: 10 ELREKTGAGMMDCKKALEEANGDMEKAIEWLREK----GLAKAAKKAGRVAAEGLVGSYI 65
Query: 87 AVEGKHATMVEFNCETDFVARNKQFQGMAELISL----ACLNYTKTQVQPQEPFAK-VFL 141
G+ +VE NCETDFVAR ++F+ +A+ I++ A Y + P E K +
Sbjct: 66 HTGGRIGVLVELNCETDFVARTEEFKELAKDIAMQIAAANPEYVSREDVPAEVIEKEKEI 125
Query: 142 DTAQLQSLAGPEN-----------------------------KSLADHVAILISSVGENL 172
AQ + PEN K++ D + I+ +GEN+
Sbjct: 126 YRAQALNEGKPENIVEKIVEGRIEKFYKEVCLLEQPFIKDPSKTVEDLIKEAIAKIGENI 185
Query: 173 VLRR 176
V+RR
Sbjct: 186 VVRR 189
Score = 35.6 bits (83), Expect = 0.014
Identities = 27/109 (24%), Positives = 41/109 (37%), Gaps = 39/109 (35%)
Query: 227 QNVARQLCQHVIGMNPKSIGSEEDTPNED-----------------PE------------ 257
+ +A+ + + NP+ + S ED P E PE
Sbjct: 91 KELAKDIAMQIAAANPEYV-SREDVPAEVIEKEKEIYRAQALNEGKPENIVEKIVEGRIE 149
Query: 258 ---EETIMYHQEFLLDPTQYVGEVIVAAGIKPVE------FLRFECGEG 297
+E + Q F+ DP++ V ++I A K E F RFE GEG
Sbjct: 150 KFYKEVCLLEQPFIKDPSKTVEDLIKEAIAKIGENIVVRRFARFELGEG 198
>gnl|CDD|214362 CHL00098, tsf, elongation factor Ts.
Length = 200
Score = 82.4 bits (204), Expect = 8e-19
Identities = 47/113 (41%), Positives = 66/113 (58%), Gaps = 11/113 (9%)
Query: 29 KLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLIS--I 86
+LR KTG +CKKAL D KA + L+++ G A A+K + R T++GLI I
Sbjct: 7 ELRDKTGAGMMDCKKALQEANGDFEKALESLRQK----GLASANKKSTRITTEGLIESYI 62
Query: 87 AVEGKHATMVEFNCETDFVARNKQFQGMAELISL---ACLN--YTKTQVQPQE 134
GK +VE NCETDFVAR ++FQ +A+ I++ AC N Y + P+E
Sbjct: 63 HTGGKLGVLVEINCETDFVARREEFQKLAKNIAMQIAACPNVKYVSLEDIPEE 115
>gnl|CDD|189759 pfam00889, EF_TS, Elongation factor TS.
Length = 134
Score = 62.1 bits (152), Expect = 4e-12
Identities = 17/37 (45%), Positives = 29/37 (78%)
Query: 79 TSQGLISIAVEGKHATMVEFNCETDFVARNKQFQGMA 115
++GL+++ ++G +VE NCETDFVA+N++FQ +A
Sbjct: 1 AAEGLVAVYIDGNKGVLVEVNCETDFVAKNEKFQELA 37
Score = 56.7 bits (138), Expect = 4e-10
Identities = 28/102 (27%), Positives = 42/102 (41%), Gaps = 33/102 (32%)
Query: 227 QNVARQLCQHVIGMNPKSIGSEEDTPNED-----------------PEE----------- 258
Q +A+ + H+ MNP+ + S +D P E PE
Sbjct: 34 QELAKDIAMHIAAMNPQYL-SRDDVPAEVLEKEREILKAQAKEEGKPENIIEKIVEGRLN 92
Query: 259 ----ETIMYHQEFLLDPTQYVGEVIVAAGIKPVEFLRFECGE 296
E + Q F+ DP + VG+++ AG K F+RFE GE
Sbjct: 93 KFLKEVCLLEQPFVKDPKKTVGQLLKEAGAKVKRFVRFEVGE 134
>gnl|CDD|143601 cd07577, Ph0642_like, Pyrococcus horikoshii Ph0642 and related
proteins, members of the nitrilase superfamily
(putative class 13 nitrilases). Uncharacterized
subgroup of the nitrilase superfamily. This superfamily
is comprised of nitrile- or amide-hydrolyzing enzymes
and amide-condensing enzymes, which depend on a
Glu-Lys-Cys catalytic triad. Pyrococcus horikoshii
Ph0642 is a hypothetical protein belonging to this
subgroup. This superfamily has been classified in the
literature based on global and structure based sequence
analysis into thirteen different enzyme classes
(referred to as 1-13). This subgroup was classified as
belonging to class 13, which represents proteins that
at the time were difficult to place in a distinct
similarity group. Members of this superfamily generally
form homomeric complexes, the basic building block of
which is a homodimer.
Length = 259
Score = 31.9 bits (73), Expect = 0.37
Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 1/35 (2%)
Query: 34 TGYSFTNCKKALDLNEQDL-GKAEKWLQEQAKELG 67
TGY+FT+ ++ L E G ++LQE A+E G
Sbjct: 41 TGYAFTSKEEVASLAESIPDGPTTRFLQELARETG 75
>gnl|CDD|143587 cd07197, nitrilase, Nitrilase superfamily, including nitrile- or
amide-hydrolyzing enzymes and amide-condensing enzymes.
This superfamily (also known as the C-N hydrolase
superfamily) contains hydrolases that break
carbon-nitrogen bonds; it includes nitrilases, cyanide
dihydratases, aliphatic amidases, N-terminal amidases,
beta-ureidopropionases, biotinidases, pantotheinase,
N-carbamyl-D-amino acid amidohydrolases, the
glutaminase domain of glutamine-dependent NAD+
synthetase, apolipoprotein N-acyltransferases, and
N-carbamoylputrescine amidohydrolases, among others.
These enzymes depend on a Glu-Lys-Cys catalytic triad,
and work through a thiol acylenzyme intermediate.
Members of this superfamily generally form homomeric
complexes, the basic building block of which is a
homodimer. These oligomers include dimers, tetramers,
hexamers, octamers, tetradecamers, octadecamers, as
well as variable length helical arrangements and
homo-oligomeric spirals. These proteins have roles in
vitamin and co-enzyme metabolism, in detoxifying small
molecules, in the synthesis of signaling molecules, and
in the post-translational modification of proteins.
They are used industrially, as biocatalysts in the fine
chemical and pharmaceutical industry, in cyanide
remediation, and in the treatment of toxic effluent.
This superfamily has been classified previously in the
literature, based on global and structure-based
sequence analysis, into thirteen different enzyme
classes (referred to as 1-13). This hierarchy includes
those thirteen classes and a few additional
subfamilies. A putative distant relative, the
plasmid-borne TraB family, has not been included in the
hierarchy.
Length = 253
Score = 31.5 bits (72), Expect = 0.49
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 34 TGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELG 67
TGYSF + K+ LDL E+ G + L E AKELG
Sbjct: 43 TGYSFESAKEDLDLAEELDGPTLEALAELAKELG 76
>gnl|CDD|201355 pfam00627, UBA, UBA/TS-N domain. This small domain is composed
of three alpha helices. This family includes the
previously defined UBA and TS-N domains. The UBA-domain
(ubiquitin associated domain) is a novel sequence motif
found in several proteins having connections to
ubiquitin and the ubiquitination pathway. The structure
of the UBA domain consists of a compact three helix
bundle. This domain is found at the N terminus of EF-TS
hence the name TS-N. The structure of EF-TS is known
and this domain is implicated in its interaction with
EF-TU. The domain has been found in non EF-TS proteins
such as alpha-NAC and MJ0280.
Length = 37
Score = 27.8 bits (63), Expect = 0.82
Identities = 11/37 (29%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Query: 23 NKSLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWL 59
++ L+ +LR+ G+S KKAL ++ +A ++L
Sbjct: 2 DEELIKQLREM-GFSREEAKKALRATNGNVERAVEYL 37
>gnl|CDD|221900 pfam13024, DUF3884, Protein of unknown function (DUF3884). This
family of proteins is functionally uncharacterized.
However several proteins are annotated as Tagatose
1,6-diphosphate aldolase, but evidence to support this
could not be found. This family of proteins is found in
bacteria. Proteins in this family are typically between
61 and 106 amino acids in length. There are two
completely conserved residues (Y and F) that may be
functionally important.
Length = 77
Score = 27.8 bits (62), Expect = 1.6
Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 2/38 (5%)
Query: 37 SFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKL 74
+F C + + +L + KW KE W SKL
Sbjct: 9 NFEKCPQIPLFDSNELKELGKWYVSTGKE--WICHSKL 44
>gnl|CDD|239221 cd02891, A2M_like, Proteins similar to alpha2-macroglobulin (alpha
(2)-M). Alpha (2)-M is a major carrier protein in
serum. It is a broadly specific proteinase inhibitor.
The structural thioester of alpha (2)-M, is involved in
the immobilization and entrapment of proteases. This
group contains another broadly specific proteinase
inhibitor: pregnancy zone protein (PZP). PZP is a
trace protein in the plasma of non-pregnant females and
males which is elevated in pregnancy. Alpha (2)-M and PZ
bind to placental protein-14 and may modulate its
activity in T-cell growth and cytokine production
thereby protecting the allogeneic fetus from attack by
the maternal immune system. This group also contains C3,
C4 and C5 of vertebrate complement. The vertebrate
complement is an effector of both the acquired and
innate immune systems The point of convergence of the
classical, alternative and lectin pathways of the
complement system is the proteolytic activation of C3.
C4 plays a key role in propagating the classical and
lectin pathways. C5 participates in the classical and
alternative pathways. The thioester bond located within
the structure of C3 and C4 is central to the function of
complement. C5 does not contain an active thioester
bond.
Length = 282
Score = 28.9 bits (65), Expect = 3.1
Identities = 11/35 (31%), Positives = 17/35 (48%)
Query: 34 TGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGW 68
+ +K +D++E L +A WL Q KE G
Sbjct: 85 VVKFLSQARKYIDVDENVLARALGWLVPQQKEDGS 119
>gnl|CDD|181452 PRK08507, PRK08507, prephenate dehydrogenase; Validated.
Length = 275
Score = 28.7 bits (65), Expect = 4.2
Identities = 18/59 (30%), Positives = 31/59 (52%), Gaps = 6/59 (10%)
Query: 192 THPSPGLEHTGP---ILGKF-GSLMVYQDL-KTGDKQQN-VQNVARQLCQHVIGMNPKS 244
HP G E++GP I G + G ++V D+ K+G+K Q + + L ++ M+ K
Sbjct: 112 AHPMAGTENSGPKAAIKGLYEGKVVVLCDVEKSGEKHQERAKEIFSGLGMRIVYMDAKE 170
>gnl|CDD|185377 PRK15480, PRK15480, glucose-1-phosphate thymidylyltransferase RfbA;
Provisional.
Length = 292
Score = 28.5 bits (63), Expect = 4.5
Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 26/86 (30%)
Query: 81 QGLISIAVEGK---------HATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQ 131
QG +S+A+ G+ H +++E ++F+A ++ QG+ ++C
Sbjct: 210 QGRLSVAMMGRGYAWLDTGTHQSLIE---ASNFIATIEERQGL----KVSC--------- 253
Query: 132 PQE-PFAKVFLDTAQLQSLAGPENKS 156
P+E F K F+D Q++ LA P K+
Sbjct: 254 PEEIAFRKGFIDAEQVKVLAEPLKKN 279
>gnl|CDD|165707 PLN00139, PLN00139, hypothetical protein; Provisional.
Length = 320
Score = 28.4 bits (63), Expect = 4.5
Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 1/38 (2%)
Query: 251 TPNEDPEEETIMYHQEFLLDPTQYVGEVIVAAGIKPVE 288
T NE P E I YH E +L + +VI+ I P E
Sbjct: 103 TANEGPLSEFIYYHHEMVL-IKESPKKVILFCEIPPPE 139
>gnl|CDD|179646 PRK03762, PRK03762, hypothetical protein; Provisional.
Length = 103
Score = 27.3 bits (61), Expect = 4.7
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)
Query: 51 DLGKAEKWL---QEQAKELGWAKASKLAGRKTSQGLISIAVEGK 91
D K + L Q++AK+L A+K K+ GL+S++ GK
Sbjct: 6 DFSKLGEMLEQMQKKAKQLEEENANKEFTAKSGGGLVSVSANGK 49
>gnl|CDD|221021 pfam11193, DUF2812, Protein of unknown function (DUF2812). This
is a bacterial family of uncharacterized proteins,
however some members of this family are annotated as
membrane proteins.
Length = 113
Score = 26.4 bits (59), Expect = 8.2
Identities = 12/19 (63%), Positives = 13/19 (68%), Gaps = 1/19 (5%)
Query: 51 DLGKAEKWLQEQAKELGWA 69
DL K EKWL E AK+ GW
Sbjct: 2 DLEKEEKWLNEMAKK-GWR 19
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.130 0.373
Gapped
Lambda K H
0.267 0.0667 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,140,845
Number of extensions: 1403648
Number of successful extensions: 1093
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1077
Number of HSP's successfully gapped: 28
Length of query: 312
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 215
Effective length of database: 6,635,264
Effective search space: 1426581760
Effective search space used: 1426581760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.5 bits)
S2: 59 (26.6 bits)