RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17364
         (312 letters)



>gnl|CDD|236491 PRK09377, tsf, elongation factor Ts; Provisional.
          Length = 290

 Score =  173 bits (442), Expect = 1e-52
 Identities = 97/311 (31%), Positives = 151/311 (48%), Gaps = 70/311 (22%)

Query: 25  SLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLI 84
           +L+ +LR++TG    +CKKAL   + D+ KA +WL+++    G AKA+K AGR  ++GL+
Sbjct: 7   ALVKELRERTGAGMMDCKKALTEADGDIEKAIEWLRKK----GLAKAAKKAGRVAAEGLV 62

Query: 85  SIAVEGKHATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQPQEPFAKVFLDTA 144
           +  V+G    +VE N ETDFVA+N+ FQ +A  ++ A L      V   E    + LD  
Sbjct: 63  AAKVDGNKGVLVEVNSETDFVAKNEDFQALANEVAEAALAAKPADV---EALLALKLD-- 117

Query: 145 QLQSLAGPENKSLADHVAILISSVGENLVLRRAACVTVNEDHDVAGFTHPSPGLEHTGPI 204
                      ++ +    LI+ +GEN+ LRR A +  +    V  + H           
Sbjct: 118 ---------GGTVEEARTELIAKIGENISLRRFARLEKDGGV-VGSYLHG---------- 157

Query: 205 LGKFGSLMVYQDLKTGDKQQNVQNVARQLCQHVIGMNPKSIGSEEDTPNED--------- 255
            G+ G L+    L+ GD++     +A+ +  H+  MNP+ + S ED P E          
Sbjct: 158 GGRIGVLVA---LEGGDEE-----LAKDIAMHIAAMNPEYL-SREDVPAEVVEKEREIAK 208

Query: 256 --------PEE---------------ETIMYHQEFLLDPTQYVGEVIVAAGIKPVEFLRF 292
                   PEE               E ++  Q F+ DP + VG+++  AG K V F+RF
Sbjct: 209 EQAKEEGKPEEIVEKIVEGRLNKFLKEVVLLEQPFVKDPKKTVGQLLKEAGAKVVGFVRF 268

Query: 293 ECGEGCEESEE 303
           E GEG E+ EE
Sbjct: 269 EVGEGIEKKEE 279


>gnl|CDD|223342 COG0264, Tsf, Translation elongation factor Ts [Translation,
           ribosomal structure and biogenesis].
          Length = 296

 Score =  169 bits (430), Expect = 8e-51
 Identities = 100/313 (31%), Positives = 150/313 (47%), Gaps = 69/313 (22%)

Query: 25  SLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLI 84
           +L+ +LR+KTG    +CKKAL+    D+ KA +WL    +E G AKA+K AGR  ++GLI
Sbjct: 7   ALVKELREKTGAGMMDCKKALEEANGDIEKAIEWL----REKGIAKAAKKAGRIAAEGLI 62

Query: 85  SIAVE--GKHATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQPQEPFAKVFLD 142
           +  V+  GK A +VE NCETDFVA+N +FQ +A  I+             +    K   D
Sbjct: 63  AAKVDGDGKKAVLVEVNCETDFVAKNAEFQELANKIA-------------KAALEKKPAD 109

Query: 143 TAQLQSLAGPENKSLADHVAILISSVGENLVLRRAACVTVNEDHDVAGFTHPSPGLEHTG 202
             +L++   P  K++ + +A LI+ +GEN+ LRR A V    D  V  + H +       
Sbjct: 110 VEELKAAFEPGGKTVEEEIAALIAKIGENISLRRFA-VLEAGDGVVGSYLHGN------- 161

Query: 203 PILGKFGSLMVYQDLKTGDKQQNVQNVARQLCQHVIGMNPKSIGSEEDTPNED------- 255
              G+ G L+  +           + +A+ +  H+  MNP+ + S ED P E        
Sbjct: 162 ---GRIGVLVALK------GGAADEELAKDIAMHIAAMNPQYL-SREDVPAEVVEKEREI 211

Query: 256 ----------PE---------------EETIMYHQEFLLDPTQYVGEVIVAAGIKPVEFL 290
                     PE                E  +  Q F+ DP + V +++  A  K  EF+
Sbjct: 212 FLAQLKAEGKPENIVEKIVEGRMNKFLAEVCLLEQPFVKDPKKTVEQLLKEANAKVTEFV 271

Query: 291 RFECGEGCEESEE 303
           RFE GEG E+ EE
Sbjct: 272 RFEVGEGIEKKEE 284


>gnl|CDD|232832 TIGR00116, tsf, translation elongation factor Ts.  Translational
           elongation factor Ts (EF-Ts) catalyzes the exchange of
           GTP for the GDP of the EF-Tu.GDP complex as part of the
           cycle of translation elongation. This protein is found
           in Bacteria, mitochondria, and chloroplasts [Protein
           synthesis, Translation factors].
          Length = 291

 Score =  127 bits (321), Expect = 7e-35
 Identities = 86/310 (27%), Positives = 137/310 (44%), Gaps = 67/310 (21%)

Query: 25  SLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLI 84
            L+ +LR++TG    +CKKAL     D  KA KWL+E+    G AKA+K A R  ++G+I
Sbjct: 7   QLVKELRERTGAGMMDCKKALVEANGDFEKAIKWLREK----GIAKAAKKADRVAAEGVI 62

Query: 85  SIAVEGKHATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQPQEPFAKVFLDTA 144
            +  +   A +VE N ETDFVA+N+ F+  A  +         T ++             
Sbjct: 63  VLKSDNHKAVIVEVNSETDFVAKNEGFKEFANKLLDELKANPITTLE-----------EL 111

Query: 145 QLQSLAGPENKSLADHVAILISSVGENLVLRRAACVTVNEDHDVAGFTHPSPGLEHTGPI 204
           Q Q L   ENK   +++A L + +GEN+VLRR A V   + + +  + H +         
Sbjct: 112 QKQEL---ENKEKVEYLASLAAKIGENIVLRRVA-VLEGQSNVIFSYLHAN--------- 158

Query: 205 LGKFGSLMVYQDLKTGDKQQNVQNVARQLCQHVIGMNPKSIGSEEDTPNEDPEEETI--- 261
             + G L          K +  + +A+ L  HV    P+ I  ++ +     +E  I   
Sbjct: 159 -ARIGVL-------VELKGKADEELAKHLAMHVAANKPQFIDQDDVSQEWVKKERQIITD 210

Query: 262 ----------------------------MYHQEFLLDPTQYVGEVIVAAGIKPVEFLRFE 293
                                       +  Q+F++DP++ VG+ +     K  EF+RFE
Sbjct: 211 QAELSGKPKEILEKMVEGRMKKFLAEISLLGQKFVMDPSKTVGQFLKEKNAKVTEFVRFE 270

Query: 294 CGEGCEESEE 303
            GEG E+  E
Sbjct: 271 VGEGIEKKAE 280


>gnl|CDD|183447 PRK12332, tsf, elongation factor Ts; Reviewed.
          Length = 198

 Score = 91.9 bits (229), Expect = 2e-22
 Identities = 61/184 (33%), Positives = 92/184 (50%), Gaps = 40/184 (21%)

Query: 29  KLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLIS--I 86
           +LR+KTG    +CKKAL+    D+ KA +WL+E+    G AKA+K AGR  ++GL+   I
Sbjct: 10  ELREKTGAGMMDCKKALEEANGDMEKAIEWLREK----GLAKAAKKAGRVAAEGLVGSYI 65

Query: 87  AVEGKHATMVEFNCETDFVARNKQFQGMAELISL----ACLNYTKTQVQPQEPFAK-VFL 141
              G+   +VE NCETDFVAR ++F+ +A+ I++    A   Y   +  P E   K   +
Sbjct: 66  HTGGRIGVLVELNCETDFVARTEEFKELAKDIAMQIAAANPEYVSREDVPAEVIEKEKEI 125

Query: 142 DTAQLQSLAGPEN-----------------------------KSLADHVAILISSVGENL 172
             AQ  +   PEN                             K++ D +   I+ +GEN+
Sbjct: 126 YRAQALNEGKPENIVEKIVEGRIEKFYKEVCLLEQPFIKDPSKTVEDLIKEAIAKIGENI 185

Query: 173 VLRR 176
           V+RR
Sbjct: 186 VVRR 189



 Score = 35.6 bits (83), Expect = 0.014
 Identities = 27/109 (24%), Positives = 41/109 (37%), Gaps = 39/109 (35%)

Query: 227 QNVARQLCQHVIGMNPKSIGSEEDTPNED-----------------PE------------ 257
           + +A+ +   +   NP+ + S ED P E                  PE            
Sbjct: 91  KELAKDIAMQIAAANPEYV-SREDVPAEVIEKEKEIYRAQALNEGKPENIVEKIVEGRIE 149

Query: 258 ---EETIMYHQEFLLDPTQYVGEVIVAAGIKPVE------FLRFECGEG 297
              +E  +  Q F+ DP++ V ++I  A  K  E      F RFE GEG
Sbjct: 150 KFYKEVCLLEQPFIKDPSKTVEDLIKEAIAKIGENIVVRRFARFELGEG 198


>gnl|CDD|214362 CHL00098, tsf, elongation factor Ts.
          Length = 200

 Score = 82.4 bits (204), Expect = 8e-19
 Identities = 47/113 (41%), Positives = 66/113 (58%), Gaps = 11/113 (9%)

Query: 29  KLRKKTGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKLAGRKTSQGLIS--I 86
           +LR KTG    +CKKAL     D  KA + L+++    G A A+K + R T++GLI   I
Sbjct: 7   ELRDKTGAGMMDCKKALQEANGDFEKALESLRQK----GLASANKKSTRITTEGLIESYI 62

Query: 87  AVEGKHATMVEFNCETDFVARNKQFQGMAELISL---ACLN--YTKTQVQPQE 134
              GK   +VE NCETDFVAR ++FQ +A+ I++   AC N  Y   +  P+E
Sbjct: 63  HTGGKLGVLVEINCETDFVARREEFQKLAKNIAMQIAACPNVKYVSLEDIPEE 115


>gnl|CDD|189759 pfam00889, EF_TS, Elongation factor TS. 
          Length = 134

 Score = 62.1 bits (152), Expect = 4e-12
 Identities = 17/37 (45%), Positives = 29/37 (78%)

Query: 79  TSQGLISIAVEGKHATMVEFNCETDFVARNKQFQGMA 115
            ++GL+++ ++G    +VE NCETDFVA+N++FQ +A
Sbjct: 1   AAEGLVAVYIDGNKGVLVEVNCETDFVAKNEKFQELA 37



 Score = 56.7 bits (138), Expect = 4e-10
 Identities = 28/102 (27%), Positives = 42/102 (41%), Gaps = 33/102 (32%)

Query: 227 QNVARQLCQHVIGMNPKSIGSEEDTPNED-----------------PEE----------- 258
           Q +A+ +  H+  MNP+ + S +D P E                  PE            
Sbjct: 34  QELAKDIAMHIAAMNPQYL-SRDDVPAEVLEKEREILKAQAKEEGKPENIIEKIVEGRLN 92

Query: 259 ----ETIMYHQEFLLDPTQYVGEVIVAAGIKPVEFLRFECGE 296
               E  +  Q F+ DP + VG+++  AG K   F+RFE GE
Sbjct: 93  KFLKEVCLLEQPFVKDPKKTVGQLLKEAGAKVKRFVRFEVGE 134


>gnl|CDD|143601 cd07577, Ph0642_like, Pyrococcus horikoshii Ph0642 and related
          proteins, members of the nitrilase superfamily
          (putative class 13 nitrilases).  Uncharacterized
          subgroup of the nitrilase superfamily. This superfamily
          is comprised of nitrile- or amide-hydrolyzing enzymes
          and amide-condensing enzymes, which depend on a
          Glu-Lys-Cys catalytic triad. Pyrococcus horikoshii
          Ph0642 is a hypothetical protein belonging to this
          subgroup. This superfamily has been classified in the
          literature based on global and structure based sequence
          analysis into thirteen different enzyme classes
          (referred to as 1-13). This subgroup was classified as
          belonging to class 13, which represents proteins that
          at the time were difficult to place in a distinct
          similarity group. Members of this superfamily generally
          form homomeric complexes, the basic building block of
          which is a homodimer.
          Length = 259

 Score = 31.9 bits (73), Expect = 0.37
 Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 1/35 (2%)

Query: 34 TGYSFTNCKKALDLNEQDL-GKAEKWLQEQAKELG 67
          TGY+FT+ ++   L E    G   ++LQE A+E G
Sbjct: 41 TGYAFTSKEEVASLAESIPDGPTTRFLQELARETG 75


>gnl|CDD|143587 cd07197, nitrilase, Nitrilase superfamily, including nitrile- or
          amide-hydrolyzing enzymes and amide-condensing enzymes.
           This superfamily (also known as the C-N hydrolase
          superfamily) contains hydrolases that break
          carbon-nitrogen bonds; it includes nitrilases, cyanide
          dihydratases, aliphatic amidases, N-terminal amidases,
          beta-ureidopropionases, biotinidases, pantotheinase,
          N-carbamyl-D-amino acid amidohydrolases, the
          glutaminase domain of glutamine-dependent NAD+
          synthetase, apolipoprotein N-acyltransferases, and
          N-carbamoylputrescine amidohydrolases, among others.
          These enzymes depend on a Glu-Lys-Cys catalytic triad,
          and work through a thiol acylenzyme intermediate.
          Members of this superfamily generally form homomeric
          complexes, the basic building block of which is a
          homodimer. These oligomers include dimers, tetramers,
          hexamers, octamers, tetradecamers, octadecamers, as
          well as variable length helical arrangements and
          homo-oligomeric spirals. These proteins have roles in
          vitamin and co-enzyme metabolism, in detoxifying small
          molecules, in the synthesis of signaling molecules, and
          in the post-translational modification of proteins.
          They are used industrially, as biocatalysts in the fine
          chemical and pharmaceutical industry, in cyanide
          remediation, and in the treatment of toxic effluent.
          This superfamily has been classified previously in the
          literature, based on global and structure-based
          sequence analysis, into thirteen different enzyme
          classes (referred to as 1-13). This hierarchy includes
          those thirteen classes and a few additional
          subfamilies. A putative distant relative, the
          plasmid-borne TraB family, has not been included in the
          hierarchy.
          Length = 253

 Score = 31.5 bits (72), Expect = 0.49
 Identities = 18/34 (52%), Positives = 22/34 (64%)

Query: 34 TGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELG 67
          TGYSF + K+ LDL E+  G   + L E AKELG
Sbjct: 43 TGYSFESAKEDLDLAEELDGPTLEALAELAKELG 76


>gnl|CDD|201355 pfam00627, UBA, UBA/TS-N domain.  This small domain is composed
          of three alpha helices. This family includes the
          previously defined UBA and TS-N domains. The UBA-domain
          (ubiquitin associated domain) is a novel sequence motif
          found in several proteins having connections to
          ubiquitin and the ubiquitination pathway. The structure
          of the UBA domain consists of a compact three helix
          bundle. This domain is found at the N terminus of EF-TS
          hence the name TS-N. The structure of EF-TS is known
          and this domain is implicated in its interaction with
          EF-TU. The domain has been found in non EF-TS proteins
          such as alpha-NAC and MJ0280.
          Length = 37

 Score = 27.8 bits (63), Expect = 0.82
 Identities = 11/37 (29%), Positives = 22/37 (59%), Gaps = 1/37 (2%)

Query: 23 NKSLLGKLRKKTGYSFTNCKKALDLNEQDLGKAEKWL 59
          ++ L+ +LR+  G+S    KKAL     ++ +A ++L
Sbjct: 2  DEELIKQLREM-GFSREEAKKALRATNGNVERAVEYL 37


>gnl|CDD|221900 pfam13024, DUF3884, Protein of unknown function (DUF3884).  This
          family of proteins is functionally uncharacterized.
          However several proteins are annotated as Tagatose
          1,6-diphosphate aldolase, but evidence to support this
          could not be found. This family of proteins is found in
          bacteria. Proteins in this family are typically between
          61 and 106 amino acids in length. There are two
          completely conserved residues (Y and F) that may be
          functionally important.
          Length = 77

 Score = 27.8 bits (62), Expect = 1.6
 Identities = 11/38 (28%), Positives = 16/38 (42%), Gaps = 2/38 (5%)

Query: 37 SFTNCKKALDLNEQDLGKAEKWLQEQAKELGWAKASKL 74
          +F  C +    +  +L +  KW     KE  W   SKL
Sbjct: 9  NFEKCPQIPLFDSNELKELGKWYVSTGKE--WICHSKL 44


>gnl|CDD|239221 cd02891, A2M_like, Proteins similar to alpha2-macroglobulin (alpha
           (2)-M).  Alpha (2)-M is a major carrier protein in
           serum. It is a broadly specific proteinase inhibitor.
           The structural thioester of alpha (2)-M, is involved in
           the immobilization and entrapment of proteases. This
           group contains another broadly specific proteinase
           inhibitor:  pregnancy zone protein (PZP).  PZP is a
           trace protein in the plasma of non-pregnant females and
           males which is elevated in pregnancy. Alpha (2)-M and PZ
           bind to placental protein-14 and may modulate its
           activity in T-cell growth and cytokine production
           thereby protecting the allogeneic fetus from attack by
           the maternal immune system. This group also contains C3,
           C4 and C5 of vertebrate complement.  The vertebrate
           complement is an effector of both the acquired and
           innate immune systems The point of convergence of the
           classical, alternative and lectin pathways of the
           complement system is the proteolytic activation of C3.
           C4 plays a key role in propagating the classical and
           lectin pathways. C5 participates in the classical and
           alternative pathways. The thioester bond located within
           the structure of C3 and C4 is central to the function of
           complement. C5 does not contain an active thioester
           bond.
          Length = 282

 Score = 28.9 bits (65), Expect = 3.1
 Identities = 11/35 (31%), Positives = 17/35 (48%)

Query: 34  TGYSFTNCKKALDLNEQDLGKAEKWLQEQAKELGW 68
                +  +K +D++E  L +A  WL  Q KE G 
Sbjct: 85  VVKFLSQARKYIDVDENVLARALGWLVPQQKEDGS 119


>gnl|CDD|181452 PRK08507, PRK08507, prephenate dehydrogenase; Validated.
          Length = 275

 Score = 28.7 bits (65), Expect = 4.2
 Identities = 18/59 (30%), Positives = 31/59 (52%), Gaps = 6/59 (10%)

Query: 192 THPSPGLEHTGP---ILGKF-GSLMVYQDL-KTGDKQQN-VQNVARQLCQHVIGMNPKS 244
            HP  G E++GP   I G + G ++V  D+ K+G+K Q   + +   L   ++ M+ K 
Sbjct: 112 AHPMAGTENSGPKAAIKGLYEGKVVVLCDVEKSGEKHQERAKEIFSGLGMRIVYMDAKE 170


>gnl|CDD|185377 PRK15480, PRK15480, glucose-1-phosphate thymidylyltransferase RfbA;
           Provisional.
          Length = 292

 Score = 28.5 bits (63), Expect = 4.5
 Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 26/86 (30%)

Query: 81  QGLISIAVEGK---------HATMVEFNCETDFVARNKQFQGMAELISLACLNYTKTQVQ 131
           QG +S+A+ G+         H +++E    ++F+A  ++ QG+     ++C         
Sbjct: 210 QGRLSVAMMGRGYAWLDTGTHQSLIE---ASNFIATIEERQGL----KVSC--------- 253

Query: 132 PQE-PFAKVFLDTAQLQSLAGPENKS 156
           P+E  F K F+D  Q++ LA P  K+
Sbjct: 254 PEEIAFRKGFIDAEQVKVLAEPLKKN 279


>gnl|CDD|165707 PLN00139, PLN00139, hypothetical protein; Provisional.
          Length = 320

 Score = 28.4 bits (63), Expect = 4.5
 Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 1/38 (2%)

Query: 251 TPNEDPEEETIMYHQEFLLDPTQYVGEVIVAAGIKPVE 288
           T NE P  E I YH E +L   +   +VI+   I P E
Sbjct: 103 TANEGPLSEFIYYHHEMVL-IKESPKKVILFCEIPPPE 139


>gnl|CDD|179646 PRK03762, PRK03762, hypothetical protein; Provisional.
          Length = 103

 Score = 27.3 bits (61), Expect = 4.7
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 3/44 (6%)

Query: 51 DLGKAEKWL---QEQAKELGWAKASKLAGRKTSQGLISIAVEGK 91
          D  K  + L   Q++AK+L    A+K    K+  GL+S++  GK
Sbjct: 6  DFSKLGEMLEQMQKKAKQLEEENANKEFTAKSGGGLVSVSANGK 49


>gnl|CDD|221021 pfam11193, DUF2812, Protein of unknown function (DUF2812).  This
          is a bacterial family of uncharacterized proteins,
          however some members of this family are annotated as
          membrane proteins.
          Length = 113

 Score = 26.4 bits (59), Expect = 8.2
 Identities = 12/19 (63%), Positives = 13/19 (68%), Gaps = 1/19 (5%)

Query: 51 DLGKAEKWLQEQAKELGWA 69
          DL K EKWL E AK+ GW 
Sbjct: 2  DLEKEEKWLNEMAKK-GWR 19


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.130    0.373 

Gapped
Lambda     K      H
   0.267   0.0667    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,140,845
Number of extensions: 1403648
Number of successful extensions: 1093
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1077
Number of HSP's successfully gapped: 28
Length of query: 312
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 215
Effective length of database: 6,635,264
Effective search space: 1426581760
Effective search space used: 1426581760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.5 bits)
S2: 59 (26.6 bits)