RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14480
         (190 letters)



>gnl|CDD|214614 smart00317, SET, SET (Su(var)3-9, Enhancer-of-zeste, Trithorax)
           domain.  Putative methyl transferase, based on outlier
           plant homologues.
          Length = 124

 Score = 42.7 bits (101), Expect = 1e-05
 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 3/50 (6%)

Query: 107 ASFIEHNCYPNLYKSFT---DSGQVLLRAMKPIAPGDHLSICYTDPLWGT 153
           A FI H+C PN    F       ++++ A++ I PG+ L+I Y       
Sbjct: 75  ARFINHSCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124


>gnl|CDD|216155 pfam00856, SET, SET domain.  SET domains are protein lysine
           methyltransferase enzymes. SET domains appear to be
           protein-protein interaction domains. It has been
           demonstrated that SET domains mediate interactions with
           a family of proteins that display similarity with
           dual-specificity phosphatases (dsPTPases). A subset of
           SET domains have been called PR domains. These domains
           are divergent in sequence from other SET domains, but
           also appear to mediate protein-protein interaction. The
           SET domain consists of two regions known as SET-N and
           SET-C. SET-C forms an unusual and conserved knot-like
           structure of probably functional importance.
           Additionally to SET-N and SET-C, an insert region
           (SET-I) and flanking regions of high structural
           variability form part of the overall structure.
          Length = 113

 Score = 37.9 bits (88), Expect = 5e-04
 Identities = 16/43 (37%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 107 ASFIEHNCYPNLYKSFT---DSGQVLLRAMKPIAPGDHLSICY 146
           A FI H+C PN    F       ++++RA++ I PG+ L+I Y
Sbjct: 70  ARFINHSCEPNCEVRFVFVNGGDRIVVRALRDIKPGEELTIDY 112


>gnl|CDD|225491 COG2940, COG2940, Proteins containing SET domain [General function
           prediction only].
          Length = 480

 Score = 36.3 bits (84), Expect = 0.006
 Identities = 19/76 (25%), Positives = 30/76 (39%), Gaps = 3/76 (3%)

Query: 104 FDRASFIEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
            D A FI H+C PN   S        ++ + A++ I  G+ L+  Y   L      +  L
Sbjct: 403 GDVARFINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLL 462

Query: 161 QTSKYFICQCERCRDP 176
           +      C  +RC   
Sbjct: 463 EKRWGCACGEDRCSHT 478


>gnl|CDD|148445 pfam06838, Alum_res, Aluminium resistance protein.  This family
           represents the aluminium resistance protein, which
           confers resistance to aluminium in bacteria. Structural
           characterisation has shown proteins in this family bind
           a pyridoxal-5'-phosphate cofactor, and that they belongs
           to the PLP dependent aminotransferase superfamily.
           Crystal packing analysis suggests that in solution, a
           tetramer may be the stable oligomeric form. This is in
           contrast to most other aminotransferases which form
           dimers (information from TOPSAN).
          Length = 405

 Score = 30.4 bits (69), Expect = 0.49
 Identities = 14/58 (24%), Positives = 28/58 (48%), Gaps = 13/58 (22%)

Query: 47  YKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ----VNAHEMPLTEPSY 100
           +   PKY   R +++Q +    K  ++   E+++  C  IQ    +++H  P  EP+ 
Sbjct: 299 FPVLPKYNEKRTDLIQAV----KFGDK---EKLIAFCQAIQAASPIDSHVDP--EPAD 347


>gnl|CDD|225652 COG3110, COG3110, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 216

 Score = 29.0 bits (65), Expect = 1.2
 Identities = 14/78 (17%), Positives = 27/78 (34%), Gaps = 11/78 (14%)

Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
           +      IE +  P+         Q++ R +K I  G    +  + PL  T N       
Sbjct: 43  LLKGKDSIELDDGPH---------QIVFRVIKTIRLGSDRRLYISPPLIVTFNA--QNID 91

Query: 163 SKYFICQCERCRDPTELN 180
             +F+ +    R+  +  
Sbjct: 92  LVFFLPRLRNEREANKFI 109


>gnl|CDD|213866 TIGR03818, MotA1, flagellar motor stator protein MotA.  The MotA
           protein, along with its partner MotB, comprise the
           stator complex of the bacterial flagellar motor. MotAB
           span the cytoplasmic membrane and undergo conformational
           changes powered by the translocation of protons. These
           conformational changes in turn are communicated to the
           rotor assembly, producing torque. This model represents
           one family of MotA proteins which are often not
           identified by the "transporter, MotA/TolQ/ExbB proton
           channel family" model, pfam01618.
          Length = 282

 Score = 29.0 bits (66), Expect = 1.5
 Identities = 11/39 (28%), Positives = 23/39 (58%), Gaps = 5/39 (12%)

Query: 37  LMQLEAHVEEYKNSP---KYENDRRN--VVQFLLNFFKL 70
           LM +E+H+E  + S    KY    ++  +V+F+ ++ +L
Sbjct: 94  LMAIESHIENPEESSIFSKYPKILKDHHLVEFICDYLRL 132


>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region.  This family
           consists of the N terminal region of various alpha, beta
           and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
           protein complexes. The adaptor protein (AP) complexes
           are involved in the formation of clathrin-coated pits
           and vesicles. The N-terminal region of the various
           adaptor proteins (APs) is constant by comparison to the
           C-terminal which is variable within members of the AP-2
           family; and it has been proposed that this constant
           region interacts with another uniform component of the
           coated vesicles.
          Length = 522

 Score = 28.7 bits (65), Expect = 2.1
 Identities = 24/124 (19%), Positives = 42/124 (33%), Gaps = 21/124 (16%)

Query: 13  PSYECIIALRCLYK---KSHQPQIWDKLMQ--LEAHVEEYKN---------SPKYENDRR 58
                + AL  LYK   +S+  +I  +L++   E    E+K          + K+  D  
Sbjct: 315 DISIRLRALDLLYKLVDESNVKEIVKELLKYVSEIADPEFKIKLVKAIGRLAEKFPTDAE 374

Query: 59  NVVQFLLNFFKLNEEFTQEEIL-KICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPN 117
             +  LL    L   +  +EI+  I  II+         E     +      +E    P 
Sbjct: 375 WCIDVLLELLSLAGSYVVDEIVEVIRDIIRKYP------ELREYILEHLCELLEDIESPE 428

Query: 118 LYKS 121
              +
Sbjct: 429 ARAA 432


>gnl|CDD|187718 cd08649, FMT_core_NRPS_like, N-terminal formyl transferase
           catalytic core domain of NRPS_like proteins, one of the
           proteins involved in the synthesis of Oxazolomycin.
           This family represents the N-terminal formyl transferase
           catalytic core domain present in a subgroup of
           non-ribosomal peptide synthetases. In Streptomyces albus
           a member of this family has been shown to be involved in
           the synthesis of oxazolomycin (OZM). OZM is a hybrid
           peptide-polyketide antibiotic and exhibits potent
           antitumor and antiviral activities. It is a multi-domain
           protein consisting of a formyl transferase domain, a
           Flavin-utilizing monoxygenase domain, a LuxE domain
           functioning as an acyl protein synthetase and a
           pp-binding domain, which may function as an acyl
           carrier. It shows sequence similarity with other
           peptide-polyketide biosynthesis proteins.
          Length = 166

 Score = 28.0 bits (63), Expect = 2.3
 Identities = 17/71 (23%), Positives = 29/71 (40%), Gaps = 11/71 (15%)

Query: 77  EEILKICGIIQVNAHEMPLTEPSYI-------AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
            E+L +     +N H+ PL  P Y        A+    +   H    +  +   D+G +L
Sbjct: 76  SEVLALPRKGAINFHDGPL--PRYAGLNATSWALLAGET--RHGVTWHRIEEGVDAGDIL 131

Query: 130 LRAMKPIAPGD 140
           ++    IAP D
Sbjct: 132 VQRPFDIAPDD 142


>gnl|CDD|237306 PRK13208, valS, valyl-tRNA synthetase; Reviewed.
          Length = 800

 Score = 28.2 bits (64), Expect = 3.3
 Identities = 11/44 (25%), Positives = 17/44 (38%), Gaps = 7/44 (15%)

Query: 44  VEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ 87
           VE+Y    K +  R         F +L  E T E+  K   + +
Sbjct: 95  VEKYYGIRKDDISRE-------EFIELCRELTDEDEKKFRELWR 131


>gnl|CDD|222408 pfam13835, DUF4194, Domain of unknown function (DUF4194). 
          Length = 165

 Score = 27.2 bits (61), Expect = 3.7
 Identities = 9/32 (28%), Positives = 16/32 (50%), Gaps = 2/32 (6%)

Query: 18 IIAL--RCLYKKSHQPQIWDKLMQLEAHVEEY 47
          +IAL           P++WD L++ +  V +Y
Sbjct: 1  LIALLKGPYISGEKHPKLWDVLLRYQEAVRDY 32


>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
          Length = 746

 Score = 27.8 bits (62), Expect = 4.5
 Identities = 11/42 (26%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 25  YKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN 66
           YK+    + W+ LM ++A   +++N  K     R  +   LN
Sbjct: 280 YKELMGEEQWNALM-VKAQETQFQNDAKLTEQFRLGINSALN 320


>gnl|CDD|223268 COG0190, FolD, 5,10-methylene-tetrahydrofolate
          dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase
          [Coenzyme metabolism].
          Length = 283

 Score = 27.1 bits (61), Expect = 5.5
 Identities = 9/30 (30%), Positives = 16/30 (53%), Gaps = 2/30 (6%)

Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHE 92
               + L E+ T+EE+L +  I ++NA  
Sbjct: 60 IASELYDLPEDITEEELLAL--IDELNADP 87


>gnl|CDD|219695 pfam08011, DUF1703, Protein of unknown function (DUF1703).  This
          family contains many hypothetical bacterial proteins.
          It has been identified as a member of the PD-(D/E)XK
          nuclease superfamily through transitive meta profile
          searches. DUF1703 has the predicted secondary structure
          pattern of the restriction endonuclease-like fold core
          and contains an additional beta-strand at the
          C-terminus.
          Length = 101

 Score = 26.0 bits (58), Expect = 5.8
 Identities = 11/43 (25%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 27 KSHQPQIWDKLMQLEA--HVEEYKNSPKYENDRRNVVQFLLNF 67
          K  + +  DK +  EA   ++EY  + KY+ D + + +  + F
Sbjct: 49 KYIKVKDSDKEIAEEALEQIKEYGYAEKYKGDGKKIKKIGIVF 91


>gnl|CDD|181414 PRK08410, PRK08410, 2-hydroxyacid dehydrogenase; Provisional.
          Length = 311

 Score = 26.9 bits (60), Expect = 7.2
 Identities = 13/30 (43%), Positives = 18/30 (60%), Gaps = 6/30 (20%)

Query: 71  NEEFTQ---EEILKICGIIQVNAHEMPLTE 97
           NEE+ +   EE+LK   II ++A   PL E
Sbjct: 182 NEEYERVSLEELLKTSDIISIHA---PLNE 208


>gnl|CDD|234344 TIGR03754, conj_TOL_TraD, conjugative coupling factor TraD, TOL
           family.  Members of this protein are assigned by
           homology to the TraD family of conjugative coupling
           factor. This particular clade serves as a marker for an
           extended gene region that occurs occasionally on
           plasmids, including the toluene catabolism TOL plasmid.
           More commonly, the gene region is chromosomal, flanked
           by various markers of conjugative transfer and
           insertion.
          Length = 643

 Score = 26.9 bits (60), Expect = 8.9
 Identities = 10/40 (25%), Positives = 21/40 (52%), Gaps = 5/40 (12%)

Query: 29  HQPQIWDKLMQLEAHVEEYKNSPKYENDRRN----VVQFL 64
           H P+ W+ + +L A +++ +N P+    R      + Q+L
Sbjct: 331 HDPKAWELIEELIARLKD-RNVPRNMQGRPPRVVAIEQYL 369


>gnl|CDD|223845 COG0774, LpxC, UDP-3-O-acyl-N-acetylglucosamine deacetylase [Cell
           envelope biogenesis, outer membrane].
          Length = 300

 Score = 26.5 bits (59), Expect = 9.2
 Identities = 24/109 (22%), Positives = 40/109 (36%), Gaps = 30/109 (27%)

Query: 85  IIQVNAHEMPLTEPS---YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPI----- 136
           II+V+  E+P+ + S   ++ + D A   E N      K F       +R  KP+     
Sbjct: 93  IIEVDGPEIPIMDGSAAPFVELIDEAGIKEQNA----KKKF-------IRIKKPVRVEDG 141

Query: 137 ------APGDHLSICYT----DPLWGTINRRHHL-QTSKYFICQCERCR 174
                  P D   + YT     P  G   +      +++ F  +  R R
Sbjct: 142 DKWAEFTPYDGFRLSYTIDFDHPAIGRQWQSFEFDFSAESFRKEIARAR 190


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.138    0.443 

Gapped
Lambda     K      H
   0.267   0.0717    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,547,695
Number of extensions: 852796
Number of successful extensions: 774
Number of sequences better than 10.0: 1
Number of HSP's gapped: 770
Number of HSP's successfully gapped: 33
Length of query: 190
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 99
Effective length of database: 6,901,388
Effective search space: 683237412
Effective search space used: 683237412
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 56 (25.4 bits)