RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16312
         (245 letters)



>gnl|CDD|233708 TIGR02072, BioC, biotin biosynthesis protein BioC.  This enzyme,
           which is found in biotin biosynthetic gene clusters in
           proteobacteria, firmicutes, green-sulfur bacteria,
           fusobacterium and bacteroides, is believed to carry out
           an enzymatic step prior to the formation of pimeloyl-CoA
           (although attribution of this annotation is not
           traceable). The enzyme appears related to
           methyltransferases by homology [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Biotin].
          Length = 240

 Score = 97.0 bits (242), Expect = 2e-24
 Identities = 46/202 (22%), Positives = 86/202 (42%), Gaps = 26/202 (12%)

Query: 3   DNCESILSQAEPPEDTEVIYEKKFVDEEHLPFAENSVDLLISNLALHWVNNLPGCFQQVL 62
           D    +L+QA+      V +     D E LP  ++S DL++SNLAL W ++L     ++ 
Sbjct: 65  DISAGMLAQAKTKLSENVQFICG--DAEKLPLEDSSFDLIVSNLALQWCDDLSQALSELA 122

Query: 63  KCLRQDGVFLASIFGGETLYELRSAVHLAEMERRGGVASHISPFTQIRDVGSLLTRAGFN 122
           + L+  G+   S FG  TL+ELR +              H   +  + ++ +LL +  F 
Sbjct: 123 RVLKPGGLLAFSTFGPGTLHELRQSFGQ-----------HGLRYLSLDELKALL-KNSFE 170

Query: 123 MLTIDVDEIVIHYPSMFELMWDLKGMGESNACVNRSLHFPLELQYASAAIYEKYYGKPND 182
           +LT++ + I + +    +++  LK  G +N   +          +      E        
Sbjct: 171 LLTLEEELITLSFDDPLDVLRHLKKTG-ANGLSSGRTSRKQLKAFLERYEQEF------- 222

Query: 183 DKNGGGTCVPATFQIIYLVAWK 204
                   +P T+ ++Y +A K
Sbjct: 223 ----QPDGLPLTYHVVYGIAKK 240


>gnl|CDD|219759 pfam08241, Methyltransf_11, Methyltransferase domain.  Members of
          this family are SAM dependent methyltransferases.
          Length = 92

 Score = 52.7 bits (127), Expect = 3e-09
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query: 28 DEEHLPFAENSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGVFL 72
          D E LPF + S D+++S+L LH + +     +++ + L+  G  +
Sbjct: 47 DAEDLPFPDESFDVVVSSLVLHHLPDPERALREIARVLKPGGKLV 91


>gnl|CDD|222171 pfam13489, Methyltransf_23, Methyltransferase domain.  This family
           appears to be a methyltransferase domain.
          Length = 154

 Score = 44.0 bits (104), Expect = 9e-06
 Identities = 22/104 (21%), Positives = 37/104 (35%), Gaps = 6/104 (5%)

Query: 19  EVIYEKKFVDEEHLPFAENSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGVFLASIFGG 78
             +      D           DL+ +   L  + + P   QQ+ + L+  GV L S    
Sbjct: 55  AAVLIFSLFDAPDPAVLAGKYDLITAFEVLEHLPDPPALLQQLRELLKPGGVLLISTPLA 114

Query: 79  ETLYELRSAVHLAEMERRGGVASHISPFTQIRDVGSLLTRAGFN 122
           +    L +  H            HIS +++   +  LL +AGF 
Sbjct: 115 DDDARLFANWHYLRPRNT-----HISFYSE-ESLKRLLEKAGFE 152


>gnl|CDD|222415 pfam13847, Methyltransf_31, Methyltransferase domain.  This family
           appears to be have methyltransferase activity.
          Length = 151

 Score = 42.8 bits (101), Expect = 2e-05
 Identities = 20/93 (21%), Positives = 40/93 (43%), Gaps = 6/93 (6%)

Query: 28  DEEHLPFAENSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGVFLASIFGGETLYELRSA 87
           +   L   +NS D++ISN  L+ + +     +++++ L+  GV + S      L EL + 
Sbjct: 65  ELPQLQLEDNSFDVVISNEVLNHLPDPDKVLEEIIRVLKPGGVLIVSDPV--LLSELPAL 122

Query: 88  VHLAEMERRGGVASHISPFTQIRDVGSLLTRAG 120
           +         GV          + + ++L  AG
Sbjct: 123 L-EDLERLYAGVLEG---AIGKKKLLTILREAG 151


>gnl|CDD|182340 PRK10258, PRK10258, biotin biosynthesis protein BioC; Provisional.
          Length = 251

 Score = 43.6 bits (103), Expect = 3e-05
 Identities = 32/122 (26%), Positives = 50/122 (40%), Gaps = 9/122 (7%)

Query: 28  DEEHLPFAENSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGVFLASIFGGETLYELRSA 87
           D E LP A  + DL  SNLA+ W  NL    +++ + +R  GV   +     +L EL  A
Sbjct: 93  DIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPGGVVAFTTLVQGSLPELHQA 152

Query: 88  VHLAEMERRGGVASHISPFTQIRDVGSLLTRAGFNMLTIDVDEIVIHYPSMFELMWDLKG 147
                ++ R   A+   P   I    +     G+      +  I + +      M  LKG
Sbjct: 153 --WQAVDERPH-ANRFLPPDAIEQALN-----GWRY-QHHIQPITLWFDDALSAMRSLKG 203

Query: 148 MG 149
           +G
Sbjct: 204 IG 205


>gnl|CDD|237007 PRK11873, arsM, arsenite S-adenosylmethyltransferase; Reviewed.
          Length = 272

 Score = 37.6 bits (88), Expect = 0.003
 Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 11/118 (9%)

Query: 30  EHLPFAENSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGVFLAS--IFGGETLYELRSA 87
           E LP A+NSVD++ISN  ++   +    F++  + L+  G F  S  +  GE   E+R+ 
Sbjct: 138 EALPVADNSVDVIISNCVINLSPDKERVFKEAFRVLKPGGRFAISDVVLRGELPEEIRND 197

Query: 88  VHLAEMERRGGVASHISPFTQIRDVGSLLTRAGFNMLTIDVDEIVIHYPSMFELMWDL 145
             L         A  ++   Q  +  ++L  AGF  +TI         P   E + D 
Sbjct: 198 AEL--------YAGCVAGALQEEEYLAMLAEAGFVDITIQPKR-EYRIPDAREFLEDW 246


>gnl|CDD|100107 cd02440, AdoMet_MTases, S-adenosylmethionine-dependent
           methyltransferases (SAM or AdoMet-MTase), class I;
           AdoMet-MTases are enzymes that use
           S-adenosyl-L-methionine (SAM or AdoMet) as a substrate
           for methyltransfer, creating the product
           S-adenosyl-L-homocysteine (AdoHcy). There are at least
           five structurally distinct families of AdoMet-MTases,
           class I being the largest and most diverse. Within this
           class enzymes can be classified by different substrate
           specificities (small molecules, lipids, nucleic acids,
           etc.) and different target atoms for methylation
           (nitrogen, oxygen, carbon, sulfur, etc.).
          Length = 107

 Score = 33.6 bits (77), Expect = 0.021
 Identities = 14/49 (28%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 29  EEHLPFAENSVDLLISNLALHWVNNLPGCF-QQVLKCLRQDGVFLASIF 76
           EE  P A+ S D++IS+  LH +      F ++  + L+  GV + ++ 
Sbjct: 57  EELPPEADESFDVIISDPPLHHLVEDLARFLEEARRLLKPGGVLVLTLV 105


>gnl|CDD|225136 COG2226, UbiE, Methylase involved in ubiquinone/menaquinone
           biosynthesis [Coenzyme metabolism].
          Length = 238

 Score = 33.4 bits (77), Expect = 0.078
 Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 2/72 (2%)

Query: 3   DNCESILSQAEPPEDTEVIYEKKFV--DEEHLPFAENSVDLLISNLALHWVNNLPGCFQQ 60
           D  ES+L  A      + +   +FV  D E+LPF +NS D +  +  L  V ++    ++
Sbjct: 82  DISESMLEVAREKLKKKGVQNVEFVVGDAENLPFPDNSFDAVTISFGLRNVTDIDKALKE 141

Query: 61  VLKCLRQDGVFL 72
           + + L+  G  L
Sbjct: 142 MYRVLKPGGRLL 153


>gnl|CDD|219760 pfam08242, Methyltransf_12, Methyltransferase domain.  Members of
          this family are SAM dependent methyltransferases.
          Length = 98

 Score = 30.8 bits (70), Expect = 0.15
 Identities = 7/47 (14%), Positives = 21/47 (44%)

Query: 25 KFVDEEHLPFAENSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGVF 71
          +    + +     S D+++++  LH + +     + + + L+  GV 
Sbjct: 52 RLDVLDAIDLDPGSFDVVVASNVLHHLADPRAVLRNLRRLLKPGGVL 98


>gnl|CDD|235607 PRK05785, PRK05785, hypothetical protein; Provisional.
          Length = 226

 Score = 30.4 bits (69), Expect = 0.56
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 30  EHLPFAENSVDLLISNLALHWVNNL 54
           E LPF + S D+++S+ ALH  +N+
Sbjct: 102 EALPFRDKSFDVVMSSFALHASDNI 126


>gnl|CDD|226591 COG4106, Tam, Trans-aconitate methyltransferase [General function
           prediction only].
          Length = 257

 Score = 29.7 bits (67), Expect = 1.3
 Identities = 11/36 (30%), Positives = 17/36 (47%)

Query: 35  AENSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGV 70
            E   DLL +N  L W+ + P    +++  L   GV
Sbjct: 89  PEQPTDLLFANAVLQWLPDHPELLPRLVSQLAPGGV 124


>gnl|CDD|223574 COG0500, SmtA, SAM-dependent methyltransferases [Secondary
           metabolites biosynthesis, transport, and catabolism /
           General function prediction only].
          Length = 257

 Score = 29.5 bits (63), Expect = 1.6
 Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 28  DEEHLPFAE-NSVDLLISNLALHWVNNLPGCFQQVLKCLRQDGVFLASIFGGETLYELRS 86
               LPF +  S DL+IS L LH +       +++L+ L+  G  + S    + L E R 
Sbjct: 108 LGGVLPFEDSASFDLVISLLVLHLLPPAKA-LRELLRVLKPGGRLVLSDLLRDGLLEGRL 166

Query: 87  AVHLAEMER 95
           A  L   + 
Sbjct: 167 AALLGFGDP 175


>gnl|CDD|238586 cd01181, IPT_plexin_repeat3, Third repeat of the IPT domain of
           Plexins and Cell Surface Receptors (PCSR) . Plexins are
           involved in the regulation of cell proliferation and of
           cellular adhesion and repulsion receptors. In general,
           there are three copies of the IPT domain present
           preceeded by SEMA (semaphorin) and PSI (plexin,
           semaphorin, integrin) domains.
          Length = 99

 Score = 27.8 bits (62), Expect = 2.0
 Identities = 9/38 (23%), Positives = 16/38 (42%)

Query: 103 ISPFTQIRDVGSLLTRAGFNMLTIDVDEIVIHYPSMFE 140
           I P       G+ +T  G N+ T+    I + Y  + +
Sbjct: 6   IEPEWSFLSGGTPITVTGTNLNTVQEPRIRVKYGGVEK 43


>gnl|CDD|219802 pfam08341, Fb_signal, Fibronectin-binding protein signal sequence. 
           This domain is found near the N-terminus of
           fibronectin-binding proteins in Streptococcus where it
           functions as a signal sequence.
          Length = 69

 Score = 26.1 bits (58), Expect = 3.9
 Identities = 8/26 (30%), Positives = 11/26 (42%)

Query: 154 CVNRSLHFPLELQYASAAIYEKYYGK 179
           C N   HFP + +      Y+K  G 
Sbjct: 32  CFNLKKHFPPKSESVGKNWYKKIDGS 57


>gnl|CDD|213282 cd05914, FACL_like_3, Uncharacterized subfamily of fatty acid CoA
          ligase (FACL).  Fatty acyl-CoA ligases catalyze the
          ATP-dependent activation of fatty acids in a two-step
          reaction. The carboxylate substrate first reacts with
          ATP to form an acyl-adenylate intermediate, which then
          reacts with CoA to produce an acyl-CoA ester. This is a
          required step before free fatty acids can participate
          in most catabolic and anabolic reactions.
          Length = 448

 Score = 28.0 bits (63), Expect = 4.9
 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 6/42 (14%)

Query: 35 AENSVDLLISNLALHWVN----NLPGCF--QQVLKCLRQDGV 70
           +NS+D +I++LA          +P  F  QQ    L   G 
Sbjct: 31 LDNSIDWVIADLACLQAGIVCIPIPHFFSAQQTQHLLNDAGA 72


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.138    0.419 

Gapped
Lambda     K      H
   0.267   0.0668    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,764,203
Number of extensions: 1215048
Number of successful extensions: 976
Number of sequences better than 10.0: 1
Number of HSP's gapped: 973
Number of HSP's successfully gapped: 17
Length of query: 245
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 151
Effective length of database: 6,768,326
Effective search space: 1022017226
Effective search space used: 1022017226
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.2 bits)