RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10120
         (510 letters)



>gnl|CDD|216059 pfam00685, Sulfotransfer_1, Sulfotransferase domain. 
          Length = 254

 Score =  217 bits (555), Expect = 2e-67
 Identities = 95/294 (32%), Positives = 141/294 (47%), Gaps = 41/294 (13%)

Query: 173 PDDVWLVSFPRTGSTWAQEMVWLLGHDLAATQIVYVA-RNPKDVLTALVANDPGDWKNEI 231
            DDV +V++P++G+TW QE++ L+ +     +       NP +    L   D      ++
Sbjct: 1   DDDVLIVTYPKSGTTWLQEILSLIPNRGDFEKFEEPHLFNPHNRSPFLEWYDLFV-IFDV 59

Query: 232 PNSVDYVQTLASPRFIKTHLPYSLLPKQIATVKPKVIEGMIEGTNIVGRPPLDYIKQIMR 291
                 +  L SPR IKTHLP  LLPK +     K+I                       
Sbjct: 60  AEGPVRLNALPSPRIIKTHLPLHLLPKSLWDPNAKII----------------------- 96

Query: 292 DMNVVYVARNPKDVCVSYYHYCKLIHHLQDPEGKYFDDFCELFLQGNAPMGPICPHMLEF 351
                Y+ RNPKDV VSYYH+ ++   L+ P      +F E FL G    G    H+  +
Sbjct: 97  -----YLVRNPKDVAVSYYHFFRMAKILKAPG--TPFEFVEDFLNGKVNCGSYFDHVKGW 149

Query: 352 WNKRTEDNILFLKYEDMKKDQKGAILQTAQFLGKQISDDNIAALIDHLSFNKMRDNPATN 411
           W  R   NILFL+YED+KKD +G I + A+FLG  ++++ +  ++ HLSF  M+ NP  N
Sbjct: 150 WKLRPPGNILFLRYEDLKKDPRGEIKKIAEFLGIPLTEEELDKIVKHLSFENMKGNPCLN 209

Query: 412 LEPILQKMDKPAEKRNSEDTFLRKGVVGDYKNQMSPELIRKFDDFVSEGLAGSG 465
                         ++    F RKG+VGD+KN  + E   KFD+   E + G+G
Sbjct: 210 ---------YSKLPKHEVSPFFRKGLVGDWKNYFTVEQAEKFDEIYQEKMKGTG 254


>gnl|CDD|177822 PLN02164, PLN02164, sulfotransferase.
          Length = 346

 Score =  107 bits (268), Expect = 1e-25
 Identities = 90/350 (25%), Positives = 160/350 (45%), Gaps = 74/350 (21%)

Query: 136 RPDDVWLVSFPRTGSTWAQEMVWLLGHDLDYEGAKVRPDDVWLVSFPRTGSTWAQEMVWL 195
           RP +  L+ +   G  W Q ++  L H  ++   + RP+D  + S+P+TG+TW + + + 
Sbjct: 47  RPKEP-LIEY--GGHWWLQPLLEGLLHAQEF--FQARPNDFLVCSYPKTGTTWLKALTFA 101

Query: 196 LGH----DLAATQIVYVARNPKDVLTALVANDPGDWKNEIP--NSVDYVQTLASPRFIKT 249
           + +    D ++  +  + RNP + +  +        + + P   SVD ++   +  F  T
Sbjct: 102 IANRSRFDDSSNPL--LKRNPHEFVPYI--------EIDFPFFPSVDVLKDKGNTLF-ST 150

Query: 250 HLPYSLLPKQIATVKPKVIEGMIEGTNIVGRPPLDYIKQIMRDMNVVYVARNPKDVCVSY 309
           H+PY LLP  +     K                            +VY+ R+PKD  +S 
Sbjct: 151 HIPYGLLPDSVVKSGCK----------------------------MVYIWRDPKDTFISM 182

Query: 310 YHYCKLIHHLQDPEG------KYFDDFCELFLQGNAPMGPICPHMLEFWNKRTE--DNIL 361
           + +   +H  +  +G      + FD FC    +G +  GP   H+L +W    E  D IL
Sbjct: 183 WTF---LHKERSQQGPLNSLEESFDMFC----RGLSVYGPYLDHVLGYWKAYQENPDRIL 235

Query: 362 FLKYEDMKKDQKGAILQTAQFLGKQISDDN-----IAALIDHLSFNKMRDNPATNLEPIL 416
           FLKYE M+ D    + + A+F+G   + +      +  ++   SF  +++  A   E   
Sbjct: 236 FLKYETMRADPLPYVKRLAEFMGYGFTAEEEEKGVVEKVVKLCSFETLKNLEANKGEKD- 294

Query: 417 QKMDKPAEKRNSEDTFLRKGVVGDYKNQMSPELIRKFDDFVSEGLAGSGL 466
            + D+PA   NS   + RKG VGD++N ++PE+  + D  + E   G+GL
Sbjct: 295 -REDRPAVYANS--AYFRKGKVGDWQNYLTPEMAARIDGLMEEKFKGTGL 341


>gnl|CDD|131369 TIGR02316, propion_prpE, propionate--CoA ligase.  This family
           contains one of three readily separable clades of
           proteins in the group of acetate and propionate--CoA
           ligases. Characterized members of this family act on
           propionate. From propionyl-CoA, there is a cyclic
           degradation pathway: it is ligated by PrpC to the TCA
           cycle intermediate oxaloacetate, acted upon further by
           PrpD and an aconitase, then cleaved by PrpB to pyruvate
           and the TCA cycle intermediate succinate.
          Length = 628

 Score = 31.4 bits (71), Expect = 1.3
 Identities = 27/107 (25%), Positives = 40/107 (37%), Gaps = 28/107 (26%)

Query: 140 VWLVSFPRTGSTW--AQEMVWLLGHDLD-------------YEGAKVRPD-DVWL----- 178
           +W +   R G     A ++ W++GH                YEG    PD  VW      
Sbjct: 265 MWAIFGIRAGQVMFSASDVGWVVGHSYIVYAPLLAGAATVLYEGLPTNPDPGVWWSIVER 324

Query: 179 --VSFPRTGSTW-----AQEMVWLLGHDLAATQIVYVARNPKDVLTA 218
             V    +  T       Q+  WL  HDL++   +++A  P D  TA
Sbjct: 325 YGVRTMFSAPTAIRVLKKQDAAWLRKHDLSSLHWLFLAGEPLDEPTA 371


>gnl|CDD|219797 pfam08326, ACC_central, Acetyl-CoA carboxylase, central region.
           The region featured in this family is found in various
           eukaryotic acetyl-CoA carboxylases, N-terminal to the
           catalytic domain (pfam01039). This enzyme (EC:6.4.1.2)
           is involved in the synthesis of long-chain fatty acids,
           as it catalyzes the rate-limiting step in this process.
          Length = 707

 Score = 30.7 bits (70), Expect = 1.8
 Identities = 18/73 (24%), Positives = 30/73 (41%), Gaps = 11/73 (15%)

Query: 351 FWNKRTEDNILFLKYE---DMKKDQKGAILQTAQFLGKQISDDN--IAALIDHLSFNKMR 405
           F N R ED IL L+ E   D+ K     ++        ++   N  + AL+D L +  + 
Sbjct: 169 FSNGREEDVILKLRDENKDDLSK-----VVDIV-LSHSRVKAKNKLVLALLDQLVYPLLP 222

Query: 406 DNPATNLEPILQK 418
                +L   L +
Sbjct: 223 STVPASLRDALSR 235


>gnl|CDD|223117 COG0039, Mdh, Malate/lactate dehydrogenases [Energy production and
           conversion].
          Length = 313

 Score = 30.6 bits (70), Expect = 2.0
 Identities = 24/100 (24%), Positives = 35/100 (35%), Gaps = 28/100 (28%)

Query: 205 IVYVARNPKDVLTALVANDPGDWKNEIPNSVDYVQTLASPRFIKTHLPYSLLPKQIAT-- 262
           IV V  NP D+LT +     G  KN +  S      L S RF             +A   
Sbjct: 113 IVLVVTNPVDILTYIAMKFSGFPKNRVIGSGT---VLDSARFR----------TFLAEKL 159

Query: 263 -VKPKVIEGMIEG------------TNIVGRPPLDYIKQI 289
            V PK +   + G              + G+P  + +K+ 
Sbjct: 160 GVSPKDVHAYVIGEHGDTMVPLWSQATVGGKPLEELLKED 199


>gnl|CDD|236758 PRK10770, PRK10770, peptidyl-prolyl cis-trans isomerase SurA;
           Provisional.
          Length = 413

 Score = 30.5 bits (69), Expect = 2.0
 Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 16/68 (23%)

Query: 376 ILQTAQFLGKQISDD-------NIAALIDHLSFNKMRDNPA-------TNLEPILQKMDK 421
           ILQ AQ +G +ISD+       NIAA  ++++ ++MR   A       T    I ++M  
Sbjct: 62  ILQMAQKMGVKISDEQLDQAIANIAAQ-NNMTLDQMRSRLAYDGLNYNTYRNQIRKEM-I 119

Query: 422 PAEKRNSE 429
            +E RN+E
Sbjct: 120 ISEVRNNE 127


>gnl|CDD|239298 cd03000, PDI_a_TMX3, PDIa family, TMX3 subfamily; composed of
           eukaryotic proteins similar to human TMX3, a TRX related
           transmembrane protein containing one redox active TRX
           domain at the N-terminus and a classical ER retrieval
           sequence for type I transmembrane proteins at the
           C-terminus. The TMX3 transcript is found in a variety of
           tissues with the highest levels detected in skeletal
           muscle and the heart. In vitro, TMX3 showed oxidase
           activity albeit slightly lower than that of protein
           disulfide isomerase.
          Length = 104

 Score = 29.0 bits (65), Expect = 2.0
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 134 EVRPDDVWLVSF--PRTGSTWAQEMVW-LLGHDLDYEGAKVR 172
           +VR +D+WLV F  P  G     E VW  +G +L   G+ VR
Sbjct: 11  DVRKEDIWLVDFYAPWCGHCKKLEPVWNEVGAELKSSGSPVR 52


>gnl|CDD|179177 PRK00945, PRK00945, acetyl-CoA decarbonylase/synthase complex
           subunit epsilon; Provisional.
          Length = 171

 Score = 29.2 bits (66), Expect = 3.2
 Identities = 23/102 (22%), Positives = 36/102 (35%), Gaps = 17/102 (16%)

Query: 244 PRFIKTHLPYSLLPKQIATVKPKVIEGMIE---------GTNIVGRPPL-DYIKQIMRDM 293
            R           PK    V PK+   MI+         G+ ++    L D   +I +  
Sbjct: 3   DRTKPYQPTAISGPKHAKIVSPKIAAMMIKKAKRPLLVVGSLLLDDEELLDRAVKIAKKA 62

Query: 294 NVVYVAR-------NPKDVCVSYYHYCKLIHHLQDPEGKYFD 328
           N+   A          K V   Y +  +L ++L+DP  K  D
Sbjct: 63  NIPVAATGGSYKGLIDKGVDAKYINLHELTNYLKDPNWKGLD 104


>gnl|CDD|218800 pfam05895, DUF859, Siphovirus protein of unknown function (DUF859).
            This family consists of several uncharacterized
           proteins from the Siphoviruses as well as one bacterial
           sequence. Some of the members of this family are
           described as putative minor structural proteins.
          Length = 534

 Score = 29.7 bits (67), Expect = 3.9
 Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 14/95 (14%)

Query: 213 KDVLTALVANDPGDWKNEIPNSVD-----YVQTLASPRFIKTH-LPYSL-LPKQIATVKP 265
            +V T++    P D  N+IPNS       YV T     FI T    ++L +P    +VKP
Sbjct: 169 SNVGTSVTWTPPLDLANDIPNSTSGTGTIYVDTYNGGTFIGTQSNTFTLNVPD---SVKP 225

Query: 266 KVIEGMIEGTNIVGR---PPLDYIKQIMRDMNVVY 297
            +    +  TN V +      +++ QIM ++ V +
Sbjct: 226 TLSGISLSDTNTVAQNIVTGNNFV-QIMSNIKVTF 259


>gnl|CDD|176503 cd08560, GDPD_EcGlpQ_like_1, Glycerophosphodiester
           phosphodiesterase domain similar to Escherichia coli
           periplasmic phosphodiesterase (GlpQ) include
           uncharacterized proteins.  This subfamily corresponds to
           the glycerophosphodiester phosphodiesterase domain
           (GDPD) present in a group of uncharacterized
           glycerophosphodiester phosphodiesterases (GP-GDE, EC
           3.1.4.46) and their hypothetical homologs. Members in
           this subfamily show high sequence similarity to
           Escherichia coli periplasmic phosphodiesterase GlpQ,
           which catalyzes the Ca2+-dependent degradation of
           periplasmic glycerophosphodiesters to produce
           sn-glycerol-3-phosphate (G3P) and the corresponding
           alcohols.
          Length = 356

 Score = 29.7 bits (67), Expect = 4.1
 Identities = 13/34 (38%), Positives = 18/34 (52%), Gaps = 6/34 (17%)

Query: 148 TGSTWAQEMVWLLGHDLDYEGAKVRPDDVWLVSF 181
           T   +AQ+M+     D +Y+ A V P  VW  SF
Sbjct: 185 TQEDYAQQMI-----D-EYKEAGVPPSRVWPQSF 212


>gnl|CDD|221720 pfam12697, Abhydrolase_6, Alpha/beta hydrolase family.  This family
           contains alpha/beta hydrolase enzymes of diverse
           specificity.
          Length = 187

 Score = 28.9 bits (65), Expect = 4.5
 Identities = 13/65 (20%), Positives = 22/65 (33%), Gaps = 2/65 (3%)

Query: 193 VWLLGHDLAATQIVYVARNPKDVLTALVANDPG--DWKNEIPNSVDYVQTLASPRFIKTH 250
           V L+GH L     +  A    + +  LV   P   D +  +      +  L     +   
Sbjct: 67  VVLVGHSLGGAVALAAAARRPERVAGLVLISPPLRDLEELLAADAAALLALLRAALLDAD 126

Query: 251 LPYSL 255
           L  +L
Sbjct: 127 LREAL 131


>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
           methyltransferase subunit [Defense mechanisms].
          Length = 489

 Score = 29.3 bits (66), Expect = 4.9
 Identities = 17/96 (17%), Positives = 31/96 (32%), Gaps = 2/96 (2%)

Query: 360 ILFLKYEDMKKDQKGAILQTAQFLGKQISDDNIAALIDHLSFNKMRDNPATNLEPILQKM 419
           +LFLKY   K + +           K         + +   F+ ++ N   NL   L   
Sbjct: 34  LLFLKYLSDKFELEFEAELEEDNEAKYAYPAKGFFIPERYRFDDLKKNAEENLGDFLDNA 93

Query: 420 DKPAEK--RNSEDTFLRKGVVGDYKNQMSPELIRKF 453
            +  E+   + +  F         K     +L+ K 
Sbjct: 94  LRKIEEKNPDLKGVFADLDFNDALKLGSLLKLLNKV 129


>gnl|CDD|216227 pfam00982, Glyco_transf_20, Glycosyltransferase family 20.  Members
           of this family belong to glycosyl transferase family 20.
           OtsA (Trehalose-6-phosphate synthase) is homologous to
           regions in the subunits of yeast trehalose-6-phosphate
           synthase/phosphate complex.
          Length = 470

 Score = 29.2 bits (66), Expect = 6.5
 Identities = 11/63 (17%), Positives = 20/63 (31%), Gaps = 7/63 (11%)

Query: 274 GTNIVGRPPLDYIKQIMRDMNVVYVARNPKDVCVSYYH-YCK-----LIH-HLQDPEGKY 326
           G  +    P D + Q++++           +    YY+ +       L H  L       
Sbjct: 53  GVPVDEDEPKDRVSQLLKEKFTCVPVFLDDEDVDEYYNGFSNSILWPLFHYRLPPNNEDE 112

Query: 327 FDD 329
           FD 
Sbjct: 113 FDR 115


>gnl|CDD|225702 COG3160, Rsd, Regulator of sigma D [Transcription].
          Length = 162

 Score = 27.9 bits (62), Expect = 8.2
 Identities = 8/37 (21%), Positives = 20/37 (54%)

Query: 76  YNMETKEPQYVTYEKLESECGDKLDSMFGLKDCLLRV 112
            N        + +++  S+ G+ L++ F L+D L+++
Sbjct: 113 LNTAIDHDNCLEFQQALSDIGEALEARFVLEDQLIQL 149


>gnl|CDD|198428 cd10030, UDG_F4_TTUDGA_like, Family 4 Uracil-DNA glycosylase (UDG),
           found exclusively in thermophilic organisms.  The
           enzymes of Family 4 Uracil-DNA glycosylase (UDG), found
           only in thermophilic organisms, are thermostable
           enzymes. Uracil-DNA glycosylases (UDGs) are DNA repair
           enzymes that catalyze the removal of mismatched uracil
           from DNA to initiate DNA base excision repair pathway.
           The Thermus thermophilus enzyme TTUDGA removes uracil
           from both, ssDNA and dsDNA, but not thymine from a G:T
           mismatch. These details suggest that the mechanism by
           which Family 4 UDGs remove uracils from DNA is similar
           to that of Family 1 enzymes. The thermostability of the
           enzyme may be linked to the presence of an iron-sulfur
           cluster, salt-bridges and ion pairs on the molecular
           surface as well as prolines on loops and turns, as
           commonly found in the Family 4 enzymes. Uracil in DNA
           can arise as a result of mis-incorporation of dUMP
           residues by DNA polymerase or deamination of cytosine.
           Uracil mispaired with guanine in DNA is one of the major
           pro-mutagenic events, causing G:C->A:T mutations.
          Length = 164

 Score = 27.8 bits (63), Expect = 9.2
 Identities = 8/22 (36%), Positives = 13/22 (59%), Gaps = 2/22 (9%)

Query: 247 IKTHLPYSLLPKQIATVKPKVI 268
           I    P+  L +QI  ++PK+I
Sbjct: 85  IAACRPF--LERQIELIRPKII 104


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.138    0.421 

Gapped
Lambda     K      H
   0.267   0.0673    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,885,414
Number of extensions: 2681482
Number of successful extensions: 2151
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2140
Number of HSP's successfully gapped: 24
Length of query: 510
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 409
Effective length of database: 6,457,848
Effective search space: 2641259832
Effective search space used: 2641259832
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.4 bits)