RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4771
         (337 letters)



>gnl|CDD|111929 pfam03088, Str_synth, Strictosidine synthase.  Strictosidine
           synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid
           biosynthesis. It catalyzes the condensation of
           tryptamine with secologanin to form strictosidine.
          Length = 89

 Score = 95.9 bits (239), Expect = 6e-25
 Identities = 37/89 (41%), Positives = 62/89 (69%), Gaps = 4/89 (4%)

Query: 195 NSVTVDSD-GMVYWSDSSTKY-KSGLFEGLTSG--SGRLIQYNPKTKKNKVLIANLHFAN 250
           N++ VD + G++Y++DSS++Y +  +   +  G  +GRL++Y+P TK  KVL+ +L+F N
Sbjct: 1   NALDVDPETGVLYFTDSSSRYDRRQVIFAMLEGDKTGRLMKYDPSTKVTKVLLKDLYFPN 60

Query: 251 GVELSADESFVIVAETMSSRVHRYYLKGP 279
           G+ LS D SFV+  ET   R+ +Y++KGP
Sbjct: 61  GIALSPDGSFVLFCETPMKRISKYWIKGP 89


>gnl|CDD|225921 COG3386, COG3386, Gluconolactonase [Carbohydrate transport and
           metabolism].
          Length = 307

 Score = 81.7 bits (202), Expect = 1e-17
 Identities = 45/147 (30%), Positives = 63/147 (42%), Gaps = 12/147 (8%)

Query: 187 DGAKPQIPNSVTVDSDGMVYWSDSSTKYKSGLFEGLTSGSGRLIQYNPKTKKNKVLIANL 246
           DG     PN   VD DG   W      +  G  E     +G L + +P     ++L  +L
Sbjct: 106 DGLPLNRPNDGVVDPDG-RIWFGDMGYFDLGKSEE--RPTGSLYRVDPDGGVVRLLDDDL 162

Query: 247 HFANGVELSADESFVIVAETMSSRVHRYYL---KGPKQGK-SEVFIDGLPGLPDNVKRDS 302
              NG+  S D   + VA+T ++R+HRY L    GP  G+   V  D  PGLPD +  D+
Sbjct: 163 TIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDA 222

Query: 303 KGNFLVSLVCPVEQPSEQAGNIRDPAG 329
            GN  V+ V              +P G
Sbjct: 223 DGNLWVAAV-----WGGGRVVRFNPDG 244



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 24/114 (21%), Positives = 37/114 (32%), Gaps = 24/114 (21%)

Query: 197 VTVDSDGMVYWSDSSTKYKSGLFEGLTSGSGRLIQYNPKTKKNKVLIANLHFANGVELSA 256
           V     G + W D                 GR+ + +P+T K +V  +   F++G  + A
Sbjct: 31  VWDPDRGALLWVDI--------------LGGRIHRLDPETGKKRVFPSPGGFSSGALIDA 76

Query: 257 DESFVIVAETMSSRVHRYYLKGPKQGKSEVFIDGLPGLPDN----VKRDSKGNF 306
               +I  E     V          GK  +  +   GLP N       D  G  
Sbjct: 77  GG-RLIACEH---GVRLLDPD--TGGKITLLAEPEDGLPLNRPNDGVVDPDGRI 124


>gnl|CDD|219847 pfam08450, SGL, SMP-30/Gluconolaconase/LRE-like region.  This
           family describes a region that is found in proteins
           expressed by a variety of eukaryotic and prokaryotic
           species. These proteins include various enzymes, such as
           senescence marker protein 30 (SMP-30), gluconolactonase
           and luciferin-regenerating enzyme (LRE). SMP-30 is known
           to hydrolyse diisopropyl phosphorofluoridate in the
           liver, and has been noted as having sequence similarity,
           in the region described in this family, with PON1 and
           LRE.
          Length = 245

 Score = 44.9 bits (107), Expect = 2e-05
 Identities = 33/141 (23%), Positives = 56/141 (39%), Gaps = 16/141 (11%)

Query: 170 DGQTGQTEQLISMDTEIDGAKPQI-PNSVTVDSDGMVYWSDSSTKYKSGLFEGLTSGSGR 228
           D  TG+   L     +++  +P    N   VD DG  ++         G         G 
Sbjct: 66  DLDTGELTTL----ADLEPDEPLNRFNDGKVDPDGRFWFGTM------GFDIAPGGEPGA 115

Query: 229 LIQYNPKTKKNKVLIANLHFANGVELSADESFVIVAETMSSRVHRY-Y-LKGPKQGKSEV 286
           L + +P  K  +V +  +  +NG+  S D   +  A++ + R+  + Y   G       V
Sbjct: 116 LYRLDPDGKVERV-LDGITISNGLAWSPDGKTLYFADSPTRRIWAFDYDADGGLISNRRV 174

Query: 287 FIDG--LPGLPDNVKRDSKGN 305
           F D     G PD +  D++GN
Sbjct: 175 FADFKDGDGEPDGMAVDAEGN 195


>gnl|CDD|190082 pfam01731, Arylesterase, Arylesterase.  This family consists of
           arylesterases (Also known as serum paraoxonase)
           EC:3.1.1.2. These enzymes hydrolyse organophosphorus
           esters such as paraoxon and are found in the liver and
           blood. They confer resistance to organophosphate
           toxicity. Human arylesterase (PON1) is associated with
           HDL and may protect against LDL oxidation.
          Length = 86

 Score = 35.5 bits (82), Expect = 0.004
 Identities = 15/57 (26%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 218 LFEGLTSGSGRLIQYNPKTKKNKVLIANLHFANGVELSADESFVIVAETMSSRVHRY 274
           L   L      ++ Y+P   + KV+ +   FANG+  S D+ ++ VA ++   +H  
Sbjct: 27  LEMYLGLIWTGVVYYSPS--EVKVVASGFSFANGINYSPDKKYIYVASSLRHSIHVM 81


>gnl|CDD|235557 PRK05682, flgE, flagellar hook protein FlgE; Validated.
          Length = 407

 Score = 37.9 bits (89), Expect = 0.005
 Identities = 26/99 (26%), Positives = 35/99 (35%), Gaps = 17/99 (17%)

Query: 117 IKFDKNGALHVADAYFGLYKVNVTTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGQTGQT 176
           + FD NGAL              TT  T   I+          Q   ++T+D  G T   
Sbjct: 231 LTFDSNGAL------------TTTTSPTTFNITGGATNGADGGQ---TITLDLSGSTQYA 275

Query: 177 EQLISMDTEIDGAKPQIPNSVTVDSDGMVY--WSDSSTK 213
                  T  DG  P      ++D DG V   +S+  TK
Sbjct: 276 SGFTVSATTQDGYAPGTLTGYSIDDDGTVTAVYSNGQTK 314


>gnl|CDD|225507 COG2959, HemX, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 391

 Score = 32.9 bits (75), Expect = 0.24
 Identities = 12/74 (16%), Positives = 23/74 (31%), Gaps = 1/74 (1%)

Query: 172 QTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGM-VYWSDSSTKYKSGLFEGLTSGSGRLI 230
           + G   +L  +  ++D             S    V  S S + +   L +   S     I
Sbjct: 197 RDGIALRLDQLSNQVDNLPLLDNAFDGAPSPANNVKLSASISDWWQNLEKSSRSFLDNFI 256

Query: 231 QYNPKTKKNKVLIA 244
               +   +  L+A
Sbjct: 257 TIRRRDDNDAPLLA 270


>gnl|CDD|214531 smart00135, LY, Low-density lipoprotein-receptor YWTD domain.  Type
           "B" repeats in low-density lipoprotein (LDL) receptor
           that plays a central role in mammalian cholesterol
           metabolism. Also present in a variety of molecules
           similar to gp300/megalin.
          Length = 43

 Score = 27.6 bits (62), Expect = 1.1
 Identities = 8/22 (36%), Positives = 12/22 (54%), Gaps = 1/22 (4%)

Query: 194 PNSVTVD-SDGMVYWSDSSTKY 214
           PN + VD  +G +YW+D     
Sbjct: 11  PNGLAVDWIEGRLYWTDWGLDV 32


>gnl|CDD|148813 pfam07419, PilM, PilM.  This family contains the bacterial protein
           PilM (approximately 150 residues long). PilM is an inner
           membrane protein that has been predicted to function as
           a component of the pilin transport apparatus and
           thin-pilus basal body.
          Length = 136

 Score = 29.2 bits (66), Expect = 1.6
 Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 12/63 (19%)

Query: 269 SRVHRYYLKGPKQGKSEVFIDGLPGLPDNVKRDSKGNFLVSLVCPVEQPSEQAGNIRDPA 328
           SR+          G+  V+I   PGL   ++  S G+ LV  V        + G + D +
Sbjct: 68  SRISHVI----SAGRLYVWIPEQPGLVSALREQSGGSALVGRV--------RNGRLLDLS 115

Query: 329 GAQ 331
           G  
Sbjct: 116 GGD 118


>gnl|CDD|215530 PLN02980, PLN02980, 2-oxoglutarate decarboxylase/ hydro-lyase/
            magnesium ion binding  / thiamin pyrophosphate binding.
          Length = 1655

 Score = 29.8 bits (67), Expect = 2.5
 Identities = 29/110 (26%), Positives = 42/110 (38%), Gaps = 24/110 (21%)

Query: 224  SGSGRLIQYN-PKTKKNKVLI----ANLHFANGVELSADESF-----------VIVAETM 267
            SGS R I  + P    +K+              VEL AD  +            +V  +M
Sbjct: 1395 SGSARCISIDLPGHGGSKIQNHAKETQTEPTLSVELVADLLYKLIEHITPGKVTLVGYSM 1454

Query: 268  SSRVHRYY-LKGPKQGKSEVFIDGLPGLPDNVKR-------DSKGNFLVS 309
             +R+  Y  L+   + +  V I G PGL D V R       DS+   L+ 
Sbjct: 1455 GARIALYMALRFSDKIEGAVIISGSPGLKDEVARKIRSAKDDSRARMLID 1504


>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
          Length = 3095

 Score = 29.2 bits (65), Expect = 3.8
 Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 152  TEIDGAKPQIPNSVTVDSDGQTGQTEQLISMDTEIDGAKPQIPNSVTVDSDG 203
            TE +  + +  N V VD   +  ++E L S D++ D ++  + NS + D+D 
Sbjct: 2978 TEAESGENKFNN-VNVDPIRKRSESESLSSTDSDSDDSRSTVYNSNSTDTDM 3028


>gnl|CDD|235253 PRK04201, PRK04201, zinc transporter ZupT; Provisional.
          Length = 265

 Score = 28.7 bits (65), Expect = 4.1
 Identities = 18/44 (40%), Positives = 23/44 (52%), Gaps = 2/44 (4%)

Query: 218 LFEGLTSGSGRLIQYNPKTKKNKVLIANLHFANGVELSADESFV 261
           L  GL +G G LI +  K   N+ L  +L FA GV L    SF+
Sbjct: 12  LLAGLATGIGSLIAFFGKKPNNRFLSFSLGFAAGVMLYV--SFM 53


>gnl|CDD|215175 PLN02309, PLN02309, 5'-adenylylsulfate reductase.
          Length = 457

 Score = 29.0 bits (65), Expect = 4.4
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query: 219 FEGLTSGSGRLIQYNP 234
           FEGL  G G L+++NP
Sbjct: 239 FEGLDGGPGSLVKWNP 254


>gnl|CDD|117609 pfam09043, Lys-AminoMut_A, D-Lysine 5,6-aminomutase alpha subunit. 
           Members of his family are involved in the 1,2
           rearrangement of the terminal amino group of DL-lysine
           and of L-beta-lysine, using adenosylcobalamin (AdoCbl)
           and pyridoxal-5'-phosphate as cofactors. The structure
           is predominantly a PLP-binding TIM barrel domain, with
           several additional alpha-helices and beta-strands at the
           N and C termini. These helices and strands form an
           intertwined accessory clamp structure that wraps around
           the sides of the TIM barrel and extends up toward the
           Ado ligand of the Cbl cofactor, providing most of the
           interactions observed between the protein and the Ado
           ligand of the Cbl, suggesting that its role is mainly in
           stabilising AdoCbl in the precatalytic resting state.
          Length = 509

 Score = 28.9 bits (65), Expect = 4.4
 Identities = 13/28 (46%), Positives = 16/28 (57%), Gaps = 2/28 (7%)

Query: 126 HVADAYFGLYKVNVTTGQTEQLISMDTE 153
           HV DA F +  V + TGQ   L+ M TE
Sbjct: 382 HVQDALFNM--VTIMTGQGIHLLGMLTE 407


>gnl|CDD|233978 TIGR02701, shell_carb_anhy, carboxysome shell carbonic anhydrase.
           This model describes a carboxysome shell protein that
           proves to be a novel class, designated epsilon, of
           carbonic anhydrase. It tends to be encoded near genes
           for RuBisCo and for other carboxysome shell proteins
           [Central intermediary metabolism, One-carbon
           metabolism].
          Length = 450

 Score = 28.6 bits (64), Expect = 4.7
 Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 10/57 (17%)

Query: 179 LISMDTEIDGAKPQIPNSVTVDSDGMV---YWSDSSTKYKSGLFEGLTSGSGRLIQY 232
           LI +DT+ D  +  IP     D+DG V    + D++  Y   L  GLT+   R    
Sbjct: 242 LIGVDTDTDAIRVHIP-----DADGFVNLYRYVDNTVLYAETL--GLTADEARQHIA 291


>gnl|CDD|178335 PLN02734, PLN02734, glycyl-tRNA synthetase.
          Length = 684

 Score = 28.9 bits (65), Expect = 5.0
 Identities = 18/55 (32%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 161 IPNSVTVDSDGQTGQTEQLISMDTEIDGAKPQIPNSVTVDSDGMVYWSDSSTKYK 215
           +P +VTVDSDG     E+  S D ++     ++ + V   +DG + W D + KY 
Sbjct: 623 VPFAVTVDSDGSVTIRER-DSKD-QVRVPVEEVASVVKDLTDGRMTWEDVTAKYP 675


>gnl|CDD|191271 pfam05421, DUF751, Protein of unknown function (DUF751).  This
          family contains several plant, cyanobacterial and algal
          proteins of unknown function. The family is exclusively
          found in phototrophic organisms and may therefore play
          a role in photosynthesis (personal obs:Moxon SJ).
          Length = 61

 Score = 26.0 bits (58), Expect = 5.6
 Identities = 8/26 (30%), Positives = 13/26 (50%), Gaps = 6/26 (23%)

Query: 3  FFRN------FFISVVVGIVAITFYP 22
          F+ N      + ISV++G+    F P
Sbjct: 1  FWDNVSRYPRYLISVLLGLFLAIFEP 26


>gnl|CDD|239356 cd03058, GST_N_Tau, GST_N family, Class Tau subfamily; GSTs are
           cytosolic dimeric proteins involved in cellular
           detoxification by catalyzing the conjugation of
           glutathione (GSH) with a wide range of endogenous and
           xenobiotic alkylating agents, including carcinogens,
           therapeutic drugs, environmental toxins and products of
           oxidative stress. The GST fold contains an N-terminal
           TRX-fold domain and a C-terminal alpha helical domain,
           with an active site located in a cleft between the two
           domains. The plant-specific class Tau GST subfamily has
           undergone extensive gene duplication. The Arabidopsis
           and Oryza genomes contain 28 and 40 Tau GSTs,
           respectively. They are primarily responsible for
           herbicide detoxification together with class Phi GSTs,
           showing class specificity in substrate preference. Tau
           enzymes are highly efficient in detoxifying
           diphenylether and aryloxyphenoxypropionate herbicides.
           In addition, Tau GSTs play important roles in
           intracellular signalling, biosynthesis of anthocyanin,
           responses to soil stresses and responses to auxin and
           cytokinin hormones.
          Length = 74

 Score = 26.5 bits (59), Expect = 6.1
 Identities = 17/44 (38%), Positives = 22/44 (50%), Gaps = 7/44 (15%)

Query: 220 EGLTSGSGRLIQYNPKTKKNKVLIANLHFANGVELSADESFVIV 263
           E L + S  L+  NP  KK  VL+   H  NG  +   ES +IV
Sbjct: 31  EDLGNKSELLLASNPVHKKIPVLL---H--NGKPIC--ESLIIV 67


>gnl|CDD|185133 PRK15211, PRK15211, fimbrial chaperone protein PefD; Provisional.
          Length = 229

 Score = 27.9 bits (62), Expect = 6.2
 Identities = 17/47 (36%), Positives = 21/47 (44%), Gaps = 4/47 (8%)

Query: 46  ALNEKLNNAEKLFENEIHGPEAVVKHKGDLYMGVDG----GHILKLS 88
           AL E   NAEK  +    G E  +K+    Y  V G    G  +KLS
Sbjct: 139 ALKEGRENAEKNLQIIHRGGETYLKNPTPYYFAVTGVKLNGQPVKLS 185


>gnl|CDD|236281 PRK08517, PRK08517, DNA polymerase III subunit epsilon;
           Provisional.
          Length = 257

 Score = 28.1 bits (63), Expect = 7.0
 Identities = 18/75 (24%), Positives = 31/75 (41%), Gaps = 16/75 (21%)

Query: 197 VTVDSDGMVYWSDSSTKYKSGLF-----EGLTSGSGRLIQYNPKTKKNKVL-IANLHFAN 250
           +  + + ++      T  K  +F     E  T+GS        K KK++++ I  +   N
Sbjct: 48  LVENKENLITLKTRFTPIKDQVFCFVDIE--TNGS--------KPKKHQIIEIGAVKVKN 97

Query: 251 GVELSADESFVIVAE 265
           G  +   ESFV   E
Sbjct: 98  GEIIDRFESFVKAKE 112


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.136    0.396 

Gapped
Lambda     K      H
   0.267   0.0778    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,063,791
Number of extensions: 1626669
Number of successful extensions: 904
Number of sequences better than 10.0: 1
Number of HSP's gapped: 899
Number of HSP's successfully gapped: 31
Length of query: 337
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 240
Effective length of database: 6,635,264
Effective search space: 1592463360
Effective search space used: 1592463360
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 59 (26.4 bits)