RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15063
         (417 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  110 bits (276), Expect = 6e-28
 Identities = 50/141 (35%), Positives = 71/141 (50%), Gaps = 6/141 (4%)

Query: 154 LRDTPVSELVLHLGDHDLTQLNETSHVRRGVRRVLFHSHFHPFVLSNDIALLQLDRPVPL 213
           +  +  S   + LG HDL+  NE       V++V+ H +++P    NDIALL+L RPV L
Sbjct: 44  VYSSAPSNYTVRLGSHDLSS-NEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTL 102

Query: 214 TGTIQPVCLPQKGESFI-GKRGHVVGWGVTSFPMGEPSPTLQKLEVKVLSNARCSTVIEE 272
           +  ++P+CLP  G +   G    V GWG TS   G     LQ++ V ++SNA C      
Sbjct: 103 SDNVRPICLPSSGYNLPAGTTCTVSGWGRTS-EGGPLPDVLQEVNVPIVSNAECKRAYSY 161

Query: 273 SIGI--GMLCAAPDET-QGTC 290
              I   MLCA   E  +  C
Sbjct: 162 GGTITDNMLCAGGLEGGKDAC 182



 Score = 71.2 bits (175), Expect = 3e-14
 Identities = 29/62 (46%), Positives = 38/62 (61%), Gaps = 7/62 (11%)

Query: 321 ILGG-EADIGEFPWQVAI-ALDGMFFCGGALLNEHFVLTAAHCIMTGGPLTFEQDGYHVL 378
           I+GG EA IG FPWQV++    G  FCGG+L++  +VLTAAHC+ +  P       Y V 
Sbjct: 1   IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAP-----SNYTVR 55

Query: 379 AG 380
            G
Sbjct: 56  LG 57



 Score = 60.0 bits (146), Expect = 2e-10
 Identities = 26/46 (56%), Positives = 31/46 (67%), Gaps = 2/46 (4%)

Query: 365 GGPLTFEQDGYHVLAGIVSYGVTGCAIMPSYPDLYTRVSEYIRWIH 410
           GGPL    +G  VL GIVS+G  GCA  P+YP +YTRVS Y+ WI 
Sbjct: 187 GGPLVCNDNGRGVLVGIVSWGS-GCAR-PNYPGVYTRVSSYLDWIQ 230


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  108 bits (273), Expect = 1e-27
 Identities = 47/141 (33%), Positives = 70/141 (49%), Gaps = 6/141 (4%)

Query: 154 LRDTPVSELVLHLGDHDLTQLNETSHVRRGVRRVLFHSHFHPFVLSNDIALLQLDRPVPL 213
           +R +  S + + LG HDL+  +        V +V+ H +++P    NDIALL+L  PV L
Sbjct: 45  VRGSDPSNIRVRLGSHDLS--SGEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTL 102

Query: 214 TGTIQPVCLPQKGESFI-GKRGHVVGWGVTSFPMGEPSPTLQKLEVKVLSNARCSTVIEE 272
           +  ++P+CLP    +   G    V GWG TS   G    TLQ++ V ++SNA C      
Sbjct: 103 SDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSG 162

Query: 273 SIGI--GMLCA-APDETQGTC 290
              I   MLCA   +  +  C
Sbjct: 163 GGAITDNMLCAGGLEGGKDAC 183



 Score = 71.2 bits (175), Expect = 4e-14
 Identities = 25/46 (54%), Positives = 34/46 (73%), Gaps = 1/46 (2%)

Query: 319 RIILGGEADIGEFPWQVAIAL-DGMFFCGGALLNEHFVLTAAHCIM 363
           RI+ G EA+IG FPWQV++    G  FCGG+L++  +VLTAAHC+ 
Sbjct: 1   RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVR 46



 Score = 57.3 bits (139), Expect = 2e-09
 Identities = 26/45 (57%), Positives = 29/45 (64%), Gaps = 3/45 (6%)

Query: 365 GGPLTFEQDGYHVLAGIVSYGVTGCAIMPSYPDLYTRVSEYIRWI 409
           GGPL    DG  VL GIVS+G  GCA  P  P +YTRVS Y+ WI
Sbjct: 188 GGPLVC-NDGRWVLVGIVSWGS-GCA-RPGKPGVYTRVSSYLDWI 229


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score = 85.2 bits (211), Expect = 4e-19
 Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 6/127 (4%)

Query: 166 LGDHDLTQLNETSHVRRGVRRVLFHSHFHPFVLSNDIALLQLDRPVPLTGTIQPVCLPQK 225
           LG H++  L E    +  V++V+ H +++P    NDIALL+L  PV L  T++P+CLP  
Sbjct: 54  LGAHNI-VLREGGEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPTA 111

Query: 226 GESF-IGKRGHVVGWGVTSFPMGEPSPTLQKLEVKVLSNARCSTVIEESIGIGMLCAAPD 284
                +G    V GWG T         TLQ++ V V+S   C +    ++   M+CA   
Sbjct: 112 SSDLPVGTTCTVSGWGNTKT--LGLPDTLQEVTVPVVSRETCRSAYGGTVTDNMICAGAG 169

Query: 285 ETQGTCF 291
             +  C 
Sbjct: 170 G-KDACQ 175



 Score = 58.6 bits (142), Expect = 5e-10
 Identities = 25/44 (56%), Positives = 35/44 (79%), Gaps = 2/44 (4%)

Query: 321 ILGG-EADIGEFPWQVAIAL-DGMFFCGGALLNEHFVLTAAHCI 362
           I+GG EA  G FPWQV++ +  G  FCGG+L++E++VLTAAHC+
Sbjct: 1   IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCV 44



 Score = 41.3 bits (97), Expect = 4e-04
 Identities = 24/46 (52%), Positives = 29/46 (63%), Gaps = 5/46 (10%)

Query: 364 TGGPLTFEQDGYHVLAGIVSYGVTGCAIMPSYPDLYTRVSEYIRWI 409
           +GGPL    DG   L GIVS+G  GCA   +YP +YT VS Y+ WI
Sbjct: 178 SGGPL-VCSDGE--LIGIVSWGY-GCAS-GNYPGVYTPVSSYLDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 47.2 bits (112), Expect = 9e-06
 Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 5/56 (8%)

Query: 319 RIILGGEADIGEFPWQVAIALDG-----MFFCGGALLNEHFVLTAAHCIMTGGPLT 369
           RII G  A+ GE+P  VA+           FCGG+ L   +VLTAAHC     P++
Sbjct: 32  RIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPIS 87



 Score = 37.9 bits (88), Expect = 0.007
 Identities = 18/45 (40%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 365 GGPLTFEQDGYHVLAGIVSYGVTGCAIMPSYPDLYTRVSEYIRWI 409
           GGP+  + +   V  G+VS+G  GC      P +YT VS Y  WI
Sbjct: 231 GGPIFHKGEEGRVQRGVVSWGDGGCG-GTLIPGVYTNVSNYQDWI 274



 Score = 29.5 bits (66), Expect = 3.8
 Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 171 LTQLNETSHVRRG-VRRVLFHSHFHPFVLSNDIALLQLDRPVPL 213
           +  LN++S   RG VR +  H  + P  L NDIA+L+L R   L
Sbjct: 94  VVDLNDSSQAERGHVRTIYVHEFYSPGNLGNDIAVLELARAASL 137


>gnl|CDD|128922 smart00678, WWE, Domain in Deltex and TRIP12 homologues. Possibly
           involved in regulation of ubiquitin-mediated
           proteolysis. 
          Length = 73

 Score = 30.0 bits (68), Expect = 0.32
 Identities = 8/25 (32%), Positives = 14/25 (56%)

Query: 169 HDLTQLNETSHVRRGVRRVLFHSHF 193
           + +TQ N+ +   R VRRV +  + 
Sbjct: 48  NAMTQYNQATGTTRKVRRVTYSPYS 72


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 30.3 bits (69), Expect = 0.42
 Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 9/74 (12%)

Query: 5  EDKVKQIKEILAQLSPSDYS-EDNGSDDYDYYDEELPEQLKYLTLFEEKYGKGRGNGGSI 63
          E+K+++IK++ A L  +D S ED    D+D   E+  E++    LF + Y          
Sbjct: 27 EEKLEKIKKV-AGLRGADLSEEDLADGDFD--PEKWDEEMA--KLFGDDYY---EEEDEE 78

Query: 64 SDPFYKGSKDIMGD 77
            P +   +DI   
Sbjct: 79 EKPTWDDDEDIGDL 92


>gnl|CDD|223009 PHA03211, PHA03211, serine/threonine kinase US3; Provisional.
          Length = 461

 Score = 32.2 bits (73), Expect = 0.60
 Identities = 17/65 (26%), Positives = 35/65 (53%), Gaps = 7/65 (10%)

Query: 5   EDKVKQIKEILAQLSPSDYSEDNGSDDYDYYDEELPEQLKYLTLFEEKYGKGR---GNGG 61
             ++ QI+E+LA++  S+   D+G++  D  D++ P+ + Y     ++Y +     G+G 
Sbjct: 65  AARLCQIQELLAEMRSSEEYPDSGAEAEDDDDDDAPDDVAY----PDEYAEDDFLPGDGA 120

Query: 62  SISDP 66
              DP
Sbjct: 121 PDHDP 125


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 30.8 bits (69), Expect = 1.3
 Identities = 9/36 (25%), Positives = 20/36 (55%)

Query: 327 DIGEFPWQVAIALDGMFFCGGALLNEHFVLTAAHCI 362
           +   +PW   + ++G + C G L++  +VL +  C+
Sbjct: 12  EELHWPWIAKVYVEGNYRCTGVLIDLSWVLVSHSCL 47


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
           transport and metabolism].
          Length = 251

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 11/37 (29%), Positives = 17/37 (45%), Gaps = 3/37 (8%)

Query: 331 FPWQVAIALD---GMFFCGGALLNEHFVLTAAHCIMT 364
           FP+   +  +   G       L+  + VLTA HCI +
Sbjct: 49  FPYSAVVQFEAATGRLCTAATLIGPNTVLTAGHCIYS 85


>gnl|CDD|178170 PLN02555, PLN02555, limonoid glucosyltransferase.
          Length = 480

 Score = 30.5 bits (69), Expect = 1.8
 Identities = 14/54 (25%), Positives = 26/54 (48%)

Query: 197 VLSNDIALLQLDRPVPLTGTIQPVCLPQKGESFIGKRGHVVGWGVTSFPMGEPS 250
           VL++ ++ L + RP      ++P  LP++     G +G +V W      +  PS
Sbjct: 302 VLNSGVSFLWVMRPPHKDSGVEPHVLPEEFLEKAGDKGKIVQWCPQEKVLAHPS 355


>gnl|CDD|178296 PLN02693, PLN02693, IAA-amino acid hydrolase.
          Length = 437

 Score = 30.4 bits (68), Expect = 1.9
 Identities = 16/67 (23%), Positives = 35/67 (52%), Gaps = 5/67 (7%)

Query: 9   KQIKEILAQLSPSDYSEDNGSDDYDYYDEELPEQLKYLTLFEEKYGKGRGNGGSISDPFY 68
           K ++++L Q +  + + + GS+D+ Y+ E +P     L + +E  G       S   P Y
Sbjct: 345 KVVRDLLGQEAFVEAAPEMGSEDFSYFAETIPGHFSLLGMQDETNGY-----ASSHSPLY 399

Query: 69  KGSKDIM 75
           + ++D++
Sbjct: 400 RINEDVL 406


>gnl|CDD|227844 COG5557, COG5557, Polysulphide reductase [Energy production and
           conversion].
          Length = 401

 Score = 29.8 bits (67), Expect = 2.8
 Identities = 20/90 (22%), Positives = 25/90 (27%), Gaps = 22/90 (24%)

Query: 322 LGGEADIGEFPWQVAIALDGMFFCGGALLNEHFVLTAAHCIMTGGPLTFEQDGYHVLAGI 381
           LG       FPW + IA D + F G         L A   I++     F +  Y   A  
Sbjct: 39  LGSTGMRNGFPWGLYIAFD-LLFVG---------LAAGGLILSSAVYVFNRGQYKPFARP 88

Query: 382 VSYGVTGCAI------------MPSYPDLY 399
                    I               YP  Y
Sbjct: 89  ALLASLFGIILAGLSILPDIGRYWLYPYFY 118


>gnl|CDD|198319 cd03210, GST_C_Pi, C-terminal, alpha helical domain of Class Pi
          Glutathione S-transferases.  Glutathione S-transferase
          (GST) C-terminal domain family, Class Pi subfamily;
          GSTs are cytosolic dimeric proteins involved in
          cellular detoxification by catalyzing the conjugation
          of glutathione (GSH) with a wide range of endogenous
          and xenobiotic alkylating agents, including
          carcinogens, therapeutic drugs, environmental toxins,
          and products of oxidative stress. The GST fold contains
          an N-terminal thioredoxin-fold domain and a C-terminal
          alpha helical domain, with an active site located in a
          cleft between the two domains. GSH binds to the
          N-terminal domain while the hydrophobic substrate
          occupies a pocket in the C-terminal domain. Class Pi
          GST is a homodimeric eukaryotic protein. The human
          GSTP1 is mainly found in erythrocytes, kidney, placenta
          and fetal liver. It is involved in stress responses and
          in cellular proliferation pathways as an inhibitor of
          JNK (c-Jun N-terminal kinase). Following oxidative
          stress, monomeric GSTP1 dissociates from JNK and
          dimerizes, losing its ability to bind JNK and causing
          an increase in JNK activity, thereby promoting
          apoptosis. GSTP1 is expressed in various tumors and is
          the predominant GST in a wide range of cancer cells. It
          has been implicated in the development of
          multidrug-resistant tumors.
          Length = 126

 Score = 28.1 bits (63), Expect = 3.9
 Identities = 17/41 (41%), Positives = 21/41 (51%), Gaps = 4/41 (9%)

Query: 25 EDNGSDDYDYYDEELPEQLKYL-TLFEEKYGKGRGNGGSIS 64
           + G DDY    ++LPEQLK    L  +  GKG   G  IS
Sbjct: 29 YEAGKDDYI---KDLPEQLKPFEKLLAKNNGKGFIVGDKIS 66


>gnl|CDD|225463 COG2911, COG2911, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1278

 Score = 29.6 bits (67), Expect = 4.4
 Identities = 12/73 (16%), Positives = 18/73 (24%), Gaps = 8/73 (10%)

Query: 320 IILGGEADIGEFPWQVAIALDGMFFCGGALLNEHFVLTAAHC-IMTGGPLTFEQDGYHVL 378
             +     IG +  Q A AL               +   AHC       L   Q      
Sbjct: 689 GTIRLSTAIGPWSLQRAAALA-------YDGKARRLNIGAHCWDNGNAELCLPQASRAGG 741

Query: 379 AGIVSYGVTGCAI 391
            G +   +    +
Sbjct: 742 PGRIPLALNRFDL 754


>gnl|CDD|180682 PRK06761, PRK06761, hypothetical protein; Provisional.
          Length = 282

 Score = 29.0 bits (65), Expect = 5.2
 Identities = 10/46 (21%), Positives = 18/46 (39%), Gaps = 1/46 (2%)

Query: 1   MLSFEDKVKQI-KEILAQLSPSDYSEDNGSDDYDYYDEELPEQLKY 45
            +   +  K +  EIL  L       +N  ++ D Y   L ++L  
Sbjct: 233 RIKVLEARKNLELEILDMLKMKKEKINNTKEEIDSYRSMLKDKLTI 278


>gnl|CDD|219287 pfam07075, DUF1343, Protein of unknown function (DUF1343).  This
           family consists of several hypothetical bacterial
           proteins of around 400 residues in length. The function
           of this family is unknown.
          Length = 362

 Score = 29.0 bits (66), Expect = 5.3
 Identities = 11/33 (33%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query: 200 NDIALLQLDRPVPLTGTIQPVCLPQKG-ESFIG 231
                + LDRP PL G      +     ESF+G
Sbjct: 109 AGKEFIVLDRPNPLGGLYVEGPVLDPEFESFVG 141


>gnl|CDD|211667 TIGR01530, nadN, NAD pyrophosphatase/5'-nucleotidase NadN.  This
           model describes NadN of Haemophilus influenzae and a
           small number of close homologs in pathogenic,
           Gram-negative bacteria. NadN is a periplasmic enzyme
           that cleaves NAD (nicotinamide adenine dinucleotide) to
           NMN (nicotinamide mononucleotide) and AMP. The NMN must
           be converted by a 5'-nucleotidase to nicotinamide
           riboside for import. NadN belongs a large family of
           5'-nucleotidases and has NMN 5'-nucleotidase activity
           for NMN, AMP, etc [Transport and binding proteins,
           Other, Biosynthesis of cofactors, prosthetic groups, and
           carriers, Pyridine nucleotides].
          Length = 545

 Score = 28.4 bits (63), Expect = 8.4
 Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 4/60 (6%)

Query: 145 VGG----VCDIDQLRDTPVSELVLHLGDHDLTQLNETSHVRRGVRRVLFHSHFHPFVLSN 200
           +GG       +++LR    + LVLH GD     L  T         V+   +FH F L N
Sbjct: 31  IGGFSAVNAKLNKLRKKYKNPLVLHAGDAITGTLYFTLFGGSADAAVMNAGNFHYFTLGN 90


>gnl|CDD|213859 TIGR03767, P_acnes_RR, metallophosphoesterase, PPA1498 family.
           This model describes a small collection of probable
           metallophosphoresterases, related to pfam00149 but with
           long inserts separating some of the shared motifs such
           that the homology is apparent only through multiple
           sequence alignment. Members of this protein family, in
           general, have a Sec-independent TAT (twin-arginine
           translocation) signal sequence, N-terminal to the region
           modeled by This model. Members include YP_056203.1 from
           Propionibacterium acnes KPA171202.
          Length = 496

 Score = 28.3 bits (63), Expect = 9.4
 Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 23  YSEDNGSDDYDYYDEELPEQLKYLTLFEEKYGKGRGNGGSISDPFYKGSKDIMGDSSTT 81
           YSEDN S    YY  ++   ++ +++  +   +  G+ GS+    +K  KD +  SS T
Sbjct: 282 YSEDNLSAGTGYYTFDIAGGVRGISM--DTTNRAGGDEGSLGQTQFKWIKDTLRASSDT 338


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.139    0.427 

Gapped
Lambda     K      H
   0.267   0.0714    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,942,747
Number of extensions: 2168972
Number of successful extensions: 1714
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1703
Number of HSP's successfully gapped: 29
Length of query: 417
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 318
Effective length of database: 6,546,556
Effective search space: 2081804808
Effective search space used: 2081804808
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.9 bits)