RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy18108
         (102 letters)



>gnl|CDD|214853 smart00848, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS. Peptide proteinase
          inhibitors can be found as single domain proteins or as
          single or multiple domains within proteins; these are
          referred to as either simple or compound inhibitors,
          respectively. In many cases they are synthesised as
          part of a larger precursor protein, either as a
          prepropeptide or as an N-terminal domain associated
          with an inactive peptidase or zymogen. This domain
          prevents access of the substrate to the active site.
          Removal of the N-terminal inhibitor domain either by
          interaction with a second peptidase or by autocatalytic
          cleavage activates the zymogen. Other inhibitors
          interact direct with proteinases using a simple
          noncovalent lock and key mechanism; while yet others
          use a conformational change-based trapping mechanism
          that depends on their structural and thermodynamic
          properties.
          Length = 57

 Score = 65.3 bits (160), Expect = 5e-16
 Identities = 27/57 (47%), Positives = 39/57 (68%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90
          FE++ +   KSY ++EE A+RFA+F++NLK IE+ NK    +   G+N  SDLT EE
Sbjct: 1  FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIEEHNKKYEHSYKLGVNQFSDLTPEE 57


>gnl|CDD|219764 pfam08246, Inhibitor_I29, Cathepsin propeptide inhibitor domain
          (I29).  This domain is found at the N-terminus of some
          C1 peptidases such as Cathepsin L where it acts as a
          propeptide. There are also a number of proteins that
          are composed solely of multiple copies of this domain
          such as the peptidase inhibitor salarin. This family is
          classified as I29 by MEROPS.
          Length = 58

 Score = 63.0 bits (154), Expect = 5e-15
 Identities = 24/58 (41%), Positives = 38/58 (65%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91
          FE + + + KSY ++EE   RF +F++NL+ IE+ NK  + + T G+N  +DLT EE 
Sbjct: 1  FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIEEHNKKGNVSYTLGLNQFADLTDEEF 58


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score = 49.4 bits (118), Expect = 3e-08
 Identities = 25/76 (32%), Positives = 38/76 (50%), Gaps = 1/76 (1%)

Query: 27  NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDL 86
           N E++  F  FI++  K Y T +E+ +R+  F +NL  I   N  E+     G+N   DL
Sbjct: 162 NLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDL 221

Query: 87  TREEMKSRLGLNLSKH 102
           + EE K +  L L   
Sbjct: 222 SFEEFKKKY-LTLKSF 236


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score = 43.5 bits (102), Expect = 3e-06
 Identities = 21/61 (34%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKS 93
          FE+F R + ++Y T  E  +R A FE NL+L+ + ++  +  A +GI    DL+  E  +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMRE-HQARNPHARFGITKFFDLSEAEFAA 96

Query: 94 R 94
          R
Sbjct: 97 R 97


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score = 42.0 bits (99), Expect = 1e-05
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 7/83 (8%)

Query: 13  LFGQMKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           LF     +++ K E  E   +FE+F + +++ + T  E   RF  F +N   +    K  
Sbjct: 106 LFKDGYISDDPKLEF-EVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEV----KSH 160

Query: 73  HGTATY--GINHLSDLTREEMKS 93
            G   Y   IN  SDLT EE + 
Sbjct: 161 KGDEPYSKEINKFSDLTEEEFRK 183


>gnl|CDD|215867 pfam00342, PGI, Phosphoglucose isomerase.  Phosphoglucose isomerase
           catalyzes the interconversion of glucose-6-phosphate and
           fructose-6-phosphate.
          Length = 483

 Score = 28.6 bits (64), Expect = 0.50
 Identities = 10/38 (26%), Positives = 12/38 (31%)

Query: 29  EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
           E +           K    K  VAK F     N K +E
Sbjct: 164 ETMTNARSAREWLLKKLGEKSAVAKHFVALSTNAKKVE 201


>gnl|CDD|218049 pfam04373, DUF511, Protein of unknown function (DUF511).  Bacterial
           protein of unknown function.
          Length = 310

 Score = 26.5 bits (59), Expect = 2.3
 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 4/50 (8%)

Query: 17  MKSNNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIE 66
             + N L  EN +    F+ FI+D ++ Y T +  +   A F+  LK  E
Sbjct: 265 WDTANRLAEENKD----FKDFIKDVNQYYQTGKRKSIWKAEFDKVLKDEE 310


>gnl|CDD|217535 pfam03401, TctC, Tripartite tricarboxylate transporter family
           receptor.  These probable extra-cytoplasmic solute
           receptors are strongly overrepresented in several
           beta-proteobacteria. This family, formerly known as Bug
           - Bordetella uptake gene (bug) product - is a family of
           bacterial tripartite tricarboxylate receptors of the
           extracytoplasmic solute binding receptor-dependent
           transporter group of families, distinct from the ABC and
           TRAP-T families. The TctABC system has been
           characterized in S. typhimurium, and TctC is the
           extracytoplasmic tricarboxylate-binding receptor which
           binds the transporters TctA and TctB, two integral
           membrane proteins. Complete three-component systems are
           found only in bacteria.
          Length = 274

 Score = 26.6 bits (59), Expect = 2.5
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 69  NKGEHGTATYGINHLSDLTREEMKSRLGLNLS 100
           N G+   A+ GI   + L  E + S+ G+ LS
Sbjct: 103 NPGKLTFASAGIGTSNHLAGELLASKAGVQLS 134


>gnl|CDD|178380 PLN02783, PLN02783, diacylglycerol O-acyltransferase.
          Length = 315

 Score = 26.5 bits (59), Expect = 2.7
 Identities = 12/38 (31%), Positives = 20/38 (52%), Gaps = 3/38 (7%)

Query: 46  PTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHL 83
           P++EEVA+    F + L+ + + +K   G   YG   L
Sbjct: 278 PSQEEVAEVLEQFVEALQDLFEKHKARAG---YGDLEL 312


>gnl|CDD|239950 cd04513, Glycosylasparaginase, Glycosylasparaginase catalyzes the
           hydrolysis of the glycosylamide bond of
           asparagine-linked glycoproteins. This enzyme is an
           amidase located inside lysosomes. Mutation of this gene
           in humans causes a genetic disorder known as
           aspartylglycosaminuria (AGU). The glycosylasparaginase
           precursor undergoes autoproteolysis through an N-O or
           N-S acyl rearrangement of the peptide bond, which leads
           to the cleavage of a peptide bond between an Asp and a
           Thr. This proteolysis step generates an exposed
           N-terminal catalytic threonine and activates the enzyme.
          Length = 263

 Score = 26.0 bits (58), Expect = 3.8
 Identities = 9/32 (28%), Positives = 16/32 (50%)

Query: 49  EEVAKRFAVFEDNLKLIEDLNKGEHGTATYGI 80
           + +AK F   +    ++    KGE+G A  G+
Sbjct: 224 KRIAKHFDGPDFEGAVVALNKKGEYGAACNGL 255


>gnl|CDD|235379 PRK05261, PRK05261, putative phosphoketolase; Provisional.
          Length = 785

 Score = 25.9 bits (58), Expect = 4.1
 Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 13/44 (29%)

Query: 26  ENPEHLKQFEKFIRDFSKSY-PTKEEVAKRFAVFEDNLKLIEDL 68
           +NPEHL   E    D+ +SY P  EE      +F+++ +L  +L
Sbjct: 313 DNPEHLDLLE----DWLRSYRP--EE------LFDEDGRLKPEL 344


>gnl|CDD|227216 COG4879, COG4879, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 243

 Score = 25.6 bits (56), Expect = 4.6
 Identities = 8/37 (21%), Positives = 18/37 (48%)

Query: 36  KFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGE 72
           K ++    +YP  ++    +A F    K+ E++  G+
Sbjct: 149 KSLKTLETAYPELKQRLLNYARFYSAFKIAEEIALGK 185


>gnl|CDD|233194 TIGR00930, 2a30, K-Cl cotransporter.  [Transport and binding
           proteins, Other].
          Length = 953

 Score = 25.4 bits (56), Expect = 7.4
 Identities = 11/44 (25%), Positives = 16/44 (36%)

Query: 26  ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLN 69
              E ++ FE+ IR F      K+  AK   +       I D  
Sbjct: 836 PQTESMEAFEEMIRPFRLHKTEKDREAKDPKMTWTKPWKITDAE 879


>gnl|CDD|235176 PRK03932, asnC, asparaginyl-tRNA synthetase; Validated.
          Length = 450

 Score = 25.1 bits (56), Expect = 8.8
 Identities = 11/43 (25%), Positives = 23/43 (53%), Gaps = 9/43 (20%)

Query: 50  EVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMK 92
           E+A  FA  EDN+ L E++ K       Y + ++ +   ++++
Sbjct: 237 EMA--FADLEDNMDLAEEMLK-------YVVKYVLENCPDDLE 270


>gnl|CDD|223416 COG0339, Dcp, Zn-dependent oligopeptidases [Amino acid transport
           and metabolism].
          Length = 683

 Score = 24.9 bits (55), Expect = 8.8
 Identities = 11/36 (30%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 12  ALFGQMKS---NNELKTENPEHLKQFEKFIRDFSKS 44
            LF + K+   + E    +PE  +  E  +RDF  S
Sbjct: 112 GLFARYKALWQSPESAGLDPEQKRVVENALRDFVLS 147


>gnl|CDD|233730 TIGR02104, pulA_typeI, pullulanase, type I.  Pullulan is an
           unusual, industrially important polysaccharide in which
           short alpha-1,4 chains (maltotriose) are connected in
           alpha-1,6 linkages. Enzymes that cleave alpha-1,6
           linkages in pullulan and release maltotriose are called
           pullulanases although pullulan itself may not be the
           natural substrate. This family consists of pullulanases
           related to the subfamilies described in TIGR02102 and
           TIGR02103 but having a different domain architecture
           with shorter sequences. Members are called type I
           pullulanases.
          Length = 605

 Score = 25.0 bits (55), Expect = 8.9
 Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 11/71 (15%)

Query: 18  KSNNELKTENPEHLKQFEKFIRDFSKSYPTK--EEVAKRFAVFEDNLKLIEDLNKGEHGT 75
           + ++  + ENPE    +E  IRDFS        +   K   + E          KG +G 
Sbjct: 115 EKDHGPRLENPEDAIIYELHIRDFSIH-ENSGVKNKGKYLGLTETG-------TKGPNGV 166

Query: 76  ATYGINHLSDL 86
           +T G+++L +L
Sbjct: 167 ST-GLDYLKEL 176


>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
           transduction.  This family is a four helix bundle that
           operates as a ubiquitous sensory module in prokaryotic
           signal-transduction. The 4HB_MCP is always found between
           two predicted transmembrane helices indicating that it
           detects only extracellular signals. In many cases the
           domain is associated with a cytoplasmic HAMP domain
           suggesting that most proteins carrying the bundle might
           share the mechanism of transmembrane signalling which is
           well-characterized in E coli chemoreceptors.
          Length = 181

 Score = 24.9 bits (55), Expect = 9.3
 Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 1/44 (2%)

Query: 20  NNELKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLK 63
            +EL  +  E   + +K ++ + K+  T+EE  K F  F++ LK
Sbjct: 77  RDELLKDIEELRAEIDKLLKKYEKTILTEEE-KKLFNEFKEQLK 119


>gnl|CDD|219951 pfam08644, SPT16, FACT complex subunit (SPT16/CDC68).  Proteins in
           this family are subunits the FACT complex. The FACT
           complex plays a role in transcription initiation and
           promotes binding of TATA-binding protein (TBP) to a TATA
           box in chromatin.
          Length = 152

 Score = 24.5 bits (54), Expect = 9.8
 Identities = 13/48 (27%), Positives = 24/48 (50%)

Query: 23  LKTENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNK 70
            ++++ +HL+   K I++  K    +E   K  A   +  KLI + NK
Sbjct: 72  FRSKDSKHLQDVFKAIKELQKRVKKRETERKEKADVVEQEKLILNRNK 119


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.128    0.347 

Gapped
Lambda     K      H
   0.267   0.0764    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,065,534
Number of extensions: 428116
Number of successful extensions: 533
Number of sequences better than 10.0: 1
Number of HSP's gapped: 532
Number of HSP's successfully gapped: 51
Length of query: 102
Length of database: 10,937,602
Length adjustment: 68
Effective length of query: 34
Effective length of database: 7,921,530
Effective search space: 269332020
Effective search space used: 269332020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 53 (24.1 bits)