RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2626
         (355 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  256 bits (655), Expect = 9e-85
 Identities = 101/247 (40%), Positives = 146/247 (59%), Gaps = 16/247 (6%)

Query: 108 IVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGEY 167
           IVGG+ ++ G +PW  ++ +   R          CGG+LI+ R+VLTAAHCV       Y
Sbjct: 1   IVGGSEAKIGSFPWQVSLQYTGGR--------HFCGGSLISPRWVLTAAHCVYSSAPSNY 52

Query: 168 EPYIVHVGSIDL-EDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPI 226
               V +GS DL  +   G  I++++ I+H  Y  +   NDIAL +L+    LSD ++PI
Sbjct: 53  T---VRLGSHDLSSNEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPI 109

Query: 227 CLPYDTNLRSELFERKTPFVAGWGSTVFRGPLSPKLRHVQISVVDNPKCRQIFSNYGATI 286
           CLP   +    L    T  V+GWG T   GPL   L+ V + +V N +C++ +S YG TI
Sbjct: 110 CLP---SSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYS-YGGTI 165

Query: 287 NENILCAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTNY 346
            +N+LCAG L GGKD+C GDSGGPL+   + +  ++G+VS+G  CA   +PGVYTRV++Y
Sbjct: 166 TDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSY 225

Query: 347 IQWIADN 353
           + WI   
Sbjct: 226 LDWIQKT 232


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  253 bits (648), Expect = 9e-84
 Identities = 102/245 (41%), Positives = 141/245 (57%), Gaps = 17/245 (6%)

Query: 107 RIVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGE 166
           RIVGG+ +  G +PW  ++ +   R          CGG+LI+ R+VLTAAHCV       
Sbjct: 1   RIVGGSEANIGSFPWQVSLQYGGGR--------HFCGGSLISPRWVLTAAHCVRGSDPSN 52

Query: 167 YEPYIVHVGSIDLEDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPI 226
                V +GS DL     G  I++ + IIH  Y  +   NDIAL +L+E   LSD ++PI
Sbjct: 53  IR---VRLGSHDLSSGEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPI 109

Query: 227 CLPYDTNLRSELFERKTPFVAGWGSTVFR-GPLSPKLRHVQISVVDNPKCRQIFSNYGAT 285
           CLP   +    +    T  V+GWG T    G L   L+ V + +V N  CR+ +S  G  
Sbjct: 110 CLP---SSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGG-A 165

Query: 286 INENILCAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTN 345
           I +N+LCAG L GGKD+C GDSGGPL+   D ++ ++G+VS+G  CA  G PGVYTRV++
Sbjct: 166 ITDNMLCAGGLEGGKDACQGDSGGPLVC-NDGRWVLVGIVSWGSGCARPGKPGVYTRVSS 224

Query: 346 YIQWI 350
           Y+ WI
Sbjct: 225 YLDWI 229


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  195 bits (498), Expect = 2e-61
 Identities = 91/244 (37%), Positives = 134/244 (54%), Gaps = 27/244 (11%)

Query: 108 IVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGEY 167
           IVGG  ++ G +PW  ++             +  CGG+LI++ +VLTAAHCVS       
Sbjct: 1   IVGGDEAQPGSFPWQVSL--------QVSSGKHFCGGSLISENWVLTAAHCVS-----NA 47

Query: 168 EPYIVHVGSIDLEDTSSGVS-IEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPI 226
           +   V +G+ ++     G    ++++ I+H  Y      NDIAL +L+    L D ++PI
Sbjct: 48  KSVRVVLGAHNIVLREGGEQKFDVKKVIVHPNYNP-DTDNDIALLKLKSPVTLGDTVRPI 106

Query: 227 CLPYDTNLRSELFERKTPFVAGWGSTVFRGPLSPKLRHVQISVVDNPKCRQIFSNYGATI 286
           CLP      S+L    T  V+GWG+T   G     L+ V + VV    CR   S YG T+
Sbjct: 107 CLPTA---SSDLPVGTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCR---SAYGGTV 159

Query: 287 NENILCAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTNY 346
            +N++CAG  +GGKD+C GDSGGPL+        +IG+VS+G  CA   +PGVYT V++Y
Sbjct: 160 TDNMICAG--AGGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSY 214

Query: 347 IQWI 350
           + WI
Sbjct: 215 LDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score =  121 bits (305), Expect = 4e-31
 Identities = 82/263 (31%), Positives = 112/263 (42%), Gaps = 19/263 (7%)

Query: 99  VNAFNSSKRIVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHC 158
             A   S RI+GG+ +  G++P + A+  +I     +G     CGG+ +  RYVLTAAHC
Sbjct: 24  QTADEVSSRIIGGSNANAGEYPSLVALVDRISDY-VSGT---FCGGSKLGGRYVLTAAHC 79

Query: 159 VSPDTTGEYEPYIVHVGSIDLEDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAP 218
               +    +   V V   DL D+S      +    +HE Y+     NDIA+  L   A 
Sbjct: 80  ADASSPISSDVNRVVV---DLNDSSQAERGHVRTIYVHEFYSPGNLGNDIAVLELARAAS 136

Query: 219 LSDLIQPICLPYDTNLRSELFERKTPFVAGWGSTVFRGP--LSPK---LRHVQISVVDNP 273
           L  +        DT   + +          +G T        SPK   L  V +  V   
Sbjct: 137 LPRVKITSFDASDT-FLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLS 195

Query: 274 KCRQIFSNYGATINENIL---CAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYG-K 329
            C Q      A+     L   CAG     KD+C GDSGGP+ +  +      GVVS+G  
Sbjct: 196 TCAQYKGCANASDGATGLTGFCAG--RPPKDACQGDSGGPIFHKGEEGRVQRGVVSWGDG 253

Query: 330 KCAEVGFPGVYTRVTNYIQWIAD 352
            C     PGVYT V+NY  WIA 
Sbjct: 254 GCGGTLIPGVYTNVSNYQDWIAA 276


>gnl|CDD|221387 pfam12032, CLIP, Regulatory CLIP domain of proteinases.  CLIP is
          a regulatory domain which controls the proteinase
          action of various proteins of the trypsin family, e.g.
          easter and pap2. The CLIP domain remains linked to the
          protease domain after cleavage of a conserved residue
          which retains the protein in zymogen form. It is named
          CLIP because it can be drawn in the shape of a paper
          clip. It has many disulphide bonds and highly conserved
          cysteine residues, and so it folds extensively.
          Length = 54

 Score = 65.9 bits (161), Expect = 4e-14
 Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 10 CSTPRNEIGTCISIRQCRYLYDILNNERNNPRAVQYVKNSHCGLEGRL-PKVCC 62
          C+TP  E GTC+SIR+C  L  IL      P    +++ S CG EG   P VCC
Sbjct: 1  CTTPNGEPGTCVSIRECPSLLKILKKPNLTPEDTNFLRKSQCGFEGSNDPLVCC 54


>gnl|CDD|197829 smart00680, CLIP, Clip or disulphide knot domain.  Present in
          horseshoe crab proclotting enzyme N-terminal domain,
          Drosophila Easter and silkworm
          prophenoloxidase-activating enzyme.
          Length = 52

 Score = 57.1 bits (138), Expect = 4e-11
 Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 10 CSTPRNEIGTCISIRQCRYLYDILNNERNNPRAVQYVKNSHCGLEGRLPKVCCP 63
          C TP  E GTC+ I  C  L  +L    + P  + +++ S CG   R P VCCP
Sbjct: 1  CRTPDGERGTCVPISDCPSLLSLLKK--DPPEDLNFLRKSQCGFGNREPLVCCP 52


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
           transport and metabolism].
          Length = 251

 Score = 33.9 bits (78), Expect = 0.094
 Identities = 45/244 (18%), Positives = 70/244 (28%), Gaps = 46/244 (18%)

Query: 118 QWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGEYEPYIVHVGSI 177
           Q+P+ +A+      +             LI    VLTA HC+          Y    G  
Sbjct: 48  QFPY-SAVV-----QFEAATGRLCTAATLIGPNTVLTAGHCI----------YSPDYGED 91

Query: 178 DLEDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPICLPYDTNLRSE 237
           D+     GV+ +   P         R          +ED    D+ +             
Sbjct: 92  DIAAAPPGVNSD-GGPFYGITKIEIRVYPG---ELYKEDGASYDVGEAA----------- 136

Query: 238 LFERKTPFVAGWGSTVFRGPLSPKLRHVQISVV-----DNPKCRQIFSNYGATINENILC 292
             E                  + + +      V     D P    ++ + G     N + 
Sbjct: 137 -LESGINIGDVVNYLKR--NTASEAKANDRITVIGYPGDKPNIGTMWESTGKV---NSIK 190

Query: 293 AGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTNYI-QWIA 351
              L    D+  G SG P+   L +K  +IGV   G             R+T  I  +I 
Sbjct: 191 GNKLFYDADTLPGSSGSPV---LISKDEVIGVHYNGPGANGGSLANNAVRLTPEILNFIQ 247

Query: 352 DNIS 355
            NI 
Sbjct: 248 QNIK 251


>gnl|CDD|131967 TIGR02921, PEP_integral, PEP-CTERM family integral membrane
           protein.  Members of this protein family, found in three
           different species so far, have a PEP-CTERM sequence at
           the carboxyl-terminus (see model TIGR02595), but are
           unusual among PEP-CTERM proteins in having multiple
           predicted transmembrane segments. The function is
           unknown. It is proposed that a member of the EpsH
           family, to be designated exosortase (see TIGR02602),
           recognizes and cleaves PEP-CTERM proteins in a manner
           analogous to the cleavage of LPXTG proteins by sortase
           (see Haft, et al., 2006).
          Length = 952

 Score = 29.2 bits (65), Expect = 3.7
 Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 1/44 (2%)

Query: 165 GEYEPYIVHVGSID-LEDTSSGVSIEIERPIIHEQYTSARKLND 207
           GE      H G ID L+D+  GV+++I   I   Q+   R L+D
Sbjct: 762 GETLAGAYHDGIIDLLKDSGGGVALDIAEAIAAHQFAQKRALDD 805


>gnl|CDD|213027 cd10970, CE4_DAC_u1_6s, Putative catalytic NodB homology domain of
           uncharacterized prokaryotic polysaccharide deacetylases
           which consist of a 6-stranded beta/alpha barrel.  This
           family contains uncharacterized prokaryotic
           polysaccharide deacetylases. Although their biological
           functions remain unknown, all members of the family
           contain a conserved domain with a 6-stranded beta/alpha
           barrel, which is similar to the catalytic NodB homology
           domain of rhizobial NodB-like proteins, belonging to the
           larger carbohydrate esterase 4 (CE4) superfamily.
          Length = 194

 Score = 28.4 bits (64), Expect = 3.8
 Identities = 10/56 (17%), Positives = 24/56 (42%)

Query: 145 ALITKRYVLTAAHCVSPDTTGEYEPYIVHVGSIDLEDTSSGVSIEIERPIIHEQYT 200
            L+ + Y L  +    P+     +PY +   + + + T+  V   ++R   + Q+ 
Sbjct: 113 ELVREYYDLGRSGGGGPNGRPPLDPYRLRRVTGEADTTTEEVKTLLDRAAAYNQWL 168


>gnl|CDD|224400 COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General
           function prediction only].
          Length = 774

 Score = 29.3 bits (66), Expect = 4.0
 Identities = 12/59 (20%), Positives = 19/59 (32%), Gaps = 5/59 (8%)

Query: 201 SARKLNDIALFRL--REDAPLSDLIQPICLP-YDTNLRSELFERKTPFVAGWGSTVFRG 256
              K+  I +  L  RE      +I P  +P  D   +  LF +   F+          
Sbjct: 347 DLIKITKIVVRELFDREAKEA--IIMPHHIPLSDDYSKESLFLKNITFIDYVSELDIDL 403


>gnl|CDD|233632 TIGR01910, DapE-ArgE, acetylornithine deacetylase or
           succinyl-diaminopimelate desuccinylase.  This group of
           sequences contains annotations for both acetylornithine
           deacetylase and succinyl-diaminopimelate desuccinylase,
           but does not contain any members with experimental
           characterization. Bacillus, Staphylococcus and
           Sulfolobus species contain multiple hits to this
           subfamily and each may have a separate activity.
           Determining which is which must await further laboratory
           research [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 375

 Score = 28.9 bits (65), Expect = 4.4
 Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 4/56 (7%)

Query: 142 CGGA----LITKRYVLTAAHCVSPDTTGEYEPYIVHVGSIDLEDTSSGVSIEIERP 193
            G A    L+ + Y   A   + P+ +G     I H GSI  +    G       P
Sbjct: 141 SGEAGTLYLLQRGYFKDADGVLIPEPSGGDNIVIGHKGSIWFKLRVKGKQAHASFP 196


>gnl|CDD|239956 cd04583, CBS_pair_ABC_OpuCA_assoc2, This cd contains two tandem
           repeats of the cystathionine beta-synthase (CBS pair)
           domains in association with the ABC transporter OpuCA.
           OpuCA is the ATP binding component of a bacterial solute
           transporter that serves a protective role to cells
           growing in a hyperosmolar environment but the function
           of the CBS domains in OpuCA remains unknown.  In the
           related ABC transporter, OpuA, the tandem CBS domains
           have been shown to function as sensors for ionic
           strength, whereby they control the transport activity
           through an electronic switching mechanism. ABC
           transporters are a large family of proteins involved in
           the transport of a wide variety of different compounds,
           like sugars, ions, peptides, and more complex organic
           molecules. They are a subset of nucleotide hydrolases
           that contain a signature motif, Q-loop, and
           H-loop/switch region, in addition to the Walker A
           motif/P-loop and Walker B motif commonly found in a
           number of ATP- and GTP-binding and hydrolyzing proteins.
           CBS is a small domain originally identified in
           cystathionine beta-synthase and subsequently found in a
           wide range of different proteins. CBS domains usually
           come in tandem repeats, which associate to form a
           so-called Bateman domain or a CBS pair which is
           reflected in this model. The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains.  It has been proposed that the
           CBS domain may play a regulatory role, although its
           exact function is unknown.
          Length = 109

 Score = 27.5 bits (62), Expect = 4.7
 Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 7/44 (15%)

Query: 186 VSIEIERPIIHEQYTSARKLNDIAL---FRLREDAPLSDLIQPI 226
           VS+E     + + Y  A+ L DI L   F ++ DA L D++  +
Sbjct: 41  VSLES----LEQAYKEAKSLEDIMLEDVFTVQPDASLRDVLGLV 80


>gnl|CDD|233699 TIGR02050, gshA_cyan_rel, carboxylate-amine ligase, YbdK family.
           This family represents a division of a larger family,
           the other branch of which is predicted to act as
           glutamate--cysteine ligase (the first of two enzymes in
           glutathione biosynthesis) in the cyanobacteria. Species
           containing this protein, however, are generally not
           believe to make glutathione, and the function is
           unknown.
          Length = 287

 Score = 28.5 bits (64), Expect = 4.7
 Identities = 9/43 (20%), Positives = 18/43 (41%)

Query: 250 GSTVFRGPLSPKLRHVQISVVDNPKCRQIFSNYGATINENILC 292
           G  +      P  R  +  V DNP+ +++   YG    + ++ 
Sbjct: 79  GLRICGAGTHPFARWRRQEVADNPRYQRLLERYGYVARQQLVF 121


>gnl|CDD|137339 PRK09499, sifB, secreted effector protein SifB; Provisional.
          Length = 316

 Score = 28.8 bits (64), Expect = 4.7
 Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 6/37 (16%)

Query: 4   YYDDS------LCSTPRNEIGTCISIRQCRYLYDILN 34
           +Y D+      LC     +I  CI IRQ  Y Y+I+N
Sbjct: 87  HYKDAECQYTNLCIKDGEDIPLCIMIRQDHYYYEIMN 123


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.137    0.436 

Gapped
Lambda     K      H
   0.267   0.0636    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,264,054
Number of extensions: 1731130
Number of successful extensions: 1157
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1135
Number of HSP's successfully gapped: 20
Length of query: 355
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 257
Effective length of database: 6,590,910
Effective search space: 1693863870
Effective search space used: 1693863870
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.1 bits)