RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4294
         (238 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  111 bits (279), Expect = 5e-30
 Identities = 68/248 (27%), Positives = 93/248 (37%), Gaps = 75/248 (30%)

Query: 21  IVGGRDVNPGEVPYIVSLSLY-GNLYCGGSLISLQWFLSARHCFVTENLVWNQFNPLIIA 79
           IVGG +   G  P+ VSL    G  +CGGSLIS +W L+A HC    +   + +   +  
Sbjct: 1   IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCV--YSSAPSNYT--VRL 56

Query: 80  GSIYRNYKEQKRQPQLNEIALIYWHS-------DADLAMVKLKEPFRQTTFVKPLDYYTA 132
           GS   +  E     Q+ ++  +  H        D D+A++KLK P   +  V+P+   ++
Sbjct: 57  GSHDLSSNEGG--GQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSS 114

Query: 133 RETNYINDVLSKTDRSEMSIVSGFGVTFQRDKDG---------IVS-------WGIG--- 173
                               VSG+G T +              IVS       +  G   
Sbjct: 115 GYNLPAGT---------TCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTI 165

Query: 174 -----CALGYP--------------------------GIVSWGIGCAL-GYPGVYVRVDH 201
                CA G                            GIVSWG GCA   YPGVY RV  
Sbjct: 166 TDNMLCA-GGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSS 224

Query: 202 YDPWIQSV 209
           Y  WIQ  
Sbjct: 225 YLDWIQKT 232


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  109 bits (275), Expect = 2e-29
 Identities = 71/246 (28%), Positives = 95/246 (38%), Gaps = 76/246 (30%)

Query: 20  RIVGGRDVNPGEVPYIVSLSLYGN-LYCGGSLISLQWFLSARHCFVTENLVWNQFNPLII 78
           RIVGG + N G  P+ VSL   G   +CGGSLIS +W L+A HC        +     + 
Sbjct: 1   RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCV--RGSDPSNIR--VR 56

Query: 79  AGSIYRNYKEQKRQPQLNEIALIYWH-------SDADLAMVKLKEPFRQTTFVKPLDYYT 131
            GS   +  E+    Q+ +++ +  H        D D+A++KLKEP   +  V+P+   +
Sbjct: 57  LGSHDLSSGEEG---QVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPS 113

Query: 132 ARETNYINDVLSKTDRSEMSIVSGFGVTFQRDKDG----------IVS--------WGIG 173
           +                    VSG+G T +               IVS         G G
Sbjct: 114 SNYNVPAGT---------TCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGG 164

Query: 174 -------CALGYP-------------------------GIVSWGIGCAL-GYPGVYVRVD 200
                  CA G                           GIVSWG GCA  G PGVY RV 
Sbjct: 165 AITDNMLCA-GGLEGGKDACQGDSGGPLVCNDGRWVLVGIVSWGSGCARPGKPGVYTRVS 223

Query: 201 HYDPWI 206
            Y  WI
Sbjct: 224 SYLDWI 229


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score = 91.7 bits (228), Expect = 1e-22
 Identities = 66/231 (28%), Positives = 92/231 (39%), Gaps = 58/231 (25%)

Query: 21  IVGGRDVNPGEVPYIVSLSLYGNLY-CGGSLISLQWFLSARHCFVTENLVWNQFNPLIIA 79
           IVGG +  PG  P+ VSL +    + CGGSLIS  W L+A HC        N  +  ++ 
Sbjct: 1   IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVS------NAKSVRVVL 54

Query: 80  GSIYRNYKEQKRQPQLNEIALIYWHSD------ADLAMVKLKEPFRQTTFVKPLDY---- 129
           G+     +E     Q  ++  +  H +       D+A++KLK P      V+P+      
Sbjct: 55  GAHNIVLREGG--EQKFDVKKVIVHPNYNPDTDNDIALLKLKSPVTLGDTVRPICLPTAS 112

Query: 130 -------------YTARETNYINDVLSK-----TDRSEMSIVSGFGVTFQRDKDGIVSWG 171
                        +   +T  + D L +       R       G  VT     D ++  G
Sbjct: 113 SDLPVGTTCTVSGWGNTKTLGLPDTLQEVTVPVVSRETCRSAYGGTVT-----DNMICAG 167

Query: 172 IG--------------CALGY-PGIVSWGIGCALG-YPGVYVRVDHYDPWI 206
            G              C+ G   GIVSWG GCA G YPGVY  V  Y  WI
Sbjct: 168 AGGKDACQGDSGGPLVCSDGELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
          [Posttranslational modification, protein turnover,
          chaperones].
          Length = 413

 Score = 47.6 bits (113), Expect = 2e-06
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 5/49 (10%)

Query: 20 RIVGGRDVNPGEVPYIVSLSLYGNL-----YCGGSLISLQWFLSARHCF 63
          RI+GG + N GE P +V+L    +      +CGGS +  ++ L+A HC 
Sbjct: 32 RIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCA 80



 Score = 35.6 bits (82), Expect = 0.015
 Identities = 19/55 (34%), Positives = 24/55 (43%), Gaps = 2/55 (3%)

Query: 159 TFQRDKDGIVSWGIGCALGYPGIVSWGIG-CA-LGYPGVYVRVDHYDPWIQSVKN 211
             Q D  G +           G+VSWG G C     PGVY  V +Y  WI ++ N
Sbjct: 225 ACQGDSGGPIFHKGEEGRVQRGVVSWGDGGCGGTLIPGVYTNVSNYQDWIAAMTN 279


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 33.5 bits (76), Expect = 0.076
 Identities = 20/94 (21%), Positives = 39/94 (41%), Gaps = 2/94 (2%)

Query: 33  PYIVSLSLYGNLYCGGSLISLQWFLSARHCFVTENLVWNQFNPLIIAGSIYRNYKEQKRQ 92
           P+I  + + GN  C G LI L W L +  C    +L  +  + ++      ++ K     
Sbjct: 17  PWIAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGHKTLKSVKGPYE- 75

Query: 93  PQLNEIALIYWHSDADLAMVKLKEPFRQTTFVKP 126
            Q+  +        + ++++ LK P   +  V P
Sbjct: 76  -QIYRVDCRKDLPRSKISLLHLKSPATFSNHVLP 108


>gnl|CDD|172156 PRK13587, PRK13587,
           1-(5-phosphoribosyl)-5-[(5-
           phosphoribosylamino)methylideneamino]
           imidazole-4-carboxamide isomerase; Provisional.
          Length = 234

 Score = 31.7 bits (72), Expect = 0.23
 Identities = 29/90 (32%), Positives = 41/90 (45%), Gaps = 16/90 (17%)

Query: 131 TARETNYINDVLSKTDRSEMSIVSGFGVTFQRDKDGIVSW---GIG-CALGYPGI--VSW 184
            ARE +YI  +   T +    I  G G+   R K  I+ +   GI  C +G  GI    W
Sbjct: 61  HAREFDYIKSLRRLTTK---DIEVGGGI---RTKSQIMDYFAAGINYCIVGTKGIQDTDW 114

Query: 185 GIGCALGYPG-VYVRVDHYDPWIQSVKNNG 213
               A  +PG +Y+ VD Y    + +K NG
Sbjct: 115 LKEMAHTFPGRIYLSVDAY---GEDIKVNG 141


>gnl|CDD|220578 pfam10113, Fibrillarin_2, Fibrillarin-like archaeal protein.
           Members of this family of proteins include archaeal
           fibrillarin homologs.
          Length = 505

 Score = 29.4 bits (66), Expect = 2.0
 Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 4/67 (5%)

Query: 160 FQRDKDGIVSWGIGCALGYPG--IVSWGIGCALGYPGVYVRVDHYDPWIQS--VKNNGDN 215
             +  D + S  I   +   G   +SWGI   L      V +   DPW++   VK   + 
Sbjct: 400 VVKTVDDLESEVIADTVALLGGRFISWGIAKKLDGGVEEVYISDVDPWVEHATVKILNEE 459

Query: 216 AGVLISA 222
            G+ + A
Sbjct: 460 LGINVHA 466


>gnl|CDD|147832 pfam05894, Podovirus_Gp16, Podovirus DNA encapsidation protein
           (Gp16).  This family consists of several DNA
           encapsidation protein (Gp16) sequences from the
           phi-29-like viruses. Gene product 16 catalyzes the in
           vivo and in vitro genome-encapsidation reaction.
          Length = 333

 Score = 28.6 bits (64), Expect = 2.5
 Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 5/86 (5%)

Query: 101 IYWHSDADLAMVKLKEPFRQTTFVKPLDYYTARETNYINDVLSKTDRSEMSIVSGFG--- 157
           + +H +  +      +     TFV   DYY+         ++  T   +MS+ + F    
Sbjct: 169 LLFHIEPSINEPFQVDDEAVITFVNSEDYYSEVRHTRFGSLIKGTAYYDMSLDNKFSSNS 228

Query: 158 --VTFQRDKDGIVSWGIGCALGYPGI 181
               ++R K+    + I     Y G+
Sbjct: 229 DLFVYKRSKNSKFRFSIRYNDNYYGV 254


>gnl|CDD|239434 cd03318, MLE, Muconate Lactonizing Enzyme (MLE), an homooctameric
           enzyme, catalyses the conversion of cis,cis-muconate
           (CCM) to muconolactone (ML) in the catechol branch of
           the beta-ketoadipate pathway. This pathway is used in
           soil microbes to breakdown lignin-derived aromatics,
           catechol and protocatechuate, to citric acid cycle
           intermediates. Some bacterial species are also capable
           of dehalogenating chloroaromatic compounds by the action
           of chloromuconate lactonizing enzymes (Cl-MLEs). MLEs
           are members of the enolase superfamily characterized by
           the presence of an enolate anion intermediate which is
           generated by abstraction of the alpha-proton of the
           carboxylate substrate by an active site residue and that
           is stabilized by coordination to the essential Mg2+ ion.
          Length = 365

 Score = 27.7 bits (62), Expect = 5.9
 Identities = 10/41 (24%), Positives = 14/41 (34%), Gaps = 6/41 (14%)

Query: 162 RDKDGIVSWGIGCALGYPGIVSWGIGCALGYPGVYVRVDHY 202
              DG+V  G       PG  +WG         +   +D Y
Sbjct: 36  TTSDGVVGIGEA---TTPGGPAWGGESP---ETIKAIIDRY 70


>gnl|CDD|144261 pfam00603, Flu_PA, Influenza RNA-dependent RNA polymerase subunit
           PA. 
          Length = 669

 Score = 27.9 bits (62), Expect = 6.0
 Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 16/52 (30%)

Query: 17  IGGRIVGGRDVNPGEVPYIVSLSLYGNLYC---GGSLISLQWFLSARHCFVT 65
           IG   V GR+V             Y  LYC   G S I ++W   AR C + 
Sbjct: 503 IGHTTVEGREV-----------PFY--LYCRTTGFSKIKMKWGSEARRCLIQ 541


>gnl|CDD|146488 pfam03881, Fructosamin_kin, Fructosamine kinase.  This family
           includes eukaryotic fructosamine-3-kinase enzymes. The
           family also includes bacterial members that have not
           been characterized but probably have a similar or
           identical function.
          Length = 287

 Score = 27.6 bits (62), Expect = 6.4
 Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 6/33 (18%)

Query: 102 YW-HSDADLAMVKLKEPFRQTTF-----VKPLD 128
           Y+   + DLAM +L   F    +     V PLD
Sbjct: 215 YYGDRECDLAMTELFGGFPPDFYDGYQSVYPLD 247


>gnl|CDD|217412 pfam03186, CobD_Cbib, CobD/Cbib protein.  This family includes CobD
           proteins from a number of bacteria, in Salmonella this
           protein is called Cbib. Salmonella CobD is a different
           protein. This protein is involved in cobalamin
           biosynthesis and is probably an enzyme responsible for
           the conversion of adenosylcobyric acid to
           adenosylcobinamide or adenosylcobinamide phosphate.
          Length = 295

 Score = 27.1 bits (61), Expect = 9.0
 Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 8/58 (13%)

Query: 177 GYP-GIVSWGIGCALGYPGVYVRVDHYDPWIQSVKNNGDN---AGVLISALHMTYRAV 230
           G+P   ++  +G  LG P VY       P +     +G     A  +  AL +  RA 
Sbjct: 242 GWPEAAMAGALGVRLGGPNVYGGEVVEKPTL----GDGARPPTAEDIRRALRLMRRAS 295


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.140    0.439 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,261,936
Number of extensions: 1159901
Number of successful extensions: 971
Number of sequences better than 10.0: 1
Number of HSP's gapped: 959
Number of HSP's successfully gapped: 30
Length of query: 238
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 144
Effective length of database: 6,768,326
Effective search space: 974638944
Effective search space used: 974638944
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 57 (25.7 bits)