RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2950
         (406 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  191 bits (487), Expect = 6e-59
 Identities = 79/170 (46%), Positives = 103/170 (60%), Gaps = 6/170 (3%)

Query: 220 HEQRRRVERIYT-DFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTA-DVGLI 277
             Q  +V+++     Y+ S Y NDIALL+L RP   ++ V PICLP+ G  + A     +
Sbjct: 67  GGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTV 126

Query: 278 SGWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQ 337
           SGWGR SEGG LP +LQ   VP+    EC+R+Y    Y   +    +C G  +GG D+CQ
Sbjct: 127 SGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAY---SYGGTITDNMLCAGGLEGGKDACQ 183

Query: 338 GDSGGPLACPLPDGRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWVKSI 387
           GDSGGPL C   +GR  L GI SWG GCARP++ GVYT VS Y DW++  
Sbjct: 184 GDSGGPLVCND-NGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232



 Score =  162 bits (411), Expect = 1e-47
 Identities = 68/156 (43%), Positives = 87/156 (55%), Gaps = 18/156 (11%)

Query: 60  ELTRPFKFNEFVSPICLPNPGLTVTADVGLISGWGRLSEGADVGLISGWGRLSEGGSLPH 119
           +L RP   ++ V PICLP+ G  + A                   +SGWGR SEGG LP 
Sbjct: 95  KLKRPVTLSDNVRPICLPSSGYNLPAG--------------TTCTVSGWGRTSEGGPLPD 140

Query: 120 ILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQGDSGGPLACPLPDG 179
           +LQ   VP+    EC+R+Y    Y   +    +C G  +GG D+CQGDSGGPL C   +G
Sbjct: 141 VLQEVNVPIVSNAECKRAY---SYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCND-NG 196

Query: 180 RYYLCGITSWGVGCARPDFYGVYTLVSCYSDWVKSI 215
           R  L GI SWG GCARP++ GVYT VS Y DW++  
Sbjct: 197 RGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232



 Score = 36.5 bits (85), Expect = 0.014
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 26  RTSEGGSLPHILQAAEVPLTPKEECRRSY 54
           RTSEGG LP +LQ   VP+    EC+R+Y
Sbjct: 131 RTSEGGPLPDVLQEVNVPIVSNAECKRAY 159


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  184 bits (468), Expect = 4e-56
 Identities = 79/168 (47%), Positives = 96/168 (57%), Gaps = 8/168 (4%)

Query: 220 HEQRRRVERIYT-DFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTA-DVGLI 277
             Q  +V ++     Y+ S Y NDIALL+L  P   ++ V PICLP+    V A     +
Sbjct: 67  EGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVPAGTTCTV 126

Query: 278 SGWGRLSEG-GSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSC 336
           SGWGR SEG GSLP  LQ   VP+     CRR+Y        +    +C G  +GG D+C
Sbjct: 127 SGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAY---SGGGAITDNMLCAGGLEGGKDAC 183

Query: 337 QGDSGGPLACPLPDGRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 384
           QGDSGGPL C   DGR+ L GI SWG GCARP   GVYT VS Y DW+
Sbjct: 184 QGDSGGPLVCN--DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229



 Score =  155 bits (394), Expect = 3e-45
 Identities = 68/154 (44%), Positives = 81/154 (52%), Gaps = 20/154 (12%)

Query: 60  ELTRPFKFNEFVSPICLPNPGLTVTA-DVGLISGWGRLSEGADVGLISGWGRLSEGGSLP 118
           +L  P   ++ V PICLP+    V A     +SGWGR SEG               GSLP
Sbjct: 95  KLKEPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEG--------------AGSLP 140

Query: 119 HILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQGDSGGPLACPLPD 178
             LQ   VP+     CRR+Y        +    +C G  +GG D+CQGDSGGPL C   D
Sbjct: 141 DTLQEVNVPIVSNATCRRAY---SGGGAITDNMLCAGGLEGGKDACQGDSGGPLVCN--D 195

Query: 179 GRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 212
           GR+ L GI SWG GCARP   GVYT VS Y DW+
Sbjct: 196 GRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229



 Score = 33.4 bits (77), Expect = 0.15
 Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 25  RRTSEGGSLPHILQAAEVPLTPKEECRRSY 54
           R +   GSLP  LQ   VP+     CRR+Y
Sbjct: 131 RTSEGAGSLPDTLQEVNVPIVSNATCRRAY 160


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  136 bits (344), Expect = 5e-38
 Identities = 65/166 (39%), Positives = 82/166 (49%), Gaps = 13/166 (7%)

Query: 220 HEQRRRVERIYTDFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTA-DVGLIS 278
            EQ+  V+++           NDIALL+L  P    + V PICLP     +       +S
Sbjct: 65  GEQKFDVKKVIVHPNYNPDTDNDIALLKLKSPVTLGDTVRPICLPTASSDLPVGTTCTVS 124

Query: 279 GWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQG 338
           GWG     G LP  LQ   VP+  +E CR +Y        +    +C G   GG D+CQG
Sbjct: 125 GWGNTKTLG-LPDTLQEVTVPVVSRETCRSAY-----GGTVTDNMICAGA--GGKDACQG 176

Query: 339 DSGGPLACPLPDGRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 384
           DSGGPL C        L GI SWG GCA  ++ GVYT VS Y DW+
Sbjct: 177 DSGGPLVCSDG----ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218



 Score =  115 bits (289), Expect = 4e-30
 Identities = 56/152 (36%), Positives = 68/152 (44%), Gaps = 26/152 (17%)

Query: 61  LTRPFKFNEFVSPICLPNPGLTVTADVGLISGWGRLSEGADVGLISGWGRLSEGGSLPHI 120
           L  P    + V PICLP     +                     +SGWG     G LP  
Sbjct: 93  LKSPVTLGDTVRPICLPTASSDLPVGTT--------------CTVSGWGNTKTLG-LPDT 137

Query: 121 LQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQGDSGGPLACPLPDGR 180
           LQ   VP+  +E CR +Y        +    +C G   GG D+CQGDSGGPL C      
Sbjct: 138 LQEVTVPVVSRETCRSAY-----GGTVTDNMICAGA--GGKDACQGDSGGPLVCSDG--- 187

Query: 181 YYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 212
             L GI SWG GCA  ++ GVYT VS Y DW+
Sbjct: 188 -ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 77.2 bits (190), Expect = 2e-15
 Identities = 55/184 (29%), Positives = 69/184 (37%), Gaps = 34/184 (18%)

Query: 222 QRRRVERIYT-DFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTADVGLIS-- 278
           +R  V  IY  +FY      NDIA+LEL R        + +          +D  L S  
Sbjct: 104 ERGHVRTIYVHEFYSPGNLGNDIAVLELARA-------ASLPRVKITSFDASDTFLNSVT 156

Query: 279 --------GWG--------RLSEGGSLPHILQAAEVPLTPKEECRRSYAVA-GYSNYLNQ 321
                    +G        R S  G+   IL    V   P   C +    A         
Sbjct: 157 TVSPMTNGTFGVTTPSDVPRSSPKGT---ILHEVAVLFVPLSTCAQYKGCANASDGATGL 213

Query: 322 CQVCTGTKQGGLDSCQGDSGGPLACPLPDGRYYLCGITSWGVG-CARPDFYGVYTLVSCY 380
              C G      D+CQGDSGGP+     +GR    G+ SWG G C      GVYT VS Y
Sbjct: 214 TGFCAGRP--PKDACQGDSGGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNY 270

Query: 381 SDWV 384
            DW+
Sbjct: 271 QDWI 274



 Score = 71.1 bits (174), Expect = 2e-13
 Identities = 37/105 (35%), Positives = 44/105 (41%), Gaps = 8/105 (7%)

Query: 110 RLSEGGSLPHILQAAEVPLTPKEECRRSYAVA-GYSNYLNQCQVCTGTKQGGLDSCQGDS 168
           R S  G+   IL    V   P   C +    A            C G      D+CQGDS
Sbjct: 176 RSSPKGT---ILHEVAVLFVPLSTCAQYKGCANASDGATGLTGFCAGRP--PKDACQGDS 230

Query: 169 GGPLACPLPDGRYYLCGITSWGVG-CARPDFYGVYTLVSCYSDWV 212
           GGP+     +GR    G+ SWG G C      GVYT VS Y DW+
Sbjct: 231 GGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNYQDWI 274


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 32.7 bits (74), Expect = 0.29
 Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 8/55 (14%)

Query: 213 KSILYAR--HEQRRRVERIYTDFYDKSIYKNDIALLELTRPFKFNEFVSPICLPN 265
           K++   +  +EQ  RV     D   K + ++ I+LL L  P  F+  V P  +P+
Sbjct: 65  KTLKSVKGPYEQIYRV-----DCR-KDLPRSKISLLHLKSPATFSNHVLPTFVPS 113


>gnl|CDD|139961 PRK13901, ruvA, Holliday junction DNA helicase RuvA; Provisional.
          Length = 196

 Score = 29.8 bits (67), Expect = 1.7
 Identities = 12/35 (34%), Positives = 18/35 (51%)

Query: 234 YDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGL 268
           Y K I K + +++ +  PF+F   VS  CL    L
Sbjct: 6   YGKIIEKKESSIVIMATPFEFELLVSSFCLAELRL 40


>gnl|CDD|217356 pfam03078, ATHILA, ATHILA ORF-1 family.  ATHILA is a group of
           Arabidopsis thaliana retrotransposons belonging to the
           Ty3/gypsy family of the long terminal repeat (LTR) class
           of eukaryotic retrotransposons. The central region of
           ATHILA retrotransposons contains two or three open
           reading frames (ORFs). This family represents the ORF1
           product. The function of ORF1 is unknown.
          Length = 456

 Score = 30.4 bits (68), Expect = 1.8
 Identities = 16/74 (21%), Positives = 23/74 (31%), Gaps = 6/74 (8%)

Query: 106 SGWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQ 165
              G L  GG +  IL A  VPL       R        ++L +C+             +
Sbjct: 272 KARGSLCVGGVVTPILIACGVPLHSPGLLPR----RIDIDHLRRCEFLEFKMVDDFYRYR 327

Query: 166 GD--SGGPLACPLP 177
            +    G     LP
Sbjct: 328 FEHSEDGKSNILLP 341



 Score = 30.4 bits (68), Expect = 1.8
 Identities = 16/74 (21%), Positives = 23/74 (31%), Gaps = 6/74 (8%)

Query: 278 SGWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQ 337
              G L  GG +  IL A  VPL       R        ++L +C+             +
Sbjct: 272 KARGSLCVGGVVTPILIACGVPLHSPGLLPR----RIDIDHLRRCEFLEFKMVDDFYRYR 327

Query: 338 GD--SGGPLACPLP 349
            +    G     LP
Sbjct: 328 FEHSEDGKSNILLP 341


>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
           periplasmic, contain C-terminal PDZ domain
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 347

 Score = 28.3 bits (63), Expect = 8.2
 Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 3/61 (4%)

Query: 47  KEECRRSYAVAGYELTRPFKFNEFVSPICLPNP-GLTVTADVGLISGWGRLSEGADVGLI 105
           K +      V     +   +  + V  I   NP GL  T   G++S  GR   G+  G +
Sbjct: 125 KIDGAGGLPVIALGDSDKLRVGDVVVAI--GNPFGLGQTVTSGIVSALGRTGVGSAGGYV 182

Query: 106 S 106
           +
Sbjct: 183 N 183


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.139    0.449 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,922,056
Number of extensions: 2000215
Number of successful extensions: 1327
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1300
Number of HSP's successfully gapped: 20
Length of query: 406
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 307
Effective length of database: 6,546,556
Effective search space: 2009792692
Effective search space used: 2009792692
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.9 bits)