RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15989
         (363 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  125 bits (315), Expect = 5e-34
 Identities = 70/242 (28%), Positives = 95/242 (39%), Gaps = 72/242 (29%)

Query: 23  GQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVI 82
           G +   G +PW V+L  T G    + CGGSL+S  +V+TAAHCV               +
Sbjct: 4   GSEAKIGSFPWQVSLQYTGG---RHFCGGSLISPRWVLTAAHCVYSSA------PSNYTV 54

Query: 83  YLGKYHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCL- 141
            LG +     S+EGG Q  +VK+V ++P +N S Y  DIALL+L   V  S  VRP+CL 
Sbjct: 55  RLGSHDL--SSNEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLP 112

Query: 142 ----------------W----DDSTAPLQLSAVE-------------------------- 155
                           W    +    P  L  V                           
Sbjct: 113 SSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCA 172

Query: 156 -----GTSVCNGDSGGGMVFKIDSAWYLRGIVS--ITVARDGLRVCDTKHYVVFTDVANV 208
                G   C GDSGG +V   +    L GIVS     AR         +  V+T V++ 
Sbjct: 173 GGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPN-------YPGVYTRVSSY 225

Query: 209 CN 210
            +
Sbjct: 226 LD 227



 Score = 82.3 bits (204), Expect = 3e-18
 Identities = 37/110 (33%), Positives = 51/110 (46%), Gaps = 8/110 (7%)

Query: 253 DVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGRDGTV 312
            VK+V ++P +N S Y  DIALL+L   V  S  VRP+CL              G   TV
Sbjct: 72  KVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNL-----PAGTTCTV 126

Query: 313 IGWGY-DENDRVSEELKMAIMPIVSHQQCLWSNPQFFSQFTSDETFCAGF 361
            GWG   E   + + L+   +PIVS+ +C       +    +D   CAG 
Sbjct: 127 SGWGRTSEGGPLPDVLQEVNVPIVSNAEC--KRAYSYGGTITDNMLCAGG 174


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  122 bits (308), Expect = 5e-33
 Identities = 73/248 (29%), Positives = 99/248 (39%), Gaps = 46/248 (18%)

Query: 23  GQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVI 82
           G +   G +PW V+L    G    + CGGSL+S  +V+TAAHCV             + +
Sbjct: 5   GSEANIGSFPWQVSLQYGGG---RHFCGGSLISPRWVLTAAHCVRGSDPSN------IRV 55

Query: 83  YLGKYHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLW 142
            LG    H  S     Q  +V +V I+P +N S Y  DIALL+L   V  S  VRP+CL 
Sbjct: 56  RLGS---HDLSSGEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICL- 111

Query: 143 DDSTAPLQLSAVEGTSVCN----GDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTKH 198
                P     V   + C     G +  G     D+   L+  V++ +  +    C   +
Sbjct: 112 -----PSSNYNVPAGTTCTVSGWGRTSEGAGSLPDT---LQE-VNVPIVSN--ATCRRAY 160

Query: 199 YVVFTDVAN------------VCNGDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTK 246
                   N             C GDSGG +V   D  W L GIVS      G       
Sbjct: 161 SGGGAITDNMLCAGGLEGGKDACQGDSGGPLVCN-DGRWVLVGIVSW-----GSGCARPG 214

Query: 247 HYVVFTDV 254
              V+T V
Sbjct: 215 KPGVYTRV 222



 Score = 80.0 bits (198), Expect = 2e-17
 Identities = 36/111 (32%), Positives = 49/111 (44%), Gaps = 9/111 (8%)

Query: 253 DVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGRDGTV 312
            V +V I+P +N S Y  DIALL+L   V  S  VRP+CL   +          G   TV
Sbjct: 72  KVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNV-----PAGTTCTV 126

Query: 313 IGWGY--DENDRVSEELKMAIMPIVSHQQCLWSNPQFFSQFTSDETFCAGF 361
            GWG   +    + + L+   +PIVS+  C            +D   CAG 
Sbjct: 127 SGWGRTSEGAGSLPDTLQEVNVPIVSNATC--RRAYSGGGAITDNMLCAGG 175


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  109 bits (275), Expect = 2e-28
 Identities = 75/244 (30%), Positives = 111/244 (45%), Gaps = 42/244 (17%)

Query: 23  GQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVI 82
           G +   G +PW V+L  + G    + CGGSL+S N+V+TAAHCV+        ++ ++ +
Sbjct: 4   GDEAQPGSFPWQVSLQVSSG---KHFCGGSLISENWVLTAAHCVS--------NAKSVRV 52

Query: 83  YLGKYHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLW 142
            LG ++      EGG Q   VK+V ++P +N      DIALL+L S V     VRP+CL 
Sbjct: 53  VLGAHNIVL--REGGEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLP 109

Query: 143 DDSTAPLQLSAVEGTSVCNGDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTKHYVVF 202
             S+         GT+     SG G    +     L+ +    V+R+    C + +    
Sbjct: 110 TASSDL-----PVGTTCTV--SGWGNTKTLGLPDTLQEVTVPVVSRE---TCRSAYGGTV 159

Query: 203 TD--------VANVCNGDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTKHYV-VFTD 253
           TD          + C GDSGG +V    S   L GIVS          C + +Y  V+T 
Sbjct: 160 TDNMICAGAGGKDACQGDSGGPLV---CSDGELIGIVS------WGYGCASGNYPGVYTP 210

Query: 254 VKRV 257
           V   
Sbjct: 211 VSSY 214



 Score = 75.2 bits (185), Expect = 9e-16
 Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 10/109 (9%)

Query: 253 DVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGRDGTV 312
           DVK+V ++P +N      DIALL+L S V     VRP+CL   S+         G   TV
Sbjct: 70  DVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPTASSDL-----PVGTTCTV 123

Query: 313 IGWGYDENDRVSEELKMAIMPIVSHQQCLWSNPQFFSQFTSDETFCAGF 361
            GWG  +   + + L+   +P+VS + C       +    +D   CAG 
Sbjct: 124 SGWGNTKTLGLPDTLQEVTVPVVSRETC----RSAYGGTVTDNMICAGA 168


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 51.8 bits (124), Expect = 2e-07
 Identities = 62/293 (21%), Positives = 96/293 (32%), Gaps = 73/293 (24%)

Query: 29  GQWPWHVALYRTEGINLSYV-CGGSLVSVNYVITAAHCVTKKPYDKPVDSD-TLVIYLGK 86
           G++P  VAL       +S   CGGS +   YV+TAAHC        P+ SD   V+    
Sbjct: 42  GEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADAS---SPISSDVNRVVV--- 95

Query: 87  YHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDST 146
                 +D    +   V+ ++++  ++  N   DIA+L+L+                 ++
Sbjct: 96  ----DLNDSSQAERGHVRTIYVHEFYSPGNLGNDIAVLELA---------------RAAS 136

Query: 147 APLQLSAVEGTSVCNGDSGGGMVFKIDSAWYLRGIVS-------------ITVARDGLRV 193
            P         S    +S   +    +  + +                  + V    L  
Sbjct: 137 LPRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLST 196

Query: 194 CDTKH--------YVVFTDVA------NVCNGDSGGGMVFKIDSAWYLRGIVSITVARDG 239
           C                T         + C GDSGG +  K +     RG+VS      G
Sbjct: 197 CAQYKGCANASDGATGLTGFCAGRPPKDACQGDSGGPIFHKGEEGRVQRGVVSW-----G 251

Query: 240 LRVC-DTKHYVVFTDVKRVHIYPTFNSSNYLGDIA--LLQLSSDVDYSMYVRP 289
              C  T    V+T+V           SNY   IA     LS      +  RP
Sbjct: 252 DGGCGGTLIPGVYTNV-----------SNYQDWIAAMTNGLSYLQFRPLGYRP 293


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 34.6 bits (79), Expect = 0.058
 Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 15/115 (13%)

Query: 31  WPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVIYLGKYHQH 90
           WPW   +Y    +  +Y C G L+ +++V+ +  C+    +D  ++   + + LG  H+ 
Sbjct: 16  WPWIAKVY----VEGNYRCTGVLIDLSWVLVSHSCL----WDTSLEHSYISVVLGG-HKT 66

Query: 91  QFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDS 145
             S +G    +Q+ RV        S     I+LL L S   +S +V P  +    
Sbjct: 67  LKSVKG--PYEQIYRVDCRKDLPRSK----ISLLHLKSPATFSNHVLPTFVPSTR 115


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
          transport and metabolism].
          Length = 251

 Score = 32.4 bits (74), Expect = 0.29
 Identities = 12/48 (25%), Positives = 19/48 (39%), Gaps = 5/48 (10%)

Query: 30 QWPWH--VALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPV 75
          Q+P+   V      G         +L+  N V+TA HC+    Y +  
Sbjct: 48 QFPYSAVVQFEAATG---RLCTAATLIGPNTVLTAGHCIYSPDYGEDD 92


>gnl|CDD|220834 pfam10647, Gmad1, Lipoprotein LpqB beta-propeller domain.  The
           Gmad1 domain is found associated with the GerMN family,
           pfam10646, in bacterial spore formation. It is predicted
           to have a beta-propeller fold and to have a passive
           binding role rather than a catalytic function owing to
           the low number of conserved hydrophilic residues.
          Length = 254

 Score = 30.8 bits (70), Expect = 0.82
 Identities = 11/40 (27%), Positives = 18/40 (45%), Gaps = 1/40 (2%)

Query: 204 DVANVCNGDSGGGMVFKID-SAWYLRGIVSITVARDGLRV 242
            VA +    +G  +  ++D  A     I  + V+RDG R 
Sbjct: 86  TVARLLRSATGTTVPVEVDAPALSAGRITELRVSRDGTRA 125



 Score = 29.7 bits (67), Expect = 2.2
 Identities = 9/42 (21%), Positives = 16/42 (38%), Gaps = 1/42 (2%)

Query: 153 AVEGTSVCNGDSGGGMVFKID-SAWYLRGIVSITVARDGLRV 193
                 +    +G  +  ++D  A     I  + V+RDG R 
Sbjct: 84  GDTVARLLRSATGTTVPVEVDAPALSAGRITELRVSRDGTRA 125


>gnl|CDD|224763 COG1850, RbcL, Ribulose 1,5-bisphosphate carboxylase, large subunit
           [Carbohydrate transport and metabolism].
          Length = 429

 Score = 31.2 bits (71), Expect = 0.89
 Identities = 8/28 (28%), Positives = 11/28 (39%), Gaps = 1/28 (3%)

Query: 29  GQWPWHV-ALYRTEGINLSYVCGGSLVS 55
           G  P  +  L R  G +L    GG +  
Sbjct: 354 GIHPGMMPELVRIFGKDLVIQAGGGVHG 381


>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316).  This
          family of proteins with unknown function are from
          Caenorhabditis elegans. The protein has GO references
          indicating the protein is a positive regulator of
          growth rate and is also involved in nematode larval
          development.
          Length = 280

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 13/60 (21%), Positives = 25/60 (41%), Gaps = 5/60 (8%)

Query: 7  SCG--TVVYNKAQPLVTYGQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAH 64
          SCG  T+          Y +K    ++PW V      G   +Y    + +S  +++T++ 
Sbjct: 30 SCGNKTLPLPSQNINGIYLEK---SEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSR 86


>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
          Length = 1713

 Score = 30.6 bits (69), Expect = 1.6
 Identities = 32/110 (29%), Positives = 44/110 (40%), Gaps = 30/110 (27%)

Query: 33  WHVALYRTEGINLSYVCGGSL--VSVNYV--------ITAAHCVTKKPYDKPVDS--DTL 80
           WH A       +L Y C   L  VS+ Y            AHC+ K  Y   V+S  + L
Sbjct: 894 WHFA-------HLEYGCAALLKEVSLPYWNQDDVYGGFGGAHCMIKGGYSNVVESLAEGL 946

Query: 81  VIYLGKYHQHQFSD------EGGVQNKQVKRVHIYPTFNSSNYLGDIALL 124
            I+L     H  +D      + G      K+V +  T N S +LGD  L+
Sbjct: 947 DIHL----NHVVTDVSYGSKDAGASGSSRKKVKV-STSNGSEFLGDAVLI 991


>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain.  This family
           includes trypsin like peptidase domains.
          Length = 138

 Score = 29.1 bits (65), Expect = 1.7
 Identities = 29/181 (16%), Positives = 53/181 (29%), Gaps = 46/181 (25%)

Query: 51  GSLVSVN-YVITAAHCVTKKPYDKPVDSDTLVIYLGKYHQHQFSDEGGVQNKQVKRVHIY 109
           G +V  +  ++T AH V      +  D+  + +            +GG    +V      
Sbjct: 3   GFVVGSDGLILTNAHVV------EDADASEIEV---------VLPDGGRVPAEVVAA--- 44

Query: 110 PTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGTSVCNGDSGGGMV 169
                 +   D+ALL++   +                    L A     +       G  
Sbjct: 45  ----DPDL--DLALLKVDGPLLP---------------AAPLLASSAAPLGGSVVVVGGP 83

Query: 170 FKIDSAWYLRGIVSITVARDGLRVCDTKHYVVFTDVANVCNGDSGGGMVFKIDSAWYLRG 229
             I       G     +    L   D ++ +   D +    G SGG +    D+   + G
Sbjct: 84  GGIGLGASGGGGGVGGLVSGSLGGVDGRYILTDADTS---PGSSGGPV---FDADGEVVG 137

Query: 230 I 230
           I
Sbjct: 138 I 138


>gnl|CDD|218649 pfam05593, RHS_repeat, RHS Repeat.  RHS proteins contain extended
           repeat regions. These repeats often appear to be
           involved in ligand binding. Note that this model may not
           find all the repeats in a protein and that it covers two
           RHS repeats.
          Length = 38

 Score = 26.9 bits (60), Expect = 1.7
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 301 QLSAVEGRDGTVIGWGYDENDRVSEE 326
           +L++V   DG V  + YD   R++  
Sbjct: 6   RLTSVTDADGRVTTYTYDAAGRLTAV 31


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.137    0.434 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,932,144
Number of extensions: 1658525
Number of successful extensions: 1100
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1083
Number of HSP's successfully gapped: 27
Length of query: 363
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 265
Effective length of database: 6,590,910
Effective search space: 1746591150
Effective search space used: 1746591150
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (26.9 bits)