RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8978
         (252 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  256 bits (657), Expect = 1e-86
 Identities = 98/240 (40%), Positives = 138/240 (57%), Gaps = 16/240 (6%)

Query: 14  GQASEVNDWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIV 73
           G  +++  +PW V+L+    R +FCGG LI+ RWVLTAAHC        +     S+  V
Sbjct: 4   GSEAKIGSFPWQVSLQYTGGR-HFCGGSLISPRWVLTAAHC--------VYSSAPSNYTV 54

Query: 74  RLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQ 133
           RLG +D S         I    + V+P ++   Y+NDIAL++L +    +  VRP+CLP 
Sbjct: 55  RLGSHDLSSNEGGGQV-IKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPS 113

Query: 134 AG-DFYEDQIGIVTGWGTLSYGGPRSDVLMEVPIPVWRLTECRKQFSQN--IFDSNLCAG 190
           +G +        V+GWG  S GGP  DVL EV +P+    EC++ +S    I D+ LCAG
Sbjct: 114 SGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCAG 173

Query: 191 GYKGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWGIGCGK--TPGVYVQVNKYLRWIYNT 248
           G +GG D+CQGDSGGPL+      +  ++G+VSWG GC +   PGVY +V+ YL WI  T
Sbjct: 174 GLEGGKDACQGDSGGPLVCNDNG-RGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  245 bits (628), Expect = 3e-82
 Identities = 95/238 (39%), Positives = 137/238 (57%), Gaps = 19/238 (7%)

Query: 14  GQASEVNDWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIV 73
           G  + +  +PW V+L+    R +FCGG LI+ RWVLTAAHC        +     S++ V
Sbjct: 5   GSEANIGSFPWQVSLQYGGGR-HFCGGSLISPRWVLTAAHC--------VRGSDPSNIRV 55

Query: 74  RLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQ 133
           RLG +D S  +  +   I  + + ++P ++   Y+NDIAL++L +    +  VRP+CLP 
Sbjct: 56  RLGSHDLS--SGEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPS 113

Query: 134 AGDFYED-QIGIVTGWGTLSYG-GPRSDVLMEVPIPVWRLTECRKQFSQN--IFDSNLCA 189
           +           V+GWG  S G G   D L EV +P+     CR+ +S    I D+ LCA
Sbjct: 114 SNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAITDNMLCA 173

Query: 190 GGYKGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWGIGCGKT--PGVYVQVNKYLRWI 245
           GG +GG D+CQGDSGGPL+    D +W ++G+VSWG GC +   PGVY +V+ YL WI
Sbjct: 174 GGLEGGKDACQGDSGGPLVCN--DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  182 bits (464), Expect = 9e-58
 Identities = 87/235 (37%), Positives = 116/235 (49%), Gaps = 23/235 (9%)

Query: 14  GQASEVNDWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIV 73
           G  ++   +PW V+L+      +FCGG LI+E WVLTAAHC+               + V
Sbjct: 4   GDEAQPGSFPWQVSLQVS-SGKHFCGGSLISENWVLTAAHCVS----------NAKSVRV 52

Query: 74  RLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQ 133
            LG ++     E          + V+P ++     NDIAL++L         VRP+CLP 
Sbjct: 53  VLGAHNIVL-REGGEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPT 110

Query: 134 AGDFYEDQIGI-VTGWGTLSYGGPRSDVLMEVPIPVWRLTECRKQFSQNIFDSNLCAGGY 192
           A           V+GWG     G   D L EV +PV     CR  +   + D+ +CAG  
Sbjct: 111 ASSDLPVGTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCRSAYGGTVTDNMICAGA- 168

Query: 193 KGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWGIGC--GKTPGVYVQVNKYLRWI 245
            GG D+CQGDSGGPL+    D    +IG+VSWG GC  G  PGVY  V+ YL WI
Sbjct: 169 -GGKDACQGDSGGPLV--CSDG--ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 98.4 bits (245), Expect = 7e-24
 Identities = 81/270 (30%), Positives = 113/270 (41%), Gaps = 57/270 (21%)

Query: 8   TAKIDKGQASEVNDWPWLVALKRQ---YERDNFCGGVLINERWVLTAAHCI-KQKIDNAL 63
           +++I  G  +   ++P LVAL  +   Y    FCGG  +  R+VLTAAHC       ++ 
Sbjct: 30  SSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSD 89

Query: 64  VLRRTSDLIVRLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYN 123
           V R   DL       D S+     V  I          +S  N  NDIA+++L+      
Sbjct: 90  VNRVVVDL------NDSSQAERGHVRTIYVHEF-----YSPGNLGNDIAVLELA------ 132

Query: 124 SFVRPVCLPQAGDFYEDQ----IGIVTGWGTLSYGG-------------PRSDVLMEVPI 166
              R   LP+      D     +  VT    ++ G              P+  +L EV +
Sbjct: 133 ---RAASLPRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAV 189

Query: 167 PVWRLTEC-RKQFSQNIFDS-----NLCAGGYKGGTDSCQGDSGGPLLLQRPD--KQWTI 218
               L+ C + +   N  D        CAG      D+CQGDSGGP+  +  +   Q   
Sbjct: 190 LFVPLSTCAQYKGCANASDGATGLTGFCAGRP--PKDACQGDSGGPIFHKGEEGRVQ--- 244

Query: 219 IGVVSWGIG-CGKT--PGVYVQVNKYLRWI 245
            GVVSWG G CG T  PGVY  V+ Y  WI
Sbjct: 245 RGVVSWGDGGCGGTLIPGVYTNVSNYQDWI 274


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 37.3 bits (86), Expect = 0.005
 Identities = 29/114 (25%), Positives = 46/114 (40%), Gaps = 14/114 (12%)

Query: 21  DWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIVRLGEYDF 80
            WPW+   K   E +  C GVLI+  WVL +  C+             S + V LG +  
Sbjct: 15  HWPWIA--KVYVEGNYRCTGVLIDLSWVLVSHSCLW------DTSLEHSYISVVLGGHKT 66

Query: 81  SKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQA 134
            K  +     I     +V  R  +    + I+L+ L   A +++ V P  +P  
Sbjct: 67  LKSVKGPYEQI----YRVDCR--KDLPRSKISLLHLKSPATFSNHVLPTFVPST 114


>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain.  This family
           includes trypsin like peptidase domains.
          Length = 138

 Score = 32.2 bits (73), Expect = 0.12
 Identities = 25/183 (13%), Positives = 48/183 (26%), Gaps = 48/183 (26%)

Query: 40  GVLINER-WVLTAAHCIKQKIDNALVLRRTSDLIVRLGEYDFSKVNETKVTDIPAAAMKV 98
           G ++     +LT AH ++    + + +                         +PA  +  
Sbjct: 3   GFVVGSDGLILTNAHVVEDADASEIEVVLPDG------------------GRVPAEVVAA 44

Query: 99  YPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQAGDFYEDQIGIVTGWGTLSYGGPRS 158
            P         D+AL+++               P          G V   G     G  +
Sbjct: 45  DPDL-------DLALLKVDGP-------LLPAAPLLASSAAPLGGSVVVVGGPGGIGLGA 90

Query: 159 DVLMEVPIPVWRLTECRKQFSQNIFDSNLCAGGYKGGTDSCQGDSGGPLLLQRPDKQWTI 218
                    +   +         + D            D+  G SGGP+     D    +
Sbjct: 91  SGGGGGVGGLVSGSLGGVDGRYILTD-----------ADTSPGSSGGPVF----DADGEV 135

Query: 219 IGV 221
           +G+
Sbjct: 136 VGI 138


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
           transport and metabolism].
          Length = 251

 Score = 31.2 bits (71), Expect = 0.39
 Identities = 43/224 (19%), Positives = 63/224 (28%), Gaps = 49/224 (21%)

Query: 38  CGGVLINERWVLTAAHCIKQ----KIDNALVLRRTSDLIVRLGEYDFSKVNETKVTDIPA 93
               LI    VLTA HCI      + D A          V      F  + + ++   P 
Sbjct: 66  TAATLIGPNTVLTAGHCIYSPDYGEDDIAAAPP-----GVNSDGGPFYGITKIEIRVYPG 120

Query: 94  AAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQAGDFYEDQIGI---VTGWGT 150
                   + E     D+    L         V  +    A    E +      V G   
Sbjct: 121 ------ELYKEDGASYDVGEAALESGINIGDVVNYLKRNTAS---EAKANDRITVIG--- 168

Query: 151 LSYGGPRSDVLMEVPIPVWRLTECRKQFSQNIFDSNLCAGGYKGGTDSCQGDSGGPLLLQ 210
             Y G + ++        W  T        N          Y    D+  G SG P+L+ 
Sbjct: 169 --YPGDKPNIGTM-----WESTGKVNSIKGNKLF-------YD--ADTLPGSSGSPVLI- 211

Query: 211 RPDKQWTIIGVVSWGIGCGKTP----GVYVQVNKYLRWIYNTAK 250
               +  +IGV   G G          V +   + L +I    K
Sbjct: 212 ---SKDEVIGVHYNGPGANGGSLANNAVRL-TPEILNFIQQNIK 251


>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316).  This
           family of proteins with unknown function are from
           Caenorhabditis elegans. The protein has GO references
           indicating the protein is a positive regulator of growth
           rate and is also involved in nematode larval
           development.
          Length = 280

 Score = 30.5 bits (69), Expect = 0.70
 Identities = 49/255 (19%), Positives = 82/255 (32%), Gaps = 46/255 (18%)

Query: 10  KIDKGQASEVNDWPWLVALKRQY--ERDNFCGGVLINERWVLTAAHCI--------KQKI 59
           +   G   E +++PWLV    Q   +++       I+ R +LT++           K   
Sbjct: 41  QNINGIYLEKSEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSRLFLNGKMLNWKNTG 100

Query: 60  DNALVLRRTSDLIV---RLGEYDFSKVNETKVT----DIPAAAMKVYPRFSEQNYENDIA 112
           DN         L V    L ++D   +++ K      D    A  +    + ++  +  A
Sbjct: 101 DNDTCSGGLGHLEVPPEVLDKFDIMDLSKKKGKNSFRDNITRAYVLNICANTKSKFDLSA 160

Query: 113 ---LVQLSKKAQYNSFVRPVCLPQAGDFYEDQIGIVTGWGTLSYGGPRSDVLMEVPIPVW 169
              LV+L       S     CL        +   +  G     YG   S  L    +   
Sbjct: 161 KPMLVELEGPEPNIS---YPCLAD------ESTSLEKGDAVDVYGIDSSGELKHRKL--- 208

Query: 170 RLTECRKQFSQNIFDSNLCAGGYKGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWG-IGC 228
                      NI +              C+GD GGPL+     K  T+IG  + G +  
Sbjct: 209 -----------NIVNCYSNDLSIGTDQYLCKGDDGGPLIKNVSGK-NTVIGFGATGNMED 256

Query: 229 GKTPGVYVQVNKYLR 243
                 +  + K L 
Sbjct: 257 NANDFKFFNI-KNLS 270


>gnl|CDD|217015 pfam02395, Peptidase_S6, Immunoglobulin A1 protease.  This family
           consists of immunoglobulin A1 protease proteins. The
           immunoglobulin A1 protease cleaves immunoglobulin IgA
           and is found in pathogenic bacteria such as Neisseria
           gonorrhoeae. Not all of the members of this family are
           IgA proteases, espP from E. coli O157:H7 cleaves human
           coagulation factor V and hbp is a hemoglobin protease
           from E. coli EB1.
          Length = 758

 Score = 30.5 bits (69), Expect = 1.0
 Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 1/35 (2%)

Query: 200 QGDSGGPLLL-QRPDKQWTIIGVVSWGIGCGKTPG 233
            GDSG PL       K+W ++GV+  G G G    
Sbjct: 205 LGDSGSPLFAYDNQKKKWVLLGVLDGGAGYGGKGN 239


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.138    0.442 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,747,130
Number of extensions: 1170985
Number of successful extensions: 773
Number of sequences better than 10.0: 1
Number of HSP's gapped: 745
Number of HSP's successfully gapped: 10
Length of query: 252
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 157
Effective length of database: 6,723,972
Effective search space: 1055663604
Effective search space used: 1055663604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.1 bits)