RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16044
         (436 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  209 bits (535), Expect = 9e-66
 Identities = 86/225 (38%), Positives = 120/225 (53%), Gaps = 16/225 (7%)

Query: 28  INGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIPEL 87
           + G E+  G++PWQVSLQ    R     H+CG  LI P WV+TAAHC+++       P  
Sbjct: 2   VGGSEAKIGSFPWQVSLQYTGGR-----HFCGGSLISPRWVLTAAHCVYS-----SAPSN 51

Query: 88  WTAVLGDWDRTEEEKSEVRIPVERIRVHEEFH--NYHHDIALLKLSRPTSARDKGVRAVC 145
           +T  LG  D +  E     I V+++ VH  ++   Y +DIALLKL RP +  D  VR +C
Sbjct: 52  YTVRLGSHDLSSNEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDN-VRPIC 110

Query: 146 LTDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPLHNISVCRDKYGDSVELHGGH 205
           L  +    +     C  +GWGR    G L   L+++ VP+ + + C+  Y     +    
Sbjct: 111 LPSSG-YNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNM 169

Query: 206 LCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCGV 250
           LC G L+G   AC GDSGGPL C+  +GR  L GI S+GSG C  
Sbjct: 170 LCAGGLEGGKDACQGDSGGPLVCN-DNGRGVLVGIVSWGSG-CAR 212



 Score =  147 bits (373), Expect = 8e-42
 Identities = 61/144 (42%), Positives = 82/144 (56%), Gaps = 3/144 (2%)

Query: 285 ALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPL 344
           ALLKL RP +  D  VR +CL  +    +     C  +GWGR    G L   L+++ VP+
Sbjct: 92  ALLKLKRPVTLSDN-VRPICLPSSG-YNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPI 149

Query: 345 HNISVCRDKYGDSVELHGGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGS 404
            + + C+  Y     +    LC G L+G   AC GDSGGPL C+  +GR  L GI S+GS
Sbjct: 150 VSNAECKRAYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCN-DNGRGVLVGIVSWGS 208

Query: 405 GCAKSGYPDVYTKLSFYLPWIRKQ 428
           GCA+  YP VYT++S YL WI+K 
Sbjct: 209 GCARPNYPGVYTRVSSYLDWIQKT 232


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  201 bits (513), Expect = 2e-62
 Identities = 89/227 (39%), Positives = 120/227 (52%), Gaps = 19/227 (8%)

Query: 26  RLINGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIP 85
           R++ G E+  G++PWQVSLQ    R     H+CG  LI P WV+TAAHC+         P
Sbjct: 1   RIVGGSEANIGSFPWQVSLQYGGGR-----HFCGGSLISPRWVLTAAHCVRG-----SDP 50

Query: 86  ELWTAVLGDWDRTEEEKSEVRIPVERIRVHEEFH--NYHHDIALLKLSRPTSARDKGVRA 143
                 LG  D +  E+ +V I V ++ +H  ++   Y +DIALLKL  P +  D  VR 
Sbjct: 51  SNIRVRLGSHDLSSGEEGQV-IKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDN-VRP 108

Query: 144 VCLTDADKRPVNPKQQCVATGWGRVKP-KGDLVSKLRQIRVPLHNISVCRDKYGDSVELH 202
           +CL  ++   V     C  +GWGR     G L   L+++ VP+ + + CR  Y     + 
Sbjct: 109 ICLPSSN-YNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAIT 167

Query: 203 GGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCG 249
              LC G L+G   AC GDSGGPL C   DGRW L GI S+GSG C 
Sbjct: 168 DNMLCAGGLEGGKDACQGDSGGPLVC--NDGRWVLVGIVSWGSG-CA 211



 Score =  140 bits (356), Expect = 3e-39
 Identities = 63/142 (44%), Positives = 80/142 (56%), Gaps = 5/142 (3%)

Query: 285 ALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKP-KGDLVSKLRQIRVP 343
           ALLKL  P +  D  VR +CL  ++   V     C  +GWGR     G L   L+++ VP
Sbjct: 92  ALLKLKEPVTLSDN-VRPICLPSSN-YNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVP 149

Query: 344 LHNISVCRDKYGDSVELHGGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFG 403
           + + + CR  Y     +    LC G L+G   AC GDSGGPL C   DGRW L GI S+G
Sbjct: 150 IVSNATCRRAYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC--NDGRWVLVGIVSWG 207

Query: 404 SGCAKSGYPDVYTKLSFYLPWI 425
           SGCA+ G P VYT++S YL WI
Sbjct: 208 SGCARPGKPGVYTRVSSYLDWI 229


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  155 bits (393), Expect = 6e-45
 Identities = 84/233 (36%), Positives = 114/233 (48%), Gaps = 25/233 (10%)

Query: 28  INGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIPEL 87
           + G E+  G++PWQVSLQV   +     H+CG  LI  +WV+TAAHC+ N        + 
Sbjct: 2   VGGDEAQPGSFPWQVSLQVSSGK-----HFCGGSLISENWVLTAAHCVSN-------AKS 49

Query: 88  WTAVLGDWDRTEEEKSEVRIPVERIRVHEEF-HNYHHDIALLKLSRPTSARDKGVRAVCL 146
              VLG  +    E  E +  V+++ VH  +  +  +DIALLKL  P +  D  VR +CL
Sbjct: 50  VRVVLGAHNIVLREGGEQKFDVKKVIVHPNYNPDTDNDIALLKLKSPVTLGD-TVRPICL 108

Query: 147 TDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPLHNISVCRDKYGDSVELHGGHL 206
             A    +     C  +GWG  K  G L   L+++ VP+ +   CR  YG +V      +
Sbjct: 109 PTAS-SDLPVGTTCTVSGWGNTKTLG-LPDTLQEVTVPVVSRETCRSAYGGTVT--DNMI 164

Query: 207 CGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCGVGIRYSHRQP 259
           C G   G   AC GDSGGPL CS       L GI S+G G C  G       P
Sbjct: 165 CAG--AGGKDACQGDSGGPLVCSDG----ELIGIVSWGYG-CASGNYPGVYTP 210



 Score =  105 bits (263), Expect = 3e-26
 Identities = 59/141 (41%), Positives = 72/141 (51%), Gaps = 11/141 (7%)

Query: 285 ALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPL 344
           ALLKL  P +  D  VR +CL  A    +     C  +GWG  K  G L   L+++ VP+
Sbjct: 89  ALLKLKSPVTLGD-TVRPICLPTAS-SDLPVGTTCTVSGWGNTKTLG-LPDTLQEVTVPV 145

Query: 345 HNISVCRDKYGDSVELHGGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGS 404
            +   CR  YG +V      +C G   G   AC GDSGGPL CS       L GI S+G 
Sbjct: 146 VSRETCRSAYGGTVT--DNMICAG--AGGKDACQGDSGGPLVCSDG----ELIGIVSWGY 197

Query: 405 GCAKSGYPDVYTKLSFYLPWI 425
           GCA   YP VYT +S YL WI
Sbjct: 198 GCASGNYPGVYTPVSSYLDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 83.0 bits (205), Expect = 3e-17
 Identities = 63/235 (26%), Positives = 95/235 (40%), Gaps = 19/235 (8%)

Query: 25  PRLINGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPI 84
            R+I G  +  G +P  V+L V      +   +CG   +   +V+TAAHC      S PI
Sbjct: 31  SRIIGGSNANAGEYPSLVAL-VDRISDYVSGTFCGGSKLGGRYVLTAAHCADA---SSPI 86

Query: 85  PELWTAVLGDWDRTEEEKSEVRIPVERIRVHEEFH--NYHHDIALLKLSRPTSARDKGVR 142
                 V+   D  +  ++E R  V  I VHE +   N  +DIA+L+L+R  S     + 
Sbjct: 87  SSDVNRVV--VDLNDSSQAE-RGHVRTIYVHEFYSPGNLGNDIAVLELARAASLPRVKIT 143

Query: 143 AVCLTDADKRPVNPKQQCVATGWGRVKPKGDL-----VSKLRQIRVPLHNISVCRDKYGD 197
           +   +D     V          +G   P          + L ++ V    +S C    G 
Sbjct: 144 SFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLSTCAQYKGC 203

Query: 198 SVELHGGHLC--GGQLDGFS-GACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCG 249
           +     G     G         AC GDSGGP+    ++GR    G+ S+G G CG
Sbjct: 204 A-NASDGATGLTGFCAGRPPKDACQGDSGGPIFHKGEEGRVQR-GVVSWGDGGCG 256



 Score = 57.6 bits (139), Expect = 5e-09
 Identities = 40/157 (25%), Positives = 59/157 (37%), Gaps = 11/157 (7%)

Query: 283 SAALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKPKGDL-----VSKL 337
             A+L+L+R  S     + +   +D     V          +G   P          + L
Sbjct: 125 DIAVLELARAASLPRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTIL 184

Query: 338 RQIRVPLHNISVCRDKYGDSVELHGGHLC--GGQLDGFS-GACIGDSGGPLQCSLKDGRW 394
            ++ V    +S C    G +     G     G         AC GDSGGP+    ++GR 
Sbjct: 185 HEVAVLFVPLSTCAQYKGCA-NASDGATGLTGFCAGRPPKDACQGDSGGPIFHKGEEGRV 243

Query: 395 YLAGITSFG-SGCAKSGYPDVYTKLSFYLPWIRKQIN 430
              G+ S+G  GC  +  P VYT +S Y  WI    N
Sbjct: 244 QR-GVVSWGDGGCGGTLIPGVYTNVSNYQDWIAAMTN 279


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 31.6 bits (71), Expect = 0.72
 Identities = 24/99 (24%), Positives = 40/99 (40%), Gaps = 14/99 (14%)

Query: 38  WPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIPELWTAVLGDWDR 97
           WPW   + V     G     C  VLI  SWV+ +  C+ +      +   + +V+    +
Sbjct: 16  WPWIAKVYVE----GNY--RCTGVLIDLSWVLVSHSCLWD----TSLEHSYISVVLGGHK 65

Query: 98  TEEEKSEVRIPVERIRVHEEFHN-YHHDIALLKLSRPTS 135
           T      V+ P E+I   +   +     I+LL L  P +
Sbjct: 66  T---LKSVKGPYEQIYRVDCRKDLPRSKISLLHLKSPAT 101


>gnl|CDD|217015 pfam02395, Peptidase_S6, Immunoglobulin A1 protease.  This family
           consists of immunoglobulin A1 protease proteins. The
           immunoglobulin A1 protease cleaves immunoglobulin IgA
           and is found in pathogenic bacteria such as Neisseria
           gonorrhoeae. Not all of the members of this family are
           IgA proteases, espP from E. coli O157:H7 cleaves human
           coagulation factor V and hbp is a hemoglobin protease
           from E. coli EB1.
          Length = 758

 Score = 32.1 bits (73), Expect = 0.72
 Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 1/57 (1%)

Query: 379 GDSGGPLQC-SLKDGRWYLAGITSFGSGCAKSGYPDVYTKLSFYLPWIRKQINIAVD 434
           GDSG PL     +  +W L G+   G+G    G     TK  F    ++K  +  V 
Sbjct: 206 GDSGSPLFAYDNQKKKWVLLGVLDGGAGYGGKGNNWNVTKKDFLGQVLQKDTDATVK 262



 Score = 29.0 bits (65), Expect = 5.6
 Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 1/33 (3%)

Query: 220 GDSGGPLQC-SLKDGRWYLAGITSFGSGYCGVG 251
           GDSG PL     +  +W L G+   G+GY G G
Sbjct: 206 GDSGSPLFAYDNQKKKWVLLGVLDGGAGYGGKG 238


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
          transport and metabolism].
          Length = 251

 Score = 30.0 bits (68), Expect = 1.8
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 58 CGAVLIHPSWVVTAAHCIHN 77
            A LI P+ V+TA HCI++
Sbjct: 66 TAATLIGPNTVLTAGHCIYS 85


>gnl|CDD|233677 TIGR01995, PTS-II-ABC-beta, PTS system, beta-glucoside-specific
           IIABC component.  This model represents a family of PTS
           enzyme II proteins in which all three domains are found
           in the same polypeptide chain and which appear to have a
           broad specificity for beta-glucosides including salicin
           (beta-D-glucose-1-salicylate) and arbutin
           (Hydroquinone-O-beta-D-glucopyranoside). These are
           distinct from the closely related sucrose-specific and
           trehalose-specific PTS transporters.
          Length = 610

 Score = 30.0 bits (68), Expect = 2.8
 Identities = 8/27 (29%), Positives = 12/27 (44%)

Query: 59  GAVLIHPSWVVTAAHCIHNDIFSLPIP 85
           GA L+HP+ V           F +P+ 
Sbjct: 177 GAALLHPTLVAMVGSGKPVTFFGIPVT 203


>gnl|CDD|217150 pfam02622, DUF179, Uncharacterized ACR, COG1678. 
          Length = 158

 Score = 29.0 bits (66), Expect = 3.2
 Identities = 10/27 (37%), Positives = 12/27 (44%), Gaps = 2/27 (7%)

Query: 66  SWVVTAAHCIHNDIFSLPIPELWTAVL 92
           SW+V  A      IF  P  + W A L
Sbjct: 130 SWLVAPAD--PEIIFDTPPEDRWEAAL 154


>gnl|CDD|236659 PRK10158, PRK10158, 23S rRNA/tRNA pseudouridine synthase A;
           Provisional.
          Length = 219

 Score = 28.8 bits (64), Expect = 4.2
 Identities = 27/86 (31%), Positives = 36/86 (41%), Gaps = 19/86 (22%)

Query: 97  RTEEEKSEVRIPVERIRVHEEFHNYHHDIALLKLSRPTSARDKGVRAVCLTDADKRPVN- 155
           R EE K  V   ++R        +Y    ++ +L   TS    GV  V LT A +R +  
Sbjct: 38  RLEEHKDSVMTRIQR--------DYPQAESVHRLDMATS----GVIVVALTKAAERELKR 85

Query: 156 ------PKQQCVATGWGRVKPKGDLV 175
                 PK+Q VA  WG   P   LV
Sbjct: 86  QFREREPKKQYVARVWGHPSPAEGLV 111


>gnl|CDD|219188 pfam06814, Lung_7-TM_R, Lung seven transmembrane receptor.  This
           family represents a conserved region with eukaryotic
           lung seven transmembrane receptors and related proteins.
          Length = 287

 Score = 28.8 bits (65), Expect = 4.9
 Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 160 CVATGWGRVKPK-GDLVSKLRQIRVPL 185
            V+ G+G VKP  GD + K+  I +  
Sbjct: 129 IVSLGYGIVKPTLGDTLKKVAGIGLLY 155



 Score = 28.8 bits (65), Expect = 4.9
 Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 319 CVATGWGRVKPK-GDLVSKLRQIRVPL 344
            V+ G+G VKP  GD + K+  I +  
Sbjct: 129 IVSLGYGIVKPTLGDTLKKVAGIGLLY 155


>gnl|CDD|234694 PRK00228, PRK00228, hypothetical protein; Validated.
          Length = 191

 Score = 28.7 bits (65), Expect = 5.1
 Identities = 7/27 (25%), Positives = 11/27 (40%), Gaps = 2/27 (7%)

Query: 66  SWVVTAAHCIHNDIFSLPIPELWTAVL 92
           +W+   A      +F  P  + W A L
Sbjct: 150 AWLTAPAD--PEILFDTPPEDRWRAAL 174


>gnl|CDD|236772 PRK10833, PRK10833, putative assembly protein; Provisional.
          Length = 617

 Score = 28.7 bits (65), Expect = 7.0
 Identities = 13/38 (34%), Positives = 16/38 (42%), Gaps = 10/38 (26%)

Query: 156 PKQQC-------VATGWGRVKPKGDLVSKLRQIRVPLH 186
           PKQQC       V  GW        L+  L++  VPL 
Sbjct: 530 PKQQCDVNFNVRVTGGW---NGDSKLIDFLQETPVPLR 564



 Score = 28.7 bits (65), Expect = 7.0
 Identities = 13/38 (34%), Positives = 16/38 (42%), Gaps = 10/38 (26%)

Query: 315 PKQQC-------VATGWGRVKPKGDLVSKLRQIRVPLH 345
           PKQQC       V  GW        L+  L++  VPL 
Sbjct: 530 PKQQCDVNFNVRVTGGW---NGDSKLIDFLQETPVPLR 564


>gnl|CDD|221807 pfam12850, Metallophos_2, Calcineurin-like phosphoesterase
           superfamily domain.  Members of this family are part of
           the Calcineurin-like phosphoesterase superfamily.
          Length = 146

 Score = 27.6 bits (62), Expect = 9.0
 Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 13/74 (17%)

Query: 75  IHNDIFSLPIPELWTAVLGDWDRTEEEKSEVRIPVERIR---VHEEFHNYHHDIALLKLS 131
           + + +  +P+     AV G+ D  E+   E+ + +   R   VH   +   H +A L   
Sbjct: 41  VLDLLADIPVY----AVRGNNDGDEDLPEELVLTLGGFRILVVHGHPYGVKHGLAELL-- 94

Query: 132 RPTSARDKGVRAVC 145
               A++ G   V 
Sbjct: 95  ----AKEGGADVVF 104


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.140    0.465 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,334,688
Number of extensions: 2128616
Number of successful extensions: 1708
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1679
Number of HSP's successfully gapped: 25
Length of query: 436
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 336
Effective length of database: 6,502,202
Effective search space: 2184739872
Effective search space used: 2184739872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (26.9 bits)