RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8662
         (260 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  163 bits (416), Expect = 4e-50
 Identities = 62/112 (55%), Positives = 74/112 (66%), Gaps = 1/112 (0%)

Query: 146 NKTGIVTGWGVQKQGGSTSDTLLEVEVPILSNAECKKTA-YENRITPNMLCAGYPKGEKD 204
             T  V+GWG   +GG   D L EV VPI+SNAECK+   Y   IT NMLCAG  +G KD
Sbjct: 121 GTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCAGGLEGGKD 180

Query: 205 SCQGDSGGPLHYANETVHHIVGVVSWGEGCAQENYPGVYARVNRYLTWIKNN 256
           +CQGDSGGPL   +     +VG+VSWG GCA+ NYPGVY RV+ YL WI+  
Sbjct: 181 ACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232



 Score = 70.8 bits (174), Expect = 1e-14
 Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 87  IVGGQVTYVHQYPWMALLMYKK-RFYCGATLINNLYVLTAAHCVH 130
           IVGG    +  +PW   L Y   R +CG +LI+  +VLTAAHCV+
Sbjct: 1   IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVY 45


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  151 bits (383), Expect = 3e-45
 Identities = 59/110 (53%), Positives = 71/110 (64%), Gaps = 3/110 (2%)

Query: 146 NKTGIVTGWGVQKQG-GSTSDTLLEVEVPILSNAECKKTAYEN-RITPNMLCAGYPKGEK 203
             T  V+GWG   +G GS  DTL EV VPI+SNA C++       IT NMLCAG  +G K
Sbjct: 121 GTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAITDNMLCAGGLEGGK 180

Query: 204 DSCQGDSGGPLHYANETVHHIVGVVSWGEGCAQENYPGVYARVNRYLTWI 253
           D+CQGDSGGPL   +     +VG+VSWG GCA+   PGVY RV+ YL WI
Sbjct: 181 DACQGDSGGPLVCNDGR-WVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  122 bits (309), Expect = 2e-34
 Identities = 55/106 (51%), Positives = 68/106 (64%), Gaps = 7/106 (6%)

Query: 148 TGIVTGWGVQKQGGSTSDTLLEVEVPILSNAECKKTAYENRITPNMLCAGYPKGEKDSCQ 207
           T  V+GWG  K  G   DTL EV VP++S   C+  AY   +T NM+CAG   G KD+CQ
Sbjct: 120 TCTVSGWGNTKTLG-LPDTLQEVTVPVVSRETCRS-AYGGTVTDNMICAGA--GGKDACQ 175

Query: 208 GDSGGPLHYANETVHHIVGVVSWGEGCAQENYPGVYARVNRYLTWI 253
           GDSGGPL  ++  +   +G+VSWG GCA  NYPGVY  V+ YL WI
Sbjct: 176 GDSGGPLVCSDGEL---IGIVSWGYGCASGNYPGVYTPVSSYLDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 69.5 bits (170), Expect = 1e-13
 Identities = 39/115 (33%), Positives = 49/115 (42%), Gaps = 15/115 (13%)

Query: 151 VTGWGVQKQG---GST--SDTLLEVEVPILSNAECKKTAYENRITPNM------LCAGYP 199
              +GV        S+     L EV V  +  + C           +        CAG P
Sbjct: 163 NGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLSTCA-QYKGCANASDGATGLTGFCAGRP 221

Query: 200 KGEKDSCQGDSGGPLHYANETVHHIVGVVSWGEG-CAQENYPGVYARVNRYLTWI 253
              KD+CQGDSGGP+ +  E      GVVSWG+G C     PGVY  V+ Y  WI
Sbjct: 222 P--KDACQGDSGGPIFHKGEEGRVQRGVVSWGDGGCGGTLIPGVYTNVSNYQDWI 274



 Score = 41.0 bits (96), Expect = 4e-04
 Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 5/49 (10%)

Query: 86  RIVGGQVTYVHQYPWMALLM-----YKKRFYCGATLINNLYVLTAAHCV 129
           RI+GG      +YP +  L+     Y    +CG + +   YVLTAAHC 
Sbjct: 32  RIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCA 80


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
           transport and metabolism].
          Length = 251

 Score = 35.1 bits (81), Expect = 0.025
 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 6/52 (11%)

Query: 82  NKKTRIVGGQVTYVHQYPWMALLMYKK---RFYCGATLINNLYVLTAAHCVH 130
            +  R    QVT   Q+P+ A++ ++    R    ATLI    VLTA HC++
Sbjct: 36  AEDDRT---QVTDTTQFPYSAVVQFEAATGRLCTAATLIGPNTVLTAGHCIY 84


>gnl|CDD|203605 pfam07262, DUF1436, Protein of unknown function (DUF1436).  This
           family consists of several hypothetical bacterial
           proteins of around 160 residues in length. The function
           of this family is unknown.
          Length = 158

 Score = 30.4 bits (69), Expect = 0.50
 Identities = 15/53 (28%), Positives = 21/53 (39%), Gaps = 13/53 (24%)

Query: 100 WMALLM-----------YKKRFYCGATLINNLYVLTAAHCVHQGLGIWVTIRG 141
           W+  LM           +K   +CG +LINN+  +   H  H  L  W  I  
Sbjct: 80  WVKKLMEKYGYKTKRALFKNMKFCGISLINNIITIRPTH--HDKLEGWSAIGD 130


>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
          Length = 1355

 Score = 30.4 bits (68), Expect = 1.1
 Identities = 16/64 (25%), Positives = 23/64 (35%), Gaps = 3/64 (4%)

Query: 7   LDDVQHDEVPEVLDDGIS---VVPGPMPVAAPHGFQDYTSWFQELISGFVTNQEESTPPE 63
           LDD +   +  +LDDG       P   PV  P         +Q+         +   P +
Sbjct: 724 LDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQ 783

Query: 64  PVKP 67
           PV P
Sbjct: 784 PVAP 787


>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316).  This
           family of proteins with unknown function are from
           Caenorhabditis elegans. The protein has GO references
           indicating the protein is a positive regulator of growth
           rate and is also involved in nematode larval
           development.
          Length = 280

 Score = 29.7 bits (67), Expect = 1.4
 Identities = 23/117 (19%), Positives = 46/117 (39%), Gaps = 23/117 (19%)

Query: 77  TCG--AVNKKTRIVGGQVTYVHQYPWMALLMYK----KRFYCGATLINNLYVLTAAHCVH 130
           +CG   +   ++ + G      +YPW+    ++    K +   AT I+  ++LT++    
Sbjct: 30  SCGNKTLPLPSQNINGIYLEKSEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSR--- 86

Query: 131 QGLGIWVTIRGKSFSNKTGIVTGWGVQKQGGST-----SDTLLEVEVPILSNAECKK 182
                 + + GK  + K    TG      GG        + L + ++  LS  + K 
Sbjct: 87  ------LFLNGKMLNWKN---TGDNDTCSGGLGHLEVPPEVLDKFDIMDLSKKKGKN 134


>gnl|CDD|238338 cd00604, IPT_CGTD, IPT domain (domain D) of cyclodextrin
           glycosyltransferase (CGTase) and similar enzymes. These
           enzymes are involved in the enzymatic hydrolysis of
           alpha-1,4 linkages of starch polymers and belong to the
           glycosyl hydrolase family 13. Most consist of three
           domains (A,B,C) but CGTase is more complex and has two
           additional domains (D,E). The function of the IPT/D
           domain is unknown.
          Length = 81

 Score = 26.9 bits (60), Expect = 3.1
 Identities = 16/37 (43%), Positives = 20/37 (54%)

Query: 137 VTIRGKSFSNKTGIVTGWGVQKQGGSTSDTLLEVEVP 173
           VTI G+ F +  G V   G   +  S SDT + VEVP
Sbjct: 17  VTISGEGFGSTGGTVYFGGTAAEVLSWSDTSIVVEVP 53


>gnl|CDD|219131 pfam06676, DUF1178, Protein of unknown function (DUF1178).  This
           family consists of several hypothetical bacterial
           proteins of around 150 residues in length. The function
           of this family is unknown.
          Length = 145

 Score = 27.5 bits (62), Expect = 3.9
 Identities = 7/19 (36%), Positives = 13/19 (68%)

Query: 13  DEVPEVLDDGISVVPGPMP 31
           +E   ++++GI V P P+P
Sbjct: 122 EEAKALIEEGIPVAPLPVP 140


>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1025

 Score = 28.6 bits (64), Expect = 4.0
 Identities = 22/83 (26%), Positives = 36/83 (43%), Gaps = 17/83 (20%)

Query: 2   LSGEPLDDVQH-------DEVPEVLDDGIS---VVPGPMPVAAPHGFQDYTSWF------ 45
           ++G P + VQ        DEV  VL D      ++ GP+P A    FQ+   W       
Sbjct: 247 MAGTPEEMVQRDFYFAIVDEVDSVLIDEARTPLIISGPVPNADNSKFQEIKPWIEQLVRA 306

Query: 46  -QELISGFVTNQEESTPPEPVKP 67
            Q L++ ++T  E++   +P   
Sbjct: 307 QQNLVASYLTEAEKALKTKPNDF 329


>gnl|CDD|226241 COG3718, IolB, Uncharacterized enzyme involved in inositol
           metabolism [Carbohydrate transport and metabolism].
          Length = 270

 Score = 27.3 bits (61), Expect = 8.6
 Identities = 15/35 (42%), Positives = 19/35 (54%), Gaps = 1/35 (2%)

Query: 8   DDVQHDEVPEVLDDGISVVP-GPMPVAAPHGFQDY 41
           DD   DE   V +  + +VP G  PV APHG+  Y
Sbjct: 207 DDRSLDETMAVENGDVVLVPKGYHPVGAPHGYDSY 241


>gnl|CDD|224516 COG1600, COG1600, Uncharacterized Fe-S protein [Energy production
           and conversion].
          Length = 337

 Score = 27.0 bits (60), Expect = 9.6
 Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 6/38 (15%)

Query: 40  DYTSWFQELISGFVTNQEESTPPEPVKPVDLEKCGPCT 77
           +Y SW    +   +T+     P  P +P + + CG CT
Sbjct: 160 EYGSWVF--LGAVLTD----APLPPDRPEEEDHCGSCT 191


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.136    0.436 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,243,519
Number of extensions: 1198395
Number of successful extensions: 813
Number of sequences better than 10.0: 1
Number of HSP's gapped: 802
Number of HSP's successfully gapped: 19
Length of query: 260
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 165
Effective length of database: 6,723,972
Effective search space: 1109455380
Effective search space used: 1109455380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (26.1 bits)