RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8651
         (302 letters)



>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  215 bits (550), Expect = 1e-69
 Identities = 78/195 (40%), Positives = 110/195 (56%), Gaps = 22/195 (11%)

Query: 100 RIVGGNVTKLHEFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEH 158
           RIVGG+   +  FPW  +L    G+ +CG +LI+ R VLTAAHC+ G +P  I+V LG H
Sbjct: 1   RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRGSDPSNIRVRLGSH 60

Query: 159 DRLSKNESVPVIIRKVKRAIRHPDFSLSNFNNDIALLEMESGVDFEAPQIHPACLP---- 214
           D  S  E     + KV + I HP+++ S ++NDIALL+++  V      + P CLP    
Sbjct: 61  DLSSGEEGQ---VIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLS-DNVRPICLPSSNY 116

Query: 215 -------------GNSLDERKPTANSLRKVEVPILSEEECKSAGYSASRITNNMLCAGYA 261
                        G + +      ++L++V VPI+S   C+ A      IT+NMLCAG  
Sbjct: 117 NVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAITDNMLCAGGL 176

Query: 262 EGKRDSCQGDSGGPL 276
           EG +D+CQGDSGGPL
Sbjct: 177 EGGKDACQGDSGGPL 191



 Score =  103 bits (260), Expect = 1e-26
 Identities = 35/77 (45%), Positives = 47/77 (61%), Gaps = 1/77 (1%)

Query: 2   DEANYTGKIGIVAGWGRLDERKPT-ANSLRKVEVPILSEEECKSAGYSASRITNNMLCAG 60
           +     G    V+GWGR  E   +  ++L++V VPI+S   C+ A      IT+NMLCAG
Sbjct: 115 NYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAITDNMLCAG 174

Query: 61  YAEGKRDSCQGDSGGPL 77
             EG +D+CQGDSGGPL
Sbjct: 175 GLEGGKDACQGDSGGPL 191


>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  214 bits (547), Expect = 3e-69
 Identities = 79/193 (40%), Positives = 107/193 (55%), Gaps = 20/193 (10%)

Query: 101 IVGGNVTKLHEFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHD 159
           IVGG+  K+  FPW  +L    G+ +CG +LI+ R VLTAAHC+    P    V LG HD
Sbjct: 1   IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHD 60

Query: 160 RLSKNESVPVIIRKVKRAIRHPDFSLSNFNNDIALLEMESGVDFEAPQIHPACLPGNS-- 217
             S        + KVK+ I HP+++ S ++NDIALL+++  V   +  + P CLP +   
Sbjct: 61  LSSNEG--GGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTL-SDNVRPICLPSSGYN 117

Query: 218 --------------LDERKPTANSLRKVEVPILSEEECKSAGYSASRITNNMLCAGYAEG 263
                           E  P  + L++V VPI+S  ECK A      IT+NMLCAG  EG
Sbjct: 118 LPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCAGGLEG 177

Query: 264 KRDSCQGDSGGPL 276
            +D+CQGDSGGPL
Sbjct: 178 GKDACQGDSGGPL 190



 Score =  109 bits (275), Expect = 1e-28
 Identities = 38/72 (52%), Positives = 46/72 (63%)

Query: 6   YTGKIGIVAGWGRLDERKPTANSLRKVEVPILSEEECKSAGYSASRITNNMLCAGYAEGK 65
             G    V+GWGR  E  P  + L++V VPI+S  ECK A      IT+NMLCAG  EG 
Sbjct: 119 PAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCAGGLEGG 178

Query: 66  RDSCQGDSGGPL 77
           +D+CQGDSGGPL
Sbjct: 179 KDACQGDSGGPL 190


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  151 bits (383), Expect = 7e-45
 Identities = 71/192 (36%), Positives = 104/192 (54%), Gaps = 26/192 (13%)

Query: 101 IVGGNVTKLHEFPWIAALTKKGKFY-CGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHD 159
           IVGG+  +   FPW  +L      + CG +LI++  VLTAAHC+   N K ++V LG H+
Sbjct: 1   IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVS--NAKSVRVVLGAHN 58

Query: 160 RLSKNESVPVIIRKVKRAIRHPDFSLSNFNNDIALLEMESGVDFEAPQIHPACLPGNSLD 219
            + +          VK+ I HP+++     NDIALL+++S V      + P CLP  S D
Sbjct: 59  IVLREGGEQKF--DVKKVIVHPNYNPDTD-NDIALLKLKSPVTL-GDTVRPICLPTASSD 114

Query: 220 ERKPT---------------ANSLRKVEVPILSEEECKSAGYSASRITNNMLCAGYAEGK 264
               T                ++L++V VP++S E C+SA      +T+NM+CAG   G 
Sbjct: 115 LPVGTTCTVSGWGNTKTLGLPDTLQEVTVPVVSRETCRSAYG--GTVTDNMICAGA--GG 170

Query: 265 RDSCQGDSGGPL 276
           +D+CQGDSGGPL
Sbjct: 171 KDACQGDSGGPL 182



 Score = 75.6 bits (186), Expect = 3e-16
 Identities = 31/70 (44%), Positives = 44/70 (62%), Gaps = 5/70 (7%)

Query: 8   GKIGIVAGWGRLDERKPTANSLRKVEVPILSEEECKSAGYSASRITNNMLCAGYAEGKRD 67
           G    V+GWG         ++L++V VP++S E C+SA      +T+NM+CAG   G +D
Sbjct: 118 GTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCRSAYGG--TVTDNMICAGA--GGKD 172

Query: 68  SCQGDSGGPL 77
           +CQGDSGGPL
Sbjct: 173 ACQGDSGGPL 182


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 80.7 bits (199), Expect = 3e-17
 Identities = 54/210 (25%), Positives = 80/210 (38%), Gaps = 40/210 (19%)

Query: 100 RIVGGNVTKLHEFPWIAALTKKGKF-----YCGATLIAKRHVLTAAHCIEGVNPKEIKVT 154
           RI+GG+     E+P + AL  +        +CG + +  R+VLTAAHC +  +P    V 
Sbjct: 32  RIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSDVN 91

Query: 155 LGEHDRLSKNESVPVIIRKVKRAIRHPDFSLSNFNNDIALLEMESGVDFEAPQIHPA--- 211
               D L+ +         V+    H  +S  N  NDIA+LE+         +I      
Sbjct: 92  RVVVD-LNDSSQAERG--HVRTIYVHEFYSPGNLGNDIAVLELARAASLPRVKITSFDAS 148

Query: 212 -----CLPGNSLDE--------------RKPTANSLRKVEVPILSEEECKS------AGY 246
                 +   S                   P    L +V V  +    C        A  
Sbjct: 149 DTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLSTCAQYKGCANASD 208

Query: 247 SASRITNNMLCAGYAEGKRDSCQGDSGGPL 276
            A+ +T    CAG     +D+CQGDSGGP+
Sbjct: 209 GATGLTG--FCAGRP--PKDACQGDSGGPI 234



 Score = 33.7 bits (77), Expect = 0.094
 Identities = 29/105 (27%), Positives = 44/105 (41%), Gaps = 12/105 (11%)

Query: 24  PTANSLRKVEVPILSEEECKS------AGYSASRITNNMLCAGYAEGKRDSCQGDSGGPL 77
           P    L +V V  +    C        A   A+ +T    CAG     +D+CQGDSGGP+
Sbjct: 179 PKGTILHEVAVLFVPLSTCAQYKGCANASDGATGLTG--FCAGRP--PKDACQGDSGGPI 234

Query: 78  QIAVAR-PGKMEVIACGQVERNQRIVGGNVTKLHEF-PWIAALTK 120
                    +  V++ G       ++ G  T +  +  WIAA+T 
Sbjct: 235 FHKGEEGRVQRGVVSWGDGGCGGTLIPGVYTNVSNYQDWIAAMTN 279


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
           transport and metabolism].
          Length = 251

 Score = 38.1 bits (89), Expect = 0.003
 Identities = 22/75 (29%), Positives = 29/75 (38%), Gaps = 12/75 (16%)

Query: 77  LQIAVARPGKMEVIACGQVERNQRIVGGN-----VTKLHEFPW--IAAL-TKKGKFYCGA 128
           L I  +  G    +    VE  Q+          VT   +FP+  +       G+    A
Sbjct: 13  LTILTSAAG----VGATAVEEPQQTASAEDDRTQVTDTTQFPYSAVVQFEAATGRLCTAA 68

Query: 129 TLIAKRHVLTAAHCI 143
           TLI    VLTA HCI
Sbjct: 69  TLIGPNTVLTAGHCI 83


>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain.  This family
           includes trypsin like peptidase domains.
          Length = 138

 Score = 36.8 bits (85), Expect = 0.003
 Identities = 29/148 (19%), Positives = 47/148 (31%), Gaps = 30/148 (20%)

Query: 132 AKRHVLTAAHCIEGVNPKEIKVTLGEHDRLSKNESVPVIIRKVKRAIRHPDFSLSNFNND 191
           +   +LT AH +E  +  EI+V L +               +V   +   D  L     D
Sbjct: 8   SDGLILTNAHVVEDADASEIEVVLPDG-------------GRVPAEVVAADPDL-----D 49

Query: 192 IALLEMESGVDFEAPQIHPACLPGNSLDERKPTANSLRKVEVPILSEEECKSAGYSASRI 251
           +ALL+++  +   AP +              P   S+  V  P          G     +
Sbjct: 50  LALLKVDGPLLPAAPLL---------ASSAAPLGGSVVVVGGPGGIGLGASGGGGGVGGL 100

Query: 252 TNNMLCAGYAEGKRDSCQ---GDSGGPL 276
            +  L                G SGGP+
Sbjct: 101 VSGSLGGVDGRYILTDADTSPGSSGGPV 128


>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316).  This
           family of proteins with unknown function are from
           Caenorhabditis elegans. The protein has GO references
           indicating the protein is a positive regulator of growth
           rate and is also involved in nematode larval
           development.
          Length = 280

 Score = 36.2 bits (84), Expect = 0.012
 Identities = 44/218 (20%), Positives = 67/218 (30%), Gaps = 46/218 (21%)

Query: 92  CGQ--VERNQRIVGGNVTKLHEFPWIAALTKK----GKFYCGATLIAKRHVLTAAHCIEG 145
           CG   +    + + G   +  E+PW+     +      +   AT I+ RH+LT++     
Sbjct: 31  CGNKTLPLPSQNINGIYLEKSEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSRLFLN 90

Query: 146 VNPKEIKVTL--GEHDRLSKNESVP------VIIR-------------KVKRAIRHP--D 182
                 K T           +  VP        I               + RA       
Sbjct: 91  GKMLNWKNTGDNDTCSGGLGHLEVPPEVLDKFDIMDLSKKKGKNSFRDNITRAYVLNICA 150

Query: 183 FSLSNFNNDIALLEMESGVDFEAPQIHPACLPGNSLDERKPTANSLRKVEV----PILSE 238
            + S F+     + +E  ++   P I   CL   S         SL K +      I S 
Sbjct: 151 NTKSKFDLSAKPMLVE--LEGPEPNISYPCLADES--------TSLEKGDAVDVYGIDSS 200

Query: 239 EECKSAGYSASRITNNMLCAGYAEGKRDSCQGDSGGPL 276
            E K        I N           +  C+GD GGPL
Sbjct: 201 GELKH---RKLNIVNCYSNDLSIGTDQYLCKGDDGGPL 235


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 33.9 bits (77), Expect = 0.070
 Identities = 19/72 (26%), Positives = 30/72 (41%), Gaps = 2/72 (2%)

Query: 105 NVTKLHEFPWIAALTKKGKFYCGATLIAKRHVLTAAHCIEGVNPKE--IKVTLGEHDRLS 162
              +   +PWIA +  +G + C   LI    VL +  C+   + +   I V LG H  L 
Sbjct: 9   TFKEELHWPWIAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGHKTLK 68

Query: 163 KNESVPVIIRKV 174
             +     I +V
Sbjct: 69  SVKGPYEQIYRV 80


>gnl|CDD|130870 TIGR01811, sdhA_Bsu, succinate dehydrogenase or fumarate reductase,
           flavoprotein subunit, Bacillus subtilis subgroup.  This
           model represents the succinate dehydrogenase
           flavoprotein subunit as found in the low-GC
           Gram-positive bacteria and a few other lineages. This
           enzyme may act in a complete or partial TCA cycle, or
           act in the opposite direction as fumarate reductase. In
           some but not all species, succinate dehydrogenase and
           fumarate reductase may be encoded as separate isozymes
           [Energy metabolism, TCA cycle].
          Length = 603

 Score = 29.4 bits (66), Expect = 2.7
 Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 198 ESGVDFEAP---QIHPACLPGNSLDERKPT--ANSLR 229
           E G  F  P   QIHP  +P +   + K    + SLR
Sbjct: 218 EQGAYFANPEFIQIHPTAIPVDGTWQSKLRLMSESLR 254


>gnl|CDD|176391 cd01796, DDI1_N, DNA damage inducible protein 1 ubiquitin-like
           domain.  DDI1_N   DDI1 (DNA damage inducible protein 1)
           has an amino-terminal ubiquitin-like domain, an
           retroviral protease-like (RVP-like) domain, and a UBA
           (ubiquitin-associated) domain.  This CD represents the
           amino-terminal ubiquitin-like domain of DDI1.
          Length = 71

 Score = 26.9 bits (60), Expect = 3.2
 Identities = 12/28 (42%), Positives = 14/28 (50%), Gaps = 3/28 (10%)

Query: 181 PDFSLSNFNNDIALLEMESGVDFEAPQI 208
           PD  L NF    AL E ESG+     Q+
Sbjct: 18  PDLELENFK---ALCEAESGIPASQQQL 42


>gnl|CDD|226828 COG4393, COG4393, Predicted membrane protein [Function unknown].
          Length = 405

 Score = 28.6 bits (64), Expect = 3.8
 Identities = 20/64 (31%), Positives = 26/64 (40%), Gaps = 18/64 (28%)

Query: 76  PLQIAVARP-----GKMEVIACGQVERNQRIVGGNVTKLHEFPWIAALTKKGKFYCGATL 130
           P  ++  +P     G M VI   QV            KLH F +IA   K  +F+    L
Sbjct: 274 PPALSEPQPVQPDEGNMVVIPIEQVRDG---------KLHRFVYIADDGKAVRFF----L 320

Query: 131 IAKR 134
           I KR
Sbjct: 321 INKR 324


>gnl|CDD|176492 cd08762, Cyt_b561_CYBASC3, Vertebrate cytochrome b(561), CYBASC3
           gene product.  Cytochrome b ascorbate-dependent 3, as
           found in vertebrates, which might act as a
           ferric-chelate reductase, catalyzing the reduction of
           Fe(3+) to Fe(2+), such as associated with the transport
           of iron from the endosome to the cytoplasm. It is
           assumed that this protein uses ascorbate as the electron
           donor. Belongs to the cytochrome b(561) family, which
           are secretory vesicle-specific electron transport
           proteins. Cytochromes b(561) are integral membrane
           proteins that bind two heme groups non-covalently, and
           may have six alpha-helical trans-membrane segments.
          Length = 179

 Score = 28.3 bits (63), Expect = 4.3
 Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 4/36 (11%)

Query: 114 WIAALTKKGKFYCGATLIAKRHVLTAAHCIEGVNPK 149
           W+ AL K    + GA ++    VL+ A CI G+N K
Sbjct: 138 WLRALVKPIHVFFGAMIL----VLSIASCISGINEK 169


>gnl|CDD|237585 PRK14015, pepN, aminopeptidase N; Provisional.
          Length = 875

 Score = 28.2 bits (64), Expect = 7.1
 Identities = 7/16 (43%), Positives = 10/16 (62%)

Query: 174 VKRAIRHPDFSLSNFN 189
           V+  ++HP F L N N
Sbjct: 769 VRALMQHPAFDLKNPN 784


>gnl|CDD|221331 pfam11940, DUF3458, Domain of unknown function (DUF3458).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is typically between 402 to 419 amino acids
           in length. This domain is found associated with
           pfam01433. This domain has a conserved FSAPV sequence
           motif.
          Length = 365

 Score = 27.9 bits (63), Expect = 7.7
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query: 174 VKRAIRHPDFSLSNFN 189
           VK  ++HP F L N N
Sbjct: 303 VKALMQHPAFDLKNPN 318


>gnl|CDD|233858 TIGR02414, pepN_proteo, aminopeptidase N, Escherichia coli type.
           The M1 family of zinc metallopeptidases contains a
           number of distinct, well-separated clades of proteins
           with aminopeptidase activity. Several are designated
           aminopeptidase N, EC 3.4.11.2, after the Escherichia
           coli enzyme, suggesting a similar activity profile (see
           SP|P04825 for a description of catalytic activity). This
           family consists of all aminopeptidases closely related
           to E. coli PepN and presumed to have similar (not
           identical) function. Nearly all are found in
           Proteobacteria, but members are found also in
           Cyanobacteria, plants, and apicomplexan parasites. This
           family differs greatly in sequence from the family of
           aminopeptidases typified by Streptomyces lividans PepN
           (TIGR02412), from the membrane bound aminopeptidase N
           family in animals, etc [Protein fate, Degradation of
           proteins, peptides, and glycopeptides].
          Length = 863

 Score = 27.7 bits (62), Expect = 9.2
 Identities = 11/28 (39%), Positives = 16/28 (57%)

Query: 168 PVIIRKVKRAIRHPDFSLSNFNNDIALL 195
           P  + +VK  ++HP F L N N   AL+
Sbjct: 753 PDTLERVKALLQHPAFDLKNPNRVRALI 780


>gnl|CDD|205932 pfam13758, Prefoldin_3, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 97

 Score = 26.1 bits (58), Expect = 9.2
 Identities = 15/38 (39%), Positives = 22/38 (57%), Gaps = 5/38 (13%)

Query: 146 VNPKEIKVTLGEHDRL--SKNESVPVIIRK---VKRAI 178
           VN KEIK  LGE   +  ++ + V +I R+   VK+ I
Sbjct: 45  VNEKEIKEILGEKQGITRTRQQVVDLISRRIDYVKQNI 82


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.132    0.388 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,969,074
Number of extensions: 1387356
Number of successful extensions: 916
Number of sequences better than 10.0: 1
Number of HSP's gapped: 897
Number of HSP's successfully gapped: 24
Length of query: 302
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 206
Effective length of database: 6,679,618
Effective search space: 1376001308
Effective search space used: 1376001308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.5 bits)