RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12894
         (124 letters)



>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score = 96.2 bits (240), Expect = 7e-26
 Identities = 36/79 (45%), Positives = 46/79 (58%), Gaps = 6/79 (7%)

Query: 46  RIVGGNVTKLHEFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEH 104
           RIVGG+   +  FPW  +L    G+ +CG +LI+ R VLTAAHC+ G +P  I+V LG H
Sbjct: 1   RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRGSDPSNIRVRLGSH 60

Query: 105 DRLSKNESV-----PVIIH 118
           D  S  E        VIIH
Sbjct: 61  DLSSGEEGQVIKVSKVIIH 79


>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score = 92.0 bits (229), Expect = 4e-24
 Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 7/79 (8%)

Query: 47  IVGGNVTKLHEFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHD 105
           IVGG+  K+  FPW  +L    G+ +CG +LI+ R VLTAAHC+    P    V LG HD
Sbjct: 1   IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHD 60

Query: 106 RLSKNESV------PVIIH 118
             S            VI+H
Sbjct: 61  LSSNEGGGQVIKVKKVIVH 79


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score = 67.1 bits (164), Expect = 9e-15
 Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 47  IVGGNVTKLHEFPWIAALTKKGKFY-CGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHD 105
           IVGG+  +   FPW  +L      + CG +LI++  VLTAAHC+   N K ++V LG H+
Sbjct: 1   IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVS--NAKSVRVVLGAHN 58

Query: 106 RLSKNESVPVI 116
            + +       
Sbjct: 59  IVLREGGEQKF 69


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 51.4 bits (123), Expect = 9e-09
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 5/62 (8%)

Query: 46  RIVGGNVTKLHEFPWIAALTKKGKF-----YCGATLIAKRHVLTAAHCIEGVNPKEIKVT 100
           RI+GG+     E+P + AL  +        +CG + +  R+VLTAAHC +  +P    V 
Sbjct: 32  RIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSDVN 91

Query: 101 LG 102
             
Sbjct: 92  RV 93


>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
          transport and metabolism].
          Length = 251

 Score = 35.8 bits (83), Expect = 0.002
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 3/41 (7%)

Query: 52 VTKLHEFPW--IAAL-TKKGKFYCGATLIAKRHVLTAAHCI 89
          VT   +FP+  +       G+    ATLI    VLTA HCI
Sbjct: 43 VTDTTQFPYSAVVQFEAATGRLCTAATLIGPNTVLTAGHCI 83


>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain.  This family
           includes trypsin like peptidase domains.
          Length = 138

 Score = 30.2 bits (68), Expect = 0.14
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 78  AKRHVLTAAHCIEGVNPKEIKVTLGEHDR 106
           +   +LT AH +E  +  EI+V L +  R
Sbjct: 8   SDGLILTNAHVVEDADASEIEVVLPDGGR 36


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 30.4 bits (68), Expect = 0.19
 Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 2/56 (3%)

Query: 51  NVTKLHEFPWIAALTKKGKFYCGATLIAKRHVLTAAHCIEGVNPKE--IKVTLGEH 104
              +   +PWIA +  +G + C   LI    VL +  C+   + +   I V LG H
Sbjct: 9   TFKEELHWPWIAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGH 64


>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316).  This
          family of proteins with unknown function are from
          Caenorhabditis elegans. The protein has GO references
          indicating the protein is a positive regulator of
          growth rate and is also involved in nematode larval
          development.
          Length = 280

 Score = 29.3 bits (66), Expect = 0.43
 Identities = 13/58 (22%), Positives = 25/58 (43%), Gaps = 6/58 (10%)

Query: 38 CGQ--VERNQRIVGGNVTKLHEFPWIAALTKK----GKFYCGATLIAKRHVLTAAHCI 89
          CG   +    + + G   +  E+PW+     +      +   AT I+ RH+LT++   
Sbjct: 31 CGNKTLPLPSQNINGIYLEKSEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSRLF 88


>gnl|CDD|188678 cd08723, RGS_RGS21, Regulator of G protein signaling (RGS) domain
           found in the RGS21 protein.  The RGS (Regulator of
           G-protein Signaling) domain is an essential part RGS21
           protein, a member of RGS protein family. They are a
           diverse group of multifunctional proteins that regulate
           cellular signaling events downstream of G-protein
           coupled receptors (GPCRs). RGS proteins play critical
           regulatory roles as GTPase activating proteins (GAPs) of
           the heterotrimeric G-protein G-alpha-subunits. As a
           major G-protein regulator, RGS domain containing
           proteins are involved in many crucial cellular
           processes. RGS proteins regulate many aspects of
           embryonic development such as glial differentiation,
           embryonic axis formation, skeletal and muscle
           development, cell migration during early embryogenesis,
           apoptosis, and cell proliferation, as well as modulation
           of cardiac development.  RGS21 is a member of the R4/RGS
           subfamily and its mRNA was detected only in sensory
           taste cells that express sweet taste receptors and the
           taste G-alpha subunit, gustducin, suggesting a potential
           role in regulating taste transduction.
          Length = 111

 Score = 27.3 bits (60), Expect = 1.3
 Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 57  EFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHDRLSKNESVPV 115
           EF W+A    KK K    +T IA +  +  +  I+   PKEI +     D +S+N S P 
Sbjct: 25  EF-WLACEDFKKTK---SSTEIALKAQMIYSEFIQADAPKEINIDFHTRDLISQNISEPT 80

Query: 116 IIHF 119
           +  F
Sbjct: 81  LKCF 84


>gnl|CDD|176492 cd08762, Cyt_b561_CYBASC3, Vertebrate cytochrome b(561), CYBASC3
           gene product.  Cytochrome b ascorbate-dependent 3, as
           found in vertebrates, which might act as a
           ferric-chelate reductase, catalyzing the reduction of
           Fe(3+) to Fe(2+), such as associated with the transport
           of iron from the endosome to the cytoplasm. It is
           assumed that this protein uses ascorbate as the electron
           donor. Belongs to the cytochrome b(561) family, which
           are secretory vesicle-specific electron transport
           proteins. Cytochromes b(561) are integral membrane
           proteins that bind two heme groups non-covalently, and
           may have six alpha-helical trans-membrane segments.
          Length = 179

 Score = 26.0 bits (57), Expect = 4.6
 Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 4/36 (11%)

Query: 60  WIAALTKKGKFYCGATLIAKRHVLTAAHCIEGVNPK 95
           W+ AL K    + GA ++    VL+ A CI G+N K
Sbjct: 138 WLRALVKPIHVFFGAMIL----VLSIASCISGINEK 169


>gnl|CDD|227435 COG5104, PRP40, Splicing factor [RNA processing and
          modification].
          Length = 590

 Score = 26.6 bits (58), Expect = 4.9
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query: 42 ERNQRIVGGNVTKLHEFPWIAALTKKGKFY 71
          E+ + ++ G+   L   PW    T  GK Y
Sbjct: 40 EKPKELLKGSEEDLDVDPWKECRTADGKVY 69


>gnl|CDD|189036 cd09866, PIN_Fcf1-Utp23-H, PIN domain of rRNA-processing protein
           Fcf1- and Utp23-like homologs found in eukaryotes except
           fungi.  PIN domain homologs of Fcf1/Utp24
           (FAF1-copurifying factor 1/U three-associated protein
           24) and Utp23, essential proteins involved in pre-rRNA
           processing and 40S ribosomal subunit assembly, are
           included in this subfamily. Fcf1 is a component of the
           small subunit (SSU) processome and an essential
           nucleolar protein required for processing of the 18S
           pre-rRNA at sites A0-A2. These PIN (PilT N terminus)
           domains are homologs of flap endonuclease-1 (FEN1)-like
           PIN domains, but apparently lack the H3TH domain or
           extensive arch/clamp region seen in the latter. PIN
           domains typically contain three or four conserved acidic
           residues (putative metal-binding, active site residues).
           The Fcf1-Utp23 homolog PIN domain subfamily has three of
           these conserved acidic residues rather than the four
           seen in the Fcf1 PIN domain subfamily.
          Length = 147

 Score = 25.6 bits (57), Expect = 6.0
 Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 10/40 (25%)

Query: 61  IAALTKKGKFYCGATLIAKR-------H---VLTAAHCIE 90
           IA L K G    GA LIAKR       H    ++A+ C++
Sbjct: 65  IAELEKLGPDLYGALLIAKRFQVAKCGHEKSPVSASECLK 104


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.133    0.401 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,959,106
Number of extensions: 489869
Number of successful extensions: 343
Number of sequences better than 10.0: 1
Number of HSP's gapped: 338
Number of HSP's successfully gapped: 18
Length of query: 124
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 39
Effective length of database: 7,167,512
Effective search space: 279532968
Effective search space used: 279532968
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (24.2 bits)