RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12894
(124 letters)
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 96.2 bits (240), Expect = 7e-26
Identities = 36/79 (45%), Positives = 46/79 (58%), Gaps = 6/79 (7%)
Query: 46 RIVGGNVTKLHEFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEH 104
RIVGG+ + FPW +L G+ +CG +LI+ R VLTAAHC+ G +P I+V LG H
Sbjct: 1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRGSDPSNIRVRLGSH 60
Query: 105 DRLSKNESV-----PVIIH 118
D S E VIIH
Sbjct: 61 DLSSGEEGQVIKVSKVIIH 79
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 92.0 bits (229), Expect = 4e-24
Identities = 32/79 (40%), Positives = 41/79 (51%), Gaps = 7/79 (8%)
Query: 47 IVGGNVTKLHEFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHD 105
IVGG+ K+ FPW +L G+ +CG +LI+ R VLTAAHC+ P V LG HD
Sbjct: 1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHD 60
Query: 106 RLSKNESV------PVIIH 118
S VI+H
Sbjct: 61 LSSNEGGGQVIKVKKVIVH 79
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 67.1 bits (164), Expect = 9e-15
Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 47 IVGGNVTKLHEFPWIAALTKKGKFY-CGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHD 105
IVGG+ + FPW +L + CG +LI++ VLTAAHC+ N K ++V LG H+
Sbjct: 1 IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVS--NAKSVRVVLGAHN 58
Query: 106 RLSKNESVPVI 116
+ +
Sbjct: 59 IVLREGGEQKF 69
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 51.4 bits (123), Expect = 9e-09
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 5/62 (8%)
Query: 46 RIVGGNVTKLHEFPWIAALTKKGKF-----YCGATLIAKRHVLTAAHCIEGVNPKEIKVT 100
RI+GG+ E+P + AL + +CG + + R+VLTAAHC + +P V
Sbjct: 32 RIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSDVN 91
Query: 101 LG 102
Sbjct: 92 RV 93
>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
transport and metabolism].
Length = 251
Score = 35.8 bits (83), Expect = 0.002
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 3/41 (7%)
Query: 52 VTKLHEFPW--IAAL-TKKGKFYCGATLIAKRHVLTAAHCI 89
VT +FP+ + G+ ATLI VLTA HCI
Sbjct: 43 VTDTTQFPYSAVVQFEAATGRLCTAATLIGPNTVLTAGHCI 83
>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain. This family
includes trypsin like peptidase domains.
Length = 138
Score = 30.2 bits (68), Expect = 0.14
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 78 AKRHVLTAAHCIEGVNPKEIKVTLGEHDR 106
+ +LT AH +E + EI+V L + R
Sbjct: 8 SDGLILTNAHVVEDADASEIEVVLPDGGR 36
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 30.4 bits (68), Expect = 0.19
Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 2/56 (3%)
Query: 51 NVTKLHEFPWIAALTKKGKFYCGATLIAKRHVLTAAHCIEGVNPKE--IKVTLGEH 104
+ +PWIA + +G + C LI VL + C+ + + I V LG H
Sbjct: 9 TFKEELHWPWIAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGH 64
>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316). This
family of proteins with unknown function are from
Caenorhabditis elegans. The protein has GO references
indicating the protein is a positive regulator of
growth rate and is also involved in nematode larval
development.
Length = 280
Score = 29.3 bits (66), Expect = 0.43
Identities = 13/58 (22%), Positives = 25/58 (43%), Gaps = 6/58 (10%)
Query: 38 CGQ--VERNQRIVGGNVTKLHEFPWIAALTKK----GKFYCGATLIAKRHVLTAAHCI 89
CG + + + G + E+PW+ + + AT I+ RH+LT++
Sbjct: 31 CGNKTLPLPSQNINGIYLEKSEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSRLF 88
>gnl|CDD|188678 cd08723, RGS_RGS21, Regulator of G protein signaling (RGS) domain
found in the RGS21 protein. The RGS (Regulator of
G-protein Signaling) domain is an essential part RGS21
protein, a member of RGS protein family. They are a
diverse group of multifunctional proteins that regulate
cellular signaling events downstream of G-protein
coupled receptors (GPCRs). RGS proteins play critical
regulatory roles as GTPase activating proteins (GAPs) of
the heterotrimeric G-protein G-alpha-subunits. As a
major G-protein regulator, RGS domain containing
proteins are involved in many crucial cellular
processes. RGS proteins regulate many aspects of
embryonic development such as glial differentiation,
embryonic axis formation, skeletal and muscle
development, cell migration during early embryogenesis,
apoptosis, and cell proliferation, as well as modulation
of cardiac development. RGS21 is a member of the R4/RGS
subfamily and its mRNA was detected only in sensory
taste cells that express sweet taste receptors and the
taste G-alpha subunit, gustducin, suggesting a potential
role in regulating taste transduction.
Length = 111
Score = 27.3 bits (60), Expect = 1.3
Identities = 21/64 (32%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 57 EFPWIAALT-KKGKFYCGATLIAKRHVLTAAHCIEGVNPKEIKVTLGEHDRLSKNESVPV 115
EF W+A KK K +T IA + + + I+ PKEI + D +S+N S P
Sbjct: 25 EF-WLACEDFKKTK---SSTEIALKAQMIYSEFIQADAPKEINIDFHTRDLISQNISEPT 80
Query: 116 IIHF 119
+ F
Sbjct: 81 LKCF 84
>gnl|CDD|176492 cd08762, Cyt_b561_CYBASC3, Vertebrate cytochrome b(561), CYBASC3
gene product. Cytochrome b ascorbate-dependent 3, as
found in vertebrates, which might act as a
ferric-chelate reductase, catalyzing the reduction of
Fe(3+) to Fe(2+), such as associated with the transport
of iron from the endosome to the cytoplasm. It is
assumed that this protein uses ascorbate as the electron
donor. Belongs to the cytochrome b(561) family, which
are secretory vesicle-specific electron transport
proteins. Cytochromes b(561) are integral membrane
proteins that bind two heme groups non-covalently, and
may have six alpha-helical trans-membrane segments.
Length = 179
Score = 26.0 bits (57), Expect = 4.6
Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 4/36 (11%)
Query: 60 WIAALTKKGKFYCGATLIAKRHVLTAAHCIEGVNPK 95
W+ AL K + GA ++ VL+ A CI G+N K
Sbjct: 138 WLRALVKPIHVFFGAMIL----VLSIASCISGINEK 169
>gnl|CDD|227435 COG5104, PRP40, Splicing factor [RNA processing and
modification].
Length = 590
Score = 26.6 bits (58), Expect = 4.9
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 42 ERNQRIVGGNVTKLHEFPWIAALTKKGKFY 71
E+ + ++ G+ L PW T GK Y
Sbjct: 40 EKPKELLKGSEEDLDVDPWKECRTADGKVY 69
>gnl|CDD|189036 cd09866, PIN_Fcf1-Utp23-H, PIN domain of rRNA-processing protein
Fcf1- and Utp23-like homologs found in eukaryotes except
fungi. PIN domain homologs of Fcf1/Utp24
(FAF1-copurifying factor 1/U three-associated protein
24) and Utp23, essential proteins involved in pre-rRNA
processing and 40S ribosomal subunit assembly, are
included in this subfamily. Fcf1 is a component of the
small subunit (SSU) processome and an essential
nucleolar protein required for processing of the 18S
pre-rRNA at sites A0-A2. These PIN (PilT N terminus)
domains are homologs of flap endonuclease-1 (FEN1)-like
PIN domains, but apparently lack the H3TH domain or
extensive arch/clamp region seen in the latter. PIN
domains typically contain three or four conserved acidic
residues (putative metal-binding, active site residues).
The Fcf1-Utp23 homolog PIN domain subfamily has three of
these conserved acidic residues rather than the four
seen in the Fcf1 PIN domain subfamily.
Length = 147
Score = 25.6 bits (57), Expect = 6.0
Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 10/40 (25%)
Query: 61 IAALTKKGKFYCGATLIAKR-------H---VLTAAHCIE 90
IA L K G GA LIAKR H ++A+ C++
Sbjct: 65 IAELEKLGPDLYGALLIAKRFQVAKCGHEKSPVSASECLK 104
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.133 0.401
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,959,106
Number of extensions: 489869
Number of successful extensions: 343
Number of sequences better than 10.0: 1
Number of HSP's gapped: 338
Number of HSP's successfully gapped: 18
Length of query: 124
Length of database: 10,937,602
Length adjustment: 85
Effective length of query: 39
Effective length of database: 7,167,512
Effective search space: 279532968
Effective search space used: 279532968
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (24.2 bits)