RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2626
(355 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 256 bits (655), Expect = 9e-85
Identities = 101/247 (40%), Positives = 146/247 (59%), Gaps = 16/247 (6%)
Query: 108 IVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGEY 167
IVGG+ ++ G +PW ++ + R CGG+LI+ R+VLTAAHCV Y
Sbjct: 1 IVGGSEAKIGSFPWQVSLQYTGGR--------HFCGGSLISPRWVLTAAHCVYSSAPSNY 52
Query: 168 EPYIVHVGSIDL-EDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPI 226
V +GS DL + G I++++ I+H Y + NDIAL +L+ LSD ++PI
Sbjct: 53 T---VRLGSHDLSSNEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPI 109
Query: 227 CLPYDTNLRSELFERKTPFVAGWGSTVFRGPLSPKLRHVQISVVDNPKCRQIFSNYGATI 286
CLP + L T V+GWG T GPL L+ V + +V N +C++ +S YG TI
Sbjct: 110 CLP---SSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYS-YGGTI 165
Query: 287 NENILCAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTNY 346
+N+LCAG L GGKD+C GDSGGPL+ + + ++G+VS+G CA +PGVYTRV++Y
Sbjct: 166 TDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSY 225
Query: 347 IQWIADN 353
+ WI
Sbjct: 226 LDWIQKT 232
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 253 bits (648), Expect = 9e-84
Identities = 102/245 (41%), Positives = 141/245 (57%), Gaps = 17/245 (6%)
Query: 107 RIVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGE 166
RIVGG+ + G +PW ++ + R CGG+LI+ R+VLTAAHCV
Sbjct: 1 RIVGGSEANIGSFPWQVSLQYGGGR--------HFCGGSLISPRWVLTAAHCVRGSDPSN 52
Query: 167 YEPYIVHVGSIDLEDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPI 226
V +GS DL G I++ + IIH Y + NDIAL +L+E LSD ++PI
Sbjct: 53 IR---VRLGSHDLSSGEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPI 109
Query: 227 CLPYDTNLRSELFERKTPFVAGWGSTVFR-GPLSPKLRHVQISVVDNPKCRQIFSNYGAT 285
CLP + + T V+GWG T G L L+ V + +V N CR+ +S G
Sbjct: 110 CLP---SSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGG-A 165
Query: 286 INENILCAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTN 345
I +N+LCAG L GGKD+C GDSGGPL+ D ++ ++G+VS+G CA G PGVYTRV++
Sbjct: 166 ITDNMLCAGGLEGGKDACQGDSGGPLVC-NDGRWVLVGIVSWGSGCARPGKPGVYTRVSS 224
Query: 346 YIQWI 350
Y+ WI
Sbjct: 225 YLDWI 229
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 195 bits (498), Expect = 2e-61
Identities = 91/244 (37%), Positives = 134/244 (54%), Gaps = 27/244 (11%)
Query: 108 IVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGEY 167
IVGG ++ G +PW ++ + CGG+LI++ +VLTAAHCVS
Sbjct: 1 IVGGDEAQPGSFPWQVSL--------QVSSGKHFCGGSLISENWVLTAAHCVS-----NA 47
Query: 168 EPYIVHVGSIDLEDTSSGVS-IEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPI 226
+ V +G+ ++ G ++++ I+H Y NDIAL +L+ L D ++PI
Sbjct: 48 KSVRVVLGAHNIVLREGGEQKFDVKKVIVHPNYNP-DTDNDIALLKLKSPVTLGDTVRPI 106
Query: 227 CLPYDTNLRSELFERKTPFVAGWGSTVFRGPLSPKLRHVQISVVDNPKCRQIFSNYGATI 286
CLP S+L T V+GWG+T G L+ V + VV CR S YG T+
Sbjct: 107 CLPTA---SSDLPVGTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCR---SAYGGTV 159
Query: 287 NENILCAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTNY 346
+N++CAG +GGKD+C GDSGGPL+ +IG+VS+G CA +PGVYT V++Y
Sbjct: 160 TDNMICAG--AGGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSY 214
Query: 347 IQWI 350
+ WI
Sbjct: 215 LDWI 218
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 121 bits (305), Expect = 4e-31
Identities = 82/263 (31%), Positives = 112/263 (42%), Gaps = 19/263 (7%)
Query: 99 VNAFNSSKRIVGGTPSERGQWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHC 158
A S RI+GG+ + G++P + A+ +I +G CGG+ + RYVLTAAHC
Sbjct: 24 QTADEVSSRIIGGSNANAGEYPSLVALVDRISDY-VSGT---FCGGSKLGGRYVLTAAHC 79
Query: 159 VSPDTTGEYEPYIVHVGSIDLEDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAP 218
+ + V V DL D+S + +HE Y+ NDIA+ L A
Sbjct: 80 ADASSPISSDVNRVVV---DLNDSSQAERGHVRTIYVHEFYSPGNLGNDIAVLELARAAS 136
Query: 219 LSDLIQPICLPYDTNLRSELFERKTPFVAGWGSTVFRGP--LSPK---LRHVQISVVDNP 273
L + DT + + +G T SPK L V + V
Sbjct: 137 LPRVKITSFDASDT-FLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLS 195
Query: 274 KCRQIFSNYGATINENIL---CAGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYG-K 329
C Q A+ L CAG KD+C GDSGGP+ + + GVVS+G
Sbjct: 196 TCAQYKGCANASDGATGLTGFCAG--RPPKDACQGDSGGPIFHKGEEGRVQRGVVSWGDG 253
Query: 330 KCAEVGFPGVYTRVTNYIQWIAD 352
C PGVYT V+NY WIA
Sbjct: 254 GCGGTLIPGVYTNVSNYQDWIAA 276
>gnl|CDD|221387 pfam12032, CLIP, Regulatory CLIP domain of proteinases. CLIP is
a regulatory domain which controls the proteinase
action of various proteins of the trypsin family, e.g.
easter and pap2. The CLIP domain remains linked to the
protease domain after cleavage of a conserved residue
which retains the protein in zymogen form. It is named
CLIP because it can be drawn in the shape of a paper
clip. It has many disulphide bonds and highly conserved
cysteine residues, and so it folds extensively.
Length = 54
Score = 65.9 bits (161), Expect = 4e-14
Identities = 24/54 (44%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 10 CSTPRNEIGTCISIRQCRYLYDILNNERNNPRAVQYVKNSHCGLEGRL-PKVCC 62
C+TP E GTC+SIR+C L IL P +++ S CG EG P VCC
Sbjct: 1 CTTPNGEPGTCVSIRECPSLLKILKKPNLTPEDTNFLRKSQCGFEGSNDPLVCC 54
>gnl|CDD|197829 smart00680, CLIP, Clip or disulphide knot domain. Present in
horseshoe crab proclotting enzyme N-terminal domain,
Drosophila Easter and silkworm
prophenoloxidase-activating enzyme.
Length = 52
Score = 57.1 bits (138), Expect = 4e-11
Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 2/54 (3%)
Query: 10 CSTPRNEIGTCISIRQCRYLYDILNNERNNPRAVQYVKNSHCGLEGRLPKVCCP 63
C TP E GTC+ I C L +L + P + +++ S CG R P VCCP
Sbjct: 1 CRTPDGERGTCVPISDCPSLLSLLKK--DPPEDLNFLRKSQCGFGNREPLVCCP 52
>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
transport and metabolism].
Length = 251
Score = 33.9 bits (78), Expect = 0.094
Identities = 45/244 (18%), Positives = 70/244 (28%), Gaps = 46/244 (18%)
Query: 118 QWPWMAAIGFKIRRRRANGKPEWMCGGALITKRYVLTAAHCVSPDTTGEYEPYIVHVGSI 177
Q+P+ +A+ + LI VLTA HC+ Y G
Sbjct: 48 QFPY-SAVV-----QFEAATGRLCTAATLIGPNTVLTAGHCI----------YSPDYGED 91
Query: 178 DLEDTSSGVSIEIERPIIHEQYTSARKLNDIALFRLREDAPLSDLIQPICLPYDTNLRSE 237
D+ GV+ + P R +ED D+ +
Sbjct: 92 DIAAAPPGVNSD-GGPFYGITKIEIRVYPG---ELYKEDGASYDVGEAA----------- 136
Query: 238 LFERKTPFVAGWGSTVFRGPLSPKLRHVQISVV-----DNPKCRQIFSNYGATINENILC 292
E + + + V D P ++ + G N +
Sbjct: 137 -LESGINIGDVVNYLKR--NTASEAKANDRITVIGYPGDKPNIGTMWESTGKV---NSIK 190
Query: 293 AGVLSGGKDSCGGDSGGPLMYPLDTKYYIIGVVSYGKKCAEVGFPGVYTRVTNYI-QWIA 351
L D+ G SG P+ L +K +IGV G R+T I +I
Sbjct: 191 GNKLFYDADTLPGSSGSPV---LISKDEVIGVHYNGPGANGGSLANNAVRLTPEILNFIQ 247
Query: 352 DNIS 355
NI
Sbjct: 248 QNIK 251
>gnl|CDD|131967 TIGR02921, PEP_integral, PEP-CTERM family integral membrane
protein. Members of this protein family, found in three
different species so far, have a PEP-CTERM sequence at
the carboxyl-terminus (see model TIGR02595), but are
unusual among PEP-CTERM proteins in having multiple
predicted transmembrane segments. The function is
unknown. It is proposed that a member of the EpsH
family, to be designated exosortase (see TIGR02602),
recognizes and cleaves PEP-CTERM proteins in a manner
analogous to the cleavage of LPXTG proteins by sortase
(see Haft, et al., 2006).
Length = 952
Score = 29.2 bits (65), Expect = 3.7
Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 1/44 (2%)
Query: 165 GEYEPYIVHVGSID-LEDTSSGVSIEIERPIIHEQYTSARKLND 207
GE H G ID L+D+ GV+++I I Q+ R L+D
Sbjct: 762 GETLAGAYHDGIIDLLKDSGGGVALDIAEAIAAHQFAQKRALDD 805
>gnl|CDD|213027 cd10970, CE4_DAC_u1_6s, Putative catalytic NodB homology domain of
uncharacterized prokaryotic polysaccharide deacetylases
which consist of a 6-stranded beta/alpha barrel. This
family contains uncharacterized prokaryotic
polysaccharide deacetylases. Although their biological
functions remain unknown, all members of the family
contain a conserved domain with a 6-stranded beta/alpha
barrel, which is similar to the catalytic NodB homology
domain of rhizobial NodB-like proteins, belonging to the
larger carbohydrate esterase 4 (CE4) superfamily.
Length = 194
Score = 28.4 bits (64), Expect = 3.8
Identities = 10/56 (17%), Positives = 24/56 (42%)
Query: 145 ALITKRYVLTAAHCVSPDTTGEYEPYIVHVGSIDLEDTSSGVSIEIERPIIHEQYT 200
L+ + Y L + P+ +PY + + + + T+ V ++R + Q+
Sbjct: 113 ELVREYYDLGRSGGGGPNGRPPLDPYRLRRVTGEADTTTEEVKTLLDRAAAYNQWL 168
>gnl|CDD|224400 COG1483, COG1483, Predicted ATPase (AAA+ superfamily) [General
function prediction only].
Length = 774
Score = 29.3 bits (66), Expect = 4.0
Identities = 12/59 (20%), Positives = 19/59 (32%), Gaps = 5/59 (8%)
Query: 201 SARKLNDIALFRL--REDAPLSDLIQPICLP-YDTNLRSELFERKTPFVAGWGSTVFRG 256
K+ I + L RE +I P +P D + LF + F+
Sbjct: 347 DLIKITKIVVRELFDREAKEA--IIMPHHIPLSDDYSKESLFLKNITFIDYVSELDIDL 403
>gnl|CDD|233632 TIGR01910, DapE-ArgE, acetylornithine deacetylase or
succinyl-diaminopimelate desuccinylase. This group of
sequences contains annotations for both acetylornithine
deacetylase and succinyl-diaminopimelate desuccinylase,
but does not contain any members with experimental
characterization. Bacillus, Staphylococcus and
Sulfolobus species contain multiple hits to this
subfamily and each may have a separate activity.
Determining which is which must await further laboratory
research [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 375
Score = 28.9 bits (65), Expect = 4.4
Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 4/56 (7%)
Query: 142 CGGA----LITKRYVLTAAHCVSPDTTGEYEPYIVHVGSIDLEDTSSGVSIEIERP 193
G A L+ + Y A + P+ +G I H GSI + G P
Sbjct: 141 SGEAGTLYLLQRGYFKDADGVLIPEPSGGDNIVIGHKGSIWFKLRVKGKQAHASFP 196
>gnl|CDD|239956 cd04583, CBS_pair_ABC_OpuCA_assoc2, This cd contains two tandem
repeats of the cystathionine beta-synthase (CBS pair)
domains in association with the ABC transporter OpuCA.
OpuCA is the ATP binding component of a bacterial solute
transporter that serves a protective role to cells
growing in a hyperosmolar environment but the function
of the CBS domains in OpuCA remains unknown. In the
related ABC transporter, OpuA, the tandem CBS domains
have been shown to function as sensors for ionic
strength, whereby they control the transport activity
through an electronic switching mechanism. ABC
transporters are a large family of proteins involved in
the transport of a wide variety of different compounds,
like sugars, ions, peptides, and more complex organic
molecules. They are a subset of nucleotide hydrolases
that contain a signature motif, Q-loop, and
H-loop/switch region, in addition to the Walker A
motif/P-loop and Walker B motif commonly found in a
number of ATP- and GTP-binding and hydrolyzing proteins.
CBS is a small domain originally identified in
cystathionine beta-synthase and subsequently found in a
wide range of different proteins. CBS domains usually
come in tandem repeats, which associate to form a
so-called Bateman domain or a CBS pair which is
reflected in this model. The interface between the two
CBS domains forms a cleft that is a potential ligand
binding site. The CBS pair coexists with a variety of
other functional domains. It has been proposed that the
CBS domain may play a regulatory role, although its
exact function is unknown.
Length = 109
Score = 27.5 bits (62), Expect = 4.7
Identities = 14/44 (31%), Positives = 23/44 (52%), Gaps = 7/44 (15%)
Query: 186 VSIEIERPIIHEQYTSARKLNDIAL---FRLREDAPLSDLIQPI 226
VS+E + + Y A+ L DI L F ++ DA L D++ +
Sbjct: 41 VSLES----LEQAYKEAKSLEDIMLEDVFTVQPDASLRDVLGLV 80
>gnl|CDD|233699 TIGR02050, gshA_cyan_rel, carboxylate-amine ligase, YbdK family.
This family represents a division of a larger family,
the other branch of which is predicted to act as
glutamate--cysteine ligase (the first of two enzymes in
glutathione biosynthesis) in the cyanobacteria. Species
containing this protein, however, are generally not
believe to make glutathione, and the function is
unknown.
Length = 287
Score = 28.5 bits (64), Expect = 4.7
Identities = 9/43 (20%), Positives = 18/43 (41%)
Query: 250 GSTVFRGPLSPKLRHVQISVVDNPKCRQIFSNYGATINENILC 292
G + P R + V DNP+ +++ YG + ++
Sbjct: 79 GLRICGAGTHPFARWRRQEVADNPRYQRLLERYGYVARQQLVF 121
>gnl|CDD|137339 PRK09499, sifB, secreted effector protein SifB; Provisional.
Length = 316
Score = 28.8 bits (64), Expect = 4.7
Identities = 14/37 (37%), Positives = 19/37 (51%), Gaps = 6/37 (16%)
Query: 4 YYDDS------LCSTPRNEIGTCISIRQCRYLYDILN 34
+Y D+ LC +I CI IRQ Y Y+I+N
Sbjct: 87 HYKDAECQYTNLCIKDGEDIPLCIMIRQDHYYYEIMN 123
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.137 0.436
Gapped
Lambda K H
0.267 0.0636 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,264,054
Number of extensions: 1731130
Number of successful extensions: 1157
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1135
Number of HSP's successfully gapped: 20
Length of query: 355
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 257
Effective length of database: 6,590,910
Effective search space: 1693863870
Effective search space used: 1693863870
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.1 bits)