RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10841
(286 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 279 bits (715), Expect = 6e-95
Identities = 97/235 (41%), Positives = 133/235 (56%), Gaps = 10/235 (4%)
Query: 46 VVGGKKAELGAWPWLIAL-YRDGFFHCGGVVLDESWVMTAAHCVDGFEKHYFEVYAGMLR 104
+VGG +A++G++PW ++L Y G CGG ++ WV+TAAHCV + V G
Sbjct: 1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHD 60
Query: 105 RFSFSPTEQVRPVSRIVMHSMFKRAEMTNDLALLQLAAPLRYNRYVRPICLPDVTETPEP 164
S QV V ++++H + + ND+ALL+L P+ + VRPICLP
Sbjct: 61 LSSNEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPA 120
Query: 165 YSTCTAVGWGAVFEHGPDPDHMREVQVPILP--ACKHYEDRIADV----ICAGMPQGGRD 218
+TCT GWG E GP PD ++EV VPI+ CK + +CAG +GG+D
Sbjct: 121 GTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCAGGLEGGKD 180
Query: 219 TCQGDSGGPLLCPVPGSQGRWYVAGVVSHGEGCARPNEPGVYTRVSQFVPWLMSN 273
CQGDSGGPL+C GR + G+VS G GCARPN PGVYTRVS ++ W+
Sbjct: 181 ACQGDSGGPLVC---NDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 264 bits (678), Expect = 3e-89
Identities = 97/233 (41%), Positives = 134/233 (57%), Gaps = 13/233 (5%)
Query: 45 RVVGGKKAELGAWPWLIALYRDGFFH-CGGVVLDESWVMTAAHCVDGFEKHYFEVYAGML 103
R+VGG +A +G++PW ++L G H CGG ++ WV+TAAHCV G + V G
Sbjct: 1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRGSDPSNIRVRLGSH 60
Query: 104 RRFSFSPTEQVRPVSRIVMHSMFKRAEMTNDLALLQLAAPLRYNRYVRPICLPDVTETPE 163
S QV VS++++H + + ND+ALL+L P+ + VRPICLP
Sbjct: 61 DLSS-GEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVP 119
Query: 164 PYSTCTAVGWGAV-FEHGPDPDHMREVQVPILP--ACKHYEDRIADV----ICAGMPQGG 216
+TCT GWG G PD ++EV VPI+ C+ + +CAG +GG
Sbjct: 120 AGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAITDNMLCAGGLEGG 179
Query: 217 RDTCQGDSGGPLLCPVPGSQGRWYVAGVVSHGEGCARPNEPGVYTRVSQFVPW 269
+D CQGDSGGPL+C + GRW + G+VS G GCARP +PGVYTRVS ++ W
Sbjct: 180 KDACQGDSGGPLVC----NDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDW 228
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 208 bits (531), Expect = 2e-67
Identities = 90/230 (39%), Positives = 127/230 (55%), Gaps = 17/230 (7%)
Query: 46 VVGGKKAELGAWPWLIALYRDGFFH-CGGVVLDESWVMTAAHCVDGFEKHYFEVYAGMLR 104
+VGG +A+ G++PW ++L H CGG ++ E+WV+TAAHCV V G
Sbjct: 1 IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVSN--AKSVRVVLGAHN 58
Query: 105 RFSFSPTEQVRPVSRIVMHSMFKRAEMTNDLALLQLAAPLRYNRYVRPICLPDVTETPEP 164
EQ V ++++H + + ND+ALL+L +P+ VRPICLP +
Sbjct: 59 IVLREGGEQKFDVKKVIVHPNY-NPDTDNDIALLKLKSPVTLGDTVRPICLPTASSDLPV 117
Query: 165 YSTCTAVGWGAVFEHGPDPDHMREVQVPILP--ACK-HYEDRIAD-VICAGMPQGGRDTC 220
+TCT GWG G PD ++EV VP++ C+ Y + D +ICAG GG+D C
Sbjct: 118 GTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCRSAYGGTVTDNMICAGA--GGKDAC 174
Query: 221 QGDSGGPLLCPVPGSQGRWYVAGVVSHGEGCARPNEPGVYTRVSQFVPWL 270
QGDSGGPL+C + G+VS G GCA N PGVYT VS ++ W+
Sbjct: 175 QGDSGGPLVCSDG------ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 102 bits (256), Expect = 7e-25
Identities = 82/265 (30%), Positives = 107/265 (40%), Gaps = 43/265 (16%)
Query: 36 AGNPILGSGRVVGGKKAELGAWPWLIAL------YRDGFFHCGGVVLDESWVMTAAHCVD 89
A S R++GG A G +P L+AL Y G F CGG L +V+TAAHC D
Sbjct: 23 AQTADEVSSRIIGGSNANAGEYPSLVALVDRISDYVSGTF-CGGSKLGGRYVLTAAHCAD 81
Query: 90 G---FEKHYFEVYAGMLRRFSFSPTEQVRPVSRIVMHSMFKRAEMTNDLALLQLAAPLRY 146
V + + S + V I +H + + ND+A+L+LA
Sbjct: 82 ASSPISSDVNRVVVDL----NDSSQAERGHVRTIYVHEFYSPGNLGNDIAVLELARAASL 137
Query: 147 NRYVRPICLPDVTETPEPYST----CTAVGWGAVFEHGPDPD------HMREVQVPI--L 194
R I D ++T T T +G P + EV V L
Sbjct: 138 PR--VKITSFDASDTFLNSVTTVSPMTNGTFGVT-TPSDVPRSSPKGTILHEVAVLFVPL 194
Query: 195 PACKHY-EDRIADV-------ICAGMPQGGRDTCQGDSGGPLLCPVPGSQGRWYVAGVVS 246
C Y A CAG P +D CQGDSGGP+ G +GR GVVS
Sbjct: 195 STCAQYKGCANASDGATGLTGFCAGRP--PKDACQGDSGGPIF--HKGEEGRVQR-GVVS 249
Query: 247 HGEG-CARPNEPGVYTRVSQFVPWL 270
G+G C PGVYT VS + W+
Sbjct: 250 WGDGGCGGTLIPGVYTNVSNYQDWI 274
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 74.7 bits (183), Expect = 1e-15
Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 11/120 (9%)
Query: 57 WPWLIALYRDGFFHCGGVVLDESWVMTAAHCVDG--FEKHYFEVYAGMLR--RFSFSPTE 112
WPW+ +Y +G + C GV++D SWV+ + C+ E Y V G + + P E
Sbjct: 16 WPWIAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGHKTLKSVKGPYE 75
Query: 113 QVRPVSRIVMHSMFKRAEMTNDLALLQLAAPLRYNRYVRPICLPDVTETPEPYSTCTAVG 172
Q+ V + + + ++LL L +P ++ +V P +P E + C VG
Sbjct: 76 QIYRVD--CRKDLPR-----SKISLLHLKSPATFSNHVLPTFVPSTRNHNEKNNKCVTVG 128
>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain. This family
includes trypsin like peptidase domains.
Length = 138
Score = 32.5 bits (74), Expect = 0.088
Identities = 29/158 (18%), Positives = 42/158 (26%), Gaps = 31/158 (19%)
Query: 72 GGVVLDESWVMTAAHCVDGFEKHYFEVYAGMLRRFSFSPTEQVRPVSRIVMHSMFKRAEM 131
G VV + ++T AH V+ + EV V+
Sbjct: 3 GFVVGSDGLILTNAHVVEDADASEIEV------VLPDGGRVPAEVVAADPDL-------- 48
Query: 132 TNDLALLQLAAPLRYNRYVRPICLPDVTETPEPYSTCTAVGWGAVFEHGPDPDHMREVQV 191
DLALL++ PL L + P S G G + V
Sbjct: 49 --DLALLKVDGPLLP-----AAPLLASSAAPLGGSVVVVGGPGGIGLGASGGGGGVGGLV 101
Query: 192 PILPACKHYEDRIADVICAGMPQGGRDTCQGDSGGPLL 229
+ D + G SGGP+
Sbjct: 102 SGSLGGVDGRYILTDADTSP----------GSSGGPVF 129
>gnl|CDD|237610 PRK14099, PRK14099, glycogen synthase; Provisional.
Length = 485
Score = 30.1 bits (68), Expect = 1.6
Identities = 24/77 (31%), Positives = 30/77 (38%), Gaps = 7/77 (9%)
Query: 134 DLALLQLAAPLRYNRYVRPICLPDVTETPE-PY------STCTAVGWGAVFEHGPDPDHM 186
L L L AP Y+R P PD + P+ A+G G V PD H
Sbjct: 80 GLDLFVLDAPHLYDRPGNPYVGPDGKDWPDNAQRFAALARAAAAIGQGLVPGFVPDIVHA 139
Query: 187 REVQVPILPACKHYEDR 203
+ Q + PA HY R
Sbjct: 140 HDWQAGLAPAYLHYSGR 156
>gnl|CDD|131001 TIGR01946, rnfD, electron transport complex, RnfABCDGE type, D
subunit. The six subunit complex RnfABCDGE in
Rhodobacter capsulatus encodes an apparent NADH
oxidoreductase responsible for electron transport to
nitrogenase, necessary for nitrogen fixation. A closely
related complex in E. coli, RsxABCDGE (Reducer of SoxR),
reduces the 2Fe-2S-containing superoxide sensor SoxR,
active as a transcription factor when oxidized. This
family of putative NADH oxidoreductase complexes exists
in many of the same species as the related NQR, a
Na(+)-translocating NADH-quinone reductase, but is
distinct. This model describes the A subunit [Energy
metabolism, Electron transport].
Length = 327
Score = 29.6 bits (67), Expect = 1.9
Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 5/46 (10%)
Query: 26 LGARNMATDMAGNPILGSGRVVGGKKAELGAWPWLIAL---YRDGF 68
LGA +ATD P GR++ G A +G +LI Y DG
Sbjct: 259 LGAFFIATDPVTAPTTNKGRIIFG--AGIGVLVYLIRTFGGYPDGV 302
>gnl|CDD|233667 TIGR01982, UbiB, 2-polyprenylphenol 6-hydroxylase. This model
represents the enzyme (UbiB) which catalyzes the first
hydroxylation step in the ubiquinone biosynthetic
pathway in bacteria. It is believed that the reaction is
2-polyprenylphenol -> 6-hydroxy-2-polyprenylphenol. This
model finds hits primarily in the proteobacteria. The
gene is also known as AarF in certain species
[Biosynthesis of cofactors, prosthetic groups, and
carriers, Menaquinone and ubiquinone].
Length = 437
Score = 28.0 bits (63), Expect = 6.4
Identities = 10/19 (52%), Positives = 12/19 (63%)
Query: 52 AELGAWPWLIALYRDGFFH 70
AE A +L + RDGFFH
Sbjct: 262 AENLARSFLNQVLRDGFFH 280
>gnl|CDD|233953 TIGR02621, cas3_GSU0051, CRISPR-associated helicase Cas3, subtype
Dpsyc. This model describes a CRISPR-associated
putative DEAH-box helicase, or Cas3, of a subtype found
in Actinomyces naeslundii MG1, Geobacter sulfurreducens
PCA, Gemmata obscuriglobus UQM 2246, and Desulfotalea
psychrophila. This protein includes both DEAH and HD
motifs [Mobile and extrachromosomal element functions,
Other].
Length = 862
Score = 28.0 bits (62), Expect = 8.0
Identities = 19/101 (18%), Positives = 29/101 (28%), Gaps = 19/101 (18%)
Query: 34 DMAGNPILGSGRVVGGKKAELGAWPWLIALYRDGFFHCGGVVLDESWVMTAAHCVDGFEK 93
DM G+ +L G K + H + D V+ AH F++
Sbjct: 153 DMIGSRLLFRGYGCSPKSRPI---------------HAALLGQDALIVLDEAHLSPAFQE 197
Query: 94 HYFEVYAGMLRRFSFSP----TEQVRPVSRIVMHSMFKRAE 130
F + + R F P P + R E
Sbjct: 198 TLFVIRSEQQRLEDFCPLRVVELTATPRTDTPDTLTTLRQE 238
>gnl|CDD|200580 cd10956, CE4_BH1302_like, Putative catalytic NodB homology domain
of uncharacterized BH1302 protein from Bacillus
halodurans and its bacterial homologs. This family is
represented by a putative polysaccharide deacetylase
BH1302 from Bacillus halodurans. Although its biological
function is unknown, BH1302 shows high sequence homology
to the catalytic NodB homology domain of Streptococcus
pneumoniae polysaccharide deacetylase PgdA (SpPgdA),
which is an extracellular metal-dependent polysaccharide
deacetylase with de-N-acetylase activity toward a
hexamer of chitooligosaccharide N-acetylglucosamine, but
not shorter chitooligosaccharides or a synthetic
peptidoglycan tetrasaccharide. Both BH1302 and SpPgdA
belong to the carbohydrate esterase 4 (CE4) superfamily.
This family also includes many uncharacterized bacterial
polysaccharide deacetylases.
Length = 194
Score = 27.3 bits (61), Expect = 8.0
Identities = 8/21 (38%), Positives = 14/21 (66%)
Query: 112 EQVRPVSRIVMHSMFKRAEMT 132
EQV+P S I++H M+ + +
Sbjct: 148 EQVKPGSIILLHVMYGSRQNS 168
>gnl|CDD|223733 COG0661, AarF, Predicted unusual protein kinase [General function
prediction only].
Length = 517
Score = 27.7 bits (62), Expect = 8.2
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 52 AELGAWPWLIALYRDGFFHC 71
AEL +L L RDGFFH
Sbjct: 270 AELLVRAFLRQLLRDGFFHA 289
>gnl|CDD|227005 COG4658, RnfD, Predicted NADH:ubiquinone oxidoreductase, subunit
RnfD [Energy production and conversion].
Length = 338
Score = 27.4 bits (61), Expect = 10.0
Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 2/39 (5%)
Query: 26 LGARNMATDMAGNPILGSGRVVGGKKAELGAWPWLIALY 64
LGA +ATD + GR++ G A G WLI +
Sbjct: 267 LGAFFIATDPVTSSTTNRGRLIFG--ALCGVLVWLIRSF 303
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.138 0.455
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,086,128
Number of extensions: 1461216
Number of successful extensions: 983
Number of sequences better than 10.0: 1
Number of HSP's gapped: 965
Number of HSP's successfully gapped: 19
Length of query: 286
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 190
Effective length of database: 6,679,618
Effective search space: 1269127420
Effective search space used: 1269127420
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.1 bits)