RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4294
(238 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 111 bits (279), Expect = 5e-30
Identities = 68/248 (27%), Positives = 93/248 (37%), Gaps = 75/248 (30%)
Query: 21 IVGGRDVNPGEVPYIVSLSLY-GNLYCGGSLISLQWFLSARHCFVTENLVWNQFNPLIIA 79
IVGG + G P+ VSL G +CGGSLIS +W L+A HC + + + +
Sbjct: 1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCV--YSSAPSNYT--VRL 56
Query: 80 GSIYRNYKEQKRQPQLNEIALIYWHS-------DADLAMVKLKEPFRQTTFVKPLDYYTA 132
GS + E Q+ ++ + H D D+A++KLK P + V+P+ ++
Sbjct: 57 GSHDLSSNEGG--GQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSS 114
Query: 133 RETNYINDVLSKTDRSEMSIVSGFGVTFQRDKDG---------IVS-------WGIG--- 173
VSG+G T + IVS + G
Sbjct: 115 GYNLPAGT---------TCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTI 165
Query: 174 -----CALGYP--------------------------GIVSWGIGCAL-GYPGVYVRVDH 201
CA G GIVSWG GCA YPGVY RV
Sbjct: 166 TDNMLCA-GGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSS 224
Query: 202 YDPWIQSV 209
Y WIQ
Sbjct: 225 YLDWIQKT 232
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 109 bits (275), Expect = 2e-29
Identities = 71/246 (28%), Positives = 95/246 (38%), Gaps = 76/246 (30%)
Query: 20 RIVGGRDVNPGEVPYIVSLSLYGN-LYCGGSLISLQWFLSARHCFVTENLVWNQFNPLII 78
RIVGG + N G P+ VSL G +CGGSLIS +W L+A HC + +
Sbjct: 1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCV--RGSDPSNIR--VR 56
Query: 79 AGSIYRNYKEQKRQPQLNEIALIYWH-------SDADLAMVKLKEPFRQTTFVKPLDYYT 131
GS + E+ Q+ +++ + H D D+A++KLKEP + V+P+ +
Sbjct: 57 LGSHDLSSGEEG---QVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPS 113
Query: 132 ARETNYINDVLSKTDRSEMSIVSGFGVTFQRDKDG----------IVS--------WGIG 173
+ VSG+G T + IVS G G
Sbjct: 114 SNYNVPAGT---------TCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGG 164
Query: 174 -------CALGYP-------------------------GIVSWGIGCAL-GYPGVYVRVD 200
CA G GIVSWG GCA G PGVY RV
Sbjct: 165 AITDNMLCA-GGLEGGKDACQGDSGGPLVCNDGRWVLVGIVSWGSGCARPGKPGVYTRVS 223
Query: 201 HYDPWI 206
Y WI
Sbjct: 224 SYLDWI 229
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 91.7 bits (228), Expect = 1e-22
Identities = 66/231 (28%), Positives = 92/231 (39%), Gaps = 58/231 (25%)
Query: 21 IVGGRDVNPGEVPYIVSLSLYGNLY-CGGSLISLQWFLSARHCFVTENLVWNQFNPLIIA 79
IVGG + PG P+ VSL + + CGGSLIS W L+A HC N + ++
Sbjct: 1 IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVS------NAKSVRVVL 54
Query: 80 GSIYRNYKEQKRQPQLNEIALIYWHSD------ADLAMVKLKEPFRQTTFVKPLDY---- 129
G+ +E Q ++ + H + D+A++KLK P V+P+
Sbjct: 55 GAHNIVLREGG--EQKFDVKKVIVHPNYNPDTDNDIALLKLKSPVTLGDTVRPICLPTAS 112
Query: 130 -------------YTARETNYINDVLSK-----TDRSEMSIVSGFGVTFQRDKDGIVSWG 171
+ +T + D L + R G VT D ++ G
Sbjct: 113 SDLPVGTTCTVSGWGNTKTLGLPDTLQEVTVPVVSRETCRSAYGGTVT-----DNMICAG 167
Query: 172 IG--------------CALGY-PGIVSWGIGCALG-YPGVYVRVDHYDPWI 206
G C+ G GIVSWG GCA G YPGVY V Y WI
Sbjct: 168 AGGKDACQGDSGGPLVCSDGELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 47.6 bits (113), Expect = 2e-06
Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 5/49 (10%)
Query: 20 RIVGGRDVNPGEVPYIVSLSLYGNL-----YCGGSLISLQWFLSARHCF 63
RI+GG + N GE P +V+L + +CGGS + ++ L+A HC
Sbjct: 32 RIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCA 80
Score = 35.6 bits (82), Expect = 0.015
Identities = 19/55 (34%), Positives = 24/55 (43%), Gaps = 2/55 (3%)
Query: 159 TFQRDKDGIVSWGIGCALGYPGIVSWGIG-CA-LGYPGVYVRVDHYDPWIQSVKN 211
Q D G + G+VSWG G C PGVY V +Y WI ++ N
Sbjct: 225 ACQGDSGGPIFHKGEEGRVQRGVVSWGDGGCGGTLIPGVYTNVSNYQDWIAAMTN 279
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 33.5 bits (76), Expect = 0.076
Identities = 20/94 (21%), Positives = 39/94 (41%), Gaps = 2/94 (2%)
Query: 33 PYIVSLSLYGNLYCGGSLISLQWFLSARHCFVTENLVWNQFNPLIIAGSIYRNYKEQKRQ 92
P+I + + GN C G LI L W L + C +L + + ++ ++ K
Sbjct: 17 PWIAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGHKTLKSVKGPYE- 75
Query: 93 PQLNEIALIYWHSDADLAMVKLKEPFRQTTFVKP 126
Q+ + + ++++ LK P + V P
Sbjct: 76 -QIYRVDCRKDLPRSKISLLHLKSPATFSNHVLP 108
>gnl|CDD|172156 PRK13587, PRK13587,
1-(5-phosphoribosyl)-5-[(5-
phosphoribosylamino)methylideneamino]
imidazole-4-carboxamide isomerase; Provisional.
Length = 234
Score = 31.7 bits (72), Expect = 0.23
Identities = 29/90 (32%), Positives = 41/90 (45%), Gaps = 16/90 (17%)
Query: 131 TARETNYINDVLSKTDRSEMSIVSGFGVTFQRDKDGIVSW---GIG-CALGYPGI--VSW 184
ARE +YI + T + I G G+ R K I+ + GI C +G GI W
Sbjct: 61 HAREFDYIKSLRRLTTK---DIEVGGGI---RTKSQIMDYFAAGINYCIVGTKGIQDTDW 114
Query: 185 GIGCALGYPG-VYVRVDHYDPWIQSVKNNG 213
A +PG +Y+ VD Y + +K NG
Sbjct: 115 LKEMAHTFPGRIYLSVDAY---GEDIKVNG 141
>gnl|CDD|220578 pfam10113, Fibrillarin_2, Fibrillarin-like archaeal protein.
Members of this family of proteins include archaeal
fibrillarin homologs.
Length = 505
Score = 29.4 bits (66), Expect = 2.0
Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 4/67 (5%)
Query: 160 FQRDKDGIVSWGIGCALGYPG--IVSWGIGCALGYPGVYVRVDHYDPWIQS--VKNNGDN 215
+ D + S I + G +SWGI L V + DPW++ VK +
Sbjct: 400 VVKTVDDLESEVIADTVALLGGRFISWGIAKKLDGGVEEVYISDVDPWVEHATVKILNEE 459
Query: 216 AGVLISA 222
G+ + A
Sbjct: 460 LGINVHA 466
>gnl|CDD|147832 pfam05894, Podovirus_Gp16, Podovirus DNA encapsidation protein
(Gp16). This family consists of several DNA
encapsidation protein (Gp16) sequences from the
phi-29-like viruses. Gene product 16 catalyzes the in
vivo and in vitro genome-encapsidation reaction.
Length = 333
Score = 28.6 bits (64), Expect = 2.5
Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 5/86 (5%)
Query: 101 IYWHSDADLAMVKLKEPFRQTTFVKPLDYYTARETNYINDVLSKTDRSEMSIVSGFG--- 157
+ +H + + + TFV DYY+ ++ T +MS+ + F
Sbjct: 169 LLFHIEPSINEPFQVDDEAVITFVNSEDYYSEVRHTRFGSLIKGTAYYDMSLDNKFSSNS 228
Query: 158 --VTFQRDKDGIVSWGIGCALGYPGI 181
++R K+ + I Y G+
Sbjct: 229 DLFVYKRSKNSKFRFSIRYNDNYYGV 254
>gnl|CDD|239434 cd03318, MLE, Muconate Lactonizing Enzyme (MLE), an homooctameric
enzyme, catalyses the conversion of cis,cis-muconate
(CCM) to muconolactone (ML) in the catechol branch of
the beta-ketoadipate pathway. This pathway is used in
soil microbes to breakdown lignin-derived aromatics,
catechol and protocatechuate, to citric acid cycle
intermediates. Some bacterial species are also capable
of dehalogenating chloroaromatic compounds by the action
of chloromuconate lactonizing enzymes (Cl-MLEs). MLEs
are members of the enolase superfamily characterized by
the presence of an enolate anion intermediate which is
generated by abstraction of the alpha-proton of the
carboxylate substrate by an active site residue and that
is stabilized by coordination to the essential Mg2+ ion.
Length = 365
Score = 27.7 bits (62), Expect = 5.9
Identities = 10/41 (24%), Positives = 14/41 (34%), Gaps = 6/41 (14%)
Query: 162 RDKDGIVSWGIGCALGYPGIVSWGIGCALGYPGVYVRVDHY 202
DG+V G PG +WG + +D Y
Sbjct: 36 TTSDGVVGIGEA---TTPGGPAWGGESP---ETIKAIIDRY 70
>gnl|CDD|144261 pfam00603, Flu_PA, Influenza RNA-dependent RNA polymerase subunit
PA.
Length = 669
Score = 27.9 bits (62), Expect = 6.0
Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 16/52 (30%)
Query: 17 IGGRIVGGRDVNPGEVPYIVSLSLYGNLYC---GGSLISLQWFLSARHCFVT 65
IG V GR+V Y LYC G S I ++W AR C +
Sbjct: 503 IGHTTVEGREV-----------PFY--LYCRTTGFSKIKMKWGSEARRCLIQ 541
>gnl|CDD|146488 pfam03881, Fructosamin_kin, Fructosamine kinase. This family
includes eukaryotic fructosamine-3-kinase enzymes. The
family also includes bacterial members that have not
been characterized but probably have a similar or
identical function.
Length = 287
Score = 27.6 bits (62), Expect = 6.4
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 6/33 (18%)
Query: 102 YW-HSDADLAMVKLKEPFRQTTF-----VKPLD 128
Y+ + DLAM +L F + V PLD
Sbjct: 215 YYGDRECDLAMTELFGGFPPDFYDGYQSVYPLD 247
>gnl|CDD|217412 pfam03186, CobD_Cbib, CobD/Cbib protein. This family includes CobD
proteins from a number of bacteria, in Salmonella this
protein is called Cbib. Salmonella CobD is a different
protein. This protein is involved in cobalamin
biosynthesis and is probably an enzyme responsible for
the conversion of adenosylcobyric acid to
adenosylcobinamide or adenosylcobinamide phosphate.
Length = 295
Score = 27.1 bits (61), Expect = 9.0
Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 8/58 (13%)
Query: 177 GYP-GIVSWGIGCALGYPGVYVRVDHYDPWIQSVKNNGDN---AGVLISALHMTYRAV 230
G+P ++ +G LG P VY P + +G A + AL + RA
Sbjct: 242 GWPEAAMAGALGVRLGGPNVYGGEVVEKPTL----GDGARPPTAEDIRRALRLMRRAS 295
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.140 0.439
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,261,936
Number of extensions: 1159901
Number of successful extensions: 971
Number of sequences better than 10.0: 1
Number of HSP's gapped: 959
Number of HSP's successfully gapped: 30
Length of query: 238
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 144
Effective length of database: 6,768,326
Effective search space: 974638944
Effective search space used: 974638944
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 57 (25.7 bits)