RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy2950
(406 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 191 bits (487), Expect = 6e-59
Identities = 79/170 (46%), Positives = 103/170 (60%), Gaps = 6/170 (3%)
Query: 220 HEQRRRVERIYT-DFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTA-DVGLI 277
Q +V+++ Y+ S Y NDIALL+L RP ++ V PICLP+ G + A +
Sbjct: 67 GGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTV 126
Query: 278 SGWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQ 337
SGWGR SEGG LP +LQ VP+ EC+R+Y Y + +C G +GG D+CQ
Sbjct: 127 SGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAY---SYGGTITDNMLCAGGLEGGKDACQ 183
Query: 338 GDSGGPLACPLPDGRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWVKSI 387
GDSGGPL C +GR L GI SWG GCARP++ GVYT VS Y DW++
Sbjct: 184 GDSGGPLVCND-NGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
Score = 162 bits (411), Expect = 1e-47
Identities = 68/156 (43%), Positives = 87/156 (55%), Gaps = 18/156 (11%)
Query: 60 ELTRPFKFNEFVSPICLPNPGLTVTADVGLISGWGRLSEGADVGLISGWGRLSEGGSLPH 119
+L RP ++ V PICLP+ G + A +SGWGR SEGG LP
Sbjct: 95 KLKRPVTLSDNVRPICLPSSGYNLPAG--------------TTCTVSGWGRTSEGGPLPD 140
Query: 120 ILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQGDSGGPLACPLPDG 179
+LQ VP+ EC+R+Y Y + +C G +GG D+CQGDSGGPL C +G
Sbjct: 141 VLQEVNVPIVSNAECKRAY---SYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCND-NG 196
Query: 180 RYYLCGITSWGVGCARPDFYGVYTLVSCYSDWVKSI 215
R L GI SWG GCARP++ GVYT VS Y DW++
Sbjct: 197 RGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
Score = 36.5 bits (85), Expect = 0.014
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 26 RTSEGGSLPHILQAAEVPLTPKEECRRSY 54
RTSEGG LP +LQ VP+ EC+R+Y
Sbjct: 131 RTSEGGPLPDVLQEVNVPIVSNAECKRAY 159
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 184 bits (468), Expect = 4e-56
Identities = 79/168 (47%), Positives = 96/168 (57%), Gaps = 8/168 (4%)
Query: 220 HEQRRRVERIYT-DFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTA-DVGLI 277
Q +V ++ Y+ S Y NDIALL+L P ++ V PICLP+ V A +
Sbjct: 67 EGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVPAGTTCTV 126
Query: 278 SGWGRLSEG-GSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSC 336
SGWGR SEG GSLP LQ VP+ CRR+Y + +C G +GG D+C
Sbjct: 127 SGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAY---SGGGAITDNMLCAGGLEGGKDAC 183
Query: 337 QGDSGGPLACPLPDGRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 384
QGDSGGPL C DGR+ L GI SWG GCARP GVYT VS Y DW+
Sbjct: 184 QGDSGGPLVCN--DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Score = 155 bits (394), Expect = 3e-45
Identities = 68/154 (44%), Positives = 81/154 (52%), Gaps = 20/154 (12%)
Query: 60 ELTRPFKFNEFVSPICLPNPGLTVTA-DVGLISGWGRLSEGADVGLISGWGRLSEGGSLP 118
+L P ++ V PICLP+ V A +SGWGR SEG GSLP
Sbjct: 95 KLKEPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEG--------------AGSLP 140
Query: 119 HILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQGDSGGPLACPLPD 178
LQ VP+ CRR+Y + +C G +GG D+CQGDSGGPL C D
Sbjct: 141 DTLQEVNVPIVSNATCRRAY---SGGGAITDNMLCAGGLEGGKDACQGDSGGPLVCN--D 195
Query: 179 GRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 212
GR+ L GI SWG GCARP GVYT VS Y DW+
Sbjct: 196 GRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Score = 33.4 bits (77), Expect = 0.15
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 25 RRTSEGGSLPHILQAAEVPLTPKEECRRSY 54
R + GSLP LQ VP+ CRR+Y
Sbjct: 131 RTSEGAGSLPDTLQEVNVPIVSNATCRRAY 160
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 136 bits (344), Expect = 5e-38
Identities = 65/166 (39%), Positives = 82/166 (49%), Gaps = 13/166 (7%)
Query: 220 HEQRRRVERIYTDFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTA-DVGLIS 278
EQ+ V+++ NDIALL+L P + V PICLP + +S
Sbjct: 65 GEQKFDVKKVIVHPNYNPDTDNDIALLKLKSPVTLGDTVRPICLPTASSDLPVGTTCTVS 124
Query: 279 GWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQG 338
GWG G LP LQ VP+ +E CR +Y + +C G GG D+CQG
Sbjct: 125 GWGNTKTLG-LPDTLQEVTVPVVSRETCRSAY-----GGTVTDNMICAGA--GGKDACQG 176
Query: 339 DSGGPLACPLPDGRYYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 384
DSGGPL C L GI SWG GCA ++ GVYT VS Y DW+
Sbjct: 177 DSGGPLVCSDG----ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218
Score = 115 bits (289), Expect = 4e-30
Identities = 56/152 (36%), Positives = 68/152 (44%), Gaps = 26/152 (17%)
Query: 61 LTRPFKFNEFVSPICLPNPGLTVTADVGLISGWGRLSEGADVGLISGWGRLSEGGSLPHI 120
L P + V PICLP + +SGWG G LP
Sbjct: 93 LKSPVTLGDTVRPICLPTASSDLPVGTT--------------CTVSGWGNTKTLG-LPDT 137
Query: 121 LQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQGDSGGPLACPLPDGR 180
LQ VP+ +E CR +Y + +C G GG D+CQGDSGGPL C
Sbjct: 138 LQEVTVPVVSRETCRSAY-----GGTVTDNMICAGA--GGKDACQGDSGGPLVCSDG--- 187
Query: 181 YYLCGITSWGVGCARPDFYGVYTLVSCYSDWV 212
L GI SWG GCA ++ GVYT VS Y DW+
Sbjct: 188 -ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 77.2 bits (190), Expect = 2e-15
Identities = 55/184 (29%), Positives = 69/184 (37%), Gaps = 34/184 (18%)
Query: 222 QRRRVERIYT-DFYDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGLTVTADVGLIS-- 278
+R V IY +FY NDIA+LEL R + + +D L S
Sbjct: 104 ERGHVRTIYVHEFYSPGNLGNDIAVLELARA-------ASLPRVKITSFDASDTFLNSVT 156
Query: 279 --------GWG--------RLSEGGSLPHILQAAEVPLTPKEECRRSYAVA-GYSNYLNQ 321
+G R S G+ IL V P C + A
Sbjct: 157 TVSPMTNGTFGVTTPSDVPRSSPKGT---ILHEVAVLFVPLSTCAQYKGCANASDGATGL 213
Query: 322 CQVCTGTKQGGLDSCQGDSGGPLACPLPDGRYYLCGITSWGVG-CARPDFYGVYTLVSCY 380
C G D+CQGDSGGP+ +GR G+ SWG G C GVYT VS Y
Sbjct: 214 TGFCAGRP--PKDACQGDSGGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNY 270
Query: 381 SDWV 384
DW+
Sbjct: 271 QDWI 274
Score = 71.1 bits (174), Expect = 2e-13
Identities = 37/105 (35%), Positives = 44/105 (41%), Gaps = 8/105 (7%)
Query: 110 RLSEGGSLPHILQAAEVPLTPKEECRRSYAVA-GYSNYLNQCQVCTGTKQGGLDSCQGDS 168
R S G+ IL V P C + A C G D+CQGDS
Sbjct: 176 RSSPKGT---ILHEVAVLFVPLSTCAQYKGCANASDGATGLTGFCAGRP--PKDACQGDS 230
Query: 169 GGPLACPLPDGRYYLCGITSWGVG-CARPDFYGVYTLVSCYSDWV 212
GGP+ +GR G+ SWG G C GVYT VS Y DW+
Sbjct: 231 GGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNYQDWI 274
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 32.7 bits (74), Expect = 0.29
Identities = 16/55 (29%), Positives = 27/55 (49%), Gaps = 8/55 (14%)
Query: 213 KSILYAR--HEQRRRVERIYTDFYDKSIYKNDIALLELTRPFKFNEFVSPICLPN 265
K++ + +EQ RV D K + ++ I+LL L P F+ V P +P+
Sbjct: 65 KTLKSVKGPYEQIYRV-----DCR-KDLPRSKISLLHLKSPATFSNHVLPTFVPS 113
>gnl|CDD|139961 PRK13901, ruvA, Holliday junction DNA helicase RuvA; Provisional.
Length = 196
Score = 29.8 bits (67), Expect = 1.7
Identities = 12/35 (34%), Positives = 18/35 (51%)
Query: 234 YDKSIYKNDIALLELTRPFKFNEFVSPICLPNPGL 268
Y K I K + +++ + PF+F VS CL L
Sbjct: 6 YGKIIEKKESSIVIMATPFEFELLVSSFCLAELRL 40
>gnl|CDD|217356 pfam03078, ATHILA, ATHILA ORF-1 family. ATHILA is a group of
Arabidopsis thaliana retrotransposons belonging to the
Ty3/gypsy family of the long terminal repeat (LTR) class
of eukaryotic retrotransposons. The central region of
ATHILA retrotransposons contains two or three open
reading frames (ORFs). This family represents the ORF1
product. The function of ORF1 is unknown.
Length = 456
Score = 30.4 bits (68), Expect = 1.8
Identities = 16/74 (21%), Positives = 23/74 (31%), Gaps = 6/74 (8%)
Query: 106 SGWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQ 165
G L GG + IL A VPL R ++L +C+ +
Sbjct: 272 KARGSLCVGGVVTPILIACGVPLHSPGLLPR----RIDIDHLRRCEFLEFKMVDDFYRYR 327
Query: 166 GD--SGGPLACPLP 177
+ G LP
Sbjct: 328 FEHSEDGKSNILLP 341
Score = 30.4 bits (68), Expect = 1.8
Identities = 16/74 (21%), Positives = 23/74 (31%), Gaps = 6/74 (8%)
Query: 278 SGWGRLSEGGSLPHILQAAEVPLTPKEECRRSYAVAGYSNYLNQCQVCTGTKQGGLDSCQ 337
G L GG + IL A VPL R ++L +C+ +
Sbjct: 272 KARGSLCVGGVVTPILIACGVPLHSPGLLPR----RIDIDHLRRCEFLEFKMVDDFYRYR 327
Query: 338 GD--SGGPLACPLP 349
+ G LP
Sbjct: 328 FEHSEDGKSNILLP 341
>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
periplasmic, contain C-terminal PDZ domain
[Posttranslational modification, protein turnover,
chaperones].
Length = 347
Score = 28.3 bits (63), Expect = 8.2
Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 3/61 (4%)
Query: 47 KEECRRSYAVAGYELTRPFKFNEFVSPICLPNP-GLTVTADVGLISGWGRLSEGADVGLI 105
K + V + + + V I NP GL T G++S GR G+ G +
Sbjct: 125 KIDGAGGLPVIALGDSDKLRVGDVVVAI--GNPFGLGQTVTSGIVSALGRTGVGSAGGYV 182
Query: 106 S 106
+
Sbjct: 183 N 183
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.139 0.449
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,922,056
Number of extensions: 2000215
Number of successful extensions: 1327
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1300
Number of HSP's successfully gapped: 20
Length of query: 406
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 307
Effective length of database: 6,546,556
Effective search space: 2009792692
Effective search space used: 2009792692
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.9 bits)