RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15989
(363 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 125 bits (315), Expect = 5e-34
Identities = 70/242 (28%), Positives = 95/242 (39%), Gaps = 72/242 (29%)
Query: 23 GQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVI 82
G + G +PW V+L T G + CGGSL+S +V+TAAHCV +
Sbjct: 4 GSEAKIGSFPWQVSLQYTGG---RHFCGGSLISPRWVLTAAHCVYSSA------PSNYTV 54
Query: 83 YLGKYHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCL- 141
LG + S+EGG Q +VK+V ++P +N S Y DIALL+L V S VRP+CL
Sbjct: 55 RLGSHDL--SSNEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLP 112
Query: 142 ----------------W----DDSTAPLQLSAVE-------------------------- 155
W + P L V
Sbjct: 113 SSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCA 172
Query: 156 -----GTSVCNGDSGGGMVFKIDSAWYLRGIVS--ITVARDGLRVCDTKHYVVFTDVANV 208
G C GDSGG +V + L GIVS AR + V+T V++
Sbjct: 173 GGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPN-------YPGVYTRVSSY 225
Query: 209 CN 210
+
Sbjct: 226 LD 227
Score = 82.3 bits (204), Expect = 3e-18
Identities = 37/110 (33%), Positives = 51/110 (46%), Gaps = 8/110 (7%)
Query: 253 DVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGRDGTV 312
VK+V ++P +N S Y DIALL+L V S VRP+CL G TV
Sbjct: 72 KVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNL-----PAGTTCTV 126
Query: 313 IGWGY-DENDRVSEELKMAIMPIVSHQQCLWSNPQFFSQFTSDETFCAGF 361
GWG E + + L+ +PIVS+ +C + +D CAG
Sbjct: 127 SGWGRTSEGGPLPDVLQEVNVPIVSNAEC--KRAYSYGGTITDNMLCAGG 174
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 122 bits (308), Expect = 5e-33
Identities = 73/248 (29%), Positives = 99/248 (39%), Gaps = 46/248 (18%)
Query: 23 GQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVI 82
G + G +PW V+L G + CGGSL+S +V+TAAHCV + +
Sbjct: 5 GSEANIGSFPWQVSLQYGGG---RHFCGGSLISPRWVLTAAHCVRGSDPSN------IRV 55
Query: 83 YLGKYHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLW 142
LG H S Q +V +V I+P +N S Y DIALL+L V S VRP+CL
Sbjct: 56 RLGS---HDLSSGEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICL- 111
Query: 143 DDSTAPLQLSAVEGTSVCN----GDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTKH 198
P V + C G + G D+ L+ V++ + + C +
Sbjct: 112 -----PSSNYNVPAGTTCTVSGWGRTSEGAGSLPDT---LQE-VNVPIVSN--ATCRRAY 160
Query: 199 YVVFTDVAN------------VCNGDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTK 246
N C GDSGG +V D W L GIVS G
Sbjct: 161 SGGGAITDNMLCAGGLEGGKDACQGDSGGPLVCN-DGRWVLVGIVSW-----GSGCARPG 214
Query: 247 HYVVFTDV 254
V+T V
Sbjct: 215 KPGVYTRV 222
Score = 80.0 bits (198), Expect = 2e-17
Identities = 36/111 (32%), Positives = 49/111 (44%), Gaps = 9/111 (8%)
Query: 253 DVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGRDGTV 312
V +V I+P +N S Y DIALL+L V S VRP+CL + G TV
Sbjct: 72 KVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNV-----PAGTTCTV 126
Query: 313 IGWGY--DENDRVSEELKMAIMPIVSHQQCLWSNPQFFSQFTSDETFCAGF 361
GWG + + + L+ +PIVS+ C +D CAG
Sbjct: 127 SGWGRTSEGAGSLPDTLQEVNVPIVSNATC--RRAYSGGGAITDNMLCAGG 175
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 109 bits (275), Expect = 2e-28
Identities = 75/244 (30%), Positives = 111/244 (45%), Gaps = 42/244 (17%)
Query: 23 GQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVI 82
G + G +PW V+L + G + CGGSL+S N+V+TAAHCV+ ++ ++ +
Sbjct: 4 GDEAQPGSFPWQVSLQVSSG---KHFCGGSLISENWVLTAAHCVS--------NAKSVRV 52
Query: 83 YLGKYHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLW 142
LG ++ EGG Q VK+V ++P +N DIALL+L S V VRP+CL
Sbjct: 53 VLGAHNIVL--REGGEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLP 109
Query: 143 DDSTAPLQLSAVEGTSVCNGDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTKHYVVF 202
S+ GT+ SG G + L+ + V+R+ C + +
Sbjct: 110 TASSDL-----PVGTTCTV--SGWGNTKTLGLPDTLQEVTVPVVSRE---TCRSAYGGTV 159
Query: 203 TD--------VANVCNGDSGGGMVFKIDSAWYLRGIVSITVARDGLRVCDTKHYV-VFTD 253
TD + C GDSGG +V S L GIVS C + +Y V+T
Sbjct: 160 TDNMICAGAGGKDACQGDSGGPLV---CSDGELIGIVS------WGYGCASGNYPGVYTP 210
Query: 254 VKRV 257
V
Sbjct: 211 VSSY 214
Score = 75.2 bits (185), Expect = 9e-16
Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 10/109 (9%)
Query: 253 DVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGRDGTV 312
DVK+V ++P +N DIALL+L S V VRP+CL S+ G TV
Sbjct: 70 DVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPTASSDL-----PVGTTCTV 123
Query: 313 IGWGYDENDRVSEELKMAIMPIVSHQQCLWSNPQFFSQFTSDETFCAGF 361
GWG + + + L+ +P+VS + C + +D CAG
Sbjct: 124 SGWGNTKTLGLPDTLQEVTVPVVSRETC----RSAYGGTVTDNMICAGA 168
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 51.8 bits (124), Expect = 2e-07
Identities = 62/293 (21%), Positives = 96/293 (32%), Gaps = 73/293 (24%)
Query: 29 GQWPWHVALYRTEGINLSYV-CGGSLVSVNYVITAAHCVTKKPYDKPVDSD-TLVIYLGK 86
G++P VAL +S CGGS + YV+TAAHC P+ SD V+
Sbjct: 42 GEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADAS---SPISSDVNRVVV--- 95
Query: 87 YHQHQFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDST 146
+D + V+ ++++ ++ N DIA+L+L+ ++
Sbjct: 96 ----DLNDSSQAERGHVRTIYVHEFYSPGNLGNDIAVLELA---------------RAAS 136
Query: 147 APLQLSAVEGTSVCNGDSGGGMVFKIDSAWYLRGIVS-------------ITVARDGLRV 193
P S +S + + + + + V L
Sbjct: 137 LPRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLST 196
Query: 194 CDTKH--------YVVFTDVA------NVCNGDSGGGMVFKIDSAWYLRGIVSITVARDG 239
C T + C GDSGG + K + RG+VS G
Sbjct: 197 CAQYKGCANASDGATGLTGFCAGRPPKDACQGDSGGPIFHKGEEGRVQRGVVSW-----G 251
Query: 240 LRVC-DTKHYVVFTDVKRVHIYPTFNSSNYLGDIA--LLQLSSDVDYSMYVRP 289
C T V+T+V SNY IA LS + RP
Sbjct: 252 DGGCGGTLIPGVYTNV-----------SNYQDWIAAMTNGLSYLQFRPLGYRP 293
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 34.6 bits (79), Expect = 0.058
Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 15/115 (13%)
Query: 31 WPWHVALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPVDSDTLVIYLGKYHQH 90
WPW +Y + +Y C G L+ +++V+ + C+ +D ++ + + LG H+
Sbjct: 16 WPWIAKVY----VEGNYRCTGVLIDLSWVLVSHSCL----WDTSLEHSYISVVLGG-HKT 66
Query: 91 QFSDEGGVQNKQVKRVHIYPTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDS 145
S +G +Q+ RV S I+LL L S +S +V P +
Sbjct: 67 LKSVKG--PYEQIYRVDCRKDLPRSK----ISLLHLKSPATFSNHVLPTFVPSTR 115
>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
transport and metabolism].
Length = 251
Score = 32.4 bits (74), Expect = 0.29
Identities = 12/48 (25%), Positives = 19/48 (39%), Gaps = 5/48 (10%)
Query: 30 QWPWH--VALYRTEGINLSYVCGGSLVSVNYVITAAHCVTKKPYDKPV 75
Q+P+ V G +L+ N V+TA HC+ Y +
Sbjct: 48 QFPYSAVVQFEAATG---RLCTAATLIGPNTVLTAGHCIYSPDYGEDD 92
>gnl|CDD|220834 pfam10647, Gmad1, Lipoprotein LpqB beta-propeller domain. The
Gmad1 domain is found associated with the GerMN family,
pfam10646, in bacterial spore formation. It is predicted
to have a beta-propeller fold and to have a passive
binding role rather than a catalytic function owing to
the low number of conserved hydrophilic residues.
Length = 254
Score = 30.8 bits (70), Expect = 0.82
Identities = 11/40 (27%), Positives = 18/40 (45%), Gaps = 1/40 (2%)
Query: 204 DVANVCNGDSGGGMVFKID-SAWYLRGIVSITVARDGLRV 242
VA + +G + ++D A I + V+RDG R
Sbjct: 86 TVARLLRSATGTTVPVEVDAPALSAGRITELRVSRDGTRA 125
Score = 29.7 bits (67), Expect = 2.2
Identities = 9/42 (21%), Positives = 16/42 (38%), Gaps = 1/42 (2%)
Query: 153 AVEGTSVCNGDSGGGMVFKID-SAWYLRGIVSITVARDGLRV 193
+ +G + ++D A I + V+RDG R
Sbjct: 84 GDTVARLLRSATGTTVPVEVDAPALSAGRITELRVSRDGTRA 125
>gnl|CDD|224763 COG1850, RbcL, Ribulose 1,5-bisphosphate carboxylase, large subunit
[Carbohydrate transport and metabolism].
Length = 429
Score = 31.2 bits (71), Expect = 0.89
Identities = 8/28 (28%), Positives = 11/28 (39%), Gaps = 1/28 (3%)
Query: 29 GQWPWHV-ALYRTEGINLSYVCGGSLVS 55
G P + L R G +L GG +
Sbjct: 354 GIHPGMMPELVRIFGKDLVIQAGGGVHG 381
>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316). This
family of proteins with unknown function are from
Caenorhabditis elegans. The protein has GO references
indicating the protein is a positive regulator of
growth rate and is also involved in nematode larval
development.
Length = 280
Score = 30.5 bits (69), Expect = 1.2
Identities = 13/60 (21%), Positives = 25/60 (41%), Gaps = 5/60 (8%)
Query: 7 SCG--TVVYNKAQPLVTYGQKTARGQWPWHVALYRTEGINLSYVCGGSLVSVNYVITAAH 64
SCG T+ Y +K ++PW V G +Y + +S +++T++
Sbjct: 30 SCGNKTLPLPSQNINGIYLEK---SEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSR 86
>gnl|CDD|215527 PLN02976, PLN02976, amine oxidase.
Length = 1713
Score = 30.6 bits (69), Expect = 1.6
Identities = 32/110 (29%), Positives = 44/110 (40%), Gaps = 30/110 (27%)
Query: 33 WHVALYRTEGINLSYVCGGSL--VSVNYV--------ITAAHCVTKKPYDKPVDS--DTL 80
WH A +L Y C L VS+ Y AHC+ K Y V+S + L
Sbjct: 894 WHFA-------HLEYGCAALLKEVSLPYWNQDDVYGGFGGAHCMIKGGYSNVVESLAEGL 946
Query: 81 VIYLGKYHQHQFSD------EGGVQNKQVKRVHIYPTFNSSNYLGDIALL 124
I+L H +D + G K+V + T N S +LGD L+
Sbjct: 947 DIHL----NHVVTDVSYGSKDAGASGSSRKKVKV-STSNGSEFLGDAVLI 991
>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain. This family
includes trypsin like peptidase domains.
Length = 138
Score = 29.1 bits (65), Expect = 1.7
Identities = 29/181 (16%), Positives = 53/181 (29%), Gaps = 46/181 (25%)
Query: 51 GSLVSVN-YVITAAHCVTKKPYDKPVDSDTLVIYLGKYHQHQFSDEGGVQNKQVKRVHIY 109
G +V + ++T AH V + D+ + + +GG +V
Sbjct: 3 GFVVGSDGLILTNAHVV------EDADASEIEV---------VLPDGGRVPAEVVAA--- 44
Query: 110 PTFNSSNYLGDIALLQLSSDVDYSMYVRPVCLWDDSTAPLQLSAVEGTSVCNGDSGGGMV 169
+ D+ALL++ + L A + G
Sbjct: 45 ----DPDL--DLALLKVDGPLLP---------------AAPLLASSAAPLGGSVVVVGGP 83
Query: 170 FKIDSAWYLRGIVSITVARDGLRVCDTKHYVVFTDVANVCNGDSGGGMVFKIDSAWYLRG 229
I G + L D ++ + D + G SGG + D+ + G
Sbjct: 84 GGIGLGASGGGGGVGGLVSGSLGGVDGRYILTDADTS---PGSSGGPV---FDADGEVVG 137
Query: 230 I 230
I
Sbjct: 138 I 138
>gnl|CDD|218649 pfam05593, RHS_repeat, RHS Repeat. RHS proteins contain extended
repeat regions. These repeats often appear to be
involved in ligand binding. Note that this model may not
find all the repeats in a protein and that it covers two
RHS repeats.
Length = 38
Score = 26.9 bits (60), Expect = 1.7
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 301 QLSAVEGRDGTVIGWGYDENDRVSEE 326
+L++V DG V + YD R++
Sbjct: 6 RLTSVTDADGRVTTYTYDAAGRLTAV 31
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.137 0.434
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,932,144
Number of extensions: 1658525
Number of successful extensions: 1100
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1083
Number of HSP's successfully gapped: 27
Length of query: 363
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 265
Effective length of database: 6,590,910
Effective search space: 1746591150
Effective search space used: 1746591150
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (26.9 bits)