RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8978
(252 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 256 bits (657), Expect = 1e-86
Identities = 98/240 (40%), Positives = 138/240 (57%), Gaps = 16/240 (6%)
Query: 14 GQASEVNDWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIV 73
G +++ +PW V+L+ R +FCGG LI+ RWVLTAAHC + S+ V
Sbjct: 4 GSEAKIGSFPWQVSLQYTGGR-HFCGGSLISPRWVLTAAHC--------VYSSAPSNYTV 54
Query: 74 RLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQ 133
RLG +D S I + V+P ++ Y+NDIAL++L + + VRP+CLP
Sbjct: 55 RLGSHDLSSNEGGGQV-IKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPS 113
Query: 134 AG-DFYEDQIGIVTGWGTLSYGGPRSDVLMEVPIPVWRLTECRKQFSQN--IFDSNLCAG 190
+G + V+GWG S GGP DVL EV +P+ EC++ +S I D+ LCAG
Sbjct: 114 SGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNMLCAG 173
Query: 191 GYKGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWGIGCGK--TPGVYVQVNKYLRWIYNT 248
G +GG D+CQGDSGGPL+ + ++G+VSWG GC + PGVY +V+ YL WI T
Sbjct: 174 GLEGGKDACQGDSGGPLVCNDNG-RGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 245 bits (628), Expect = 3e-82
Identities = 95/238 (39%), Positives = 137/238 (57%), Gaps = 19/238 (7%)
Query: 14 GQASEVNDWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIV 73
G + + +PW V+L+ R +FCGG LI+ RWVLTAAHC + S++ V
Sbjct: 5 GSEANIGSFPWQVSLQYGGGR-HFCGGSLISPRWVLTAAHC--------VRGSDPSNIRV 55
Query: 74 RLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQ 133
RLG +D S + + I + + ++P ++ Y+NDIAL++L + + VRP+CLP
Sbjct: 56 RLGSHDLS--SGEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPS 113
Query: 134 AGDFYED-QIGIVTGWGTLSYG-GPRSDVLMEVPIPVWRLTECRKQFSQN--IFDSNLCA 189
+ V+GWG S G G D L EV +P+ CR+ +S I D+ LCA
Sbjct: 114 SNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAITDNMLCA 173
Query: 190 GGYKGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWGIGCGKT--PGVYVQVNKYLRWI 245
GG +GG D+CQGDSGGPL+ D +W ++G+VSWG GC + PGVY +V+ YL WI
Sbjct: 174 GGLEGGKDACQGDSGGPLVCN--DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 182 bits (464), Expect = 9e-58
Identities = 87/235 (37%), Positives = 116/235 (49%), Gaps = 23/235 (9%)
Query: 14 GQASEVNDWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIV 73
G ++ +PW V+L+ +FCGG LI+E WVLTAAHC+ + V
Sbjct: 4 GDEAQPGSFPWQVSLQVS-SGKHFCGGSLISENWVLTAAHCVS----------NAKSVRV 52
Query: 74 RLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQ 133
LG ++ E + V+P ++ NDIAL++L VRP+CLP
Sbjct: 53 VLGAHNIVL-REGGEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPT 110
Query: 134 AGDFYEDQIGI-VTGWGTLSYGGPRSDVLMEVPIPVWRLTECRKQFSQNIFDSNLCAGGY 192
A V+GWG G D L EV +PV CR + + D+ +CAG
Sbjct: 111 ASSDLPVGTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCRSAYGGTVTDNMICAGA- 168
Query: 193 KGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWGIGC--GKTPGVYVQVNKYLRWI 245
GG D+CQGDSGGPL+ D +IG+VSWG GC G PGVY V+ YL WI
Sbjct: 169 -GGKDACQGDSGGPLV--CSDG--ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 218
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 98.4 bits (245), Expect = 7e-24
Identities = 81/270 (30%), Positives = 113/270 (41%), Gaps = 57/270 (21%)
Query: 8 TAKIDKGQASEVNDWPWLVALKRQ---YERDNFCGGVLINERWVLTAAHCI-KQKIDNAL 63
+++I G + ++P LVAL + Y FCGG + R+VLTAAHC ++
Sbjct: 30 SSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSD 89
Query: 64 VLRRTSDLIVRLGEYDFSKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYN 123
V R DL D S+ V I +S N NDIA+++L+
Sbjct: 90 VNRVVVDL------NDSSQAERGHVRTIYVHEF-----YSPGNLGNDIAVLELA------ 132
Query: 124 SFVRPVCLPQAGDFYEDQ----IGIVTGWGTLSYGG-------------PRSDVLMEVPI 166
R LP+ D + VT ++ G P+ +L EV +
Sbjct: 133 ---RAASLPRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAV 189
Query: 167 PVWRLTEC-RKQFSQNIFDS-----NLCAGGYKGGTDSCQGDSGGPLLLQRPD--KQWTI 218
L+ C + + N D CAG D+CQGDSGGP+ + + Q
Sbjct: 190 LFVPLSTCAQYKGCANASDGATGLTGFCAGRP--PKDACQGDSGGPIFHKGEEGRVQ--- 244
Query: 219 IGVVSWGIG-CGKT--PGVYVQVNKYLRWI 245
GVVSWG G CG T PGVY V+ Y WI
Sbjct: 245 RGVVSWGDGGCGGTLIPGVYTNVSNYQDWI 274
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 37.3 bits (86), Expect = 0.005
Identities = 29/114 (25%), Positives = 46/114 (40%), Gaps = 14/114 (12%)
Query: 21 DWPWLVALKRQYERDNFCGGVLINERWVLTAAHCIKQKIDNALVLRRTSDLIVRLGEYDF 80
WPW+ K E + C GVLI+ WVL + C+ S + V LG +
Sbjct: 15 HWPWIA--KVYVEGNYRCTGVLIDLSWVLVSHSCLW------DTSLEHSYISVVLGGHKT 66
Query: 81 SKVNETKVTDIPAAAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQA 134
K + I +V R + + I+L+ L A +++ V P +P
Sbjct: 67 LKSVKGPYEQI----YRVDCR--KDLPRSKISLLHLKSPATFSNHVLPTFVPST 114
>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain. This family
includes trypsin like peptidase domains.
Length = 138
Score = 32.2 bits (73), Expect = 0.12
Identities = 25/183 (13%), Positives = 48/183 (26%), Gaps = 48/183 (26%)
Query: 40 GVLINER-WVLTAAHCIKQKIDNALVLRRTSDLIVRLGEYDFSKVNETKVTDIPAAAMKV 98
G ++ +LT AH ++ + + + +PA +
Sbjct: 3 GFVVGSDGLILTNAHVVEDADASEIEVVLPDG------------------GRVPAEVVAA 44
Query: 99 YPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQAGDFYEDQIGIVTGWGTLSYGGPRS 158
P D+AL+++ P G V G G +
Sbjct: 45 DPDL-------DLALLKVDGP-------LLPAAPLLASSAAPLGGSVVVVGGPGGIGLGA 90
Query: 159 DVLMEVPIPVWRLTECRKQFSQNIFDSNLCAGGYKGGTDSCQGDSGGPLLLQRPDKQWTI 218
+ + + D D+ G SGGP+ D +
Sbjct: 91 SGGGGGVGGLVSGSLGGVDGRYILTD-----------ADTSPGSSGGPVF----DADGEV 135
Query: 219 IGV 221
+G+
Sbjct: 136 VGI 138
>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
transport and metabolism].
Length = 251
Score = 31.2 bits (71), Expect = 0.39
Identities = 43/224 (19%), Positives = 63/224 (28%), Gaps = 49/224 (21%)
Query: 38 CGGVLINERWVLTAAHCIKQ----KIDNALVLRRTSDLIVRLGEYDFSKVNETKVTDIPA 93
LI VLTA HCI + D A V F + + ++ P
Sbjct: 66 TAATLIGPNTVLTAGHCIYSPDYGEDDIAAAPP-----GVNSDGGPFYGITKIEIRVYPG 120
Query: 94 AAMKVYPRFSEQNYENDIALVQLSKKAQYNSFVRPVCLPQAGDFYEDQIGI---VTGWGT 150
+ E D+ L V + A E + V G
Sbjct: 121 ------ELYKEDGASYDVGEAALESGINIGDVVNYLKRNTAS---EAKANDRITVIG--- 168
Query: 151 LSYGGPRSDVLMEVPIPVWRLTECRKQFSQNIFDSNLCAGGYKGGTDSCQGDSGGPLLLQ 210
Y G + ++ W T N Y D+ G SG P+L+
Sbjct: 169 --YPGDKPNIGTM-----WESTGKVNSIKGNKLF-------YD--ADTLPGSSGSPVLI- 211
Query: 211 RPDKQWTIIGVVSWGIGCGKTP----GVYVQVNKYLRWIYNTAK 250
+ +IGV G G V + + L +I K
Sbjct: 212 ---SKDEVIGVHYNGPGANGGSLANNAVRL-TPEILNFIQQNIK 251
>gnl|CDD|217716 pfam03761, DUF316, Domain of unknown function (DUF316). This
family of proteins with unknown function are from
Caenorhabditis elegans. The protein has GO references
indicating the protein is a positive regulator of growth
rate and is also involved in nematode larval
development.
Length = 280
Score = 30.5 bits (69), Expect = 0.70
Identities = 49/255 (19%), Positives = 82/255 (32%), Gaps = 46/255 (18%)
Query: 10 KIDKGQASEVNDWPWLVALKRQY--ERDNFCGGVLINERWVLTAAHCI--------KQKI 59
+ G E +++PWLV Q +++ I+ R +LT++ K
Sbjct: 41 QNINGIYLEKSEYPWLVKAAFQNGNQKNYKPPATFISTRHILTSSRLFLNGKMLNWKNTG 100
Query: 60 DNALVLRRTSDLIV---RLGEYDFSKVNETKVT----DIPAAAMKVYPRFSEQNYENDIA 112
DN L V L ++D +++ K D A + + ++ + A
Sbjct: 101 DNDTCSGGLGHLEVPPEVLDKFDIMDLSKKKGKNSFRDNITRAYVLNICANTKSKFDLSA 160
Query: 113 ---LVQLSKKAQYNSFVRPVCLPQAGDFYEDQIGIVTGWGTLSYGGPRSDVLMEVPIPVW 169
LV+L S CL + + G YG S L +
Sbjct: 161 KPMLVELEGPEPNIS---YPCLAD------ESTSLEKGDAVDVYGIDSSGELKHRKL--- 208
Query: 170 RLTECRKQFSQNIFDSNLCAGGYKGGTDSCQGDSGGPLLLQRPDKQWTIIGVVSWG-IGC 228
NI + C+GD GGPL+ K T+IG + G +
Sbjct: 209 -----------NIVNCYSNDLSIGTDQYLCKGDDGGPLIKNVSGK-NTVIGFGATGNMED 256
Query: 229 GKTPGVYVQVNKYLR 243
+ + K L
Sbjct: 257 NANDFKFFNI-KNLS 270
>gnl|CDD|217015 pfam02395, Peptidase_S6, Immunoglobulin A1 protease. This family
consists of immunoglobulin A1 protease proteins. The
immunoglobulin A1 protease cleaves immunoglobulin IgA
and is found in pathogenic bacteria such as Neisseria
gonorrhoeae. Not all of the members of this family are
IgA proteases, espP from E. coli O157:H7 cleaves human
coagulation factor V and hbp is a hemoglobin protease
from E. coli EB1.
Length = 758
Score = 30.5 bits (69), Expect = 1.0
Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 1/35 (2%)
Query: 200 QGDSGGPLLL-QRPDKQWTIIGVVSWGIGCGKTPG 233
GDSG PL K+W ++GV+ G G G
Sbjct: 205 LGDSGSPLFAYDNQKKKWVLLGVLDGGAGYGGKGN 239
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.138 0.442
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,747,130
Number of extensions: 1170985
Number of successful extensions: 773
Number of sequences better than 10.0: 1
Number of HSP's gapped: 745
Number of HSP's successfully gapped: 10
Length of query: 252
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 157
Effective length of database: 6,723,972
Effective search space: 1055663604
Effective search space used: 1055663604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.1 bits)