RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16044
(436 letters)
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 209 bits (535), Expect = 9e-66
Identities = 86/225 (38%), Positives = 120/225 (53%), Gaps = 16/225 (7%)
Query: 28 INGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIPEL 87
+ G E+ G++PWQVSLQ R H+CG LI P WV+TAAHC+++ P
Sbjct: 2 VGGSEAKIGSFPWQVSLQYTGGR-----HFCGGSLISPRWVLTAAHCVYS-----SAPSN 51
Query: 88 WTAVLGDWDRTEEEKSEVRIPVERIRVHEEFH--NYHHDIALLKLSRPTSARDKGVRAVC 145
+T LG D + E I V+++ VH ++ Y +DIALLKL RP + D VR +C
Sbjct: 52 YTVRLGSHDLSSNEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDN-VRPIC 110
Query: 146 LTDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPLHNISVCRDKYGDSVELHGGH 205
L + + C +GWGR G L L+++ VP+ + + C+ Y +
Sbjct: 111 LPSSG-YNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAYSYGGTITDNM 169
Query: 206 LCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCGV 250
LC G L+G AC GDSGGPL C+ +GR L GI S+GSG C
Sbjct: 170 LCAGGLEGGKDACQGDSGGPLVCN-DNGRGVLVGIVSWGSG-CAR 212
Score = 147 bits (373), Expect = 8e-42
Identities = 61/144 (42%), Positives = 82/144 (56%), Gaps = 3/144 (2%)
Query: 285 ALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPL 344
ALLKL RP + D VR +CL + + C +GWGR G L L+++ VP+
Sbjct: 92 ALLKLKRPVTLSDN-VRPICLPSSG-YNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPI 149
Query: 345 HNISVCRDKYGDSVELHGGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGS 404
+ + C+ Y + LC G L+G AC GDSGGPL C+ +GR L GI S+GS
Sbjct: 150 VSNAECKRAYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCN-DNGRGVLVGIVSWGS 208
Query: 405 GCAKSGYPDVYTKLSFYLPWIRKQ 428
GCA+ YP VYT++S YL WI+K
Sbjct: 209 GCARPNYPGVYTRVSSYLDWIQKT 232
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 201 bits (513), Expect = 2e-62
Identities = 89/227 (39%), Positives = 120/227 (52%), Gaps = 19/227 (8%)
Query: 26 RLINGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIP 85
R++ G E+ G++PWQVSLQ R H+CG LI P WV+TAAHC+ P
Sbjct: 1 RIVGGSEANIGSFPWQVSLQYGGGR-----HFCGGSLISPRWVLTAAHCVRG-----SDP 50
Query: 86 ELWTAVLGDWDRTEEEKSEVRIPVERIRVHEEFH--NYHHDIALLKLSRPTSARDKGVRA 143
LG D + E+ +V I V ++ +H ++ Y +DIALLKL P + D VR
Sbjct: 51 SNIRVRLGSHDLSSGEEGQV-IKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDN-VRP 108
Query: 144 VCLTDADKRPVNPKQQCVATGWGRVKP-KGDLVSKLRQIRVPLHNISVCRDKYGDSVELH 202
+CL ++ V C +GWGR G L L+++ VP+ + + CR Y +
Sbjct: 109 ICLPSSN-YNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRAYSGGGAIT 167
Query: 203 GGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCG 249
LC G L+G AC GDSGGPL C DGRW L GI S+GSG C
Sbjct: 168 DNMLCAGGLEGGKDACQGDSGGPLVC--NDGRWVLVGIVSWGSG-CA 211
Score = 140 bits (356), Expect = 3e-39
Identities = 63/142 (44%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 285 ALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKP-KGDLVSKLRQIRVP 343
ALLKL P + D VR +CL ++ V C +GWGR G L L+++ VP
Sbjct: 92 ALLKLKEPVTLSDN-VRPICLPSSN-YNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVP 149
Query: 344 LHNISVCRDKYGDSVELHGGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFG 403
+ + + CR Y + LC G L+G AC GDSGGPL C DGRW L GI S+G
Sbjct: 150 IVSNATCRRAYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC--NDGRWVLVGIVSWG 207
Query: 404 SGCAKSGYPDVYTKLSFYLPWI 425
SGCA+ G P VYT++S YL WI
Sbjct: 208 SGCARPGKPGVYTRVSSYLDWI 229
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 155 bits (393), Expect = 6e-45
Identities = 84/233 (36%), Positives = 114/233 (48%), Gaps = 25/233 (10%)
Query: 28 INGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIPEL 87
+ G E+ G++PWQVSLQV + H+CG LI +WV+TAAHC+ N +
Sbjct: 2 VGGDEAQPGSFPWQVSLQVSSGK-----HFCGGSLISENWVLTAAHCVSN-------AKS 49
Query: 88 WTAVLGDWDRTEEEKSEVRIPVERIRVHEEF-HNYHHDIALLKLSRPTSARDKGVRAVCL 146
VLG + E E + V+++ VH + + +DIALLKL P + D VR +CL
Sbjct: 50 VRVVLGAHNIVLREGGEQKFDVKKVIVHPNYNPDTDNDIALLKLKSPVTLGD-TVRPICL 108
Query: 147 TDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPLHNISVCRDKYGDSVELHGGHL 206
A + C +GWG K G L L+++ VP+ + CR YG +V +
Sbjct: 109 PTAS-SDLPVGTTCTVSGWGNTKTLG-LPDTLQEVTVPVVSRETCRSAYGGTVT--DNMI 164
Query: 207 CGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCGVGIRYSHRQP 259
C G G AC GDSGGPL CS L GI S+G G C G P
Sbjct: 165 CAG--AGGKDACQGDSGGPLVCSDG----ELIGIVSWGYG-CASGNYPGVYTP 210
Score = 105 bits (263), Expect = 3e-26
Identities = 59/141 (41%), Positives = 72/141 (51%), Gaps = 11/141 (7%)
Query: 285 ALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKPKGDLVSKLRQIRVPL 344
ALLKL P + D VR +CL A + C +GWG K G L L+++ VP+
Sbjct: 89 ALLKLKSPVTLGD-TVRPICLPTAS-SDLPVGTTCTVSGWGNTKTLG-LPDTLQEVTVPV 145
Query: 345 HNISVCRDKYGDSVELHGGHLCGGQLDGFSGACIGDSGGPLQCSLKDGRWYLAGITSFGS 404
+ CR YG +V +C G G AC GDSGGPL CS L GI S+G
Sbjct: 146 VSRETCRSAYGGTVT--DNMICAG--AGGKDACQGDSGGPLVCSDG----ELIGIVSWGY 197
Query: 405 GCAKSGYPDVYTKLSFYLPWI 425
GCA YP VYT +S YL WI
Sbjct: 198 GCASGNYPGVYTPVSSYLDWI 218
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 83.0 bits (205), Expect = 3e-17
Identities = 63/235 (26%), Positives = 95/235 (40%), Gaps = 19/235 (8%)
Query: 25 PRLINGKESIRGAWPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPI 84
R+I G + G +P V+L V + +CG + +V+TAAHC S PI
Sbjct: 31 SRIIGGSNANAGEYPSLVAL-VDRISDYVSGTFCGGSKLGGRYVLTAAHCADA---SSPI 86
Query: 85 PELWTAVLGDWDRTEEEKSEVRIPVERIRVHEEFH--NYHHDIALLKLSRPTSARDKGVR 142
V+ D + ++E R V I VHE + N +DIA+L+L+R S +
Sbjct: 87 SSDVNRVV--VDLNDSSQAE-RGHVRTIYVHEFYSPGNLGNDIAVLELARAASLPRVKIT 143
Query: 143 AVCLTDADKRPVNPKQQCVATGWGRVKPKGDL-----VSKLRQIRVPLHNISVCRDKYGD 197
+ +D V +G P + L ++ V +S C G
Sbjct: 144 SFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLSTCAQYKGC 203
Query: 198 SVELHGGHLC--GGQLDGFS-GACIGDSGGPLQCSLKDGRWYLAGITSFGSGYCG 249
+ G G AC GDSGGP+ ++GR G+ S+G G CG
Sbjct: 204 A-NASDGATGLTGFCAGRPPKDACQGDSGGPIFHKGEEGRVQR-GVVSWGDGGCG 256
Score = 57.6 bits (139), Expect = 5e-09
Identities = 40/157 (25%), Positives = 59/157 (37%), Gaps = 11/157 (7%)
Query: 283 SAALLKLSRPTSARDKGVRAVCLTDADKRPVNPKQQCVATGWGRVKPKGDL-----VSKL 337
A+L+L+R S + + +D V +G P + L
Sbjct: 125 DIAVLELARAASLPRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTIL 184
Query: 338 RQIRVPLHNISVCRDKYGDSVELHGGHLC--GGQLDGFS-GACIGDSGGPLQCSLKDGRW 394
++ V +S C G + G G AC GDSGGP+ ++GR
Sbjct: 185 HEVAVLFVPLSTCAQYKGCA-NASDGATGLTGFCAGRPPKDACQGDSGGPIFHKGEEGRV 243
Query: 395 YLAGITSFG-SGCAKSGYPDVYTKLSFYLPWIRKQIN 430
G+ S+G GC + P VYT +S Y WI N
Sbjct: 244 QR-GVVSWGDGGCGGTLIPGVYTNVSNYQDWIAAMTN 279
>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986). This
domain is found in serine proteases and is predicted to
contain disulphide bonds.
Length = 267
Score = 31.6 bits (71), Expect = 0.72
Identities = 24/99 (24%), Positives = 40/99 (40%), Gaps = 14/99 (14%)
Query: 38 WPWQVSLQVLHPRLGLMPHWCGAVLIHPSWVVTAAHCIHNDIFSLPIPELWTAVLGDWDR 97
WPW + V G C VLI SWV+ + C+ + + + +V+ +
Sbjct: 16 WPWIAKVYVE----GNY--RCTGVLIDLSWVLVSHSCLWD----TSLEHSYISVVLGGHK 65
Query: 98 TEEEKSEVRIPVERIRVHEEFHN-YHHDIALLKLSRPTS 135
T V+ P E+I + + I+LL L P +
Sbjct: 66 T---LKSVKGPYEQIYRVDCRKDLPRSKISLLHLKSPAT 101
>gnl|CDD|217015 pfam02395, Peptidase_S6, Immunoglobulin A1 protease. This family
consists of immunoglobulin A1 protease proteins. The
immunoglobulin A1 protease cleaves immunoglobulin IgA
and is found in pathogenic bacteria such as Neisseria
gonorrhoeae. Not all of the members of this family are
IgA proteases, espP from E. coli O157:H7 cleaves human
coagulation factor V and hbp is a hemoglobin protease
from E. coli EB1.
Length = 758
Score = 32.1 bits (73), Expect = 0.72
Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
Query: 379 GDSGGPLQC-SLKDGRWYLAGITSFGSGCAKSGYPDVYTKLSFYLPWIRKQINIAVD 434
GDSG PL + +W L G+ G+G G TK F ++K + V
Sbjct: 206 GDSGSPLFAYDNQKKKWVLLGVLDGGAGYGGKGNNWNVTKKDFLGQVLQKDTDATVK 262
Score = 29.0 bits (65), Expect = 5.6
Identities = 14/33 (42%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 220 GDSGGPLQC-SLKDGRWYLAGITSFGSGYCGVG 251
GDSG PL + +W L G+ G+GY G G
Sbjct: 206 GDSGSPLFAYDNQKKKWVLLGVLDGGAGYGGKG 238
>gnl|CDD|226119 COG3591, COG3591, V8-like Glu-specific endopeptidase [Amino acid
transport and metabolism].
Length = 251
Score = 30.0 bits (68), Expect = 1.8
Identities = 10/20 (50%), Positives = 14/20 (70%)
Query: 58 CGAVLIHPSWVVTAAHCIHN 77
A LI P+ V+TA HCI++
Sbjct: 66 TAATLIGPNTVLTAGHCIYS 85
>gnl|CDD|233677 TIGR01995, PTS-II-ABC-beta, PTS system, beta-glucoside-specific
IIABC component. This model represents a family of PTS
enzyme II proteins in which all three domains are found
in the same polypeptide chain and which appear to have a
broad specificity for beta-glucosides including salicin
(beta-D-glucose-1-salicylate) and arbutin
(Hydroquinone-O-beta-D-glucopyranoside). These are
distinct from the closely related sucrose-specific and
trehalose-specific PTS transporters.
Length = 610
Score = 30.0 bits (68), Expect = 2.8
Identities = 8/27 (29%), Positives = 12/27 (44%)
Query: 59 GAVLIHPSWVVTAAHCIHNDIFSLPIP 85
GA L+HP+ V F +P+
Sbjct: 177 GAALLHPTLVAMVGSGKPVTFFGIPVT 203
>gnl|CDD|217150 pfam02622, DUF179, Uncharacterized ACR, COG1678.
Length = 158
Score = 29.0 bits (66), Expect = 3.2
Identities = 10/27 (37%), Positives = 12/27 (44%), Gaps = 2/27 (7%)
Query: 66 SWVVTAAHCIHNDIFSLPIPELWTAVL 92
SW+V A IF P + W A L
Sbjct: 130 SWLVAPAD--PEIIFDTPPEDRWEAAL 154
>gnl|CDD|236659 PRK10158, PRK10158, 23S rRNA/tRNA pseudouridine synthase A;
Provisional.
Length = 219
Score = 28.8 bits (64), Expect = 4.2
Identities = 27/86 (31%), Positives = 36/86 (41%), Gaps = 19/86 (22%)
Query: 97 RTEEEKSEVRIPVERIRVHEEFHNYHHDIALLKLSRPTSARDKGVRAVCLTDADKRPVN- 155
R EE K V ++R +Y ++ +L TS GV V LT A +R +
Sbjct: 38 RLEEHKDSVMTRIQR--------DYPQAESVHRLDMATS----GVIVVALTKAAERELKR 85
Query: 156 ------PKQQCVATGWGRVKPKGDLV 175
PK+Q VA WG P LV
Sbjct: 86 QFREREPKKQYVARVWGHPSPAEGLV 111
>gnl|CDD|219188 pfam06814, Lung_7-TM_R, Lung seven transmembrane receptor. This
family represents a conserved region with eukaryotic
lung seven transmembrane receptors and related proteins.
Length = 287
Score = 28.8 bits (65), Expect = 4.9
Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 160 CVATGWGRVKPK-GDLVSKLRQIRVPL 185
V+ G+G VKP GD + K+ I +
Sbjct: 129 IVSLGYGIVKPTLGDTLKKVAGIGLLY 155
Score = 28.8 bits (65), Expect = 4.9
Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 319 CVATGWGRVKPK-GDLVSKLRQIRVPL 344
V+ G+G VKP GD + K+ I +
Sbjct: 129 IVSLGYGIVKPTLGDTLKKVAGIGLLY 155
>gnl|CDD|234694 PRK00228, PRK00228, hypothetical protein; Validated.
Length = 191
Score = 28.7 bits (65), Expect = 5.1
Identities = 7/27 (25%), Positives = 11/27 (40%), Gaps = 2/27 (7%)
Query: 66 SWVVTAAHCIHNDIFSLPIPELWTAVL 92
+W+ A +F P + W A L
Sbjct: 150 AWLTAPAD--PEILFDTPPEDRWRAAL 174
>gnl|CDD|236772 PRK10833, PRK10833, putative assembly protein; Provisional.
Length = 617
Score = 28.7 bits (65), Expect = 7.0
Identities = 13/38 (34%), Positives = 16/38 (42%), Gaps = 10/38 (26%)
Query: 156 PKQQC-------VATGWGRVKPKGDLVSKLRQIRVPLH 186
PKQQC V GW L+ L++ VPL
Sbjct: 530 PKQQCDVNFNVRVTGGW---NGDSKLIDFLQETPVPLR 564
Score = 28.7 bits (65), Expect = 7.0
Identities = 13/38 (34%), Positives = 16/38 (42%), Gaps = 10/38 (26%)
Query: 315 PKQQC-------VATGWGRVKPKGDLVSKLRQIRVPLH 345
PKQQC V GW L+ L++ VPL
Sbjct: 530 PKQQCDVNFNVRVTGGW---NGDSKLIDFLQETPVPLR 564
>gnl|CDD|221807 pfam12850, Metallophos_2, Calcineurin-like phosphoesterase
superfamily domain. Members of this family are part of
the Calcineurin-like phosphoesterase superfamily.
Length = 146
Score = 27.6 bits (62), Expect = 9.0
Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 13/74 (17%)
Query: 75 IHNDIFSLPIPELWTAVLGDWDRTEEEKSEVRIPVERIR---VHEEFHNYHHDIALLKLS 131
+ + + +P+ AV G+ D E+ E+ + + R VH + H +A L
Sbjct: 41 VLDLLADIPVY----AVRGNNDGDEDLPEELVLTLGGFRILVVHGHPYGVKHGLAELL-- 94
Query: 132 RPTSARDKGVRAVC 145
A++ G V
Sbjct: 95 ----AKEGGADVVF 104
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.140 0.465
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,334,688
Number of extensions: 2128616
Number of successful extensions: 1708
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1679
Number of HSP's successfully gapped: 25
Length of query: 436
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 336
Effective length of database: 6,502,202
Effective search space: 2184739872
Effective search space used: 2184739872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (26.9 bits)