BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026361
(240 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q1PF50|EDA2_ARATH Probable serine protease EDA2 OS=Arabidopsis thaliana GN=EDA2 PE=2
SV=2
Length = 489
Score = 343 bits (881), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 153/241 (63%), Positives = 195/241 (80%), Gaps = 3/241 (1%)
Query: 1 MFDAAELEIEGDFLYFLADAAVTAFQYGNPDKLCTPLVEAKNAGEDLVDAYAKFVKEYYL 60
+F+A EL+++ DFLY ADAAV AFQYGNPDKLC PLVEAK G DLV Y+ +V+EY +
Sbjct: 251 LFNATELDVDADFLYLTADAAVMAFQYGNPDKLCVPLVEAKKNGSDLVVTYSTYVREYCM 310
Query: 61 GSFGASVQTYNQKRLKNTAVTDQSADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHL 120
+G V+TYN+K L+NT VT SA RLWWFQ CTE+ +FQVAP DSVRS +++T +HL
Sbjct: 311 RIWGLRVRTYNRKHLRNTVVTADSAYRLWWFQACTELGYFQVAPKYDSVRSHQINTTFHL 370
Query: 121 DLCKNVFGEGIYPDVDSTNIYYGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLIT 180
DLCK++FG+ +YP VD+TN+YYGG ++A +KI+FTNGS+DPWRHASKQ S+ +MPSY+I
Sbjct: 371 DLCKSLFGKDVYPKVDATNLYYGGDRLAATKIIFTNGSEDPWRHASKQNSTHEMPSYIIK 430
Query: 181 CHNCGHGTDLRGCPQSPLTPEGDAQNCSAPDAVHKVRQQVIEKIDLWLSEC-QSVGWRSS 239
C NCGHG+D+RGCPQSP+ EG + NCS PD V+KVRQQ++E IDLWLSEC QS+ RSS
Sbjct: 431 CRNCGHGSDIRGCPQSPMVIEGKSNNCSLPDYVNKVRQQMVEHIDLWLSECRQSI--RSS 488
Query: 240 M 240
+
Sbjct: 489 I 489
>sp|Q9QXE5|TSSP_MOUSE Thymus-specific serine protease OS=Mus musculus GN=Prss16 PE=2 SV=1
Length = 509
Score = 59.3 bits (142), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 64/151 (42%), Gaps = 16/151 (10%)
Query: 85 ADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHLDLCKNVFG---EGIYPDVDSTNIY 141
DR W +Q CTE F+ S + L+LC+ VFG + V TN Y
Sbjct: 368 GDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLELCEQVFGLSPASVAQAVAQTNSY 427
Query: 142 YGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPE 201
YGG ++++F NG DPW H T + + + H D+ +P+ P
Sbjct: 428 YGGQSPGATQVLFVNGDTDPW-HVLSVTQDLGLSEPALLIPSASHCFDM-----APMRP- 480
Query: 202 GDAQNCSAPDAVHKVRQQVIEKIDLWLSECQ 232
S ++ RQ++ +++ WL + +
Sbjct: 481 ------SDSPSLRLGRQKISQQLQDWLKDIK 505
>sp|Q9NQE7|TSSP_HUMAN Thymus-specific serine protease OS=Homo sapiens GN=PRSS16 PE=2 SV=2
Length = 514
Score = 58.9 bits (141), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 58/147 (39%), Gaps = 16/147 (10%)
Query: 85 ADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHLDLCKNVFGEG---IYPDVDSTNIY 141
DR W +Q CTE F+ S LDLC+ VFG + V TN Y
Sbjct: 369 GDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQVFGLSALSVAQAVAQTNSY 428
Query: 142 YGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPE 201
YGG +K++F NG DPW H T + + H D+ P+ P
Sbjct: 429 YGGQTPGANKVLFVNGDTDPW-HVLSVTQALGSSESTLLIRTGSHCLDM--APERP---- 481
Query: 202 GDAQNCSAPDAVHKVRQQVIEKIDLWL 228
S ++ RQ + +++ WL
Sbjct: 482 ------SDSPSLRLGRQNIFQQLQTWL 502
>sp|P34528|YM67_CAEEL Putative serine protease K12H4.7 OS=Caenorhabditis elegans
GN=K12H4.7 PE=3 SV=2
Length = 510
Score = 57.0 bits (136), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 81/184 (44%), Gaps = 33/184 (17%)
Query: 61 GSFGASVQTYNQ--KRLKNTAVTDQSADRLWWFQVCTEVAFFQ------VAPANDSVRSS 112
G FG + YN +K+ + +DR W +Q CTE ++Q P V S
Sbjct: 341 GYFGCNDIDYNGFISFMKDETFGEAQSDRAWVWQTCTEFGYYQSTSSATAGPWFGGV--S 398
Query: 113 KVDTRYHLDLCKNVFG-----EGIYPDVDSTNIYYGGT-KIAGSKIVFTNGSQDPWRHAS 166
+ +Y++D C ++G + + VD TN YYGG + +I+ NG DPW HA
Sbjct: 399 NLPAQYYIDECTAIYGAAYNSQEVQTSVDYTNQYYGGRDNLNTDRILLPNGDIDPW-HAL 457
Query: 167 KQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEGDAQNCSAPDAVH--KVRQQVIEKI 224
+ +S + + + H D+ G S+ D+++ RQ++ + +
Sbjct: 458 GKLTSSNSNIVPVVINGTAHCADMYG--------------ASSLDSMYLTNARQRISDVL 503
Query: 225 DLWL 228
D WL
Sbjct: 504 DGWL 507
>sp|P90893|YM9I_CAEEL Putative serine protease F56F10.1 OS=Caenorhabditis elegans
GN=F56F10.1 PE=1 SV=2
Length = 540
Score = 48.5 bits (114), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 11/119 (9%)
Query: 77 NTAVTDQSADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHLDLCKNVFGEG-----I 131
N D +A R W + C E+ F Q ++V + V +D+C ++FG+ I
Sbjct: 364 NVLGPDGAAARGWMWLCCNEIGFLQTTNQGNNVFGTGVPLNLFIDMCTDMFGDSMKMSQI 423
Query: 132 YPDVDSTNIYYGGTKI-AGSKIVFTNGSQDPWRHASK--QTSSPDMPSYLI--TCHNCG 185
+ YYGG + +V NGS DPW S + YLI T H CG
Sbjct: 424 MGGNKKSQNYYGGADFYNATNVVLPNGSLDPWHALGTYGTIKSQSLLPYLINGTAH-CG 481
>sp|Q9EPB1|DPP2_RAT Dipeptidyl peptidase 2 OS=Rattus norvegicus GN=Dpp7 PE=1 SV=1
Length = 500
Score = 44.7 bits (104), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 56/150 (37%), Gaps = 15/150 (10%)
Query: 84 SADRLWWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYY 142
S R W +Q CTE+ F D C + +G PD T+ ++
Sbjct: 352 SNARAWDYQACTEINLTFDSNNVTDMFPEIPFSDELRQQYCLDTWGVWPRPDWLQTS-FW 410
Query: 143 GGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEG 202
GG A S I+F+NG DPW Q + +T H DLR
Sbjct: 411 GGDLKAASNIIFSNGDLDPWAGGGIQRNL-STSIIAVTIQGGAHHLDLR----------- 458
Query: 203 DAQNCSAPDAVHKVRQQVIEKIDLWLSECQ 232
A N P +V +VR+ I W++ +
Sbjct: 459 -ASNSEDPPSVVEVRKLEATLIREWVAAAR 487
>sp|Q9ET22|DPP2_MOUSE Dipeptidyl peptidase 2 OS=Mus musculus GN=Dpp7 PE=2 SV=2
Length = 506
Score = 43.5 bits (101), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 17/148 (11%)
Query: 87 RLWWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNI-YYGG 144
R W +Q CTE+ F D C + +G ++P D ++GG
Sbjct: 355 RAWDYQACTEINLTFDSNNVTDMFPEIPFSEELRQQYCLDTWG--VWPRQDWLQTSFWGG 412
Query: 145 TKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEGDA 204
A S I+F+NG DPW Q S+ +T H DLR A
Sbjct: 413 DLKAASNIIFSNGDLDPWAGGGIQ-SNLSTSVIAVTIQGGAHHLDLR------------A 459
Query: 205 QNCSAPDAVHKVRQQVIEKIDLWLSECQ 232
N P +V +VR+ I W++ +
Sbjct: 460 SNSEDPPSVVEVRKLESTLIREWVAAAR 487
>sp|Q7TMR0|PCP_MOUSE Lysosomal Pro-X carboxypeptidase OS=Mus musculus GN=Prcp PE=2 SV=2
Length = 491
Score = 42.0 bits (97), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 53/142 (37%), Gaps = 17/142 (11%)
Query: 89 WWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
W FQ CTE+ F +D D + + C N +G + P YGG I
Sbjct: 357 WSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWG--VKPRPHWMTTMYGGKNI 414
Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEGDAQN 206
+ S I+F+NG DPW D I H+ H DLR A N
Sbjct: 415 SSHSNIIFSNGELDPWSGGGVTRDITDT-LVAINIHDGAHHLDLR------------AHN 461
Query: 207 CSAPDAVHKVRQQVIEKIDLWL 228
P +V R ++ + W+
Sbjct: 462 AFDPSSVLLSRLLEVKHMKKWI 483
>sp|Q9UHL4|DPP2_HUMAN Dipeptidyl peptidase 2 OS=Homo sapiens GN=DPP7 PE=1 SV=3
Length = 492
Score = 40.8 bits (94), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 52/124 (41%), Gaps = 26/124 (20%)
Query: 87 RLWWFQVCTEVAF----------FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVD 136
R W +Q CTE+ F P D +R RY LD G++P D
Sbjct: 345 RAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQ-----RYCLDTW------GVWPRPD 393
Query: 137 -STNIYYGGTKIAGSKIVFTNGSQDPWRHAS-KQTSSPDMPSYLITCHNCGHGTDLRGC- 193
++GG A S I+F+NG+ DPW ++ S + + +T H DLR
Sbjct: 394 WLLTSFWGGDLRAASNIIFSNGNLDPWAGGGIRRNLSASVIA--VTIQGGAHHLDLRASH 451
Query: 194 PQSP 197
P+ P
Sbjct: 452 PEDP 455
>sp|P34676|PCP5_CAEEL Prolyl carboxy peptidase like protein 5 OS=Caenorhabditis elegans
GN=pcp-5 PE=1 SV=1
Length = 507
Score = 38.9 bits (89), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 56/159 (35%), Gaps = 23/159 (14%)
Query: 89 WWFQVCTEVAFFQVA---PANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVD----STNIY 141
W FQ CTE+ P + + + + + C F Y + +
Sbjct: 360 WPFQTCTEMVMPLCGSGYPNDFFWKDCPFTSEKYAEFCMQTFSSIHYNKTLLRPLAGGLA 419
Query: 142 YGGTKI-AGSKIVFTNGSQDPWRHASKQTSSPDMPSYL-ITCHNCGHGTDLRGC-PQSPL 198
+G T + + S IVF+NG DPW S S + + H DLRG PQ
Sbjct: 420 FGATSLPSASNIVFSNGYLDPWSGGGYDHSDKVQGSVISVILKQGAHHYDLRGAHPQD-- 477
Query: 199 TPEGDAQNCSAPDAVHKVRQQVIEKIDLWLSECQSVGWR 237
+ V KVR + I W+ E WR
Sbjct: 478 -----------TEEVKKVRAMETQAIKKWIKEKARNSWR 505
>sp|Q2TA14|PCP_BOVIN Lysosomal Pro-X carboxypeptidase OS=Bos taurus GN=PRCP PE=2 SV=1
Length = 499
Score = 37.4 bits (85), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 58/146 (39%), Gaps = 21/146 (14%)
Query: 89 WWFQVCTEVAFFQVAPA-NDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
W +Q CTE+ + +D + + + D C +G + P YGG I
Sbjct: 362 WSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWG--VRPRPSWIPTMYGGKNI 419
Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYL--ITCHNCGHGTDLRGCPQSPLTPEGDA 204
+ + I+F+NG DPW S + D+ L I N H DLR A
Sbjct: 420 SSHTNIIFSNGELDPW---SGGGVTKDITDTLLAIVIPNGAHHLDLR------------A 464
Query: 205 QNCSAPDAVHKVRQQVIEKIDLWLSE 230
N P +V R ++ + W+S+
Sbjct: 465 SNALDPVSVQLTRSLEVKYMKQWISD 490
>sp|Q5RBU7|PCP_PONAB Lysosomal Pro-X carboxypeptidase OS=Pongo abelii GN=PRCP PE=2 SV=1
Length = 496
Score = 36.6 bits (83), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 55/146 (37%), Gaps = 21/146 (14%)
Query: 89 WWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
W +Q CTEV F +D + + D C +G + P YGG I
Sbjct: 359 WSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG--VRPRPSWITTMYGGKNI 416
Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYL--ITCHNCGHGTDLRGCPQSPLTPEGDA 204
+ + IVF+NG DPW S + D+ L +T H DLR
Sbjct: 417 SSHTNIVFSNGELDPW---SGGGVTKDITDTLVAVTISEGAHHLDLR------------T 461
Query: 205 QNCSAPDAVHKVRQQVIEKIDLWLSE 230
+N P +V R + + W+ +
Sbjct: 462 KNALDPTSVLLARSLEVRHMKNWIRD 487
>sp|P42785|PCP_HUMAN Lysosomal Pro-X carboxypeptidase OS=Homo sapiens GN=PRCP PE=1 SV=1
Length = 496
Score = 35.8 bits (81), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 44/107 (41%), Gaps = 9/107 (8%)
Query: 89 WWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
W +Q CTEV F +D + + D C +G + P YGG I
Sbjct: 359 WSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG--VRPRPSWITTMYGGKNI 416
Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYL--ITCHNCGHGTDLR 191
+ + IVF+NG DPW S + D+ L +T H DLR
Sbjct: 417 SSHTNIVFSNGELDPW---SGGGVTKDITDTLVAVTISEGAHHLDLR 460
>sp|P34610|PCP1_CAEEL Putative serine protease pcp-1 OS=Caenorhabditis elegans GN=pcp-1
PE=1 SV=2
Length = 565
Score = 33.1 bits (74), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/104 (21%), Positives = 38/104 (36%), Gaps = 15/104 (14%)
Query: 134 DVDSTNIYYGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPS--YLITCHNCGHGTDLR 191
++D+ YG S ++ T G DPW + + Y++ H DLR
Sbjct: 425 NIDAVKTLYGYDLSGSSNLILTQGHLDPWSGGGYKVDQNNAARGIYVLEIPGSAHHLDLR 484
Query: 192 GCPQSPLTPEGDAQNCSAPDAVHKVRQQVIEKIDLWLS-ECQSV 234
N P+ V R Q+I+ + W+ C ++
Sbjct: 485 ------------QPNTCDPNTVTNARFQIIQILKCWVDPNCNTI 516
>sp|B0UR84|DNAK_METS4 Chaperone protein DnaK OS=Methylobacterium sp. (strain 4-46)
GN=dnaK PE=3 SV=1
Length = 639
Score = 32.0 bits (71), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/28 (60%), Positives = 17/28 (60%)
Query: 31 DKLCTPLVEAKNAGEDLVDAYAKFVKEY 58
DK LVE KN GE LV A K VKEY
Sbjct: 521 DKKRRELVEVKNQGESLVHATEKSVKEY 548
>sp|A0QSH5|Y1481_MYCS2 Putative S-adenosyl-L-methionine-dependent methyltransferase
MSMEG_1481/MSMEI_1445 OS=Mycobacterium smegmatis (strain
ATCC 700084 / mc(2)155) GN=MSMEG_1481 PE=3 SV=1
Length = 304
Score = 31.6 bits (70), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 9/61 (14%)
Query: 108 SVRSSKVDTRYHLDLCKNVFGEGIYP---------DVDSTNIYYGGTKIAGSKIVFTNGS 158
S R S++ T YH D+ + G G DVD ++++YGG + A + + +G
Sbjct: 203 SARGSRLATEYHPDITSTLRGRGQAMSERWRDHGFDVDLSDLWYGGDRNAADEYLAGHGW 262
Query: 159 Q 159
Q
Sbjct: 263 Q 263
>sp|B8IHL3|DNAK_METNO Chaperone protein DnaK OS=Methylobacterium nodulans (strain ORS2060
/ LMG 21967) GN=dnaK PE=3 SV=1
Length = 637
Score = 31.2 bits (69), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 17/28 (60%)
Query: 31 DKLCTPLVEAKNAGEDLVDAYAKFVKEY 58
DK LVE KN GE L+ A K VKEY
Sbjct: 521 DKKRRELVEVKNQGESLIHATEKSVKEY 548
>sp|Q0W5U6|DP2L_UNCMA DNA polymerase II large subunit OS=Uncultured methanogenic archaeon
RC-I GN=polC PE=3 SV=1
Length = 1134
Score = 31.2 bits (69), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 3/63 (4%)
Query: 176 SYLITCHNCGHGTDLRGCPQSPLTP-EGDAQNCSAPDAVHKVRQQVIEKIDLWLSECQSV 234
+Y++ C GH + GCP ++ EGD C P V+Q++ K L+ + + V
Sbjct: 649 TYMVACECGGHTIPVYGCPDCGISGIEGDCPKCHKP-TTPNVKQKIDVK-GLYAAALKRV 706
Query: 235 GWR 237
G R
Sbjct: 707 GER 709
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.133 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,906,877
Number of Sequences: 539616
Number of extensions: 3898109
Number of successful extensions: 8214
Number of sequences better than 100.0: 20
Number of HSP's better than 100.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 8196
Number of HSP's gapped (non-prelim): 20
length of query: 240
length of database: 191,569,459
effective HSP length: 114
effective length of query: 126
effective length of database: 130,053,235
effective search space: 16386707610
effective search space used: 16386707610
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)