BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 026361
         (240 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q1PF50|EDA2_ARATH Probable serine protease EDA2 OS=Arabidopsis thaliana GN=EDA2 PE=2
           SV=2
          Length = 489

 Score =  343 bits (881), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 153/241 (63%), Positives = 195/241 (80%), Gaps = 3/241 (1%)

Query: 1   MFDAAELEIEGDFLYFLADAAVTAFQYGNPDKLCTPLVEAKNAGEDLVDAYAKFVKEYYL 60
           +F+A EL+++ DFLY  ADAAV AFQYGNPDKLC PLVEAK  G DLV  Y+ +V+EY +
Sbjct: 251 LFNATELDVDADFLYLTADAAVMAFQYGNPDKLCVPLVEAKKNGSDLVVTYSTYVREYCM 310

Query: 61  GSFGASVQTYNQKRLKNTAVTDQSADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHL 120
             +G  V+TYN+K L+NT VT  SA RLWWFQ CTE+ +FQVAP  DSVRS +++T +HL
Sbjct: 311 RIWGLRVRTYNRKHLRNTVVTADSAYRLWWFQACTELGYFQVAPKYDSVRSHQINTTFHL 370

Query: 121 DLCKNVFGEGIYPDVDSTNIYYGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLIT 180
           DLCK++FG+ +YP VD+TN+YYGG ++A +KI+FTNGS+DPWRHASKQ S+ +MPSY+I 
Sbjct: 371 DLCKSLFGKDVYPKVDATNLYYGGDRLAATKIIFTNGSEDPWRHASKQNSTHEMPSYIIK 430

Query: 181 CHNCGHGTDLRGCPQSPLTPEGDAQNCSAPDAVHKVRQQVIEKIDLWLSEC-QSVGWRSS 239
           C NCGHG+D+RGCPQSP+  EG + NCS PD V+KVRQQ++E IDLWLSEC QS+  RSS
Sbjct: 431 CRNCGHGSDIRGCPQSPMVIEGKSNNCSLPDYVNKVRQQMVEHIDLWLSECRQSI--RSS 488

Query: 240 M 240
           +
Sbjct: 489 I 489


>sp|Q9QXE5|TSSP_MOUSE Thymus-specific serine protease OS=Mus musculus GN=Prss16 PE=2 SV=1
          Length = 509

 Score = 59.3 bits (142), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 64/151 (42%), Gaps = 16/151 (10%)

Query: 85  ADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHLDLCKNVFG---EGIYPDVDSTNIY 141
            DR W +Q CTE  F+          S      + L+LC+ VFG     +   V  TN Y
Sbjct: 368 GDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLELCEQVFGLSPASVAQAVAQTNSY 427

Query: 142 YGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPE 201
           YGG     ++++F NG  DPW H    T    +    +   +  H  D+     +P+ P 
Sbjct: 428 YGGQSPGATQVLFVNGDTDPW-HVLSVTQDLGLSEPALLIPSASHCFDM-----APMRP- 480

Query: 202 GDAQNCSAPDAVHKVRQQVIEKIDLWLSECQ 232
                 S   ++   RQ++ +++  WL + +
Sbjct: 481 ------SDSPSLRLGRQKISQQLQDWLKDIK 505


>sp|Q9NQE7|TSSP_HUMAN Thymus-specific serine protease OS=Homo sapiens GN=PRSS16 PE=2 SV=2
          Length = 514

 Score = 58.9 bits (141), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 58/147 (39%), Gaps = 16/147 (10%)

Query: 85  ADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHLDLCKNVFGEG---IYPDVDSTNIY 141
            DR W +Q CTE  F+          S        LDLC+ VFG     +   V  TN Y
Sbjct: 369 GDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQVFGLSALSVAQAVAQTNSY 428

Query: 142 YGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPE 201
           YGG     +K++F NG  DPW H    T +       +      H  D+   P+ P    
Sbjct: 429 YGGQTPGANKVLFVNGDTDPW-HVLSVTQALGSSESTLLIRTGSHCLDM--APERP---- 481

Query: 202 GDAQNCSAPDAVHKVRQQVIEKIDLWL 228
                 S   ++   RQ + +++  WL
Sbjct: 482 ------SDSPSLRLGRQNIFQQLQTWL 502


>sp|P34528|YM67_CAEEL Putative serine protease K12H4.7 OS=Caenorhabditis elegans
           GN=K12H4.7 PE=3 SV=2
          Length = 510

 Score = 57.0 bits (136), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 81/184 (44%), Gaps = 33/184 (17%)

Query: 61  GSFGASVQTYNQ--KRLKNTAVTDQSADRLWWFQVCTEVAFFQ------VAPANDSVRSS 112
           G FG +   YN     +K+    +  +DR W +Q CTE  ++Q        P    V  S
Sbjct: 341 GYFGCNDIDYNGFISFMKDETFGEAQSDRAWVWQTCTEFGYYQSTSSATAGPWFGGV--S 398

Query: 113 KVDTRYHLDLCKNVFG-----EGIYPDVDSTNIYYGGT-KIAGSKIVFTNGSQDPWRHAS 166
            +  +Y++D C  ++G     + +   VD TN YYGG   +   +I+  NG  DPW HA 
Sbjct: 399 NLPAQYYIDECTAIYGAAYNSQEVQTSVDYTNQYYGGRDNLNTDRILLPNGDIDPW-HAL 457

Query: 167 KQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEGDAQNCSAPDAVH--KVRQQVIEKI 224
            + +S +     +  +   H  D+ G               S+ D+++    RQ++ + +
Sbjct: 458 GKLTSSNSNIVPVVINGTAHCADMYG--------------ASSLDSMYLTNARQRISDVL 503

Query: 225 DLWL 228
           D WL
Sbjct: 504 DGWL 507


>sp|P90893|YM9I_CAEEL Putative serine protease F56F10.1 OS=Caenorhabditis elegans
           GN=F56F10.1 PE=1 SV=2
          Length = 540

 Score = 48.5 bits (114), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 11/119 (9%)

Query: 77  NTAVTDQSADRLWWFQVCTEVAFFQVAPANDSVRSSKVDTRYHLDLCKNVFGEG-----I 131
           N    D +A R W +  C E+ F Q     ++V  + V     +D+C ++FG+      I
Sbjct: 364 NVLGPDGAAARGWMWLCCNEIGFLQTTNQGNNVFGTGVPLNLFIDMCTDMFGDSMKMSQI 423

Query: 132 YPDVDSTNIYYGGTKI-AGSKIVFTNGSQDPWRHASK--QTSSPDMPSYLI--TCHNCG 185
                 +  YYGG      + +V  NGS DPW          S  +  YLI  T H CG
Sbjct: 424 MGGNKKSQNYYGGADFYNATNVVLPNGSLDPWHALGTYGTIKSQSLLPYLINGTAH-CG 481


>sp|Q9EPB1|DPP2_RAT Dipeptidyl peptidase 2 OS=Rattus norvegicus GN=Dpp7 PE=1 SV=1
          Length = 500

 Score = 44.7 bits (104), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 56/150 (37%), Gaps = 15/150 (10%)

Query: 84  SADRLWWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYY 142
           S  R W +Q CTE+   F      D               C + +G    PD   T+ ++
Sbjct: 352 SNARAWDYQACTEINLTFDSNNVTDMFPEIPFSDELRQQYCLDTWGVWPRPDWLQTS-FW 410

Query: 143 GGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEG 202
           GG   A S I+F+NG  DPW     Q +        +T     H  DLR           
Sbjct: 411 GGDLKAASNIIFSNGDLDPWAGGGIQRNL-STSIIAVTIQGGAHHLDLR----------- 458

Query: 203 DAQNCSAPDAVHKVRQQVIEKIDLWLSECQ 232
            A N   P +V +VR+     I  W++  +
Sbjct: 459 -ASNSEDPPSVVEVRKLEATLIREWVAAAR 487


>sp|Q9ET22|DPP2_MOUSE Dipeptidyl peptidase 2 OS=Mus musculus GN=Dpp7 PE=2 SV=2
          Length = 506

 Score = 43.5 bits (101), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 56/148 (37%), Gaps = 17/148 (11%)

Query: 87  RLWWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNI-YYGG 144
           R W +Q CTE+   F      D               C + +G  ++P  D     ++GG
Sbjct: 355 RAWDYQACTEINLTFDSNNVTDMFPEIPFSEELRQQYCLDTWG--VWPRQDWLQTSFWGG 412

Query: 145 TKIAGSKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEGDA 204
              A S I+F+NG  DPW     Q S+       +T     H  DLR            A
Sbjct: 413 DLKAASNIIFSNGDLDPWAGGGIQ-SNLSTSVIAVTIQGGAHHLDLR------------A 459

Query: 205 QNCSAPDAVHKVRQQVIEKIDLWLSECQ 232
            N   P +V +VR+     I  W++  +
Sbjct: 460 SNSEDPPSVVEVRKLESTLIREWVAAAR 487


>sp|Q7TMR0|PCP_MOUSE Lysosomal Pro-X carboxypeptidase OS=Mus musculus GN=Prcp PE=2 SV=2
          Length = 491

 Score = 42.0 bits (97), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 53/142 (37%), Gaps = 17/142 (11%)

Query: 89  WWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
           W FQ CTE+   F     +D       D   + + C N +G  + P        YGG  I
Sbjct: 357 WSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWG--VKPRPHWMTTMYGGKNI 414

Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYLITCHNCGHGTDLRGCPQSPLTPEGDAQN 206
           +  S I+F+NG  DPW          D     I  H+  H  DLR            A N
Sbjct: 415 SSHSNIIFSNGELDPWSGGGVTRDITDT-LVAINIHDGAHHLDLR------------AHN 461

Query: 207 CSAPDAVHKVRQQVIEKIDLWL 228
              P +V   R   ++ +  W+
Sbjct: 462 AFDPSSVLLSRLLEVKHMKKWI 483


>sp|Q9UHL4|DPP2_HUMAN Dipeptidyl peptidase 2 OS=Homo sapiens GN=DPP7 PE=1 SV=3
          Length = 492

 Score = 40.8 bits (94), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 52/124 (41%), Gaps = 26/124 (20%)

Query: 87  RLWWFQVCTEVAF----------FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVD 136
           R W +Q CTE+            F   P  D +R      RY LD        G++P  D
Sbjct: 345 RAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQ-----RYCLDTW------GVWPRPD 393

Query: 137 -STNIYYGGTKIAGSKIVFTNGSQDPWRHAS-KQTSSPDMPSYLITCHNCGHGTDLRGC- 193
                ++GG   A S I+F+NG+ DPW     ++  S  + +  +T     H  DLR   
Sbjct: 394 WLLTSFWGGDLRAASNIIFSNGNLDPWAGGGIRRNLSASVIA--VTIQGGAHHLDLRASH 451

Query: 194 PQSP 197
           P+ P
Sbjct: 452 PEDP 455


>sp|P34676|PCP5_CAEEL Prolyl carboxy peptidase like protein 5 OS=Caenorhabditis elegans
           GN=pcp-5 PE=1 SV=1
          Length = 507

 Score = 38.9 bits (89), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 39/159 (24%), Positives = 56/159 (35%), Gaps = 23/159 (14%)

Query: 89  WWFQVCTEVAFFQVA---PANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVD----STNIY 141
           W FQ CTE+         P +   +     +  + + C   F    Y        +  + 
Sbjct: 360 WPFQTCTEMVMPLCGSGYPNDFFWKDCPFTSEKYAEFCMQTFSSIHYNKTLLRPLAGGLA 419

Query: 142 YGGTKI-AGSKIVFTNGSQDPWRHASKQTSSPDMPSYL-ITCHNCGHGTDLRGC-PQSPL 198
           +G T + + S IVF+NG  DPW       S     S + +      H  DLRG  PQ   
Sbjct: 420 FGATSLPSASNIVFSNGYLDPWSGGGYDHSDKVQGSVISVILKQGAHHYDLRGAHPQD-- 477

Query: 199 TPEGDAQNCSAPDAVHKVRQQVIEKIDLWLSECQSVGWR 237
                       + V KVR    + I  W+ E     WR
Sbjct: 478 -----------TEEVKKVRAMETQAIKKWIKEKARNSWR 505


>sp|Q2TA14|PCP_BOVIN Lysosomal Pro-X carboxypeptidase OS=Bos taurus GN=PRCP PE=2 SV=1
          Length = 499

 Score = 37.4 bits (85), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 58/146 (39%), Gaps = 21/146 (14%)

Query: 89  WWFQVCTEVAFFQVAPA-NDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
           W +Q CTE+     +   +D       + + + D C   +G  + P        YGG  I
Sbjct: 362 WSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWG--VRPRPSWIPTMYGGKNI 419

Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYL--ITCHNCGHGTDLRGCPQSPLTPEGDA 204
           +  + I+F+NG  DPW   S    + D+   L  I   N  H  DLR            A
Sbjct: 420 SSHTNIIFSNGELDPW---SGGGVTKDITDTLLAIVIPNGAHHLDLR------------A 464

Query: 205 QNCSAPDAVHKVRQQVIEKIDLWLSE 230
            N   P +V   R   ++ +  W+S+
Sbjct: 465 SNALDPVSVQLTRSLEVKYMKQWISD 490


>sp|Q5RBU7|PCP_PONAB Lysosomal Pro-X carboxypeptidase OS=Pongo abelii GN=PRCP PE=2 SV=1
          Length = 496

 Score = 36.6 bits (83), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 55/146 (37%), Gaps = 21/146 (14%)

Query: 89  WWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
           W +Q CTEV   F     +D       + +   D C   +G  + P        YGG  I
Sbjct: 359 WSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG--VRPRPSWITTMYGGKNI 416

Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYL--ITCHNCGHGTDLRGCPQSPLTPEGDA 204
           +  + IVF+NG  DPW   S    + D+   L  +T     H  DLR             
Sbjct: 417 SSHTNIVFSNGELDPW---SGGGVTKDITDTLVAVTISEGAHHLDLR------------T 461

Query: 205 QNCSAPDAVHKVRQQVIEKIDLWLSE 230
           +N   P +V   R   +  +  W+ +
Sbjct: 462 KNALDPTSVLLARSLEVRHMKNWIRD 487


>sp|P42785|PCP_HUMAN Lysosomal Pro-X carboxypeptidase OS=Homo sapiens GN=PRCP PE=1 SV=1
          Length = 496

 Score = 35.8 bits (81), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 44/107 (41%), Gaps = 9/107 (8%)

Query: 89  WWFQVCTEVAF-FQVAPANDSVRSSKVDTRYHLDLCKNVFGEGIYPDVDSTNIYYGGTKI 147
           W +Q CTEV   F     +D       + +   D C   +G  + P        YGG  I
Sbjct: 359 WSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG--VRPRPSWITTMYGGKNI 416

Query: 148 AG-SKIVFTNGSQDPWRHASKQTSSPDMPSYL--ITCHNCGHGTDLR 191
           +  + IVF+NG  DPW   S    + D+   L  +T     H  DLR
Sbjct: 417 SSHTNIVFSNGELDPW---SGGGVTKDITDTLVAVTISEGAHHLDLR 460


>sp|P34610|PCP1_CAEEL Putative serine protease pcp-1 OS=Caenorhabditis elegans GN=pcp-1
           PE=1 SV=2
          Length = 565

 Score = 33.1 bits (74), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/104 (21%), Positives = 38/104 (36%), Gaps = 15/104 (14%)

Query: 134 DVDSTNIYYGGTKIAGSKIVFTNGSQDPWRHASKQTSSPDMPS--YLITCHNCGHGTDLR 191
           ++D+    YG      S ++ T G  DPW     +    +     Y++      H  DLR
Sbjct: 425 NIDAVKTLYGYDLSGSSNLILTQGHLDPWSGGGYKVDQNNAARGIYVLEIPGSAHHLDLR 484

Query: 192 GCPQSPLTPEGDAQNCSAPDAVHKVRQQVIEKIDLWLS-ECQSV 234
                         N   P+ V   R Q+I+ +  W+   C ++
Sbjct: 485 ------------QPNTCDPNTVTNARFQIIQILKCWVDPNCNTI 516


>sp|B0UR84|DNAK_METS4 Chaperone protein DnaK OS=Methylobacterium sp. (strain 4-46)
           GN=dnaK PE=3 SV=1
          Length = 639

 Score = 32.0 bits (71), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/28 (60%), Positives = 17/28 (60%)

Query: 31  DKLCTPLVEAKNAGEDLVDAYAKFVKEY 58
           DK    LVE KN GE LV A  K VKEY
Sbjct: 521 DKKRRELVEVKNQGESLVHATEKSVKEY 548


>sp|A0QSH5|Y1481_MYCS2 Putative S-adenosyl-L-methionine-dependent methyltransferase
           MSMEG_1481/MSMEI_1445 OS=Mycobacterium smegmatis (strain
           ATCC 700084 / mc(2)155) GN=MSMEG_1481 PE=3 SV=1
          Length = 304

 Score = 31.6 bits (70), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 9/61 (14%)

Query: 108 SVRSSKVDTRYHLDLCKNVFGEGIYP---------DVDSTNIYYGGTKIAGSKIVFTNGS 158
           S R S++ T YH D+   + G G            DVD ++++YGG + A  + +  +G 
Sbjct: 203 SARGSRLATEYHPDITSTLRGRGQAMSERWRDHGFDVDLSDLWYGGDRNAADEYLAGHGW 262

Query: 159 Q 159
           Q
Sbjct: 263 Q 263


>sp|B8IHL3|DNAK_METNO Chaperone protein DnaK OS=Methylobacterium nodulans (strain ORS2060
           / LMG 21967) GN=dnaK PE=3 SV=1
          Length = 637

 Score = 31.2 bits (69), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 17/28 (60%)

Query: 31  DKLCTPLVEAKNAGEDLVDAYAKFVKEY 58
           DK    LVE KN GE L+ A  K VKEY
Sbjct: 521 DKKRRELVEVKNQGESLIHATEKSVKEY 548


>sp|Q0W5U6|DP2L_UNCMA DNA polymerase II large subunit OS=Uncultured methanogenic archaeon
           RC-I GN=polC PE=3 SV=1
          Length = 1134

 Score = 31.2 bits (69), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 3/63 (4%)

Query: 176 SYLITCHNCGHGTDLRGCPQSPLTP-EGDAQNCSAPDAVHKVRQQVIEKIDLWLSECQSV 234
           +Y++ C   GH   + GCP   ++  EGD   C  P     V+Q++  K  L+ +  + V
Sbjct: 649 TYMVACECGGHTIPVYGCPDCGISGIEGDCPKCHKP-TTPNVKQKIDVK-GLYAAALKRV 706

Query: 235 GWR 237
           G R
Sbjct: 707 GER 709


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.133    0.417 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,906,877
Number of Sequences: 539616
Number of extensions: 3898109
Number of successful extensions: 8214
Number of sequences better than 100.0: 20
Number of HSP's better than 100.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 12
Number of HSP's that attempted gapping in prelim test: 8196
Number of HSP's gapped (non-prelim): 20
length of query: 240
length of database: 191,569,459
effective HSP length: 114
effective length of query: 126
effective length of database: 130,053,235
effective search space: 16386707610
effective search space used: 16386707610
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (27.3 bits)