BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012026
         (472 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P54802|ANAG_HUMAN Alpha-N-acetylglucosaminidase OS=Homo sapiens GN=NAGLU PE=1 SV=2
          Length = 743

 Score =  357 bits (915), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 191/470 (40%), Positives = 272/470 (57%), Gaps = 48/470 (10%)

Query: 1   MSNLHGWGGPLPQSWLDQQLVLQKKILVRIYELGMNPVLPAFSGNVPAALQNVFPSAKIT 60
           M NLH W GPLP SW  +QL LQ ++L ++   GM PVLPAF+G+VP A+  VFP   +T
Sbjct: 204 MGNLHTWDGPLPPSWHIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVT 263

Query: 61  QLGNWFSVKSDPRWCCTYLLDATDPLFIEIGRAFIEQQLKEYGRTSHIYNCDTFDENTPP 120
           ++G+W        + C++LL   DP+F  IG  F+ + +KE+G T HIY  DTF+E  PP
Sbjct: 264 KMGSWGHFNCS--YSCSFLLAPEDPIFPIIGSLFLRELIKEFG-TDHIYGADTFNEMQPP 320

Query: 121 VDSPEYISSLGAAIYSGMQSGDSDAVWLMQGWLFSYDP-FWRPPQMKALLNSVPLGKLVV 179
              P Y+++   A+Y  M + D++AVWL+QGWLF + P FW P Q++A+L +VP G+L+V
Sbjct: 321 SSEPSYLAAATTAVYEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLV 380

Query: 180 LDLFAEVKPIWSTSKQFYGVPYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVG 239
           LDLFAE +P+++ +  F G P+IWCMLHNF GN  ++G L+++  GP  AR   N+TMVG
Sbjct: 381 LDLFAESQPVYTRTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVG 440

Query: 240 VGMSMEGIEQNPVVYDLMSEMAFQHEKV-DVKAWINQYSVRRYGRSVPAIQDAWNVLYHT 298
            GM+ EGI QN VVY LM+E+ ++ + V D+ AW+  ++ RRYG S P    AW +L  +
Sbjct: 441 TGMAPEGISQNEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRS 500

Query: 299 VYNCT-DGATDKNRDVIVAFPDVDPSIISVTEGKYQNYGKPVSKEAVLKSETSSYDHPHL 357
           VYNC+ +     NR  +V  P                          L+  TS      +
Sbjct: 501 VYNCSGEACRGHNRSPLVRRPS-------------------------LQMNTS------I 529

Query: 358 WYSTSEVIRALELFIASGNELSASNTYRYDLIDLTRQALAKYANELFLNIIEAY------ 411
           WY+ S+V  A  L + S   L+ S  +RYDL+DLTRQA+ +  +  +     AY      
Sbjct: 530 WYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQAVQELVSLYYEEARSAYLSKELA 589

Query: 412 QLNDAHGVFQLSRRFLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEEQ 461
            L  A GV        EL+  +D +LA    FLLG WLE A+  A +E +
Sbjct: 590 SLLRAGGVLA-----YELLPALDEVLASDSRFLLGSWLEQARAAAVSEAE 634


>sp|Q0AF54|RPOC_NITEC DNA-directed RNA polymerase subunit beta' OS=Nitrosomonas eutropha
            (strain C91) GN=rpoC PE=3 SV=1
          Length = 1404

 Score = 35.0 bits (79), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 52/275 (18%), Positives = 109/275 (39%), Gaps = 19/275 (6%)

Query: 200  PYIWCMLHNFAGNIEMYGILDSIAFGPVEARTSENTTMVGVGMSMEGIEQNPVVYDLMSE 259
            P+   ++  +AG I    + + +         +   T+V +      I Q+  +  L+  
Sbjct: 1034 PHKRPIITEYAGKIRFENVEEGVTVVRQIDEITGMATLVVIDPKRRNIAQSKGLRPLVKF 1093

Query: 260  MAFQHEKVDVKAWINQYSVRRYGRSVPAIQDAWNVLYHTVYNCTDGATDKNRDVIVAFPD 319
            +    +++++       S+  +  S+  ++D   V    V       T K RD+    P 
Sbjct: 1094 LDENDQEINIPGTDQPVSITFHVGSIITVRDGQQVGIGEVLARIPQETSKTRDITGGLPR 1153

Query: 320  V----------DPSIISVTEGKYQNYGKPVS-KEAVLKSETSSYDHPHLWYSTSEVIRAL 368
            V          D   ++   G    +GK    K+ ++ ++     H +L      V    
Sbjct: 1154 VAELFEARVPKDAGFLAEATGTVA-FGKDTKGKQRLVITDLDGIAHEYLIPKDKHVTAHD 1212

Query: 369  ELFIASGNELSASNTYRYDLIDLTR-QALAKYANELFLNI--IEAYQLNDAHGVFQLSRR 425
               +  G  +       +D++ L   +ALAKY ++   ++  ++   +ND H +  + R+
Sbjct: 1213 GQVVNKGEVIVDGPIDPHDILRLQGVEALAKYISDEVQDVYRLQGVSINDKH-IEVIVRQ 1271

Query: 426  FLELVEDMDGLLACHDGFLLGPWLESAKQLAQNEE 460
             L  V+  +   A    F+LG  +E A+ LA+NE+
Sbjct: 1272 MLRRVQITN---AGDSAFILGEQVERAEVLAENEK 1303


>sp|Q9NVE4|CCD87_HUMAN Coiled-coil domain-containing protein 87 OS=Homo sapiens GN=CCDC87
           PE=2 SV=2
          Length = 849

 Score = 33.1 bits (74), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 25/46 (54%)

Query: 155 SYDPFWRPPQMKALLNSVPLGKLVVLDLFAEVKPIWSTSKQFYGVP 200
           S +P  RPPQ   +L S PL KL V  L ++V  + + S    GVP
Sbjct: 31  SPEPQKRPPQEGRILQSFPLAKLTVASLCSQVAKLLAGSGIAAGVP 76


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.135    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 180,460,311
Number of Sequences: 539616
Number of extensions: 7562524
Number of successful extensions: 16296
Number of sequences better than 100.0: 5
Number of HSP's better than 100.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 16288
Number of HSP's gapped (non-prelim): 7
length of query: 472
length of database: 191,569,459
effective HSP length: 121
effective length of query: 351
effective length of database: 126,275,923
effective search space: 44322848973
effective search space used: 44322848973
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)