BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 004092
         (774 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|D5USX8|MSHA_TSUPD D-inositol 3-phosphate glycosyltransferase OS=Tsukamurella
           paurometabola (strain ATCC 8368 / DSM 20162 / JCM 10117
           / NBRC 16120 / NCTC 13040) GN=mshA PE=3 SV=1
          Length = 438

 Score = 40.0 bits (92), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 40/176 (22%), Positives = 78/176 (44%), Gaps = 18/176 (10%)

Query: 324 VRVKMGFKPDDLVIAIVGTQFMYRGLWLEHALILRALLPLFSEVSVENESNSPIKVMILS 383
            R ++G  PD+LV+  VG    ++       L+LRA  PL     +    +  I+++++ 
Sbjct: 223 ARRELGIAPDELVLTFVGRIQPHKA----PDLLLRAAAPL-----IHAHPDRRIRILVVG 273

Query: 384 GDSTSNYSVVIEAIAHNLHYPLGVVKHMAAE-----GDVDSVLNTADVVIYGSFLEEQTF 438
           G S +        IA  L   LG+   +  E       +  V   +D+V+  S+ E  +F
Sbjct: 274 GPSGTGLERPDALIA--LARELGIEHAVTFEPPRPPAGLAEVYRASDLVVVPSYSE--SF 329

Query: 439 PEILVKALCFRKPIIAPDLSNIRKYVDDRVNGYLFPKENIKALTHIILQVITNGKI 494
             + V+A     P++A  +  +   V D V+G L    + +  T ++ ++  N ++
Sbjct: 330 GLVAVEAQACGTPVVAAKVGGLSVAVADGVSGRLIDGHDPQEWTAVLDELTANAEL 385


>sp|D5UJ42|MSHA_CELFN D-inositol 3-phosphate glycosyltransferase OS=Cellulomonas
           flavigena (strain ATCC 482 / DSM 20109 / NCIB 8073 / NRS
           134) GN=mshA PE=3 SV=1
          Length = 443

 Score = 39.3 bits (90), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 59/132 (44%), Gaps = 9/132 (6%)

Query: 363 LFSEVSVENESNSPIKVMILSGDSTSNYSVVIEAIAHNLHYPLGV-----VKHMAAEGDV 417
           L   V V   S  P+ ++++ G  +   + V E  A  L   LGV     V+  A   ++
Sbjct: 276 LVQAVGVLRASGRPVPLLVVLGGPSGRPTAVRELRA--LAVTLGVDDDVVVRPPAPRDEL 333

Query: 418 DSVLNTADVVIYGSFLEEQTFPEILVKALCFRKPIIAPDLSNIRKYVDDRVNGYLFPKEN 477
            S    AD+V   S    ++F  + V+A     P++A D+  +R  V+D V+G L P  +
Sbjct: 334 VSWYRAADLVAMPS--RSESFGLVAVEAQASGTPVLAADVGGLRTVVEDDVSGRLVPGHD 391

Query: 478 IKALTHIILQVI 489
            +    +I   +
Sbjct: 392 PQVWAEVIADAL 403


>sp|O07147|PIMA_MYCLE GDP-mannose-dependent alpha-(1-2)-phosphatidylinositol
           mannosyltransferase OS=Mycobacterium leprae (strain TN)
           GN=pimA PE=3 SV=1
          Length = 374

 Score = 35.0 bits (79), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 6/77 (7%)

Query: 406 GVVKHMAAEGDVD-----SVLNTADVVIYGSFLEEQTFPEILVKALCFRKPIIAPDLSNI 460
           G+V+H+   G VD     + + +ADV    +   E +F  +LV+A+    P++A DL   
Sbjct: 240 GLVEHIRFLGQVDDAGKAAAMRSADVYCAPNIGGE-SFGIVLVEAMAAGTPVVASDLDAF 298

Query: 461 RKYVDDRVNGYLFPKEN 477
           R+ + D   G+L P  +
Sbjct: 299 RRVLRDGEVGHLVPAGD 315


>sp|Q4A5V9|SYG_MYCS5 Glycine--tRNA ligase OS=Mycoplasma synoviae (strain 53) GN=glyQS
           PE=3 SV=1
          Length = 451

 Score = 34.3 bits (77), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 36/73 (49%), Gaps = 2/73 (2%)

Query: 448 FRKPIIAPDLSNIRKYVDDRVNGYLFPKENIKALTHIILQVITNGKISPFARNIASIGRR 507
           F  P+I   ++N ++Y  D+V   LFP EN+  +TH  +Q      ++ +  +  +    
Sbjct: 84  FNDPLIENKINN-KRYRADKVIEELFPNENVAKMTHEQMQEFLQKNVTSYDNSKCAWSEI 142

Query: 508 SVKNLMALETIEG 520
              NLM  ET +G
Sbjct: 143 KKFNLM-FETFQG 154


>sp|Q9UPY6|WASF3_HUMAN Wiskott-Aldrich syndrome protein family member 3 OS=Homo sapiens
           GN=WASF3 PE=1 SV=2
          Length = 502

 Score = 33.9 bits (76), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 35/62 (56%), Gaps = 11/62 (17%)

Query: 598 TDDSFLYDIWKEE--KDIEMLNVRKRREEEELKDRIDQSHGTWDEVYRSAKRADRAKNDL 655
           TD S+ +D+WKE+  +D E     KRR++E+   RID   GT  EV    K+  +A+N  
Sbjct: 152 TDPSYFFDLWKEKMLQDTEDKRKEKRRQKEQ--KRID---GTTREV----KKVRKARNRR 202

Query: 656 HE 657
            E
Sbjct: 203 QE 204


>sp|Q46638|AMSK_ERWAM Amylovoran biosynthesis glycosyltransferase AmsK OS=Erwinia
           amylovora GN=amsK PE=3 SV=2
          Length = 407

 Score = 33.5 bits (75), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 18/77 (23%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 416 DVDSVLNTADVVIYGSFL----EEQTFPEILVKALCFRKPIIAPDLSNIRKYVDDRVNGY 471
           ++   L+ AD+ +  S      + +  P  L++A+    P+++ + S I + ++  V+G+
Sbjct: 292 EIKRYLDEADIFLLPSLTAADGDMEGIPVALMEAMAVGLPVVSSEHSGIPELIEHNVSGW 351

Query: 472 LFPKENIKALTHIILQV 488
           L P+ + +AL  I+L++
Sbjct: 352 LAPEGDAQALAAILLRL 368


>sp|Q65CC7|KANE_STRKN Glycosyltransferase KanE OS=Streptomyces kanamyceticus GN=kanE PE=1
           SV=1
          Length = 386

 Score = 33.1 bits (74), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 13/58 (22%), Positives = 34/58 (58%), Gaps = 2/58 (3%)

Query: 417 VDSVLNTADVVIYGSFLEEQTFPEILVKALCFRKPIIAPDLSNIRKYVDDRVNGYLFP 474
           + S+++  ++V+  S  EE     ++++ +  R+P++A D++ + K ++D   G + P
Sbjct: 277 IPSMISLGELVVLPSRYEELGI--VILECMTMRRPLVAHDVNGVNKLIEDGTTGIVVP 332


>sp|O05083|Y1698_HAEIN Uncharacterized glycosyltransferase HI_1698 OS=Haemophilus
           influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd)
           GN=HI_1698 PE=3 SV=1
          Length = 353

 Score = 33.1 bits (74), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 26/96 (27%), Positives = 50/96 (52%), Gaps = 12/96 (12%)

Query: 408 VKHMAAEGDVDSVLN----TADVVIYGS-----FLEEQT--FPEILVKALCFRKPIIAPD 456
           +K++A   D++  +N    T DV  Y        L  QT   P ++++A+ F  PI+A +
Sbjct: 227 LKNLAKALDIEDSVNFIPRTNDVSFYYESSSIYCLPSQTEGLPLVVIEAMAFGLPIVAFN 286

Query: 457 LS-NIRKYVDDRVNGYLFPKENIKALTHIILQVITN 491
            S  +++ V+ + NG+L  + NI+ +   +  +I N
Sbjct: 287 CSPGVKQLVEHKENGFLCEQNNIEEMVKGLDLLINN 322


>sp|Q8S403|PHO1_ARATH Phosphate transporter PHO1 OS=Arabidopsis thaliana GN=PHO1 PE=2
           SV=1
          Length = 782

 Score = 33.1 bits (74), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 60/146 (41%), Gaps = 21/146 (14%)

Query: 542 KELSPKLKEEWQWHLFEAFLN------STHEDRTSRSNRFLNQIELLQSNHTERDSYLPV 595
           KEL  +L  EW+    EAF+N         + +TSR  +  +   +   +   R  + PV
Sbjct: 6   KELEAQLIPEWK----EAFVNYCLLKKQIKKIKTSRKPKPASHYPIGHHSDFGRSLFDPV 61

Query: 596 PETDDSFLYDIWKEEKDIEMLNVRKRREEEELKDRIDQSHGT--------WDEVYRSAKR 647
            +   +F   ++   +  E+L VR+RR   E  D +D+ + T         DEV     R
Sbjct: 62  RKLARTFSDKLFSNSEKPEILQVRRRRGSSETGDDVDEIYQTELVQLFSEEDEVKVFFAR 121

Query: 648 ADR---AKNDLHERDEGELERTGQPL 670
            D      N  H+  E E    G+ L
Sbjct: 122 LDEELNKVNQFHKPKETEFLERGEIL 147


>sp|Q8VHI6|WASF3_MOUSE Wiskott-Aldrich syndrome protein family member 3 OS=Mus musculus
           GN=Wasf3 PE=2 SV=1
          Length = 501

 Score = 33.1 bits (74), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 35/62 (56%), Gaps = 11/62 (17%)

Query: 598 TDDSFLYDIWKEE--KDIEMLNVRKRREEEELKDRIDQSHGTWDEVYRSAKRADRAKNDL 655
           TD S+ +D+WKE+  +D E     KRR++E+   R+D   GT  EV    K+  +A+N  
Sbjct: 152 TDPSYFFDLWKEKMLQDTEDKRKEKRRQKEQ--KRVD---GTTREV----KKVRKARNRR 202

Query: 656 HE 657
            E
Sbjct: 203 QE 204


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.322    0.138    0.413 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 290,527,162
Number of Sequences: 539616
Number of extensions: 12749956
Number of successful extensions: 32525
Number of sequences better than 100.0: 42
Number of HSP's better than 100.0 without gapping: 2
Number of HSP's successfully gapped in prelim test: 40
Number of HSP's that attempted gapping in prelim test: 32489
Number of HSP's gapped (non-prelim): 69
length of query: 774
length of database: 191,569,459
effective HSP length: 125
effective length of query: 649
effective length of database: 124,117,459
effective search space: 80552230891
effective search space used: 80552230891
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 65 (29.6 bits)