BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 014232
         (428 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|2PQ6|A Chain A, Crystal Structure Of Medicago Truncatula Ugt85h2- Insights
           Into The Structural Basis Of A Multifunctional (Iso)
           Flavonoid Glycosyltransferase
          Length = 482

 Score =  231 bits (590), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 138/439 (31%), Positives = 226/439 (51%), Gaps = 40/439 (9%)

Query: 1   MLQLGTILYSKGFSITVVHTDFNSPN--PSNHPK-------FSFQSIPEGL----ADDDI 47
           + +L  +L+ +GF IT V+T++N      S  PK       F+F+SIP+GL     D D+
Sbjct: 25  LFKLAKLLHLRGFHITFVNTEYNHKRLLKSRGPKAFDGFTDFNFESIPDGLTPMEGDGDV 84

Query: 48  YSGNIIAIIMHLNANCGAPFHKCLVQMMKQQMPADEIVCIIYDELMYFAESAANQLKLRS 107
            S ++  +   +  N   P+ + L ++         + C++ D  M F   AA + +L +
Sbjct: 85  -SQDVPTLCQSVRKNFLKPYCELLTRL-NHSTNVPPVTCLVSDCCMSFTIQAAEEFELPN 142

Query: 108 IILRTNSAATQISRIALLQLKEDGSIPLQDPSNLA--------DPVPRLQPLRFKDL-PF 158
           ++  ++SA + ++ +      E G IP +D S L         D +P L+  R KD+  F
Sbjct: 143 VLYFSSSACSLLNVMHFRSFVERGIIPFKDESYLTNGCLETKVDWIPGLKNFRLKDIVDF 202

Query: 159 SQFGLPENFLQ--LIPKIYNVRTSKAVIWNTMNCIEXXXXXXXXXXCNIP-IFPVGPLHK 215
            +   P + +    I     V     ++ NT N +E            IP I+P+GPL  
Sbjct: 203 IRTTNPNDIMLEFFIEVADRVNKDTTILLNTFNELESDVINALSS--TIPSIYPIGPLPS 260

Query: 216 FAPSSP---------GSLLKEDTSCISWLNNQAPNSVLYVSLGSIVSMDKKELKEMAWGL 266
               +P          +L KEDT C+ WL ++ P SV+YV+ GS   M  ++L E AWGL
Sbjct: 261 LLKQTPQIHQLDSLDSNLWKEDTECLDWLESKEPGSVVYVNFGSTTVMTPEQLLEFAWGL 320

Query: 267 YNSKQPFLWVLRPSSTSASSGIELLPEGFEEAVGENGCVVKWAPQKDVLSHIAVGGFWSH 326
            N K+ FLW++RP      S I      F   + + G +  W PQ  VL+H ++GGF +H
Sbjct: 321 ANCKKSFLWIIRPDLVIGGSVI--FSSEFTNEIADRGLIASWCPQDKVLNHPSIGGFLTH 378

Query: 327 CGWNSTLESICEGVPMICRPGFGDQRVSARYVSHVWRIGLQLENDLEKGEVEKAVKQLMV 386
           CGWNST ESIC GVPM+C P F DQ    R++ + W IG++++ ++++ E+ K + +++ 
Sbjct: 379 CGWNSTTESICAGVPMLCWPFFADQPTDCRFICNEWEIGMEIDTNVKREELAKLINEVIA 438

Query: 387 EKEGQEMRQRAKNLKEDVE 405
             +G++M+Q+A  LK+  E
Sbjct: 439 GDKGKKMKQKAMELKKKAE 457


>pdb|2C1X|A Chain A, Structure And Activity Of A Flavonoid 3-O
           Glucosyltransferase Reveals The Basis For Plant Natural
           Product Modification
 pdb|2C1Z|A Chain A, Structure And Activity Of A Flavonoid 3-O
           Glucosyltransferase Reveals The Basis For Plant Natural
           Product Modification
 pdb|2C9Z|A Chain A, Structure And Activity Of A Flavonoid 3-0
           Glucosyltransferase Reveals The Basis For Plant Natural
           Product Modification
          Length = 456

 Score =  155 bits (393), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 187/377 (49%), Gaps = 18/377 (4%)

Query: 38  IPEGLADDDIYSGNIIAIIMHLNANCGAPFHKCLVQMMKQQMPADEIVCIIYDELMYFAE 97
           I +G+ +  +++G     I          F + +V  + +      + C++ D  ++FA 
Sbjct: 69  ISDGVPEGYVFAGRPQEDIELFTRAAPESFRQGMVMAVAET--GRPVSCLVADAFIWFAA 126

Query: 98  SAANQLKLRSIILRTNSAATQISRIALLQLKED-GSIPLQDPSN-LADPVPRLQPLRFKD 155
             A ++ +  +   T    +  + + + +++E  G   +Q   + L + +P +  +RF+D
Sbjct: 127 DMAAEMGVAWLPFWTAGPNSLSTHVYIDEIREKIGVSGIQGREDELLNFIPGMSKVRFRD 186

Query: 156 LPFS-QFG-LPENFLQLIPKIYNVR-TSKAVIWNTMNCIEXXXXXXXXXXCNIPIFPVGP 212
           L     FG L   F +++ ++  V   + AV  N+   ++                 +GP
Sbjct: 187 LQEGIVFGNLNSLFSRMLHRMGQVLPKATAVFINSFEELDDSLTNDLKSKLKT-YLNIGP 245

Query: 213 LHKFAPSSPGSLLKEDTSCISWLNNQAPNSVLYVSLGSIVSMDKKELKEMAWGLYNSKQP 272
            +     +P  ++   T C+ WL  + P SV+Y+S G++ +    E+  ++  L  S+ P
Sbjct: 246 FNLI---TPPPVVPNTTGCLQWLKERKPTSVVYISFGTVTTPPPAEVVALSEALEASRVP 302

Query: 273 FLWVLRPSSTSASSGIELLPEGFEEAVGENGCVVKWAPQKDVLSHIAVGGFWSHCGWNST 332
           F+W LR  +         LPEGF E     G VV WAPQ +VL+H AVG F +HCGWNS 
Sbjct: 303 FIWSLRDKARVH------LPEGFLEKTRGYGMVVPWAPQAEVLAHEAVGAFVTHCGWNSL 356

Query: 333 LESICEGVPMICRPGFGDQRVSARYVSHVWRIGLQLENDL-EKGEVEKAVKQLMVEKEGQ 391
            ES+  GVP+ICRP FGDQR++ R V  V  IG+++E  +  K  +     Q++ +++G+
Sbjct: 357 WESVAGGVPLICRPFFGDQRLNGRMVEDVLEIGVRIEGGVFTKSGLMSCFDQILSQEKGK 416

Query: 392 EMRQRAKNLKEDVELCI 408
           ++R+  + L+E  +  +
Sbjct: 417 KLRENLRALRETADRAV 433


>pdb|2VCE|A Chain A, Characterization And Engineering Of The Bifunctional N-
           And O-glucosyltransferase Involved In Xenobiotic
           Metabolism In Plants
 pdb|2VCH|A Chain A, Characterization And Engineering Of The Bifunctional N-
           And O-Glucosyltransferase Involved In Xenobiotic
           Metabolism In Plants
 pdb|2VG8|A Chain A, Characterization And Engineering Of The Bifunctional N-
           And O-Glucosyltransferase Involved In Xenobiotic
           Metabolism In Plants
          Length = 480

 Score =  149 bits (375), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 84/212 (39%), Positives = 123/212 (58%), Gaps = 17/212 (8%)

Query: 206 PIFPVGPLHKFAPSSPGSLLKEDTSCISWLNNQAPNSVLYVSLGSIVSMDKKELKEMAWG 265
           P++PVGPL             E++ C+ WL+NQ   SVLYVS GS  ++  ++L E+A G
Sbjct: 235 PVYPVGPLVNIGKQEAKQ--TEESECLKWLDNQPLGSVLYVSFGSGGTLTCEQLNELALG 292

Query: 266 LYNSKQPFLWVLRPSSTSASSG----------IELLPEGFEEAVGENGCVVK-WAPQKDV 314
           L +S+Q FLWV+R  S  A+S           +  LP GF E   + G V+  WAPQ  V
Sbjct: 293 LADSEQRFLWVIRSPSGIANSSYFDSHSQTDPLTFLPPGFLERTKKRGFVIPFWAPQAQV 352

Query: 315 LSHIAVGGFWSHCGWNSTLESICEGVPMICRPGFGDQRVSARYVSHVWRIGLQL----EN 370
           L+H + GGF +HCGWNSTLES+  G+P+I  P + +Q+++A  +S   R  L+     + 
Sbjct: 353 LAHPSTGGFLTHCGWNSTLESVVSGIPLIAWPLYAEQKMNAVLLSEDIRAALRPRAGDDG 412

Query: 371 DLEKGEVEKAVKQLMVEKEGQEMRQRAKNLKE 402
            + + EV + VK LM  +EG+ +R + K LKE
Sbjct: 413 LVRREEVARVVKGLMEGEEGKGVRNKMKELKE 444


>pdb|3HBF|A Chain A, Structure Of Ugt78g1 Complexed With Myricetin And Udp
 pdb|3HBJ|A Chain A, Structure Of Ugt78g1 Complexed With Udp
          Length = 454

 Score =  148 bits (373), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 109/382 (28%), Positives = 177/382 (46%), Gaps = 29/382 (7%)

Query: 31  PKFSFQSIPEGLADDDIYSGN----IIAIIMHLNANCGAPFHKCLVQMMKQQMPADEIVC 86
           P   + ++ +GL    + SGN    I   I  +  N      + + +  K       I C
Sbjct: 67  PNIKYYNVHDGLPKGYVSSGNPREPIFLFIKAMQENFKHVIDEAVAETGKN------ITC 120

Query: 87  IIYDELMYFAESAANQLKLRSIILRTNSAATQISRIALLQLKED-GSIPLQDPSNLADPV 145
           ++ D   +F    A ++  + + L T    + ++ +    ++E  GS  + D  ++ D +
Sbjct: 121 LVTDAFFWFGADLAEEMHAKWVPLWTAGPHSLLTHVYTDLIREKTGSKEVHDVKSI-DVL 179

Query: 146 PRLQPLRFKDLP---FSQFGLPENFLQLIPKI-YNVRTSKAVIWNTMNCIEXXXXXXXXX 201
           P    L+  DLP        +P  F  ++ K+   +  + AV  N+   I          
Sbjct: 180 PGFPELKASDLPEGVIKDIDVP--FATMLHKMGLELPRANAVAINSFATIHPLIENELNS 237

Query: 202 XCNIPIFPVGPLHKFAPSSPGSLLKEDTSCISWLNNQAPNSVLYVSLGSIVSMDKKELKE 261
              + +  VGP   F  ++P   + ++  C+ WL+    +SV+Y+S GS+V+    EL  
Sbjct: 238 KFKL-LLNVGP---FNLTTPQRKVSDEHGCLEWLDQHENSSVVYISFGSVVTPPPHELTA 293

Query: 262 MAWGLYNSKQPFLWVLRPSSTSASSGIELLPEGFEEAVGENGCVVKWAPQKDVLSHIAVG 321
           +A  L     PF+W  R          E LP+GF E     G +V WAPQ ++L H +VG
Sbjct: 294 LAESLEECGFPFIWSFRGDPK------EKLPKGFLERTKTKGKIVAWAPQVEILKHSSVG 347

Query: 322 GFWSHCGWNSTLESICEGVPMICRPGFGDQRVSARYVSHVWRIGLQLEND-LEKGEVEKA 380
            F +H GWNS LE I  GVPMI RP FGDQ ++      V  IG+ ++N  L K  ++KA
Sbjct: 348 VFLTHSGWNSVLECIVGGVPMISRPFFGDQGLNTILTESVLEIGVGVDNGVLTKESIKKA 407

Query: 381 VKQLMVEKEGQEMRQRAKNLKE 402
           ++  M  ++G  MRQ+   LKE
Sbjct: 408 LELTMSSEKGGIMRQKIVKLKE 429


>pdb|2ACV|A Chain A, Crystal Structure Of Medicago Truncatula Ugt71g1
 pdb|2ACV|B Chain B, Crystal Structure Of Medicago Truncatula Ugt71g1
          Length = 463

 Score =  127 bits (318), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 84/240 (35%), Positives = 127/240 (52%), Gaps = 25/240 (10%)

Query: 178 RTSKAVIWNTMNCIEXXXXXXXXXXCNI--PIFPVGPLHKF-APSSPGSLLKEDTSCISW 234
           R +K +I NT + +E               PI+ VGPL       +P     +    + W
Sbjct: 210 RDTKGIIVNTFSDLEQSSIDALYDHDEKIPPIYAVGPLLDLKGQPNPKLDQAQHDLILKW 269

Query: 235 LNNQAPNSVLYVSLGSI-VSMDKKELKEMAWGLYNSKQPFLWVLRPSSTSASSGIELLPE 293
           L+ Q   SV+++  GS+ VS    +++E+A GL +S   FLW       S S+  ++ PE
Sbjct: 270 LDEQPDKSVVFLCFGSMGVSFGPSQIREIALGLKHSGVRFLW-------SNSAEKKVFPE 322

Query: 294 GFEE--AVGENGCVVKWAPQKDVLSHIAVGGFWSHCGWNSTLESICEGVPMICRPGFGDQ 351
           GF E   +   G +  WAPQ +VL+H A+GGF SHCGWNS LES+  GVP++  P + +Q
Sbjct: 323 GFLEWMELEGKGMICGWAPQVEVLAHKAIGGFVSHCGWNSILESMWFGVPILTWPIYAEQ 382

Query: 352 RVSARYVSHVWRIGLQLENDLEKG-------EVEKAVKQLM-----VEKEGQEMRQRAKN 399
           +++A  +   W +GL L  D  KG       E+EK +K LM     V K+ QEM++ ++N
Sbjct: 383 QLNAFRLVKEWGVGLGLRVDYRKGSDVVAAEEIEKGLKDLMDKDSIVHKKVQEMKEMSRN 442


>pdb|2ACW|A Chain A, Crystal Structure Of Medicago Truncatula Ugt71g1 Complexed
           With Udp-Glucose
 pdb|2ACW|B Chain B, Crystal Structure Of Medicago Truncatula Ugt71g1 Complexed
           With Udp-Glucose
          Length = 465

 Score =  127 bits (318), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 84/240 (35%), Positives = 127/240 (52%), Gaps = 25/240 (10%)

Query: 178 RTSKAVIWNTMNCIEXXXXXXXXXXCNI--PIFPVGPLHKF-APSSPGSLLKEDTSCISW 234
           R +K +I NT + +E               PI+ VGPL       +P     +    + W
Sbjct: 210 RDTKGIIVNTFSDLEQSSIDALYDHDEKIPPIYAVGPLLDLKGQPNPKLDQAQHDLILKW 269

Query: 235 LNNQAPNSVLYVSLGSI-VSMDKKELKEMAWGLYNSKQPFLWVLRPSSTSASSGIELLPE 293
           L+ Q   SV+++  GS+ VS    +++E+A GL +S   FLW       S S+  ++ PE
Sbjct: 270 LDEQPDKSVVFLCFGSMGVSFGPSQIREIALGLKHSGVRFLW-------SNSAEKKVFPE 322

Query: 294 GFEE--AVGENGCVVKWAPQKDVLSHIAVGGFWSHCGWNSTLESICEGVPMICRPGFGDQ 351
           GF E   +   G +  WAPQ +VL+H A+GGF SHCGWNS LES+  GVP++  P + +Q
Sbjct: 323 GFLEWMELEGKGMICGWAPQVEVLAHKAIGGFVSHCGWNSILESMWFGVPILTWPIYAEQ 382

Query: 352 RVSARYVSHVWRIGLQLENDLEKG-------EVEKAVKQLM-----VEKEGQEMRQRAKN 399
           +++A  +   W +GL L  D  KG       E+EK +K LM     V K+ QEM++ ++N
Sbjct: 383 QLNAFRLVKEWGVGLGLRVDYRKGSDVVAAEEIEKGLKDLMDKDSIVHKKVQEMKEMSRN 442


>pdb|2O6L|A Chain A, Crystal Structure Of The Udp-Glucuronic Acid Binding
           Domain Of The Human Drug Metabolizing
           Udp-Glucuronosyltransferase 2b7
 pdb|2O6L|B Chain B, Crystal Structure Of The Udp-Glucuronic Acid Binding
           Domain Of The Human Drug Metabolizing
           Udp-Glucuronosyltransferase 2b7
          Length = 170

 Score = 52.0 bits (123), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 15/119 (12%)

Query: 234 WLNNQAPNSVLYVSLGSIVSMDKKELKE-MAWGLYNSKQPFLWVLRPSSTSASSGIELLP 292
           ++ +   N V+  SLGS VS   +E    +A  L    Q  LW    +            
Sbjct: 14  FVQSSGENGVVVFSLGSXVSNXTEERANVIASALAQIPQKVLWRFDGNKP---------- 63

Query: 293 EGFEEAVGENGCVVKWAPQKDVLSHIAVGGFWSHCGWNSTLESICEGVPMICRPGFGDQ 351
               + +G N  + KW PQ D+L H     F +H G N   E+I  G+P +  P F DQ
Sbjct: 64  ----DTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIYEAIYHGIPXVGIPLFADQ 118


>pdb|2IYA|A Chain A, The Crystal Structure Of Macrolide Glycosyltransferases: A
           Blueprint For Antibiotic Engineering
 pdb|2IYA|B Chain B, The Crystal Structure Of Macrolide Glycosyltransferases: A
           Blueprint For Antibiotic Engineering
          Length = 424

 Score = 41.2 bits (95), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 302 NGCVVKWAPQKDVLSHIAVGGFWSHCGWNSTLESICEGVPMICRPGFGDQRVSARYV 358
           N  V +W PQ D+L+  +   F +H G  ST+E++   VPM+  P   +Q ++A  +
Sbjct: 306 NVEVHQWVPQLDILTKAS--AFITHAGMGSTMEALSNAVPMVAVPQIAEQTMNAERI 360


>pdb|3OTG|A Chain A, Crystal Structure Of Calg1, Calicheamicin
           Glycostyltransferase, Tdp Bound Form
 pdb|3OTH|A Chain A, Crystal Structure Of Calg1, Calicheamicin
           Glycostyltransferase, Tdp And Calicheamicin Alpha3i
           Bound Form
 pdb|3OTH|B Chain B, Crystal Structure Of Calg1, Calicheamicin
           Glycostyltransferase, Tdp And Calicheamicin Alpha3i
           Bound Form
          Length = 412

 Score = 35.4 bits (80), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 69/165 (41%), Gaps = 13/165 (7%)

Query: 225 LKEDTSCISWLNNQ-APNSVLYVSLGSIVSMDKKELKEMAWGLYNSKQPFLWVLRPSSTS 283
             E     +WL+++     ++Y++LG+      + L+    GL       L         
Sbjct: 225 FAEQGDLPAWLSSRDTARPLVYLTLGTSSGGTVEVLRAAIDGLAGLDADVL--------- 275

Query: 284 ASSGIELLPEGFEEAVGENGCVVKWAPQKDVLSHIAVGGFWSHCGWNSTLESICEGVPMI 343
            +SG  L   G  E V  N  +  W PQ  +L H+ +     H G  +TL ++  GVP +
Sbjct: 276 VASGPSLDVSGLGE-VPANVRLESWVPQAALLPHVDL--VVHHGGSGTTLGALGAGVPQL 332

Query: 344 CRPGFGDQRVSARYVSHVWRIGLQLENDLEKGEVEKAVKQLMVEK 388
             P  GD   +A+ V+        L +++    V  A K+L+ E+
Sbjct: 333 SFPWAGDSFANAQAVAQAGAGDHLLPDNISPDSVSGAAKRLLAEE 377


>pdb|3IA7|A Chain A, Crystal Structure Of Calg4, The Calicheamicin
           Glycosyltransferase
 pdb|3IA7|B Chain B, Crystal Structure Of Calg4, The Calicheamicin
           Glycosyltransferase
          Length = 402

 Score = 31.2 bits (69), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/84 (26%), Positives = 38/84 (45%), Gaps = 7/84 (8%)

Query: 307 KWAPQKDVLSHIAVGGFWSHCGWNSTLESICEGVPMICRPGFGDQRVSARYVSHVWRIGL 366
           +W P   VL+H       +H    + LE+   GVP++  P F  +  +A     V  +GL
Sbjct: 287 QWIPFHSVLAH--ARACLTHGTTGAVLEAFAAGVPLVLVPHFATE--AAPSAERVIELGL 342

Query: 367 Q---LENDLEKGEVEKAVKQLMVE 387
                 + LE   + +AV++L  +
Sbjct: 343 GSVLRPDQLEPASIREAVERLAAD 366


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.319    0.135    0.415 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,401,388
Number of Sequences: 62578
Number of extensions: 518260
Number of successful extensions: 1058
Number of sequences better than 100.0: 14
Number of HSP's better than 100.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 1036
Number of HSP's gapped (non-prelim): 14
length of query: 428
length of database: 14,973,337
effective HSP length: 102
effective length of query: 326
effective length of database: 8,590,381
effective search space: 2800464206
effective search space used: 2800464206
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (25.0 bits)