BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 021438
         (312 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|1C9B|A Chain A, Crystal Structure Of A Human Tbp Core Domain-Human Tfiib
           Core Domain Complex Bound To An Extended, Modified
           Adenoviral Major Late Promoter (Admlp)
 pdb|1C9B|E Chain E, Crystal Structure Of A Human Tbp Core Domain-Human Tfiib
           Core Domain Complex Bound To An Extended, Modified
           Adenoviral Major Late Promoter (Admlp)
 pdb|1C9B|I Chain I, Crystal Structure Of A Human Tbp Core Domain-Human Tfiib
           Core Domain Complex Bound To An Extended, Modified
           Adenoviral Major Late Promoter (Admlp)
 pdb|1C9B|M Chain M, Crystal Structure Of A Human Tbp Core Domain-Human Tfiib
           Core Domain Complex Bound To An Extended, Modified
           Adenoviral Major Late Promoter (Admlp)
 pdb|1C9B|Q Chain Q, Crystal Structure Of A Human Tbp Core Domain-Human Tfiib
           Core Domain Complex Bound To An Extended, Modified
           Adenoviral Major Late Promoter (Admlp)
          Length = 207

 Score =  154 bits (390), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 87/210 (41%), Positives = 125/210 (59%), Gaps = 9/210 (4%)

Query: 103 DRNLIQAFKSISAMSDRLGLVTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYIACRQ 162
           DR ++ AFK I+ M+DR+ L   I DR N ++K+V +QK L+GR  +AI +ACLYIACRQ
Sbjct: 2   DRAMMNAFKEITTMADRINLPRNIVDRTNNLFKQVYEQKSLKGRANDAIASACLYIACRQ 61

Query: 163 ENKPRTVKEFCSVANGTTKKEIGRAKEFIVKHLEAEMGQSVEMGTIHASDYLRRFCSNLG 222
           E  PRT KE C+V+   +KKEIGR  + I+K LE     SV++  I   D++ RFCSNL 
Sbjct: 62  EGVPRTFKEICAVSR-ISKKEIGRCFKLILKALET----SVDL--ITTGDFMSRFCSNLC 114

Query: 223 MTNXXXXXXXXXXXXSEDLDI--RRSPISVAAAVIYIITQLSNDTKPLKEISIVTRVAEG 280
           +              + +LD+   RSPISVAAA IY+ +Q S + +  KEI  +  VA+ 
Sbjct: 115 LPKQVQMAATHIARKAVELDLVPGRSPISVAAAAIYMASQASAEKRTQKEIGDIAGVADV 174

Query: 281 TIKNVYKDLFPHLARIIPDWFANEEDIKNL 310
           TI+  Y+ ++P    + P  F  +  +  L
Sbjct: 175 TIRQSYRLIYPRAPDLFPTDFKFDTPVDKL 204


>pdb|2PHG|A Chain A, Model For Vp16 Binding To Tfiib
          Length = 206

 Score =  152 bits (384), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 124/209 (59%), Gaps = 9/209 (4%)

Query: 104 RNLIQAFKSISAMSDRLGLVTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYIACRQE 163
           R ++ AFK I+ M+DR+ L   I DR N ++K+V +QK L+GR  +AI +ACLYIACRQE
Sbjct: 2   RAMMNAFKEITTMADRINLPRNIVDRTNNLFKQVYEQKSLKGRANDAIASACLYIACRQE 61

Query: 164 NKPRTVKEFCSVANGTTKKEIGRAKEFIVKHLEAEMGQSVEMGTIHASDYLRRFCSNLGM 223
             PRT KE C+V+   +KKEIGR  + I+K LE     SV++  I   D++ RFCSNL +
Sbjct: 62  GVPRTFKEICAVSR-ISKKEIGRCFKLILKALET----SVDL--ITTGDFMSRFCSNLCL 114

Query: 224 TNXXXXXXXXXXXXSEDLDI--RRSPISVAAAVIYIITQLSNDTKPLKEISIVTRVAEGT 281
                         + +LD+   RSPISVAAA IY+ +Q S + +  KEI  +  VA+ T
Sbjct: 115 PKQVQMAATHIARKAVELDLVPGRSPISVAAAAIYMASQASAEKRTQKEIGDIAGVADVT 174

Query: 282 IKNVYKDLFPHLARIIPDWFANEEDIKNL 310
           I+  Y+ ++P    + P  F  +  +  L
Sbjct: 175 IRQSYRLIYPRAPDLFPTDFKFDTPVDKL 203


>pdb|1TFB|A Chain A, Nmr Studies Of Human General Transcription Factor Tfiib:
           Dynamics And Interaction With Vp16 Activation Domain, 20
           Structures
          Length = 208

 Score =  152 bits (384), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 124/209 (59%), Gaps = 9/209 (4%)

Query: 104 RNLIQAFKSISAMSDRLGLVTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYIACRQE 163
           R ++ AFK I+ M+DR+ L   I DR N ++K+V +QK L+GR  +AI +ACLYIACRQE
Sbjct: 4   RAMMNAFKEITTMADRINLPRNIVDRTNNLFKQVYEQKSLKGRANDAIASACLYIACRQE 63

Query: 164 NKPRTVKEFCSVANGTTKKEIGRAKEFIVKHLEAEMGQSVEMGTIHASDYLRRFCSNLGM 223
             PRT KE C+V+   +KKEIGR  + I+K LE     SV++  I   D++ RFCSNL +
Sbjct: 64  GVPRTFKEICAVSR-ISKKEIGRCFKLILKALET----SVDL--ITTGDFMSRFCSNLCL 116

Query: 224 TNXXXXXXXXXXXXSEDLDI--RRSPISVAAAVIYIITQLSNDTKPLKEISIVTRVAEGT 281
                         + +LD+   RSPISVAAA IY+ +Q S + +  KEI  +  VA+ T
Sbjct: 117 PKQVQMAATHIARKAVELDLVPGRSPISVAAAAIYMASQASAEKRTQKEIGDIAGVADVT 176

Query: 282 IKNVYKDLFPHLARIIPDWFANEEDIKNL 310
           I+  Y+ ++P    + P  F  +  +  L
Sbjct: 177 IRQSYRLIYPRAPDLFPTDFKFDTPVDKL 205


>pdb|1VOL|A Chain A, Tfiib (Human Core Domain)TBP (A.THALIANA)TATA ELEMENT
           Ternary Complex
          Length = 204

 Score =  145 bits (365), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 82/207 (39%), Positives = 121/207 (58%), Gaps = 9/207 (4%)

Query: 106 LIQAFKSISAMSDRLGLVTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYIACRQENK 165
           ++ AFK I+ M+DR+ L     DR N ++++  +QK L+GR  +AI +ACLYIACRQE  
Sbjct: 2   MMNAFKEITTMADRINLPRNKVDRTNNLFRQAYEQKSLKGRANDAIASACLYIACRQEGV 61

Query: 166 PRTVKEFCSVANGTTKKEIGRAKEFIVKHLEAEMGQSVEMGTIHASDYLRRFCSNLGMTN 225
           PRT KE C+V+   +KKEIGR  + I+K LE     SV++  I   D++ RFCSNL +  
Sbjct: 62  PRTFKEICAVSR-ISKKEIGRCFKLILKALET----SVDL--ITTGDFMSRFCSNLCLPK 114

Query: 226 XXXXXXXXXXXXSEDLDI--RRSPISVAAAVIYIITQLSNDTKPLKEISIVTRVAEGTIK 283
                       + +LD+   RSPISVAAA IY+ +Q S + +  KEI  +  VA+ TI+
Sbjct: 115 QVQMAATHIARKAVELDLVPGRSPISVAAAAIYMASQASAEKRTQKEIGDIAGVADVTIR 174

Query: 284 NVYKDLFPHLARIIPDWFANEEDIKNL 310
             Y+ ++P    + P  F  +  +  L
Sbjct: 175 QSYRLIYPRAPDLFPTDFKFDTPVDKL 201


>pdb|3K7A|M Chain M, Crystal Structure Of An Rna Polymerase Ii-Tfiib Complex
 pdb|4BBR|M Chain M, Structure Of Rna Polymerase Ii-tfiib Complex
 pdb|4BBS|M Chain M, Structure Of An Initially Transcribing Rna Polymerase Ii-
           Tfiib Complex
          Length = 345

 Score =  115 bits (289), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 138/311 (44%), Gaps = 16/311 (5%)

Query: 6   CADCKRLT-EVVFDHSAGDTICSECGLVLEAYSVDETSEWRIFANESSDH---DPVRVXX 61
           C +CK    ++V   S GD +C+ CGLVL    VD  SEWR F+N+  DH   DP RV  
Sbjct: 24  CPECKVYPPKIVERFSEGDVVCALCGLVLSDKLVDTRSEWRTFSND--DHNGDDPSRVGE 81

Query: 62  XXXXXXXXXXXXTVIAKPTAXXXXXXXXXXXXXXQARSSHPDRNLIQAFKSISAMSDRLG 121
                       T I K                 +      D  +  AF  I+ + D   
Sbjct: 82  ASNPLLDGNNLSTRIGKGETTDMRFTKELNKAQGKNVMDKKDNEVQAAFAKITMLCDAAE 141

Query: 122 LVTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYIACRQENKPRTVKEFCSVANGTTK 181
           L   +KD A E YK   D+K L+G++ E+I+AA + I CR+    RT KE  S+ +  T 
Sbjct: 142 LPKIVKDCAKEAYKLCHDEKTLKGKSMESIMAASILIGCRRAEVARTFKEIQSLIHVKT- 200

Query: 182 KEIGRAKEFIVKHL--EAEMG----QSVEMGTIHASDYLRRFCSNLGMTNXXXXXXXXXX 235
           KE G+    +   L  ++E G     +  M       Y+ RFCS+LG+            
Sbjct: 201 KEFGKTLNIMKNILRGKSEDGFLKIDTDNMSGAQNLTYIPRFCSHLGLPMQVTTSAEYTA 260

Query: 236 XXSEDLD--IRRSPISVAAAVIYIITQLSNDTKPLKEISIVTRVAEGTIKNVYKDLFPHL 293
              +++     +SPI++A   IY+   L        ++    +V EGTIK+ YK L+ H 
Sbjct: 261 KKCKEIKEIAGKSPITIAVVSIYLNILLFQIPITAAKVGQTLQVTEGTIKSGYKILYEHR 320

Query: 294 ARII-PDWFAN 303
            +++ P   AN
Sbjct: 321 DKLVDPQLIAN 331


>pdb|1D3U|B Chain B, Tata-Binding ProteinTRANSCRIPTION FACTOR (II)BBRE+TATA-
           Box Complex From Pyrococcus Woesei
          Length = 201

 Score = 93.6 bits (231), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 104/202 (51%), Gaps = 7/202 (3%)

Query: 99  SSHPDRNLIQAFKSISAMSDRLGLVTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYI 158
           S   +RNL  A   +  ++ +L L   +++ A  +Y++   +  +RGR+ E+++AAC+Y 
Sbjct: 3   SDAAERNLAFALSELDRITAQLKLPRHVEEEAARLYREAVRKGLIRGRSIESVMAACVYA 62

Query: 159 ACRQENKPRTVKEFCSVANGTTKKEIGRAKEFIVKHLEAEMGQSVEMGTIHASDYLRRFC 218
           ACR    PRT+ E   +A    KKEIGR+  FI ++L      + +   +  +DY+ +F 
Sbjct: 63  ACRLLKVPRTLDEIADIAR-VDKKEIGRSYRFIARNL----NLTPKKLFVKPTDYVNKFA 117

Query: 219 SNLGMTNXXXXXXXXXXXXS--EDLDIRRSPISVAAAVIYIITQLSNDTKPLKEISIVTR 276
             LG++             +    L   +SP  + AA +YI + L  + +  +E++ V R
Sbjct: 118 DELGLSEKVRRRAIEILDEAYKRGLTSGKSPAGLVAAALYIASLLEGEKRTQREVAEVAR 177

Query: 277 VAEGTIKNVYKDLFPHLARIIP 298
           V E T++N YK+L   L   +P
Sbjct: 178 VTEVTVRNRYKELVEKLKIKVP 199


>pdb|1AIS|B Chain B, Tata-Binding ProteinTRANSCRIPTION FACTOR (II)BTATA-Box
           Complex From Pyrococcus Woesei
          Length = 200

 Score = 93.6 bits (231), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 104/202 (51%), Gaps = 7/202 (3%)

Query: 99  SSHPDRNLIQAFKSISAMSDRLGLVTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYI 158
           S   +RNL  A   +  ++ +L L   +++ A  +Y++   +  +RGR+ E+++AAC+Y 
Sbjct: 2   SDAAERNLAFALSELDRITAQLKLPRHVEEEAARLYREAVRKGLIRGRSIESVMAACVYA 61

Query: 159 ACRQENKPRTVKEFCSVANGTTKKEIGRAKEFIVKHLEAEMGQSVEMGTIHASDYLRRFC 218
           ACR    PRT+ E   +A    KKEIGR+  FI ++L      + +   +  +DY+ +F 
Sbjct: 62  ACRLLKVPRTLDEIADIAR-VDKKEIGRSYRFIARNL----NLTPKKLFVKPTDYVNKFA 116

Query: 219 SNLGMTNXXXXXXXXXXXXS--EDLDIRRSPISVAAAVIYIITQLSNDTKPLKEISIVTR 276
             LG++             +    L   +SP  + AA +YI + L  + +  +E++ V R
Sbjct: 117 DELGLSEKVRRRAIEILDEAYKRGLTSGKSPAGLVAAALYIASLLEGEKRTQREVAEVAR 176

Query: 277 VAEGTIKNVYKDLFPHLARIIP 298
           V E T++N YK+L   L   +P
Sbjct: 177 VTEVTVRNRYKELVEKLKIKVP 198


>pdb|1RLY|A Chain A, Rdc-Derived Models Of The Zinc Ribbon Domain Of Human
          General Transcription Tfiib (Zinc Bound Structures)
 pdb|1RO4|A Chain A, Rdc-Derived Models Of The Zinc Ribbon Domain Of Human
          General Transcription Factor Tfiib (Zinc Free
          Structures)
          Length = 60

 Score = 44.3 bits (103), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 19/38 (50%), Positives = 26/38 (68%)

Query: 15 VVFDHSAGDTICSECGLVLEAYSVDETSEWRIFANESS 52
          +V D+ AGD IC ECGLV+    +D  SEWR F+N+ +
Sbjct: 23 LVEDYRAGDMICPECGLVVGDRVIDVGSEWRTFSNDKA 60


>pdb|1DL6|A Chain A, Solution Structure Of Human Tfiib N-Terminal Domain
          Length = 58

 Score = 43.9 bits (102), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 19/36 (52%), Positives = 25/36 (69%)

Query: 15 VVFDHSAGDTICSECGLVLEAYSVDETSEWRIFANE 50
          +V D+ AGD IC ECGLV+    +D  SEWR F+N+
Sbjct: 22 LVEDYRAGDMICPECGLVVGDRVIDVGSEWRTFSND 57


>pdb|3K1F|M Chain M, Crystal Structure Of Rna Polymerase Ii In Complex With
          Tfiib
          Length = 197

 Score = 43.1 bits (100), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 6  CADCKRLT-EVVFDHSAGDTICSECGLVLEAYSVDETSEWRIFAN 49
          C +CK    ++V   S GD +C+ CGLVL    VD  SEWR F+N
Sbjct: 24 CPECKVYPPKIVERFSEGDVVCALCGLVLSDKLVDTRSEWRTFSN 68


>pdb|1ZP2|A Chain A, Structure Of The Mediator Subunit Cyclin C
          Length = 235

 Score = 38.1 bits (87), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 11/91 (12%)

Query: 104 RNLIQAFKSISAMSDRLGL-----VTTIKDRANEIYKKVEDQKPLRGRNQEAIVAACLYI 158
           ++ I  +K +    DRL L      T I      + KK E+    +G + EA+VA C+Y+
Sbjct: 26  KDTIYQWKVVQTFGDRLRLRQRVLATAIVLLRRYMLKKNEE----KGFSLEALVATCIYL 81

Query: 159 ACRQENKPRTVKEFCSVAN--GTTKKEIGRA 187
           +C+ E  P  ++  C+ AN   + K ++ R+
Sbjct: 82  SCKVEECPVHIRTICNEANDLWSLKVKLSRS 112


>pdb|1PFT|A Chain A, N-Terminal Domain Of Tfiib, Nmr
          Length = 50

 Score = 37.7 bits (86), Expect = 0.007,   Method: Composition-based stats.
 Identities = 15/42 (35%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 6  CADCKRLTEVVFDHSAGDTICSECGLVLEAYSVDETSEWRIF 47
          C  C+   E+++D   G+ +C++CG V+E   +D   EWR F
Sbjct: 8  CPACES-AELIYDPERGEIVCAKCGYVIEENIIDMGPEWRAF 48


>pdb|4HC7|A Chain A, Crystal Structure Of The Full Dna Binding Domain Of
          Gata3-Complex 2
 pdb|4HC7|B Chain B, Crystal Structure Of The Full Dna Binding Domain Of
          Gata3-Complex 2
 pdb|4HC9|A Chain A, Dna Binding By Gata Transcription Factor-Complex 3
 pdb|4HCA|A Chain A, Dna Binding By Gata Transcription Factor-Complex 1
          Length = 115

 Score = 32.0 bits (71), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 12/40 (30%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 2  ADSYCADCKRLTEVVFDHSA-GDTICSECGLVLEAYSVDE 40
          A + CA+C+  T  ++  +A GD +C+ CGL  + ++++ 
Sbjct: 58 AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNINR 97


>pdb|3DFV|D Chain D, Adjacent Gata Dna Binding
 pdb|3DFV|C Chain C, Adjacent Gata Dna Binding
 pdb|3DFX|A Chain A, Opposite Gata Dna Binding
 pdb|3DFX|B Chain B, Opposite Gata Dna Binding
          Length = 63

 Score = 29.3 bits (64), Expect = 2.6,   Method: Composition-based stats.
 Identities = 12/39 (30%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 2  ADSYCADCKRLTEVVFDHSA-GDTICSECGLVLEAYSVD 39
          A + CA+C+  T  ++  +A GD +C+ CGL  + ++++
Sbjct: 6  AGTSCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNIN 44


>pdb|1GAT|A Chain A, Solution Structure Of The Specific Dna Complex Of The
          Zinc Containing Dna Binding Domain Of The Erythroid
          Transcription Factor Gata-1 By Multidimensional Nmr
 pdb|1GAU|A Chain A, Solution Structure Of The Specific Dna Complex Of The
          Zinc Containing Dna Binding Domain Of The Erythroid
          Transcription Factor Gata-1 By Multidimensional Nmr
          Length = 60

 Score = 28.1 bits (61), Expect = 7.0,   Method: Composition-based stats.
 Identities = 12/39 (30%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 2  ADSYCADCKRLTEVVFDHSA-GDTICSECGLVLEAYSVD 39
          A + C++C+  T  ++  S  GD +C+ CGL  + + V+
Sbjct: 3  AGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVN 41


>pdb|3GAT|A Chain A, Solution Nmr Structure Of The C-Terminal Domain Of
          Chicken Gata-1 Bound To Dna, 34 Structures
 pdb|2GAT|A Chain A, Solution Structure Of The C-Terminal Domain Of Chicken
          Gata- 1 Bound To Dna, Nmr, Regularized Mean Structure
          Length = 66

 Score = 27.7 bits (60), Expect = 8.0,   Method: Composition-based stats.
 Identities = 12/39 (30%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 2  ADSYCADCKRLTEVVFDHSA-GDTICSECGLVLEAYSVD 39
          A + C++C+  T  ++  S  GD +C+ CGL  + + V+
Sbjct: 3  AGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVN 41


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.318    0.132    0.380 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,343,235
Number of Sequences: 62578
Number of extensions: 239313
Number of successful extensions: 518
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 476
Number of HSP's gapped (non-prelim): 18
length of query: 312
length of database: 14,973,337
effective HSP length: 99
effective length of query: 213
effective length of database: 8,778,115
effective search space: 1869738495
effective search space used: 1869738495
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 51 (24.3 bits)