BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>000792
MGGYDSGSASKDGRDEDDEEEYEDVSGGNRLLGFMFGNVDYAGDLDVDYLDEDAKEHLAA
VADKLGPSLTDIDVWNSQSCFQLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDID
EQYEGPEIQIASEEDYLLPKKEYFAAEVSLAALKPTASPFDDENYDEDEELEKEHEEVDK
ETEDTTTILSGEQVECATAVPDGEKSPEGDPQVGSLGAEEEMTAGVKDYDEELADILKGP
LDGQVSTPLPVLCVEDGKVILRFSEIFGIHEPLKKGKKRDQRYSTPKDKYNTMDVSSLVE
EDEEVYLKGSGQGFPLFKEANIFQHGIFSLNDDDSELVKFGVEQDAATISEHDEQRKDSC
ICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEENIVWDNSPAASDNSIESHEIAGAD
VESALMRGIELDTGQNNFHERSTSPNEKDCIVNMQNSPVLSETFGSKSSSDNTSHLFTES
RYHPQLLRLESQLDLDNHSHANGIKENVSIKLCQSDAVKRFSKHSLQNRDMMEGSWLDNI
IWEPVDAVGKPKLILDLQDEQMLFEILDNKDDEHPLLHAGAMIITRSAKPSDFKPAESVK
LFYLGKELEDHKSLADQNVRPNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAFKKKSDL
SVKDGHVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLE
PGDKSPFLGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVV
AQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEA
IIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKH
LGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIER
LEITGVGDPSGRGLGFSYVRAAPKASVSSAMVKKKAAANRGGSTVTGTDADLRRLSMEAA
REVLLKFNVPEEMIAKQTRWHRIAMIRKLSSEQAASGVQLQQQTRGKCQEIWDRQVQSLS
AADDDEIGSDSEHSDLDSFAGDLENLLDAEEFEEEESNYDTKHDKVEGVKGLKMRRRPIQ
VQAEEEIEDEAAEAAELCRLLMDDDEAELKKKKKKTKAQVEGGLSLAKSISGLEIVERLK
KANKPAKHIAITVQPNGSHTANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQGEKIIKG
DFCLWSLWSAWTHEDQQKLPQI

High Scoring Gene Products

Symbol, full name Information P value
HAF01
AT1G32750
protein from Arabidopsis thaliana 1.2e-253
HAF2
AT3G19040
protein from Arabidopsis thaliana 2.9e-251
taf1
transcription initiation factor TFIID subunit 1
gene from Dictyostelium discoideum 4.3e-69
Taf1
TAF1 RNA polymerase II, TATA box binding protein (TBP)-associated factor
protein from Mus musculus 1.6e-68
Taf1
TAF1 RNA polymerase II, TATA box binding protein (TBP)-associated factor
gene from Rattus norvegicus 4.7e-68
TAF1
Transcription initiation factor TFIID subunit 1
protein from Mesocricetus auratus 6.0e-68
TAF1
Transcription initiation factor TFIID subunit 1
protein from Homo sapiens 1.0e-67
N-TAF1
TAF1 RNA polymerase II, TATA box binding protein (TBP)-associated factor, neuron specific isoform
protein from Homo sapiens 1.1e-67
TAF1
Uncharacterized protein
protein from Bos taurus 1.3e-67
taf1
TAF1 RNA polymerase II, TATA box binding protein (TBP)-associated factor
gene_product from Danio rerio 1.5e-67
TAF1L
Transcription initiation factor TFIID subunit 1-like
protein from Homo sapiens 5.1e-67
TAF1
Uncharacterized protein
protein from Canis lupus familiaris 2.2e-66
TAF1
Uncharacterized protein
protein from Canis lupus familiaris 2.7e-66
TAF1
Uncharacterized protein
protein from Canis lupus familiaris 6.5e-66
I3LP77
Uncharacterized protein
protein from Sus scrofa 4.5e-63
Taf1
TBP-associated factor 1
protein from Drosophila melanogaster 2.7e-55
TAF1
Uncharacterized protein
protein from Gallus gallus 6.7e-46
TAF1
Uncharacterized protein
protein from Gallus gallus 7.7e-46
taf-1 gene from Caenorhabditis elegans 2.2e-44
taf-1
Protein TAF-1
protein from Caenorhabditis elegans 2.2e-44
TAF1
TFIID subunit (145 kDa)
gene from Saccharomyces cerevisiae 1.4e-22

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  000792
        (1282 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2035574 - symbol:HAF01 "AT1G32750" species:370...  1795  1.2e-253  3
TAIR|locus:2085909 - symbol:HAF2 "AT3G19040" species:3702...  1802  2.9e-251  2
DICTYBASE|DDB_G0292242 - symbol:taf1 "transcription initi...   651  4.3e-69   5
MGI|MGI:1336878 - symbol:Taf1 "TAF1 RNA polymerase II, TA...   445  1.6e-68   5
RGD|1562050 - symbol:Taf1 "TAF1 RNA polymerase II, TATA b...   444  4.7e-68   5
UNIPROTKB|Q60544 - symbol:TAF1 "Transcription initiation ...   448  6.0e-68   5
UNIPROTKB|P21675 - symbol:TAF1 "Transcription initiation ...   448  1.0e-67   5
UNIPROTKB|B1Q2X3 - symbol:N-TAF1 "TAF1 RNA polymerase II,...   448  1.1e-67   5
UNIPROTKB|F1MF62 - symbol:TAF1 "Uncharacterized protein" ...   448  1.3e-67   5
ZFIN|ZDB-GENE-030131-5576 - symbol:taf1 "TAF1 RNA polymer...   443  1.5e-67   5
UNIPROTKB|Q8IZX4 - symbol:TAF1L "Transcription initiation...   440  5.1e-67   5
UNIPROTKB|E2QSZ4 - symbol:TAF1 "Uncharacterized protein" ...   448  2.2e-66   5
UNIPROTKB|J9NZ21 - symbol:TAF1 "Uncharacterized protein" ...   448  2.7e-66   5
UNIPROTKB|F6UPV2 - symbol:TAF1 "Uncharacterized protein" ...   446  6.5e-66   5
UNIPROTKB|I3LP77 - symbol:I3LP77 "Uncharacterized protein...   448  4.5e-63   5
FB|FBgn0010355 - symbol:Taf1 "TBP-associated factor 1" sp...   573  2.7e-55   6
UNIPROTKB|F1P5U7 - symbol:TAF1 "Uncharacterized protein" ...   441  6.7e-46   5
UNIPROTKB|F1P5U8 - symbol:TAF1 "Uncharacterized protein" ...   441  7.7e-46   5
WB|WBGene00006382 - symbol:taf-1 species:6239 "Caenorhabd...   351  2.2e-44   6
UNIPROTKB|G5EGM3 - symbol:taf-1 "Protein TAF-1" species:6...   351  2.2e-44   6
POMBASE|SPAC2G11.14 - symbol:taf111 "transcription factor...   342  4.9e-27   2
ASPGD|ASPL0000011251 - symbol:AN3907 species:162425 "Emer...   311  6.3e-26   5
SGD|S000003506 - symbol:TAF1 "TFIID subunit (145 kDa)" sp...   300  1.4e-22   2


>TAIR|locus:2035574 [details] [associations]
            symbol:HAF01 "AT1G32750" species:3702 "Arabidopsis
            thaliana" [GO:0003677 "DNA binding" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0004402 "histone acetyltransferase
            activity" evidence=ISS] [GO:0009294 "DNA mediated transformation"
            evidence=IMP] [GO:0000394 "RNA splicing, via endonucleolytic
            cleavage and ligation" evidence=RCA] [GO:0006366 "transcription
            from RNA polymerase II promoter" evidence=RCA] InterPro:IPR000626
            InterPro:IPR001487 Pfam:PF00240 Pfam:PF00439 PRINTS:PR00503
            PROSITE:PS50014 SMART:SM00213 SMART:SM00297 Prosite:PS00299
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0006355
            GO:GO:0006351 GO:GO:0016568 EMBL:AC017118 InterPro:IPR019955
            PROSITE:PS50053 GO:GO:0009294 Gene3D:1.20.920.10 SUPFAM:SSF47370
            PROSITE:PS00633 eggNOG:COG5076 InterPro:IPR018359 HSSP:Q91888
            KO:K03125 InterPro:IPR022591 Pfam:PF12157 EMBL:AF510669
            IPI:IPI00524275 PIR:D86452 RefSeq:NP_174552.1 UniGene:At.40077
            UniGene:At.48257 ProteinModelPortal:Q8LRK9 SMR:Q8LRK9 IntAct:Q8LRK9
            STRING:Q8LRK9 PaxDb:Q8LRK9 PRIDE:Q8LRK9 EnsemblPlants:AT1G32750.1
            GeneID:840169 KEGG:ath:AT1G32750 TAIR:At1g32750
            HOGENOM:HOG000030311 InParanoid:Q8LRK9 OMA:IANFHRP PhylomeDB:Q8LRK9
            ProtClustDB:CLSN2682826 Genevestigator:Q8LRK9 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055 Uniprot:Q8LRK9
        Length = 1919

 Score = 1795 (636.9 bits), Expect = 1.2e-253, Sum P(3) = 1.2e-253
 Identities = 407/731 (55%), Positives = 493/731 (67%)

Query:   389 LDQHDWEENIVWDNSPAASDNSIESHEIAGADVESALMRGIELDTGQN-NFHERSTSPNE 447
             L    W ++I+WD+    S + +    I     E  +    + +  +N   H  +     
Sbjct:   502 LGDEAWLDSIIWDSEKELSRSKL----IFDLQDEQMVFEIFDNEESKNLQLHAGAM---- 553

Query:   448 KDCIVNMQNSPVLSETF--GSKSSSDNTSHLFTESRYHPQLLRLESQLDLDNHSHANGIK 505
                IV+ ++S    ETF  G +S+S    +L  +  Y    +  +S   L     AN  K
Sbjct:   554 ---IVS-RSSKSKDETFQEGCESNSGWQFNLSNDKFY----MNGKSSQQLQ----ANTNK 601

Query:   506 ENV-SIKLCQS-DAVK-RFSKHSLQNRDMMEGSWLDNIIWEPVD---AVGKPKLILDLQD 559
              +V S+++  S  A+K +  K  L N+D+         +W P D   A+ +   +     
Sbjct:   602 SSVHSLRVFHSVPAIKLQTMKSKLSNKDIANFH-RPKALWYPHDNELAIKQQGKLPTRGS 660

Query:   560 EQMLFEILDNKDDEHPLLHAGA--MIITRSAKPS---DFKPAESVKLFYLGKELEDHKSL 614
              +++ + L  K  +   LH G    + +  AK S   DFK  E+VK+FY GKEL+D KSL
Sbjct:   661 MKIIVKSLGGKGSK---LHVGIEESVSSLRAKASRKLDFKETEAVKMFYKGKELDDEKSL 717

Query:   615 ADQNVRPNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAFKKKSDLSVKDGHVFLMEYCE 674
             A QNV+PNSL+HLIRTK+HL P AQKLPGENKSLRPPGAFKKKSDLS KDGHVFLMEYCE
Sbjct:   718 AAQNVQPNSLVHLIRTKVHLWPWAQKLPGENKSLRPPGAFKKKSDLSTKDGHVFLMEYCE 777

Query:   675 ERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAG 734
             ERPL+LSNAGMGANLCTYYQKSSP DQ G LL + ++ LGNV+ LEPGDKSPFLG+I AG
Sbjct:   778 ERPLMLSNAGMGANLCTYYQKSSPEDQRGNLLRNQSDTLGNVMILEPGDKSPFLGEIHAG 837

Query:   735 CSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGS 794
             CSQSS+ETNMY+AP+F  ++ +TD+LLVRS KGK+S+RRIDK+ VV QQEP MEVMSPGS
Sbjct:   838 CSQSSVETNMYKAPIFPQRLQSTDYLLVRSPKGKLSLRRIDKIVVVGQQEPRMEVMSPGS 897

Query:   795 KNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRR 854
             KNLQTY +NRMLV VYREF      G  P I  DELS  F NL++AII+K +K  A  +R
Sbjct:   898 KNLQTYLVNRMLVYVYREFFKRGG-GEHP-IAADELSFLFSNLTDAIIKKNMKIIACWKR 955

Query:   855 DGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISS 914
             D NG+  W+ K +   P E +L+KL  PEHVCSYESM AGLYRLKHLGIT+ TLPASIS+
Sbjct:   956 DKNGQSYWTKKDSLLEPPESELKKLVAPEHVCSYESMLAGLYRLKHLGITRFTLPASISN 1015

Query:   915 AMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGL 974
             A++QLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDR NIERLEITGVGDPSGRGL
Sbjct:  1016 ALAQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRANIERLEITGVGDPSGRGL 1075

Query:   975 GFSYVRXXXXXXXXXXXXXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMI 1034
             GFSYVR                   RG  TVTGTDADLRRLSMEAAREVL+KFNVP+E+I
Sbjct:  1076 GFSYVRAAPKAPAAAGHMKKKAAAGRGAPTVTGTDADLRRLSMEAAREVLIKFNVPDEII 1135

Query:  1035 AKQTRWHRIAMIRKLSSEQAASGVQLQQQTRGKC---QEIWDRQVQSLSAADDDEIGSDS 1091
             AKQTRWHRIAMIRKLSSEQAASGV++   T GK    Q +   Q+Q  +     EI  D 
Sbjct:  1136 AKQTRWHRIAMIRKLSSEQAASGVKVDPTTIGKYARGQRMSFLQMQQQAREKCQEIW-DR 1194

Query:  1092 EHSDLDSFAGD 1102
             +   L +F GD
Sbjct:  1195 QLLSLSAFDGD 1205

 Score = 823 (294.8 bits), Expect = 6.6e-107, Sum P(2) = 6.6e-107
 Identities = 220/567 (38%), Positives = 314/567 (55%)

Query:    90 LSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYEGPEIQIASEEDYLLPKKEYFAAEVS 149
             L   S +   D  EQDYD+KAEDAVDYEDIDE+Y+GPE+Q+ SEED+LLPKKEYF+  V+
Sbjct:    72 LLAKSERTASDPAEQDYDRKAEDAVDYEDIDEEYDGPEVQVVSEEDHLLPKKEYFSTAVA 131

Query:   150 LAALKPTASPFXXXXXXXXXXXXXXXXXXXXXXXXXXXILSGEQVECATAVPDGEKSPEG 209
             L +LK  AS F                            L  E+ E      D  K+ E 
Sbjct:   132 LGSLKSRASVFDDEDYDEEEEQEEEQAPVEKS-------LETEKREPVVLKED--KALEY 182

Query:   210 DPQVGSLGAEEEM-TAGVKDYDEELADILKGPLDGQVSTPLPVLCVEDGKVILRFSEIFG 268
             + +   L  E+ M T  V++  EE+ ++L+G LD + +TPLP L VEDG VIL+FSEIF 
Sbjct:   183 EEEASILDKEDHMDTEDVQE--EEVDELLEGTLDDKGATPLPTLYVEDGMVILQFSEIFA 240

Query:   269 IHEPLKKGKKRDQRYSTPKDKYNTMXXXXXXXXXXXXYLKGSGQGFPLFKEANIFQHGI- 327
             IHEP +K  +R+ RY T +DKY +M             LK  G+     ++A++ Q  + 
Sbjct:   241 IHEPPQKRDRRENRYVTCRDKYKSMDISELVEDDEEVLLKSHGRIDTHVEQADLIQLDVP 300

Query:   328 FSLNDDDSELVKFGVEQDAATIS-EHDEQRKDSCICSEPMKEESNVNPSVGWKSMPSPNF 386
             F + +   +LVK          S E  +  +DSCI  E +K++   + S   +S  S   
Sbjct:   301 FPIREG-LQLVKASTIGGITPESREFTKLGRDSCIMGELLKQDFIDDNSSLCQSQLSMQV 359

Query:   387 FPLDQHDWEENIVWDNSPAASDNSIESHEIAGADVESALMRGIELDTGQNNFHE-RSTSP 445
             FPLDQH+WE  I+W++SP  S NS E  E  G + E  L++G   +T Q + +   S   
Sbjct:   360 FPLDQHEWERRIIWEHSPEISGNSGEIFE-PGLEPEGMLVKGTNSETEQESLNVVNSRVQ 418

Query:   446 NEKDCIVNMQNSPVLSETFGSKSSSDNTSHLFTESRYHPQLLRLESQLDLDNHSHAN--- 502
              + D  + +  S  L E+FGS+ S  +T+    +SR+HPQLLRLESQ D +NH   N   
Sbjct:   419 VQADNNMFVPFSANLLESFGSRGSQ-STNESTNKSRHHPQLLRLESQWD-ENHLSGNDEA 476

Query:   503 GIKENVSIKLCQSDAVKRFSKHSLQNRDMMEGSWLDNIIWEPVDAVGKPKLILDLQDEQM 562
             G+K+   IK  + DA+ RFS+  L+ RD+ + +WLD+IIW+    + + KLI DLQDEQM
Sbjct:   477 GVKK---IKRLEKDALGRFSRLVLRERDLGDEAWLDSIIWDSEKELSRSKLIFDLQDEQM 533

Query:   563 LFEILDNKDDEHPLLHAGAMIITRSAKPSD--FKPA-ESVK--LFYLGKELEDHKSLADQ 617
             +FEI DN++ ++  LHAGAMI++RS+K  D  F+   ES     F L  +       + Q
Sbjct:   534 VFEIFDNEESKNLQLHAGAMIVSRSSKSKDETFQEGCESNSGWQFNLSNDKFYMNGKSSQ 593

Query:   618 NVRPN---SLIHLIRTKIHLLPRAQKL 641
              ++ N   S +H +R   H +P A KL
Sbjct:   594 QLQANTNKSSVHSLRV-FHSVP-AIKL 618

 Score = 374 (136.7 bits), Expect = 1.2e-253, Sum P(3) = 1.2e-253
 Identities = 78/129 (60%), Positives = 93/129 (72%)

Query:    32 LGFMFGNVDYAGDLDVDYLDEDAKEHLAAVADKLGPSLTDIDVWNSQSCFQLFCSYCLLS 91
             LGF+FGNVD +GDLD DYLDEDAKEHL+A+ADKLG SL DI+                L 
Sbjct:    30 LGFIFGNVDNSGDLDADYLDEDAKEHLSALADKLGSSLPDIN----------------LL 73

Query:    92 VNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYEGPEIQIASEEDYLLPKKEYFAAEVSLA 151
               S +   D  EQDYD+KAEDAVDYEDIDE+Y+GPE+Q+ SEED+LLPKKEYF+  V+L 
Sbjct:    74 AKSERTASDPAEQDYDRKAEDAVDYEDIDEEYDGPEVQVVSEEDHLLPKKEYFSTAVALG 133

Query:   152 ALKPTASPF 160
             +LK  AS F
Sbjct:   134 SLKSRASVF 142

 Score = 314 (115.6 bits), Expect = 1.2e-253, Sum P(3) = 1.2e-253
 Identities = 84/217 (38%), Positives = 115/217 (52%)

Query:  1046 IRKLSSEQAASGVQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSE-HSDLDSFAGDXX 1104
             I K +  Q  S +Q+QQQ R KCQEIWDRQ+ SLSA D DE  S++E +SDLDSFAGD  
Sbjct:  1166 IGKYARGQRMSFLQMQQQAREKCQEIWDRQLLSLSAFDGDENESENEANSDLDSFAGDLE 1225

Query:  1105 XXXXXXXXXXXXSNYDTKHDKVEGVKGLKMRRRPIQVQXXXXXXXXXXXXXXLCRLLMDD 1164
                          +  +K+DK++GVKGLKMRRRP QV+              LCRLLM D
Sbjct:  1226 NLLDAEEGGEGEESNISKNDKLDGVKGLKMRRRPSQVETDEEIEDEATEYAELCRLLMQD 1285

Query:  1165 DEAELXXXXXXXXAQVEGGLSLAKSISGLEIVERLKKANKPAKHIAITVQPNGSHTANEQ 1224
             ++ +          +  G     +    L+  E ++KAN   K   I +QP+ S   NE 
Sbjct:  1286 EDQKKKKKKMKGVGEGMGSYPPPRPNIALQSGEPVRKANAMDKK-PIAIQPDASFLVNES 1344

Query:  1225 -IKDPKEEESLIAKRNLSGKVQAMKKNNVQGE-KIIK 1259
              IKD +  +S+I  +   GK Q  + +N  G+ K +K
Sbjct:  1345 TIKDNRNVDSII--KTPKGK-QVKENSNSLGQLKKVK 1378

 Score = 55 (24.4 bits), Expect = 1.6e-29, Sum P(3) = 1.6e-29
 Identities = 30/128 (23%), Positives = 54/128 (42%)

Query:   325 HGIFSLNDDDSELVKFGVEQDAATISEHDEQRKDSCICSEPMKEESNVNPSVGW-KSMPS 383
             H I  +    SE    GV+ D  TI ++   ++ S +    M++++       W + + S
Sbjct:  1142 HRIAMIRKLSSEQAASGVKVDPTTIGKYARGQRMSFL---QMQQQAREKCQEIWDRQLLS 1198

Query:   384 PNFFPLDQHDWEENIVWDNSPAASD--NSIESHEIAGADVESALMRGIELDTGQNNFHER 441
              + F  D+++ E     D    A D  N +++ E  G   ES + +  +LD  +     R
Sbjct:  1199 LSAFDGDENESENEANSDLDSFAGDLENLLDAEE-GGEGEESNISKNDKLDGVKGLKMRR 1257

Query:   442 STSPNEKD 449
               S  E D
Sbjct:  1258 RPSQVETD 1265

 Score = 49 (22.3 bits), Expect = 6.0e-79, Sum P(2) = 6.0e-79
 Identities = 11/47 (23%), Positives = 23/47 (48%)

Query:   828 DELSVQFPNLSEAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEG 874
             D L +    +  + IR K+++  +  R+     VW +K   H+ ++G
Sbjct:  1835 DYLDIVENPMDLSTIRDKVRKIEYRNREQFRHDVWQIKYNAHLYNDG 1881


>TAIR|locus:2085909 [details] [associations]
            symbol:HAF2 "AT3G19040" species:3702 "Arabidopsis
            thaliana" [GO:0003677 "DNA binding" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0004402 "histone acetyltransferase
            activity" evidence=ISS] [GO:0003712 "transcription cofactor
            activity" evidence=TAS] [GO:0005669 "transcription factor TFIID
            complex" evidence=TAS] [GO:0009416 "response to light stimulus"
            evidence=IEP] [GO:0016573 "histone acetylation" evidence=IMP]
            [GO:0016020 "membrane" evidence=IDA] [GO:0000394 "RNA splicing, via
            endonucleolytic cleavage and ligation" evidence=RCA] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=RCA]
            InterPro:IPR000626 InterPro:IPR001487 Pfam:PF00240 Pfam:PF00439
            PRINTS:PR00503 PROSITE:PS50014 SMART:SM00213 SMART:SM00297
            Prosite:PS00299 EMBL:CP002686 GenomeReviews:BA000014_GR
            GO:GO:0016020 GO:GO:0006355 GO:GO:0016573 GO:GO:0006351
            GO:GO:0009416 GO:GO:0003712 InterPro:IPR019955 PROSITE:PS50053
            Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633 eggNOG:COG5076
            InterPro:IPR018359 EMBL:AP000735 GO:GO:0005669 HSSP:Q91888
            KO:K03125 InterPro:IPR022591 Pfam:PF12157 HOGENOM:HOG000030311
            ProtClustDB:CLSN2682826 EMBL:AY579213 IPI:IPI00527667
            RefSeq:NP_188534.2 UniGene:At.50197 ProteinModelPortal:Q6PUA2
            SMR:Q6PUA2 STRING:Q6PUA2 EnsemblPlants:AT3G19040.1 GeneID:821437
            KEGG:ath:AT3G19040 TAIR:At3g19040 InParanoid:Q6PUA2 OMA:DGNESEN
            PhylomeDB:Q6PUA2 Genevestigator:Q6PUA2 Uniprot:Q6PUA2
        Length = 1786

 Score = 1802 (639.4 bits), Expect = 2.9e-251, Sum P(2) = 2.9e-251
 Identities = 415/795 (52%), Positives = 507/795 (63%)

Query:   502 NGIKENV-SIKLCQS-DAVK-RFSKHSLQNRDMMEGSWLDNIIWEPVD---AVGKPKLIL 555
             N  K  V S+++  S  A+K +  K  L N++          +W P D   A+ + K IL
Sbjct:   511 NAKKSTVHSLRVFHSAPAIKLQTMKIKLSNKERANFH-RPKALWYPHDNELAIKQQK-IL 568

Query:   556 DLQDEQMLFEILDNKDDEHPLLHAGA--MIITRSAKPS---DFKPAESVKLFYLGKELED 610
               Q    +  ++ +   +  LL  G    + +  AK S   DFK  E+VK+FY+GKELED
Sbjct:   569 PTQGSMTI--VVKSLGGKGSLLTVGREESVSSLKAKASRKLDFKETEAVKMFYMGKELED 626

Query:   611 HKSLADQNVRPNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAFKKKSDLSVKDGHVFLM 670
              KSLA+QNV+PNSL+HL+RTK+HL P AQKLPGENKSLRPPGAFKKKSDLS +DGHVFLM
Sbjct:   627 EKSLAEQNVQPNSLVHLLRTKVHLWPWAQKLPGENKSLRPPGAFKKKSDLSNQDGHVFLM 686

Query:   671 EYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGD 730
             EYCEERPL+LSNAGMGANLCTYYQKSSP DQ G LL + ++ LG+V+ LE G+KSPFLG+
Sbjct:   687 EYCEERPLMLSNAGMGANLCTYYQKSSPEDQHGNLLRNQSDTLGSVIILEHGNKSPFLGE 746

Query:   731 IKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVM 790
             +  GCSQSS+ETNMY+APVF H++ +TD+LLVRSAKGK+S+RRI+K+  V QQEP ME+M
Sbjct:   747 VHGGCSQSSVETNMYKAPVFPHRLQSTDYLLVRSAKGKLSLRRINKIVAVGQQEPRMEIM 806

Query:   791 SPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECA 850
             SP SKNL  Y +NRM+  VYREF    +      I  DELS  F N+S+A +RK ++ C+
Sbjct:   807 SPASKNLHAYLVNRMMAYVYREFKHRDR------IAADELSFSFSNISDATVRKYMQVCS 860

Query:   851 FLRRDGNGKQVWSMKRTFH-IPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLP 909
              L RD NGK  WS KR F  IP  G L  L  PE VCSYESM AGL+RLKHLGIT+ TLP
Sbjct:   861 DLERDANGKACWSKKRKFDKIPL-G-LNTLVAPEDVCSYESMLAGLFRLKHLGITRFTLP 918

Query:   910 ASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDP 969
             ASIS+A++QLPDE IA  AASHI RELQITPWNLSS+FV C  Q RENIERLEITGVGDP
Sbjct:   919 ASISTALAQLPDERIA--AASHIARELQITPWNLSSSFVTCATQGRENIERLEITGVGDP 976

Query:   970 SGRGLGFSYVRXXXXXXXXXXXXXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNV 1029
             SGRGLGFSYVR                   RG  TVTGTDAD RRLSMEAAREVLLKFNV
Sbjct:   977 SGRGLGFSYVRVAPKSSAASEHKKKKAAACRGVPTVTGTDADPRRLSMEAAREVLLKFNV 1036

Query:  1030 PEEMIAKQTRWHRIAMIRKLSSEQAASG-------------------VQLQQQTRGKCQE 1070
             P+E+IAKQT+ HR AMIRK+SSEQAASG                   +QLQQQ R  C E
Sbjct:  1037 PDEIIAKQTQRHRTAMIRKISSEQAASGGKVGPTTVGMFSRSQRMSFLQLQQQAREMCHE 1096

Query:  1071 IWDRQVQSLSAADDDEIGSDSE-HSDLDSFAGDXXXXXXXXXXXXXXSNYDTKHDKVEGV 1129
             IWDRQ  SLSA DDD   S++E +SDLDSF GD               +  + ++K++GV
Sbjct:  1097 IWDRQRLSLSACDDDGNESENEANSDLDSFVGDLEDLLDAEDGGEGEESNKSMNEKLDGV 1156

Query:  1130 KGLKMRRRPIQVQXXXXXXXXXXXXXXLCRLLMDDDEAELXXXXXXXXAQVEGGLSLAKS 1189
             KGLKMRR P QV+              LCRLLM D+  +            EG  S    
Sbjct:  1157 KGLKMRRWPSQVEKDEEIEDEAAEYVELCRLLMQDENDKKKKKLKDVG---EGIGSFPPP 1213

Query:  1190 ISGLE-IVERLKKANKPAKHIAITVQPNGSHTANEQ---IKDPKEEESLIAKRNLSGKVQ 1245
              S  E  +++   A +P     I  +    HT N      K PK+++     + +   + 
Sbjct:  1214 RSNFEPFIDKKYIATEPDASFLIVNESTVKHTKNVDKATSKSPKDKQV----KEIGTPIC 1269

Query:  1246 AMKKNNVQGEKIIKG 1260
              MKK   + +K+  G
Sbjct:  1270 QMKKILKENQKVFMG 1284

 Score = 641 (230.7 bits), Expect = 2.9e-251, Sum P(2) = 2.9e-251
 Identities = 176/487 (36%), Positives = 253/487 (51%)

Query:   114 VDYEDIDEQYEGPEIQIASEEDYLLPKKEYFAAEVSLAALKPTASPFXXXXXXXXXXXXX 173
             VDY   DE+Y+GPE+Q+ +EED+LLPK+EY +A  +L+ L   AS F             
Sbjct:     5 VDYGSNDEEYDGPELQVVTEEDHLLPKREYLSAAFALSGLNSRASVFDDEDYDEQGGQEK 64

Query:   174 XXXXXXXXXXXXXXILSGEQVECATAVPDGEKSPEGDPQVGSLGAEEEMTAGVKDYDEEL 233
                               E+ E    V   EK  + + +   LG + +M  G  D  EEL
Sbjct:    65 EHVPVEKS-------FDSEERE--PVVLKEEKPVKHEKEASILGNKNQMDTG--DVQEEL 113

Query:   234 A-DILKGPLDGQVSTPLPVLCVED-GKVILRFSEIFGIHEPLKKGKKRDQRYSTPKDKYN 291
                + +  LD +  TPLP L +ED G VIL+FSEIF I EP KK +KR+ R  T +DKY 
Sbjct:   114 VVGLSEATLDEKRVTPLPTLYLEDDGMVILQFSEIFAIQEPQKKRQKREIRCITYRDKYI 173

Query:   292 TMXXXXXXXXXXXXYLKGSGQGFPLFKEANIFQHGIFSLNDDDSELVKFGVEQDAATIS- 350
             +M             LK  G+     K+ +  Q  +     + S+LVK G+ +D  + S 
Sbjct:   174 SMDISELIEDDEEVLLKSHGRIDTHGKKTDQIQLDVPLPIRERSQLVKSGIVRDTTSESR 233

Query:   351 EHDEQRKDSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEENIVWDNSPAASDNS 410
             E  +  +DSCI  E +K++   + S   +S  +   FPLDQ +WE  I+W+ SP  S N 
Sbjct:   234 EFTKLGRDSCIMGELLKQDLKDDNSSLCQSQLTMEVFPLDQQEWEHLILWEISPQFSANC 293

Query:   411 IESH----EIAGADVESALMRGIELDTGQNNFHERS-TSPNEKDCIVNMQNSPVLSETFG 465
              E      E AG  V+      +      N  +    T  +  + +     +P+  E+FG
Sbjct:   294 CEGFKSGLESAGIMVQVRASNSVTEQESLNVMNSGGQTQGDNNNMLEPFFVNPL--ESFG 351

Query:   466 SKSSSDNTSHLFTESRYHPQLLRLESQLDLDNHSHANGIKENVSIKLCQSDAVKRFSKHS 525
             S+ S  +T+    +SR+HPQLLRLESQ D D H   NG     ++K   SDA  R S  +
Sbjct:   352 SRGSQ-STNESTNKSRHHPQLLRLESQWDED-HYRENGDAGRENLKQLNSDARGRLSGLA 409

Query:   526 LQNRDMMEGSWLDNIIWEPVDAVGKPKLILDLQDEQMLFEILDNKDDEHPLLHAGAMIIT 585
             LQ+RDM + SWLD+IIWE    + + KLI DLQDEQM+FE+ +NK+ ++  LHAG+ I++
Sbjct:   410 LQDRDMWDESWLDSIIWESDKDLSRSKLIFDLQDEQMIFEVPNNKERKYLQLHAGSRIVS 469

Query:   586 RSAKPSD 592
             RS+K  D
Sbjct:   470 RSSKSKD 476

 Score = 47 (21.6 bits), Expect = 1.9e-188, Sum P(2) = 1.9e-188
 Identities = 13/62 (20%), Positives = 31/62 (50%)

Query:    83 LFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYEGPEIQ--IASEEDYLLPK 140
             L  ++ L  +NS     D  ++DYD++     ++  +++ ++  E +  +  EE  +  +
Sbjct:    35 LSAAFALSGLNSRASVFD--DEDYDEQGGQEKEHVPVEKSFDSEEREPVVLKEEKPVKHE 92

Query:   141 KE 142
             KE
Sbjct:    93 KE 94

 Score = 47 (21.6 bits), Expect = 4.9e-58, Sum P(2) = 4.9e-58
 Identities = 12/41 (29%), Positives = 20/41 (48%)

Query:   842 IRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYP 882
             IR K++   +  R      VW +K   H+ ++G  R L+ P
Sbjct:  1712 IRDKVRRMEYRDRQQFRHDVWQIKFNAHLYNDG--RNLSIP 1750


>DICTYBASE|DDB_G0292242 [details] [associations]
            symbol:taf1 "transcription initiation factor TFIID
            subunit 1" species:44689 "Dictyostelium discoideum" [GO:0045944
            "positive regulation of transcription from RNA polymerase II
            promoter" evidence=ISS] [GO:0016573 "histone acetylation"
            evidence=ISS] [GO:0005669 "transcription factor TFIID complex"
            evidence=ISS] [GO:0004402 "histone acetyltransferase activity"
            evidence=ISS] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR001487 Pfam:PF00439 PROSITE:PS50014 SMART:SM00297
            dictyBase:DDB_G0292242 GenomeReviews:CM000155_GR GO:GO:0045944
            Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633 eggNOG:COG5076
            InterPro:IPR018359 GO:GO:0004402 EMBL:AAFI02000188 GO:GO:0005669
            HSSP:Q92830 KO:K03125 InterPro:IPR022591 Pfam:PF12157
            RefSeq:XP_629749.1 ProteinModelPortal:Q54DH8 STRING:Q54DH8
            PRIDE:Q54DH8 EnsemblProtists:DDB0220687 GeneID:8628578
            KEGG:ddi:DDB_G0292242 InParanoid:Q54DH8 OMA:EHRSSEH Uniprot:Q54DH8
        Length = 2310

 Score = 651 (234.2 bits), Expect = 4.3e-69, Sum P(5) = 4.3e-69
 Identities = 171/505 (33%), Positives = 255/505 (50%)

Query:   657 KSDLSVKDGHVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNV 716
             KSDLS +DG + L+EY E+ P L+SN GMG  +  YY+K +  D     L   N   G +
Sbjct:   966 KSDLSARDGRLVLIEYTEQHPPLVSNVGMGLRIRNYYKKKNTHDTGPKDL---NFEDGEL 1022

Query:   717 LTLEPGDKSPFLGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKI-SIRRID 775
             + L+  ++SPFLGDI  G +  S+  N+++ P+  H  A TDFLLV+S  GK   IR + 
Sbjct:  1023 VMLDNNEESPFLGDINPGQTIQSVVNNLFKVPIHKHNSANTDFLLVKSRDGKRWYIRDVG 1082

Query:   776 KVAVVAQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFP 835
              +    Q  P +EV +P S+N   +  +R+   +YR+F    K      + + ++   FP
Sbjct:  1083 PIYAAGQILPEVEVPAPNSRNANMFLKSRLQAYIYRQF--LKKSNPQRRLKITDICSAFP 1140

Query:   836 NLSEAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGL 895
             + SE  IRK+LK+CA  +R G+    W++K  F +P+E + +KL  PE V S+ESM  GL
Sbjct:  1141 SQSETSIRKRLKDCADFQRGGDDSGWWTVKDNFTLPTEEEFQKLVTPEAVVSFESMLIGL 1200

Query:   896 YRLKHLGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDR 955
              RL+  GI   T P +I + +  L DE     +   +E EL ITPWNL+ +F++   Q +
Sbjct:  1201 QRLQDNGIIHFTAPGTIPTILGNLDDEDPIKKSIKPVEDELSITPWNLTGSFLSAM-QGK 1259

Query:   956 ENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXXXXXXXXXNRGGSTVTGTDADLRRL 1015
                 RL+I    DP+GR   +SY++                      + VTGTDADLR+L
Sbjct:  1260 G---RLQIIS-DDPTGREDEYSYLKMPQKVVNQKQKAIKLALQK---NQVTGTDADLRKL 1312

Query:  1016 SMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSSEQAA---SGVQLQQQTRG------ 1066
             S+ A++ VLL+  V EE I K  RW RI ++RK SSE A    S   + +  RG      
Sbjct:  1313 SLSASKTVLLELGVDEETINKLARWQRIDLVRKKSSEAALASNSNAAMTKFARGSRYSLD 1372

Query:  1067 --------KCQEIWDRQVQSLSAADDDEIGSDSEHS---DL-DSFAGDXXXXXXXXXXXX 1114
                     +CQ ++D Q+++++   DD    D +     DL DS  GD            
Sbjct:  1373 HQNLQYKEQCQLVFDNQIKAIAGKGDDLYDEDLDADLLKDLEDSLFGDSNQSQNQNQNQN 1432

Query:  1115 XXSNYDTKHDKVEGVKGLKMRRRPI 1139
                N    ++     K    R R +
Sbjct:  1433 QNQNQSNNNNNKSSSKSTGKRSRSL 1457

 Score = 123 (48.4 bits), Expect = 4.3e-69, Sum P(5) = 4.3e-69
 Identities = 38/162 (23%), Positives = 74/162 (45%)

Query:   408 DNSIESHEIAGADVESALMRGIELDTGQNNFHERSTSPNEKDCIVNMQNSPVLSETFGSK 467
             D+  E  EI   D ++ L + +  +  +    ++  + NE+  I  + N+   S T  + 
Sbjct:   592 DHISEKEEIMDTDSQNLLQQPLSQEKDKEKEKDKDKAQNEQH-IQTIVNNKKNSTTNNNN 650

Query:   468 SSSDNTSHLFTESRYHPQLLRLESQLDLDNHSHANGIKENVSIKLCQSDAVKRFSKHSLQ 527
              +++N ++  T +  +       +  + +N++  N  KE   I       + ++S   + 
Sbjct:   651 LTNNNGNNNNTNNNNNNSNNNSNNNNNNNNNNKQNNNKE---IDEIMKSNIDKWSLFPIS 707

Query:   528 NRDMMEGSWLDNIIWE----PVDAVGKPKLILDLQDEQMLFE 565
             N+++  G W+DNIIW+    P        LILDL D +M FE
Sbjct:   708 NQELENGDWIDNIIWDESMVPEKIQQVSMLILDLNDREMYFE 749

 Score = 81 (33.6 bits), Expect = 1.0e-64, Sum P(5) = 1.0e-64
 Identities = 43/215 (20%), Positives = 86/215 (40%)

Query:   364 EPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEENIVWDNSPAA----SDNSIESHEIAGA 419
             EP++E  +      +  +P  N+  L Q +WE+NI+WD         SD+  + +  +  
Sbjct:   440 EPLRETEDEEIMTMFYMVPDSNYHSLQQVNWEDNIIWDEESLIKFKKSDHFNQFYLSSDN 499

Query:   420 DVESALMR-GIELDTGQNNFHERSTSPNEKDCIVNMQNSPVLSETFGSKSSSDNTSHLFT 478
               ++ +++  I +    N+      +PN         +S   + T  + S++  T+   T
Sbjct:   500 QQQNIIIKEAITMPVEINDL-----TPNLNKNSKQTSSSSTTTSTTTTTSTTTTTTTTTT 554

Query:   479 ESRYHPQLLRLESQLDLDNHSHANGIKENVSIKLCQSDAVKRFSKHSLQNRDMMEGSWLD 538
              +          +Q +  N +  N    N S KL   D +    K  + + D        
Sbjct:   555 ATTLSSTSTSTTTQRNNKNKNKNNNNNINNSSKLFGLDHISE--KEEIMDTDSQ------ 606

Query:   539 NIIWEPVDAVGKPKLILD---LQDEQMLFEILDNK 570
             N++ +P+      +   D    Q+EQ +  I++NK
Sbjct:   607 NLLQQPLSQEKDKEKEKDKDKAQNEQHIQTIVNNK 641

 Score = 74 (31.1 bits), Expect = 4.3e-69, Sum P(5) = 4.3e-69
 Identities = 32/97 (32%), Positives = 47/97 (48%)

Query:    28 GNRLLGFMFGNVDYAGDLDVDYLDEDAKEHLAAVADKLGPSLTDIDVWNSQSCFQLFCSY 87
             GN L GF+FGNV  +G+LDV+  D+  K+       K      D++++   S    F   
Sbjct:     9 GN-LTGFLFGNVKESGELDVEENDQVFKD-----LKK------DLELFAKSSQHISFKKT 56

Query:    88 CLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
               +  +      D+V    DK   +A+DYEDIDE  E
Sbjct:    57 IGIDEDDKNAVTDSVIVP-DK---NALDYEDIDEVAE 89

 Score = 62 (26.9 bits), Expect = 9.2e-63, Sum P(5) = 9.2e-63
 Identities = 18/54 (33%), Positives = 28/54 (51%)

Query:   350 SEHDEQRKDSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEENIVWDNS 403
             + +++Q  +  I  E MK  SN++    W   P  N   L+  DW +NI+WD S
Sbjct:   679 NNNNKQNNNKEI-DEIMK--SNIDK---WSLFPISNQ-ELENGDWIDNIIWDES 725

 Score = 53 (23.7 bits), Expect = 4.3e-69, Sum P(5) = 4.3e-69
 Identities = 13/30 (43%), Positives = 19/30 (63%)

Query:   260 ILRFSEIFGIHEPLK--KGKKRDQRYSTPK 287
             IL+FS++F    P K  + K R  R+STP+
Sbjct:   320 ILKFSQLFAPKFPDKPTRKKSRKTRFSTPQ 349

 Score = 45 (20.9 bits), Expect = 0.00018, Sum P(5) = 0.00018
 Identities = 20/78 (25%), Positives = 30/78 (38%)

Query:   676 RPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGC 735
             RP +L  + +   +  Y ++   G  +G L  S NN L N  T          G I  G 
Sbjct:   892 RPRILFPSNVPFRIIIYNKE---GSLSGDLSSSSNNLLLNSSTNSIMGGHNMRGSINGGM 948

Query:   736 SQSSLETNMYRAPVFTHK 753
               SS  ++   +    HK
Sbjct:   949 MSSSSSSSSSSSKKSLHK 966

 Score = 44 (20.5 bits), Expect = 5.5e-60, Sum P(4) = 5.5e-60
 Identities = 25/123 (20%), Positives = 48/123 (39%)

Query:   354 EQRKDSCICSEPMKEESNVNPS--VGWKSMPSPNFFPLDQHDWEENIVWDNSPAASDNSI 411
             EQ++      E  K+   V     +  K  PS + F  D+ + ++      S  + D   
Sbjct:   131 EQQQRQLSKKERRKQRKQVGSKQPLQIKKKPSVDDFDFDEEEEQQQ---QKSTRSEDE-- 185

Query:   412 ESHEIAGADVESALMRGIELDTGQNNFHERSTSPNEKDCIVNMQNSPVLSETFGSKSSSD 471
             +  +I+ A   S+        + ++    RS S  E D + +   S     +  S+SS  
Sbjct:   186 DDDDISSASSLSSSSASSSASSSRSPSMSRSASDIESDSMSDSSRSS--GSSISSRSSIS 243

Query:   472 NTS 474
             ++S
Sbjct:   244 SSS 246

 Score = 38 (18.4 bits), Expect = 4.3e-69, Sum P(5) = 4.3e-69
 Identities = 6/26 (23%), Positives = 18/26 (69%)

Query:  1226 KDPKEEESLIAKRNLSGKVQAMKKNN 1251
             ++ ++EE+   +R +  +++ +KKN+
Sbjct:  1575 QNEEDEETKKQRRRIQERLRRLKKND 1600

 Score = 38 (18.4 bits), Expect = 4.3e-69, Sum P(5) = 4.3e-69
 Identities = 7/43 (16%), Positives = 19/43 (44%)

Query:  1211 ITVQPNGSHTANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             ++   +GS+     +     E   ++   L G  ++ ++N V+
Sbjct:  2065 LSASSSGSNRKKRTLDQSNSESPSLSSTTLDGSDKSSRRNRVR 2107


>MGI|MGI:1336878 [details] [associations]
            symbol:Taf1 "TAF1 RNA polymerase II, TATA box binding
            protein (TBP)-associated factor" species:10090 "Mus musculus"
            [GO:0000080 "G1 phase of mitotic cell cycle" evidence=ISO]
            [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0002039 "p53
            binding" evidence=ISO] [GO:0003677 "DNA binding" evidence=IDA]
            [GO:0003713 "transcription coactivator activity" evidence=ISO]
            [GO:0004402 "histone acetyltransferase activity" evidence=ISO;IDA]
            [GO:0004674 "protein serine/threonine kinase activity"
            evidence=ISO] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005524 "ATP binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISO;IDA] [GO:0005669 "transcription factor TFIID complex"
            evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006352 "DNA-dependent transcription, initiation"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0008134 "transcription factor binding" evidence=ISO]
            [GO:0016301 "kinase activity" evidence=IEA] [GO:0016310
            "phosphorylation" evidence=IEA] [GO:0016573 "histone acetylation"
            evidence=ISO;IDA] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0017025 "TBP-class protein binding" evidence=ISO] [GO:0018105
            "peptidyl-serine phosphorylation" evidence=ISO] [GO:0018107
            "peptidyl-threonine phosphorylation" evidence=ISO] [GO:0032436
            "positive regulation of proteasomal ubiquitin-dependent protein
            catabolic process" evidence=ISO] [GO:0045120 "pronucleus"
            evidence=IDA] [GO:0045944 "positive regulation of transcription
            from RNA polymerase II promoter" evidence=ISO] [GO:0046777 "protein
            autophosphorylation" evidence=ISO] [GO:0070577 "histone
            acetyl-lysine binding" evidence=ISO] [GO:0071339 "MLL1 complex"
            evidence=ISO] InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50011 PROSITE:PS50014
            SMART:SM00297 MGI:MGI:1336878 GO:GO:0005524 GO:GO:0006355
            GO:GO:0003677 GO:GO:0004674 GO:GO:0006352 PROSITE:PS50118
            GO:GO:0007049 Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633
            eggNOG:COG5076 GO:GO:0071339 InterPro:IPR018359 GO:GO:0045120
            GO:GO:0004402 GO:GO:0005669 EMBL:AL806534 InterPro:IPR022591
            Pfam:PF12157 Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247
            SUPFAM:SSF47055 HOVERGEN:HBG050223 GeneTree:ENSGT00390000012659
            HSSP:P21675 EMBL:AL831722 EMBL:BC047418 EMBL:BC094568 EMBL:AK045586
            EMBL:AK046668 EMBL:AK049826 EMBL:AK050691 EMBL:AK132088
            EMBL:AK143571 EMBL:AF081115 EMBL:AF081116 EMBL:AF081117
            EMBL:AF022178 IPI:IPI00330385 IPI:IPI00671543 IPI:IPI00831185
            UniGene:Mm.261750 ProteinModelPortal:Q80UV9 SMR:Q80UV9
            STRING:Q80UV9 PhosphoSite:Q80UV9 PRIDE:Q80UV9
            Ensembl:ENSMUST00000118878 Ensembl:ENSMUST00000149274
            UCSC:uc009txx.1 UCSC:uc009txy.1 Bgee:Q80UV9 Genevestigator:Q80UV9
            Uniprot:Q80UV9
        Length = 1891

 Score = 445 (161.7 bits), Expect = 1.6e-68, Sum P(5) = 1.6e-68
 Identities = 121/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   654 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 710

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   711 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 762

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  +DFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   763 AFENNLFRAPIYLHKMPESDFLIIRTRQGYF-IRELVDIFVVGQQCPLFEVPGPNSKRAN 821

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   822 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 879

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   880 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 931

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   932 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 986

Query:   979 VR 980
             V+
Sbjct:   987 VK 988

 Score = 235 (87.8 bits), Expect = 1.6e-68, Sum P(5) = 1.6e-68
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   944 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 999

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:  1000 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1055

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1056 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAED 1115

Query:  1097 DSF 1099
               F
Sbjct:  1116 SDF 1118

 Score = 103 (41.3 bits), Expect = 1.6e-68, Sum P(5) = 1.6e-68
 Identities = 39/123 (31%), Positives = 58/123 (47%)

Query:     5 DSGSASKXXXXXXXXXXXXXVSGGN-RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV- 61
             D G  S                GG   L GF+FGN++ AG L+ +  LD++ K+HLA + 
Sbjct:    11 DKGGGSPSVVMSDTDSDEESAGGGPFSLTGFLFGNINGAGQLEGESVLDDECKKHLAGLG 70

Query:    62 ADKLGPSLTDIDVWNSQSCFQLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDE 121
             A  LG  +T++   N +           LS  S    V+  ++ + +  EDAVDY DI+E
Sbjct:    71 ALGLGSLITELTA-NEE-----------LS-GSDGALVN--DEGWIRSREDAVDYSDINE 115

Query:   122 QYE 124
               E
Sbjct:   116 VAE 118

 Score = 96 (38.9 bits), Expect = 1.6e-68, Sum P(5) = 1.6e-68
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDA---VGKPKLILDLQDEQMLFEILDNKDD 572
             +S   + N D++ G W DNIIW+  +    +  P L LD  DE ++ EI D K++
Sbjct:   486 YSIFPIDNEDLVYGRWEDNIIWDAQNMPRILEPPVLTLDPNDENLILEIPDEKEE 540

 Score = 66 (28.3 bits), Expect = 1.6e-68, Sum P(5) = 1.6e-68
 Identities = 17/64 (26%), Positives = 30/64 (46%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + +K   +S I    MK+   +  + G   +   NF  + Q  WE++
Sbjct:   375 GVPEDGSGFDYGFKMKKTEHESTIKCNIMKKLRKLEENSGVDLLADENFLMVTQLHWEDD 434

Query:   398 IVWD 401
             I+WD
Sbjct:   435 IIWD 438

 Score = 61 (26.5 bits), Expect = 7.2e-63, Sum P(4) = 7.2e-63
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD--NSP 404
             P  + FP+D  D     WE+NI+WD  N P
Sbjct:   484 PWYSIFPIDNEDLVYGRWEDNIIWDAQNMP 513

 Score = 50 (22.7 bits), Expect = 3.0e-07, Sum P(4) = 3.0e-07
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   542 TSNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 575

 Score = 42 (19.8 bits), Expect = 6.8e-61, Sum P(4) = 6.8e-61
 Identities = 15/52 (28%), Positives = 25/52 (48%)

Query:   500 HANGIKENVSIKLCQSDAVKRFSKHSLQNRDMM-EGSWLDNIIWEPVDAVGK 550
             H + IK N+  KL + +        + +N  M+ +  W D+IIW+  D   K
Sbjct:   394 HESTIKCNIMKKLRKLEENSGVDLLADENFLMVTQLHWEDDIIWDGEDVKHK 445

 Score = 37 (18.1 bits), Expect = 1.9e-18, Sum P(3) = 1.9e-18
 Identities = 15/49 (30%), Positives = 24/49 (48%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRT 867
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+
Sbjct:   230 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRS 278


>RGD|1562050 [details] [associations]
            symbol:Taf1 "TAF1 RNA polymerase II, TATA box binding protein
            (TBP)-associated factor" species:10116 "Rattus norvegicus"
            [GO:0000080 "G1 phase of mitotic cell cycle" evidence=ISO]
            [GO:0002039 "p53 binding" evidence=ISO] [GO:0003674
            "molecular_function" evidence=ND] [GO:0003677 "DNA binding"
            evidence=IEA;ISO] [GO:0003713 "transcription coactivator activity"
            evidence=ISO] [GO:0004402 "histone acetyltransferase activity"
            evidence=ISO] [GO:0004674 "protein serine/threonine kinase
            activity" evidence=ISO] [GO:0005634 "nucleus" evidence=ISO;IDA]
            [GO:0005669 "transcription factor TFIID complex" evidence=IEA;ISO]
            [GO:0006352 "DNA-dependent transcription, initiation" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0008134 "transcription factor binding"
            evidence=ISO] [GO:0016573 "histone acetylation" evidence=ISO]
            [GO:0017025 "TBP-class protein binding" evidence=ISO] [GO:0018105
            "peptidyl-serine phosphorylation" evidence=ISO] [GO:0018107
            "peptidyl-threonine phosphorylation" evidence=ISO] [GO:0032436
            "positive regulation of proteasomal ubiquitin-dependent protein
            catabolic process" evidence=ISO] [GO:0042493 "response to drug"
            evidence=IEP] [GO:0045120 "pronucleus" evidence=ISO] [GO:0045944
            "positive regulation of transcription from RNA polymerase II
            promoter" evidence=ISO] [GO:0046777 "protein autophosphorylation"
            evidence=ISO] [GO:0070577 "histone acetyl-lysine binding"
            evidence=ISO] [GO:0071339 "MLL1 complex" evidence=ISO] [GO:0005730
            "nucleolus" evidence=ISO] InterPro:IPR001487 InterPro:IPR011177
            Pfam:PF00439 PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014
            SMART:SM00297 RGD:1562050 GO:GO:0005634 GO:GO:0006355 GO:GO:0042493
            GO:GO:0003677 GO:GO:0006352 Gene3D:1.20.920.10 SUPFAM:SSF47370
            PROSITE:PS00633 InterPro:IPR018359 GO:GO:0045120 GO:GO:0005669
            InterPro:IPR022591 Pfam:PF12157 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055 OrthoDB:EOG4K3KVC
            IPI:IPI00951101 Ensembl:ENSRNOT00000066825 UCSC:RGD:1562050
            ArrayExpress:D3ZM43 Uniprot:D3ZM43
        Length = 1902

 Score = 444 (161.4 bits), Expect = 4.7e-68, Sum P(5) = 4.7e-68
 Identities = 121/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   660 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 716

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   717 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 768

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  +DFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   769 AFENNLFRAPIYLHKMPESDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 827

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   828 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 885

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   886 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 937

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   938 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 992

Query:   979 VR 980
             V+
Sbjct:   993 VK 994

 Score = 235 (87.8 bits), Expect = 4.7e-68, Sum P(5) = 4.7e-68
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   950 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 1005

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:  1006 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1061

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1062 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAED 1121

Query:  1097 DSF 1099
               F
Sbjct:  1122 SDF 1124

 Score = 101 (40.6 bits), Expect = 4.7e-68, Sum P(5) = 4.7e-68
 Identities = 38/123 (30%), Positives = 59/123 (47%)

Query:     5 DSGSASKXXXXXXXXXXXXXVSGGN-RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV- 61
             + GS S                GG   L GF+FGN++ AG L+ +  LD++ K+HLA + 
Sbjct:    11 EDGSGSPSAVMSDTDSDEDSAGGGPFSLTGFLFGNINGAGQLEGESVLDDECKKHLAGLG 70

Query:    62 ADKLGPSLTDIDVWNSQSCFQLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDE 121
             A  LG  +T++   N +           LS  +    V+  ++ + +  EDAVDY DI+E
Sbjct:    71 ALGLGSLITELTA-NEE-----------LS-GTDGTLVN--DEGWIRSREDAVDYSDINE 115

Query:   122 QYE 124
               E
Sbjct:   116 VAE 118

 Score = 96 (38.9 bits), Expect = 4.7e-68, Sum P(5) = 4.7e-68
 Identities = 20/55 (36%), Positives = 31/55 (56%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDA---VGKPKLILDLQDEQMLFEILDNKDD 572
             +S   + N D++ G W DNIIW+  +    +  P L LD  DE ++ EI D K++
Sbjct:   492 YSIFPIDNEDLVYGRWEDNIIWDAQNMPRILEPPVLTLDPNDENLILEIPDEKEE 546

 Score = 80 (33.2 bits), Expect = 1.6e-64, Sum P(4) = 1.6e-64
 Identities = 21/72 (29%), Positives = 32/72 (44%)

Query:   340 FGVEQDAATISEHDEQRKDSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHD-----W 394
             + V+Q+   I E  +Q K +      +   +   PS      P  + FP+D  D     W
Sbjct:   448 YNVQQERPYIKEPTKQEKKNTREQISLNHNTGAMPSTLDDDKPWYSIFPIDNEDLVYGRW 507

Query:   395 EENIVWD--NSP 404
             E+NI+WD  N P
Sbjct:   508 EDNIIWDAQNMP 519

 Score = 65 (27.9 bits), Expect = 4.7e-68, Sum P(5) = 4.7e-68
 Identities = 17/64 (26%), Positives = 30/64 (46%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + +K   DS +    MK+   +  + G   +   NF  + Q  WE++
Sbjct:   354 GVPEDGSGFDYGFKMKKTEHDSTMKYNIMKKLRKLEENNGVDLLADENFLMVTQLHWEDD 413

Query:   398 IVWD 401
             I+WD
Sbjct:   414 IIWD 417

 Score = 50 (22.7 bits), Expect = 6.1e-07, Sum P(4) = 6.1e-07
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   548 TSNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 581

 Score = 42 (19.8 bits), Expect = 9.4e-19, Sum P(3) = 9.4e-19
 Identities = 21/75 (28%), Positives = 33/75 (44%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRK 878
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+     +   R+
Sbjct:   209 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRE 268

Query:   879 LAYPEHV----CSYE 889
             L   E +    CS E
Sbjct:   269 LIQEEQIQEEECSVE 283

 Score = 40 (19.1 bits), Expect = 2.4e-60, Sum P(4) = 2.4e-60
 Identities = 14/52 (26%), Positives = 25/52 (48%)

Query:   500 HANGIKENVSIKLCQSDAVKRFSKHSLQNRDMM-EGSWLDNIIWEPVDAVGK 550
             H + +K N+  KL + +        + +N  M+ +  W D+IIW+  D   K
Sbjct:   373 HDSTMKYNIMKKLRKLEENNGVDLLADENFLMVTQLHWEDDIIWDGEDVKHK 424


>UNIPROTKB|Q60544 [details] [associations]
            symbol:TAF1 "Transcription initiation factor TFIID subunit
            1" species:10036 "Mesocricetus auratus" [GO:0000080 "G1 phase of
            mitotic cell cycle" evidence=IMP] [GO:0004402 "histone
            acetyltransferase activity" evidence=ISS] [GO:0004674 "protein
            serine/threonine kinase activity" evidence=ISS] [GO:0016573
            "histone acetylation" evidence=ISS] [GO:0045944 "positive
            regulation of transcription from RNA polymerase II promoter"
            evidence=IMP] [GO:0071339 "MLL1 complex" evidence=ISS]
            InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50011 PROSITE:PS50014
            SMART:SM00297 GO:GO:0005524 GO:GO:0003677 GO:GO:0004674
            GO:GO:0045944 GO:GO:0006352 PROSITE:PS50118 Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 GO:GO:0071339 InterPro:IPR018359
            GO:GO:0000080 GO:GO:0004402 GO:GO:0005669 InterPro:IPR022591
            Pfam:PF12157 Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247
            SUPFAM:SSF47055 HOVERGEN:HBG050223 EMBL:D26114 PIR:I48155
            HSSP:P21675 ProteinModelPortal:Q60544 SMR:Q60544 PRIDE:Q60544
            Uniprot:Q60544
        Length = 1865

 Score = 448 (162.8 bits), Expect = 6.0e-68, Sum P(5) = 6.0e-68
 Identities = 122/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   628 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 684

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   685 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 736

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   737 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 795

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   796 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 853

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   854 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 905

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   906 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 960

Query:   979 VR 980
             V+
Sbjct:   961 VK 962

 Score = 235 (87.8 bits), Expect = 6.0e-68, Sum P(5) = 6.0e-68
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   918 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 973

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   974 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1029

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1030 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAED 1089

Query:  1097 DSF 1099
               F
Sbjct:  1090 SDF 1092

 Score = 98 (39.6 bits), Expect = 6.0e-68, Sum P(5) = 6.0e-68
 Identities = 39/120 (32%), Positives = 59/120 (49%)

Query:     9 ASKXXXXXXXXXXXXXVSGGN--RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADK 64
             AS+              SGG    L GF+FGN++ AG L+ +  LD++ K+HLA + A  
Sbjct:     8 ASRSAAIMSDTDSDEDSSGGGPFSLTGFLFGNINGAGQLEGESVLDDECKKHLAGLGALG 67

Query:    65 LGPSLTDIDVWNSQSCFQLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
             LG  +T++   N +    L  +   L VN         ++ + +  EDAVDY DI+E  E
Sbjct:    68 LGSLITELTA-NEE----LAGTDGAL-VN---------DEGWIRSREDAVDYSDINEVAE 112

 Score = 95 (38.5 bits), Expect = 6.0e-68, Sum P(5) = 6.0e-68
 Identities = 21/56 (37%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N D++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   460 YSIFPIDNEDLVYGRWEDNIIWD-AQAMPRILEPPVLTLDPNDENLILEIPDEKEE 514

 Score = 63 (27.2 bits), Expect = 6.0e-68, Sum P(5) = 6.0e-68
 Identities = 17/64 (26%), Positives = 29/64 (45%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + RK   +  I  + M +   +  S G   +   NF  + Q  WE++
Sbjct:   349 GVPEDGSGFDYGFKMRKTEHEPAIKCKMMTKLRKLEESNGIDLLADENFLMVTQLHWEDD 408

Query:   398 IVWD 401
             I+WD
Sbjct:   409 IIWD 412

 Score = 60 (26.2 bits), Expect = 1.3e-62, Sum P(4) = 1.3e-62
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  D     WE+NI+WD
Sbjct:   458 PWYSIFPIDNEDLVYGRWEDNIIWD 482

 Score = 50 (22.7 bits), Expect = 2.2e-06, Sum P(4) = 2.2e-06
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   516 TSNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 549

 Score = 39 (18.8 bits), Expect = 2.0e-60, Sum P(4) = 2.0e-60
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   405 WEDDIIWDGEDVKHK 419

 Score = 38 (18.4 bits), Expect = 4.6e-18, Sum P(3) = 4.6e-18
 Identities = 18/58 (31%), Positives = 28/58 (48%)

Query:   815 AAAKRG--LLPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRT 867
             A +K G   LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+
Sbjct:   195 AESKDGKLTLPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRS 252

 Score = 37 (18.1 bits), Expect = 1.3e-61, Sum P(5) = 1.3e-61
 Identities = 15/48 (31%), Positives = 24/48 (50%)

Query:   105 DYDKKAEDAVDYEDIDEQYEGPE------IQIASEEDYLLPKKEYFAA 146
             DYD+   DA D EDID +   P       ++   ++D L  +K  F++
Sbjct:   133 DYDEDDYDA-DCEDIDCKLMPPPPPPPGPVKKEKDQDGLTGEKVDFSS 179


>UNIPROTKB|P21675 [details] [associations]
            symbol:TAF1 "Transcription initiation factor TFIID subunit
            1" species:9606 "Homo sapiens" [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0019048 "virus-host interaction" evidence=IEA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0046777 "protein
            autophosphorylation" evidence=IDA;TAS] [GO:0004674 "protein
            serine/threonine kinase activity" evidence=IDA] [GO:0004402
            "histone acetyltransferase activity" evidence=IDA] [GO:0071339
            "MLL1 complex" evidence=IDA] [GO:0005669 "transcription factor
            TFIID complex" evidence=IDA] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=TAS] [GO:0006367 "transcription initiation from
            RNA polymerase II promoter" evidence=TAS] [GO:0006368
            "transcription elongation from RNA polymerase II promoter"
            evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0016032 "viral reproduction" evidence=TAS] [GO:0044212
            "transcription regulatory region DNA binding" evidence=ISS]
            [GO:0006352 "DNA-dependent transcription, initiation" evidence=ISS]
            [GO:0000117 "regulation of transcription involved in G2/M-phase of
            mitotic cell cycle" evidence=ISS] [GO:0043565 "sequence-specific
            DNA binding" evidence=ISS] [GO:0060261 "positive regulation of
            transcription initiation from RNA polymerase II promoter"
            evidence=ISS] [GO:0002039 "p53 binding" evidence=IPI] [GO:0018107
            "peptidyl-threonine phosphorylation" evidence=IDA] [GO:0032436
            "positive regulation of proteasomal ubiquitin-dependent protein
            catabolic process" evidence=IDA] [GO:0006974 "response to DNA
            damage stimulus" evidence=IC] [GO:0070577 "histone acetyl-lysine
            binding" evidence=IDA] [GO:0003713 "transcription coactivator
            activity" evidence=IDA] [GO:0045944 "positive regulation of
            transcription from RNA polymerase II promoter" evidence=IGI;IDA]
            [GO:0051123 "RNA polymerase II transcriptional preinitiation
            complex assembly" evidence=ISS] [GO:0008134 "transcription factor
            binding" evidence=IPI] [GO:0018105 "peptidyl-serine
            phosphorylation" evidence=IDA] [GO:0017025 "TBP-class protein
            binding" evidence=IPI] [GO:0000080 "G1 phase of mitotic cell cycle"
            evidence=IGI] [GO:0016573 "histone acetylation" evidence=IDA]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0005730 "nucleolus"
            evidence=IDA] Reactome:REACT_71 InterPro:IPR001487
            InterPro:IPR011177 Pfam:PF00439 PIRSF:PIRSF003047 PRINTS:PR00503
            PROSITE:PS50011 PROSITE:PS50014 SMART:SM00297 GO:GO:0005524
            Reactome:REACT_116125 GO:GO:0019048 GO:GO:0016032 GO:GO:0004674
            GO:GO:0043565 GO:GO:0045944 GO:GO:0046777 GO:GO:0018105
            GO:GO:0006974 PROSITE:PS50118 GO:GO:0003713 GO:GO:0018107
            GO:GO:0006368 GO:GO:0032436 PDB:3AAD PDBsum:3AAD Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 eggNOG:COG5076 GO:GO:0071339
            GO:GO:0070577 InterPro:IPR018359 GO:GO:0000080 GO:GO:0000117
            Reactome:REACT_1788 GO:GO:0004402 GO:GO:0005669 GO:GO:0060261
            GO:GO:0051123 KO:K03125 InterPro:IPR022591 Pfam:PF12157
            Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055
            HOVERGEN:HBG050223 OrthoDB:EOG4K3KVC CTD:6872 EMBL:D90359
            EMBL:X07024 EMBL:AY623109 EMBL:AB209316 EMBL:AJ549247 EMBL:AJ549248
            EMBL:AJ549249 EMBL:AJ549250 EMBL:AJ555148 EMBL:AJ555149
            EMBL:AM711894 IPI:IPI00009891 IPI:IPI00645793 IPI:IPI00828032
            IPI:IPI00939191 PIR:A40262 RefSeq:NP_004597.2 RefSeq:NP_620278.1
            UniGene:Hs.158560 PDB:1EQF PDB:3UV4 PDB:3UV5 PDBsum:1EQF
            PDBsum:3UV4 PDBsum:3UV5 ProteinModelPortal:P21675 SMR:P21675
            DIP:DIP-147N DIP:DIP-24198N IntAct:P21675 MINT:MINT-1211825
            STRING:P21675 PhosphoSite:P21675 DMDM:115942 PaxDb:P21675
            PRIDE:P21675 Ensembl:ENST00000276072 Ensembl:ENST00000373790
            Ensembl:ENST00000449580 GeneID:6872 KEGG:hsa:6872 UCSC:uc004dzt.4
            UCSC:uc004dzu.4 UCSC:uc004dzv.4 GeneCards:GC0XP070586
            HGNC:HGNC:11535 HPA:CAB016283 HPA:HPA001075 MIM:313650 MIM:314250
            neXtProt:NX_P21675 Orphanet:53351 PharmGKB:PA36310 OMA:DEFYYPK
            EvolutionaryTrace:P21675 GenomeRNAi:6872 NextBio:26827
            ArrayExpress:P21675 Bgee:P21675 Genevestigator:P21675
            GermOnline:ENSG00000147133 Uniprot:P21675
        Length = 1872

 Score = 448 (162.8 bits), Expect = 1.0e-67, Sum P(5) = 1.0e-67
 Identities = 122/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   633 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 689

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   690 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 741

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   742 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 800

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   801 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 858

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   859 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 910

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   911 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 965

Query:   979 VR 980
             V+
Sbjct:   966 VK 967

 Score = 235 (87.8 bits), Expect = 1.0e-67, Sum P(5) = 1.0e-67
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   923 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 978

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   979 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1034

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1035 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAED 1094

Query:  1097 DSF 1099
               F
Sbjct:  1095 SDF 1097

 Score = 95 (38.5 bits), Expect = 1.0e-67, Sum P(5) = 1.0e-67
 Identities = 21/56 (37%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N D++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   465 YSIFPIDNEDLVYGRWEDNIIWD-AQAMPRLLEPPVLTLDPNDENLILEIPDEKEE 519

 Score = 95 (38.5 bits), Expect = 1.0e-67, Sum P(5) = 1.0e-67
 Identities = 36/103 (34%), Positives = 56/103 (54%)

Query:    26 SGGN--RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCF 81
             +GG    L GF+FGN++ AG L+ +  LD++ K+HLA + A  LG  +T++   N +   
Sbjct:    31 AGGGPFSLAGFLFGNINGAGQLEGESVLDDECKKHLAGLGALGLGSLITELTA-NEE--- 86

Query:    82 QLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
              L  +   L VN         ++ + +  EDAVDY DI+E  E
Sbjct:    87 -LTGTDGAL-VN---------DEGWVRSTEDAVDYSDINEVAE 118

 Score = 64 (27.6 bits), Expect = 1.0e-67, Sum P(5) = 1.0e-67
 Identities = 17/64 (26%), Positives = 30/64 (46%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + RK   +  I S  ++E   +  + G   +   NF  + Q  WE++
Sbjct:   354 GVPEDGSGFDYGFKLRKTEHEPVIKSRMIEEFRKLEENNGTDLLADENFLMVTQLHWEDD 413

Query:   398 IVWD 401
             I+WD
Sbjct:   414 IIWD 417

 Score = 60 (26.2 bits), Expect = 2.7e-62, Sum P(4) = 2.7e-62
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  D     WE+NI+WD
Sbjct:   463 PWYSIFPIDNEDLVYGRWEDNIIWD 487

 Score = 51 (23.0 bits), Expect = 2.9e-06, Sum P(4) = 2.9e-06
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   521 TSNSPSKESKKESSLKKSRILLGKTGVIKEEPQQ 554

 Score = 46 (21.3 bits), Expect = 1.4e-18, Sum P(3) = 1.4e-18
 Identities = 22/76 (28%), Positives = 34/76 (44%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRK 878
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+     +   R+
Sbjct:   209 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRE 268

Query:   879 LAYPEHV----CSYES 890
             L   E +    CS ES
Sbjct:   269 LIQEEQIQEVECSVES 284

 Score = 39 (18.8 bits), Expect = 6.3e-62, Sum P(5) = 6.3e-62
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query:    38 NVDYAGDLDVDY-LDEDAKEHLAAVADKLGPSLTDID 73
             N DYA     +  L +D    +A V  K   S  DID
Sbjct:   293 NYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDID 329

 Score = 39 (18.8 bits), Expect = 4.2e-60, Sum P(4) = 4.2e-60
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   410 WEDDIIWDGEDVKHK 424


>UNIPROTKB|B1Q2X3 [details] [associations]
            symbol:N-TAF1 "TAF1 RNA polymerase II, TATA box binding
            protein (TBP)-associated factor, neuron specific isoform"
            species:9606 "Homo sapiens" [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0005669 "transcription factor TFIID complex" evidence=IEA]
            [GO:0006352 "DNA-dependent transcription, initiation" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005730
            "nucleolus" evidence=IDA] InterPro:IPR001487 InterPro:IPR011177
            Pfam:PF00439 PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014
            SMART:SM00297 GO:GO:0005634 GO:GO:0006355 GO:GO:0003677
            GO:GO:0006352 Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633
            InterPro:IPR018359 GO:GO:0005669 EMBL:AL590762 InterPro:IPR022591
            Pfam:PF12157 Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247
            SUPFAM:SSF47055 HOGENOM:HOG000020066 HOVERGEN:HBG050223
            UniGene:Hs.158560 HGNC:HGNC:11535 EMBL:AL590763 EMBL:AB300418
            IPI:IPI00149098 SMR:B1Q2X3 STRING:B1Q2X3 Ensembl:ENST00000423759
            Uniprot:B1Q2X3
        Length = 1895

 Score = 448 (162.8 bits), Expect = 1.1e-67, Sum P(5) = 1.1e-67
 Identities = 122/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   654 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 710

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   711 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 762

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   763 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 821

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   822 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 879

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   880 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 931

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   932 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 986

Query:   979 VR 980
             V+
Sbjct:   987 VK 988

 Score = 235 (87.8 bits), Expect = 1.1e-67, Sum P(5) = 1.1e-67
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   944 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 999

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:  1000 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1055

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1056 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAED 1115

Query:  1097 DSF 1099
               F
Sbjct:  1116 SDF 1118

 Score = 95 (38.5 bits), Expect = 1.1e-67, Sum P(5) = 1.1e-67
 Identities = 21/56 (37%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N D++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   486 YSIFPIDNEDLVYGRWEDNIIWD-AQAMPRLLEPPVLTLDPNDENLILEIPDEKEE 540

 Score = 95 (38.5 bits), Expect = 1.1e-67, Sum P(5) = 1.1e-67
 Identities = 36/103 (34%), Positives = 56/103 (54%)

Query:    26 SGGN--RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCF 81
             +GG    L GF+FGN++ AG L+ +  LD++ K+HLA + A  LG  +T++   N +   
Sbjct:    31 AGGGPFSLAGFLFGNINGAGQLEGESVLDDECKKHLAGLGALGLGSLITELTA-NEE--- 86

Query:    82 QLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
              L  +   L VN         ++ + +  EDAVDY DI+E  E
Sbjct:    87 -LTGTDGAL-VN---------DEGWVRSTEDAVDYSDINEVAE 118

 Score = 64 (27.6 bits), Expect = 1.1e-67, Sum P(5) = 1.1e-67
 Identities = 17/64 (26%), Positives = 30/64 (46%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + RK   +  I S  ++E   +  + G   +   NF  + Q  WE++
Sbjct:   375 GVPEDGSGFDYGFKLRKTEHEPVIKSRMIEEFRKLEENNGTDLLADENFLMVTQLHWEDD 434

Query:   398 IVWD 401
             I+WD
Sbjct:   435 IIWD 438

 Score = 60 (26.2 bits), Expect = 2.9e-62, Sum P(4) = 2.9e-62
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  D     WE+NI+WD
Sbjct:   484 PWYSIFPIDNEDLVYGRWEDNIIWD 508

 Score = 51 (23.0 bits), Expect = 3.0e-06, Sum P(4) = 3.0e-06
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   542 TSNSPSKESKKESSLKKSRILLGKTGVIKEEPQQ 575

 Score = 46 (21.3 bits), Expect = 1.5e-18, Sum P(3) = 1.5e-18
 Identities = 22/76 (28%), Positives = 34/76 (44%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRK 878
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+     +   R+
Sbjct:   230 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRE 289

Query:   879 LAYPEHV----CSYES 890
             L   E +    CS ES
Sbjct:   290 LIQEEQIQEVECSVES 305

 Score = 39 (18.8 bits), Expect = 6.9e-62, Sum P(5) = 6.9e-62
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query:    38 NVDYAGDLDVDY-LDEDAKEHLAAVADKLGPSLTDID 73
             N DYA     +  L +D    +A V  K   S  DID
Sbjct:   314 NYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDID 350

 Score = 39 (18.8 bits), Expect = 4.5e-60, Sum P(4) = 4.5e-60
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   431 WEDDIIWDGEDVKHK 445


>UNIPROTKB|F1MF62 [details] [associations]
            symbol:TAF1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0045120 "pronucleus" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0006352 "DNA-dependent
            transcription, initiation" evidence=IEA] [GO:0005669 "transcription
            factor TFIID complex" evidence=IEA] InterPro:IPR001487
            InterPro:IPR011177 Pfam:PF00439 PIRSF:PIRSF003047 PRINTS:PR00503
            PROSITE:PS50014 SMART:SM00297 GO:GO:0006355 GO:GO:0003677
            GO:GO:0006352 Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633
            InterPro:IPR018359 GO:GO:0045120 GO:GO:0005669 InterPro:IPR022591
            Pfam:PF12157 Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247
            SUPFAM:SSF47055 GeneTree:ENSGT00390000012659 OMA:DEFYYPK
            EMBL:DAAA02072840 IPI:IPI00715936 Ensembl:ENSBTAT00000061423
            ArrayExpress:F1MF62 Uniprot:F1MF62
        Length = 1882

 Score = 448 (162.8 bits), Expect = 1.3e-67, Sum P(5) = 1.3e-67
 Identities = 122/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   642 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 698

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   699 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 750

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   751 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 809

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   810 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 867

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   868 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 919

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   920 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 974

Query:   979 VR 980
             V+
Sbjct:   975 VK 976

 Score = 235 (87.8 bits), Expect = 1.3e-67, Sum P(5) = 1.3e-67
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   932 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 987

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   988 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1043

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1044 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEILSTDTDSSSAED 1103

Query:  1097 DSF 1099
               F
Sbjct:  1104 SDF 1106

 Score = 104 (41.7 bits), Expect = 1.3e-67, Sum P(5) = 1.3e-67
 Identities = 39/121 (32%), Positives = 59/121 (48%)

Query:     7 GSASKXXXXXXXXXXXXXVSGGN-RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-AD 63
             GSA+                GG   L GF+FGN++ AG L+ +  LD++ K+HLA + A 
Sbjct:     1 GSAAAAAIMSDTDSDEDSAGGGPFSLAGFLFGNINGAGQLEGESVLDDECKKHLAGLGAL 60

Query:    64 KLGPSLTDIDVWNSQSCFQLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQY 123
              LG  +T++   N +    L  +   L VN         ++ + +  EDAVDY DI+E  
Sbjct:    61 GLGSLITELTA-NEE----LTGTDAAL-VN---------DEGWVRSTEDAVDYSDINEVA 105

Query:   124 E 124
             E
Sbjct:   106 E 106

 Score = 90 (36.7 bits), Expect = 1.3e-67, Sum P(5) = 1.3e-67
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N +++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   474 YSIFPIDNEELVYGRWEDNIIWD-AQAMPRFLEPPILTLDPNDENLILEIPDEKEE 528

 Score = 59 (25.8 bits), Expect = 1.3e-67, Sum P(5) = 1.3e-67
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query:   366 MKEESNV-NPSVGWKSMPSPNFFPLDQHDWEENIVWD 401
             M +E  +   S G   +   NF  + Q  WE++I+WD
Sbjct:   390 MADEYRIPEESSGTDLLADENFLMVTQLHWEDDIIWD 426

 Score = 56 (24.8 bits), Expect = 8.5e-63, Sum P(4) = 8.5e-63
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  +     WE+NI+WD
Sbjct:   472 PWYSIFPIDNEELVYGRWEDNIIWD 496

 Score = 51 (23.0 bits), Expect = 3.4e-06, Sum P(4) = 3.4e-06
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   530 TSNSPSKESKKESSLKKSRILLGKTGVIKEEPQQ 563

 Score = 46 (21.3 bits), Expect = 1.7e-19, Sum P(3) = 1.7e-19
 Identities = 22/76 (28%), Positives = 34/76 (44%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRK 878
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+     +   R+
Sbjct:   218 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRE 277

Query:   879 LAYPEHV----CSYES 890
             L   E +    CS ES
Sbjct:   278 LIQEEQIQETECSVES 293

 Score = 43 (20.2 bits), Expect = 2.7e-61, Sum P(5) = 2.7e-61
 Identities = 22/82 (26%), Positives = 33/82 (40%)

Query:    39 VDYAGDLDVDYLDEDAKEHLAAVADKLGPSL-TDIDVWNSQS-CFQLFCSYCLLSVNSPQ 96
             VDY+   D++ + ED           L P    D D  +  + C  + C      +  P 
Sbjct:    96 VDYS---DINEVAEDESRRYQQTMGSLQPLCHADYDEDDYDADCEDIDCKL----MPPPP 148

Query:    97 PPVDAVEQDYDKKAEDAVDYED 118
             PP   V++D D+ A   V  ED
Sbjct:   149 PPPGPVKKDKDQDAIAGVS-ED 169

 Score = 39 (18.8 bits), Expect = 5.0e-61, Sum P(4) = 5.0e-61
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   419 WEDDIIWDGEDVKHK 433


>ZFIN|ZDB-GENE-030131-5576 [details] [associations]
            symbol:taf1 "TAF1 RNA polymerase II, TATA box
            binding protein (TBP)-associated factor" species:7955 "Danio rerio"
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA] [GO:0005669
            "transcription factor TFIID complex" evidence=IEA] [GO:0006352
            "DNA-dependent transcription, initiation" evidence=IEA]
            InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            ZFIN:ZDB-GENE-030131-5576 GO:GO:0006355 GO:GO:0003677 GO:GO:0006352
            EMBL:BX005407 Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633
            InterPro:IPR018359 GO:GO:0005669 KO:K03125 InterPro:IPR022591
            Pfam:PF12157 Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247
            SUPFAM:SSF47055 HOGENOM:HOG000020066 HOVERGEN:HBG050223 CTD:6872
            GeneTree:ENSGT00390000012659 OMA:DEFYYPK IPI:IPI00499221
            RefSeq:NP_001038250.1 UniGene:Dr.104785 SMR:Q1LYC2
            Ensembl:ENSDART00000051196 GeneID:555452 KEGG:dre:555452
            InParanoid:Q1LYC2 NextBio:20880984 Uniprot:Q1LYC2
        Length = 1947

 Score = 443 (161.0 bits), Expect = 1.5e-67, Sum P(5) = 1.5e-67
 Identities = 119/358 (33%), Positives = 178/358 (49%)

Query:   626 HLIRTKI-HLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPLLLSN 682
             H ++  + H+  +A+    E ++      F  +   DL+ KDG + L EY EE P L   
Sbjct:   686 HAVQPLLKHIKKKAKMREQERQASGGGDMFFMRTAQDLTGKDGDLVLAEYSEEYPPLHMQ 745

Query:   683 AGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQSSLET 742
              GM + +  YY K  PG   GA  C      G  +       SPFLG +  G    + E 
Sbjct:   746 VGMASKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQAFEN 797

Query:   743 NMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQTYSI 802
             N++RAP++ HK+  TDFL++R+ +G   IR +  + VV Q+ PL EV  P SK   T+  
Sbjct:   798 NLFRAPIYLHKMPETDFLIIRTRQGYF-IRELVDIFVVGQECPLYEVPGPNSKRANTHIR 856

Query:   803 NRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNGKQVW 862
             + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G     W
Sbjct:   857 DFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMDSNWW 914

Query:   863 SMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQLPDE 922
              +K  F +P+E ++R +  PE  C+Y SM     RLK  G  + +  A         P+E
Sbjct:   915 VLKPDFRLPTEEEIRAMVSPEQCCAYYSMLVAEQRLKDAGYGEKSFFA---------PEE 965

Query:   923 AIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVR 980
                      I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+
Sbjct:   966 ENEEEFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVK 1019

 Score = 231 (86.4 bits), Expect = 1.5e-67, Sum P(5) = 1.5e-67
 Identities = 64/183 (34%), Positives = 94/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   975 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKVPNKPTQQKDD 1030

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:  1031 REPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1086

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + L + +     +DS  ++ 
Sbjct:  1087 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLESTEVLSTDTDSSSAED 1146

Query:  1097 DSF 1099
               F
Sbjct:  1147 SDF 1149

 Score = 108 (43.1 bits), Expect = 1.5e-67, Sum P(5) = 1.5e-67
 Identities = 33/96 (34%), Positives = 48/96 (50%)

Query:    31 LLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAVADK-LGPSLTDIDVWNSQSCFQLFCSYC 88
             L GF+FGN++  G L+ D  LD ++K+HLA +    LG  +T+I     ++  Q      
Sbjct:    16 LTGFLFGNINENGQLEDDSVLDTESKKHLAGLGTLGLGSLITEITASEEETAEQ------ 69

Query:    89 LLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
                    Q   DA  + + +  EDAVDY DI E  E
Sbjct:    70 ----EQDQSSTDA--EGWVRSTEDAVDYSDISEVAE 99

 Score = 101 (40.6 bits), Expect = 1.5e-67, Sum P(5) = 1.5e-67
 Identities = 22/71 (30%), Positives = 38/71 (53%)

Query:   505 KENVSIKLCQSDAVKRFSKHSLQNRDMMEGSWLDNIIWEP--VDAVGKPKLI-LDLQDEQ 561
             K N   ++   D    FS   + N +++ G W DNIIW+   +D +  P ++ L+  DE 
Sbjct:   501 KHNHDHQVSHEDDTPWFSIFPIDNEELVYGRWEDNIIWDDQNMDCIPSPPVLTLNPNDEN 560

Query:   562 MLFEILDNKDD 572
             ++ EI D K++
Sbjct:   561 IILEIPDEKEE 571

 Score = 56 (24.8 bits), Expect = 3.7e-62, Sum P(4) = 3.7e-62
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query:   385 NFFPLDQHD-----WEENIVWDN 402
             + FP+D  +     WE+NI+WD+
Sbjct:   518 SIFPIDNEELVYGRWEDNIIWDD 540

 Score = 54 (24.1 bits), Expect = 1.5e-67, Sum P(5) = 1.5e-67
 Identities = 26/118 (22%), Positives = 44/118 (37%)

Query:   363 SEPMKEESNVNPSVGWKSMPSPN----FFPLDQHDWEENIVWDNSPAASDNS-IESHEIA 417
             SEP+ E+     S       +P     F  + Q  WEE+I+W+        +  +   +A
Sbjct:   394 SEPVPEQEPPQESQDKADEDAPMQDELFLMVTQLQWEEDIIWNGEDVKHKGTKTQRASLA 453

Query:   418 GADVESALMRGIELDTGQNNFHERSTSPNEKDCIVNMQNSPVLSETFGSKSSSDNTSH 475
             G  + +++ R       Q     RS S         +  +P ++   GSK    N  H
Sbjct:   454 GW-LPTSMTRNANAYNAQQGL-SRSNSQLVPPTPPPLTKTPSIT---GSKRDKHNHDH 506

 Score = 39 (18.8 bits), Expect = 5.9e-06, Sum P(4) = 5.9e-06
 Identities = 10/29 (34%), Positives = 16/29 (55%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMK 1248
             T++   K+ K+E +L   R L GK   +K
Sbjct:   573 TSHSPSKENKKETALKKSRILLGKTGVIK 601

 Score = 37 (18.1 bits), Expect = 1.9e-59, Sum P(4) = 1.9e-59
 Identities = 18/68 (26%), Positives = 29/68 (42%)

Query:    93 NSPQPPVDAVEQDYDKKAEDAVDYEDID---EQYEGPEIQIASEEDYLLPKKEYFAAEVS 149
             + P P  +  ++  DK  EDA   +++     Q +  E  I + ED  +  K       S
Sbjct:   394 SEPVPEQEPPQESQDKADEDAPMQDELFLMVTQLQWEEDIIWNGED--VKHKGTKTQRAS 451

Query:   150 LAALKPTA 157
             LA   PT+
Sbjct:   452 LAGWLPTS 459


>UNIPROTKB|Q8IZX4 [details] [associations]
            symbol:TAF1L "Transcription initiation factor TFIID subunit
            1-like" species:9606 "Homo sapiens" [GO:0005669 "transcription
            factor TFIID complex" evidence=IEA] [GO:0006352 "DNA-dependent
            transcription, initiation" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0006357 "regulation of transcription from RNA
            polymerase II promoter" evidence=IDA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0004402 "histone
            acetyltransferase activity" evidence=ISS] [GO:0004674 "protein
            serine/threonine kinase activity" evidence=ISS] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=IDA]
            [GO:0007140 "male meiosis" evidence=IEP] [GO:0017025 "TBP-class
            protein binding" evidence=IPI] [GO:0070577 "histone acetyl-lysine
            binding" evidence=IDA] [GO:0016573 "histone acetylation"
            evidence=ISS] InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            GO:GO:0045893 GO:GO:0003677 GO:GO:0004674 GO:GO:0006352
            GO:GO:0006357 GO:GO:0007140 Gene3D:1.20.920.10 SUPFAM:SSF47370
            PROSITE:PS00633 eggNOG:COG5076 GO:GO:0070577 InterPro:IPR018359
            GO:GO:0004402 GO:GO:0005669 KO:K03125 InterPro:IPR022591
            Pfam:PF12157 Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247
            SUPFAM:SSF47055 EMBL:AF390562 IPI:IPI00100867 RefSeq:NP_722516.1
            UniGene:Hs.591086 PDB:3HMH PDBsum:3HMH ProteinModelPortal:Q8IZX4
            SMR:Q8IZX4 IntAct:Q8IZX4 STRING:Q8IZX4 PhosphoSite:Q8IZX4
            DMDM:57013082 PaxDb:Q8IZX4 PRIDE:Q8IZX4 Ensembl:ENST00000242310
            GeneID:138474 KEGG:hsa:138474 UCSC:uc003zrg.1 CTD:138474
            GeneCards:GC09M032619 H-InvDB:HIX0169078 HGNC:HGNC:18056 MIM:607798
            neXtProt:NX_Q8IZX4 PharmGKB:PA134947802 HOGENOM:HOG000020066
            HOVERGEN:HBG050223 InParanoid:Q8IZX4 OMA:VIREEPQ OrthoDB:EOG4K3KVC
            PhylomeDB:Q8IZX4 EvolutionaryTrace:Q8IZX4 GenomeRNAi:138474
            NextBio:83796 CleanEx:HS_TAF1L Genevestigator:Q8IZX4
            GermOnline:ENSG00000122728 Uniprot:Q8IZX4
        Length = 1826

 Score = 440 (159.9 bits), Expect = 5.1e-67, Sum P(5) = 5.1e-67
 Identities = 122/362 (33%), Positives = 179/362 (49%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   652 PHSVQPLLK---HIKKKAKMREQERQASGGGELFFMRTPQDLTGKDGDLILAEYSEENGP 708

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   709 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 760

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             +LE N++RAPV+ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P S+   
Sbjct:   761 ALENNLFRAPVYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSRRAN 819

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
              +  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   820 MHIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 877

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R    PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   878 SNWWVLKSDFRLPTEEEIRAKVSPEQCCAYYSMIAAKQRLKDAGYGEKSFFA-------- 929

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E+   PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   930 -PEEENEEDFQMKIDDEVHAAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 984

Query:   979 VR 980
             V+
Sbjct:   985 VK 986

 Score = 232 (86.7 bits), Expect = 5.1e-67, Sum P(5) = 5.1e-67
 Identities = 65/183 (35%), Positives = 94/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E+   PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   942 IDDEVHAAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 997

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   998 KEPQAVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1053

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1054 EQAHSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSISAED 1113

Query:  1097 DSF 1099
               F
Sbjct:  1114 SDF 1116

 Score = 95 (38.5 bits), Expect = 5.1e-67, Sum P(5) = 5.1e-67
 Identities = 21/56 (37%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N D++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   484 YSIFPIDNEDLVYGRWEDNIIWD-AQAMPRLLEPPVLALDPNDENLILEIPDEKEE 538

 Score = 91 (37.1 bits), Expect = 5.1e-67, Sum P(5) = 5.1e-67
 Identities = 36/103 (34%), Positives = 54/103 (52%)

Query:    26 SGGN--RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCF 81
             SGG    L G +FGN+  AG L+ +  LD++ K+HLA + A  LG  +T++   N +   
Sbjct:    30 SGGGPFTLAGILFGNISGAGQLEGESVLDDECKKHLAGLGALGLGSLITELTA-NEE--- 85

Query:    82 QLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
              L  +   L VN         ++ + +  EDAVDY DI+E  E
Sbjct:    86 -LTGTGGAL-VN---------DEGWIRSTEDAVDYSDINEVAE 117

 Score = 72 (30.4 bits), Expect = 5.1e-67, Sum P(5) = 5.1e-67
 Identities = 19/64 (29%), Positives = 30/64 (46%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + RK   +  I S  M+E   +  S G   +   NF  + Q  WE++
Sbjct:   373 GVSEDGSGFDYGFKLRKTQHEPVIKSRMMEEFRKLEESNGTDLLADENFLMVTQLHWEDS 432

Query:   398 IVWD 401
             I+WD
Sbjct:   433 IIWD 436

 Score = 60 (26.2 bits), Expect = 9.6e-61, Sum P(4) = 9.6e-61
 Identities = 11/25 (44%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  D     WE+NI+WD
Sbjct:   482 PWYSIFPIDNEDLVYGRWEDNIIWD 506

 Score = 48 (22.0 bits), Expect = 2.1e-06, Sum P(4) = 2.1e-06
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +++   Q
Sbjct:   540 TSNSPSKESKKESSLKKSRILLGKTGVIREEPQQ 573

 Score = 38 (18.4 bits), Expect = 1.9e-58, Sum P(4) = 1.9e-58
 Identities = 6/11 (54%), Positives = 8/11 (72%)

Query:   536 WLDNIIWEPVD 546
             W D+IIW+  D
Sbjct:   429 WEDSIIWDGED 439


>UNIPROTKB|E2QSZ4 [details] [associations]
            symbol:TAF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0006352 "DNA-dependent
            transcription, initiation" evidence=IEA] [GO:0005669 "transcription
            factor TFIID complex" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            GO:GO:0006355 GO:GO:0003677 GO:GO:0006352 Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 InterPro:IPR018359 GO:GO:0005669
            InterPro:IPR022591 Pfam:PF12157 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055
            Ensembl:ENSCAFT00000027104 Uniprot:E2QSZ4
        Length = 1872

 Score = 448 (162.8 bits), Expect = 2.2e-66, Sum P(5) = 2.2e-66
 Identities = 122/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   633 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 689

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   690 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 741

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   742 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 800

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   801 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 858

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   859 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 910

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   911 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 965

Query:   979 VR 980
             V+
Sbjct:   966 VK 967

 Score = 235 (87.8 bits), Expect = 2.2e-66, Sum P(5) = 2.2e-66
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   923 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 978

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   979 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1034

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1035 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEILSTDTDSSSAED 1094

Query:  1097 DSF 1099
               F
Sbjct:  1095 SDF 1097

 Score = 93 (37.8 bits), Expect = 2.2e-66, Sum P(5) = 2.2e-66
 Identities = 36/103 (34%), Positives = 55/103 (53%)

Query:    26 SGGN--RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCF 81
             +GG    L GF+FGN++ AG L+ +  LD++ K+HLA + A  LG  +T++   N +   
Sbjct:    31 AGGGPFSLTGFLFGNINGAGQLEGESVLDDECKKHLAGLGALGLGSLITELTA-NEE--- 86

Query:    82 QLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
              L  +   L VN         ++ + +  EDAVDY DI E  E
Sbjct:    87 -LAGTDGAL-VN---------DEGWIRSTEDAVDYSDITEVAE 118

 Score = 91 (37.1 bits), Expect = 2.2e-66, Sum P(5) = 2.2e-66
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N +++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   465 YSIFPIDNEELVYGRWEDNIIWD-AQAMPRLLEPPVLTLDPNDENLILEIPDEKEE 519

 Score = 57 (25.1 bits), Expect = 2.2e-66, Sum P(5) = 2.2e-66
 Identities = 16/64 (25%), Positives = 28/64 (43%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + RK   +  I    M++      + G   +   NF  + Q  WE++
Sbjct:   354 GVPEDGSGFDYGFKLRKMDHEPVIKCRMMEDLRKFEENNGSDLLADENFLMVTQLHWEDD 413

Query:   398 IVWD 401
             I+WD
Sbjct:   414 IIWD 417

 Score = 56 (24.8 bits), Expect = 1.2e-61, Sum P(4) = 1.2e-61
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  +     WE+NI+WD
Sbjct:   463 PWYSIFPIDNEELVYGRWEDNIIWD 487

 Score = 50 (22.7 bits), Expect = 6.4e-05, Sum P(4) = 6.4e-05
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   521 TSNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 554

 Score = 46 (21.3 bits), Expect = 2.3e-18, Sum P(3) = 2.3e-18
 Identities = 22/76 (28%), Positives = 34/76 (44%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRK 878
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+     +   R+
Sbjct:   209 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRE 268

Query:   879 LAYPEHV----CSYES 890
             L   E +    CS ES
Sbjct:   269 LIQEEQIQEVECSVES 284

 Score = 39 (18.8 bits), Expect = 8.7e-61, Sum P(5) = 8.7e-61
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query:    38 NVDYAGDLDVDY-LDEDAKEHLAAVADKLGPSLTDID 73
             N DYA     +  L +D    +A V  K   S  DID
Sbjct:   293 NYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDID 329

 Score = 39 (18.8 bits), Expect = 6.8e-60, Sum P(4) = 6.8e-60
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   410 WEDDIIWDGEDVKHK 424


>UNIPROTKB|J9NZ21 [details] [associations]
            symbol:TAF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0006352 "DNA-dependent
            transcription, initiation" evidence=IEA] [GO:0005669 "transcription
            factor TFIID complex" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            GO:GO:0006355 GO:GO:0003677 GO:GO:0006352 Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 InterPro:IPR018359 GO:GO:0005669
            InterPro:IPR022591 Pfam:PF12157 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055
            GeneTree:ENSGT00390000012659 OMA:DEFYYPK EMBL:AAEX03026487
            Ensembl:ENSCAFT00000043925 Uniprot:J9NZ21
        Length = 1926

 Score = 448 (162.8 bits), Expect = 2.7e-66, Sum P(5) = 2.7e-66
 Identities = 122/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   654 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 710

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   711 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 762

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   763 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 821

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   822 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 879

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   880 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 931

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   932 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 986

Query:   979 VR 980
             V+
Sbjct:   987 VK 988

 Score = 235 (87.8 bits), Expect = 2.7e-66, Sum P(5) = 2.7e-66
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   944 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 999

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:  1000 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1055

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1056 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEILSTDTDSSSAED 1115

Query:  1097 DSF 1099
               F
Sbjct:  1116 SDF 1118

 Score = 93 (37.8 bits), Expect = 2.7e-66, Sum P(5) = 2.7e-66
 Identities = 36/103 (34%), Positives = 55/103 (53%)

Query:    26 SGGN--RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCF 81
             +GG    L GF+FGN++ AG L+ +  LD++ K+HLA + A  LG  +T++   N +   
Sbjct:    31 AGGGPFSLTGFLFGNINGAGQLEGESVLDDECKKHLAGLGALGLGSLITELTA-NEE--- 86

Query:    82 QLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
              L  +   L VN         ++ + +  EDAVDY DI E  E
Sbjct:    87 -LAGTDGAL-VN---------DEGWIRSTEDAVDYSDITEVAE 118

 Score = 91 (37.1 bits), Expect = 2.7e-66, Sum P(5) = 2.7e-66
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N +++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   486 YSIFPIDNEELVYGRWEDNIIWD-AQAMPRLLEPPVLTLDPNDENLILEIPDEKEE 540

 Score = 57 (25.1 bits), Expect = 2.7e-66, Sum P(5) = 2.7e-66
 Identities = 16/64 (25%), Positives = 28/64 (43%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + RK   +  I    M++      + G   +   NF  + Q  WE++
Sbjct:   375 GVPEDGSGFDYGFKLRKMDHEPVIKCRMMEDLRKFEENNGSDLLADENFLMVTQLHWEDD 434

Query:   398 IVWD 401
             I+WD
Sbjct:   435 IIWD 438

 Score = 56 (24.8 bits), Expect = 1.4e-61, Sum P(4) = 1.4e-61
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  +     WE+NI+WD
Sbjct:   484 PWYSIFPIDNEELVYGRWEDNIIWD 508

 Score = 50 (22.7 bits), Expect = 7.2e-05, Sum P(4) = 7.2e-05
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   542 TSNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 575

 Score = 46 (21.3 bits), Expect = 2.5e-18, Sum P(3) = 2.5e-18
 Identities = 22/76 (28%), Positives = 34/76 (44%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRK 878
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+     +   R+
Sbjct:   230 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRE 289

Query:   879 LAYPEHV----CSYES 890
             L   E +    CS ES
Sbjct:   290 LIQEEQIQEVECSVES 305

 Score = 39 (18.8 bits), Expect = 1.1e-60, Sum P(5) = 1.1e-60
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query:    38 NVDYAGDLDVDY-LDEDAKEHLAAVADKLGPSLTDID 73
             N DYA     +  L +D    +A V  K   S  DID
Sbjct:   314 NYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDID 350

 Score = 39 (18.8 bits), Expect = 8.0e-60, Sum P(4) = 8.0e-60
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   431 WEDDIIWDGEDVKHK 445


>UNIPROTKB|F6UPV2 [details] [associations]
            symbol:TAF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0006352 "DNA-dependent
            transcription, initiation" evidence=IEA] [GO:0005669 "transcription
            factor TFIID complex" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            GO:GO:0006355 GO:GO:0003677 GO:GO:0006352 Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 InterPro:IPR018359 GO:GO:0005669
            InterPro:IPR022591 Pfam:PF12157 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055
            GeneTree:ENSGT00390000012659 Ensembl:ENSCAFT00000027104
            EMBL:AAEX03026487 Uniprot:F6UPV2
        Length = 1893

 Score = 446 (162.1 bits), Expect = 6.5e-66, Sum P(5) = 6.5e-66
 Identities = 122/362 (33%), Positives = 180/362 (49%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   654 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 710

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   711 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 762

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   763 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 821

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   822 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 879

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   880 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 931

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I  E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   932 -PEEENEEDFQMKIVNEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 986

Query:   979 VR 980
             V+
Sbjct:   987 VK 988

 Score = 233 (87.1 bits), Expect = 6.5e-66, Sum P(5) = 6.5e-66
 Identities = 65/183 (35%), Positives = 94/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I  E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   944 IVNEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 999

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:  1000 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1055

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:  1056 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEILSTDTDSSSAED 1115

Query:  1097 DSF 1099
               F
Sbjct:  1116 SDF 1118

 Score = 93 (37.8 bits), Expect = 6.5e-66, Sum P(5) = 6.5e-66
 Identities = 36/103 (34%), Positives = 55/103 (53%)

Query:    26 SGGN--RLLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCF 81
             +GG    L GF+FGN++ AG L+ +  LD++ K+HLA + A  LG  +T++   N +   
Sbjct:    31 AGGGPFSLTGFLFGNINGAGQLEGESVLDDECKKHLAGLGALGLGSLITELTA-NEE--- 86

Query:    82 QLFCSYCLLSVNSPQPPVDAVEQDYDKKAEDAVDYEDIDEQYE 124
              L  +   L VN         ++ + +  EDAVDY DI E  E
Sbjct:    87 -LAGTDGAL-VN---------DEGWIRSTEDAVDYSDITEVAE 118

 Score = 91 (37.1 bits), Expect = 6.5e-66, Sum P(5) = 6.5e-66
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N +++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   486 YSIFPIDNEELVYGRWEDNIIWD-AQAMPRLLEPPVLTLDPNDENLILEIPDEKEE 540

 Score = 57 (25.1 bits), Expect = 6.5e-66, Sum P(5) = 6.5e-66
 Identities = 16/64 (25%), Positives = 28/64 (43%)

Query:   341 GVEQDAATISEHDEQRK---DSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEEN 397
             GV +D +      + RK   +  I    M++      + G   +   NF  + Q  WE++
Sbjct:   375 GVPEDGSGFDYGFKLRKMDHEPVIKCRMMEDLRKFEENNGSDLLADENFLMVTQLHWEDD 434

Query:   398 IVWD 401
             I+WD
Sbjct:   435 IIWD 438

 Score = 56 (24.8 bits), Expect = 3.3e-61, Sum P(4) = 3.3e-61
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  +     WE+NI+WD
Sbjct:   484 PWYSIFPIDNEELVYGRWEDNIIWD 508

 Score = 50 (22.7 bits), Expect = 6.7e-05, Sum P(4) = 6.7e-05
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T+N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   542 TSNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 575

 Score = 46 (21.3 bits), Expect = 3.9e-18, Sum P(3) = 3.9e-18
 Identities = 22/76 (28%), Positives = 34/76 (44%)

Query:   822 LPCIGV--DELSVQFPNLSEAIIR-KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRK 878
             LP  G+   + +   P+++E     +  K   FLR  G GK V S+ R+     +   R+
Sbjct:   230 LPLAGIMQHDATKLLPSVTELFPEFRPGKVLRFLRLFGPGKNVPSVWRSARRKRKKKHRE 289

Query:   879 LAYPEHV----CSYES 890
             L   E +    CS ES
Sbjct:   290 LIQEEQIQEVECSVES 305

 Score = 39 (18.8 bits), Expect = 2.5e-60, Sum P(5) = 2.5e-60
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query:    38 NVDYAGDLDVDY-LDEDAKEHLAAVADKLGPSLTDID 73
             N DYA     +  L +D    +A V  K   S  DID
Sbjct:   314 NYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDID 350

 Score = 39 (18.8 bits), Expect = 1.9e-59, Sum P(4) = 1.9e-59
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   431 WEDDIIWDGEDVKHK 445


>UNIPROTKB|I3LP77 [details] [associations]
            symbol:I3LP77 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0006352 "DNA-dependent transcription, initiation"
            evidence=IEA] [GO:0005669 "transcription factor TFIID complex"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            GO:GO:0006355 GO:GO:0003677 GO:GO:0006352 Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 InterPro:IPR018359 GO:GO:0005669
            InterPro:IPR022591 Pfam:PF12157 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055
            GeneTree:ENSGT00390000012659 OMA:DEFYYPK Ensembl:ENSSSCT00000032456
            Uniprot:I3LP77
        Length = 1549

 Score = 448 (162.8 bits), Expect = 4.5e-63, Sum P(5) = 4.5e-63
 Identities = 122/362 (33%), Positives = 181/362 (50%)

Query:   621 PNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAF--KKKSDLSVKDGHVFLMEYCEERPL 678
             P+S+  L++   H+  +A+    E ++      F  +   DL+ KDG + L EY EE   
Sbjct:   597 PHSVQPLLK---HIKKKAKMREQERQASGGGEMFFMRTPQDLTGKDGDLILAEYSEENGP 653

Query:   679 LLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPFLGDIKAGCSQS 738
             L+   GM   +  YY K  PG   GA  C      G  +       SPFLG +  G    
Sbjct:   654 LMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVYCHT---SPFLGSLHPGQLLQ 705

Query:   739 SLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVAVVAQQEPLMEVMSPGSKNLQ 798
             + E N++RAP++ HK+  TDFL++R+ +G   IR +  + VV QQ PL EV  P SK   
Sbjct:   706 AFENNLFRAPIYLHKMPETDFLIIRTRQGYY-IRELVDIFVVGQQCPLFEVPGPNSKRAN 764

Query:   799 TYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIRKKLKECAFLRRDGNG 858
             T+  + + V +YR F  +  R     I ++++   FP+ SE+ IRK+LK CA  +R G  
Sbjct:   765 THIRDFLQVFIYRLFWKSKDRPRR--IRMEDIKKAFPSHSESSIRKRLKLCADFKRTGMD 822

Query:   859 KQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQ 918
                W +K  F +P+E ++R +  PE  C+Y SM A   RLK  G  + +  A        
Sbjct:   823 SNWWVLKSDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRLKDAGYGEKSFFA-------- 874

Query:   919 LPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSY 978
              P+E         I+ E++  PWN +  F+A           LE+TGV DP+G G GFSY
Sbjct:   875 -PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSY 929

Query:   979 VR 980
             V+
Sbjct:   930 VK 931

 Score = 235 (87.8 bits), Expect = 4.5e-63, Sum P(5) = 4.5e-63
 Identities = 65/183 (35%), Positives = 95/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   887 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 942

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   943 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 998

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + LS+ +     +DS  ++ 
Sbjct:   999 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLSSTEVLSTDTDSSSAED 1058

Query:  1097 DSF 1099
               F
Sbjct:  1059 SDF 1061

 Score = 91 (37.1 bits), Expect = 4.5e-63, Sum P(5) = 4.5e-63
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEPVDAVGK----PKLILDLQDEQMLFEILDNKDD 572
             +S   + N +++ G W DNIIW+   A+ +    P L LD  DE ++ EI D K++
Sbjct:   429 YSIFPIDNEELVYGRWEDNIIWD-AQAMPRLLEPPVLTLDPNDENLILEIPDEKEE 483

 Score = 59 (25.8 bits), Expect = 4.5e-63, Sum P(5) = 4.5e-63
 Identities = 15/51 (29%), Positives = 24/51 (47%)

Query:   351 EHDEQRKDSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQHDWEENIVWD 401
             EH+   K  C   E  ++   +  S G   +   NF  + Q  WE++I+WD
Sbjct:   336 EHEPVTK--CRMIEDFRK---LEESSGTDLLADENFLMVTQLHWEDDIIWD 381

 Score = 56 (24.8 bits), Expect = 5.2e-58, Sum P(4) = 5.2e-58
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query:   382 PSPNFFPLDQHD-----WEENIVWD 401
             P  + FP+D  +     WE+NI+WD
Sbjct:   427 PWYSIFPIDNEELVYGRWEDNIIWD 451

 Score = 53 (23.7 bits), Expect = 4.5e-63, Sum P(5) = 4.5e-63
 Identities = 10/22 (45%), Positives = 15/22 (68%)

Query:   103 EQDYDKKAEDAVDYEDIDEQYE 124
             ++ + +  EDAVDY DI+E  E
Sbjct:    47 DEGWIRSTEDAVDYSDINEVAE 68

 Score = 39 (18.8 bits), Expect = 1.3e-61, Sum P(5) = 1.3e-61
 Identities = 13/37 (35%), Positives = 16/37 (43%)

Query:    38 NVDYAGDLDVDY-LDEDAKEHLAAVADKLGPSLTDID 73
             N DYA     +  L +D    +A V  K   S  DID
Sbjct:   257 NYDYAPPPPPEQCLSDDEITMMAPVESKFSQSTGDID 293

 Score = 39 (18.8 bits), Expect = 3.1e-56, Sum P(4) = 3.1e-56
 Identities = 7/15 (46%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+IIW+  D   K
Sbjct:   374 WEDDIIWDGEDVKHK 388


>FB|FBgn0010355 [details] [associations]
            symbol:Taf1 "TBP-associated factor 1" species:7227
            "Drosophila melanogaster" [GO:0008134 "transcription factor
            binding" evidence=IPI] [GO:0005669 "transcription factor TFIID
            complex" evidence=ISS;NAS;IDA;IPI] [GO:0005634 "nucleus"
            evidence=NAS;IDA] [GO:0001075 "RNA polymerase II core promoter
            sequence-specific DNA binding transcription factor activity
            involved in preinitiation complex assembly" evidence=IDA]
            [GO:0001129 "TBP-class protein binding RNA polymerase II
            transcription factor activity involved in preinitiation complex
            assembly" evidence=IPI] [GO:0006367 "transcription initiation from
            RNA polymerase II promoter" evidence=ISS] [GO:0004672 "protein
            kinase activity" evidence=NAS] [GO:0003677 "DNA binding"
            evidence=NAS] [GO:0006461 "protein complex assembly" evidence=NAS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=NAS] [GO:0051726 "regulation of cell cycle" evidence=NAS]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0016573 "histone
            acetylation" evidence=IDA] [GO:0035174 "histone serine kinase
            activity" evidence=IDA] [GO:0000117 "regulation of transcription
            involved in G2/M-phase of mitotic cell cycle" evidence=IMP]
            [GO:0043565 "sequence-specific DNA binding" evidence=IDA]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IDA] [GO:0046425 "regulation of
            JAK-STAT cascade" evidence=IMP] [GO:0022008 "neurogenesis"
            evidence=IMP] InterPro:IPR001487 InterPro:IPR001878
            InterPro:IPR011177 InterPro:IPR017956 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50011 PROSITE:PS50014
            SMART:SM00297 SMART:SM00343 SMART:SM00384 EMBL:AE014297
            GO:GO:0005524 GO:GO:0022008 GO:GO:0043565 GO:GO:0008270
            GO:GO:0045944 GO:GO:0016573 GO:GO:0035174 EMBL:AE001572
            Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633 eggNOG:COG5076
            InterPro:IPR018359 GO:GO:0000117 GO:GO:0046425 GO:GO:0005669
            GO:GO:0060261 GO:GO:0001129 KO:K03125 InterPro:IPR022591
            Pfam:PF12157 Gene3D:1.10.1100.10 InterPro:IPR009067 Pfam:PF09247
            SUPFAM:SSF47055 EMBL:S61883 EMBL:BT004888 PIR:A47371
            RefSeq:NP_476956.3 RefSeq:NP_996159.1 RefSeq:NP_996160.1
            UniGene:Dm.7380 PDB:1TBA PDBsum:1TBA DisProt:DP00081
            ProteinModelPortal:P51123 SMR:P51123 DIP:DIP-228N IntAct:P51123
            MINT:MINT-788336 STRING:P51123 PaxDb:P51123
            EnsemblMetazoa:FBtr0081685 GeneID:40813 KEGG:dme:Dmel_CG17603
            CTD:6872 FlyBase:FBgn0010355 GeneTree:ENSGT00390000012659
            InParanoid:P51123 OrthoDB:EOG4HHMH6 PhylomeDB:P51123
            EvolutionaryTrace:P51123 GenomeRNAi:40813 NextBio:820715
            Bgee:P51123 GermOnline:CG17603 Uniprot:P51123
        Length = 2129

 Score = 573 (206.8 bits), Expect = 2.7e-55, Sum P(6) = 2.7e-55
 Identities = 156/463 (33%), Positives = 241/463 (52%)

Query:   655 KKKSDLSVKDGHVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLG 714
             +   DLS +DG + L E+CEE P L++  GM + +  YY++ +  D        G     
Sbjct:   711 RNPEDLSGRDGDIVLAEFCEEHPPLINQVGMCSKIKNYYKRKAEKDSGPQDFVYGEVAFA 770

Query:   715 NVLTLEPGDKSPFLGDIKAG-CSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRR 773
             +         SPFLG +  G C Q+ +E NMYRAP++ HK+A  DFL++R+ +    IR 
Sbjct:   771 HT--------SPFLGILHPGQCIQA-IENNMYRAPIYPHKMAHNDFLVIRT-RNNYWIRS 820

Query:   774 IDKVAVVAQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQ 833
             ++ +  V Q+ PL EV  P SK    ++ + + V +YR F  +  R     I +D++   
Sbjct:   821 VNSIYTVGQECPLYEVPGPNSKRANNFTRDFLQVFIYRLFWKS--RDNPRRIRMDDIKQA 878

Query:   834 FPNLSEAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQA 893
             FP  SE+ IRK+LK+CA  +R G     W +K  F +PSE ++R +  PE  C+Y SM A
Sbjct:   879 FPAHSESSIRKRLKQCADFKRTGMDSNWWVIKPEFRLPSEEEIRAMVSPEQCCAYFSMIA 938

Query:   894 GLYRLKHLGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQ 953
                RLK  G  +  L A         P E     A   ++ E+++ PWN +  ++    Q
Sbjct:   939 AEQRLKDAGYGEKFLFA---------PQEDDDEEAQLKLDDEVKVAPWNTTRAYI----Q 985

Query:   954 DRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXXXXXXXXXNRGGSTVTGTDADLR 1013
                    L+++G  DP+G G GFSYVR                  ++   +VTGTDADLR
Sbjct:   986 AMRGKCLLQLSGPADPTGCGEGFSYVRVPNKPTQTKEEQE-----SQPKRSVTGTDADLR 1040

Query:  1014 RLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSSEQAASGVQ-LQQQTRG------ 1066
             RL ++ A+E+L +F VPEE I K +RW  I ++R LS+E+A +G + + + +RG      
Sbjct:  1041 RLPLQRAKELLRQFKVPEEEIKKLSRWEVIDVVRTLSTEKAKAGEEGMDKFSRGNRFSIA 1100

Query:  1067 --------KCQEIWDRQVQSLSAAD----DDEIGSDSEHSDLD 1097
                     +CQ I+D Q + L++++    D+   S SE SDL+
Sbjct:  1101 EHQERYKEECQRIFDLQNRVLASSEVLSTDEAESSASEESDLE 1143

 Score = 83 (34.3 bits), Expect = 2.7e-55, Sum P(6) = 2.7e-55
 Identities = 20/70 (28%), Positives = 37/70 (52%)

Query:   505 KENVSIKLCQSDAVKRFSKHSLQNRDMMEGSWLDNIIWEP--VDAVGKPKLI-LDLQDEQ 561
             ++N   K  ++     +S   ++N +++   W D +IW+   V  V KPK++ LD  DE 
Sbjct:   486 QQNAQAKPAEAPDDTWYSLFPVENEELIYYKWEDEVIWDAQQVSKVPKPKVLTLDPNDEN 545

Query:   562 MLFEILDNKD 571
             ++  I D+ D
Sbjct:   546 IILGIPDDID 555

 Score = 65 (27.9 bits), Expect = 2.7e-55, Sum P(6) = 2.7e-55
 Identities = 20/71 (28%), Positives = 31/71 (43%)

Query:   333 DDSELVKFGVEQDAATISEHDEQRKDSCICSEPMKEESNVNPSVGWKSMPSPNFFPLDQH 392
             D  E   +G +  AA+ S    Q KD      P  E+   +PS+         F  + Q 
Sbjct:   366 DSGEGFNYGFKTKAASTSSQP-QLKDERRVKSP--EDDVEDPSIA-----DDAFLMVSQL 417

Query:   393 DWEENIVWDNS 403
              WE+++VWD +
Sbjct:   418 HWEDDVVWDGN 428

 Score = 52 (23.4 bits), Expect = 9.6e-50, Sum P(5) = 9.6e-50
 Identities = 19/77 (24%), Positives = 34/77 (44%)

Query:   374 PSVGWKSM-PSPNFFPLDQHDWEENIVWDNSPAASDNSIESHEIAGADV-ESALMRGIEL 431
             P   W S+ P  N   L  + WE+ ++WD   A   + +   ++   D  +  ++ GI  
Sbjct:   497 PDDTWYSLFPVENE-ELIYYKWEDEVIWD---AQQVSKVPKPKVLTLDPNDENIILGIPD 552

Query:   432 DTGQNNFHERSTSPNEK 448
             D   +  + +ST P  K
Sbjct:   553 DIDPSKIN-KSTGPPPK 568

 Score = 47 (21.6 bits), Expect = 2.7e-55, Sum P(6) = 2.7e-55
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query:    25 VSGGNRLLGFMFGNVDYAGDL 45
             +  G  L G +FGN+D  G L
Sbjct:    14 IGNGLDLTGILFGNIDSEGRL 34

 Score = 46 (21.3 bits), Expect = 2.7e-55, Sum P(6) = 2.7e-55
 Identities = 9/17 (52%), Positives = 11/17 (64%)

Query:   108 KKAEDAVDYEDIDEQYE 124
             K  +DA+DY DI E  E
Sbjct:   118 KAQDDAIDYSDITELSE 134

 Score = 40 (19.1 bits), Expect = 1.4e-49, Sum P(2) = 1.4e-49
 Identities = 23/89 (25%), Positives = 39/89 (43%)

Query:   487 LRLESQLDLDNHSHANGIKENVSIKLCQSDAVKRFSKHSLQNRDMMEGSWLDNIIWEPVD 546
             + +ES    D  S  NG+ +   I     D+  R  +    + +   G+  D  + E + 
Sbjct:     1 MEMESDNSDDEGSIGNGL-DLTGILFGNIDSEGRLLQDD--DGEGRGGTGFDAELRENIG 57

Query:   547 AVGKPKLILDLQDEQMLFEILDNKDDEHP 575
             ++ K  L LD     ML E++D K+ E P
Sbjct:    58 SLSK--LGLD----SMLLEVIDLKEAEPP 80

 Score = 39 (18.8 bits), Expect = 2.7e-55, Sum P(6) = 2.7e-55
 Identities = 7/13 (53%), Positives = 10/13 (76%)

Query:   260 ILRFSEIFGIHEP 272
             +LRFS +FG  +P
Sbjct:   243 VLRFSRLFGPGKP 255

 Score = 37 (18.1 bits), Expect = 3.3e-48, Sum P(5) = 3.3e-48
 Identities = 5/15 (33%), Positives = 9/15 (60%)

Query:   536 WLDNIIWEPVDAVGK 550
             W D+++W+  D   K
Sbjct:   419 WEDDVVWDGNDIKAK 433


>UNIPROTKB|F1P5U7 [details] [associations]
            symbol:TAF1 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003677 "DNA binding" evidence=IEA] [GO:0005669
            "transcription factor TFIID complex" evidence=IEA] [GO:0006352
            "DNA-dependent transcription, initiation" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            GO:GO:0006355 GO:GO:0003677 GO:GO:0006352 Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 InterPro:IPR018359 GO:GO:0005669
            InterPro:IPR022591 Pfam:PF12157 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055
            GeneTree:ENSGT00390000012659 EMBL:AADN02013226 IPI:IPI00580748
            Ensembl:ENSGALT00000008777 ArrayExpress:F1P5U7 Uniprot:F1P5U7
        Length = 1816

 Score = 441 (160.3 bits), Expect = 6.7e-46, Sum P(5) = 6.7e-46
 Identities = 113/322 (35%), Positives = 164/322 (50%)

Query:   659 DLSVKDGHVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLT 718
             DL+ KDG + L EY EE   L+   GM   +  YY K  PG   GA  C      G  + 
Sbjct:   680 DLTGKDGDLILAEYSEENAPLMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVY 734

Query:   719 LEPGDKSPFLGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVA 778
                   SPFLG +  G    + E N++RAP++ HK+  TDFL++R+ +G   +R +  + 
Sbjct:   735 CHT---SPFLGSLHPGQLLQAFENNLFRAPIYLHKMPETDFLIIRTRQGYY-VRELVDIF 790

Query:   779 VVAQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLS 838
             VV Q+ PL EV  P SK   T+  + + V +YR F  +  R     I ++++   FP+ S
Sbjct:   791 VVGQECPLYEVPGPNSKRANTHIRDFLQVFIYRLFWKSRDRPRR--IRMEDIKKAFPSHS 848

Query:   839 EAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRL 898
             E+ IRK+LK CA  +R G     W +K  F +P+E ++R +  PE  C+Y SM A   RL
Sbjct:   849 ESSIRKRLKLCADFKRTGMDSNWWVLKPDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRL 908

Query:   899 KHLGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENI 958
             K  G  + +  A         P+E         I+ E++  PWN +  F+A         
Sbjct:   909 KDAGYGEKSFFA---------PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC--- 956

Query:   959 ERLEITGVGDPSGRGLGFSYVR 980
               LE+TGV DP+G G GFSYV+
Sbjct:   957 -LLEVTGVADPTGCGEGFSYVK 977

 Score = 231 (86.4 bits), Expect = 5.1e-24, Sum P(4) = 5.1e-24
 Identities = 64/183 (34%), Positives = 94/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   933 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 988

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   989 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1044

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + L + +     +DS  ++ 
Sbjct:  1045 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLESTEILSTDTDSSSAED 1104

Query:  1097 DSF 1099
               F
Sbjct:  1105 SDF 1107

 Score = 110 (43.8 bits), Expect = 6.7e-46, Sum P(5) = 6.7e-46
 Identities = 36/99 (36%), Positives = 51/99 (51%)

Query:    31 LLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCFQLFCSYC 88
             L GF+FGN++ AG L+ D  LD+++K+HLA + A  LG  +T+I     +S         
Sbjct:    29 LAGFLFGNINEAGQLEGDSVLDKESKKHLAGLGALGLGNLITEITASEDES--------- 79

Query:    89 LLSVNSPQPPVDAV---EQDYDKKAEDAVDYEDIDEQYE 124
                     P  D     E+ + K  EDAVDY DI+E  E
Sbjct:    80 --------PEADGAHLDEEGWVKSTEDAVDYSDINEVAE 110

 Score = 90 (36.7 bits), Expect = 6.7e-46, Sum P(5) = 6.7e-46
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEP--VDA-VGKPKLILDLQDEQMLFEILDNKDD 572
             +S   + N +++ G W DNIIW+   ++  +  P L LD  DE ++ EI D K++
Sbjct:   475 YSIFPIDNEELVYGRWEDNIIWDDQAMETYLDPPVLTLDPNDENIILEIPDEKEE 529

 Score = 57 (25.1 bits), Expect = 2.1e-42, Sum P(4) = 2.1e-42
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query:   382 PSPNFFPLDQHD-----WEENIVWDN 402
             P  + FP+D  +     WE+NI+WD+
Sbjct:   473 PWYSIFPIDNEELVYGRWEDNIIWDD 498

 Score = 48 (22.0 bits), Expect = 2.6e-05, Sum P(4) = 2.6e-05
 Identities = 13/34 (38%), Positives = 17/34 (50%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   531 TLNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 564

 Score = 46 (21.3 bits), Expect = 6.7e-46, Sum P(5) = 6.7e-46
 Identities = 11/55 (20%), Positives = 24/55 (43%)

Query:   385 NFFPLDQHDWEENIVWDNSPAASDNS-IESHEIAGADVESALMRGIELDTGQNNF 438
             +F  + Q  WE++++W+        +  +   +AG  + S++ R       Q  F
Sbjct:   411 HFLMVTQLQWEDDVIWNGEDVKHKGTKTQRASLAGW-LPSSMTRNATAYNAQQEF 464

 Score = 38 (18.4 bits), Expect = 6.7e-46, Sum P(5) = 6.7e-46
 Identities = 7/14 (50%), Positives = 10/14 (71%)

Query:  1079 LSAADDDEIGSDSE 1092
             +S  +DD+ GSD E
Sbjct:  1758 MSDGEDDDDGSDEE 1771


>UNIPROTKB|F1P5U8 [details] [associations]
            symbol:TAF1 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005669 "transcription factor TFIID complex"
            evidence=IEA] [GO:0006352 "DNA-dependent transcription, initiation"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0045120 "pronucleus" evidence=IEA]
            InterPro:IPR001487 InterPro:IPR011177 Pfam:PF00439
            PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014 SMART:SM00297
            GO:GO:0006355 GO:GO:0003677 GO:GO:0006352 Gene3D:1.20.920.10
            SUPFAM:SSF47370 PROSITE:PS00633 InterPro:IPR018359 GO:GO:0045120
            GO:GO:0005669 InterPro:IPR022591 Pfam:PF12157 Gene3D:1.10.1100.10
            InterPro:IPR009067 Pfam:PF09247 SUPFAM:SSF47055
            GeneTree:ENSGT00390000012659 OMA:DEFYYPK EMBL:AADN02013226
            IPI:IPI00811057 Ensembl:ENSGALT00000008776 ArrayExpress:F1P5U8
            Uniprot:F1P5U8
        Length = 1851

 Score = 441 (160.3 bits), Expect = 7.7e-46, Sum P(5) = 7.7e-46
 Identities = 113/322 (35%), Positives = 164/322 (50%)

Query:   659 DLSVKDGHVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLT 718
             DL+ KDG + L EY EE   L+   GM   +  YY K  PG   GA  C      G  + 
Sbjct:   656 DLTGKDGDLILAEYSEENAPLMMQVGMATKIKNYY-KRKPGKDPGAPDCK----YGETVY 710

Query:   719 LEPGDKSPFLGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVA 778
                   SPFLG +  G    + E N++RAP++ HK+  TDFL++R+ +G   +R +  + 
Sbjct:   711 CHT---SPFLGSLHPGQLLQAFENNLFRAPIYLHKMPETDFLIIRTRQGYY-VRELVDIF 766

Query:   779 VVAQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLS 838
             VV Q+ PL EV  P SK   T+  + + V +YR F  +  R     I ++++   FP+ S
Sbjct:   767 VVGQECPLYEVPGPNSKRANTHIRDFLQVFIYRLFWKSRDRPRR--IRMEDIKKAFPSHS 824

Query:   839 EAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRL 898
             E+ IRK+LK CA  +R G     W +K  F +P+E ++R +  PE  C+Y SM A   RL
Sbjct:   825 ESSIRKRLKLCADFKRTGMDSNWWVLKPDFRLPTEEEIRAMVSPEQCCAYYSMIAAEQRL 884

Query:   899 KHLGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENI 958
             K  G  + +  A         P+E         I+ E++  PWN +  F+A         
Sbjct:   885 KDAGYGEKSFFA---------PEEENEEDFQMKIDDEVRTAPWNTTRAFIAAMKGKC--- 932

Query:   959 ERLEITGVGDPSGRGLGFSYVR 980
               LE+TGV DP+G G GFSYV+
Sbjct:   933 -LLEVTGVADPTGCGEGFSYVK 953

 Score = 231 (86.4 bits), Expect = 5.6e-24, Sum P(4) = 5.6e-24
 Identities = 64/183 (34%), Positives = 94/183 (51%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             I+ E++  PWN +  F+A           LE+TGV DP+G G GFSYV+           
Sbjct:   909 IDDEVRTAPWNTTRAFIAAMKGKC----LLEVTGVADPTGCGEGFSYVKIPNKPTQQKDD 964

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                         TVTGTDADLRRLS++ A+++L KF VPEE I K +RW  I ++R +S+
Sbjct:   965 KEPQPVKK----TVTGTDADLRRLSLKNAKQLLRKFGVPEEEIKKLSRWEVIDVVRTMST 1020

Query:  1052 EQAASG---------------VQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDL 1096
             EQA SG                + Q++ + +CQ I+D Q + L + +     +DS  ++ 
Sbjct:  1021 EQARSGEGPMSKFARGSRFSVAEHQERYKEECQRIFDLQNKVLESTEILSTDTDSSSAED 1080

Query:  1097 DSF 1099
               F
Sbjct:  1081 SDF 1083

 Score = 110 (43.8 bits), Expect = 7.7e-46, Sum P(5) = 7.7e-46
 Identities = 36/99 (36%), Positives = 51/99 (51%)

Query:    31 LLGFMFGNVDYAGDLDVD-YLDEDAKEHLAAV-ADKLGPSLTDIDVWNSQSCFQLFCSYC 88
             L GF+FGN++ AG L+ D  LD+++K+HLA + A  LG  +T+I     +S         
Sbjct:    24 LAGFLFGNINEAGQLEGDSVLDKESKKHLAGLGALGLGNLITEITASEDES--------- 74

Query:    89 LLSVNSPQPPVDAV---EQDYDKKAEDAVDYEDIDEQYE 124
                     P  D     E+ + K  EDAVDY DI+E  E
Sbjct:    75 --------PEADGAHLDEEGWVKSTEDAVDYSDINEVAE 105

 Score = 90 (36.7 bits), Expect = 7.7e-46, Sum P(5) = 7.7e-46
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query:   521 FSKHSLQNRDMMEGSWLDNIIWEP--VDA-VGKPKLILDLQDEQMLFEILDNKDD 572
             +S   + N +++ G W DNIIW+   ++  +  P L LD  DE ++ EI D K++
Sbjct:   451 YSIFPIDNEELVYGRWEDNIIWDDQAMETYLDPPVLTLDPNDENIILEIPDEKEE 505

 Score = 57 (25.1 bits), Expect = 2.4e-42, Sum P(4) = 2.4e-42
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query:   382 PSPNFFPLDQHD-----WEENIVWDN 402
             P  + FP+D  +     WE+NI+WD+
Sbjct:   449 PWYSIFPIDNEELVYGRWEDNIIWDD 474

 Score = 48 (22.0 bits), Expect = 2.8e-05, Sum P(4) = 2.8e-05
 Identities = 13/34 (38%), Positives = 17/34 (50%)

Query:  1220 TANEQIKDPKEEESLIAKRNLSGKVQAMKKNNVQ 1253
             T N   K+ K+E SL   R L GK   +K+   Q
Sbjct:   507 TLNSPSKENKKESSLKKSRILLGKTGVIKEEPQQ 540

 Score = 46 (21.3 bits), Expect = 7.7e-46, Sum P(5) = 7.7e-46
 Identities = 11/55 (20%), Positives = 24/55 (43%)

Query:   385 NFFPLDQHDWEENIVWDNSPAASDNS-IESHEIAGADVESALMRGIELDTGQNNF 438
             +F  + Q  WE++++W+        +  +   +AG  + S++ R       Q  F
Sbjct:   387 HFLMVTQLQWEDDVIWNGEDVKHKGTKTQRASLAGW-LPSSMTRNATAYNAQQEF 440

 Score = 38 (18.4 bits), Expect = 7.7e-46, Sum P(5) = 7.7e-46
 Identities = 7/14 (50%), Positives = 10/14 (71%)

Query:  1079 LSAADDDEIGSDSE 1092
             +S  +DD+ GSD E
Sbjct:  1701 MSDGEDDDDGSDEE 1714


>WB|WBGene00006382 [details] [associations]
            symbol:taf-1 species:6239 "Caenorhabditis elegans"
            [GO:0005669 "transcription factor TFIID complex" evidence=IEA;ISS]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0006352 "DNA-dependent
            transcription, initiation" evidence=IEA] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0010171 "body morphogenesis" evidence=IMP] [GO:0009790 "embryo
            development" evidence=IMP] [GO:0001703 "gastrulation with mouth
            forming first" evidence=IMP] [GO:0040002 "collagen and
            cuticulin-based cuticle development" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0040035 "hermaphrodite genitalia
            development" evidence=IMP] [GO:0040010 "positive regulation of
            growth rate" evidence=IMP] [GO:0040007 "growth" evidence=IMP]
            [GO:0002119 "nematode larval development" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] [GO:0045944 "positive regulation of
            transcription from RNA polymerase II promoter" evidence=IGI;IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0001071 "nucleic acid
            binding transcription factor activity" evidence=IMP] [GO:0004402
            "histone acetyltransferase activity" evidence=ISS] [GO:0004674
            "protein serine/threonine kinase activity" evidence=ISS]
            [GO:0003690 "double-stranded DNA binding" evidence=ISS] [GO:0004842
            "ubiquitin-protein ligase activity" evidence=ISS]
            InterPro:IPR001487 InterPro:IPR001878 InterPro:IPR011177
            Pfam:PF00439 PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014
            SMART:SM00297 SMART:SM00343 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0040010 GO:GO:0002119 GO:GO:0004674
            GO:GO:0008270 GO:GO:0045944 GO:GO:0010171 GO:GO:0040011
            GO:GO:0006352 GO:GO:0040035 GO:GO:0003690 GO:GO:0004842
            GO:GO:0040002 Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633
            InterPro:IPR018359 GO:GO:0004402 GO:GO:0001071 GO:GO:0001703
            GO:GO:0005669 EMBL:AL132902 KO:K03125 InterPro:IPR022591
            Pfam:PF12157 GeneTree:ENSGT00390000012659 OMA:DEFYYPK EMBL:Z82069
            RefSeq:NP_493426.2 ProteinModelPortal:G5EGM3 SMR:G5EGM3
            IntAct:G5EGM3 EnsemblMetazoa:W04A8.7 GeneID:173257
            KEGG:cel:CELE_W04A8.7 CTD:173257 WormBase:W04A8.7 NextBio:878901
            Uniprot:G5EGM3
        Length = 1744

 Score = 351 (128.6 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 95/291 (32%), Positives = 146/291 (50%)

Query:   659 DLSVKDGHVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLT 718
             DLS KD  + ++EY EE P++LS  GM + +  Y+++    D             G    
Sbjct:   594 DLSGKDETLVMIEYSEEHPVILSQPGMASKMKNYFKRRQANDSEPTFT------FGE--- 644

Query:   719 LEPGDKSPFLGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVA 778
             L    + PFLG ++ G S  S+E  +YRAP++ HK   TDFLL+RS   +  IR +  + 
Sbjct:   645 LAFSHQIPFLGQLQPGQSLQSIENMLYRAPIYLHKRQNTDFLLIRSMN-QWYIRPLPSIF 703

Query:   779 VVAQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLS 838
             V  QQ PL EV SP SK    +  + +   +YR F A+        + +D++   FP+ +
Sbjct:   704 VAGQQCPLYEVPSPNSKRATVFVRDFLFAFIYRLFWASDSSPRR--LKMDDVRNAFPHYA 761

Query:   839 EAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRL 898
             E+ IRK+LK C+   R G+ +  WS+K  F +PS+ ++  +  PE  C+  SM A   RL
Sbjct:   762 ESNIRKRLKMCSTFVRQGS-ETYWSLKPDFRLPSKEEVLSMVTPEMCCAQYSMMAAEQRL 820

Query:   899 KHLGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVA 949
             K  G  +       +   S+  DE         IE E++  PWN +  F+A
Sbjct:   821 KDAGYGEKYFFTPENDEGSE--DEVT-------IEDEIKCAPWNTTRAFLA 862

 Score = 177 (67.4 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 46/123 (37%), Positives = 65/123 (52%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             IE E++  PWN +  F+A     RE    L+ TG+ DP+G G GFSYVR           
Sbjct:   845 IEDEIKCAPWNTTRAFLA---SQREKC-LLDQTGIADPTGCGQGFSYVRVSQKPHKDENA 900

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                          VTGT+ADLR+L ++ A+++   + V EE I+  TRW  I +IR LS+
Sbjct:   901 TPVPK------KLVTGTNADLRKLPLKEAKQICRGYGVKEEEISALTRWEIIDVIRTLST 954

Query:  1052 EQA 1054
             + A
Sbjct:   955 QAA 957

 Score = 78 (32.5 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 17/43 (39%), Positives = 27/43 (62%)

Query:  1056 SGVQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDLDS 1098
             S   +Q++ R  CQ I+D+Q Q+L  A+ D I +D + +D DS
Sbjct:   982 SSADMQEKYRKHCQRIFDQQNQTL--ANTDPISTDDDSTDADS 1022

 Score = 58 (25.5 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 16/48 (33%), Positives = 27/48 (56%)

Query:   536 WLDNIIWEPVDAVG---KP-KLILDLQDEQMLFEIL-DNKDDEHPLLH 578
             W  NIIW+  DA+    +P   ++D QD+ +++ +  D + DE P  H
Sbjct:   395 WETNIIWD-ADAMPSTLEPIDFLVDFQDDPLIYGMPEDRRHDEGPDHH 441

 Score = 56 (24.8 bits), Expect = 3.6e-44, Sum P(6) = 3.6e-44
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query:   369 ESNVNPSVGWKSMPSPNFFPLDQHDWEENIVWD 401
             +S + P+ G    PS     LD   WE NI+WD
Sbjct:   371 DSAILPTPGHSIFPSAPC-DLDILPWETNIIWD 402

 Score = 47 (21.6 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query:  1224 QIKDPKEEESLIAKRNLSGKVQAMKKN 1250
             Q+ +  +EE    KR L  +++ MKKN
Sbjct:  1163 QMDEQYKEEKRKKKRRLQDQIRRMKKN 1189

 Score = 47 (21.6 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 15/48 (31%), Positives = 23/48 (47%)

Query:  1179 QVEGGLSLAKSISGLEIVERLKKANKPAKHIAITVQPNGSHTANEQIK 1226
             QVE G    K   G    E  K A++  + +A++       TAN+Q+K
Sbjct:  1074 QVEKG---EKKEEGEVTAEEKKSASQFGEDVAMSASKISGITANQQLK 1118

 Score = 45 (20.9 bits), Expect = 1.8e-40, Sum P(4) = 1.8e-40
 Identities = 18/81 (22%), Positives = 29/81 (35%)

Query:  1051 SEQAASGVQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDLDSFAGDXXXXXXXX 1110
             ++ AA+   L   +     E+ D   +     DDDEI  D    D   ++ D        
Sbjct:  1651 NDAAAAQRGLAMDSDDHMDEMEDHPTEEEEEDDDDEIMDDDMDIDATGYSYDHDDNVAVG 1710

Query:  1111 XXXXXXSNYDTKHD-KVEGVK 1130
                   +  D+  D + E VK
Sbjct:  1711 QIFNDLAMSDSDEDERAEDVK 1731

 Score = 39 (18.8 bits), Expect = 1.9e-42, Sum P(6) = 1.9e-42
 Identities = 8/37 (21%), Positives = 16/37 (43%)

Query:   117 EDIDEQYEGPEIQIASEEDYLLPKKEYFAAEVSLAAL 153
             ++ DEQ E    Q    + + L   +Y+  + +   L
Sbjct:   470 QEEDEQMESTMAQFTDNDPFNLSNDDYYVPKATSKTL 506


>UNIPROTKB|G5EGM3 [details] [associations]
            symbol:taf-1 "Protein TAF-1" species:6239 "Caenorhabditis
            elegans" [GO:0016573 "histone acetylation" evidence=ISS]
            [GO:0016567 "protein ubiquitination" evidence=ISS]
            InterPro:IPR001487 InterPro:IPR001878 InterPro:IPR011177
            Pfam:PF00439 PIRSF:PIRSF003047 PRINTS:PR00503 PROSITE:PS50014
            SMART:SM00297 SMART:SM00343 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0040010 GO:GO:0002119 GO:GO:0004674
            GO:GO:0008270 GO:GO:0045944 GO:GO:0010171 GO:GO:0040011
            GO:GO:0006352 GO:GO:0040035 GO:GO:0003690 GO:GO:0004842
            GO:GO:0040002 Gene3D:1.20.920.10 SUPFAM:SSF47370 PROSITE:PS00633
            InterPro:IPR018359 GO:GO:0004402 GO:GO:0001071 GO:GO:0001703
            GO:GO:0005669 EMBL:AL132902 KO:K03125 InterPro:IPR022591
            Pfam:PF12157 GeneTree:ENSGT00390000012659 OMA:DEFYYPK EMBL:Z82069
            RefSeq:NP_493426.2 ProteinModelPortal:G5EGM3 SMR:G5EGM3
            IntAct:G5EGM3 EnsemblMetazoa:W04A8.7 GeneID:173257
            KEGG:cel:CELE_W04A8.7 CTD:173257 WormBase:W04A8.7 NextBio:878901
            Uniprot:G5EGM3
        Length = 1744

 Score = 351 (128.6 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 95/291 (32%), Positives = 146/291 (50%)

Query:   659 DLSVKDGHVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLT 718
             DLS KD  + ++EY EE P++LS  GM + +  Y+++    D             G    
Sbjct:   594 DLSGKDETLVMIEYSEEHPVILSQPGMASKMKNYFKRRQANDSEPTFT------FGE--- 644

Query:   719 LEPGDKSPFLGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKISIRRIDKVA 778
             L    + PFLG ++ G S  S+E  +YRAP++ HK   TDFLL+RS   +  IR +  + 
Sbjct:   645 LAFSHQIPFLGQLQPGQSLQSIENMLYRAPIYLHKRQNTDFLLIRSMN-QWYIRPLPSIF 703

Query:   779 VVAQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLS 838
             V  QQ PL EV SP SK    +  + +   +YR F A+        + +D++   FP+ +
Sbjct:   704 VAGQQCPLYEVPSPNSKRATVFVRDFLFAFIYRLFWASDSSPRR--LKMDDVRNAFPHYA 761

Query:   839 EAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRL 898
             E+ IRK+LK C+   R G+ +  WS+K  F +PS+ ++  +  PE  C+  SM A   RL
Sbjct:   762 ESNIRKRLKMCSTFVRQGS-ETYWSLKPDFRLPSKEEVLSMVTPEMCCAQYSMMAAEQRL 820

Query:   899 KHLGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVA 949
             K  G  +       +   S+  DE         IE E++  PWN +  F+A
Sbjct:   821 KDAGYGEKYFFTPENDEGSE--DEVT-------IEDEIKCAPWNTTRAFLA 862

 Score = 177 (67.4 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 46/123 (37%), Positives = 65/123 (52%)

Query:   932 IERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVRXXXXXXXXXXX 991
             IE E++  PWN +  F+A     RE    L+ TG+ DP+G G GFSYVR           
Sbjct:   845 IEDEIKCAPWNTTRAFLA---SQREKC-LLDQTGIADPTGCGQGFSYVRVSQKPHKDENA 900

Query:   992 XXXXXXXNRGGSTVTGTDADLRRLSMEAAREVLLKFNVPEEMIAKQTRWHRIAMIRKLSS 1051
                          VTGT+ADLR+L ++ A+++   + V EE I+  TRW  I +IR LS+
Sbjct:   901 TPVPK------KLVTGTNADLRKLPLKEAKQICRGYGVKEEEISALTRWEIIDVIRTLST 954

Query:  1052 EQA 1054
             + A
Sbjct:   955 QAA 957

 Score = 78 (32.5 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 17/43 (39%), Positives = 27/43 (62%)

Query:  1056 SGVQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDLDS 1098
             S   +Q++ R  CQ I+D+Q Q+L  A+ D I +D + +D DS
Sbjct:   982 SSADMQEKYRKHCQRIFDQQNQTL--ANTDPISTDDDSTDADS 1022

 Score = 58 (25.5 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 16/48 (33%), Positives = 27/48 (56%)

Query:   536 WLDNIIWEPVDAVG---KP-KLILDLQDEQMLFEIL-DNKDDEHPLLH 578
             W  NIIW+  DA+    +P   ++D QD+ +++ +  D + DE P  H
Sbjct:   395 WETNIIWD-ADAMPSTLEPIDFLVDFQDDPLIYGMPEDRRHDEGPDHH 441

 Score = 56 (24.8 bits), Expect = 3.6e-44, Sum P(6) = 3.6e-44
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query:   369 ESNVNPSVGWKSMPSPNFFPLDQHDWEENIVWD 401
             +S + P+ G    PS     LD   WE NI+WD
Sbjct:   371 DSAILPTPGHSIFPSAPC-DLDILPWETNIIWD 402

 Score = 47 (21.6 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 10/27 (37%), Positives = 16/27 (59%)

Query:  1224 QIKDPKEEESLIAKRNLSGKVQAMKKN 1250
             Q+ +  +EE    KR L  +++ MKKN
Sbjct:  1163 QMDEQYKEEKRKKKRRLQDQIRRMKKN 1189

 Score = 47 (21.6 bits), Expect = 2.2e-44, Sum P(6) = 2.2e-44
 Identities = 15/48 (31%), Positives = 23/48 (47%)

Query:  1179 QVEGGLSLAKSISGLEIVERLKKANKPAKHIAITVQPNGSHTANEQIK 1226
             QVE G    K   G    E  K A++  + +A++       TAN+Q+K
Sbjct:  1074 QVEKG---EKKEEGEVTAEEKKSASQFGEDVAMSASKISGITANQQLK 1118

 Score = 45 (20.9 bits), Expect = 1.8e-40, Sum P(4) = 1.8e-40
 Identities = 18/81 (22%), Positives = 29/81 (35%)

Query:  1051 SEQAASGVQLQQQTRGKCQEIWDRQVQSLSAADDDEIGSDSEHSDLDSFAGDXXXXXXXX 1110
             ++ AA+   L   +     E+ D   +     DDDEI  D    D   ++ D        
Sbjct:  1651 NDAAAAQRGLAMDSDDHMDEMEDHPTEEEEEDDDDEIMDDDMDIDATGYSYDHDDNVAVG 1710

Query:  1111 XXXXXXSNYDTKHD-KVEGVK 1130
                   +  D+  D + E VK
Sbjct:  1711 QIFNDLAMSDSDEDERAEDVK 1731

 Score = 39 (18.8 bits), Expect = 1.9e-42, Sum P(6) = 1.9e-42
 Identities = 8/37 (21%), Positives = 16/37 (43%)

Query:   117 EDIDEQYEGPEIQIASEEDYLLPKKEYFAAEVSLAAL 153
             ++ DEQ E    Q    + + L   +Y+  + +   L
Sbjct:   470 QEEDEQMESTMAQFTDNDPFNLSNDDYYVPKATSKTL 506


>POMBASE|SPAC2G11.14 [details] [associations]
            symbol:taf111 "transcription factor TFIID complex
            subunit Taf111" species:4896 "Schizosaccharomyces pombe"
            [GO:0000991 "core RNA polymerase II binding transcription factor
            activity" evidence=ISO] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005669 "transcription factor TFIID complex" evidence=ISO]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0006367 "transcription initiation from RNA
            polymerase II promoter" evidence=ISO] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001878 SMART:SM00343
            PomBase:SPAC2G11.14 EMBL:CU329670 GO:GO:0006355 GO:GO:0008270
            GO:GO:0003676 GO:GO:0006367 GO:GO:0000991 GO:GO:0005669 PIR:T38580
            RefSeq:NP_593094.1 IntAct:Q09813 STRING:Q09813
            EnsemblFungi:SPAC2G11.14.1 GeneID:2541848 KEGG:spo:SPAC2G11.14
            eggNOG:COG5179 HOGENOM:HOG000216192 KO:K03125 OMA:NKMIRAP
            OrthoDB:EOG4ZW8KG NextBio:20802936 InterPro:IPR022591 Pfam:PF12157
            Uniprot:Q09813
        Length = 979

 Score = 342 (125.4 bits), Expect = 4.9e-27, Sum P(2) = 4.9e-27
 Identities = 94/311 (30%), Positives = 156/311 (50%)

Query:   666 HVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKS 725
             H  L+E+ EE P +LSNAGM + +  YY+K +  D++   L  G +   +VL ++  D+S
Sbjct:   463 HAILVEFSEEHPAVLSNAGMASRIVNYYRKKNEQDESRPKLEVGES---HVLDVQ--DRS 517

Query:   726 PF--LGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKG---KISIRRIDKVAVV 780
             PF   G ++ G    +L   M RAP+F H+V  TDF+L+R++     K  ++ I+ + V 
Sbjct:   518 PFWNFGSVEPGEITPTLYNKMIRAPLFKHEVPPTDFILIRNSSSYGSKYYLKNINHMFVS 577

Query:   781 AQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEA 840
              Q  P+ +V  P S+ + T S NR+ + V+R    +   GL     + +LS  F + +E 
Sbjct:   578 GQTFPVTDVPGPHSRKVTTASKNRLKMLVFRLIRRSPNGGLF----IRQLSKHFSDQNEM 633

Query:   841 IIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGLYRLKH 900
              IR++LKE    ++ G+G   W +K    +P E   R +  PE VC  ESMQ G+ +L+ 
Sbjct:   634 QIRQRLKEFMEYKKKGDGPGYWKLKSNEVVPDEAGTRSMVSPETVCLLESMQVGVRQLED 693

Query:   901 LGITQLTLPASISSAMSQLPDEAIALAAASHIERELQITPWNLSSNFVACTNQDRENIER 960
              G             M ++ D+      A  +     + PW  + NF+  T Q +  +  
Sbjct:   694 AGY---------GKTMDEINDDEDEEQPAEQL-----LAPWITTRNFINAT-QGKAMLTL 738

Query:   961 L---EITGVGD 968
                 + TG+G+
Sbjct:   739 FGEGDPTGIGE 749

 Score = 53 (23.7 bits), Expect = 4.9e-27, Sum P(2) = 4.9e-27
 Identities = 14/45 (31%), Positives = 23/45 (51%)

Query:   392 HDWEENIVWDNSPAASDNSIESH-EIAGADVESALMRGI-ELDTG 434
             +DWE+NI++D S     N      + +  D++ A    I + DTG
Sbjct:   275 NDWEKNIIYDESDVNKTNQSSFFIDKSLVDIDFAFDENIFDGDTG 319


>ASPGD|ASPL0000011251 [details] [associations]
            symbol:AN3907 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR001878 SMART:SM00343 GO:GO:0008270 GO:GO:0003676
            EMBL:BN001302 EMBL:AACD01000064 eggNOG:COG5179 HOGENOM:HOG000216192
            KO:K03125 OrthoDB:EOG4ZW8KG InterPro:IPR022591 Pfam:PF12157
            RefSeq:XP_661511.1 STRING:Q5B6C3 EnsemblFungi:CADANIAT00004791
            GeneID:2873329 KEGG:ani:AN3907.2 OMA:RKCANCG Uniprot:Q5B6C3
        Length = 1092

 Score = 311 (114.5 bits), Expect = 6.3e-26, Sum P(5) = 6.3e-26
 Identities = 84/256 (32%), Positives = 125/256 (48%)

Query:   654 FKKKSDLSVKDG-HVFLMEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNC 712
             +     LS+ D  HV L+EY EE PL LSN GM   +  YY+K +  D            
Sbjct:   534 YDSTKSLSMADNSHVLLVEYSEESPLTLSNFGMANRIINYYRKKNAEDPTRP-----KAE 588

Query:   713 LGNVLTLEPGDKSPF--LGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGK-- 768
             +G    L P DKSPF   G +  G    ++   MYRAP+F+H+  +TDFL++R++ G   
Sbjct:   589 IGETAVLLPQDKSPFSIFGHVDPGEVTPAISNAMYRAPLFSHQPKSTDFLVIRNSTGSGG 648

Query:   769 --ISIRRIDKVAVVAQQEPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIG 826
                 IR ID + V  QQ P +++  P S+ + T + NRM + VYR    +    L     
Sbjct:   649 SDYYIRNIDNIFVAGQQFPSVDIPGPHSRKVTTVAKNRMKMLVYRLLKKSPDLRL----S 704

Query:   827 VDELSVQFPNLSEAIIRKKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVC 886
             + +++   P  S+   R+K+K+  FL+ D + K  W       +P +  +R    PE VC
Sbjct:   705 ISDVTAHIPGTSDMQNRQKVKD--FLQHDKDTKY-WVPLEP--VPEQDVIRSWVQPEDVC 759

Query:   887 SYESMQAGLYRLKHLG 902
               E+MQ G   L   G
Sbjct:   760 LLEAMQVGQQHLHDTG 775

 Score = 107 (42.7 bits), Expect = 0.00013, Sum P(5) = 0.00013
 Identities = 35/111 (31%), Positives = 51/111 (45%)

Query:   870 IPSEGDLRKLAYPEHVCSYESMQAGLYRLKHLGITQLTLPASISSAMSQLPDEAIALAAA 929
             +P +  +R    PE VC  E+MQ G    +HL  T     A          DE     + 
Sbjct:   743 VPEQDVIRSWVQPEDVCLLEAMQVGQ---QHLHDTGYGNDAETGGDDDN--DEE----SE 793

Query:   930 SHIERELQITPWNLSSNFVACTNQDRENIERLEITGVGDPSGRGLGFSYVR 980
             S    E Q+ PW  S NF+  +    +    L++ G GDP+GRG GFS+++
Sbjct:   794 SF---EQQMAPWKASRNFLLAS----QGKAMLKLHGEGDPTGRGEGFSFIK 837

 Score = 63 (27.2 bits), Expect = 6.3e-26, Sum P(5) = 6.3e-26
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query:   103 EQDYDKKAEDAVDYEDIDEQYEGPE 127
             E +  +KA+DAVDYED D+  E PE
Sbjct:    42 ELEPGEKADDAVDYEDFDDD-ELPE 65

 Score = 55 (24.4 bits), Expect = 6.3e-26, Sum P(5) = 6.3e-26
 Identities = 26/111 (23%), Positives = 45/111 (40%)

Query:   387 FPLDQHDWEENIVWDNSPAASDNSIESHEIAGADVESALMRGIELDTGQNNFHERSTSPN 446
             F  D+   EE I   N+ A ++N +E+ E +    + AL  G   D  +    +   +P+
Sbjct:    58 FDDDELPEEEEITQHNARAVAENGLEAPETS----DDALFGGGGDDLFEAPGKDEQAAPD 113

Query:   447 EKDCIVN-MQNSPVLSETFGSKSSSDNTSHLFTESRYHPQLLRLESQLDLD 496
             E D +     +SP           +D T  LF E    P  ++    + +D
Sbjct:   114 ELDDLFGEAPSSP-------RPDLADETRDLFFEEEERPTAVQRPQAIQMD 157

 Score = 54 (24.1 bits), Expect = 6.3e-26, Sum P(5) = 6.3e-26
 Identities = 13/38 (34%), Positives = 17/38 (44%)

Query:  1061 QQQTRGKCQEIWDRQVQSLSAA---DDDEIGSDSEHSD 1095
             Q+      + IWD Q  SLS+     DDE+    E  D
Sbjct:   874 QKSYETSIRRIWDAQKASLSSTIEHSDDEVNMSEEEED 911

 Score = 39 (18.8 bits), Expect = 6.3e-26, Sum P(5) = 6.3e-26
 Identities = 18/52 (34%), Positives = 24/52 (46%)

Query:   520 RFSKHSLQNRDMMEGSWLDNI----IWEPVDAVGK--PKLILDLQDEQMLFE 565
             R +K     RD ME   L +I    + +P  A  K   K+ LDL D  ML +
Sbjct:   360 RPAKKRKLGRDPMEIVSLSHIDLPLLDDPEQASSKVAQKVTLDLNDPYMLLD 411

 Score = 37 (18.1 bits), Expect = 6.3e-22, Sum P(4) = 6.3e-22
 Identities = 8/28 (28%), Positives = 15/28 (53%)

Query:    99 VDAVEQDYDKKAEDAVDYEDIDEQYEGP 126
             V   E   +++ E+A D  D+D + + P
Sbjct:   286 VPVAETSQEEEEEEARDEFDLDYESDEP 313


>SGD|S000003506 [details] [associations]
            symbol:TAF1 "TFIID subunit (145 kDa)" species:4932
            "Saccharomyces cerevisiae" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003682 "chromatin binding" evidence=IDA]
            [GO:0051123 "RNA polymerase II transcriptional preinitiation
            complex assembly" evidence=IMP] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IDA;IMP] [GO:0001075 "RNA polymerase II core promoter
            sequence-specific DNA binding transcription factor activity
            involved in preinitiation complex assembly" evidence=IC]
            [GO:0004402 "histone acetyltransferase activity" evidence=IEA;IDA]
            [GO:0016746 "transferase activity, transferring acyl groups"
            evidence=IEA] [GO:0032947 "protein complex scaffold"
            evidence=IMP;IDA] [GO:0017025 "TBP-class protein binding"
            evidence=IDA] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016573 "histone acetylation" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0016568 "chromatin modification" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0005669
            "transcription factor TFIID complex" evidence=IDA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR001878
            SMART:SM00343 SGD:S000003506 GO:GO:0006355 EMBL:BK006941
            GO:GO:0008270 GO:GO:0003682 GO:GO:0003676 GO:GO:0032947
            GO:GO:0004402 GO:GO:0017025 GO:GO:0005669 EMBL:X84098 GO:GO:0051123
            eggNOG:COG5179 HOGENOM:HOG000216192 KO:K03125 OrthoDB:EOG4ZW8KG
            InterPro:IPR022591 Pfam:PF12157 GeneTree:ENSGT00390000012659
            EMBL:U14954 EMBL:Z73059 PIR:S50237 RefSeq:NP_011790.1
            ProteinModelPortal:P46677 DIP:DIP-839N IntAct:P46677
            MINT:MINT-617790 STRING:P46677 PaxDb:P46677 PeptideAtlas:P46677
            PRIDE:P46677 EnsemblFungi:YGR274C GeneID:853191 KEGG:sce:YGR274C
            CYGD:YGR274c OMA:NTTRRCA NextBio:973344 Genevestigator:P46677
            GermOnline:YGR274C Uniprot:P46677
        Length = 1066

 Score = 300 (110.7 bits), Expect = 1.4e-22, Sum P(2) = 1.4e-22
 Identities = 89/292 (30%), Positives = 143/292 (48%)

Query:   611 HKSLADQNVRPNSLIHLIRTKIHLLPRAQKLPGENKSLRPPGAFKKKSDLSVKD-GHVFL 669
             H+     ++RP + I   + K     R +K   + K ++   +F    DL++ D   V+L
Sbjct:   494 HRENFGSHIRPGTKIVFSKLKA----RKRKRD-KGKDVKE--SFSTSQDLTIGDTAPVYL 546

Query:   670 MEYCEERPLLLSNAGMGANLCTYYQKSSPGDQAGALLCSGNNCLGNVLTLEPGDKSPF-- 727
             MEY E+ P+ LS  GM   L  YY+K++  D     L  G     +VL ++  DKSPF  
Sbjct:   547 MEYSEQTPVALSKFGMANKLINYYRKANEQDTLRPKLPVGET---HVLGVQ--DKSPFWN 601

Query:   728 LGDIKAGCSQSSLETNMYRAPVFTHKVATTDFLLVRSAKGKIS----IRRIDKVAVVAQQ 783
              G ++ G    +L  NM RAPVF H ++ TDFLL +S+   IS    +R I+ +  V Q 
Sbjct:   602 FGFVEPGHIVPTLYNNMIRAPVFKHDISGTDFLLTKSSGFGISNRFYLRNINHLFTVGQT 661

Query:   784 EPLMEVMSPGSKNLQTYSINRMLVNVYREFSAAAKRGLLPCIGVDELSVQFPNLSEAIIR 843
              P+ E+  P S+ + +    R+ + +YR  +    +     I +D ++  FP+      R
Sbjct:   662 FPVEEIPGPNSRKVTSMKATRLKMIIYRILNHNHSKA----ISIDPIAKHFPDQDYGQNR 717

Query:   844 KKLKECAFLRRDGNGKQVWSMKRTFHIPSEGDLRKLAYPEHVCSYESMQAGL 895
             +K+KE    +RDG  K +W +K    +     ++ L  PE +   ESM  GL
Sbjct:   718 QKVKEFMKYQRDGPEKGLWRLKDDEKLLDNEAVKSLITPEQISQVESMSQGL 769

 Score = 55 (24.4 bits), Expect = 1.4e-22, Sum P(2) = 1.4e-22
 Identities = 13/35 (37%), Positives = 19/35 (54%)

Query:   101 AVEQDYDKKAEDAVDYEDIDEQYEGPEIQIASEED 135
             A  +DY +   DAVD+ED DE  +  +  +  E D
Sbjct:    42 ANSKDYTEHLPDAVDFEDEDELADDDD-DLPEESD 75


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.132   0.383    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1282      1176   0.00095  123 3  11 23  0.45    34
                                                     38  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  23
  No. of states in DFA:  633 (67 KB)
  Total size of DFA:  543 KB (2247 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  106.54u 0.07s 106.61t   Elapsed:  00:00:09
  Total cpu time:  106.55u 0.07s 106.62t   Elapsed:  00:00:09
  Start:  Tue May 21 10:12:59 2013   End:  Tue May 21 10:13:08 2013

Back to top