BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy13810
ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK
PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLFNEGAENP
VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM
SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVY
AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV
SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR
TVVVTKQGHSDIKKFWVNHQKGSQHS

High Scoring Gene Products

Symbol, full name Information P value
GTF2H4
General transcription factor IIH subunit 4
protein from Homo sapiens 6.5e-106
GTF2H4
General transcription factor IIH subunit 4
protein from Pan troglodytes 6.5e-106
Gtf2h4
general transcription factor II H, polypeptide 4
gene from Rattus norvegicus 6.5e-106
Gtf2h4
general transcription factor II H, polypeptide 4
protein from Mus musculus 8.2e-106
GTF2H4
Uncharacterized protein
protein from Canis lupus familiaris 2.8e-105
TFIIH
General transcription factor IIH, polypeptide 4
protein from Sus scrofa 2.8e-105
GTF2H4
GTF2H4 protein
protein from Bos taurus 5.8e-105
mrn
marionette
protein from Drosophila melanogaster 3.5e-88
gtf2h4
general transcription factor IIH, polypeptide 4
gene_product from Danio rerio 2.8e-80
TFB2
Subunit of TFIIH and nucleotide excision repair factor 3 complexes
gene from Saccharomyces cerevisiae 4.0e-57
GTF2H4
General transcription factor IIH subunit 4
protein from Homo sapiens 4.8e-52
MGG_00397
RNA polymerase II transcription factor B subunit 2
protein from Magnaporthe oryzae 70-15 1.7e-51
gtf2h4
TFIIH subunit
gene from Dictyostelium discoideum 6.0e-48
orf19.5846 gene_product from Candida albicans 5.3e-47
Y73F8A.24 gene from Caenorhabditis elegans 5.4e-47
GTF2H4
General transcription factor IIH subunit 4
protein from Homo sapiens 1.5e-07

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy13810
        (386 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

UNIPROTKB|Q92759 - symbol:GTF2H4 "General transcription f...   941  6.5e-106  2
UNIPROTKB|P60027 - symbol:GTF2H4 "General transcription f...   941  6.5e-106  2
RGD|1303309 - symbol:Gtf2h4 "general transcription factor...   941  6.5e-106  2
MGI|MGI:1338799 - symbol:Gtf2h4 "general transcription fa...   940  8.2e-106  2
UNIPROTKB|F1PGI3 - symbol:GTF2H4 "Uncharacterized protein...   938  2.8e-105  2
UNIPROTKB|Q767M2 - symbol:TFIIH "Uncharacterized protein"...   935  2.8e-105  2
UNIPROTKB|A6H7G8 - symbol:GTF2H4 "Uncharacterized protein...   932  5.8e-105  2
FB|FBgn0261109 - symbol:mrn "marionette" species:7227 "Dr...   741  3.5e-88   2
ZFIN|ZDB-GENE-030131-6779 - symbol:gtf2h4 "general transc...   698  2.8e-80   2
POMBASE|SPBC13G1.13 - symbol:tfb2 "transcription factor T...   617  4.2e-63   2
SGD|S000006043 - symbol:TFB2 "Subunit of TFIIH and nucleo...   270  4.0e-57   4
UNIPROTKB|B4DTJ5 - symbol:GTF2H4 "cDNA FLJ50212, highly s...   456  4.8e-52   2
UNIPROTKB|G4NCE6 - symbol:MGG_00397 "RNA polymerase II tr...   407  1.7e-51   3
ASPGD|ASPL0000068300 - symbol:AN7015 species:162425 "Emer...   466  5.3e-49   2
DICTYBASE|DDB_G0293228 - symbol:gtf2h4 "TFIIH subunit" sp...   454  6.0e-48   2
CGD|CAL0002355 - symbol:orf19.5846 species:5476 "Candida ...   448  5.3e-47   2
WB|WBGene00013529 - symbol:Y73F8A.24 species:6239 "Caenor...   492  5.4e-47   1
UNIPROTKB|A2ABL5 - symbol:GTF2H4 "General transcription f...   127  1.5e-07   1


>UNIPROTKB|Q92759 [details] [associations]
            symbol:GTF2H4 "General transcription factor IIH subunit 4"
            species:9606 "Homo sapiens" [GO:0000439 "core TFIIH complex"
            evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IDA;TAS] [GO:0004672 "protein kinase activity"
            evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005675 "holo TFIIH complex" evidence=IDA] [GO:0008353 "RNA
            polymerase II carboxy-terminal domain kinase activity"
            evidence=IDA] [GO:0008094 "DNA-dependent ATPase activity"
            evidence=IDA] [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=TAS] [GO:0000718
            "nucleotide-excision repair, DNA damage removal" evidence=TAS]
            [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281 "DNA repair"
            evidence=TAS] [GO:0006283 "transcription-coupled
            nucleotide-excision repair" evidence=TAS] [GO:0006289
            "nucleotide-excision repair" evidence=TAS] [GO:0006360
            "transcription from RNA polymerase I promoter" evidence=TAS]
            [GO:0006361 "transcription initiation from RNA polymerase I
            promoter" evidence=TAS] [GO:0006362 "transcription elongation from
            RNA polymerase I promoter" evidence=TAS] [GO:0006363 "termination
            of RNA polymerase I transcription" evidence=TAS] [GO:0006367
            "transcription initiation from RNA polymerase II promoter"
            evidence=TAS] [GO:0006368 "transcription elongation from RNA
            polymerase II promoter" evidence=TAS] [GO:0006370
            "7-methylguanosine mRNA capping" evidence=TAS] [GO:0010467 "gene
            expression" evidence=TAS] [GO:0016032 "viral reproduction"
            evidence=TAS] [GO:0050434 "positive regulation of viral
            transcription" evidence=TAS] [GO:0006468 "protein phosphorylation"
            evidence=IDA] Reactome:REACT_216 Reactome:REACT_71
            InterPro:IPR004598 Pfam:PF03849 Reactome:REACT_116125 GO:GO:0016032
            EMBL:BA000025 EMBL:CH471081 Reactome:REACT_1675 GO:GO:0006468
            GO:GO:0003700 GO:GO:0006368 GO:GO:0006367 Reactome:REACT_1788
            GO:GO:0005675 GO:GO:0006370 GO:GO:0000718 GO:GO:0050434
            GO:GO:0006363 GO:GO:0006362 GO:GO:0006361 GO:GO:0006283
            EMBL:CR759747 EMBL:AL773541 EMBL:BX927194 EMBL:CR936875
            eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 EMBL:Y07595
            EMBL:BT007321 EMBL:AY124590 EMBL:AB088103 EMBL:AB202101
            EMBL:AB103609 EMBL:BC004935 EMBL:BC016302 IPI:IPI00016839
            RefSeq:NP_001508.1 UniGene:Hs.485070 ProteinModelPortal:Q92759
            SMR:Q92759 DIP:DIP-48376N STRING:Q92759 PhosphoSite:Q92759
            DMDM:17380328 PaxDb:Q92759 PRIDE:Q92759 DNASU:2968
            Ensembl:ENST00000259895 Ensembl:ENST00000376316
            Ensembl:ENST00000376326 Ensembl:ENST00000400450
            Ensembl:ENST00000413314 Ensembl:ENST00000416773
            Ensembl:ENST00000423881 Ensembl:ENST00000434226
            Ensembl:ENST00000435498 Ensembl:ENST00000438348
            Ensembl:ENST00000440824 Ensembl:ENST00000456968 GeneID:2968
            KEGG:hsa:2968 UCSC:uc003nsa.1 CTD:2968 GeneCards:GC06P030875
            HGNC:HGNC:4658 MIM:601760 neXtProt:NX_Q92759 PharmGKB:PA29044
            HOGENOM:HOG000170529 HOVERGEN:HBG054205 InParanoid:Q92759
            OMA:CVLHYMV OrthoDB:EOG4F1X34 PhylomeDB:Q92759 GenomeRNAi:2968
            NextBio:11764 ArrayExpress:Q92759 Bgee:Q92759 CleanEx:HS_GTF2H4
            Genevestigator:Q92759 GermOnline:ENSG00000137411 TIGRFAMs:TIGR00625
            Uniprot:Q92759
        Length = 462

 Score = 941 (336.3 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
 Identities = 192/349 (55%), Positives = 249/349 (71%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:   115 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 173

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   174 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 233

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
             L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++ 
Sbjct:   234 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 293

Query:   219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
               G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ 
Sbjct:   294 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 353

Query:   278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
             A+ SGITA QII FLR  A                DQIRLWE ER+RL  +EGVLY+QF+
Sbjct:   354 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 413

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
             SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   QK S
Sbjct:   414 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 461

 Score = 127 (49.8 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 104


>UNIPROTKB|P60027 [details] [associations]
            symbol:GTF2H4 "General transcription factor IIH subunit 4"
            species:9598 "Pan troglodytes" [GO:0005675 "holo TFIIH complex"
            evidence=ISS] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=ISS] [GO:0004672 "protein kinase activity"
            evidence=ISS] [GO:0008094 "DNA-dependent ATPase activity"
            evidence=ISS] [GO:0008353 "RNA polymerase II carboxy-terminal
            domain kinase activity" evidence=ISS] InterPro:IPR004598
            Pfam:PF03849 GO:GO:0006355 GO:GO:0006281 EMBL:BA000041
            GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
            eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 CTD:2968
            HOGENOM:HOG000170529 HOVERGEN:HBG054205 OMA:CVLHYMV
            OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625 EMBL:AB210199 EMBL:AB210200
            RefSeq:NP_001038968.1 UniGene:Ptr.6202 ProteinModelPortal:P60027
            SMR:P60027 STRING:P60027 PRIDE:P60027 Ensembl:ENSPTRT00000033136
            GeneID:462549 KEGG:ptr:462549 GeneTree:ENSGT00390000014159
            InParanoid:P60027 NextBio:20841799 Uniprot:P60027
        Length = 462

 Score = 941 (336.3 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
 Identities = 192/349 (55%), Positives = 249/349 (71%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:   115 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 173

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   174 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 233

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
             L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++ 
Sbjct:   234 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 293

Query:   219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
               G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ 
Sbjct:   294 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 353

Query:   278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
             A+ SGITA QII FLR  A                DQIRLWE ER+RL  +EGVLY+QF+
Sbjct:   354 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 413

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
             SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   QK S
Sbjct:   414 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 461

 Score = 127 (49.8 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 104


>RGD|1303309 [details] [associations]
            symbol:Gtf2h4 "general transcription factor II H, polypeptide 4"
            species:10116 "Rattus norvegicus" [GO:0000439 "core TFIIH complex"
            evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0005675 "holo TFIIH complex" evidence=IEA;ISO]
            [GO:0006289 "nucleotide-excision repair" evidence=IEA] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=IEA;ISO]
            [GO:0006468 "protein phosphorylation" evidence=ISO] [GO:0008353
            "RNA polymerase II carboxy-terminal domain kinase activity"
            evidence=IEA;ISO] [GO:0004672 "protein kinase activity"
            evidence=ISO] [GO:0008094 "DNA-dependent ATPase activity"
            evidence=ISO] InterPro:IPR004598 Pfam:PF03849 RGD:1303309
            GO:GO:0006355 GO:GO:0006281 GO:GO:0006366 GO:GO:0008353
            GO:GO:0005675 GO:GO:0008094 EMBL:BX883047 eggNOG:COG5144 KO:K03144
            PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
            OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625
            GeneTree:ENSGT00390000014159 EMBL:BC127469 IPI:IPI00215269
            RefSeq:NP_997666.1 UniGene:Rn.34122 STRING:Q6MG20
            Ensembl:ENSRNOT00000001103 GeneID:294236 KEGG:rno:294236
            UCSC:RGD:1303309 InParanoid:Q6MG20 NextBio:637807
            Genevestigator:Q6MG20 Uniprot:Q6MG20
        Length = 463

 Score = 941 (336.3 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
 Identities = 192/349 (55%), Positives = 249/349 (71%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:   116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   175 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
             L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++ 
Sbjct:   235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294

Query:   219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
               G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ 
Sbjct:   295 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354

Query:   278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
             A+ SGITA QII FLR  A                DQIRLWE ER+RL  +EGVLY+QF+
Sbjct:   355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
             SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   QK S
Sbjct:   415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462

 Score = 127 (49.8 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105


>MGI|MGI:1338799 [details] [associations]
            symbol:Gtf2h4 "general transcription factor II H,
            polypeptide 4" species:10090 "Mus musculus" [GO:0000439 "core TFIIH
            complex" evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase
            activity" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0005675 "holo TFIIH complex" evidence=ISO] [GO:0006281 "DNA
            repair" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=ISO] [GO:0006468 "protein
            phosphorylation" evidence=ISO] [GO:0006974 "response to DNA damage
            stimulus" evidence=IEA] InterPro:IPR004598 Pfam:PF03849
            MGI:MGI:1338799 GO:GO:0006355 GO:GO:0006281 GO:GO:0006366
            GO:GO:0008353 GO:GO:0005675 GO:GO:0008094 eggNOG:COG5144 KO:K03144
            PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
            OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625 EMBL:AF054823
            EMBL:BC012638 IPI:IPI00118172 RefSeq:NP_034494.1 UniGene:Mm.10182
            ProteinModelPortal:O70422 SMR:O70422 IntAct:O70422 STRING:O70422
            PhosphoSite:O70422 PaxDb:O70422 PRIDE:O70422
            Ensembl:ENSMUST00000001565 Ensembl:ENSMUST00000160734 GeneID:14885
            KEGG:mmu:14885 InParanoid:O70422 NextBio:287167 Bgee:O70422
            CleanEx:MM_GTF2H4 Genevestigator:O70422
            GermOnline:ENSMUSG00000001524 Uniprot:O70422
        Length = 463

 Score = 940 (336.0 bits), Expect = 8.2e-106, Sum P(2) = 8.2e-106
 Identities = 192/349 (55%), Positives = 249/349 (71%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:   116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   175 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
             L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++ 
Sbjct:   235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294

Query:   219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
               G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ 
Sbjct:   295 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354

Query:   278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
             A+ SGITA QII FLR  A                DQIRLWE ER+RL  +EGVLY+QF+
Sbjct:   355 AIASGITAQQIIHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
             SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   QK S
Sbjct:   415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462

 Score = 127 (49.8 bits), Expect = 8.2e-106, Sum P(2) = 8.2e-106
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105


>UNIPROTKB|F1PGI3 [details] [associations]
            symbol:GTF2H4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
            InterPro:IPR004598 Pfam:PF03849 GO:GO:0005634 GO:GO:0006355
            GO:GO:0006281 KO:K03144 PANTHER:PTHR13152 OMA:CVLHYMV
            TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159 EMBL:AAEX03008206
            RefSeq:XP_003639474.1 Ensembl:ENSCAFT00000000727 GeneID:100856712
            KEGG:cfa:100856712 Uniprot:F1PGI3
        Length = 463

 Score = 938 (335.3 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
 Identities = 193/349 (55%), Positives = 248/349 (71%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:   116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   175 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
             L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++ 
Sbjct:   235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294

Query:   219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
               G      +PGF+IVETN+R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ 
Sbjct:   295 VSGAGGTAHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354

Query:   278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
             A+ SGITA QII FLR  A                DQIRLWE ER+RL  +EGVLY+QF+
Sbjct:   355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
             SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   QK S
Sbjct:   415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462

 Score = 124 (48.7 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    48 ELPFLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105


>UNIPROTKB|Q767M2 [details] [associations]
            symbol:TFIIH "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008353 "RNA polymerase II carboxy-terminal domain
            kinase activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005675 "holo TFIIH
            complex" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
            InterPro:IPR004598 Pfam:PF03849 GO:GO:0006355 GO:GO:0006281
            GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
            EMBL:AB113354 EMBL:AB113355 eggNOG:COG5144 KO:K03144
            PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
            OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625
            GeneTree:ENSGT00390000014159 EMBL:CT971577 RefSeq:NP_001116592.1
            UniGene:Ssc.7512 STRING:Q767M2 Ensembl:ENSSSCT00000001505
            GeneID:100144458 KEGG:ssc:100144458 Uniprot:Q767M2
        Length = 463

 Score = 935 (334.2 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
 Identities = 191/349 (54%), Positives = 248/349 (71%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:   116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   175 DLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
             L+FLFQL FST G DYS EGMSD L  FLQHLRE GLV+QRKRK+ R+YPT+LA+N+++ 
Sbjct:   235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLRELGLVFQRKRKSRRYYPTRLAINLSSG 294

Query:   219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
               G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ 
Sbjct:   295 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354

Query:   278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
             A+ SGITA QII FLR  A                DQIRLWE ER+RL  +EGVLY+QF+
Sbjct:   355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
             SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   QK S
Sbjct:   415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462

 Score = 127 (49.8 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105


>UNIPROTKB|A6H7G8 [details] [associations]
            symbol:GTF2H4 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008353 "RNA polymerase II carboxy-terminal domain
            kinase activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005675 "holo TFIIH
            complex" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
            InterPro:IPR004598 Pfam:PF03849 GO:GO:0006355 GO:GO:0006281
            GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
            eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 CTD:2968
            HOGENOM:HOG000170529 HOVERGEN:HBG054205 OMA:CVLHYMV
            TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159 EMBL:DAAA02055395
            EMBL:BC146240 IPI:IPI00707368 RefSeq:NP_001094527.1
            UniGene:Bt.98689 Ensembl:ENSBTAT00000054213 GeneID:100137723
            KEGG:bta:100137723 NextBio:20789427 Uniprot:A6H7G8
        Length = 463

 Score = 932 (333.1 bits), Expect = 5.8e-105, Sum P(2) = 5.8e-105
 Identities = 190/349 (54%), Positives = 247/349 (70%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:   116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   175 DLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
             L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++ 
Sbjct:   235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294

Query:   219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
               G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ 
Sbjct:   295 VSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354

Query:   278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
             A+ SGITA QII FLR  A                DQIRLWE ER+RL  +EGVLY+QF+
Sbjct:   355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
             SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct:   415 SQVDFELLLAHARELGVLMFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463

 Score = 127 (49.8 bits), Expect = 5.8e-105, Sum P(2) = 5.8e-105
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105


>FB|FBgn0261109 [details] [associations]
            symbol:mrn "marionette" species:7227 "Drosophila
            melanogaster" [GO:0005675 "holo TFIIH complex" evidence=ISS;IDA]
            [GO:0006367 "transcription initiation from RNA polymerase II
            promoter" evidence=ISS] [GO:0000439 "core TFIIH complex"
            evidence=ISS;IPI] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0001671 "ATPase activator activity" evidence=IDA]
            [GO:0001111 "promoter clearance from RNA polymerase II promoter"
            evidence=ISS] [GO:0006289 "nucleotide-excision repair"
            evidence=IDA;IMP] [GO:0010224 "response to UV-B" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0001113 "transcriptional
            open complex formation at RNA polymerase II promoter" evidence=ISS]
            [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IDA] InterPro:IPR004598 Pfam:PF03849 EMBL:AE014296
            GO:GO:0006355 GO:GO:0001671 GO:GO:0006367 GO:GO:0010224
            GO:GO:0006289 GO:GO:0005675 GO:GO:0000439 KO:K03144
            PANTHER:PTHR13152 OMA:CVLHYMV TIGRFAMs:TIGR00625
            GeneTree:ENSGT00390000014159 FlyBase:FBgn0261109 RefSeq:NP_648780.1
            ProteinModelPortal:Q9VUR1 SMR:Q9VUR1 IntAct:Q9VUR1 MINT:MINT-849737
            STRING:Q9VUR1 PRIDE:Q9VUR1 EnsemblMetazoa:FBtr0075595 GeneID:39688
            KEGG:dme:Dmel_CG7764 UCSC:CG7764-RA CTD:39688 InParanoid:Q9VUR1
            PhylomeDB:Q9VUR1 GenomeRNAi:39688 NextBio:814886
            ArrayExpress:Q9VUR1 Bgee:Q9VUR1 Uniprot:Q9VUR1
        Length = 499

 Score = 741 (265.9 bits), Expect = 3.5e-88, Sum P(2) = 3.5e-88
 Identities = 147/269 (54%), Positives = 194/269 (72%)

Query:    48 RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS-----QQTE 102
             +++ +  LLGGGKPW M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+        E
Sbjct:   134 KKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAE 193

Query:   103 GISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
              IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T E +G+ L
Sbjct:   194 AISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISL 253

Query:   157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
              ECL+ LFQL FST G DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFYPT+LALN+
Sbjct:   254 PECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFYPTRLALNV 313

Query:   217 ATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
              ++            +  ++ G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL V
Sbjct:   314 TSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVV 373

Query:   268 GVLTRDSVRAALRSGITAAQIIGFLRLHA 296
             GVLTRDSVR ALR GITA QI+ +L  +A
Sbjct:   374 GVLTRDSVRQALRGGITAEQIVSYLEQYA 402

 Score = 458 (166.3 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
 Identities = 90/162 (55%), Positives = 113/162 (69%)

Query:   229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
             G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL VGVLTRDSVR ALR GITA QI
Sbjct:   335 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 394

Query:   289 IGFLRLHAXXXXXXXXXXXXX---------DQIRLWEGERERLTTSEGVLYSQFVSQADF 339
             + +L  +A                      DQI+LWE ER R T +EGVLY+QF+S  DF
Sbjct:   395 VSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDF 454

Query:   340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
               LRDYAQ + +L WQNE+ RT+VV K GH D+K++W  + K
Sbjct:   455 VTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496

 Score = 159 (61.0 bits), Expect = 3.5e-88, Sum P(2) = 3.5e-88
 Identities = 33/65 (50%), Positives = 43/65 (66%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK 60
             EL EIA+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  LT L      A+ GG  
Sbjct:    66 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125

Query:    61 PWSMS 65
              W +S
Sbjct:   126 AWELS 130


>ZFIN|ZDB-GENE-030131-6779 [details] [associations]
            symbol:gtf2h4 "general transcription factor IIH,
            polypeptide 4" species:7955 "Danio rerio" [GO:0005634 "nucleus"
            evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
            InterPro:IPR004598 Pfam:PF03849 ZFIN:ZDB-GENE-030131-6779
            GO:GO:0005634 GO:GO:0006355 GO:GO:0006281 KO:K03144
            PANTHER:PTHR13152 CTD:2968 HOVERGEN:HBG054205 TIGRFAMs:TIGR00625
            EMBL:BC064301 IPI:IPI00511064 RefSeq:NP_956221.1 UniGene:Dr.75464
            STRING:Q6P2T8 GeneID:334839 KEGG:dre:334839 InParanoid:Q6P2T8
            NextBio:20810565 ArrayExpress:Q6P2T8 Uniprot:Q6P2T8
        Length = 466

 Score = 698 (250.8 bits), Expect = 2.8e-80, Sum P(2) = 2.8e-80
 Identities = 143/268 (53%), Positives = 186/268 (69%)

Query:    50 NAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA 108
             N ++ALLGGGKPW+   A L  D  ARD++ L++YA+ERWE IL FMVGS     +S D 
Sbjct:   110 NLRIALLGGGKPWADEGANLGPDRHARDVESLDRYAMERWEVILHFMVGSPSA-AVSQDL 168

Query:   109 VRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTF 162
              + L   G       E P IT  GFQFLLL+T +Q+WYF L+YL+T +S+G+DLVE L+F
Sbjct:   169 AQLLIQAGLMKSETGEAPCITSAGFQFLLLDTASQLWYFTLQYLKTAQSRGMDLVEILSF 228

Query:   163 LFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR--- 219
             LFQL FST G DYS EGMS+ L  FLQHLREFGLV+QRKRK+ R+YPT+LA+ +A     
Sbjct:   229 LFQLSFSTLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRLAITLAAGVTA 288

Query:   220 -----------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
                        G +    + GF++VETN+R+YAYT+S L++AL+ LF E++YRFPNL V 
Sbjct:   289 NPASGSASSALGAIPGTGDTGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVA 348

Query:   269 VLTRDSVRAALRSGITAAQIIGFLRLHA 296
              +TR+SV+ A+ +GITA QII FLR  A
Sbjct:   349 QVTRESVQQAISNGITAQQIIHFLRTRA 376

 Score = 446 (162.1 bits), Expect = 9.9e-54, Sum P(2) = 9.9e-54
 Identities = 91/170 (53%), Positives = 117/170 (68%)

Query:   220 GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
             G +    + GF++VETN+R+YAYT+S L++AL+ LF E++YRFPNL V  +TR+SV+ A+
Sbjct:   300 GAIPGTGDTGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAI 359

Query:   280 RSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFVSQ 336
              +GITA QII FLR  A                DQIRLWE E++RL  +EGVLY+QF+SQ
Sbjct:   360 SNGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQ 419

Query:   337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQHS 386
             ADFE LRD AQ LGVL WQN   R +VVT  GHS++K+FW   QK   HS
Sbjct:   420 ADFEVLRDRAQGLGVLVWQNPAHRVMVVTPHGHSEVKRFW-KRQKS--HS 466

 Score = 127 (49.8 bits), Expect = 2.8e-80, Sum P(2) = 2.8e-80
 Identities = 25/58 (43%), Positives = 36/58 (62%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGG 58
             EL  +AK+YV+R+LF++ P+PQA +A W+ K   K+H +    LT LR      L GG
Sbjct:    40 ELPPLAKNYVMRMLFLDHPLPQAAVALWVKKGSQKDHDQCVSVLTGLRLWHSQHLQGG 97


>POMBASE|SPBC13G1.13 [details] [associations]
            symbol:tfb2 "transcription factor TFIIH complex subunit
            Tfb2" species:4896 "Schizosaccharomyces pombe" [GO:0000112
            "nucleotide-excision repair factor 3 complex" evidence=ISO]
            [GO:0000439 "core TFIIH complex" evidence=ISO] [GO:0000717
            "nucleotide-excision repair, DNA duplex unwinding" evidence=ISO]
            [GO:0000991 "core RNA polymerase II binding transcription factor
            activity" evidence=ISO] [GO:0004003 "ATP-dependent DNA helicase
            activity" evidence=IEA] [GO:0005675 "holo TFIIH complex"
            evidence=ISO] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0006367 "transcription initiation from RNA polymerase II
            promoter" evidence=ISO] InterPro:IPR004598 Pfam:PF03849
            PomBase:SPBC13G1.13 GO:GO:0005737 GO:GO:0006355 EMBL:CU329671
            GenomeReviews:CU329671_GR GO:GO:0006367 GO:GO:0000991 GO:GO:0005675
            GO:GO:0000717 GO:GO:0000439 GO:GO:0000112 eggNOG:COG5144 KO:K03144
            OMA:PVNCLPI PANTHER:PTHR13152 HOGENOM:HOG000170529
            TIGRFAMs:TIGR00625 PIR:T39414 RefSeq:NP_596563.1
            ProteinModelPortal:P87303 IntAct:P87303 STRING:P87303
            EnsemblFungi:SPBC13G1.13.1 GeneID:2539790 KEGG:spo:SPBC13G1.13
            OrthoDB:EOG4CZFQD NextBio:20800939 Uniprot:P87303
        Length = 447

 Score = 617 (222.3 bits), Expect = 4.2e-63, Sum P(2) = 4.2e-63
 Identities = 144/339 (42%), Positives = 197/339 (58%)

Query:    45 TELRRNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGSQQT-- 101
             +E R+    AL GGG   S      + D     +DFL+ YA E WE IL FMVG+ +   
Sbjct:    95 SEFRKQFITALTGGGNHNSFGVPCTDEDKHLVTVDFLDAYAKETWETILHFMVGTPEAKF 154

Query:   102 --EGISADAVRTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVE 158
               EG+ +   R     G +N + IT+ GFQFLL +  AQ+W  +L YL+  E   +D V+
Sbjct:   155 PGEGVLSLLKRGGLMSGPKNQLRITRAGFQFLLQDINAQIWTLLLDYLKLSEDTHMDPVQ 214

Query:   159 CLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
              L FLF L     G  YS + ++D  Q+ L+ LRE+GLVYQRK  + RFYPT+LA  + T
Sbjct:   215 VLHFLFMLGSLDLGRAYSVDFLTDTQQIMLEDLREYGLVYQRKITSKRFYPTRLATGLTT 274

Query:   219 R-GTL--KQIR---EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
                +L  KQ     + GF+IVETN+R+YAYT S L++A+IGLF  L  RF NL VGV+TR
Sbjct:   275 DYRSLHGKQSENDDDKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITR 334

Query:   273 DSVRAALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVL 329
             DS+R AL +GI A QII +L  HA                DQI LWE E+ RL  + G+L
Sbjct:   335 DSIRRALMNGIAAEQIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGIL 394

Query:   330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
             +  F++ + F++  +YA++LGVL W +  KR   +T  G
Sbjct:   395 FRDFLTDSGFDQAVEYAKELGVLVWDSSLKRMFFITTTG 433

 Score = 45 (20.9 bits), Expect = 4.2e-63, Sum P(2) = 4.2e-63
 Identities = 10/43 (23%), Positives = 20/43 (46%)

Query:     5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL 47
             +A+ YV+ +LF   PV  +    W+  +      E+ + L  +
Sbjct:    39 LARQYVMSMLFNPMPVALSDFDLWIKLSSKVYQSESFNKLVRM 81


>SGD|S000006043 [details] [associations]
            symbol:TFB2 "Subunit of TFIIH and nucleotide excision repair
            factor 3 complexes" species:4932 "Saccharomyces cerevisiae"
            [GO:0005634 "nucleus" evidence=IEA] [GO:0006289
            "nucleotide-excision repair" evidence=IEA;IMP] [GO:0006974
            "response to DNA damage stimulus" evidence=IEA] [GO:0000990 "core
            RNA polymerase binding transcription factor activity" evidence=IC]
            [GO:0000439 "core TFIIH complex" evidence=IEA;IDA] [GO:0004003
            "ATP-dependent DNA helicase activity" evidence=IEA] [GO:0006281
            "DNA repair" evidence=IEA] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=IDA]
            [GO:0070816 "phosphorylation of RNA polymerase II C-terminal
            domain" evidence=IDA] [GO:0005675 "holo TFIIH complex"
            evidence=IDA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0000112 "nucleotide-excision repair factor 3
            complex" evidence=IPI] [GO:0003690 "double-stranded DNA binding"
            evidence=IDA] InterPro:IPR004598 Pfam:PF03849 SGD:S000006043
            GO:GO:0006366 EMBL:BK006949 GO:GO:0003690 EMBL:U43503 GO:GO:0006289
            GO:GO:0070816 GO:GO:0005675 GO:GO:0000439 GO:GO:0000112
            eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 HOGENOM:HOG000170529
            OMA:CVLHYMV TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159
            OrthoDB:EOG4CZFQD EMBL:U62804 PIR:S62000 RefSeq:NP_015203.1
            PDB:3DGP PDB:3DOM PDBsum:3DGP PDBsum:3DOM ProteinModelPortal:Q02939
            SMR:Q02939 DIP:DIP-5783N IntAct:Q02939 MINT:MINT-2785444
            STRING:Q02939 PaxDb:Q02939 PeptideAtlas:Q02939 EnsemblFungi:YPL122C
            GeneID:855981 KEGG:sce:YPL122C CYGD:YPL122c
            EvolutionaryTrace:Q02939 NextBio:980816 Genevestigator:Q02939
            GermOnline:YPL122C Uniprot:Q02939
        Length = 513

 Score = 270 (100.1 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
 Identities = 70/200 (35%), Positives = 108/200 (54%)

Query:    37 HKEATDTLTELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
             +K +   +  L    K++L   L GG+  + S  + V+     LD L++Y+  +WE IL 
Sbjct:    90 NKSSGTLMINLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILH 148

Query:    94 FMVGSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYL 146
             FMVG+   +  S   +  L +      V       IT +GFQFLL E  +Q+W  +L+YL
Sbjct:   149 FMVGTPLAKIPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYL 208

Query:   147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
             + +E+  +DLV+ L F+F L     G  Y  + +S+  ++ LQ +R++GLV+Q+      
Sbjct:   209 KMIETSKMDLVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSI 268

Query:   207 FYPTKLALNMATRGTLKQIR 226
             FYPTKLAL M T  T K IR
Sbjct:   269 FYPTKLAL-MLTSDT-KTIR 286

 Score = 192 (72.6 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
 Identities = 41/83 (49%), Positives = 57/83 (68%)

Query:   214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
             LN A  G   Q    G LIVETNF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+
Sbjct:   324 LNKA--GLKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRE 381

Query:   274 SVRAALRSGITAAQIIGFLRLHA 296
             S+R AL +GITA QII +L  HA
Sbjct:   382 SIRRALTNGITADQIIAYLETHA 404

 Score = 168 (64.2 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
 Identities = 30/66 (45%), Positives = 46/66 (69%)

Query:   310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
             DQIRLW+ E +R+ T EG LYS F +  ++  L  YAQD+GVL W+++KK+   ++K+G+
Sbjct:   439 DQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGN 498

Query:   370 SDIKKF 375
             S +  F
Sbjct:   499 SQVLDF 504

 Score = 51 (23.0 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
 Identities = 10/46 (21%), Positives = 23/46 (50%)

Query:     2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL 47
             L  +AK +++ ++F E  VP   +  W++     + + A  ++  L
Sbjct:    39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSL 84


>UNIPROTKB|B4DTJ5 [details] [associations]
            symbol:GTF2H4 "cDNA FLJ50212, highly similar to TFIIH basal
            transcription factor complex p52 subunit" species:9606 "Homo
            sapiens" [GO:0000439 "core TFIIH complex" evidence=IEA] [GO:0004003
            "ATP-dependent DNA helicase activity" evidence=IEA] [GO:0006289
            "nucleotide-excision repair" evidence=IEA] InterPro:IPR004598
            Pfam:PF03849 GO:GO:0005634 GO:GO:0006355 GO:GO:0006281
            EMBL:CR759747 EMBL:AL662854 EMBL:AL662870 EMBL:AL773541
            EMBL:BX927194 EMBL:AL669830 EMBL:CR936875 PANTHER:PTHR13152
            UniGene:Hs.485070 HGNC:HGNC:4658 TIGRFAMs:TIGR00625 EMBL:AK300239
            IPI:IPI01015708 STRING:B4DTJ5 Ensembl:ENST00000539324
            Ensembl:ENST00000547731 Ensembl:ENST00000549684
            Ensembl:ENST00000551614 Ensembl:ENST00000551657
            Ensembl:ENST00000552174 UCSC:uc011dmv.1 Uniprot:B4DTJ5
        Length = 220

 Score = 456 (165.6 bits), Expect = 4.8e-52, Sum P(2) = 4.8e-52
 Identities = 94/162 (58%), Positives = 116/162 (71%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS     +S 
Sbjct:    59 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 117

Query:   107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
             D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE 
Sbjct:   118 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 177

Query:   160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
             L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRK
Sbjct:   178 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRK 219

 Score = 101 (40.6 bits), Expect = 4.8e-52, Sum P(2) = 4.8e-52
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query:    11 IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             +R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:     1 MRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 48


>UNIPROTKB|G4NCE6 [details] [associations]
            symbol:MGG_00397 "RNA polymerase II transcription factor B
            subunit 2" species:242507 "Magnaporthe oryzae 70-15" [GO:0043581
            "mycelium development" evidence=IEP] InterPro:IPR004598
            Pfam:PF03849 GO:GO:0005634 GO:GO:0006355 EMBL:CM001235
            GO:GO:0006281 GO:GO:0043581 KO:K03144 PANTHER:PTHR13152
            TIGRFAMs:TIGR00625 RefSeq:XP_003718679.1 EnsemblFungi:MGG_00397T0
            GeneID:2674140 KEGG:mgr:MGG_00397 Uniprot:G4NCE6
        Length = 490

 Score = 407 (148.3 bits), Expect = 1.7e-51, Sum P(3) = 1.7e-51
 Identities = 96/274 (35%), Positives = 146/274 (53%)

Query:   117 AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD-LVECLTFLFQLKFSTFGTDY 175
             +E   ITK GF FLL E  AQVW  +L++L +V     +  V+ L+FLF L     G  Y
Sbjct:   198 SERISITKTGFTFLLQEANAQVWTLLLQWLHSVNEDNTNRAVDMLSFLFMLGTLELGQAY 257

Query:   176 STEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------- 227
              T  +S++ +  L  L +FGLVY       +++PT+LA  + +  + L+ +         
Sbjct:   258 DTGALSEERRNMLPDLNDFGLVYIPPSNPDQYFPTRLATTLTSGSSALRSVSSGVAAATA 317

Query:   228 -------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
                     G +I+ETNFR+YAYT + L++A++ LF  L  RF  +  G L+R S++ A+ 
Sbjct:   318 EAGENNTKGAIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAIS 377

Query:   281 SGITAAQIIGFLRLHAXXXXXXXXXX--------XXXDQIRLWEGERERLTTSEGVLYSQ 332
              GITA QII +L  HA                     DQIRLW+ E ER+    G L+  
Sbjct:   378 HGITADQIIEYLASHAHEQMHRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKD 437

Query:   333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
             F SQA+F+ + DYA ++GVL W+++ ++    +K
Sbjct:   438 FESQAEFKAIADYADEVGVLIWRSDARQLFFASK 471

 Score = 80 (33.2 bits), Expect = 1.7e-51, Sum P(3) = 1.7e-51
 Identities = 21/74 (28%), Positives = 43/74 (58%)

Query:    35 KEHKEATDTLT-ELRRNAKVALLGGGK--PWSMSAKLEVDSKARDLDFLNQYALERWECI 91
             KE      +LT   +++ ++AL GGG    + + + L +  +  D+ FL+++A  RW+ I
Sbjct:    94 KEKPVQEMSLTPNFKKSLRLALEGGGSHNSFGVPSSLPIPPQV-DVAFLDKWARSRWDAI 152

Query:    92 LRFMVGS-QQTEGI 104
             L ++V S ++T+ +
Sbjct:   153 LHYVVNSVEETDSM 166

 Score = 76 (31.8 bits), Expect = 1.7e-51, Sum P(3) = 1.7e-51
 Identities = 15/38 (39%), Positives = 26/38 (68%)

Query:     2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKE 39
             L  +AK +V+ LL++ QP+P   + SW+ K  AK++K+
Sbjct:    39 LPPLAKTFVMSLLYMPQPLPLTALDSWV-KPEAKKNKD 75


>ASPGD|ASPL0000068300 [details] [associations]
            symbol:AN7015 species:162425 "Emericella nidulans"
            [GO:0006281 "DNA repair" evidence=IEA] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] InterPro:IPR004598 Pfam:PF03849
            GO:GO:0005634 GO:GO:0006355 GO:GO:0006281 EMBL:BN001304
            EMBL:AACD01000117 eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152
            HOGENOM:HOG000170529 OMA:CVLHYMV TIGRFAMs:TIGR00625
            OrthoDB:EOG4CZFQD RefSeq:XP_664619.1 STRING:Q5AXG5
            EnsemblFungi:CADANIAT00000461 GeneID:2870093 KEGG:ani:AN7015.2
            Uniprot:Q5AXG5
        Length = 482

 Score = 466 (169.1 bits), Expect = 5.3e-49, Sum P(2) = 5.3e-49
 Identities = 110/287 (38%), Positives = 161/287 (56%)

Query:   122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM- 180
             IT+DGF F+L +   QVW+ ++ Y+E+ E+ G+D VE L+F+F L     G  Y  + + 
Sbjct:   194 ITQDGFAFVLQDVGTQVWHILILYVESAEAIGMDSVEVLSFIFLLSSLELGKSYEKKHLT 253

Query:   181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT---------RGTLKQ-----IR 226
             S++L+  L  L +FG+VYQ   +A  FYPT+LA  + +          G L         
Sbjct:   254 SNQLRT-LTDLADFGIVYQDSPEASHFYPTRLATTLTSDSSALSNPISGALSDPDGGDSN 312

Query:   227 EPG--FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGIT 284
             +PG  F+I+ETN+R+YAYT S L+++LI LF  L YRFPNL  G +TR S+R A+  GIT
Sbjct:   313 QPGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGIT 372

Query:   285 AAQIIGFLRLHAXXXXXXXXXXXXX----------------DQIRLWEGERERLTTSEGV 328
             A QII +L  HA                             DQIRLW+ ER+R+  + G 
Sbjct:   373 ADQIISYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGF 432

Query:   329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             L+  FVS A++E    YA+++GVL W++++KR   VT+  H  +  F
Sbjct:   433 LFKDFVSLAEYEAPCRYAEEIGVLIWKSDRKRMFFVTR--HEQVAAF 477

 Score = 227 (85.0 bits), Expect = 6.0e-19, Sum P(2) = 6.0e-19
 Identities = 60/178 (33%), Positives = 95/178 (53%)

Query:    54 ALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA---- 108
             AL G  +  S      + D +   +  L++YA  +WE +L +MVG+    GI  D     
Sbjct:   115 ALTGAKETQSFGVLHHIPDDQTVPIHDLDEYARRQWEGVLGYMVGTSGL-GIQRDVNLSK 173

Query:   109 -VRTLFNEG-----AENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
              V+ L   G      +  V IT+DGF F+L +   QVW+ ++ Y+E+ E+ G+D VE L+
Sbjct:   174 GVKQLLQAGHLVEIRDRRVEITQDGFAFVLQDVGTQVWHILILYVESAEAIGMDSVEVLS 233

Query:   162 FLFQLKFSTFGTDYSTEGM-SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
             F+F L     G  Y  + + S++L+  L  L +FG+VYQ   +A  FYPT+LA  + +
Sbjct:   234 FIFLLSSLELGKSYEKKHLTSNQLRT-LTDLADFGIVYQDSPEASHFYPTRLATTLTS 290

 Score = 62 (26.9 bits), Expect = 5.3e-49, Sum P(2) = 5.3e-49
 Identities = 10/28 (35%), Positives = 21/28 (75%)

Query:     2 LSEIAKHYVIRLLFVEQPVPQAVIASWL 29
             L ++AK +V+ LL+++ P+P A + +W+
Sbjct:    37 LPDLAKCFVMALLYLKDPLPAADLETWV 64


>DICTYBASE|DDB_G0293228 [details] [associations]
            symbol:gtf2h4 "TFIIH subunit" species:44689
            "Dictyostelium discoideum" [GO:0006289 "nucleotide-excision repair"
            evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=IEA] [GO:0004003
            "ATP-dependent DNA helicase activity" evidence=IEA] [GO:0000439
            "core TFIIH complex" evidence=IEA] [GO:0006366 "transcription from
            RNA polymerase II promoter" evidence=ISS] [GO:0005675 "holo TFIIH
            complex" evidence=ISS] [GO:0006974 "response to DNA damage
            stimulus" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA]
            [GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR004598
            Pfam:PF03849 dictyBase:DDB_G0293228 GenomeReviews:CM000155_GR
            GO:GO:0006355 EMBL:AAFI02000200 GO:GO:0006366 GO:GO:0006289
            GO:GO:0005675 RefSeq:XP_629237.1 ProteinModelPortal:Q54C29
            STRING:Q54C29 PRIDE:Q54C29 EnsemblProtists:DDB0231037
            GeneID:8629119 KEGG:ddi:DDB_G0293228 eggNOG:COG5144 KO:K03144
            OMA:PVNCLPI ProtClustDB:CLSZ2728699 PANTHER:PTHR13152
            Uniprot:Q54C29
        Length = 483

 Score = 454 (164.9 bits), Expect = 6.0e-48, Sum P(2) = 6.0e-48
 Identities = 97/272 (35%), Positives = 161/272 (59%)

Query:   122 ITKDGFQFLLLETPAQVWYFILKYLETVESK-GL---DLVECLTFLFQLKFSTFGTDYST 177
             IT +GF+FLL +   Q+W  ++ YL+ +E K G       + L+FLF+L F   G  Y  
Sbjct:   206 ITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGSRNDLLSFLFRLSFLNLGRGYLV 265

Query:   178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQI-----------R 226
               +S++ + +L  L++FGL+Y R   +  FYPT+L +++ T  TL  I           +
Sbjct:   266 SELSEQQKEYLFALKQFGLIYMRTDSSILFYPTRLIISLTTGKTLSLIQSISSERTQTQK 325

Query:   227 EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
             E G++++ETN+R+YAYT S+L+++L+ LF +++YR PNL VG++TR+S+R AL  GITA 
Sbjct:   326 EQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITAD 385

Query:   287 QIIGFLRLHAXXXXXXXXX---XXXXDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLR 343
             QII F+R ++                +QI LWE ER R+T ++ VLY+ F +   +    
Sbjct:   386 QIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRITYTKSVLYNSFPTNDCYIATL 445

Query:   344 DYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
              +A++     W ++  +T+VV ++G+  I+ F
Sbjct:   446 KFAKEQDYYIWSHDPLKTLVVKEEGNDPIRNF 477

 Score = 215 (80.7 bits), Expect = 5.6e-15, P = 5.6e-15
 Identities = 64/242 (26%), Positives = 127/242 (52%)

Query:     5 IAKHYVIRLLFVEQ---PV-PQAVIASWLSKTHAKEHKEATDTLTE--------LRRNAK 52
             + K + ++++F+++   P+ PQ   +S  S +  ++ ++     TE         + N K
Sbjct:    73 LKKLFDLKIIFLDKINKPIQPQQQQSSQQSSSQQQQQQQQQQQQTEQTIRLNPLFQDNIK 132

Query:    53 VALLGGGKP-WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS--QQTEGISADAV 109
              +L+   +  +S ++ ++ + K   +D L+ Y+  +WE +L F+     Q ++ IS   +
Sbjct:   133 RSLVQVNQVIFSNNSSIKDNHKPPSIDDLDSYSKSQWEKVLYFLSDDTVQPSKLISELLL 192

Query:   110 RT-LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK---GL-DLVECLTFLF 164
              + L  +  +   IT +GF+FLL +   Q+W  ++ YL+ +E K   G     + L+FLF
Sbjct:   193 SSNLTKQEGDGLSITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGSRNDLLSFLF 252

Query:   165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQ 224
             +L F   G  Y    +S++ + +L  L++FGL+Y R   +  FYPT+L +++ T  TL  
Sbjct:   253 RLSFLNLGRGYLVSELSEQQKEYLFALKQFGLIYMRTDSSILFYPTRLIISLTTGKTLSL 312

Query:   225 IR 226
             I+
Sbjct:   313 IQ 314

 Score = 64 (27.6 bits), Expect = 6.0e-48, Sum P(2) = 6.0e-48
 Identities = 11/43 (25%), Positives = 26/43 (60%)

Query:     6 AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
             +K Y++++L V+   P ++   W ++   ++HKE+   L +L+
Sbjct:    39 SKQYILKMLLVDT-YPLSLAKDWSTQASIQQHKESLKKLFDLK 80


>CGD|CAL0002355 [details] [associations]
            symbol:orf19.5846 species:5476 "Candida albicans" [GO:0005675
            "holo TFIIH complex" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
            [GO:0000112 "nucleotide-excision repair factor 3 complex"
            evidence=IEA] [GO:0003690 "double-stranded DNA binding"
            evidence=IEA] [GO:0070816 "phosphorylation of RNA polymerase II
            C-terminal domain" evidence=IEA] [GO:0006366 "transcription from
            RNA polymerase II promoter" evidence=IEA] [GO:0006289
            "nucleotide-excision repair" evidence=IEA] InterPro:IPR004598
            Pfam:PF03849 CGD:CAL0002355 GO:GO:0005634 GO:GO:0006355
            GO:GO:0006281 EMBL:AACQ01000235 EMBL:AACQ01000236 eggNOG:COG5144
            KO:K03144 PANTHER:PTHR13152 HOGENOM:HOG000170529 TIGRFAMs:TIGR00625
            RefSeq:XP_710801.1 RefSeq:XP_710812.1 STRING:Q59M53 GeneID:3647581
            GeneID:3647592 KEGG:cal:CaO19.13268 KEGG:cal:CaO19.5846
            Uniprot:Q59M53
        Length = 494

 Score = 448 (162.8 bits), Expect = 5.3e-47, Sum P(2) = 5.3e-47
 Identities = 124/356 (34%), Positives = 179/356 (50%)

Query:    48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
             R+N +  L G   P +  S    VD    D+ FL+ +A ++WE IL FMVG++ T    +
Sbjct:   103 RKNFRDCLTGSQNPNAFGSISTTVDKHKVDIPFLDLFASQKWETILHFMVGTEST-ATPS 161

Query:   107 DAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECL 160
             D+V +L       EG  N + IT  GFQFLL +  AQ+W  +L+YL   +   +D V+ L
Sbjct:   162 DSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDPVDVL 221

Query:   161 TFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-R 219
              F+F L     G  YS   +S+     L  L+++GLVYQR   +GRFYPT+LA  + +  
Sbjct:   222 NFIFVLGSLELGKSYSVLSLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTLTSDS 281

Query:   220 GTLK-----------QIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
               LK           Q+     +I+ETNF++YAYT S L++A++ LF     RF N+  G
Sbjct:   282 AALKTPSMAMDEEEQQVATKESIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCG 341

Query:   269 VLTRDSVRAALRSGITAAQIIGFLRLHAXXXXXXXXXXXXXDQIRLWEGERERLTTSEGV 328
              +TR+S+R AL +GITA QII FL  HA              +I         + T+ G 
Sbjct:   342 QITRESIRNALYNGITADQIIKFLETHAHPQMRILAKEKLDKKIEF--DTSHNINTAGGA 399

Query:   329 LYSQ---FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
               S+    VSQ   E L     D  +  WQ E  R  + T  G+  + K + N Q+
Sbjct:   400 PQSKTDGMVSQHKLEILPPNVVDQ-IKLWQLELDR--IQTFDGY--LFKDFSNQQE 450

 Score = 145 (56.1 bits), Expect = 4.9e-09, Sum P(2) = 4.9e-09
 Identities = 26/66 (39%), Positives = 43/66 (65%)

Query:   310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
             DQI+LW+ E +R+ T +G L+  F +Q +++ L +YA +LGVL W ++ K+   VTK G 
Sbjct:   422 DQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWADKVKKKFFVTKDGM 481

Query:   370 SDIKKF 375
             + +  F
Sbjct:   482 TQVADF 487

 Score = 61 (26.5 bits), Expect = 5.3e-47, Sum P(2) = 5.3e-47
 Identities = 14/46 (30%), Positives = 23/46 (50%)

Query:     2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL 47
             LS +AK Y++ ++F E+P+    +  W   +  K   EA   L  L
Sbjct:    39 LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESL 84


>WB|WBGene00013529 [details] [associations]
            symbol:Y73F8A.24 species:6239 "Caenorhabditis elegans"
            [GO:0005634 "nucleus" evidence=IEA] [GO:0006281 "DNA repair"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040035 "hermaphrodite genitalia development"
            evidence=IMP] [GO:0009792 "embryo development ending in birth or
            egg hatching" evidence=IMP] InterPro:IPR004598 Pfam:PF03849
            GO:GO:0005634 GO:GO:0009792 GO:GO:0006355 GO:GO:0006281
            GO:GO:0040035 EMBL:AL132862 eggNOG:COG5144 KO:K03144 OMA:PVNCLPI
            PANTHER:PTHR13152 GeneTree:ENSGT00390000014159 GeneID:178431
            KEGG:cel:CELE_Y73F8A.24 CTD:178431 RefSeq:NP_001255863.1
            ProteinModelPortal:Q9NA35 SMR:Q9NA35 STRING:Q9NA35 PaxDb:Q9NA35
            EnsemblMetazoa:Y73F8A.24a UCSC:Y73F8A.24 WormBase:Y73F8A.24a
            HOGENOM:HOG000020092 InParanoid:Q9NA35 NextBio:901110
            ArrayExpress:Q9NA35 Uniprot:Q9NA35
        Length = 481

 Score = 492 (178.3 bits), Expect = 5.4e-47, P = 5.4e-47
 Identities = 110/287 (38%), Positives = 167/287 (58%)

Query:   113 FNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFST- 170
             F  G +  + IT  GFQFLLL    Q+W ++++YL+   S+G D+VE +  L Q+     
Sbjct:   180 FTSGGDTQIEITTFGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQIVLLAN 239

Query:   171 --FGTD---YSTEG-MSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQ 224
               F  +   Y  +   ++     L HLRE G+++ RKRK G F+ T+L  ++AT  T+  
Sbjct:   240 RGFKAERECYQIDANWTEPQNELLNHLRELGVIFIRKRKDGVFFLTQLLTHLATNETIDD 299

Query:   225 IREP----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
             +       G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SVR AL+
Sbjct:   300 VSAEKVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQ 359

Query:   281 SGITAAQIIGFLRLHAXXXXXXXXX------XXXXDQIRLWEGERERLTTSEGVLYSQFV 334
              GITAAQII FLR +A                   DQIRLWE ER R+   +  +YS F 
Sbjct:   360 HGITAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFE 419

Query:   335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
             S+ +F  + +YA+   +L W + +++ V+V + GH  +++ W   +K
Sbjct:   420 SEDEFHGVCEYARQQNILLWSDNQQKLVIVNEDGHELVRQ-WYKREK 465

 Score = 226 (84.6 bits), Expect = 2.9e-16, P = 2.9e-16
 Identities = 65/208 (31%), Positives = 108/208 (51%)

Query:    37 HKEATDTL----TELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECIL 92
             H++ TD      T+ +R+   A + G    S S  LE +   R    + + A+ERW+CIL
Sbjct:    95 HRQTTDGKLNIDTDYKRSYMYAAMLGAAQIS-SLVLEPNEGNRRGKDVEKKAVERWDCIL 153

Query:    93 RFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFILK 144
             R++     + T+ +S +  R LF +     G +  + IT  GFQFLLL    Q+W ++++
Sbjct:   154 RYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVIE 212

Query:   145 YLETVESKGLDLVECLTFLFQLKFST---FGTD---YSTEG-MSDKLQVFLQHLREFGLV 197
             YL+   S+G D+VE +  L Q+       F  +   Y  +   ++     L HLRE G++
Sbjct:   213 YLKLEMSQGQDIVEVIEPLIQIVLLANRGFKAERECYQIDANWTEPQNELLNHLRELGVI 272

Query:   198 YQRKRKAGRFYPTKLALNMATRGTLKQI 225
             + RKRK G F+ T+L  ++AT  T+  +
Sbjct:   273 FIRKRKDGVFFLTQLLTHLATNETIDDV 300


>UNIPROTKB|A2ABL5 [details] [associations]
            symbol:GTF2H4 "General transcription factor IIH subunit 4"
            species:9606 "Homo sapiens" [GO:0000439 "core TFIIH complex"
            evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=IEA] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] InterPro:IPR004598 Pfam:PF03849 GO:GO:0005634
            GO:GO:0006355 GO:GO:0006281 EMBL:CR759747 EMBL:AL662854
            EMBL:AL662870 EMBL:AL773541 EMBL:BX927194 EMBL:AL669830
            EMBL:CR936875 eggNOG:COG5144 PANTHER:PTHR13152 HGNC:HGNC:4658
            IPI:IPI00798015 Ensembl:ENST00000416161 Ensembl:ENST00000423849
            Ensembl:ENST00000430576 Ensembl:ENST00000453897
            Ensembl:ENST00000454718 Ensembl:ENST00000456195
            HOGENOM:HOG000197779 Uniprot:A2ABL5
        Length = 138

 Score = 127 (49.8 bits), Expect = 1.5e-07, P = 1.5e-07
 Identities = 27/59 (45%), Positives = 40/59 (67%)

Query:     1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
             EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR      LL GG
Sbjct:    47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 104


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.322   0.136   0.399    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      386       373   0.00087  117 3  11 22  0.50    33
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  18
  No. of states in DFA:  619 (66 KB)
  Total size of DFA:  246 KB (2132 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:01
  No. of threads or processors used:  24
  Search cpu time:  29.97u 0.10s 30.07t   Elapsed:  00:00:11
  Total cpu time:  29.98u 0.10s 30.08t   Elapsed:  00:00:13
  Start:  Thu Aug 15 12:11:46 2013   End:  Thu Aug 15 12:11:59 2013

Back to top