Your job contains 1 sequence.
>psy13810
ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK
PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLFNEGAENP
VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM
SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVY
AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV
SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR
TVVVTKQGHSDIKKFWVNHQKGSQHS
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= psy13810
(386 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
UNIPROTKB|Q92759 - symbol:GTF2H4 "General transcription f... 941 6.5e-106 2
UNIPROTKB|P60027 - symbol:GTF2H4 "General transcription f... 941 6.5e-106 2
RGD|1303309 - symbol:Gtf2h4 "general transcription factor... 941 6.5e-106 2
MGI|MGI:1338799 - symbol:Gtf2h4 "general transcription fa... 940 8.2e-106 2
UNIPROTKB|F1PGI3 - symbol:GTF2H4 "Uncharacterized protein... 938 2.8e-105 2
UNIPROTKB|Q767M2 - symbol:TFIIH "Uncharacterized protein"... 935 2.8e-105 2
UNIPROTKB|A6H7G8 - symbol:GTF2H4 "Uncharacterized protein... 932 5.8e-105 2
FB|FBgn0261109 - symbol:mrn "marionette" species:7227 "Dr... 741 3.5e-88 2
ZFIN|ZDB-GENE-030131-6779 - symbol:gtf2h4 "general transc... 698 2.8e-80 2
POMBASE|SPBC13G1.13 - symbol:tfb2 "transcription factor T... 617 4.2e-63 2
SGD|S000006043 - symbol:TFB2 "Subunit of TFIIH and nucleo... 270 4.0e-57 4
UNIPROTKB|B4DTJ5 - symbol:GTF2H4 "cDNA FLJ50212, highly s... 456 4.8e-52 2
UNIPROTKB|G4NCE6 - symbol:MGG_00397 "RNA polymerase II tr... 407 1.7e-51 3
ASPGD|ASPL0000068300 - symbol:AN7015 species:162425 "Emer... 466 5.3e-49 2
DICTYBASE|DDB_G0293228 - symbol:gtf2h4 "TFIIH subunit" sp... 454 6.0e-48 2
CGD|CAL0002355 - symbol:orf19.5846 species:5476 "Candida ... 448 5.3e-47 2
WB|WBGene00013529 - symbol:Y73F8A.24 species:6239 "Caenor... 492 5.4e-47 1
UNIPROTKB|A2ABL5 - symbol:GTF2H4 "General transcription f... 127 1.5e-07 1
>UNIPROTKB|Q92759 [details] [associations]
symbol:GTF2H4 "General transcription factor IIH subunit 4"
species:9606 "Homo sapiens" [GO:0000439 "core TFIIH complex"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=IDA;TAS] [GO:0004672 "protein kinase activity"
evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005675 "holo TFIIH complex" evidence=IDA] [GO:0008353 "RNA
polymerase II carboxy-terminal domain kinase activity"
evidence=IDA] [GO:0008094 "DNA-dependent ATPase activity"
evidence=IDA] [GO:0003700 "sequence-specific DNA binding
transcription factor activity" evidence=TAS] [GO:0000718
"nucleotide-excision repair, DNA damage removal" evidence=TAS]
[GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281 "DNA repair"
evidence=TAS] [GO:0006283 "transcription-coupled
nucleotide-excision repair" evidence=TAS] [GO:0006289
"nucleotide-excision repair" evidence=TAS] [GO:0006360
"transcription from RNA polymerase I promoter" evidence=TAS]
[GO:0006361 "transcription initiation from RNA polymerase I
promoter" evidence=TAS] [GO:0006362 "transcription elongation from
RNA polymerase I promoter" evidence=TAS] [GO:0006363 "termination
of RNA polymerase I transcription" evidence=TAS] [GO:0006367
"transcription initiation from RNA polymerase II promoter"
evidence=TAS] [GO:0006368 "transcription elongation from RNA
polymerase II promoter" evidence=TAS] [GO:0006370
"7-methylguanosine mRNA capping" evidence=TAS] [GO:0010467 "gene
expression" evidence=TAS] [GO:0016032 "viral reproduction"
evidence=TAS] [GO:0050434 "positive regulation of viral
transcription" evidence=TAS] [GO:0006468 "protein phosphorylation"
evidence=IDA] Reactome:REACT_216 Reactome:REACT_71
InterPro:IPR004598 Pfam:PF03849 Reactome:REACT_116125 GO:GO:0016032
EMBL:BA000025 EMBL:CH471081 Reactome:REACT_1675 GO:GO:0006468
GO:GO:0003700 GO:GO:0006368 GO:GO:0006367 Reactome:REACT_1788
GO:GO:0005675 GO:GO:0006370 GO:GO:0000718 GO:GO:0050434
GO:GO:0006363 GO:GO:0006362 GO:GO:0006361 GO:GO:0006283
EMBL:CR759747 EMBL:AL773541 EMBL:BX927194 EMBL:CR936875
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 EMBL:Y07595
EMBL:BT007321 EMBL:AY124590 EMBL:AB088103 EMBL:AB202101
EMBL:AB103609 EMBL:BC004935 EMBL:BC016302 IPI:IPI00016839
RefSeq:NP_001508.1 UniGene:Hs.485070 ProteinModelPortal:Q92759
SMR:Q92759 DIP:DIP-48376N STRING:Q92759 PhosphoSite:Q92759
DMDM:17380328 PaxDb:Q92759 PRIDE:Q92759 DNASU:2968
Ensembl:ENST00000259895 Ensembl:ENST00000376316
Ensembl:ENST00000376326 Ensembl:ENST00000400450
Ensembl:ENST00000413314 Ensembl:ENST00000416773
Ensembl:ENST00000423881 Ensembl:ENST00000434226
Ensembl:ENST00000435498 Ensembl:ENST00000438348
Ensembl:ENST00000440824 Ensembl:ENST00000456968 GeneID:2968
KEGG:hsa:2968 UCSC:uc003nsa.1 CTD:2968 GeneCards:GC06P030875
HGNC:HGNC:4658 MIM:601760 neXtProt:NX_Q92759 PharmGKB:PA29044
HOGENOM:HOG000170529 HOVERGEN:HBG054205 InParanoid:Q92759
OMA:CVLHYMV OrthoDB:EOG4F1X34 PhylomeDB:Q92759 GenomeRNAi:2968
NextBio:11764 ArrayExpress:Q92759 Bgee:Q92759 CleanEx:HS_GTF2H4
Genevestigator:Q92759 GermOnline:ENSG00000137411 TIGRFAMs:TIGR00625
Uniprot:Q92759
Length = 462
Score = 941 (336.3 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
Identities = 192/349 (55%), Positives = 249/349 (71%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 115 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 173
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 174 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 233
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++
Sbjct: 234 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 293
Query: 219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+
Sbjct: 294 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 353
Query: 278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
A+ SGITA QII FLR A DQIRLWE ER+RL +EGVLY+QF+
Sbjct: 354 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 413
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW QK S
Sbjct: 414 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 461
Score = 127 (49.8 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 104
>UNIPROTKB|P60027 [details] [associations]
symbol:GTF2H4 "General transcription factor IIH subunit 4"
species:9598 "Pan troglodytes" [GO:0005675 "holo TFIIH complex"
evidence=ISS] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=ISS] [GO:0004672 "protein kinase activity"
evidence=ISS] [GO:0008094 "DNA-dependent ATPase activity"
evidence=ISS] [GO:0008353 "RNA polymerase II carboxy-terminal
domain kinase activity" evidence=ISS] InterPro:IPR004598
Pfam:PF03849 GO:GO:0006355 GO:GO:0006281 EMBL:BA000041
GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 CTD:2968
HOGENOM:HOG000170529 HOVERGEN:HBG054205 OMA:CVLHYMV
OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625 EMBL:AB210199 EMBL:AB210200
RefSeq:NP_001038968.1 UniGene:Ptr.6202 ProteinModelPortal:P60027
SMR:P60027 STRING:P60027 PRIDE:P60027 Ensembl:ENSPTRT00000033136
GeneID:462549 KEGG:ptr:462549 GeneTree:ENSGT00390000014159
InParanoid:P60027 NextBio:20841799 Uniprot:P60027
Length = 462
Score = 941 (336.3 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
Identities = 192/349 (55%), Positives = 249/349 (71%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 115 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 173
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 174 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 233
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++
Sbjct: 234 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 293
Query: 219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+
Sbjct: 294 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 353
Query: 278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
A+ SGITA QII FLR A DQIRLWE ER+RL +EGVLY+QF+
Sbjct: 354 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 413
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW QK S
Sbjct: 414 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 461
Score = 127 (49.8 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 104
>RGD|1303309 [details] [associations]
symbol:Gtf2h4 "general transcription factor II H, polypeptide 4"
species:10116 "Rattus norvegicus" [GO:0000439 "core TFIIH complex"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0005675 "holo TFIIH complex" evidence=IEA;ISO]
[GO:0006289 "nucleotide-excision repair" evidence=IEA] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=IEA;ISO]
[GO:0006468 "protein phosphorylation" evidence=ISO] [GO:0008353
"RNA polymerase II carboxy-terminal domain kinase activity"
evidence=IEA;ISO] [GO:0004672 "protein kinase activity"
evidence=ISO] [GO:0008094 "DNA-dependent ATPase activity"
evidence=ISO] InterPro:IPR004598 Pfam:PF03849 RGD:1303309
GO:GO:0006355 GO:GO:0006281 GO:GO:0006366 GO:GO:0008353
GO:GO:0005675 GO:GO:0008094 EMBL:BX883047 eggNOG:COG5144 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625
GeneTree:ENSGT00390000014159 EMBL:BC127469 IPI:IPI00215269
RefSeq:NP_997666.1 UniGene:Rn.34122 STRING:Q6MG20
Ensembl:ENSRNOT00000001103 GeneID:294236 KEGG:rno:294236
UCSC:RGD:1303309 InParanoid:Q6MG20 NextBio:637807
Genevestigator:Q6MG20 Uniprot:Q6MG20
Length = 463
Score = 941 (336.3 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
Identities = 192/349 (55%), Positives = 249/349 (71%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 175 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++
Sbjct: 235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294
Query: 219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+
Sbjct: 295 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354
Query: 278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
A+ SGITA QII FLR A DQIRLWE ER+RL +EGVLY+QF+
Sbjct: 355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW QK S
Sbjct: 415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462
Score = 127 (49.8 bits), Expect = 6.5e-106, Sum P(2) = 6.5e-106
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105
>MGI|MGI:1338799 [details] [associations]
symbol:Gtf2h4 "general transcription factor II H,
polypeptide 4" species:10090 "Mus musculus" [GO:0000439 "core TFIIH
complex" evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase
activity" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0005675 "holo TFIIH complex" evidence=ISO] [GO:0006281 "DNA
repair" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=ISO] [GO:0006468 "protein
phosphorylation" evidence=ISO] [GO:0006974 "response to DNA damage
stimulus" evidence=IEA] InterPro:IPR004598 Pfam:PF03849
MGI:MGI:1338799 GO:GO:0006355 GO:GO:0006281 GO:GO:0006366
GO:GO:0008353 GO:GO:0005675 GO:GO:0008094 eggNOG:COG5144 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625 EMBL:AF054823
EMBL:BC012638 IPI:IPI00118172 RefSeq:NP_034494.1 UniGene:Mm.10182
ProteinModelPortal:O70422 SMR:O70422 IntAct:O70422 STRING:O70422
PhosphoSite:O70422 PaxDb:O70422 PRIDE:O70422
Ensembl:ENSMUST00000001565 Ensembl:ENSMUST00000160734 GeneID:14885
KEGG:mmu:14885 InParanoid:O70422 NextBio:287167 Bgee:O70422
CleanEx:MM_GTF2H4 Genevestigator:O70422
GermOnline:ENSMUSG00000001524 Uniprot:O70422
Length = 463
Score = 940 (336.0 bits), Expect = 8.2e-106, Sum P(2) = 8.2e-106
Identities = 192/349 (55%), Positives = 249/349 (71%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 175 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++
Sbjct: 235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294
Query: 219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+
Sbjct: 295 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354
Query: 278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
A+ SGITA QII FLR A DQIRLWE ER+RL +EGVLY+QF+
Sbjct: 355 AIASGITAQQIIHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW QK S
Sbjct: 415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462
Score = 127 (49.8 bits), Expect = 8.2e-106, Sum P(2) = 8.2e-106
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105
>UNIPROTKB|F1PGI3 [details] [associations]
symbol:GTF2H4 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 GO:GO:0005634 GO:GO:0006355
GO:GO:0006281 KO:K03144 PANTHER:PTHR13152 OMA:CVLHYMV
TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159 EMBL:AAEX03008206
RefSeq:XP_003639474.1 Ensembl:ENSCAFT00000000727 GeneID:100856712
KEGG:cfa:100856712 Uniprot:F1PGI3
Length = 463
Score = 938 (335.3 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
Identities = 193/349 (55%), Positives = 248/349 (71%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 175 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++
Sbjct: 235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294
Query: 219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
G +PGF+IVETN+R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+
Sbjct: 295 VSGAGGTAHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354
Query: 278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
A+ SGITA QII FLR A DQIRLWE ER+RL +EGVLY+QF+
Sbjct: 355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW QK S
Sbjct: 415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462
Score = 124 (48.7 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 48 ELPFLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105
>UNIPROTKB|Q767M2 [details] [associations]
symbol:TFIIH "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008353 "RNA polymerase II carboxy-terminal domain
kinase activity" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0005675 "holo TFIIH
complex" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 GO:GO:0006355 GO:GO:0006281
GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
EMBL:AB113354 EMBL:AB113355 eggNOG:COG5144 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625
GeneTree:ENSGT00390000014159 EMBL:CT971577 RefSeq:NP_001116592.1
UniGene:Ssc.7512 STRING:Q767M2 Ensembl:ENSSSCT00000001505
GeneID:100144458 KEGG:ssc:100144458 Uniprot:Q767M2
Length = 463
Score = 935 (334.2 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
Identities = 191/349 (54%), Positives = 248/349 (71%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 175 DLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
L+FLFQL FST G DYS EGMSD L FLQHLRE GLV+QRKRK+ R+YPT+LA+N+++
Sbjct: 235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLRELGLVFQRKRKSRRYYPTRLAINLSSG 294
Query: 219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+
Sbjct: 295 VSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354
Query: 278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
A+ SGITA QII FLR A DQIRLWE ER+RL +EGVLY+QF+
Sbjct: 355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW QK S
Sbjct: 415 SQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFW-KRQKHS 462
Score = 127 (49.8 bits), Expect = 2.8e-105, Sum P(2) = 2.8e-105
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105
>UNIPROTKB|A6H7G8 [details] [associations]
symbol:GTF2H4 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008353 "RNA polymerase II carboxy-terminal domain
kinase activity" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0005675 "holo TFIIH
complex" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 GO:GO:0006355 GO:GO:0006281
GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 CTD:2968
HOGENOM:HOG000170529 HOVERGEN:HBG054205 OMA:CVLHYMV
TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159 EMBL:DAAA02055395
EMBL:BC146240 IPI:IPI00707368 RefSeq:NP_001094527.1
UniGene:Bt.98689 Ensembl:ENSBTAT00000054213 GeneID:100137723
KEGG:bta:100137723 NextBio:20789427 Uniprot:A6H7G8
Length = 463
Score = 932 (333.1 bits), Expect = 5.8e-105, Sum P(2) = 5.8e-105
Identities = 190/349 (54%), Positives = 247/349 (70%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 116 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGSPSA-AVSQ 174
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 175 DLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 234
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT- 218
L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++
Sbjct: 235 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSG 294
Query: 219 -RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+
Sbjct: 295 VSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQ 354
Query: 278 ALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFV 334
A+ SGITA QII FLR A DQIRLWE ER+RL +EGVLY+QF+
Sbjct: 355 AIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFL 414
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 415 SQVDFELLLAHARELGVLMFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463
Score = 127 (49.8 bits), Expect = 5.8e-105, Sum P(2) = 5.8e-105
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 105
>FB|FBgn0261109 [details] [associations]
symbol:mrn "marionette" species:7227 "Drosophila
melanogaster" [GO:0005675 "holo TFIIH complex" evidence=ISS;IDA]
[GO:0006367 "transcription initiation from RNA polymerase II
promoter" evidence=ISS] [GO:0000439 "core TFIIH complex"
evidence=ISS;IPI] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0001671 "ATPase activator activity" evidence=IDA]
[GO:0001111 "promoter clearance from RNA polymerase II promoter"
evidence=ISS] [GO:0006289 "nucleotide-excision repair"
evidence=IDA;IMP] [GO:0010224 "response to UV-B" evidence=IMP]
[GO:0005634 "nucleus" evidence=IDA] [GO:0001113 "transcriptional
open complex formation at RNA polymerase II promoter" evidence=ISS]
[GO:0006366 "transcription from RNA polymerase II promoter"
evidence=IDA] InterPro:IPR004598 Pfam:PF03849 EMBL:AE014296
GO:GO:0006355 GO:GO:0001671 GO:GO:0006367 GO:GO:0010224
GO:GO:0006289 GO:GO:0005675 GO:GO:0000439 KO:K03144
PANTHER:PTHR13152 OMA:CVLHYMV TIGRFAMs:TIGR00625
GeneTree:ENSGT00390000014159 FlyBase:FBgn0261109 RefSeq:NP_648780.1
ProteinModelPortal:Q9VUR1 SMR:Q9VUR1 IntAct:Q9VUR1 MINT:MINT-849737
STRING:Q9VUR1 PRIDE:Q9VUR1 EnsemblMetazoa:FBtr0075595 GeneID:39688
KEGG:dme:Dmel_CG7764 UCSC:CG7764-RA CTD:39688 InParanoid:Q9VUR1
PhylomeDB:Q9VUR1 GenomeRNAi:39688 NextBio:814886
ArrayExpress:Q9VUR1 Bgee:Q9VUR1 Uniprot:Q9VUR1
Length = 499
Score = 741 (265.9 bits), Expect = 3.5e-88, Sum P(2) = 3.5e-88
Identities = 147/269 (54%), Positives = 194/269 (72%)
Query: 48 RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS-----QQTE 102
+++ + LLGGGKPW M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+ E
Sbjct: 134 KKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAE 193
Query: 103 GISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T E +G+ L
Sbjct: 194 AISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISL 253
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
ECL+ LFQL FST G DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFYPT+LALN+
Sbjct: 254 PECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFYPTRLALNV 313
Query: 217 ATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
++ + ++ G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL V
Sbjct: 314 TSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVV 373
Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHA 296
GVLTRDSVR ALR GITA QI+ +L +A
Sbjct: 374 GVLTRDSVRQALRGGITAEQIVSYLEQYA 402
Score = 458 (166.3 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
Identities = 90/162 (55%), Positives = 113/162 (69%)
Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL VGVLTRDSVR ALR GITA QI
Sbjct: 335 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 394
Query: 289 IGFLRLHAXXXXXXXXXXXXX---------DQIRLWEGERERLTTSEGVLYSQFVSQADF 339
+ +L +A DQI+LWE ER R T +EGVLY+QF+S DF
Sbjct: 395 VSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDF 454
Query: 340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
LRDYAQ + +L WQNE+ RT+VV K GH D+K++W + K
Sbjct: 455 VTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
Score = 159 (61.0 bits), Expect = 3.5e-88, Sum P(2) = 3.5e-88
Identities = 33/65 (50%), Positives = 43/65 (66%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK 60
EL EIA+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT LT L A+ GG
Sbjct: 66 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125
Query: 61 PWSMS 65
W +S
Sbjct: 126 AWELS 130
>ZFIN|ZDB-GENE-030131-6779 [details] [associations]
symbol:gtf2h4 "general transcription factor IIH,
polypeptide 4" species:7955 "Danio rerio" [GO:0005634 "nucleus"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 ZFIN:ZDB-GENE-030131-6779
GO:GO:0005634 GO:GO:0006355 GO:GO:0006281 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOVERGEN:HBG054205 TIGRFAMs:TIGR00625
EMBL:BC064301 IPI:IPI00511064 RefSeq:NP_956221.1 UniGene:Dr.75464
STRING:Q6P2T8 GeneID:334839 KEGG:dre:334839 InParanoid:Q6P2T8
NextBio:20810565 ArrayExpress:Q6P2T8 Uniprot:Q6P2T8
Length = 466
Score = 698 (250.8 bits), Expect = 2.8e-80, Sum P(2) = 2.8e-80
Identities = 143/268 (53%), Positives = 186/268 (69%)
Query: 50 NAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA 108
N ++ALLGGGKPW+ A L D ARD++ L++YA+ERWE IL FMVGS +S D
Sbjct: 110 NLRIALLGGGKPWADEGANLGPDRHARDVESLDRYAMERWEVILHFMVGSPSA-AVSQDL 168
Query: 109 VRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTF 162
+ L G E P IT GFQFLLL+T +Q+WYF L+YL+T +S+G+DLVE L+F
Sbjct: 169 AQLLIQAGLMKSETGEAPCITSAGFQFLLLDTASQLWYFTLQYLKTAQSRGMDLVEILSF 228
Query: 163 LFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR--- 219
LFQL FST G DYS EGMS+ L FLQHLREFGLV+QRKRK+ R+YPT+LA+ +A
Sbjct: 229 LFQLSFSTLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRLAITLAAGVTA 288
Query: 220 -----------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
G + + GF++VETN+R+YAYT+S L++AL+ LF E++YRFPNL V
Sbjct: 289 NPASGSASSALGAIPGTGDTGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVA 348
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHA 296
+TR+SV+ A+ +GITA QII FLR A
Sbjct: 349 QVTRESVQQAISNGITAQQIIHFLRTRA 376
Score = 446 (162.1 bits), Expect = 9.9e-54, Sum P(2) = 9.9e-54
Identities = 91/170 (53%), Positives = 117/170 (68%)
Query: 220 GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
G + + GF++VETN+R+YAYT+S L++AL+ LF E++YRFPNL V +TR+SV+ A+
Sbjct: 300 GAIPGTGDTGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAI 359
Query: 280 RSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVLYSQFVSQ 336
+GITA QII FLR A DQIRLWE E++RL +EGVLY+QF+SQ
Sbjct: 360 SNGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQ 419
Query: 337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQHS 386
ADFE LRD AQ LGVL WQN R +VVT GHS++K+FW QK HS
Sbjct: 420 ADFEVLRDRAQGLGVLVWQNPAHRVMVVTPHGHSEVKRFW-KRQKS--HS 466
Score = 127 (49.8 bits), Expect = 2.8e-80, Sum P(2) = 2.8e-80
Identities = 25/58 (43%), Positives = 36/58 (62%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGG 58
EL +AK+YV+R+LF++ P+PQA +A W+ K K+H + LT LR L GG
Sbjct: 40 ELPPLAKNYVMRMLFLDHPLPQAAVALWVKKGSQKDHDQCVSVLTGLRLWHSQHLQGG 97
>POMBASE|SPBC13G1.13 [details] [associations]
symbol:tfb2 "transcription factor TFIIH complex subunit
Tfb2" species:4896 "Schizosaccharomyces pombe" [GO:0000112
"nucleotide-excision repair factor 3 complex" evidence=ISO]
[GO:0000439 "core TFIIH complex" evidence=ISO] [GO:0000717
"nucleotide-excision repair, DNA duplex unwinding" evidence=ISO]
[GO:0000991 "core RNA polymerase II binding transcription factor
activity" evidence=ISO] [GO:0004003 "ATP-dependent DNA helicase
activity" evidence=IEA] [GO:0005675 "holo TFIIH complex"
evidence=ISO] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0006367 "transcription initiation from RNA polymerase II
promoter" evidence=ISO] InterPro:IPR004598 Pfam:PF03849
PomBase:SPBC13G1.13 GO:GO:0005737 GO:GO:0006355 EMBL:CU329671
GenomeReviews:CU329671_GR GO:GO:0006367 GO:GO:0000991 GO:GO:0005675
GO:GO:0000717 GO:GO:0000439 GO:GO:0000112 eggNOG:COG5144 KO:K03144
OMA:PVNCLPI PANTHER:PTHR13152 HOGENOM:HOG000170529
TIGRFAMs:TIGR00625 PIR:T39414 RefSeq:NP_596563.1
ProteinModelPortal:P87303 IntAct:P87303 STRING:P87303
EnsemblFungi:SPBC13G1.13.1 GeneID:2539790 KEGG:spo:SPBC13G1.13
OrthoDB:EOG4CZFQD NextBio:20800939 Uniprot:P87303
Length = 447
Score = 617 (222.3 bits), Expect = 4.2e-63, Sum P(2) = 4.2e-63
Identities = 144/339 (42%), Positives = 197/339 (58%)
Query: 45 TELRRNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGSQQT-- 101
+E R+ AL GGG S + D +DFL+ YA E WE IL FMVG+ +
Sbjct: 95 SEFRKQFITALTGGGNHNSFGVPCTDEDKHLVTVDFLDAYAKETWETILHFMVGTPEAKF 154
Query: 102 --EGISADAVRTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVE 158
EG+ + R G +N + IT+ GFQFLL + AQ+W +L YL+ E +D V+
Sbjct: 155 PGEGVLSLLKRGGLMSGPKNQLRITRAGFQFLLQDINAQIWTLLLDYLKLSEDTHMDPVQ 214
Query: 159 CLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
L FLF L G YS + ++D Q+ L+ LRE+GLVYQRK + RFYPT+LA + T
Sbjct: 215 VLHFLFMLGSLDLGRAYSVDFLTDTQQIMLEDLREYGLVYQRKITSKRFYPTRLATGLTT 274
Query: 219 R-GTL--KQIR---EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
+L KQ + GF+IVETN+R+YAYT S L++A+IGLF L RF NL VGV+TR
Sbjct: 275 DYRSLHGKQSENDDDKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITR 334
Query: 273 DSVRAALRSGITAAQIIGFLRLHAXXXXXXXXXX---XXXDQIRLWEGERERLTTSEGVL 329
DS+R AL +GI A QII +L HA DQI LWE E+ RL + G+L
Sbjct: 335 DSIRRALMNGIAAEQIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGIL 394
Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
+ F++ + F++ +YA++LGVL W + KR +T G
Sbjct: 395 FRDFLTDSGFDQAVEYAKELGVLVWDSSLKRMFFITTTG 433
Score = 45 (20.9 bits), Expect = 4.2e-63, Sum P(2) = 4.2e-63
Identities = 10/43 (23%), Positives = 20/43 (46%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL 47
+A+ YV+ +LF PV + W+ + E+ + L +
Sbjct: 39 LARQYVMSMLFNPMPVALSDFDLWIKLSSKVYQSESFNKLVRM 81
>SGD|S000006043 [details] [associations]
symbol:TFB2 "Subunit of TFIIH and nucleotide excision repair
factor 3 complexes" species:4932 "Saccharomyces cerevisiae"
[GO:0005634 "nucleus" evidence=IEA] [GO:0006289
"nucleotide-excision repair" evidence=IEA;IMP] [GO:0006974
"response to DNA damage stimulus" evidence=IEA] [GO:0000990 "core
RNA polymerase binding transcription factor activity" evidence=IC]
[GO:0000439 "core TFIIH complex" evidence=IEA;IDA] [GO:0004003
"ATP-dependent DNA helicase activity" evidence=IEA] [GO:0006281
"DNA repair" evidence=IEA] [GO:0006355 "regulation of
transcription, DNA-dependent" evidence=IEA] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=IDA]
[GO:0070816 "phosphorylation of RNA polymerase II C-terminal
domain" evidence=IDA] [GO:0005675 "holo TFIIH complex"
evidence=IDA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0000112 "nucleotide-excision repair factor 3
complex" evidence=IPI] [GO:0003690 "double-stranded DNA binding"
evidence=IDA] InterPro:IPR004598 Pfam:PF03849 SGD:S000006043
GO:GO:0006366 EMBL:BK006949 GO:GO:0003690 EMBL:U43503 GO:GO:0006289
GO:GO:0070816 GO:GO:0005675 GO:GO:0000439 GO:GO:0000112
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 HOGENOM:HOG000170529
OMA:CVLHYMV TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159
OrthoDB:EOG4CZFQD EMBL:U62804 PIR:S62000 RefSeq:NP_015203.1
PDB:3DGP PDB:3DOM PDBsum:3DGP PDBsum:3DOM ProteinModelPortal:Q02939
SMR:Q02939 DIP:DIP-5783N IntAct:Q02939 MINT:MINT-2785444
STRING:Q02939 PaxDb:Q02939 PeptideAtlas:Q02939 EnsemblFungi:YPL122C
GeneID:855981 KEGG:sce:YPL122C CYGD:YPL122c
EvolutionaryTrace:Q02939 NextBio:980816 Genevestigator:Q02939
GermOnline:YPL122C Uniprot:Q02939
Length = 513
Score = 270 (100.1 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
Identities = 70/200 (35%), Positives = 108/200 (54%)
Query: 37 HKEATDTLTELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
+K + + L K++L L GG+ + S + V+ LD L++Y+ +WE IL
Sbjct: 90 NKSSGTLMINLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILH 148
Query: 94 FMVGSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYL 146
FMVG+ + S + L + V IT +GFQFLL E +Q+W +L+YL
Sbjct: 149 FMVGTPLAKIPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYL 208
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
+ +E+ +DLV+ L F+F L G Y + +S+ ++ LQ +R++GLV+Q+
Sbjct: 209 KMIETSKMDLVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSI 268
Query: 207 FYPTKLALNMATRGTLKQIR 226
FYPTKLAL M T T K IR
Sbjct: 269 FYPTKLAL-MLTSDT-KTIR 286
Score = 192 (72.6 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
Identities = 41/83 (49%), Positives = 57/83 (68%)
Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
LN A G Q G LIVETNF++Y+Y++S L++A++ LF L RF N+ +G +TR+
Sbjct: 324 LNKA--GLKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRE 381
Query: 274 SVRAALRSGITAAQIIGFLRLHA 296
S+R AL +GITA QII +L HA
Sbjct: 382 SIRRALTNGITADQIIAYLETHA 404
Score = 168 (64.2 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
Identities = 30/66 (45%), Positives = 46/66 (69%)
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQIRLW+ E +R+ T EG LYS F + ++ L YAQD+GVL W+++KK+ ++K+G+
Sbjct: 439 DQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGN 498
Query: 370 SDIKKF 375
S + F
Sbjct: 499 SQVLDF 504
Score = 51 (23.0 bits), Expect = 4.0e-57, Sum P(4) = 4.0e-57
Identities = 10/46 (21%), Positives = 23/46 (50%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL 47
L +AK +++ ++F E VP + W++ + + A ++ L
Sbjct: 39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSL 84
>UNIPROTKB|B4DTJ5 [details] [associations]
symbol:GTF2H4 "cDNA FLJ50212, highly similar to TFIIH basal
transcription factor complex p52 subunit" species:9606 "Homo
sapiens" [GO:0000439 "core TFIIH complex" evidence=IEA] [GO:0004003
"ATP-dependent DNA helicase activity" evidence=IEA] [GO:0006289
"nucleotide-excision repair" evidence=IEA] InterPro:IPR004598
Pfam:PF03849 GO:GO:0005634 GO:GO:0006355 GO:GO:0006281
EMBL:CR759747 EMBL:AL662854 EMBL:AL662870 EMBL:AL773541
EMBL:BX927194 EMBL:AL669830 EMBL:CR936875 PANTHER:PTHR13152
UniGene:Hs.485070 HGNC:HGNC:4658 TIGRFAMs:TIGR00625 EMBL:AK300239
IPI:IPI01015708 STRING:B4DTJ5 Ensembl:ENST00000539324
Ensembl:ENST00000547731 Ensembl:ENST00000549684
Ensembl:ENST00000551614 Ensembl:ENST00000551657
Ensembl:ENST00000552174 UCSC:uc011dmv.1 Uniprot:B4DTJ5
Length = 220
Score = 456 (165.6 bits), Expect = 4.8e-52, Sum P(2) = 4.8e-52
Identities = 94/162 (58%), Positives = 116/162 (71%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +S
Sbjct: 59 RQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA-AVSQ 117
Query: 107 DAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE
Sbjct: 118 DLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 177
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRK
Sbjct: 178 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRK 219
Score = 101 (40.6 bits), Expect = 4.8e-52, Sum P(2) = 4.8e-52
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 11 IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 1 MRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 48
>UNIPROTKB|G4NCE6 [details] [associations]
symbol:MGG_00397 "RNA polymerase II transcription factor B
subunit 2" species:242507 "Magnaporthe oryzae 70-15" [GO:0043581
"mycelium development" evidence=IEP] InterPro:IPR004598
Pfam:PF03849 GO:GO:0005634 GO:GO:0006355 EMBL:CM001235
GO:GO:0006281 GO:GO:0043581 KO:K03144 PANTHER:PTHR13152
TIGRFAMs:TIGR00625 RefSeq:XP_003718679.1 EnsemblFungi:MGG_00397T0
GeneID:2674140 KEGG:mgr:MGG_00397 Uniprot:G4NCE6
Length = 490
Score = 407 (148.3 bits), Expect = 1.7e-51, Sum P(3) = 1.7e-51
Identities = 96/274 (35%), Positives = 146/274 (53%)
Query: 117 AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD-LVECLTFLFQLKFSTFGTDY 175
+E ITK GF FLL E AQVW +L++L +V + V+ L+FLF L G Y
Sbjct: 198 SERISITKTGFTFLLQEANAQVWTLLLQWLHSVNEDNTNRAVDMLSFLFMLGTLELGQAY 257
Query: 176 STEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------- 227
T +S++ + L L +FGLVY +++PT+LA + + + L+ +
Sbjct: 258 DTGALSEERRNMLPDLNDFGLVYIPPSNPDQYFPTRLATTLTSGSSALRSVSSGVAAATA 317
Query: 228 -------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
G +I+ETNFR+YAYT + L++A++ LF L RF + G L+R S++ A+
Sbjct: 318 EAGENNTKGAIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAIS 377
Query: 281 SGITAAQIIGFLRLHAXXXXXXXXXX--------XXXDQIRLWEGERERLTTSEGVLYSQ 332
GITA QII +L HA DQIRLW+ E ER+ G L+
Sbjct: 378 HGITADQIIEYLASHAHEQMHRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKD 437
Query: 333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
F SQA+F+ + DYA ++GVL W+++ ++ +K
Sbjct: 438 FESQAEFKAIADYADEVGVLIWRSDARQLFFASK 471
Score = 80 (33.2 bits), Expect = 1.7e-51, Sum P(3) = 1.7e-51
Identities = 21/74 (28%), Positives = 43/74 (58%)
Query: 35 KEHKEATDTLT-ELRRNAKVALLGGGK--PWSMSAKLEVDSKARDLDFLNQYALERWECI 91
KE +LT +++ ++AL GGG + + + L + + D+ FL+++A RW+ I
Sbjct: 94 KEKPVQEMSLTPNFKKSLRLALEGGGSHNSFGVPSSLPIPPQV-DVAFLDKWARSRWDAI 152
Query: 92 LRFMVGS-QQTEGI 104
L ++V S ++T+ +
Sbjct: 153 LHYVVNSVEETDSM 166
Score = 76 (31.8 bits), Expect = 1.7e-51, Sum P(3) = 1.7e-51
Identities = 15/38 (39%), Positives = 26/38 (68%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKE 39
L +AK +V+ LL++ QP+P + SW+ K AK++K+
Sbjct: 39 LPPLAKTFVMSLLYMPQPLPLTALDSWV-KPEAKKNKD 75
>ASPGD|ASPL0000068300 [details] [associations]
symbol:AN7015 species:162425 "Emericella nidulans"
[GO:0006281 "DNA repair" evidence=IEA] [GO:0006355 "regulation of
transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0003674
"molecular_function" evidence=ND] InterPro:IPR004598 Pfam:PF03849
GO:GO:0005634 GO:GO:0006355 GO:GO:0006281 EMBL:BN001304
EMBL:AACD01000117 eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152
HOGENOM:HOG000170529 OMA:CVLHYMV TIGRFAMs:TIGR00625
OrthoDB:EOG4CZFQD RefSeq:XP_664619.1 STRING:Q5AXG5
EnsemblFungi:CADANIAT00000461 GeneID:2870093 KEGG:ani:AN7015.2
Uniprot:Q5AXG5
Length = 482
Score = 466 (169.1 bits), Expect = 5.3e-49, Sum P(2) = 5.3e-49
Identities = 110/287 (38%), Positives = 161/287 (56%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM- 180
IT+DGF F+L + QVW+ ++ Y+E+ E+ G+D VE L+F+F L G Y + +
Sbjct: 194 ITQDGFAFVLQDVGTQVWHILILYVESAEAIGMDSVEVLSFIFLLSSLELGKSYEKKHLT 253
Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT---------RGTLKQ-----IR 226
S++L+ L L +FG+VYQ +A FYPT+LA + + G L
Sbjct: 254 SNQLRT-LTDLADFGIVYQDSPEASHFYPTRLATTLTSDSSALSNPISGALSDPDGGDSN 312
Query: 227 EPG--FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGIT 284
+PG F+I+ETN+R+YAYT S L+++LI LF L YRFPNL G +TR S+R A+ GIT
Sbjct: 313 QPGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGIT 372
Query: 285 AAQIIGFLRLHAXXXXXXXXXXXXX----------------DQIRLWEGERERLTTSEGV 328
A QII +L HA DQIRLW+ ER+R+ + G
Sbjct: 373 ADQIISYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGF 432
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
L+ FVS A++E YA+++GVL W++++KR VT+ H + F
Sbjct: 433 LFKDFVSLAEYEAPCRYAEEIGVLIWKSDRKRMFFVTR--HEQVAAF 477
Score = 227 (85.0 bits), Expect = 6.0e-19, Sum P(2) = 6.0e-19
Identities = 60/178 (33%), Positives = 95/178 (53%)
Query: 54 ALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA---- 108
AL G + S + D + + L++YA +WE +L +MVG+ GI D
Sbjct: 115 ALTGAKETQSFGVLHHIPDDQTVPIHDLDEYARRQWEGVLGYMVGTSGL-GIQRDVNLSK 173
Query: 109 -VRTLFNEG-----AENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
V+ L G + V IT+DGF F+L + QVW+ ++ Y+E+ E+ G+D VE L+
Sbjct: 174 GVKQLLQAGHLVEIRDRRVEITQDGFAFVLQDVGTQVWHILILYVESAEAIGMDSVEVLS 233
Query: 162 FLFQLKFSTFGTDYSTEGM-SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
F+F L G Y + + S++L+ L L +FG+VYQ +A FYPT+LA + +
Sbjct: 234 FIFLLSSLELGKSYEKKHLTSNQLRT-LTDLADFGIVYQDSPEASHFYPTRLATTLTS 290
Score = 62 (26.9 bits), Expect = 5.3e-49, Sum P(2) = 5.3e-49
Identities = 10/28 (35%), Positives = 21/28 (75%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWL 29
L ++AK +V+ LL+++ P+P A + +W+
Sbjct: 37 LPDLAKCFVMALLYLKDPLPAADLETWV 64
>DICTYBASE|DDB_G0293228 [details] [associations]
symbol:gtf2h4 "TFIIH subunit" species:44689
"Dictyostelium discoideum" [GO:0006289 "nucleotide-excision repair"
evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=IEA] [GO:0004003
"ATP-dependent DNA helicase activity" evidence=IEA] [GO:0000439
"core TFIIH complex" evidence=IEA] [GO:0006366 "transcription from
RNA polymerase II promoter" evidence=ISS] [GO:0005675 "holo TFIIH
complex" evidence=ISS] [GO:0006974 "response to DNA damage
stimulus" evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0006351 "transcription,
DNA-dependent" evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA]
[GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR004598
Pfam:PF03849 dictyBase:DDB_G0293228 GenomeReviews:CM000155_GR
GO:GO:0006355 EMBL:AAFI02000200 GO:GO:0006366 GO:GO:0006289
GO:GO:0005675 RefSeq:XP_629237.1 ProteinModelPortal:Q54C29
STRING:Q54C29 PRIDE:Q54C29 EnsemblProtists:DDB0231037
GeneID:8629119 KEGG:ddi:DDB_G0293228 eggNOG:COG5144 KO:K03144
OMA:PVNCLPI ProtClustDB:CLSZ2728699 PANTHER:PTHR13152
Uniprot:Q54C29
Length = 483
Score = 454 (164.9 bits), Expect = 6.0e-48, Sum P(2) = 6.0e-48
Identities = 97/272 (35%), Positives = 161/272 (59%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESK-GL---DLVECLTFLFQLKFSTFGTDYST 177
IT +GF+FLL + Q+W ++ YL+ +E K G + L+FLF+L F G Y
Sbjct: 206 ITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGSRNDLLSFLFRLSFLNLGRGYLV 265
Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQI-----------R 226
+S++ + +L L++FGL+Y R + FYPT+L +++ T TL I +
Sbjct: 266 SELSEQQKEYLFALKQFGLIYMRTDSSILFYPTRLIISLTTGKTLSLIQSISSERTQTQK 325
Query: 227 EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
E G++++ETN+R+YAYT S+L+++L+ LF +++YR PNL VG++TR+S+R AL GITA
Sbjct: 326 EQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITAD 385
Query: 287 QIIGFLRLHAXXXXXXXXX---XXXXDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLR 343
QII F+R ++ +QI LWE ER R+T ++ VLY+ F + +
Sbjct: 386 QIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRITYTKSVLYNSFPTNDCYIATL 445
Query: 344 DYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+A++ W ++ +T+VV ++G+ I+ F
Sbjct: 446 KFAKEQDYYIWSHDPLKTLVVKEEGNDPIRNF 477
Score = 215 (80.7 bits), Expect = 5.6e-15, P = 5.6e-15
Identities = 64/242 (26%), Positives = 127/242 (52%)
Query: 5 IAKHYVIRLLFVEQ---PV-PQAVIASWLSKTHAKEHKEATDTLTE--------LRRNAK 52
+ K + ++++F+++ P+ PQ +S S + ++ ++ TE + N K
Sbjct: 73 LKKLFDLKIIFLDKINKPIQPQQQQSSQQSSSQQQQQQQQQQQQTEQTIRLNPLFQDNIK 132
Query: 53 VALLGGGKP-WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS--QQTEGISADAV 109
+L+ + +S ++ ++ + K +D L+ Y+ +WE +L F+ Q ++ IS +
Sbjct: 133 RSLVQVNQVIFSNNSSIKDNHKPPSIDDLDSYSKSQWEKVLYFLSDDTVQPSKLISELLL 192
Query: 110 RT-LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK---GL-DLVECLTFLF 164
+ L + + IT +GF+FLL + Q+W ++ YL+ +E K G + L+FLF
Sbjct: 193 SSNLTKQEGDGLSITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGSRNDLLSFLF 252
Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQ 224
+L F G Y +S++ + +L L++FGL+Y R + FYPT+L +++ T TL
Sbjct: 253 RLSFLNLGRGYLVSELSEQQKEYLFALKQFGLIYMRTDSSILFYPTRLIISLTTGKTLSL 312
Query: 225 IR 226
I+
Sbjct: 313 IQ 314
Score = 64 (27.6 bits), Expect = 6.0e-48, Sum P(2) = 6.0e-48
Identities = 11/43 (25%), Positives = 26/43 (60%)
Query: 6 AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
+K Y++++L V+ P ++ W ++ ++HKE+ L +L+
Sbjct: 39 SKQYILKMLLVDT-YPLSLAKDWSTQASIQQHKESLKKLFDLK 80
>CGD|CAL0002355 [details] [associations]
symbol:orf19.5846 species:5476 "Candida albicans" [GO:0005675
"holo TFIIH complex" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
[GO:0000112 "nucleotide-excision repair factor 3 complex"
evidence=IEA] [GO:0003690 "double-stranded DNA binding"
evidence=IEA] [GO:0070816 "phosphorylation of RNA polymerase II
C-terminal domain" evidence=IEA] [GO:0006366 "transcription from
RNA polymerase II promoter" evidence=IEA] [GO:0006289
"nucleotide-excision repair" evidence=IEA] InterPro:IPR004598
Pfam:PF03849 CGD:CAL0002355 GO:GO:0005634 GO:GO:0006355
GO:GO:0006281 EMBL:AACQ01000235 EMBL:AACQ01000236 eggNOG:COG5144
KO:K03144 PANTHER:PTHR13152 HOGENOM:HOG000170529 TIGRFAMs:TIGR00625
RefSeq:XP_710801.1 RefSeq:XP_710812.1 STRING:Q59M53 GeneID:3647581
GeneID:3647592 KEGG:cal:CaO19.13268 KEGG:cal:CaO19.5846
Uniprot:Q59M53
Length = 494
Score = 448 (162.8 bits), Expect = 5.3e-47, Sum P(2) = 5.3e-47
Identities = 124/356 (34%), Positives = 179/356 (50%)
Query: 48 RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R+N + L G P + S VD D+ FL+ +A ++WE IL FMVG++ T +
Sbjct: 103 RKNFRDCLTGSQNPNAFGSISTTVDKHKVDIPFLDLFASQKWETILHFMVGTEST-ATPS 161
Query: 107 DAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECL 160
D+V +L EG N + IT GFQFLL + AQ+W +L+YL + +D V+ L
Sbjct: 162 DSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDPVDVL 221
Query: 161 TFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-R 219
F+F L G YS +S+ L L+++GLVYQR +GRFYPT+LA + +
Sbjct: 222 NFIFVLGSLELGKSYSVLSLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTLTSDS 281
Query: 220 GTLK-----------QIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
LK Q+ +I+ETNF++YAYT S L++A++ LF RF N+ G
Sbjct: 282 AALKTPSMAMDEEEQQVATKESIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCG 341
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHAXXXXXXXXXXXXXDQIRLWEGERERLTTSEGV 328
+TR+S+R AL +GITA QII FL HA +I + T+ G
Sbjct: 342 QITRESIRNALYNGITADQIIKFLETHAHPQMRILAKEKLDKKIEF--DTSHNINTAGGA 399
Query: 329 LYSQ---FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
S+ VSQ E L D + WQ E R + T G+ + K + N Q+
Sbjct: 400 PQSKTDGMVSQHKLEILPPNVVDQ-IKLWQLELDR--IQTFDGY--LFKDFSNQQE 450
Score = 145 (56.1 bits), Expect = 4.9e-09, Sum P(2) = 4.9e-09
Identities = 26/66 (39%), Positives = 43/66 (65%)
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQI+LW+ E +R+ T +G L+ F +Q +++ L +YA +LGVL W ++ K+ VTK G
Sbjct: 422 DQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWADKVKKKFFVTKDGM 481
Query: 370 SDIKKF 375
+ + F
Sbjct: 482 TQVADF 487
Score = 61 (26.5 bits), Expect = 5.3e-47, Sum P(2) = 5.3e-47
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL 47
LS +AK Y++ ++F E+P+ + W + K EA L L
Sbjct: 39 LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESL 84
>WB|WBGene00013529 [details] [associations]
symbol:Y73F8A.24 species:6239 "Caenorhabditis elegans"
[GO:0005634 "nucleus" evidence=IEA] [GO:0006281 "DNA repair"
evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040035 "hermaphrodite genitalia development"
evidence=IMP] [GO:0009792 "embryo development ending in birth or
egg hatching" evidence=IMP] InterPro:IPR004598 Pfam:PF03849
GO:GO:0005634 GO:GO:0009792 GO:GO:0006355 GO:GO:0006281
GO:GO:0040035 EMBL:AL132862 eggNOG:COG5144 KO:K03144 OMA:PVNCLPI
PANTHER:PTHR13152 GeneTree:ENSGT00390000014159 GeneID:178431
KEGG:cel:CELE_Y73F8A.24 CTD:178431 RefSeq:NP_001255863.1
ProteinModelPortal:Q9NA35 SMR:Q9NA35 STRING:Q9NA35 PaxDb:Q9NA35
EnsemblMetazoa:Y73F8A.24a UCSC:Y73F8A.24 WormBase:Y73F8A.24a
HOGENOM:HOG000020092 InParanoid:Q9NA35 NextBio:901110
ArrayExpress:Q9NA35 Uniprot:Q9NA35
Length = 481
Score = 492 (178.3 bits), Expect = 5.4e-47, P = 5.4e-47
Identities = 110/287 (38%), Positives = 167/287 (58%)
Query: 113 FNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFST- 170
F G + + IT GFQFLLL Q+W ++++YL+ S+G D+VE + L Q+
Sbjct: 180 FTSGGDTQIEITTFGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQIVLLAN 239
Query: 171 --FGTD---YSTEG-MSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQ 224
F + Y + ++ L HLRE G+++ RKRK G F+ T+L ++AT T+
Sbjct: 240 RGFKAERECYQIDANWTEPQNELLNHLRELGVIFIRKRKDGVFFLTQLLTHLATNETIDD 299
Query: 225 IREP----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
+ G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SVR AL+
Sbjct: 300 VSAEKVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQ 359
Query: 281 SGITAAQIIGFLRLHAXXXXXXXXX------XXXXDQIRLWEGERERLTTSEGVLYSQFV 334
GITAAQII FLR +A DQIRLWE ER R+ + +YS F
Sbjct: 360 HGITAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFE 419
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
S+ +F + +YA+ +L W + +++ V+V + GH +++ W +K
Sbjct: 420 SEDEFHGVCEYARQQNILLWSDNQQKLVIVNEDGHELVRQ-WYKREK 465
Score = 226 (84.6 bits), Expect = 2.9e-16, P = 2.9e-16
Identities = 65/208 (31%), Positives = 108/208 (51%)
Query: 37 HKEATDTL----TELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECIL 92
H++ TD T+ +R+ A + G S S LE + R + + A+ERW+CIL
Sbjct: 95 HRQTTDGKLNIDTDYKRSYMYAAMLGAAQIS-SLVLEPNEGNRRGKDVEKKAVERWDCIL 153
Query: 93 RFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFILK 144
R++ + T+ +S + R LF + G + + IT GFQFLLL Q+W ++++
Sbjct: 154 RYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVIE 212
Query: 145 YLETVESKGLDLVECLTFLFQLKFST---FGTD---YSTEG-MSDKLQVFLQHLREFGLV 197
YL+ S+G D+VE + L Q+ F + Y + ++ L HLRE G++
Sbjct: 213 YLKLEMSQGQDIVEVIEPLIQIVLLANRGFKAERECYQIDANWTEPQNELLNHLRELGVI 272
Query: 198 YQRKRKAGRFYPTKLALNMATRGTLKQI 225
+ RKRK G F+ T+L ++AT T+ +
Sbjct: 273 FIRKRKDGVFFLTQLLTHLATNETIDDV 300
>UNIPROTKB|A2ABL5 [details] [associations]
symbol:GTF2H4 "General transcription factor IIH subunit 4"
species:9606 "Homo sapiens" [GO:0000439 "core TFIIH complex"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] InterPro:IPR004598 Pfam:PF03849 GO:GO:0005634
GO:GO:0006355 GO:GO:0006281 EMBL:CR759747 EMBL:AL662854
EMBL:AL662870 EMBL:AL773541 EMBL:BX927194 EMBL:AL669830
EMBL:CR936875 eggNOG:COG5144 PANTHER:PTHR13152 HGNC:HGNC:4658
IPI:IPI00798015 Ensembl:ENST00000416161 Ensembl:ENST00000423849
Ensembl:ENST00000430576 Ensembl:ENST00000453897
Ensembl:ENST00000454718 Ensembl:ENST00000456195
HOGENOM:HOG000197779 Uniprot:A2ABL5
Length = 138
Score = 127 (49.8 bits), Expect = 1.5e-07, P = 1.5e-07
Identities = 27/59 (45%), Positives = 40/59 (67%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG 59
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR LL GG
Sbjct: 47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRI-WHTQLLPGG 104
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.322 0.136 0.399 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 386 373 0.00087 117 3 11 22 0.50 33
34 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 18
No. of states in DFA: 619 (66 KB)
Total size of DFA: 246 KB (2132 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:01
No. of threads or processors used: 24
Search cpu time: 29.97u 0.10s 30.07t Elapsed: 00:00:11
Total cpu time: 29.98u 0.10s 30.08t Elapsed: 00:00:13
Start: Thu Aug 15 12:11:46 2013 End: Thu Aug 15 12:11:59 2013