Your job contains 1 sequence.
>023328
MPSGITARLPTLEDLEAYAIGQWECFLLQLISSAQAERPTNFSSSMMKVFQRGLLIQSIL
RSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNLSMSL
TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE
NGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD
VFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 023328
(284 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
WB|WBGene00013529 - symbol:Y73F8A.24 species:6239 "Caenor... 395 5.8e-40 2
POMBASE|SPBC13G1.13 - symbol:tfb2 "transcription factor T... 367 4.5e-38 2
DICTYBASE|DDB_G0293228 - symbol:gtf2h4 "TFIIH subunit" sp... 363 9.3e-38 2
ZFIN|ZDB-GENE-030131-6779 - symbol:gtf2h4 "general transc... 364 1.0e-36 2
ASPGD|ASPL0000068300 - symbol:AN7015 species:162425 "Emer... 345 2.2e-36 2
UNIPROTKB|A6H7G8 - symbol:GTF2H4 "Uncharacterized protein... 390 3.5e-36 1
UNIPROTKB|Q767M2 - symbol:TFIIH "Uncharacterized protein"... 389 4.4e-36 1
UNIPROTKB|F1PGI3 - symbol:GTF2H4 "Uncharacterized protein... 388 5.7e-36 1
UNIPROTKB|Q92759 - symbol:GTF2H4 "General transcription f... 385 1.2e-35 1
UNIPROTKB|P60027 - symbol:GTF2H4 "General transcription f... 385 1.2e-35 1
RGD|1303309 - symbol:Gtf2h4 "general transcription factor... 385 1.2e-35 1
MGI|MGI:1338799 - symbol:Gtf2h4 "general transcription fa... 383 1.9e-35 1
FB|FBgn0261109 - symbol:mrn "marionette" species:7227 "Dr... 351 5.0e-35 2
CGD|CAL0002355 - symbol:orf19.5846 species:5476 "Candida ... 247 6.3e-33 3
SGD|S000006043 - symbol:TFB2 "Subunit of TFIIH and nucleo... 187 1.9e-30 4
UNIPROTKB|G4NCE6 - symbol:MGG_00397 "RNA polymerase II tr... 307 2.0e-29 2
>WB|WBGene00013529 [details] [associations]
symbol:Y73F8A.24 species:6239 "Caenorhabditis elegans"
[GO:0005634 "nucleus" evidence=IEA] [GO:0006281 "DNA repair"
evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040035 "hermaphrodite genitalia development"
evidence=IMP] [GO:0009792 "embryo development ending in birth or
egg hatching" evidence=IMP] InterPro:IPR004598 Pfam:PF03849
GO:GO:0005634 GO:GO:0009792 GO:GO:0006355 GO:GO:0006281
GO:GO:0040035 EMBL:AL132862 eggNOG:COG5144 KO:K03144 OMA:PVNCLPI
PANTHER:PTHR13152 GeneTree:ENSGT00390000014159 GeneID:178431
KEGG:cel:CELE_Y73F8A.24 CTD:178431 RefSeq:NP_001255863.1
ProteinModelPortal:Q9NA35 SMR:Q9NA35 STRING:Q9NA35 PaxDb:Q9NA35
EnsemblMetazoa:Y73F8A.24a UCSC:Y73F8A.24 WormBase:Y73F8A.24a
HOGENOM:HOG000020092 InParanoid:Q9NA35 NextBio:901110
ArrayExpress:Q9NA35 Uniprot:Q9NA35
Length = 481
Score = 395 (144.1 bits), Expect = 5.8e-40, Sum P(2) = 5.8e-40
Identities = 86/217 (39%), Positives = 138/217 (63%)
Query: 72 EAYNLNT-LSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNLSMSLT--DSSARK- 127
E Y ++ +E Q ++ +LG++ +++ RK+ F T+L T+L+ + T D SA K
Sbjct: 247 ECYQIDANWTEPQNELLNHLRELGVIFIRK-RKDGVFFLTQLLTHLATNETIDDVSAEKV 305
Query: 128 -EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 186
G V+VETNFR+YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA
Sbjct: 306 SNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAA 365
Query: 187 QIISFLQQNAHPR-VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFE 243
QIISFL+ NAHP+ +A P +P V DQIRLWE + R+ + A+ Y F S D F
Sbjct: 366 QIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFH 425
Query: 244 AACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 280
C+YAR ++ LLW D+++ ++VN + H +R++ +
Sbjct: 426 GVCEYARQQNILLWSDNQQKLVIVNEDGHELVRQWYK 462
Score = 47 (21.6 bits), Expect = 5.8e-40, Sum P(2) = 5.8e-40
Identities = 9/40 (22%), Positives = 22/40 (55%)
Query: 13 EDLEAYAIGQWECFLLQLISSAQAERPTNFSSSMMKVFQR 52
+D+E A+ +W+C +L+ ++ E S + +F++
Sbjct: 139 KDVEKKAVERWDC-ILRYLALPSEENTQAVSETTRNLFKK 177
>POMBASE|SPBC13G1.13 [details] [associations]
symbol:tfb2 "transcription factor TFIIH complex subunit
Tfb2" species:4896 "Schizosaccharomyces pombe" [GO:0000112
"nucleotide-excision repair factor 3 complex" evidence=ISO]
[GO:0000439 "core TFIIH complex" evidence=ISO] [GO:0000717
"nucleotide-excision repair, DNA duplex unwinding" evidence=ISO]
[GO:0000991 "core RNA polymerase II binding transcription factor
activity" evidence=ISO] [GO:0004003 "ATP-dependent DNA helicase
activity" evidence=IEA] [GO:0005675 "holo TFIIH complex"
evidence=ISO] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0006367 "transcription initiation from RNA polymerase II
promoter" evidence=ISO] InterPro:IPR004598 Pfam:PF03849
PomBase:SPBC13G1.13 GO:GO:0005737 GO:GO:0006355 EMBL:CU329671
GenomeReviews:CU329671_GR GO:GO:0006367 GO:GO:0000991 GO:GO:0005675
GO:GO:0000717 GO:GO:0000439 GO:GO:0000112 eggNOG:COG5144 KO:K03144
OMA:PVNCLPI PANTHER:PTHR13152 HOGENOM:HOG000170529
TIGRFAMs:TIGR00625 PIR:T39414 RefSeq:NP_596563.1
ProteinModelPortal:P87303 IntAct:P87303 STRING:P87303
EnsemblFungi:SPBC13G1.13.1 GeneID:2539790 KEGG:spo:SPBC13G1.13
OrthoDB:EOG4CZFQD NextBio:20800939 Uniprot:P87303
Length = 447
Score = 367 (134.2 bits), Expect = 4.5e-38, Sum P(2) = 4.5e-38
Identities = 77/214 (35%), Positives = 127/214 (59%)
Query: 73 AYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNLSM---SLTDSSARKE- 128
AY+++ L++ Q+ M++D + GLV Q+ F PT+LAT L+ SL + +
Sbjct: 230 AYSVDFLTDTQQIMLEDLREYGLV-YQRKITSKRFYPTRLATGLTTDYRSLHGKQSENDD 288
Query: 129 --GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 186
GF++VETN+R+YAY++S L I+ LF+ + + NL+VG IT++S+ A NGI AE
Sbjct: 289 DKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRDSIRRALMNGIAAE 348
Query: 187 QIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAAC 246
QII++L +AHP++ +P +P + DQI LWE + NR+ TP + +F + F+ A
Sbjct: 349 QIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILFRDFLTDSGFDQAV 408
Query: 247 DYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 280
+YA++ L+W+ S K + M +L+
Sbjct: 409 EYAKELGVLVWDSSLKRMFFITTTGAQPMIAYLK 442
Score = 57 (25.1 bits), Expect = 4.5e-38, Sum P(2) = 4.5e-38
Identities = 16/61 (26%), Positives = 30/61 (49%)
Query: 9 LPTLEDLEAYAIGQWECFLLQLISSAQAERPTNFSSSMMKVFQRGLLIQSILRSLKFLCQ 68
L T++ L+AYA WE L ++ + +A+ P ++ + +RG L+ L+
Sbjct: 125 LVTVDFLDAYAKETWETILHFMVGTPEAKFP---GEGVLSLLKRGGLMSGPKNQLRITRA 181
Query: 69 G 69
G
Sbjct: 182 G 182
>DICTYBASE|DDB_G0293228 [details] [associations]
symbol:gtf2h4 "TFIIH subunit" species:44689
"Dictyostelium discoideum" [GO:0006289 "nucleotide-excision repair"
evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=IEA] [GO:0004003
"ATP-dependent DNA helicase activity" evidence=IEA] [GO:0000439
"core TFIIH complex" evidence=IEA] [GO:0006366 "transcription from
RNA polymerase II promoter" evidence=ISS] [GO:0005675 "holo TFIIH
complex" evidence=ISS] [GO:0006974 "response to DNA damage
stimulus" evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0006351 "transcription,
DNA-dependent" evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA]
[GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR004598
Pfam:PF03849 dictyBase:DDB_G0293228 GenomeReviews:CM000155_GR
GO:GO:0006355 EMBL:AAFI02000200 GO:GO:0006366 GO:GO:0006289
GO:GO:0005675 RefSeq:XP_629237.1 ProteinModelPortal:Q54C29
STRING:Q54C29 PRIDE:Q54C29 EnsemblProtists:DDB0231037
GeneID:8629119 KEGG:ddi:DDB_G0293228 eggNOG:COG5144 KO:K03144
OMA:PVNCLPI ProtClustDB:CLSZ2728699 PANTHER:PTHR13152
Uniprot:Q54C29
Length = 483
Score = 363 (132.8 bits), Expect = 9.3e-38, Sum P(2) = 9.3e-38
Identities = 83/234 (35%), Positives = 134/234 (57%)
Query: 58 SILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNLS 117
S L L FL G Y ++ LSE Q+ + GL+ ++ F PT+L +L+
Sbjct: 249 SFLFRLSFLNLG--RGYLVSELSEQQKEYLFALKQFGLIYMRTD-SSILFYPTRLIISLT 305
Query: 118 ----MSLTDS-------SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLI 166
+SL S + +++G++V+ETN+R+YAY++S L +L LF K+ Y+LPNL
Sbjct: 306 TGKTLSLIQSISSERTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLA 365
Query: 167 VGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVE 226
VG IT+ES+ A +GITA+QII F++ N+HP A+ +P+ V +QI LWE++ NR+
Sbjct: 366 VGIITRESIRTALIHGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRIT 425
Query: 227 MTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 280
T + Y FP+ D + A +A+++ +W LVV E + +R F++
Sbjct: 426 YTKSVLYNSFPTNDCYIATLKFAKEQDYYIWSHDPLKTLVVKEEGNDPIRNFIK 479
Score = 58 (25.5 bits), Expect = 9.3e-38, Sum P(2) = 9.3e-38
Identities = 20/65 (30%), Positives = 33/65 (50%)
Query: 10 PTLEDLEAYAIGQWECFLLQLIS-SAQAERPTN---FSSSMMKVFQRGLLIQSILRSLKF 65
P+++DL++Y+ QWE L L + Q + + SS++ K GL I S KF
Sbjct: 156 PSIDDLDSYSKSQWEKVLYFLSDDTVQPSKLISELLLSSNLTKQEGDGLSITS--EGFKF 213
Query: 66 LCQGI 70
L + +
Sbjct: 214 LLKDV 218
>ZFIN|ZDB-GENE-030131-6779 [details] [associations]
symbol:gtf2h4 "general transcription factor IIH,
polypeptide 4" species:7955 "Danio rerio" [GO:0005634 "nucleus"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 ZFIN:ZDB-GENE-030131-6779
GO:GO:0005634 GO:GO:0006355 GO:GO:0006281 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOVERGEN:HBG054205 TIGRFAMs:TIGR00625
EMBL:BC064301 IPI:IPI00511064 RefSeq:NP_956221.1 UniGene:Dr.75464
STRING:Q6P2T8 GeneID:334839 KEGG:dre:334839 InParanoid:Q6P2T8
NextBio:20810565 ArrayExpress:Q6P2T8 Uniprot:Q6P2T8
Length = 466
Score = 364 (133.2 bits), Expect = 1.0e-36, Sum P(2) = 1.0e-36
Identities = 90/259 (34%), Positives = 142/259 (54%)
Query: 42 FSSSMMKVFQ-RGL-LIQ--SILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVK 97
F+ +K Q RG+ L++ S L L F G Y++ +SE + ++ + GLV
Sbjct: 207 FTLQYLKTAQSRGMDLVEILSFLFQLSFSTLG--RDYSVEGMSESLLTFLQHLREFGLV- 263
Query: 98 LQQGRKESWFIPTKLATNLSMSLTDSSARKE--------------GFVVVETNFRMYAYS 143
Q+ RK + PT+LA L+ +T + A GF+VVETN+R+YAY+
Sbjct: 264 FQRKRKSRRYYPTRLAITLAAGVTANPASGSASSALGAIPGTGDTGFIVVETNYRIYAYT 323
Query: 144 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 203
S+L ++ LFS++ Y+ PNL+V +T+ES+ A NGITA+QII FL+ AHP + +
Sbjct: 324 NSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAISNGITAQQIIHFLRTRAHPVMLKQ 383
Query: 204 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 263
P +P + DQIRLWE + +R++ T Y +F S+ FE D A+ L+W++
Sbjct: 384 TPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGVLVWQNPAHR 443
Query: 264 RLVVNAEIHMHMREFLRGQ 282
+VV H ++ F + Q
Sbjct: 444 VMVVTPHGHSEVKRFWKRQ 462
Score = 47 (21.6 bits), Expect = 1.0e-36, Sum P(2) = 1.0e-36
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 12 LEDLEAYAIGQWECFLLQLISSAQA 36
+E L+ YA+ +WE L ++ S A
Sbjct: 138 VESLDRYAMERWEVILHFMVGSPSA 162
>ASPGD|ASPL0000068300 [details] [associations]
symbol:AN7015 species:162425 "Emericella nidulans"
[GO:0006281 "DNA repair" evidence=IEA] [GO:0006355 "regulation of
transcription, DNA-dependent" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0003674
"molecular_function" evidence=ND] InterPro:IPR004598 Pfam:PF03849
GO:GO:0005634 GO:GO:0006355 GO:GO:0006281 EMBL:BN001304
EMBL:AACD01000117 eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152
HOGENOM:HOG000170529 OMA:CVLHYMV TIGRFAMs:TIGR00625
OrthoDB:EOG4CZFQD RefSeq:XP_664619.1 STRING:Q5AXG5
EnsemblFungi:CADANIAT00000461 GeneID:2870093 KEGG:ani:AN7015.2
Uniprot:Q5AXG5
Length = 482
Score = 345 (126.5 bits), Expect = 2.2e-36, Sum P(2) = 2.2e-36
Identities = 85/240 (35%), Positives = 132/240 (55%)
Query: 72 EAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNL---SMSLT------- 121
++Y L+ Q + D AD G+V Q + S F PT+LAT L S +L+
Sbjct: 245 KSYEKKHLTSNQLRTLTDLADFGIV-YQDSPEASHFYPTRLATTLTSDSSALSNPISGAL 303
Query: 122 ------DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESL 175
DS+ GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+
Sbjct: 304 SDPDGGDSNQPGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQSI 363
Query: 176 YNAFENGITAEQIISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDL 222
A E GITA+QIIS+L +AHP++ A PSV P V DQIRLW+ +
Sbjct: 364 RRAIEMGITADQIISYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLER 423
Query: 223 NRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+RV+ T +++F S +EA C YA + L+W+ +K V H + FLR +
Sbjct: 424 DRVKATAGFLFKDFVSLAEYEAPCRYAEEIGVLIWKSDRKRMFFVTR--HEQVAAFLRSR 481
Score = 63 (27.2 bits), Expect = 2.2e-36, Sum P(2) = 2.2e-36
Identities = 15/48 (31%), Positives = 25/48 (52%)
Query: 12 LEDLEAYAIGQWECFLLQLI--SSAQAERPTNFSSSMMKVFQRGLLIQ 57
+ DL+ YA QWE L ++ S +R N S + ++ Q G L++
Sbjct: 139 IHDLDEYARRQWEGVLGYMVGTSGLGIQRDVNLSKGVKQLLQAGHLVE 186
>UNIPROTKB|A6H7G8 [details] [associations]
symbol:GTF2H4 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0008353 "RNA polymerase II carboxy-terminal domain
kinase activity" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0005675 "holo TFIIH
complex" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 GO:GO:0006355 GO:GO:0006281
GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 CTD:2968
HOGENOM:HOG000170529 HOVERGEN:HBG054205 OMA:CVLHYMV
TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159 EMBL:DAAA02055395
EMBL:BC146240 IPI:IPI00707368 RefSeq:NP_001094527.1
UniGene:Bt.98689 Ensembl:ENSBTAT00000054213 GeneID:100137723
KEGG:bta:100137723 NextBio:20789427 Uniprot:A6H7G8
Length = 463
Score = 390 (142.3 bits), Expect = 3.5e-36, P = 3.5e-36
Identities = 85/229 (37%), Positives = 138/229 (60%)
Query: 56 IQSILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATN 115
I S L L F G + Y++ +S+ + ++ + GLV Q+ RK + PT+LA N
Sbjct: 234 ILSFLFQLSFSTLG--KDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAIN 290
Query: 116 LSMSLTDS--SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKE 173
LS ++ + +A + GF+VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+E
Sbjct: 291 LSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRE 350
Query: 174 SLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 233
S+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y
Sbjct: 351 SVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLY 410
Query: 234 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 411 NQFLSQVDFELLLAHARELGVLMFENSAKRLMVVTPAGHSDVKRFWKRQ 459
>UNIPROTKB|Q767M2 [details] [associations]
symbol:TFIIH "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008353 "RNA polymerase II carboxy-terminal domain
kinase activity" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0005675 "holo TFIIH
complex" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 GO:GO:0006355 GO:GO:0006281
GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
EMBL:AB113354 EMBL:AB113355 eggNOG:COG5144 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625
GeneTree:ENSGT00390000014159 EMBL:CT971577 RefSeq:NP_001116592.1
UniGene:Ssc.7512 STRING:Q767M2 Ensembl:ENSSSCT00000001505
GeneID:100144458 KEGG:ssc:100144458 Uniprot:Q767M2
Length = 463
Score = 389 (142.0 bits), Expect = 4.4e-36, P = 4.4e-36
Identities = 85/229 (37%), Positives = 138/229 (60%)
Query: 56 IQSILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATN 115
I S L L F G + Y++ +S+ + ++ +LGLV Q+ RK + PT+LA N
Sbjct: 234 ILSFLFQLSFSTLG--KDYSVEGMSDSLLNFLQHLRELGLV-FQRKRKSRRYYPTRLAIN 290
Query: 116 LSMSLTDS--SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKE 173
LS ++ + + + GF+VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+E
Sbjct: 291 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRE 350
Query: 174 SLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 233
S+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y
Sbjct: 351 SVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLY 410
Query: 234 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 411 NQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQ 459
>UNIPROTKB|F1PGI3 [details] [associations]
symbol:GTF2H4 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
InterPro:IPR004598 Pfam:PF03849 GO:GO:0005634 GO:GO:0006355
GO:GO:0006281 KO:K03144 PANTHER:PTHR13152 OMA:CVLHYMV
TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159 EMBL:AAEX03008206
RefSeq:XP_003639474.1 Ensembl:ENSCAFT00000000727 GeneID:100856712
KEGG:cfa:100856712 Uniprot:F1PGI3
Length = 463
Score = 388 (141.6 bits), Expect = 5.7e-36, P = 5.7e-36
Identities = 84/229 (36%), Positives = 138/229 (60%)
Query: 56 IQSILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATN 115
I S L L F G + Y++ +S+ + ++ + GLV Q+ RK + PT+LA N
Sbjct: 234 ILSFLFQLSFSTLG--KDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAIN 290
Query: 116 LSMSLTDS--SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKE 173
LS ++ + +A + GF++VETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+E
Sbjct: 291 LSSGVSGAGGTAHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRE 350
Query: 174 SLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 233
S+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y
Sbjct: 351 SVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLY 410
Query: 234 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 411 NQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQ 459
>UNIPROTKB|Q92759 [details] [associations]
symbol:GTF2H4 "General transcription factor IIH subunit 4"
species:9606 "Homo sapiens" [GO:0000439 "core TFIIH complex"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=IDA;TAS] [GO:0004672 "protein kinase activity"
evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005675 "holo TFIIH complex" evidence=IDA] [GO:0008353 "RNA
polymerase II carboxy-terminal domain kinase activity"
evidence=IDA] [GO:0008094 "DNA-dependent ATPase activity"
evidence=IDA] [GO:0003700 "sequence-specific DNA binding
transcription factor activity" evidence=TAS] [GO:0000718
"nucleotide-excision repair, DNA damage removal" evidence=TAS]
[GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281 "DNA repair"
evidence=TAS] [GO:0006283 "transcription-coupled
nucleotide-excision repair" evidence=TAS] [GO:0006289
"nucleotide-excision repair" evidence=TAS] [GO:0006360
"transcription from RNA polymerase I promoter" evidence=TAS]
[GO:0006361 "transcription initiation from RNA polymerase I
promoter" evidence=TAS] [GO:0006362 "transcription elongation from
RNA polymerase I promoter" evidence=TAS] [GO:0006363 "termination
of RNA polymerase I transcription" evidence=TAS] [GO:0006367
"transcription initiation from RNA polymerase II promoter"
evidence=TAS] [GO:0006368 "transcription elongation from RNA
polymerase II promoter" evidence=TAS] [GO:0006370
"7-methylguanosine mRNA capping" evidence=TAS] [GO:0010467 "gene
expression" evidence=TAS] [GO:0016032 "viral reproduction"
evidence=TAS] [GO:0050434 "positive regulation of viral
transcription" evidence=TAS] [GO:0006468 "protein phosphorylation"
evidence=IDA] Reactome:REACT_216 Reactome:REACT_71
InterPro:IPR004598 Pfam:PF03849 Reactome:REACT_116125 GO:GO:0016032
EMBL:BA000025 EMBL:CH471081 Reactome:REACT_1675 GO:GO:0006468
GO:GO:0003700 GO:GO:0006368 GO:GO:0006367 Reactome:REACT_1788
GO:GO:0005675 GO:GO:0006370 GO:GO:0000718 GO:GO:0050434
GO:GO:0006363 GO:GO:0006362 GO:GO:0006361 GO:GO:0006283
EMBL:CR759747 EMBL:AL773541 EMBL:BX927194 EMBL:CR936875
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 EMBL:Y07595
EMBL:BT007321 EMBL:AY124590 EMBL:AB088103 EMBL:AB202101
EMBL:AB103609 EMBL:BC004935 EMBL:BC016302 IPI:IPI00016839
RefSeq:NP_001508.1 UniGene:Hs.485070 ProteinModelPortal:Q92759
SMR:Q92759 DIP:DIP-48376N STRING:Q92759 PhosphoSite:Q92759
DMDM:17380328 PaxDb:Q92759 PRIDE:Q92759 DNASU:2968
Ensembl:ENST00000259895 Ensembl:ENST00000376316
Ensembl:ENST00000376326 Ensembl:ENST00000400450
Ensembl:ENST00000413314 Ensembl:ENST00000416773
Ensembl:ENST00000423881 Ensembl:ENST00000434226
Ensembl:ENST00000435498 Ensembl:ENST00000438348
Ensembl:ENST00000440824 Ensembl:ENST00000456968 GeneID:2968
KEGG:hsa:2968 UCSC:uc003nsa.1 CTD:2968 GeneCards:GC06P030875
HGNC:HGNC:4658 MIM:601760 neXtProt:NX_Q92759 PharmGKB:PA29044
HOGENOM:HOG000170529 HOVERGEN:HBG054205 InParanoid:Q92759
OMA:CVLHYMV OrthoDB:EOG4F1X34 PhylomeDB:Q92759 GenomeRNAi:2968
NextBio:11764 ArrayExpress:Q92759 Bgee:Q92759 CleanEx:HS_GTF2H4
Genevestigator:Q92759 GermOnline:ENSG00000137411 TIGRFAMs:TIGR00625
Uniprot:Q92759
Length = 462
Score = 385 (140.6 bits), Expect = 1.2e-35, P = 1.2e-35
Identities = 84/229 (36%), Positives = 137/229 (59%)
Query: 56 IQSILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATN 115
I S L L F G + Y++ +S+ + ++ + GLV Q+ RK + PT+LA N
Sbjct: 233 ILSFLFQLSFSTLG--KDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAIN 289
Query: 116 LSMSLTDS--SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKE 173
LS ++ + + + GF+VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+E
Sbjct: 290 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRE 349
Query: 174 SLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 233
S+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y
Sbjct: 350 SVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLY 409
Query: 234 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 410 NQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQ 458
>UNIPROTKB|P60027 [details] [associations]
symbol:GTF2H4 "General transcription factor IIH subunit 4"
species:9598 "Pan troglodytes" [GO:0005675 "holo TFIIH complex"
evidence=ISS] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=ISS] [GO:0004672 "protein kinase activity"
evidence=ISS] [GO:0008094 "DNA-dependent ATPase activity"
evidence=ISS] [GO:0008353 "RNA polymerase II carboxy-terminal
domain kinase activity" evidence=ISS] InterPro:IPR004598
Pfam:PF03849 GO:GO:0006355 GO:GO:0006281 EMBL:BA000041
GO:GO:0006366 GO:GO:0008353 GO:GO:0005675 GO:GO:0008094
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 CTD:2968
HOGENOM:HOG000170529 HOVERGEN:HBG054205 OMA:CVLHYMV
OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625 EMBL:AB210199 EMBL:AB210200
RefSeq:NP_001038968.1 UniGene:Ptr.6202 ProteinModelPortal:P60027
SMR:P60027 STRING:P60027 PRIDE:P60027 Ensembl:ENSPTRT00000033136
GeneID:462549 KEGG:ptr:462549 GeneTree:ENSGT00390000014159
InParanoid:P60027 NextBio:20841799 Uniprot:P60027
Length = 462
Score = 385 (140.6 bits), Expect = 1.2e-35, P = 1.2e-35
Identities = 84/229 (36%), Positives = 137/229 (59%)
Query: 56 IQSILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATN 115
I S L L F G + Y++ +S+ + ++ + GLV Q+ RK + PT+LA N
Sbjct: 233 ILSFLFQLSFSTLG--KDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAIN 289
Query: 116 LSMSLTDS--SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKE 173
LS ++ + + + GF+VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+E
Sbjct: 290 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRE 349
Query: 174 SLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 233
S+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y
Sbjct: 350 SVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLY 409
Query: 234 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 410 NQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQ 458
>RGD|1303309 [details] [associations]
symbol:Gtf2h4 "general transcription factor II H, polypeptide 4"
species:10116 "Rattus norvegicus" [GO:0000439 "core TFIIH complex"
evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0005675 "holo TFIIH complex" evidence=IEA;ISO]
[GO:0006289 "nucleotide-excision repair" evidence=IEA] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=IEA;ISO]
[GO:0006468 "protein phosphorylation" evidence=ISO] [GO:0008353
"RNA polymerase II carboxy-terminal domain kinase activity"
evidence=IEA;ISO] [GO:0004672 "protein kinase activity"
evidence=ISO] [GO:0008094 "DNA-dependent ATPase activity"
evidence=ISO] InterPro:IPR004598 Pfam:PF03849 RGD:1303309
GO:GO:0006355 GO:GO:0006281 GO:GO:0006366 GO:GO:0008353
GO:GO:0005675 GO:GO:0008094 EMBL:BX883047 eggNOG:COG5144 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625
GeneTree:ENSGT00390000014159 EMBL:BC127469 IPI:IPI00215269
RefSeq:NP_997666.1 UniGene:Rn.34122 STRING:Q6MG20
Ensembl:ENSRNOT00000001103 GeneID:294236 KEGG:rno:294236
UCSC:RGD:1303309 InParanoid:Q6MG20 NextBio:637807
Genevestigator:Q6MG20 Uniprot:Q6MG20
Length = 463
Score = 385 (140.6 bits), Expect = 1.2e-35, P = 1.2e-35
Identities = 84/229 (36%), Positives = 137/229 (59%)
Query: 56 IQSILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATN 115
I S L L F G + Y++ +S+ + ++ + GLV Q+ RK + PT+LA N
Sbjct: 234 ILSFLFQLSFSTLG--KDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAIN 290
Query: 116 LSMSLTDS--SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKE 173
LS ++ + + + GF+VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+E
Sbjct: 291 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRE 350
Query: 174 SLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 233
S+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y
Sbjct: 351 SVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLY 410
Query: 234 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 411 NQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQ 459
>MGI|MGI:1338799 [details] [associations]
symbol:Gtf2h4 "general transcription factor II H,
polypeptide 4" species:10090 "Mus musculus" [GO:0000439 "core TFIIH
complex" evidence=IEA] [GO:0004003 "ATP-dependent DNA helicase
activity" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0005675 "holo TFIIH complex" evidence=ISO] [GO:0006281 "DNA
repair" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=ISO] [GO:0006468 "protein
phosphorylation" evidence=ISO] [GO:0006974 "response to DNA damage
stimulus" evidence=IEA] InterPro:IPR004598 Pfam:PF03849
MGI:MGI:1338799 GO:GO:0006355 GO:GO:0006281 GO:GO:0006366
GO:GO:0008353 GO:GO:0005675 GO:GO:0008094 eggNOG:COG5144 KO:K03144
PANTHER:PTHR13152 CTD:2968 HOGENOM:HOG000170529 HOVERGEN:HBG054205
OMA:CVLHYMV OrthoDB:EOG4F1X34 TIGRFAMs:TIGR00625 EMBL:AF054823
EMBL:BC012638 IPI:IPI00118172 RefSeq:NP_034494.1 UniGene:Mm.10182
ProteinModelPortal:O70422 SMR:O70422 IntAct:O70422 STRING:O70422
PhosphoSite:O70422 PaxDb:O70422 PRIDE:O70422
Ensembl:ENSMUST00000001565 Ensembl:ENSMUST00000160734 GeneID:14885
KEGG:mmu:14885 InParanoid:O70422 NextBio:287167 Bgee:O70422
CleanEx:MM_GTF2H4 Genevestigator:O70422
GermOnline:ENSMUSG00000001524 Uniprot:O70422
Length = 463
Score = 383 (139.9 bits), Expect = 1.9e-35, P = 1.9e-35
Identities = 84/229 (36%), Positives = 137/229 (59%)
Query: 56 IQSILRSLKFLCQGILEAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATN 115
I S L L F G + Y++ +S+ + ++ + GLV Q+ RK + PT+LA N
Sbjct: 234 ILSFLFQLSFSTLG--KDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAIN 290
Query: 116 LSMSLTDS--SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKE 173
LS ++ + + + GF+VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+E
Sbjct: 291 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRE 350
Query: 174 SLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 233
S+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y
Sbjct: 351 SVQQAIASGITAQQIIHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLY 410
Query: 234 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 282
+F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 411 NQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQ 459
>FB|FBgn0261109 [details] [associations]
symbol:mrn "marionette" species:7227 "Drosophila
melanogaster" [GO:0005675 "holo TFIIH complex" evidence=ISS;IDA]
[GO:0006367 "transcription initiation from RNA polymerase II
promoter" evidence=ISS] [GO:0000439 "core TFIIH complex"
evidence=ISS;IPI] [GO:0004003 "ATP-dependent DNA helicase activity"
evidence=IEA] [GO:0001671 "ATPase activator activity" evidence=IDA]
[GO:0001111 "promoter clearance from RNA polymerase II promoter"
evidence=ISS] [GO:0006289 "nucleotide-excision repair"
evidence=IDA;IMP] [GO:0010224 "response to UV-B" evidence=IMP]
[GO:0005634 "nucleus" evidence=IDA] [GO:0001113 "transcriptional
open complex formation at RNA polymerase II promoter" evidence=ISS]
[GO:0006366 "transcription from RNA polymerase II promoter"
evidence=IDA] InterPro:IPR004598 Pfam:PF03849 EMBL:AE014296
GO:GO:0006355 GO:GO:0001671 GO:GO:0006367 GO:GO:0010224
GO:GO:0006289 GO:GO:0005675 GO:GO:0000439 KO:K03144
PANTHER:PTHR13152 OMA:CVLHYMV TIGRFAMs:TIGR00625
GeneTree:ENSGT00390000014159 FlyBase:FBgn0261109 RefSeq:NP_648780.1
ProteinModelPortal:Q9VUR1 SMR:Q9VUR1 IntAct:Q9VUR1 MINT:MINT-849737
STRING:Q9VUR1 PRIDE:Q9VUR1 EnsemblMetazoa:FBtr0075595 GeneID:39688
KEGG:dme:Dmel_CG7764 UCSC:CG7764-RA CTD:39688 InParanoid:Q9VUR1
PhylomeDB:Q9VUR1 GenomeRNAi:39688 NextBio:814886
ArrayExpress:Q9VUR1 Bgee:Q9VUR1 Uniprot:Q9VUR1
Length = 499
Score = 351 (128.6 bits), Expect = 5.0e-35, Sum P(2) = 5.0e-35
Identities = 85/250 (34%), Positives = 137/250 (54%)
Query: 51 QRGLLIQSILRSLKFLCQGIL-EAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIP 109
+RG+ + L L L L Y+ ++ + ++ + GLV Q+ RKE F P
Sbjct: 248 ERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLV-FQRKRKEGRFYP 306
Query: 110 TKLATNL---------SMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEY 160
T+LA N+ S+++ + + + G++VVETN+R+YAY+ S L +L LF+++ Y
Sbjct: 307 TRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLY 366
Query: 161 QLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR---VADRIPS---VPENVCDQ 214
+ PNL+VG +T++S+ A GITAEQI+S+L+Q AHP V I S +P V DQ
Sbjct: 367 RFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQ 426
Query: 215 IRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMH 274
I+LWE + NR T Y +F S F DYA+ L+W++ + +VV H
Sbjct: 427 IKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDD 486
Query: 275 MREFLRGQNK 284
++ + + +K
Sbjct: 487 VKRYWKKYSK 496
Score = 44 (20.5 bits), Expect = 5.0e-35, Sum P(2) = 5.0e-35
Identities = 6/19 (31%), Positives = 12/19 (63%)
Query: 15 LEAYAIGQWECFLLQLISS 33
L+ YA+ +W C L ++ +
Sbjct: 166 LDTYAMSRWRCVLHYMVGT 184
>CGD|CAL0002355 [details] [associations]
symbol:orf19.5846 species:5476 "Candida albicans" [GO:0005675
"holo TFIIH complex" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0000439 "core TFIIH complex" evidence=IEA]
[GO:0000112 "nucleotide-excision repair factor 3 complex"
evidence=IEA] [GO:0003690 "double-stranded DNA binding"
evidence=IEA] [GO:0070816 "phosphorylation of RNA polymerase II
C-terminal domain" evidence=IEA] [GO:0006366 "transcription from
RNA polymerase II promoter" evidence=IEA] [GO:0006289
"nucleotide-excision repair" evidence=IEA] InterPro:IPR004598
Pfam:PF03849 CGD:CAL0002355 GO:GO:0005634 GO:GO:0006355
GO:GO:0006281 EMBL:AACQ01000235 EMBL:AACQ01000236 eggNOG:COG5144
KO:K03144 PANTHER:PTHR13152 HOGENOM:HOG000170529 TIGRFAMs:TIGR00625
RefSeq:XP_710801.1 RefSeq:XP_710812.1 STRING:Q59M53 GeneID:3647581
GeneID:3647592 KEGG:cal:CaO19.13268 KEGG:cal:CaO19.5846
Uniprot:Q59M53
Length = 494
Score = 247 (92.0 bits), Expect = 6.3e-33, Sum P(3) = 6.3e-33
Identities = 59/142 (41%), Positives = 87/142 (61%)
Query: 72 EAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNL----------SMSLT 121
++Y++ +LSE Q SM+ D D GLV Q+ F PT+LAT L SM++
Sbjct: 234 KSYSVLSLSETQVSMLADLKDYGLV-YQRSDTSGRFYPTRLATTLTSDSAALKTPSMAMD 292
Query: 122 DSS---ARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 178
+ A KE +++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA
Sbjct: 293 EEEQQVATKES-IIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNA 351
Query: 179 FENGITAEQIISFLQQNAHPRV 200
NGITA+QII FL+ +AHP++
Sbjct: 352 LYNGITADQIIKFLETHAHPQM 373
Score = 115 (45.5 bits), Expect = 6.3e-33, Sum P(3) = 6.3e-33
Identities = 19/76 (25%), Positives = 43/76 (56%)
Query: 203 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 262
++ +P NV DQI+LW+ +L+R++ + +++F ++ ++ +YA + L+W D K
Sbjct: 412 KLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWADKVK 471
Query: 263 MRLVVNAEIHMHMREF 278
+ V + + +F
Sbjct: 472 KKFFVTKDGMTQVADF 487
Score = 42 (19.8 bits), Expect = 6.3e-33, Sum P(3) = 6.3e-33
Identities = 8/35 (22%), Positives = 19/35 (54%)
Query: 15 LEAYAIGQWECFLLQLISSAQAERPTNFSSSMMKV 49
L+ +A +WE L ++ + P++ S++K+
Sbjct: 136 LDLFASQKWETILHFMVGTESTATPSDSVLSLLKL 170
>SGD|S000006043 [details] [associations]
symbol:TFB2 "Subunit of TFIIH and nucleotide excision repair
factor 3 complexes" species:4932 "Saccharomyces cerevisiae"
[GO:0005634 "nucleus" evidence=IEA] [GO:0006289
"nucleotide-excision repair" evidence=IEA;IMP] [GO:0006974
"response to DNA damage stimulus" evidence=IEA] [GO:0000990 "core
RNA polymerase binding transcription factor activity" evidence=IC]
[GO:0000439 "core TFIIH complex" evidence=IEA;IDA] [GO:0004003
"ATP-dependent DNA helicase activity" evidence=IEA] [GO:0006281
"DNA repair" evidence=IEA] [GO:0006355 "regulation of
transcription, DNA-dependent" evidence=IEA] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=IDA]
[GO:0070816 "phosphorylation of RNA polymerase II C-terminal
domain" evidence=IDA] [GO:0005675 "holo TFIIH complex"
evidence=IDA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0000112 "nucleotide-excision repair factor 3
complex" evidence=IPI] [GO:0003690 "double-stranded DNA binding"
evidence=IDA] InterPro:IPR004598 Pfam:PF03849 SGD:S000006043
GO:GO:0006366 EMBL:BK006949 GO:GO:0003690 EMBL:U43503 GO:GO:0006289
GO:GO:0070816 GO:GO:0005675 GO:GO:0000439 GO:GO:0000112
eggNOG:COG5144 KO:K03144 PANTHER:PTHR13152 HOGENOM:HOG000170529
OMA:CVLHYMV TIGRFAMs:TIGR00625 GeneTree:ENSGT00390000014159
OrthoDB:EOG4CZFQD EMBL:U62804 PIR:S62000 RefSeq:NP_015203.1
PDB:3DGP PDB:3DOM PDBsum:3DGP PDBsum:3DOM ProteinModelPortal:Q02939
SMR:Q02939 DIP:DIP-5783N IntAct:Q02939 MINT:MINT-2785444
STRING:Q02939 PaxDb:Q02939 PeptideAtlas:Q02939 EnsemblFungi:YPL122C
GeneID:855981 KEGG:sce:YPL122C CYGD:YPL122c
EvolutionaryTrace:Q02939 NextBio:980816 Genevestigator:Q02939
GermOnline:YPL122C Uniprot:Q02939
Length = 513
Score = 187 (70.9 bits), Expect = 1.9e-30, Sum P(4) = 1.9e-30
Identities = 33/73 (45%), Positives = 55/73 (75%)
Query: 128 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 187
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+Q
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 395
Query: 188 IISFLQQNAHPRV 200
II++L+ +AHP++
Sbjct: 396 IIAYLETHAHPQM 408
Score = 128 (50.1 bits), Expect = 1.9e-30, Sum P(4) = 1.9e-30
Identities = 26/87 (29%), Positives = 45/87 (51%)
Query: 198 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 257
P + + +P V DQIRLW+ +L+RV Y +F + + YA+D LLW
Sbjct: 424 PNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLW 483
Query: 258 EDSKKMRLVVNAEIHMHMREFLRGQNK 284
+D KK + ++ E + + +F + + K
Sbjct: 484 KDDKKKKFFISKEGNSQVLDFAKRKLK 510
Score = 96 (38.9 bits), Expect = 1.9e-30, Sum P(4) = 1.9e-30
Identities = 22/46 (47%), Positives = 30/46 (65%)
Query: 72 EAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNLS 117
+AY ++ LSE QR M++D D GLV Q+ +S F PTKLA L+
Sbjct: 235 KAYKIDALSETQRIMLQDMRDYGLV-FQKHSNDSIFYPTKLALMLT 279
Score = 45 (20.9 bits), Expect = 1.9e-30, Sum P(4) = 1.9e-30
Identities = 9/38 (23%), Positives = 22/38 (57%)
Query: 11 TLEDLEAYAIGQWECFLLQLISSAQAERPTNFSSSMMK 48
+L+ L+ Y+ +WE L ++ + A+ P+ +++K
Sbjct: 131 SLDLLDEYSANKWETILHFMVGTPLAKIPSEKVLNLLK 168
>UNIPROTKB|G4NCE6 [details] [associations]
symbol:MGG_00397 "RNA polymerase II transcription factor B
subunit 2" species:242507 "Magnaporthe oryzae 70-15" [GO:0043581
"mycelium development" evidence=IEP] InterPro:IPR004598
Pfam:PF03849 GO:GO:0005634 GO:GO:0006355 EMBL:CM001235
GO:GO:0006281 GO:GO:0043581 KO:K03144 PANTHER:PTHR13152
TIGRFAMs:TIGR00625 RefSeq:XP_003718679.1 EnsemblFungi:MGG_00397T0
GeneID:2674140 KEGG:mgr:MGG_00397 Uniprot:G4NCE6
Length = 490
Score = 307 (113.1 bits), Expect = 2.0e-29, Sum P(2) = 2.0e-29
Identities = 72/213 (33%), Positives = 124/213 (58%)
Query: 72 EAYNLNTLSEIQRSMIKDFADLGLVKLQQGRKESWFIPTKLATNL--------SMSLTDS 123
+AY+ LSE +R+M+ D D GLV + + +F PT+LAT L S+S +
Sbjct: 255 QAYDTGALSEERRNMLPDLNDFGLVYIPPSNPDQYF-PTRLATTLTSGSSALRSVSSGVA 313
Query: 124 SARKE-------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 176
+A E G +++ETNFR+YAY+++ L IL LF+ ++ + ++ G +++ S+
Sbjct: 314 AATAEAGENNTKGAIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIK 373
Query: 177 NAFENGITAEQIISFLQQNAHP---RVAD--RIPSVPENVCDQIRLWESDLNRVEMTPAH 231
A +GITA+QII +L +AH R+A P +P V DQIRLW+ + R+++ +
Sbjct: 374 RAISHGITADQIIEYLASHAHEQMHRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGY 433
Query: 232 YYEEFPSRDVFEAACDYARDRSGLLWE-DSKKM 263
+++F S+ F+A DYA + L+W D++++
Sbjct: 434 LFKDFESQAEFKAIADYADEVGVLIWRSDARQL 466
Score = 41 (19.5 bits), Expect = 2.0e-29, Sum P(2) = 2.0e-29
Identities = 12/49 (24%), Positives = 26/49 (53%)
Query: 15 LEAYAIGQWECFLLQLISSAQ---AERPT---NFSSSMMKVFQRGLLIQ 57
L+ +A +W+ L +++S + + P+ NF S ++ + LL+Q
Sbjct: 141 LDKWARSRWDAILHYVVNSVEETDSMEPSKKYNFGGSKLQDTVKTLLVQ 189
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.321 0.134 0.390 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 284 284 0.00085 115 3 11 22 0.47 33
33 0.42 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 16
No. of states in DFA: 611 (65 KB)
Total size of DFA: 208 KB (2116 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 23.76u 0.08s 23.84t Elapsed: 00:00:01
Total cpu time: 23.76u 0.08s 23.84t Elapsed: 00:00:01
Start: Thu May 9 21:10:15 2013 End: Thu May 9 21:10:16 2013