Citrus Sinensis ID: 036799


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160------
MPSGIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL
cccccccccccHHHHHHHHHcHHHHHccHHHHHHHHHHHccccccEEEEEEccEEEEEcccHHHHHHHHHHHHHHHHcccEEEEEEcHHHHHHHHHccccHHHHHcccccccccccccccHHHHHHHHHHHcccccccccEEEcccccHHHHHHHHHHHHHccccc
ccccEEEEccccHHHHHHHHHHHHHHHccccHHHHHHHcccccccEEEEEEcEEEEEEcccHHHHHHHHHHHHHHHHcccEEEEEEcHHHHHHHHHccccHHHHHccHHHccccccccccHHHHHHHHHHHHccEEEcccEEEcccccHHHHHHHHHHHHHccccc
mpsgiiarlptsedleAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIeyqlpnlivgaTTKESLNNAFENgitaeqqnahprvadripsipenvCDQIRLWESDlnrvemtpahyydefpsrdvFEAACDYardrsgll
mpsgiiarlptsedLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPahyydefpsrdVFEAACDYARDRSGLL
MPSGIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL
************EDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI************DRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY********
********LPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL
MPSGIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL
*PSGIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MPSGIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query166 2.2.26 [Sep-21-2011]
P87303447 RNA polymerase II transcr yes no 0.722 0.268 0.380 7e-23
P60027462 General transcription fac yes no 0.704 0.253 0.406 8e-23
Q92759462 General transcription fac yes no 0.704 0.253 0.406 8e-23
O70422463 General transcription fac yes no 0.704 0.252 0.406 2e-22
Q54C29483 General transcription fac yes no 0.759 0.260 0.409 1e-21
Q6BZX4467 RNA polymerase II transcr yes no 0.765 0.271 0.345 1e-20
Q6CLR2496 RNA polymerase II transcr yes no 0.734 0.245 0.356 9e-20
Q6FP41504 RNA polymerase II transcr yes no 0.734 0.242 0.328 3e-17
Q75B51514 RNA polymerase II transcr yes no 0.734 0.237 0.321 9e-17
Q02939513 RNA polymerase II transcr yes no 0.734 0.237 0.321 4e-16
>sp|P87303|TFB2_SCHPO RNA polymerase II transcription factor B subunit 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=tfb2 PE=1 SV=2 Back     alignment and function desciption
 Score =  106 bits (265), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 79/126 (62%), Gaps = 6/126 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + +GF++VETN+R+YAY++S L   I+ LF+ +  +  NL+VG  T++S+  A  NGI A
Sbjct: 288 DDKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRDSIRRALMNGIAA 347

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           EQ       +AHP++   +P +P  + DQI LWE + NR+  TP   + +F +   F+ A
Sbjct: 348 EQIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILFRDFLTDSGFDQA 407

Query: 156 CDYARD 161
            +YA++
Sbjct: 408 VEYAKE 413




Component of the core-TFIIH basal transcription factor involved in nucleotide excision repair (NER) of DNA.
Schizosaccharomyces pombe (strain 972 / ATCC 24843) (taxid: 284812)
>sp|P60027|TF2H4_PANTR General transcription factor IIH subunit 4 OS=Pan troglodytes GN=GTF2H4 PE=3 SV=1 Back     alignment and function description
>sp|Q92759|TF2H4_HUMAN General transcription factor IIH subunit 4 OS=Homo sapiens GN=GTF2H4 PE=2 SV=1 Back     alignment and function description
>sp|O70422|TF2H4_MOUSE General transcription factor IIH subunit 4 OS=Mus musculus GN=Gtf2h4 PE=2 SV=1 Back     alignment and function description
>sp|Q54C29|TF2H4_DICDI General transcription factor IIH subunit 4 OS=Dictyostelium discoideum GN=gtf2h4 PE=3 SV=1 Back     alignment and function description
>sp|Q6BZX4|TFB2_YARLI RNA polymerase II transcription factor B subunit 2 OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=TFB2 PE=3 SV=1 Back     alignment and function description
>sp|Q6CLR2|TFB2_KLULA RNA polymerase II transcription factor B subunit 2 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=TFB2 PE=3 SV=1 Back     alignment and function description
>sp|Q6FP41|TFB2_CANGA RNA polymerase II transcription factor B subunit 2 OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=TFB2 PE=3 SV=1 Back     alignment and function description
>sp|Q75B51|TFB2_ASHGO RNA polymerase II transcription factor B subunit 2 OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=TFB2 PE=3 SV=1 Back     alignment and function description
>sp|Q02939|TFB2_YEAST RNA polymerase II transcription factor B subunit 2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TFB2 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query166
255556021 451 tfiih, polypeptide, putative [Ricinus co 0.759 0.279 0.810 3e-57
449433857 451 PREDICTED: general transcription factor 0.759 0.279 0.818 1e-56
224077128 449 predicted protein [Populus trichocarpa] 0.759 0.280 0.772 5e-55
147854406 451 hypothetical protein VITISV_005315 [Viti 0.759 0.279 0.780 5e-55
356548733 451 PREDICTED: general transcription factor 0.759 0.279 0.795 1e-54
358346685 451 General transcription factor IIH subunit 0.885 0.325 0.670 2e-54
357461579 452 General transcription factor IIH subunit 0.885 0.325 0.664 2e-54
115461368 444 Os04g0680000 [Oryza sativa Japonica Grou 0.879 0.328 0.541 5e-54
90399251 470 B0213E10.4 [Oryza sativa Indica Group] 0.873 0.308 0.549 6e-54
356521347 451 PREDICTED: general transcription factor 0.759 0.279 0.772 3e-53
>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis] gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  226 bits (575), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 107/132 (81%), Positives = 118/132 (89%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGFVVVETNFRMYAYSTSKLH EI+RLFS++EYQLPNL+VGA TKESL +AFENGIT
Sbjct: 293 SRKQGFVVVETNFRMYAYSTSKLHCEIMRLFSRVEYQLPNLVVGAMTKESLYSAFENGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ      QNAHPRVA+RIPS+PENV DQIRLWESD+NRVEMTPAH YDEFPSRDVFEA
Sbjct: 353 AEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWESDMNRVEMTPAHLYDEFPSRDVFEA 412

Query: 155 ACDYARDRSGLL 166
           AC++ARD +GLL
Sbjct: 413 ACNFARDWNGLL 424




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa] gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine max] Back     alignment and taxonomy information
>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula] gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula] Back     alignment and taxonomy information
>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula] gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula] Back     alignment and taxonomy information
>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group] gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group] Back     alignment and taxonomy information
>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query166
DICTYBASE|DDB_G0293228483 gtf2h4 "TFIIH subunit" [Dictyo 0.759 0.260 0.409 2.4e-26
POMBASE|SPBC13G1.13447 tfb2 "transcription factor TFI 0.722 0.268 0.380 5.7e-25
ZFIN|ZDB-GENE-030131-6779466 gtf2h4 "general transcription 0.698 0.248 0.426 8.1e-24
WB|WBGene00013529481 Y73F8A.24 [Caenorhabditis eleg 0.825 0.284 0.412 1e-22
ASPGD|ASPL0000068300482 AN7015 [Emericella nidulans (t 0.397 0.136 0.416 1.2e-22
UNIPROTKB|A6H7G8463 GTF2H4 "Uncharacterized protei 0.759 0.272 0.386 3.2e-21
UNIPROTKB|Q92759462 GTF2H4 "General transcription 0.704 0.253 0.406 5.2e-21
UNIPROTKB|P60027462 GTF2H4 "General transcription 0.704 0.253 0.406 5.2e-21
UNIPROTKB|F1PGI3463 GTF2H4 "Uncharacterized protei 0.728 0.261 0.385 5.3e-21
UNIPROTKB|Q767M2463 TFIIH "Uncharacterized protein 0.704 0.252 0.406 5.3e-21
DICTYBASE|DDB_G0293228 gtf2h4 "TFIIH subunit" [Dictyostelium discoideum (taxid:44689)] Back     alignment and assigned GO terms
 Score = 274 (101.5 bits), Expect = 2.4e-26, Sum P(2) = 2.4e-26
 Identities = 54/132 (40%), Positives = 83/132 (62%)

Query:    37 REYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFE 96
             R      QG++V+ETN+R+YAY++S L   +L LF K+ Y+LPNL VG  T+ES+  A  
Sbjct:   320 RTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALI 379

Query:    97 NGITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
             +GITA+Q       N+HP  A+    IP+ V +QI LWE++ NR+  T +  Y+ FP+ D
Sbjct:   380 HGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRITYTKSVLYNSFPTND 439

Query:   151 VFEAACDYARDR 162
              + A   +A+++
Sbjct:   440 CYIATLKFAKEQ 451


GO:0006289 "nucleotide-excision repair" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=IEA
GO:0004003 "ATP-dependent DNA helicase activity" evidence=IEA
GO:0000439 "core TFIIH complex" evidence=IEA
GO:0006366 "transcription from RNA polymerase II promoter" evidence=ISS
GO:0005675 "holo TFIIH complex" evidence=ISS
GO:0006974 "response to DNA damage stimulus" evidence=IEA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEA
GO:0006351 "transcription, DNA-dependent" evidence=IEA
GO:0006281 "DNA repair" evidence=IEA
GO:0044351 "macropinocytosis" evidence=RCA
POMBASE|SPBC13G1.13 tfb2 "transcription factor TFIIH complex subunit Tfb2" [Schizosaccharomyces pombe (taxid:4896)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-030131-6779 gtf2h4 "general transcription factor IIH, polypeptide 4" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
WB|WBGene00013529 Y73F8A.24 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
ASPGD|ASPL0000068300 AN7015 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
UNIPROTKB|A6H7G8 GTF2H4 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|Q92759 GTF2H4 "General transcription factor IIH subunit 4" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|P60027 GTF2H4 "General transcription factor IIH subunit 4" [Pan troglodytes (taxid:9598)] Back     alignment and assigned GO terms
UNIPROTKB|F1PGI3 GTF2H4 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|Q767M2 TFIIH "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pg.C_LG_IV0538
hypothetical protein (449 aa)
(Populus trichocarpa)
Predicted Functional Partners:
fgenesh4_pg.C_LG_I000900
hypothetical protein (803 aa)
    0.885
eugene3.00020503
hypothetical protein (758 aa)
    0.836
gw1.V.5266.1
hypothetical protein (289 aa)
     0.795
estExt_fgenesh4_pm.C_1650003
hypothetical protein (413 aa)
    0.744
estExt_Genewise1_v1.C_1330021
hypothetical protein (334 aa)
      0.573
estExt_Genewise1_v1.C_LG_IX3069
SubName- Full=Putative uncharacterized protein; (117 aa)
      0.494
grail3.0014029901
hypothetical protein (52 aa)
       0.479
gw1.70.559.1
hypothetical protein (329 aa)
      0.472
gw1.IX.2304.1
hypothetical protein (321 aa)
       0.442
fgenesh4_pg.C_scaffold_82000017
hypothetical protein (603 aa)
      0.432

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query166
pfam03849365 pfam03849, Tfb2, Transcription factor Tfb2 1e-32
TIGR00625448 TIGR00625, tfb2, Transcription factor tfb2 1e-31
COG5144447 COG5144, TFB2, RNA polymerase II transcription ini 1e-31
pfam03849365 pfam03849, Tfb2, Transcription factor Tfb2 6e-04
>gnl|CDD|217760 pfam03849, Tfb2, Transcription factor Tfb2 Back     alignment and domain information
 Score =  118 bits (298), Expect = 1e-32
 Identities = 39/74 (52%), Positives = 55/74 (74%), Gaps = 6/74 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           + S+GF++VETNFR+YAY++S L   IL LF +++Y+ PNL+VG  T+ES+ NA ENGIT
Sbjct: 292 AASKGFIIVETNFRLYAYTSSPLQIAILALFVELKYRFPNLVVGQITRESIRNALENGIT 351

Query: 101 AEQ------QNAHP 108
           A+Q       +AHP
Sbjct: 352 ADQIISYLETHAHP 365


Length = 365

>gnl|CDD|233058 TIGR00625, tfb2, Transcription factor tfb2 Back     alignment and domain information
>gnl|CDD|227473 COG5144, TFB2, RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB2 [Transcription / DNA replication, recombination, and repair] Back     alignment and domain information
>gnl|CDD|217760 pfam03849, Tfb2, Transcription factor Tfb2 Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 166
TIGR00625448 tfb2 Transcription factor tfb2. This family is bas 100.0
KOG3471465 consensus RNA polymerase II transcription initiati 100.0
COG5144447 TFB2 RNA polymerase II transcription initiation/nu 100.0
PF03849366 Tfb2: Transcription factor Tfb2; InterPro: IPR0045 100.0
PF13625129 Helicase_C_3: Helicase conserved C-terminal domain 99.18
TIGR00603 732 rad25 DNA repair helicase rad25. All proteins in t 86.73
>TIGR00625 tfb2 Transcription factor tfb2 Back     alignment and domain information
Probab=100.00  E-value=2e-62  Score=443.78  Aligned_cols=122  Identities=42%  Similarity=0.795  Sum_probs=120.1

Q ss_pred             ceEEEEecceEEEEcCCHhHHHHHHHHHhhHhhcCceEEEEeCHHHHHHHHHcCCChhh------hhcChhhh-hcCCCC
Q 036799           45 GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVA-DRIPSI  117 (166)
Q Consensus        45 gfIIvETNFRvYAYT~S~LqiaiL~lF~~l~~r~PNlvvG~iTR~Sv~~Al~~GITA~Q------~~AHp~m~-~~~p~i  117 (166)
                      |||||||||||||||+||||++||++||+++||||||+||+|||+||++|+++||||||      +||||+|+ +..|+|
T Consensus       292 g~iivEtNfrvYaYt~s~l~~~il~lF~~~~~r~pnlvvg~iTr~Sv~~A~~~GITa~qIi~fl~~~ahp~~~~~~~~~l  371 (448)
T TIGR00625       292 GFIIVETNYRLYAYTESPLQIALIALFSELLARFPNMVVGQITRESIRRALANGITAQQIIHYLRTHAHPQMRKEQTPVL  371 (448)
T ss_pred             ceEEEEecceEEEecCCHHHHHHHHHHHHHHhcCCceEEEEecHHHHHHHHHcCCCHHHHHHHHHhcCChhhhccCCCCC
Confidence            99999999999999999999999999999999999999999999999999999999999      99999998 478999


Q ss_pred             CHHHHHHHHHHHHhcCceeeccceeecCCCCHHHHHHHHHHHHHcCCCC
Q 036799          118 PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL  166 (166)
Q Consensus       118 P~tV~DQIrLWE~ErnRl~~~~g~Ly~~F~s~~~f~~~~~yA~~~g~Ll  166 (166)
                      ||||+|||||||.||||+++++|+||++|+|+.+|+++++||+++|+|+
T Consensus       372 P~tv~dQi~lWe~e~~R~~~~~~~l~~~f~s~~~y~~~~~ya~~~~~l~  420 (448)
T TIGR00625       372 PPTIVDQIRLWELERDRLRFTEGVLYNDFLTQVDFELLLAYARELGVLV  420 (448)
T ss_pred             ChHHHHHHHHHHHHhcceEeecceeeeecCCHHHHHHHHHHHHHcCEEE
Confidence            9999999999999999999999999999999999999999999999985



This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).

>KOG3471 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB2 [Transcription; Replication, recombination and repair] Back     alignment and domain information
>COG5144 TFB2 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB2 [Transcription / DNA replication, recombination, and repair] Back     alignment and domain information
>PF03849 Tfb2: Transcription factor Tfb2; InterPro: IPR004598 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair Back     alignment and domain information
>PF13625 Helicase_C_3: Helicase conserved C-terminal domain Back     alignment and domain information
>TIGR00603 rad25 DNA repair helicase rad25 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query166
3dom_A108 RNA polymerase II transcription factor B subunit; 1e-25
3dgp_A80 RNA polymerase II transcription factor B subunit; 2e-18
>3dom_A RNA polymerase II transcription factor B subunit; protein-protein complex, heterodimer, beta-alpha-beta split, strand addition; 2.60A {Saccharomyces cerevisiae} Length = 108 Back     alignment and structure
 Score = 93.5 bits (232), Expect = 1e-25
 Identities = 19/64 (29%), Positives = 30/64 (46%)

Query: 103 QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDR 162
           +    P   + +  +P  V DQIRLW+ +L+RV       Y +F +   +     YA+D 
Sbjct: 14  KLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDI 73

Query: 163 SGLL 166
             LL
Sbjct: 74  GVLL 77


>3dgp_A RNA polymerase II transcription factor B subunit; protein-protein complex, beta-alpha-beta spilt, heterodimer, damage, DNA excision; 1.80A {Saccharomyces cerevisiae} Length = 80 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query166
3dom_A108 RNA polymerase II transcription factor B subunit; 99.89
3dgp_A80 RNA polymerase II transcription factor B subunit; 99.84
>3dom_A RNA polymerase II transcription factor B subunit; protein-protein complex, heterodimer, beta-alpha-beta split, strand addition; 2.60A {Saccharomyces cerevisiae} Back     alignment and structure
Probab=99.89  E-value=1.1e-24  Score=165.29  Aligned_cols=61  Identities=31%  Similarity=0.604  Sum_probs=45.4

Q ss_pred             cChhhhhcCCCCCHHHHHHHHHHHHhcCceeeccceeecCCCCHHHHHHHHHHHHHcCCCC
Q 036799          106 AHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL  166 (166)
Q Consensus       106 AHp~m~~~~p~iP~tV~DQIrLWE~ErnRl~~~~g~Ly~~F~s~~~f~~~~~yA~~~g~Ll  166 (166)
                      ..|+++.+.|+|||||+|||||||.|||||++++|+||++|.|+++|+++++||+++|+||
T Consensus        17 ~~~~~~~~~p~lPpTVvDQIrLWE~ErnRl~~~~G~LY~dF~s~~efe~v~~yA~e~gvLl   77 (108)
T 3dom_A           17 LDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLL   77 (108)
T ss_dssp             ---------------CCCHHHHHHHHHTTCEEEEEEEEECCSCHHHHHHHHHHHHHHTCEE
T ss_pred             cCcccccCCCCCCCcHHHHHHHHHHhhCceeccceEEEecCCCHHHHHHHHHHHHHcCeEE
Confidence            4566667789999999999999999999999999999999999999999999999999986



>3dgp_A RNA polymerase II transcription factor B subunit; protein-protein complex, beta-alpha-beta spilt, heterodimer, damage, DNA excision; 1.80A {Saccharomyces cerevisiae} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query166
d1wbha1213 KDPG aldolase {Escherichia coli [TaxId: 562]} 81.9
d1mxsa_216 KDPG aldolase {Pseudomonas putida [TaxId: 303]} 81.41
d1vhca_212 Hypothetical protein HI0047 {Haemophilus influenza 80.17
>d1wbha1 c.1.10.1 (A:1-213) KDPG aldolase {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: TIM beta/alpha-barrel
superfamily: Aldolase
family: Class I aldolase
domain: KDPG aldolase
species: Escherichia coli [TaxId: 562]
Probab=81.90  E-value=0.96  Score=34.97  Aligned_cols=74  Identities=22%  Similarity=0.326  Sum_probs=56.2

Q ss_pred             HHHHHhhHhhcCceEEE---EeCHHHHHHHHHcCCChhh-hhcChhhhh-----cCCCCC--HHHHHHHHHHHHhcCcee
Q 036799           68 LRLFSKIEYQLPNLIVG---ATTKESLNNAFENGITAEQ-QNAHPRVAD-----RIPSIP--ENVCDQIRLWESDLNRVE  136 (166)
Q Consensus        68 L~lF~~l~~r~PNlvvG---~iTR~Sv~~Al~~GITA~Q-~~AHp~m~~-----~~p~iP--~tV~DQIrLWE~ErnRl~  136 (166)
                      +..+.++..+||++++|   ++|.+.+++|.+.|..-== -+.+|.+.+     ..|.+|  -|..+=...|+.=-+-+|
T Consensus        54 ~~~I~~l~~~~p~~~vGaGTV~~~~~~~~a~~aGa~FivSP~~~~~v~~~a~~~~i~~iPGv~TpsEi~~A~~~G~~~vK  133 (213)
T d1wbha1          54 VDAIRAIAKEVPEAIVGAGTVLNPQQLAEVTEAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDYGLKEFK  133 (213)
T ss_dssp             HHHHHHHHHHCTTSEEEEESCCSHHHHHHHHHHTCSCEEESSCCHHHHHHHHHSSSCEEEEESSHHHHHHHHHTTCCEEE
T ss_pred             HHHHHHHHHHCCCCeeeccccccHHHHHHHHHCCCcEEECCCCCHHHHHHHHhcCCCccCCcCCHHHHHHHHHCCCCEEE
Confidence            44555667799999999   8999999999999975433 334444432     367888  677787889999999999


Q ss_pred             eccce
Q 036799          137 MTPAH  141 (166)
Q Consensus       137 ~~~g~  141 (166)
                      +.|+-
T Consensus       134 lFPA~  138 (213)
T d1wbha1         134 FFPAE  138 (213)
T ss_dssp             ETTTT
T ss_pred             eccch
Confidence            99873



>d1mxsa_ c.1.10.1 (A:) KDPG aldolase {Pseudomonas putida [TaxId: 303]} Back     information, alignment and structure
>d1vhca_ c.1.10.1 (A:) Hypothetical protein HI0047 {Haemophilus influenzae [TaxId: 727]} Back     information, alignment and structure