BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041357
(169 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa]
gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 285 bits (730), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 136/183 (74%), Positives = 155/183 (84%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +A N G+VVVETNF++YAYS+SKLHCEILRLFSKIE
Sbjct: 266 QGRKESWFIPTKLATNLSVSLTDSSSRKQGYVVVETNFRLYAYSSSKLHCEILRLFSKIE 325
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLY AFENGIT++Q QNAHPRVA+++PSVP+NV DQIRLWE
Sbjct: 326 YQLPNLIVGAITKESLYTAFENGITSDQIISFLQQNAHPRVAERLPSVPENVTDQIRLWE 385
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DLNRVE TP+H+YDEFPSRD FEAACD+AR+ +GLLWEDSKKMR+VV AEIHM+MREFL
Sbjct: 386 ADLNRVEITPSHFYDEFPSRDTFEAACDFAREWNGLLWEDSKKMRVVVKAEIHMNMREFL 445
Query: 165 RGQ 167
RGQ
Sbjct: 446 RGQ 448
>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis]
gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis]
Length = 451
Score = 285 bits (728), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 135/183 (73%), Positives = 155/183 (84%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +A N GFVVVETNF+MYAYSTSKLHCEI+RLFS++E
Sbjct: 268 QGRKESWFIPTKLATNLSMSLTDSSSRKQGFVVVETNFRMYAYSTSKLHCEIMRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNL+VGA+TKESLY+AFENGIT EQ QNAHPRVA++IPSVP+NV DQIRLWE
Sbjct: 328 YQLPNLVVGAMTKESLYSAFENGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
SD+NRVE TPAH YDEFPSRDVFEAAC++ARD +GLLWEDSK+MR+VV AEIH++MRE+L
Sbjct: 388 SDMNRVEMTPAHLYDEFPSRDVFEAACNFARDWNGLLWEDSKRMRMVVKAEIHLNMREYL 447
Query: 165 RGQ 167
RGQ
Sbjct: 448 RGQ 450
>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis
sativus]
Length = 451
Score = 283 bits (724), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 138/183 (75%), Positives = 153/183 (83%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT---------IADN----CGFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +AD+ GFVVVETNF+MYAYSTSKLHCEILRLFS+IE
Sbjct: 268 QGRKESWFIPTKLATNLSMSLADSSSRKLGFVVVETNFRMYAYSTSKLHCEILRLFSRIE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLYNAF+NGIT EQ QNAHPRVA++IPSVP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
SDLNRV+ TPAH+YDEFPSR+VFEAACDYAR+ +GLLWEDSK +RLVV A+IH HMRE L
Sbjct: 388 SDLNRVDITPAHFYDEFPSREVFEAACDYAREWNGLLWEDSKNLRLVVKADIHTHMREHL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera]
Length = 451
Score = 283 bits (724), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 135/181 (74%), Positives = 151/181 (83%), Gaps = 19/181 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +A N GFVVVETNF++YAYS+SKLHCEILRLFS++E
Sbjct: 268 QGRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLYNAFENGIT EQ QNAHPRVA++ P+VP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAERTPAVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DLNRVET P+H YDEFPSRDVFEAACD+AR+ GLLWEDSKKMRLVV AEIH+HMRE+L
Sbjct: 388 TDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREYL 447
Query: 165 R 165
R
Sbjct: 448 R 448
>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 451
Score = 269 bits (687), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 132/183 (72%), Positives = 149/183 (81%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +A N GFVVVETNF++YAYSTSKLHCEILRLFS++E
Sbjct: 268 QGRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLYNAF+NGIT +Q QNAHPRVA +IP+VP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGAITKESLYNAFDNGITADQIVSFFRQNAHPRVAQRIPAVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
SDLNRVE T A+YYDEFPSRDVFE ACD AR+ SGLLWEDSKKM LVV +E+H ++R+FL
Sbjct: 388 SDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMHLVVKSEVHTYVRDFL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 452
Score = 269 bits (687), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 131/183 (71%), Positives = 149/183 (81%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +A N GFVVVETNF++YAYSTSKLHCEILRLFS++E
Sbjct: 269 QGRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEILRLFSRVE 328
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLYNAF+NGIT +Q QNAHPRVA ++P+VP+NV DQIRLWE
Sbjct: 329 YQLPNLIVGAITKESLYNAFDNGITADQIVSFLRQNAHPRVAQRVPAVPENVTDQIRLWE 388
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
SDLNRVE T A+YYDEFPSRDVFE ACD AR+ SGLLWEDSKKM LVV +E+H ++R+FL
Sbjct: 389 SDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMHLVVKSEVHTYVRDFL 448
Query: 165 RGQ 167
R Q
Sbjct: 449 RRQ 451
>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 264 bits (674), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 128/183 (69%), Positives = 149/183 (81%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT---------IADNC----GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK SWFIPT +AD+ GFVVVETNF++YAYSTSKLHCEILRLFS++E
Sbjct: 268 QGRKGSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLY+AFENGIT EQ QNAHPRVA++IPSVP+NV +QIRLWE
Sbjct: 328 YQLPNLIVGAITKESLYSAFENGITAEQIITFLQQNAHPRVAERIPSVPENVTEQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DLNRVE T A+YYDEFPSRDVFE ACD AR+ +GLLWEDSKKM +VV E+H ++R++L
Sbjct: 388 ADLNRVEMTDAYYYDEFPSRDVFEGACDCAREWNGLLWEDSKKMHMVVKTEVHQYVRDYL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|297800414|ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 262 bits (669), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 124/185 (67%), Positives = 148/185 (80%), Gaps = 19/185 (10%)
Query: 4 QSRKESWFIPT---------IADNC----GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +AD+ GFVV+ETNF+MYAYSTSKL CEILRLF++IE
Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLI AITKESLYNAF+NGIT++Q QN+HPR AD++PS+P+NV DQIRLWE
Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DL R+E T AH+YDEFPS+DVFEAACD+AR+ GLLWEDSK+MRLVV +E+H MREFL
Sbjct: 388 TDLKRIEMTQAHFYDEFPSKDVFEAACDFAREWGGLLWEDSKRMRLVVKSEVHNQMREFL 447
Query: 165 RGQNK 169
Q K
Sbjct: 448 HNQTK 452
>gi|42566894|ref|NP_193435.2| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana]
gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana]
gi|332658439|gb|AEE83839.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 452
Score = 260 bits (665), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/185 (66%), Positives = 149/185 (80%), Gaps = 19/185 (10%)
Query: 4 QSRKESWFIPT---------IADNC----GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +AD+ GFVV+ETNF+MYAYSTSKL CEILRLF++IE
Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLI AITKESLYNAF+NGIT++Q QN+HPR AD++PS+P+NV DQIRLWE
Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DL R+E T AH+YDEFPS+DVFEAACD+AR+ GLLWEDSK+MRLVV +E+H MREFL
Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447
Query: 165 RGQNK 169
Q++
Sbjct: 448 HTQSR 452
>gi|42572935|ref|NP_974564.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658438|gb|AEE83838.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 462
Score = 260 bits (664), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/185 (66%), Positives = 149/185 (80%), Gaps = 19/185 (10%)
Query: 4 QSRKESWFIPT---------IADNC----GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +AD+ GFVV+ETNF+MYAYSTSKL CEILRLF++IE
Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLI AITKESLYNAF+NGIT++Q QN+HPR AD++PS+P+NV DQIRLWE
Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DL R+E T AH+YDEFPS+DVFEAACD+AR+ GLLWEDSK+MRLVV +E+H MREFL
Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447
Query: 165 RGQNK 169
Q++
Sbjct: 448 HTQSR 452
>gi|334186626|ref|NP_001190745.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658440|gb|AEE83840.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 482
Score = 259 bits (662), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 123/184 (66%), Positives = 148/184 (80%), Gaps = 19/184 (10%)
Query: 4 QSRKESWFIPT---------IADNC----GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +AD+ GFVV+ETNF+MYAYSTSKL CEILRLF++IE
Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLI AITKESLYNAF+NGIT++Q QN+HPR AD++PS+P+NV DQIRLWE
Sbjct: 328 YQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DL R+E T AH+YDEFPS+DVFEAACD+AR+ GLLWEDSK+MRLVV +E+H MREFL
Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447
Query: 165 RGQN 168
Q+
Sbjct: 448 HTQS 451
>gi|51969958|dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana]
Length = 452
Score = 259 bits (662), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 123/185 (66%), Positives = 148/185 (80%), Gaps = 19/185 (10%)
Query: 4 QSRKESWFIPT---------IADNC----GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +AD+ GFVV+ETNF+MYAYSTSKL CEILRLF++IE
Sbjct: 268 QGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLI AITKESLYNAF NGIT++Q QN+HPR AD++PS+P+NV DQIRLWE
Sbjct: 328 YQLPNLIACAITKESLYNAFGNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DL R+E T AH+YDEFPS+DVFEAACD+AR+ GLLWEDSK+MRLVV +E+H MREFL
Sbjct: 388 TDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREFL 447
Query: 165 RGQNK 169
Q++
Sbjct: 448 HTQSR 452
>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 259 bits (661), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 125/183 (68%), Positives = 148/183 (80%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT---------IADNC----GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK SWFIPT +AD+ GFVVVETNF++YAYSTSKLHCEILRLFS++E
Sbjct: 268 QGRKGSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLY+A+ENGIT EQ QNAHPRVA+++PSVP+NV +QIRLWE
Sbjct: 328 YQLPNLIVGAITKESLYSAYENGITAEQIVTFLQQNAHPRVAERVPSVPENVTEQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DLNRVE T +YYDEFPSRDVFE ACD AR+ +GLLWEDSKKM +VV E+H ++R++L
Sbjct: 388 ADLNRVEMTDTYYYDEFPSRDVFEGACDCAREWNGLLWEDSKKMHMVVKTEVHPYVRDYL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|414584751|tpg|DAA35322.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 451
Score = 258 bits (659), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 124/183 (67%), Positives = 142/183 (77%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N GFVVVETNF+MYAYSTSKLHCEILRLF+++E
Sbjct: 268 QGRKDSWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFARVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGA+TKES+Y AFENGIT EQ QNAHPRVADKIP+VP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPAVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+DLNRVE P+H Y++FPS++ FE CDYARD LLWED K+MRL+V E H MREFL
Sbjct: 388 TDLNRVEMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDPKRMRLIVRGEFHPEMREFL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|32487401|emb|CAE05735.1| OSJNBb0017I01.15 [Oryza sativa Japonica Group]
Length = 451
Score = 256 bits (654), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 123/181 (67%), Positives = 142/181 (78%), Gaps = 19/181 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N GFVVVETNF+MYAYSTS+LHCEILRLFS++E
Sbjct: 268 QGRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVG+ITKESLY AFENGIT EQ QNAHPRVADKIP+VP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+D NRV+ T +H Y++FPS+D+F+ CDYARD LLWED+KKMRL+V E H MREFL
Sbjct: 388 TDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMREFL 447
Query: 165 R 165
R
Sbjct: 448 R 448
>gi|218195833|gb|EEC78260.1| hypothetical protein OsI_17944 [Oryza sativa Indica Group]
Length = 459
Score = 256 bits (653), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 123/181 (67%), Positives = 142/181 (78%), Gaps = 19/181 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N GFVVVETNF+MYAYSTS+LHCEILRLFS++E
Sbjct: 276 QGRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVE 335
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVG+ITKESLY AFENGIT EQ QNAHPRVADKIP+VP+NV DQIRLWE
Sbjct: 336 YQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWE 395
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+D NRV+ T +H Y++FPS+D+F+ CDYARD LLWED+KKMRL+V E H MREFL
Sbjct: 396 TDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMREFL 455
Query: 165 R 165
R
Sbjct: 456 R 456
>gi|222629781|gb|EEE61913.1| hypothetical protein OsJ_16644 [Oryza sativa Japonica Group]
Length = 427
Score = 255 bits (652), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 123/181 (67%), Positives = 142/181 (78%), Gaps = 19/181 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N GFVVVETNF+MYAYSTS+LHCEILRLFS++E
Sbjct: 244 QGRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVE 303
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVG+ITKESLY AFENGIT EQ QNAHPRVADKIP+VP+NV DQIRLWE
Sbjct: 304 YQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWE 363
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+D NRV+ T +H Y++FPS+D+F+ CDYARD LLWED+KKMRL+V E H MREFL
Sbjct: 364 TDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMREFL 423
Query: 165 R 165
R
Sbjct: 424 R 424
>gi|242074798|ref|XP_002447335.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
gi|241938518|gb|EES11663.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
Length = 451
Score = 254 bits (650), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 123/183 (67%), Positives = 140/183 (76%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q R + WFIPT +A N GFVVVETNF+MYAYSTSKLHCEILRLFS++E
Sbjct: 268 QGRTDRWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGA+TKES+Y AFENGIT EQ QNAHPRVADKIP+VP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPTVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+D NRVE P+H Y++FPS++ FE CDYARD LLWEDSK+MRL+V E H MREFL
Sbjct: 388 TDRNRVEMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDSKRMRLIVRGEFHPEMREFL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|357166758|ref|XP_003580836.1| PREDICTED: general transcription factor IIH subunit 4-like
[Brachypodium distachyon]
Length = 451
Score = 249 bits (635), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 121/183 (66%), Positives = 138/183 (75%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N G VVVETNF++YAYS S+LHCEILRLFS++E
Sbjct: 268 QGRKDSWFIPTKLATNLSSSLSDSSASKEGIVVVETNFRLYAYSASRLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLY AF+NGIT EQ QNAHPRVADKIP VP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGAITKESLYGAFDNGITAEQIISFLKQNAHPRVADKIPVVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+D NRV+ +H Y++FPS+D+FE CD ARD LLWEDSKKMRL+V E H MREFL
Sbjct: 388 TDRNRVDMVLSHVYEDFPSKDMFEQCCDLARDNGFLLWEDSKKMRLIVRVEFHQEMREFL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|326510735|dbj|BAJ91715.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 451
Score = 248 bits (633), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 121/183 (66%), Positives = 139/183 (75%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N G VVVETNF++YAYS SKLHCEILRLFS++E
Sbjct: 268 QGRKDSWFIPTKLATNLSSSLSDSAASKEGIVVVETNFRLYAYSASKLHCEILRLFSRVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLY AF+NGIT EQ QNAHPRV DKIP VP+NV DQIRLWE
Sbjct: 328 YQLPNLIVGAITKESLYGAFDNGITAEQIISFLQQNAHPRVIDKIPIVPENVTDQIRLWE 387
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+D NRVE +H Y++FPS+D+FE CD+ARD LLWED+KKMRL+V+ E H MREFL
Sbjct: 388 NDRNRVEMILSHVYEDFPSKDMFEQCCDHARDNGYLLWEDAKKMRLIVSGEFHQEMREFL 447
Query: 165 RGQ 167
R Q
Sbjct: 448 RRQ 450
>gi|357123176|ref|XP_003563288.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
subunit 4-like [Brachypodium distachyon]
Length = 452
Score = 246 bits (627), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 120/183 (65%), Positives = 137/183 (74%), Gaps = 19/183 (10%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N G VVVETNF++YAYS S+LHCEILRLFS++E
Sbjct: 269 QGRKDSWFIPTKLATNLSSSLSDSSASKEGIVVVETNFRLYAYSASRLHCEILRLFSRVE 328
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWE 104
YQLPNLIVGAITKESLY AF+NGIT EQ QNAHP VADKIP VP+NV DQIRLWE
Sbjct: 329 YQLPNLIVGAITKESLYGAFDNGITAEQIISFLKQNAHPXVADKIPVVPENVTDQIRLWE 388
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+D NRV+ +H Y++FPS+D+FE CD ARD LLWEDSKKMRL+V E H MREFL
Sbjct: 389 TDRNRVDMVLSHVYEDFPSKDLFEQCCDLARDNGFLLWEDSKKMRLIVRVEFHQEMREFL 448
Query: 165 RGQ 167
R Q
Sbjct: 449 RRQ 451
>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group]
Length = 470
Score = 244 bits (623), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 112/152 (73%), Positives = 129/152 (84%), Gaps = 6/152 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFVVVETNF+MYAYSTS+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGIT EQ
Sbjct: 316 GFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQI 375
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
QNAHPRVADKIP+VP+NV DQIRLWE+D NRV+ T +H Y++FPS+D+F+ CDY
Sbjct: 376 ISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDY 435
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
ARD LLWED+KKMRL+V E H MREFLR
Sbjct: 436 ARDHGCLLWEDAKKMRLIVRVEFHSEMREFLR 467
>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group]
gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group]
Length = 444
Score = 235 bits (600), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 107/150 (71%), Positives = 126/150 (84%), Gaps = 6/150 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFVVVETNF+MYAYSTS+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGIT EQ
Sbjct: 288 GFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQI 347
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
QNAHPRVADKIP+VP+NV DQIRLWE+D NRV+ T +H Y++FPS+D+F+ CDY
Sbjct: 348 ISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDY 407
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
ARD LLWED+KKMRL+V E H M ++
Sbjct: 408 ARDHGCLLWEDAKKMRLIVRVEFHSEMHDY 437
>gi|167997877|ref|XP_001751645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697626|gb|EDQ83962.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 462
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 93/186 (50%), Positives = 129/186 (69%), Gaps = 21/186 (11%)
Query: 4 QSRKESWFIPT-IADNC--------------GFVVVETNFKMYAYSTSKLHCEILRLFSK 48
Q KESW+IPT +A N GFVVVETNFK+YAY++SKL EILR F++
Sbjct: 269 QGMKESWYIPTKLASNLSASLSESTDWQSSEGFVVVETNFKVYAYTSSKLQTEILRCFTR 328
Query: 49 IEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRL 102
+EYQLPNL+V +TKES+ A +GI+ EQ ++AHP VA KIP VP+ V DQ+RL
Sbjct: 329 LEYQLPNLVVATLTKESVNKALGSGISAEQIISFLRKHAHPHVAQKIPVVPETVSDQLRL 388
Query: 103 WESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMRE 162
WE+D NRV+ PA++YD+FP+ ++EA +ARD GLL+ED+ RL+V +++H MR+
Sbjct: 389 WETDRNRVQFEPAYFYDDFPTMAIYEAVVAHARDLGGLLFEDASAKRLIVRSDLHEDMRQ 448
Query: 163 FLRGQN 168
++R Q+
Sbjct: 449 YIRKQS 454
>gi|296084734|emb|CBI25875.3| unnamed protein product [Vitis vinifera]
Length = 258
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 105/205 (51%), Positives = 126/205 (61%), Gaps = 43/205 (20%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +A N GFVVVETNF++YAYS+SKLHCEILRLFS+
Sbjct: 51 QGRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRYA 110
Query: 51 YQLPNLIV--------GAITKESL--YNA------------FENGITTEQQNAHPRV--- 85
L + ++ G + SL Y F + ++T H +
Sbjct: 111 SMLFDCLLCICYSSQKGTLLDVSLIMYECLRTFSLGKIIFWFFSSVSTVATPCHGYLVIF 170
Query: 86 -----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
A+KIP + QIRLWE+DLNRVET P+H YDEFPSRDVFEAACD+AR+ GL
Sbjct: 171 TISCLAEKIPCLQFTNMSQIRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGL 230
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
LWEDSKKMRLVV AEIH+HMRE+LR
Sbjct: 231 LWEDSKKMRLVVKAEIHLHMREYLR 255
>gi|302806060|ref|XP_002984780.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
gi|302808297|ref|XP_002985843.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300146350|gb|EFJ13020.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300147366|gb|EFJ14030.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
Length = 459
Score = 182 bits (462), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 91/183 (49%), Positives = 124/183 (67%), Gaps = 20/183 (10%)
Query: 7 KESWFIPT-IADNC-------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQ 52
K+ WFIPT +A GF++VETNF++YAY++SKLH E L +F + EY
Sbjct: 275 KDRWFIPTRLATGLSASLSESSAWQTEGFIMVETNFRVYAYTSSKLHIETLHVFVRTEYV 334
Query: 53 LPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESD 106
LPN++VG+ITKES+ AF +GI+ +Q Q+AHP VA K+PSVP+ VCDQIRLWESD
Sbjct: 335 LPNILVGSITKESVNGAFASGISADQIIKFLQQHAHPFVAQKVPSVPETVCDQIRLWESD 394
Query: 107 LNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRG 166
RV+ PA+ Y+ FPS V+E+ +ARD++GLLWED+ + +VV E H +R FL+
Sbjct: 395 RVRVQYLPAYCYEGFPSTSVYESVVAHARDRNGLLWEDANRKMIVVGGEHHEAIRAFLQN 454
Query: 167 QNK 169
NK
Sbjct: 455 INK 457
>gi|359497087|ref|XP_002267711.2| PREDICTED: general transcription factor IIH subunit 4-like [Vitis
vinifera]
Length = 238
Score = 176 bits (445), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 94/175 (53%), Positives = 109/175 (62%), Gaps = 50/175 (28%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RKESWFIPT +A N GFVVVETNF++YAYS+SKLHCEILRLFS+
Sbjct: 98 QGRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSR-- 155
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQQNAHPRVADKIPSVPKNVCDQIRLWESDLNRV 110
+ ++T H + IRLWE+DLNRV
Sbjct: 156 ---------------------SSVSTVATPCHGYL--------------IRLWETDLNRV 180
Query: 111 ETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
ET P+H YDEFPSRDVFEAACD+AR+ GLLWEDSKKMRLVV AEIH+HMRE+LR
Sbjct: 181 ETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREYLR 235
>gi|414584750|tpg|DAA35321.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 386
Score = 163 bits (412), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 80/116 (68%), Positives = 91/116 (78%), Gaps = 19/116 (16%)
Query: 4 QSRKESWFIPT-IADNC------------GFVVVETNFKMYAYSTSKLHCEILRLFSKIE 50
Q RK+SWFIPT +A N GFVVVETNF+MYAYSTSKLHCEILRLF+++E
Sbjct: 268 QGRKDSWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFARVE 327
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQI 100
YQLPNLIVGA+TKES+Y AFENGIT EQ QNAHPRVADKIP+VP+NV DQ+
Sbjct: 328 YQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPAVPENVTDQV 383
>gi|384246869|gb|EIE20357.1| Tfb2-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 451
Score = 138 bits (347), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 70/154 (45%), Positives = 96/154 (62%), Gaps = 8/154 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFVVVETN+++YAY++S L +LRLF++ E LPNL VG +T+ES+ A G++ +Q
Sbjct: 295 GFVVVETNYRVYAYTSSLLQTALLRLFTRCECILPNLFVGVLTRESVTGALACGLSADQI 354
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
Q+AHP VA + P VP+ V DQ+RLW++D RV A YD+FPS VF+ +
Sbjct: 355 VLYLRQHAHPHVASRTPVVPEVVADQVRLWQADTMRVRHNRAVLYDDFPSAQVFQLSAQK 414
Query: 134 ARDQSGLLWEDSKK--MRLVVNAEIHMHMREFLR 165
AR LWED K RL V H MRE+++
Sbjct: 415 ARTLGVWLWEDPKAGMGRLAVQEAGHDAMREYIK 448
>gi|290987279|ref|XP_002676350.1| predicted protein [Naegleria gruberi]
gi|284089952|gb|EFC43606.1| predicted protein [Naegleria gruberi]
Length = 532
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 94/156 (60%), Gaps = 6/156 (3%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
N G+++VETN+++YAY+ S L +L LF EY+LPN++VG IT+ ++ A +NGI+
Sbjct: 374 NNGYIIVETNYRVYAYTNSPLQIALLSLFIFPEYRLPNMVVGLITRSTIREALKNGISAH 433
Query: 78 Q------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAAC 131
Q NAHP++ K P +P V DQI LWE + NRV T + YD+F + A
Sbjct: 434 QILQFLRLNAHPQMRLKKPVIPDTVSDQILLWEKERNRVIKTTSVVYDKFTNVSELNATV 493
Query: 132 DYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
DYAR Q LLW +KM +V E H M+EF+ Q
Sbjct: 494 DYARRQGALLWHSEEKMMMVCKREFHGLMKEFISSQ 529
>gi|145351329|ref|XP_001420034.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580267|gb|ABO98327.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 131 bits (330), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 65/149 (43%), Positives = 88/149 (59%), Gaps = 7/149 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETN+++YAY++S + EILRLF++ +Y+LPNL VG +T+ES+ NA G+ EQ
Sbjct: 334 GHIIVETNYRVYAYTSSAVEMEILRLFTRADYRLPNLYVGMLTRESVQNALRAGVDAEQI 393
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF-EAACD 132
+AH +V K PSVP VCDQIRLW D+ R+E Y +FP F A
Sbjct: 394 VGYIRAHAHKQVRRKKPSVPSTVCDQIRLWARDMERMEAEECVLYCDFPQAGTFYGAVVS 453
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMR 161
A + LLW D RL V A H M+
Sbjct: 454 EAEKRGALLWRDDASRRLTVRASAHDEMK 482
>gi|440791255|gb|ELR12502.1| TFIIH basal transcription factor complex subunit 2, putative
[Acanthamoeba castellanii str. Neff]
Length = 304
Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 100/155 (64%), Gaps = 6/155 (3%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D G+++V T++++YAY++S + +L LF++IEYQLPN+++G + +E++ A + GI+
Sbjct: 137 DQEGWIIVGTDYRIYAYTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISA 196
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
Q NAHP++ P +P+++ DQ+RLWE++ R+ +P ++YD+F S F+ A
Sbjct: 197 NQILQFLETNAHPQMKQNTPIIPESIADQLRLWEAEDRRLSLSPGYFYDDFASLAAFKKA 256
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
YARD LL+ D+ K L V + H +R +++
Sbjct: 257 EKYARDVGALLYSDAAKRFLFVTEQGHQLLRRYVK 291
>gi|303272589|ref|XP_003055656.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463630|gb|EEH60908.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 476
Score = 129 bits (324), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 96/152 (63%), Gaps = 8/152 (5%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++VETNF++YAY++S + EILRLF++ +Y+LPNL VG +T+E++ A GI+ EQ
Sbjct: 318 IIVETNFRVYAYTSSAVEVEILRLFTRPDYKLPNLYVGMMTREAVVTALRGGISAEQIVS 377
Query: 79 ---QNAHPRV-ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPS-RDVFEAACDY 133
++AHP+ P++P VCDQIRLW D NRV+ TP Y +FP+ +FE +
Sbjct: 378 YLRKHAHPQARKTPGPAIPATVCDQIRLWSKDENRVKYTPCVLYCDFPTGTGMFEKVAEI 437
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
A+++ LW D ++L V E H M++ +
Sbjct: 438 AKERGLYLWGDPVGLKLAVREEGHESMKDVFK 469
>gi|384484453|gb|EIE76633.1| hypothetical protein RO3G_01337 [Rhizopus delemar RA 99-880]
Length = 284
Score = 129 bits (323), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 97/152 (63%), Gaps = 6/152 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+++ETN+K+YAY+ S L +L LF +++ + N++ G IT++S+ NA GIT EQ
Sbjct: 127 GFIILETNYKLYAYTDSPLQIAVLNLFVQLQSRFRNMVTGVITRDSIRNALMKGITAEQI 186
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ + P +P V DQIRLWE + NR++ TP++ Y EF + F+AA Y
Sbjct: 187 IYYLQSHAHPQMRKETPVLPLTVVDQIRLWEMERNRLKPTPSYLYHEFNVQADFDAAEKY 246
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
ARD LLW +++K + + H +++ F++
Sbjct: 247 ARDLGVLLWSNNQKRTMAITEAGHENVKGFVK 278
>gi|403180193|ref|XP_003338499.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166015|gb|EFP94080.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 679
Score = 128 bits (322), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 95/152 (62%), Gaps = 6/152 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+++ L +L LF + Y+ PNL+VGA+T+ES+ +A NGIT +Q
Sbjct: 147 GFLVLETNYRIYAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQV 206
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NR+ + Y++F S +++ Y
Sbjct: 207 IMYLHTHAHPQMRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQY 266
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+R +LWE + +L V + H+ +REF R
Sbjct: 267 SRKLGIVLWEHAGLRKLFVGYDGHLTLREFFR 298
>gi|403163300|ref|XP_003323393.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163998|gb|EFP78974.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 887
Score = 128 bits (322), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 95/152 (62%), Gaps = 6/152 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+++ L +L LF + Y+ PNL+VGA+T+ES+ +A NGIT +Q
Sbjct: 352 GFLVLETNYRIYAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQV 411
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NR+ + Y++F S +++ Y
Sbjct: 412 IMYLHTHAHPQMRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQY 471
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+R +LWE + +L V + H+ +REF R
Sbjct: 472 SRKLGIVLWEHAGLRKLFVGYDGHLTLREFFR 503
>gi|328873270|gb|EGG21637.1| general transcription factor IIH [Dictyostelium fasciculatum]
Length = 453
Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/160 (40%), Positives = 98/160 (61%), Gaps = 8/160 (5%)
Query: 12 IPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
I T + G++++ETNF++YAY+ S L ++ LF K+ Y+LPNL VG +T+ES+ +A
Sbjct: 292 ISTSQKDQGYLILETNFRIYAYTQSSLQISLISLFVKMLYRLPNLAVGILTRESVRSALL 351
Query: 72 NGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
+GIT +Q QNAHP + I P+ V +QIRLWES+ NR+ A +D FP+ +
Sbjct: 352 HGITADQIVDFVKQNAHPNMV--ISGYPEVVFEQIRLWESERNRITYKKAVLFDSFPNAE 409
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
F +A+DQ L+W D K LVVN E +R +++
Sbjct: 410 SFSKTVTFAKDQYFLVWFDETKKMLVVNDEGSEPIRNYIK 449
>gi|390370355|ref|XP_783140.3| PREDICTED: general transcription factor IIH subunit 4-like, partial
[Strongylocentrotus purpuratus]
Length = 180
Score = 125 bits (314), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 63/150 (42%), Positives = 92/150 (61%), Gaps = 6/150 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETNF++YAY+ S L EIL LF + Y+ PNL V A+T+ES+ A NGIT EQ
Sbjct: 30 GFIVVETNFRIYAYTESDLQVEILGLFCSMMYRFPNLSVAALTRESVQLAISNGITAEQI 89
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP + K P VP + DQ+RLWE + +R+ T Y+EF S FE DY
Sbjct: 90 LSFLRTHAHPNMRLKTPIVPPTISDQVRLWELERDRLSFTQGIIYNEFLSLHDFEVLRDY 149
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
A+D L+W+ + + ++V+ H ++++
Sbjct: 150 AKDLGVLIWDSTARRIMIVSPAGHDSVKKY 179
>gi|72074363|ref|XP_796111.1| PREDICTED: general transcription factor IIH subunit 4
[Strongylocentrotus purpuratus]
Length = 469
Score = 125 bits (314), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 63/150 (42%), Positives = 92/150 (61%), Gaps = 6/150 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETNF++YAY+ S L EIL LF + Y+ PNL V A+T+ES+ A NGIT EQ
Sbjct: 310 GFIVVETNFRVYAYTESDLQVEILGLFCSMMYRFPNLSVAALTRESVQLAISNGITAEQI 369
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP + K P VP + DQ+RLWE + +R+ T Y+EF S FE DY
Sbjct: 370 LSFLRTHAHPNMRLKTPIVPPTISDQVRLWELERDRLSFTQGIIYNEFLSLHDFEVLRDY 429
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
A+D L+W+ + + ++V+ H ++++
Sbjct: 430 AKDLGVLIWDSTARRIMIVSPAGHDSVKKY 459
>gi|307106040|gb|EFN54287.1| hypothetical protein CHLNCDRAFT_25123 [Chlorella variabilis]
Length = 458
Score = 124 bits (311), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/152 (41%), Positives = 94/152 (61%), Gaps = 9/152 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G+V+VE+NF++YAY+TS + +LR+F + + LPNL VG IT+ES NA + GI +Q
Sbjct: 302 GYVIVESNFRVYAYTTSAVQVAVLRVFVRCDALLPNLFVGTITRESATNALDTGIAADQV 361
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
Q+AHPR A K P+V V DQIRLW +L R++ A YD+F S++++ A +
Sbjct: 362 VAFLRQHAHPRAAAKTPTV---VTDQIRLWAQELKRLQEKNATLYDKFESKELYVGAVAH 418
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
AR + LL+ + +LVV + H M L+
Sbjct: 419 ARQLNALLYSCEDRRQLVVESAFHGLMVGHLK 450
>gi|330918142|ref|XP_003298106.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
gi|311328891|gb|EFQ93800.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
Length = 482
Score = 124 bits (311), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 87/138 (63%), Gaps = 6/138 (4%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
P+ A N GF+++ETN+++YAY+ S + IL LF+K++++ PNL+ G +TKES++ A ++
Sbjct: 314 PSAAANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQS 373
Query: 73 GITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
GIT+ Q AHP++ +P +P V DQIRLWE + RVETTP + EF S
Sbjct: 374 GITSAQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDTE 433
Query: 127 FEAACDYARDQSGLLWED 144
+ YA L+W++
Sbjct: 434 YRDVMGYASALGVLVWQN 451
>gi|440800489|gb|ELR21525.1| transcription factor tfb2 subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 465
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 99/155 (63%), Gaps = 6/155 (3%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D G+++V T++++YAY++S + +L LF++IEYQLPN+++G + +E++ A + GI+
Sbjct: 296 DQEGWIIVGTDYRIYAYTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISA 355
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
Q NAHP++ P +P+++ DQ+RLWE++ R+ + ++YD+F S F+ A
Sbjct: 356 NQILQFLETNAHPQMKQNTPIIPESIADQLRLWEAEDRRLSLSSGYFYDDFASLAAFKKA 415
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
YARD L++ D+ K L V+ H +R +++
Sbjct: 416 EKYARDVGALIYSDATKRFLFVSEPGHQLLRRYVK 450
>gi|308452336|ref|XP_003089005.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
gi|308243592|gb|EFO87544.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
Length = 468
Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 98/155 (63%), Gaps = 9/155 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 304 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQI 363
Query: 79 -----QNAHPR---VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ + I +P V DQIRLWE + R++ A+ Y F S D F+
Sbjct: 364 ISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQGV 423
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
CDYA+++ LLW ++++ ++VN E H ++R++ +
Sbjct: 424 CDYAQERGILLWANAQQKLVIVNEEGHEYVRQWYK 458
>gi|242021693|ref|XP_002431278.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
gi|212516535|gb|EEB18540.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
Length = 467
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/157 (40%), Positives = 100/157 (63%), Gaps = 7/157 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++++ETN+++YAY+ S L +L LF ++ Y+ PNL VG IT++S+ AF++GIT EQ
Sbjct: 308 GYIIIETNYRVYAYTDSNLQVALLGLFCEMLYRFPNLSVGLITRDSVRQAFKSGITAEQI 367
Query: 79 -----QNAHPR-VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
+AHPR +A P++P V DQI+LWE++LNR+ + Y +F S+ FEA D
Sbjct: 368 VGFLRLHAHPRMIAVGPPTLPPTVVDQIKLWENELNRLVYSDGVLYSQFLSQADFEALRD 427
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
A + L+WE+ KK +VV H +++F + +K
Sbjct: 428 RANELGVLVWENDKKRTMVVTKSGHDDVKKFWKSYSK 464
>gi|308460924|ref|XP_003092760.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
gi|308252519|gb|EFO96471.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
Length = 470
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 98/155 (63%), Gaps = 9/155 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 307 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQI 366
Query: 79 -----QNAHPR---VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ + I +P V DQIRLWE + R++ A+ Y F S D F+
Sbjct: 367 ISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQGV 426
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
CDYA+++ LLW ++++ ++VN E H ++R++ +
Sbjct: 427 CDYAQERGILLWANAQQKLVIVNEEGHEYVRQWYK 461
>gi|395332344|gb|EJF64723.1| transcription factor Tfb2 [Dichomitus squalens LYAD-421 SS1]
Length = 506
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 59/151 (39%), Positives = 92/151 (60%), Gaps = 6/151 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+ + L +L LF ++Y+ PNL+VG IT++S+ A NGIT +Q
Sbjct: 348 GFIVLETNYRLYAYTDNPLQIAVLNLFVTLKYRFPNLVVGQITRDSVKKALANGITADQI 407
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NRV++ + Y F S +E DY
Sbjct: 408 ISYLITHAHPQMRKNKPLLPVTVQDQIRLWELERNRVKSQEGYLYTAFASHADYEWVLDY 467
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
A+ +LWE+ K + E HM++R ++
Sbjct: 468 AKKLGVVLWENPSKRCFFGSVEGHMNIRGYI 498
>gi|189205068|ref|XP_001938869.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187985968|gb|EDU51456.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 482
Score = 122 bits (306), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 86/138 (62%), Gaps = 6/138 (4%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
P+ N GF+++ETN+++YAY+ S + IL LF+K++++ PNL+ G +TKES++ A ++
Sbjct: 314 PSATANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQS 373
Query: 73 GITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
GIT+ Q AHP++ +P +P V DQIRLWE + RVETTP + EF S
Sbjct: 374 GITSAQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDAE 433
Query: 127 FEAACDYARDQSGLLWED 144
+ YA L+W++
Sbjct: 434 YRDVMGYASALGVLVWQN 451
>gi|409077631|gb|EKM77996.1| hypothetical protein AGABI1DRAFT_114851 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426198975|gb|EKV48900.1| hypothetical protein AGABI2DRAFT_191077 [Agaricus bisporus var.
bisporus H97]
Length = 467
Score = 121 bits (304), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 98/157 (62%), Gaps = 8/157 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+ + L +L LF ++Y+ PNL+VG IT++S+ A NGIT +Q
Sbjct: 308 GFIVLETNYRIYAYTDNPLQTAVLNLFVSLKYRFPNLVVGMITRDSVRRALMNGITADQI 367
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NR++++ + Y F S+ ++ +Y
Sbjct: 368 ISYLITHAHPQMRKNNPLLPVTVQDQIRLWELEKNRLKSSEGYLYTAFTSQADYDLVLNY 427
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL--RGQN 168
A+ +LWE++ K + E H+ ++EF+ R QN
Sbjct: 428 AKQLDVVLWENAAKRCFFGSLEGHLRIKEFIERRTQN 464
>gi|390594544|gb|EIN03954.1| transcription factor Tfb2 [Punctularia strigosozonata HHB-11173
SS5]
Length = 467
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 97/161 (60%), Gaps = 8/161 (4%)
Query: 12 IPTIADNC--GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 69
+PT GF+V+ETN+++YAY+ + L +L LF+ + + PNL+VG IT+ES+ A
Sbjct: 299 LPTSGSGVSEGFIVLETNYRLYAYTDNPLQTAVLALFTTLRSRFPNLVVGQITRESVKRA 358
Query: 70 FENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
++GI+ EQ +AHP++ P +P V DQIRLWE + NR+++ Y EF S
Sbjct: 359 LQSGISAEQIISYLSTHAHPQMRKNNPLIPVTVQDQIRLWELEKNRLKSQEGFLYTEFAS 418
Query: 124 RDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+ +E +YAR+ + +LWE+ K + E H ++R F+
Sbjct: 419 QGDYEYVLNYARELNVVLWENPVKRCFFGSMEGHANIRGFI 459
>gi|412992641|emb|CCO18621.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 121 bits (303), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 102/185 (55%), Gaps = 30/185 (16%)
Query: 10 WFIPTI------------------ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEY 51
W++PT+ A G ++VETNF++YAY+ S+L E+LRLF++ +Y
Sbjct: 358 WYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRLFTRPDY 417
Query: 52 QLPNLIVGAITKESLYNAFENGITTE------QQNAHPRVA----DKIPSVPKNVCDQIR 101
+LPN VG IT++S+ +A GI+ + Q AHPR P+VP VCDQIR
Sbjct: 418 KLPNAYVGMITRDSILDAMRAGISPDQIVNYLQTRAHPRCTIGKRPNHPAVPPTVCDQIR 477
Query: 102 LWESDLNRVETTPAHYYDEFPSR-DVFEAACDYARDQSG-LLWEDSKKMRLVVNAEIHMH 159
LW DL RV+ Y +FP + + F+ A + AR+ +LW D K R V+A+ H
Sbjct: 478 LWARDLYRVKADDCVMYTDFPMQGNQFQDAVNNARNVGAQILWMDENKRRFAVDADSHER 537
Query: 160 MREFL 164
++ FL
Sbjct: 538 LKVFL 542
>gi|453232318|ref|NP_001263825.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
gi|412976516|emb|CCO25635.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
Length = 435
Score = 120 bits (302), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 95/155 (61%), Gaps = 9/155 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 262 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQI 321
Query: 79 -----QNAHPR---VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ + + +P V DQIRLWE + R+ A+ Y F S D F
Sbjct: 322 ISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGV 381
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
C+YAR Q+ LLW D+++ ++VN + H +R++ +
Sbjct: 382 CEYARQQNILLWSDNQQKLVIVNEDGHELVRQWYK 416
>gi|343428936|emb|CBQ72481.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Sporisorium reilianum SRZ2]
Length = 493
Score = 120 bits (302), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 101/166 (60%), Gaps = 12/166 (7%)
Query: 12 IPTIADNC------GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 65
+P ++ N G++++ETN+++YAY++++L +L LF I+ + PNL+VG+IT++S
Sbjct: 324 VPLLSSNGTEQEERGYIILETNYRLYAYTSNRLRVAVLSLFVTIKARFPNLVVGSITRDS 383
Query: 66 LYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYD 119
+ +A NGIT EQ +AH ++ P +P V DQIRLWE + NRV+ +
Sbjct: 384 VKSALANGITAEQIITYLTHHAHVQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFT 443
Query: 120 EFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+F S+ FE +YA+ L+WED K R V+ ++ +R+++R
Sbjct: 444 DFTSQFDFEEVRNYAKQLGVLVWEDEGKRRFFVDEAGNVPVRDYIR 489
>gi|432883135|ref|XP_004074222.1| PREDICTED: general transcription factor IIH subunit 4-like [Oryzias
latipes]
Length = 460
Score = 120 bits (302), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 98/161 (60%), Gaps = 7/161 (4%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
P D GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A N
Sbjct: 297 PGTGD-AGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAHLTRESVQQAIAN 355
Query: 73 GITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
GIT +Q AHP + + P++P + DQIRLWE + +R++ T Y++F S+
Sbjct: 356 GITAQQIIHFLRTRAHPVMLKQSPALPPTITDQIRLWELERDRLQFTEGVLYNQFLSQAD 415
Query: 127 FEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
FE D A+ L+W+D+ +VV + H +++F + Q
Sbjct: 416 FEVLRDRAQGLGCLVWQDASHRAMVVTPQGHSEVKKFWKRQ 456
>gi|348683175|gb|EGZ22990.1| hypothetical protein PHYSODRAFT_479459 [Phytophthora sojae]
Length = 490
Score = 120 bits (302), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 93/153 (60%), Gaps = 6/153 (3%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
VVVETNFK+YAY+TS LH +L +F I +LPNL +G IT+ESL +A +GI+ +Q
Sbjct: 333 LVVVETNFKIYAYTTSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIY 392
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
++AHP++ P +P+N+ DQI LWE + NRV+ +D F +++ +E+ DYA
Sbjct: 393 DFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFMEGILFDGFNTKEDYESVRDYA 452
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
+D L W D RL + +R +++ Q
Sbjct: 453 KDLKVLTWSDPIHFRLSITTAGIDDVRHYIQSQ 485
>gi|392901992|ref|NP_001255864.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
gi|290457493|emb|CBK19514.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
Length = 431
Score = 120 bits (302), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 95/155 (61%), Gaps = 9/155 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 258 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQI 317
Query: 79 -----QNAHPR---VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ + + +P V DQIRLWE + R+ A+ Y F S D F
Sbjct: 318 ISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGV 377
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
C+YAR Q+ LLW D+++ ++VN + H +R++ +
Sbjct: 378 CEYARQQNILLWSDNQQKLVIVNEDGHELVRQWYK 412
>gi|392901990|ref|NP_001255863.1| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
gi|26985915|emb|CAB70230.2| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
Length = 481
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 95/155 (61%), Gaps = 9/155 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 308 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQI 367
Query: 79 -----QNAHPR---VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ + + +P V DQIRLWE + R+ A+ Y F S D F
Sbjct: 368 ISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGV 427
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
C+YAR Q+ LLW D+++ ++VN + H +R++ +
Sbjct: 428 CEYARQQNILLWSDNQQKLVIVNEDGHELVRQWYK 462
>gi|393246879|gb|EJD54387.1| transcription factor Tfb2 [Auricularia delicata TFB-10046 SS5]
Length = 480
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 92/155 (59%), Gaps = 6/155 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A + GF+++ETN+++YAY+ + L +L LF + + PNL+VG +T++S+ A NGIT
Sbjct: 314 ATSGGFIILETNYRLYAYTDNPLQIAVLNLFVSFKSRFPNLVVGMVTRDSVKKALANGIT 373
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
EQ +AHP++ P +P V DQ+RLWE + NRV+ Y +F S+ FE
Sbjct: 374 AEQIITYLSAHAHPQMRKNNPLLPVTVQDQVRLWELEKNRVKAEEGFLYMDFTSQADFEL 433
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
DYAR ++WE+ ++ A+ H ++R F+
Sbjct: 434 VLDYARKLGVVIWENGRQRMFFGKADGHNNIRTFI 468
>gi|453232316|ref|NP_001263824.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
gi|412976517|emb|CCO25636.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
Length = 485
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 95/155 (61%), Gaps = 9/155 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 312 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQI 371
Query: 79 -----QNAHPR---VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ + + +P V DQIRLWE + R+ A+ Y F S D F
Sbjct: 372 ISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGV 431
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
C+YAR Q+ LLW D+++ ++VN + H +R++ +
Sbjct: 432 CEYARQQNILLWSDNQQKLVIVNEDGHELVRQWYK 466
>gi|358057347|dbj|GAA96696.1| hypothetical protein E5Q_03367 [Mixia osmundae IAM 14324]
Length = 467
Score = 120 bits (300), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 94/159 (59%), Gaps = 7/159 (4%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
T + GF+++ETN+++YAY+ + L +L LF ++ + P L+VG IT+ES+ NG
Sbjct: 302 TAEEEQGFLILETNYRVYAYTQNPLQIAVLNLFVALKSRFPGLVVGMITRESIKAGLANG 361
Query: 74 ITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
I ++Q +AHP++ + P +P V DQI+LWE + NRV+T P YD+F S+ +
Sbjct: 362 IKSDQIIAFLTAHAHPQMRKQEPLLPPTVVDQIKLWEREKNRVKTEPCFLYDDFRSQADY 421
Query: 128 EAACDYARDQSGLLW-EDSKKMRLVVNAEIHMHMREFLR 165
+ CDYA+ +LW + R + H+ +R F++
Sbjct: 422 DLVCDYAKQIGAVLWLGEPGSRRFATTEDGHVQVRGFIQ 460
>gi|213402795|ref|XP_002172170.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
gi|212000217|gb|EEB05877.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
Length = 444
Score = 120 bits (300), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 58/142 (40%), Positives = 89/142 (62%), Gaps = 7/142 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF++VETN+++YAY+ S L IL LF+ + + NLIVG IT++S+ A ++GIT EQ
Sbjct: 290 GFIIVETNYRLYAYTDSPLQIAILGLFTNLRARFSNLIVGVITRDSIRRALQSGITAEQI 349
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ +IP +P + DQI LWE + NR+ TP + +F + FE A Y
Sbjct: 350 ITYLTTHAHPQMRKEIPLLPPTLVDQIYLWELERNRLRATPGILFRDFLTDKDFEKAVQY 409
Query: 134 ARDQSGLLWEDS-KKMRLVVNA 154
A++ L+W+ S K+M + NA
Sbjct: 410 AKELGVLVWDSSLKRMFFITNA 431
>gi|341886092|gb|EGT42027.1| hypothetical protein CAEBREN_00374 [Caenorhabditis brenneri]
Length = 460
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 97/158 (61%), Gaps = 9/158 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 295 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQI 354
Query: 79 -----QNAHPR---VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ + + +P V DQIRLWE + R++ ++ Y F S + F
Sbjct: 355 ISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMDLKDSYIYSNFESDEEFNGV 414
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
CDYA+++ LLW D ++ ++VN E H ++R++ + Q
Sbjct: 415 CDYAKERKILLWADYQRKLVIVNEEGHEYVRQWYKQQK 452
>gi|325181847|emb|CCA16302.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 473
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 93/151 (61%), Gaps = 6/151 (3%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
+VETNFK+YAY+TS LH +L +F I +LPNL +G IT+ESL +A +GI+ +Q
Sbjct: 317 LTIVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIY 376
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
++AHPR+ P VP+N+ DQI LWE + NR++ +D F S++ FE +YA
Sbjct: 377 DFLLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYA 436
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ Q L W D ++L +++ +R F++
Sbjct: 437 QKQRFLTWTDPIHLKLAISSNGIESVRNFIQ 467
>gi|41053973|ref|NP_956221.1| general transcription factor IIH subunit 4 [Danio rerio]
gi|39794736|gb|AAH64301.1| General transcription factor IIH, polypeptide 4 [Danio rerio]
Length = 466
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 95/163 (58%), Gaps = 7/163 (4%)
Query: 12 IPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
IP D GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PNL+V +T+ES+ A
Sbjct: 302 IPGTGDT-GFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAIS 360
Query: 72 NGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
NGIT +Q AHP + + P +P + DQIRLWE + +R++ T Y++F S+
Sbjct: 361 NGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQA 420
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
FE D A+ L+W++ +VV H ++ F + Q
Sbjct: 421 DFEVLRDRAQGLGVLVWQNPAHRVMVVTPHGHSEVKRFWKRQK 463
>gi|325181222|emb|CCA15636.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 1033
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 60/151 (39%), Positives = 93/151 (61%), Gaps = 6/151 (3%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
+VETNFK+YAY+TS LH +L +F I +LPNL +G IT+ESL +A +GI+ +Q
Sbjct: 364 LTIVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIY 423
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
++AHPR+ P VP+N+ DQI LWE + NR++ +D F S++ FE +YA
Sbjct: 424 DFLLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYA 483
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ Q L W D ++L +++ +R F++
Sbjct: 484 QKQRFLTWTDPIHLKLAISSNGIESVRNFIQ 514
>gi|393221241|gb|EJD06726.1| transcription factor Tfb2 [Fomitiporia mediterranea MF3/22]
Length = 464
Score = 119 bits (299), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+++ETN+++YAY+ + L +L LF ++Y+ PNL+VG++T++S+ A NGIT
Sbjct: 301 AQAQGFIILETNYRIYAYTDNPLQTAVLNLFVTLKYRFPNLVVGSLTRDSVRKALANGIT 360
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q +AHP++ P +P V DQIRLWE + NR+++ + Y F S+ +E
Sbjct: 361 ADQIIKYLTTHAHPQMRKNDPLIPVTVQDQIRLWELERNRLKSQEGYLYTAFASQADYEF 420
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
YA+ +LWE++ K + E H ++R F+
Sbjct: 421 VLKYAKQLGVVLWENAAKRCFFGSLEGHANIRGFI 455
>gi|260810774|ref|XP_002600123.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
gi|229285409|gb|EEN56135.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
Length = 457
Score = 119 bits (299), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 57/150 (38%), Positives = 95/150 (63%), Gaps = 6/150 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++VVE+N+++YAY+ S+L ++ LFS+I Y+ PN++V +T++S+ A GIT +Q
Sbjct: 299 GYLVVESNYRIYAYTDSELQVALIGLFSEILYRFPNMVVANLTRDSVQEAVVRGITADQI 358
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
NAHP+ ++P VP + DQIRLWE + +R+ T Y++F S+ FE +Y
Sbjct: 359 LHFLRVNAHPKALHRVPIVPPTISDQIRLWEMERDRLTFTEGVLYNQFLSQPDFEMLRNY 418
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
A+D LLWE++ K +VV+ H ++ +
Sbjct: 419 AKDLGVLLWENNPKRLMVVSKAGHDDVKRY 448
>gi|451847848|gb|EMD61155.1| hypothetical protein COCSADRAFT_97372 [Cochliobolus sativus ND90Pr]
Length = 482
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 85/138 (61%), Gaps = 6/138 (4%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
P A N GF+++ETN+++YAY+ S + IL LF+K++++ PNL+ G +TKES++ A +
Sbjct: 314 PPTAANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQA 373
Query: 73 GITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
GIT+ Q AHP++ +P +P V DQIRLWE + RVETT + EF S
Sbjct: 374 GITSAQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAE 433
Query: 127 FEAACDYARDQSGLLWED 144
+ +YA L+W++
Sbjct: 434 YRDVLNYASALGVLVWQN 451
>gi|451996931|gb|EMD89397.1| hypothetical protein COCHEDRAFT_1141584 [Cochliobolus
heterostrophus C5]
Length = 482
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 85/138 (61%), Gaps = 6/138 (4%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
P A N GF+++ETN+++YAY+ S + IL LF+K++++ PNL+ G +TKES++ A +
Sbjct: 314 PPTAANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQA 373
Query: 73 GITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
GIT+ Q AHP++ +P +P V DQIRLWE + RVETT + EF S
Sbjct: 374 GITSAQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAE 433
Query: 127 FEAACDYARDQSGLLWED 144
+ +YA L+W++
Sbjct: 434 YRDVLNYASALGVLVWQN 451
>gi|342319113|gb|EGU11064.1| RNA polymerase II transcription factor B subunit 2 [Rhodotorula
glutinis ATCC 204091]
Length = 496
Score = 119 bits (298), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 92/156 (58%), Gaps = 10/156 (6%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+K+YAY+++ L +L LF+ ++ + N + G IT+ES+ NGIT Q
Sbjct: 322 GFIVLETNYKLYAYTSNPLQIAVLGLFAHLKTRFANFVTGHITRESIRRGLANGITANQI 381
Query: 79 -----QNAHPRVADKIPS----VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
AHP++ + S +P V DQIRLWE + R++TT + YDEF S +E
Sbjct: 382 ISYLASRAHPQMRAQAGSDDKLLPITVVDQIRLWEHERRRIQTTEGYLYDEFSSTHDYEL 441
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+YAR+ +L E K ++ V A+ H +REF++
Sbjct: 442 VVNYAREIGSVLLELPKARKVFVTADGHQQVREFIK 477
>gi|348542188|ref|XP_003458568.1| PREDICTED: general transcription factor IIH subunit 4 [Oreochromis
niloticus]
Length = 461
Score = 119 bits (297), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 94/156 (60%), Gaps = 6/156 (3%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+ GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A NGIT +
Sbjct: 302 DAGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQLTRESVQQAIANGITAQ 361
Query: 78 Q------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAAC 131
Q AHP + + P +P + DQIRLWE + +R++ T Y++F S+ FE
Sbjct: 362 QIIHFLRTRAHPVMLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQADFEVLR 421
Query: 132 DYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
D A+ L+W+D +VV + H ++ F + Q
Sbjct: 422 DRAQGLGCLVWQDVAHRVMVVTPQGHSEVKRFWKRQ 457
>gi|410911638|ref|XP_003969297.1| PREDICTED: general transcription factor IIH subunit 4-like
[Takifugu rubripes]
Length = 465
Score = 119 bits (297), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 96/159 (60%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ + GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A NGIT
Sbjct: 304 SGDSGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQVTRESVQQAIANGIT 363
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R++ T Y++F S+ FE
Sbjct: 364 AQQIIHFLRTRAHPVMLRQTPFLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEV 423
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
D A+ L+W+D+ +VV H +++F + Q
Sbjct: 424 LRDRAKSLGCLVWQDAAHRVMVVTLWGHSEVKKFWKRQK 462
>gi|388581217|gb|EIM21527.1| transcription factor Tfb2 [Wallemia sebi CBS 633.66]
Length = 449
Score = 118 bits (296), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 6/155 (3%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
DN F+++ETN+++YAY+++ L IL LF ++ + PNL++G IT++S+ +A NGIT
Sbjct: 276 DNKRFIILETNYRLYAYTSNPLQISILNLFVTLKARYPNLVIGVITRDSIRSALSNGITA 335
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
EQ +AH ++ P +P V DQIRLWE + NR++ Y EF S+ +E
Sbjct: 336 EQIIGYLTSHAHTQMHRNNPLLPVTVSDQIRLWELEKNRLKADDGVLYAEFRSQPDYEIL 395
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+YA+ +LW + K V E H +REF+R
Sbjct: 396 LNYAKSYDCVLWSNDIKRMFFVTLEGHQIVREFVR 430
>gi|327266348|ref|XP_003217968.1| PREDICTED: general transcription factor IIH subunit 4-like [Anolis
carolinensis]
Length = 460
Score = 118 bits (296), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 95/157 (60%), Gaps = 6/157 (3%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
N GF++VETN+++YAY+ S+L ++ LFS++ Y+ PNL+V +T+ES+ A NGIT +
Sbjct: 301 NQGFIIVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRESVQQAIANGITAD 360
Query: 78 Q------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAAC 131
Q AHP + + P +P + DQIRLWE + +R+ + Y++F S+ FE
Sbjct: 361 QIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLR 420
Query: 132 DYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
D+AR+ L++E+ K +VV H ++ F + Q
Sbjct: 421 DHARELGVLIFENPSKRLMVVTPAGHSDVKRFWKRQK 457
>gi|147899535|ref|NP_001085110.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
laevis]
gi|47939830|gb|AAH72322.1| MGC83106 protein [Xenopus laevis]
Length = 455
Score = 118 bits (296), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 96/156 (61%), Gaps = 6/156 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PNL+V +T+E++ A NGIT EQ
Sbjct: 298 GFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQI 357
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P++P + DQIRLWE + +R+ + Y++F S+ FE +Y
Sbjct: 358 IHFLRTRAHPVMLQQNPALPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRNY 417
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
ARD L++E+ K +VV H ++ F + Q +
Sbjct: 418 ARDLGVLVFENPAKRVMVVTPGGHSDVKRFWKRQKQ 453
>gi|403417620|emb|CCM04320.1| predicted protein [Fibroporia radiculosa]
Length = 472
Score = 118 bits (295), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 92/151 (60%), Gaps = 6/151 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+ + L +L LF ++ + PNL++GAIT+ES+ A NGIT +Q
Sbjct: 314 GFIVLETNYRIYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRESVKKALANGITADQI 373
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NRV++ + Y F S+ +E +Y
Sbjct: 374 ISYLTAHAHPQMHKNKPLLPVTVQDQIRLWELERNRVKSEEGYLYTSFASQADYEYVLNY 433
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
A+ +LWE+ + + + H ++R F+
Sbjct: 434 AKQLDVVLWENPSRRSFFGSLDGHANIRGFI 464
>gi|301098671|ref|XP_002898428.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
gi|262105199|gb|EEY63251.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
Length = 483
Score = 117 bits (294), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 92/153 (60%), Gaps = 6/153 (3%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
VVVETNFK+YAY++S LH +L +F I +LPNL +G IT+ESL +A +GI+ +Q
Sbjct: 326 LVVVETNFKIYAYTSSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIY 385
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
++AHP++ P +P+N+ DQI LWE + NRV+ +D F +++ +E+ DYA
Sbjct: 386 DFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFLEGILFDGFNTKEDYESVRDYA 445
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
+ L W D RL + +R F++ Q
Sbjct: 446 KGLMVLTWSDPIHFRLSIATAGIDEVRHFIQNQ 478
>gi|328766914|gb|EGF76966.1| hypothetical protein BATDEDRAFT_92242 [Batrachochytrium
dendrobatidis JAM81]
Length = 471
Score = 117 bits (294), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/158 (34%), Positives = 87/158 (55%), Gaps = 6/158 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD+ GF+++ETN+++YAY+ S L +L LF ++ + N+++G IT++S+ A GI+
Sbjct: 305 ADSDGFIIIETNYRVYAYTDSPLQIAVLSLFIEMRVRFANMVIGIITRDSVREALAKGIS 364
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
EQ +AHP + P +P + DQ+RLWE + NR+ + H Y F +
Sbjct: 365 AEQIIAYLTTHAHPEMKKGSPVLPTTIVDQVRLWEMERNRLRISRGHLYQMFSGEQEYRE 424
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
YA D LW K +VV+AE H H++ F Q
Sbjct: 425 ILKYATDLGYELWHSDSKRLVVVSAEGHEHIKVFFSKQ 462
>gi|71024427|ref|XP_762443.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
gi|46097692|gb|EAK82925.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
Length = 491
Score = 117 bits (293), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 100/166 (60%), Gaps = 12/166 (7%)
Query: 12 IPTIADNC------GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 65
+P ++ N G++++ETN+++YAY+++ L +L LF I+ + PNL+VG+IT++S
Sbjct: 322 VPLLSSNGTEQEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDS 381
Query: 66 LYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYD 119
+ +A NGIT EQ +AH ++ P +P V DQIRLWE + NRV+ +
Sbjct: 382 VKSALANGITAEQIITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFT 441
Query: 120 EFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+F S+ FE +YA+ L+W+D +K R V+ + +R+++R
Sbjct: 442 DFTSQFDFEEVRNYAKQLGVLVWQDEEKRRFFVDEAGNEPVRDYIR 487
>gi|155371845|ref|NP_001094527.1| general transcription factor IIH subunit 4 [Bos taurus]
gi|148878127|gb|AAI46241.1| GTF2H4 protein [Bos taurus]
gi|296474219|tpg|DAA16334.1| TPA: general transcription factor IIH, polypeptide 4, 52kDa [Bos
taurus]
Length = 463
Score = 117 bits (293), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLMFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|388857860|emb|CCF48525.1| probable TFB2-TFIIH subunit (transcription/repair factor) [Ustilago
hordei]
Length = 492
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 98/166 (59%), Gaps = 12/166 (7%)
Query: 12 IPTIADNC------GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 65
IP ++ N G++++ETN+++YAY+++ L +L LF I+ + PNL+VG+IT++S
Sbjct: 323 IPLLSSNGAEQEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDS 382
Query: 66 LYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYD 119
+ +A NGIT EQ +AH ++ P +P V DQIRLWE + NRV +
Sbjct: 383 VKSALANGITAEQIITYLTHHAHVQMHRNDPLLPVTVSDQIRLWEREKNRVVQNLGSLFT 442
Query: 120 EFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+F S+ FE +YA+ L+WED K R V+ + +R+++R
Sbjct: 443 DFTSQFDFEEVRNYAKQLGVLVWEDESKRRFFVDEAGNEPVRDYIR 488
>gi|338718559|ref|XP_003363845.1| PREDICTED: general transcription factor IIH subunit 4 [Equus
caballus]
Length = 463
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|402224480|gb|EJU04542.1| transcription factor Tfb2 [Dacryopinax sp. DJM-731 SS1]
Length = 466
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 97/156 (62%), Gaps = 6/156 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A + GF+++ETN+++YAY+ + L +L LF + + NL++G +T+ES+ A NGIT
Sbjct: 306 ASSEGFIILETNYRVYAYTENPLQIAVLNLFVALHSRFENLVIGRLTRESIKAALANGIT 365
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q +AHP + P +P V DQIRLW+ + NR+++ + Y++F S+ F+
Sbjct: 366 ADQIISYLTVHAHPMMHKNNPVLPVTVQDQIRLWQLEKNRLKSENGYLYEDFNSQGDFDL 425
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+YA+ +LWE+ +K ++ V + H ++REF+R
Sbjct: 426 VLNYAKQLDVVLWENREKRKMFVREDGHENVREFIR 461
>gi|302684027|ref|XP_003031694.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
gi|300105387|gb|EFI96791.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
Length = 476
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 11/163 (6%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
PT GF+V+ETN+++YAY+ + L +L LF +Y+ PNL+VG+IT+ES+ A N
Sbjct: 307 PTSGPKEGFIVLETNYRVYAYTDNALQTAVLNLFITPKYRFPNLVVGSITRESVKRAMSN 366
Query: 73 GITTEQ------QNAHPRVADKI-----PSVPKNVCDQIRLWESDLNRVETTPAHYYDEF 121
GIT +Q +AHP++ + P +P V DQIRLWE + NR ++ Y EF
Sbjct: 367 GITADQIISYLITHAHPQMHKNVCLVANPLLPVTVQDQIRLWELERNRTKSQEGFLYTEF 426
Query: 122 PSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
S+ +E +YA+ +LWE+ K E H +++ F+
Sbjct: 427 ASQADYEYVLNYAKQLGVVLWENHAKRCFFAAQEGHANIKSFI 469
>gi|281337500|gb|EFB13084.1| hypothetical protein PANDA_018954 [Ailuropoda melanoleuca]
Length = 463
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|442757633|gb|JAA70975.1| Putative rna polymer [Ixodes ricinus]
Length = 459
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 90/150 (60%), Gaps = 6/150 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE-- 77
G++VVETN+++YAY+ S+L +L LF ++ Y+LPNL+VG +T+ES+ A +GIT+
Sbjct: 302 GYIVVETNYRVYAYTDSQLQVALLALFCELMYRLPNLVVGVLTRESVRQALRSGITSNQI 361
Query: 78 ----QQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
Q +AHP + P +P + DQ+RLWE + +R Y +F S+ F+ +Y
Sbjct: 362 IKFLQMHAHPEAQKQSPVIPPTIMDQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNY 421
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
A D L+W++ K +VVN H ++ F
Sbjct: 422 ASDLGVLIWDNPSKRVMVVNRNGHDEVKRF 451
>gi|336370852|gb|EGN99192.1| hypothetical protein SERLA73DRAFT_90464 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383609|gb|EGO24758.1| hypothetical protein SERLADRAFT_361686 [Serpula lacrymans var.
lacrymans S7.9]
Length = 469
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 92/151 (60%), Gaps = 6/151 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+ L +L LF ++Y+ PNL+VG +T+ES+ A NGI+ EQ
Sbjct: 311 GFIVLETNYRIYAYTDKPLQTAVLNLFITMKYRFPNLVVGMLTRESVKKALSNGISAEQI 370
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NR+++ + Y +F S +E +Y
Sbjct: 371 ISYLTTHAHPQMRKNNPLLPVTVQDQIRLWELERNRLKSEEGYLYKDFGSHADYEYVLNY 430
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
A+ +LWE++ + + + H ++R F+
Sbjct: 431 AKQLDVVLWENTSRRCFFGSLDGHTNIRGFI 461
>gi|426250586|ref|XP_004019016.1| PREDICTED: general transcription factor IIH subunit 4 [Ovis aries]
Length = 463
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|417401351|gb|JAA47565.1| Putative rna polymer [Desmodus rotundus]
Length = 463
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHGDVKRFWKRQK 460
>gi|62751795|ref|NP_001015845.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
(Silurana) tropicalis]
gi|58475911|gb|AAH90134.1| general transcription factor II H, polypeptide 4 [Xenopus
(Silurana) tropicalis]
gi|89272862|emb|CAJ82116.1| transcription factor tfb2 [Xenopus (Silurana) tropicalis]
gi|114107989|gb|AAI22900.1| gtf2h4 protein [Xenopus (Silurana) tropicalis]
Length = 455
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/154 (38%), Positives = 94/154 (61%), Gaps = 6/154 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PNL+V +T+E++ A NGIT EQ
Sbjct: 298 GFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQI 357
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ + Y++F S+ FE +Y
Sbjct: 358 IHFLRTRAHPVMLQQNPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRNY 417
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
ARD L++E+ K +VV H ++ F + Q
Sbjct: 418 ARDLGVLVFENPAKRVMVVTPGGHSDVKRFWKRQ 451
>gi|440896165|gb|ELR48176.1| General transcription factor IIH subunit 4 [Bos grunniens mutus]
Length = 463
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|149031822|gb|EDL86757.1| rCG41794 [Rattus norvegicus]
Length = 209
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 52 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 111
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 112 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 171
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 172 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 206
>gi|299750117|ref|XP_001836551.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
gi|298408751|gb|EAU85259.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
Length = 403
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 93/151 (61%), Gaps = 6/151 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+++ETN+++YAY+ + L +L LF+ ++Y+ PNL+VG+IT+ES+ A NGI+ +Q
Sbjct: 245 GFIILETNYRIYAYTDNPLQTAVLSLFASLKYRFPNLVVGSITRESVKKALLNGISADQI 304
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP + P +P V DQIRLWE + NR+++ Y F S+ +E Y
Sbjct: 305 ISYLITHAHPNMRKNNPLLPVTVQDQIRLWELEKNRLKSREGFLYTAFASQADYELVLQY 364
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
A++ +LWE++ K + E H +++ F+
Sbjct: 365 AKELDVVLWENASKRCFFGSLEGHANIKGFI 395
>gi|195999048|ref|XP_002109392.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
gi|190587516|gb|EDV27558.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
Length = 452
Score = 117 bits (292), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 89/156 (57%), Gaps = 8/156 (5%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ + G+++VETN+++YAY+ S L ++ LF ++ Y+LP L VG IT+ES+ AF NGIT
Sbjct: 289 SKSSGYIIVETNYRLYAYTESPLDIALVALFCEMMYRLPGLCVGLITRESVQQAFTNGIT 348
Query: 76 TEQ------QNAHPRVADKI--PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
+ +AHP K+ P VP + DQ+ LWE + R+ + Y++ S F
Sbjct: 349 ANKIINFIRTHAHPEAKKKVTTPIVPSTIIDQLYLWELERGRLSCSDGVLYNQILSSSDF 408
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
EA YA D LLW K LVVN + H H++ F
Sbjct: 409 EALRKYADDMGVLLWASPAKRLLVVNRDGHNHVKHF 444
>gi|409048320|gb|EKM57798.1| hypothetical protein PHACADRAFT_171015 [Phanerochaete carnosa
HHB-10118-sp]
Length = 472
Score = 116 bits (291), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 92/159 (57%), Gaps = 10/159 (6%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+++ETN+++YAY+ + L +L LF ++ + PNL+VGAIT+ES+ A NGIT
Sbjct: 306 AHGQGFIILETNYRLYAYTDNPLQIAVLNLFVTLKSRYPNLVVGAITRESVKKALTNGIT 365
Query: 76 TEQQN----------AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
+Q + AHP++ P +P V DQIRLWE + NR+++ + Y F S+
Sbjct: 366 ADQASRQIISYLTTHAHPQMRKNKPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQA 425
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+E +YA+ +LWE S + E H ++R F+
Sbjct: 426 DYEYVLNYAKQLGVVLWESSGRRCFFGTLEGHPNIRGFI 464
>gi|169784197|ref|XP_001826560.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus oryzae RIB40]
gi|238508739|ref|XP_002385554.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|83775305|dbj|BAE65427.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220688446|gb|EED44799.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|391868531|gb|EIT77745.1| RNA polymerase II transcription initiation protein [Aspergillus
oryzae 3.042]
Length = 480
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/171 (37%), Positives = 97/171 (56%), Gaps = 23/171 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF++VETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 313 GSGFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITAD 372
Query: 78 Q------QNAHPRV------------ADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ A PSV P V DQIRLW+ + +RV+ TP +
Sbjct: 373 QIISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVTDQIRLWQLERDRVKATPGFLF 432
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 168
+F S +EA C YA + L+W+ D K+M V H + FLR +
Sbjct: 433 KDFVSLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSRK 480
>gi|344307658|ref|XP_003422497.1| PREDICTED: general transcription factor IIH subunit 4 [Loxodonta
africana]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF++VETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|74143475|dbj|BAE28812.1| unnamed protein product [Mus musculus]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ +GIT +Q
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQVIASGITAQQI 365
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ VFE +
Sbjct: 366 IHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVVFELLLAH 425
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 426 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|359320969|ref|XP_003639474.1| PREDICTED: general transcription factor IIH subunit 4-like [Canis
lupus familiaris]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF++VETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|344252968|gb|EGW09072.1| General transcription factor IIH subunit 4 [Cricetulus griseus]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 426 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|178056954|ref|NP_001116592.1| general transcription factor IIH subunit 4 [Sus scrofa]
gi|41529162|dbj|BAD08424.1| general transcription factor IIH, polypeptide 4 [Sus scrofa]
gi|47496805|dbj|BAD08426.2| general transcription factor IIH, polypeptide 4 [Sus scrofa]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 426 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|47059175|ref|NP_997666.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|46237651|emb|CAE84027.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|117558335|gb|AAI27470.1| General transcription factor II H, polypeptide 4 [Rattus
norvegicus]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 426 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|4504201|ref|NP_001508.1| general transcription factor IIH subunit 4 [Homo sapiens]
gi|113865855|ref|NP_001038968.1| general transcription factor IIH subunit 4 [Pan troglodytes]
gi|114050799|ref|NP_001040607.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|297677636|ref|XP_002816695.1| PREDICTED: general transcription factor IIH subunit 4 isoform 2
[Pongo abelii]
gi|332245940|ref|XP_003272109.1| PREDICTED: general transcription factor IIH subunit 4 [Nomascus
leucogenys]
gi|426352309|ref|XP_004043656.1| PREDICTED: general transcription factor IIH subunit 4 [Gorilla
gorilla gorilla]
gi|17380328|sp|Q92759.1|TF2H4_HUMAN RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|38503276|sp|P60027.1|TF2H4_PANTR RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|1514597|emb|CAA68870.1| transcription factor TFIIH [Homo sapiens]
gi|13436278|gb|AAH04935.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|15277224|dbj|BAB63317.1| Transcription factor II H [Homo sapiens]
gi|16740884|gb|AAH16302.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|21655317|gb|AAM64222.1| general transcription factor IIH, polypeptide 4 (52kD subunit)
[Homo sapiens]
gi|27544403|dbj|BAC54936.1| transcription factor II H [Homo sapiens]
gi|30583481|gb|AAP35985.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|32127781|dbj|BAC78171.1| transcription factor II H [Pan troglodytes]
gi|55700798|dbj|BAD69753.1| general transcription factor IIH, polypeptide 4, 52kDa [Macaca
mulatta]
gi|60655839|gb|AAX32483.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|60655841|gb|AAX32484.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|86197966|dbj|BAE78622.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|90960934|dbj|BAE92819.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|90960936|dbj|BAE92820.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|114306784|dbj|BAF31271.1| TFIIH protein [Homo sapiens]
gi|119623752|gb|EAX03347.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|123992979|gb|ABM84091.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|123999907|gb|ABM87462.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|355561507|gb|EHH18139.1| General transcription factor IIH polypeptide 4, partial [Macaca
mulatta]
gi|355748409|gb|EHH52892.1| General transcription factor IIH polypeptide 4 [Macaca
fascicularis]
gi|380812138|gb|AFE77944.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|380812140|gb|AFE77945.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|383408563|gb|AFH27495.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|410214852|gb|JAA04645.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410250640|gb|JAA13287.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410290944|gb|JAA24072.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410352807|gb|JAA43007.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
Length = 462
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 305 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 364
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 365 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 424
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 425 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 459
>gi|410958682|ref|XP_003985944.1| PREDICTED: general transcription factor IIH subunit 4 [Felis catus]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 95/159 (59%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIIHFLRTRAHPVMLKQSPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|395831929|ref|XP_003789035.1| PREDICTED: general transcription factor IIH subunit 4 [Otolemur
garnettii]
Length = 463
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 426 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|19113355|ref|NP_596563.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe 972h-]
gi|62901133|sp|P87303.2|TFB2_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|3080538|emb|CAA18666.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe]
Length = 447
Score = 115 bits (289), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 90/155 (58%), Gaps = 6/155 (3%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D+ GF++VETN+++YAY++S L I+ LF+ + + NL+VG IT++S+ A NGI
Sbjct: 288 DDKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRDSIRRALMNGIAA 347
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
EQ +AHP++ +P +P + DQI LWE + NR+ TP + +F + F+ A
Sbjct: 348 EQIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILFRDFLTDSGFDQA 407
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+YA++ L+W+ S K + M +L+
Sbjct: 408 VEYAKELGVLVWDSSLKRMFFITTTGAQPMIAYLK 442
>gi|351713119|gb|EHB16038.1| General transcription factor IIH subunit 4 [Heterocephalus glaber]
Length = 463
Score = 115 bits (289), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 306 GFIVVETNYRLYAYTESELQVALIALFSEMLYRFPNVVVAQVTRESVQQAIGSGITAQQI 365
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 426 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|62896773|dbj|BAD96327.1| general transcription factor IIH, polypeptide 4, 52kDa variant
[Homo sapiens]
Length = 462
Score = 115 bits (289), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 93/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 305 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 364
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE +R+ T Y++F S+ FE +
Sbjct: 365 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELGRDRLRFTEGVLYNQFLSQVDFELLLAH 424
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 425 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 459
>gi|119179887|ref|XP_001241468.1| hypothetical protein CIMG_08631 [Coccidioides immitis RS]
gi|392866653|gb|EAS30172.2| transcription factor Tfb2 [Coccidioides immitis RS]
Length = 487
Score = 115 bits (289), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 97/173 (56%), Gaps = 21/173 (12%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
T A GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+ A E G
Sbjct: 313 TGAPGTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMG 372
Query: 74 ITTEQ------QNAHPRV------------ADKIPSV-PKNVCDQIRLWESDLNRVETTP 114
IT +Q +AHP++ A + SV P V DQIRLW+ + +R++ TP
Sbjct: 373 ITADQIISYLTTHAHPQMRKVNASKSTSTTAGLLASVLPPTVVDQIRLWQLERDRLKATP 432
Query: 115 AHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
+ +F S FEA C YA + L+W+ K+ V H + FLR +
Sbjct: 433 GFLFKDFASSAEFEAPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAFLRSR 483
>gi|303321033|ref|XP_003070511.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240110207|gb|EER28366.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320036051|gb|EFW17991.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Coccidioides posadasii str. Silveira]
Length = 487
Score = 115 bits (288), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 97/173 (56%), Gaps = 21/173 (12%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
T A GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+ A E G
Sbjct: 313 TGAPGTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMG 372
Query: 74 ITTEQ------QNAHPRV----ADKIPS---------VPKNVCDQIRLWESDLNRVETTP 114
IT +Q +AHP++ A K S +P V DQIRLW+ + +R++ TP
Sbjct: 373 ITADQIISYLTTHAHPQMRKVNASKSTSTTAGLPASVLPPTVVDQIRLWQLERDRLKATP 432
Query: 115 AHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
+ +F S FEA C YA + L+W+ K+ V H + FLR +
Sbjct: 433 GFLFKDFASSAEFEAPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAFLRSR 483
>gi|392564310|gb|EIW57488.1| transcription factor Tfb2 [Trametes versicolor FP-101664 SS1]
Length = 477
Score = 115 bits (287), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 93/151 (61%), Gaps = 6/151 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+ + L +L LF ++ + PNL++GAIT++S+ A +GIT +Q
Sbjct: 318 GFIVLETNYRLYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRDSVKKALASGITADQI 377
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NR+++ + Y F S+ +E +Y
Sbjct: 378 ISYLVTHAHPQMRKNQPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQADYEYVLNY 437
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
A+ +LWE++ + + E H ++R F+
Sbjct: 438 AKQLDVVLWENASRRCFFGSVEGHANIRGFI 468
>gi|6754094|ref|NP_034494.1| general transcription factor IIH subunit 4 [Mus musculus]
gi|18202124|sp|O70422.1|TF2H4_MOUSE RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|2997755|gb|AAC08594.1| TFIIH transcription/DNA repair factor p52 subunit [Mus musculus]
gi|15215027|gb|AAH12638.1| Gtf2h4 protein [Mus musculus]
gi|26348003|dbj|BAC37650.1| unnamed protein product [Mus musculus]
gi|74194835|dbj|BAE26009.1| unnamed protein product [Mus musculus]
gi|74222199|dbj|BAE26910.1| unnamed protein product [Mus musculus]
Length = 463
Score = 115 bits (287), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 94/155 (60%), Gaps = 6/155 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 366 IHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
AR+ L++E+S K +VV H ++ F + Q
Sbjct: 426 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|170039509|ref|XP_001847575.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
gi|167863052|gb|EDS26435.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
Length = 492
Score = 115 bits (287), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 102/176 (57%), Gaps = 18/176 (10%)
Query: 12 IPTI------ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 65
+PTI A + G++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S
Sbjct: 314 VPTIQEDGSAAQDKGYIVVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDS 373
Query: 66 LYNAFENGITTEQ------QNAHP------RVADKIPSVPKNVCDQIRLWESDLNRVETT 113
+ AF GIT +Q Q+AHP + + ++P V DQI+LWE++ NR T
Sbjct: 374 VRQAFRGGITADQIISYLEQHAHPTMHNVEQTINTKSALPPTVVDQIKLWENERNRFTYT 433
Query: 114 PAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
Y++F S+ F DYA+ ++W++ + +VV H +++F + +K
Sbjct: 434 EGVVYNQFLSQGDFNTLRDYAQSIGVMIWQNERTRTMVVTKNGHDDVKKFWKRYSK 489
>gi|261187837|ref|XP_002620336.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239593453|gb|EEQ76034.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239608433|gb|EEQ85420.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis ER-3]
gi|327356043|gb|EGE84900.1| RNA pol II transcription initiation subunit [Ajellomyces
dermatitidis ATCC 18188]
Length = 485
Score = 115 bits (287), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 96/168 (57%), Gaps = 20/168 (11%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A E GIT +
Sbjct: 316 GTGFIIIETNYRLYAYTSSPLQISLISLFTTLKYRFPNLITGKLTRQSIRRAVEMGITAD 375
Query: 78 Q------QNAHPRV----ADK-------IPSV-PKNVCDQIRLWESDLNRVETTPAHYYD 119
Q +AHP++ A K +P+V P V DQIRLW+ + +R++ TP +
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSGANPAGVPAVLPPTVVDQIRLWQLERDRIKATPGFLFK 435
Query: 120 EFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
+F S FE C YA + LLW+ K+ V H + +LR +
Sbjct: 436 DFVSLAEFEGPCRYAEEIGVLLWKSEKRRMFFVTR--HEQVAAYLRSR 481
>gi|392594388|gb|EIW83712.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 486
Score = 115 bits (287), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 94/154 (61%), Gaps = 6/154 (3%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
++ GF+V+ETN+++YAY+ + L +L LF ++Y+ PNL+VG +T+ES+ A NGI+
Sbjct: 325 NDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALGNGISA 384
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
EQ +AHP++ P +P V DQ+RLWE + NR+++ Y +F ++ +E
Sbjct: 385 EQVISYLTAHAHPQMRKNNPLLPVTVQDQVRLWELERNRLKSDDGFLYKDFATQADYEYV 444
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
YA++ +LWE++ K + H ++R F+
Sbjct: 445 LAYAKELDVVLWENTSKRCFFGTLDGHTNVRGFI 478
>gi|302495702|ref|XP_003009866.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
gi|291173385|gb|EFE29221.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
Length = 386
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 95/165 (57%), Gaps = 18/165 (10%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF++VETN+++YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GIT +
Sbjct: 220 GTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 279
Query: 78 Q------QNAHPRVADKIPS--------VPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
Q +AHP++ S +P V DQIRLW+ + +R++ T + EF +
Sbjct: 280 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDT 339
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
FEA C YA + L+W+ DS++M V H + FLR +
Sbjct: 340 FAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 381
>gi|302667863|ref|XP_003025510.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
gi|291189624|gb|EFE44899.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
Length = 386
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 95/165 (57%), Gaps = 18/165 (10%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF++VETN+++YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GIT +
Sbjct: 220 GTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 279
Query: 78 Q------QNAHPRVADKIPS--------VPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
Q +AHP++ S +P V DQIRLW+ + +R++ T + EF +
Sbjct: 280 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDT 339
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
FEA C YA + L+W+ DS++M V H + FLR +
Sbjct: 340 FAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 381
>gi|353240543|emb|CCA72408.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Piriformospora indica DSM 11827]
Length = 462
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 90/150 (60%), Gaps = 6/150 (4%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
FVV+ETN+ +YAY+ + L +L LF + + PN+I+G++T++S+ A NGIT +Q
Sbjct: 307 FVVLETNYHVYAYTNNPLQIAVLNLFVSFKARFPNMIMGSLTRDSVKKALVNGITADQIL 366
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
+AHP++ P +P V DQIRLWE + +RV+ + Y EF S + +E YA
Sbjct: 367 SYLVTHAHPQMRKNNPIIPVTVQDQIRLWELERHRVKGQDGYLYKEFASMNDYEVVVQYA 426
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
R+ +LWE++ + +A +H+R ++
Sbjct: 427 RELGVVLWENASRRMFFADAAGRVHIRSYI 456
>gi|159474228|ref|XP_001695231.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276165|gb|EDP01939.1| predicted protein [Chlamydomonas reinhardtii]
Length = 435
Score = 114 bits (285), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/137 (43%), Positives = 83/137 (60%), Gaps = 6/137 (4%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
FV+VE+N+++Y Y+ S + +L LF K E LPNL VGAI ++S+ A GIT ++
Sbjct: 299 FVIVESNYRVYVYTRSPVTIAVLELFVKREALLPNLFVGAIRRDSILAALARGITADELV 358
Query: 81 A------HPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
A HP +A + P VP+ V DQIRLWE+ +NR++ PA Y+ SR ++E A A
Sbjct: 359 AYLSARPHPSIAARCPVVPEVVSDQIRLWEASMNRLQAHPAVLYENMESRQLYERAVAAA 418
Query: 135 RDQSGLLWEDSKKMRLV 151
R L WED +MR V
Sbjct: 419 RAAGTLQWEDGARMRFV 435
>gi|327298217|ref|XP_003233802.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
gi|326463980|gb|EGD89433.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
Length = 484
Score = 114 bits (285), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 95/165 (57%), Gaps = 18/165 (10%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF++VETN+++YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GIT +
Sbjct: 318 GTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 377
Query: 78 Q------QNAHPRVADKIPS--------VPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
Q +AHP++ S +P V DQIRLW+ + +R++ T + EF +
Sbjct: 378 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDT 437
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
FEA C YA + L+W+ DS++M V H + FLR +
Sbjct: 438 FAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|240279262|gb|EER42767.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H143]
Length = 485
Score = 114 bits (285), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 101/178 (56%), Gaps = 23/178 (12%)
Query: 11 FIPTIADN---CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 67
+P+ + N GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G +T++S+
Sbjct: 306 LVPSASSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVR 365
Query: 68 NAFENGITTEQ------QNAHPRV----ADK-------IPSV-PKNVCDQIRLWESDLNR 109
A E GIT +Q +AHP++ A K +P+V P V DQIRLW+ + +R
Sbjct: 366 RAVEMGITADQIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDR 425
Query: 110 VETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
++ TP + +F S FE C YA + L+W+ +K V H + +LR +
Sbjct: 426 IKATPGFLFKDFVSLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVAAYLRSK 481
>gi|325089533|gb|EGC42843.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H88]
Length = 485
Score = 114 bits (285), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 101/178 (56%), Gaps = 23/178 (12%)
Query: 11 FIPTIADN---CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 67
+P+ + N GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G +T++S+
Sbjct: 306 LVPSASSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVR 365
Query: 68 NAFENGITTEQ------QNAHPRV----ADK-------IPSV-PKNVCDQIRLWESDLNR 109
A E GIT +Q +AHP++ A K +P+V P V DQIRLW+ + +R
Sbjct: 366 RAVEMGITADQIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDR 425
Query: 110 VETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
++ TP + +F S FE C YA + L+W+ +K V H + +LR +
Sbjct: 426 IKATPGFLFKDFVSLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVATYLRSK 481
>gi|121714341|ref|XP_001274781.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
gi|119402935|gb|EAW13355.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
Length = 478
Score = 114 bits (285), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/171 (34%), Positives = 95/171 (55%), Gaps = 23/171 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 311 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 370
Query: 78 Q------QNAHPRV-------------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ + +P V DQIRLW+ + +RV+ TP +
Sbjct: 371 QIISYLSTHAHPQMRKHNASRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATPGFLF 430
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 168
+F S +EA C YA + L+W+ D K+M V H + FLR +
Sbjct: 431 KDFVSLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|449540295|gb|EMD31288.1| hypothetical protein CERSUDRAFT_119839 [Ceriporiopsis subvermispora
B]
Length = 412
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 91/151 (60%), Gaps = 6/151 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+ + L +L LF ++ + PNL++GA+T+ES+ A NGIT +Q
Sbjct: 254 GFIVLETNYRVYAYTDNPLQIAVLNLFVTMKSRFPNLVIGAVTRESVKKALANGITADQI 313
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AHP++ P +P V DQIRLWE + NRV++ + Y F S+ +E +Y
Sbjct: 314 ISYLTAHAHPQMRRYKPLLPVTVQDQIRLWELEKNRVKSQEGYLYTAFASQADYEYVLNY 373
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
A+ ++WE + + E H ++R F+
Sbjct: 374 AKQLDVVIWESPTRRCFFGSLEGHSNIRGFI 404
>gi|443894209|dbj|GAC71558.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Pseudozyma antarctica
T-34]
Length = 491
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 92/152 (60%), Gaps = 6/152 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++++ETN+++YAY+++ L +L LF I+ + PNL+VG+IT++S+ +A NGIT EQ
Sbjct: 336 GYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPNLLVGSITRDSVKSALANGITAEQI 395
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
+AH ++ P +P V DQIRLWE + NRV+ + +F S+ F +Y
Sbjct: 396 ITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQPDFAEVRNY 455
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
A L+W D K RL V+ + +R+++R
Sbjct: 456 ASQLGVLVWHDEPKRRLFVDEAGNEPVRDYIR 487
>gi|296818081|ref|XP_002849377.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
gi|238839830|gb|EEQ29492.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
Length = 484
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 98/172 (56%), Gaps = 20/172 (11%)
Query: 13 PTIADN--CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 70
PT A + GF++VETN+++YAY++S L ++ LF+ ++++ PNLI G IT++S+ A
Sbjct: 311 PTSATDGGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAI 370
Query: 71 ENGITTEQ------QNAHPRVADKIPS--------VPKNVCDQIRLWESDLNRVETTPAH 116
E GIT +Q +AHP++ S +P V DQIRLW+ + +R++ T
Sbjct: 371 EMGITADQIVSYLTTHAHPQMRKTKKSTSNVAASILPPTVVDQIRLWQLERDRIKATSGF 430
Query: 117 YYDEFPSRDVFEAACDYARDQSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQ 167
+ EF + FEA C YA + L+W DS++M V H + FLR +
Sbjct: 431 LFKEFDTFAEFEAPCKYAEEIGVLVWRSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|169610043|ref|XP_001798440.1| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
gi|160701967|gb|EAT84391.2| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
Length = 426
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/135 (40%), Positives = 82/135 (60%), Gaps = 6/135 (4%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A N GF++VETN+++YAY+ S + IL LF+K++++ PNL+ G +TKES++ A + GIT
Sbjct: 264 AQNKGFIIVETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHRAVQAGIT 323
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+ Q AHP++ P +P V DQIRLWE + RVE T + EF S +
Sbjct: 324 SAQIISYLTTYAHPQMQKSNPPLPPTVMDQIRLWEYEGERVEVTTGYLMREFGSESEYRD 383
Query: 130 ACDYARDQSGLLWED 144
YA+ L+W++
Sbjct: 384 VMGYAQALGVLVWKN 398
>gi|340371925|ref|XP_003384495.1| PREDICTED: general transcription factor IIH subunit 4-like
[Amphimedon queenslandica]
Length = 446
Score = 113 bits (283), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 89/158 (56%), Gaps = 6/158 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
GF++VETN+++ AY+ SKLH L LF +++Y+ PN+ VG IT+ES+ A +GI
Sbjct: 286 TSQAGFIIVETNYRVIAYTDSKLHIATLALFCQMQYRFPNVAVGTITRESIQQALVHGIK 345
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q Q+AH + K +P V DQI+LWE + NR+ Y EF S +E
Sbjct: 346 ADQIISFLTQHAHHNMLSKAHILPPTVTDQIKLWEIERNRLSFQEGILYSEFLSVTDYEK 405
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
YA D LLW + ++ +V++ + H +R F + Q
Sbjct: 406 VKKYAEDLGVLLWSNRQRRLMVIHPDRHDEIRHFWKRQ 443
>gi|195126587|ref|XP_002007752.1| GI12214 [Drosophila mojavensis]
gi|193919361|gb|EDW18228.1| GI12214 [Drosophila mojavensis]
Length = 492
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 96/162 (59%), Gaps = 12/162 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++VVETN+++YAY+ S+L +L LF+++ Y+ PNL+VG +T++S+ A GIT EQ
Sbjct: 328 GYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 387
Query: 79 -----QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
Q AHP V I S +P V DQI+LWE + NR T Y++F S++ F
Sbjct: 388 VSYLEQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDF 447
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
DYA+ Q+ L+W++ + +VV H ++ + + +K
Sbjct: 448 VTLRDYAQSQNVLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 489
>gi|328863850|gb|EGG12949.1| hypothetical protein MELLADRAFT_46475 [Melampsora larici-populina
98AG31]
Length = 454
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 95/156 (60%), Gaps = 6/156 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+D F+++ETN+++YAY+++ L IL LF + + PNL++G+IT+ES+ AF+NGI
Sbjct: 291 SDPNEFLILETNYRIYAYTSNPLQIAILNLFISFKSRFPNLVIGSITRESIKMAFKNGIR 350
Query: 76 TEQQ------NAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q ++H ++ P +P V DQIRLWE + NRV + Y++F S +E
Sbjct: 351 ADQIISYLEFHSHSQMKLLKPILPNTVVDQIRLWELEKNRVRDQEGYLYEDFKSVSDYEI 410
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+Y++ +LWE+ + + V+ + H +REF++
Sbjct: 411 VINYSKQIDIILWENPELKKFFVSLDGHTALREFIK 446
>gi|295662078|ref|XP_002791593.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226279719|gb|EEH35285.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 484
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 100/176 (56%), Gaps = 23/176 (13%)
Query: 13 PTIADN----CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 68
PT A++ GF+V+ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+
Sbjct: 307 PTAANSSEPSTGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRR 366
Query: 69 AFENGITTEQ------QNAHPRV----ADK------IPSV-PKNVCDQIRLWESDLNRVE 111
A E GIT +Q +AHP++ A K P+V P V DQIRLW+ + +R++
Sbjct: 367 AVEMGITADQIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIK 426
Query: 112 TTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
TP + +F S F+ C YA + L+W+ K+ V H + +LR +
Sbjct: 427 ATPGFLFKDFVSLAEFDGPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAYLRSR 480
>gi|170120252|ref|XP_001891159.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164633568|gb|EDQ98188.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 475
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 95/157 (60%), Gaps = 12/157 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+V+ETN+++YAY+ + L IL LF ++Y+ PNL+VGAIT++S+ A NGI+ +Q
Sbjct: 312 GFIVLETNYRIYAYTDNPLQTAILNLFVSLKYRFPNLVVGAITRDSVKKALLNGISADQI 371
Query: 79 -----QNAHPRVADKI------PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
+AHP++ + P +P V DQIRLWE + NR+++ + Y F S+ +
Sbjct: 372 ISYLMSHAHPQMRKNVRLRSCNPLLPVTVQDQIRLWELEKNRLKSQEGYLYTSFASQADY 431
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+ +YA++ +LWE+ K + + H ++R F+
Sbjct: 432 DFVLNYAKELGVVLWENPVKRCFFGSLDGHANIRGFI 468
>gi|242794214|ref|XP_002482325.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
gi|218718913|gb|EED18333.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
Length = 481
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 90/158 (56%), Gaps = 20/158 (12%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GIT +
Sbjct: 313 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITAD 372
Query: 78 Q------QNAHP--RVADKIPS-----------VPKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP R D + S +P V DQIRLW+ + +R++ TP +
Sbjct: 373 QIISYLSTHAHPQMRKEDAVKSTSNTAGLPRSVLPPTVVDQIRLWQLERDRIKATPGFLF 432
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAE 155
EF S +E C YA + L+W+ D K+M V E
Sbjct: 433 KEFASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 470
>gi|444727354|gb|ELW67853.1| Valyl-tRNA synthetase, mitochondrial [Tupaia chinensis]
Length = 1382
Score = 112 bits (281), Expect = 4e-23, Method: Composition-based stats.
Identities = 55/144 (38%), Positives = 89/144 (61%), Gaps = 6/144 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 263 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 322
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQIRLWE + +R+ T Y++F S+ FE +
Sbjct: 323 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 382
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIH 157
AR+ L++E+S K +VV H
Sbjct: 383 ARELGVLVFENSAKRLMVVTPAGH 406
>gi|76154655|gb|AAX26095.2| SJCHGC05044 protein [Schistosoma japonicum]
Length = 214
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 88/154 (57%), Gaps = 6/154 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ + G++++ETNF++YAY+ S L +L LFSKI + PNL+V IT++S+ A GIT
Sbjct: 54 SSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGIT 113
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
Q NAHP + P +P + DQIRLWE + +R Y+ F FE
Sbjct: 114 ANQILSFLTSNAHPDMLLSNPILPPTLTDQIRLWELERDRFVFQEGCLYEHFSRNTDFEL 173
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
DYA++ LLWE+ ++ +VV+ H +R+F
Sbjct: 174 VRDYAKNIGVLLWENPERRLMVVSKSGHEDVRKF 207
>gi|353231086|emb|CCD77504.1| putative tfiih, polypeptide [Schistosoma mansoni]
Length = 486
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 89/154 (57%), Gaps = 6/154 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ + G++++ETNF++YAY+ S L +L LFSKI + PNL+V IT++S+ A GIT
Sbjct: 326 SSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGIT 385
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
Q NAHP + + P +P + DQIRLWE + +R Y++F FE
Sbjct: 386 ANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDFEM 445
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
DYA+ LLWE+ ++ +VV+ H +R+F
Sbjct: 446 VRDYAKSIGVLLWENPERRLMVVSKAGHEDVRKF 479
>gi|326483225|gb|EGE07235.1| transcription factor Tfb2 [Trichophyton equinum CBS 127.97]
Length = 484
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 94/165 (56%), Gaps = 18/165 (10%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF++VETN ++YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GIT +
Sbjct: 318 GTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 377
Query: 78 Q------QNAHPRVADKIPS--------VPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
Q +AHP++ S +P V DQIRLW+ + +R++ T + EF +
Sbjct: 378 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDT 437
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
FEA C YA + L+W+ DS++M V H + FLR +
Sbjct: 438 FAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|256086358|ref|XP_002579367.1| tfiih polypeptide [Schistosoma mansoni]
Length = 486
Score = 112 bits (281), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 89/154 (57%), Gaps = 6/154 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ + G++++ETNF++YAY+ S L +L LFSKI + PNL+V IT++S+ A GIT
Sbjct: 326 SSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGIT 385
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
Q NAHP + + P +P + DQIRLWE + +R Y++F FE
Sbjct: 386 ANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDFEM 445
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
DYA+ LLWE+ ++ +VV+ H +R+F
Sbjct: 446 VRDYAKSIGVLLWENPERRLMVVSKAGHEDVRKF 479
>gi|225555033|gb|EEH03326.1| RNA pol II transcription initiation subunit [Ajellomyces capsulatus
G186AR]
Length = 485
Score = 112 bits (280), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 101/177 (57%), Gaps = 23/177 (12%)
Query: 12 IPTIADN---CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 68
+P+ + N GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G +T++S+
Sbjct: 307 VPSASSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRR 366
Query: 69 AFENGITTEQ------QNAHPRV----ADK-------IPSV-PKNVCDQIRLWESDLNRV 110
A E GIT +Q +AHP++ A K +P+V P V DQIRLW+ + +R+
Sbjct: 367 AVEMGITADQIISYLTTHAHPQMRKYHATKAGANPVGVPTVLPPTVVDQIRLWQLERDRI 426
Query: 111 ETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
+ TP + +F + FE C YA + L+W+ +K V H + +LR +
Sbjct: 427 KATPGFLFKDFVNLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVAAYLRSR 481
>gi|67541703|ref|XP_664619.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|40742471|gb|EAA61661.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|259483674|tpe|CBF79257.1| TPA: TFIIH and nucleotide excision repair factor 3 complexes
subunit (Tfb2), putative (AFU_orthologue; AFUA_4G04360)
[Aspergillus nidulans FGSC A4]
Length = 482
Score = 112 bits (280), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 96/171 (56%), Gaps = 23/171 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+ A E GIT +
Sbjct: 315 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGITAD 374
Query: 78 Q------QNAHPRV------------ADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ A PSV P V DQIRLW+ + +RV+ T +
Sbjct: 375 QIISYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGFLF 434
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 168
+F S +EA C YA + L+W+ D K+M V H + FLR +
Sbjct: 435 KDFVSLAEYEAPCRYAEEIGVLIWKSDRKRMFFVTR---HEQVAAFLRSRK 482
>gi|225682227|gb|EEH20511.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides brasiliensis Pb03]
gi|226289599|gb|EEH45083.1| TFIIH basal transcription factor complex p52 subunit
[Paracoccidioides brasiliensis Pb18]
Length = 484
Score = 112 bits (280), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 96/167 (57%), Gaps = 19/167 (11%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+ GF+V+ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 316 STGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 375
Query: 78 Q------QNAHPRV----ADK------IPSV-PKNVCDQIRLWESDLNRVETTPAHYYDE 120
Q +AHP++ A K P+V P V DQIRLW+ + +R++ TP + +
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKD 435
Query: 121 FPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
F S F+ C YA + L+W+ K+ V H + +LR +
Sbjct: 436 FVSLAEFDGPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAYLRSR 480
>gi|212535854|ref|XP_002148083.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070482|gb|EEA24572.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 112 bits (280), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 91/158 (57%), Gaps = 20/158 (12%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GIT +
Sbjct: 313 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITAD 372
Query: 78 Q------QNAHPRVADK-----------IPS--VPKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ + IP +P V DQIRLW+ + +R++ TP +
Sbjct: 373 QIISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLF 432
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAE 155
EF S +E C YA + L+W+ D K+M V E
Sbjct: 433 KEFASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 470
>gi|91094191|ref|XP_971121.1| PREDICTED: similar to TFIIH basal transcription factor complex p52
subunit [Tribolium castaneum]
gi|270010854|gb|EFA07302.1| hypothetical protein TcasGA2_TC015892 [Tribolium castaneum]
Length = 472
Score = 112 bits (279), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 98/159 (61%), Gaps = 10/159 (6%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G+++VETN+++YAY+ S L ++ LF+++ Y+ PNL+VG IT++S+ A + GIT +Q
Sbjct: 313 GYIIVETNYRVYAYTDSNLQVALIGLFTELMYRFPNLVVGVITRDSIRQALKGGITADQI 372
Query: 79 -----QNAHPRVAD---KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
Q+AHP++ + K P +P V DQI+LWE + NR+ + Y +F S+ F
Sbjct: 373 IGYLKQHAHPQMLEGEAKHP-LPPTVVDQIKLWEIERNRLTYSEGVLYSQFLSQADFNIL 431
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+YA+ L+W + +K L++N H +++F + +K
Sbjct: 432 KEYAQSNGHLIWCNKEKRTLIINKSAHDDVKKFWKRYSK 470
>gi|357618296|gb|EHJ71332.1| putative TFIIH basal transcription factor complex p52 subunit
[Danaus plexippus]
Length = 603
Score = 112 bits (279), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 93/167 (55%), Gaps = 13/167 (7%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A + G+++ ETN+++YAY+TS L +L LF+++ Y+ PN++VG +T+ES+ A GI+
Sbjct: 435 AASSGYIIAETNYRVYAYTTSALQVALLGLFTELVYRFPNVVVGVLTRESVRAALRGGIS 494
Query: 76 TEQ------QNAHPRVAD-------KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFP 122
+Q Q++HP++ S+P V DQIRLWES+ NR T Y++F
Sbjct: 495 AQQIITYLEQHSHPQMLKSDQGGIRSSSSLPPTVLDQIRLWESERNRFTYTEGVVYNQFL 554
Query: 123 SRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
S+ F DY R L+W + +VV H ++ + + +K
Sbjct: 555 SQAEFNVLRDYGRSSGALVWAADRTRTMVVARAAHDDVKRYWKRYSK 601
>gi|156378083|ref|XP_001630974.1| predicted protein [Nematostella vectensis]
gi|156218005|gb|EDO38911.1| predicted protein [Nematostella vectensis]
Length = 471
Score = 112 bits (279), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 91/156 (58%), Gaps = 6/156 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+++ETN+++YAY+ S L ++ LF +I + PNL V ++T+ES A +GI+ EQ
Sbjct: 314 GFIIIETNYRVYAYTESCLQVSLIGLFCEILCRFPNLCVASLTRESCQQALASGISAEQI 373
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP + + P +P + DQ+RLWE + +R++ T Y++F S+ FE Y
Sbjct: 374 LNFLQTRAHPEMLKRTPIIPSTISDQVRLWEMERSRMKFTEGVLYNQFLSQADFEMLRKY 433
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
A D L+W +S K +VV+ H ++ F + Q +
Sbjct: 434 AEDLGVLIWANSTKRVVVVSRSGHDDVKRFWKRQRQ 469
>gi|157113167|ref|XP_001651923.1| TFIIH basal transcription factor complex p52 subunit [Aedes
aegypti]
gi|108877858|gb|EAT42083.1| AAEL006356-PA [Aedes aegypti]
Length = 487
Score = 112 bits (279), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 95/163 (58%), Gaps = 14/163 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G+++VETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ AF GIT EQ
Sbjct: 323 GYIIVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITAEQI 382
Query: 79 -----QNAHPRVAD-------KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
Q+AHP + + K P +P V DQI+LWE++ NR T Y++F S+
Sbjct: 383 ISYLEQHAHPTMLNMEQAINSKSP-LPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQGD 441
Query: 127 FEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ + W++ + +VV H ++ F + +K
Sbjct: 442 FNTLRDYAQSIGVMTWQNERTRTMVVTKNGHDDVKRFWKRYSK 484
>gi|195477962|ref|XP_002086438.1| GE22864 [Drosophila yakuba]
gi|194186228|gb|EDW99839.1| GE22864 [Drosophila yakuba]
Length = 241
Score = 111 bits (278), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 93/164 (56%), Gaps = 12/164 (7%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+CG++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT E
Sbjct: 75 DCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAE 134
Query: 78 Q------QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
Q Q AHP V I S +P V DQI+LWE + NR T Y++F S
Sbjct: 135 QIVSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHT 194
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 195 DFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 238
>gi|50310215|ref|XP_455127.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62901296|sp|Q6CLR2.1|TFB2_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49644263|emb|CAG97834.1| KLLA0F01056p [Kluyveromyces lactis]
Length = 496
Score = 111 bits (278), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 92/170 (54%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETNFK+Y+YS S L IL LF ++ + N++ G +T+ES+ NA NGIT EQ
Sbjct: 322 GALIIETNFKLYSYSNSPLQIAILSLFVHLKSRFANMVTGQLTRESVRNALLNGITAEQI 381
Query: 79 -----QNAHPR------------------VADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHPR V + + +P V DQIRLW+ +L+R+ +
Sbjct: 382 IAYLETHAHPRMRRLAEENLSKKLELDPTVKETLQVLPPTVVDQIRLWQLELDRIISYDG 441
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ Y +F S ++ DYA+D LLW++ KK V+ E + + +F R
Sbjct: 442 YLYTDFESYQEYQTVADYAKDIGVLLWQNEKKKMFFVSTEGNSQVLDFHR 491
>gi|440636738|gb|ELR06657.1| hypothetical protein GMDG_00274 [Geomyces destructans 20631-21]
Length = 470
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 90/157 (57%), Gaps = 8/157 (5%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+V+ETN+++YAY++S L +L LF+K+ + PN++ G +T++S+ A E+GIT +
Sbjct: 314 TTGFIVIETNYRLYAYTSSPLQIAVLALFAKLSTRYPNMVSGRVTRDSVRGAIEHGITAD 373
Query: 78 Q------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAAC 131
Q +AHP++ K P +P V DQIRLW+ + R++ T + +F S +E
Sbjct: 374 QIISYLTTHAHPQLRRKSPVLPPTVVDQIRLWQIENERMKATAGFLFKDFASLAEYEGCV 433
Query: 132 DYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
YA + L+W K V H +R++++ +
Sbjct: 434 RYADEIGVLVWRSDAKRNFFVTK--HEQLRDYIKSRK 468
>gi|145248676|ref|XP_001400677.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus niger CBS 513.88]
gi|134081344|emb|CAK41847.1| unnamed protein product [Aspergillus niger]
Length = 478
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 23/171 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 311 GTGFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITAD 370
Query: 78 Q------QNAHPRV------------ADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ A PSV P V DQIRLW+ + +RV+ T +
Sbjct: 371 QIISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLF 430
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 168
+F S +EA C YA + L W+ D K+M V H + FLR +
Sbjct: 431 KDFVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|115388809|ref|XP_001211910.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195994|gb|EAU37694.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 460
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 94/170 (55%), Gaps = 21/170 (12%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF++VETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 293 GSGFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITAD 352
Query: 78 Q------QNAHPRV------------ADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ A PSV P V DQIRLW+ + +RV+ T +
Sbjct: 353 QIISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATAGFLF 412
Query: 119 DEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+F S +E C YA + L+W+ ++K V H + FLR +
Sbjct: 413 KDFASLAEYEDPCRYAEEIGVLVWKSNRKRMFFVTR--HEQVAAFLRSRK 460
>gi|212535856|ref|XP_002148084.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070483|gb|EEA24573.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 422
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 91/158 (57%), Gaps = 20/158 (12%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GIT +
Sbjct: 254 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITAD 313
Query: 78 Q------QNAHPRVADK-----------IPS--VPKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ + IP +P V DQIRLW+ + +R++ TP +
Sbjct: 314 QIISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLF 373
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAE 155
EF S +E C YA + L+W+ D K+M V E
Sbjct: 374 KEFASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 411
>gi|350639203|gb|EHA27557.1| transcription factor Tfb2 [Aspergillus niger ATCC 1015]
Length = 478
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 23/171 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 311 GTGFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITAD 370
Query: 78 Q------QNAHPRV------------ADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ A PSV P V DQIRLW+ + +RV+ T +
Sbjct: 371 QIISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLF 430
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 168
+F S +EA C YA + L W+ D K+M V H + FLR +
Sbjct: 431 KDFVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|358370616|dbj|GAA87227.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus kawachii IFO 4308]
Length = 478
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 23/171 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 311 GTGFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITAD 370
Query: 78 Q------QNAHPRV------------ADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ A PSV P V DQIRLW+ + +RV+ T +
Sbjct: 371 QIISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLF 430
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 168
+F S +EA C YA + L W+ D K+M V H + FLR +
Sbjct: 431 KDFVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|268534706|ref|XP_002632484.1| Hypothetical protein CBG13719 [Caenorhabditis briggsae]
Length = 482
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 94/155 (60%), Gaps = 9/155 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V+VETNF++YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GIT Q
Sbjct: 311 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFTDMSVGMITRESVRGALQHGITAAQI 370
Query: 79 -----QNAHPR-VADKIP--SVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
NAHP+ VA P +P V DQIRLWE + R++ A+ Y F S D +
Sbjct: 371 ISFLRANAHPQCVATSGPVNCLPITVADQIRLWEDERRRMDLKDAYIYSHFESDDEYHGV 430
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
YA+++ LLW + ++ ++VN E H +R++ +
Sbjct: 431 VRYAQERGILLWANPQQKLVIVNEEGHEAVRQWYK 465
>gi|31201525|ref|XP_309710.1| AGAP010984-PA [Anopheles gambiae str. PEST]
gi|21293295|gb|EAA05440.1| AGAP010984-PA [Anopheles gambiae str. PEST]
Length = 501
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 96/162 (59%), Gaps = 12/162 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G+++VETN+++YAY+ S L +L LF+++ Y+ PNL+VG ++++S+ AF GIT EQ
Sbjct: 337 GYIIVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLSRDSVRQAFRGGITAEQI 396
Query: 79 -----QNAHP------RVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
Q+AHP + + S+P V DQI+LWE++ NR T Y++F S+ F
Sbjct: 397 ISYLEQHAHPTMLTVEQAINSKSSLPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQADF 456
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
DYA+ ++W++ + +VV H +++F + +K
Sbjct: 457 ITLRDYAQSIGVMIWQNERIRTMVVTKNGHDDVKKFWKRYSK 498
>gi|315041383|ref|XP_003170068.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
gi|311345102|gb|EFR04305.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
Length = 484
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 93/162 (57%), Gaps = 18/162 (11%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F++VETN+++YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GIT +Q
Sbjct: 321 FIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIV 380
Query: 79 ----QNAHPRVADKIPS--------VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
+AHP++ S +P V DQIRLW+ + +R++ T + EF +
Sbjct: 381 SYLTTHAHPQMRKTKKSTTNVSSTVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAE 440
Query: 127 FEAACDYARDQSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQ 167
FEA C YA + L+W DS++M V H + FLR +
Sbjct: 441 FEAPCKYAEEIGVLVWRSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|194872905|ref|XP_001973103.1| GG13537 [Drosophila erecta]
gi|190654886|gb|EDV52129.1| GG13537 [Drosophila erecta]
Length = 499
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 93/164 (56%), Gaps = 12/164 (7%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+CG++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT E
Sbjct: 333 DCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAE 392
Query: 78 Q------QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
Q Q AHP V I S +P V DQI+LWE + NR T Y++F S
Sbjct: 393 QIVSYLEQYAHPNMRMVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHT 452
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 453 DFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|195477975|ref|XP_002086439.1| GE22863 [Drosophila yakuba]
gi|194186229|gb|EDW99840.1| GE22863 [Drosophila yakuba]
Length = 499
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 93/164 (56%), Gaps = 12/164 (7%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+CG++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT E
Sbjct: 333 DCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAE 392
Query: 78 Q------QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
Q Q AHP V I S +P V DQI+LWE + NR T Y++F S
Sbjct: 393 QIVSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHT 452
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 453 DFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|255079788|ref|XP_002503474.1| predicted protein [Micromonas sp. RCC299]
gi|226518741|gb|ACO64732.1| predicted protein [Micromonas sp. RCC299]
Length = 458
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 98/178 (55%), Gaps = 24/178 (13%)
Query: 7 KESWFIPT-------IADN--------CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEY 51
KE +++PT +AD G ++ ETNF++YAY+ S + CEILRLF++ +Y
Sbjct: 281 KEGYYVPTQLTAGKDVADGDASLGGDPGGHIIAETNFRVYAYTFSDVECEILRLFTRPDY 340
Query: 52 QLPNLIVGAITKESLYNAFENGITTEQ------QNAHP--RVADKIPSVPKNVCDQIRLW 103
+LPNL VG +T+E+++ A + G+ EQ +AHP R +P NV DQI LW
Sbjct: 341 RLPNLYVGMLTREAVHEALDTGVAAEQIIKYIKSHAHPNARKTTNGSGIPPNVADQIMLW 400
Query: 104 ESDLNRVETTPAHYYDEFPS-RDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHM 160
+ RV + Y +FP+ D + AA A D LLWE+ ++M+L V H M
Sbjct: 401 AMERRRVRSAECVLYCDFPTGTDEYAAAVKAASDAGVLLWENREQMKLAVAKSGHERM 458
>gi|50557260|ref|XP_506038.1| YALI0F30173p [Yarrowia lipolytica]
gi|62901293|sp|Q6BZX4.1|TFB2_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49651908|emb|CAG78851.1| YALI0F30173p [Yarrowia lipolytica CLIB122]
Length = 467
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 94/162 (58%), Gaps = 6/162 (3%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
++A + GF+++ETNF++YAY+ S L +L LF ++ + N++ G I ++S+ A NG
Sbjct: 299 SVAADSGFIILETNFRLYAYTESPLQIAVLNLFVNLKTRFANMVTGQINRDSVRFALSNG 358
Query: 74 ITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
IT EQ +AHPR+ +P V DQI+LW+ +++R+ T + + EF + D +
Sbjct: 359 ITAEQIITYLSVHAHPRMKGMEHVLPPTVVDQIKLWQLEMDRIRATDGYLFSEFKNFDEY 418
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ YA++ LL+E+ K + V + EF++ +N+
Sbjct: 419 KDVSTYAKELGVLLYENPGKRKFVSTLAGSQQIVEFVKRRNQ 460
>gi|16198089|gb|AAL13841.1| LD30622p [Drosophila melanogaster]
Length = 350
Score = 110 bits (276), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 93/164 (56%), Gaps = 12/164 (7%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+CG++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT E
Sbjct: 184 DCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAE 243
Query: 78 Q------QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
Q Q AHP V I S +P V DQI+LWE + NR T Y++F S
Sbjct: 244 QIVSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHT 303
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 304 DFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 347
>gi|198428509|ref|XP_002131018.1| PREDICTED: similar to General transcription factor II H,
polypeptide 4 [Ciona intestinalis]
Length = 463
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 95/164 (57%), Gaps = 6/164 (3%)
Query: 12 IPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
+ + ++ GF++ ETNF++YAY+ S+L IL LF K+ Y+ PN+ V +T++S+ +A
Sbjct: 296 VTNVTNSRGFILAETNFRVYAYTDSELQYSILSLFCKMLYRFPNVCVLQLTRKSIQSAVV 355
Query: 72 NGITTEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
NGIT +Q NAHP + P + + DQ+RLW + +R+ Y++F ++
Sbjct: 356 NGITADQILHYIKANAHPDMLKNDPIIAPTLADQVRLWAMERDRLTYRDGVLYNQFLAQK 415
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
FE +YA++ L+WE+S K +VV + H ++ + + K
Sbjct: 416 DFEVLRNYAKELGALIWENSSKRYMVVTKDGHDQVKRYWKKYKK 459
>gi|195327811|ref|XP_002030611.1| GM24480 [Drosophila sechellia]
gi|194119554|gb|EDW41597.1| GM24480 [Drosophila sechellia]
Length = 499
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 93/164 (56%), Gaps = 12/164 (7%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+CG++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT E
Sbjct: 333 DCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAE 392
Query: 78 Q------QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
Q Q AHP V I S +P V DQI+LWE + NR T Y++F S
Sbjct: 393 QIVSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHT 452
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 453 DFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|21357821|ref|NP_648780.1| marionette [Drosophila melanogaster]
gi|7294263|gb|AAF49614.1| marionette [Drosophila melanogaster]
gi|220942562|gb|ACL83824.1| Tfb2-PA [synthetic construct]
Length = 499
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 93/164 (56%), Gaps = 12/164 (7%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+CG++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT E
Sbjct: 333 DCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAE 392
Query: 78 Q------QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
Q Q AHP V I S +P V DQI+LWE + NR T Y++F S
Sbjct: 393 QIVSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHT 452
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 453 DFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|195590491|ref|XP_002084979.1| GD12553 [Drosophila simulans]
gi|194196988|gb|EDX10564.1| GD12553 [Drosophila simulans]
Length = 500
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 93/164 (56%), Gaps = 12/164 (7%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+CG++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT E
Sbjct: 334 DCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAE 393
Query: 78 Q------QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
Q Q AHP V I S +P V DQI+LWE + NR T Y++F S
Sbjct: 394 QIVSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHT 453
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 454 DFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 497
>gi|330797174|ref|XP_003286637.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
gi|325083385|gb|EGC36839.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
Length = 471
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 91/153 (59%), Gaps = 6/153 (3%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
G++V+ETN+++YAY+ S L +L LF K+ Y+LPNL VG IT+ES+ A +GIT +Q
Sbjct: 315 SGYIVLETNYRLYAYTASSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITADQ 374
Query: 79 ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
NAHP + +P V +QI LWE + NR+ T + Y+ FP+ D + A
Sbjct: 375 IIDFIRHNAHPNAVNNGQPIPDVVAEQILLWEQERNRITYTKSVLYNSFPTTDCYHATLK 434
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+A++Q +W + + LVVN + +R F++
Sbjct: 435 FAKEQDYYIWSNDQLKTLVVNENGNDPIRNFIK 467
>gi|406862938|gb|EKD15987.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 421
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 95/160 (59%), Gaps = 15/160 (9%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+++ETN+++YAY+ S L +L LF+K+ + PN++ G +T++S+ NA +GIT++Q
Sbjct: 258 GFIIIETNYRLYAYTNSPLQIAVLALFTKMGVRYPNMVTGRVTRKSVANAISHGITSDQI 317
Query: 79 -----QNAHPRVADKI-----PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AHP++ P +P V DQIRLW+ + R++ T + EF ++ +E
Sbjct: 318 ISYLRAHAHPQLVKAAAVNGNPVLPPTVVDQIRLWQIENERMKATAGFLFKEFETQKEYE 377
Query: 129 AACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
YA + L+W+ D+K+M V H +R++L+ +
Sbjct: 378 GCARYADENGVLVWKNDAKRMFFVTR---HEQLRDYLKSR 414
>gi|425777809|gb|EKV15965.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum PHI26]
gi|425782577|gb|EKV20476.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum Pd1]
Length = 382
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 94/165 (56%), Gaps = 21/165 (12%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GIT +Q
Sbjct: 216 FIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQII 275
Query: 79 ----QNAHPRV---------ADKIPS--VPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
+AHP++ IP+ +P V DQIRLW+ + +R+ T + +F S
Sbjct: 276 SYLLSHAHPQLRKHSAAQSNGKGIPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTS 335
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
++A C YA + L+W+ D K+M V H + FLR Q
Sbjct: 336 LAEYQAPCQYAAEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSQ 377
>gi|321470926|gb|EFX81900.1| hypothetical protein DAPPUDRAFT_210868 [Daphnia pulex]
Length = 463
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 90/154 (58%), Gaps = 6/154 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ G++VVETN+++YAY+ S L ++ LF ++ Y+ PNL+VG IT+ES+ A + GIT
Sbjct: 300 SQTTGYIVVETNYRVYAYTDSSLQVALIALFCELIYRFPNLVVGIITRESVREALKRGIT 359
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q Q+AH + P +P + DQI+LW + +R Y++F S+ FE
Sbjct: 360 ADQIVSFLRQHAHSECYKEPPVLPSTIADQIKLWAIERDRFLYKDGVLYNQFDSQADFEI 419
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
+YA+++ L+W ++VV E H +R+F
Sbjct: 420 LRNYAQERGLLVWHTLNGRKMVVTKEGHDEVRKF 453
>gi|427785675|gb|JAA58289.1| Putative rna polymer [Rhipicephalus pulchellus]
Length = 457
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 88/151 (58%), Gaps = 6/151 (3%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
G+++VETN+++YAY+ S+L +L LF ++ Y+ PNL+V +T+ES+ A +GIT+ Q
Sbjct: 299 AGYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQ 358
Query: 79 ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
AHP + P +P + DQ+RLWE + +R Y +F S+ F+ +
Sbjct: 359 IIKFLRMYAHPEALKQTPVIPATIMDQLRLWEMERDRFVFREGVLYSQFISQSDFQLLRN 418
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
YA + L+W++ K +VVN H +++F
Sbjct: 419 YASELGVLIWDNPSKRVMVVNRNGHDEVKKF 449
>gi|195162678|ref|XP_002022181.1| GL25529 [Drosophila persimilis]
gi|198464237|ref|XP_001353141.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
gi|194104142|gb|EDW26185.1| GL25529 [Drosophila persimilis]
gi|198149631|gb|EAL30642.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
Length = 494
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 102/182 (56%), Gaps = 17/182 (9%)
Query: 5 SRKESWFIPTIAD-----NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVG 59
+ KE+ T++D + G++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG
Sbjct: 310 TSKEAATTMTVSDEEAMQDSGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVG 369
Query: 60 AITKESLYNAFENGITTE------QQNAHPR---VADKIPS---VPKNVCDQIRLWESDL 107
+T++S+ A GIT E QQ AHP V I S +P V DQI+LWE +
Sbjct: 370 VLTRDSVRQALRGGITAEQIVSYLQQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWEMER 429
Query: 108 NRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
NR T Y++F S++ F DYA+ L+W++ + +VV H ++ + +
Sbjct: 430 NRFTYTEGVVYNQFLSQNDFVTLRDYAQSIHVLVWQNERTRTMVVQKHGHDDVKRYWKKY 489
Query: 168 NK 169
+K
Sbjct: 490 SK 491
>gi|443688956|gb|ELT91478.1| hypothetical protein CAPTEDRAFT_18660 [Capitella teleta]
Length = 463
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 56/154 (36%), Positives = 86/154 (55%), Gaps = 6/154 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S L ++ LF + Y+ PN+ VG ++++S+ A GIT
Sbjct: 303 ASEDGFIVVETNYRVYAYTNSSLKVALISLFCSMMYRFPNMAVGVVSRDSVREALSRGIT 362
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
EQ +AHP + P +P V DQIRLWE + +R Y++F S+ FE
Sbjct: 363 AEQIINFLRNHAHPHTQKRKPILPPTVSDQIRLWELERDRFHFNEGVLYNQFLSQTDFEV 422
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
DYA+D L +E+ + +VV H ++ F
Sbjct: 423 LRDYAKDLGVLSYENIPRRLMVVTRAGHDDVKRF 456
>gi|195378984|ref|XP_002048261.1| GJ11446 [Drosophila virilis]
gi|194155419|gb|EDW70603.1| GJ11446 [Drosophila virilis]
Length = 492
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 95/162 (58%), Gaps = 12/162 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++VVETN+++YAY+ S+L +L LF+++ Y+ PNL+VG +T++S+ A GIT EQ
Sbjct: 328 GYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 387
Query: 79 -----QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
Q AHP V I S +P + DQI+LWE + NR T Y++F S++ F
Sbjct: 388 VSYLEQYAHPNMKLVESAIQSKSCLPPTIVDQIKLWEMERNRFTYTEGVVYNQFLSQNDF 447
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
DYA+ + L+W++ + +VV H ++ + + +K
Sbjct: 448 VTLRDYAQSINMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 489
>gi|431907067|gb|ELK11185.1| General transcription factor IIH subunit 4 [Pteropus alecto]
Length = 447
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 90/153 (58%), Gaps = 10/153 (6%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361
Query: 76 TEQQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYAR 135
+Q P +P + DQIRLWE + +R+ T Y++F S+ FE +AR
Sbjct: 362 AQQ----------TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHAR 411
Query: 136 DQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+ L++E+S K +VV H ++ F + Q
Sbjct: 412 ELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 444
>gi|195019521|ref|XP_001984999.1| GH16811 [Drosophila grimshawi]
gi|193898481|gb|EDV97347.1| GH16811 [Drosophila grimshawi]
Length = 493
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 94/162 (58%), Gaps = 12/162 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++VVETN+++YAY+ S+L +L LF+++ Y+ PNL+VG +T++S+ A GIT EQ
Sbjct: 329 GYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 388
Query: 79 -----QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
Q AHP V I S +P V DQI+LWE + NR T Y++F S+ F
Sbjct: 389 VSYLEQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSQTDF 448
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
DYA+ + L+W++ + +VV H ++ + + +K
Sbjct: 449 VTLRDYAQSINMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 490
>gi|255936123|ref|XP_002559088.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583708|emb|CAP91723.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 483
Score = 108 bits (271), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 94/165 (56%), Gaps = 21/165 (12%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F+++ETN+++YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GIT +Q
Sbjct: 317 FIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQII 376
Query: 79 ----QNAHPRV---------ADKIPS--VPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
+AHP++ +P+ +P V DQIRLW+ + +R+ T + +F S
Sbjct: 377 SYLLSHAHPQLRKHSAAQPNGKGVPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTS 436
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
++A C YA + L+W+ D K+M V H + FLR Q
Sbjct: 437 LAEYQAPCQYAAEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSQ 478
>gi|396459783|ref|XP_003834504.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
gi|312211053|emb|CBX91139.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
Length = 483
Score = 108 bits (271), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 77/131 (58%), Gaps = 6/131 (4%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
N GF+++ETN+++YAY+ S + IL LF+K++++ PNL+ G +TKES++ A GIT+
Sbjct: 321 NKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVSAGITSA 380
Query: 78 Q------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAAC 131
Q AHP++ + +P V DQIRLWE + RVE TP +F S +
Sbjct: 381 QIISYLSTYAHPQMQKNVSFIPPTVMDQIRLWEYEGERVEVTPGFLMKDFGSDAEYRDVL 440
Query: 132 DYARDQSGLLW 142
YA L+W
Sbjct: 441 GYADALGVLVW 451
>gi|406604717|emb|CCH43852.1| RNA polymerase II transcription factor B subunit 2 [Wickerhamomyces
ciferrii]
Length = 488
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 95/179 (53%), Gaps = 25/179 (13%)
Query: 2 AIQSRKESWFIPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAI 61
I++ K+S + PT +N G +++ETNFK+YAY+ S L IL LF ++ + NL+ G I
Sbjct: 293 VIENAKDSAYTPTNIENSGTIIIETNFKVYAYTNSPLQIAILNLFVHLKARFSNLVTGQI 352
Query: 62 TKESLYNAFENGITTEQ------QNAHPRV-------------------ADKIPSVPKNV 96
T+ES+ A +GIT+EQ +AHP++ +KI + +
Sbjct: 353 TRESIRKALVSGITSEQIISYLESHAHPQLRRAAEEELNKKNGFESNNHGEKIQILQPTI 412
Query: 97 CDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
DQI+LW+ +L+R+ + + + +F S ++ Y+ + LLW DS K + V E
Sbjct: 413 ADQIKLWQLELDRIMSFDGYLFTDFSSDQEYQVLSKYSEEIGVLLWNDSSKKKFFVTKE 471
>gi|119486893|ref|XP_001262366.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
gi|119410523|gb|EAW20469.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
Length = 468
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 95/172 (55%), Gaps = 23/172 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 300 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITAD 359
Query: 78 Q------QNAHPRV-------------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ + +P V DQIRLW+ + +RV+ T +
Sbjct: 360 QIISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLF 419
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 169
+F + +EA C YA + L+W+ D K+M V H + FL+ + K
Sbjct: 420 RDFNTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HHQVAAFLKSRRK 468
>gi|383852611|ref|XP_003701820.1| PREDICTED: general transcription factor IIH subunit 4-like
[Megachile rotundata]
Length = 467
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 94/160 (58%), Gaps = 7/160 (4%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D G++VVETN+++YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GIT
Sbjct: 305 DKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITA 364
Query: 77 E------QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
QQ+AH ++ + P V P + DQI+LWE++ NR + Y +F S+ FE
Sbjct: 365 SQIVGYLQQHAHSKMIEAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEV 424
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
D+A L+W+ +K +VV H +++F + +K
Sbjct: 425 LRDHALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSK 464
>gi|70982185|ref|XP_746621.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2) [Aspergillus fumigatus Af293]
gi|66844244|gb|EAL84583.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus Af293]
gi|159122143|gb|EDP47265.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus A1163]
Length = 479
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 95/172 (55%), Gaps = 23/172 (13%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GIT +
Sbjct: 311 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITAD 370
Query: 78 Q------QNAHPRV-------------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYY 118
Q +AHP++ + +P V DQIRLW+ + +RV+ T +
Sbjct: 371 QIISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLF 430
Query: 119 DEFPSRDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 169
+F + +EA C YA + L+W+ D K+M V H + FL+ + K
Sbjct: 431 RDFNTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HHQVAAFLKSRRK 479
>gi|156057729|ref|XP_001594788.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980]
gi|154702381|gb|EDO02120.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 420
Score = 108 bits (270), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 96/165 (58%), Gaps = 16/165 (9%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A+ GF+++ETN+++YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT
Sbjct: 256 ANQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAISHGIT 315
Query: 76 TEQ------QNAHPRV------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
++Q +AHP++ A P +P V DQIRLW+ + R++ P + EF
Sbjct: 316 SDQIITYLSTHAHPQLVKASTAAHGGPVLPPTVVDQIRLWQLENERMKAVPGFLFKEFDG 375
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
+ +E YA + L+W+ D+K+M V E +R++++ +
Sbjct: 376 QKEYEGCAKYAEEVGVLVWKNDAKRMFFVTRVE---QLRDYIKAR 417
>gi|320167307|gb|EFW44206.1| transcription factor tfb2 [Capsaspora owczarzaki ATCC 30864]
Length = 406
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 88/154 (57%), Gaps = 8/154 (5%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD G++VVETNF++YAY+ S L +L LF + Y+ PN++ G ++++S+ A G+T
Sbjct: 222 ADVKGYIVVETNFRIYAYTDSPLQLALLSLFVDLRYRFPNMVCGLLSRDSVRKALVKGLT 281
Query: 76 TEQQ------NAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
EQ +AHP++ + P +P+ + DQ+RLWE + NR+ PA Y+ F ++ +
Sbjct: 282 AEQMIRFLRTHAHPQMRSRTPVLPETISDQLRLWELERNRLRVLPAVLYERFSNQREHDL 341
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
Y RD L S+ +VV E H ++ F
Sbjct: 342 LHHYGRDLGVELAHSSQH--IVVTFEGHEQIKTF 373
>gi|66800623|ref|XP_629237.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
gi|74850716|sp|Q54C29.1|TF2H4_DICDI RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=TFIIH basal transcription factor complex subunit 4
gi|60462610|gb|EAL60813.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
Length = 483
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 92/152 (60%), Gaps = 6/152 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++V+ETN+++YAY++S L +L LF K+ Y+LPNL VG IT+ES+ A +GIT +Q
Sbjct: 328 GYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITADQI 387
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
N+HP A+ +P V +QI LWE++ NR+ T + Y+ FP+ D + A +
Sbjct: 388 IDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRITYTKSVLYNSFPTNDCYIATLKF 447
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
A++Q +W LVV E + +R F++
Sbjct: 448 AKEQDYYIWSHDPLKTLVVKEEGNDPIRNFIK 479
>gi|194749905|ref|XP_001957376.1| GF10392 [Drosophila ananassae]
gi|190624658|gb|EDV40182.1| GF10392 [Drosophila ananassae]
Length = 497
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 93/162 (57%), Gaps = 12/162 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT EQ
Sbjct: 333 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 392
Query: 79 -----QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
Q AHP V I S +P V DQI+LWE + NR T Y++F S++ F
Sbjct: 393 VSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDF 452
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
DYA+ L+W++ + +VV H ++ + + +K
Sbjct: 453 VTLRDYAQSIHVLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 494
>gi|195441209|ref|XP_002068409.1| GK20438 [Drosophila willistoni]
gi|194164494|gb|EDW79395.1| GK20438 [Drosophila willistoni]
Length = 512
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 91/162 (56%), Gaps = 12/162 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++VVETN+++YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GIT EQ
Sbjct: 348 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 407
Query: 79 -----QNAHPR---VADKIPS---VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
Q AHP V I S +P V DQI+LWE + NR T Y++F S F
Sbjct: 408 ISYLEQYAHPNMKLVESAINSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSTTDF 467
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
DYA+ L+W++ K +VV H ++ + + +K
Sbjct: 468 VTLRDYAQSIQVLVWQNEKTRTMVVQKNGHDDVKRYWKKYSK 509
>gi|171681598|ref|XP_001905742.1| hypothetical protein [Podospora anserina S mat+]
gi|170940758|emb|CAP66407.1| unnamed protein product [Podospora anserina S mat+]
Length = 540
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 93/165 (56%), Gaps = 13/165 (7%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD G +VVETN+++YAY++S L IL+LF ++ + PN++ +T+ES+ A + GIT
Sbjct: 376 ADENGGIVVETNYRIYAYTSSPLQIAILKLFCRLHMRFPNMVTARLTRESVQEAIKEGIT 435
Query: 76 TEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
Q +AHP++ A +P V DQIRLW+ + R++ TP + +F S
Sbjct: 436 ANQIIDYLVAHAHPQMRRAAAARGTTVIPPTVMDQIRLWQLESQRMQKTPGFQFKDFESV 495
Query: 125 DVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ + +YA + L+W+D +K V+ +REFL+ + K
Sbjct: 496 EEYRQLAEYATEIGVLVWKDDRKGTFFVSKV--EQIREFLKARKK 538
>gi|350413829|ref|XP_003490125.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
impatiens]
Length = 467
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 94/157 (59%), Gaps = 7/157 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE-- 77
G++VVETN+++YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GIT
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQI 367
Query: 78 ----QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
QQ+AH ++ + P V P + DQI+LWE++ NR + Y +F S+ FE D
Sbjct: 368 VGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRD 427
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+A L+W++ +K +VV H +++F + +K
Sbjct: 428 HALSTGVLIWQNERKRTIVVTKAGHDDVKKFWKRYSK 464
>gi|340710070|ref|XP_003393621.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
terrestris]
Length = 467
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 93/157 (59%), Gaps = 7/157 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE-- 77
G++VVETN+++YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GIT
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQI 367
Query: 78 ----QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
QQ+AH ++ + P V P + DQI+LWE++ NR Y +F S+ FE D
Sbjct: 368 VGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFNEGVLYSQFLSQTDFEVLRD 427
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+A L+W++ +K +VV H +++F + +K
Sbjct: 428 HALSTGVLIWQNERKRTIVVTKAGHDDVKKFWKRYSK 464
>gi|307177222|gb|EFN66420.1| General transcription factor IIH subunit 4 [Camponotus floridanus]
Length = 467
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 94/161 (58%), Gaps = 7/161 (4%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
D G++VVETN+++YAY+ S L +L LF ++ Y+ PN++V +T++S+ A ++GIT
Sbjct: 304 TDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSIRQALKSGIT 363
Query: 76 TE------QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
QQ+ H ++ ++ P + P + DQI+LWE++ NR + Y +F S+ FE
Sbjct: 364 ASQIVGYLQQHVHSKMIERGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFE 423
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
D+A L+W+ +K +VV H +++F + +K
Sbjct: 424 VLRDHALSTGVLIWQSERKRTMVVTKTGHDDVKKFWKRYSK 464
>gi|380014187|ref|XP_003691121.1| PREDICTED: general transcription factor IIH subunit 4-like [Apis
florea]
Length = 467
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 93/157 (59%), Gaps = 7/157 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE-- 77
G++VVETN+++YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GIT
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQI 367
Query: 78 ----QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
QQ+AH ++ + P + P + DQI+LWE++ NR + Y +F S+ FE D
Sbjct: 368 VGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRD 427
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+A L+W+ +K +VV H +++F + +K
Sbjct: 428 HALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSK 464
>gi|48102983|ref|XP_395476.1| PREDICTED: general transcription factor IIH subunit 4 [Apis
mellifera]
Length = 467
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 93/157 (59%), Gaps = 7/157 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE-- 77
G++VVETN+++YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GIT
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQI 367
Query: 78 ----QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
QQ+AH ++ + P + P + DQI+LWE++ NR + Y +F S+ FE D
Sbjct: 368 VGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRD 427
Query: 133 YARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+A L+W+ +K +VV H +++F + +K
Sbjct: 428 HALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSK 464
>gi|307212060|gb|EFN87943.1| General transcription factor IIH subunit 4 [Harpegnathos saltator]
Length = 467
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 93/161 (57%), Gaps = 7/161 (4%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
D G++VVETN+++YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GIT
Sbjct: 304 TDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMIYRFPNLVVSILTRDSVRQALKSGIT 363
Query: 76 TE------QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
QQ+AH + + P + P + DQI LWE++ NR + Y +F S+ FE
Sbjct: 364 ATQIVGYLQQHAHSKTIEAGPPILPPTIVDQIMLWENERNRFLFSEGVLYSQFLSQTDFE 423
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
D+A L+W++ +K +VV H +++F + +K
Sbjct: 424 VLRDHALTTGVLIWQNERKRTIVVTKAGHDDIKKFWKRYSK 464
>gi|405952750|gb|EKC20525.1| General transcription factor IIH subunit 4 [Crassostrea gigas]
Length = 380
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 90/156 (57%), Gaps = 10/156 (6%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
++ N G+++VETN+++YAY+ S L ++ LF ++ Y+ P VG +T+ S+ +A G
Sbjct: 232 SVGKNTGYLMVETNYRVYAYTDSPLQVALVALFCEMLYRFPTFSVGNLTRVSVRDALIRG 291
Query: 74 ITTEQQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
IT AD+ P +P V DQ+RLWE + +R + YD+F S++ FE DY
Sbjct: 292 IT----------ADQTPVIPSTVTDQVRLWELERDRFKFMEGVLYDQFLSQNDFELLRDY 341
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
A+D LLW+++ K +VV H ++ + + Q +
Sbjct: 342 AKDLGVLLWDNAIKRVMVVTKGGHDDVKRYWKRQKQ 377
>gi|50291141|ref|XP_448003.1| hypothetical protein [Candida glabrata CBS 138]
gi|62901305|sp|Q6FP41.1|TFB2_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49527314|emb|CAG60954.1| unnamed protein product [Candida glabrata]
Length = 504
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 92/174 (52%), Gaps = 24/174 (13%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETNFK+Y+Y S L IL LF ++ + N++ G IT+ES+ A NGIT +Q
Sbjct: 327 GALIIETNFKLYSYCNSPLQIAILSLFVHLKSRFANMVAGQITRESIRRALINGITADQV 386
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ D + +P V DQI+LW+ +L+RV T
Sbjct: 387 IAYLESHAHPQMRRLAEEKLQKKLELDPNCKDPLQVLPPTVVDQIKLWQLELDRVLTYEG 446
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
Y +F + F C YA+D LLW+D +K +L V+ E + + E+ + + K
Sbjct: 447 SLYIDFDTAQDFNMLCKYAQDIGALLWKDDRKRKLFVSREGNAQVLEYAKRKIK 500
>gi|298705308|emb|CBJ48998.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 534
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 77/133 (57%), Gaps = 7/133 (5%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D ++VETNF++ AY+ SKLH +L LF ++ LPN IVGAIT+ES+ A GI
Sbjct: 345 DTSIHIIVETNFQVIAYTRSKLHFAMLSLFLELRALLPNAIVGAITRESMRKALSTGIKG 404
Query: 77 EQQ------NAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYD-EFPSRDVFEA 129
Q +AHP V + P VP+N+ DQ+ LWE + +R+E D + SRD F
Sbjct: 405 RQVLDFLKWHAHPVVRRRTPVVPENIADQVLLWERERDRMEHRDGVLVDVSYASRDAFRG 464
Query: 130 ACDYARDQSGLLW 142
++A + GLLW
Sbjct: 465 MTEFANAKQGLLW 477
>gi|339521903|gb|AEJ84116.1| general transcription factor IIH subunit 4 [Capra hircus]
Length = 463
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 91/159 (57%), Gaps = 6/159 (3%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ +L + LFS++ Y+ PN++V +T+ S+ A +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTEPELQIARIALFSEMLYRFPNMVVPQVTRGSVQQAIASGIT 361
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q A P + + P +P + DQIRLWE + +R+ T Y++F S+ FE
Sbjct: 362 AQQIVHFLRTRADPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+AR+ L++E+S K +VV H ++ F + Q
Sbjct: 422 LLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 460
>gi|190347675|gb|EDK39997.2| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 90/181 (49%), Gaps = 37/181 (20%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETNFK+YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT++Q
Sbjct: 311 GSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSDQI 370
Query: 79 -----QNAHPRV-------------------------------ADKIPSVPKNVCDQIRL 102
+AHP++ K+ +P V DQI+L
Sbjct: 371 INFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQIKL 430
Query: 103 WESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMRE 162
W+ +L+R++ + + EF S+ F+ C+YA D L+W D KMR V + + +
Sbjct: 431 WQLELDRIQAFEGYLFKEFASQQEFDVLCNYASDVGVLIWSDRTKMRFFVTKDGISQVAD 490
Query: 163 F 163
F
Sbjct: 491 F 491
>gi|146414792|ref|XP_001483366.1| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 90/181 (49%), Gaps = 37/181 (20%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETNFK+YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT++Q
Sbjct: 311 GSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSDQI 370
Query: 79 -----QNAHPRV-------------------------------ADKIPSVPKNVCDQIRL 102
+AHP++ K+ +P V DQI+L
Sbjct: 371 INFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQIKL 430
Query: 103 WESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMRE 162
W+ +L+R++ + + EF S+ F+ C+YA D L+W D KMR V + + +
Sbjct: 431 WQLELDRIQAFEGYLFKEFASQQEFDVLCNYASDVGVLIWSDRTKMRFFVTKDGISQVAD 490
Query: 163 F 163
F
Sbjct: 491 F 491
>gi|326474968|gb|EGD98977.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton tonsurans CBS 112818]
Length = 489
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 82/141 (58%), Gaps = 14/141 (9%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
GF++VETN ++YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GIT +
Sbjct: 318 GTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 377
Query: 78 Q------QNAHPRVADKIPS--------VPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
Q +AHP++ S +P V DQIRLW+ + +R++ T + EF +
Sbjct: 378 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDT 437
Query: 124 RDVFEAACDYARDQSGLLWED 144
FEA C YA + L+W+
Sbjct: 438 FAEFEAPCKYAEEIGVLVWKS 458
>gi|281208616|gb|EFA82792.1| general transcription factor IIH [Polysphondylium pallidum PN500]
Length = 458
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 91/152 (59%), Gaps = 8/152 (5%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++++ETNF++YAY+ S L +L LF K+ Y+LPNL VG +T+ES+ AF +GIT +Q
Sbjct: 305 GYIILETNFRIYAYTASSLQISLLSLFVKMLYRLPNLSVGILTRESIRTAFLHGITADQI 364
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
QN HP + P+ V +QIR+WE++ NR+ A +D FP+++ F Y
Sbjct: 365 VDFIKQNGHPNMLK--VGAPEIVFEQIRIWENERNRILYKKAVLFDSFPTQESFNMTLQY 422
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
A+D S +W K LVV+ ++ ++R
Sbjct: 423 AKDLSFYMWASEAKKVLVVSDNGFDAIKNYIR 454
>gi|389749775|gb|EIM90946.1| transcription factor Tfb2 [Stereum hirsutum FP-91666 SS1]
Length = 494
Score = 105 bits (262), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 88/157 (56%), Gaps = 13/157 (8%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F+++ETN+++YAY+ + L +L LF ++ + PNL+VG+IT++S+ A NGIT +Q
Sbjct: 318 FIILETNYRLYAYTDNPLQTAVLNLFVTMKSRFPNLVVGSITRDSVRKALSNGITADQII 377
Query: 79 ----QNAHPRVADKI-------PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVF 127
AHP++ + P +P V DQIRLWE + NR+++ Y F S+ +
Sbjct: 378 SYLTTYAHPQMRKNVSRCRSFTPLLPVTVQDQIRLWELERNRLKSQEGFLYTAFASQADY 437
Query: 128 EAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
E YA+ +LWE+ K E H ++R F+
Sbjct: 438 EYVLTYAKQLDVVLWENPTKRCFFGTMEGHANIRGFI 474
>gi|378729002|gb|EHY55461.1| transcription initiation factor TFIIH subunit H4 [Exophiala
dermatitidis NIH/UT8656]
Length = 508
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 86/153 (56%), Gaps = 20/153 (13%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F++VETN+++YAY++S L +L LF + + PNL+ G ++K S+ A + GIT +Q
Sbjct: 338 FIIVETNYRLYAYTSSPLQIALLSLFVNLRSRHPNLVTGKMSKSSVQRAIQAGITADQII 397
Query: 79 ----QNAHP-----------RVADK---IPSVPKNVCDQIRLWESDLNRVETTPAHYYDE 120
+AHP R AD +P +P + DQI LW+ + +R+ TTP +
Sbjct: 398 SYLTSHAHPQMRRHAQAEQARNADPNRVVPILPATILDQIHLWQLERDRMTTTPGFLLKD 457
Query: 121 FPSRDVFEAACDYARDQSGLLWEDSKKMRLVVN 153
FP++ +EA C YA + L+W++ KK VN
Sbjct: 458 FPNQADYEAPCRYADEIGVLVWKNDKKRMFFVN 490
>gi|366997767|ref|XP_003683620.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
gi|357521915|emb|CCE61186.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
Length = 504
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 91/171 (53%), Gaps = 24/171 (14%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE- 77
G +VVETNFK+Y+YS S L IL LF + + N++ G IT+ES+ NA NG+T +
Sbjct: 327 VGALVVETNFKLYSYSNSPLQIAILSLFVHLRTRFSNMVTGQITRESIRNALANGVTADQ 386
Query: 78 -----QQNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTP 114
Q +AHP++ D++ +P V DQI+LW+ +L+R+ +
Sbjct: 387 IIAYLQSHAHPQMRRLAETTLKKKLELDPNTKDQLQILPPTVVDQIKLWQLELDRILSYD 446
Query: 115 AHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F + + +YA+D LLW++ +K + V+ E + + +F +
Sbjct: 447 GSLYSDFDNNQEYNMLYNYAKDIGVLLWKNDRKRKFFVSKEGNSQVLDFAK 497
>gi|346464903|gb|AEO32296.1| hypothetical protein [Amblyomma maculatum]
Length = 436
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 79/135 (58%), Gaps = 6/135 (4%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
G+++VETN+++YAY+ S+L +L LF ++ Y+ PNL+V +T+ES+ A +GIT+ Q
Sbjct: 300 AGYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQ 359
Query: 79 ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACD 132
AHP + P +P + DQ+RLWE + +R Y +F S+ F+ +
Sbjct: 360 IIKFLRMYAHPEALKQTPVIPATIMDQLRLWELERDRFVYREGVLYSQFISQSDFQLLRN 419
Query: 133 YARDQSGLLWEDSKK 147
YA D L+W++ K
Sbjct: 420 YASDLGVLIWDNPSK 434
>gi|347830304|emb|CCD46001.1| similar to RNA polymerase II transcription factor B subunit 2
[Botryotinia fuckeliana]
Length = 471
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 94/165 (56%), Gaps = 16/165 (9%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+++ETN+++YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT
Sbjct: 307 ASQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGIT 366
Query: 76 TEQ------QNAHPRV------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
++Q +AHP++ + P +P V DQIRLW+ + R++ P +F +
Sbjct: 367 SDQIITYLSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFET 426
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
+ +E YA + L+W+ D+K+M V E +R++ + +
Sbjct: 427 QKEYEGCAKYAEEVGVLVWKSDAKRMFFVTRVE---QLRDYFKAK 468
>gi|45187599|ref|NP_983822.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|62901343|sp|Q75B51.1|TFB2_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|44982337|gb|AAS51646.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|374107034|gb|AEY95942.1| FADL274Wp [Ashbya gossypii FDAG1]
Length = 514
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 90/173 (52%), Gaps = 29/173 (16%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L IL LF ++ + N++ G IT+ES+ A NGIT +Q
Sbjct: 339 GALIVETNFKLYSYSNSPLQIAILSLFIHLKTRFQNMVTGQITRESIRRALHNGITADQI 398
Query: 79 -----QNAHP---RVA---------------DKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP R+A D + +P V DQI+LW+ +L+R+ +
Sbjct: 399 IAYMETHAHPQMRRLAGDNLEKKLELDPNCRDTLQVLPPTVVDQIKLWQLELDRIISYDG 458
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE-----IHMHMREF 163
+ + +F + ++ YARD LLW D KK V+ E I H R+F
Sbjct: 459 YLFRDFDNLQEYQVLAQYARDIGVLLWSDDKKKMFFVSKEGNAQVIDFHKRKF 511
>gi|156839389|ref|XP_001643386.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
gi|156113994|gb|EDO15528.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
Length = 504
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 94/170 (55%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +VVETNFK+Y+YS S L IL LF ++ + N++ G IT+ES+ NA NGIT EQ
Sbjct: 329 GALVVETNFKLYSYSNSPLQIAILSLFVHLKTRFSNMVTGQITRESIRNALSNGITAEQI 388
Query: 79 -----QNAHP---RVADK---------------IPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP R+A++ + +P V DQI+LW+ +L+R+ +
Sbjct: 389 IAYLETHAHPQMRRLAEERLKKKLELDQNSKETLQILPPTVVDQIKLWQLELDRIISYEG 448
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F ++ ++ YA+D L+W+D +K + V+ E + + ++ +
Sbjct: 449 SLYSDFDNKQEYDMLYSYAQDIGVLIWKDDRKRKFFVSKESNSQVLDYAK 498
>gi|345492819|ref|XP_001602702.2| PREDICTED: general transcription factor IIH subunit 4-like [Nasonia
vitripennis]
Length = 428
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 93/156 (59%), Gaps = 7/156 (4%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
FV+VETN+++YAY+ S L ++ LF ++ Y+ PNL+V +T++S+ A ++GIT Q
Sbjct: 270 FVIVETNYRVYAYTNSNLQVALIGLFCELLYRFPNLVVAILTRDSVRAALKSGITAVQIV 329
Query: 79 ----QNAHPRVAD-KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
Q+AH ++ D P++P + DQI+LWE++ NR + Y +F S+ FE D+
Sbjct: 330 GYLNQHAHNKMIDPGPPTLPPTIVDQIKLWENERNRFIFSEGVLYSQFHSQIDFEVLRDH 389
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
A ++W+ +K +VV H +++F + +K
Sbjct: 390 AVSLGVMIWQSDRKRTMVVTKTGHDDVKKFWKRYSK 425
>gi|365986883|ref|XP_003670273.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
gi|343769043|emb|CCD25030.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
Length = 522
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L IL LF ++++ N++ G IT+ES+ A NGIT EQ
Sbjct: 344 GSLIVETNFKLYSYSNSPLQIAILSLFVHLKFRFVNMVTGQITRESIRRALINGITAEQI 403
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ D + +P V DQIRLW+ +L+RV
Sbjct: 404 IAYLETHAHPQMRRLAEEKLEKKLELDANCKDSLQILPPTVVDQIRLWQLELDRVIAYEG 463
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F + + YA+D LLW+D KK + V+ E + + ++ +
Sbjct: 464 SLYSDFENNTEYTTLYKYAQDIGVLLWKDDKKRKFFVSKEGNSQVLDYAK 513
>gi|154308354|ref|XP_001553513.1| hypothetical protein BC1G_08237 [Botryotinia fuckeliana B05.10]
Length = 410
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 94/165 (56%), Gaps = 16/165 (9%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+++ETN+++YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT
Sbjct: 246 ASQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGIT 305
Query: 76 TEQ------QNAHPRV------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
++Q +AHP++ + P +P V DQIRLW+ + R++ P +F +
Sbjct: 306 SDQIITYLSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFET 365
Query: 124 RDVFEAACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 167
+ +E YA + L+W+ D+K+M V E +R++ + +
Sbjct: 366 QKEYEGCAKYAEEVGVLVWKSDAKRMFFVTRVE---QLRDYFKAK 407
>gi|345569654|gb|EGX52519.1| hypothetical protein AOL_s00043g13 [Arthrobotrys oligospora ATCC
24927]
Length = 452
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 83/155 (53%), Gaps = 14/155 (9%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D GF+++ETN+ S L +L LF+++ + PNL+ IT+ES+ A GIT
Sbjct: 299 DENGFLIIETNYH------SPLQIAVLNLFARLISRFPNLVTARITRESIRQAIGKGITA 352
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
EQ NAHP++ P +P V DQIRLW+ + R++ T + + EF + F A
Sbjct: 353 EQIIDYMTANAHPQLRKNSPVLPPTVVDQIRLWQIEGERMKVTSGYLFKEFATNAEFNAV 412
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
C YA D L+W D K L V+ H + +++R
Sbjct: 413 CKYADDIGVLVWIDKNKGMLFVSK--HEQIADYIR 445
>gi|407927340|gb|EKG20235.1| Transcription factor Tfb2 [Macrophomina phaseolina MS6]
Length = 466
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 89/157 (56%), Gaps = 10/157 (6%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++++ETN+++YAY++S L +L LF+ +E + PNL+ G +TKES+ A GIT +Q
Sbjct: 311 GYIIIETNYRLYAYTSSPLQIAVLALFTDLETRFPNLVSGKLTKESVQRAIGLGITADQI 370
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AHP++ K P +P V DQIRLW+ + +R+ T + +F D ++ Y
Sbjct: 371 ISYLDTYAHPQMQRKTPVLPPTVVDQIRLWQIEGDRMRATEGFLFKDFRDWDEYQDLKKY 430
Query: 134 ARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 169
A L+W D +++ + E + ++++ + K
Sbjct: 431 ADSIGVLVWSNDQRRLMFLTRVE---QISQYIKNRTK 464
>gi|410082501|ref|XP_003958829.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
gi|372465418|emb|CCF59694.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
Length = 514
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 89/170 (52%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETNFK+Y+YS S L IL LF ++ + N++ G IT+ES+ A NGIT +Q
Sbjct: 338 GSLIIETNFKLYSYSNSPLQIAILSLFVHLKSRFVNMVTGQITRESIRRALVNGITADQI 397
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHPR+ D + +P V DQI+LW+ +L+RV T
Sbjct: 398 IAYLETHAHPRMRRLAEEKLEKKLELDPNSKDTLQILPPTVVDQIKLWQLELDRVITYDG 457
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F + + YA+D L+W+D KK + V+ E + + ++ +
Sbjct: 458 SLYSDFENSTEYNMLSKYAQDIGVLIWKDDKKRKFFVSKEGNTQVLDYAK 507
>gi|154273196|ref|XP_001537450.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150415962|gb|EDN11306.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 455
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 85/146 (58%), Gaps = 21/146 (14%)
Query: 12 IPTIADN---CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 68
+P+ N GF+++ETN+++YAY++S L ++ LF+ ++Y+ PNLI G +T++S+
Sbjct: 282 VPSATSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRR 341
Query: 69 AFENGITTEQ------QNAHPRV----ADKIPS--------VPKNVCDQIRLWESDLNRV 110
A E GIT +Q +AHP++ A K + +P V DQIRLW+ + +R+
Sbjct: 342 AVEMGITADQIISYLTTHAHPQMRKYHATKPGANPVGVHTVLPPTVVDQIRLWQLERDRI 401
Query: 111 ETTPAHYYDEFPSRDVFEAACDYARD 136
+ TP + +F S FE C YA +
Sbjct: 402 KATPGFLFKDFVSLAEFEGPCRYAEE 427
>gi|334323685|ref|XP_003340422.1| PREDICTED: valyl-tRNA synthetase, mitochondrial-like [Monodelphis
domestica]
Length = 1641
Score = 102 bits (253), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 96/190 (50%), Gaps = 37/190 (19%)
Query: 16 ADNCGFVVVETNFKMYAYST-------------------------------SKLHCEILR 44
A GF+VVETN+++YAY+ S+L ++
Sbjct: 302 AHQPGFIVVETNYRLYAYTGEVGGMRMGIGMLEGRNCGFFGIILFFSLATESELQIALIA 361
Query: 45 LFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCD 98
LFS++ Y+ PN++V IT+ES+ A NGIT +Q AHP + +IP +P + D
Sbjct: 362 LFSEMLYRFPNMVVAQITRESVQQAIANGITAQQIIHFLKTRAHPVMLKQIPVLPPTITD 421
Query: 99 QIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHM 158
QIRLWE + +R+ + Y++F S+ FE +AR+ L++E++ K +VV H
Sbjct: 422 QIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARELGVLVFENTAKRLMVVTPAGHS 481
Query: 159 HMREFLRGQN 168
++ F + Q
Sbjct: 482 DVKRFWKRQK 491
>gi|391335225|ref|XP_003741996.1| PREDICTED: general transcription factor IIH subunit 4-like
[Metaseiulus occidentalis]
Length = 457
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 91/160 (56%), Gaps = 7/160 (4%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D G+++VETN+++YAY+ S L +L LF ++ Y+ PNL+V +T+ES+ A + GIT+
Sbjct: 295 DEQGYIIVETNYRVYAYTDSPLQVALLSLFCELLYRFPNLLVAVLTRESVRQALKGGITS 354
Query: 77 EQ------QNAHPRVADKIPS-VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
Q +H V ++ +P V DQ+RLWE + +R + + Y +F + FE
Sbjct: 355 NQITHFLKSRSHRVVLEREEGIIPMTVTDQLRLWELERDRFKFGESVLYSQFQTTGDFEM 414
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+YARD L +E+ +K LVV+ +R+F + K
Sbjct: 415 LRNYARDIGVLKYENPQKRFLVVSKSGDAEVRQFWKRHKK 454
>gi|339237219|ref|XP_003380164.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
gi|316977049|gb|EFV60220.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
Length = 674
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 82/151 (54%), Gaps = 6/151 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF++VETNF++Y Y+ S L I+ F + Y+ PNL+ + +ES+ AF+ I+ EQ
Sbjct: 293 GFLIVETNFRVYCYTDSVLDLAIVSTFCEPLYRFPNLVACILNRESVRRAFQVNISAEQI 352
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
NAH + + P++P V DQI+LWE + +R + P Y F S + DY
Sbjct: 353 IQYLFSNAHKNMQKQTPTIPSTVTDQIKLWEMERDRFKFDPGVMYSNFFSDTDYITIRDY 412
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
A+D LL E LVV+A+ H +FL
Sbjct: 413 AKDLGVLLCEHEANRALVVSADGHEQSNQFL 443
>gi|260945437|ref|XP_002617016.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
gi|238848870|gb|EEQ38334.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
Length = 511
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/188 (29%), Positives = 93/188 (49%), Gaps = 43/188 (22%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
G +++ETNFK+YAY+ S L IL LF ++ + N+++G IT+ES+ A NGIT +Q
Sbjct: 318 SGQIILETNFKLYAYTNSPLEIAILNLFVNLKVRFANMVIGQITRESIRKALYNGITADQ 377
Query: 79 ------QNAHPRV----------------------ADKIPS---------------VPKN 95
+AH ++ A P +P N
Sbjct: 378 IIKFLETHAHSQMRTSAKEKLDKKIEFDASHNINTAGGAPQSKSSDSGVAQHRLEILPPN 437
Query: 96 VCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
V DQI+LW+ +L+R++T + + +F +++ ++ C+YA + LLW D K+R V AE
Sbjct: 438 VVDQIKLWQLELDRIQTFEGYLFKDFKNQNEYDVLCNYATEVGVLLWSDKTKLRFFVTAE 497
Query: 156 IHMHMREF 163
+ +F
Sbjct: 498 GMHQVADF 505
>gi|221120480|ref|XP_002159945.1| PREDICTED: general transcription factor IIH subunit 4-like [Hydra
magnipapillata]
Length = 458
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 89/150 (59%), Gaps = 6/150 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S LH +L LF+ I+ + P+ V +++ES+ A GI+ EQ
Sbjct: 302 GFLVVETNYRVYAYTDSVLHIALLSLFTDIKARFPSFTVALLSRESVQQALACGISAEQI 361
Query: 79 -----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDY 133
AH ++ P + + DQI+LWE + +R+ + Y++F S+ FE +
Sbjct: 362 IDFLKTRAHSQMTTSSPIIASTITDQIKLWEMERDRLRYSQGVLYNQFLSQSDFEMLRKF 421
Query: 134 ARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
A +++ LLW +++K +VV+ H ++++
Sbjct: 422 ADEKNHLLWANNQKRLMVVSKSGHEDVKKY 451
>gi|403217068|emb|CCK71563.1| hypothetical protein KNAG_0H01490 [Kazachstania naganishii CBS
8797]
Length = 519
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 83/160 (51%), Gaps = 24/160 (15%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L IL LF ++ + N++ G ITK S+ NA +NGIT EQ
Sbjct: 343 GVLIVETNFKVYSYSNSPLQIAILSLFVHLKSRFINMVTGQITKRSIINALDNGITAEQI 402
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ D + +P V DQI+LW+ + +RV
Sbjct: 403 IAYLETHAHPQMRRLAEAQLEKKRELDQNCKDPLQILPPTVVDQIKLWQLEKDRVMAFEG 462
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
Y +F S + YA+D +LW + KK +L V+ E
Sbjct: 463 SLYSDFESNAEYNILKKYAQDIGVMLWSEDKKRKLFVSQE 502
>gi|332030486|gb|EGI70174.1| General transcription factor IIH subunit 4 [Acromyrmex echinatior]
Length = 550
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 86/146 (58%), Gaps = 7/146 (4%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
D G+++VETN+++YAY+ S L +L LF ++ Y+ PN++V +T++S+ A ++GIT
Sbjct: 304 TDKEGYIIVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSVRQALKSGIT 363
Query: 76 TEQ------QNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
Q Q+AH ++ + P + P + DQI+LWE++ NR + Y +F S+ FE
Sbjct: 364 ASQIVGYLRQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFE 423
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNA 154
D+A L+W+ ++ L+ +
Sbjct: 424 VLRDHAVSTGVLIWQSERRSTLLADT 449
>gi|402866404|ref|XP_003897374.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial,
partial [Papio anubis]
Length = 1647
Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats.
Identities = 58/187 (31%), Positives = 94/187 (50%), Gaps = 39/187 (20%)
Query: 20 GFVVVETNFKMYAY---------------------------------STSKLHCEILRLF 46
GF+VVETN+++YAY S S+L ++ LF
Sbjct: 351 GFIVVETNYRLYAYTGEVGQSAPEEKKEGLEGVWVWGWPPHPLSIPGSESELQIALIALF 410
Query: 47 SKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKNVCDQI 100
S++ Y+ PN++V +T+ES+ A +GIT +Q AHP + + P +P + DQI
Sbjct: 411 SEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQI 470
Query: 101 RLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHM 160
RLWE + +R+ T Y++F S+ FE +AR+ L++E+S K +VV H +
Sbjct: 471 RLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDV 530
Query: 161 REFLRGQ 167
+ F + Q
Sbjct: 531 KRFWKRQ 537
>gi|296422658|ref|XP_002840876.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637102|emb|CAZ85067.1| unnamed protein product [Tuber melanosporum]
Length = 460
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 77/133 (57%), Gaps = 6/133 (4%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
+ GF+++ETN+++YAY+ S L +L LF K+ + PNL+ G I++ S+ A + GIT
Sbjct: 295 EGKGFIILETNYRVYAYTDSPLQIAVLNLFVKLSTRYPNLVSGRISRRSIQEAIKMGITA 354
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
+Q +AHP++ + ++P V DQIRLW+ + R+ TT + +F S FE
Sbjct: 355 DQVIDYLSAHAHPQMRKSLVTLPPTVVDQIRLWQIEGERMRTTTGFLFKDFHSTQEFEEV 414
Query: 131 CDYARDQSGLLWE 143
YA + L W+
Sbjct: 415 AKYAEELGVLKWK 427
>gi|396080769|gb|AFN82390.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
romaleae SJ-2008]
Length = 414
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 83/145 (57%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F++VETN K+YAY+ S+ ++ LF + LPNLI G+IT+ES+ AF+ GIT +Q
Sbjct: 267 FIIVETNNKIYAYTNSEYEKSVIHLFCNVSVSLPNLIKGSITEESVNVAFDKGITGKQII 326
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
+ K S+P + QI +WES NR+ P + Y F + ++ ++ ++S L
Sbjct: 327 HFLEASVKPGSLPPAITSQIMIWESKRNRIFMVPGYIYSNFLNLSDYQRVLEFCTERSYL 386
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
+ D K +VV E H +REF++
Sbjct: 387 IESDVGKRMIVVKLEGHELVREFIK 411
>gi|255713706|ref|XP_002553135.1| KLTH0D09790p [Lachancea thermotolerans]
gi|238934515|emb|CAR22697.1| KLTH0D09790p [Lachancea thermotolerans CBS 6340]
Length = 506
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 86/160 (53%), Gaps = 24/160 (15%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V++ETNFK+Y+YS S L +L LF ++ + N++ G IT+ES+ A NGIT +Q
Sbjct: 331 GAVILETNFKLYSYSNSPLQIAVLSLFVHLKSRFSNMVTGQITRESIRRALHNGITADQI 390
Query: 79 -----QNAHP---RVADK---------------IPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP R+A++ + +P V DQI+LW+ +L+R+ +
Sbjct: 391 IAYMETHAHPQMRRLAEQTLDKKMELDPNCNEGLQILPPTVVDQIKLWQLELDRIISYDG 450
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
+ + +F + + YA+D LLW D KK R V+ E
Sbjct: 451 YLFTDFENFQEYNMLSSYAKDIGVLLWSDDKKKRFFVSQE 490
>gi|254584224|ref|XP_002497680.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
gi|238940573|emb|CAR28747.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
Length = 509
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 84/160 (52%), Gaps = 24/160 (15%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L IL LF ++ + N++ G +T+ES+ A NGIT +Q
Sbjct: 334 GALIVETNFKLYSYSNSPLQIAILSLFVHMKSRFSNMVTGQLTRESIRQALLNGITADQI 393
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ D + +P V DQI+LW+ +L+R+ +
Sbjct: 394 IAYLETHAHPQMRRMAEEQLEKRLELDPNSKDPLQILPPTVVDQIKLWQLELDRIISFEG 453
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
Y +F + F YA+D L+W+D +K + V+ E
Sbjct: 454 SLYSDFENHQEFTLLSSYAQDIGVLIWKDDRKKKFFVSKE 493
>gi|150863860|ref|XP_001382481.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
gi|149385116|gb|ABN64452.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
Length = 515
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 93/196 (47%), Gaps = 44/196 (22%)
Query: 12 IPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
I T N G V++ETNFK+YAY+ S L IL LF ++ + N++ G IT+ES+ +A
Sbjct: 310 IDTPTGNQGTVIIETNFKLYAYTNSPLEIAILNLFVHLKSRFTNMVCGQITRESIRSALY 369
Query: 72 NGITTEQ------QNAHPRVA--------------------------------------D 87
NGIT +Q +AHP++
Sbjct: 370 NGITADQIIKFLETHAHPQMKLLAKEKLDKKIEFDTSHNINTAGGAPQSKVSGEGNVAQH 429
Query: 88 KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKK 147
K+ +P V DQI+LW+ +L+R++T + + +F ++ F+ +YA + L+W D K
Sbjct: 430 KLEILPPTVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEFDTLSNYASEVGVLVWSDKIK 489
Query: 148 MRLVVNAEIHMHMREF 163
+ V AE + EF
Sbjct: 490 KKFFVTAEGMTQVAEF 505
>gi|303388087|ref|XP_003072278.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
gi|303301417|gb|ADM10918.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
Length = 414
Score = 99.0 bits (245), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 85/145 (58%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F++VETN K+YAY+ S+ ++ LF + + LPNL G+IT+ES+ AF+ GIT Q
Sbjct: 267 FIIVETNNKIYAYTNSEYEKSVIHLFCNVSFNLPNLTKGSITEESVNAAFDKGITGRQII 326
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
+ K S+P + +QI +WES NR+ P + Y F + ++ ++ +++ L
Sbjct: 327 HFLEASSKPGSLPPAIINQIIIWESKRNRIFMAPGYLYSNFLNLSDYQKVLEFCSERNYL 386
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
+ D + +VVN + H+ ++EF++
Sbjct: 387 IESDIDRRMIVVNPKGHVFVKEFIK 411
>gi|403308524|ref|XP_003944708.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Saimiri boliviensis boliviensis]
Length = 1752
Score = 99.0 bits (245), Expect = 6e-19, Method: Composition-based stats.
Identities = 58/192 (30%), Positives = 94/192 (48%), Gaps = 44/192 (22%)
Query: 20 GFVVVETNFKMYAY--------------------------------------STSKLHCE 41
GF+VVETN+++YAY S S+L
Sbjct: 450 GFIVVETNYRLYAYTGEVGQRAPGRAGWGRGNARLEGVWVWGWPPHPLSIPGSESELQIA 509
Query: 42 ILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKN 95
++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q AHP + + P +P
Sbjct: 510 LIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPT 569
Query: 96 VCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
+ DQIRLWE + +R+ T Y++F S+ FE +AR+ L++E+S K +VV
Sbjct: 570 ITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPA 629
Query: 156 IHMHMREFLRGQ 167
H ++ F + Q
Sbjct: 630 GHSDVKRFWKRQ 641
>gi|62088538|dbj|BAD92716.1| VARS2L protein variant [Homo sapiens]
Length = 1653
Score = 99.0 bits (245), Expect = 6e-19, Method: Composition-based stats.
Identities = 58/192 (30%), Positives = 94/192 (48%), Gaps = 44/192 (22%)
Query: 20 GFVVVETNFKMYAY--------------------------------------STSKLHCE 41
GF+VVETN+++YAY S S+L
Sbjct: 352 GFIVVETNYRLYAYTGEAGQRAPGRGGWGHLKEGLEGVWVWGWPPLPLSIPGSESELQIA 411
Query: 42 ILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKN 95
++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q AHP + + P +P
Sbjct: 412 LIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPT 471
Query: 96 VCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
+ DQIRLWE + +R+ T Y++F S+ FE +AR+ L++E+S K +VV
Sbjct: 472 ITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPA 531
Query: 156 IHMHMREFLRGQ 167
H ++ F + Q
Sbjct: 532 GHSDVKRFWKRQ 543
>gi|344303892|gb|EGW34141.1| RNA polymerase II transcription factor B subunit 2 [Spathaspora
passalidarum NRRL Y-27907]
Length = 519
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 91/189 (48%), Gaps = 45/189 (23%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V++ETNFK+YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT +Q
Sbjct: 321 GTVIIETNFKLYAYTNSPLEIAILNLFVHLRTRFANMVCGQITRESIRNALYNGITADQI 380
Query: 79 -----QNAHPRV---------------------------------AD------KIPSVPK 94
+AHP++ AD K+ +P
Sbjct: 381 IRFLETHAHPQMKVLAKEKLDKKIEFDTSHNINTAGGAPQSQALRADGSVAQHKLEILPP 440
Query: 95 NVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNA 154
NV DQI+LW+ +L+R++T + + +FP++ F+ YA + L+W D K + V
Sbjct: 441 NVVDQIKLWQLELDRIQTFDGYLFKDFPNQQEFDKLSSYASELGVLIWSDKIKRKFFVTK 500
Query: 155 EIHMHMREF 163
+ + +F
Sbjct: 501 DGMSQVADF 509
>gi|395533817|ref|XP_003768949.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Sarcophilus harrisii]
Length = 1638
Score = 98.6 bits (244), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 95/193 (49%), Gaps = 40/193 (20%)
Query: 16 ADNCGFVVVETNFKMYAYST----------------------------------SKLHCE 41
A GF+VVETN+++YAY+ S+L
Sbjct: 335 AHQPGFIVVETNYRLYAYTGEVGGIKMRIEIRGGGIGGELNLLWSSIVFSLETESELQIA 394
Query: 42 ILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPKN 95
++ LFS++ Y+ PN++V +T+ES+ A NGIT +Q AHP + + P +P
Sbjct: 395 LIALFSEMLYRFPNMVVAQVTRESVQQAIANGITAQQIIHFMKTRAHPVMLKQTPVLPPT 454
Query: 96 VCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
+ DQIRLWE + +R+ + Y++F S+ FE +AR+ L++E++ K +VV
Sbjct: 455 ITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARELGVLVFENTAKRLMVVTPA 514
Query: 156 IHMHMREFLRGQN 168
H ++ F + Q
Sbjct: 515 GHSDVKRFWKRQK 527
>gi|401825125|ref|XP_003886658.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
gi|395459803|gb|AFM97677.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
Length = 414
Score = 98.6 bits (244), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 83/145 (57%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F++VETN K+YAY+ S+ ++ LF + LPNLI G+IT+ES+ AF+ GIT +Q
Sbjct: 267 FIIVETNNKIYAYTNSEYEKSVIHLFCSVSVNLPNLIKGSITEESVNVAFDKGITGKQII 326
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
+ K S+P + QI +WES NR+ P + Y F + ++ ++ ++S L
Sbjct: 327 HFLEASVKPGSLPPAISSQIMIWESKRNRIFMAPGYIYSNFLNLSDYQRVLEFCTERSYL 386
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
+ D K +VV E H ++EF++
Sbjct: 387 IESDVDKRMIVVKVEGHELVKEFIK 411
>gi|390461382|ref|XP_002746360.2| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Callithrix jacchus]
Length = 1675
Score = 98.2 bits (243), Expect = 1e-18, Method: Composition-based stats.
Identities = 58/193 (30%), Positives = 94/193 (48%), Gaps = 45/193 (23%)
Query: 20 GFVVVETNFKMYAY---------------------------------------STSKLHC 40
GF+VVETN+++YAY S S+L
Sbjct: 372 GFIVVETNYRLYAYTGEAGQRAPGRAGWGXHLKERFEGVWVWGWPPHPLSIPGSESELQI 431
Query: 41 EILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPSVPK 94
++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q AHP + + P +P
Sbjct: 432 ALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPP 491
Query: 95 NVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNA 154
+ DQIRLWE + +R+ T Y++F S+ FE +AR+ L++E+S K +VV
Sbjct: 492 TITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTP 551
Query: 155 EIHMHMREFLRGQ 167
H ++ F + Q
Sbjct: 552 AGHSDVKRFWKRQ 564
>gi|193633122|ref|XP_001945987.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 471
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 89/155 (57%), Gaps = 8/155 (5%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
++VVETNF++YA + S L ++ LF+ + Y+ PN+ G +T++S+ A +GIT Q
Sbjct: 315 YIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILTRDSVRTALRSGITAAQIV 374
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
+ HP++ + +P+ V DQI LWE++ NR+ T Y + + +E +YA
Sbjct: 375 RFLTVHTHPQMQE--CGMPQTVIDQIYLWENERNRLTYTDGVLYSNINTPNDYETIKNYA 432
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
D L+W D ++ +VV+ + H +R+F + Q K
Sbjct: 433 ADIGALVWCDERRRNIVVSTDGHDDVRKFWKKQPK 467
>gi|164425579|ref|XP_960485.2| hypothetical protein NCU05523 [Neurospora crassa OR74A]
gi|157070982|gb|EAA31249.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 475
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 91/159 (57%), Gaps = 13/159 (8%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETN+++YAY+TS L +L LF ++ + PN++ G +T++S+ A GIT +Q
Sbjct: 317 IIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAIGFGITADQIIS 376
Query: 79 ---QNAHPRVADKI-----PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
+AH ++ + P +P V DQIRLW+ + R++TT + +F S + + A
Sbjct: 377 YLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFKDFESTEEYAAL 436
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
YA + L+W++ K R + A H +R++L+ + K
Sbjct: 437 SRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 473
>gi|393910938|gb|EFO26137.2| transcription factor Tfb2 family protein [Loa loa]
Length = 486
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 82/159 (51%), Gaps = 13/159 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFV+VETN+++YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GIT Q
Sbjct: 320 GFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQI 379
Query: 79 -----QNAHPRV-------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
NAHP I SVP V DQIRLWE + R+ A Y F S
Sbjct: 380 IAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKE 439
Query: 127 FEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ Y Q LLW D + +V+ E H ++ + +
Sbjct: 440 YFGLKGYVSSQGILLWCDDVQRLMVITEEGHESVKSWWK 478
>gi|393910939|gb|EJD76097.1| transcription factor Tfb2 family protein, variant [Loa loa]
Length = 389
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 78/151 (51%), Gaps = 13/151 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFV+VETN+++YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GIT Q
Sbjct: 223 GFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQI 282
Query: 79 -----QNAHPRV-------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
NAHP I SVP V DQIRLWE + R+ A Y F S
Sbjct: 283 IAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKE 342
Query: 127 FEAACDYARDQSGLLWEDSKKMRLVVNAEIH 157
+ Y Q LLW D + +V+ E H
Sbjct: 343 YFGLKGYVSSQGILLWCDDVQRLMVITEEGH 373
>gi|336466164|gb|EGO54329.1| hypothetical protein NEUTE1DRAFT_87562 [Neurospora tetrasperma FGSC
2508]
gi|350286986|gb|EGZ68233.1| transcription factor Tfb2 [Neurospora tetrasperma FGSC 2509]
Length = 475
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 91/159 (57%), Gaps = 13/159 (8%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETN+++YAY+TS L +L LF ++ + PN++ G +T++S+ A GIT +Q
Sbjct: 317 IIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAIGFGITADQIIS 376
Query: 79 ---QNAHPRVADKI-----PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
+AH ++ + P +P V DQIRLW+ + R++TT + +F S + + A
Sbjct: 377 YLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFKDFESTEEYAAL 436
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
YA + L+W++ K R + A H +R++L+ + K
Sbjct: 437 SRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 473
>gi|312069992|ref|XP_003137940.1| transcription factor Tfb2 family protein [Loa loa]
Length = 470
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 82/159 (51%), Gaps = 13/159 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFV+VETN+++YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GIT Q
Sbjct: 304 GFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQI 363
Query: 79 -----QNAHPRV-------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
NAHP I SVP V DQIRLWE + R+ A Y F S
Sbjct: 364 IAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKE 423
Query: 127 FEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ Y Q LLW D + +V+ E H ++ + +
Sbjct: 424 YFGLKGYVSSQGILLWCDDVQRLMVITEEGHESVKSWWK 462
>gi|429962184|gb|ELA41728.1| hypothetical protein VICG_01232 [Vittaforma corneae ATCC 50505]
Length = 449
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 82/158 (51%), Gaps = 2/158 (1%)
Query: 12 IPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
IP + F+V+ETNFK+YAY+++ +L LFSK Y PNLI +ESL +AF
Sbjct: 294 IPINKASNKFIVLETNFKIYAYTSTAYDKSVLSLFSKTVYVFPNLIKACFDEESLLSAFN 353
Query: 72 NGITTEQQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAAC 131
GIT +Q + + + VPKN+ +QI +WE +R+ + Y +F F
Sbjct: 354 KGITAKQIIKY--LQEHSEEVPKNIVNQISIWEHRQHRIRARNGYLYHDFIHLSDFHRVL 411
Query: 132 DYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
Y + GL++ D K +V IH ++ F++ K
Sbjct: 412 RYVESKGGLIYRDEVKRMIVGEERIHESVKNFIKEMQK 449
>gi|294660113|ref|XP_002777731.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
gi|218511847|sp|Q6BGW8.2|TFB2_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|199434471|emb|CAR66042.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
Length = 515
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 85/185 (45%), Gaps = 47/185 (25%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
N G +++ETNFK+YAY+ S L IL LF ++ + N++ G IT+ES+ NA NGIT +
Sbjct: 316 NQGTIILETNFKIYAYTNSPLEIAILNLFVHLKTRFSNMVCGQITRESIRNALYNGITAD 375
Query: 78 Q-----------------------------------------------QNAHPRVADKIP 90
Q +N K+
Sbjct: 376 QIIKFLETHAHSQMKILAKERLDKKIEFDTSNNINTAGGAPQSQMMTNENGTTVAQHKLE 435
Query: 91 SVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRL 150
+P V DQI+LW+ +L+R++T + + +F S+ FEA +YA + LLW + KM+
Sbjct: 436 VLPPTVVDQIKLWQLELDRIQTFEGYLFKDFASQQEFEALSNYASEIGVLLWSNKSKMKF 495
Query: 151 VVNAE 155
V +
Sbjct: 496 FVTKD 500
>gi|354546081|emb|CCE42810.1| hypothetical protein CPAR2_204530 [Candida parapsilosis]
Length = 451
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 94/191 (49%), Gaps = 46/191 (24%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETNFK+YAY+ S L IL LF +++ + N++ G IT+ES+ NA NGIT++Q
Sbjct: 261 IIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNGITSDQIIK 320
Query: 79 ---QNAHPRVA------------------------------------DKIPSVPKNVCDQ 99
+AHP++ K+ +P NV DQ
Sbjct: 321 FLETHAHPQMKALAKEKLDKKVEFDASHNINTAGGAPQSKTDGTISQHKLEVIPPNVVDQ 380
Query: 100 IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMH 159
I+LW+ +L+R++T + + +F ++ ++ +YA + L+W D K + V +
Sbjct: 381 IKLWQLELDRIQTVEGYLFKDFANQQEYDMLSNYATELGVLVWGDKAKRKFFVTKDGMAQ 440
Query: 160 MREF----LRG 166
+ +F LRG
Sbjct: 441 VADFANRKLRG 451
>gi|358333989|dbj|GAA34589.2| transcription initiation factor TFIIH subunit 4 [Clonorchis
sinensis]
Length = 456
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/124 (38%), Positives = 73/124 (58%), Gaps = 6/124 (4%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+ G++++ETNF++YAY+ S L +L LFSKI + PNL+V IT++S+ A GIT +
Sbjct: 298 DVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITAD 357
Query: 78 Q------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAAC 131
Q NAHP + + P +P + DQIRLWE + +R Y++F FE
Sbjct: 358 QIISFLTTNAHPDMLREPPILPPTLVDQIRLWELERDRFVFQEGCLYEQFSKSADFEMVR 417
Query: 132 DYAR 135
D+A+
Sbjct: 418 DFAK 421
>gi|193654859|ref|XP_001944632.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 470
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 87/155 (56%), Gaps = 8/155 (5%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
++VVETNF++YA + S L ++ LF+ + Y+ PN+ G +T++S+ A NGIT Q
Sbjct: 314 YIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILTRDSVQTALRNGITAAQIV 373
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
+ HP++ + +P+ V DQI LWE + NR+ Y + + + +EA +YA
Sbjct: 374 RFLTVHTHPQMQE--CGMPQTVIDQIYLWEYERNRLTYRDGVLYSDINTPNDYEAIKNYA 431
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
D LLW D ++ ++V+ + H + F + Q K
Sbjct: 432 ADIGALLWCDERQRNIIVSTDGHNDVTTFWKKQPK 466
>gi|452839401|gb|EME41340.1| hypothetical protein DOTSEDRAFT_156149 [Dothistroma septosporum
NZE10]
Length = 497
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 84/145 (57%), Gaps = 15/145 (10%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+D GF+++ETN+++YAY++S L IL LF+ + + PNLI ITK S NA GIT
Sbjct: 318 SDEKGFIILETNYRLYAYTSSPLLISILSLFASLNTRYPNLITAKITKASTQNAIAAGIT 377
Query: 76 TEQ------QNAHP-----RVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
++Q +AHP A+ P +P V DQI+LW+ + R+E T + + S+
Sbjct: 378 SDQIIDYLTTHAHPVLRRQAAANDAPILPPTVVDQIKLWQMEGERMEATKGYLIRDIGSQ 437
Query: 125 DVFEAACDYARDQSGLLWEDSKKMR 149
+ ++ A +YA D G+L SK+ R
Sbjct: 438 EEYDKAVNYA-DAIGVL---SKEFR 458
>gi|323306981|gb|EGA60265.1| Tfb2p [Saccharomyces cerevisiae FostersO]
Length = 448
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 80/152 (52%), Gaps = 24/152 (15%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L +L LF ++ + N+++G IT+ES+ A NGIT +Q
Sbjct: 288 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 347
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ + + +P V DQIRLW+ +L+RV T
Sbjct: 348 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 407
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKK 147
Y +F + + YA+D LLW+D K
Sbjct: 408 SLYSDFETSQEYNLLSKYAQDIGVLLWKDDXK 439
>gi|448527133|ref|XP_003869438.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis Co 90-125]
gi|380353791|emb|CCG23303.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis]
Length = 500
Score = 96.3 bits (238), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 54/191 (28%), Positives = 94/191 (49%), Gaps = 46/191 (24%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETNFK+YAY+ S L IL LF +++ + N++ G IT+ES+ NA NGIT++Q
Sbjct: 310 IIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNGITSDQIIK 369
Query: 79 ---QNAHPRV------------------------------------ADKIPSVPKNVCDQ 99
+AHP++ K+ +P NV DQ
Sbjct: 370 FLETHAHPQMRALAKEKLDKKVEFDASHNINTAGGAPQSKTDGAISQHKLEVIPPNVVDQ 429
Query: 100 IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMH 159
I+LW+ +L+R++T + + +F ++ ++ +YA + L+W D K + V +
Sbjct: 430 IKLWQLELDRIQTVEGYLFKDFANQQEYDTLSNYATELGVLVWGDKVKRKFFVTKDGMAQ 489
Query: 160 MREF----LRG 166
+ +F LRG
Sbjct: 490 VADFANRKLRG 500
>gi|301786953|ref|XP_002928911.1| PREDICTED: LOW QUALITY PROTEIN: valyl-tRNA synthetase,
mitochondrial-like [Ailuropoda melanoleuca]
Length = 1637
Score = 96.3 bits (238), Expect = 4e-18, Method: Composition-based stats.
Identities = 57/196 (29%), Positives = 94/196 (47%), Gaps = 48/196 (24%)
Query: 20 GFVVVETNFKMYAYS------------------------------------------TSK 37
GF+VVETN+++YAY+ S+
Sbjct: 329 GFIVVETNYRLYAYTGEARQRAPNSRXGVGTFLVTLEGLEGVWVWGXPSSSLPIPGPESE 388
Query: 38 LHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPS 91
L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q AHP + + P
Sbjct: 389 LQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPV 448
Query: 92 VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLV 151
+P + DQIRLWE + +R+ T Y++F S+ FE +AR+ L++E+S K +V
Sbjct: 449 LPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMV 508
Query: 152 VNAEIHMHMREFLRGQ 167
V H ++ F + Q
Sbjct: 509 VTPAGHSDVKRFWKRQ 524
>gi|397471696|ref|XP_003807420.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Pan paniscus]
Length = 1673
Score = 96.3 bits (238), Expect = 4e-18, Method: Composition-based stats.
Identities = 58/199 (29%), Positives = 94/199 (47%), Gaps = 51/199 (25%)
Query: 20 GFVVVETNFKMYAY---------------------------------------------S 34
GF+VVETN+++YAY S
Sbjct: 365 GFIVVETNYRLYAYTGEAGQRAPGRGGWGALXHFSXHLKEGLEGVWVWGCPPHPLSIPGS 424
Query: 35 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADK 88
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q AHP + +
Sbjct: 425 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 484
Query: 89 IPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKM 148
P +P + DQIRLWE + +R+ T Y++F S+ FE +AR+ L++E+S K
Sbjct: 485 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 544
Query: 149 RLVVNAEIHMHMREFLRGQ 167
+VV H ++ F + Q
Sbjct: 545 LMVVTPAGHSDVKRFWKRQ 563
>gi|444320663|ref|XP_004180988.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
gi|387514031|emb|CCH61469.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
Length = 518
Score = 95.5 bits (236), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 83/163 (50%), Gaps = 24/163 (14%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
T A G +++ETNFK+Y+YS S L +L LF ++ + N++ G IT++S+ A NG
Sbjct: 323 TQATADGALIIETNFKLYSYSNSPLQIAVLSLFVHLKTRFSNMVTGQITRDSIRRALRNG 382
Query: 74 ITTEQ------QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNR 109
IT EQ ++HP++ D + +P V DQI+LW+ +L+R
Sbjct: 383 ITAEQIIAYLLTHSHPQMRRLAEGNLEKKLELDPNAKDSLQILPPTVVDQIKLWQLELDR 442
Query: 110 VETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVV 152
+ + Y +F + + YA D LLW++ KK + V
Sbjct: 443 IISHEGSLYSDFENNQEYNLLSTYAEDIGVLLWKNDKKRKFFV 485
>gi|320587960|gb|EFX00435.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Grosmannia clavigera kw1407]
Length = 518
Score = 95.5 bits (236), Expect = 7e-18, Method: Composition-based stats.
Identities = 51/161 (31%), Positives = 87/161 (54%), Gaps = 13/161 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V++ETNF++YAY+ S L +L LFSK+ + P ++ G +T++S+ A GIT++Q
Sbjct: 356 GSVILETNFRLYAYTASPLQIAVLALFSKLSQRFPGMVAGKLTRDSVRRAISYGITSDQI 415
Query: 79 -----QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
NAH ++ A+ P +P V DQIRLWE + R+++ + +F S ++
Sbjct: 416 ISYLAANAHEQMHKYAAANHRPVLPPTVVDQIRLWELENERMKSHKGFLFKDFDSAKEYD 475
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
YA + L++ + KK V H + +L+ + K
Sbjct: 476 MLAQYADEIGVLVYRNDKKRHFFVTK--HEQLVTYLKARKK 514
>gi|320581720|gb|EFW95939.1| TFIIH basal transcription factor complex subunit 2 [Ogataea
parapolymorpha DL-1]
Length = 544
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 90/179 (50%), Gaps = 31/179 (17%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
++N G +VVETNFK+Y Y++S L IL LF ++ + N++ G IT+ES+ A NGIT
Sbjct: 357 SNNQGTIVVETNFKIYCYTSSPLQIAILNLFVHLKARFANMVTGVITRESVRRALINGIT 416
Query: 76 TEQ------QNAHP----RVADKIPS---------------------VPKNVCDQIRLWE 104
+Q +AHP + DK+ +P V DQI+LW+
Sbjct: 417 ADQMINYLGSHAHPWMVKQAEDKLARKLELESSIGNAAGQRQIDLEILPPTVVDQIKLWQ 476
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
+L+R+++ + Y +F + FE Y + ++W+D + R V E + + ++
Sbjct: 477 LELDRIQSFKGYLYKDFSTDLEFEKLLTYGEEIGVIVWKDRARKRFFVTQEGNGQLLDY 535
>gi|324512881|gb|ADY45318.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 481
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 13/153 (8%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE 77
+ G+++VETN+++YAY+ S L IL F+++ Y+ ++ VG +T++S+ A + GIT
Sbjct: 315 HSGYIIVETNYRVYAYTDSSLQLAILSTFTEMLYRFNDMSVGMLTRDSVRRALQVGITAS 374
Query: 78 Q------QNAHPRV---ADKIPS----VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
Q NAHP A+ P VP V DQIRLWE + +R+ + Y F S
Sbjct: 375 QIISFLRANAHPETVAAANASPGVIHCVPVTVADQIRLWEDERHRLVFFDSALYSTFESE 434
Query: 125 DVFEAACDYARDQSGLLWEDSKKMRLVVNAEIH 157
+ +Y R Q LLW D + +VV E H
Sbjct: 435 REYVGVKEYTRSQDILLWFDDVQRLVVVTEEGH 467
>gi|367009216|ref|XP_003679109.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
gi|359746766|emb|CCE89898.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
Length = 513
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 82/160 (51%), Gaps = 24/160 (15%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE-- 77
G ++VETNFK+Y+YS S L IL LF ++ + N++ G IT+ES+ A NGIT +
Sbjct: 334 GALIVETNFKLYSYSNSPLQIAILSLFVHLKSRFTNMVTGQITRESIRRALINGITADQI 393
Query: 78 ----QQNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
Q +AHP++ D + +P V DQI+LW+ +L+R+ +
Sbjct: 394 IAYLQTHAHPQMRRLAEEKLEKKLELDANCKDTLQILPPTVVDQIKLWQLELDRIISYDG 453
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
Y + S + YA+D LLW+D +K V+ E
Sbjct: 454 SLYSDIDSHQEYILLSTYAQDIGVLLWKDDRKRIFFVSKE 493
>gi|401623322|gb|EJS41426.1| tfb2p [Saccharomyces arboricola H-6]
Length = 513
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 90/170 (52%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L +L LF ++ + N+++G IT+ES+ A NGIT EQ
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITAEQI 396
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ + + +P V DQIRLW+ +L+RV T
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVMTYEG 456
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 457 SLYSDFETSQEYNVLNKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506
>gi|254573074|ref|XP_002493646.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|238033445|emb|CAY71467.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|328354526|emb|CCA40923.1| RNA polymerase II transcription factor B subunit 2 [Komagataella
pastoris CBS 7435]
Length = 547
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 85/175 (48%), Gaps = 33/175 (18%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
P + D+ G +V+ETNFK+Y Y+TS L IL LF + + N++ G IT+ES+ A N
Sbjct: 357 PEVKDH-GSIVLETNFKLYCYTTSPLQIAILNLFVHLRTRFANMVTGMITRESVREALRN 415
Query: 73 GITTEQ------QNAHPRVAD--------------------------KIPSVPKNVCDQI 100
GIT +Q +AH ++ K+ +P V DQI
Sbjct: 416 GITADQIIKYLETHAHSQMKKLAEEKLLKKLEFDSNATETQQIGSQVKLEVIPPTVVDQI 475
Query: 101 RLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
+LW+ +++R++T + +F + FE +YA + +LW D K + V E
Sbjct: 476 KLWQLEMDRLQTFAGFLFKDFANAQEFEQLANYADEVGVMLWRDDDKRKFFVTEE 530
>gi|440493551|gb|ELQ76006.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Trachipleistophora
hominis]
Length = 426
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 84/150 (56%), Gaps = 1/150 (0%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQ 79
F++VETN+K+YAY+TS I++LFS+I +LPNL+ IT+ES+ AF GIT +Q
Sbjct: 278 SFLIVETNYKLYAYTTSSHELSIIKLFSQIVRELPNLVTAHITEESVNAAFLKGITGQQI 337
Query: 80 NAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSG 139
+ K +P V +QI +WE +R++ A Y F + + +E Y R++
Sbjct: 338 VDYLTEKSK-SELPPVVLEQILIWERQRDRMKCIDAVIYSHFMTYNEYEITYRYCREKGA 396
Query: 140 LLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
L+ D + LVV E H ++ F++ K
Sbjct: 397 LIDHDEFRRLLVVRLECHNEVKNFIKNNIK 426
>gi|46127843|ref|XP_388475.1| hypothetical protein FG08299.1 [Gibberella zeae PH-1]
gi|408390907|gb|EKJ70292.1| hypothetical protein FPSE_09509 [Fusarium pseudograminearum CS3096]
Length = 494
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 88/161 (54%), Gaps = 13/161 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V++ETN+++YAY+ S L +L LF+K++ + P+++ G I+++S+ A GIT EQ
Sbjct: 334 GSVIIETNYRIYAYTQSTLQIAVLALFTKLQMRFPDMVAGRISRQSIRQAINFGITAEQI 393
Query: 79 -----QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AH ++ + P +P V DQIRLW+ + R++TT + +F +
Sbjct: 394 ISYLSAHAHDQMRRTASLNNKPVLPPTVVDQIRLWQLENERMKTTGGFLFKDFEDHKEYM 453
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
A +A + L+W + K A H +R++LR + K
Sbjct: 454 AVAGFAEEVGVLVWRNDVKGMFF--ASKHEQIRDYLRIRKK 492
>gi|326435722|gb|EGD81292.1| hypothetical protein PTSG_11329 [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 81/161 (50%), Gaps = 11/161 (6%)
Query: 1 MAIQSRKESWFIPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGA 60
+A+ R S + A CGF+V+ETNF++YAY+ S L +L+LF+K Y+LP ++V +
Sbjct: 272 LALALRTSSALVAKDAMRCGFLVIETNFRIYAYTKSDLKISLLKLFAKPLYKLPTMLVAS 331
Query: 61 ITKESLYNAFENGITTEQQNAHPRV----------ADKIPSVPKNVCDQIRLWESDLNRV 110
IT+ S+ A GIT Q +V VP V DQI LWE + +R+
Sbjct: 332 ITRSSVRQAMAQGITARQILHFLKVHCSSRMRHQPHSVFSPVPPTVSDQILLWEQETSRL 391
Query: 111 ETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLV 151
T + P + +F + +++ +W D K ++
Sbjct: 392 SKTEGVLFRHLP-KPIFPKLLAFVKERQACIWYDETKRAII 431
>gi|6325135|ref|NP_015203.1| Tfb2p [Saccharomyces cerevisiae S288c]
gi|62901134|sp|Q02939.1|TFB2_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=General transcription and DNA repair
factor IIH subunit TFB2; Short=TFIIH subunit TFB2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|1163092|gb|AAB68240.1| Tfb2p: TFIIH subunit [Saccharomyces cerevisiae]
gi|1778059|gb|AAB40628.1| transcription/repair factor TFIIH subunit Tfb2, partial
[Saccharomyces cerevisiae]
gi|151942676|gb|EDN61022.1| transcription initiation factor TFIIH subunit [Saccharomyces
cerevisiae YJM789]
gi|190407835|gb|EDV11100.1| RNA polymerase II transcription factor B subunit 2 [Saccharomyces
cerevisiae RM11-1a]
gi|256274256|gb|EEU09164.1| Tfb2p [Saccharomyces cerevisiae JAY291]
gi|259150035|emb|CAY86838.1| Tfb2p [Saccharomyces cerevisiae EC1118]
gi|285815419|tpg|DAA11311.1| TPA: Tfb2p [Saccharomyces cerevisiae S288c]
gi|323331200|gb|EGA72618.1| Tfb2p [Saccharomyces cerevisiae AWRI796]
gi|323335024|gb|EGA76314.1| Tfb2p [Saccharomyces cerevisiae Vin13]
gi|323346175|gb|EGA80465.1| Tfb2p [Saccharomyces cerevisiae Lalvin QA23]
gi|323352000|gb|EGA84539.1| Tfb2p [Saccharomyces cerevisiae VL3]
gi|365762780|gb|EHN04313.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295887|gb|EIW06990.1| Tfb2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 513
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 90/170 (52%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L +L LF ++ + N+++G IT+ES+ A NGIT +Q
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 396
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ + + +P V DQIRLW+ +L+RV T
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 456
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 457 SLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506
>gi|349581695|dbj|GAA26852.1| K7_Tfb2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 513
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 90/170 (52%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L +L LF ++ + N+++G IT+ES+ A NGIT +Q
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 396
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ + + +P V DQIRLW+ +L+RV T
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 456
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 457 SLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506
>gi|170589001|ref|XP_001899262.1| Transcription factor Tfb2 family protein [Brugia malayi]
gi|158593475|gb|EDP32070.1| Transcription factor Tfb2 family protein [Brugia malayi]
Length = 465
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 82/155 (52%), Gaps = 13/155 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFV+VETN+++YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GIT Q
Sbjct: 299 GFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFSDMSVGVLSREAVRRAFQVGITAAQI 358
Query: 79 -----QNAHP-------RVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
+NAHP I SVP V DQI+LWE + R+ A Y F S
Sbjct: 359 IAFLSRNAHPITCAVTTEYGRMIQSVPITVVDQIQLWEDERRRLTFCSAAVYSSFESEKE 418
Query: 127 FEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMR 161
+ Y Q+ LLW + +V+ E H +++
Sbjct: 419 YFGLKGYVISQNILLWCHDVQRLMVITEEGHENVK 453
>gi|323302721|gb|EGA56527.1| Tfb2p [Saccharomyces cerevisiae FostersB]
Length = 505
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 89/168 (52%), Gaps = 24/168 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L +L LF ++ + N+++G IT+ES+ A NGIT +Q
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 396
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ + + +P V DQIRLW+ +L+RV T
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 456
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREF 163
Y +F + + YA+D LLW+D KK + ++ E + + +F
Sbjct: 457 SLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504
>gi|207340606|gb|EDZ68904.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 226
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 90/170 (52%), Gaps = 24/170 (14%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L +L LF ++ + N+++G IT+ES+ A NGIT +Q
Sbjct: 50 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 109
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ + + +P V DQIRLW+ +L+RV T
Sbjct: 110 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 169
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 170 SLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 219
>gi|300707677|ref|XP_002996037.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
gi|239605297|gb|EEQ82366.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
Length = 403
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 88/159 (55%)
Query: 9 SWFIPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 68
+ F+ + F+++ETNFK+YAY++S I++LFS I ++PNLI +IT+ESL N
Sbjct: 242 NLFVKSDTKKLRFILLETNFKLYAYTSSVYEMSIIQLFSNIYLKMPNLIKASITEESLSN 301
Query: 69 AFENGITTEQQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
AF G+T++Q + +P + QI +WE+ R++ P + Y F + ++
Sbjct: 302 AFSKGVTSQQIINFLKSYSLFEDIPVAIISQIIIWETKRKRIKIFPGYLYSNFLNLIDYQ 361
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
+ ++ +D +K +V+ + + +++F++ Q
Sbjct: 362 KVVKFCLSNDCIIEKDDEKRMIVIKPDHNEVVKKFVKQQ 400
>gi|452977455|gb|EME77221.1| hypothetical protein MYCFIDRAFT_146475 [Pseudocercospora fijiensis
CIRAD86]
Length = 476
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 80/140 (57%), Gaps = 12/140 (8%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A+ GF+++ETN+++YAY+ S L IL LF+ + + PNL+ ITK S+ A +GIT
Sbjct: 300 ANEKGFIILETNYRLYAYTNSPLLISILSLFASLHTRYPNLVTAKITKTSVQAAISSGIT 359
Query: 76 TE------QQNAHP---RVAD--KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
+ Q +AHP R A P +P V DQIRLW+ + R+ +T + E S+
Sbjct: 360 SNQIISYLQTHAHPILRRTASMHNAPILPPTVVDQIRLWQIEGERMTSTKGYLIREVGSK 419
Query: 125 DVFEAACDYARDQSGLLWED 144
+ +E A YA + G+L +D
Sbjct: 420 EDYEKAVQYA-EALGILVKD 438
>gi|344229873|gb|EGV61758.1| transcription factor Tfb2 [Candida tenuis ATCC 10573]
Length = 498
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 85/172 (49%), Gaps = 38/172 (22%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETNFK+Y+Y+ S L IL LF + + N++ G IT+ES+ NA NGIT Q
Sbjct: 314 IIIETNFKVYSYTQSPLEIAILNLFVHLRTRFANMVTGQITRESIRNALYNGITAGQVIK 373
Query: 79 ---QNAHPRVA--------------------------------DKIPSVPKNVCDQIRLW 103
+AHP++ K+ +P NV DQI+LW
Sbjct: 374 FLETHAHPQMKMLAQEKLDKKIEFDASNNINTATGKSTDAPMQHKLEILPPNVVDQIKLW 433
Query: 104 ESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAE 155
+ +L+R++T + + EF ++ +E+ +YA + L+W D KK V E
Sbjct: 434 QLELDRIQTFDGYLFREFNNQIDYESLRNYASEIGVLVWSDDKKRTCFVTKE 485
>gi|302917488|ref|XP_003052448.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256733388|gb|EEU46735.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 496
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 87/161 (54%), Gaps = 13/161 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V++ETN+++YAY+ S L +L LFSK+ + P+++ G ++++S+ A GIT +Q
Sbjct: 336 GSVIIETNYRIYAYTQSTLQIAVLALFSKLSMRFPDMVAGRLSRQSIRQAINFGITADQI 395
Query: 79 -----QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AH ++ + P +P V DQIRLW+ + R++TT + +F +
Sbjct: 396 ISYLAAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKDFEDHKEYM 455
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
A +A + L+W + K A H +R++LR + K
Sbjct: 456 AVAGFAEEVGVLVWRNDVKGMFF--ASKHEQIRDYLRIRKK 494
>gi|392574410|gb|EIW67546.1| hypothetical protein TREMEDRAFT_33390 [Tremella mesenterica DSM
1558]
Length = 504
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 89/160 (55%), Gaps = 8/160 (5%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D+ F+++ETN+K+YAY+ ++L IL LF I+ Q NL+VG + + + A E GI+
Sbjct: 341 DDKKFLILETNYKIYAYTNNELEIAILNLFMDIKVQYRNLVVGKLDRAHVKAAMEKGISA 400
Query: 77 E------QQNAHPRVADKIPSV-PKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
Q +AHP++ P + + DQ+ LW+ + NR+ T + + EF S+D++E
Sbjct: 401 YQIIAYLQSHAHPQMYSSPPPILHTTIVDQLHLWDKERNRLRTEESEMF-EFFSKDLYED 459
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
A+ GLL K L VN + +++F++GQ +
Sbjct: 460 TEAEAKRYDGLLLAVPKDKLLFVNPSVKDAIKDFVKGQQR 499
>gi|402465832|gb|EJW01474.1| hypothetical protein EDEG_03937 [Edhazardia aedis USNM 41457]
Length = 465
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 88/149 (59%), Gaps = 6/149 (4%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQ 79
F+ +ETNFK+YAY+TSK IL LFSKI +LPNLI I ++S+ AF+ I+ +Q
Sbjct: 317 SFMYIETNFKLYAYTTSKYDFSILSLFSKISCKLPNLISAIINEDSVNTAFDKKISAKQI 376
Query: 80 NAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSG 139
+ + + K +VPKNV +Q+ +WES NR++T+ + F + F+ A D +++
Sbjct: 377 SYYLKSKGK--NVPKNVVEQVYIWESKRNRIKTSECTLFKGFLNLMDFKKAVDVCKEKHW 434
Query: 140 L--LWEDSKKMRLVVNAEIHMHMREFLRG 166
L ++ED K + VN +E+++
Sbjct: 435 LVDVYED--KRFIFVNNLYAEEFKEYIKN 461
>gi|448100135|ref|XP_004199281.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359380703|emb|CCE82944.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 509
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 91/193 (47%), Gaps = 45/193 (23%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
++ G +++ETNFK+YAY+ S L IL LF ++ + PN++ G IT+ES+ A NGIT
Sbjct: 313 SNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFNGIT 372
Query: 76 TEQ------QNAHPRVAD---------------------------------------KIP 90
+Q +AH ++ K+
Sbjct: 373 ADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDSQQLSVGEFGSASSYHKLE 432
Query: 91 SVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRL 150
+P V DQI+LW+ +L+R++T + + +F S+ ++A +YA + LLW D + +
Sbjct: 433 ILPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRNYASELGVLLWSDKNRKKF 492
Query: 151 VVNAEIHMHMREF 163
V + + +F
Sbjct: 493 FVTKDGMSQVADF 505
>gi|430812024|emb|CCJ30551.1| unnamed protein product [Pneumocystis jirovecii]
Length = 319
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 68/126 (53%), Gaps = 6/126 (4%)
Query: 36 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKI 89
S L +L LF + + NL+VG IT+ S+ AF NGIT EQ AHP++
Sbjct: 178 SPLQIAVLNLFVHLYLRFSNLVVGVITRNSVRQAFMNGITAEQIISYLTSYAHPQMKKNT 237
Query: 90 PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMR 149
P +P V DQIRLWE + NR++ T + + +F S FE A YA + + ++WE K
Sbjct: 238 PIIPPTVNDQIRLWEMERNRLKATEGYLFRDFNSNSDFELALKYANELNVVVWEAPSKRV 297
Query: 150 LVVNAE 155
VN +
Sbjct: 298 FFVNIQ 303
>gi|355693906|gb|AER99491.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
putorius furo]
Length = 143
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/139 (33%), Positives = 79/139 (56%), Gaps = 6/139 (4%)
Query: 36 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKI 89
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q AHP + +
Sbjct: 2 SELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQT 61
Query: 90 PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMR 149
P +P + DQIRLWE + +R+ T Y++F S+ FE +AR+ L++E+S K
Sbjct: 62 PVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRL 121
Query: 150 LVVNAEIHMHMREFLRGQN 168
+VV H ++ F + Q
Sbjct: 122 MVVTPAGHSDVKRFWKRQK 140
>gi|402592023|gb|EJW85952.1| transcription factor Tfb2 family protein [Wuchereria bancrofti]
Length = 470
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 83/159 (52%), Gaps = 13/159 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GFV+VETN+++YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GIT Q
Sbjct: 304 GFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQI 363
Query: 79 -----QNAHP-------RVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV 126
NAHP I SVP V DQI+LWE + R+ A Y F S
Sbjct: 364 IAFLSTNAHPITYAVTTEYGRMIQSVPITVADQIQLWEDERRRLTFCNATVYSSFESEKE 423
Query: 127 FEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ Y Q+ LLW + +V+ E H +++ + +
Sbjct: 424 YFGLKGYVISQNILLWCHDVQRLMVITEEGHENVKAWWK 462
>gi|448103836|ref|XP_004200137.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359381559|emb|CCE82018.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 512
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 90/193 (46%), Gaps = 45/193 (23%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
++ G +++ETNFK+YAY+ S L IL LF ++ + PN++ G IT+ES+ A NGIT
Sbjct: 313 SNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFNGIT 372
Query: 76 TEQ------QNAHPRVAD---------------------------------------KIP 90
+Q +AH ++ K+
Sbjct: 373 ADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDNQQLSVGEFGSANSYHKLE 432
Query: 91 SVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRL 150
+P V DQI+LW+ +L+R++T + + +F S+ ++A YA + LLW D + +
Sbjct: 433 VLPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRSYASELGVLLWSDKNRKKF 492
Query: 151 VVNAEIHMHMREF 163
V + + +F
Sbjct: 493 FVTKDGMSQVADF 505
>gi|85691005|ref|XP_965902.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi GB-M1]
gi|19068469|emb|CAD24937.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
gi|449329766|gb|AGE96035.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi]
Length = 414
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 49/145 (33%), Positives = 81/145 (55%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F++VETN K+YAY+ S+ ++ LF + LPNLI G IT+ES+ AF+ GIT +Q
Sbjct: 267 FIIVETNNKIYAYTNSEYEKSVIHLFCNVTVNLPNLIKGIITEESVNVAFDKGITGKQII 326
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
+ + S+P + +QI +WES NR+ P + Y F + ++ ++ + L
Sbjct: 327 HFLEASVRQGSLPPAIRNQIVIWESKRNRIFMVPGYLYSNFLNLSDYQKVLEFCIRGNHL 386
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
+ D K +VV E H ++EF++
Sbjct: 387 VESDVDKRMIVVKLEGHSLVKEFVK 411
>gi|340939354|gb|EGS19976.1| RNA polymerase II transcription factor B subunit 2-like protein
[Chaetomium thermophilum var. thermophilum DSM 1495]
Length = 514
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 88/162 (54%), Gaps = 15/162 (9%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETN+++YAY++S L +L LF+ + + ++ G +T+ES+ A GIT +Q
Sbjct: 354 GSIIIETNYRLYAYTSSPLQIAVLALFTHLNMRFAGMVTGRLTRESIRRAISFGITADQI 413
Query: 79 -----QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AH ++ A P +P V DQIRLW+ + R+ T+P + +F + + +
Sbjct: 414 ISYLASHAHEQMVRAAAAAGRPVLPPTVVDQIRLWQLENERMRTSPGFLFKDFENVEEYM 473
Query: 129 AACDYARDQSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQNK 169
A YA + L+W D K+M E +R++L+ + K
Sbjct: 474 ALAGYAEEIGVLVWRSDRKRMFFASKFE---QLRDYLKSRKK 512
>gi|322697444|gb|EFY89224.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium acridum CQMa 102]
Length = 541
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 88/166 (53%), Gaps = 15/166 (9%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD G V+VETN+++YAY+ S L +L LF K+ + P+++ G +T+ S++ A GIT
Sbjct: 377 ADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTRTSIHQAINFGIT 436
Query: 76 TEQ------QNAHPR------VADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPS 123
+Q +AH + +A+K P +P V DQIRLW+ + R++TT + +F
Sbjct: 437 ADQIISYLAAHAHEQMHRTAALANK-PILPPTVVDQIRLWQLENERMKTTGGFLFRDFED 495
Query: 124 RDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
F +A + L+W K N + +R++L+ + +
Sbjct: 496 HKEFLDTARFAEEIGVLVWRSDKSGMFFANK--YEQIRDYLKSRKR 539
>gi|346323394|gb|EGX92992.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Cordyceps militaris CM01]
Length = 511
Score = 91.3 bits (225), Expect = 1e-16, Method: Composition-based stats.
Identities = 50/164 (30%), Positives = 87/164 (53%), Gaps = 13/164 (7%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
D G VV+ETN+++YAY+ S L +L LF+++ + P+++ G +T+ S+ A + GIT
Sbjct: 348 DQPGAVVIETNYRLYAYTQSTLQIAVLALFARLTMRFPDMVAGRLTRTSIRQAIQFGITA 407
Query: 77 EQ------QNAHPRVADKI-----PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
+Q +AH ++ P +P V DQIRLW+ + R++TTP + +F
Sbjct: 408 DQIIAYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTPGFLFRDFDDPK 467
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ +A + L+W D KK A +R++L+ + K
Sbjct: 468 EYHDTRRFADEIGVLVWSDDKKGLFF--ASKFEQIRDYLKSRKK 509
>gi|255721741|ref|XP_002545805.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
gi|240136294|gb|EER35847.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
Length = 494
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 89/184 (48%), Gaps = 42/184 (22%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETNFK+YAY+ S L IL LF +++ + N++ G IT+ES+ NA NGIT +Q
Sbjct: 304 IIIETNFKLYAYTKSPLEIAILNLFVQLKTRFANMVCGQITRESIRNALYNGITADQIIK 363
Query: 79 ---QNAHPRV------------------------------------ADKIPSVPKNVCDQ 99
+AH ++ K+ +P NV DQ
Sbjct: 364 FLETHAHSQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGSISQHKLEILPPNVVDQ 423
Query: 100 IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMH 159
I+LW+ +L+R++T + + +F ++ ++ +YA + L+W D K + V +
Sbjct: 424 IKLWQLELDRIQTFDGYLFKDFANQQEYDILSNYASELGVLIWADKVKRKFFVTKDGMTQ 483
Query: 160 MREF 163
+ +F
Sbjct: 484 VADF 487
>gi|313224499|emb|CBY20289.1| unnamed protein product [Oikopleura dioica]
Length = 411
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 86/151 (56%), Gaps = 6/151 (3%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F++VETN+++YAY+ S+LH ++ LF++++Y+ P +IV ++++S+ + + GI+ EQ
Sbjct: 254 FILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQIL 313
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
+AHP VP+ V D I LW + R++ Y +F ++ FE YA
Sbjct: 314 NYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYA 373
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+D L+W + ++ +VV H ++ + +
Sbjct: 374 QDIRALVWANDERRFMVVAPWSHDQIKSYYK 404
>gi|429965574|gb|ELA47571.1| hypothetical protein VCUG_00894 [Vavraia culicis 'floridensis']
Length = 424
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 83/150 (55%), Gaps = 1/150 (0%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQ 79
F+VVETN+K+YAY+TS I++LF +I +LPNL+V IT+ES+ AF GIT +Q
Sbjct: 276 SFLVVETNYKLYAYTTSSHELSIIKLFCQIIRELPNLVVAHITEESVNAAFVKGITGQQI 335
Query: 80 NAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSG 139
+ + +P V +QI +WE +R++ A Y F + +E Y +++
Sbjct: 336 VDYLNEKSR-SELPVVVLEQILIWERKRDRMKCMDAVIYSHFMTYGEYEITYKYCKEKGA 394
Query: 140 LLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
L+ D + LVV E H ++ F++ K
Sbjct: 395 LVDYDEFRRLLVVKLEYHNDVKNFIKSNIK 424
>gi|313245784|emb|CBY34779.1| unnamed protein product [Oikopleura dioica]
Length = 456
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 86/151 (56%), Gaps = 6/151 (3%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F++VETN+++YAY+ S+LH ++ LF++++Y+ P +IV ++++S+ + + GI+ EQ
Sbjct: 299 FILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQIL 358
Query: 79 ----QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYA 134
+AHP VP+ V D I LW + R++ Y +F ++ FE YA
Sbjct: 359 NYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYA 418
Query: 135 RDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+D L+W + ++ +VV H ++ + +
Sbjct: 419 QDIRALVWANDERRFMVVAPWSHDQIKSYYK 449
>gi|149238950|ref|XP_001525351.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
gi|146450844|gb|EDK45100.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
Length = 503
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 82/165 (49%), Gaps = 42/165 (25%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETNFKMYAY+ S L IL LF ++ + N+I G IT+ES+ NA NGIT +Q
Sbjct: 314 IIIETNFKMYAYTNSPLEIAILNLFVHMKTRFSNMICGQITRESIRNALYNGITADQIIK 373
Query: 79 ---QNAHPRV------------------------------------ADKIPSVPKNVCDQ 99
+AHP++ ++ +P NV DQ
Sbjct: 374 FLETHAHPQMRALAKERLDKKVEFDASHNINTAGGAPQSKTDGAISQHRLEVLPPNVVDQ 433
Query: 100 IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWED 144
I+LW+ +L+R++T + + +F ++ ++ +YA + L+W D
Sbjct: 434 IKLWQLELDRIQTFDGYLFKDFANQQEYDMLSNYAAEIGVLIWAD 478
>gi|367030175|ref|XP_003664371.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
gi|347011641|gb|AEO59126.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
Length = 512
Score = 90.1 bits (222), Expect = 2e-16, Method: Composition-based stats.
Identities = 51/167 (30%), Positives = 93/167 (55%), Gaps = 13/167 (7%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
T A + G V++ETN+++YAY++S L +L LF+++ + + G +T++S+ A G
Sbjct: 346 TSAASKGSVIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMATGRLTRDSIRRAIGFG 405
Query: 74 ITTEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFP 122
IT +Q +AH ++ A P +P V DQIRLW+ + R+ T+ + +F
Sbjct: 406 ITADQIISYLASHAHEQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFLFKDFD 465
Query: 123 SRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
S D + + YA + L+W+ KK R+ ++I +R++LR + +
Sbjct: 466 SLDEYLSLSAYAEEIGVLVWKSDKK-RMFYASKIE-QLRDYLRSRKR 510
>gi|302405781|ref|XP_003000727.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
gi|261360684|gb|EEY23112.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
Length = 476
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 87/165 (52%), Gaps = 13/165 (7%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
D ++VETN+++YAY+TS+L +L LF + + P ++ G +T++S+ A + GIT
Sbjct: 313 GDQTNGIIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGIT 372
Query: 76 TEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
+Q +AH ++ AD P +P V DQIRLW+ + R++TT + +F
Sbjct: 373 ADQIISYLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDD 432
Query: 125 DVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ +A + L W + +K A H +R++L+ + K
Sbjct: 433 AEYRDIARFADEIGVLAWRNDRKRMFF--ASKHEQIRDYLKLRKK 475
>gi|116201905|ref|XP_001226764.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
gi|88177355|gb|EAQ84823.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
Length = 503
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 92/167 (55%), Gaps = 13/167 (7%)
Query: 14 TIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 73
T A + G V++ETN+++YAY++S L +L LF+++ + + G +T++S+ A G
Sbjct: 337 TSAASKGSVIIETNYRLYAYTSSPLQIAVLGLFTQLNMRFAGMATGRLTRDSIRRAIGFG 396
Query: 74 ITTEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFP 122
IT +Q +AH ++ A P +P V DQIRLW+ + R+ T+ + +F
Sbjct: 397 ITADQIISYLASHAHDQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFLFKDFD 456
Query: 123 SRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
S + + + YA + L+W KK R+ ++I +R++L+ + K
Sbjct: 457 SLEEYVSLSSYADEVGVLVWRSDKK-RMFYASKIE-QLRDYLKSRKK 501
>gi|346972761|gb|EGY16213.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
dahliae VdLs.17]
Length = 476
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 87/165 (52%), Gaps = 13/165 (7%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
D ++VETN+++YAY+TS+L +L LF + + P ++ G +T++S+ A + GIT
Sbjct: 313 GDQTNGIIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGIT 372
Query: 76 TEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
+Q +AH ++ AD P +P V DQIRLW+ + R++TT + +F
Sbjct: 373 ADQIISYLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDD 432
Query: 125 DVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ +A + L W + +K A H +R++L+ + K
Sbjct: 433 AEYRDIARFADEIGVLAWRNDRKRMFF--ASKHEQIRDYLKLRKK 475
>gi|453082965|gb|EMF11011.1| RNA polymerase II transcription factor B subunit 2 [Mycosphaerella
populorum SO2202]
Length = 507
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 74/130 (56%), Gaps = 11/130 (8%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A G++++ETN+++YAY++S L IL LF+ + + PNLI ITK S++ A +GIT
Sbjct: 319 ASEKGYIILETNYRLYAYTSSPLPISILSLFASLNTRYPNLITAKITKTSIHTAIASGIT 378
Query: 76 TE------QQNAHP-----RVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
++ Q +AHP + P +P V DQIRLW+ + R+++T + + +
Sbjct: 379 SDQIISYLQTHAHPILRRQAALNSAPILPPTVVDQIRLWQIEGERMKSTKGYLIRDVGTE 438
Query: 125 DVFEAACDYA 134
D + A YA
Sbjct: 439 DDYTKAVQYA 448
>gi|366991461|ref|XP_003675496.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
gi|342301361|emb|CCC69129.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
Length = 516
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 75/147 (51%), Gaps = 24/147 (16%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETNFK+Y+YS S L L LF ++ + N++ G IT+ES+ A NGIT +Q
Sbjct: 341 GSLIIETNFKLYSYSNSPLQIATLSLFVHLKSRFANMVTGQITRESIRRALINGITADQI 400
Query: 79 -----QNAHPRV------------------ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP++ D + +P V DQI+LW+ +L+RV T
Sbjct: 401 IAYLETHAHPQMRRLAEERLEKKLELDPNSKDPLQVLPPTVVDQIKLWQLELDRVITYEG 460
Query: 116 HYYDEFPSRDVFEAACDYARDQSGLLW 142
Y +F + + YA+D LLW
Sbjct: 461 SLYSDFETIAEYTTLSKYAQDIGVLLW 487
>gi|342884444|gb|EGU84659.1| hypothetical protein FOXB_04847 [Fusarium oxysporum Fo5176]
Length = 496
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 91/162 (56%), Gaps = 15/162 (9%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G V++ETN+++YAY+ S L +L LF+K+ + P+++ G I+++S+ A + GIT EQ
Sbjct: 336 GSVIIETNYRIYAYTQSTLQIAVLALFTKLAMRFPDMVAGRISRQSIRQAIQFGITAEQI 395
Query: 79 -----QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AH ++ + P +P V DQIRLW+ + R++TT + F ++
Sbjct: 396 ISYLSAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKSFEDDREYK 455
Query: 129 AACDYARDQSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 169
+A + L+W+ D+++M A H +R++++ + K
Sbjct: 456 DIARFAEEVGVLVWKNDARQMFF---ASKHEQIRDYMKIRKK 494
>gi|400597688|gb|EJP65418.1| RNA polymerase II transcription factor B subunit 2 [Beauveria
bassiana ARSEF 2860]
Length = 507
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 90/165 (54%), Gaps = 13/165 (7%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
D G +V+ETN+++YAY+ S L +L LFS++ + P+++ G IT+ S+ A + GIT
Sbjct: 343 GDQIGSIVIETNYRLYAYTQSTLQIAVLALFSRLTMRFPDMVAGRITRTSIRQAIQFGIT 402
Query: 76 TEQ------QNAHPRVADKI-----PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
+Q +AH ++ P +P V DQIRLW+ + R++TT + +F +
Sbjct: 403 ADQIIAYLAAHAHEQMHRSAALAARPVLPPTVVDQIRLWQLENERMKTTGGFLFRDFDDQ 462
Query: 125 DVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
++ +A D+ G+L S K + ++ +R++L+ + K
Sbjct: 463 KEYQDTRRFA-DEIGVLVWSSDKTGMFFASKFE-QIRDYLKSRKK 505
>gi|308460920|ref|XP_003092758.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
gi|308252517|gb|EFO96469.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
Length = 128
Score = 88.2 bits (217), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 70/117 (59%), Gaps = 9/117 (7%)
Query: 58 VGAITKESLYNAFENGITTEQ------QNAHPR---VADKIPSVPKNVCDQIRLWESDLN 108
VG IT+ES+ A ++GIT Q NAHP+ + I +P V DQIRLWE +
Sbjct: 3 VGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERR 62
Query: 109 RVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
R++ A+ Y F S D F+ CDYA+++ LLW ++++ ++VN E H ++R++ +
Sbjct: 63 RMDLKDAYIYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQWYK 119
>gi|449296697|gb|EMC92716.1| hypothetical protein BAUCODRAFT_259506 [Baudoinia compniacensis
UAMH 10762]
Length = 489
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 78/132 (59%), Gaps = 7/132 (5%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A+ G++++ETN+++YAY++S L IL LF+ ++ + PNLI +TK S+ +A +GIT
Sbjct: 321 ANEKGYIILETNYRLYAYTSSPLPITILSLFATLKTRYPNLITAKLTKSSIQSAIASGIT 380
Query: 76 TEQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
++Q +AHP + + P +P V DQIRLW+ + R+ + + + ++
Sbjct: 381 SDQIITYLTTHAHPILRRQNPVLPPTVVDQIRLWQIEGERMTAWKGFLIRDVGTAEEYDK 440
Query: 130 ACDYARDQSGLL 141
A YA D G+L
Sbjct: 441 AVQYA-DALGVL 451
>gi|241958748|ref|XP_002422093.1| TFIIH subunit, putative; tanscription initiation factor IIB,
putative [Candida dubliniensis CD36]
gi|223645438|emb|CAX40094.1| TFIIH subunit, putative [Candida dubliniensis CD36]
Length = 494
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 81/165 (49%), Gaps = 42/165 (25%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETNFK+YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT +Q
Sbjct: 304 IIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQIIK 363
Query: 79 ---QNAHPRV------------------------------------ADKIPSVPKNVCDQ 99
+AHP++ K+ +P NV DQ
Sbjct: 364 FLETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMISQHKLEILPPNVVDQ 423
Query: 100 IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWED 144
I+LW+ +L+R++T + + +F ++ ++ +YA + L+W D
Sbjct: 424 IKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWAD 468
>gi|68490740|ref|XP_710812.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|68490763|ref|XP_710801.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432046|gb|EAK91553.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432058|gb|EAK91564.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|238879997|gb|EEQ43635.1| TFIIH basal transcription factor complex subunit 2 [Candida
albicans WO-1]
Length = 494
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 81/165 (49%), Gaps = 42/165 (25%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++ETNFK+YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT +Q
Sbjct: 304 IIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQIIK 363
Query: 79 ---QNAHPRV------------------------------------ADKIPSVPKNVCDQ 99
+AHP++ K+ +P NV DQ
Sbjct: 364 FLETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMVSQHKLEILPPNVVDQ 423
Query: 100 IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWED 144
I+LW+ +L+R++T + + +F ++ ++ +YA + L+W D
Sbjct: 424 IKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWAD 468
>gi|367040709|ref|XP_003650735.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
gi|346997996|gb|AEO64399.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
Length = 513
Score = 87.8 bits (216), Expect = 1e-15, Method: Composition-based stats.
Identities = 48/168 (28%), Positives = 89/168 (52%), Gaps = 13/168 (7%)
Query: 13 PTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 72
P G +++ETN+++YAY++S L +L LF+++ + ++ G +T++S+ A
Sbjct: 346 PDTTPTKGSLIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMVTGRLTRDSIRRAISF 405
Query: 73 GITTEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEF 121
GIT +Q +AH ++ A P +P V DQIRLW+ + R+ TT + +F
Sbjct: 406 GITADQIISYLASHAHEQMTRAAAAAGRPVLPPTVVDQIRLWQLENERMRTTAGFLFKDF 465
Query: 122 PSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
S + + + YA + L+W+ +K V A +R++L+ + K
Sbjct: 466 DSPEEYVSLSGYAEEIGVLVWKSDRKR--VFFASKFEQLRDYLKSRKK 511
>gi|449017614|dbj|BAM81016.1| similar to TFIIH subunit TFB2 [Cyanidioschyzon merolae strain 10D]
Length = 693
Score = 87.8 bits (216), Expect = 2e-15, Method: Composition-based stats.
Identities = 51/151 (33%), Positives = 83/151 (54%), Gaps = 5/151 (3%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQNA 81
++VETNF++YAY+ S +L LF++I Y++P + +G IT++S+ A + GIT +Q
Sbjct: 538 IIVETNFRLYAYAASSFQVALLSLFTRILYRMPGVAIGVITRDSVRRALKCGITAKQLLH 597
Query: 82 HPRVAD-KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEF-PSRD---VFEAACDYARD 136
+ + SVP NV DQI LWE + R++ P + F P+ D F+ +Y +
Sbjct: 598 FLGIHSMEGKSVPFNVHDQILLWELERKRIQAYPGVLLEGFEPTSDGRAFFDQLQEYTVE 657
Query: 137 QSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 167
W D + LV +A +RE++R Q
Sbjct: 658 LGAQQWCDRVRQLLVADASSFERLREWIRRQ 688
>gi|389642093|ref|XP_003718679.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|351641232|gb|EHA49095.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|440469172|gb|ELQ38293.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae Y34]
gi|440481925|gb|ELQ62459.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae P131]
Length = 490
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 82/141 (58%), Gaps = 12/141 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETNF++YAY+++ L IL LF+ ++ + ++ G +++ S+ A +GIT +Q
Sbjct: 326 GAIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAISHGITADQI 385
Query: 79 -----QNAHP---RVAD--KIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AH R+A P +P V DQIRLW+ + R++ + + +F S+ F+
Sbjct: 386 IEYLASHAHEQMHRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKDFESQAEFK 445
Query: 129 AACDYARDQSGLLW-EDSKKM 148
A DYA + L+W D++++
Sbjct: 446 AIADYADEVGVLIWRSDARQL 466
>gi|361130709|gb|EHL02459.1| putative RNA polymerase II transcription factor B subunit 2 [Glarea
lozoyensis 74030]
Length = 210
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 80/154 (51%), Gaps = 18/154 (11%)
Query: 12 IPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
+ T + GF+++ETN+ S L +L LF+K+ + PN++ G +T+ES+Y A
Sbjct: 48 LSTGSGTAGFIIIETNYH------SPLQIAVLALFTKLTTRYPNMVTGRVTRESVYRAVS 101
Query: 72 NGITTEQ------QNAHPRV------ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYD 119
+GI+++Q +AHP + P +P V DQIRLW+ + R++TT +
Sbjct: 102 SGISSDQIITYLSTHAHPELLKASAAKGGGPVLPPTVVDQIRLWQIENERMKTTYGFLFK 161
Query: 120 EFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVN 153
+F S D ++ YA + L W + + V+
Sbjct: 162 DFESTDEYKKLLKYADEIGVLTWASKTEEKFFVS 195
>gi|340518979|gb|EGR49219.1| trancription factor tfb2-like protein [Trichoderma reesei QM6a]
Length = 480
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 84/164 (51%), Gaps = 13/164 (7%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
+ G VV+ETN+++YAY+ S L +L LF+K+ + P+++ G +++ S+ A GIT
Sbjct: 317 EQAGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITA 376
Query: 77 EQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
+Q +AH ++ P +P V DQIRLW+ + R++TT + +F
Sbjct: 377 DQIISYLSAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDK 436
Query: 126 VFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ ++ + L+W + N H +++FL+ + K
Sbjct: 437 DYLDTARFSEEIGVLVWRNDNTRMFFANK--HEQIKDFLKSRKK 478
>gi|452824378|gb|EME31381.1| transcription initiation factor TFIIH subunit H4 [Galdieria
sulphuraria]
Length = 740
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 67/126 (53%), Gaps = 9/126 (7%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
DN ++V+TNF++Y Y+ +L LF + Y+LPN+ VG IT++S+ A NGIT
Sbjct: 322 VDNDCRMIVQTNFRVYVYTNCTFQISLLSLFIQFLYRLPNMAVGVITRDSIRTALNNGIT 381
Query: 76 TEQ-----QN-AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+Q QN HP + K+P + DQIRLWE+ RV T A D F + FE
Sbjct: 382 AQQMISYLQNHMHPNMKGKLPI---TIIDQIRLWEAQRFRVTTKHALLLDHFDNMTCFEK 438
Query: 130 ACDYAR 135
R
Sbjct: 439 TYGTTR 444
>gi|321265552|ref|XP_003197492.1| subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Cryptococcus gattii WM276]
gi|317463972|gb|ADV25705.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes,
putative [Cryptococcus gattii WM276]
Length = 481
Score = 85.1 bits (209), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 88/157 (56%), Gaps = 8/157 (5%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD+ F+++ETN+K+YAY++++L IL LF I + PNL+VG + ++ + A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375
Query: 76 TEQ------QNAHPRVADKIPS-VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
Q +AHP++ + P + + DQ+ LW+ + NR++T Y EF S+++F+
Sbjct: 376 ARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFD 434
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ A+ + L + + L + + EF++
Sbjct: 435 DTVNEAKANAALQHAATSQKLLFIEPHTKPAITEFVK 471
>gi|310799661|gb|EFQ34554.1| transcription factor tfb2 [Glomerella graminicola M1.001]
Length = 485
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 84/160 (52%), Gaps = 13/160 (8%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ-- 78
F + ETN+++YAY + L +L LF K++ + +++ G +T+ S+ NA E GIT +Q
Sbjct: 325 FRLSETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQII 384
Query: 79 ----QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEA 129
+AH ++ A P +P V DQIRLW+ + R+ TT + +F S ++
Sbjct: 385 SYLAAHAHEQMHRMAAARSRPILPPTVIDQIRLWQLETERMTTTSGFLFRDFDSPREYDD 444
Query: 130 ACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
YA + L+W + K A H +R++L+ + K
Sbjct: 445 IAGYAAEIGVLVWRNDKLGMFF--ASKHEQIRDYLKLRKK 482
>gi|322711210|gb|EFZ02784.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium anisopliae ARSEF 23]
Length = 540
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 88/170 (51%), Gaps = 19/170 (11%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD G V+VETN+++YAY+ S L +L LF K+ + P+++ G +T+ S++ A GI+
Sbjct: 372 ADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTRTSIHQAINFGIS 431
Query: 76 TEQ------QNAHPR------VADKIPSVPKNVCDQIRLWESDLN----RVETTPAHYYD 119
+Q +AH + +A+K P +P V DQIRLW+ + R++TT +
Sbjct: 432 ADQIISYLAAHAHEQMHRSAALANK-PILPPTVVDQIRLWQLEWQLENERMKTTGGFLFR 490
Query: 120 EFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+F F +A + L+W K N + +R++L+ + +
Sbjct: 491 DFEDHKEFLDTARFAEEIGVLVWRSDKSGMFFANK--YEQIRDYLKSRKR 538
>gi|405123868|gb|AFR98631.1| transcription factor TFIIH complex subunit Tfb2 [Cryptococcus
neoformans var. grubii H99]
Length = 481
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 88/157 (56%), Gaps = 8/157 (5%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD+ F+++ETN+K+YAY++++L IL LF I + PNL+VG + ++ + A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375
Query: 76 TEQ------QNAHPRVADKIPS-VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
Q +AHP++ + P + + DQ+ LW+ + NR++T Y EF S+++F+
Sbjct: 376 AGQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFD 434
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ A+ + L + + L + + EF++
Sbjct: 435 DTVNEAKANAALQHAATSQKLLFIEPHTKPAITEFVK 471
>gi|134118870|ref|XP_771938.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50254542|gb|EAL17291.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 481
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 88/157 (56%), Gaps = 8/157 (5%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD+ F+++ETN+K+YAY++++L IL LF I + PNL+VG + ++ + A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375
Query: 76 TEQ------QNAHPRVADKIPS-VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
Q +AHP++ + P + + DQ+ LW+ + NR++T Y EF S+++F+
Sbjct: 376 ARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFD 434
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ A+ + L + + L + + EF++
Sbjct: 435 DTVNEAKANAALQHAATSQKLLFIEPHTKPAITEFVK 471
>gi|358385986|gb|EHK23582.1| hypothetical protein TRIVIDRAFT_36912 [Trichoderma virens Gv29-8]
Length = 492
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 85/165 (51%), Gaps = 13/165 (7%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ G VV+ETN+++YAY+ S L +L LF+K+ + P+++ G +++ S+ A GIT
Sbjct: 328 GEQAGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGIT 387
Query: 76 TEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR 124
+Q +AH ++ P +P V DQIRLW+ + R++TT + +F
Sbjct: 388 ADQIISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDD 447
Query: 125 DVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ ++ + L+W++ N H +++FL+ + +
Sbjct: 448 KDYLDTARFSEEIGVLVWKNDHARMFFANK--HEQIKDFLKTRKR 490
>gi|336270172|ref|XP_003349845.1| hypothetical protein SMAC_00733 [Sordaria macrospora k-hell]
gi|380095234|emb|CCC06707.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 496
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 89/170 (52%), Gaps = 22/170 (12%)
Query: 14 TIADNCGF---VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 70
T AD G +++ETN+++YAY+TS L +L LF ++ + PN++ A+ A
Sbjct: 333 TPADKTGTSKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSAAL------RAI 386
Query: 71 ENGITTEQ------QNAHPRVADKIPS-----VPKNVCDQIRLWESDLNRVETTPAHYYD 119
GIT +Q +AH ++ + +P V DQIRLW+ + R++TT +
Sbjct: 387 GFGITADQIISYLHAHAHQQMVREATVTGKTVLPPTVVDQIRLWQLENERMQTTAGFLFK 446
Query: 120 EFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+F S + + A YA + L+W++ K R + A H +R++L+ + K
Sbjct: 447 DFESTEEYAALSRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 494
>gi|58262306|ref|XP_568563.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230737|gb|AAW47046.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 481
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 87/157 (55%), Gaps = 8/157 (5%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
AD+ F+++ETN+K+YAY++++L IL LF I + PNL+VG + ++ + A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375
Query: 76 TEQ------QNAHPRVADKIPS-VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
Q +AHP++ + P + + DQ+ LW+ + NR++T Y EF S+++F+
Sbjct: 376 ARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFD 434
Query: 129 AACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLR 165
+ A+ + L + L + + EF++
Sbjct: 435 DTVNEAKANAALQHAAISQKLLFIEPHTKPAITEFVK 471
>gi|402083674|gb|EJT78692.1| RNA polymerase II transcription factor B subunit 2 [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 496
Score = 81.6 bits (200), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 86/162 (53%), Gaps = 15/162 (9%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G +++ETN+++YAY+++ L IL LF+++ ++ ++ G + +ES+ A GIT +Q
Sbjct: 335 GSIIIETNYRLYAYTSTPLQIAILGLFAELRFRFAGMVTGRLDRESIKRAISYGITADQV 394
Query: 79 -----QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AH ++ K P +P V DQIRLW+ + R++ + +F S+ ++
Sbjct: 395 IEYLAAHAHEQMHRTATLRKKPVLPPTVVDQIRLWQLETERMKIMRGFLFRDFDSQAEYD 454
Query: 129 AACDYARDQSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQNK 169
YA + LLW D++++ + + FL+ + K
Sbjct: 455 DLAKYADEIGVLLWRSDARQLFFASKVD---QLSVFLKARKK 493
>gi|66358470|ref|XP_626413.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
gi|46227864|gb|EAK88784.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
Length = 709
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 89/174 (51%), Gaps = 30/174 (17%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NF++Y Y+ S L +ILR +++ + PN+I G +T++ L +A+ G++ EQ
Sbjct: 528 IIVQSNFRIYCYTASPLQAKILRHLCQVKVRGPNIICGILTRKGLLSAYSMGVSAEQILR 587
Query: 79 ---QNAHPRVADKI-----PSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSR----DV 126
NAHP + + +P NV Q++LWE D NR++ + A + ++ + +
Sbjct: 588 FFSSNAHPIILRRFMLEGTSIIPVNVETQLKLWEKDKNRLKISHASTFSDWGASPNDIQL 647
Query: 127 FEAACDYARDQSGLLW---------------EDSKKMRLVVNAEIHMHMREFLR 165
F YAR + LL+ E KK+ LV+ E ++ F+R
Sbjct: 648 FRQTILYARSKDILLYNSPIELTEKELNLNVELQKKIILVIKQEYEDDIKTFIR 701
>gi|358394641|gb|EHK44034.1| hypothetical protein TRIATDRAFT_222970 [Trichoderma atroviride IMI
206040]
Length = 491
Score = 80.9 bits (198), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 82/159 (51%), Gaps = 13/159 (8%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
VV+ETN+++YAY+ S L +L LF+K+ + P+++ G +++ S+ A GIT +Q
Sbjct: 333 VVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQIIS 392
Query: 79 ---QNAHPRV-----ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAA 130
+AH ++ P +P V DQIRLW+ + R++TT + +F +
Sbjct: 393 YLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYLDT 452
Query: 131 CDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
++ + L+W + N H +++FL+ + +
Sbjct: 453 ARFSEEIGVLVWRNDHARMFFANK--HEQIKDFLKTRKR 489
>gi|401887601|gb|EJT51582.1| hypothetical protein A1Q1_07170 [Trichosporon asahii var. asahii
CBS 2479]
gi|406699646|gb|EKD02845.1| hypothetical protein A1Q2_02789 [Trichosporon asahii var. asahii
CBS 8904]
Length = 451
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 98/192 (51%), Gaps = 31/192 (16%)
Query: 3 IQSRKESWFIPT-----------------IADNCGFVVVETNFKMYAYSTSKLHCEILRL 45
+ RK F PT AD+ F+++ETN+++YAY++++L IL L
Sbjct: 252 VDGRKTDQFFPTHLATSLCSGDTSVNRSATADDKRFLILETNYRVYAYTSNELEIAILNL 311
Query: 46 FSKIEYQLPNLIVGAITKESLYNAFENGITTEQ------QNAHPRVADKIPS-VPKNVCD 98
F I + PNLIVG + + + A + GI+ Q +AHP++ + P + +V D
Sbjct: 312 FVNITIRYPNLIVGHLDRRHVKAAMDKGISAYQIISYLTTHAHPQMYNHPPPLLHPSVTD 371
Query: 99 QIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMR-----LVVN 153
Q+ LW+ + NRV+ + EF S+++FE D A ++ G L ++ + + V+
Sbjct: 372 QLHLWDRERNRVQAQDTVMF-EFYSKELFETCRDEA-ERMGALQHSVQRGQGMAKLIFVD 429
Query: 154 AEIHMHMREFLR 165
I +R+F++
Sbjct: 430 PTIRESLRDFIK 441
>gi|223993063|ref|XP_002286215.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
gi|220977530|gb|EED95856.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
Length = 461
Score = 79.3 bits (194), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 72/133 (54%), Gaps = 16/133 (12%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEY--QLPNLIVGAITKESLYNAFENGITTEQ- 78
VVV+TNF++ AY+ SKLH L LF + +LPN+I IT++S+ +AF G+T +Q
Sbjct: 333 VVVQTNFQVVAYTKSKLHISTLGLFCDVSSFRRLPNVIFFHITRDSIRSAFRLGVTADQI 392
Query: 79 -----QNAHP--RVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV--FEA 129
+AHP R D+ P VP NV DQI LW+ + +RV + RD F A
Sbjct: 393 LRFLHVHAHPMLRSGDR-PMVPSNVVDQILLWDRERHRVVMDEVCVHQ---CRDAAEFTA 448
Query: 130 ACDYARDQSGLLW 142
YA D L W
Sbjct: 449 VSQYASDVDALAW 461
>gi|71033737|ref|XP_766510.1| hypothetical protein [Theileria parva strain Muguga]
gi|68353467|gb|EAN34227.1| hypothetical protein, conserved [Theileria parva]
Length = 359
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 81/141 (57%), Gaps = 19/141 (13%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A N ++V++NFK+Y Y+ S L +L +++ + PNL+VG +T+ES+ +AF++GIT
Sbjct: 158 AGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKSGIT 217
Query: 76 TEQ-------QNAHPRVADK---------IPSVPKNVCDQIRLWESDLNRVETTPA---H 116
+++ + + ++ S+P+NVC Q+++WES+ +R+E P+
Sbjct: 218 SKEIIRFLSPNGMNSSIGNQENTLLNSTFTYSIPENVCRQLKMWESERDRIELCPSIVFK 277
Query: 117 YYDEFPSRDVFEAACDYARDQ 137
+D+ D+F+ +A+ +
Sbjct: 278 RWDQDFMPDLFQRTVRWAQSK 298
>gi|432098035|gb|ELK27922.1| General transcription factor IIH subunit 4 [Myotis davidii]
Length = 377
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 75/153 (49%), Gaps = 28/153 (18%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V
Sbjct: 250 AHQPGFIVVETNYQLYAYTESELQIALIALFSEMLYRFPNMVVA---------------- 293
Query: 76 TEQQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYAR 135
+ P +P + DQIRLWE + +R+ T Y++F S+ F+ A
Sbjct: 294 ------------QTPVLPPTITDQIRLWELERDRLLFTEGVLYNQFLSQVDFDLLLARAW 341
Query: 136 DQSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
+ L++E+ K +VV H ++ F + Q
Sbjct: 342 ELGALVFENMAKQLMVVTPAGHREVKRFWKQQK 374
>gi|84998132|ref|XP_953787.1| TFIIH subunit, Tf2b [Theileria annulata]
gi|65304784|emb|CAI73109.1| TFIIH subunit, Tf2b homologue, putative [Theileria annulata]
Length = 645
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 81/145 (55%), Gaps = 19/145 (13%)
Query: 12 IPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
I A N ++V++NFK+Y Y+ S L +L +++ + PNL+VG +T+ES+ +AF+
Sbjct: 441 ISIKAGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFK 500
Query: 72 NGITTEQ-------QNAHPRVADK---------IPSVPKNVCDQIRLWESDLNRVETTPA 115
+GIT+++ + + + S+P+NVC Q+++WES+ +R+E P+
Sbjct: 501 SGITSKEIIRFLSPNGMNSSIGSQENTLLNSSFTYSIPENVCRQLKMWESERDRIELCPS 560
Query: 116 ---HYYDEFPSRDVFEAACDYARDQ 137
+D+ D+F+ +A+ +
Sbjct: 561 IVFKRWDQDFMPDLFQRTVRWAQSK 585
>gi|428673252|gb|EKX74165.1| conserved hypothetical protein [Babesia equi]
Length = 668
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 64/98 (65%), Gaps = 9/98 (9%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQNA 81
++V++NFK+Y Y+ + L +IL + +++ + PN++VG +T+ES AF++GIT +
Sbjct: 494 IIVQSNFKVYVYTVNNLQFDILNILCEVQARTPNMVVGVLTRESAQRAFKSGIT-----S 548
Query: 82 HPRV----ADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
H + + + P+NV Q+R+WE++ NRVE +PA
Sbjct: 549 HEIIRFFSSTNTSTFPENVIRQLRMWEAERNRVELSPA 586
>gi|221502519|gb|EEE28246.1| tfiih, polypeptide, putative [Toxoplasma gondii VEG]
Length = 836
Score = 77.4 bits (189), Expect = 2e-12, Method: Composition-based stats.
Identities = 37/105 (35%), Positives = 66/105 (62%), Gaps = 11/105 (10%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GIT +Q
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709
Query: 79 ---QNAHPRVADK-----IPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP V ++ P +P+NV Q+R+WE++ R+ PA
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754
>gi|221482097|gb|EEE20458.1| tfiih, polypeptide, putative [Toxoplasma gondii GT1]
Length = 861
Score = 77.4 bits (189), Expect = 2e-12, Method: Composition-based stats.
Identities = 37/105 (35%), Positives = 66/105 (62%), Gaps = 11/105 (10%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GIT +Q
Sbjct: 675 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 734
Query: 79 ---QNAHPRVADK-----IPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP V ++ P +P+NV Q+R+WE++ R+ PA
Sbjct: 735 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 779
>gi|237842971|ref|XP_002370783.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
gi|211968447|gb|EEB03643.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
Length = 836
Score = 77.4 bits (189), Expect = 2e-12, Method: Composition-based stats.
Identities = 37/105 (35%), Positives = 66/105 (62%), Gaps = 11/105 (10%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GIT +Q
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709
Query: 79 ---QNAHPRVADK-----IPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP V ++ P +P+NV Q+R+WE++ R+ PA
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754
>gi|401410644|ref|XP_003884770.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
gi|325119188|emb|CBZ54742.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
Length = 868
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 66/105 (62%), Gaps = 11/105 (10%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GIT +Q
Sbjct: 682 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 741
Query: 79 ---QNAHPRVADK-----IPSVPKNVCDQIRLWESDLNRVETTPA 115
+AHP V ++ P +P+NV Q+R+WE++ R+ PA
Sbjct: 742 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 786
>gi|403221352|dbj|BAM39485.1| transcription factor IIH subunit [Theileria orientalis strain
Shintoku]
Length = 634
Score = 75.5 bits (184), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 67/116 (57%), Gaps = 16/116 (13%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
A N ++V++NFK+Y Y+ S L +L +++ + PNL+VG +T+ES+ +AF++GIT
Sbjct: 415 AGNESRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKSGIT 474
Query: 76 TEQ--------------QNAHPRVADKIPS--VPKNVCDQIRLWESDLNRVETTPA 115
+ + D + +P+NVC Q+++WES+ +R+E +PA
Sbjct: 475 SNEIIRFLSPMKLSLSYAGTCNSFMDSFANYKIPENVCRQLKMWESERDRIELSPA 530
>gi|302831319|ref|XP_002947225.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
gi|300267632|gb|EFJ51815.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
Length = 90
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 44/70 (62%)
Query: 82 HPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLL 141
HP +A + P VP+ V DQIRLWE+ +NR+ Y+ SR++FE A ++R LL
Sbjct: 16 HPSIASRTPVVPEVVSDQIRLWEASMNRLRADSVVLYENLASRELFERALAFSRSSGTLL 75
Query: 142 WEDSKKMRLV 151
WEDS +MR V
Sbjct: 76 WEDSGQMRFV 85
>gi|209880858|ref|XP_002141868.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209557474|gb|EEA07519.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 794
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 82/156 (52%), Gaps = 15/156 (9%)
Query: 4 QSRKESWFIPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITK 63
S + + F+ I++ ++V++NF++Y Y+ S L +ILR +++ + PN+I G +T+
Sbjct: 588 HSIESNLFLSGISNLEAGIIVQSNFRVYCYTASPLQAKILRHLCQVKVRGPNIICGVLTR 647
Query: 64 ESLYNAFENGITTEQ------QNAHP-----RVADKIPSVPKNVCDQIRLWESDLNRVET 112
L +A+ G+ Q NAHP + D +P +V Q++LWE+D NR+E
Sbjct: 648 RGLLSAYSMGVKAYQILRFFISNAHPIILKKHITDGTSIIPISVETQLKLWENDHNRLEI 707
Query: 113 TPAHYYDEFPSR----DVFEAACDYARDQSGLLWED 144
+ ++ + ++F+ YA + +L+ +
Sbjct: 708 NKVSLFSDWGNNKEDIELFKQTVTYAIGKQVVLYHN 743
>gi|398406290|ref|XP_003854611.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
gi|339474494|gb|EGP89587.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
Length = 493
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/126 (34%), Positives = 70/126 (55%), Gaps = 11/126 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++++ETN+++YAY++S L IL LFS + + PNLI ITK S+ +A IT++Q
Sbjct: 321 GYIILETNYRLYAYTSSPLQISILSLFSTLHTRYPNLITAKITKSSIASAISQSITSDQI 380
Query: 79 -----QNAHPRVADK-----IPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFE 128
+AHP + + P +P V DQIRLW+ + R+ T Y + + +
Sbjct: 381 ITYLSTHAHPVLRRQAAIMSAPILPPTVVDQIRLWQIEGERMTTWKGFYIRDVGGWEEYV 440
Query: 129 AACDYA 134
A +YA
Sbjct: 441 KAVEYA 446
>gi|422294120|gb|EKU21420.1| transcription initiation factor TFIIH subunit 4, partial
[Nannochloropsis gaditana CCMP526]
Length = 213
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 73/130 (56%), Gaps = 10/130 (7%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE---- 77
V+VETN+++ AY++S LH E+LR+F+ + +LPN+++G IT+ S+ A +GIT
Sbjct: 84 VIVETNYQLVAYTSSSLHVEMLRIFTDVRCRLPNVVIGFITRASVRRAMASGITAATILS 143
Query: 78 --QQNAHPRV-ADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDV--FEAACD 132
+ + H V A K +P+NV QI LW + +RV+ D S V FE
Sbjct: 144 FLKTHTHVAVRAGKGRLLPENVEAQIELWHQERSRVKFEEVMMID-LSSLMVEEFEEVRT 202
Query: 133 YARDQSGLLW 142
YA + + + W
Sbjct: 203 YAENLAVVCW 212
>gi|156086630|ref|XP_001610724.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797977|gb|EDO07156.1| conserved hypothetical protein [Babesia bovis]
Length = 586
Score = 72.4 bits (176), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 63/103 (61%), Gaps = 9/103 (8%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+VV++NFK+YAY S L +L +++ + PNL++G +T+ SL AF++GIT +Q
Sbjct: 396 MVVQSNFKIYAYIPSALQMNVLNHICELQAKTPNLVIGVLTRSSLQTAFKSGITADQLIC 455
Query: 79 ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
Q ++ + +VP+NV Q+++WE++ NR+E A
Sbjct: 456 FFESKGQYDDIQINRNVMNVPENVRRQLKMWEAERNRLELLNA 498
>gi|387595518|gb|EIJ93142.1| hypothetical protein NEPG_02098 [Nematocida parisii ERTm1]
Length = 398
Score = 72.4 bits (176), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 76/145 (52%), Gaps = 1/145 (0%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F+V+E+NF++Y YS L I+ LFS + PN+IV I ++S+ A +GIT Q
Sbjct: 251 FLVLESNFRLYIYSNRPLDTFIISLFSIKSREFPNMIVAMINEDSIRQALMHGITAGQIR 310
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
+ + + + +NV +QIRLWE +NR+ + ++ + F + F Y + +
Sbjct: 311 VYLN-QNSMYEINENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNID 369
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
+K LVV E + ++ F+R
Sbjct: 370 HRSYREKRMLVVGIENYESVKSFIR 394
>gi|323455511|gb|EGB11379.1| hypothetical protein AURANDRAFT_52511 [Aureococcus anophagefferens]
Length = 291
Score = 72.0 bits (175), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 60/103 (58%), Gaps = 13/103 (12%)
Query: 22 VVVETNFKMYAYS-----TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
V+V+TNF++ AY+ TS L L LF+++ +LPNL+VG I+++++ + GI
Sbjct: 126 VIVQTNFQVLAYTDAGVNTSTLVLATLNLFAELTTRLPNLVVGTISRDAIKRCVDRGIRV 185
Query: 77 EQ------QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETT 113
Q +AHP A K VP+NV DQ+ LW + NRV T
Sbjct: 186 PQIVKFLRAHAHP--AMKASGVPQNVTDQMALWAGEGNRVAFT 226
>gi|399949686|gb|AFP65344.1| tfIIB-like protein [Chroomonas mesostigmatica CCMP1168]
Length = 464
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 59/100 (59%), Gaps = 10/100 (10%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE---- 77
+++E+NF++Y Y + L ++L +FS+I YQLPNL VG IT++S+ AF +G T+
Sbjct: 329 IIIESNFRIYVYKNAHLGNDLLLVFSEILYQLPNLFVGEITEKSILKAFNSGSTSRNIIG 388
Query: 78 --QQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPA 115
+ N HP P +P V +QI+ WE ++ T +
Sbjct: 389 FLKNNLHPIC----PWIPSAVTNQIKAWEFQKTEIKITES 424
>gi|387592902|gb|EIJ87926.1| hypothetical protein NEQG_01998 [Nematocida parisii ERTm3]
Length = 398
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 76/145 (52%), Gaps = 1/145 (0%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F+V+E+NF++Y YS L I+ LFS + PN++V I ++S+ A +GIT Q
Sbjct: 251 FLVLESNFRLYIYSNRPLDTFIISLFSIKSREFPNMMVAMINEDSIRQALMHGITAGQIR 310
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
+ + + + +NV +QIRLWE +NR+ + ++ + F + F Y + +
Sbjct: 311 VYLN-QNSMYEINENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNID 369
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
+K LVV E + ++ F+R
Sbjct: 370 HRSYREKRMLVVGIENYESVKSFIR 394
>gi|389586531|dbj|GAB69260.1| hypothetical protein PCYB_146880 [Plasmodium cynomolgi strain B]
Length = 913
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 62/106 (58%), Gaps = 6/106 (5%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + IT Q
Sbjct: 733 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 792
Query: 79 ---QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEF 121
AHP ++ S+P NV Q++LWES+ +R+ A + F
Sbjct: 793 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 838
>gi|221061889|ref|XP_002262514.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193811664|emb|CAQ42392.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 943
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 62/106 (58%), Gaps = 6/106 (5%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + IT Q
Sbjct: 763 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 822
Query: 79 ---QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEF 121
AHP ++ S+P NV Q++LWES+ +R+ A + F
Sbjct: 823 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 868
>gi|412992644|emb|CCO18624.1| predicted protein [Bathycoccus prasinos]
Length = 464
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/87 (37%), Positives = 53/87 (60%), Gaps = 18/87 (20%)
Query: 10 WFIPTI------------------ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEY 51
W++PT+ A G ++VETNF++YAY+ S+L E+LRLF++ +Y
Sbjct: 373 WYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRLFTRPDY 432
Query: 52 QLPNLIVGAITKESLYNAFENGITTEQ 78
+LPN VG IT++S+ +A GI+ +Q
Sbjct: 433 KLPNAYVGMITRDSILDAMRAGISPDQ 459
>gi|392594381|gb|EIW83705.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 393
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 28/62 (45%), Positives = 46/62 (74%)
Query: 17 DNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITT 76
++ GF+V+ETN+++YAY+ + L +L LF ++Y+ PNL+VG +T+ES+ A NGI+
Sbjct: 325 NDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALGNGISA 384
Query: 77 EQ 78
EQ
Sbjct: 385 EQ 386
>gi|156095867|ref|XP_001613968.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802842|gb|EDL44241.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 960
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 61/106 (57%), Gaps = 6/106 (5%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + IT Q
Sbjct: 780 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 839
Query: 79 ---QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEF 121
AHP + S+P NV Q++LWES+ +R+ A + F
Sbjct: 840 FLESYAHPGRSTFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 885
>gi|378754852|gb|EHY64880.1| hypothetical protein NERG_01936 [Nematocida sp. 1 ERTm2]
Length = 396
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 73/145 (50%), Gaps = 1/145 (0%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F+ +E+NF++Y YS L I+ LFS + PN+IV I ++S+ A GIT Q
Sbjct: 249 FLTLESNFRLYIYSNRPLDVFIISLFSIKIREFPNMIVAMINEDSIRQALTYGITAGQIR 308
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGL 140
+ + + + +NV +QIRLWE +NR+ ++ + F + F Y + +
Sbjct: 309 VYLN-QNSMHKINENVLEQIRLWEKRMNRIHAWESYIFSNFLNYKDFLLVESYCENNNVE 367
Query: 141 LWEDSKKMRLVVNAEIHMHMREFLR 165
K LVV E + +++ F+R
Sbjct: 368 HRSYRDKRVLVVGVENYDNVKSFIR 392
>gi|124806778|ref|XP_001350829.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
gi|23496958|gb|AAN36509.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
Length = 945
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 62/106 (58%), Gaps = 6/106 (5%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + IT Q
Sbjct: 766 LIVQSNFKVYLYTSSLLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 825
Query: 79 ---QNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEF 121
+HP + S+P NV Q++LWES+ +R+ A + F
Sbjct: 826 FLESYSHPGKNNFKSSIPMNVITQLKLWESERHRLTLEDAIVFKSF 871
>gi|194388768|dbj|BAG60352.1| unnamed protein product [Homo sapiens]
Length = 386
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 51/76 (67%), Gaps = 6/76 (7%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
GF+VVETN+++YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT +Q
Sbjct: 311 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 370
Query: 79 -----QNAHPRVADKI 89
AHP + ++
Sbjct: 371 IHFLRTRAHPVMLKQV 386
>gi|440295536|gb|ELP88449.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba invadens IP1]
Length = 531
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 78/152 (51%), Gaps = 10/152 (6%)
Query: 23 VVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQNAH 82
+ ETN+ +Y Y+ S ++ LF + QL NL VG IT +++ AF GIT E
Sbjct: 291 ITETNYTVYIYTESFFQVKLYSLFIRQNLQLTNLCVGRITYDTVTEAFLKGITNEM---- 346
Query: 83 PRVADKIPSVPKNVCDQIRLWESDLNRVETTPA----HYYDEFP-SRDVFEAACDYARDQ 137
V P++PKN+ QI LW+ +LNR++ A Y E ++++ A
Sbjct: 347 -LVNFLQPNLPKNIQAQIDLWKRELNRLKEVRAVKFRFYEPELEVQKELYHLTKSEAEKM 405
Query: 138 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
G+++ +++ L V ++ ++EFLR + +
Sbjct: 406 KGVVFYKEEELTLFVRYDVAEKIKEFLRRKTR 437
>gi|60603004|gb|AAX27959.1| unknown [Schistosoma japonicum]
Length = 100
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 32/74 (43%), Positives = 47/74 (63%), Gaps = 6/74 (8%)
Query: 16 ADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 75
+ + G++++ETNF++YAY+ S L +L LFSKI + PNL+V IT++S+ A GIT
Sbjct: 10 SSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGIT 69
Query: 76 TEQ------QNAHP 83
Q NAHP
Sbjct: 70 ANQILSFLTSNAHP 83
>gi|399219100|emb|CCF75987.1| unnamed protein product [Babesia microti strain RI]
Length = 612
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 58/99 (58%), Gaps = 10/99 (10%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
++V++NFK+Y Y+ S L +L +++ + PNL++G +T+ S + AF GIT +Q
Sbjct: 432 IIVQSNFKVYVYTASPLQINVLAHLCELQSRTPNLVIGVLTRASAHAAFRAGITAKQICQ 491
Query: 79 ---QNAHPRVADKI----PSVPKNVCDQIRLWESDLNRV 110
++HP + + +P NV Q+ +WE++ NR+
Sbjct: 492 FLETHSHPILLQNVREGGSYLPNNVVTQLNMWEAERNRI 530
>gi|269861689|ref|XP_002650538.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220065990|gb|EED43520.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 291
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 74/142 (52%), Gaps = 10/142 (7%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F+ +ETNFK+Y+Y+ + IL LFS I +++P +I + ++ + N + GI +Q
Sbjct: 156 FIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQ-- 213
Query: 81 AHPRVADKIPSVPKNVCDQ----IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARD 136
+ + I N CDQ I +W+ NR+ + Y +F + + + + + +
Sbjct: 214 ----IVEYIRRHTINNCDQILHMIEIWDKQRNRINKQIGYLYSDFTNYNEYRSVLEQIKT 269
Query: 137 QSGLLWEDSKKMRLVVNAEIHM 158
+ LL+++ ++ L V ++H+
Sbjct: 270 DTDLLYKNEEERILFVKNKLHI 291
>gi|269860316|ref|XP_002649880.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220066720|gb|EED44193.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 369
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 74/142 (52%), Gaps = 10/142 (7%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQN 80
F+ +ETNFK+Y+Y+ + IL LFS I +++P +I + ++ + N + GI +Q
Sbjct: 234 FIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQ-- 291
Query: 81 AHPRVADKIPSVPKNVCDQ----IRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARD 136
+ + I N CDQ I +W+ NR+ + Y +F + + + + + +
Sbjct: 292 ----IVEYIRRHTINNCDQILHMIEIWDKQRNRINKQIGYLYSDFTNYNEYRSVLEQIKT 347
Query: 137 QSGLLWEDSKKMRLVVNAEIHM 158
+ LL+++ ++ L V ++H+
Sbjct: 348 DTDLLYKNEEERILFVKNKLHI 369
>gi|403348380|gb|EJY73625.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Oxytricha trifallax]
Length = 532
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 29/58 (50%), Positives = 45/58 (77%)
Query: 21 FVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
F++VETNFK+YAY++S L+ +LR F ++E PNL+VG +T++SL AF+ GI++ Q
Sbjct: 323 FLIVETNFKVYAYTSSDLYRALLRQFIRVECIFPNLVVGTLTRKSLQKAFQRGISSGQ 380
>gi|365758016|gb|EHM99881.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 419
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 48/72 (66%), Gaps = 6/72 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G ++VETNFK+Y+YS S L +L LF ++ + N+++G IT+ES+ NA NGIT +Q
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRNALTNGITADQI 396
Query: 79 -----QNAHPRV 85
+AHP++
Sbjct: 397 IAYLETHAHPQM 408
>gi|238570605|ref|XP_002386883.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
gi|215440101|gb|EEB87813.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
Length = 69
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 45/64 (70%), Gaps = 5/64 (7%)
Query: 20 GFVVVETNFKMYAYSTS-----KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGI 74
GF+V+ETN+++YAY+ S L IL LF ++Y+ PNL+VG+IT+ES+ A NGI
Sbjct: 6 GFIVLETNYRVYAYTGSFSLHNPLQTAILHLFVSLKYRFPNLVVGSITRESVKKALSNGI 65
Query: 75 TTEQ 78
+ +Q
Sbjct: 66 SADQ 69
>gi|429854513|gb|ELA29524.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Colletotrichum gloeosporioides Nara gc5]
Length = 509
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 58/101 (57%), Gaps = 13/101 (12%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
G ++VETN+++YAY + L +L LF K++ + +++ G +T+ S+ NA E GIT +Q
Sbjct: 370 SGSIIVETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQ 429
Query: 79 ------QNAHPR------VADKIPSVPKNVCDQIRLWESDL 107
+AH + V +K P +P V DQIRL D+
Sbjct: 430 IISYLASHAHEQMHRFAAVRNK-PVLPPTVIDQIRLEYEDI 469
>gi|82704497|ref|XP_726580.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23482045|gb|EAA18145.1| Drosophila melanogaster LD30622p-related [Plasmodium yoelii yoelii]
Length = 926
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 58/95 (61%), Gaps = 6/95 (6%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++++NFK+Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + IT +Q
Sbjct: 746 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 805
Query: 79 ---QNAHPRVADKIPSVPKNVCDQIRLWESDLNRV 110
+HP +P NV Q++LWE++ +R+
Sbjct: 806 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRL 840
>gi|70950974|ref|XP_744765.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524852|emb|CAH80923.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 922
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 58/95 (61%), Gaps = 6/95 (6%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++++NFK+Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + IT +Q
Sbjct: 742 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 801
Query: 79 ---QNAHPRVADKIPSVPKNVCDQIRLWESDLNRV 110
+HP +P NV Q++LWE++ +R+
Sbjct: 802 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRL 836
>gi|68070707|ref|XP_677265.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497310|emb|CAH94094.1| conserved hypothetical protein [Plasmodium berghei]
Length = 929
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 58/95 (61%), Gaps = 6/95 (6%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ--- 78
+++++NFK+Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + IT +Q
Sbjct: 749 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 808
Query: 79 ---QNAHPRVADKIPSVPKNVCDQIRLWESDLNRV 110
+HP +P NV Q++LWE++ +R+
Sbjct: 809 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRL 843
>gi|164660182|ref|XP_001731214.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
gi|159105114|gb|EDP44000.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
Length = 231
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 49/72 (68%), Gaps = 6/72 (8%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ- 78
G++++ETN+++YAY+ + L IL LF ++ +LPNL++G +T+ S+ +A GIT +Q
Sbjct: 148 GYLILETNYRVYAYTANLLRIAILNLFVTLKSRLPNLVIGQLTRHSVKSALNKGITADQI 207
Query: 79 -----QNAHPRV 85
+AHP++
Sbjct: 208 ITYLTHHAHPQM 219
>gi|428181353|gb|EKX50217.1| TFB2 transcription factor B2 nucleotide excision repair [Guillardia
theta CCMP2712]
Length = 463
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/136 (31%), Positives = 66/136 (48%), Gaps = 15/136 (11%)
Query: 42 ILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQNAHPRVADKIPSVPKNVCDQIR 101
ILRLF++ Y+ P++I+ IT+ES+ NA I + +AHP+ + P VP+ V DQI
Sbjct: 324 ILRLFTRPIYKFPHMIIAVITRESIRNAL--IIEYLRMHAHPQCLENWPIVPEVVTDQIC 381
Query: 102 LWESDLNRVETTPAHYYDEFPSRDVFEAACDYAR-------------DQSGLLWEDSKKM 148
WE + R+ PA Y F S + A AR + L+ D+ +
Sbjct: 382 FWEQERCRIRAEPAVAYHNFFSGEAHRACEKEARRLKFNDKGPGEVAKREACLYSDTSEQ 441
Query: 149 RLVVNAEIHMHMREFL 164
LVV +R+F+
Sbjct: 442 LLVVPEAADPLIRKFI 457
>gi|67483069|ref|XP_656826.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474050|gb|EAL51440.1| hypothetical protein EHI_192400 [Entamoeba histolytica HM-1:IMSS]
gi|449706224|gb|EMD46115.1| tfiih basal transcription factor complex p52 subunit, putative
[Entamoeba histolytica KU27]
Length = 484
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 72/150 (48%), Gaps = 8/150 (5%)
Query: 23 VVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQNAH 82
+ E N+ +Y Y+ S+ ++ RLF + +QL NL VG + +S+ AF GIT+E
Sbjct: 290 ITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEMLINF 349
Query: 83 PRVADKIPSVPKNVCDQIRLWESDLNRVE---TTPAHYYDEFPSRDVFEAACDYARDQSG 139
+ P++P+ + QI LW+ ++NR + +YD+ R +F + +
Sbjct: 350 LQ-----PNLPRTIQKQIDLWKKEINRFKEEHVVRYRFYDDAIGRQLFVIVKNESDRLKA 404
Query: 140 LLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ E + V + +R F++ + +
Sbjct: 405 TILEKEDIRLIFVKYQYGETIRSFMKKKER 434
>gi|167395228|ref|XP_001741280.1| TFIIH basal transcription factor complex p52 subunit [Entamoeba
dispar SAW760]
gi|165894177|gb|EDR22239.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba dispar SAW760]
Length = 489
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 72/150 (48%), Gaps = 8/150 (5%)
Query: 23 VVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQNAH 82
+ E N+ +Y Y+ S+ ++ RLF + +QL NL VG + +S+ AF GIT+E
Sbjct: 290 ITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEMLINF 349
Query: 83 PRVADKIPSVPKNVCDQIRLWESDLNRVE---TTPAHYYDEFPSRDVFEAACDYARDQSG 139
+ P++P+ + QI LW+ ++NR + +YD+ R +F + +
Sbjct: 350 LQ-----PNLPRTIQKQIDLWKKEINRFKEEHVVRYKFYDDAIGRQLFIIVKNESDRLKA 404
Query: 140 LLWEDSKKMRLVVNAEIHMHMREFLRGQNK 169
+ E + V + +R F++ + +
Sbjct: 405 TILEKEDIRLIFVKYQYGETIRSFMKKKER 434
>gi|380476902|emb|CCF44454.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
higginsianum]
Length = 126
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 61/125 (48%), Gaps = 13/125 (10%)
Query: 56 LIVGAITKESLYNAFENGITTEQ------QNAHPRV-----ADKIPSVPKNVCDQIRLWE 104
++ G +T+ S+ NA E GIT +Q +AH ++ P +P V DQIRLW+
Sbjct: 1 MVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQMHRMAAVRSRPVLPPTVVDQIRLWQ 60
Query: 105 SDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEIHMHMREFL 164
+ R+ TT + +F S +E YA + L+W + K A H +R++L
Sbjct: 61 LETERMTTTSGFLFRDFDSPKEYEVIAGYASEIGVLVWRNDKLGMFF--ASKHEQIRDYL 118
Query: 165 RGQNK 169
+ + K
Sbjct: 119 KLRKK 123
>gi|160331881|ref|XP_001712647.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
gi|159766096|gb|ABW98322.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
Length = 432
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 55/94 (58%), Gaps = 6/94 (6%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQNA 81
+++E+N+++Y Y + + ++ +FS+I Y LPN VG IT+ S+ A ++GIT QN
Sbjct: 300 IIIESNYRIYVYQKNSNNNQLFLIFSEILYILPNFFVGEITETSISRALKSGITI--QNI 357
Query: 82 HPRVADKI----PSVPKNVCDQIRLWESDLNRVE 111
+ + + S+P + +Q RLWE +++
Sbjct: 358 LGFIRENLHCVCRSIPSTILNQFRLWEFQKKKIK 391
>gi|47212202|emb|CAF90416.1| unnamed protein product [Tetraodon nigroviridis]
Length = 471
Score = 55.5 bits (132), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 42/70 (60%), Gaps = 5/70 (7%)
Query: 51 YQLPNLIVGAITKESLYNAFENGITTEQQN-----AHPRVADKIPSVPKNVCDQIRLWES 105
Y+ PN++V +T+ES+ A NGIT +Q+ + P + P +P + DQIRLWE
Sbjct: 282 YRFPNVVVAQVTRESVQQAIANGITAQQRLTFDLWSSPTPLSQTPVLPPTITDQIRLWEL 341
Query: 106 DLNRVETTPA 115
+ +R++ T A
Sbjct: 342 ERDRLQFTEA 351
>gi|167535774|ref|XP_001749560.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771952|gb|EDQ85611.1| predicted protein [Monosiga brevicollis MX1]
Length = 417
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 36/60 (60%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
GF++ ETNF +YAY+ S + +L F+ + Y+LP + IT+ S+ A GIT +Q
Sbjct: 291 TGFLLAETNFHVYAYTDSAVKVALLSKFATLTYRLPIMTTAIITRTSVRRALSQGITADQ 350
>gi|340508252|gb|EGR33999.1| hypothetical protein IMG5_027880 [Ichthyophthirius multifiliis]
Length = 454
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 39/66 (59%), Gaps = 10/66 (15%)
Query: 22 VVVETNFKMYAY----------STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 71
++VETNF +YAY S SK ++L+ FSKI Y P+LIV +T+ + AF
Sbjct: 306 IIVETNFNLYAYLDYNPQNKTFSESKYIRDLLKKFSKIHYTFPHLIVAQLTEAKMKQAFN 365
Query: 72 NGITTE 77
GIT++
Sbjct: 366 QGITSK 371
>gi|330038479|ref|XP_003239608.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
gi|327206532|gb|AEA38710.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
Length = 467
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 6/98 (6%)
Query: 19 CGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQ 78
C +++E+N+++YAY+ + +IL F + Y LP L VG I K S++ A GI
Sbjct: 316 CFQMIIESNYRVYAYNNNTFLNKILIQFCDLIYNLPGLFVGEINKTSIHKAIHKGINA-- 373
Query: 79 QNAHPRVADKIPSVPKNVC----DQIRLWESDLNRVET 112
+N + + +N C +QIR+WE + T
Sbjct: 374 KNIISFILKNSHYIHQNSCNPIINQIRIWEFQQRKTST 411
>gi|123488965|ref|XP_001325283.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121908180|gb|EAY13060.1| hypothetical protein TVAG_212420 [Trichomonas vaginalis G3]
Length = 460
Score = 48.9 bits (115), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/132 (23%), Positives = 61/132 (46%), Gaps = 4/132 (3%)
Query: 20 GFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTEQQ 79
G+++++TNFK+ A+ S LH +L+ F+ + Y+ P I+ S A G T +
Sbjct: 294 GWMLIDTNFKITAFPKSPLHTALLKKFANVTYEFPGFASAFISPNSFREALNQGTTLDDI 353
Query: 80 NAHPR--VADKIPS--VPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYAR 135
+ ++ KI S +P V Q +W R+ T ++ + + A A+
Sbjct: 354 IGFLKSNLSHKIGSGQIPSAVMKQFYVWRDQRERLTVTHECIMRQYTNPNDANLAAQCAK 413
Query: 136 DQSGLLWEDSKK 147
+G ++ ++K
Sbjct: 414 QLAGYVYGPAEK 425
>gi|294950275|ref|XP_002786548.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
gi|239900840|gb|EER18344.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
Length = 580
Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 58/132 (43%), Gaps = 40/132 (30%)
Query: 24 VETNFKMYAYSTSKLHCEILRLFSKIEYQL-------PN---LIVGAITKESLYNAFENG 73
V++NF + AY+TS L +L F +++ QL PN ++G +T+ S+ +A + G
Sbjct: 397 VDSNFAVTAYTTSSLDLRLLGTFVQLQRQLGDGREYDPNDFGCVLGTLTQSSVQSAAQRG 456
Query: 74 ITTE------QQNAHPRVADK----------------------IPSVPKNVCDQIRLWES 105
+T+E + + PR A I +P NV QI LWE
Sbjct: 457 VTSEYIISYLKSHVDPRAAHMGSQGGRSSAATANTGAARGEKFIDGIPANVVTQITLWER 516
Query: 106 DL--NRVETTPA 115
+ NR+ P
Sbjct: 517 EAIHNRLRIDPG 528
>gi|414589532|tpg|DAA40103.1| TPA: hypothetical protein ZEAMMB73_417245 [Zea mays]
Length = 877
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 36/58 (62%), Gaps = 9/58 (15%)
Query: 94 KNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLV 151
KN + + ++DLNRVE P++ Y++FPS+ ARD LL +DSK+MRL+
Sbjct: 814 KNYSNSENMRQTDLNRVEMIPSYLYEDFPSK---------ARDNGYLLCDDSKRMRLI 862
>gi|197107449|pdb|3DOM|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
gi|197107451|pdb|3DOM|C Chain C, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
Length = 108
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 44/83 (53%)
Query: 83 PRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLW 142
P + + +P V DQIRLW+ +L+RV T Y +F + + YA+D LLW
Sbjct: 19 PNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLW 78
Query: 143 EDSKKMRLVVNAEIHMHMREFLR 165
+D KK + ++ E + + +F +
Sbjct: 79 KDDKKKKFFISKEGNSQVLDFAK 101
>gi|397564488|gb|EJK44232.1| hypothetical protein THAOC_37249 [Thalassiosira oceanica]
Length = 971
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 70/159 (44%), Gaps = 30/159 (18%)
Query: 5 SRKESWFIPTIADNCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQ-LPNLIVGAITK 63
SR+ +P++A M A ++ H + I Y+ LPN++ +T+
Sbjct: 397 SRQSDALLPSVAATRSL-----EEAMNAPDPTRSHLAV------ISYKRLPNVVFFHLTR 445
Query: 64 ESLYNAFENGITTE------QQNAHPRV-ADKIPSVPKNVCDQIRLWESDLNRVETTPAH 116
+S+ +AF G+T + Q +AHP + + P P NV DQI LW+ + RV
Sbjct: 446 DSIKSAFRLGVTADQILRFLQVHAHPMMRSGNQPMPPANVRDQILLWDRERRRV------ 499
Query: 117 YYDEF---PSRD--VFEAACDYARDQSGLLWEDSKKMRL 150
DE RD F A YA D L W ++ +L
Sbjct: 500 VMDEVWVHQCRDDAEFSAVGMYASDSEALAWGNAHTNKL 538
>gi|47194594|emb|CAF93785.1| unnamed protein product [Tetraodon nigroviridis]
Length = 123
Score = 43.1 bits (100), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 27/113 (23%)
Query: 81 AHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSG- 139
+ P + P +P + DQIRLWE + +R++ T Y++F S+ FE D R Q G
Sbjct: 13 SSPTPLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRD--RAQVGL 70
Query: 140 ------------------------LLWEDSKKMRLVVNAEIHMHMREFLRGQN 168
LLW+D+ +VV H +++F + Q
Sbjct: 71 FHCCYTIRQKVHQQLSPVLQSLGCLLWQDAAHRVMVVTLWGHSEVKKFWKRQK 123
>gi|197107383|pdb|3DGP|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
Length = 80
Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 39/71 (54%)
Query: 95 NVCDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNA 154
V DQIRLW+ +L+RV T Y +F + + YA+D LLW+D KK + ++
Sbjct: 3 TVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISK 62
Query: 155 EIHMHMREFLR 165
E + + +F +
Sbjct: 63 EGNSQVLDFAK 73
>gi|401419872|ref|XP_003874425.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490661|emb|CBZ25923.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 559
Score = 42.7 bits (99), Expect = 0.054, Method: Composition-based stats.
Identities = 36/144 (25%), Positives = 56/144 (38%), Gaps = 55/144 (38%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLP-NLIVGAITKESLYNAFENGITTEQ-- 78
++ ETNF++YAY+ IL F+++E + NL +T++S A GIT Q
Sbjct: 359 IITETNFRLYAYTDDADLLNILNQFAELEEIVNGNLHCYRVTRDSFAAAMRKGITAAQVL 418
Query: 79 ----QNAHPRVADK-----------IPS-------------------------------- 91
AHP + + PS
Sbjct: 419 RFLSLRAHPSMLRRHGEREESGDTAAPSSAGGVSSSLTFSAPHRRGAVPGTCGGTTAIDA 478
Query: 92 -----VPKNVCDQIRLWESDLNRV 110
VP++ CDQ+R+WES+ +RV
Sbjct: 479 VSTIVVPQSFCDQMRMWESECHRV 502
>gi|162605818|ref|XP_001713424.1| hypothetical protein GTHECHR3089 [Guillardia theta]
gi|13794356|gb|AAK39733.1|AF083031_90 hypothetical protein [Guillardia theta]
Length = 180
Score = 42.4 bits (98), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 27/110 (24%), Positives = 51/110 (46%), Gaps = 15/110 (13%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITTE---- 77
+++E NF++Y + + SK Y LP VG I + F+ G+T++
Sbjct: 45 IIIEVNFRIYIFKYFDFSNSFFKFISKKIYSLPYFYVGEINENYSRRLFQLGVTSDNILK 104
Query: 78 --QQNAHPRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHYYDEFPSRD 125
++N H +KIPS + D++R+WE + +++ F R+
Sbjct: 105 FIKKNLH-YTCNKIPSTFE---DRLRIWEISFKK-----KYFFGGFLMRN 145
>gi|241725008|ref|XP_002412209.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
gi|215505416|gb|EEC14910.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
Length = 75
Score = 42.4 bits (98), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 35/67 (52%)
Query: 97 CDQIRLWESDLNRVETTPAHYYDEFPSRDVFEAACDYARDQSGLLWEDSKKMRLVVNAEI 156
DQ+RLWE + +R Y +F S+ F+ +YA D L+W++ K +VVN
Sbjct: 1 MDQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRVMVVNRNG 60
Query: 157 HMHMREF 163
H ++ F
Sbjct: 61 HDEVKRF 67
>gi|398024082|ref|XP_003865202.1| hypothetical protein, conserved [Leishmania donovani]
gi|322503439|emb|CBZ38524.1| hypothetical protein, conserved [Leishmania donovani]
Length = 557
Score = 40.8 bits (94), Expect = 0.19, Method: Composition-based stats.
Identities = 35/144 (24%), Positives = 55/144 (38%), Gaps = 55/144 (38%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLP-NLIVGAITKESLYNAFENGITTEQ-- 78
++ ETNF++YAY+ IL F+K+E + NL +T++S +A GIT Q
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 79 ----QNAHPRV----------------------------------------------ADK 88
AHP + D
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476
Query: 89 IPS--VPKNVCDQIRLWESDLNRV 110
+ + VP++ CDQ+ +WES+ RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500
>gi|146103275|ref|XP_001469523.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134073893|emb|CAM72632.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 557
Score = 40.8 bits (94), Expect = 0.19, Method: Composition-based stats.
Identities = 35/144 (24%), Positives = 55/144 (38%), Gaps = 55/144 (38%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLP-NLIVGAITKESLYNAFENGITTEQ-- 78
++ ETNF++YAY+ IL F+K+E + NL +T++S +A GIT Q
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 79 ----QNAHPRV----------------------------------------------ADK 88
AHP + D
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476
Query: 89 IPS--VPKNVCDQIRLWESDLNRV 110
+ + VP++ CDQ+ +WES+ RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500
>gi|154345556|ref|XP_001568715.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066057|emb|CAM43844.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 561
Score = 40.8 bits (94), Expect = 0.20, Method: Composition-based stats.
Identities = 36/143 (25%), Positives = 53/143 (37%), Gaps = 54/143 (37%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLP-NLIVGAITKESLYNAFENGITTEQ-- 78
++ ETNF++YAY+ IL F+++E + NL +T++S A GIT Q
Sbjct: 362 IITETNFRLYAYTEDSDLLNILHQFAELEEVVNGNLHCYRVTRDSFAAAMRKGITATQVL 421
Query: 79 ----QNAHPRV------------------ADKIPS------------------------- 91
AHP + A PS
Sbjct: 422 RFLSLRAHPSMLRRYGEKEENGDTKSPSPAGGAPSSTFSAPPRRAAVAGTRGSTTFIDAV 481
Query: 92 ----VPKNVCDQIRLWESDLNRV 110
VP + CDQ+R+WE + RV
Sbjct: 482 TTLVVPHSFCDQLRMWERECRRV 504
>gi|145509120|ref|XP_001440504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407721|emb|CAK73107.1| unnamed protein product [Paramecium tetraurelia]
Length = 400
Score = 39.3 bits (90), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 7/71 (9%)
Query: 14 TIADNCGFVVVETNFKMYAYSTS------KLHCEILRLFSKIEYQLPNLIVGAITKESLY 67
+I C ++VE NF++YAY S ++ C +L LFS+I+ + LI+ +++ S+
Sbjct: 265 SIGIQCN-IIVEANFRIYAYLNSGDQQEEEILCNLLNLFSEIKKRFKILIIADLSESSIR 323
Query: 68 NAFENGITTEQ 78
A + +Q
Sbjct: 324 KAVRENLQAKQ 334
>gi|324515605|gb|ADY46257.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 245
Score = 38.5 bits (88), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 26/35 (74%)
Query: 18 NCGFVVVETNFKMYAYSTSKLHCEILRLFSKIEYQ 52
+ G+++VETN+++YAY+ S L IL F+++ Y+
Sbjct: 209 HSGYIIVETNYRVYAYTDSSLQLAILSTFTEMLYR 243
>gi|146161819|ref|XP_001033510.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila]
gi|146142827|gb|EAR85847.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila
SB210]
Length = 484
Score = 38.1 bits (87), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 17/71 (23%)
Query: 22 VVVETNFKMYAY-------------STSKLHCE----ILRLFSKIEYQLPNLIVGAITKE 64
+ +ETNF +YAY T K E +L F + Y+ P+LIVG ++++
Sbjct: 322 ITIETNFNVYAYIDQEDPKEIKDSKDTLKDQTEHIEKLLASFCDLHYKFPHLIVGQLSEQ 381
Query: 65 SLYNAFENGIT 75
+ F+NG+T
Sbjct: 382 KTKDQFKNGLT 392
>gi|342184190|emb|CCC93671.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 499
Score = 37.7 bits (86), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 35/124 (28%)
Query: 22 VVVETNFKMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITTEQ-- 78
++ ETNF++YAY+ + IL F+ E + +IV +T+ S A GI +
Sbjct: 317 IITETNFRLYAYTRNPDLLGILDQFAVKEIDIDGIIVCYRVTRSSFALALRKGIDAKHIL 376
Query: 79 ----QNAHPR-----------VADKI----------PS-------VPKNVCDQIRLWESD 106
AHP V+D + PS +P++ CDQ+ WE +
Sbjct: 377 QFLTLKAHPSMVRKDGGASRDVSDNLALNSVAGFGKPSEVHQSTVIPQSFCDQLMTWERE 436
Query: 107 LNRV 110
R+
Sbjct: 437 CRRL 440
>gi|255019851|ref|ZP_05291927.1| 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[Acidithiobacillus caldus ATCC 51756]
gi|340783226|ref|YP_004749833.1| 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[Acidithiobacillus caldus SM-1]
gi|254970780|gb|EET28266.1| 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[Acidithiobacillus caldus ATCC 51756]
gi|340557377|gb|AEK59131.1| 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[Acidithiobacillus caldus SM-1]
Length = 160
Score = 37.4 bits (85), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 23/35 (65%)
Query: 83 PRVADKIPSVPKNVCDQIRLWESDLNRVETTPAHY 117
PR+A+ IPS+ +N+CD ++L E LN TT H
Sbjct: 106 PRLAEHIPSMRRNLCDDLQLGEDRLNIKATTTEHL 140
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,642,824,243
Number of Sequences: 23463169
Number of extensions: 103022855
Number of successful extensions: 204808
Number of sequences better than 100.0: 383
Number of HSP's better than 100.0 without gapping: 370
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 204027
Number of HSP's gapped (non-prelim): 421
length of query: 169
length of database: 8,064,228,071
effective HSP length: 129
effective length of query: 40
effective length of database: 9,332,446,566
effective search space: 373297862640
effective search space used: 373297862640
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 71 (32.0 bits)