BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 029030
(200 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis]
gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis]
Length = 451
Score = 372 bits (954), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/198 (87%), Positives = 192/198 (96%), Gaps = 1/198 (0%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
MIKD ADLGLVKLQQ GRKESWFIPTKLATNLSMSLTDSS+RK+GFVVVETNFRMYAYST
Sbjct: 254 MIKDLADLGLVKLQQ-GRKESWFIPTKLATNLSMSLTDSSSRKQGFVVVETNFRMYAYST 312
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
SKLHCEI+RLFS++EYQLPNL+VGA+TKESLY+AFENGITAEQIISFLQQNAHPRVA+RI
Sbjct: 313 SKLHCEIMRLFSRVEYQLPNLVVGAMTKESLYSAFENGITAEQIISFLQQNAHPRVAERI 372
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
PSVPENV DQIRLWESD+NRVEMTPAH Y+EFPSRDVFEAAC++ARD +GLLWEDSK+MR
Sbjct: 373 PSVPENVTDQIRLWESDMNRVEMTPAHLYDEFPSRDVFEAACNFARDWNGLLWEDSKRMR 432
Query: 181 LVVNAEIHMHMREFLRGQ 198
+VV AEIH++MRE+LRGQ
Sbjct: 433 MVVKAEIHLNMREYLRGQ 450
>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa]
gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 363 bits (933), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 169/197 (85%), Positives = 190/197 (96%), Gaps = 1/197 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
IKD A+LGLVKLQQ GRKESWFIPTKLATNLS+SLTDSS+RK+G+VVVETNFR+YAYS+S
Sbjct: 253 IKDLAELGLVKLQQ-GRKESWFIPTKLATNLSVSLTDSSSRKQGYVVVETNFRLYAYSSS 311
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLFSKIEYQLPNLIVGAITKESLY AFENGIT++QIISFLQQNAHPRVA+R+P
Sbjct: 312 KLHCEILRLFSKIEYQLPNLIVGAITKESLYTAFENGITSDQIISFLQQNAHPRVAERLP 371
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
SVPENV DQIRLWE+DLNRVE+TP+H+Y+EFPSRD FEAACD+AR+ +GLLWEDSKKMR+
Sbjct: 372 SVPENVTDQIRLWEADLNRVEITPSHFYDEFPSRDTFEAACDFAREWNGLLWEDSKKMRV 431
Query: 182 VVNAEIHMHMREFLRGQ 198
VV AEIHM+MREFLRGQ
Sbjct: 432 VVKAEIHMNMREFLRGQ 448
>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis
sativus]
Length = 451
Score = 362 bits (928), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 171/197 (86%), Positives = 186/197 (94%), Gaps = 1/197 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
IKD ADLGLVKLQQ GRKESWFIPTKLATNLSMSL DSS+RK GFVVVETNFRMYAYSTS
Sbjct: 255 IKDLADLGLVKLQQ-GRKESWFIPTKLATNLSMSLADSSSRKLGFVVVETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLFS+IEYQLPNLIVGAITKESLYNAF+NGITAEQI++FLQQNAHPRVA+RIP
Sbjct: 314 KLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
SVPENV DQIRLWESDLNRV++TPAH+Y+EFPSR+VFEAACDYAR+ +GLLWEDSK +RL
Sbjct: 374 SVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREWNGLLWEDSKNLRL 433
Query: 182 VVNAEIHMHMREFLRGQ 198
VV A+IH HMRE LR Q
Sbjct: 434 VVKADIHTHMREHLRRQ 450
>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera]
Length = 451
Score = 360 bits (923), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 167/195 (85%), Positives = 184/195 (94%), Gaps = 1/195 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
IKD DLGLVKLQQ GRKESWFIPTKLATNLSMSL+D+S+RK+GFVVVETNFR+YAYS+S
Sbjct: 255 IKDLVDLGLVKLQQ-GRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLFS++EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA+R P
Sbjct: 314 KLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAERTP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
+VPENV DQIRLWE+DLNRVE P+H Y+EFPSRDVFEAACD+AR+ GLLWEDSKKMRL
Sbjct: 374 AVPENVTDQIRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRL 433
Query: 182 VVNAEIHMHMREFLR 196
VV AEIH+HMRE+LR
Sbjct: 434 VVKAEIHLHMREYLR 448
>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 452
Score = 350 bits (897), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 164/198 (82%), Positives = 186/198 (93%), Gaps = 1/198 (0%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
+IKD ADLGLVKLQQ GRKESWFIPTKLATNLS+S+T+SS+RKEGFVVVETNFR+YAYST
Sbjct: 255 IIKDLADLGLVKLQQ-GRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYST 313
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
SKLHCEILRLFS++EYQLPNLIVGAITKESLYNAF+NGITA+QI+SFL+QNAHPRVA R+
Sbjct: 314 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSFLRQNAHPRVAQRV 373
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
P+VPENV DQIRLWESDLNRVEMT A+YY+EFPSRDVFE ACD AR+ SGLLWEDSKKM
Sbjct: 374 PAVPENVTDQIRLWESDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMH 433
Query: 181 LVVNAEIHMHMREFLRGQ 198
LVV +E+H ++R+FLR Q
Sbjct: 434 LVVKSEVHTYVRDFLRRQ 451
>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 451
Score = 348 bits (893), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 164/198 (82%), Positives = 185/198 (93%), Gaps = 1/198 (0%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
+IKD ADLGLVKLQQ GRKESWFIPTKLATNLS+S+T+SS+RKEGFVVVETNFR+YAYST
Sbjct: 254 IIKDLADLGLVKLQQ-GRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYST 312
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
SKLHCEILRLFS++EYQLPNLIVGAITKESLYNAF+NGITA+QI+SF +QNAHPRVA RI
Sbjct: 313 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSFFRQNAHPRVAQRI 372
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
P+VPENV DQIRLWESDLNRVEMT A+YY+EFPSRDVFE ACD AR+ SGLLWEDSKKM
Sbjct: 373 PAVPENVTDQIRLWESDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMH 432
Query: 181 LVVNAEIHMHMREFLRGQ 198
LVV +E+H ++R+FLR Q
Sbjct: 433 LVVKSEVHTYVRDFLRRQ 450
>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 344 bits (882), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 161/198 (81%), Positives = 183/198 (92%), Gaps = 1/198 (0%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
+I D ADLGLVK+QQ GRK SWFIPTKLATNLSMSL DSS+RK+GFVVVETNFR+YAYST
Sbjct: 254 IINDLADLGLVKIQQ-GRKGSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRVYAYST 312
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
SKLHCEILRLFS++EYQLPNLIVGAITKESLY+AFENGITAEQII+FLQQNAHPRVA+RI
Sbjct: 313 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAFENGITAEQIITFLQQNAHPRVAERI 372
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
PSVPENV +QIRLWE+DLNRVEMT A+YY+EFPSRDVFE ACD AR+ +GLLWEDSKKM
Sbjct: 373 PSVPENVTEQIRLWEADLNRVEMTDAYYYDEFPSRDVFEGACDCAREWNGLLWEDSKKMH 432
Query: 181 LVVNAEIHMHMREFLRGQ 198
+VV E+H ++R++LR Q
Sbjct: 433 MVVKTEVHQYVRDYLRRQ 450
>gi|297800414|ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 342 bits (878), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 158/199 (79%), Positives = 181/199 (90%), Gaps = 1/199 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
+KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KL CEILRLF++IEYQLPNLI AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+ GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLKRIEMTQAHFYDEFPSKDVFEAACDFAREWGGLLWEDSKRMRL 433
Query: 182 VVNAEIHMHMREFLRGQNK 200
VV +E+H MREFL Q K
Sbjct: 434 VVKSEVHNQMREFLHNQTK 452
>gi|42566894|ref|NP_193435.2| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana]
gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana]
gi|332658439|gb|AEE83839.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 452
Score = 341 bits (874), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 157/199 (78%), Positives = 182/199 (91%), Gaps = 1/199 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
+KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KL CEILRLF++IEYQLPNLI AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+ GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433
Query: 182 VVNAEIHMHMREFLRGQNK 200
VV +E+H MREFL Q++
Sbjct: 434 VVKSEVHNQMREFLHTQSR 452
>gi|42572935|ref|NP_974564.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658438|gb|AEE83838.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 462
Score = 340 bits (873), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 157/199 (78%), Positives = 182/199 (91%), Gaps = 1/199 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
+KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KL CEILRLF++IEYQLPNLI AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+ GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433
Query: 182 VVNAEIHMHMREFLRGQNK 200
VV +E+H MREFL Q++
Sbjct: 434 VVKSEVHNQMREFLHTQSR 452
>gi|334186626|ref|NP_001190745.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658440|gb|AEE83840.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 482
Score = 340 bits (872), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 157/198 (79%), Positives = 181/198 (91%), Gaps = 1/198 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
+KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KL CEILRLF++IEYQLPNLI AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+ GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433
Query: 182 VVNAEIHMHMREFLRGQN 199
VV +E+H MREFL Q+
Sbjct: 434 VVKSEVHNQMREFLHTQS 451
>gi|51969958|dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana]
Length = 452
Score = 339 bits (870), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 157/199 (78%), Positives = 181/199 (90%), Gaps = 1/199 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
+KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KL CEILRLF++IEYQLPNLI AITKESLYNAF NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFGNGITSDQIITFLQQNSHPRCADRVP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+ GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433
Query: 182 VVNAEIHMHMREFLRGQNK 200
VV +E+H MREFL Q++
Sbjct: 434 VVKSEVHNQMREFLHTQSR 452
>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 339 bits (869), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 157/198 (79%), Positives = 182/198 (91%), Gaps = 1/198 (0%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
+I D ADLGLVK+QQ GRK SWFIPTKLATNLSMSL DSS+RK+GFVVVETNFR+YAYST
Sbjct: 254 IINDLADLGLVKIQQ-GRKGSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRVYAYST 312
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
SKLHCEILRLFS++EYQLPNLIVGAITKESLY+A+ENGITAEQI++FLQQNAHPRVA+R+
Sbjct: 313 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAYENGITAEQIVTFLQQNAHPRVAERV 372
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
PSVPENV +QIRLWE+DLNRVEMT +YY+EFPSRDVFE ACD AR+ +GLLWEDSKKM
Sbjct: 373 PSVPENVTEQIRLWEADLNRVEMTDTYYYDEFPSRDVFEGACDCAREWNGLLWEDSKKMH 432
Query: 181 LVVNAEIHMHMREFLRGQ 198
+VV E+H ++R++LR Q
Sbjct: 433 MVVKTEVHPYVRDYLRRQ 450
>gi|32487401|emb|CAE05735.1| OSJNBb0017I01.15 [Oryza sativa Japonica Group]
Length = 451
Score = 319 bits (818), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 156/195 (80%), Positives = 176/195 (90%), Gaps = 1/195 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVKLQQ GRK+SWFIPTKLATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVKLQQ-GRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 314 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
+VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+ CDYARD LLWED+KKMRL
Sbjct: 374 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 433
Query: 182 VVNAEIHMHMREFLR 196
+V E H MREFLR
Sbjct: 434 IVRVEFHSEMREFLR 448
>gi|222629781|gb|EEE61913.1| hypothetical protein OsJ_16644 [Oryza sativa Japonica Group]
Length = 427
Score = 319 bits (817), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 156/195 (80%), Positives = 176/195 (90%), Gaps = 1/195 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVKLQQ GRK+SWFIPTKLATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 231 IRDLAELGLVKLQQ-GRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 289
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 290 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 349
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
+VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+ CDYARD LLWED+KKMRL
Sbjct: 350 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 409
Query: 182 VVNAEIHMHMREFLR 196
+V E H MREFLR
Sbjct: 410 IVRVEFHSEMREFLR 424
>gi|218195833|gb|EEC78260.1| hypothetical protein OsI_17944 [Oryza sativa Indica Group]
Length = 459
Score = 317 bits (813), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 156/202 (77%), Positives = 177/202 (87%), Gaps = 7/202 (3%)
Query: 2 IKDFADLGLVKLQQV-------GRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFR 54
I+D A+LGLVKLQQ+ GRK+SWFIPTKLATNLS SL+DSS+ KEGFVVVETNFR
Sbjct: 255 IRDLAELGLVKLQQMSRVKPWQGRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFR 314
Query: 55 MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
MYAYSTS+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHP
Sbjct: 315 MYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHP 374
Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
RVAD+IP+VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+ CDYARD LLWE
Sbjct: 375 RVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWE 434
Query: 175 DSKKMRLVVNAEIHMHMREFLR 196
D+KKMRL+V E H MREFLR
Sbjct: 435 DAKKMRLIVRVEFHSEMREFLR 456
>gi|414584751|tpg|DAA35322.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 451
Score = 316 bits (809), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 154/197 (78%), Positives = 175/197 (88%), Gaps = 1/197 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVK QQ GRK+SWFIPT+LATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVK-QQQGRKDSWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLF+++EYQLPNLIVGA+TKES+Y AFENGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 KLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
+VPENV DQIRLWE+DLNRVEM P+H YE+FPS++ FE CDYARD LLWED K+MRL
Sbjct: 374 AVPENVTDQIRLWETDLNRVEMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDPKRMRL 433
Query: 182 VVNAEIHMHMREFLRGQ 198
+V E H MREFLR Q
Sbjct: 434 IVRGEFHPEMREFLRRQ 450
>gi|242074798|ref|XP_002447335.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
gi|241938518|gb|EES11663.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
Length = 451
Score = 312 bits (799), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 153/197 (77%), Positives = 173/197 (87%), Gaps = 1/197 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVK QQ GR + WFIPT+LATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVK-QQQGRTDRWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLFS++EYQLPNLIVGA+TKES+Y AFENGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 KLHCEILRLFSRVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
+VPENV DQIRLWE+D NRVEM P+H YE+FPS++ FE CDYARD LLWEDSK+MRL
Sbjct: 374 TVPENVTDQIRLWETDRNRVEMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDSKRMRL 433
Query: 182 VVNAEIHMHMREFLRGQ 198
+V E H MREFLR Q
Sbjct: 434 IVRGEFHPEMREFLRRQ 450
>gi|357166758|ref|XP_003580836.1| PREDICTED: general transcription factor IIH subunit 4-like
[Brachypodium distachyon]
Length = 451
Score = 308 bits (789), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 153/197 (77%), Positives = 172/197 (87%), Gaps = 1/197 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVK+QQ GRK+SWFIPTKLATNLS SL+DSSA KEG VVVETNFR+YAYS S
Sbjct: 255 IRDLAELGLVKVQQ-GRKDSWFIPTKLATNLSSSLSDSSASKEGIVVVETNFRLYAYSAS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
+LHCEILRLFS++EYQLPNLIVGAITKESLY AF+NGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 RLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFLKQNAHPRVADKIP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
VPENV DQIRLWE+D NRV+M +H YE+FPS+D+FE CD ARD LLWEDSKKMRL
Sbjct: 374 VVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSKDMFEQCCDLARDNGFLLWEDSKKMRL 433
Query: 182 VVNAEIHMHMREFLRGQ 198
+V E H MREFLR Q
Sbjct: 434 IVRVEFHQEMREFLRRQ 450
>gi|326510735|dbj|BAJ91715.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 451
Score = 307 bits (787), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/197 (77%), Positives = 171/197 (86%), Gaps = 1/197 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
+ D +LGLVKLQQ GRK+SWFIPTKLATNLS SL+DS+A KEG VVVETNFR+YAYS S
Sbjct: 255 VMDLMELGLVKLQQ-GRKDSWFIPTKLATNLSSSLSDSAASKEGIVVVETNFRLYAYSAS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLFS++EYQLPNLIVGAITKESLY AF+NGITAEQIISFLQQNAHPRV D+IP
Sbjct: 314 KLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFLQQNAHPRVIDKIP 373
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
VPENV DQIRLWE+D NRVEM +H YE+FPS+D+FE CD+ARD LLWED+KKMRL
Sbjct: 374 IVPENVTDQIRLWENDRNRVEMILSHVYEDFPSKDMFEQCCDHARDNGYLLWEDAKKMRL 433
Query: 182 VVNAEIHMHMREFLRGQ 198
+V+ E H MREFLR Q
Sbjct: 434 IVSGEFHQEMREFLRRQ 450
>gi|357123176|ref|XP_003563288.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
subunit 4-like [Brachypodium distachyon]
Length = 452
Score = 305 bits (782), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 152/197 (77%), Positives = 171/197 (86%), Gaps = 1/197 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVK+QQ GRK+SWFIPTKLATNLS SL+DSSA KEG VVVETNFR+YAYS S
Sbjct: 256 IRDLAELGLVKVQQ-GRKDSWFIPTKLATNLSSSLSDSSASKEGIVVVETNFRLYAYSAS 314
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
+LHCEILRLFS++EYQLPNLIVGAITKESLY AF+NGITAEQIISFL+QNAHP VAD+IP
Sbjct: 315 RLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFLKQNAHPXVADKIP 374
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
VPENV DQIRLWE+D NRV+M +H YE+FPS+D+FE CD ARD LLWEDSKKMRL
Sbjct: 375 VVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSKDLFEQCCDLARDNGFLLWEDSKKMRL 434
Query: 182 VVNAEIHMHMREFLRGQ 198
+V E H MREFLR Q
Sbjct: 435 IVRVEFHQEMREFLRRQ 451
>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group]
Length = 470
Score = 277 bits (708), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 133/195 (68%), Positives = 150/195 (76%), Gaps = 29/195 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVKLQQ GFVVVETNFRMYAYSTS
Sbjct: 302 IRDLAELGLVKLQQ-----------------------------GFVVVETNFRMYAYSTS 332
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 333 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 392
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
+VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+ CDYARD LLWED+KKMRL
Sbjct: 393 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 452
Query: 182 VVNAEIHMHMREFLR 196
+V E H MREFLR
Sbjct: 453 IVRVEFHSEMREFLR 467
>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group]
gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group]
Length = 444
Score = 269 bits (687), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 128/193 (66%), Positives = 147/193 (76%), Gaps = 29/193 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVKLQQ GFVVVETNFRMYAYSTS
Sbjct: 274 IRDLAELGLVKLQQ-----------------------------GFVVVETNFRMYAYSTS 304
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 305 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 364
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
+VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+ CDYARD LLWED+KKMRL
Sbjct: 365 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 424
Query: 182 VVNAEIHMHMREF 194
+V E H M ++
Sbjct: 425 IVRVEFHSEMHDY 437
>gi|167997877|ref|XP_001751645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697626|gb|EDQ83962.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 462
Score = 245 bits (626), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 114/201 (56%), Positives = 159/201 (79%), Gaps = 3/201 (1%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--ARKEGFVVVETNFRMYAY 58
++ + A LGLVKLQQ G KESW+IPTKLA+NLS SL++S+ EGFVVVETNF++YAY
Sbjct: 255 VLDELAALGLVKLQQ-GMKESWYIPTKLASNLSASLSESTDWQSSEGFVVVETNFKVYAY 313
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
++SKL EILR F+++EYQLPNL+V +TKES+ A +GI+AEQIISFL+++AHP VA
Sbjct: 314 TSSKLQTEILRCFTRLEYQLPNLVVATLTKESVNKALGSGISAEQIISFLRKHAHPHVAQ 373
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
+IP VPE V DQ+RLWE+D NRV+ PA++Y++FP+ ++EA +ARD GLL+ED+
Sbjct: 374 KIPVVPETVSDQLRLWETDRNRVQFEPAYFYDDFPTMAIYEAVVAHARDLGGLLFEDASA 433
Query: 179 MRLVVNAEIHMHMREFLRGQN 199
RL+V +++H MR+++R Q+
Sbjct: 434 KRLIVRSDLHEDMRQYIRKQS 454
>gi|296084734|emb|CBI25875.3| unnamed protein product [Vitis vinifera]
Length = 258
Score = 241 bits (614), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 128/219 (58%), Positives = 151/219 (68%), Gaps = 25/219 (11%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
IKD DLGLVKLQQ GRKESWFIPTKLATNLSMSL+D+S+RK+GFVVVETNFR+YAYS+S
Sbjct: 38 IKDLVDLGLVKLQQ-GRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSS 96
Query: 62 KLHCEILRLFSKIEYQLPNLIV--------GAITKESL-----YNAFENGITAEQIISFL 108
KLHCEILRLFS+ L + ++ G + SL F G S +
Sbjct: 97 KLHCEILRLFSRYASMLFDCLLCICYSSQKGTLLDVSLIMYECLRTFSLGKIIFWFFSSV 156
Query: 109 QQNAHP-----------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV 157
A P +A++IP + QIRLWE+DLNRVE P+H Y+EFPSRDV
Sbjct: 157 STVATPCHGYLVIFTISCLAEKIPCLQFTNMSQIRLWETDLNRVETMPSHLYDEFPSRDV 216
Query: 158 FEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
FEAACD+AR+ GLLWEDSKKMRLVV AEIH+HMRE+LR
Sbjct: 217 FEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREYLR 255
>gi|302806060|ref|XP_002984780.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
gi|302808297|ref|XP_002985843.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300146350|gb|EFJ13020.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300147366|gb|EFJ14030.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
Length = 459
Score = 227 bits (578), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 116/199 (58%), Positives = 155/199 (77%), Gaps = 2/199 (1%)
Query: 3 KDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA-RKEGFVVVETNFRMYAYSTS 61
++ A LG+++LQ+ G K+ WFIPT+LAT LS SL++SSA + EGF++VETNFR+YAY++S
Sbjct: 260 EELAMLGVLQLQK-GMKDRWFIPTRLATGLSASLSESSAWQTEGFIMVETNFRVYAYTSS 318
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLH E L +F + EY LPN++VG+ITKES+ AF +GI+A+QII FLQQ+AHP VA ++P
Sbjct: 319 KLHIETLHVFVRTEYVLPNILVGSITKESVNGAFASGISADQIIKFLQQHAHPFVAQKVP 378
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
SVPE VCDQIRLWESD RV+ PA+ YE FPS V+E+ +ARDR+GLLWED+ + +
Sbjct: 379 SVPETVCDQIRLWESDRVRVQYLPAYCYEGFPSTSVYESVVAHARDRNGLLWEDANRKMI 438
Query: 182 VVNAEIHMHMREFLRGQNK 200
VV E H +R FL+ NK
Sbjct: 439 VVGGEHHEAIRAFLQNINK 457
>gi|359497087|ref|XP_002267711.2| PREDICTED: general transcription factor IIH subunit 4-like [Vitis
vinifera]
Length = 238
Score = 226 bits (575), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 117/195 (60%), Positives = 134/195 (68%), Gaps = 44/195 (22%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
IKD DLGLVKLQQ GRKESWFIPTKLATNLSMSL+D+S+RK+GFVVVETNFR+YAYS+S
Sbjct: 85 IKDLVDLGLVKLQQ-GRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSS 143
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLFS+ +S + H +
Sbjct: 144 KLHCEILRLFSR-----------------------------SSVSTVATPCHGYL----- 169
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
IRLWE+DLNRVE P+H Y+EFPSRDVFEAACD+AR+ GLLWEDSKKMRL
Sbjct: 170 ---------IRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRL 220
Query: 182 VVNAEIHMHMREFLR 196
VV AEIH+HMRE+LR
Sbjct: 221 VVKAEIHLHMREYLR 235
>gi|414584750|tpg|DAA35321.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 386
Score = 217 bits (552), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 108/130 (83%), Positives = 123/130 (94%), Gaps = 1/130 (0%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
I+D A+LGLVK QQ GRK+SWFIPT+LATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVK-QQQGRKDSWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
KLHCEILRLF+++EYQLPNLIVGA+TKES+Y AFENGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 KLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIP 373
Query: 122 SVPENVCDQI 131
+VPENV DQ+
Sbjct: 374 AVPENVTDQV 383
>gi|440791255|gb|ELR12502.1| TFIIH basal transcription factor complex subunit 2, putative
[Acanthamoeba castellanii str. Neff]
Length = 304
Score = 166 bits (420), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 75/197 (38%), Positives = 132/197 (67%), Gaps = 5/197 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-KEGFVVVETNFRMYAYS 59
+++D DLG++ + R W PT+LA LS T+++ R +EG+++V T++R+YAY+
Sbjct: 99 LLQDLQDLGIIYRKH--RDSKWLYPTQLAIGLSS--TEAAKRDQEGWIIVGTDYRIYAYT 154
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
+S + +L LF++IEYQLPN+++G + +E++ A + GI+A QI+ FL+ NAHP++
Sbjct: 155 SSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMKQN 214
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +PE++ DQ+RLWE++ R+ ++P ++Y++F S F+ A YARD LL+ D+ K
Sbjct: 215 TPIIPESIADQLRLWEAEDRRLSLSPGYFYDDFASLAAFKKAEKYARDVGALLYSDAAKR 274
Query: 180 RLVVNAEIHMHMREFLR 196
L V + H +R +++
Sbjct: 275 FLFVTEQGHQLLRRYVK 291
>gi|384246869|gb|EIE20357.1| Tfb2-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 451
Score = 166 bits (419), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 86/197 (43%), Positives = 119/197 (60%), Gaps = 4/197 (2%)
Query: 4 DFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYSTS 61
D A LGL+ W PT+LA L+ + + +GFVVVETN+R+YAY++S
Sbjct: 252 DMAQLGLLMPFTAKDGSVWLAPTRLALALAGGSSGQAQHDVTDGFVVVETNYRVYAYTSS 311
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
L +LRLF++ E LPNL VG +T+ES+ A G++A+QI+ +L+Q+AHP VA R P
Sbjct: 312 LLQTALLRLFTRCECILPNLFVGVLTRESVTGALACGLSADQIVLYLRQHAHPHVASRTP 371
Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK--M 179
VPE V DQ+RLW++D RV A Y++FPS VF+ + AR LWED K
Sbjct: 372 VVPEVVADQVRLWQADTMRVRHNRAVLYDDFPSAQVFQLSAQKARTLGVWLWEDPKAGMG 431
Query: 180 RLVVNAEIHMHMREFLR 196
RL V H MRE+++
Sbjct: 432 RLAVQEAGHDAMREYIK 448
>gi|290987279|ref|XP_002676350.1| predicted protein [Naegleria gruberi]
gi|284089952|gb|EFC43606.1| predicted protein [Naegleria gruberi]
Length = 532
Score = 163 bits (413), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/211 (39%), Positives = 130/211 (61%), Gaps = 13/211 (6%)
Query: 1 MIKDFADLGLVKLQQVGRKES---WFIPTKLATNLSMSLTDS----------SARKEGFV 47
++ DF DLG++ L + +K+ +F PT LA +L++S++ S ++ G++
Sbjct: 319 LLVDFNDLGIIYLHRDKKKKIKDKYFFPTPLAKSLTVSMSTSYDLISSFGSHNSINNGYI 378
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
+VETN+R+YAY+ S L +L LF EY+LPN++VG IT+ ++ A +NGI+A QI+ F
Sbjct: 379 IVETNYRVYAYTNSPLQIALLSLFIFPEYRLPNMVVGLITRSTIREALKNGISAHQILQF 438
Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
L+ NAHP++ + P +P+ V DQI LWE + NRV T + Y++F + A DYAR
Sbjct: 439 LRLNAHPQMRLKKPVIPDTVSDQILLWEKERNRVIKTTSVVYDKFTNVSELNATVDYARR 498
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+ LLW +KM +V E H M+EF+ Q
Sbjct: 499 QGALLWHSEEKMMMVCKREFHGLMKEFISSQ 529
>gi|145351329|ref|XP_001420034.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580267|gb|ABO98327.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 162 bits (409), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 82/200 (41%), Positives = 118/200 (59%), Gaps = 9/200 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA-------RKEGFVVVETNF 53
++ D LGL+ +V +K+ +++PT LA+ LS T EG ++VETN+
Sbjct: 284 VVLDLTHLGLLYTFEV-KKKFYYVPTLLASGLSGGFTGDDGDTKAAAAAAEGHIIVETNY 342
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY++S + EILRLF++ +Y+LPNL VG +T+ES+ NA G+ AEQI+ +++ +AH
Sbjct: 343 RVYAYTSSAVEMEILRLFTRADYRLPNLYVGMLTRESVQNALRAGVDAEQIVGYIRAHAH 402
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA-CDYARDRSGLL 172
+V + PSVP VCDQIRLW D+ R+E Y +FP F A A R LL
Sbjct: 403 KQVRRKKPSVPSTVCDQIRLWARDMERMEAEECVLYCDFPQAGTFYGAVVSEAEKRGALL 462
Query: 173 WEDSKKMRLVVNAEIHMHMR 192
W D RL V A H M+
Sbjct: 463 WRDDASRRLTVRASAHDEMK 482
>gi|440800489|gb|ELR21525.1| transcription factor tfb2 subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 465
Score = 159 bits (401), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 72/197 (36%), Positives = 131/197 (66%), Gaps = 5/197 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-KEGFVVVETNFRMYAYS 59
+++D DLG++ + + W PT+LA LS T+++ R +EG+++V T++R+YAY+
Sbjct: 258 LLQDLQDLGIIYRKH--KDSKWLYPTQLAIGLSS--TEAAKRDQEGWIIVGTDYRIYAYT 313
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
+S + +L LF++IEYQLPN+++G + +E++ A + GI+A QI+ FL+ NAHP++
Sbjct: 314 SSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMKQN 373
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +PE++ DQ+RLWE++ R+ ++ ++Y++F S F+ A YARD L++ D+ K
Sbjct: 374 TPIIPESIADQLRLWEAEDRRLSLSSGYFYDDFASLAAFKKAEKYARDVGALIYSDATKR 433
Query: 180 RLVVNAEIHMHMREFLR 196
L V+ H +R +++
Sbjct: 434 FLFVSEPGHQLLRRYVK 450
>gi|390594544|gb|EIN03954.1| transcription factor Tfb2 [Punctularia strigosozonata HHB-11173
SS5]
Length = 467
Score = 157 bits (397), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/199 (41%), Positives = 123/199 (61%), Gaps = 6/199 (3%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL----TDSSARKEGFVVVETNFRMY 56
M++D D GL+ Q+ K F PT+LAT L+ S T S EGF+V+ETN+R+Y
Sbjct: 263 MLEDLRDYGLIWQQKPTSKR--FSPTRLATTLTSSSPPLPTSGSGVSEGFIVLETNYRLY 320
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
AY+ + L +L LF+ + + PNL+VG IT+ES+ A ++GI+AEQIIS+L +AHP++
Sbjct: 321 AYTDNPLQTAVLALFTTLRSRFPNLVVGQITRESVKRALQSGISAEQIISYLSTHAHPQM 380
Query: 117 ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
P +P V DQIRLWE + NR++ Y EF S+ +E +YAR+ + +LWE+
Sbjct: 381 RKNNPLIPVTVQDQIRLWELEKNRLKSQEGFLYTEFASQGDYEYVLNYARELNVVLWENP 440
Query: 177 KKMRLVVNAEIHMHMREFL 195
K + E H ++R F+
Sbjct: 441 VKRCFFGSMEGHANIRGFI 459
>gi|403180193|ref|XP_003338499.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166015|gb|EFP94080.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 679
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 123/201 (61%), Gaps = 9/201 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL-----TDSSARKEGFVVVETNFRM 55
+++D D GLV + R F PT+LAT L+ + + + + GF+V+ETN+R+
Sbjct: 102 VLQDLVDYGLVLVSAPDR----FYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETNYRI 157
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY+++ L +L LF + Y+ PNL+VGA+T+ES+ +A NGITA+Q+I +L +AHP+
Sbjct: 158 YAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHAHPQ 217
Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
+ P +P V DQIRLWE + NR+ + YE+F S +++ Y+R +LWE
Sbjct: 218 MRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVLWEH 277
Query: 176 SKKMRLVVNAEIHMHMREFLR 196
+ +L V + H+ +REF R
Sbjct: 278 AGLRKLFVGYDGHLTLREFFR 298
>gi|403163300|ref|XP_003323393.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163998|gb|EFP78974.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 887
Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 123/201 (61%), Gaps = 9/201 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL-----TDSSARKEGFVVVETNFRM 55
+++D D GLV + R F PT+LAT L+ + + + + GF+V+ETN+R+
Sbjct: 307 VLQDLVDYGLVLVSAPDR----FYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETNYRI 362
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY+++ L +L LF + Y+ PNL+VGA+T+ES+ +A NGITA+Q+I +L +AHP+
Sbjct: 363 YAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHAHPQ 422
Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
+ P +P V DQIRLWE + NR+ + YE+F S +++ Y+R +LWE
Sbjct: 423 MRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVLWEH 482
Query: 176 SKKMRLVVNAEIHMHMREFLR 196
+ +L V + H+ +REF R
Sbjct: 483 AGLRKLFVGYDGHLTLREFFR 503
>gi|72074363|ref|XP_796111.1| PREDICTED: general transcription factor IIH subunit 4
[Strongylocentrotus purpuratus]
Length = 469
Score = 156 bits (395), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 82/195 (42%), Positives = 122/195 (62%), Gaps = 4/195 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ +LGLV Q RK + PT+LA NL+ ++ + K+GF+VVETNFR+YAY+
Sbjct: 267 LQHLRELGLVF--QRKRKSMRYYPTRLAINLASGVSSMAKDDHKDGFIVVETNFRVYAYT 324
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S L EIL LF + Y+ PNL V A+T+ES+ A NGITAEQI+SFL+ +AHP + +
Sbjct: 325 ESDLQVEILGLFCSMMYRFPNLSVAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLK 384
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P VP + DQ+RLWE + +R+ T Y EF S FE DYA+D L+W+ + +
Sbjct: 385 TPIVPPTISDQVRLWELERDRLSFTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARR 444
Query: 180 RLVVNAEIHMHMREF 194
++V+ H ++++
Sbjct: 445 IMIVSPAGHDSVKKY 459
>gi|260810774|ref|XP_002600123.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
gi|229285409|gb|EEN56135.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
Length = 457
Score = 156 bits (394), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 79/197 (40%), Positives = 125/197 (63%), Gaps = 4/197 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD--SSARKEGFVVVETNFRMYAYS 59
++ +LGLV Q RK F PT+LA +L+ ++D KEG++VVE+N+R+YAY+
Sbjct: 256 LQHLRELGLVN--QRKRKSGRFYPTRLAIHLASGISDVEKDFHKEGYLVVESNYRIYAYT 313
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS+I Y+ PN++V +T++S+ A GITA+QI+ FL+ NAHP+ R
Sbjct: 314 DSELQVALIGLFSEILYRFPNMVVANLTRDSVQEAVVRGITADQILHFLRVNAHPKALHR 373
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
+P VP + DQIRLWE + +R+ T Y +F S+ FE +YA+D LLWE++ K
Sbjct: 374 VPIVPPTISDQIRLWEMERDRLTFTEGVLYNQFLSQPDFEMLRNYAKDLGVLLWENNPKR 433
Query: 180 RLVVNAEIHMHMREFLR 196
+VV+ H ++ + +
Sbjct: 434 LMVVSKAGHDDVKRYWK 450
>gi|308452336|ref|XP_003089005.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
gi|308243592|gb|EFO87544.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
Length = 468
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 133/203 (65%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
++ +LG++ +++ RK+ F T L T+L+ ++ D+SA + G V+VETNFR+Y
Sbjct: 258 LLNHLRELGVIFIRK--RKDGVFFLTHLLTHLATNETIDDTSAERVSNGKVIVETNFRVY 315
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 316 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 375
Query: 116 --VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+ I +P V DQIRLWE + R+++ A+ Y F S D F+ CDYA++R LLW
Sbjct: 376 IATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQGVCDYAQERGILLW 435
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
++++ ++VN E H ++R++ +
Sbjct: 436 ANAQQKLVIVNEEGHEYVRQWYK 458
>gi|308460924|ref|XP_003092760.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
gi|308252519|gb|EFO96471.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
Length = 470
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 133/203 (65%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
++ +LG++ +++ RK+ F T L T+L+ ++ D+SA + G V+VETNFR+Y
Sbjct: 261 LLNHLRELGVIFIRK--RKDGVFFLTHLLTHLATNETIDDTSAERVSNGKVIVETNFRVY 318
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 319 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 378
Query: 116 --VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+ I +P V DQIRLWE + R+++ A+ Y F S D F+ CDYA++R LLW
Sbjct: 379 IATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQGVCDYAQERGILLW 438
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
++++ ++VN E H ++R++ +
Sbjct: 439 ANAQQKLVIVNEEGHEYVRQWYK 461
>gi|395332344|gb|EJF64723.1| transcription factor Tfb2 [Dichomitus squalens LYAD-421 SS1]
Length = 506
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 79/203 (38%), Positives = 122/203 (60%), Gaps = 10/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
M++D D GL++ + ++ F PT+LAT L+ S T + +GF+V+ETN
Sbjct: 298 MLEDLRDYGLIRQRTATSRK--FSPTRLATTLTSSSPPLPTSAGTGDGSHVQGFIVLETN 355
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ + L +L LF ++Y+ PNL+VG IT++S+ A NGITA+QIIS+L +A
Sbjct: 356 YRLYAYTDNPLQIAVLNLFVTLKYRFPNLVVGQITRDSVKKALANGITADQIISYLITHA 415
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP++ P +P V DQIRLWE + NRV+ + Y F S +E DYA+ +L
Sbjct: 416 HPQMRKNKPLLPVTVQDQIRLWELERNRVKSQEGYLYTAFASHADYEWVLDYAKKLGVVL 475
Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
WE+ K + E HM++R ++
Sbjct: 476 WENPSKRCFFGSVEGHMNIRGYI 498
>gi|343428936|emb|CBQ72481.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Sporisorium reilianum SRZ2]
Length = 493
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 79/203 (38%), Positives = 129/203 (63%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----LTDSSARKE--GFVVVETNF 53
M++DF D GLV ++ + F PT+LAT L+ S L+ + +E G++++ETN+
Sbjct: 289 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNY 346
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY++++L +L LF I+ + PNL+VG+IT++S+ +A NGITAEQII++L +AH
Sbjct: 347 RLYAYTSNRLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAH 406
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
++ P +P V DQIRLWE + NRV+ + +F S+ FE +YA+ L+W
Sbjct: 407 VQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVW 466
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
ED K R V+ ++ +R+++R
Sbjct: 467 EDEGKRRFFVDEAGNVPVRDYIR 489
>gi|390370355|ref|XP_783140.3| PREDICTED: general transcription factor IIH subunit 4-like, partial
[Strongylocentrotus purpuratus]
Length = 180
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/179 (43%), Positives = 114/179 (63%), Gaps = 2/179 (1%)
Query: 18 RKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIE 75
RK + PT+LA NL+ ++ + K+GF+VVETNFR+YAY+ S L EIL LF +
Sbjct: 1 RKSMRYYPTRLAINLASGVSSMAKDDHKDGFIVVETNFRIYAYTESDLQVEILGLFCSMM 60
Query: 76 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWE 135
Y+ PNL V A+T+ES+ A NGITAEQI+SFL+ +AHP + + P VP + DQ+RLWE
Sbjct: 61 YRFPNLSVAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWE 120
Query: 136 SDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
+ +R+ T Y EF S FE DYA+D L+W+ + + ++V+ H ++++
Sbjct: 121 LERDRLSFTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARRIMIVSPAGHDSVKKY 179
>gi|341886092|gb|EGT42027.1| hypothetical protein CAEBREN_00374 [Caenorhabditis brenneri]
Length = 460
Score = 154 bits (388), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 82/205 (40%), Positives = 134/205 (65%), Gaps = 9/205 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
++ +LG++ +++ RK+ F T+L T+L+ ++ D+SA K G V+VETNFR+Y
Sbjct: 249 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDTSAEKVSNGKVIVETNFRVY 306
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 307 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 366
Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+A P +P V DQIRLWE + R+++ ++ Y F S + F CDYA++R LLW
Sbjct: 367 IATSGPVNCLPITVADQIRLWEDERRRMDLKDSYIYSNFESDEEFNGVCDYAKERKILLW 426
Query: 174 EDSKKMRLVVNAEIHMHMREFLRGQ 198
D ++ ++VN E H ++R++ + Q
Sbjct: 427 ADYQRKLVIVNEEGHEYVRQWYKQQ 451
>gi|412992641|emb|CCO18621.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 153 bits (387), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 81/185 (43%), Positives = 118/185 (63%), Gaps = 11/185 (5%)
Query: 22 WFIPTKLATNLSMSLTDSSA-----RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEY 76
W++PT L+ LS T SSA R +G ++VETNFR+YAY+ S+L E+LRLF++ +Y
Sbjct: 358 WYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRLFTRPDY 417
Query: 77 QLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR--VADRI--PSVPENVCDQIR 132
+LPN VG IT++S+ +A GI+ +QI+++LQ AHPR + R P+VP VCDQIR
Sbjct: 418 KLPNAYVGMITRDSILDAMRAGISPDQIVNYLQTRAHPRCTIGKRPNHPAVPPTVCDQIR 477
Query: 133 LWESDLNRVEMTPAHYYEEFPSR-DVFEAACDYARDRSG-LLWEDSKKMRLVVNAEIHMH 190
LW DL RV+ Y +FP + + F+ A + AR+ +LW D K R V+A+ H
Sbjct: 478 LWARDLYRVKADDCVMYTDFPMQGNQFQDAVNNARNVGAQILWMDENKRRFAVDADSHER 537
Query: 191 MREFL 195
++ FL
Sbjct: 538 LKVFL 542
>gi|384484453|gb|EIE76633.1| hypothetical protein RO3G_01337 [Rhizopus delemar RA 99-880]
Length = 284
Score = 153 bits (387), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 77/218 (35%), Positives = 126/218 (57%), Gaps = 24/218 (11%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS----------------------LTD 38
M++D D G+V Q + + PT+LAT L+ TD
Sbjct: 63 MLEDLRDYGIVY--QRKKHSKRYYPTRLATTLTSGKSALATVAGKYNHMMQETNIDDTTD 120
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
+ + +GF+++ETN+++YAY+ S L +L LF +++ + N++ G IT++S+ NA G
Sbjct: 121 TESVDQGFIILETNYKLYAYTDSPLQIAVLNLFVQLQSRFRNMVTGVITRDSIRNALMKG 180
Query: 99 ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
ITAEQII +LQ +AHP++ P +P V DQIRLWE + NR++ TP++ Y EF + F
Sbjct: 181 ITAEQIIYYLQSHAHPQMRKETPVLPLTVVDQIRLWEMERNRLKPTPSYLYHEFNVQADF 240
Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+AA YARD LLW +++K + + H +++ F++
Sbjct: 241 DAAEKYARDLGVLLWSNNQKRTMAITEAGHENVKGFVK 278
>gi|242021693|ref|XP_002431278.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
gi|212516535|gb|EEB18540.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
Length = 467
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 127/202 (62%), Gaps = 5/202 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GL+ Q R+ F PT+LA N++ S KEG++++ETN+R+YAY+
Sbjct: 265 LQHLREFGLI--YQRKRRGGRFYPTRLALNIACGENKSLQQMNKEGYIIIETNYRVYAYT 322
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR-VAD 118
S L +L LF ++ Y+ PNL VG IT++S+ AF++GITAEQI+ FL+ +AHPR +A
Sbjct: 323 DSNLQVALLGLFCEMLYRFPNLSVGLITRDSVRQAFKSGITAEQIVGFLRLHAHPRMIAV 382
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
P++P V DQI+LWE++LNR+ + Y +F S+ FEA D A + L+WE+ KK
Sbjct: 383 GPPTLPPTVVDQIKLWENELNRLVYSDGVLYSQFLSQADFEALRDRANELGVLVWENDKK 442
Query: 179 MRLVVNAEIHMHMREFLRGQNK 200
+VV H +++F + +K
Sbjct: 443 RTMVVTKSGHDDVKKFWKSYSK 464
>gi|328873270|gb|EGG21637.1| general transcription factor IIH [Dictyostelium fasciculatum]
Length = 453
Score = 152 bits (385), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 80/207 (38%), Positives = 125/207 (60%), Gaps = 15/207 (7%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-----------DSSARKEGFVVV 49
M+ D + GLV ++ K F PT+LA +L+ T +S + +G++++
Sbjct: 247 MLFDLKEFGLVYVR--SEKSDVFYPTRLAISLATGRTVSLMNDLAQEISTSQKDQGYLIL 304
Query: 50 ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
ETNFR+YAY+ S L ++ LF K+ Y+LPNL VG +T+ES+ +A +GITA+QI+ F++
Sbjct: 305 ETNFRIYAYTQSSLQISLISLFVKMLYRLPNLAVGILTRESVRSALLHGITADQIVDFVK 364
Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
QNAHP + I PE V +QIRLWES+ NR+ A ++ FP+ + F +A+D+
Sbjct: 365 QNAHPNMV--ISGYPEVVFEQIRLWESERNRITYKKAVLFDSFPNAESFSKTVTFAKDQY 422
Query: 170 GLLWEDSKKMRLVVNAEIHMHMREFLR 196
L+W D K LVVN E +R +++
Sbjct: 423 FLVWFDETKKMLVVNDEGSEPIRNYIK 449
>gi|402224480|gb|EJU04542.1| transcription factor Tfb2 [Dacryopinax sp. DJM-731 SS1]
Length = 466
Score = 152 bits (384), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/204 (38%), Positives = 128/204 (62%), Gaps = 9/204 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDS----SARKEGFVVVETN 52
M++DF D GL+ ++ + F PT+LAT L S++ T S +A EGF+++ETN
Sbjct: 259 MLQDFLDYGLI-YRRNPDDHTRFYPTRLATTLTSTSSLAFTSSKHEKAASSEGFIILETN 317
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ + L +L LF + + NL++G +T+ES+ A NGITA+QIIS+L +A
Sbjct: 318 YRVYAYTENPLQIAVLNLFVALHSRFENLVIGRLTRESIKAALANGITADQIISYLTVHA 377
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP + P +P V DQIRLW+ + NR++ + YE+F S+ F+ +YA+ +L
Sbjct: 378 HPMMHKNNPVLPVTVQDQIRLWQLEKNRLKSENGYLYEDFNSQGDFDLVLNYAKQLDVVL 437
Query: 173 WEDSKKMRLVVNAEIHMHMREFLR 196
WE+ +K ++ V + H ++REF+R
Sbjct: 438 WENREKRKMFVREDGHENVREFIR 461
>gi|393221241|gb|EJD06726.1| transcription factor Tfb2 [Fomitiporia mediterranea MF3/22]
Length = 464
Score = 152 bits (383), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 124/203 (61%), Gaps = 10/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
M++D D GL+ ++ + F PT+LAT L+ SL ++A+ +GF+++ETN
Sbjct: 255 MLEDLRDYGLIWQKKASSRR--FYPTRLATTLTSSLPPLPTSGSGGANAQAQGFIILETN 312
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ + L +L LF ++Y+ PNL+VG++T++S+ A NGITA+QII +L +A
Sbjct: 313 YRIYAYTDNPLQTAVLNLFVTLKYRFPNLVVGSLTRDSVRKALANGITADQIIKYLTTHA 372
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP++ P +P V DQIRLWE + NR++ + Y F S+ +E YA+ +L
Sbjct: 373 HPQMRKNDPLIPVTVQDQIRLWELERNRLKSQEGYLYTAFASQADYEFVLKYAKQLGVVL 432
Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
WE++ K + E H ++R F+
Sbjct: 433 WENAAKRCFFGSLEGHANIRGFI 455
>gi|388857860|emb|CCF48525.1| probable TFB2-TFIIH subunit (transcription/repair factor) [Ustilago
hordei]
Length = 492
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 80/203 (39%), Positives = 127/203 (62%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----LTDSSARKE--GFVVVETNF 53
M++DF D GLV ++ + F PT+LAT L+ S L+ + A +E G++++ETN+
Sbjct: 288 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAIPLLSSNGAEQEERGYIILETNY 345
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+++ L +L LF I+ + PNL+VG+IT++S+ +A NGITAEQII++L +AH
Sbjct: 346 RLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAH 405
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
++ P +P V DQIRLWE + NRV + +F S+ FE +YA+ L+W
Sbjct: 406 VQMHRNDPLLPVTVSDQIRLWEREKNRVVQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVW 465
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
ED K R V+ + +R+++R
Sbjct: 466 EDESKRRFFVDEAGNEPVRDYIR 488
>gi|409077631|gb|EKM77996.1| hypothetical protein AGABI1DRAFT_114851 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426198975|gb|EKV48900.1| hypothetical protein AGABI2DRAFT_191077 [Agaricus bisporus var.
bisporus H97]
Length = 467
Score = 151 bits (382), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 80/208 (38%), Positives = 127/208 (61%), Gaps = 11/208 (5%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-------LTDSSARKEGFVVVETNF 53
+++D D GL+ Q K F PT+L T L+ S ++ SS +GF+V+ETN+
Sbjct: 259 LLEDLRDYGLI--WQSKPKSRRFSPTRLGTTLTSSSPPLPTTISASSGPLDGFIVLETNY 316
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ + L +L LF ++Y+ PNL+VG IT++S+ A NGITA+QIIS+L +AH
Sbjct: 317 RIYAYTDNPLQTAVLNLFVSLKYRFPNLVVGMITRDSVRRALMNGITADQIISYLITHAH 376
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
P++ P +P V DQIRLWE + NR++ + + Y F S+ ++ +YA+ +LW
Sbjct: 377 PQMRKNNPLLPVTVQDQIRLWELEKNRLKSSEGYLYTAFTSQADYDLVLNYAKQLDVVLW 436
Query: 174 EDSKKMRLVVNAEIHMHMREFL--RGQN 199
E++ K + E H+ ++EF+ R QN
Sbjct: 437 ENAAKRCFFGSLEGHLRIKEFIERRTQN 464
>gi|147899535|ref|NP_001085110.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
laevis]
gi|47939830|gb|AAH72322.1| MGC83106 protein [Xenopus laevis]
Length = 455
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 125/201 (62%), Gaps = 4/201 (1%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NL+ ++ S + K+GF+VVETN+R+YAY+
Sbjct: 255 LQHLREFGLVF--QRKRKSRRYYPTRLAINLASGISGSVVDSHKQGFIVVETNYRIYAYT 312
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PNL+V +T+E++ A NGITAEQII FL+ AHP + +
Sbjct: 313 DSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQ 372
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P++P + DQIRLWE + +R+ + Y +F S+ FE +YARD L++E+ K
Sbjct: 373 NPALPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRNYARDLGVLVFENPAKR 432
Query: 180 RLVVNAEIHMHMREFLRGQNK 200
+VV H ++ F + Q +
Sbjct: 433 VMVVTPGGHSDVKRFWKRQKQ 453
>gi|442757633|gb|JAA70975.1| Putative rna polymer [Ixodes ricinus]
Length = 459
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/197 (38%), Positives = 120/197 (60%), Gaps = 4/197 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYS 59
++ + GLV Q R+ F PT+LA NL+ L +++ R + G++VVETN+R+YAY+
Sbjct: 259 LQHLREFGLVY--QRKRRSGRFYPTRLAINLASGLKETNLRSYESGYIVVETNYRVYAYT 316
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L +L LF ++ Y+LPNL+VG +T+ES+ A +GIT+ QII FLQ +AHP +
Sbjct: 317 DSQLQVALLALFCELMYRLPNLVVGVLTRESVRQALRSGITSNQIIKFLQMHAHPEAQKQ 376
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQ+RLWE + +R Y +F S+ F+ +YA D L+W++ K
Sbjct: 377 SPVIPPTIMDQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKR 436
Query: 180 RLVVNAEIHMHMREFLR 196
+VVN H ++ F +
Sbjct: 437 VMVVNRNGHDEVKRFWK 453
>gi|336370852|gb|EGN99192.1| hypothetical protein SERLA73DRAFT_90464 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383609|gb|EGO24758.1| hypothetical protein SERLADRAFT_361686 [Serpula lacrymans var.
lacrymans S7.9]
Length = 469
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/203 (37%), Positives = 122/203 (60%), Gaps = 10/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
M++D D GLV ++ K F PT+LAT L+ S T + +GF+V+ETN
Sbjct: 261 MLEDLRDYGLVWQRKATSKR--FSPTRLATTLTSSCPPLPTSTGTSGGPQSQGFIVLETN 318
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ L +L LF ++Y+ PNL+VG +T+ES+ A NGI+AEQIIS+L +A
Sbjct: 319 YRIYAYTDKPLQTAVLNLFITMKYRFPNLVVGMLTRESVKKALSNGISAEQIISYLTTHA 378
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP++ P +P V DQIRLWE + NR++ + Y++F S +E +YA+ +L
Sbjct: 379 HPQMRKNNPLLPVTVQDQIRLWELERNRLKSEEGYLYKDFGSHADYEYVLNYAKQLDVVL 438
Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
WE++ + + + H ++R F+
Sbjct: 439 WENTSRRCFFGSLDGHTNIRGFI 461
>gi|71024427|ref|XP_762443.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
gi|46097692|gb|EAK82925.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
Length = 491
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 78/203 (38%), Positives = 128/203 (63%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----LTDSSARKE--GFVVVETNF 53
M++DF D GLV ++ + F PT+LAT L+ S L+ + +E G++++ETN+
Sbjct: 287 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNY 344
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+++ L +L LF I+ + PNL+VG+IT++S+ +A NGITAEQII++L +AH
Sbjct: 345 RLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAH 404
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
++ P +P V DQIRLWE + NRV+ + +F S+ FE +YA+ L+W
Sbjct: 405 LQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVW 464
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
+D +K R V+ + +R+++R
Sbjct: 465 QDEEKRRFFVDEAGNEPVRDYIR 487
>gi|393246879|gb|EJD54387.1| transcription factor Tfb2 [Auricularia delicata TFB-10046 SS5]
Length = 480
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 119/202 (58%), Gaps = 7/202 (3%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD-------SSARKEGFVVVETNF 53
++ + D GL+ + + F PT+LAT L SL + +SA GF+++ETN+
Sbjct: 267 VLGELLDYGLIYQRALPGMSKRFFPTRLATTLMSSLPELPRTAGVASATSGGFIILETNY 326
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ + L +L LF + + PNL+VG +T++S+ A NGITAEQII++L +AH
Sbjct: 327 RLYAYTDNPLQIAVLNLFVSFKSRFPNLVVGMVTRDSVKKALANGITAEQIITYLSAHAH 386
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
P++ P +P V DQ+RLWE + NRV+ Y +F S+ FE DYAR ++W
Sbjct: 387 PQMRKNNPLLPVTVQDQVRLWELEKNRVKAEEGFLYMDFTSQADFELVLDYARKLGVVIW 446
Query: 174 EDSKKMRLVVNAEIHMHMREFL 195
E+ ++ A+ H ++R F+
Sbjct: 447 ENGRQRMFFGKADGHNNIRTFI 468
>gi|453232318|ref|NP_001263825.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
gi|412976516|emb|CCO25635.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
Length = 435
Score = 150 bits (378), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 131/203 (64%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
++ +LG++ +++ RK+ F T+L T+L+ ++ D SA K G V+VETNFR+Y
Sbjct: 216 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 273
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 274 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 333
Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+A P +P V DQIRLWE + R+ + A+ Y F S D F C+YAR ++ LLW
Sbjct: 334 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 393
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
D+++ ++VN + H +R++ +
Sbjct: 394 SDNQQKLVIVNEDGHELVRQWYK 416
>gi|392901992|ref|NP_001255864.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
gi|290457493|emb|CBK19514.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
Length = 431
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 131/203 (64%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT--DSSARK--EGFVVVETNFRMY 56
++ +LG++ +++ RK+ F T+L T+L+ + T D SA K G V+VETNFR+Y
Sbjct: 212 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 269
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 270 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 329
Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+A P +P V DQIRLWE + R+ + A+ Y F S D F C+YAR ++ LLW
Sbjct: 330 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 389
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
D+++ ++VN + H +R++ +
Sbjct: 390 SDNQQKLVIVNEDGHELVRQWYK 412
>gi|62751795|ref|NP_001015845.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
(Silurana) tropicalis]
gi|58475911|gb|AAH90134.1| general transcription factor II H, polypeptide 4 [Xenopus
(Silurana) tropicalis]
gi|89272862|emb|CAJ82116.1| transcription factor tfb2 [Xenopus (Silurana) tropicalis]
gi|114107989|gb|AAI22900.1| gtf2h4 protein [Xenopus (Silurana) tropicalis]
Length = 455
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 123/199 (61%), Gaps = 4/199 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NL+ ++ S + K+GF+VVETN+R+YAY+
Sbjct: 255 LQHLREFGLVF--QRKRKSRRYYPTRLAINLASGISGSVVDSHKQGFIVVETNYRIYAYT 312
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PNL+V +T+E++ A NGITAEQII FL+ AHP + +
Sbjct: 313 DSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQ 372
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ + Y +F S+ FE +YARD L++E+ K
Sbjct: 373 NPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRNYARDLGVLVFENPAKR 432
Query: 180 RLVVNAEIHMHMREFLRGQ 198
+VV H ++ F + Q
Sbjct: 433 VMVVTPGGHSDVKRFWKRQ 451
>gi|443894209|dbj|GAC71558.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Pseudozyma antarctica
T-34]
Length = 491
Score = 149 bits (377), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 77/203 (37%), Positives = 123/203 (60%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNF 53
M++DF D GLV ++ + F PT+LAT L+ +S S + G++++ETN+
Sbjct: 287 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAAPLLSTNGSEPEERGYIILETNY 344
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+++ L +L LF I+ + PNL+VG+IT++S+ +A NGITAEQII++L +AH
Sbjct: 345 RLYAYTSNPLRVAVLSLFVTIKARFPNLLVGSITRDSVKSALANGITAEQIITYLTHHAH 404
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
++ P +P V DQIRLWE + NRV+ + +F S+ F +YA L+W
Sbjct: 405 LQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQPDFAEVRNYASQLGVLVW 464
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
D K RL V+ + +R+++R
Sbjct: 465 HDEPKRRLFVDEAGNEPVRDYIR 487
>gi|453232316|ref|NP_001263824.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
gi|412976517|emb|CCO25636.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
Length = 485
Score = 149 bits (376), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 131/203 (64%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
++ +LG++ +++ RK+ F T+L T+L+ ++ D SA K G V+VETNFR+Y
Sbjct: 266 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 323
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 324 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 383
Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+A P +P V DQIRLWE + R+ + A+ Y F S D F C+YAR ++ LLW
Sbjct: 384 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 443
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
D+++ ++VN + H +R++ +
Sbjct: 444 SDNQQKLVIVNEDGHELVRQWYK 466
>gi|392901990|ref|NP_001255863.1| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
gi|26985915|emb|CAB70230.2| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
Length = 481
Score = 149 bits (376), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 82/203 (40%), Positives = 131/203 (64%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT--DSSARK--EGFVVVETNFRMY 56
++ +LG++ +++ RK+ F T+L T+L+ + T D SA K G V+VETNFR+Y
Sbjct: 262 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 319
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 320 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 379
Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+A P +P V DQIRLWE + R+ + A+ Y F S D F C+YAR ++ LLW
Sbjct: 380 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 439
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
D+++ ++VN + H +R++ +
Sbjct: 440 SDNQQKLVIVNEDGHELVRQWYK 462
>gi|303272589|ref|XP_003055656.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463630|gb|EEH60908.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 476
Score = 149 bits (375), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 103/152 (67%), Gaps = 2/152 (1%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
++VETNFR+YAY++S + EILRLF++ +Y+LPNL VG +T+E++ A GI+AEQI+S
Sbjct: 318 IIVETNFRVYAYTSSAVEVEILRLFTRPDYKLPNLYVGMMTREAVVTALRGGISAEQIVS 377
Query: 107 FLQQNAHPRVADRI-PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPS-RDVFEAACDY 164
+L+++AHP+ P++P VCDQIRLW D NRV+ TP Y +FP+ +FE +
Sbjct: 378 YLRKHAHPQARKTPGPAIPATVCDQIRLWSKDENRVKYTPCVLYCDFPTGTGMFEKVAEI 437
Query: 165 ARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
A++R LW D ++L V E H M++ +
Sbjct: 438 AKERGLYLWGDPVGLKLAVREEGHESMKDVFK 469
>gi|327266348|ref|XP_003217968.1| PREDICTED: general transcription factor IIH subunit 4-like [Anolis
carolinensis]
Length = 460
Score = 149 bits (375), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--ARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + R +GF++VETN+R+YAY+
Sbjct: 260 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGISGITIDTRNQGFIIVETNYRIYAYT 317
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PNL+V +T+ES+ A NGITA+QII FL+ AHP + +
Sbjct: 318 DSELQIALIALFSEMLYRFPNLVVAQVTRESVQQAIANGITADQIIHFLRTRAHPVMLKQ 377
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ + Y +F S+ FE D+AR+ L++E+ K
Sbjct: 378 TPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRDHARELGVLIFENPSKR 437
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 438 LMVVTPAGHSDVKRFWKRQK 457
>gi|342319113|gb|EGU11064.1| RNA polymerase II transcription factor B subunit 2 [Rhodotorula
glutinis ATCC 204091]
Length = 496
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 124/205 (60%), Gaps = 11/205 (5%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS---LTDS--SARKEGFVVVETNFRM 55
+++D AD GLV L + R F PT+LAT L+ S L S S ++GF+V+ETN+++
Sbjct: 275 VLRDLADYGLVYLPE--RNAPIFYPTRLATTLTSSAPPLVSSRHSNEEKGFIVLETNYKL 332
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY+++ L +L LF+ ++ + N + G IT+ES+ NGITA QIIS+L AHP+
Sbjct: 333 YAYTSNPLQIAVLGLFAHLKTRFANFVTGHITRESIRRGLANGITANQIISYLASRAHPQ 392
Query: 116 VADRIPS----VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGL 171
+ + S +P V DQIRLWE + R++ T + Y+EF S +E +YAR+ +
Sbjct: 393 MRAQAGSDDKLLPITVVDQIRLWEHERRRIQTTEGYLYDEFSSTHDYELVVNYAREIGSV 452
Query: 172 LWEDSKKMRLVVNAEIHMHMREFLR 196
L E K ++ V A+ H +REF++
Sbjct: 453 LLELPKARKVFVTADGHQQVREFIK 477
>gi|155371845|ref|NP_001094527.1| general transcription factor IIH subunit 4 [Bos taurus]
gi|148878127|gb|AAI46241.1| GTF2H4 protein [Bos taurus]
gi|296474219|tpg|DAA16334.1| TPA: general transcription factor IIH, polypeptide 4, 52kDa [Bos
taurus]
Length = 463
Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLMFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|281337500|gb|EFB13084.1| hypothetical protein PANDA_018954 [Ailuropoda melanoleuca]
Length = 463
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|338718559|ref|XP_003363845.1| PREDICTED: general transcription factor IIH subunit 4 [Equus
caballus]
Length = 463
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|426250586|ref|XP_004019016.1| PREDICTED: general transcription factor IIH subunit 4 [Ovis aries]
Length = 463
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|440896165|gb|ELR48176.1| General transcription factor IIH subunit 4 [Bos grunniens mutus]
Length = 463
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|178056954|ref|NP_001116592.1| general transcription factor IIH subunit 4 [Sus scrofa]
gi|41529162|dbj|BAD08424.1| general transcription factor IIH, polypeptide 4 [Sus scrofa]
gi|47496805|dbj|BAD08426.2| general transcription factor IIH, polypeptide 4 [Sus scrofa]
Length = 463
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ +LGLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 263 LQHLRELGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|417401351|gb|JAA47565.1| Putative rna polymer [Desmodus rotundus]
Length = 463
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHGDVKRFWKRQK 460
>gi|344307658|ref|XP_003422497.1| PREDICTED: general transcription factor IIH subunit 4 [Loxodonta
africana]
Length = 463
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF++VETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|359320969|ref|XP_003639474.1| PREDICTED: general transcription factor IIH subunit 4-like [Canis
lupus familiaris]
Length = 463
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS +S +A + GF++VETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|149031822|gb|EDL86757.1| rCG41794 [Rattus norvegicus]
Length = 209
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + + GF+VVETN+R+YAY+
Sbjct: 9 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 66
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 67 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 126
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 127 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 186
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 187 LMVVTPAGHSDVKRFWKRQK 206
>gi|410958682|ref|XP_003985944.1| PREDICTED: general transcription factor IIH subunit 4 [Felis catus]
Length = 463
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 SPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|307106040|gb|EFN54287.1| hypothetical protein CHLNCDRAFT_25123 [Chlorella variabilis]
Length = 458
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 114/183 (62%), Gaps = 6/183 (3%)
Query: 17 GRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAYSTSKLHCEILRLFSK 73
G E W PT+LA L+ A +EG+V+VE+NFR+YAY+TS + +LR+F +
Sbjct: 271 GANELWLHPTRLAAVLAGGGRAGEAAVAPEEGYVIVESNFRVYAYTTSAVQVAVLRVFVR 330
Query: 74 IEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRL 133
+ LPNL VG IT+ES NA + GI A+Q+++FL+Q+AHPR A + P+V V DQIRL
Sbjct: 331 CDALLPNLFVGTITRESATNALDTGIAADQVVAFLRQHAHPRAAAKTPTV---VTDQIRL 387
Query: 134 WESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMRE 193
W +L R++ A Y++F S++++ A +AR + LL+ + +LVV + H M
Sbjct: 388 WAQELKRLQEKNATLYDKFESKELYVGAVAHARQLNALLYSCEDRRQLVVESAFHGLMVG 447
Query: 194 FLR 196
L+
Sbjct: 448 HLK 450
>gi|213402795|ref|XP_002172170.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
gi|212000217|gb|EEB05877.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
Length = 444
Score = 146 bits (369), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 76/191 (39%), Positives = 117/191 (61%), Gaps = 8/191 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD-----SSARKEGFVVVETNFRM 55
M++D + GLV ++ + F PT+LAT+L+ S +GF++VETN+R+
Sbjct: 243 MLEDLREYGLVYQKKSSSRR--FYPTRLATSLTTEYHSPVKGAGSDAGKGFIIVETNYRL 300
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY+ S L IL LF+ + + NLIVG IT++S+ A ++GITAEQII++L +AHP+
Sbjct: 301 YAYTDSPLQIAILGLFTNLRARFSNLIVGVITRDSIRRALQSGITAEQIITYLTTHAHPQ 360
Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
+ IP +P + DQI LWE + NR+ TP + +F + FE A YA++ L+W+
Sbjct: 361 MRKEIPLLPPTLVDQIYLWELERNRLRATPGILFRDFLTDKDFEKAVQYAKELGVLVWDS 420
Query: 176 S-KKMRLVVNA 185
S K+M + NA
Sbjct: 421 SLKRMFFITNA 431
>gi|74143475|dbj|BAE28812.1| unnamed protein product [Mus musculus]
Length = 463
Score = 146 bits (369), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQVIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ VFE +AR+ L++E+S K
Sbjct: 381 NPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVVFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|328766914|gb|EGF76966.1| hypothetical protein BATDEDRAFT_92242 [Batrachochytrium
dendrobatidis JAM81]
Length = 471
Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 116/206 (56%), Gaps = 10/206 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--------ARKEGFVVVETN 52
M+ D LGL+ Q +K + F PT LAT+L+ + + A +GF+++ETN
Sbjct: 259 MLDDLKHLGLI--YQRKKKSTRFYPTHLATSLTSGASAGAVLSPRSLEADSDGFIIIETN 316
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF ++ + N+++G IT++S+ A GI+AEQII++L +A
Sbjct: 317 YRVYAYTDSPLQIAVLSLFIEMRVRFANMVIGIITRDSVREALAKGISAEQIIAYLTTHA 376
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP + P +P + DQ+RLWE + NR+ ++ H Y+ F + YA D L
Sbjct: 377 HPEMKKGSPVLPTTIVDQVRLWEMERNRLRISRGHLYQMFSGEQEYREILKYATDLGYEL 436
Query: 173 WEDSKKMRLVVNAEIHMHMREFLRGQ 198
W K +VV+AE H H++ F Q
Sbjct: 437 WHSDSKRLVVVSAEGHEHIKVFFSKQ 462
>gi|47059175|ref|NP_997666.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|46237651|emb|CAE84027.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|117558335|gb|AAI27470.1| General transcription factor II H, polypeptide 4 [Rattus
norvegicus]
Length = 463
Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|4504201|ref|NP_001508.1| general transcription factor IIH subunit 4 [Homo sapiens]
gi|113865855|ref|NP_001038968.1| general transcription factor IIH subunit 4 [Pan troglodytes]
gi|114050799|ref|NP_001040607.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|297677636|ref|XP_002816695.1| PREDICTED: general transcription factor IIH subunit 4 isoform 2
[Pongo abelii]
gi|332245940|ref|XP_003272109.1| PREDICTED: general transcription factor IIH subunit 4 [Nomascus
leucogenys]
gi|426352309|ref|XP_004043656.1| PREDICTED: general transcription factor IIH subunit 4 [Gorilla
gorilla gorilla]
gi|17380328|sp|Q92759.1|TF2H4_HUMAN RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|38503276|sp|P60027.1|TF2H4_PANTR RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|1514597|emb|CAA68870.1| transcription factor TFIIH [Homo sapiens]
gi|13436278|gb|AAH04935.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|15277224|dbj|BAB63317.1| Transcription factor II H [Homo sapiens]
gi|16740884|gb|AAH16302.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|21655317|gb|AAM64222.1| general transcription factor IIH, polypeptide 4 (52kD subunit)
[Homo sapiens]
gi|27544403|dbj|BAC54936.1| transcription factor II H [Homo sapiens]
gi|30583481|gb|AAP35985.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|32127781|dbj|BAC78171.1| transcription factor II H [Pan troglodytes]
gi|55700798|dbj|BAD69753.1| general transcription factor IIH, polypeptide 4, 52kDa [Macaca
mulatta]
gi|60655839|gb|AAX32483.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|60655841|gb|AAX32484.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|86197966|dbj|BAE78622.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|90960934|dbj|BAE92819.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|90960936|dbj|BAE92820.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|114306784|dbj|BAF31271.1| TFIIH protein [Homo sapiens]
gi|119623752|gb|EAX03347.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|123992979|gb|ABM84091.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|123999907|gb|ABM87462.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|355561507|gb|EHH18139.1| General transcription factor IIH polypeptide 4, partial [Macaca
mulatta]
gi|355748409|gb|EHH52892.1| General transcription factor IIH polypeptide 4 [Macaca
fascicularis]
gi|380812138|gb|AFE77944.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|380812140|gb|AFE77945.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|383408563|gb|AFH27495.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|410214852|gb|JAA04645.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410250640|gb|JAA13287.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410290944|gb|JAA24072.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410352807|gb|JAA43007.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
Length = 462
Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 262 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 319
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 320 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 379
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 380 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 439
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 440 LMVVTPAGHSDVKRFWKRQK 459
>gi|395831929|ref|XP_003789035.1| PREDICTED: general transcription factor IIH subunit 4 [Otolemur
garnettii]
Length = 463
Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|169784197|ref|XP_001826560.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus oryzae RIB40]
gi|238508739|ref|XP_002385554.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|83775305|dbj|BAE65427.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220688446|gb|EED44799.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|391868531|gb|EIT77745.1| RNA polymerase II transcription initiation protein [Aspergillus
oryzae 3.042]
Length = 480
Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 126/228 (55%), Gaps = 35/228 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKEG 45
+ D AD G+V Q S F PT+LAT L+ S T S+ G
Sbjct: 258 LTDLADFGIV--YQDSPDASHFYPTRLATTLTSDSSALSNPISGSLSGPSGTGSNKAGSG 315
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F++VETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+QII
Sbjct: 316 FIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQII 375
Query: 106 SFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP++ A PSV P V DQIRLW+ + +RV+ TP +++F
Sbjct: 376 SYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVTDQIRLWQLERDRVKATPGFLFKDF 435
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
S +EA C YA + L+W+ D K+M V H + FLR +
Sbjct: 436 VSLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSRK 480
>gi|62896773|dbj|BAD96327.1| general transcription factor IIH, polypeptide 4, 52kDa variant
[Homo sapiens]
Length = 462
Score = 145 bits (367), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 120/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 262 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 319
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 320 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 379
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 380 TPVLPPTITDQIRLWELGRDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 439
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 440 LMVVTPAGHSDVKRFWKRQK 459
>gi|170039509|ref|XP_001847575.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
gi|167863052|gb|EDS26435.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
Length = 492
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 127/212 (59%), Gaps = 15/212 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNFR 54
++ + GLV Q RKE F PT+LA N++ + S+A+ +G++VVETN+R
Sbjct: 280 LQHLREFGLV--YQRKRKEGRFYPTRLAHNITSRNAVPTIQEDGSAAQDKGYIVVETNYR 337
Query: 55 MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ AF GITA+QIIS+L+Q+AHP
Sbjct: 338 VYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITADQIISYLEQHAHP 397
Query: 115 ------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
+ + ++P V DQI+LWE++ NR T Y +F S+ F DYA+
Sbjct: 398 TMHNVEQTINTKSALPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSI 457
Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
++W++ + +VV H +++F + +K
Sbjct: 458 GVMIWQNERTRTMVVTKNGHDDVKKFWKRYSK 489
>gi|6754094|ref|NP_034494.1| general transcription factor IIH subunit 4 [Mus musculus]
gi|18202124|sp|O70422.1|TF2H4_MOUSE RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|2997755|gb|AAC08594.1| TFIIH transcription/DNA repair factor p52 subunit [Mus musculus]
gi|15215027|gb|AAH12638.1| Gtf2h4 protein [Mus musculus]
gi|26348003|dbj|BAC37650.1| unnamed protein product [Mus musculus]
gi|74194835|dbj|BAE26009.1| unnamed protein product [Mus musculus]
gi|74222199|dbj|BAE26910.1| unnamed protein product [Mus musculus]
Length = 463
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 NPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|392564310|gb|EIW57488.1| transcription factor Tfb2 [Trametes versicolor FP-101664 SS1]
Length = 477
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 121/203 (59%), Gaps = 10/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
M+ D D GL+K + + F PT+LAT L+ S + + +GF+V+ETN
Sbjct: 268 MLDDLRDYGLIKQRTPTSRR--FSPTRLATTLTSSSPPLPTSAGSGDGSHAQGFIVLETN 325
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ + L +L LF ++ + PNL++GAIT++S+ A +GITA+QIIS+L +A
Sbjct: 326 YRLYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRDSVKKALASGITADQIISYLVTHA 385
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP++ P +P V DQIRLWE + NR++ + Y F S+ +E +YA+ +L
Sbjct: 386 HPQMRKNQPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQADYEYVLNYAKQLDVVL 445
Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
WE++ + + E H ++R F+
Sbjct: 446 WENASRRCFFGSVEGHANIRGFI 468
>gi|432883135|ref|XP_004074222.1| PREDICTED: general transcription factor IIH subunit 4-like [Oryzias
latipes]
Length = 460
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 122/206 (59%), Gaps = 11/206 (5%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK---------EGFVVVETN 52
++ + GLV Q RK + PT+LA L+ ++ +S GF+VVETN
Sbjct: 253 LQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVSSNSPSNMTNTPGTGDAGFIVVETN 310
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A NGITA+QII FL+ A
Sbjct: 311 YRIYAYTNSELQIALVALFSEMLYRFPNVVVAHLTRESVQQAIANGITAQQIIHFLRTRA 370
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP + + P++P + DQIRLWE + +R++ T Y +F S+ FE D A+ L+
Sbjct: 371 HPVMLKQSPALPPTITDQIRLWELERDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCLV 430
Query: 173 WEDSKKMRLVVNAEIHMHMREFLRGQ 198
W+D+ +VV + H +++F + Q
Sbjct: 431 WQDASHRAMVVTPQGHSEVKKFWKRQ 456
>gi|119179887|ref|XP_001241468.1| hypothetical protein CIMG_08631 [Coccidioides immitis RS]
gi|392866653|gb|EAS30172.2| transcription factor Tfb2 [Coccidioides immitis RS]
Length = 487
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 125/229 (54%), Gaps = 34/229 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDS----------------SAR 42
+ D D G+V + + F PT+LAT L SM+L+ S A
Sbjct: 257 LADLTDFGIVYQHSPASESTRFYPTRLATTLTSDSMALSSSISGNLAPAGPNINAATGAP 316
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+ A E GITA+
Sbjct: 317 GTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITAD 376
Query: 103 QIISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYY 149
QIIS+L +AHP++ A + SV P V DQIRLW+ + +R++ TP +
Sbjct: 377 QIISYLTTHAHPQMRKVNASKSTSTTAGLLASVLPPTVVDQIRLWQLERDRLKATPGFLF 436
Query: 150 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
++F S FEA C YA + L+W+ K+ V H + FLR +
Sbjct: 437 KDFASSAEFEAPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAFLRSR 483
>gi|409048320|gb|EKM57798.1| hypothetical protein PHACADRAFT_171015 [Phanerochaete carnosa
HHB-10118-sp]
Length = 472
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/207 (37%), Positives = 121/207 (58%), Gaps = 14/207 (6%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
M++D D G++ ++ K F PT+LAT L+ S +SA +GF+++ETN
Sbjct: 260 MLEDLRDYGIIWQRRATSKR--FCPTRLATTLTSSSPPLPAAGGVSASAHGQGFIILETN 317
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE----QIISFL 108
+R+YAY+ + L +L LF ++ + PNL+VGAIT+ES+ A NGITA+ QIIS+L
Sbjct: 318 YRLYAYTDNPLQIAVLNLFVTLKSRYPNLVVGAITRESVKKALTNGITADQASRQIISYL 377
Query: 109 QQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
+AHP++ P +P V DQIRLWE + NR++ + Y F S+ +E +YA+
Sbjct: 378 TTHAHPQMRKNKPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQADYEYVLNYAKQL 437
Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFL 195
+LWE S + E H ++R F+
Sbjct: 438 GVVLWESSGRRCFFGTLEGHPNIRGFI 464
>gi|358057347|dbj|GAA96696.1| hypothetical protein E5Q_03367 [Mixia osmundae IAM 14324]
Length = 467
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 120/202 (59%), Gaps = 10/202 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS----LTDSSARKE-GFVVVETNFRM 55
++K F LGLV + + G + PT+LA L+ L +A +E GF+++ETN+R+
Sbjct: 263 ILKTFGGLGLVYVYKSGD----YSPTRLAVTLTSGAPPLLKAGTAEEEQGFLILETNYRV 318
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY+ + L +L LF ++ + P L+VG IT+ES+ NGI ++QII+FL +AHP+
Sbjct: 319 YAYTQNPLQIAVLNLFVALKSRFPGLVVGMITRESIKAGLANGIKSDQIIAFLTAHAHPQ 378
Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW-E 174
+ + P +P V DQI+LWE + NRV+ P Y++F S+ ++ CDYA+ +LW
Sbjct: 379 MRKQEPLLPPTVVDQIKLWEREKNRVKTEPCFLYDDFRSQADYDLVCDYAKQIGAVLWLG 438
Query: 175 DSKKMRLVVNAEIHMHMREFLR 196
+ R + H+ +R F++
Sbjct: 439 EPGSRRFATTEDGHVQVRGFIQ 460
>gi|195999048|ref|XP_002109392.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
gi|190587516|gb|EDV27558.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
Length = 452
Score = 143 bits (361), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 116/200 (58%), Gaps = 7/200 (3%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS---ARKEGFVVVETNFRMYAY 58
++ ++GLV Q RK+ F PT+LA NL+ ++ G+++VETN+R+YAY
Sbjct: 249 LQHMREIGLVV--QRKRKDMKFYPTQLAINLASGAKQEELDHSKSSGYIIVETNYRLYAY 306
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L ++ LF ++ Y+LP L VG IT+ES+ AF NGITA +II+F++ +AHP
Sbjct: 307 TESPLDIALVALFCEMMYRLPGLCVGLITRESVQQAFTNGITANKIINFIRTHAHPEAKK 366
Query: 119 RI--PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
++ P VP + DQ+ LWE + R+ + Y + S FEA YA D LLW
Sbjct: 367 KVTTPIVPSTIIDQLYLWELERGRLSCSDGVLYNQILSSSDFEALRKYADDMGVLLWASP 426
Query: 177 KKMRLVVNAEIHMHMREFLR 196
K LVVN + H H++ F +
Sbjct: 427 AKRLLVVNRDGHNHVKHFWK 446
>gi|157113167|ref|XP_001651923.1| TFIIH basal transcription factor complex p52 subunit [Aedes
aegypti]
gi|108877858|gb|EAT42083.1| AAEL006356-PA [Aedes aegypti]
Length = 487
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 124/212 (58%), Gaps = 15/212 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNFR 54
++ + GLV Q RKE F PT+LA N++ + SS + +G+++VETN+R
Sbjct: 275 LQHLREFGLV--YQRKRKEGRFYPTRLAHNITSKNAVQTIQEDGSSVQDKGYIIVETNYR 332
Query: 55 MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ AF GITAEQIIS+L+Q+AHP
Sbjct: 333 VYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITAEQIISYLEQHAHP 392
Query: 115 RVADRIPSV------PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
+ + ++ P V DQI+LWE++ NR T Y +F S+ F DYA+
Sbjct: 393 TMLNMEQAINSKSPLPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSI 452
Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ W++ + +VV H ++ F + +K
Sbjct: 453 GVMTWQNERTRTMVVTKNGHDDVKRFWKRYSK 484
>gi|328863850|gb|EGG12949.1| hypothetical protein MELLADRAFT_46475 [Melampsora larici-populina
98AG31]
Length = 454
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 121/204 (59%), Gaps = 12/204 (5%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT--------DSSARKEGFVVVETN 52
++ D D GLV L Q F PT+L T L+ + SS+ F+++ETN
Sbjct: 247 VLNDLVDYGLVYLTQSKT----FYPTRLVTTLTSTAPPIVSNPSDQSSSDPNEFLILETN 302
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+++ L IL LF + + PNL++G+IT+ES+ AF+NGI A+QIIS+L+ ++
Sbjct: 303 YRIYAYTSNPLQIAILNLFISFKSRFPNLVIGSITRESIKMAFKNGIRADQIISYLEFHS 362
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
H ++ P +P V DQIRLWE + NRV + YE+F S +E +Y++ +L
Sbjct: 363 HSQMKLLKPILPNTVVDQIRLWELEKNRVRDQEGYLYEDFKSVSDYEIVINYSKQIDIIL 422
Query: 173 WEDSKKMRLVVNAEIHMHMREFLR 196
WE+ + + V+ + H +REF++
Sbjct: 423 WENPELKKFFVSLDGHTALREFIK 446
>gi|268534706|ref|XP_002632484.1| Hypothetical protein CBG13719 [Caenorhabditis briggsae]
Length = 482
Score = 143 bits (360), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/203 (38%), Positives = 126/203 (62%), Gaps = 9/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWF----IPTKLATNLSMSLTDSSARKEGFVVVETNFRMY 56
++ +LG++ +++ RK+ F + T LATN ++ T S G V+VETNFR+Y
Sbjct: 265 LLNHLRELGVIFIRK--RKDGVFFLTHLLTHLATNETIDDTSSEKASNGKVIVETNFRVY 322
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
AY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+
Sbjct: 323 AYTSSLLQLAIIALFTEMTYRFTDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 382
Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
VA P +P V DQIRLWE + R+++ A+ Y F S D + YA++R LLW
Sbjct: 383 VATSGPVNCLPITVADQIRLWEDERRRMDLKDAYIYSHFESDDEYHGVVRYAQERGILLW 442
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
+ ++ ++VN E H +R++ +
Sbjct: 443 ANPQQKLVIVNEEGHEAVRQWYK 465
>gi|302684027|ref|XP_003031694.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
gi|300105387|gb|EFI96791.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
Length = 476
Score = 143 bits (360), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/192 (40%), Positives = 115/192 (59%), Gaps = 14/192 (7%)
Query: 18 RKESW--FIPTKLATNLSMSLTD-------SSARKEGFVVVETNFRMYAYSTSKLHCEIL 68
RK+++ F PT+LAT L+ S +S KEGF+V+ETN+R+YAY+ + L +L
Sbjct: 278 RKKNYPRFSPTRLATTLTSSAPPLPTSSGPTSGPKEGFIVLETNYRVYAYTDNALQTAVL 337
Query: 69 RLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSV 123
LF +Y+ PNL+VG+IT+ES+ A NGITA+QIIS+L +AHP++ + P +
Sbjct: 338 NLFITPKYRFPNLVVGSITRESVKRAMSNGITADQIISYLITHAHPQMHKNVCLVANPLL 397
Query: 124 PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVV 183
P V DQIRLWE + NR + Y EF S+ +E +YA+ +LWE+ K
Sbjct: 398 PVTVQDQIRLWELERNRTKSQEGFLYTEFASQADYEYVLNYAKQLGVVLWENHAKRCFFA 457
Query: 184 NAEIHMHMREFL 195
E H +++ F+
Sbjct: 458 AQEGHANIKSFI 469
>gi|19113355|ref|NP_596563.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe 972h-]
gi|62901133|sp|P87303.2|TFB2_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|3080538|emb|CAA18666.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe]
Length = 447
Score = 143 bits (360), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 118/202 (58%), Gaps = 8/202 (3%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------EGFVVVETNFR 54
M++D + GLV +++ K F PT+LAT L+ ++ +GF++VETN+R
Sbjct: 243 MLEDLREYGLVYQRKITSKR--FYPTRLATGLTTDYRSLHGKQSENDDDKGFIIVETNYR 300
Query: 55 MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
+YAY++S L I+ LF+ + + NL+VG IT++S+ A NGI AEQII++L +AHP
Sbjct: 301 LYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRDSIRRALMNGIAAEQIITYLTTHAHP 360
Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
++ +P +P + DQI LWE + NR+ TP + +F + F+ A +YA++ L+W+
Sbjct: 361 QMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILFRDFLTDSGFDQAVEYAKELGVLVWD 420
Query: 175 DSKKMRLVVNAEIHMHMREFLR 196
S K + M +L+
Sbjct: 421 SSLKRMFFITTTGAQPMIAYLK 442
>gi|156378083|ref|XP_001630974.1| predicted protein [Nematostella vectensis]
gi|156218005|gb|EDO38911.1| predicted protein [Nematostella vectensis]
Length = 471
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 122/204 (59%), Gaps = 8/204 (3%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-----MSLTDSSARKEGFVVVETNFRMY 56
++ ++GLV Q RK + PTKL+ NL+ ++ TDS + GF+++ETN+R+Y
Sbjct: 269 LQQLREIGLVF--QRKRKSRRYYPTKLSVNLTAAGKGINTTDSQI-EAGFIIIETNYRVY 325
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
AY+ S L ++ LF +I + PNL V ++T+ES A +GI+AEQI++FLQ AHP +
Sbjct: 326 AYTESCLQVSLIGLFCEILCRFPNLCVASLTRESCQQALASGISAEQILNFLQTRAHPEM 385
Query: 117 ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
R P +P + DQ+RLWE + +R++ T Y +F S+ FE YA D L+W +S
Sbjct: 386 LKRTPIIPSTISDQVRLWEMERSRMKFTEGVLYNQFLSQADFEMLRKYAEDLGVLIWANS 445
Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
K +VV+ H ++ F + Q +
Sbjct: 446 TKRVVVVSRSGHDDVKRFWKRQRQ 469
>gi|303321033|ref|XP_003070511.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240110207|gb|EER28366.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320036051|gb|EFW17991.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Coccidioides posadasii str. Silveira]
Length = 487
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 77/229 (33%), Positives = 125/229 (54%), Gaps = 34/229 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD----------------SSAR 42
+ D D G+V + + F PT+LAT L SM+L+ + A
Sbjct: 257 LADLTDFGIVYQHSPASESTRFYPTRLATTLTSDSMALSSPISGNLAPAGPNINAATGAP 316
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+ A E GITA+
Sbjct: 317 GTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITAD 376
Query: 103 QIISFLQQNAHPRV-----------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYY 149
QIIS+L +AHP++ +P+ +P V DQIRLW+ + +R++ TP +
Sbjct: 377 QIISYLTTHAHPQMRKVNASKSTSTTAGLPASVLPPTVVDQIRLWQLERDRLKATPGFLF 436
Query: 150 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
++F S FEA C YA + L+W+ K+ V H + FLR +
Sbjct: 437 KDFASSAEFEAPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAFLRSR 483
>gi|444727354|gb|ELW67853.1| Valyl-tRNA synthetase, mitochondrial [Tupaia chinensis]
Length = 1382
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/189 (40%), Positives = 117/189 (61%), Gaps = 4/189 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS +S +A + GF+VVETN+R+YAY+
Sbjct: 220 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 277
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 278 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 337
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 338 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 397
Query: 180 RLVVNAEIH 188
+VV H
Sbjct: 398 LMVVTPAGH 406
>gi|121714341|ref|XP_001274781.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
gi|119402935|gb|EAW13355.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
Length = 478
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 128/231 (55%), Gaps = 41/231 (17%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA-------------------R 42
+ D AD G+V Q + + F PT+LAT L+ +DSSA
Sbjct: 256 LTDLADFGIV--YQESPEATHFYPTRLATTLT---SDSSALSNPIAGSLSGPTGTASNKA 310
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+
Sbjct: 311 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 370
Query: 103 QIISFLQQNAHPRV----ADRIPS---------VPENVCDQIRLWESDLNRVEMTPAHYY 149
QIIS+L +AHP++ A R S +P V DQIRLW+ + +RV+ TP +
Sbjct: 371 QIISYLSTHAHPQMRKHNASRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATPGFLF 430
Query: 150 EEFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
++F S +EA C YA + L+W+ D K+M V H + FLR +
Sbjct: 431 KDFVSLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|330918142|ref|XP_003298106.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
gi|311328891|gb|EFQ93800.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
Length = 482
Score = 142 bits (358), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 113/194 (58%), Gaps = 21/194 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTD-------------SSA 41
M+ D + +GL+ + + F PT+LAT L+ MS + S+A
Sbjct: 260 MLDDLSSMGLI--YRSDKNARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSAA 317
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
+GF+++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A ++GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITS 377
Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
QIIS+L AHP++ +P +P V DQIRLWE + RVE TP + EF S +
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDTEYRDV 437
Query: 162 CDYARDRSGLLWED 175
YA L+W++
Sbjct: 438 MGYASALGVLVWQN 451
>gi|388581217|gb|EIM21527.1| transcription factor Tfb2 [Wallemia sebi CBS 633.66]
Length = 449
Score = 142 bits (358), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 78/207 (37%), Positives = 119/207 (57%), Gaps = 13/207 (6%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----------LTDSSARKEGFVVV 49
M+ D D GLV ++ K F PT+LAT L+ S +S+ + F+++
Sbjct: 226 MLSDLRDYGLVWQRKTSSKR--FYPTRLATTLTSSAPPLLPTTESSSFTTSSDNKRFIIL 283
Query: 50 ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
ETN+R+YAY+++ L IL LF ++ + PNL++G IT++S+ +A NGITAEQII +L
Sbjct: 284 ETNYRLYAYTSNPLQISILNLFVTLKARYPNLVIGVITRDSIRSALSNGITAEQIIGYLT 343
Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
+AH ++ P +P V DQIRLWE + NR++ Y EF S+ +E +YA+
Sbjct: 344 SHAHTQMHRNNPLLPVTVSDQIRLWELEKNRLKADDGVLYAEFRSQPDYEILLNYAKSYD 403
Query: 170 GLLWEDSKKMRLVVNAEIHMHMREFLR 196
+LW + K V E H +REF+R
Sbjct: 404 CVLWSNDIKRMFFVTLEGHQIVREFVR 430
>gi|195126587|ref|XP_002007752.1| GI12214 [Drosophila mojavensis]
gi|193919361|gb|EDW18228.1| GI12214 [Drosophila mojavensis]
Length = 492
Score = 142 bits (357), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 79/213 (37%), Positives = 127/213 (59%), Gaps = 16/213 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKEGFVVVETNF 53
++ + GLV Q RKE F PT+LA N+ S+S+ + ++ G++VVETN+
Sbjct: 279 LQHLREFGLVF--QRKRKEHRFYPTRLALNVTNKEAAEASISVDEERMQERGYIVVETNY 336
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ S+L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q AH
Sbjct: 337 RVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAH 396
Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
P V I S +P V DQI+LWE + NR T Y +F S++ F DYA+
Sbjct: 397 PNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQS 456
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
++ L+W++ + +VV H ++ + + +K
Sbjct: 457 QNVLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 489
>gi|427785675|gb|JAA58289.1| Putative rna polymer [Rhipicephalus pulchellus]
Length = 457
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 71/197 (36%), Positives = 118/197 (59%), Gaps = 4/197 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYS 59
++ + GLV Q R+ F PT+LA NL+ L +++ R + G+++VETN+R+YAY+
Sbjct: 257 LQHLREFGLV--YQRKRRSGRFYPTRLAINLASGLKETTLRSFEAGYIMVETNYRVYAYT 314
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L +L LF ++ Y+ PNL+V +T+ES+ A +GIT+ QII FL+ AHP +
Sbjct: 315 NSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQIIKFLRMYAHPEALKQ 374
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQ+RLWE + +R Y +F S+ F+ +YA + L+W++ K
Sbjct: 375 TPVIPATIMDQLRLWEMERDRFVFREGVLYSQFISQSDFQLLRNYASELGVLIWDNPSKR 434
Query: 180 RLVVNAEIHMHMREFLR 196
+VVN H +++F +
Sbjct: 435 VMVVNRNGHDEVKKFWK 451
>gi|195162678|ref|XP_002022181.1| GL25529 [Drosophila persimilis]
gi|198464237|ref|XP_001353141.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
gi|194104142|gb|EDW26185.1| GL25529 [Drosophila persimilis]
gi|198149631|gb|EAL30642.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
Length = 494
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/213 (38%), Positives = 126/213 (59%), Gaps = 16/213 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKE-GFVVVETNF 53
++ + GLV Q RKE F PT+LA N++ M+++D A ++ G++VVETN+
Sbjct: 281 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAATTMTVSDEEAMQDSGYIVVETNY 338
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+LQQ AH
Sbjct: 339 RVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLQQYAH 398
Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
P V I S +P V DQI+LWE + NR T Y +F S++ F DYA+
Sbjct: 399 PNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQS 458
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 459 IHVLVWQNERTRTMVVQKHGHDDVKRYWKKYSK 491
>gi|348542188|ref|XP_003458568.1| PREDICTED: general transcription factor IIH subunit 4 [Oreochromis
niloticus]
Length = 461
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 118/207 (57%), Gaps = 12/207 (5%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS----------SARKEGFVVVET 51
++ + GLV Q RK + PT+LA L+ +T S GF+VVET
Sbjct: 253 LQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVTSSSSVSNLSSSPGTGDAGFIVVET 310
Query: 52 NFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 111
N+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A NGITA+QII FL+
Sbjct: 311 NYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQLTRESVQQAIANGITAQQIIHFLRTR 370
Query: 112 AHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGL 171
AHP + + P +P + DQIRLWE + +R++ T Y +F S+ FE D A+ L
Sbjct: 371 AHPVMLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCL 430
Query: 172 LWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+W+D +VV + H ++ F + Q
Sbjct: 431 VWQDVAHRVMVVTPQGHSEVKRFWKRQ 457
>gi|443688956|gb|ELT91478.1| hypothetical protein CAPTEDRAFT_18660 [Capitella teleta]
Length = 463
Score = 141 bits (355), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 113/198 (57%), Gaps = 5/198 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARKEGFVVVETNFRMYAY 58
++ + GLV Q R F PT+LA NL S +A ++GF+VVETN+R+YAY
Sbjct: 263 LQHLREFGLV--YQRKRSSQRFYPTRLAVNLATGSKGGDSETASEDGFIVVETNYRVYAY 320
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L ++ LF + Y+ PN+ VG ++++S+ A GITAEQII+FL+ +AHP
Sbjct: 321 TNSSLKVALISLFCSMMYRFPNMAVGVVSRDSVREALSRGITAEQIINFLRNHAHPHTQK 380
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
R P +P V DQIRLWE + +R Y +F S+ FE DYA+D L +E+ +
Sbjct: 381 RKPILPPTVSDQIRLWELERDRFHFNEGVLYNQFLSQTDFEVLRDYAKDLGVLSYENIPR 440
Query: 179 MRLVVNAEIHMHMREFLR 196
+VV H ++ F +
Sbjct: 441 RLMVVTRAGHDDVKRFWK 458
>gi|350413829|ref|XP_003490125.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
impatiens]
Length = 467
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 122/204 (59%), Gaps = 6/204 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
++ + GLV Q RK F PT+LA N++ +R KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYA 320
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI+ +LQQ+AH ++
Sbjct: 321 YTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQIVGYLQQHAHGKMI 380
Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+ P V P + DQI+LWE++ NR + Y +F S+ FE D+A L+W++
Sbjct: 381 EAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQNE 440
Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
+K +VV H +++F + +K
Sbjct: 441 RKRTIVVTKAGHDDVKKFWKRYSK 464
>gi|449540295|gb|EMD31288.1| hypothetical protein CERSUDRAFT_119839 [Ceriporiopsis subvermispora
B]
Length = 412
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/203 (37%), Positives = 119/203 (58%), Gaps = 10/203 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS--------LTDSSARKEGFVVVETN 52
M++D D GL L Q F PT+LAT L+ S + ++ +GF+V+ETN
Sbjct: 204 MLEDLRDYGL--LWQRSATSQRFSPTRLATTLTASSNPLPTSSSASADSQSQGFIVLETN 261
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ + L +L LF ++ + PNL++GA+T+ES+ A NGITA+QIIS+L +A
Sbjct: 262 YRVYAYTDNPLQIAVLNLFVTMKSRFPNLVIGAVTRESVKKALANGITADQIISYLTAHA 321
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP++ P +P V DQIRLWE + NRV+ + Y F S+ +E +YA+ ++
Sbjct: 322 HPQMRRYKPLLPVTVQDQIRLWELEKNRVKSQEGYLYTAFASQADYEYVLNYAKQLDVVI 381
Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
WE + + E H ++R F+
Sbjct: 382 WESPTRRCFFGSLEGHSNIRGFI 404
>gi|67541703|ref|XP_664619.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|40742471|gb|EAA61661.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|259483674|tpe|CBF79257.1| TPA: TFIIH and nucleotide excision repair factor 3 complexes
subunit (Tfb2), putative (AFU_orthologue; AFUA_4G04360)
[Aspergillus nidulans FGSC A4]
Length = 482
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 128/228 (56%), Gaps = 35/228 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLT-------------DSSARKEG 45
+ D AD G+V Q + S F PT+LAT L S +L+ DS+ G
Sbjct: 260 LTDLADFGIV--YQDSPEASHFYPTRLATTLTSDSSALSNPISGALSDPDGGDSNQPGSG 317
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+ A E GITA+QII
Sbjct: 318 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGITADQII 377
Query: 106 SFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP++ A PSV P V DQIRLW+ + +RV+ T +++F
Sbjct: 378 SYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKDF 437
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
S +EA C YA + L+W+ D K+M V H + FLR +
Sbjct: 438 VSLAEYEAPCRYAEEIGVLIWKSDRKRMFFVTR---HEQVAAFLRSRK 482
>gi|261187837|ref|XP_002620336.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239593453|gb|EEQ76034.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239608433|gb|EEQ85420.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis ER-3]
gi|327356043|gb|EGE84900.1| RNA pol II transcription initiation subunit [Ajellomyces
dermatitidis ATCC 18188]
Length = 485
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 123/228 (53%), Gaps = 33/228 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLTDSSARK------- 43
+ D D G+V ++ + F PT+LAT L S SLT S
Sbjct: 256 LADLTDFGIVYQHPPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPSGGPAAVSNEP 315
Query: 44 -EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A E GITA+
Sbjct: 316 GTGFIIIETNYRLYAYTSSPLQISLISLFTTLKYRFPNLITGKLTRQSIRRAVEMGITAD 375
Query: 103 QIISFLQQNAHPRV-----------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++ TP ++
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSGANPAGVPAVLPPTVVDQIRLWQLERDRIKATPGFLFK 435
Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+F S FE C YA + LLW+ K+ V H + +LR +
Sbjct: 436 DFVSLAEFEGPCRYAEEIGVLLWKSEKRRMFFVTR--HEQVAAYLRSR 481
>gi|383852611|ref|XP_003701820.1| PREDICTED: general transcription factor IIH subunit 4-like
[Megachile rotundata]
Length = 467
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/203 (36%), Positives = 121/203 (59%), Gaps = 6/203 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
++ + GLV Q RK F PT+LA N++ +R KEG++VVETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPDKEGYIVVETNYRVYAY 321
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI+ +LQQ+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITASQIVGYLQQHAHSKMIE 381
Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
P V P + DQI+LWE++ NR + Y +F S+ FE D+A L+W+ +
Sbjct: 382 AGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSER 441
Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
K +VV H +++F + +K
Sbjct: 442 KRTMVVTKAGHDDVKKFWKRYSK 464
>gi|41053973|ref|NP_956221.1| general transcription factor IIH subunit 4 [Danio rerio]
gi|39794736|gb|AAH64301.1| General transcription factor IIH, polypeptide 4 [Danio rerio]
Length = 466
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 118/212 (55%), Gaps = 16/212 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE--------------GFV 47
++ + GLV Q RK + PT+LA L+ +T + A GF+
Sbjct: 254 LQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVTANPASGSASSALGAIPGTGDTGFI 311
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
VVETN+R+YAY+ S+L ++ LFS++ Y+ PNL+V +T+ES+ A NGITA+QII F
Sbjct: 312 VVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAISNGITAQQIIHF 371
Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
L+ AHP + + P +P + DQIRLWE + +R++ T Y +F S+ FE D A+
Sbjct: 372 LRTRAHPVMLKQTPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQADFEVLRDRAQG 431
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
L+W++ +VV H ++ F + Q
Sbjct: 432 LGVLVWQNPAHRVMVVTPHGHSEVKRFWKRQK 463
>gi|321470926|gb|EFX81900.1| hypothetical protein DAPPUDRAFT_210868 [Daphnia pulex]
Length = 463
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 123/206 (59%), Gaps = 9/206 (4%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNFR 54
++ + GLV Q RK+ F PT+LA +++ ++ ++S++ G++VVETN+R
Sbjct: 256 LQHLREFGLV--YQRKRKDGRFYPTRLALDIAAGPKKSMLNSLNASSQTTGYIVVETNYR 313
Query: 55 MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
+YAY+ S L ++ LF ++ Y+ PNL+VG IT+ES+ A + GITA+QI+SFL+Q+AH
Sbjct: 314 VYAYTDSSLQVALIALFCELIYRFPNLVVGIITRESVREALKRGITADQIVSFLRQHAHS 373
Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
P +P + DQI+LW + +R Y +F S+ FE +YA++R L+W
Sbjct: 374 ECYKEPPVLPSTIADQIKLWAIERDRFLYKDGVLYNQFDSQADFEILRNYAQERGLLVWH 433
Query: 175 DSKKMRLVVNAEIHMHMREFLRGQNK 200
++VV E H +R+F + ++
Sbjct: 434 TLNGRKMVVTKEGHDEVRKFWKRHSR 459
>gi|344252968|gb|EGW09072.1| General transcription factor IIH subunit 4 [Cricetulus griseus]
Length = 463
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|410911638|ref|XP_003969297.1| PREDICTED: general transcription factor IIH subunit 4-like
[Takifugu rubripes]
Length = 465
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 120/211 (56%), Gaps = 15/211 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS-------------ARKEGFVV 48
++ + GLV Q RK + PT+LA L+ + SS + GF+V
Sbjct: 254 LQHLREFGLVF--QRKRKSRRYYPTRLAITLATGDSSSSLHTPTASLASTPGSGDSGFIV 311
Query: 49 VETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 108
VETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A NGITA+QII FL
Sbjct: 312 VETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQVTRESVQQAIANGITAQQIIHFL 371
Query: 109 QQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
+ AHP + + P +P + DQIRLWE + +R++ T Y +F S+ FE D A+
Sbjct: 372 RTRAHPVMLRQTPFLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRDRAKSL 431
Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
L+W+D+ +VV H +++F + Q
Sbjct: 432 GCLVWQDAAHRVMVVTLWGHSEVKKFWKRQK 462
>gi|340710070|ref|XP_003393621.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
terrestris]
Length = 467
Score = 140 bits (354), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 121/204 (59%), Gaps = 6/204 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
++ + GLV Q RK F PT+LA N++ +R KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYA 320
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI+ +LQQ+AH ++
Sbjct: 321 YTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQIVGYLQQHAHGKMI 380
Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+ P V P + DQI+LWE++ NR Y +F S+ FE D+A L+W++
Sbjct: 381 EAGPPVLPPTIVDQIKLWENERNRFIFNEGVLYSQFLSQTDFEVLRDHALSTGVLIWQNE 440
Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
+K +VV H +++F + +K
Sbjct: 441 RKRTIVVTKAGHDDVKKFWKRYSK 464
>gi|307177222|gb|EFN66420.1| General transcription factor IIH subunit 4 [Camponotus floridanus]
Length = 467
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 121/204 (59%), Gaps = 6/204 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
++ + GLV Q RK F PT+LA N++ A+ KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGENKPLAKDTDKEGYIVVETNYRVYA 320
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y+ S L +L LF ++ Y+ PN++V +T++S+ A ++GITA QI+ +LQQ+ H ++
Sbjct: 321 YTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSIRQALKSGITASQIVGYLQQHVHSKMI 380
Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+R P + P + DQI+LWE++ NR + Y +F S+ FE D+A L+W+
Sbjct: 381 ERGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSE 440
Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
+K +VV H +++F + +K
Sbjct: 441 RKRTMVVTKTGHDDVKKFWKRYSK 464
>gi|380014187|ref|XP_003691121.1| PREDICTED: general transcription factor IIH subunit 4-like [Apis
florea]
Length = 467
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 121/203 (59%), Gaps = 6/203 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
++ + GLV Q RK F PT+LA N++ +R KEG++VVETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYAY 321
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI+ +LQQ+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQIVGYLQQHAHSKMIE 381
Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
P + P + DQI+LWE++ NR + Y +F S+ FE D+A L+W+ +
Sbjct: 382 AGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSER 441
Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
K +VV H +++F + +K
Sbjct: 442 KRTMVVTKAGHDDVKKFWKRYSK 464
>gi|48102983|ref|XP_395476.1| PREDICTED: general transcription factor IIH subunit 4 [Apis
mellifera]
Length = 467
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 121/203 (59%), Gaps = 6/203 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
++ + GLV Q RK F PT+LA N++ +R KEG++VVETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYAY 321
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI+ +LQQ+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQIVGYLQQHAHSKMIE 381
Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
P + P + DQI+LWE++ NR + Y +F S+ FE D+A L+W+ +
Sbjct: 382 AGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSER 441
Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
K +VV H +++F + +K
Sbjct: 442 KRTMVVTKAGHDDVKKFWKRYSK 464
>gi|307212060|gb|EFN87943.1| General transcription factor IIH subunit 4 [Harpegnathos saltator]
Length = 467
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 120/204 (58%), Gaps = 6/204 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
++ + GLV Q RK F PT+LA N++ T R KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGETKPLTRDTDKEGYIVVETNYRVYA 320
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI+ +LQQ+AH +
Sbjct: 321 YTNSNLQVALLGLFCEMIYRFPNLVVSILTRDSVRQALKSGITATQIVGYLQQHAHSKTI 380
Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+ P + P + DQI LWE++ NR + Y +F S+ FE D+A L+W++
Sbjct: 381 EAGPPILPPTIVDQIMLWENERNRFLFSEGVLYSQFLSQTDFEVLRDHALTTGVLIWQNE 440
Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
+K +VV H +++F + +K
Sbjct: 441 RKRTIVVTKAGHDDIKKFWKRYSK 464
>gi|170120252|ref|XP_001891159.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164633568|gb|EDQ98188.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 475
Score = 140 bits (353), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 78/208 (37%), Positives = 126/208 (60%), Gaps = 15/208 (7%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-------LTDSSARKEGFVVVETNF 53
M++D D GL+ ++ K F PT+LAT L+ S + + S +EGF+V+ETN+
Sbjct: 263 MLEDLVDYGLIWRRKPSSKG--FHPTRLATTLTSSSPPLPSSIGNRSGPQEGFIVLETNY 320
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ + L IL LF ++Y+ PNL+VGAIT++S+ A NGI+A+QIIS+L +AH
Sbjct: 321 RIYAYTDNPLQTAILNLFVSLKYRFPNLVVGAITRDSVKKALLNGISADQIISYLMSHAH 380
Query: 114 PRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
P++ + P +P V DQIRLWE + NR++ + Y F S+ ++ +YA++
Sbjct: 381 PQMRKNVRLRSCNPLLPVTVQDQIRLWELEKNRLKSQEGYLYTSFASQADYDFVLNYAKE 440
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFL 195
+LWE+ K + + H ++R F+
Sbjct: 441 LGVVLWENPVKRCFFGSLDGHANIRGFI 468
>gi|189205068|ref|XP_001938869.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187985968|gb|EDU51456.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 482
Score = 140 bits (352), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 112/194 (57%), Gaps = 21/194 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTD-------------SSA 41
M+ D + +GL+ + + F PT+LAT L+ MS + S+
Sbjct: 260 MLDDLSSMGLI--YRSDKNARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSAT 317
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
+GF+++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A ++GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITS 377
Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
QIIS+L AHP++ +P +P V DQIRLWE + RVE TP + EF S +
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDAEYRDV 437
Query: 162 CDYARDRSGLLWED 175
YA L+W++
Sbjct: 438 MGYASALGVLVWQN 451
>gi|351713119|gb|EHB16038.1| General transcription factor IIH subunit 4 [Heterocephalus glaber]
Length = 463
Score = 140 bits (352), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 321 ESELQVALIALFSEMLYRFPNVVVAQVTRESVQQAIGSGITAQQIIHFLRTRAHPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|195019521|ref|XP_001984999.1| GH16811 [Drosophila grimshawi]
gi|193898481|gb|EDV97347.1| GH16811 [Drosophila grimshawi]
Length = 493
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 123/213 (57%), Gaps = 16/213 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKEGFVVVETNF 53
++ + GLV Q RKE F PT+LA N+ S S D ++ G++VVETN+
Sbjct: 280 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTNKDAAQASTSADDERMQERGYIVVETNY 337
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ S+L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q AH
Sbjct: 338 RVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAH 397
Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
P V I S +P V DQI+LWE + NR T Y +F S+ F DYA+
Sbjct: 398 PNMKLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSQTDFVTLRDYAQS 457
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ L+W++ + +VV H ++ + + +K
Sbjct: 458 INMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 490
>gi|31201525|ref|XP_309710.1| AGAP010984-PA [Anopheles gambiae str. PEST]
gi|21293295|gb|EAA05440.1| AGAP010984-PA [Anopheles gambiae str. PEST]
Length = 501
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 125/216 (57%), Gaps = 19/216 (8%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLA-----------TNLSMSLTDSSARKEGFVVVE 50
++ + GLV Q RKE F PT+LA T L+ ++ + +G+++VE
Sbjct: 285 LQHLREFGLV--YQRKRKEGRFYPTRLAHNITSKNATHATTLAQDQESNATKDKGYIIVE 342
Query: 51 TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
TN+R+YAY+ S L +L LF+++ Y+ PNL+VG ++++S+ AF GITAEQIIS+L+Q
Sbjct: 343 TNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLSRDSVRQAFRGGITAEQIISYLEQ 402
Query: 111 NAHP------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDY 164
+AHP + + S+P V DQI+LWE++ NR T Y +F S+ F DY
Sbjct: 403 HAHPTMLTVEQAINSKSSLPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQADFITLRDY 462
Query: 165 ARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
A+ ++W++ + +VV H +++F + +K
Sbjct: 463 AQSIGVMIWQNERIRTMVVTKNGHDDVKKFWKRYSK 498
>gi|358370616|dbj|GAA87227.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus kawachii IFO 4308]
Length = 478
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 126/229 (55%), Gaps = 35/229 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-------------SSARKE 44
M+ D AD G+V Q F PT+LAT L S +L++ SS
Sbjct: 255 MLTDLADFGIV--YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGT 312
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+QI
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372
Query: 105 ISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
IS+L +AHP++ A PSV P V DQIRLW+ + +RV+ T +++
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432
Query: 152 FPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
F S +EA C YA + L W+ D K+M V H + FLR +
Sbjct: 433 FVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|242794214|ref|XP_002482325.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
gi|218718913|gb|EED18333.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
Length = 481
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 79/215 (36%), Positives = 119/215 (55%), Gaps = 30/215 (13%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-----MSLTDSSARK-----------EG 45
+ D D G+V + F PT+LAT L+ +S T SSA G
Sbjct: 256 LTDLTDFGIVYQYPSAAEADRFYPTRLATTLTSDSSALSNTMSSALSAQTNAATGEPGSG 315
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+QII
Sbjct: 316 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQII 375
Query: 106 SFLQQNAHP--RVADRIPS-----------VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP R D + S +P V DQIRLW+ + +R++ TP ++EF
Sbjct: 376 SYLSTHAHPQMRKEDAVKSTSNTAGLPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEF 435
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAE 186
S +E C YA + L+W+ D K+M V E
Sbjct: 436 ASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 470
>gi|145248676|ref|XP_001400677.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus niger CBS 513.88]
gi|134081344|emb|CAK41847.1| unnamed protein product [Aspergillus niger]
Length = 478
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 126/229 (55%), Gaps = 35/229 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-------------SSARKE 44
M+ D AD G+V Q F PT+LAT L S +L++ SS
Sbjct: 255 MLTDLADFGIV--YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGT 312
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+QI
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372
Query: 105 ISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
IS+L +AHP++ A PSV P V DQIRLW+ + +RV+ T +++
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432
Query: 152 FPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
F S +EA C YA + L W+ D K+M V H + FLR +
Sbjct: 433 FVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|353240543|emb|CCA72408.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Piriformospora indica DSM 11827]
Length = 462
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 119/205 (58%), Gaps = 12/205 (5%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS----------MSLTDSSARKEGFVVVE 50
M+ D D GLV + F PT+ AT L+ +++ + + + FVV+E
Sbjct: 254 MVSDLRDFGLVYFP--SDTTTSFQPTRFATTLTSYTSSFSDHDLTIENGADLSQEFVVLE 311
Query: 51 TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
TN+ +YAY+ + L +L LF + + PN+I+G++T++S+ A NGITA+QI+S+L
Sbjct: 312 TNYHVYAYTNNPLQIAVLNLFVSFKARFPNMIMGSLTRDSVKKALVNGITADQILSYLVT 371
Query: 111 NAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSG 170
+AHP++ P +P V DQIRLWE + +RV+ + Y+EF S + +E YAR+
Sbjct: 372 HAHPQMRKNNPIIPVTVQDQIRLWELERHRVKGQDGYLYKEFASMNDYEVVVQYARELGV 431
Query: 171 LLWEDSKKMRLVVNAEIHMHMREFL 195
+LWE++ + +A +H+R ++
Sbjct: 432 VLWENASRRMFFADAAGRVHIRSYI 456
>gi|169610043|ref|XP_001798440.1| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
gi|160701967|gb|EAT84391.2| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
Length = 426
Score = 140 bits (352), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 110/188 (58%), Gaps = 15/188 (7%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------------MSLTDSSARKEGFV 47
M+ D + +G+V + + F PT+LAT L+ DS A+ +GF+
Sbjct: 213 MLDDLSAMGIV--YRSSKDSPTFYPTRLATTLTSDSGALPGSDVGTTEKPDSKAQNKGFI 270
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
+VETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A + GIT+ QIIS+
Sbjct: 271 IVETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHRAVQAGITSAQIISY 330
Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
L AHP++ P +P V DQIRLWE + RVE+T + EF S + YA+
Sbjct: 331 LTTYAHPQMQKSNPPLPPTVMDQIRLWEYEGERVEVTTGYLMREFGSESEYRDVMGYAQA 390
Query: 168 RSGLLWED 175
L+W++
Sbjct: 391 LGVLVWKN 398
>gi|350639203|gb|EHA27557.1| transcription factor Tfb2 [Aspergillus niger ATCC 1015]
Length = 478
Score = 139 bits (351), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 126/229 (55%), Gaps = 35/229 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-------------SSARKE 44
M+ D AD G+V Q F PT+LAT L S +L++ SS
Sbjct: 255 MLTDLADFGIV--YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGT 312
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+QI
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372
Query: 105 ISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
IS+L +AHP++ A PSV P V DQIRLW+ + +RV+ T +++
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432
Query: 152 FPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
F S +EA C YA + L W+ D K+M V H + FLR +
Sbjct: 433 FVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478
>gi|212535854|ref|XP_002148083.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070482|gb|EEA24572.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 139 bits (351), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 76/215 (35%), Positives = 119/215 (55%), Gaps = 30/215 (13%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-------------SMSLTDSSARKE---G 45
+ D D G++ + F PT+LAT L S+S ++A E G
Sbjct: 256 LADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLSAQTNAATGEPGSG 315
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+QII
Sbjct: 316 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQII 375
Query: 106 SFLQQNAHPRV-----------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP++ IP +P V DQIRLW+ + +R++ TP ++EF
Sbjct: 376 SYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEF 435
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAE 186
S +E C YA + L+W+ D K+M V E
Sbjct: 436 ASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 470
>gi|255079788|ref|XP_002503474.1| predicted protein [Micromonas sp. RCC299]
gi|226518741|gb|ACO64732.1| predicted protein [Micromonas sp. RCC299]
Length = 458
Score = 139 bits (351), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 77/196 (39%), Positives = 117/196 (59%), Gaps = 5/196 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE--GFVVVETNFRMYAY 58
+ +D A LGL+ L KE +++PT+L ++ D+S + G ++ ETNFR+YAY
Sbjct: 263 IAEDLAHLGLLYLFAGPGKEGYYVPTQLTAGKDVADGDASLGGDPGGHIIAETNFRVYAY 322
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP--RV 116
+ S + CEILRLF++ +Y+LPNL VG +T+E+++ A + G+ AEQII +++ +AHP R
Sbjct: 323 TFSDVECEILRLFTRPDYRLPNLYVGMLTREAVHEALDTGVAAEQIIKYIKSHAHPNARK 382
Query: 117 ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPS-RDVFEAACDYARDRSGLLWED 175
+P NV DQI LW + RV Y +FP+ D + AA A D LLWE+
Sbjct: 383 TTNGSGIPPNVADQIMLWAMERRRVRSAECVLYCDFPTGTDEYAAAVKAASDAGVLLWEN 442
Query: 176 SKKMRLVVNAEIHMHM 191
++M+L V H M
Sbjct: 443 REQMKLAVAKSGHERM 458
>gi|296818081|ref|XP_002849377.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
gi|238839830|gb|EEQ29492.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
Length = 484
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/226 (34%), Positives = 120/226 (53%), Gaps = 32/226 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------------------SMSLTDSSA 41
+ D D G+V + F PT+LAT L S T ++
Sbjct: 257 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMPGPVSGEPVVPAPSTGPTSATD 316
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
GF++VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA
Sbjct: 317 GGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 376
Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
+QI+S+L +AHP++ S +P V DQIRLW+ + +R++ T ++EF
Sbjct: 377 DQIVSYLTTHAHPQMRKTKKSTSNVAASILPPTVVDQIRLWQLERDRIKATSGFLFKEFD 436
Query: 154 SRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQ 198
+ FEA C YA + L+W DS++M V H + FLR +
Sbjct: 437 TFAEFEAPCKYAEEIGVLVWRSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|115388809|ref|XP_001211910.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195994|gb|EAU37694.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 460
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 124/227 (54%), Gaps = 33/227 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSL-----TDSSARKEG 45
+ D AD G+V Q + F PT+LAT L S SL T S+ G
Sbjct: 238 LTDLADFGIV--YQHSPDATHFYPTRLATTLTSDASSLSNPISGSLSGPDGTSSAKAGSG 295
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F++VETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+QII
Sbjct: 296 FIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQII 355
Query: 106 SFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP++ A PSV P V DQIRLW+ + +RV+ T +++F
Sbjct: 356 SYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKDF 415
Query: 153 PSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
S +E C YA + L+W+ ++K V H + FLR +
Sbjct: 416 ASLAEYEDPCRYAEEIGVLVWKSNRKRMFFVTR--HEQVAAFLRSRK 460
>gi|451996931|gb|EMD89397.1| hypothetical protein COCHEDRAFT_1141584 [Cochliobolus
heterostrophus C5]
Length = 482
Score = 139 bits (350), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 123/214 (57%), Gaps = 23/214 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTDS-------------SA 41
M++D + +GLV + R F PT+LAT L+ MS + + +A
Sbjct: 260 MLEDLSAMGLV--YRSDRNARTFYPTRLATTLTSDSGSAMSTSSNDIAQASTSTTGPPTA 317
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
+GF+++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A + GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITS 377
Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
QIIS+L AHP++ +P +P V DQIRLWE + RVE T + EF S +
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDV 437
Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAE-IHMHMRE 193
+YA L+W+ D+ + V + E I ++R+
Sbjct: 438 LNYASALGVLVWQNDANRCFFVSHVEQISAYLRK 471
>gi|451847848|gb|EMD61155.1| hypothetical protein COCSADRAFT_97372 [Cochliobolus sativus ND90Pr]
Length = 482
Score = 139 bits (350), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 122/214 (57%), Gaps = 23/214 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTDS-------------SA 41
M++D + +GLV + R F PT+LAT L+ MS + +A
Sbjct: 260 MLEDLSAMGLVYRSE--RNARTFYPTRLATTLTSDSGSAMSASSKDIAQASTSTTGPPTA 317
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
+GF+++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A + GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITS 377
Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
QIIS+L AHP++ +P +P V DQIRLWE + RVE T + EF S +
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDV 437
Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAE-IHMHMRE 193
+YA L+W+ D+ + V + E I ++R+
Sbjct: 438 LNYASALGVLVWQNDANRCFFVSHVEQISAYLRK 471
>gi|302495702|ref|XP_003009866.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
gi|291173385|gb|EFE29221.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
Length = 386
Score = 139 bits (349), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 122/224 (54%), Gaps = 32/224 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-----------------SSA 41
+ D D G+V + F PT+LAT L S++++ S
Sbjct: 159 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPAVPTTTAGTTTSGD 218
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
GF++VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA
Sbjct: 219 AGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 278
Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
+QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T ++EF
Sbjct: 279 DQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFD 338
Query: 154 SRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLR 196
+ FEA C YA + L+W+ DS++M V H + FLR
Sbjct: 339 TFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLR 379
>gi|302667863|ref|XP_003025510.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
gi|291189624|gb|EFE44899.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
Length = 386
Score = 139 bits (349), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 122/224 (54%), Gaps = 32/224 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-----------------SSA 41
+ D D G+V + F PT+LAT L S++++ S
Sbjct: 159 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPAVPTTTAGTTTSGD 218
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
GF++VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA
Sbjct: 219 AGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 278
Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
+QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T ++EF
Sbjct: 279 DQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFD 338
Query: 154 SRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLR 196
+ FEA C YA + L+W+ DS++M V H + FLR
Sbjct: 339 TFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLR 379
>gi|299750117|ref|XP_001836551.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
gi|298408751|gb|EAU85259.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
Length = 403
Score = 139 bits (349), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 77/203 (37%), Positives = 125/203 (61%), Gaps = 11/203 (5%)
Query: 1 MIKDFADLGLV-KLQQVGRKESWFIPTKLATNLSMSLTD-------SSARKEGFVVVETN 52
M++D D GL+ + +Q ++ F PT+L+T L+ S SS +EGF+++ETN
Sbjct: 196 MLEDLRDYGLIWQRKQTSKR---FSPTRLSTTLTSSSPPLPSTSGASSGPQEGFIILETN 252
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ + L +L LF+ ++Y+ PNL+VG+IT+ES+ A NGI+A+QIIS+L +A
Sbjct: 253 YRIYAYTDNPLQTAVLSLFASLKYRFPNLVVGSITRESVKKALLNGISADQIISYLITHA 312
Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
HP + P +P V DQIRLWE + NR++ Y F S+ +E YA++ +L
Sbjct: 313 HPNMRKNNPLLPVTVQDQIRLWELEKNRLKSREGFLYTAFASQADYELVLQYAKELDVVL 372
Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
WE++ K + E H +++ F+
Sbjct: 373 WENASKRCFFGSLEGHANIKGFI 395
>gi|212535856|ref|XP_002148084.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070483|gb|EEA24573.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 422
Score = 139 bits (349), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 76/215 (35%), Positives = 119/215 (55%), Gaps = 30/215 (13%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-------------SMSLTDSSARKE---G 45
+ D D G++ + F PT+LAT L S+S ++A E G
Sbjct: 197 LADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLSAQTNAATGEPGSG 256
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+QII
Sbjct: 257 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQII 316
Query: 106 SFLQQNAHPRV-----------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP++ IP +P V DQIRLW+ + +R++ TP ++EF
Sbjct: 317 SYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEF 376
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAE 186
S +E C YA + L+W+ D K+M V E
Sbjct: 377 ASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 411
>gi|91094191|ref|XP_971121.1| PREDICTED: similar to TFIIH basal transcription factor complex p52
subunit [Tribolium castaneum]
gi|270010854|gb|EFA07302.1| hypothetical protein TcasGA2_TC015892 [Tribolium castaneum]
Length = 472
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 123/207 (59%), Gaps = 9/207 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETNFRM 55
++ + GLV Q RK F PT+LA N++ S + E G+++VETN+R+
Sbjct: 266 FLQHLREFGLV--YQRKRKAGRFYPTRLALNITCSQGPETRILEEDTPKGYIIVETNYRV 323
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY+ S L ++ LF+++ Y+ PNL+VG IT++S+ A + GITA+QII +L+Q+AHP+
Sbjct: 324 YAYTDSNLQVALIGLFTELMYRFPNLVVGVITRDSIRQALKGGITADQIIGYLKQHAHPQ 383
Query: 116 V--ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
+ + +P V DQI+LWE + NR+ + Y +F S+ F +YA+ L+W
Sbjct: 384 MLEGEAKHPLPPTVVDQIKLWEIERNRLTYSEGVLYSQFLSQADFNILKEYAQSNGHLIW 443
Query: 174 EDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ +K L++N H +++F + +K
Sbjct: 444 CNKEKRTLIINKSAHDDVKKFWKRYSK 470
>gi|240279262|gb|EER42767.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H143]
Length = 485
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 125/228 (54%), Gaps = 33/228 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKE- 44
+ D D G+V ++ + F PT+LAT L+ +L S++ E
Sbjct: 256 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEP 315
Query: 45 --GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375
Query: 103 QIISFLQQNAHPRVAD-----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++ TP ++
Sbjct: 376 QIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435
Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+F S FE C YA + L+W+ +K V H + +LR +
Sbjct: 436 DFVSLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVAAYLRSK 481
>gi|295662078|ref|XP_002791593.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226279719|gb|EEH35285.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 484
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 123/227 (54%), Gaps = 32/227 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
+ D D G+V ++ + F PT+L T L S SLT +SS
Sbjct: 256 LTDLTDFGIVYQHFPPQEATRFYPTRLVTTLTSDSATLGSSISTSLTGPSGPTAANSSEP 315
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+V+ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+
Sbjct: 316 STGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 375
Query: 103 QIISFLQQNAHPRVAD----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
QIIS+L +AHP++ P+V P V DQIRLW+ + +R++ TP +++
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKD 435
Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
F S F+ C YA + L+W+ K+ V H + +LR +
Sbjct: 436 FVSLAEFDGPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAYLRSR 480
>gi|225682227|gb|EEH20511.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides brasiliensis Pb03]
gi|226289599|gb|EEH45083.1| TFIIH basal transcription factor complex p52 subunit
[Paracoccidioides brasiliensis Pb18]
Length = 484
Score = 139 bits (349), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 123/227 (54%), Gaps = 32/227 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
+ D D G+V ++ + F PT+L T L S SLT +SS
Sbjct: 256 LTDLTDFGIVYQHFPPQEATRFYPTRLVTTLTSDSATLGSSISTSLTGPSGPAAANSSEP 315
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+V+ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+
Sbjct: 316 STGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 375
Query: 103 QIISFLQQNAHPRVAD----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
QIIS+L +AHP++ P+V P V DQIRLW+ + +R++ TP +++
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKD 435
Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
F S F+ C YA + L+W+ K+ V H + +LR +
Sbjct: 436 FVSLAEFDGPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAYLRSR 480
>gi|392594388|gb|EIW83712.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 486
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/219 (34%), Positives = 122/219 (55%), Gaps = 26/219 (11%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARK-------------- 43
M++D D GLV ++ K F PT+LAT L S SL + +
Sbjct: 262 MLEDLRDYGLVWQRKASSKR--FSPTRLATTLTSVSPSLPTAGGSRNPGSAPGASSVANS 319
Query: 44 -------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
GF+V+ETN+R+YAY+ + L +L LF ++Y+ PNL+VG +T+ES+ A
Sbjct: 320 TGINTNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALG 379
Query: 97 NGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD 156
NGI+AEQ+IS+L +AHP++ P +P V DQ+RLWE + NR++ Y++F ++
Sbjct: 380 NGISAEQVISYLTAHAHPQMRKNNPLLPVTVQDQVRLWELERNRLKSDDGFLYKDFATQA 439
Query: 157 VFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
+E YA++ +LWE++ K + H ++R F+
Sbjct: 440 DYEYVLAYAKELDVVLWENTSKRCFFGTLDGHTNVRGFI 478
>gi|195378984|ref|XP_002048261.1| GJ11446 [Drosophila virilis]
gi|194155419|gb|EDW70603.1| GJ11446 [Drosophila virilis]
Length = 492
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/213 (36%), Positives = 124/213 (58%), Gaps = 16/213 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKEGFVVVETNF 53
++ + GLV Q RKE F PT+LA N+ S S + ++ G++VVETN+
Sbjct: 279 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTNKDAAQASTSADEERMQERGYIVVETNY 336
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ S+L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q AH
Sbjct: 337 RVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAH 396
Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
P V I S +P + DQI+LWE + NR T Y +F S++ F DYA+
Sbjct: 397 PNMKLVESAIQSKSCLPPTIVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQS 456
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ L+W++ + +VV H ++ + + +K
Sbjct: 457 INMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 489
>gi|221120480|ref|XP_002159945.1| PREDICTED: general transcription factor IIH subunit 4-like [Hydra
magnipapillata]
Length = 458
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 116/185 (62%), Gaps = 3/185 (1%)
Query: 15 QVGRKESWFIPTKLATNLSMSLT---DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLF 71
Q RK F PT+LA NL ++T DSS++++GF+VVETN+R+YAY+ S LH +L LF
Sbjct: 269 QRKRKSKRFYPTRLAINLGSAVTGNSDSSSQQQGFLVVETNYRVYAYTDSVLHIALLSLF 328
Query: 72 SKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQI 131
+ I+ + P+ V +++ES+ A GI+AEQII FL+ AH ++ P + + DQI
Sbjct: 329 TDIKARFPSFTVALLSRESVQQALACGISAEQIIDFLKTRAHSQMTTSSPIIASTITDQI 388
Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
+LWE + +R+ + Y +F S+ FE +A +++ LLW +++K +VV+ H +
Sbjct: 389 KLWEMERDRLRYSQGVLYNQFLSQSDFEMLRKFADEKNHLLWANNQKRLMVVSKSGHEDV 448
Query: 192 REFLR 196
+++ +
Sbjct: 449 KKYWK 453
>gi|325089533|gb|EGC42843.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H88]
Length = 485
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 125/228 (54%), Gaps = 33/228 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKE- 44
+ D D G+V ++ + F PT+LAT L+ +L S++ E
Sbjct: 256 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEP 315
Query: 45 --GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375
Query: 103 QIISFLQQNAHPRVAD-----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++ TP ++
Sbjct: 376 QIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435
Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+F S FE C YA + L+W+ +K V H + +LR +
Sbjct: 436 DFVSLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVATYLRSK 481
>gi|225555033|gb|EEH03326.1| RNA pol II transcription initiation subunit [Ajellomyces capsulatus
G186AR]
Length = 485
Score = 138 bits (348), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/228 (33%), Positives = 124/228 (54%), Gaps = 33/228 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
+ D D G+V ++ + F PT+LAT L S SLT S+
Sbjct: 256 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPNGVPSASSNEP 315
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375
Query: 103 QIISFLQQNAHPRVAD-----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++ TP ++
Sbjct: 376 QIISYLTTHAHPQMRKYHATKAGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435
Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+F + FE C YA + L+W+ +K V H + +LR +
Sbjct: 436 DFVNLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVAAYLRSR 481
>gi|194749905|ref|XP_001957376.1| GF10392 [Drosophila ananassae]
gi|190624658|gb|EDV40182.1| GF10392 [Drosophila ananassae]
Length = 497
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 123/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS---------MSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N++ ++ + ++ G++VVETN
Sbjct: 283 LQHLREFGLV--YQRKRKEGRFYPTRLALNVTSKEAAVAATLTTDEEGVQESGYIVVETN 340
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 341 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 400
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S++ F DYA+
Sbjct: 401 HPNMRLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQ 460
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 461 SIHVLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 494
>gi|340371925|ref|XP_003384495.1| PREDICTED: general transcription factor IIH subunit 4-like
[Amphimedon queenslandica]
Length = 446
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 111/195 (56%), Gaps = 3/195 (1%)
Query: 5 FADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-DSSARKEGFVVVETNFRMYAYSTSKL 63
GLV ++ K + PTKL L+ T + + GF++VETN+R+ AY+ SKL
Sbjct: 251 LCQFGLVYQRKSSSKR--YYPTKLVIQLTAGETIGMNTSQAGFIIVETNYRVIAYTDSKL 308
Query: 64 HCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSV 123
H L LF +++Y+ PN+ VG IT+ES+ A +GI A+QIISFL Q+AH + + +
Sbjct: 309 HIATLALFCQMQYRFPNVAVGTITRESIQQALVHGIKADQIISFLTQHAHHNMLSKAHIL 368
Query: 124 PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVV 183
P V DQI+LWE + NR+ Y EF S +E YA D LLW + ++ +V+
Sbjct: 369 PPTVTDQIKLWEIERNRLSFQEGILYSEFLSVTDYEKVKKYAEDLGVLLWSNRQRRLMVI 428
Query: 184 NAEIHMHMREFLRGQ 198
+ + H +R F + Q
Sbjct: 429 HPDRHDEIRHFWKRQ 443
>gi|119486893|ref|XP_001262366.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
gi|119410523|gb|EAW20469.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
Length = 468
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 123/229 (53%), Gaps = 35/229 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKEG 45
+ D AD G+V Q + F PT+LAT L+ + T SS G
Sbjct: 245 LTDLADFGIV--YQETPDATHFYPTRLATTLTSDSSTLSNPLAGSLSGPTGTSSSKAGSG 302
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+QII
Sbjct: 303 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQII 362
Query: 106 SFLQQNAHPRVADR-------------IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP++ + +P V DQIRLW+ + +RV+ T + +F
Sbjct: 363 SYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRDF 422
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
+ +EA C YA + L+W+ D K+M V H + FL+ + K
Sbjct: 423 NTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HHQVAAFLKSRRK 468
>gi|327298217|ref|XP_003233802.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
gi|326463980|gb|EGD89433.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
Length = 484
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 122/229 (53%), Gaps = 38/229 (16%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------------------ 43
+ D D G+V + F PT+LAT L+ +DS A
Sbjct: 257 LADLTDFGIVYQHTPASGSTRFYPTRLATTLT---SDSLAMSGPISGEPTVPTTSAGTTT 313
Query: 44 -----EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
GF++VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E G
Sbjct: 314 SGDAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMG 373
Query: 99 ITAEQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYE 150
ITA+QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T ++
Sbjct: 374 ITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFK 433
Query: 151 EFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
EF + FEA C YA + L+W+ DS++M V H + FLR +
Sbjct: 434 EFDTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|70982185|ref|XP_746621.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2) [Aspergillus fumigatus Af293]
gi|66844244|gb|EAL84583.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus Af293]
gi|159122143|gb|EDP47265.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus A1163]
Length = 479
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 123/229 (53%), Gaps = 35/229 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKEG 45
+ D AD G+V Q + F PT+LAT L+ + T SS G
Sbjct: 256 LTDLADFGIV--YQETPDATHFYPTRLATTLTSDSSTLSNPLAGSLSGPTGTSSSKAGSG 313
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E GITA+QII
Sbjct: 314 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQII 373
Query: 106 SFLQQNAHPRVADR-------------IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
S+L +AHP++ + +P V DQIRLW+ + +RV+ T + +F
Sbjct: 374 SYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRDF 433
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
+ +EA C YA + L+W+ D K+M V H + FL+ + K
Sbjct: 434 NTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HHQVAAFLKSRRK 479
>gi|195477962|ref|XP_002086438.1| GE22864 [Drosophila yakuba]
gi|194186228|gb|EDW99839.1| GE22864 [Drosophila yakuba]
Length = 241
Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N+ S+++ + + + G++VVETN
Sbjct: 27 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 84
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 85 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 144
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S F DYA+
Sbjct: 145 HPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQ 204
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 205 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 238
>gi|357618296|gb|EHJ71332.1| putative TFIIH basal transcription factor complex p52 subunit
[Danaus plexippus]
Length = 603
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 122/212 (57%), Gaps = 15/212 (7%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS-----SARKEGFVVVETNFRM 55
++ + GLV Q RK F PT+LA N++ + D +A G+++ ETN+R+
Sbjct: 393 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIT-CVKDGVAPLQTAASSGYIIAETNYRV 449
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY+TS L +L LF+++ Y+ PN++VG +T+ES+ A GI+A+QII++L+Q++HP+
Sbjct: 450 YAYTTSALQVALLGLFTELVYRFPNVVVGVLTRESVRAALRGGISAQQIITYLEQHSHPQ 509
Query: 116 V--ADR-----IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
+ +D+ S+P V DQIRLWES+ NR T Y +F S+ F DY R
Sbjct: 510 MLKSDQGGIRSSSSLPPTVLDQIRLWESERNRFTYTEGVVYNQFLSQAEFNVLRDYGRSS 569
Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W + +VV H ++ + + +K
Sbjct: 570 GALVWAADRTRTMVVARAAHDDVKRYWKRYSK 601
>gi|156057729|ref|XP_001594788.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980]
gi|154702381|gb|EDO02120.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 420
Score = 136 bits (343), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/218 (31%), Positives = 125/218 (57%), Gaps = 25/218 (11%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------SLTDSSARKEGFV 47
M+ + D GL+ L + F PT+LAT L+ + + S+A ++GF+
Sbjct: 205 MLGNLIDFGLIYLPPSAPTQ--FFPTRLATTLTSDASALRTVSAGFDAASKSAANQKGFI 262
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
++ETN+R+YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT++QII++
Sbjct: 263 IIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAISHGITSDQIITY 322
Query: 108 LQQNAHPRV------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
L +AHP++ A P +P V DQIRLW+ + R++ P ++EF + +E
Sbjct: 323 LSTHAHPQLVKASTAAHGGPVLPPTVVDQIRLWQLENERMKAVPGFLFKEFDGQKEYEGC 382
Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
YA + L+W+ D+K+M V E +R++++ +
Sbjct: 383 AKYAEEVGVLVWKNDAKRMFFVTRVE---QLRDYIKAR 417
>gi|403417620|emb|CCM04320.1| predicted protein [Fibroporia radiculosa]
Length = 472
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/159 (39%), Positives = 100/159 (62%)
Query: 37 TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
+ + EGF+V+ETN+R+YAY+ + L +L LF ++ + PNL++GAIT+ES+ A
Sbjct: 306 SGTGPHGEGFIVLETNYRIYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRESVKKALA 365
Query: 97 NGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD 156
NGITA+QIIS+L +AHP++ P +P V DQIRLWE + NRV+ + Y F S+
Sbjct: 366 NGITADQIISYLTAHAHPQMHKNKPLLPVTVQDQIRLWELERNRVKSEEGYLYTSFASQA 425
Query: 157 VFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
+E +YA+ +LWE+ + + + H ++R F+
Sbjct: 426 DYEYVLNYAKQLDVVLWENPSRRSFFGSLDGHANIRGFI 464
>gi|16198089|gb|AAL13841.1| LD30622p [Drosophila melanogaster]
Length = 350
Score = 136 bits (343), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N+ S+++ + + + G++VVETN
Sbjct: 136 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 193
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 194 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 253
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S F DYA+
Sbjct: 254 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 313
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 314 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 347
>gi|339521903|gb|AEJ84116.1| general transcription factor IIH subunit 4 [Capra hircus]
Length = 463
Score = 136 bits (342), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 118/200 (59%), Gaps = 4/200 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS +S +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
+L + LFS++ Y+ PN++V +T+ S+ A +GITA+QI+ FL+ A P + +
Sbjct: 321 EPELQIARIALFSEMLYRFPNMVVPQVTRGSVQQAIASGITAQQIVHFLRTRADPVMLKQ 380
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460
>gi|198428509|ref|XP_002131018.1| PREDICTED: similar to General transcription factor II H,
polypeptide 4 [Ciona intestinalis]
Length = 463
Score = 136 bits (342), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 121/203 (59%), Gaps = 7/203 (3%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS----MSLTDSSARKEGFVVVETNFRMYA 57
++ F ++GLV Q RK F PT LA NL+ S+T+ + + GF++ ETNFR+YA
Sbjct: 260 LQHFREMGLV--MQRKRKSRRFYPTYLAINLANRANASVTNVTNSR-GFILAETNFRVYA 316
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y+ S+L IL LF K+ Y+ PN+ V +T++S+ +A NGITA+QI+ +++ NAHP +
Sbjct: 317 YTDSELQYSILSLFCKMLYRFPNVCVLQLTRKSIQSAVVNGITADQILHYIKANAHPDML 376
Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
P + + DQ+RLW + +R+ Y +F ++ FE +YA++ L+WE+S
Sbjct: 377 KNDPIIAPTLADQVRLWAMERDRLTYRDGVLYNQFLAQKDFEVLRNYAKELGALIWENSS 436
Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
K +VV + H ++ + + K
Sbjct: 437 KRYMVVTKDGHDQVKRYWKKYKK 459
>gi|194872905|ref|XP_001973103.1| GG13537 [Drosophila erecta]
gi|190654886|gb|EDV52129.1| GG13537 [Drosophila erecta]
Length = 499
Score = 136 bits (342), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N+ S+++ + + + G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S F DYA+
Sbjct: 403 HPNMRMVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQ 462
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|195477975|ref|XP_002086439.1| GE22863 [Drosophila yakuba]
gi|194186229|gb|EDW99840.1| GE22863 [Drosophila yakuba]
Length = 499
Score = 135 bits (341), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N+ S+++ + + + G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S F DYA+
Sbjct: 403 HPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQ 462
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|195327811|ref|XP_002030611.1| GM24480 [Drosophila sechellia]
gi|194119554|gb|EDW41597.1| GM24480 [Drosophila sechellia]
Length = 499
Score = 135 bits (341), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N+ S+++ + + + G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S F DYA+
Sbjct: 403 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 462
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|195590491|ref|XP_002084979.1| GD12553 [Drosophila simulans]
gi|194196988|gb|EDX10564.1| GD12553 [Drosophila simulans]
Length = 500
Score = 135 bits (340), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N+ S+++ + + + G++VVETN
Sbjct: 286 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 343
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 344 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 403
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S F DYA+
Sbjct: 404 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 463
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 464 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 497
>gi|425777809|gb|EKV15965.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum PHI26]
gi|425782577|gb|EKV20476.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum Pd1]
Length = 382
Score = 135 bits (340), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 81/227 (35%), Positives = 125/227 (55%), Gaps = 38/227 (16%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA------------------RK 43
+ D AD G+V Q + F PT+LAT L+ +DSSA
Sbjct: 159 LADLADFGIVY--QEDADATHFYPTRLATTLT---SDSSALSNPVTGSLTGPVGPSGGSG 213
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+Q
Sbjct: 214 SGFIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQ 273
Query: 104 IISFLQQNAHPRV---------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
IIS+L +AHP++ IP+ +P V DQIRLW+ + +R+ T +++F
Sbjct: 274 IISYLLSHAHPQLRKHSAAQSNGKGIPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDF 333
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
S ++A C YA + L+W+ D K+M V H + FLR Q
Sbjct: 334 TSLAEYQAPCQYAAEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSQ 377
>gi|21357821|ref|NP_648780.1| marionette [Drosophila melanogaster]
gi|7294263|gb|AAF49614.1| marionette [Drosophila melanogaster]
gi|220942562|gb|ACL83824.1| Tfb2-PA [synthetic construct]
Length = 499
Score = 135 bits (340), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
++ + GLV Q RKE F PT+LA N+ S+++ + + + G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402
Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
HP V I S +P V DQI+LWE + NR T Y +F S F DYA+
Sbjct: 403 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 462
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W++ + +VV H ++ + + +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496
>gi|326483225|gb|EGE07235.1| transcription factor Tfb2 [Trichophyton equinum CBS 127.97]
Length = 484
Score = 135 bits (340), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 79/229 (34%), Positives = 121/229 (52%), Gaps = 38/229 (16%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------------------ 43
+ D D G+V + F PT+LAT L+ +DS A
Sbjct: 257 LADLTDFGIVYQHTPASGSARFYPTRLATTLT---SDSLAMSGPISGEPAVPTTTSGTTT 313
Query: 44 -----EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
GF++VETN R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E G
Sbjct: 314 SGDAGTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMG 373
Query: 99 ITAEQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYE 150
ITA+QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T ++
Sbjct: 374 ITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFK 433
Query: 151 EFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
EF + FEA C YA + L+W+ DS++M V H + FLR +
Sbjct: 434 EFDTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|325181847|emb|CCA16302.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 473
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 118/209 (56%), Gaps = 15/209 (7%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFV------------- 47
++ DF + GL+ ++ F T LA NL T +S E V
Sbjct: 261 LLTDFNNFGLLYRKRSNSDR--FYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLT 318
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
+VETNF++YAY+TS LH +L +F I +LPNL +G IT+ESL +A +GI+A+QI F
Sbjct: 319 IVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDF 378
Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
L ++AHPR+ P VPEN+ DQI LWE + NR++ ++ F S++ FE +YA+
Sbjct: 379 LLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQK 438
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ L W D ++L +++ +R F++
Sbjct: 439 QRFLTWTDPIHLKLAISSNGIESVRNFIQ 467
>gi|195441209|ref|XP_002068409.1| GK20438 [Drosophila willistoni]
gi|194164494|gb|EDW79395.1| GK20438 [Drosophila willistoni]
Length = 512
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 122/220 (55%), Gaps = 23/220 (10%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------------MSLTDSSARKE--GF 46
++ + GLV Q RKE F PT+LA N++ MS ++ R + G+
Sbjct: 292 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKESADATTAIVTMSASEEEERMQDRGY 349
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+VVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQIIS
Sbjct: 350 IVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIIS 409
Query: 107 FLQQNAHPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEA 160
+L+Q AHP V I S +P V DQI+LWE + NR T Y +F S F
Sbjct: 410 YLEQYAHPNMKLVESAINSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSTTDFVT 469
Query: 161 ACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
DYA+ L+W++ K +VV H ++ + + +K
Sbjct: 470 LRDYAQSIQVLVWQNEKTRTMVVQKNGHDDVKRYWKKYSK 509
>gi|315041383|ref|XP_003170068.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
gi|311345102|gb|EFR04305.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
Length = 484
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/226 (33%), Positives = 118/226 (52%), Gaps = 32/226 (14%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEG---------------- 45
+ D D G+V + F PT+LAT L+ S G
Sbjct: 257 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPTVPAAAAGATTAGD 316
Query: 46 ----FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
F++VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA
Sbjct: 317 AGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 376
Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
+QI+S+L +AHP++ S +P V DQIRLW+ + +R++ T ++EF
Sbjct: 377 DQIVSYLTTHAHPQMRKTKKSTTNVSSTVLPPTVVDQIRLWQLERDRIKATSGFLFKEFD 436
Query: 154 SRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQ 198
+ FEA C YA + L+W DS++M V H + FLR +
Sbjct: 437 TFAEFEAPCKYAEEIGVLVWRSDSRRMFFVTR---HEQVAAFLRSR 479
>gi|50557260|ref|XP_506038.1| YALI0F30173p [Yarrowia lipolytica]
gi|62901293|sp|Q6BZX4.1|TFB2_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49651908|emb|CAG78851.1| YALI0F30173p [Yarrowia lipolytica CLIB122]
Length = 467
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 121/218 (55%), Gaps = 20/218 (9%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM------------------SLTDSSAR 42
M++D D G+V ++ + F PT+LAT L+ +++ S A
Sbjct: 245 MLEDLRDYGIVYQRKASSRR--FYPTRLATTLTSETAALRTASQSMEAATQDTISSSVAA 302
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETNFR+YAY+ S L +L LF ++ + N++ G I ++S+ A NGITAE
Sbjct: 303 DSGFIILETNFRLYAYTESPLQIAVLNLFVNLKTRFANMVTGQINRDSVRFALSNGITAE 362
Query: 103 QIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAAC 162
QII++L +AHPR+ +P V DQI+LW+ +++R+ T + + EF + D ++
Sbjct: 363 QIITYLSVHAHPRMKGMEHVLPPTVVDQIKLWQLEMDRIRATDGYLFSEFKNFDEYKDVS 422
Query: 163 DYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
YA++ LL+E+ K + V + EF++ +N+
Sbjct: 423 TYAKELGVLLYENPGKRKFVSTLAGSQQIVEFVKRRNQ 460
>gi|320167307|gb|EFW44206.1| transcription factor tfb2 [Capsaspora owczarzaki ATCC 30864]
Length = 406
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/196 (36%), Positives = 116/196 (59%), Gaps = 6/196 (3%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAY 58
+++ ++GLV ++ R F PT LA NL+ + +A +G++VVETNFR+YAY
Sbjct: 182 LLQHLREIGLVFRRK--RTSRRFYPTPLAINLASGSAKNLDAADVKGYIVVETNFRIYAY 239
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L +L LF + Y+ PN++ G ++++S+ A G+TAEQ+I FL+ +AHP++
Sbjct: 240 TDSPLQLALLSLFVDLRYRFPNMVCGLLSRDSVRKALVKGLTAEQMIRFLRTHAHPQMRS 299
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
R P +PE + DQ+RLWE + NR+ + PA YE F ++ + Y RD L S+
Sbjct: 300 RTPVLPETISDQLRLWELERNRLRVLPAVLYERFSNQREHDLLHHYGRDLGVELAHSSQH 359
Query: 179 MRLVVNAEIHMHMREF 194
+VV E H ++ F
Sbjct: 360 --IVVTFEGHEQIKTF 373
>gi|325181222|emb|CCA15636.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 1033
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 118/209 (56%), Gaps = 15/209 (7%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFV------------- 47
++ DF + GL+ ++ F T LA NL T +S E V
Sbjct: 308 LLTDFNNFGLLYRKRSNSDR--FYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLT 365
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
+VETNF++YAY+TS LH +L +F I +LPNL +G IT+ESL +A +GI+A+QI F
Sbjct: 366 IVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDF 425
Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
L ++AHPR+ P VPEN+ DQI LWE + NR++ ++ F S++ FE +YA+
Sbjct: 426 LLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQK 485
Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ L W D ++L +++ +R F++
Sbjct: 486 QRFLTWTDPIHLKLAISSNGIESVRNFIQ 514
>gi|346464903|gb|AEO32296.1| hypothetical protein [Amblyomma maculatum]
Length = 436
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 108/179 (60%), Gaps = 4/179 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYS 59
++ + GLV Q R+ F PT+LA NL+ L +++ R + G+++VETN+R+YAY+
Sbjct: 258 LQHLREFGLV--YQRKRRSGRFYPTRLAINLASGLRETNLRSYEAGYIMVETNYRVYAYT 315
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L +L LF ++ Y+ PNL+V +T+ES+ A +GIT+ QII FL+ AHP +
Sbjct: 316 NSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQIIKFLRMYAHPEALKQ 375
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
P +P + DQ+RLWE + +R Y +F S+ F+ +YA D L+W++ K
Sbjct: 376 TPVIPATIMDQLRLWELERDRFVYREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSK 434
>gi|406862938|gb|EKD15987.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 421
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/215 (33%), Positives = 123/215 (57%), Gaps = 24/215 (11%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD-------------SSARKEGFV 47
M+K D GLV ++ F PT+L+T L+ + S + +GF+
Sbjct: 203 MLKYLVDFGLVYSPSSTPQQ--FFPTRLSTTLTSDASGLRSVSAGFDDALKSESGTKGFI 260
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
++ETN+R+YAY+ S L +L LF+K+ + PN++ G +T++S+ NA +GIT++QIIS+
Sbjct: 261 IIETNYRLYAYTNSPLQIAVLALFTKMGVRYPNMVTGRVTRKSVANAISHGITSDQIISY 320
Query: 108 LQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAAC 162
L+ +AHP++ P +P V DQIRLW+ + R++ T ++EF ++ +E
Sbjct: 321 LRAHAHPQLVKAAAVNGNPVLPPTVVDQIRLWQIENERMKATAGFLFKEFETQKEYEGCA 380
Query: 163 DYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLR 196
YA + L+W+ D+K+M V H +R++L+
Sbjct: 381 RYADENGVLVWKNDAKRMFFVTR---HEQLRDYLK 412
>gi|255936123|ref|XP_002559088.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583708|emb|CAP91723.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 483
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 125/227 (55%), Gaps = 38/227 (16%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA------------------RK 43
+ D AD G+V + + F PT+LAT L+ +DSSA
Sbjct: 260 LMDLADFGIVYQEHA--DATRFYPTRLATTLT---SDSSALSNPVTGSLTGPAGPSGGSG 314
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+Q
Sbjct: 315 SGFIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQ 374
Query: 104 IISFLQQNAHPRV---------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
IIS+L +AHP++ +P+ +P V DQIRLW+ + +R+ T +++F
Sbjct: 375 IISYLLSHAHPQLRKHSAAQPNGKGVPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDF 434
Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
S ++A C YA + L+W+ D K+M V H + FLR Q
Sbjct: 435 TSLAEYQAPCQYAAEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSQ 478
>gi|347830304|emb|CCD46001.1| similar to RNA polymerase II transcription factor B subunit 2
[Botryotinia fuckeliana]
Length = 471
Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 124/218 (56%), Gaps = 25/218 (11%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------SLTDSSARKEGFV 47
M+ + DLGL+ L + F PT+LAT L+ + + S+A ++GF+
Sbjct: 256 MLANLIDLGLIYLPPSAPTQ--FFPTRLATTLTSDASALRTVAAGFDAASKSAASQKGFI 313
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
++ETN+R+YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT++QII++
Sbjct: 314 IIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITY 373
Query: 108 LQQNAHPRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
L +AHP++ P +P V DQIRLW+ + R++ P ++F ++ +E
Sbjct: 374 LSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFETQKEYEGC 433
Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
YA + L+W+ D+K+M V E +R++ + +
Sbjct: 434 AKYAEEVGVLVWKSDAKRMFFVTRVE---QLRDYFKAK 468
>gi|334323685|ref|XP_003340422.1| PREDICTED: valyl-tRNA synthetase, mitochondrial-like [Monodelphis
domestica]
Length = 1641
Score = 134 bits (336), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 124/231 (53%), Gaps = 35/231 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS +T + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 T-------------------------------SKLHCEILRLFSKIEYQLPNLIVGAITK 88
S+L ++ LFS++ Y+ PN++V IT+
Sbjct: 321 GEVGGMRMGIGMLEGRNCGFFGIILFFSLATESELQIALIALFSEMLYRFPNMVVAQITR 380
Query: 89 ESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
ES+ A NGITA+QII FL+ AHP + +IP +P + DQIRLWE + +R+ +
Sbjct: 381 ESVQQAIANGITAQQIIHFLKTRAHPVMLKQIPVLPPTITDQIRLWELERDRLRFSEGVL 440
Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
Y +F S+ FE +AR+ L++E++ K +VV H ++ F + Q
Sbjct: 441 YNQFLSQVDFELLLSHARELGVLVFENTAKRLMVVTPAGHSDVKRFWKRQK 491
>gi|345492819|ref|XP_001602702.2| PREDICTED: general transcription factor IIH subunit 4-like [Nasonia
vitripennis]
Length = 428
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 120/203 (59%), Gaps = 6/203 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
++ + GLV Q RK F PT+LA N++ +R KE FV+VETN+R+YAY
Sbjct: 225 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQDKPISRDLEKERFVIVETNYRVYAY 282
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L ++ LF ++ Y+ PNL+V +T++S+ A ++GITA QI+ +L Q+AH ++ D
Sbjct: 283 TNSNLQVALIGLFCELLYRFPNLVVAILTRDSVRAALKSGITAVQIVGYLNQHAHNKMID 342
Query: 119 -RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
P++P + DQI+LWE++ NR + Y +F S+ FE D+A ++W+ +
Sbjct: 343 PGPPTLPPTIVDQIKLWENERNRFIFSEGVLYSQFHSQIDFEVLRDHAVSLGVMIWQSDR 402
Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
K +VV H +++F + +K
Sbjct: 403 KRTMVVTKTGHDDVKKFWKRYSK 425
>gi|154308354|ref|XP_001553513.1| hypothetical protein BC1G_08237 [Botryotinia fuckeliana B05.10]
Length = 410
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 125/218 (57%), Gaps = 25/218 (11%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------SLTDSSARKEGFV 47
M+ + DLGL+ L + F PT+LAT L+ + + S+A ++GF+
Sbjct: 195 MLANLIDLGLIYLPPSAPTQ--FFPTRLATTLTSDASALRTVAAGFDAASKSAASQKGFI 252
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
++ETN+R+YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT++QII++
Sbjct: 253 IIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITY 312
Query: 108 LQQNAHPRV------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
L +AHP++ + P +P V DQIRLW+ + R++ P ++F ++ +E
Sbjct: 313 LSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFETQKEYEGC 372
Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
YA + L+W+ D+K+M V E +R++ + +
Sbjct: 373 AKYAEEVGVLVWKSDAKRMFFVTRVE---QLRDYFKAK 407
>gi|348683175|gb|EGZ22990.1| hypothetical protein PHYSODRAFT_479459 [Phytophthora sojae]
Length = 490
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 97/153 (63%)
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
VVVETNF++YAY+TS LH +L +F I +LPNL +G IT+ESL +A +GI+A+QI
Sbjct: 333 LVVVETNFKIYAYTTSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIY 392
Query: 106 SFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
FL ++AHP++ P +PEN+ DQI LWE + NRV+ ++ F +++ +E+ DYA
Sbjct: 393 DFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFMEGILFDGFNTKEDYESVRDYA 452
Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+D L W D RL + +R +++ Q
Sbjct: 453 KDLKVLTWSDPIHFRLSITTAGIDDVRHYIQSQ 485
>gi|389749775|gb|EIM90946.1| transcription factor Tfb2 [Stereum hirsutum FP-91666 SS1]
Length = 494
Score = 132 bits (333), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 75/212 (35%), Positives = 119/212 (56%), Gaps = 19/212 (8%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD----------SSARKEGFVVVE 50
M++D D GL+ ++ + F PT+LAT L+ SL + + +GF+++E
Sbjct: 265 MLEDLRDYGLIWQRKATSRR--FSPTRLATTLTSSLPPLPTASGTGTSGTGQTQGFIILE 322
Query: 51 TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
TN+R+YAY+ + L +L LF ++ + PNL+VG+IT++S+ A NGITA+QIIS+L
Sbjct: 323 TNYRLYAYTDNPLQTAVLNLFVTMKSRFPNLVVGSITRDSVRKALSNGITADQIISYLTT 382
Query: 111 NAHPRVADRI-------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACD 163
AHP++ + P +P V DQIRLWE + NR++ Y F S+ +E
Sbjct: 383 YAHPQMRKNVSRCRSFTPLLPVTVQDQIRLWELERNRLKSQEGFLYTAFASQADYEYVLT 442
Query: 164 YARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
YA+ +LWE+ K E H ++R F+
Sbjct: 443 YAKQLDVVLWENPTKRCFFGTMEGHANIRGFI 474
>gi|76154655|gb|AAX26095.2| SJCHGC05044 protein [Schistosoma japonicum]
Length = 214
Score = 132 bits (331), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 67/158 (42%), Positives = 96/158 (60%)
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
SS+ G++++ETNFR+YAY+ S L +L LFSKI + PNL+V IT++S+ A G
Sbjct: 52 SSSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 111
Query: 99 ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
ITA QI+SFL NAHP + P +P + DQIRLWE + +R YE F F
Sbjct: 112 ITANQILSFLTSNAHPDMLLSNPILPPTLTDQIRLWELERDRFVFQEGCLYEHFSRNTDF 171
Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
E DYA++ LLWE+ ++ +VV+ H +R+F +
Sbjct: 172 ELVRDYAKNIGVLLWENPERRLMVVSKSGHEDVRKFWK 209
>gi|332030486|gb|EGI70174.1| General transcription factor IIH subunit 4 [Acromyrmex echinatior]
Length = 550
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 113/188 (60%), Gaps = 6/188 (3%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
++ + GLV Q RK F PT+LA N++ T R KEG+++VETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGETKPLTRDTDKEGYIIVETNYRVYAY 321
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L +L LF ++ Y+ PN++V +T++S+ A ++GITA QI+ +L+Q+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNVVVSILTRDSVRQALKSGITASQIVGYLRQHAHSKMIE 381
Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
P + P + DQI+LWE++ NR + Y +F S+ FE D+A L+W+ +
Sbjct: 382 AGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFEVLRDHAVSTGVLIWQSER 441
Query: 178 KMRLVVNA 185
+ L+ +
Sbjct: 442 RSTLLADT 449
>gi|159474228|ref|XP_001695231.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276165|gb|EDP01939.1| predicted protein [Chlamydomonas reinhardtii]
Length = 435
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/139 (46%), Positives = 89/139 (64%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+ FV+VE+N+R+Y Y+ S + +L LF K E LPNL VGAI ++S+ A GITA++
Sbjct: 297 DAFVIVESNYRVYVYTRSPVTIAVLELFVKREALLPNLFVGAIRRDSILAALARGITADE 356
Query: 104 IISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACD 163
++++L HP +A R P VPE V DQIRLWE+ +NR++ PA YE SR ++E A
Sbjct: 357 LVAYLSARPHPSIAARCPVVPEVVSDQIRLWEASMNRLQAHPAVLYENMESRQLYERAVA 416
Query: 164 YARDRSGLLWEDSKKMRLV 182
AR L WED +MR V
Sbjct: 417 AARAAGTLQWEDGARMRFV 435
>gi|396459783|ref|XP_003834504.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
gi|312211053|emb|CBX91139.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
Length = 483
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 108/192 (56%), Gaps = 21/192 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTDS-------------SA 41
M+ D + +GLV + + F PT+LAT L+ MS + + +
Sbjct: 262 MLDDLSAMGLV--YRTSKDARTFYPTRLATTLTSDSGSAMSASSNDIAQANQGNAGTLAT 319
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
+GF+++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A GIT+
Sbjct: 320 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVSAGITS 379
Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
QIIS+L AHP++ + +P V DQIRLWE + RVE+TP ++F S +
Sbjct: 380 AQIISYLSTYAHPQMQKNVSFIPPTVMDQIRLWEYEGERVEVTPGFLMKDFGSDAEYRDV 439
Query: 162 CDYARDRSGLLW 173
YA L+W
Sbjct: 440 LGYADALGVLVW 451
>gi|146414792|ref|XP_001483366.1| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 121/243 (49%), Gaps = 51/243 (20%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDS--------SAR 42
M+ D D GLV Q + F PT+LAT L SM+L + S
Sbjct: 251 MLADLRDYGLV--YQYSDQSDRFYPTRLATTLTSESASLKTPSMALDQAVDTSNGLDSGP 308
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
+G +++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT++
Sbjct: 309 GQGSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSD 368
Query: 103 QIISFLQQNAHPRV-------------------------------ADRIPSVPENVCDQI 131
QII+FL+ +AHP++ ++ +P V DQI
Sbjct: 369 QIINFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQI 428
Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
+LW+ +L+R++ + ++EF S+ F+ C+YA D L+W D KMR V + +
Sbjct: 429 KLWQLELDRIQAFEGYLFKEFASQQEFDVLCNYASDVGVLIWSDRTKMRFFVTKDGISQV 488
Query: 192 REF 194
+F
Sbjct: 489 ADF 491
>gi|190347675|gb|EDK39997.2| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 121/243 (49%), Gaps = 51/243 (20%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDS--------SAR 42
M+ D D GLV Q + F PT+LAT L SM+L + S
Sbjct: 251 MLADLRDYGLV--YQYSDQSDRFYPTRLATTLTSESASLKTPSMALDQAVDTSNGLDSGP 308
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
+G +++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT++
Sbjct: 309 GQGSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSD 368
Query: 103 QIISFLQQNAHPRV-------------------------------ADRIPSVPENVCDQI 131
QII+FL+ +AHP++ ++ +P V DQI
Sbjct: 369 QIINFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQI 428
Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
+LW+ +L+R++ + ++EF S+ F+ C+YA D L+W D KMR V + +
Sbjct: 429 KLWQLELDRIQAFEGYLFKEFASQQEFDVLCNYASDVGVLIWSDRTKMRFFVTKDGISQV 488
Query: 192 REF 194
+F
Sbjct: 489 ADF 491
>gi|353231086|emb|CCD77504.1| putative tfiih, polypeptide [Schistosoma mansoni]
Length = 486
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 97/158 (61%)
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
S++ G++++ETNFR+YAY+ S L +L LFSKI + PNL+V IT++S+ A G
Sbjct: 324 SNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 383
Query: 99 ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
ITA QI+SFL NAHP + + P +P + DQIRLWE + +R YE+F F
Sbjct: 384 ITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDF 443
Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
E DYA+ LLWE+ ++ +VV+ H +R+F +
Sbjct: 444 EMVRDYAKSIGVLLWENPERRLMVVSKAGHEDVRKFWK 481
>gi|256086358|ref|XP_002579367.1| tfiih polypeptide [Schistosoma mansoni]
Length = 486
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 97/158 (61%)
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
S++ G++++ETNFR+YAY+ S L +L LFSKI + PNL+V IT++S+ A G
Sbjct: 324 SNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 383
Query: 99 ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
ITA QI+SFL NAHP + + P +P + DQIRLWE + +R YE+F F
Sbjct: 384 ITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDF 443
Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
E DYA+ LLWE+ ++ +VV+ H +R+F +
Sbjct: 444 EMVRDYAKSIGVLLWENPERRLMVVSKAGHEDVRKFWK 481
>gi|391335225|ref|XP_003741996.1| PREDICTED: general transcription factor IIH subunit 4-like
[Metaseiulus occidentalis]
Length = 457
Score = 130 bits (328), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 116/203 (57%), Gaps = 5/203 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK--EGFVVVETNFRMYAY 58
++ + GLV Q RK F PT+LA L+ L + A K +G+++VETN+R+YAY
Sbjct: 254 FLQHLREFGLV--YQRKRKSGRFYPTRLAIGLASGLKELQATKDEQGYIIVETNYRVYAY 311
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L +L LF ++ Y+ PNL+V +T+ES+ A + GIT+ QI FL+ +H V +
Sbjct: 312 TDSPLQVALLSLFCELLYRFPNLLVAVLTRESVRQALKGGITSNQITHFLKSRSHRVVLE 371
Query: 119 RIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
R +P V DQ+RLWE + +R + + Y +F + FE +YARD L +E+ +
Sbjct: 372 REEGIIPMTVTDQLRLWELERDRFKFGESVLYSQFQTTGDFEMLRNYARDIGVLKYENPQ 431
Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
K LVV+ +R+F + K
Sbjct: 432 KRFLVVSKSGDAEVRQFWKRHKK 454
>gi|339237219|ref|XP_003380164.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
gi|316977049|gb|EFV60220.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
Length = 674
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 109/195 (55%), Gaps = 2/195 (1%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
I+ ++GLV Q R WF T L + L+ + SS+ KEGF++VETNFR+Y Y+
Sbjct: 251 FIQHLREIGLV--HQRKRSAGWFYYTPLISVLTGLKSSSSSSKEGFLIVETNFRVYCYTD 308
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
S L I+ F + Y+ PNL+ + +ES+ AF+ I+AEQII +L NAH + +
Sbjct: 309 SVLDLAIVSTFCEPLYRFPNLVACILNRESVRRAFQVNISAEQIIQYLFSNAHKNMQKQT 368
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
P++P V DQI+LWE + +R + P Y F S + DYA+D LL E
Sbjct: 369 PTIPSTVTDQIKLWEMERDRFKFDPGVMYSNFFSDTDYITIRDYAKDLGVLLCEHEANRA 428
Query: 181 LVVNAEIHMHMREFL 195
LVV+A+ H +FL
Sbjct: 429 LVVSADGHEQSNQFL 443
>gi|301098671|ref|XP_002898428.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
gi|262105199|gb|EEY63251.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
Length = 483
Score = 130 bits (327), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 119/232 (51%), Gaps = 38/232 (16%)
Query: 1 MIKDFADLGLV--KLQQVGRKESWFIPTKLATNL-----------SMSLTDSSARKEG-- 45
++ DF D GL+ K Q R F T LA NL +SLT S A
Sbjct: 251 LLGDFIDFGLLFRKRQNSDR----FYTTSLAVNLIFGGSTGQKRSHVSLTSSFAGVRAGM 306
Query: 46 -------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAI 86
VVVETNF++YAY++S LH +L +F I +LPNL +G I
Sbjct: 307 KSQVADPRQAPTVDHGARLLVVVETNFKIYAYTSSTLHVAMLSVFVDIVARLPNLAIGFI 366
Query: 87 TKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
T+ESL +A +GI+A+QI FL ++AHP++ P +PEN+ DQI LWE + NRV+
Sbjct: 367 TRESLRSALIHGISAQQIYDFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFLEG 426
Query: 147 HYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
++ F +++ +E+ DYA+ L W D RL + +R F++ Q
Sbjct: 427 ILFDGFNTKEDYESVRDYAKGLMVLTWSDPIHFRLSIATAGIDEVRHFIQNQ 478
>gi|407927340|gb|EKG20235.1| Transcription factor Tfb2 [Macrophomina phaseolina MS6]
Length = 466
Score = 130 bits (326), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 121/215 (56%), Gaps = 20/215 (9%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL-------------SMSLTDSSAR-KEGF 46
M+ D D G+V + K + F PT+LAT L S +L +SA +G+
Sbjct: 255 MLDDLNDFGIVY--RSPPKSNRFYPTRLATTLTSDAAALNNSDSFSSALNSTSASGSKGY 312
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+++ETN+R+YAY++S L +L LF+ +E + PNL+ G +TKES+ A GITA+QIIS
Sbjct: 313 IIIETNYRLYAYTSSPLQIAVLALFTDLETRFPNLVSGKLTKESVQRAIGLGITADQIIS 372
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
+L AHP++ + P +P V DQIRLW+ + +R+ T +++F D ++ YA
Sbjct: 373 YLDTYAHPQMQRKTPVLPPTVVDQIRLWQIEGDRMRATEGFLFKDFRDWDEYQDLKKYAD 432
Query: 167 DRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
L+W D +++ + E + ++++ + K
Sbjct: 433 SIGVLVWSNDQRRLMFLTRVE---QISQYIKNRTK 464
>gi|281208616|gb|EFA82792.1| general transcription factor IIH [Polysphondylium pallidum PN500]
Length = 458
Score = 129 bits (325), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 120/207 (57%), Gaps = 15/207 (7%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-----------DSSARKEGFVVV 49
++ D GLV ++ K F PT+L +L+ T ++ +++G++++
Sbjct: 252 LLFDLRQFGLVYIR--SEKSEIFYPTRLIISLTTGKTVTVIKDLAKEMSNTQKEQGYIIL 309
Query: 50 ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
ETNFR+YAY+ S L +L LF K+ Y+LPNL VG +T+ES+ AF +GITA+QI+ F++
Sbjct: 310 ETNFRIYAYTASSLQISLLSLFVKMLYRLPNLSVGILTRESIRTAFLHGITADQIVDFIK 369
Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
QN HP + PE V +QIR+WE++ NR+ A ++ FP+++ F YA+D S
Sbjct: 370 QNGHPNMLK--VGAPEIVFEQIRIWENERNRILYKKAVLFDSFPTQESFNMTLQYAKDLS 427
Query: 170 GLLWEDSKKMRLVVNAEIHMHMREFLR 196
+W K LVV+ ++ ++R
Sbjct: 428 FYMWASEAKKVLVVSDNGFDAIKNYIR 454
>gi|395533817|ref|XP_003768949.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Sarcophilus harrisii]
Length = 1638
Score = 129 bits (324), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 123/234 (52%), Gaps = 38/234 (16%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS +T + +A + GF+VVETN+R+YAY+
Sbjct: 296 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYT 353
Query: 60 T----------------------------------SKLHCEILRLFSKIEYQLPNLIVGA 85
S+L ++ LFS++ Y+ PN++V
Sbjct: 354 GEVGGIKMRIEIRGGGIGGELNLLWSSIVFSLETESELQIALIALFSEMLYRFPNMVVAQ 413
Query: 86 ITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTP 145
+T+ES+ A NGITA+QII F++ AHP + + P +P + DQIRLWE + +R+ +
Sbjct: 414 VTRESVQQAIANGITAQQIIHFMKTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFSE 473
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
Y +F S+ FE +AR+ L++E++ K +VV H ++ F + Q
Sbjct: 474 GVLYNQFLSQVDFELLLSHARELGVLVFENTAKRLMVVTPAGHSDVKRFWKRQK 527
>gi|66800623|ref|XP_629237.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
gi|74850716|sp|Q54C29.1|TF2H4_DICDI RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=TFIIH basal transcription factor complex subunit 4
gi|60462610|gb|EAL60813.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
Length = 483
Score = 129 bits (324), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 70/185 (37%), Positives = 114/185 (61%), Gaps = 11/185 (5%)
Query: 23 FIPTKLATNLS----MSLTDS-------SARKEGFVVVETNFRMYAYSTSKLHCEILRLF 71
F PT+L +L+ +SL S + +++G++V+ETN+R+YAY++S L +L LF
Sbjct: 295 FYPTRLIISLTTGKTLSLIQSISSERTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLF 354
Query: 72 SKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQI 131
K+ Y+LPNL VG IT+ES+ A +GITA+QII F++ N+HP A+ +P+ V +QI
Sbjct: 355 VKMLYRLPNLAVGIITRESIRTALIHGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQI 414
Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
LWE++ NR+ T + Y FP+ D + A +A+++ +W LVV E + +
Sbjct: 415 LLWEAERNRITYTKSVLYNSFPTNDCYIATLKFAKEQDYYIWSHDPLKTLVVKEEGNDPI 474
Query: 192 REFLR 196
R F++
Sbjct: 475 RNFIK 479
>gi|326474968|gb|EGD98977.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton tonsurans CBS 112818]
Length = 489
Score = 129 bits (324), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 109/204 (53%), Gaps = 34/204 (16%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------------------ 43
+ D D G+V + F PT+LAT L+ +DS A
Sbjct: 257 LADLTDFGIVYQHTPASGSARFYPTRLATTLT---SDSLAMSGPISGEPAVPTTTSGTTT 313
Query: 44 -----EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
GF++VETN R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E G
Sbjct: 314 SGDAGTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMG 373
Query: 99 ITAEQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYE 150
ITA+QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T ++
Sbjct: 374 ITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFK 433
Query: 151 EFPSRDVFEAACDYARDRSGLLWE 174
EF + FEA C YA + L+W+
Sbjct: 434 EFDTFAEFEAPCKYAEEIGVLVWK 457
>gi|154273196|ref|XP_001537450.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150415962|gb|EDN11306.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 455
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/197 (35%), Positives = 111/197 (56%), Gaps = 31/197 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
+ D D G+V ++ + F PT+LAT L S SLT S+
Sbjct: 231 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPNGVPSATSNEP 290
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
GF+++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A E GITA+
Sbjct: 291 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 350
Query: 103 QIISFLQQNAHPRV---------ADRI---PSVPENVCDQIRLWESDLNRVEMTPAHYYE 150
QIIS+L +AHP++ A+ + +P V DQIRLW+ + +R++ TP ++
Sbjct: 351 QIISYLTTHAHPQMRKYHATKPGANPVGVHTVLPPTVVDQIRLWQLERDRIKATPGFLFK 410
Query: 151 EFPSRDVFEAACDYARD 167
+F S FE C YA +
Sbjct: 411 DFVSLAEFEGPCRYAEE 427
>gi|330797174|ref|XP_003286637.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
gi|325083385|gb|EGC36839.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
Length = 471
Score = 128 bits (321), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 99/155 (63%)
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
++ G++V+ETN+R+YAY+ S L +L LF K+ Y+LPNL VG IT+ES+ A +GITA
Sbjct: 313 KESGYIVLETNYRLYAYTASSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITA 372
Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
+QII F++ NAHP + +P+ V +QI LWE + NR+ T + Y FP+ D + A
Sbjct: 373 DQIIDFIRHNAHPNAVNNGQPIPDVVAEQILLWEQERNRITYTKSVLYNSFPTTDCYHAT 432
Query: 162 CDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+A+++ +W + + LVVN + +R F++
Sbjct: 433 LKFAKEQDYYIWSNDQLKTLVVNENGNDPIRNFIK 467
>gi|50310215|ref|XP_455127.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62901296|sp|Q6CLR2.1|TFB2_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49644263|emb|CAG97834.1| KLLA0F01056p [Kluyveromyces lactis]
Length = 496
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 122/244 (50%), Gaps = 50/244 (20%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLAT-----------------------NLSMSLT 37
M++D D GL+ Q + F PT+LAT N + ++
Sbjct: 250 MLQDMRDYGLIYQNQSDYAK--FYPTRLATLLTSDTKAFRSASVALDSVLNKANETTAVE 307
Query: 38 DSSARKE-------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
S + E G +++ETNF++Y+YS S L IL LF ++ + N++ G +T+ES
Sbjct: 308 GDSGQDETTERTQDGALIIETNFKLYSYSNSPLQIAILSLFVHLKSRFANMVTGQLTRES 367
Query: 91 LYNAFENGITAEQIISFLQQNAHPR------------------VADRIPSVPENVCDQIR 132
+ NA NGITAEQII++L+ +AHPR V + + +P V DQIR
Sbjct: 368 VRNALLNGITAEQIIAYLETHAHPRMRRLAEENLSKKLELDPTVKETLQVLPPTVVDQIR 427
Query: 133 LWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMR 192
LW+ +L+R+ + Y +F S ++ DYA+D LLW++ KK V+ E + +
Sbjct: 428 LWQLELDRIISYDGYLYTDFESYQEYQTVADYAKDIGVLLWQNEKKKMFFVSTEGNSQVL 487
Query: 193 EFLR 196
+F R
Sbjct: 488 DFHR 491
>gi|440636738|gb|ELR06657.1| hypothetical protein GMDG_00274 [Geomyces destructans 20631-21]
Length = 470
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 97/154 (62%), Gaps = 2/154 (1%)
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
GF+V+ETN+R+YAY++S L +L LF+K+ + PN++ G +T++S+ A E+GITA+QI
Sbjct: 316 GFIVIETNYRLYAYTSSPLQIAVLALFAKLSTRYPNMVSGRVTRDSVRGAIEHGITADQI 375
Query: 105 ISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDY 164
IS+L +AHP++ + P +P V DQIRLW+ + R++ T +++F S +E Y
Sbjct: 376 ISYLTTHAHPQLRRKSPVLPPTVVDQIRLWQIENERMKATAGFLFKDFASLAEYEGCVRY 435
Query: 165 ARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
A + L+W K V H +R++++ +
Sbjct: 436 ADEIGVLVWRSDAKRNFFVTK--HEQLRDYIKSR 467
>gi|193633122|ref|XP_001945987.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 471
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 115/202 (56%), Gaps = 7/202 (3%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
++ + GLV Q RK F PT+L + + +S R KE ++VVETNFR+YA
Sbjct: 270 LQHLREFGLV--YQRKRKAGRFYPTRLVIEMGQGNSRTSERMKNKERYIVVETNFRIYAM 327
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L ++ LF+ + Y+ PN+ G +T++S+ A +GITA QI+ FL + HP++ +
Sbjct: 328 TDSDLKVALVALFTHMLYRFPNMSAGILTRDSVRTALRSGITAAQIVRFLTVHTHPQMQE 387
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
+P+ V DQI LWE++ NR+ T Y + + +E +YA D L+W D ++
Sbjct: 388 --CGMPQTVIDQIYLWENERNRLTYTDGVLYSNINTPNDYETIKNYAADIGALVWCDERR 445
Query: 179 MRLVVNAEIHMHMREFLRGQNK 200
+VV+ + H +R+F + Q K
Sbjct: 446 RNIVVSTDGHDDVRKFWKKQPK 467
>gi|402866404|ref|XP_003897374.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial,
partial [Papio anubis]
Length = 1647
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/233 (33%), Positives = 121/233 (51%), Gaps = 37/233 (15%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY
Sbjct: 308 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 365
Query: 59 --------------------------------STSKLHCEILRLFSKIEYQLPNLIVGAI 86
S S+L ++ LFS++ Y+ PN++V +
Sbjct: 366 GEVGQSAPEEKKEGLEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQV 425
Query: 87 TKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
T+ES+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T
Sbjct: 426 TRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEG 485
Query: 147 HYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
Y +F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 486 VLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 538
>gi|345569654|gb|EGX52519.1| hypothetical protein AOL_s00043g13 [Arthrobotrys oligospora ATCC
24927]
Length = 452
Score = 125 bits (315), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 72/208 (34%), Positives = 114/208 (54%), Gaps = 22/208 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM---SLTDSSAR---------KEGFVV 48
M++D D G+V ++ + F PT+LAT L+ SL +A + GF++
Sbjct: 248 MLEDLRDYGIVYQRKSSSRR--FYPTRLATGLTSEIRSLRSPAATLANATPGNDENGFLI 305
Query: 49 VETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 108
+ETN+ S L +L LF+++ + PNL+ IT+ES+ A GITAEQII ++
Sbjct: 306 IETNYH------SPLQIAVLNLFARLISRFPNLVTARITRESIRQAIGKGITAEQIIDYM 359
Query: 109 QQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
NAHP++ P +P V DQIRLW+ + R+++T + ++EF + F A C YA D
Sbjct: 360 TANAHPQLRKNSPVLPPTVVDQIRLWQIEGERMKVTSGYLFKEFATNAEFNAVCKYADDI 419
Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLR 196
L+W D K L V+ H + +++R
Sbjct: 420 GVLVWIDKNKGMLFVSK--HEQIADYIR 445
>gi|62088538|dbj|BAD92716.1| VARS2L protein variant [Homo sapiens]
Length = 1653
Score = 125 bits (315), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 121/238 (50%), Gaps = 42/238 (17%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY
Sbjct: 309 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 366
Query: 59 -------------------------------------STSKLHCEILRLFSKIEYQLPNL 81
S S+L ++ LFS++ Y+ PN+
Sbjct: 367 GEAGQRAPGRGGWGHLKEGLEGVWVWGWPPLPLSIPGSESELQIALIALFSEMLYRFPNM 426
Query: 82 IVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRV 141
+V +T+ES+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+
Sbjct: 427 VVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRL 486
Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
T Y +F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 487 RFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 544
>gi|403308524|ref|XP_003944708.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Saimiri boliviensis boliviensis]
Length = 1752
Score = 125 bits (315), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 121/238 (50%), Gaps = 42/238 (17%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY
Sbjct: 407 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 464
Query: 59 -------------------------------------STSKLHCEILRLFSKIEYQLPNL 81
S S+L ++ LFS++ Y+ PN+
Sbjct: 465 GEVGQRAPGRAGWGRGNARLEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPNM 524
Query: 82 IVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRV 141
+V +T+ES+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+
Sbjct: 525 VVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRL 584
Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
T Y +F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 585 RFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 642
>gi|390461382|ref|XP_002746360.2| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Callithrix jacchus]
Length = 1675
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/239 (32%), Positives = 121/239 (50%), Gaps = 43/239 (17%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY
Sbjct: 329 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 386
Query: 59 --------------------------------------STSKLHCEILRLFSKIEYQLPN 80
S S+L ++ LFS++ Y+ PN
Sbjct: 387 GEAGQRAPGRAGWGXHLKERFEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPN 446
Query: 81 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNR 140
++V +T+ES+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R
Sbjct: 447 MVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDR 506
Query: 141 VEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
+ T Y +F S+ FE +AR+ L++E+S K +VV H ++ F + Q
Sbjct: 507 LRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 565
>gi|260945437|ref|XP_002617016.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
gi|238848870|gb|EEQ38334.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
Length = 511
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 123/257 (47%), Gaps = 65/257 (25%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------MSLTDSSARKE-- 44
M+ D D GLV Q + F PT+LAT L+ +L D A+ +
Sbjct: 251 MLPDLRDYGLV--YQRSENSTRFFPTRLATTLTSESTGLKTPSMALNQTLEDPGAKDQEH 308
Query: 45 ----------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
G +++ETNF++YAY+ S L IL LF ++ + N+++G IT+ES+ A
Sbjct: 309 VTSEAGESQSGQIILETNFKLYAYTNSPLEIAILNLFVNLKVRFANMVIGQITRESIRKA 368
Query: 95 FENGITAEQIISFLQQNAHPRV-------------------------------------A 117
NGITA+QII FL+ +AH ++
Sbjct: 369 LYNGITADQIIKFLETHAHSQMRTSAKEKLDKKIEFDASHNINTAGGAPQSKSSDSGVAQ 428
Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
R+ +P NV DQI+LW+ +L+R++ + +++F +++ ++ C+YA + LLW D
Sbjct: 429 HRLEILPPNVVDQIKLWQLELDRIQTFEGYLFKDFKNQNEYDVLCNYATEVGVLLWSDKT 488
Query: 178 KMRLVVNAEIHMHMREF 194
K+R V AE + +F
Sbjct: 489 KLRFFVTAEGMHQVADF 505
>gi|301786953|ref|XP_002928911.1| PREDICTED: LOW QUALITY PROTEIN: valyl-tRNA synthetase,
mitochondrial-like [Ailuropoda melanoleuca]
Length = 1637
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/242 (32%), Positives = 122/242 (50%), Gaps = 46/242 (19%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS +S +A + GF+VVETN+R+YAY+
Sbjct: 286 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 343
Query: 60 T------------------------------------------SKLHCEILRLFSKIEYQ 77
S+L ++ LFS++ Y+
Sbjct: 344 GEARQRAPNSRXGVGTFLVTLEGLEGVWVWGXPSSSLPIPGPESELQIALIALFSEMLYR 403
Query: 78 LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESD 137
PN++V +T+ES+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE +
Sbjct: 404 FPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELE 463
Query: 138 LNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRG 197
+R+ T Y +F S+ FE +AR+ L++E+S K +VV H ++ F +
Sbjct: 464 RDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKR 523
Query: 198 QN 199
Q
Sbjct: 524 QK 525
>gi|193654859|ref|XP_001944632.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 470
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 114/202 (56%), Gaps = 7/202 (3%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
++ + GLV +++ K F PT+L + + +S R KE ++VVETNFR+YA
Sbjct: 269 LQHLQEFGLVYQRKL--KAGRFCPTRLVIEMGRENSHTSKRMKKKERYIVVETNFRIYAM 326
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L ++ LF+ + Y+ PN+ G +T++S+ A NGITA QI+ FL + HP++ +
Sbjct: 327 TDSDLKVALVALFTHMLYRFPNMSAGILTRDSVQTALRNGITAAQIVRFLTVHTHPQMQE 386
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
+P+ V DQI LWE + NR+ Y + + + +EA +YA D LLW D ++
Sbjct: 387 --CGMPQTVIDQIYLWEYERNRLTYRDGVLYSDINTPNDYEAIKNYAADIGALLWCDERQ 444
Query: 179 MRLVVNAEIHMHMREFLRGQNK 200
++V+ + H + F + Q K
Sbjct: 445 RNIIVSTDGHNDVTTFWKKQPK 466
>gi|431907067|gb|ELK11185.1| General transcription factor IIH subunit 4 [Pteropus alecto]
Length = 447
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 113/200 (56%), Gaps = 20/200 (10%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+Q
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQ---------------- 364
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 365 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 424
Query: 180 RLVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 425 LMVVTPAGHSDVKRFWKRQK 444
>gi|171681598|ref|XP_001905742.1| hypothetical protein [Podospora anserina S mat+]
gi|170940758|emb|CAP66407.1| unnamed protein product [Podospora anserina S mat+]
Length = 540
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 125/233 (53%), Gaps = 37/233 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL-------SMSLTDSSARKE--------- 44
M+ AD GL+ + + R + ++ PT+LAT L S+S + +A K+
Sbjct: 310 MLPALADFGLIYIDR-DRPQQYY-PTRLATTLTSLSTMRSVSASIDAATKKTPGDAGSLG 367
Query: 45 ------------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
G +VVETN+R+YAY++S L IL+LF ++ + PN++ +T+ES+
Sbjct: 368 ADSTPTAPADENGGIVVETNYRIYAYTSSPLQIAILKLFCRLHMRFPNMVTARLTRESVQ 427
Query: 93 NAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAH 147
A + GITA QII +L +AHP++ A +P V DQIRLW+ + R++ TP
Sbjct: 428 EAIKEGITANQIIDYLVAHAHPQMRRAAAARGTTVIPPTVMDQIRLWQLESQRMQKTPGF 487
Query: 148 YYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+++F S + + +YA + L+W+D +K V+ +REFL+ + K
Sbjct: 488 QFKDFESVEEYRQLAEYATEIGVLVWKDDRKGTFFVSKV--EQIREFLKARKK 538
>gi|324512881|gb|ADY45318.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 481
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 73/194 (37%), Positives = 112/194 (57%), Gaps = 14/194 (7%)
Query: 7 DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETNFRMYAYSTS 61
+LGLV +++ RK+ +F T L +L++S + +A E G+++VETN+R+YAY+ S
Sbjct: 276 ELGLVFIRK--RKDGYFFITPLFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDS 333
Query: 62 KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV---AD 118
L IL F+++ Y+ ++ VG +T++S+ A + GITA QIISFL+ NAHP A+
Sbjct: 334 SLQLAILSTFTEMLYRFNDMSVGMLTRDSVRRALQVGITASQIISFLRANAHPETVAAAN 393
Query: 119 RIPS----VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
P VP V DQIRLWE + +R+ + Y F S + +Y R + LLW
Sbjct: 394 ASPGVIHCVPVTVADQIRLWEDERHRLVFFDSALYSTFESEREYVGVKEYTRSQDILLWF 453
Query: 175 DSKKMRLVVNAEIH 188
D + +VV E H
Sbjct: 454 DDVQRLVVVTEEGH 467
>gi|298705308|emb|CBJ48998.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 534
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 106/200 (53%), Gaps = 29/200 (14%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--------------------- 39
+ +DF +LGL+ ++ K + F PT +A NL + S
Sbjct: 280 LAQDFVELGLLFKRKA--KSTRFYPTSIAVNLIFGSSPSGDAGGAGGGGTQRKPQPAGGL 337
Query: 40 -SARKEG----FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
++E ++VETNF++ AY+ SKLH +L LF ++ LPN IVGAIT+ES+ A
Sbjct: 338 GGGKREDDTSIHIIVETNFQVIAYTRSKLHFAMLSLFLELRALLPNAIVGAITRESMRKA 397
Query: 95 FENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYE-EFP 153
GI Q++ FL+ +AHP V R P VPEN+ DQ+ LWE + +R+E + +
Sbjct: 398 LSTGIKGRQVLDFLKWHAHPVVRRRTPVVPENIADQVLLWERERDRMEHRDGVLVDVSYA 457
Query: 154 SRDVFEAACDYARDRSGLLW 173
SRD F ++A + GLLW
Sbjct: 458 SRDAFRGMTEFANAKQGLLW 477
>gi|354546081|emb|CCE42810.1| hypothetical protein CPAR2_204530 [Candida parapsilosis]
Length = 451
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/259 (29%), Positives = 127/259 (49%), Gaps = 67/259 (25%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK-------------EG-- 45
M+ D DLGL+ Q + S F PT+LAT L+ +DSSA K EG
Sbjct: 198 MLADLKDLGLI--YQKSKTSSRFYPTRLATTLT---SDSSALKTPSMAVEQALEESEGQM 252
Query: 46 -------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
+++ETNF++YAY+ S L IL LF +++ + N++ G IT+ES+ NA NG
Sbjct: 253 MTSTSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNG 312
Query: 99 ITAEQIISFLQQNAHPRVA------------------------------------DRIPS 122
IT++QII FL+ +AHP++ ++
Sbjct: 313 ITSDQIIKFLETHAHPQMKALAKEKLDKKVEFDASHNINTAGGAPQSKTDGTISQHKLEV 372
Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
+P NV DQI+LW+ +L+R++ + +++F ++ ++ +YA + L+W D K +
Sbjct: 373 IPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDMLSNYATELGVLVWGDKAKRKFF 432
Query: 183 VNAEIHMHMREF----LRG 197
V + + +F LRG
Sbjct: 433 VTKDGMAQVADFANRKLRG 451
>gi|397471696|ref|XP_003807420.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Pan paniscus]
Length = 1673
Score = 123 bits (308), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 77/245 (31%), Positives = 121/245 (49%), Gaps = 49/245 (20%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY
Sbjct: 322 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 379
Query: 59 --------------------------------------------STSKLHCEILRLFSKI 74
S S+L ++ LFS++
Sbjct: 380 GEAGQRAPGRGGWGALXHFSXHLKEGLEGVWVWGCPPHPLSIPGSESELQIALIALFSEM 439
Query: 75 EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLW 134
Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + + P +P + DQIRLW
Sbjct: 440 LYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLW 499
Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
E + +R+ T Y +F S+ FE +AR+ L++E+S K +VV H ++ F
Sbjct: 500 ELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRF 559
Query: 195 LRGQN 199
+ Q
Sbjct: 560 WKRQK 564
>gi|313224499|emb|CBY20289.1| unnamed protein product [Oikopleura dioica]
Length = 411
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 121/199 (60%), Gaps = 5/199 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
+I+ F ++GL+ ++ KE F PT+LA ++S++ ++ +E F++VETN+R+YA
Sbjct: 208 VIQHFREMGLIFKRK--SKEQRFYPTRLAQSISIAGGKKASSEDVQEQFILVETNYRIYA 265
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y+ S+LH ++ LF++++Y+ P +IV ++++S+ + + GI+AEQI+++L+ +AHP
Sbjct: 266 YTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQILNYLRSSAHPIAR 325
Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
VP+ V D I LW + R++ Y +F ++ FE YA+D L+W + +
Sbjct: 326 KNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYAQDIRALVWANDE 385
Query: 178 KMRLVVNAEIHMHMREFLR 196
+ +VV H ++ + +
Sbjct: 386 RRFMVVAPWSHDQIKSYYK 404
>gi|313245784|emb|CBY34779.1| unnamed protein product [Oikopleura dioica]
Length = 456
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 120/199 (60%), Gaps = 5/199 (2%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS---LTDSSARKEGFVVVETNFRMYA 57
+I+ F ++GL+ ++ KE F PT+LA ++S++ S +E F++VETN+R+YA
Sbjct: 253 VIQHFREMGLIFKRK--SKEQRFYPTRLAQSISIAGGKKASSEDVQEQFILVETNYRIYA 310
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y+ S+LH ++ LF++++Y+ P +IV ++++S+ + + GI+AEQI+++L+ +AHP
Sbjct: 311 YTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQILNYLRSSAHPIAR 370
Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
VP+ V D I LW + R++ Y +F ++ FE YA+D L+W + +
Sbjct: 371 KNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYAQDIRALVWANDE 430
Query: 178 KMRLVVNAEIHMHMREFLR 196
+ +VV H ++ + +
Sbjct: 431 RRFMVVAPWSHDQIKSYYK 449
>gi|393910939|gb|EJD76097.1| transcription factor Tfb2 family protein, variant [Loa loa]
Length = 389
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 114/210 (54%), Gaps = 18/210 (8%)
Query: 3 KDFA----DLGLVKLQQVGRKESWFIPTKLATNL-SMSLTDSSA----RKEGFVVVETNF 53
KDF +LGLV +++ RK+ +F T L +L S+S + + GFV+VETN+
Sbjct: 174 KDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNY 231
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GITA QII+FL+ NAH
Sbjct: 232 RVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAH 291
Query: 114 PRV-------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
P I SVP V DQIRLWE + R+ A Y F S + Y
Sbjct: 292 PTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVS 351
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ LLW D + +V+ E H ++ + +
Sbjct: 352 SQGILLWCDDVQRLMVITEEGHESVKSWWK 381
>gi|366997767|ref|XP_003683620.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
gi|357521915|emb|CCE61186.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
Length = 504
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 120/252 (47%), Gaps = 61/252 (24%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE---------------- 44
M+KD D GL+ Q + F PT LAT L+ +DSS +
Sbjct: 251 MLKDMRDYGLIF--QKTSNATVFYPTTLATQLT---SDSSTVRSASGAIDSLLSQTKEAN 305
Query: 45 ----------------------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLI 82
G +VVETNF++Y+YS S L IL LF + + N++
Sbjct: 306 NNDSQNLGTDTDDLVGNQSVEVGALVVETNFKLYSYSNSPLQIAILSLFVHLRTRFSNMV 365
Query: 83 VGAITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRIPSVP 124
G IT+ES+ NA NG+TA+QII++LQ +AHP++ D++ +P
Sbjct: 366 TGQITRESIRNALANGVTADQIIAYLQSHAHPQMRRLAETTLKKKLELDPNTKDQLQILP 425
Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
V DQI+LW+ +L+R+ Y +F + + +YA+D LLW++ +K + V+
Sbjct: 426 PTVVDQIKLWQLELDRILSYDGSLYSDFDNNQEYNMLYNYAKDIGVLLWKNDRKRKFFVS 485
Query: 185 AEIHMHMREFLR 196
E + + +F +
Sbjct: 486 KEGNSQVLDFAK 497
>gi|393910938|gb|EFO26137.2| transcription factor Tfb2 family protein [Loa loa]
Length = 486
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 114/210 (54%), Gaps = 18/210 (8%)
Query: 3 KDFA----DLGLVKLQQVGRKESWFIPTKLATNL-SMSLTDSSA----RKEGFVVVETNF 53
KDF +LGLV +++ RK+ +F T L +L S+S + + GFV+VETN+
Sbjct: 271 KDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNY 328
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GITA QII+FL+ NAH
Sbjct: 329 RVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAH 388
Query: 114 PRV-------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
P I SVP V DQIRLWE + R+ A Y F S + Y
Sbjct: 389 PTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVS 448
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ LLW D + +V+ E H ++ + +
Sbjct: 449 SQGILLWCDDVQRLMVITEEGHESVKSWWK 478
>gi|312069992|ref|XP_003137940.1| transcription factor Tfb2 family protein [Loa loa]
Length = 470
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 114/210 (54%), Gaps = 18/210 (8%)
Query: 3 KDFA----DLGLVKLQQVGRKESWFIPTKLATNL-SMSLTDSSA----RKEGFVVVETNF 53
KDF +LGLV +++ RK+ +F T L +L S+S + + GFV+VETN+
Sbjct: 255 KDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNY 312
Query: 54 RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GITA QII+FL+ NAH
Sbjct: 313 RVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAH 372
Query: 114 PRV-------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
P I SVP V DQIRLWE + R+ A Y F S + Y
Sbjct: 373 PTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVS 432
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ LLW D + +V+ E H ++ + +
Sbjct: 433 SQGILLWCDDVQRLMVITEEGHESVKSWWK 462
>gi|46127843|ref|XP_388475.1| hypothetical protein FG08299.1 [Gibberella zeae PH-1]
gi|408390907|gb|EKJ70292.1| hypothetical protein FPSE_09509 [Fusarium pseudograminearum CS3096]
Length = 494
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 117/235 (49%), Gaps = 39/235 (16%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
M+ D GL+ + K S F PT+LAT L+
Sbjct: 262 MLPSLVDFGLIYIPN--HKRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSR 319
Query: 35 ----SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
S T SS + G V++ETN+R+YAY+ S L +L LF+K++ + P+++ G I+++S
Sbjct: 320 APGSSATGSSNEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLQMRFPDMVAGRISRQS 379
Query: 91 LYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
+ A GITAEQIIS+L +AH ++ + P +P V DQIRLW+ + R++ T
Sbjct: 380 IRQAINFGITAEQIISYLSAHAHDQMRRTASLNNKPVLPPTVVDQIRLWQLENERMKTTG 439
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+++F + A +A + L+W + K A H +R++LR + K
Sbjct: 440 GFLFKDFEDHKEYMAVAGFAEEVGVLVWRNDVKGMFF--ASKHEQIRDYLRIRKK 492
>gi|378729002|gb|EHY55461.1| transcription initiation factor TFIIH subunit H4 [Exophiala
dermatitidis NIH/UT8656]
Length = 508
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 92/153 (60%), Gaps = 14/153 (9%)
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F++VETN+R+YAY++S L +L LF + + PNL+ G ++K S+ A + GITA+QII
Sbjct: 338 FIIVETNYRLYAYTSSPLQIALLSLFVNLRSRHPNLVTGKMSKSSVQRAIQAGITADQII 397
Query: 106 SFLQQNAHP-----------RVADR---IPSVPENVCDQIRLWESDLNRVEMTPAHYYEE 151
S+L +AHP R AD +P +P + DQI LW+ + +R+ TP ++
Sbjct: 398 SYLTSHAHPQMRRHAQAEQARNADPNRVVPILPATILDQIHLWQLERDRMTTTPGFLLKD 457
Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
FP++ +EA C YA + L+W++ KK VN
Sbjct: 458 FPNQADYEAPCRYADEIGVLVWKNDKKRMFFVN 490
>gi|296422658|ref|XP_002840876.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637102|emb|CAZ85067.1| unnamed protein product [Tuber melanosporum]
Length = 460
Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 102/188 (54%), Gaps = 16/188 (8%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------MSLTDSSARKEGF 46
M+ D D G+V Q F PT+LAT L+ ++ + +GF
Sbjct: 242 MLADLRDYGIV--YQRKSSSDRFYPTRLATTLTSESGGLRSASASMSSAMAKDAEEGKGF 299
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+++ETN+R+YAY+ S L +L LF K+ + PNL+ G I++ S+ A + GITA+Q+I
Sbjct: 300 IILETNYRVYAYTDSPLQIAVLNLFVKLSTRYPNLVSGRISRRSIQEAIKMGITADQVID 359
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
+L +AHP++ + ++P V DQIRLW+ + R+ T +++F S FE YA
Sbjct: 360 YLSAHAHPQMRKSLVTLPPTVVDQIRLWQIEGERMRTTTGFLFKDFHSTQEFEEVAKYAE 419
Query: 167 DRSGLLWE 174
+ L W+
Sbjct: 420 ELGVLKWK 427
>gi|344229873|gb|EGV61758.1| transcription factor Tfb2 [Candida tenuis ATCC 10573]
Length = 498
Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/237 (31%), Positives = 117/237 (49%), Gaps = 54/237 (22%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSL---------TDSSA 41
M+ D D GLV Q + F PT+LAT L SM++ TDS
Sbjct: 252 MLSDLRDFGLV--YQRSATSNKFYPTRLATTLTSDSNSLQTPSMAIDKANSGIDSTDSKQ 309
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
R+E +++ETNF++Y+Y+ S L IL LF + + N++ G IT+ES+ NA NGITA
Sbjct: 310 RQES-IIIETNFKVYSYTQSPLEIAILNLFVHLRTRFANMVTGQITRESIRNALYNGITA 368
Query: 102 EQIISFLQQNAHPRVA--------------------------------DRIPSVPENVCD 129
Q+I FL+ +AHP++ ++ +P NV D
Sbjct: 369 GQVIKFLETHAHPQMKMLAQEKLDKKIEFDASNNINTATGKSTDAPMQHKLEILPPNVVD 428
Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
QI+LW+ +L+R++ + + EF ++ +E+ +YA + L+W D KK V E
Sbjct: 429 QIKLWQLELDRIQTFDGYLFREFNNQIDYESLRNYASEIGVLVWSDDKKRTCFVTKE 485
>gi|170589001|ref|XP_001899262.1| Transcription factor Tfb2 family protein [Brugia malayi]
gi|158593475|gb|EDP32070.1| Transcription factor Tfb2 family protein [Brugia malayi]
Length = 465
Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 113/207 (54%), Gaps = 18/207 (8%)
Query: 2 IKDFA----DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETN 52
IKDF +LGLV +++ RK+ +F T L +L+ S A E GFV+VETN
Sbjct: 249 IKDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTGISNTSEAGMENRNQNGFVIVETN 306
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GITA QII+FL +NA
Sbjct: 307 YRVYAYTDSNLQLAILSTFTEMLYRFSDMSVGVLSREAVRRAFQVGITAAQIIAFLSRNA 366
Query: 113 HP-------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
HP I SVP V DQI+LWE + R+ A Y F S + Y
Sbjct: 367 HPITCAVTTEYGRMIQSVPITVVDQIQLWEDERRRLTFCSAAVYSSFESEKEYFGLKGYV 426
Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMR 192
++ LLW + +V+ E H +++
Sbjct: 427 ISQNILLWCHDVQRLMVITEEGHENVK 453
>gi|344303892|gb|EGW34141.1| RNA polymerase II transcription factor B subunit 2 [Spathaspora
passalidarum NRRL Y-27907]
Length = 519
Score = 119 bits (299), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 122/263 (46%), Gaps = 74/263 (28%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK----------------- 43
M+ D D GLV + S F PT+LAT L+ +DSSA K
Sbjct: 252 MLADLKDYGLVYRRT--DTSSRFYPTRLATTLT---SDSSALKTPAMAVEQALDSVEGGT 306
Query: 44 -------------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
+G V++ETNF++YAY+ S L IL LF + + N++ G IT+ES
Sbjct: 307 DNEVSQTAPPGNIQGTVIIETNFKLYAYTNSPLEIAILNLFVHLRTRFANMVCGQITRES 366
Query: 91 LYNAFENGITAEQIISFLQQNAHPRV---------------------------------A 117
+ NA NGITA+QII FL+ +AHP++ A
Sbjct: 367 IRNALYNGITADQIIRFLETHAHPQMKVLAKEKLDKKIEFDTSHNINTAGGAPQSQALRA 426
Query: 118 D------RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGL 171
D ++ +P NV DQI+LW+ +L+R++ + +++FP++ F+ YA + L
Sbjct: 427 DGSVAQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFPNQQEFDKLSSYASELGVL 486
Query: 172 LWEDSKKMRLVVNAEIHMHMREF 194
+W D K + V + + +F
Sbjct: 487 IWSDKIKRKFFVTKDGMSQVADF 509
>gi|50291141|ref|XP_448003.1| hypothetical protein [Candida glabrata CBS 138]
gi|62901305|sp|Q6FP41.1|TFB2_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49527314|emb|CAG60954.1| unnamed protein product [Candida glabrata]
Length = 504
Score = 119 bits (298), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 100/176 (56%), Gaps = 18/176 (10%)
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
++G +++ETNF++Y+Y S L IL LF ++ + N++ G IT+ES+ A NGITA+
Sbjct: 325 QDGALIIETNFKLYSYCNSPLQIAILSLFVHLKSRFANMVAGQITRESIRRALINGITAD 384
Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
Q+I++L+ +AHP++ D + +P V DQI+LW+ +L+RV
Sbjct: 385 QVIAYLESHAHPQMRRLAEEKLQKKLELDPNCKDPLQVLPPTVVDQIKLWQLELDRVLTY 444
Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
Y +F + F C YA+D LLW+D +K +L V+ E + + E+ + + K
Sbjct: 445 EGSLYIDFDTAQDFNMLCKYAQDIGALLWKDDRKRKLFVSREGNAQVLEYAKRKIK 500
>gi|448527133|ref|XP_003869438.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis Co 90-125]
gi|380353791|emb|CCG23303.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis]
Length = 500
Score = 119 bits (298), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 124/259 (47%), Gaps = 67/259 (25%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGF-------------- 46
M+ D DLGLV Q + F PT+LAT L+ +DSSA K
Sbjct: 247 MLADLKDLGLV--YQKSDTSNKFYPTRLATTLT---SDSSALKTPSMAVQQALEENEEQM 301
Query: 47 --------VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
+++ETNF++YAY+ S L IL LF +++ + N++ G IT+ES+ NA NG
Sbjct: 302 MASNSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNG 361
Query: 99 ITAEQIISFLQQNAHPRV------------------------------------ADRIPS 122
IT++QII FL+ +AHP++ ++
Sbjct: 362 ITSDQIIKFLETHAHPQMRALAKEKLDKKVEFDASHNINTAGGAPQSKTDGAISQHKLEV 421
Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
+P NV DQI+LW+ +L+R++ + +++F ++ ++ +YA + L+W D K +
Sbjct: 422 IPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDTLSNYATELGVLVWGDKVKRKFF 481
Query: 183 VNAEIHMHMREF----LRG 197
V + + +F LRG
Sbjct: 482 VTKDGMAQVADFANRKLRG 500
>gi|156839389|ref|XP_001643386.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
gi|156113994|gb|EDO15528.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
Length = 504
Score = 119 bits (298), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 74/250 (29%), Positives = 122/250 (48%), Gaps = 56/250 (22%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM------------------------SL 36
M+KD D GL+ Q + F PT+LAT L+
Sbjct: 251 MMKDMRDYGLIF--QKNSNTNVFYPTRLATMLTSDSRTVRNASSAMDSVLTQSKDEPSGA 308
Query: 37 TDSSAR------------KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG 84
T SA ++G +VVETNF++Y+YS S L IL LF ++ + N++ G
Sbjct: 309 TSGSADVDEQVGTQGQNIQDGALVVETNFKLYSYSNSPLQIAILSLFVHLKTRFSNMVTG 368
Query: 85 AITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRIPSVPEN 126
IT+ES+ NA NGITAEQII++L+ +AHP++ + + +P
Sbjct: 369 QITRESIRNALSNGITAEQIIAYLETHAHPQMRRLAEERLKKKLELDQNSKETLQILPPT 428
Query: 127 VCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
V DQI+LW+ +L+R+ Y +F ++ ++ YA+D L+W+D +K + V+ E
Sbjct: 429 VVDQIKLWQLELDRIISYEGSLYSDFDNKQEYDMLYSYAQDIGVLIWKDDRKRKFFVSKE 488
Query: 187 IHMHMREFLR 196
+ + ++ +
Sbjct: 489 SNSQVLDYAK 498
>gi|149238950|ref|XP_001525351.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
gi|146450844|gb|EDK45100.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
Length = 503
Score = 119 bits (297), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 73/232 (31%), Positives = 112/232 (48%), Gaps = 60/232 (25%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS---------------------MSLTDS 39
M+ D D GLV Q S F PT+LAT L+ SL S
Sbjct: 250 MLADLKDYGLV--YQKSDTSSRFYPTRLATTLTSDSAALKTPSMAVEQALDETETSLASS 307
Query: 40 SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGI 99
+ KE +++ETNF+MYAY+ S L IL LF ++ + N+I G IT+ES+ NA NGI
Sbjct: 308 TQTKES-IIIETNFKMYAYTNSPLEIAILNLFVHMKTRFSNMICGQITRESIRNALYNGI 366
Query: 100 TAEQIISFLQQNAHPRV------------------------------------ADRIPSV 123
TA+QII FL+ +AHP++ R+ +
Sbjct: 367 TADQIIKFLETHAHPQMRALAKERLDKKVEFDASHNINTAGGAPQSKTDGAISQHRLEVL 426
Query: 124 PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
P NV DQI+LW+ +L+R++ + +++F ++ ++ +YA + L+W D
Sbjct: 427 PPNVVDQIKLWQLELDRIQTFDGYLFKDFANQQEYDMLSNYAAEIGVLIWAD 478
>gi|358333989|dbj|GAA34589.2| transcription initiation factor TFIIH subunit 4 [Clonorchis
sinensis]
Length = 456
Score = 119 bits (297), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 82/134 (61%)
Query: 33 SMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
S+ T SS G++++ETNFR+YAY+ S L +L LFSKI + PNL+V IT++S+
Sbjct: 288 SVQPTASSTSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVR 347
Query: 93 NAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
A GITA+QIISFL NAHP + P +P + DQIRLWE + +R YE+F
Sbjct: 348 EALIRGITADQIISFLTTNAHPDMLREPPILPPTLVDQIRLWELERDRFVFQEGCLYEQF 407
Query: 153 PSRDVFEAACDYAR 166
FE D+A+
Sbjct: 408 SKSADFEMVRDFAK 421
>gi|452839401|gb|EME41340.1| hypothetical protein DOTSEDRAFT_156149 [Dothistroma septosporum
NZE10]
Length = 497
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 33/207 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------------------SMSLTD 38
M+ D AD GL+ + + PT+LAT L S L
Sbjct: 258 MLDDLADFGLIYRRTPDSHR--YYPTRLATTLTSDAPALTNNSLMKTTVMTGDPSTDLAA 315
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
S++ ++GF+++ETN+R+YAY++S L IL LF+ + + PNLI ITK S NA G
Sbjct: 316 SASDEKGFIILETNYRLYAYTSSPLLISILSLFASLNTRYPNLITAKITKASTQNAIAAG 375
Query: 99 ITAEQIISFLQQNAHP-----RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
IT++QII +L +AHP A+ P +P V DQI+LW+ + R+E T + +
Sbjct: 376 ITSDQIIDYLTTHAHPVLRRQAAANDAPILPPTVVDQIKLWQMEGERMEATKGYLIRDIG 435
Query: 154 SRDVFEAACDYARDRSGLLWEDSKKMR 180
S++ ++ A +YA D G+L SK+ R
Sbjct: 436 SQEEYDKAVNYA-DAIGVL---SKEFR 458
>gi|150863860|ref|XP_001382481.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
gi|149385116|gb|ABN64452.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
Length = 515
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 121/258 (46%), Gaps = 68/258 (26%)
Query: 1 MIKDFADLGLV--KLQQVGRKESWFIPTKLATNL----------SMSL------------ 36
M+ D D GLV + GR F PT+LAT L SM++
Sbjct: 252 MLADLRDYGLVYQRTDSSGR----FYPTRLATTLTSDSAALKTPSMAIEQALGASDGGEE 307
Query: 37 --TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
D+ +G V++ETNF++YAY+ S L IL LF ++ + N++ G IT+ES+ +A
Sbjct: 308 QQIDTPTGNQGTVIIETNFKLYAYTNSPLEIAILNLFVHLKSRFTNMVCGQITRESIRSA 367
Query: 95 FENGITAEQIISFLQQNAHPRV----------------------ADRIPS---------- 122
NGITA+QII FL+ +AHP++ A P
Sbjct: 368 LYNGITADQIIKFLETHAHPQMKLLAKEKLDKKIEFDTSHNINTAGGAPQSKVSGEGNVA 427
Query: 123 ------VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+P V DQI+LW+ +L+R++ + +++F ++ F+ +YA + L+W D
Sbjct: 428 QHKLEILPPTVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEFDTLSNYASEVGVLVWSDK 487
Query: 177 KKMRLVVNAEIHMHMREF 194
K + V AE + EF
Sbjct: 488 IKKKFFVTAEGMTQVAEF 505
>gi|365986883|ref|XP_003670273.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
gi|343769043|emb|CCD25030.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
Length = 522
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 97/171 (56%), Gaps = 18/171 (10%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G ++VETNF++Y+YS S L IL LF ++++ N++ G IT+ES+ A NGITAEQ
Sbjct: 343 DGSLIVETNFKLYSYSNSPLQIAILSLFVHLKFRFVNMVTGQITRESIRRALINGITAEQ 402
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ D + +P V DQIRLW+ +L+RV
Sbjct: 403 IIAYLETHAHPQMRRLAEEKLEKKLELDANCKDSLQILPPTVVDQIRLWQLELDRVIAYE 462
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y +F + + YA+D LLW+D KK + V+ E + + ++ +
Sbjct: 463 GSLYSDFENNTEYTTLYKYAQDIGVLLWKDDKKRKFFVSKEGNSQVLDYAK 513
>gi|45187599|ref|NP_983822.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|62901343|sp|Q75B51.1|TFB2_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|44982337|gb|AAS51646.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|374107034|gb|AEY95942.1| FADL274Wp [Ashbya gossypii FDAG1]
Length = 514
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 123/254 (48%), Gaps = 62/254 (24%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS---MSLTDSSA--------RKEGF--- 46
M++D D GLV +Q S F PT+LAT L+ +S+ +S R EG
Sbjct: 260 MLQDLRDYGLVFQKQ--SNLSKFYPTRLATMLTSDVVSIRSASGAVNSVLRQRAEGVDGK 317
Query: 47 -----------------------VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIV 83
++VETNF++Y+YS S L IL LF ++ + N++
Sbjct: 318 VLNGTALGDDDLQAGGEGALDGALIVETNFKLYSYSNSPLQIAILSLFIHLKTRFQNMVT 377
Query: 84 GAITKESLYNAFENGITAEQIISFLQQNAHP---RVA---------------DRIPSVPE 125
G IT+ES+ A NGITA+QII++++ +AHP R+A D + +P
Sbjct: 378 GQITRESIRRALHNGITADQIIAYMETHAHPQMRRLAGDNLEKKLELDPNCRDTLQVLPP 437
Query: 126 NVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNA 185
V DQI+LW+ +L+R+ + + +F + ++ YARD LLW D KK V+
Sbjct: 438 TVVDQIKLWQLELDRIISYDGYLFRDFDNLQEYQVLAQYARDIGVLLWSDDKKKMFFVSK 497
Query: 186 E-----IHMHMREF 194
E I H R+F
Sbjct: 498 EGNAQVIDFHKRKF 511
>gi|452977455|gb|EME77221.1| hypothetical protein MYCFIDRAFT_146475 [Pseudocercospora fijiensis
CIRAD86]
Length = 476
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 110/203 (54%), Gaps = 33/203 (16%)
Query: 1 MIKDFADLGLV-KLQQVGRKESWFIPTKLATNLSMSLTD--------------------- 38
M++D +DLGL+ +Q R + PT+LAT L+
Sbjct: 241 MLEDLSDLGLIFHPEQSDR----YYPTRLATTLTSDAPALLNSSHTSTTTTVSASSNDDL 296
Query: 39 -SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
+SA ++GF+++ETN+R+YAY+ S L IL LF+ + + PNL+ ITK S+ A +
Sbjct: 297 AASANEKGFIILETNYRLYAYTNSPLLISILSLFASLHTRYPNLVTAKITKTSVQAAISS 356
Query: 98 GITAEQIISFLQQNAHP---RVAD--RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
GIT+ QIIS+LQ +AHP R A P +P V DQIRLW+ + R+ T + E
Sbjct: 357 GITSNQIISYLQTHAHPILRRTASMHNAPILPPTVVDQIRLWQIEGERMTSTKGYLIREV 416
Query: 153 PSRDVFEAACDYARDRSGLLWED 175
S++ +E A YA + G+L +D
Sbjct: 417 GSKEDYEKAVQYA-EALGILVKD 438
>gi|403217068|emb|CCK71563.1| hypothetical protein KNAG_0H01490 [Kazachstania naganishii CBS
8797]
Length = 519
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)
Query: 37 TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
T S ++G ++VETNF++Y+YS S L IL LF ++ + N++ G ITK S+ NA +
Sbjct: 335 TQSQEVQDGVLIVETNFKVYSYSNSPLQIAILSLFVHLKSRFINMVTGQITKRSIINALD 394
Query: 97 NGITAEQIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDL 138
NGITAEQII++L+ +AHP++ D + +P V DQI+LW+ +
Sbjct: 395 NGITAEQIIAYLETHAHPQMRRLAEAQLEKKRELDQNCKDPLQILPPTVVDQIKLWQLEK 454
Query: 139 NRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+RV Y +F S + YA+D +LW + KK +L V+ E + + ++ +
Sbjct: 455 DRVMAFEGSLYSDFESNAEYNILKKYAQDIGVMLWSEDKKRKLFVSQEGNSQVLDYAK 512
>gi|336466164|gb|EGO54329.1| hypothetical protein NEUTE1DRAFT_87562 [Neurospora tetrasperma FGSC
2508]
gi|350286986|gb|EGZ68233.1| transcription factor Tfb2 [Neurospora tetrasperma FGSC 2509]
Length = 475
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 120/239 (50%), Gaps = 44/239 (18%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSSARKEG----- 45
M+ D GL+ + R+ F PT+LAT L S + T ++A G
Sbjct: 240 MLPSLVDFGLIYFPRDTRQ---FFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTT 296
Query: 46 -------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAI 86
+++ETN+R+YAY+TS L +L LF ++ + PN++ G +
Sbjct: 297 TSTSGGPAATPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRL 356
Query: 87 TKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNRV 141
T++S+ A GITA+QIIS+L +AH ++ P +P V DQIRLW+ + R+
Sbjct: 357 TRDSIRQAIGFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERM 416
Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ T +++F S + + A YA + L+W++ K R + A H +R++L+ + K
Sbjct: 417 QTTAGFLFKDFESTEEYAALSRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 473
>gi|254573074|ref|XP_002493646.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|238033445|emb|CAY71467.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|328354526|emb|CCA40923.1| RNA polymerase II transcription factor B subunit 2 [Komagataella
pastoris CBS 7435]
Length = 547
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/227 (29%), Positives = 111/227 (48%), Gaps = 44/227 (19%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTD-----SSARKEG 45
M++D D GL+ + R +F PT++AT L SMS+ + G
Sbjct: 307 MLEDLVDYGLIYYPEKNR---YFYPTRMATTLTSEKTTFKTASMSMNQVLEGGPEVKDHG 363
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
+V+ETNF++Y Y+TS L IL LF + + N++ G IT+ES+ A NGITA+QII
Sbjct: 364 SIVLETNFKLYCYTTSPLQIAILNLFVHLRTRFANMVTGMITRESVREALRNGITADQII 423
Query: 106 SFLQQNAHPRVAD--------------------------RIPSVPENVCDQIRLWESDLN 139
+L+ +AH ++ ++ +P V DQI+LW+ +++
Sbjct: 424 KYLETHAHSQMKKLAEEKLLKKLEFDSNATETQQIGSQVKLEVIPPTVVDQIKLWQLEMD 483
Query: 140 RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
R++ +++F + FE +YA + +LW D K + V E
Sbjct: 484 RLQTFAGFLFKDFANAQEFEQLANYADEVGVMLWRDDDKRKFFVTEE 530
>gi|164425579|ref|XP_960485.2| hypothetical protein NCU05523 [Neurospora crassa OR74A]
gi|157070982|gb|EAA31249.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 475
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 120/239 (50%), Gaps = 44/239 (18%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSSARKEG----- 45
M+ D GL+ + R+ F PT+LAT L S + T ++A G
Sbjct: 240 MLPSLVDFGLIYFPRDTRQ---FFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTT 296
Query: 46 -------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAI 86
+++ETN+R+YAY+TS L +L LF ++ + PN++ G +
Sbjct: 297 TSTSGGPAATPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRL 356
Query: 87 TKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNRV 141
T++S+ A GITA+QIIS+L +AH ++ P +P V DQIRLW+ + R+
Sbjct: 357 TRDSIRQAIGFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERM 416
Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ T +++F S + + A YA + L+W++ K R + A H +R++L+ + K
Sbjct: 417 QTTAGFLFKDFESTEEYAALSRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 473
>gi|402592023|gb|EJW85952.1| transcription factor Tfb2 family protein [Wuchereria bancrofti]
Length = 470
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 113/211 (53%), Gaps = 18/211 (8%)
Query: 2 IKDFA----DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETN 52
IKDF +LGLV +++ RK+ F T L +L+ S A E GFV+VETN
Sbjct: 254 IKDFLIHLRELGLVFIRK--RKDGVFFLTPLLNHLTGISNTSEASMENRNQHGFVIVETN 311
Query: 53 FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
+R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GITA QII+FL NA
Sbjct: 312 YRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLSTNA 371
Query: 113 HP-------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
HP I SVP V DQI+LWE + R+ A Y F S + Y
Sbjct: 372 HPITYAVTTEYGRMIQSVPITVADQIQLWEDERRRLTFCNATVYSSFESEKEYFGLKGYV 431
Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
++ LLW + +V+ E H +++ + +
Sbjct: 432 ISQNILLWCHDVQRLMVITEEGHENVKAWWK 462
>gi|410082501|ref|XP_003958829.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
gi|372465418|emb|CCF59694.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
Length = 514
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 97/172 (56%), Gaps = 18/172 (10%)
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
++G +++ETNF++Y+YS S L IL LF ++ + N++ G IT+ES+ A NGITA+
Sbjct: 336 RDGSLIIETNFKLYSYSNSPLQIAILSLFVHLKSRFVNMVTGQITRESIRRALVNGITAD 395
Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
QII++L+ +AHPR+ D + +P V DQI+LW+ +L+RV
Sbjct: 396 QIIAYLETHAHPRMRRLAEEKLEKKLELDPNSKDTLQILPPTVVDQIKLWQLELDRVITY 455
Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y +F + + YA+D L+W+D KK + V+ E + + ++ +
Sbjct: 456 DGSLYSDFENSTEYNMLSKYAQDIGVLIWKDDKKRKFFVSKEGNTQVLDYAK 507
>gi|294660113|ref|XP_002777731.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
gi|218511847|sp|Q6BGW8.2|TFB2_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|199434471|emb|CAR66042.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
Length = 515
Score = 116 bits (290), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 117/254 (46%), Gaps = 73/254 (28%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK----------------- 43
M+ D D GL+ Q F PT+LAT L+ +DS+A K
Sbjct: 252 MLADLRDYGLI--YQRSENSERFYPTRLATTLT---SDSAALKSPSMAMEQALESTTETE 306
Query: 44 ----------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
+G +++ETNF++YAY+ S L IL LF ++ + N++ G IT+ES+ N
Sbjct: 307 EQQNLASNSNQGTIILETNFKIYAYTNSPLEIAILNLFVHLKTRFSNMVCGQITRESIRN 366
Query: 94 AFENGITAEQIISFLQQNAHPRVA------------------------------------ 117
A NGITA+QII FL+ +AH ++
Sbjct: 367 ALYNGITADQIIKFLETHAHSQMKILAKERLDKKIEFDTSNNINTAGGAPQSQMMTNENG 426
Query: 118 -----DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
++ +P V DQI+LW+ +L+R++ + +++F S+ FEA +YA + LL
Sbjct: 427 TTVAQHKLEVLPPTVVDQIKLWQLELDRIQTFEGYLFKDFASQQEFEALSNYASEIGVLL 486
Query: 173 WEDSKKMRLVVNAE 186
W + KM+ V +
Sbjct: 487 WSNKSKMKFFVTKD 500
>gi|326435722|gb|EGD81292.1| hypothetical protein PTSG_11329 [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 106/188 (56%), Gaps = 9/188 (4%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL--SMSLTDSSARKEGFVVVETNFRMYAY 58
+++ F+ +GL ++ +K + PT+LA L S +L A + GF+V+ETNFR+YAY
Sbjct: 247 VLRHFSKIGLCYHRK--KKSKQYYPTRLALALRTSSALVAKDAMRCGFLVIETNFRIYAY 304
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S L +L+LF+K Y+LP ++V +IT+ S+ A GITA QI+ FL+ + R+
Sbjct: 305 TKSDLKISLLKLFAKPLYKLPTMLVASITRSSVRQAMAQGITARQILHFLKVHCSSRMRH 364
Query: 119 RIPS----VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
+ S VP V DQI LWE + +R+ T + P + +F + ++R +W
Sbjct: 365 QPHSVFSPVPPTVSDQILLWEQETSRLSKTEGVLFRHLP-KPIFPKLLAFVKERQACIWY 423
Query: 175 DSKKMRLV 182
D K ++
Sbjct: 424 DETKRAII 431
>gi|405952750|gb|EKC20525.1| General transcription factor IIH subunit 4 [Crassostrea gigas]
Length = 380
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 108/198 (54%), Gaps = 25/198 (12%)
Query: 5 FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD--SSARKEGFVVVETNFRMYAYSTSK 62
F+ LG K+Q+ + PT+LA NL+ +D S + G+++VETN+R+YAY+ S
Sbjct: 203 FSTLGKRKIQR-------YYPTRLAINLAAGQSDFTSVGKNTGYLMVETNYRVYAYTDSP 255
Query: 63 LHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPS 122
L ++ LF ++ Y+ P VG +T+ S+ +A GITA+Q P
Sbjct: 256 LQVALVALFCEMLYRFPTFSVGNLTRVSVRDALIRGITADQT----------------PV 299
Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
+P V DQ+RLWE + +R + Y++F S++ FE DYA+D LLW+++ K +V
Sbjct: 300 IPSTVTDQVRLWELERDRFKFMEGVLYDQFLSQNDFELLRDYAKDLGVLLWDNAIKRVMV 359
Query: 183 VNAEIHMHMREFLRGQNK 200
V H ++ + + Q +
Sbjct: 360 VTKGGHDDVKRYWKRQKQ 377
>gi|255721741|ref|XP_002545805.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
gi|240136294|gb|EER35847.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
Length = 494
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 119/245 (48%), Gaps = 56/245 (22%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK---------------EG 45
M+ D D GLV Q F PT+LAT L+ +DS+A K +
Sbjct: 248 MLADLKDYGLV--YQRSDTSGRFYPTRLATTLT---SDSAALKTPAMAMDEEEQQVVTKE 302
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
+++ETNF++YAY+ S L IL LF +++ + N++ G IT+ES+ NA NGITA+QII
Sbjct: 303 SIIIETNFKLYAYTKSPLEIAILNLFVQLKTRFANMVCGQITRESIRNALYNGITADQII 362
Query: 106 SFLQQNAHPRV------------------------------------ADRIPSVPENVCD 129
FL+ +AH ++ ++ +P NV D
Sbjct: 363 KFLETHAHSQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGSISQHKLEILPPNVVD 422
Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHM 189
QI+LW+ +L+R++ + +++F ++ ++ +YA + L+W D K + V +
Sbjct: 423 QIKLWQLELDRIQTFDGYLFKDFANQQEYDILSNYASELGVLIWADKVKRKFFVTKDGMT 482
Query: 190 HMREF 194
+ +F
Sbjct: 483 QVADF 487
>gi|342884444|gb|EGU84659.1| hypothetical protein FOXB_04847 [Fusarium oxysporum Fo5176]
Length = 496
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 121/236 (51%), Gaps = 41/236 (17%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
M+ D GL+ + K S F PT+LAT L+
Sbjct: 264 MLPSLVDFGLIYIPN--HKRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSG 321
Query: 35 ----SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
S+T ++ + G V++ETN+R+YAY+ S L +L LF+K+ + P+++ G I+++S
Sbjct: 322 APGGSVTAANTEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLAMRFPDMVAGRISRQS 381
Query: 91 LYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
+ A + GITAEQIIS+L +AH ++ + P +P V DQIRLW+ + R++ T
Sbjct: 382 IRQAIQFGITAEQIISYLSAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTS 441
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
++ F ++ +A + L+W+ D+++M A H +R++++ + K
Sbjct: 442 GFLFKSFEDDREYKDIARFAEEVGVLVWKNDARQMFF---ASKHEQIRDYMKIRKK 494
>gi|401825125|ref|XP_003886658.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
gi|395459803|gb|AFM97677.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
Length = 414
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 16/197 (8%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-KEGFVVVETNFRMYAYS 59
++K LG++KL KES I + L ++S R + F++VETN ++YAY+
Sbjct: 230 LLKYLEALGILKLY----KESLAIGKSF-----VQLFEASERNRREFIIVETNNKIYAYT 280
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+ ++ LF + LPNLI G+IT+ES+ AF+ GIT +QII FL+ + P
Sbjct: 281 NSEYEKSVIHLFCSVSVNLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASVKP----- 335
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
S+P + QI +WES NR+ M P + Y F + ++ ++ +RS L+ D K
Sbjct: 336 -GSLPPAISSQIMIWESKRNRIFMAPGYIYSNFLNLSDYQRVLEFCTERSYLIESDVDKR 394
Query: 180 RLVVNAEIHMHMREFLR 196
+VV E H ++EF++
Sbjct: 395 MIVVKVEGHELVKEFIK 411
>gi|302917488|ref|XP_003052448.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256733388|gb|EEU46735.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 496
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 116/235 (49%), Gaps = 39/235 (16%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS------LTDS--------------- 39
M+ D GL+ + K S F PT+LAT L+ S ++D
Sbjct: 264 MLPSLVDFGLIYIPN--HKRSMFFPTRLATTLTSSGNSLRSISDGVAAATAAALQPGQSG 321
Query: 40 ---------SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
+ G V++ETN+R+YAY+ S L +L LFSK+ + P+++ G ++++S
Sbjct: 322 GSGGSTTGSGQEQRGSVIIETNYRIYAYTQSTLQIAVLALFSKLSMRFPDMVAGRLSRQS 381
Query: 91 LYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
+ A GITA+QIIS+L +AH ++ + P +P V DQIRLW+ + R++ T
Sbjct: 382 IRQAINFGITADQIISYLAAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTS 441
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+++F + A +A + L+W + K A H +R++LR + K
Sbjct: 442 GFLFKDFEDHKEYMAVAGFAEEVGVLVWRNDVKGMFF--ASKHEQIRDYLRIRKK 494
>gi|320581720|gb|EFW95939.1| TFIIH basal transcription factor complex subunit 2 [Ogataea
parapolymorpha DL-1]
Length = 544
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 67/234 (28%), Positives = 116/234 (49%), Gaps = 41/234 (17%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD---------------SSARKEG 45
M+ D D GL+ K+ F PT+LAT L+ T+ ++ +G
Sbjct: 303 MLDDLIDYGLIYTPSKTNKK-IFYPTRLATTLTSEHTNFKTSAAVIDQEITNSKNSNNQG 361
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
+VVETNF++Y Y++S L IL LF ++ + N++ G IT+ES+ A NGITA+Q+I
Sbjct: 362 TIVVETNFKIYCYTSSPLQIAILNLFVHLKARFANMVTGVITRESVRRALINGITADQMI 421
Query: 106 SFLQQNAHP----RVADRIPS---------------------VPENVCDQIRLWESDLNR 140
++L +AHP + D++ +P V DQI+LW+ +L+R
Sbjct: 422 NYLGSHAHPWMVKQAEDKLARKLELESSIGNAAGQRQIDLEILPPTVVDQIKLWQLELDR 481
Query: 141 VEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
++ + Y++F + FE Y + ++W+D + R V E + + ++
Sbjct: 482 IQSFKGYLYKDFSTDLEFEKLLTYGEEIGVIVWKDRARKRFFVTQEGNGQLLDY 535
>gi|254584224|ref|XP_002497680.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
gi|238940573|emb|CAR28747.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
Length = 509
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 92/162 (56%), Gaps = 18/162 (11%)
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
++G ++VETNF++Y+YS S L IL LF ++ + N++ G +T+ES+ A NGITA+
Sbjct: 332 QDGALIVETNFKLYSYSNSPLQIAILSLFVHMKSRFSNMVTGQLTRESIRQALLNGITAD 391
Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
QII++L+ +AHP++ D + +P V DQI+LW+ +L+R+
Sbjct: 392 QIIAYLETHAHPQMRRMAEEQLEKRLELDPNSKDPLQILPPTVVDQIKLWQLELDRIISF 451
Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
Y +F + F YA+D L+W+D +K + V+ E
Sbjct: 452 EGSLYSDFENHQEFTLLSSYAQDIGVLIWKDDRKKKFFVSKE 493
>gi|396080769|gb|AFN82390.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
romaleae SJ-2008]
Length = 414
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 96/164 (58%), Gaps = 7/164 (4%)
Query: 34 MSLTDSSAR-KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
+ L ++S R + F++VETN ++YAY+ S+ ++ LF + LPNLI G+IT+ES+
Sbjct: 254 VQLFEASERNRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVSVSLPNLIKGSITEESVN 313
Query: 93 NAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
AF+ GIT +QII FL+ + P S+P + QI +WES NR+ M P + Y F
Sbjct: 314 VAFDKGITGKQIIHFLEASVKP------GSLPPAITSQIMIWESKRNRIFMVPGYIYSNF 367
Query: 153 PSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ ++ ++ +RS L+ D K +VV E H +REF++
Sbjct: 368 LNLSDYQRVLEFCTERSYLIESDVGKRMIVVKLEGHELVREFIK 411
>gi|448100135|ref|XP_004199281.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359380703|emb|CCE82944.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 509
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 122/256 (47%), Gaps = 64/256 (25%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSL------TDSSARKE 44
M+ D D GL+ Q KE F PT+LAT L SM+L +S A +E
Sbjct: 252 MLADLKDYGLI-YQNHNDKER-FYPTRLATTLTSEAAALRTPSMALNQVVDTANSEATEE 309
Query: 45 -------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
G +++ETNF++YAY+ S L IL LF ++ + PN++ G IT+ES+ A N
Sbjct: 310 QMFSNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFN 369
Query: 98 GITAEQIISFLQQNAHPRVAD--------------------------------------- 118
GITA+QII FL+ +AH ++
Sbjct: 370 GITADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDSQQLSVGEFGSASSYH 429
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
++ +P V DQI+LW+ +L+R++ + +++F S+ ++A +YA + LLW D +
Sbjct: 430 KLEILPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRNYASELGVLLWSDKNR 489
Query: 179 MRLVVNAEIHMHMREF 194
+ V + + +F
Sbjct: 490 KKFFVTKDGMSQVADF 505
>gi|448103836|ref|XP_004200137.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359381559|emb|CCE82018.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 512
Score = 112 bits (281), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 121/256 (47%), Gaps = 64/256 (25%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSL------TDSSARKE 44
M+ D D GL+ Q KE F PT+LAT L SM+L +S A +E
Sbjct: 252 MLADLKDYGLI-YQNHNDKER-FYPTRLATTLTSEAAALRTPSMALNQVVDTANSEATEE 309
Query: 45 -------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
G +++ETNF++YAY+ S L IL LF ++ + PN++ G IT+ES+ A N
Sbjct: 310 QMFSNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFN 369
Query: 98 GITAEQIISFLQQNAHPRVAD--------------------------------------- 118
GITA+QII FL+ +AH ++
Sbjct: 370 GITADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDNQQLSVGEFGSANSYH 429
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
++ +P V DQI+LW+ +L+R++ + +++F S+ ++A YA + LLW D +
Sbjct: 430 KLEVLPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRSYASELGVLLWSDKNR 489
Query: 179 MRLVVNAEIHMHMREF 194
+ V + + +F
Sbjct: 490 KKFFVTKDGMSQVADF 505
>gi|340939354|gb|EGS19976.1| RNA polymerase II transcription factor B subunit 2-like protein
[Chaetomium thermophilum var. thermophilum DSM 1495]
Length = 514
Score = 112 bits (281), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 119/232 (51%), Gaps = 38/232 (16%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT----------------------- 37
M+ D GL+ + + ++ + PT+LAT L+ S +
Sbjct: 287 MLPALVDFGLIYIPREDTRQ--YFPTRLATTLTSSASALRSVSSGFTAATNNTANDASSL 344
Query: 38 ---DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
D SA K G +++ETN+R+YAY++S L +L LF+ + + ++ G +T+ES+ A
Sbjct: 345 GGADPSAHK-GSIIIETNYRLYAYTSSPLQIAVLALFTHLNMRFAGMVTGRLTRESIRRA 403
Query: 95 FENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 149
GITA+QIIS+L +AH ++ A P +P V DQIRLW+ + R+ +P +
Sbjct: 404 ISFGITADQIISYLASHAHEQMVRAAAAAGRPVLPPTVVDQIRLWQLENERMRTSPGFLF 463
Query: 150 EEFPSRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQNK 200
++F + + + A YA + L+W D K+M E +R++L+ + K
Sbjct: 464 KDFENVEEYMALAGYAEEIGVLVWRSDRKRMFFASKFE---QLRDYLKSRKK 512
>gi|255713706|ref|XP_002553135.1| KLTH0D09790p [Lachancea thermotolerans]
gi|238934515|emb|CAR22697.1| KLTH0D09790p [Lachancea thermotolerans CBS 6340]
Length = 506
Score = 112 bits (280), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 93/161 (57%), Gaps = 18/161 (11%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G V++ETNF++Y+YS S L +L LF ++ + N++ G IT+ES+ A NGITA+Q
Sbjct: 330 DGAVILETNFKLYSYSNSPLQIAVLSLFVHLKSRFSNMVTGQITRESIRRALHNGITADQ 389
Query: 104 IISFLQQNAHP---RVADR---------------IPSVPENVCDQIRLWESDLNRVEMTP 145
II++++ +AHP R+A++ + +P V DQI+LW+ +L+R+
Sbjct: 390 IIAYMETHAHPQMRRLAEQTLDKKMELDPNCNEGLQILPPTVVDQIKLWQLELDRIISYD 449
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
+ + +F + + YA+D LLW D KK R V+ E
Sbjct: 450 GYLFTDFENFQEYNMLSSYAKDIGVLLWSDDKKKRFFVSQE 490
>gi|68490740|ref|XP_710812.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|68490763|ref|XP_710801.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432046|gb|EAK91553.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432058|gb|EAK91564.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|238879997|gb|EEQ43635.1| TFIIH basal transcription factor complex subunit 2 [Candida
albicans WO-1]
Length = 494
Score = 112 bits (280), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 111/224 (49%), Gaps = 52/224 (23%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSS---ARKEGFV 47
M+ D D GLV Q F PT+LAT L SM++ + A KE +
Sbjct: 248 MLADLKDYGLV--YQRSDTSGRFYPTRLATTLTSDSAALKTPSMAMDEEEQQVATKES-I 304
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA NGITA+QII F
Sbjct: 305 IIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQIIKF 364
Query: 108 LQQNAHPRV------------------------------------ADRIPSVPENVCDQI 131
L+ +AHP++ ++ +P NV DQI
Sbjct: 365 LETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMVSQHKLEILPPNVVDQI 424
Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
+LW+ +L+R++ + +++F ++ ++ +YA + L+W D
Sbjct: 425 KLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWAD 468
>gi|241958748|ref|XP_002422093.1| TFIIH subunit, putative; tanscription initiation factor IIB,
putative [Candida dubliniensis CD36]
gi|223645438|emb|CAX40094.1| TFIIH subunit, putative [Candida dubliniensis CD36]
Length = 494
Score = 112 bits (280), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 68/226 (30%), Positives = 111/226 (49%), Gaps = 56/226 (24%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK---------------EG 45
M+ D D GLV Q F PT+LAT L+ +DS+A K +
Sbjct: 248 MLADLKDYGLV--YQRSDTSGRFYPTRLATTLT---SDSAALKTPSMAMDEEEQQVVSKE 302
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
+++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA NGITA+QII
Sbjct: 303 SIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQII 362
Query: 106 SFLQQNAHPRV------------------------------------ADRIPSVPENVCD 129
FL+ +AHP++ ++ +P NV D
Sbjct: 363 KFLETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMISQHKLEILPPNVVD 422
Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
QI+LW+ +L+R++ + +++F ++ ++ +YA + L+W D
Sbjct: 423 QIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWAD 468
>gi|303388087|ref|XP_003072278.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
gi|303301417|gb|ADM10918.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
Length = 414
Score = 112 bits (280), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 99/164 (60%), Gaps = 7/164 (4%)
Query: 34 MSLTDSSAR-KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
+ L ++S R + F++VETN ++YAY+ S+ ++ LF + + LPNL G+IT+ES+
Sbjct: 254 VQLFEASERNRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVSFNLPNLTKGSITEESVN 313
Query: 93 NAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
AF+ GIT QII FL+ ++ P S+P + +QI +WES NR+ M P + Y F
Sbjct: 314 AAFDKGITGRQIIHFLEASSKP------GSLPPAIINQIIIWESKRNRIFMAPGYLYSNF 367
Query: 153 PSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ ++ ++ +R+ L+ D + +VVN + H+ ++EF++
Sbjct: 368 LNLSDYQKVLEFCSERNYLIESDIDRRMIVVNPKGHVFVKEFIK 411
>gi|389642093|ref|XP_003718679.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|351641232|gb|EHA49095.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|440469172|gb|ELQ38293.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae Y34]
gi|440481925|gb|ELQ62459.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae P131]
Length = 490
Score = 112 bits (280), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 114/201 (56%), Gaps = 25/201 (12%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKE-------G 45
M+ D D GLV + + + PT+LAT L S+S ++A E G
Sbjct: 269 MLPDLNDFGLVYIPPSNPDQ--YFPTRLATTLTSGSSALRSVSSGVAAATAEAGENNTKG 326
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
+++ETNFR+YAY+++ L IL LF+ ++ + ++ G +++ S+ A +GITA+QII
Sbjct: 327 AIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAISHGITADQII 386
Query: 106 SFLQQNAHPRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFE 159
+L +AH ++ RI P +P V DQIRLW+ + R+++ + +++F S+ F+
Sbjct: 387 EYLASHAHEQM-HRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKDFESQAEFK 445
Query: 160 AACDYARDRSGLLW-EDSKKM 179
A DYA + L+W D++++
Sbjct: 446 AIADYADEVGVLIWRSDARQL 466
>gi|392574410|gb|EIW67546.1| hypothetical protein TREMEDRAFT_33390 [Tremella mesenterica DSM
1558]
Length = 504
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 113/214 (52%), Gaps = 16/214 (7%)
Query: 2 IKDFADLGLV--KLQQVGRKESWFIPTKLATNL------------SMSLTDSSARKEGFV 47
+ D GL+ G K +F PT L+T+L + + S + F+
Sbjct: 287 LDDLQSYGLIYRPATPSGAKADYFFPTHLSTSLCSGNSALSAALNTTEIDTSHEDDKKFL 346
Query: 48 VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
++ETN+++YAY+ ++L IL LF I+ Q NL+VG + + + A E GI+A QII++
Sbjct: 347 ILETNYKIYAYTNNELEIAILNLFMDIKVQYRNLVVGKLDRAHVKAAMEKGISAYQIIAY 406
Query: 108 LQQNAHPRVADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
LQ +AHP++ P + + DQ+ LW+ + NR+ + +E F S+D++E A+
Sbjct: 407 LQSHAHPQMYSSPPPILHTTIVDQLHLWDKERNRLRTEESEMFEFF-SKDLYEDTEAEAK 465
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
GLL K L VN + +++F++GQ +
Sbjct: 466 RYDGLLLAVPKDKLLFVNPSVKDAIKDFVKGQQR 499
>gi|453082965|gb|EMF11011.1| RNA polymerase II transcription factor B subunit 2 [Mycosphaerella
populorum SO2202]
Length = 507
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 103/192 (53%), Gaps = 29/192 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM---SLTDSS----------------- 40
M+ D + GLV + + PT+LAT L+ +L ++S
Sbjct: 259 MLDDLTNFGLVYRRSPSSDR--YYPTRLATTLTSDAPALPNNSFTTTTTTNQNADPNDPA 316
Query: 41 --ARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
A ++G++++ETN+R+YAY++S L IL LF+ + + PNLI ITK S++ A +G
Sbjct: 317 ATASEKGYIILETNYRLYAYTSSPLPISILSLFASLNTRYPNLITAKITKTSIHTAIASG 376
Query: 99 ITAEQIISFLQQNAHP-----RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
IT++QIIS+LQ +AHP + P +P V DQIRLW+ + R++ T + +
Sbjct: 377 ITSDQIISYLQTHAHPILRRQAALNSAPILPPTVVDQIRLWQIEGERMKSTKGYLIRDVG 436
Query: 154 SRDVFEAACDYA 165
+ D + A YA
Sbjct: 437 TEDDYTKAVQYA 448
>gi|340518979|gb|EGR49219.1| trancription factor tfb2-like protein [Trichoderma reesei QM6a]
Length = 480
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 110/220 (50%), Gaps = 24/220 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS---------------MSLTDSSARKEG 45
M+ D GL+ + Q K S F PT+LAT L+ + + + G
Sbjct: 263 MLPSLVDFGLIYIPQ--HKRSMFFPTRLATTLTSGGNSLRTISEGSLGLLGGGGTGEQAG 320
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
VV+ETN+R+YAY+ S L +L LF+K+ + P+++ G +++ S+ A GITA+QII
Sbjct: 321 SVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQII 380
Query: 106 SFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEA 160
S+L +AH ++ P +P V DQIRLW+ + R++ T + +F +
Sbjct: 381 SYLSAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYLD 440
Query: 161 ACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
++ + L+W + N H +++FL+ + K
Sbjct: 441 TARFSEEIGVLVWRNDNTRMFFANK--HEQIKDFLKSRKK 478
>gi|449296697|gb|EMC92716.1| hypothetical protein BAUCODRAFT_259506 [Baudoinia compniacensis
UAMH 10762]
Length = 489
Score = 111 bits (278), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 106/195 (54%), Gaps = 26/195 (13%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT----------------------- 37
M+ D +D GLV + E+ + PT+LAT L+
Sbjct: 260 MLDDLSDFGLVYRRS--NDEARYYPTRLATTLTSDAPALPNTSLTSTTSTTTLASSSTDP 317
Query: 38 DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
+SA ++G++++ETN+R+YAY++S L IL LF+ ++ + PNLI +TK S+ +A +
Sbjct: 318 SASANEKGYIILETNYRLYAYTSSPLPITILSLFATLKTRYPNLITAKLTKSSIQSAIAS 377
Query: 98 GITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV 157
GIT++QII++L +AHP + + P +P V DQIRLW+ + R+ + + +
Sbjct: 378 GITSDQIITYLTTHAHPILRRQNPVLPPTVVDQIRLWQIEGERMTAWKGFLIRDVGTAEE 437
Query: 158 FEAACDYARDRSGLL 172
++ A YA D G+L
Sbjct: 438 YDKAVQYA-DALGVL 451
>gi|367030175|ref|XP_003664371.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
gi|347011641|gb|AEO59126.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
Length = 512
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 121/232 (52%), Gaps = 36/232 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
M+ D GL+ + + ++ + PT+LAT L+
Sbjct: 283 MLPALVDFGLIYIPREDTRQ--YFPTRLATTLTSSASSLRSVSSGFSAAAANNPGDASSL 340
Query: 35 -SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
+ ++SA +G V++ETN+R+YAY++S L +L LF+++ + + G +T++S+
Sbjct: 341 GTTPETSAASKGSVIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMATGRLTRDSIRR 400
Query: 94 AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
A GITA+QIIS+L +AH ++ A P +P V DQIRLW+ + R+ +
Sbjct: 401 AIGFGITADQIISYLASHAHEQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFL 460
Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+++F S D + + YA + L+W+ KK R+ ++I +R++LR + +
Sbjct: 461 FKDFDSLDEYLSLSAYAEEIGVLVWKSDKK-RMFYASKIE-QLRDYLRSRKR 510
>gi|401887601|gb|EJT51582.1| hypothetical protein A1Q1_07170 [Trichosporon asahii var. asahii
CBS 2479]
gi|406699646|gb|EKD02845.1| hypothetical protein A1Q2_02789 [Trichosporon asahii var. asahii
CBS 8904]
Length = 451
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 119/207 (57%), Gaps = 14/207 (6%)
Query: 2 IKDFADLGLV-KLQQV-GRKESWFIPTKLATNLSMSLTD----SSARKEGFVVVETNFRM 55
+ D G + + Q+V GRK F PT LAT+L T ++A + F+++ETN+R+
Sbjct: 237 LDDLEQYGFIYRRQEVDGRKTDQFFPTHLATSLCSGDTSVNRSATADDKRFLILETNYRV 296
Query: 56 YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
YAY++++L IL LF I + PNLIVG + + + A + GI+A QIIS+L +AHP+
Sbjct: 297 YAYTSNELEIAILNLFVNITIRYPNLIVGHLDRRHVKAAMDKGISAYQIISYLTTHAHPQ 356
Query: 116 VADRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
+ + P + +V DQ+ LW+ + NRV+ + EF S+++FE D A +R G L
Sbjct: 357 MYNHPPPLLHPSVTDQLHLWDRERNRVQAQDTVMF-EFYSKELFETCRDEA-ERMGALQH 414
Query: 175 DSKKMR-----LVVNAEIHMHMREFLR 196
++ + + V+ I +R+F++
Sbjct: 415 SVQRGQGMAKLIFVDPTIRESLRDFIK 441
>gi|367009216|ref|XP_003679109.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
gi|359746766|emb|CCE89898.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
Length = 513
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 95/172 (55%), Gaps = 18/172 (10%)
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
++G ++VETNF++Y+YS S L IL LF ++ + N++ G IT+ES+ A NGITA+
Sbjct: 332 QDGALIVETNFKLYSYSNSPLQIAILSLFVHLKSRFTNMVTGQITRESIRRALINGITAD 391
Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
QII++LQ +AHP++ D + +P V DQI+LW+ +L+R+
Sbjct: 392 QIIAYLQTHAHPQMRRLAEEKLEKKLELDANCKDTLQILPPTVVDQIKLWQLELDRIISY 451
Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y + S + YA+D LLW+D +K V+ E + + ++ +
Sbjct: 452 DGSLYSDIDSHQEYILLSTYAQDIGVLLWKDDRKRIFFVSKEGNSQVLDYAK 503
>gi|358385986|gb|EHK23582.1| hypothetical protein TRIVIDRAFT_36912 [Trichoderma virens Gv29-8]
Length = 492
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 111/232 (47%), Gaps = 36/232 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS------------------------- 35
M+ D GL+ + Q K S F PT+LAT L+
Sbjct: 263 MLPSLVDFGLIYIPQ--HKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQAL 320
Query: 36 --LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
L S + G VV+ETN+R+YAY+ S L +L LF+K+ + P+++ G +++ S+
Sbjct: 321 GPLGGSGGEQAGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQ 380
Query: 94 AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
A GITA+QIIS+L +AH ++ P +P V DQIRLW+ + R++ T
Sbjct: 381 AINFGITADQIISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFL 440
Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ +F + ++ + L+W++ N H +++FL+ + +
Sbjct: 441 FRDFTDDKDYLDTARFSEEIGVLVWKNDHARMFFANK--HEQIKDFLKTRKR 490
>gi|367040709|ref|XP_003650735.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
gi|346997996|gb|AEO64399.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
Length = 513
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 118/231 (51%), Gaps = 35/231 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL--------------------------SM 34
M+ D GL+ + + ++ + PT+LAT L S+
Sbjct: 285 MLPALVDFGLIYIPREDTRQ--YFPTRLATTLTSSASALRSVSSGFSAAAANNPGDAASL 342
Query: 35 SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
+T + +G +++ETN+R+YAY++S L +L LF+++ + ++ G +T++S+ A
Sbjct: 343 GMTPDTTPTKGSLIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMVTGRLTRDSIRRA 402
Query: 95 FENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 149
GITA+QIIS+L +AH ++ A P +P V DQIRLW+ + R+ T +
Sbjct: 403 ISFGITADQIISYLASHAHEQMTRAAAAAGRPVLPPTVVDQIRLWQLENERMRTTAGFLF 462
Query: 150 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
++F S + + + YA + L+W+ +K V A +R++L+ + K
Sbjct: 463 KDFDSPEEYVSLSGYAEEIGVLVWKSDRKR--VFFASKFEQLRDYLKSRKK 511
>gi|406604717|emb|CCH43852.1| RNA polymerase II transcription factor B subunit 2 [Wickerhamomyces
ciferrii]
Length = 488
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 66/227 (29%), Positives = 112/227 (49%), Gaps = 43/227 (18%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------------SLTD 38
M+ D D GL+ ++ + F PT+LAT L+ + T
Sbjct: 247 MLGDLRDYGLIYQRKSTSRR--FYPTRLATTLTSDTTSLRSASSAMNKVIENAKDSAYTP 304
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
++ G +++ETNF++YAY+ S L IL LF ++ + NL+ G IT+ES+ A +G
Sbjct: 305 TNIENSGTIIIETNFKVYAYTNSPLQIAILNLFVHLKARFSNLVTGQITRESIRKALVSG 364
Query: 99 ITAEQIISFLQQNAHPRV-------------------ADRIPSVPENVCDQIRLWESDLN 139
IT+EQIIS+L+ +AHP++ ++I + + DQI+LW+ +L+
Sbjct: 365 ITSEQIISYLESHAHPQLRRAAEEELNKKNGFESNNHGEKIQILQPTIADQIKLWQLELD 424
Query: 140 RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
R+ + + +F S ++ Y+ + LLW DS K + V E
Sbjct: 425 RIMSFDGYLFTDFSSDQEYQVLSKYSEEIGVLLWNDSSKKKFFVTKE 471
>gi|444320663|ref|XP_004180988.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
gi|387514031|emb|CCH61469.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
Length = 518
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 91/165 (55%), Gaps = 18/165 (10%)
Query: 37 TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
+++ A +G +++ETNF++Y+YS S L +L LF ++ + N++ G IT++S+ A
Sbjct: 321 SNTQATADGALIIETNFKLYSYSNSPLQIAVLSLFVHLKTRFSNMVTGQITRDSIRRALR 380
Query: 97 NGITAEQIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDL 138
NGITAEQII++L ++HP++ D + +P V DQI+LW+ +L
Sbjct: 381 NGITAEQIIAYLLTHSHPQMRRLAEGNLEKKLELDPNAKDSLQILPPTVVDQIKLWQLEL 440
Query: 139 NRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVV 183
+R+ Y +F + + YA D LLW++ KK + V
Sbjct: 441 DRIISHEGSLYSDFENNQEYNLLSTYAEDIGVLLWKNDKKRKFFV 485
>gi|430812024|emb|CCJ30551.1| unnamed protein product [Pneumocystis jirovecii]
Length = 319
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 98/186 (52%), Gaps = 21/186 (11%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
++++ D G+V ++ R+ F PT+LAT L+ + E +
Sbjct: 139 VLENLKDYGIVYRRKSSRR---FYPTRLATILT-------SNSEILI-----------RH 177
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
S L +L LF + + NL+VG IT+ S+ AF NGITAEQIIS+L AHP++
Sbjct: 178 SPLQIAVLNLFVHLYLRFSNLVVGVITRNSVRQAFMNGITAEQIISYLTSYAHPQMKKNT 237
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
P +P V DQIRLWE + NR++ T + + +F S FE A YA + + ++WE K
Sbjct: 238 PIIPPTVNDQIRLWEMERNRLKATEGYLFRDFNSNSDFELALKYANELNVVVWEAPSKRV 297
Query: 181 LVVNAE 186
VN +
Sbjct: 298 FFVNIQ 303
>gi|223993063|ref|XP_002286215.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
gi|220977530|gb|EED95856.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
Length = 461
Score = 109 bits (272), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 27/195 (13%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS----------------ARKE 44
+++DF+ GL+ + +V K S F PT+ A NL S + +R
Sbjct: 272 LMRDFSRFGLLFVCRVAGKNS-FYPTRAAVNLVASSEKAGRQASDLMGQTLAAPVPSRSH 330
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEY--QLPNLIVGAITKESLYNAFENGITAE 102
VVV+TNF++ AY+ SKLH L LF + +LPN+I IT++S+ +AF G+TA+
Sbjct: 331 LAVVVQTNFQVVAYTKSKLHISTLGLFCDVSSFRRLPNVIFFHITRDSIRSAFRLGVTAD 390
Query: 103 QIISFLQQNAHP--RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV--F 158
QI+ FL +AHP R DR P VP NV DQI LW+ + +RV M ++ RD F
Sbjct: 391 QILRFLHVHAHPMLRSGDR-PMVPSNVVDQILLWDRERHRVVMDEVCVHQ---CRDAAEF 446
Query: 159 EAACDYARDRSGLLW 173
A YA D L W
Sbjct: 447 TAVSQYASDVDALAW 461
>gi|323306981|gb|EGA60265.1| Tfb2p [Saccharomyces cerevisiae FostersO]
Length = 448
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 87/153 (56%), Gaps = 18/153 (11%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+Q
Sbjct: 287 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 346
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ + + +P V DQIRLW+ +L+RV
Sbjct: 347 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 406
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
Y +F + + YA+D LLW+D K
Sbjct: 407 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDXK 439
>gi|346323394|gb|EGX92992.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Cordyceps militaris CM01]
Length = 511
Score = 108 bits (271), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 71/244 (29%), Positives = 118/244 (48%), Gaps = 48/244 (19%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------SLTDS-------------- 39
M+ AD GLV + +K S F PT+LAT L+ S++D
Sbjct: 270 MLPALADFGLVYIPP--QKRSMFFPTRLATTLTSGASSGLRSISDGVAAATSAARSSPSS 327
Query: 40 ------------------SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNL 81
S + G VV+ETN+R+YAY+ S L +L LF+++ + P++
Sbjct: 328 SAHQPASSSAGSGPLGGGSGDQPGAVVIETNYRLYAYTQSTLQIAVLALFARLTMRFPDM 387
Query: 82 IVGAITKESLYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWES 136
+ G +T+ S+ A + GITA+QII++L +AH ++ P +P V DQIRLW+
Sbjct: 388 VAGRLTRTSIRQAIQFGITADQIIAYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQL 447
Query: 137 DLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
+ R++ TP + +F + +A + L+W D KK + A +R++L+
Sbjct: 448 ENERMKTTPGFLFRDFDDPKEYHDTRRFADEIGVLVWSDDKKG--LFFASKFEQIRDYLK 505
Query: 197 GQNK 200
+ K
Sbjct: 506 SRKK 509
>gi|320587960|gb|EFX00435.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Grosmannia clavigera kw1407]
Length = 518
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 92/161 (57%), Gaps = 7/161 (4%)
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
G V++ETNFR+YAY+ S L +L LFSK+ + P ++ G +T++S+ A GIT++QI
Sbjct: 356 GSVILETNFRLYAYTASPLQIAVLALFSKLSQRFPGMVAGKLTRDSVRRAISYGITSDQI 415
Query: 105 ISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFE 159
IS+L NAH ++ A+ P +P V DQIRLWE + R++ +++F S ++
Sbjct: 416 ISYLAANAHEQMHKYAAANHRPVLPPTVVDQIRLWELENERMKSHKGFLFKDFDSAKEYD 475
Query: 160 AACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
YA + L++ + KK V H + +L+ + K
Sbjct: 476 MLAQYADEIGVLVYRNDKKRHFFVTK--HEQLVTYLKARKK 514
>gi|116201905|ref|XP_001226764.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
gi|88177355|gb|EAQ84823.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
Length = 503
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 120/232 (51%), Gaps = 36/232 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
M+ D GL+ + + ++ + PT+LAT L+
Sbjct: 274 MLPHLVDFGLIFIPREDTRQ--YFPTRLATTLTSSASSLRSVSSGFNAATANNPGDTSSL 331
Query: 35 -SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
+ ++SA +G V++ETN+R+YAY++S L +L LF+++ + + G +T++S+
Sbjct: 332 GTTPETSAASKGSVIIETNYRLYAYTSSPLQIAVLGLFTQLNMRFAGMATGRLTRDSIRR 391
Query: 94 AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
A GITA+QIIS+L +AH ++ A P +P V DQIRLW+ + R+ +
Sbjct: 392 AIGFGITADQIISYLASHAHDQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFL 451
Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+++F S + + + YA + L+W KK R+ ++I +R++L+ + K
Sbjct: 452 FKDFDSLEEYVSLSSYADEVGVLVWRSDKK-RMFYASKIE-QLRDYLKSRKK 501
>gi|308460920|ref|XP_003092758.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
gi|308252517|gb|EFO96469.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
Length = 128
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 78/117 (66%), Gaps = 3/117 (2%)
Query: 83 VGAITKESLYNAFENGITAEQIISFLQQNAHPR---VADRIPSVPENVCDQIRLWESDLN 139
VG IT+ES+ A ++GITA QIISFL+ NAHP+ + I +P V DQIRLWE +
Sbjct: 3 VGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERR 62
Query: 140 RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
R+++ A+ Y F S D F+ CDYA++R LLW ++++ ++VN E H ++R++ +
Sbjct: 63 RMDLKDAYIYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQWYK 119
>gi|361130709|gb|EHL02459.1| putative RNA polymerase II transcription factor B subunit 2 [Glarea
lozoyensis 74030]
Length = 210
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/206 (30%), Positives = 108/206 (52%), Gaps = 33/206 (16%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE---------------- 44
M+++ D GLV + + + F PT+LAT L+ +D+SA +
Sbjct: 1 MLENLVDFGLVYIPPSVKDQ--FFPTRLATTLT---SDASALRSITAGFESALSTGSGTA 55
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
GF+++ETN+ S L +L LF+K+ + PN++ G +T+ES+Y A +GI+++QI
Sbjct: 56 GFIIIETNYH------SPLQIAVLALFTKLTTRYPNMVTGRVTRESVYRAVSSGISSDQI 109
Query: 105 ISFLQQNAHPRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
I++L +AHP + P +P V DQIRLW+ + R++ T +++F S D +
Sbjct: 110 ITYLSTHAHPELLKASAAKGGGPVLPPTVVDQIRLWQIENERMKTTYGFLFKDFESTDEY 169
Query: 159 EAACDYARDRSGLLWEDSKKMRLVVN 184
+ YA + L W + + V+
Sbjct: 170 KKLLKYADEIGVLTWASKTEEKFFVS 195
>gi|405123868|gb|AFR98631.1| transcription factor TFIIH complex subunit Tfb2 [Cryptococcus
neoformans var. grubii H99]
Length = 481
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 111/200 (55%), Gaps = 8/200 (4%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
+ D A G + G + W PT LAT+L + ++ SA + F+++ETN+++YA
Sbjct: 275 LDDLAQYGFIYKSSPGSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y++++L IL LF I + PNL+VG + ++ + A E GI+A QII++L +AHP++
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISAGQIIAYLSSHAHPQMY 392
Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+ P + + DQ+ LW+ + NR++ YE F S+++F+ + A+ + L +
Sbjct: 393 NSPPPLLHPTIVDQLHLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAT 451
Query: 177 KKMRLVVNAEIHMHMREFLR 196
+ L + + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471
>gi|401623322|gb|EJS41426.1| tfb2p [Saccharomyces arboricola H-6]
Length = 513
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITAEQ
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITAEQ 395
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ + + +P V DQIRLW+ +L+RV
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVMTYE 455
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 456 GSLYSDFETSQEYNVLNKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506
>gi|355693906|gb|AER99491.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
putorius furo]
Length = 143
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 84/139 (60%)
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 2 SELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQT 61
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
P +P + DQIRLWE + +R+ T Y +F S+ FE +AR+ L++E+S K
Sbjct: 62 PVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRL 121
Query: 181 LVVNAEIHMHMREFLRGQN 199
+VV H ++ F + Q
Sbjct: 122 MVVTPAGHSDVKRFWKRQK 140
>gi|429962184|gb|ELA41728.1| hypothetical protein VICG_01232 [Vittaforma corneae ATCC 50505]
Length = 449
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 85/155 (54%), Gaps = 8/155 (5%)
Query: 46 FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
F+V+ETNF++YAY+++ +L LFSK Y PNLI +ESL +AF GITA+QII
Sbjct: 303 FIVLETNFKIYAYTSTAYDKSVLSLFSKTVYVFPNLIKACFDEESLLSAFNKGITAKQII 362
Query: 106 SFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
+LQ+++ VP+N+ +QI +WE +R+ + Y +F F Y
Sbjct: 363 KYLQEHSE--------EVPKNIVNQISIWEHRQHRIRARNGYLYHDFIHLSDFHRVLRYV 414
Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ GL++ D K +V IH ++ F++ K
Sbjct: 415 ESKGGLIYRDEVKRMIVGEERIHESVKNFIKEMQK 449
>gi|6325135|ref|NP_015203.1| Tfb2p [Saccharomyces cerevisiae S288c]
gi|62901134|sp|Q02939.1|TFB2_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=General transcription and DNA repair
factor IIH subunit TFB2; Short=TFIIH subunit TFB2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|1163092|gb|AAB68240.1| Tfb2p: TFIIH subunit [Saccharomyces cerevisiae]
gi|1778059|gb|AAB40628.1| transcription/repair factor TFIIH subunit Tfb2, partial
[Saccharomyces cerevisiae]
gi|151942676|gb|EDN61022.1| transcription initiation factor TFIIH subunit [Saccharomyces
cerevisiae YJM789]
gi|190407835|gb|EDV11100.1| RNA polymerase II transcription factor B subunit 2 [Saccharomyces
cerevisiae RM11-1a]
gi|256274256|gb|EEU09164.1| Tfb2p [Saccharomyces cerevisiae JAY291]
gi|259150035|emb|CAY86838.1| Tfb2p [Saccharomyces cerevisiae EC1118]
gi|285815419|tpg|DAA11311.1| TPA: Tfb2p [Saccharomyces cerevisiae S288c]
gi|323331200|gb|EGA72618.1| Tfb2p [Saccharomyces cerevisiae AWRI796]
gi|323335024|gb|EGA76314.1| Tfb2p [Saccharomyces cerevisiae Vin13]
gi|323346175|gb|EGA80465.1| Tfb2p [Saccharomyces cerevisiae Lalvin QA23]
gi|323352000|gb|EGA84539.1| Tfb2p [Saccharomyces cerevisiae VL3]
gi|365762780|gb|EHN04313.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295887|gb|EIW06990.1| Tfb2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 513
Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+Q
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 395
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ + + +P V DQIRLW+ +L+RV
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 455
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 456 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506
>gi|349581695|dbj|GAA26852.1| K7_Tfb2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 513
Score = 105 bits (263), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+Q
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 395
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ + + +P V DQIRLW+ +L+RV
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 455
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 456 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506
>gi|323302721|gb|EGA56527.1| Tfb2p [Saccharomyces cerevisiae FostersB]
Length = 505
Score = 105 bits (263), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 96/169 (56%), Gaps = 18/169 (10%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+Q
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 395
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ + + +P V DQIRLW+ +L+RV
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 455
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
Y +F + + YA+D LLW+D KK + ++ E + + +F
Sbjct: 456 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504
>gi|440493551|gb|ELQ76006.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Trachipleistophora
hominis]
Length = 426
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 94/165 (56%), Gaps = 7/165 (4%)
Query: 36 LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
L S+A ++ F++VETN+++YAY+TS I++LFS+I +LPNL+ IT+ES+ AF
Sbjct: 269 LFSSAAIQDSFLIVETNYKLYAYTTSSHELSIIKLFSQIVRELPNLVTAHITEESVNAAF 328
Query: 96 ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR 155
GIT +QI+ +L + + +P V +QI +WE +R++ A Y F +
Sbjct: 329 LKGITGQQIVDYLTEKSKSE-------LPPVVLEQILIWERQRDRMKCIDAVIYSHFMTY 381
Query: 156 DVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ +E Y R++ L+ D + LVV E H ++ F++ K
Sbjct: 382 NEYEITYRYCREKGALIDHDEFRRLLVVRLECHNEVKNFIKNNIK 426
>gi|358394641|gb|EHK44034.1| hypothetical protein TRIATDRAFT_222970 [Trichoderma atroviride IMI
206040]
Length = 491
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 108/232 (46%), Gaps = 36/232 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGF-------------- 46
M+ D GL+ + Q K S F PT+LAT L+ EG
Sbjct: 262 MLPSLVDFGLIYIPQ--HKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQSL 319
Query: 47 -------------VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
VV+ETN+R+YAY+ S L +L LF+K+ + P+++ G +++ S+
Sbjct: 320 GPLGGGGEQQSGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQ 379
Query: 94 AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
A GITA+QIIS+L +AH ++ P +P V DQIRLW+ + R++ T
Sbjct: 380 AINFGITADQIISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFL 439
Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+ +F + ++ + L+W + N H +++FL+ + +
Sbjct: 440 FRDFTDDKDYLDTARFSEEIGVLVWRNDHARMFFANK--HEQIKDFLKTRKR 489
>gi|207340606|gb|EDZ68904.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 226
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+Q
Sbjct: 49 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 108
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ + + +P V DQIRLW+ +L+RV
Sbjct: 109 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 168
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y +F + + YA+D LLW+D KK + ++ E + + +F +
Sbjct: 169 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 219
>gi|400597688|gb|EJP65418.1| RNA polymerase II transcription factor B subunit 2 [Beauveria
bassiana ARSEF 2860]
Length = 507
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/246 (28%), Positives = 116/246 (47%), Gaps = 51/246 (20%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--------------------- 39
M+ AD GLV + +K S F PT+LAT L+ + S
Sbjct: 265 MLPALADFGLVYIPP--QKRSMFFPTRLATTLTSGASSSLRSISEGVVAATSAALSASSS 322
Query: 40 -------------------SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPN 80
+ + G +V+ETN+R+YAY+ S L +L LFS++ + P+
Sbjct: 323 SSTHQDQNHQSRAGPLGAGAGDQIGSIVIETNYRLYAYTQSTLQIAVLALFSRLTMRFPD 382
Query: 81 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLW 134
++ G IT+ S+ A + GITA+QII++L +AH + +A R P +P V DQIRLW
Sbjct: 383 MVAGRITRTSIRQAIQFGITADQIIAYLAAHAHEQMHRSAALAAR-PVLPPTVVDQIRLW 441
Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
+ + R++ T + +F + ++ +A + L+W K A +R++
Sbjct: 442 QLENERMKTTGGFLFRDFDDQKEYQDTRRFADEIGVLVWSSDKTGMFF--ASKFEQIRDY 499
Query: 195 LRGQNK 200
L+ + K
Sbjct: 500 LKSRKK 505
>gi|452824378|gb|EME31381.1| transcription initiation factor TFIIH subunit H4 [Galdieria
sulphuraria]
Length = 740
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/181 (35%), Positives = 94/181 (51%), Gaps = 20/181 (11%)
Query: 5 FADLGLVKLQQVG--RKESWFIPTKLATNLSMS---LTD----------SSARKEGFVVV 49
ADLGL+ L K+S+F T L L+ S L D S+ + ++V
Sbjct: 271 LADLGLIYLHISSSLEKDSYFYVTPLGVTLTRSICWLGDKNNYLLLEDWSNVDNDCRMIV 330
Query: 50 ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
+TNFR+Y Y+ +L LF + Y+LPN+ VG IT++S+ A NGITA+Q+IS+LQ
Sbjct: 331 QTNFRVYVYTNCTFQISLLSLFIQFLYRLPNMAVGVITRDSIRTALNNGITAQQMISYLQ 390
Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
+ HP + ++P + DQIRLWE+ RV A + F + FE Y R
Sbjct: 391 NHMHPNMKGKLPIT---IIDQIRLWEAQRFRVTTKHALLLDHFDNMTCFEKT--YGTTRI 445
Query: 170 G 170
G
Sbjct: 446 G 446
>gi|321265552|ref|XP_003197492.1| subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Cryptococcus gattii WM276]
gi|317463972|gb|ADV25705.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes,
putative [Cryptococcus gattii WM276]
Length = 481
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 110/200 (55%), Gaps = 8/200 (4%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
+ D A G + + W PT LAT+L + ++ SA + F+++ETN+++YA
Sbjct: 275 LNDLAQYGFIYKPSPDSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y++++L IL LF I + PNL+VG + ++ + A E GI+A QII++L +AHP++
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMY 392
Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+ P + + DQ+ LW+ + NR++ YE F S+++F+ + A+ + L +
Sbjct: 393 NSPPPLLHPTIVDQLHLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAT 451
Query: 177 KKMRLVVNAEIHMHMREFLR 196
+ L + + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471
>gi|346972761|gb|EGY16213.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
dahliae VdLs.17]
Length = 476
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 111/219 (50%), Gaps = 21/219 (9%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLA-TNLSM---SLTDS----------SARKEGF 46
M+ D GLV + + + P A TN+S +LTD +
Sbjct: 259 MLPSLMDFGLVYIPSHKPFQYFVTPQARALTNVSTATRTLTDGMNALSQPQGGPGDQTNG 318
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
++VETN+R+YAY+TS+L +L LF + + P ++ G +T++S+ A + GITA+QIIS
Sbjct: 319 IIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIIS 378
Query: 107 FLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
+L +AH ++ AD P +P V DQIRLW+ + R++ T + +F +
Sbjct: 379 YLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDI 438
Query: 162 CDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+A + L W + +K A H +R++L+ + K
Sbjct: 439 ARFADEIGVLAWRNDRKRMFF--ASKHEQIRDYLKLRKK 475
>gi|302405781|ref|XP_003000727.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
gi|261360684|gb|EEY23112.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
Length = 476
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 111/219 (50%), Gaps = 21/219 (9%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLA-TNLSM---SLTDS----------SARKEGF 46
M+ D GLV + + + P A TN+S +LTD +
Sbjct: 259 MLPSLMDFGLVYIPSHKPFQYFVTPQARALTNVSTATRTLTDGMNALSQSQGGPGDQTNG 318
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
++VETN+R+YAY+TS+L +L LF + + P ++ G +T++S+ A + GITA+QIIS
Sbjct: 319 IIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIIS 378
Query: 107 FLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
+L +AH ++ AD P +P V DQIRLW+ + R++ T + +F +
Sbjct: 379 YLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDI 438
Query: 162 CDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+A + L W + +K A H +R++L+ + K
Sbjct: 439 ARFADEIGVLAWRNDRKRMFF--ASKHEQIRDYLKLRKK 475
>gi|322697444|gb|EFY89224.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium acridum CQMa 102]
Length = 541
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 94/166 (56%), Gaps = 9/166 (5%)
Query: 41 ARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 100
A ++G V+VETN+R+YAY+ S L +L LF K+ + P+++ G +T+ S++ A GIT
Sbjct: 377 ADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTRTSIHQAINFGIT 436
Query: 101 AEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPS 154
A+QIIS+L +AH + +A++ P +P V DQIRLW+ + R++ T + +F
Sbjct: 437 ADQIISYLAAHAHEQMHRTAALANK-PILPPTVVDQIRLWQLENERMKTTGGFLFRDFED 495
Query: 155 RDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
F +A + L+W K N + +R++L+ + +
Sbjct: 496 HKEFLDTARFAEEIGVLVWRSDKSGMFFANK--YEQIRDYLKSRKR 539
>gi|134118870|ref|XP_771938.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50254542|gb|EAL17291.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 481
Score = 103 bits (258), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 110/200 (55%), Gaps = 8/200 (4%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
+ D A G + + W PT LAT+L + ++ SA + F+++ETN+++YA
Sbjct: 275 LDDLAQYGFIYKSSPDSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y++++L IL LF I + PNL+VG + ++ + A E GI+A QII++L +AHP++
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMY 392
Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+ P + + DQ+ LW+ + NR++ YE F S+++F+ + A+ + L +
Sbjct: 393 NSPPPLLHPTIVDQLYLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAT 451
Query: 177 KKMRLVVNAEIHMHMREFLR 196
+ L + + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471
>gi|66358470|ref|XP_626413.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
gi|46227864|gb|EAK88784.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
Length = 709
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 110/208 (52%), Gaps = 28/208 (13%)
Query: 19 KESWFIPTKLATNLSMS----LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKI 74
+E W T +N+ ++ L+ +S R E ++V++NFR+Y Y+ S L +ILR ++
Sbjct: 496 EELWHKSTTETSNVKINEPNCLSMNSTRLEAGIIVQSNFRIYCYTASPLQAKILRHLCQV 555
Query: 75 EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCD 129
+ + PN+I G +T++ L +A+ G++AEQI+ F NAHP + R +P NV
Sbjct: 556 KVRGPNIICGILTRKGLLSAYSMGVSAEQILRFFSSNAHPIILRRFMLEGTSIIPVNVET 615
Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSR----DVFEAACDYARDRSGLLW------------ 173
Q++LWE D NR++++ A + ++ + +F YAR + LL+
Sbjct: 616 QLKLWEKDKNRLKISHASTFSDWGASPNDIQLFRQTILYARSKDILLYNSPIELTEKELN 675
Query: 174 ---EDSKKMRLVVNAEIHMHMREFLRGQ 198
E KK+ LV+ E ++ F+R +
Sbjct: 676 LNVELQKKIILVIKQEYEDDIKTFIRTK 703
>gi|366991461|ref|XP_003675496.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
gi|342301361|emb|CCC69129.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
Length = 516
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 95/171 (55%), Gaps = 18/171 (10%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
+G +++ETNF++Y+YS S L L LF ++ + N++ G IT+ES+ A NGITA+Q
Sbjct: 340 DGSLIIETNFKLYSYSNSPLQIATLSLFVHLKSRFANMVTGQITRESIRRALINGITADQ 399
Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
II++L+ +AHP++ D + +P V DQI+LW+ +L+RV
Sbjct: 400 IIAYLETHAHPQMRRLAEERLEKKLELDPNSKDPLQVLPPTVVDQIKLWQLELDRVITYE 459
Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
Y +F + + YA+D LLW+D KK + V+ E + + ++ +
Sbjct: 460 GSLYSDFETIAEYTTLSKYAQDIGVLLWKDDKKKKFFVSKEGNSQVLDYAK 510
>gi|85691005|ref|XP_965902.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi GB-M1]
gi|19068469|emb|CAD24937.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
gi|449329766|gb|AGE96035.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi]
Length = 414
Score = 102 bits (254), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 107/198 (54%), Gaps = 18/198 (9%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS-MSLTDSSAR-KEGFVVVETNFRMYAY 58
+ K LG+++L + G LA S + L ++S R + F++VETN ++YAY
Sbjct: 230 LFKYLEALGILRLYEKG----------LAIGRSFVQLFEASERNRREFIIVETNNKIYAY 279
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S+ ++ LF + LPNLI G IT+ES+ AF+ GIT +QII FL+ +
Sbjct: 280 TNSEYEKSVIHLFCNVTVNLPNLIKGIITEESVNVAFDKGITGKQIIHFLEASV------ 333
Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
R S+P + +QI +WES NR+ M P + Y F + ++ ++ + L+ D K
Sbjct: 334 RQGSLPPAIRNQIVIWESKRNRIFMVPGYLYSNFLNLSDYQKVLEFCIRGNHLVESDVDK 393
Query: 179 MRLVVNAEIHMHMREFLR 196
+VV E H ++EF++
Sbjct: 394 RMIVVKLEGHSLVKEFVK 411
>gi|58262306|ref|XP_568563.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230737|gb|AAW47046.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 481
Score = 102 bits (254), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 109/200 (54%), Gaps = 8/200 (4%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
+ D A G + + W PT LAT+L + ++ SA + F+++ETN+++YA
Sbjct: 275 LDDLAQYGFIYKSSPDSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332
Query: 58 YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
Y++++L IL LF I + PNL+VG + ++ + A E GI+A QII++L +AHP++
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMY 392
Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
+ P + + DQ+ LW+ + NR++ YE F S+++F+ + A+ + L
Sbjct: 393 NSPPPLLHPTIVDQLYLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAI 451
Query: 177 KKMRLVVNAEIHMHMREFLR 196
+ L + + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471
>gi|429965574|gb|ELA47571.1| hypothetical protein VCUG_00894 [Vavraia culicis 'floridensis']
Length = 424
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 93/165 (56%), Gaps = 7/165 (4%)
Query: 36 LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
L S++ ++ F+VVETN+++YAY+TS I++LF +I +LPNL+V IT+ES+ AF
Sbjct: 267 LFSSASTQDSFLVVETNYKLYAYTTSSHELSIIKLFCQIIRELPNLVVAHITEESVNAAF 326
Query: 96 ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR 155
GIT +QI+ +L + + +P V +QI +WE +R++ A Y F +
Sbjct: 327 VKGITGQQIVDYLNEKSRSE-------LPVVVLEQILIWERKRDRMKCMDAVIYSHFMTY 379
Query: 156 DVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+E Y +++ L+ D + LVV E H ++ F++ K
Sbjct: 380 GEYEITYKYCKEKGALVDYDEFRRLLVVKLEYHNDVKNFIKSNIK 424
>gi|336270172|ref|XP_003349845.1| hypothetical protein SMAC_00733 [Sordaria macrospora k-hell]
gi|380095234|emb|CCC06707.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 496
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 114/240 (47%), Gaps = 51/240 (21%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSSARKEG----- 45
M+ D GL+ + R+ F PT+LAT L S + T ++A G
Sbjct: 266 MLPSLVDFGLIYFPRDTRQ---FFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTT 322
Query: 46 --------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGA 85
+++ETN+R+YAY+TS L +L LF ++ + PN++ A
Sbjct: 323 TSTSGGPAGATPADKTGTSKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSAA 382
Query: 86 ITKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNR 140
+ A GITA+QIIS+L +AH ++ +P V DQIRLW+ + R
Sbjct: 383 L------RAIGFGITADQIISYLHAHAHQQMVREATVTGKTVLPPTVVDQIRLWQLENER 436
Query: 141 VEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
++ T +++F S + + A YA + L+W++ K R + A H +R++L+ + K
Sbjct: 437 MQTTAGFLFKDFESTEEYAALSRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 494
>gi|402083674|gb|EJT78692.1| RNA polymerase II transcription factor B subunit 2 [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 496
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/228 (28%), Positives = 113/228 (49%), Gaps = 33/228 (14%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------------SLTD 38
M+ D GLV + G +F PT+LAT L+ S
Sbjct: 271 MLPSLVDFGLVYIPP-GNTSQYF-PTRLATTLTSGSSALRSASSALAAATAEGPNASGGG 328
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
S +G +++ETN+R+YAY+++ L IL LF+++ ++ ++ G + +ES+ A G
Sbjct: 329 QSEAAKGSIIIETNYRLYAYTSTPLQIAILGLFAELRFRFAGMVTGRLDRESIKRAISYG 388
Query: 99 ITAEQIISFLQQNAHP---RVAD--RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
ITA+Q+I +L +AH R A + P +P V DQIRLW+ + R+++ + +F
Sbjct: 389 ITADQVIEYLAAHAHEQMHRTATLRKKPVLPPTVVDQIRLWQLETERMKIMRGFLFRDFD 448
Query: 154 SRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQNK 200
S+ ++ YA + LLW D++++ + + FL+ + K
Sbjct: 449 SQAEYDDLAKYADEIGVLLWRSDARQLFFASKVD---QLSVFLKARKK 493
>gi|300707677|ref|XP_002996037.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
gi|239605297|gb|EEQ82366.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
Length = 403
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 96/160 (60%), Gaps = 6/160 (3%)
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
S +K F+++ETNF++YAY++S I++LFS I ++PNLI +IT+ESL NAF G
Sbjct: 247 SDTKKLRFILLETNFKLYAYTSSVYEMSIIQLFSNIYLKMPNLIKASITEESLSNAFSKG 306
Query: 99 ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
+T++QII+FL+ ++ D +P + QI +WE+ R+++ P + Y F + +
Sbjct: 307 VTSQQIINFLK--SYSLFED----IPVAIISQIIIWETKRKRIKIFPGYLYSNFLNLIDY 360
Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
+ + ++ +D +K +V+ + + +++F++ Q
Sbjct: 361 QKVVKFCLSNDCIIEKDDEKRMIVIKPDHNEVVKKFVKQQ 400
>gi|322711210|gb|EFZ02784.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium anisopliae ARSEF 23]
Length = 540
Score = 100 bits (249), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 115/250 (46%), Gaps = 55/250 (22%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD---------------------- 38
M+ D GL+ + Q K S F PT+LAT L+ +
Sbjct: 294 MLPSLLDFGLIYIPQ--HKRSMFFPTRLATTLTSGGSSLRTISEGVAAATAAATTTAPAG 351
Query: 39 ------------------SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPN 80
SA ++G V+VETN+R+YAY+ S L +L LF K+ + P+
Sbjct: 352 GPSSQSGSGSGGTGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPD 411
Query: 81 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLW 134
++ G +T+ S++ A GI+A+QIIS+L +AH + +A++ P +P V DQIRLW
Sbjct: 412 MVAGRVTRTSIHQAINFGISADQIISYLAAHAHEQMHRSAALANK-PILPPTVVDQIRLW 470
Query: 135 ESDLN----RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMH 190
+ + R++ T + +F F +A + L+W K N +
Sbjct: 471 QLEWQLENERMKTTGGFLFRDFEDHKEFLDTARFAEEIGVLVWRSDKSGMFFANK--YEQ 528
Query: 191 MREFLRGQNK 200
+R++L+ + +
Sbjct: 529 IRDYLKSRKR 538
>gi|310799661|gb|EFQ34554.1| transcription factor tfb2 [Glomerella graminicola M1.001]
Length = 485
Score = 99.8 bits (247), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 90/164 (54%), Gaps = 7/164 (4%)
Query: 42 RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
R F + ETN+R+YAY + L +L LF K++ + +++ G +T+ S+ NA E GITA
Sbjct: 321 RVRPFRLSETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITA 380
Query: 102 EQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD 156
+QIIS+L +AH ++ A P +P V DQIRLW+ + R+ T + +F S
Sbjct: 381 DQIISYLAAHAHEQMHRMAAARSRPILPPTVIDQIRLWQLETERMTTTSGFLFRDFDSPR 440
Query: 157 VFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
++ YA + L+W + K A H +R++L+ + K
Sbjct: 441 EYDDIAGYAAEIGVLVWRNDKLGMFF--ASKHEQIRDYLKLRKK 482
>gi|194388768|dbj|BAG60352.1| unnamed protein product [Homo sapiens]
Length = 386
Score = 99.4 bits (246), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 79/121 (65%), Gaps = 4/121 (3%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
++ + GLV Q RK + PT+LA NLS ++ + + GF+VVETN+R+YAY+
Sbjct: 268 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 325
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 326 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 385
Query: 120 I 120
+
Sbjct: 386 V 386
>gi|449017614|dbj|BAM81016.1| similar to TFIIH subunit TFB2 [Cyanidioschyzon merolae strain 10D]
Length = 693
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/158 (35%), Positives = 89/158 (56%), Gaps = 9/158 (5%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
++VETNFR+YAY+ S +L LF++I Y++P + +G IT++S+ A + GITA+Q++
Sbjct: 538 IIVETNFRLYAYAASSFQVALLSLFTRILYRMPGVAIGVITRDSVRRALKCGITAKQLLH 597
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF-PSRD---VFEAAC 162
FL ++ SVP NV DQI LWE + R++ P E F P+ D F+
Sbjct: 598 FLGIHSM-----EGKSVPFNVHDQILLWELERKRIQAYPGVLLEGFEPTSDGRAFFDQLQ 652
Query: 163 DYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
+Y + W D + LV +A +RE++R Q +
Sbjct: 653 EYTVELGAQQWCDRVRQLLVADASSFERLREWIRRQAQ 690
>gi|221482097|gb|EEE20458.1| tfiih, polypeptide, putative [Toxoplasma gondii GT1]
Length = 861
Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 672 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 731
Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 732 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 779
>gi|401410644|ref|XP_003884770.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
gi|325119188|emb|CBZ54742.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
Length = 868
Score = 96.3 bits (238), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 679 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 738
Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 739 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 786
>gi|237842971|ref|XP_002370783.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
gi|211968447|gb|EEB03643.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
Length = 836
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 647 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 706
Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 707 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754
>gi|221502519|gb|EEE28246.1| tfiih, polypeptide, putative [Toxoplasma gondii VEG]
Length = 836
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 647 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 706
Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 707 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754
>gi|398406290|ref|XP_003854611.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
gi|339474494|gb|EGP89587.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
Length = 493
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 98/186 (52%), Gaps = 26/186 (13%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA----------------RKE 44
M+ D D GL+ + + + PT+LAT L+ +D+ A ++
Sbjct: 266 MLDDLQDFGLI--YRSSPTATAYYPTRLATTLT---SDAPALPNNTSSTTTTAATESNEK 320
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
G++++ETN+R+YAY++S L IL LFS + + PNLI ITK S+ +A IT++QI
Sbjct: 321 GYIILETNYRLYAYTSSPLQISILSLFSTLHTRYPNLITAKITKSSIASAISQSITSDQI 380
Query: 105 ISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFE 159
I++L +AHP + + P +P V DQIRLW+ + R+ Y + + +
Sbjct: 381 ITYLSTHAHPVLRRQAAIMSAPILPPTVVDQIRLWQIEGERMTTWKGFYIRDVGGWEEYV 440
Query: 160 AACDYA 165
A +YA
Sbjct: 441 KAVEYA 446
>gi|402465832|gb|EJW01474.1| hypothetical protein EDEG_03937 [Edhazardia aedis USNM 41457]
Length = 465
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 90/156 (57%), Gaps = 12/156 (7%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
E F+ +ETNF++YAY+TSK IL LFSKI +LPNLI I ++S+ AF+ I+A+Q
Sbjct: 316 ESFMYIETNFKLYAYTTSKYDFSILSLFSKISCKLPNLISAIINEDSVNTAFDKKISAKQ 375
Query: 104 IISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACD 163
I +L+ +VP+NV +Q+ +WES NR++ + ++ F + F+ A D
Sbjct: 376 ISYYLKSKG--------KNVPKNVVEQVYIWESKRNRIKTSECTLFKGFLNLMDFKKAVD 427
Query: 164 YARDRSGL--LWEDSKKMRLVVNAEIHMHMREFLRG 197
+++ L ++ED K + VN +E+++
Sbjct: 428 VCKEKHWLVDVYED--KRFIFVNNLYAEEFKEYIKN 461
>gi|164660182|ref|XP_001731214.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
gi|159105114|gb|EDP44000.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
Length = 231
Score = 93.2 bits (230), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 81/120 (67%), Gaps = 7/120 (5%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-----KEGFVVVETNFRMY 56
+++ +D GLV + + +F PT+LA+ L+ + + +R ++G++++ETN+R+Y
Sbjct: 102 LEELSDYGLVY--RPSKSAKYFFPTRLASTLTSTASPLLSRLNDQEEQGYLILETNYRVY 159
Query: 57 AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
AY+ + L IL LF ++ +LPNL++G +T+ S+ +A GITA+QII++L +AHP++
Sbjct: 160 AYTANLLRIAILNLFVTLKSRLPNLVIGQLTRHSVKSALNKGITADQIITYLTHHAHPQM 219
>gi|71033737|ref|XP_766510.1| hypothetical protein [Theileria parva strain Muguga]
gi|68353467|gb|EAN34227.1| hypothetical protein, conserved [Theileria parva]
Length = 359
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 94/158 (59%), Gaps = 18/158 (11%)
Query: 24 IPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIV 83
+PT +LS+ A + ++V++NF++Y Y+ S L +L +++ + PNL+V
Sbjct: 146 LPTSKDLDLSIK-----AGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVV 200
Query: 84 GAITKESLYNAFENGITAEQIISFLQQNA-HPRVADR---------IPSVPENVCDQIRL 133
G +T+ES+ +AF++GIT+++II FL N + + ++ S+PENVC Q+++
Sbjct: 201 GVLTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGNQENTLLNSTFTYSIPENVCRQLKM 260
Query: 134 WESDLNRVEMTPAHYYEEFPSR---DVFEAACDYARDR 168
WES+ +R+E+ P+ ++ + D+F+ +A+ +
Sbjct: 261 WESERDRIELCPSIVFKRWDQDFMPDLFQRTVRWAQSK 298
>gi|422294120|gb|EKU21420.1| transcription initiation factor TFIIH subunit 4, partial
[Nannochloropsis gaditana CCMP526]
Length = 213
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 101/205 (49%), Gaps = 40/205 (19%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---------KEG------ 45
+++ F GL+ Q R F PT +A NL T R EG
Sbjct: 16 LLEKFISFGLI--YQDDRHSRHFYPTAVAVNLIFGGTVQEERLRRGHGHVGPEGRENTKE 73
Query: 46 ---------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
V+VETN+++ AY++S LH E+LR+F+ + +LPN+++G IT+ S+ A
Sbjct: 74 LRLVDPSQLAVIVETNYQLVAYTSSSLHVEMLRIFTDVRCRLPNVVIGFITRASVRRAMA 133
Query: 97 NGITAEQIISFLQQNAHPRV-ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR 155
+GITA I+SFL+ + H V A + +PENV QI LW + +RV+ +EE
Sbjct: 134 SGITAATILSFLKTHTHVAVRAGKGRLLPENVEAQIELWHQERSRVK------FEEVMMI 187
Query: 156 DV-------FEAACDYARDRSGLLW 173
D+ FE YA + + + W
Sbjct: 188 DLSSLMVEEFEEVRTYAENLAVVCW 212
>gi|84998132|ref|XP_953787.1| TFIIH subunit, Tf2b [Theileria annulata]
gi|65304784|emb|CAI73109.1| TFIIH subunit, Tf2b homologue, putative [Theileria annulata]
Length = 645
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 86/141 (60%), Gaps = 13/141 (9%)
Query: 41 ARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 100
A + ++V++NF++Y Y+ S L +L +++ + PNL+VG +T+ES+ +AF++GIT
Sbjct: 445 AGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKSGIT 504
Query: 101 AEQIISFLQQNA-HPRVADR---------IPSVPENVCDQIRLWESDLNRVEMTPAHYYE 150
+++II FL N + + + S+PENVC Q+++WES+ +R+E+ P+ ++
Sbjct: 505 SKEIIRFLSPNGMNSSIGSQENTLLNSSFTYSIPENVCRQLKMWESERDRIELCPSIVFK 564
Query: 151 EFPSR---DVFEAACDYARDR 168
+ D+F+ +A+ +
Sbjct: 565 RWDQDFMPDLFQRTVRWAQSK 585
>gi|302831319|ref|XP_002947225.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
gi|300267632|gb|EFJ51815.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
Length = 90
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 54/85 (63%)
Query: 98 GITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV 157
GI A++++S+L HP +A R P VPE V DQIRLWE+ +NR+ YE SR++
Sbjct: 1 GIAADELVSYLAARPHPSIASRTPVVPEVVSDQIRLWEASMNRLRADSVVLYENLASREL 60
Query: 158 FEAACDYARDRSGLLWEDSKKMRLV 182
FE A ++R LLWEDS +MR V
Sbjct: 61 FERALAFSRSSGTLLWEDSGQMRFV 85
>gi|323455511|gb|EGB11379.1| hypothetical protein AURANDRAFT_52511 [Aureococcus anophagefferens]
Length = 291
Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/150 (36%), Positives = 83/150 (55%), Gaps = 17/150 (11%)
Query: 8 LGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--ARKEGF------VVVETNFRMYAYS 59
LGLV + S F PT L +++ S+ AR G V+V+TNF++ AY+
Sbjct: 81 LGLVYRRNA--SSSRFYPTTLGVDVAFGARRSAGGARAGGDFRRPVDVIVQTNFQVLAYT 138
Query: 60 -----TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
TS L L LF+++ +LPNL+VG I+++++ + GI QI+ FL+ +AHP
Sbjct: 139 DAGVNTSTLVLATLNLFAELTTRLPNLVVGTISRDAIKRCVDRGIRVPQIVKFLRAHAHP 198
Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMT 144
A + VP+NV DQ+ LW + NRV T
Sbjct: 199 --AMKASGVPQNVTDQMALWAGEGNRVAFT 226
>gi|428673252|gb|EKX74165.1| conserved hypothetical protein [Babesia equi]
Length = 668
Score = 89.7 bits (221), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 77/124 (62%), Gaps = 7/124 (5%)
Query: 23 FIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLI 82
+P+ ++ S++ +S ++V++NF++Y Y+ + L +IL + +++ + PN++
Sbjct: 470 LVPSNSPLSVHTSISFTSEYLPSKIIVQSNFKVYVYTVNNLQFDILNILCEVQARTPNMV 529
Query: 83 VGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVE 142
VG +T+ES AF++GIT+ +II F + + PENV Q+R+WE++ NRVE
Sbjct: 530 VGVLTRESAQRAFKSGITSHEIIRFFS-------STNTSTFPENVIRQLRMWEAERNRVE 582
Query: 143 MTPA 146
++PA
Sbjct: 583 LSPA 586
>gi|412992644|emb|CCO18624.1| predicted protein [Bathycoccus prasinos]
Length = 464
Score = 89.7 bits (221), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 42/89 (47%), Positives = 63/89 (70%), Gaps = 5/89 (5%)
Query: 22 WFIPTKLATNLSMSLTDSSA-----RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEY 76
W++PT L+ LS T SSA R +G ++VETNFR+YAY+ S+L E+LRLF++ +Y
Sbjct: 373 WYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRLFTRPDY 432
Query: 77 QLPNLIVGAITKESLYNAFENGITAEQII 105
+LPN VG IT++S+ +A GI+ +QI+
Sbjct: 433 KLPNAYVGMITRDSILDAMRAGISPDQIV 461
>gi|124806778|ref|XP_001350829.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
gi|23496958|gb|AAN36509.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
Length = 945
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 108/196 (55%), Gaps = 15/196 (7%)
Query: 3 KDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSK 62
KD L L K + + +K I L +++ M + S E ++V++NF++Y Y++S
Sbjct: 728 KDEKSLILDKRKNITKK---LINDSLYSDIKM---NGSNNLEIGLIVQSNFKVYLYTSSL 781
Query: 63 LHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPS 122
L IL +++ + PN++VG +T+ S+ NA+ + ITA QII FL+ +HP + S
Sbjct: 782 LKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIKFLESYSHPGKNNFKSS 841
Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR---DVFEAACDYARDRSGLLW------ 173
+P NV Q++LWES+ +R+ + A ++ F +++ +A ++ LL+
Sbjct: 842 IPMNVITQLKLWESERHRLTLEDAIVFKSFEKDFMPHLYQQIVIWANSKNYLLYYTPWPK 901
Query: 174 EDSKKMRLVVNAEIHM 189
++K+ L + AE ++
Sbjct: 902 NNTKEFDLWIKAEKYL 917
>gi|403221352|dbj|BAM39485.1| transcription factor IIH subunit [Theileria orientalis strain
Shintoku]
Length = 634
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 86/144 (59%), Gaps = 13/144 (9%)
Query: 38 DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
+ +A E ++V++NF++Y Y+ S L +L +++ + PNL+VG +T+ES+ +AF++
Sbjct: 412 NKTAGNESRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKS 471
Query: 98 GITAEQIISFLQ--------QNAHPRVADRIPS--VPENVCDQIRLWESDLNRVEMTPAH 147
GIT+ +II FL D + +PENVC Q+++WES+ +R+E++PA
Sbjct: 472 GITSNEIIRFLSPMKLSLSYAGTCNSFMDSFANYKIPENVCRQLKMWESERDRIELSPAI 531
Query: 148 YYEEFPSR---DVFEAACDYARDR 168
++ + + ++F+ +A+ +
Sbjct: 532 LFKRWDADFMPELFQRTVRWAQSK 555
>gi|209880858|ref|XP_002141868.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209557474|gb|EEA07519.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 794
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 92/175 (52%), Gaps = 12/175 (6%)
Query: 10 LVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILR 69
+V+L I + +NL +S + E ++V++NFR+Y Y+ S L +ILR
Sbjct: 572 IVELNNTSNDIDLKISHSIESNLFLS---GISNLEAGIIVQSNFRVYCYTASPLQAKILR 628
Query: 70 LFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP-----RVADRIPSVP 124
+++ + PN+I G +T+ L +A+ G+ A QI+ F NAHP + D +P
Sbjct: 629 HLCQVKVRGPNIICGVLTRRGLLSAYSMGVKAYQILRFFISNAHPIILKKHITDGTSIIP 688
Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSR----DVFEAACDYARDRSGLLWED 175
+V Q++LWE+D NR+E+ + ++ + ++F+ YA + +L+ +
Sbjct: 689 ISVETQLKLWENDHNRLEINKVSLFSDWGNNKEDIELFKQTVTYAIGKQVVLYHN 743
>gi|389586531|dbj|GAB69260.1| hypothetical protein PCYB_146880 [Plasmodium cynomolgi strain B]
Length = 913
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 70/106 (66%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
++V++NF++Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + ITA QII
Sbjct: 733 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 792
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
FL+ AHP ++ S+P NV Q++LWES+ +R+ + A ++ F
Sbjct: 793 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 838
>gi|399949686|gb|AFP65344.1| tfIIB-like protein [Chroomonas mesostigmatica CCMP1168]
Length = 464
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 64/100 (64%), Gaps = 4/100 (4%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+++E+NFR+Y Y + L ++L +FS+I YQLPNL VG IT++S+ AF +G T+ II
Sbjct: 329 IIIESNFRIYVYKNAHLGNDLLLVFSEILYQLPNLFVGEITEKSILKAFNSGSTSRNIIG 388
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
FL+ N HP P +P V +QI+ WE +++T +
Sbjct: 389 FLKNNLHPIC----PWIPSAVTNQIKAWEFQKTEIKITES 424
>gi|221061889|ref|XP_002262514.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193811664|emb|CAQ42392.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 943
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 70/106 (66%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
++V++NF++Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + ITA QII
Sbjct: 763 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 822
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
FL+ AHP ++ S+P NV Q++LWES+ +R+ + A ++ F
Sbjct: 823 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 868
>gi|60603004|gb|AAX27959.1| unknown [Schistosoma japonicum]
Length = 100
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 54/76 (71%)
Query: 39 SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
SS+ G++++ETNFR+YAY+ S L +L LFSKI + PNL+V IT++S+ A G
Sbjct: 8 SSSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 67
Query: 99 ITAEQIISFLQQNAHP 114
ITA QI+SFL NAHP
Sbjct: 68 ITANQILSFLTSNAHP 83
>gi|432098035|gb|ELK27922.1| General transcription factor IIH subunit 4 [Myotis davidii]
Length = 377
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 89/184 (48%), Gaps = 36/184 (19%)
Query: 18 RKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIE 75
RK + PT+LA +LS +S +A + GF+VVETN+++YAY+ S+L ++ LFS++
Sbjct: 225 RKSRRYYPTRLAISLSSGVSGAGGTAHQPGFIVVETNYQLYAYTESELQIALIALFSEML 284
Query: 76 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWE 135
Y+ PN++V + P +P + DQIRLWE
Sbjct: 285 YRFPNMVVA----------------------------------QTPVLPPTITDQIRLWE 310
Query: 136 SDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
+ +R+ T Y +F S+ F+ A + L++E+ K +VV H ++ F
Sbjct: 311 LERDRLLFTEGVLYNQFLSQVDFDLLLARAWELGALVFENMAKQLMVVTPAGHREVKRFW 370
Query: 196 RGQN 199
+ Q
Sbjct: 371 KQQK 374
>gi|156095867|ref|XP_001613968.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802842|gb|EDL44241.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 960
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 69/106 (65%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
++V++NF++Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + ITA QII
Sbjct: 780 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 839
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
FL+ AHP + S+P NV Q++LWES+ +R+ + A ++ F
Sbjct: 840 FLESYAHPGRSTFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 885
>gi|156086630|ref|XP_001610724.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797977|gb|EDO07156.1| conserved hypothetical protein [Babesia bovis]
Length = 586
Score = 86.7 bits (213), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 90/159 (56%), Gaps = 16/159 (10%)
Query: 1 MIKDFADLGLVKLQQVGRK-----ESWFIPTKLAT-----NLSMSLTDSSARKEGFVVVE 50
+++ +LGLV G K S+ + K A +SMS+ + K +VV+
Sbjct: 343 ILRFLIELGLVYADSDGSKIYVTNLSFLVDVKHANLVEEHQISMSVCGNKGSK---MVVQ 399
Query: 51 TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
+NF++YAY S L +L +++ + PNL++G +T+ SL AF++GITA+Q+I F +
Sbjct: 400 SNFKIYAYIPSALQMNVLNHICELQAKTPNLVIGVLTRSSLQTAFKSGITADQLICFFES 459
Query: 111 NAH---PRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
++ + +VPENV Q+++WE++ NR+E+ A
Sbjct: 460 KGQYDDIQINRNVMNVPENVRRQLKMWEAERNRLELLNA 498
>gi|387595518|gb|EIJ93142.1| hypothetical protein NEPG_02098 [Nematocida parisii ERTm1]
Length = 398
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/192 (32%), Positives = 96/192 (50%), Gaps = 16/192 (8%)
Query: 5 FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLH 64
F LGL+ ++ K F PT S+ L D E F+V+E+NFR+Y YS L
Sbjct: 219 FESLGLIFFEKGLVK---FSPT-----FSL-LFDDEEGAEKFLVLESNFRLYIYSNRPLD 269
Query: 65 CEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVP 124
I+ LFS + PN+IV I ++S+ A +GITA QI +L QN+ + +
Sbjct: 270 TFIISLFSIKSREFPNMIVAMINEDSIRQALMHGITAGQIRVYLNQNS-------MYEIN 322
Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
ENV +QIRLWE +NR+ ++ + F + F Y + + +K LVV
Sbjct: 323 ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYREKRMLVVG 382
Query: 185 AEIHMHMREFLR 196
E + ++ F+R
Sbjct: 383 IENYESVKSFIR 394
>gi|378754852|gb|EHY64880.1| hypothetical protein NERG_01936 [Nematocida sp. 1 ERTm2]
Length = 396
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 95/196 (48%), Gaps = 16/196 (8%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
M+ F LGL+ ++ + ++ S+ L D E F+ +E+NFR+Y YS
Sbjct: 213 MLDLFESLGLITFERG--------LVRFSSTFSL-LFDDEEGGEKFLTLESNFRLYIYSN 263
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
L I+ LFS + PN+IV I ++S+ A GITA QI +L QN+ +
Sbjct: 264 RPLDVFIISLFSIKIREFPNMIVAMINEDSIRQALTYGITAGQIRVYLNQNS-------M 316
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
+ ENV +QIRLWE +NR+ ++ + F + F Y + + K
Sbjct: 317 HKINENVLEQIRLWEKRMNRIHAWESYIFSNFLNYKDFLLVESYCENNNVEHRSYRDKRV 376
Query: 181 LVVNAEIHMHMREFLR 196
LVV E + +++ F+R
Sbjct: 377 LVVGVENYDNVKSFIR 392
>gi|387592902|gb|EIJ87926.1| hypothetical protein NEQG_01998 [Nematocida parisii ERTm3]
Length = 398
Score = 84.0 bits (206), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 16/192 (8%)
Query: 5 FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLH 64
F LGL+ ++ K F PT S+ L D E F+V+E+NFR+Y YS L
Sbjct: 219 FESLGLIFFEKGLVK---FSPT-----FSL-LFDDEEGAEKFLVLESNFRLYIYSNRPLD 269
Query: 65 CEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVP 124
I+ LFS + PN++V I ++S+ A +GITA QI +L QN+ + +
Sbjct: 270 TFIISLFSIKSREFPNMMVAMINEDSIRQALMHGITAGQIRVYLNQNS-------MYEIN 322
Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
ENV +QIRLWE +NR+ ++ + F + F Y + + +K LVV
Sbjct: 323 ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYREKRMLVVG 382
Query: 185 AEIHMHMREFLR 196
E + ++ F+R
Sbjct: 383 IENYESVKSFIR 394
>gi|403348380|gb|EJY73625.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Oxytricha trifallax]
Length = 532
Score = 83.6 bits (205), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 81/129 (62%), Gaps = 19/129 (14%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------MSLTDS--SARKE 44
+++DF D+GLV ++ R++ + I + L + L ++ D+ S ++
Sbjct: 265 ILRDFDDMGLVGME---RRQKFVITSLLQSFLQSQTGQLQKSSDSSSSNVIDAQRSQSQD 321
Query: 45 GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
F++VETNF++YAY++S L+ +LR F ++E PNL+VG +T++SL AF+ GI++ QI
Sbjct: 322 KFLIVETNFKVYAYTSSDLYRALLRQFIRVECIFPNLVVGTLTRKSLQKAFQRGISSGQI 381
Query: 105 ISFLQQNAH 113
+SFL+ + H
Sbjct: 382 LSFLESHIH 390
>gi|399219100|emb|CCF75987.1| unnamed protein product [Babesia microti strain RI]
Length = 612
Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 65/104 (62%), Gaps = 4/104 (3%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
E ++V++NF++Y Y+ S L +L +++ + PNL++G +T+ S + AF GITA+Q
Sbjct: 429 ESGIIVQSNFKVYVYTASPLQINVLAHLCELQSRTPNLVIGVLTRASAHAAFRAGITAKQ 488
Query: 104 IISFLQQNAHPRVADRI----PSVPENVCDQIRLWESDLNRVEM 143
I FL+ ++HP + + +P NV Q+ +WE++ NR+ +
Sbjct: 489 ICQFLETHSHPILLQNVREGGSYLPNNVVTQLNMWEAERNRISL 532
>gi|429854513|gb|ELA29524.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Colletotrichum gloeosporioides Nara gc5]
Length = 509
Score = 82.8 bits (203), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 63/101 (62%), Gaps = 5/101 (4%)
Query: 43 KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
+ G ++VETN+R+YAY + L +L LF K++ + +++ G +T+ S+ NA E GITA+
Sbjct: 369 RSGSIIVETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITAD 428
Query: 103 QIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDL 138
QIIS+L +AH ++ P +P V DQIRL D+
Sbjct: 429 QIISYLASHAHEQMHRFAAVRNKPVLPPTVIDQIRLEYEDI 469
>gi|365758016|gb|EHM99881.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 419
Score = 82.4 bits (202), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 82/162 (50%), Gaps = 48/162 (29%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------------------- 33
M++D D GLV + + +S F PTKLA L+
Sbjct: 249 MLQDMRDYGLVFQKHLN--DSIFYPTKLALMLTSDTKTIISASNAMDSVLRQNREEPSVN 306
Query: 34 ------MSLTDSSARKE-------------GFVVVETNFRMYAYSTSKLHCEILRLFSKI 74
S TD++A + G ++VETNF++Y+YS S L +L LF +
Sbjct: 307 EDGANGKSTTDTTASDDLNKAGSKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHL 366
Query: 75 EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
+ + N+++G IT+ES+ NA NGITA+QII++L+ +AHP++
Sbjct: 367 KARFVNMVLGQITRESIRNALTNGITADQIIAYLETHAHPQM 408
>gi|392594381|gb|EIW83705.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 393
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 70/128 (54%), Gaps = 26/128 (20%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARK-------------- 43
M++D D GLV ++ K F PT+LAT L S SL + +
Sbjct: 262 MLEDLRDYGLVWQRKASSKR--FSPTRLATTLTSVSPSLPTAGGSRNPGSAPGASSVANS 319
Query: 44 -------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
GF+V+ETN+R+YAY+ + L +L LF ++Y+ PNL+VG +T+ES+ A
Sbjct: 320 TGINTNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALG 379
Query: 97 NGITAEQI 104
NGI+AEQI
Sbjct: 380 NGISAEQI 387
>gi|82704497|ref|XP_726580.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23482045|gb|EAA18145.1| Drosophila melanogaster LD30622p-related [Plasmodium yoelii yoelii]
Length = 926
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 69/106 (65%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+++++NF++Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + ITA+QII
Sbjct: 746 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 805
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
FL+ +HP +P NV Q++LWE++ +R+ + + ++ F
Sbjct: 806 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 851
>gi|70950974|ref|XP_744765.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524852|emb|CAH80923.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 922
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 69/106 (65%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+++++NF++Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + ITA+QII
Sbjct: 742 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 801
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
FL+ +HP +P NV Q++LWE++ +R+ + + ++ F
Sbjct: 802 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 847
>gi|68070707|ref|XP_677265.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497310|emb|CAH94094.1| conserved hypothetical protein [Plasmodium berghei]
Length = 929
Score = 79.7 bits (195), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 69/106 (65%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+++++NF++Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + ITA+QII
Sbjct: 749 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 808
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
FL+ +HP +P NV Q++LWE++ +R+ + + ++ F
Sbjct: 809 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLILEDSIVFKNF 854
>gi|397564488|gb|EJK44232.1| hypothetical protein THAOC_37249 [Thalassiosira oceanica]
Length = 971
Score = 79.3 bits (194), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 99/196 (50%), Gaps = 20/196 (10%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFR------ 54
++KDFA GL+ + +V K + F PT++A NL S +R+ ++
Sbjct: 358 LMKDFARFGLLFVCRVAGKTA-FYPTRVAVNLVASNEKGGSRQSDALLPSVAATRSLEEA 416
Query: 55 MYAYSTSKLHCEILRLFSKIEYQ-LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
M A ++ H + I Y+ LPN++ +T++S+ +AF G+TA+QI+ FLQ +AH
Sbjct: 417 MNAPDPTRSHLAV------ISYKRLPNVVFFHLTRDSIKSAFRLGVTADQILRFLQVHAH 470
Query: 114 PRV-ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD--VFEAACDYARDRSG 170
P + + P P NV DQI LW+ + RV M ++ RD F A YA D
Sbjct: 471 PMMRSGNQPMPPANVRDQILLWDRERRRVVMDEVWVHQ---CRDDAEFSAVGMYASDSEA 527
Query: 171 LLWEDSKKMRLVVNAE 186
L W ++ +L + +
Sbjct: 528 LAWGNAHTNKLYLQCD 543
>gi|428181353|gb|EKX50217.1| TFB2 transcription factor B2 nucleotide excision repair [Guillardia
theta CCMP2712]
Length = 463
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/209 (30%), Positives = 93/209 (44%), Gaps = 32/209 (15%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-SLTDSSARKEGFVVVETNFRMYAYS 59
+I DLG V + R+ +F PT L NL S S + V E
Sbjct: 267 LITSLHDLGAVYFKPNDRQ--YFFPTPLIVNLCTESEVPSGPSTDAGVADEF-------- 316
Query: 60 TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
+++ ILRLF++ Y+ P++I+ IT+ES+ NA II +L+ +AHP+ +
Sbjct: 317 SAEPAAGILRLFTRPIYKFPHMIIAVITRESIRNAL--------IIEYLRMHAHPQCLEN 368
Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR------------- 166
P VPE V DQI WE + R+ PA Y F S + A AR
Sbjct: 369 WPIVPEVVTDQICFWEQERCRIRAEPAVAYHNFFSGEAHRACEKEARRLKFNDKGPGEVA 428
Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
R L+ D+ + LVV +R+F+
Sbjct: 429 KREACLYSDTSEQLLVVPEAADPLIRKFI 457
>gi|440295536|gb|ELP88449.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba invadens IP1]
Length = 531
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 85/161 (52%), Gaps = 20/161 (12%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+ ETN+ +Y Y+ S ++ LF + QL NL VG IT +++ AF GIT E +++
Sbjct: 290 TITETNYTVYIYTESFFQVKLYSLFIRQNLQLTNLCVGRITYDTVTEAFLKGITNEMLVN 349
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA---HYYEEFP----SRDVFE 159
FLQ P++P+N+ QI LW+ +LNR++ A +YE P ++++
Sbjct: 350 FLQ-----------PNLPKNIQAQIDLWKRELNRLKEVRAVKFRFYE--PELEVQKELYH 396
Query: 160 AACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
A G+++ +++ L V ++ ++EFLR + +
Sbjct: 397 LTKSEAEKMKGVVFYKEEELTLFVRYDVAEKIKEFLRRKTR 437
>gi|449521313|ref|XP_004167674.1| PREDICTED: RNA polymerase II transcription factor B subunit 2-like,
partial [Cucumis sativus]
Length = 296
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 38/42 (90%), Positives = 39/42 (92%), Gaps = 1/42 (2%)
Query: 2 IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK 43
IKD ADLGLVKLQQ GRKESWFIPTKLATNLSMSL DSS+RK
Sbjct: 255 IKDLADLGLVKLQQ-GRKESWFIPTKLATNLSMSLADSSSRK 295
>gi|67483069|ref|XP_656826.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474050|gb|EAL51440.1| hypothetical protein EHI_192400 [Entamoeba histolytica HM-1:IMSS]
gi|449706224|gb|EMD46115.1| tfiih basal transcription factor complex p52 subunit, putative
[Entamoeba histolytica KU27]
Length = 484
Score = 72.8 bits (177), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 82/163 (50%), Gaps = 28/163 (17%)
Query: 5 FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR------KEGFVVVETNFRMYAY 58
F DLG++ V KE +A MSL S+A K+ + E N+ +Y Y
Sbjct: 249 FNDLGII----VYYKEQ----NVMAITPLMSLLRSNANIPSDLVKKPKTITEINYSVYIY 300
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S+ ++ RLF + +QL NL VG + +S+ AF GIT+E +I+FLQ
Sbjct: 301 TESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEMLINFLQ--------- 351
Query: 119 RIPSVPENVCDQIRLWESDLNRVE---MTPAHYYEEFPSRDVF 158
P++P + QI LW+ ++NR + + +Y++ R +F
Sbjct: 352 --PNLPRTIQKQIDLWKKEINRFKEEHVVRYRFYDDAIGRQLF 392
>gi|160331881|ref|XP_001712647.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
gi|159766096|gb|ABW98322.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
Length = 432
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 60/97 (61%), Gaps = 4/97 (4%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
+++E+N+R+Y Y + + ++ +FS+I Y LPN VG IT+ S+ A ++GIT + I+
Sbjct: 300 IIIESNYRIYVYQKNSNNNQLFLIFSEILYILPNFFVGEITETSISRALKSGITIQNILG 359
Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEM 143
F+++N H S+P + +Q RLWE ++++
Sbjct: 360 FIRENLHCVCR----SIPSTILNQFRLWEFQKKKIKI 392
>gi|380476902|emb|CCF44454.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
higginsianum]
Length = 126
Score = 72.8 bits (177), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 67/126 (53%), Gaps = 9/126 (7%)
Query: 81 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLW 134
++ G +T+ S+ NA E GITA+QIIS+L +AH + V R P +P V DQIRLW
Sbjct: 1 MVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQMHRMAAVRSR-PVLPPTVVDQIRLW 59
Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
+ + R+ T + +F S +E YA + L+W + K A H +R++
Sbjct: 60 QLETERMTTTSGFLFRDFDSPKEYEVIAGYASEIGVLVWRNDKLGMFF--ASKHEQIRDY 117
Query: 195 LRGQNK 200
L+ + K
Sbjct: 118 LKLRKK 123
>gi|167395228|ref|XP_001741280.1| TFIIH basal transcription factor complex p52 subunit [Entamoeba
dispar SAW760]
gi|165894177|gb|EDR22239.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba dispar SAW760]
Length = 489
Score = 72.8 bits (177), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 82/163 (50%), Gaps = 28/163 (17%)
Query: 5 FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR------KEGFVVVETNFRMYAY 58
F DLG++ V KE +A MSL S+A K+ + E N+ +Y Y
Sbjct: 249 FNDLGII----VYYKEQ----NVMAITPLMSLLRSNANIPSDLVKKPKTITEINYSVYIY 300
Query: 59 STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
+ S+ ++ RLF + +QL NL VG + +S+ AF GIT+E +I+FLQ
Sbjct: 301 TESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEMLINFLQ--------- 351
Query: 119 RIPSVPENVCDQIRLWESDLNRVE---MTPAHYYEEFPSRDVF 158
P++P + QI LW+ ++NR + + +Y++ R +F
Sbjct: 352 --PNLPRTIQKQIDLWKKEINRFKEEHVVRYKFYDDAIGRQLF 392
>gi|238570605|ref|XP_002386883.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
gi|215440101|gb|EEB87813.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
Length = 69
Score = 72.4 bits (176), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 49/69 (71%), Gaps = 5/69 (7%)
Query: 40 SARKEGFVVVETNFRMYAYSTS-----KLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
S ++GF+V+ETN+R+YAY+ S L IL LF ++Y+ PNL+VG+IT+ES+ A
Sbjct: 1 SGSQQGFIVLETNYRVYAYTGSFSLHNPLQTAILHLFVSLKYRFPNLVVGSITRESVKKA 60
Query: 95 FENGITAEQ 103
NGI+A+Q
Sbjct: 61 LSNGISADQ 69
>gi|330038479|ref|XP_003239608.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
gi|327206532|gb|AEA38710.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
Length = 467
Score = 69.3 bits (168), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 60/105 (57%), Gaps = 13/105 (12%)
Query: 36 LTDSSARKEGF-VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
L+ KE F +++E+N+R+YAY+ + +IL F + Y LP L VG I K S++ A
Sbjct: 307 LSKKIKTKECFQMIIESNYRVYAYNNNTFLNKILIQFCDLIYNLPGLFVGEINKTSIHKA 366
Query: 95 FENGITAEQIISFLQQNAHPRVADRIPSVPENVC----DQIRLWE 135
GI A+ IISF+ +N+H + +N C +QIR+WE
Sbjct: 367 IHKGINAKNIISFILKNSH--------YIHQNSCNPIINQIRIWE 403
>gi|269861689|ref|XP_002650538.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220065990|gb|EED43520.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 291
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/158 (22%), Positives = 82/158 (51%), Gaps = 16/158 (10%)
Query: 36 LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
L ++ + F+ +ETNF++Y+Y+ + IL LFS I +++P +I + ++ + N
Sbjct: 146 LYKNNTEIQKFIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTL 205
Query: 96 ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQ----IRLWESDLNRVEMTPAHYYEE 151
+ GI +QI+ +++++ N CDQ I +W+ NR+ + Y +
Sbjct: 206 DRGIKIQQIVEYIRRHT------------INNCDQILHMIEIWDKQRNRINKQIGYLYSD 253
Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHM 189
F + + + + + + + LL+++ ++ L V ++H+
Sbjct: 254 FTNYNEYRSVLEQIKTDTDLLYKNEEERILFVKNKLHI 291
>gi|269860316|ref|XP_002649880.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220066720|gb|EED44193.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 369
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/158 (22%), Positives = 82/158 (51%), Gaps = 16/158 (10%)
Query: 36 LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
L ++ + F+ +ETNF++Y+Y+ + IL LFS I +++P +I + ++ + N
Sbjct: 224 LYKNNTEIQKFIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTL 283
Query: 96 ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQ----IRLWESDLNRVEMTPAHYYEE 151
+ GI +QI+ +++++ N CDQ I +W+ NR+ + Y +
Sbjct: 284 DRGIKIQQIVEYIRRHT------------INNCDQILHMIEIWDKQRNRINKQIGYLYSD 331
Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHM 189
F + + + + + + + LL+++ ++ L V ++H+
Sbjct: 332 FTNYNEYRSVLEQIKTDTDLLYKNEEERILFVKNKLHI 369
>gi|123488965|ref|XP_001325283.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121908180|gb|EAY13060.1| hypothetical protein TVAG_212420 [Trichomonas vaginalis G3]
Length = 460
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/178 (23%), Positives = 81/178 (45%), Gaps = 9/178 (5%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
++ F +GLV + +F PTK N + + EG+++++TNF++ A+
Sbjct: 257 LLSPFKTIGLVYIDG-----DYFYPTKSILNFFGK--SNIFQTEGWMLIDTNFKITAFPK 309
Query: 61 SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
S LH +L+ F+ + Y+ P I+ S A G T + II FL+ N ++
Sbjct: 310 SPLHTALLKKFANVTYEFPGFASAFISPNSFREALNQGTTLDDIIGFLKSNLSHKIGSG- 368
Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
+P V Q +W R+ +T ++ + + A A+ +G ++ ++K
Sbjct: 369 -QIPSAVMKQFYVWRDQRERLTVTHECIMRQYTNPNDANLAAQCAKQLAGYVYGPAEK 425
>gi|167535774|ref|XP_001749560.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771952|gb|EDQ85611.1| predicted protein [Monosiga brevicollis MX1]
Length = 417
Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 54/94 (57%), Gaps = 5/94 (5%)
Query: 40 SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGI 99
S + GF++ ETNF +YAY+ S + +L F+ + Y+LP + IT+ S+ A GI
Sbjct: 287 SQTETGFLLAETNFHVYAYTDSAVKVALLSKFATLTYRLPIMTTAIITRTSVRRALSQGI 346
Query: 100 TAEQIISFLQQNA-----HPRVADRIPSVPENVC 128
TA+QI+ F+Q+++ H A P+ P++
Sbjct: 347 TADQILRFMQRSSLLLRHHDETAPTRPTSPQHAT 380
>gi|294950275|ref|XP_002786548.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
gi|239900840|gb|EER18344.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
Length = 580
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 75/153 (49%), Gaps = 36/153 (23%)
Query: 30 TNLSMSLTDSSA--RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-------PN 80
+ + MS+ D++A K + V++NF + AY+TS L +L F +++ QL PN
Sbjct: 376 SKMMMSVDDTTAITNKSCRLFVDSNFAVTAYTTSSLDLRLLGTFVQLQRQLGDGREYDPN 435
Query: 81 ---LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR------------------ 119
++G +T+ S+ +A + G+T+E IIS+L+ + PR A
Sbjct: 436 DFGCVLGTLTQSSVQSAAQRGVTSEYIISYLKSHVDPRAAHMGSQGGRSSAATANTGAAR 495
Query: 120 ----IPSVPENVCDQIRLWESDL--NRVEMTPA 146
I +P NV QI LWE + NR+ + P
Sbjct: 496 GEKFIDGIPANVVTQITLWEREAIHNRLRIDPG 528
>gi|340508252|gb|EGR33999.1| hypothetical protein IMG5_027880 [Ichthyophthirius multifiliis]
Length = 454
Score = 63.5 bits (153), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 69/129 (53%), Gaps = 17/129 (13%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARKEGFVVVETNFRMYA 57
++ D A LG++K +G+++ F T L S+ + +A+ ++VETNF +YA
Sbjct: 261 ILSDLAKLGMIKEFDLGQQKKVFGITPLIWQFCYRSIDIKTINAK----IIVETNFNLYA 316
Query: 58 Y----------STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
Y S SK ++L+ FSKI Y P+LIV +T+ + AF GIT++ +I F
Sbjct: 317 YLDYNPQNKTFSESKYIRDLLKKFSKIHYTFPHLIVAQLTEAKMKQAFNQGITSKLLIEF 376
Query: 108 LQQNAHPRV 116
+ + ++
Sbjct: 377 FHKTSDAKL 385
>gi|47212202|emb|CAF90416.1| unnamed protein product [Tetraodon nigroviridis]
Length = 471
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 45/71 (63%), Gaps = 1/71 (1%)
Query: 76 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWE 135
Y+ PN++V +T+ES+ A NGITA+Q ++F + P + P +P + DQIRLWE
Sbjct: 282 YRFPNVVVAQVTRESVQQAIANGITAQQRLTF-DLWSSPTPLSQTPVLPPTITDQIRLWE 340
Query: 136 SDLNRVEMTPA 146
+ +R++ T A
Sbjct: 341 LERDRLQFTEA 351
>gi|162605818|ref|XP_001713424.1| hypothetical protein GTHECHR3089 [Guillardia theta]
gi|13794356|gb|AAK39733.1|AF083031_90 hypothetical protein [Guillardia theta]
Length = 180
Score = 59.3 bits (142), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 66/137 (48%), Gaps = 8/137 (5%)
Query: 5 FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGF-VVVETNFRMYAYSTSKL 63
F ++GL+ L G+K I K+ + + E F +++E NFR+Y +
Sbjct: 5 FYNMGLINL--TGKKTIQ-ISNKIINLEKNGIIIQKKKTERFKIIIEVNFRIYIFKYFDF 61
Query: 64 HCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSV 123
+ SK Y LP VG I + F+ G+T++ I+ F+++N H ++IPS
Sbjct: 62 SNSFFKFISKKIYSLPYFYVGEINENYSRRLFQLGVTSDNILKFIKKNLH-YTCNKIPST 120
Query: 124 PENVCDQIRLWESDLNR 140
E D++R+WE +
Sbjct: 121 FE---DRLRIWEISFKK 134
>gi|401419872|ref|XP_003874425.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490661|emb|CBZ25923.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 559
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 61/144 (42%), Gaps = 49/144 (34%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
++ ETNFR+YAY+ IL F+++E + NL +T++S A GITA Q++
Sbjct: 359 IITETNFRLYAYTDDADLLNILNQFAELEEIVNGNLHCYRVTRDSFAAAMRKGITAAQVL 418
Query: 106 SFLQQNAHPRVADR---------------------------------IPS---------- 122
FL AHP + R +P
Sbjct: 419 RFLSLRAHPSMLRRHGEREESGDTAAPSSAGGVSSSLTFSAPHRRGAVPGTCGGTTAIDA 478
Query: 123 -----VPENVCDQIRLWESDLNRV 141
VP++ CDQ+R+WES+ +RV
Sbjct: 479 VSTIVVPQSFCDQMRMWESECHRV 502
>gi|146103275|ref|XP_001469523.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134073893|emb|CAM72632.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 557
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 59/144 (40%), Gaps = 49/144 (34%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
++ ETNFR+YAY+ IL F+K+E + NL +T++S +A GITA Q++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 106 SFLQQNAHPR-------------------------------------VADRIPS------ 122
FL AHP VA S
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476
Query: 123 -----VPENVCDQIRLWESDLNRV 141
VP++ CDQ+ +WES+ RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500
>gi|398024082|ref|XP_003865202.1| hypothetical protein, conserved [Leishmania donovani]
gi|322503439|emb|CBZ38524.1| hypothetical protein, conserved [Leishmania donovani]
Length = 557
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 59/144 (40%), Gaps = 49/144 (34%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
++ ETNFR+YAY+ IL F+K+E + NL +T++S +A GITA Q++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 106 SFLQQNAHPR-------------------------------------VADRIPS------ 122
FL AHP VA S
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476
Query: 123 -----VPENVCDQIRLWESDLNRV 141
VP++ CDQ+ +WES+ RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500
>gi|154345556|ref|XP_001568715.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066057|emb|CAM43844.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 561
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 58/143 (40%), Gaps = 48/143 (33%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
++ ETNFR+YAY+ IL F+++E + NL +T++S A GITA Q++
Sbjct: 362 IITETNFRLYAYTEDSDLLNILHQFAELEEVVNGNLHCYRVTRDSFAAAMRKGITATQVL 421
Query: 106 SFLQQNAHPRVADR------------------IPS------------------------- 122
FL AHP + R PS
Sbjct: 422 RFLSLRAHPSMLRRYGEKEENGDTKSPSPAGGAPSSTFSAPPRRAAVAGTRGSTTFIDAV 481
Query: 123 ----VPENVCDQIRLWESDLNRV 141
VP + CDQ+R+WE + RV
Sbjct: 482 TTLVVPHSFCDQLRMWERECRRV 504
>gi|157876614|ref|XP_001686652.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129727|emb|CAJ09033.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 626
Score = 54.3 bits (129), Expect = 3e-05, Method: Composition-based stats.
Identities = 39/144 (27%), Positives = 58/144 (40%), Gaps = 49/144 (34%)
Query: 47 VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLP-NLIVGAITKESLYNAFENGITAEQII 105
++ ETNFR++AY+ IL F+++E + NL +T+ S A GITA Q++
Sbjct: 426 IITETNFRLFAYTNDADLLNILNQFAELEEIVNGNLHCYRVTRASFAAAMRKGITAAQVL 485
Query: 106 SFLQQNAHPRVADR---------------------------IPS---------------- 122
FL AHP + R P
Sbjct: 486 RFLSLRAHPSMLRRHGEREENGDTASPSSAGGASSSFTFSASPRRGAVAGTCGSTTSIDA 545
Query: 123 -----VPENVCDQIRLWESDLNRV 141
VP++ CDQ+R+WES+ RV
Sbjct: 546 VSTLVVPQSFCDQMRMWESECRRV 569
>gi|145509120|ref|XP_001440504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407721|emb|CAK73107.1| unnamed protein product [Paramecium tetraurelia]
Length = 400
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/170 (27%), Positives = 75/170 (44%), Gaps = 39/170 (22%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
+IKD + GL Q + ++ L S+S+ ++VE NFR+YAY
Sbjct: 231 VIKDLQEFGLTHYQDQQMRITFLFWNFLYEPPSVSIGIQCN-----IIVEANFRIYAYLN 285
Query: 61 S------KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
S ++ C +L LFS+I+ + LI+ +++ S+ A + A+QII FL+ N+
Sbjct: 286 SGDQQEEEILCNLLNLFSEIKKRFKILIIADLSESSIRKAVRENLQAKQIIQFLEMNSKQ 345
Query: 115 ------------------------RV-ADRIPS---VPENVCDQIRLWES 136
RV + +P +P NV QI+ WES
Sbjct: 346 LKQQAATEKQHKSNDELKKRLDFLRVFQEGVPEKAIIPHNVVQQIQYWES 395
>gi|342184190|emb|CCC93671.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 499
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 29/127 (22%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ ETNFR+YAY+ + IL F+ E + +IV +T+ S A GI A+
Sbjct: 314 EDTIITETNFRLYAYTRNPDLLGILDQFAVKEIDIDGIIVCYRVTRSSFALALRKGIDAK 373
Query: 103 QIISFLQQNAHPR-----------VADRI----------PS-------VPENVCDQIRLW 134
I+ FL AHP V+D + PS +P++ CDQ+ W
Sbjct: 374 HILQFLTLKAHPSMVRKDGGASRDVSDNLALNSVAGFGKPSEVHQSTVIPQSFCDQLMTW 433
Query: 135 ESDLNRV 141
E + R+
Sbjct: 434 ERECRRL 440
>gi|340057132|emb|CCC51474.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 502
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 34/130 (26%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ ETNFR+YAY+ +K IL F+ E ++ +IV +T+ S A GI A+
Sbjct: 315 EDTIITETNFRLYAYTRNKHLLAILEQFAIKEAEVDGMIVCFRVTRASFAAALRKGIGAQ 374
Query: 103 QIISFLQQNAH-----------PR--------------------VADRIPSVPENVCDQI 131
I+ FL+ AH PR AD+I +P + CDQ+
Sbjct: 375 HIVQFLRVKAHSSMLKHQLTCDPRDSAGLAATSSRVTPTDTPWTHADKI--IPRSFCDQL 432
Query: 132 RLWESDLNRV 141
WE + R+
Sbjct: 433 FTWERECRRL 442
>gi|71747422|ref|XP_822766.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832434|gb|EAN77938.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 500
Score = 52.8 bits (125), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 29/127 (22%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ ETNFR+YAY+ + IL F+ E + ++V +T+ S +A GI A
Sbjct: 315 EDTIITETNFRLYAYTKNPDMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGAN 374
Query: 103 QIISFLQQNAHPRVADR------------IPS----------------VPENVCDQIRLW 134
I+ FL AHP + + +P+ +P++ CDQ+ W
Sbjct: 375 HILQFLTVKAHPSMIKQSNSEAGDPSCPVLPAASVGFGNTSEYRQGNIIPQSFCDQLFTW 434
Query: 135 ESDLNRV 141
E + R+
Sbjct: 435 ERECRRL 441
>gi|261332554|emb|CBH15549.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 500
Score = 52.8 bits (125), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 29/127 (22%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ ETNFR+YAY+ + IL F+ E + ++V +T+ S +A GI A
Sbjct: 315 EDTIITETNFRLYAYTKNPDMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGAN 374
Query: 103 QIISFLQQNAHPRVADR------------IPS----------------VPENVCDQIRLW 134
I+ FL AHP + + +P+ +P++ CDQ+ W
Sbjct: 375 HILQFLTVKAHPSMIKQSNSEAGDPSCPVLPAASAGFGNTSEYRQGNIIPQSFCDQLFTW 434
Query: 135 ESDLNRV 141
E + R+
Sbjct: 435 ERECRRL 441
>gi|324515605|gb|ADY46257.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 245
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 49/76 (64%), Gaps = 7/76 (9%)
Query: 7 DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETNFRMYAYSTS 61
+LGLV +++ RK+ +F T L +L++S + +A E G+++VETN+R+YAY+ S
Sbjct: 170 ELGLVFIRK--RKDGYFFITPLFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDS 227
Query: 62 KLHCEILRLFSKIEYQ 77
L IL F+++ Y+
Sbjct: 228 SLQLAILSTFTEMLYR 243
>gi|414589532|tpg|DAA40103.1| TPA: hypothetical protein ZEAMMB73_417245 [Zea mays]
Length = 877
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 33/48 (68%), Gaps = 9/48 (18%)
Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
++DLNRVEM P++ YE+FPS+ ARD LL +DSK+MRL+
Sbjct: 824 QTDLNRVEMIPSYLYEDFPSK---------ARDNGYLLCDDSKRMRLI 862
>gi|146161819|ref|XP_001033510.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila]
gi|146142827|gb|EAR85847.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila
SB210]
Length = 484
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 17/87 (19%)
Query: 47 VVVETNFRMYAY-------------STSKLHCE----ILRLFSKIEYQLPNLIVGAITKE 89
+ +ETNF +YAY T K E +L F + Y+ P+LIVG ++++
Sbjct: 322 ITIETNFNVYAYIDQEDPKEIKDSKDTLKDQTEHIEKLLASFCDLHYKFPHLIVGQLSEQ 381
Query: 90 SLYNAFENGITAEQIISFLQQNAHPRV 116
+ F+NG+TA +I F + + P++
Sbjct: 382 KTKDQFKNGLTARLLIQFFNKTSDPQM 408
>gi|407397449|gb|EKF27748.1| DNA repair and transcription factor protein [Trypanosoma cruzi
marinkellei]
Length = 501
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 57/128 (44%), Gaps = 30/128 (23%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ E NFR+YAY+ + IL F++ + ++ +I +T+++ A + GI +
Sbjct: 315 EDTIITEANFRLYAYTKNPDLLNILNQFAERDAEIDQMIACYRVTRKTFAAALKRGIGSS 374
Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
I+ FL AHP R D RI +P++ CDQ+
Sbjct: 375 HILQFLAVKAHPSMLRNHRDGDSKKTSGLSVLGAGTRFKNTSDIRIDGIIPQSFCDQLMT 434
Query: 134 WESDLNRV 141
WE + R+
Sbjct: 435 WEKECRRL 442
>gi|71650598|ref|XP_813994.1| DNA repair and transcription factor protein [Trypanosoma cruzi
strain CL Brener]
gi|70878928|gb|EAN92143.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 30/128 (23%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ E NFR+YAY+ + IL F++ + ++ +I +T+ + A + GI +
Sbjct: 315 EDTIITEANFRLYAYTKNPDLLNILNQFAERDAEVDQMIACYRVTRRTFAAALKRGIGSS 374
Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
I+ FL AHP R D RI +P++ CDQ+
Sbjct: 375 HILQFLAVKAHPSMLMHHREGDSKKASGLSVLGAGTRFKNASDIRIDEIIPQSFCDQLMT 434
Query: 134 WESDLNRV 141
WE + R+
Sbjct: 435 WEKECRRL 442
>gi|71425201|ref|XP_813043.1| DNA repair and transcription factor protein [Trypanosoma cruzi
strain CL Brener]
gi|70877891|gb|EAN91192.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 30/128 (23%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ E NFR+YAY+ + IL F++ + ++ +I +T+ + A + GI +
Sbjct: 315 EDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQMIACYRVTRRTFAAALKRGIGSS 374
Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
I+ FL AHP R D RI +P++ CDQ+
Sbjct: 375 HILQFLAVKAHPSMLRHHREGDSKKASGLSVLGAGTRFKNASDIRIDEIIPQSFCDQLMT 434
Query: 134 WESDLNRV 141
WE + R+
Sbjct: 435 WEKECRRL 442
>gi|407835017|gb|EKF99108.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 30/128 (23%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
E ++ E NFR+YAY+ + IL F++ + ++ +I +T+ + A + GI +
Sbjct: 315 EDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQMIACYRVTRRTFAAALKRGIGSL 374
Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
I+ FL AHP R D RI +P++ CDQ+
Sbjct: 375 HILQFLAVKAHPSMLRHHREGDSKKASGLSVLGAGTRFKKASDIRIDEIIPQSFCDQLMT 434
Query: 134 WESDLNRV 141
WE + R+
Sbjct: 435 WEKECRRL 442
>gi|197107449|pdb|3DOM|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
gi|197107451|pdb|3DOM|C Chain C, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
Length = 108
Score = 43.9 bits (102), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 43/83 (51%)
Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
P + + +P V DQIRLW+ +L+RV Y +F + + YA+D LLW
Sbjct: 19 PNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLW 78
Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
+D KK + ++ E + + +F +
Sbjct: 79 KDDKKKKFFISKEGNSQVLDFAK 101
>gi|47194594|emb|CAF93785.1| unnamed protein product [Tetraodon nigroviridis]
Length = 123
Score = 42.7 bits (99), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 46/111 (41%), Gaps = 23/111 (20%)
Query: 112 AHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA------ 165
+ P + P +P + DQIRLWE + +R++ T Y +F S+ FE D A
Sbjct: 13 SSPTPLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRDRAQVGLFH 72
Query: 166 -----------------RDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
+ LLW+D+ +VV H +++F + Q
Sbjct: 73 CCYTIRQKVHQQLSPVLQSLGCLLWQDAAHRVMVVTLWGHSEVKKFWKRQK 123
>gi|241725008|ref|XP_002412209.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
gi|215505416|gb|EEC14910.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
Length = 75
Score = 42.7 bits (99), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 36/68 (52%)
Query: 129 DQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIH 188
DQ+RLWE + +R Y +F S+ F+ +YA D L+W++ K +VVN H
Sbjct: 2 DQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRVMVVNRNGH 61
Query: 189 MHMREFLR 196
++ F +
Sbjct: 62 DEVKRFWK 69
>gi|197107383|pdb|3DGP|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
Length = 80
Score = 40.8 bits (94), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 38/71 (53%)
Query: 126 NVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNA 185
V DQIRLW+ +L+RV Y +F + + YA+D LLW+D KK + ++
Sbjct: 3 TVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISK 62
Query: 186 EIHMHMREFLR 196
E + + +F +
Sbjct: 63 EGNSQVLDFAK 73
>gi|337743331|gb|AEI73162.1| GTF2H4 [Kryptolebias marmoratus]
Length = 82
Score = 39.7 bits (91), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 13/70 (18%)
Query: 1 MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-----------DSSARKEGFVVV 49
++ + GLV Q RK + PT+LA L+ +T A GF+VV
Sbjct: 15 FLQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVTSNSSSSSNLSSTQGAGDAGFIVV 72
Query: 50 ETNFRMYAYS 59
ETN+R+YAY+
Sbjct: 73 ETNYRVYAYT 82
>gi|440297879|gb|ELP90520.1| hypothetical protein EIN_018790 [Entamoeba invadens IP1]
Length = 1702
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 97 NGITAEQIISFLQQN--AHPRVADRIPSVPENVCDQI-RLWESDLNRVEMTPAHYYEEFP 153
+G EQ+ F+ +N A+ R+ ++IPS+P+ C ++ + ++ + + H E P
Sbjct: 1211 SGKVCEQMAFFISKNPKAYERLKEKIPSLPDKSCVELFDFFSENVQKNQQKYKHLNYESP 1270
Query: 154 SRDVFEAACDYARD 167
S D E + ARD
Sbjct: 1271 SEDFLEQYTELARD 1284
>gi|261405782|ref|YP_003242023.1| hypothetical protein GYMC10_1934 [Paenibacillus sp. Y412MC10]
gi|261282245|gb|ACX64216.1| hypothetical protein GYMC10_1934 [Paenibacillus sp. Y412MC10]
Length = 572
Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 9/94 (9%)
Query: 81 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNR 140
+ V +T++S+ A + G+ ++I +F ++A +PENV + W +++R
Sbjct: 364 MTVYRLTRDSVIQAAKAGMEPDRIAAFFSKHA-------AAGMPENVQSALYQWGKEMDR 416
Query: 141 VEMTPAHYYEE-FPSRDVFEAACDYARDRSGLLW 173
+ + YEE FP D A D DR + W
Sbjct: 417 AGLNESPRYEEAFPEEDTELAGTD-GNDRYPMQW 449
>gi|430746993|ref|YP_007206122.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
18658]
gi|430018713|gb|AGA30427.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
18658]
Length = 790
Score = 37.0 bits (84), Expect = 4.0, Method: Composition-based stats.
Identities = 30/120 (25%), Positives = 55/120 (45%), Gaps = 10/120 (8%)
Query: 44 EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEY--QLPNLIVGAITKESLYNAFENGITA 101
E F+ V+ NF + AY L ++ FS+ Q+ + +T ES+Y E G+T
Sbjct: 448 EHFLYVQPNFEIIAYRQG-LTPPLIGQFSRFALWSQVGAALELKLTPESVYRGLEGGLTP 506
Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
+++ L +++ RV +P V + +R W R+ + EF +++ E A
Sbjct: 507 PEMLDRLARHSQ-RV------LPAGVSEAVRTWAGRRERITYYASATLVEFATKEDLEKA 559
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.134 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,954,759,714
Number of Sequences: 23463169
Number of extensions: 112206556
Number of successful extensions: 244311
Number of sequences better than 100.0: 396
Number of HSP's better than 100.0 without gapping: 378
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 243499
Number of HSP's gapped (non-prelim): 470
length of query: 200
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 65
effective length of database: 9,191,667,552
effective search space: 597458390880
effective search space used: 597458390880
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)