BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029030
         (200 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis]
 gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis]
          Length = 451

 Score =  372 bits (954), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 173/198 (87%), Positives = 192/198 (96%), Gaps = 1/198 (0%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           MIKD ADLGLVKLQQ GRKESWFIPTKLATNLSMSLTDSS+RK+GFVVVETNFRMYAYST
Sbjct: 254 MIKDLADLGLVKLQQ-GRKESWFIPTKLATNLSMSLTDSSSRKQGFVVVETNFRMYAYST 312

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           SKLHCEI+RLFS++EYQLPNL+VGA+TKESLY+AFENGITAEQIISFLQQNAHPRVA+RI
Sbjct: 313 SKLHCEIMRLFSRVEYQLPNLVVGAMTKESLYSAFENGITAEQIISFLQQNAHPRVAERI 372

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           PSVPENV DQIRLWESD+NRVEMTPAH Y+EFPSRDVFEAAC++ARD +GLLWEDSK+MR
Sbjct: 373 PSVPENVTDQIRLWESDMNRVEMTPAHLYDEFPSRDVFEAACNFARDWNGLLWEDSKRMR 432

Query: 181 LVVNAEIHMHMREFLRGQ 198
           +VV AEIH++MRE+LRGQ
Sbjct: 433 MVVKAEIHLNMREYLRGQ 450


>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa]
 gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  363 bits (933), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 169/197 (85%), Positives = 190/197 (96%), Gaps = 1/197 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           IKD A+LGLVKLQQ GRKESWFIPTKLATNLS+SLTDSS+RK+G+VVVETNFR+YAYS+S
Sbjct: 253 IKDLAELGLVKLQQ-GRKESWFIPTKLATNLSVSLTDSSSRKQGYVVVETNFRLYAYSSS 311

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLFSKIEYQLPNLIVGAITKESLY AFENGIT++QIISFLQQNAHPRVA+R+P
Sbjct: 312 KLHCEILRLFSKIEYQLPNLIVGAITKESLYTAFENGITSDQIISFLQQNAHPRVAERLP 371

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           SVPENV DQIRLWE+DLNRVE+TP+H+Y+EFPSRD FEAACD+AR+ +GLLWEDSKKMR+
Sbjct: 372 SVPENVTDQIRLWEADLNRVEITPSHFYDEFPSRDTFEAACDFAREWNGLLWEDSKKMRV 431

Query: 182 VVNAEIHMHMREFLRGQ 198
           VV AEIHM+MREFLRGQ
Sbjct: 432 VVKAEIHMNMREFLRGQ 448


>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis
           sativus]
          Length = 451

 Score =  362 bits (928), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 171/197 (86%), Positives = 186/197 (94%), Gaps = 1/197 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           IKD ADLGLVKLQQ GRKESWFIPTKLATNLSMSL DSS+RK GFVVVETNFRMYAYSTS
Sbjct: 255 IKDLADLGLVKLQQ-GRKESWFIPTKLATNLSMSLADSSSRKLGFVVVETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLFS+IEYQLPNLIVGAITKESLYNAF+NGITAEQI++FLQQNAHPRVA+RIP
Sbjct: 314 KLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           SVPENV DQIRLWESDLNRV++TPAH+Y+EFPSR+VFEAACDYAR+ +GLLWEDSK +RL
Sbjct: 374 SVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREWNGLLWEDSKNLRL 433

Query: 182 VVNAEIHMHMREFLRGQ 198
           VV A+IH HMRE LR Q
Sbjct: 434 VVKADIHTHMREHLRRQ 450


>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera]
          Length = 451

 Score =  360 bits (923), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 167/195 (85%), Positives = 184/195 (94%), Gaps = 1/195 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           IKD  DLGLVKLQQ GRKESWFIPTKLATNLSMSL+D+S+RK+GFVVVETNFR+YAYS+S
Sbjct: 255 IKDLVDLGLVKLQQ-GRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLFS++EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA+R P
Sbjct: 314 KLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAERTP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           +VPENV DQIRLWE+DLNRVE  P+H Y+EFPSRDVFEAACD+AR+  GLLWEDSKKMRL
Sbjct: 374 AVPENVTDQIRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRL 433

Query: 182 VVNAEIHMHMREFLR 196
           VV AEIH+HMRE+LR
Sbjct: 434 VVKAEIHLHMREYLR 448


>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 452

 Score =  350 bits (897), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 164/198 (82%), Positives = 186/198 (93%), Gaps = 1/198 (0%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           +IKD ADLGLVKLQQ GRKESWFIPTKLATNLS+S+T+SS+RKEGFVVVETNFR+YAYST
Sbjct: 255 IIKDLADLGLVKLQQ-GRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYST 313

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           SKLHCEILRLFS++EYQLPNLIVGAITKESLYNAF+NGITA+QI+SFL+QNAHPRVA R+
Sbjct: 314 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSFLRQNAHPRVAQRV 373

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           P+VPENV DQIRLWESDLNRVEMT A+YY+EFPSRDVFE ACD AR+ SGLLWEDSKKM 
Sbjct: 374 PAVPENVTDQIRLWESDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMH 433

Query: 181 LVVNAEIHMHMREFLRGQ 198
           LVV +E+H ++R+FLR Q
Sbjct: 434 LVVKSEVHTYVRDFLRRQ 451


>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 451

 Score =  348 bits (893), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 164/198 (82%), Positives = 185/198 (93%), Gaps = 1/198 (0%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           +IKD ADLGLVKLQQ GRKESWFIPTKLATNLS+S+T+SS+RKEGFVVVETNFR+YAYST
Sbjct: 254 IIKDLADLGLVKLQQ-GRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYST 312

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           SKLHCEILRLFS++EYQLPNLIVGAITKESLYNAF+NGITA+QI+SF +QNAHPRVA RI
Sbjct: 313 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSFFRQNAHPRVAQRI 372

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           P+VPENV DQIRLWESDLNRVEMT A+YY+EFPSRDVFE ACD AR+ SGLLWEDSKKM 
Sbjct: 373 PAVPENVTDQIRLWESDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMH 432

Query: 181 LVVNAEIHMHMREFLRGQ 198
           LVV +E+H ++R+FLR Q
Sbjct: 433 LVVKSEVHTYVRDFLRRQ 450


>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
           max]
          Length = 451

 Score =  344 bits (882), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 161/198 (81%), Positives = 183/198 (92%), Gaps = 1/198 (0%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           +I D ADLGLVK+QQ GRK SWFIPTKLATNLSMSL DSS+RK+GFVVVETNFR+YAYST
Sbjct: 254 IINDLADLGLVKIQQ-GRKGSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRVYAYST 312

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           SKLHCEILRLFS++EYQLPNLIVGAITKESLY+AFENGITAEQII+FLQQNAHPRVA+RI
Sbjct: 313 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAFENGITAEQIITFLQQNAHPRVAERI 372

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           PSVPENV +QIRLWE+DLNRVEMT A+YY+EFPSRDVFE ACD AR+ +GLLWEDSKKM 
Sbjct: 373 PSVPENVTEQIRLWEADLNRVEMTDAYYYDEFPSRDVFEGACDCAREWNGLLWEDSKKMH 432

Query: 181 LVVNAEIHMHMREFLRGQ 198
           +VV  E+H ++R++LR Q
Sbjct: 433 MVVKTEVHQYVRDYLRRQ 450


>gi|297800414|ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  342 bits (878), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 158/199 (79%), Positives = 181/199 (90%), Gaps = 1/199 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           +KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KL CEILRLF++IEYQLPNLI  AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+  GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLKRIEMTQAHFYDEFPSKDVFEAACDFAREWGGLLWEDSKRMRL 433

Query: 182 VVNAEIHMHMREFLRGQNK 200
           VV +E+H  MREFL  Q K
Sbjct: 434 VVKSEVHNQMREFLHNQTK 452


>gi|42566894|ref|NP_193435.2| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana]
 gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana]
 gi|332658439|gb|AEE83839.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 452

 Score =  341 bits (874), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 157/199 (78%), Positives = 182/199 (91%), Gaps = 1/199 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           +KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KL CEILRLF++IEYQLPNLI  AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+  GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433

Query: 182 VVNAEIHMHMREFLRGQNK 200
           VV +E+H  MREFL  Q++
Sbjct: 434 VVKSEVHNQMREFLHTQSR 452


>gi|42572935|ref|NP_974564.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|332658438|gb|AEE83838.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 462

 Score =  340 bits (873), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 157/199 (78%), Positives = 182/199 (91%), Gaps = 1/199 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           +KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KL CEILRLF++IEYQLPNLI  AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+  GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433

Query: 182 VVNAEIHMHMREFLRGQNK 200
           VV +E+H  MREFL  Q++
Sbjct: 434 VVKSEVHNQMREFLHTQSR 452


>gi|334186626|ref|NP_001190745.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|332658440|gb|AEE83840.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 482

 Score =  340 bits (872), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 157/198 (79%), Positives = 181/198 (91%), Gaps = 1/198 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           +KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KL CEILRLF++IEYQLPNLI  AITKESLYNAF+NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+  GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433

Query: 182 VVNAEIHMHMREFLRGQN 199
           VV +E+H  MREFL  Q+
Sbjct: 434 VVKSEVHNQMREFLHTQS 451


>gi|51969958|dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana]
          Length = 452

 Score =  339 bits (870), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 157/199 (78%), Positives = 181/199 (90%), Gaps = 1/199 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           +KD ADLGLVKLQQ GRK+SWFIPTKLATNLS+SL DSSARKEGFVV+ETNFRMYAYSTS
Sbjct: 255 LKDLADLGLVKLQQ-GRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KL CEILRLF++IEYQLPNLI  AITKESLYNAF NGIT++QII+FLQQN+HPR ADR+P
Sbjct: 314 KLQCEILRLFARIEYQLPNLIACAITKESLYNAFGNGITSDQIITFLQQNSHPRCADRVP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           S+PENV DQIRLWE+DL R+EMT AH+Y+EFPS+DVFEAACD+AR+  GLLWEDSK+MRL
Sbjct: 374 SIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRL 433

Query: 182 VVNAEIHMHMREFLRGQNK 200
           VV +E+H  MREFL  Q++
Sbjct: 434 VVKSEVHNQMREFLHTQSR 452


>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
           max]
          Length = 451

 Score =  339 bits (869), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 157/198 (79%), Positives = 182/198 (91%), Gaps = 1/198 (0%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           +I D ADLGLVK+QQ GRK SWFIPTKLATNLSMSL DSS+RK+GFVVVETNFR+YAYST
Sbjct: 254 IINDLADLGLVKIQQ-GRKGSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRVYAYST 312

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           SKLHCEILRLFS++EYQLPNLIVGAITKESLY+A+ENGITAEQI++FLQQNAHPRVA+R+
Sbjct: 313 SKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAYENGITAEQIVTFLQQNAHPRVAERV 372

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           PSVPENV +QIRLWE+DLNRVEMT  +YY+EFPSRDVFE ACD AR+ +GLLWEDSKKM 
Sbjct: 373 PSVPENVTEQIRLWEADLNRVEMTDTYYYDEFPSRDVFEGACDCAREWNGLLWEDSKKMH 432

Query: 181 LVVNAEIHMHMREFLRGQ 198
           +VV  E+H ++R++LR Q
Sbjct: 433 MVVKTEVHPYVRDYLRRQ 450


>gi|32487401|emb|CAE05735.1| OSJNBb0017I01.15 [Oryza sativa Japonica Group]
          Length = 451

 Score =  319 bits (818), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 156/195 (80%), Positives = 176/195 (90%), Gaps = 1/195 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVKLQQ GRK+SWFIPTKLATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVKLQQ-GRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           +LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 314 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           +VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+  CDYARD   LLWED+KKMRL
Sbjct: 374 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 433

Query: 182 VVNAEIHMHMREFLR 196
           +V  E H  MREFLR
Sbjct: 434 IVRVEFHSEMREFLR 448


>gi|222629781|gb|EEE61913.1| hypothetical protein OsJ_16644 [Oryza sativa Japonica Group]
          Length = 427

 Score =  319 bits (817), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 156/195 (80%), Positives = 176/195 (90%), Gaps = 1/195 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVKLQQ GRK+SWFIPTKLATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 231 IRDLAELGLVKLQQ-GRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 289

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           +LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 290 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 349

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           +VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+  CDYARD   LLWED+KKMRL
Sbjct: 350 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 409

Query: 182 VVNAEIHMHMREFLR 196
           +V  E H  MREFLR
Sbjct: 410 IVRVEFHSEMREFLR 424


>gi|218195833|gb|EEC78260.1| hypothetical protein OsI_17944 [Oryza sativa Indica Group]
          Length = 459

 Score =  317 bits (813), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 156/202 (77%), Positives = 177/202 (87%), Gaps = 7/202 (3%)

Query: 2   IKDFADLGLVKLQQV-------GRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFR 54
           I+D A+LGLVKLQQ+       GRK+SWFIPTKLATNLS SL+DSS+ KEGFVVVETNFR
Sbjct: 255 IRDLAELGLVKLQQMSRVKPWQGRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFR 314

Query: 55  MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
           MYAYSTS+LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHP
Sbjct: 315 MYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHP 374

Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
           RVAD+IP+VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+  CDYARD   LLWE
Sbjct: 375 RVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWE 434

Query: 175 DSKKMRLVVNAEIHMHMREFLR 196
           D+KKMRL+V  E H  MREFLR
Sbjct: 435 DAKKMRLIVRVEFHSEMREFLR 456


>gi|414584751|tpg|DAA35322.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
          Length = 451

 Score =  316 bits (809), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 154/197 (78%), Positives = 175/197 (88%), Gaps = 1/197 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVK QQ GRK+SWFIPT+LATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVK-QQQGRKDSWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLF+++EYQLPNLIVGA+TKES+Y AFENGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 KLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           +VPENV DQIRLWE+DLNRVEM P+H YE+FPS++ FE  CDYARD   LLWED K+MRL
Sbjct: 374 AVPENVTDQIRLWETDLNRVEMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDPKRMRL 433

Query: 182 VVNAEIHMHMREFLRGQ 198
           +V  E H  MREFLR Q
Sbjct: 434 IVRGEFHPEMREFLRRQ 450


>gi|242074798|ref|XP_002447335.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
 gi|241938518|gb|EES11663.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
          Length = 451

 Score =  312 bits (799), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 153/197 (77%), Positives = 173/197 (87%), Gaps = 1/197 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVK QQ GR + WFIPT+LATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVK-QQQGRTDRWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLFS++EYQLPNLIVGA+TKES+Y AFENGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 KLHCEILRLFSRVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           +VPENV DQIRLWE+D NRVEM P+H YE+FPS++ FE  CDYARD   LLWEDSK+MRL
Sbjct: 374 TVPENVTDQIRLWETDRNRVEMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDSKRMRL 433

Query: 182 VVNAEIHMHMREFLRGQ 198
           +V  E H  MREFLR Q
Sbjct: 434 IVRGEFHPEMREFLRRQ 450


>gi|357166758|ref|XP_003580836.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Brachypodium distachyon]
          Length = 451

 Score =  308 bits (789), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 153/197 (77%), Positives = 172/197 (87%), Gaps = 1/197 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVK+QQ GRK+SWFIPTKLATNLS SL+DSSA KEG VVVETNFR+YAYS S
Sbjct: 255 IRDLAELGLVKVQQ-GRKDSWFIPTKLATNLSSSLSDSSASKEGIVVVETNFRLYAYSAS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           +LHCEILRLFS++EYQLPNLIVGAITKESLY AF+NGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 RLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFLKQNAHPRVADKIP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
            VPENV DQIRLWE+D NRV+M  +H YE+FPS+D+FE  CD ARD   LLWEDSKKMRL
Sbjct: 374 VVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSKDMFEQCCDLARDNGFLLWEDSKKMRL 433

Query: 182 VVNAEIHMHMREFLRGQ 198
           +V  E H  MREFLR Q
Sbjct: 434 IVRVEFHQEMREFLRRQ 450


>gi|326510735|dbj|BAJ91715.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 451

 Score =  307 bits (787), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 152/197 (77%), Positives = 171/197 (86%), Gaps = 1/197 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           + D  +LGLVKLQQ GRK+SWFIPTKLATNLS SL+DS+A KEG VVVETNFR+YAYS S
Sbjct: 255 VMDLMELGLVKLQQ-GRKDSWFIPTKLATNLSSSLSDSAASKEGIVVVETNFRLYAYSAS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLFS++EYQLPNLIVGAITKESLY AF+NGITAEQIISFLQQNAHPRV D+IP
Sbjct: 314 KLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFLQQNAHPRVIDKIP 373

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
            VPENV DQIRLWE+D NRVEM  +H YE+FPS+D+FE  CD+ARD   LLWED+KKMRL
Sbjct: 374 IVPENVTDQIRLWENDRNRVEMILSHVYEDFPSKDMFEQCCDHARDNGYLLWEDAKKMRL 433

Query: 182 VVNAEIHMHMREFLRGQ 198
           +V+ E H  MREFLR Q
Sbjct: 434 IVSGEFHQEMREFLRRQ 450


>gi|357123176|ref|XP_003563288.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
           subunit 4-like [Brachypodium distachyon]
          Length = 452

 Score =  305 bits (782), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 152/197 (77%), Positives = 171/197 (86%), Gaps = 1/197 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVK+QQ GRK+SWFIPTKLATNLS SL+DSSA KEG VVVETNFR+YAYS S
Sbjct: 256 IRDLAELGLVKVQQ-GRKDSWFIPTKLATNLSSSLSDSSASKEGIVVVETNFRLYAYSAS 314

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           +LHCEILRLFS++EYQLPNLIVGAITKESLY AF+NGITAEQIISFL+QNAHP VAD+IP
Sbjct: 315 RLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFLKQNAHPXVADKIP 374

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
            VPENV DQIRLWE+D NRV+M  +H YE+FPS+D+FE  CD ARD   LLWEDSKKMRL
Sbjct: 375 VVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSKDLFEQCCDLARDNGFLLWEDSKKMRL 434

Query: 182 VVNAEIHMHMREFLRGQ 198
           +V  E H  MREFLR Q
Sbjct: 435 IVRVEFHQEMREFLRRQ 451


>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group]
          Length = 470

 Score =  277 bits (708), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 133/195 (68%), Positives = 150/195 (76%), Gaps = 29/195 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVKLQQ                             GFVVVETNFRMYAYSTS
Sbjct: 302 IRDLAELGLVKLQQ-----------------------------GFVVVETNFRMYAYSTS 332

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           +LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 333 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 392

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           +VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+  CDYARD   LLWED+KKMRL
Sbjct: 393 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 452

Query: 182 VVNAEIHMHMREFLR 196
           +V  E H  MREFLR
Sbjct: 453 IVRVEFHSEMREFLR 467


>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group]
 gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group]
          Length = 444

 Score =  269 bits (687), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 128/193 (66%), Positives = 147/193 (76%), Gaps = 29/193 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVKLQQ                             GFVVVETNFRMYAYSTS
Sbjct: 274 IRDLAELGLVKLQQ-----------------------------GFVVVETNFRMYAYSTS 304

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           +LHCEILRLFS++EYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD+IP
Sbjct: 305 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 364

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           +VPENV DQIRLWE+D NRV+MT +H YE+FPS+D+F+  CDYARD   LLWED+KKMRL
Sbjct: 365 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRL 424

Query: 182 VVNAEIHMHMREF 194
           +V  E H  M ++
Sbjct: 425 IVRVEFHSEMHDY 437


>gi|167997877|ref|XP_001751645.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697626|gb|EDQ83962.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 462

 Score =  245 bits (626), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 114/201 (56%), Positives = 159/201 (79%), Gaps = 3/201 (1%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--ARKEGFVVVETNFRMYAY 58
           ++ + A LGLVKLQQ G KESW+IPTKLA+NLS SL++S+     EGFVVVETNF++YAY
Sbjct: 255 VLDELAALGLVKLQQ-GMKESWYIPTKLASNLSASLSESTDWQSSEGFVVVETNFKVYAY 313

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           ++SKL  EILR F+++EYQLPNL+V  +TKES+  A  +GI+AEQIISFL+++AHP VA 
Sbjct: 314 TSSKLQTEILRCFTRLEYQLPNLVVATLTKESVNKALGSGISAEQIISFLRKHAHPHVAQ 373

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
           +IP VPE V DQ+RLWE+D NRV+  PA++Y++FP+  ++EA   +ARD  GLL+ED+  
Sbjct: 374 KIPVVPETVSDQLRLWETDRNRVQFEPAYFYDDFPTMAIYEAVVAHARDLGGLLFEDASA 433

Query: 179 MRLVVNAEIHMHMREFLRGQN 199
            RL+V +++H  MR+++R Q+
Sbjct: 434 KRLIVRSDLHEDMRQYIRKQS 454


>gi|296084734|emb|CBI25875.3| unnamed protein product [Vitis vinifera]
          Length = 258

 Score =  241 bits (614), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 128/219 (58%), Positives = 151/219 (68%), Gaps = 25/219 (11%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           IKD  DLGLVKLQQ GRKESWFIPTKLATNLSMSL+D+S+RK+GFVVVETNFR+YAYS+S
Sbjct: 38  IKDLVDLGLVKLQQ-GRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSS 96

Query: 62  KLHCEILRLFSKIEYQLPNLIV--------GAITKESL-----YNAFENGITAEQIISFL 108
           KLHCEILRLFS+    L + ++        G +   SL        F  G       S +
Sbjct: 97  KLHCEILRLFSRYASMLFDCLLCICYSSQKGTLLDVSLIMYECLRTFSLGKIIFWFFSSV 156

Query: 109 QQNAHP-----------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV 157
              A P            +A++IP +      QIRLWE+DLNRVE  P+H Y+EFPSRDV
Sbjct: 157 STVATPCHGYLVIFTISCLAEKIPCLQFTNMSQIRLWETDLNRVETMPSHLYDEFPSRDV 216

Query: 158 FEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           FEAACD+AR+  GLLWEDSKKMRLVV AEIH+HMRE+LR
Sbjct: 217 FEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREYLR 255


>gi|302806060|ref|XP_002984780.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
 gi|302808297|ref|XP_002985843.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
 gi|300146350|gb|EFJ13020.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
 gi|300147366|gb|EFJ14030.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
          Length = 459

 Score =  227 bits (578), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 116/199 (58%), Positives = 155/199 (77%), Gaps = 2/199 (1%)

Query: 3   KDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA-RKEGFVVVETNFRMYAYSTS 61
           ++ A LG+++LQ+ G K+ WFIPT+LAT LS SL++SSA + EGF++VETNFR+YAY++S
Sbjct: 260 EELAMLGVLQLQK-GMKDRWFIPTRLATGLSASLSESSAWQTEGFIMVETNFRVYAYTSS 318

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLH E L +F + EY LPN++VG+ITKES+  AF +GI+A+QII FLQQ+AHP VA ++P
Sbjct: 319 KLHIETLHVFVRTEYVLPNILVGSITKESVNGAFASGISADQIIKFLQQHAHPFVAQKVP 378

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
           SVPE VCDQIRLWESD  RV+  PA+ YE FPS  V+E+   +ARDR+GLLWED+ +  +
Sbjct: 379 SVPETVCDQIRLWESDRVRVQYLPAYCYEGFPSTSVYESVVAHARDRNGLLWEDANRKMI 438

Query: 182 VVNAEIHMHMREFLRGQNK 200
           VV  E H  +R FL+  NK
Sbjct: 439 VVGGEHHEAIRAFLQNINK 457


>gi|359497087|ref|XP_002267711.2| PREDICTED: general transcription factor IIH subunit 4-like [Vitis
           vinifera]
          Length = 238

 Score =  226 bits (575), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 117/195 (60%), Positives = 134/195 (68%), Gaps = 44/195 (22%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           IKD  DLGLVKLQQ GRKESWFIPTKLATNLSMSL+D+S+RK+GFVVVETNFR+YAYS+S
Sbjct: 85  IKDLVDLGLVKLQQ-GRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSS 143

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLFS+                               +S +    H  +     
Sbjct: 144 KLHCEILRLFSR-----------------------------SSVSTVATPCHGYL----- 169

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRL 181
                    IRLWE+DLNRVE  P+H Y+EFPSRDVFEAACD+AR+  GLLWEDSKKMRL
Sbjct: 170 ---------IRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRL 220

Query: 182 VVNAEIHMHMREFLR 196
           VV AEIH+HMRE+LR
Sbjct: 221 VVKAEIHLHMREYLR 235


>gi|414584750|tpg|DAA35321.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
          Length = 386

 Score =  217 bits (552), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 108/130 (83%), Positives = 123/130 (94%), Gaps = 1/130 (0%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTS 61
           I+D A+LGLVK QQ GRK+SWFIPT+LATNLS SL+DSS+ KEGFVVVETNFRMYAYSTS
Sbjct: 255 IRDLAELGLVK-QQQGRKDSWFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTS 313

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
           KLHCEILRLF+++EYQLPNLIVGA+TKES+Y AFENGITAEQIISFL+QNAHPRVAD+IP
Sbjct: 314 KLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIP 373

Query: 122 SVPENVCDQI 131
           +VPENV DQ+
Sbjct: 374 AVPENVTDQV 383


>gi|440791255|gb|ELR12502.1| TFIIH basal transcription factor complex subunit 2, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 304

 Score =  166 bits (420), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 75/197 (38%), Positives = 132/197 (67%), Gaps = 5/197 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-KEGFVVVETNFRMYAYS 59
           +++D  DLG++  +   R   W  PT+LA  LS   T+++ R +EG+++V T++R+YAY+
Sbjct: 99  LLQDLQDLGIIYRKH--RDSKWLYPTQLAIGLSS--TEAAKRDQEGWIIVGTDYRIYAYT 154

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
           +S +   +L LF++IEYQLPN+++G + +E++  A + GI+A QI+ FL+ NAHP++   
Sbjct: 155 SSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMKQN 214

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +PE++ DQ+RLWE++  R+ ++P ++Y++F S   F+ A  YARD   LL+ D+ K 
Sbjct: 215 TPIIPESIADQLRLWEAEDRRLSLSPGYFYDDFASLAAFKKAEKYARDVGALLYSDAAKR 274

Query: 180 RLVVNAEIHMHMREFLR 196
            L V  + H  +R +++
Sbjct: 275 FLFVTEQGHQLLRRYVK 291


>gi|384246869|gb|EIE20357.1| Tfb2-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 451

 Score =  166 bits (419), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 86/197 (43%), Positives = 119/197 (60%), Gaps = 4/197 (2%)

Query: 4   DFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYSTS 61
           D A LGL+          W  PT+LA  L+   +  +     +GFVVVETN+R+YAY++S
Sbjct: 252 DMAQLGLLMPFTAKDGSVWLAPTRLALALAGGSSGQAQHDVTDGFVVVETNYRVYAYTSS 311

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIP 121
            L   +LRLF++ E  LPNL VG +T+ES+  A   G++A+QI+ +L+Q+AHP VA R P
Sbjct: 312 LLQTALLRLFTRCECILPNLFVGVLTRESVTGALACGLSADQIVLYLRQHAHPHVASRTP 371

Query: 122 SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK--M 179
            VPE V DQ+RLW++D  RV    A  Y++FPS  VF+ +   AR     LWED K    
Sbjct: 372 VVPEVVADQVRLWQADTMRVRHNRAVLYDDFPSAQVFQLSAQKARTLGVWLWEDPKAGMG 431

Query: 180 RLVVNAEIHMHMREFLR 196
           RL V    H  MRE+++
Sbjct: 432 RLAVQEAGHDAMREYIK 448


>gi|290987279|ref|XP_002676350.1| predicted protein [Naegleria gruberi]
 gi|284089952|gb|EFC43606.1| predicted protein [Naegleria gruberi]
          Length = 532

 Score =  163 bits (413), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 83/211 (39%), Positives = 130/211 (61%), Gaps = 13/211 (6%)

Query: 1   MIKDFADLGLVKLQQVGRKES---WFIPTKLATNLSMSLTDS----------SARKEGFV 47
           ++ DF DLG++ L +  +K+    +F PT LA +L++S++ S          ++   G++
Sbjct: 319 LLVDFNDLGIIYLHRDKKKKIKDKYFFPTPLAKSLTVSMSTSYDLISSFGSHNSINNGYI 378

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           +VETN+R+YAY+ S L   +L LF   EY+LPN++VG IT+ ++  A +NGI+A QI+ F
Sbjct: 379 IVETNYRVYAYTNSPLQIALLSLFIFPEYRLPNMVVGLITRSTIREALKNGISAHQILQF 438

Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           L+ NAHP++  + P +P+ V DQI LWE + NRV  T +  Y++F +     A  DYAR 
Sbjct: 439 LRLNAHPQMRLKKPVIPDTVSDQILLWEKERNRVIKTTSVVYDKFTNVSELNATVDYARR 498

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +  LLW   +KM +V   E H  M+EF+  Q
Sbjct: 499 QGALLWHSEEKMMMVCKREFHGLMKEFISSQ 529


>gi|145351329|ref|XP_001420034.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580267|gb|ABO98327.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 495

 Score =  162 bits (409), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 82/200 (41%), Positives = 118/200 (59%), Gaps = 9/200 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA-------RKEGFVVVETNF 53
           ++ D   LGL+   +V +K+ +++PT LA+ LS   T             EG ++VETN+
Sbjct: 284 VVLDLTHLGLLYTFEV-KKKFYYVPTLLASGLSGGFTGDDGDTKAAAAAAEGHIIVETNY 342

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY++S +  EILRLF++ +Y+LPNL VG +T+ES+ NA   G+ AEQI+ +++ +AH
Sbjct: 343 RVYAYTSSAVEMEILRLFTRADYRLPNLYVGMLTRESVQNALRAGVDAEQIVGYIRAHAH 402

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA-CDYARDRSGLL 172
            +V  + PSVP  VCDQIRLW  D+ R+E      Y +FP    F  A    A  R  LL
Sbjct: 403 KQVRRKKPSVPSTVCDQIRLWARDMERMEAEECVLYCDFPQAGTFYGAVVSEAEKRGALL 462

Query: 173 WEDSKKMRLVVNAEIHMHMR 192
           W D    RL V A  H  M+
Sbjct: 463 WRDDASRRLTVRASAHDEMK 482


>gi|440800489|gb|ELR21525.1| transcription factor tfb2 subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 465

 Score =  159 bits (401), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 131/197 (66%), Gaps = 5/197 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-KEGFVVVETNFRMYAYS 59
           +++D  DLG++  +   +   W  PT+LA  LS   T+++ R +EG+++V T++R+YAY+
Sbjct: 258 LLQDLQDLGIIYRKH--KDSKWLYPTQLAIGLSS--TEAAKRDQEGWIIVGTDYRIYAYT 313

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
           +S +   +L LF++IEYQLPN+++G + +E++  A + GI+A QI+ FL+ NAHP++   
Sbjct: 314 SSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMKQN 373

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +PE++ DQ+RLWE++  R+ ++  ++Y++F S   F+ A  YARD   L++ D+ K 
Sbjct: 374 TPIIPESIADQLRLWEAEDRRLSLSSGYFYDDFASLAAFKKAEKYARDVGALIYSDATKR 433

Query: 180 RLVVNAEIHMHMREFLR 196
            L V+   H  +R +++
Sbjct: 434 FLFVSEPGHQLLRRYVK 450


>gi|390594544|gb|EIN03954.1| transcription factor Tfb2 [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 467

 Score =  157 bits (397), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 123/199 (61%), Gaps = 6/199 (3%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL----TDSSARKEGFVVVETNFRMY 56
           M++D  D GL+  Q+   K   F PT+LAT L+ S     T  S   EGF+V+ETN+R+Y
Sbjct: 263 MLEDLRDYGLIWQQKPTSKR--FSPTRLATTLTSSSPPLPTSGSGVSEGFIVLETNYRLY 320

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
           AY+ + L   +L LF+ +  + PNL+VG IT+ES+  A ++GI+AEQIIS+L  +AHP++
Sbjct: 321 AYTDNPLQTAVLALFTTLRSRFPNLVVGQITRESVKRALQSGISAEQIISYLSTHAHPQM 380

Query: 117 ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
               P +P  V DQIRLWE + NR++      Y EF S+  +E   +YAR+ + +LWE+ 
Sbjct: 381 RKNNPLIPVTVQDQIRLWELEKNRLKSQEGFLYTEFASQGDYEYVLNYARELNVVLWENP 440

Query: 177 KKMRLVVNAEIHMHMREFL 195
            K     + E H ++R F+
Sbjct: 441 VKRCFFGSMEGHANIRGFI 459


>gi|403180193|ref|XP_003338499.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375166015|gb|EFP94080.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 679

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 77/201 (38%), Positives = 123/201 (61%), Gaps = 9/201 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL-----TDSSARKEGFVVVETNFRM 55
           +++D  D GLV +    R    F PT+LAT L+ +       + +  + GF+V+ETN+R+
Sbjct: 102 VLQDLVDYGLVLVSAPDR----FYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETNYRI 157

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY+++ L   +L LF  + Y+ PNL+VGA+T+ES+ +A  NGITA+Q+I +L  +AHP+
Sbjct: 158 YAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHAHPQ 217

Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
           +    P +P  V DQIRLWE + NR+     + YE+F S   +++   Y+R    +LWE 
Sbjct: 218 MRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVLWEH 277

Query: 176 SKKMRLVVNAEIHMHMREFLR 196
           +   +L V  + H+ +REF R
Sbjct: 278 AGLRKLFVGYDGHLTLREFFR 298


>gi|403163300|ref|XP_003323393.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163998|gb|EFP78974.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 887

 Score =  157 bits (396), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 77/201 (38%), Positives = 123/201 (61%), Gaps = 9/201 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL-----TDSSARKEGFVVVETNFRM 55
           +++D  D GLV +    R    F PT+LAT L+ +       + +  + GF+V+ETN+R+
Sbjct: 307 VLQDLVDYGLVLVSAPDR----FYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETNYRI 362

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY+++ L   +L LF  + Y+ PNL+VGA+T+ES+ +A  NGITA+Q+I +L  +AHP+
Sbjct: 363 YAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHAHPQ 422

Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
           +    P +P  V DQIRLWE + NR+     + YE+F S   +++   Y+R    +LWE 
Sbjct: 423 MRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVLWEH 482

Query: 176 SKKMRLVVNAEIHMHMREFLR 196
           +   +L V  + H+ +REF R
Sbjct: 483 AGLRKLFVGYDGHLTLREFFR 503


>gi|72074363|ref|XP_796111.1| PREDICTED: general transcription factor IIH subunit 4
           [Strongylocentrotus purpuratus]
          Length = 469

 Score =  156 bits (395), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 82/195 (42%), Positives = 122/195 (62%), Gaps = 4/195 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   +LGLV   Q  RK   + PT+LA NL+  ++  +    K+GF+VVETNFR+YAY+
Sbjct: 267 LQHLRELGLVF--QRKRKSMRYYPTRLAINLASGVSSMAKDDHKDGFIVVETNFRVYAYT 324

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S L  EIL LF  + Y+ PNL V A+T+ES+  A  NGITAEQI+SFL+ +AHP +  +
Sbjct: 325 ESDLQVEILGLFCSMMYRFPNLSVAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLK 384

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P VP  + DQ+RLWE + +R+  T    Y EF S   FE   DYA+D   L+W+ + + 
Sbjct: 385 TPIVPPTISDQVRLWELERDRLSFTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARR 444

Query: 180 RLVVNAEIHMHMREF 194
            ++V+   H  ++++
Sbjct: 445 IMIVSPAGHDSVKKY 459


>gi|260810774|ref|XP_002600123.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
 gi|229285409|gb|EEN56135.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
          Length = 457

 Score =  156 bits (394), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 79/197 (40%), Positives = 125/197 (63%), Gaps = 4/197 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD--SSARKEGFVVVETNFRMYAYS 59
           ++   +LGLV   Q  RK   F PT+LA +L+  ++D      KEG++VVE+N+R+YAY+
Sbjct: 256 LQHLRELGLVN--QRKRKSGRFYPTRLAIHLASGISDVEKDFHKEGYLVVESNYRIYAYT 313

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS+I Y+ PN++V  +T++S+  A   GITA+QI+ FL+ NAHP+   R
Sbjct: 314 DSELQVALIGLFSEILYRFPNMVVANLTRDSVQEAVVRGITADQILHFLRVNAHPKALHR 373

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
           +P VP  + DQIRLWE + +R+  T    Y +F S+  FE   +YA+D   LLWE++ K 
Sbjct: 374 VPIVPPTISDQIRLWEMERDRLTFTEGVLYNQFLSQPDFEMLRNYAKDLGVLLWENNPKR 433

Query: 180 RLVVNAEIHMHMREFLR 196
            +VV+   H  ++ + +
Sbjct: 434 LMVVSKAGHDDVKRYWK 450


>gi|308452336|ref|XP_003089005.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
 gi|308243592|gb|EFO87544.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
          Length = 468

 Score =  155 bits (391), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 133/203 (65%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F  T L T+L+   ++ D+SA +   G V+VETNFR+Y
Sbjct: 258 LLNHLRELGVIFIRK--RKDGVFFLTHLLTHLATNETIDDTSAERVSNGKVIVETNFRVY 315

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 316 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 375

Query: 116 --VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
              +  I  +P  V DQIRLWE +  R+++  A+ Y  F S D F+  CDYA++R  LLW
Sbjct: 376 IATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQGVCDYAQERGILLW 435

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            ++++  ++VN E H ++R++ +
Sbjct: 436 ANAQQKLVIVNEEGHEYVRQWYK 458


>gi|308460924|ref|XP_003092760.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
 gi|308252519|gb|EFO96471.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
          Length = 470

 Score =  155 bits (391), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 133/203 (65%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F  T L T+L+   ++ D+SA +   G V+VETNFR+Y
Sbjct: 261 LLNHLRELGVIFIRK--RKDGVFFLTHLLTHLATNETIDDTSAERVSNGKVIVETNFRVY 318

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 319 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 378

Query: 116 --VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
              +  I  +P  V DQIRLWE +  R+++  A+ Y  F S D F+  CDYA++R  LLW
Sbjct: 379 IATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQGVCDYAQERGILLW 438

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            ++++  ++VN E H ++R++ +
Sbjct: 439 ANAQQKLVIVNEEGHEYVRQWYK 461


>gi|395332344|gb|EJF64723.1| transcription factor Tfb2 [Dichomitus squalens LYAD-421 SS1]
          Length = 506

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 122/203 (60%), Gaps = 10/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
           M++D  D GL++ +    ++  F PT+LAT L+ S         T   +  +GF+V+ETN
Sbjct: 298 MLEDLRDYGLIRQRTATSRK--FSPTRLATTLTSSSPPLPTSAGTGDGSHVQGFIVLETN 355

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ + L   +L LF  ++Y+ PNL+VG IT++S+  A  NGITA+QIIS+L  +A
Sbjct: 356 YRLYAYTDNPLQIAVLNLFVTLKYRFPNLVVGQITRDSVKKALANGITADQIISYLITHA 415

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP++    P +P  V DQIRLWE + NRV+    + Y  F S   +E   DYA+    +L
Sbjct: 416 HPQMRKNKPLLPVTVQDQIRLWELERNRVKSQEGYLYTAFASHADYEWVLDYAKKLGVVL 475

Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
           WE+  K     + E HM++R ++
Sbjct: 476 WENPSKRCFFGSVEGHMNIRGYI 498


>gi|343428936|emb|CBQ72481.1| probable TFB2-TFIIH subunit (transcription/repair factor)
           [Sporisorium reilianum SRZ2]
          Length = 493

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 129/203 (63%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----LTDSSARKE--GFVVVETNF 53
           M++DF D GLV  ++   +   F PT+LAT L+ S     L+ +   +E  G++++ETN+
Sbjct: 289 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNY 346

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY++++L   +L LF  I+ + PNL+VG+IT++S+ +A  NGITAEQII++L  +AH
Sbjct: 347 RLYAYTSNRLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAH 406

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
            ++    P +P  V DQIRLWE + NRV+      + +F S+  FE   +YA+    L+W
Sbjct: 407 VQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVW 466

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
           ED  K R  V+   ++ +R+++R
Sbjct: 467 EDEGKRRFFVDEAGNVPVRDYIR 489


>gi|390370355|ref|XP_783140.3| PREDICTED: general transcription factor IIH subunit 4-like, partial
           [Strongylocentrotus purpuratus]
          Length = 180

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 77/179 (43%), Positives = 114/179 (63%), Gaps = 2/179 (1%)

Query: 18  RKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIE 75
           RK   + PT+LA NL+  ++  +    K+GF+VVETNFR+YAY+ S L  EIL LF  + 
Sbjct: 1   RKSMRYYPTRLAINLASGVSSMAKDDHKDGFIVVETNFRIYAYTESDLQVEILGLFCSMM 60

Query: 76  YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWE 135
           Y+ PNL V A+T+ES+  A  NGITAEQI+SFL+ +AHP +  + P VP  + DQ+RLWE
Sbjct: 61  YRFPNLSVAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWE 120

Query: 136 SDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
            + +R+  T    Y EF S   FE   DYA+D   L+W+ + +  ++V+   H  ++++
Sbjct: 121 LERDRLSFTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARRIMIVSPAGHDSVKKY 179


>gi|341886092|gb|EGT42027.1| hypothetical protein CAEBREN_00374 [Caenorhabditis brenneri]
          Length = 460

 Score =  154 bits (388), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 82/205 (40%), Positives = 134/205 (65%), Gaps = 9/205 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F  T+L T+L+   ++ D+SA K   G V+VETNFR+Y
Sbjct: 249 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDTSAEKVSNGKVIVETNFRVY 306

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 307 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 366

Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           +A   P   +P  V DQIRLWE +  R+++  ++ Y  F S + F   CDYA++R  LLW
Sbjct: 367 IATSGPVNCLPITVADQIRLWEDERRRMDLKDSYIYSNFESDEEFNGVCDYAKERKILLW 426

Query: 174 EDSKKMRLVVNAEIHMHMREFLRGQ 198
            D ++  ++VN E H ++R++ + Q
Sbjct: 427 ADYQRKLVIVNEEGHEYVRQWYKQQ 451


>gi|412992641|emb|CCO18621.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score =  153 bits (387), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 81/185 (43%), Positives = 118/185 (63%), Gaps = 11/185 (5%)

Query: 22  WFIPTKLATNLSMSLTDSSA-----RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEY 76
           W++PT L+  LS   T SSA     R +G ++VETNFR+YAY+ S+L  E+LRLF++ +Y
Sbjct: 358 WYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRLFTRPDY 417

Query: 77  QLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR--VADRI--PSVPENVCDQIR 132
           +LPN  VG IT++S+ +A   GI+ +QI+++LQ  AHPR  +  R   P+VP  VCDQIR
Sbjct: 418 KLPNAYVGMITRDSILDAMRAGISPDQIVNYLQTRAHPRCTIGKRPNHPAVPPTVCDQIR 477

Query: 133 LWESDLNRVEMTPAHYYEEFPSR-DVFEAACDYARDRSG-LLWEDSKKMRLVVNAEIHMH 190
           LW  DL RV+      Y +FP + + F+ A + AR+    +LW D  K R  V+A+ H  
Sbjct: 478 LWARDLYRVKADDCVMYTDFPMQGNQFQDAVNNARNVGAQILWMDENKRRFAVDADSHER 537

Query: 191 MREFL 195
           ++ FL
Sbjct: 538 LKVFL 542


>gi|384484453|gb|EIE76633.1| hypothetical protein RO3G_01337 [Rhizopus delemar RA 99-880]
          Length = 284

 Score =  153 bits (387), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 77/218 (35%), Positives = 126/218 (57%), Gaps = 24/218 (11%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS----------------------LTD 38
           M++D  D G+V   Q  +    + PT+LAT L+                         TD
Sbjct: 63  MLEDLRDYGIVY--QRKKHSKRYYPTRLATTLTSGKSALATVAGKYNHMMQETNIDDTTD 120

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           + +  +GF+++ETN+++YAY+ S L   +L LF +++ +  N++ G IT++S+ NA   G
Sbjct: 121 TESVDQGFIILETNYKLYAYTDSPLQIAVLNLFVQLQSRFRNMVTGVITRDSIRNALMKG 180

Query: 99  ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
           ITAEQII +LQ +AHP++    P +P  V DQIRLWE + NR++ TP++ Y EF  +  F
Sbjct: 181 ITAEQIIYYLQSHAHPQMRKETPVLPLTVVDQIRLWEMERNRLKPTPSYLYHEFNVQADF 240

Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           +AA  YARD   LLW +++K  + +    H +++ F++
Sbjct: 241 DAAEKYARDLGVLLWSNNQKRTMAITEAGHENVKGFVK 278


>gi|242021693|ref|XP_002431278.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Pediculus humanus corporis]
 gi|212516535|gb|EEB18540.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Pediculus humanus corporis]
          Length = 467

 Score =  153 bits (387), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 127/202 (62%), Gaps = 5/202 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GL+   Q  R+   F PT+LA N++     S     KEG++++ETN+R+YAY+
Sbjct: 265 LQHLREFGLI--YQRKRRGGRFYPTRLALNIACGENKSLQQMNKEGYIIIETNYRVYAYT 322

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR-VAD 118
            S L   +L LF ++ Y+ PNL VG IT++S+  AF++GITAEQI+ FL+ +AHPR +A 
Sbjct: 323 DSNLQVALLGLFCEMLYRFPNLSVGLITRDSVRQAFKSGITAEQIVGFLRLHAHPRMIAV 382

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
             P++P  V DQI+LWE++LNR+  +    Y +F S+  FEA  D A +   L+WE+ KK
Sbjct: 383 GPPTLPPTVVDQIKLWENELNRLVYSDGVLYSQFLSQADFEALRDRANELGVLVWENDKK 442

Query: 179 MRLVVNAEIHMHMREFLRGQNK 200
             +VV    H  +++F +  +K
Sbjct: 443 RTMVVTKSGHDDVKKFWKSYSK 464


>gi|328873270|gb|EGG21637.1| general transcription factor IIH [Dictyostelium fasciculatum]
          Length = 453

 Score =  152 bits (385), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 80/207 (38%), Positives = 125/207 (60%), Gaps = 15/207 (7%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-----------DSSARKEGFVVV 49
           M+ D  + GLV ++    K   F PT+LA +L+   T            +S + +G++++
Sbjct: 247 MLFDLKEFGLVYVR--SEKSDVFYPTRLAISLATGRTVSLMNDLAQEISTSQKDQGYLIL 304

Query: 50  ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
           ETNFR+YAY+ S L   ++ LF K+ Y+LPNL VG +T+ES+ +A  +GITA+QI+ F++
Sbjct: 305 ETNFRIYAYTQSSLQISLISLFVKMLYRLPNLAVGILTRESVRSALLHGITADQIVDFVK 364

Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
           QNAHP +   I   PE V +QIRLWES+ NR+    A  ++ FP+ + F     +A+D+ 
Sbjct: 365 QNAHPNMV--ISGYPEVVFEQIRLWESERNRITYKKAVLFDSFPNAESFSKTVTFAKDQY 422

Query: 170 GLLWEDSKKMRLVVNAEIHMHMREFLR 196
            L+W D  K  LVVN E    +R +++
Sbjct: 423 FLVWFDETKKMLVVNDEGSEPIRNYIK 449


>gi|402224480|gb|EJU04542.1| transcription factor Tfb2 [Dacryopinax sp. DJM-731 SS1]
          Length = 466

 Score =  152 bits (384), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 79/204 (38%), Positives = 128/204 (62%), Gaps = 9/204 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDS----SARKEGFVVVETN 52
           M++DF D GL+  ++     + F PT+LAT L    S++ T S    +A  EGF+++ETN
Sbjct: 259 MLQDFLDYGLI-YRRNPDDHTRFYPTRLATTLTSTSSLAFTSSKHEKAASSEGFIILETN 317

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ + L   +L LF  +  +  NL++G +T+ES+  A  NGITA+QIIS+L  +A
Sbjct: 318 YRVYAYTENPLQIAVLNLFVALHSRFENLVIGRLTRESIKAALANGITADQIISYLTVHA 377

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP +    P +P  V DQIRLW+ + NR++    + YE+F S+  F+   +YA+    +L
Sbjct: 378 HPMMHKNNPVLPVTVQDQIRLWQLEKNRLKSENGYLYEDFNSQGDFDLVLNYAKQLDVVL 437

Query: 173 WEDSKKMRLVVNAEIHMHMREFLR 196
           WE+ +K ++ V  + H ++REF+R
Sbjct: 438 WENREKRKMFVREDGHENVREFIR 461


>gi|393221241|gb|EJD06726.1| transcription factor Tfb2 [Fomitiporia mediterranea MF3/22]
          Length = 464

 Score =  152 bits (383), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 75/203 (36%), Positives = 124/203 (61%), Gaps = 10/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
           M++D  D GL+  ++   +   F PT+LAT L+ SL          ++A+ +GF+++ETN
Sbjct: 255 MLEDLRDYGLIWQKKASSRR--FYPTRLATTLTSSLPPLPTSGSGGANAQAQGFIILETN 312

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ + L   +L LF  ++Y+ PNL+VG++T++S+  A  NGITA+QII +L  +A
Sbjct: 313 YRIYAYTDNPLQTAVLNLFVTLKYRFPNLVVGSLTRDSVRKALANGITADQIIKYLTTHA 372

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP++    P +P  V DQIRLWE + NR++    + Y  F S+  +E    YA+    +L
Sbjct: 373 HPQMRKNDPLIPVTVQDQIRLWELERNRLKSQEGYLYTAFASQADYEFVLKYAKQLGVVL 432

Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
           WE++ K     + E H ++R F+
Sbjct: 433 WENAAKRCFFGSLEGHANIRGFI 455


>gi|388857860|emb|CCF48525.1| probable TFB2-TFIIH subunit (transcription/repair factor) [Ustilago
           hordei]
          Length = 492

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 127/203 (62%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----LTDSSARKE--GFVVVETNF 53
           M++DF D GLV  ++   +   F PT+LAT L+ S     L+ + A +E  G++++ETN+
Sbjct: 288 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAIPLLSSNGAEQEERGYIILETNY 345

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+++ L   +L LF  I+ + PNL+VG+IT++S+ +A  NGITAEQII++L  +AH
Sbjct: 346 RLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAH 405

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
            ++    P +P  V DQIRLWE + NRV       + +F S+  FE   +YA+    L+W
Sbjct: 406 VQMHRNDPLLPVTVSDQIRLWEREKNRVVQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVW 465

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
           ED  K R  V+   +  +R+++R
Sbjct: 466 EDESKRRFFVDEAGNEPVRDYIR 488


>gi|409077631|gb|EKM77996.1| hypothetical protein AGABI1DRAFT_114851 [Agaricus bisporus var.
           burnettii JB137-S8]
 gi|426198975|gb|EKV48900.1| hypothetical protein AGABI2DRAFT_191077 [Agaricus bisporus var.
           bisporus H97]
          Length = 467

 Score =  151 bits (382), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/208 (38%), Positives = 127/208 (61%), Gaps = 11/208 (5%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-------LTDSSARKEGFVVVETNF 53
           +++D  D GL+   Q   K   F PT+L T L+ S       ++ SS   +GF+V+ETN+
Sbjct: 259 LLEDLRDYGLI--WQSKPKSRRFSPTRLGTTLTSSSPPLPTTISASSGPLDGFIVLETNY 316

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ + L   +L LF  ++Y+ PNL+VG IT++S+  A  NGITA+QIIS+L  +AH
Sbjct: 317 RIYAYTDNPLQTAVLNLFVSLKYRFPNLVVGMITRDSVRRALMNGITADQIISYLITHAH 376

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           P++    P +P  V DQIRLWE + NR++ +  + Y  F S+  ++   +YA+    +LW
Sbjct: 377 PQMRKNNPLLPVTVQDQIRLWELEKNRLKSSEGYLYTAFTSQADYDLVLNYAKQLDVVLW 436

Query: 174 EDSKKMRLVVNAEIHMHMREFL--RGQN 199
           E++ K     + E H+ ++EF+  R QN
Sbjct: 437 ENAAKRCFFGSLEGHLRIKEFIERRTQN 464


>gi|147899535|ref|NP_001085110.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
           laevis]
 gi|47939830|gb|AAH72322.1| MGC83106 protein [Xenopus laevis]
          Length = 455

 Score =  151 bits (381), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 79/201 (39%), Positives = 125/201 (62%), Gaps = 4/201 (1%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NL+  ++ S   + K+GF+VVETN+R+YAY+
Sbjct: 255 LQHLREFGLVF--QRKRKSRRYYPTRLAINLASGISGSVVDSHKQGFIVVETNYRIYAYT 312

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PNL+V  +T+E++  A  NGITAEQII FL+  AHP +  +
Sbjct: 313 DSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQ 372

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P++P  + DQIRLWE + +R+  +    Y +F S+  FE   +YARD   L++E+  K 
Sbjct: 373 NPALPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRNYARDLGVLVFENPAKR 432

Query: 180 RLVVNAEIHMHMREFLRGQNK 200
            +VV    H  ++ F + Q +
Sbjct: 433 VMVVTPGGHSDVKRFWKRQKQ 453


>gi|442757633|gb|JAA70975.1| Putative rna polymer [Ixodes ricinus]
          Length = 459

 Score =  151 bits (381), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 76/197 (38%), Positives = 120/197 (60%), Gaps = 4/197 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  R+   F PT+LA NL+  L +++ R  + G++VVETN+R+YAY+
Sbjct: 259 LQHLREFGLVY--QRKRRSGRFYPTRLAINLASGLKETNLRSYESGYIVVETNYRVYAYT 316

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   +L LF ++ Y+LPNL+VG +T+ES+  A  +GIT+ QII FLQ +AHP    +
Sbjct: 317 DSQLQVALLALFCELMYRLPNLVVGVLTRESVRQALRSGITSNQIIKFLQMHAHPEAQKQ 376

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQ+RLWE + +R        Y +F S+  F+   +YA D   L+W++  K 
Sbjct: 377 SPVIPPTIMDQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKR 436

Query: 180 RLVVNAEIHMHMREFLR 196
            +VVN   H  ++ F +
Sbjct: 437 VMVVNRNGHDEVKRFWK 453


>gi|336370852|gb|EGN99192.1| hypothetical protein SERLA73DRAFT_90464 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336383609|gb|EGO24758.1| hypothetical protein SERLADRAFT_361686 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 469

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 77/203 (37%), Positives = 122/203 (60%), Gaps = 10/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
           M++D  D GLV  ++   K   F PT+LAT L+ S         T    + +GF+V+ETN
Sbjct: 261 MLEDLRDYGLVWQRKATSKR--FSPTRLATTLTSSCPPLPTSTGTSGGPQSQGFIVLETN 318

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+   L   +L LF  ++Y+ PNL+VG +T+ES+  A  NGI+AEQIIS+L  +A
Sbjct: 319 YRIYAYTDKPLQTAVLNLFITMKYRFPNLVVGMLTRESVKKALSNGISAEQIISYLTTHA 378

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP++    P +P  V DQIRLWE + NR++    + Y++F S   +E   +YA+    +L
Sbjct: 379 HPQMRKNNPLLPVTVQDQIRLWELERNRLKSEEGYLYKDFGSHADYEYVLNYAKQLDVVL 438

Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
           WE++ +     + + H ++R F+
Sbjct: 439 WENTSRRCFFGSLDGHTNIRGFI 461


>gi|71024427|ref|XP_762443.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
 gi|46097692|gb|EAK82925.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
          Length = 491

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 78/203 (38%), Positives = 128/203 (63%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----LTDSSARKE--GFVVVETNF 53
           M++DF D GLV  ++   +   F PT+LAT L+ S     L+ +   +E  G++++ETN+
Sbjct: 287 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNY 344

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+++ L   +L LF  I+ + PNL+VG+IT++S+ +A  NGITAEQII++L  +AH
Sbjct: 345 RLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAH 404

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
            ++    P +P  V DQIRLWE + NRV+      + +F S+  FE   +YA+    L+W
Sbjct: 405 LQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVW 464

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
           +D +K R  V+   +  +R+++R
Sbjct: 465 QDEEKRRFFVDEAGNEPVRDYIR 487


>gi|393246879|gb|EJD54387.1| transcription factor Tfb2 [Auricularia delicata TFB-10046 SS5]
          Length = 480

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 75/202 (37%), Positives = 119/202 (58%), Gaps = 7/202 (3%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD-------SSARKEGFVVVETNF 53
           ++ +  D GL+  + +      F PT+LAT L  SL +       +SA   GF+++ETN+
Sbjct: 267 VLGELLDYGLIYQRALPGMSKRFFPTRLATTLMSSLPELPRTAGVASATSGGFIILETNY 326

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ + L   +L LF   + + PNL+VG +T++S+  A  NGITAEQII++L  +AH
Sbjct: 327 RLYAYTDNPLQIAVLNLFVSFKSRFPNLVVGMVTRDSVKKALANGITAEQIITYLSAHAH 386

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           P++    P +P  V DQ+RLWE + NRV+      Y +F S+  FE   DYAR    ++W
Sbjct: 387 PQMRKNNPLLPVTVQDQVRLWELEKNRVKAEEGFLYMDFTSQADFELVLDYARKLGVVIW 446

Query: 174 EDSKKMRLVVNAEIHMHMREFL 195
           E+ ++      A+ H ++R F+
Sbjct: 447 ENGRQRMFFGKADGHNNIRTFI 468


>gi|453232318|ref|NP_001263825.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
 gi|412976516|emb|CCO25635.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
          Length = 435

 Score =  150 bits (378), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 131/203 (64%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F  T+L T+L+   ++ D SA K   G V+VETNFR+Y
Sbjct: 216 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 273

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 274 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 333

Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           +A   P   +P  V DQIRLWE +  R+ +  A+ Y  F S D F   C+YAR ++ LLW
Sbjct: 334 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 393

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            D+++  ++VN + H  +R++ +
Sbjct: 394 SDNQQKLVIVNEDGHELVRQWYK 416


>gi|392901992|ref|NP_001255864.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
 gi|290457493|emb|CBK19514.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
          Length = 431

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 131/203 (64%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT--DSSARK--EGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F  T+L T+L+ + T  D SA K   G V+VETNFR+Y
Sbjct: 212 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 269

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 270 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 329

Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           +A   P   +P  V DQIRLWE +  R+ +  A+ Y  F S D F   C+YAR ++ LLW
Sbjct: 330 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 389

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            D+++  ++VN + H  +R++ +
Sbjct: 390 SDNQQKLVIVNEDGHELVRQWYK 412


>gi|62751795|ref|NP_001015845.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
           (Silurana) tropicalis]
 gi|58475911|gb|AAH90134.1| general transcription factor II H, polypeptide 4 [Xenopus
           (Silurana) tropicalis]
 gi|89272862|emb|CAJ82116.1| transcription factor tfb2 [Xenopus (Silurana) tropicalis]
 gi|114107989|gb|AAI22900.1| gtf2h4 protein [Xenopus (Silurana) tropicalis]
          Length = 455

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 123/199 (61%), Gaps = 4/199 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NL+  ++ S   + K+GF+VVETN+R+YAY+
Sbjct: 255 LQHLREFGLVF--QRKRKSRRYYPTRLAINLASGISGSVVDSHKQGFIVVETNYRIYAYT 312

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PNL+V  +T+E++  A  NGITAEQII FL+  AHP +  +
Sbjct: 313 DSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQ 372

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  +    Y +F S+  FE   +YARD   L++E+  K 
Sbjct: 373 NPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRNYARDLGVLVFENPAKR 432

Query: 180 RLVVNAEIHMHMREFLRGQ 198
            +VV    H  ++ F + Q
Sbjct: 433 VMVVTPGGHSDVKRFWKRQ 451


>gi|443894209|dbj|GAC71558.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB2 [Pseudozyma antarctica
           T-34]
          Length = 491

 Score =  149 bits (377), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 77/203 (37%), Positives = 123/203 (60%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNF 53
           M++DF D GLV  ++   +   F PT+LAT L+       +S   S   + G++++ETN+
Sbjct: 287 MLEDFRDYGLVYQRKASSRR--FYPTRLATTLTSSAAAPLLSTNGSEPEERGYIILETNY 344

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+++ L   +L LF  I+ + PNL+VG+IT++S+ +A  NGITAEQII++L  +AH
Sbjct: 345 RLYAYTSNPLRVAVLSLFVTIKARFPNLLVGSITRDSVKSALANGITAEQIITYLTHHAH 404

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
            ++    P +P  V DQIRLWE + NRV+      + +F S+  F    +YA     L+W
Sbjct: 405 LQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQPDFAEVRNYASQLGVLVW 464

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            D  K RL V+   +  +R+++R
Sbjct: 465 HDEPKRRLFVDEAGNEPVRDYIR 487


>gi|453232316|ref|NP_001263824.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
 gi|412976517|emb|CCO25636.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
          Length = 485

 Score =  149 bits (376), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 131/203 (64%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM--SLTDSSARK--EGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F  T+L T+L+   ++ D SA K   G V+VETNFR+Y
Sbjct: 266 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 323

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 324 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 383

Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           +A   P   +P  V DQIRLWE +  R+ +  A+ Y  F S D F   C+YAR ++ LLW
Sbjct: 384 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 443

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            D+++  ++VN + H  +R++ +
Sbjct: 444 SDNQQKLVIVNEDGHELVRQWYK 466


>gi|392901990|ref|NP_001255863.1| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
 gi|26985915|emb|CAB70230.2| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
          Length = 481

 Score =  149 bits (376), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 131/203 (64%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT--DSSARK--EGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F  T+L T+L+ + T  D SA K   G V+VETNFR+Y
Sbjct: 262 LLNHLRELGVIFIRK--RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVY 319

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 320 AYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 379

Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           +A   P   +P  V DQIRLWE +  R+ +  A+ Y  F S D F   C+YAR ++ LLW
Sbjct: 380 IATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLW 439

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            D+++  ++VN + H  +R++ +
Sbjct: 440 SDNQQKLVIVNEDGHELVRQWYK 462


>gi|303272589|ref|XP_003055656.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463630|gb|EEH60908.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 476

 Score =  149 bits (375), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 68/152 (44%), Positives = 103/152 (67%), Gaps = 2/152 (1%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           ++VETNFR+YAY++S +  EILRLF++ +Y+LPNL VG +T+E++  A   GI+AEQI+S
Sbjct: 318 IIVETNFRVYAYTSSAVEVEILRLFTRPDYKLPNLYVGMMTREAVVTALRGGISAEQIVS 377

Query: 107 FLQQNAHPRVADRI-PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPS-RDVFEAACDY 164
           +L+++AHP+      P++P  VCDQIRLW  D NRV+ TP   Y +FP+   +FE   + 
Sbjct: 378 YLRKHAHPQARKTPGPAIPATVCDQIRLWSKDENRVKYTPCVLYCDFPTGTGMFEKVAEI 437

Query: 165 ARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           A++R   LW D   ++L V  E H  M++  +
Sbjct: 438 AKERGLYLWGDPVGLKLAVREEGHESMKDVFK 469


>gi|327266348|ref|XP_003217968.1| PREDICTED: general transcription factor IIH subunit 4-like [Anolis
           carolinensis]
          Length = 460

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--ARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++  +   R +GF++VETN+R+YAY+
Sbjct: 260 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGISGITIDTRNQGFIIVETNYRIYAYT 317

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PNL+V  +T+ES+  A  NGITA+QII FL+  AHP +  +
Sbjct: 318 DSELQIALIALFSEMLYRFPNLVVAQVTRESVQQAIANGITADQIIHFLRTRAHPVMLKQ 377

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  +    Y +F S+  FE   D+AR+   L++E+  K 
Sbjct: 378 TPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLRDHARELGVLIFENPSKR 437

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 438 LMVVTPAGHSDVKRFWKRQK 457


>gi|342319113|gb|EGU11064.1| RNA polymerase II transcription factor B subunit 2 [Rhodotorula
           glutinis ATCC 204091]
          Length = 496

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 80/205 (39%), Positives = 124/205 (60%), Gaps = 11/205 (5%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS---LTDS--SARKEGFVVVETNFRM 55
           +++D AD GLV L +  R    F PT+LAT L+ S   L  S  S  ++GF+V+ETN+++
Sbjct: 275 VLRDLADYGLVYLPE--RNAPIFYPTRLATTLTSSAPPLVSSRHSNEEKGFIVLETNYKL 332

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY+++ L   +L LF+ ++ +  N + G IT+ES+     NGITA QIIS+L   AHP+
Sbjct: 333 YAYTSNPLQIAVLGLFAHLKTRFANFVTGHITRESIRRGLANGITANQIISYLASRAHPQ 392

Query: 116 VADRIPS----VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGL 171
           +  +  S    +P  V DQIRLWE +  R++ T  + Y+EF S   +E   +YAR+   +
Sbjct: 393 MRAQAGSDDKLLPITVVDQIRLWEHERRRIQTTEGYLYDEFSSTHDYELVVNYAREIGSV 452

Query: 172 LWEDSKKMRLVVNAEIHMHMREFLR 196
           L E  K  ++ V A+ H  +REF++
Sbjct: 453 LLELPKARKVFVTADGHQQVREFIK 477


>gi|155371845|ref|NP_001094527.1| general transcription factor IIH subunit 4 [Bos taurus]
 gi|148878127|gb|AAI46241.1| GTF2H4 protein [Bos taurus]
 gi|296474219|tpg|DAA16334.1| TPA: general transcription factor IIH, polypeptide 4, 52kDa [Bos
           taurus]
          Length = 463

 Score =  148 bits (373), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLMFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|281337500|gb|EFB13084.1| hypothetical protein PANDA_018954 [Ailuropoda melanoleuca]
          Length = 463

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|338718559|ref|XP_003363845.1| PREDICTED: general transcription factor IIH subunit 4 [Equus
           caballus]
          Length = 463

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|426250586|ref|XP_004019016.1| PREDICTED: general transcription factor IIH subunit 4 [Ovis aries]
          Length = 463

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|440896165|gb|ELR48176.1| General transcription factor IIH subunit 4 [Bos grunniens mutus]
          Length = 463

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|178056954|ref|NP_001116592.1| general transcription factor IIH subunit 4 [Sus scrofa]
 gi|41529162|dbj|BAD08424.1| general transcription factor IIH, polypeptide 4 [Sus scrofa]
 gi|47496805|dbj|BAD08426.2| general transcription factor IIH, polypeptide 4 [Sus scrofa]
          Length = 463

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   +LGLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 263 LQHLRELGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|417401351|gb|JAA47565.1| Putative rna polymer [Desmodus rotundus]
          Length = 463

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHGDVKRFWKRQK 460


>gi|344307658|ref|XP_003422497.1| PREDICTED: general transcription factor IIH subunit 4 [Loxodonta
           africana]
          Length = 463

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF++VETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|359320969|ref|XP_003639474.1| PREDICTED: general transcription factor IIH subunit 4-like [Canis
           lupus familiaris]
          Length = 463

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  +S    +A + GF++VETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|149031822|gb|EDL86757.1| rCG41794 [Rattus norvegicus]
          Length = 209

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 122/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +  + GF+VVETN+R+YAY+
Sbjct: 9   LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 66

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 67  ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 126

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 127 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 186

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 187 LMVVTPAGHSDVKRFWKRQK 206


>gi|410958682|ref|XP_003985944.1| PREDICTED: general transcription factor IIH subunit 4 [Felis catus]
          Length = 463

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 123/200 (61%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 SPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|307106040|gb|EFN54287.1| hypothetical protein CHLNCDRAFT_25123 [Chlorella variabilis]
          Length = 458

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 75/183 (40%), Positives = 114/183 (62%), Gaps = 6/183 (3%)

Query: 17  GRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAYSTSKLHCEILRLFSK 73
           G  E W  PT+LA  L+       A    +EG+V+VE+NFR+YAY+TS +   +LR+F +
Sbjct: 271 GANELWLHPTRLAAVLAGGGRAGEAAVAPEEGYVIVESNFRVYAYTTSAVQVAVLRVFVR 330

Query: 74  IEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRL 133
            +  LPNL VG IT+ES  NA + GI A+Q+++FL+Q+AHPR A + P+V   V DQIRL
Sbjct: 331 CDALLPNLFVGTITRESATNALDTGIAADQVVAFLRQHAHPRAAAKTPTV---VTDQIRL 387

Query: 134 WESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMRE 193
           W  +L R++   A  Y++F S++++  A  +AR  + LL+    + +LVV +  H  M  
Sbjct: 388 WAQELKRLQEKNATLYDKFESKELYVGAVAHARQLNALLYSCEDRRQLVVESAFHGLMVG 447

Query: 194 FLR 196
            L+
Sbjct: 448 HLK 450


>gi|213402795|ref|XP_002172170.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces japonicus yFS275]
 gi|212000217|gb|EEB05877.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces japonicus yFS275]
          Length = 444

 Score =  146 bits (369), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 76/191 (39%), Positives = 117/191 (61%), Gaps = 8/191 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD-----SSARKEGFVVVETNFRM 55
           M++D  + GLV  ++   +   F PT+LAT+L+           S   +GF++VETN+R+
Sbjct: 243 MLEDLREYGLVYQKKSSSRR--FYPTRLATSLTTEYHSPVKGAGSDAGKGFIIVETNYRL 300

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY+ S L   IL LF+ +  +  NLIVG IT++S+  A ++GITAEQII++L  +AHP+
Sbjct: 301 YAYTDSPLQIAILGLFTNLRARFSNLIVGVITRDSIRRALQSGITAEQIITYLTTHAHPQ 360

Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
           +   IP +P  + DQI LWE + NR+  TP   + +F +   FE A  YA++   L+W+ 
Sbjct: 361 MRKEIPLLPPTLVDQIYLWELERNRLRATPGILFRDFLTDKDFEKAVQYAKELGVLVWDS 420

Query: 176 S-KKMRLVVNA 185
           S K+M  + NA
Sbjct: 421 SLKRMFFITNA 431


>gi|74143475|dbj|BAE28812.1| unnamed protein product [Mus musculus]
          Length = 463

 Score =  146 bits (369), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+     +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQVIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+ VFE    +AR+   L++E+S K 
Sbjct: 381 NPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVVFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|328766914|gb|EGF76966.1| hypothetical protein BATDEDRAFT_92242 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 471

 Score =  146 bits (368), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 116/206 (56%), Gaps = 10/206 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--------ARKEGFVVVETN 52
           M+ D   LGL+   Q  +K + F PT LAT+L+   +  +        A  +GF+++ETN
Sbjct: 259 MLDDLKHLGLI--YQRKKKSTRFYPTHLATSLTSGASAGAVLSPRSLEADSDGFIIIETN 316

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF ++  +  N+++G IT++S+  A   GI+AEQII++L  +A
Sbjct: 317 YRVYAYTDSPLQIAVLSLFIEMRVRFANMVIGIITRDSVREALAKGISAEQIIAYLTTHA 376

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP +    P +P  + DQ+RLWE + NR+ ++  H Y+ F     +     YA D    L
Sbjct: 377 HPEMKKGSPVLPTTIVDQVRLWEMERNRLRISRGHLYQMFSGEQEYREILKYATDLGYEL 436

Query: 173 WEDSKKMRLVVNAEIHMHMREFLRGQ 198
           W    K  +VV+AE H H++ F   Q
Sbjct: 437 WHSDSKRLVVVSAEGHEHIKVFFSKQ 462


>gi|47059175|ref|NP_997666.1| general transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
 gi|46237651|emb|CAE84027.1| general transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
 gi|117558335|gb|AAI27470.1| General transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
          Length = 463

 Score =  146 bits (368), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|4504201|ref|NP_001508.1| general transcription factor IIH subunit 4 [Homo sapiens]
 gi|113865855|ref|NP_001038968.1| general transcription factor IIH subunit 4 [Pan troglodytes]
 gi|114050799|ref|NP_001040607.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|297677636|ref|XP_002816695.1| PREDICTED: general transcription factor IIH subunit 4 isoform 2
           [Pongo abelii]
 gi|332245940|ref|XP_003272109.1| PREDICTED: general transcription factor IIH subunit 4 [Nomascus
           leucogenys]
 gi|426352309|ref|XP_004043656.1| PREDICTED: general transcription factor IIH subunit 4 [Gorilla
           gorilla gorilla]
 gi|17380328|sp|Q92759.1|TF2H4_HUMAN RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|38503276|sp|P60027.1|TF2H4_PANTR RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|1514597|emb|CAA68870.1| transcription factor TFIIH [Homo sapiens]
 gi|13436278|gb|AAH04935.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|15277224|dbj|BAB63317.1| Transcription factor II H [Homo sapiens]
 gi|16740884|gb|AAH16302.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|21655317|gb|AAM64222.1| general transcription factor IIH, polypeptide 4 (52kD subunit)
           [Homo sapiens]
 gi|27544403|dbj|BAC54936.1| transcription factor II H [Homo sapiens]
 gi|30583481|gb|AAP35985.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|32127781|dbj|BAC78171.1| transcription factor II H [Pan troglodytes]
 gi|55700798|dbj|BAD69753.1| general transcription factor IIH, polypeptide 4, 52kDa [Macaca
           mulatta]
 gi|60655839|gb|AAX32483.1| general transcription factor IIH polypeptide 4 [synthetic
           construct]
 gi|60655841|gb|AAX32484.1| general transcription factor IIH polypeptide 4 [synthetic
           construct]
 gi|86197966|dbj|BAE78622.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|90960934|dbj|BAE92819.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
 gi|90960936|dbj|BAE92820.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
 gi|114306784|dbj|BAF31271.1| TFIIH protein [Homo sapiens]
 gi|119623752|gb|EAX03347.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|123992979|gb|ABM84091.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
           construct]
 gi|123999907|gb|ABM87462.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
           construct]
 gi|355561507|gb|EHH18139.1| General transcription factor IIH polypeptide 4, partial [Macaca
           mulatta]
 gi|355748409|gb|EHH52892.1| General transcription factor IIH polypeptide 4 [Macaca
           fascicularis]
 gi|380812138|gb|AFE77944.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|380812140|gb|AFE77945.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|383408563|gb|AFH27495.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|410214852|gb|JAA04645.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410250640|gb|JAA13287.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410290944|gb|JAA24072.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410352807|gb|JAA43007.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
          Length = 462

 Score =  146 bits (368), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 262 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 319

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 320 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 379

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 380 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 439

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 440 LMVVTPAGHSDVKRFWKRQK 459


>gi|395831929|ref|XP_003789035.1| PREDICTED: general transcription factor IIH subunit 4 [Otolemur
           garnettii]
          Length = 463

 Score =  146 bits (368), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|169784197|ref|XP_001826560.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus oryzae RIB40]
 gi|238508739|ref|XP_002385554.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus flavus NRRL3357]
 gi|83775305|dbj|BAE65427.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220688446|gb|EED44799.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus flavus NRRL3357]
 gi|391868531|gb|EIT77745.1| RNA polymerase II transcription initiation protein [Aspergillus
           oryzae 3.042]
          Length = 480

 Score =  145 bits (367), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 126/228 (55%), Gaps = 35/228 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKEG 45
           + D AD G+V   Q     S F PT+LAT L+                 S T S+    G
Sbjct: 258 LTDLADFGIV--YQDSPDASHFYPTRLATTLTSDSSALSNPISGSLSGPSGTGSNKAGSG 315

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F++VETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+QII
Sbjct: 316 FIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQII 375

Query: 106 SFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP++            A   PSV P  V DQIRLW+ + +RV+ TP   +++F
Sbjct: 376 SYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVTDQIRLWQLERDRVKATPGFLFKDF 435

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
            S   +EA C YA +   L+W+ D K+M  V     H  +  FLR + 
Sbjct: 436 VSLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSRK 480


>gi|62896773|dbj|BAD96327.1| general transcription factor IIH, polypeptide 4, 52kDa variant
           [Homo sapiens]
          Length = 462

 Score =  145 bits (367), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 120/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 262 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 319

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 320 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 379

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE   +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 380 TPVLPPTITDQIRLWELGRDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 439

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 440 LMVVTPAGHSDVKRFWKRQK 459


>gi|170039509|ref|XP_001847575.1| TFIIH basal transcription factor complex p52 subunit [Culex
           quinquefasciatus]
 gi|167863052|gb|EDS26435.1| TFIIH basal transcription factor complex p52 subunit [Culex
           quinquefasciatus]
          Length = 492

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/212 (36%), Positives = 127/212 (59%), Gaps = 15/212 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNFR 54
           ++   + GLV   Q  RKE  F PT+LA N++       +    S+A+ +G++VVETN+R
Sbjct: 280 LQHLREFGLV--YQRKRKEGRFYPTRLAHNITSRNAVPTIQEDGSAAQDKGYIVVETNYR 337

Query: 55  MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
           +YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  AF  GITA+QIIS+L+Q+AHP
Sbjct: 338 VYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITADQIISYLEQHAHP 397

Query: 115 ------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
                 +  +   ++P  V DQI+LWE++ NR   T    Y +F S+  F    DYA+  
Sbjct: 398 TMHNVEQTINTKSALPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSI 457

Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             ++W++ +   +VV    H  +++F +  +K
Sbjct: 458 GVMIWQNERTRTMVVTKNGHDDVKKFWKRYSK 489


>gi|6754094|ref|NP_034494.1| general transcription factor IIH subunit 4 [Mus musculus]
 gi|18202124|sp|O70422.1|TF2H4_MOUSE RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|2997755|gb|AAC08594.1| TFIIH transcription/DNA repair factor p52 subunit [Mus musculus]
 gi|15215027|gb|AAH12638.1| Gtf2h4 protein [Mus musculus]
 gi|26348003|dbj|BAC37650.1| unnamed protein product [Mus musculus]
 gi|74194835|dbj|BAE26009.1| unnamed protein product [Mus musculus]
 gi|74222199|dbj|BAE26910.1| unnamed protein product [Mus musculus]
          Length = 463

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 NPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|392564310|gb|EIW57488.1| transcription factor Tfb2 [Trametes versicolor FP-101664 SS1]
          Length = 477

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 121/203 (59%), Gaps = 10/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
           M+ D  D GL+K +    +   F PT+LAT L+ S         +   +  +GF+V+ETN
Sbjct: 268 MLDDLRDYGLIKQRTPTSRR--FSPTRLATTLTSSSPPLPTSAGSGDGSHAQGFIVLETN 325

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ + L   +L LF  ++ + PNL++GAIT++S+  A  +GITA+QIIS+L  +A
Sbjct: 326 YRLYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRDSVKKALASGITADQIISYLVTHA 385

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP++    P +P  V DQIRLWE + NR++    + Y  F S+  +E   +YA+    +L
Sbjct: 386 HPQMRKNQPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQADYEYVLNYAKQLDVVL 445

Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
           WE++ +     + E H ++R F+
Sbjct: 446 WENASRRCFFGSVEGHANIRGFI 468


>gi|432883135|ref|XP_004074222.1| PREDICTED: general transcription factor IIH subunit 4-like [Oryzias
           latipes]
          Length = 460

 Score =  144 bits (364), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 122/206 (59%), Gaps = 11/206 (5%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK---------EGFVVVETN 52
           ++   + GLV   Q  RK   + PT+LA  L+  ++ +S             GF+VVETN
Sbjct: 253 LQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVSSNSPSNMTNTPGTGDAGFIVVETN 310

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  NGITA+QII FL+  A
Sbjct: 311 YRIYAYTNSELQIALVALFSEMLYRFPNVVVAHLTRESVQQAIANGITAQQIIHFLRTRA 370

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP +  + P++P  + DQIRLWE + +R++ T    Y +F S+  FE   D A+    L+
Sbjct: 371 HPVMLKQSPALPPTITDQIRLWELERDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCLV 430

Query: 173 WEDSKKMRLVVNAEIHMHMREFLRGQ 198
           W+D+    +VV  + H  +++F + Q
Sbjct: 431 WQDASHRAMVVTPQGHSEVKKFWKRQ 456


>gi|119179887|ref|XP_001241468.1| hypothetical protein CIMG_08631 [Coccidioides immitis RS]
 gi|392866653|gb|EAS30172.2| transcription factor Tfb2 [Coccidioides immitis RS]
          Length = 487

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 125/229 (54%), Gaps = 34/229 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDS----------------SAR 42
           + D  D G+V       + + F PT+LAT L   SM+L+ S                 A 
Sbjct: 257 LADLTDFGIVYQHSPASESTRFYPTRLATTLTSDSMALSSSISGNLAPAGPNINAATGAP 316

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G +T++S+  A E GITA+
Sbjct: 317 GTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITAD 376

Query: 103 QIISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYY 149
           QIIS+L  +AHP++            A  + SV P  V DQIRLW+ + +R++ TP   +
Sbjct: 377 QIISYLTTHAHPQMRKVNASKSTSTTAGLLASVLPPTVVDQIRLWQLERDRLKATPGFLF 436

Query: 150 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           ++F S   FEA C YA +   L+W+  K+    V    H  +  FLR +
Sbjct: 437 KDFASSAEFEAPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAFLRSR 483


>gi|409048320|gb|EKM57798.1| hypothetical protein PHACADRAFT_171015 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 472

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 121/207 (58%), Gaps = 14/207 (6%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSL--------TDSSARKEGFVVVETN 52
           M++D  D G++  ++   K   F PT+LAT L+ S           +SA  +GF+++ETN
Sbjct: 260 MLEDLRDYGIIWQRRATSKR--FCPTRLATTLTSSSPPLPAAGGVSASAHGQGFIILETN 317

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE----QIISFL 108
           +R+YAY+ + L   +L LF  ++ + PNL+VGAIT+ES+  A  NGITA+    QIIS+L
Sbjct: 318 YRLYAYTDNPLQIAVLNLFVTLKSRYPNLVVGAITRESVKKALTNGITADQASRQIISYL 377

Query: 109 QQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
             +AHP++    P +P  V DQIRLWE + NR++    + Y  F S+  +E   +YA+  
Sbjct: 378 TTHAHPQMRKNKPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQADYEYVLNYAKQL 437

Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFL 195
             +LWE S +       E H ++R F+
Sbjct: 438 GVVLWESSGRRCFFGTLEGHPNIRGFI 464


>gi|358057347|dbj|GAA96696.1| hypothetical protein E5Q_03367 [Mixia osmundae IAM 14324]
          Length = 467

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 120/202 (59%), Gaps = 10/202 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS----LTDSSARKE-GFVVVETNFRM 55
           ++K F  LGLV + + G     + PT+LA  L+      L   +A +E GF+++ETN+R+
Sbjct: 263 ILKTFGGLGLVYVYKSGD----YSPTRLAVTLTSGAPPLLKAGTAEEEQGFLILETNYRV 318

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY+ + L   +L LF  ++ + P L+VG IT+ES+     NGI ++QII+FL  +AHP+
Sbjct: 319 YAYTQNPLQIAVLNLFVALKSRFPGLVVGMITRESIKAGLANGIKSDQIIAFLTAHAHPQ 378

Query: 116 VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW-E 174
           +  + P +P  V DQI+LWE + NRV+  P   Y++F S+  ++  CDYA+    +LW  
Sbjct: 379 MRKQEPLLPPTVVDQIKLWEREKNRVKTEPCFLYDDFRSQADYDLVCDYAKQIGAVLWLG 438

Query: 175 DSKKMRLVVNAEIHMHMREFLR 196
           +    R     + H+ +R F++
Sbjct: 439 EPGSRRFATTEDGHVQVRGFIQ 460


>gi|195999048|ref|XP_002109392.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
 gi|190587516|gb|EDV27558.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
          Length = 452

 Score =  143 bits (361), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 116/200 (58%), Gaps = 7/200 (3%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS---ARKEGFVVVETNFRMYAY 58
           ++   ++GLV   Q  RK+  F PT+LA NL+          ++  G+++VETN+R+YAY
Sbjct: 249 LQHMREIGLVV--QRKRKDMKFYPTQLAINLASGAKQEELDHSKSSGYIIVETNYRLYAY 306

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   ++ LF ++ Y+LP L VG IT+ES+  AF NGITA +II+F++ +AHP    
Sbjct: 307 TESPLDIALVALFCEMMYRLPGLCVGLITRESVQQAFTNGITANKIINFIRTHAHPEAKK 366

Query: 119 RI--PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           ++  P VP  + DQ+ LWE +  R+  +    Y +  S   FEA   YA D   LLW   
Sbjct: 367 KVTTPIVPSTIIDQLYLWELERGRLSCSDGVLYNQILSSSDFEALRKYADDMGVLLWASP 426

Query: 177 KKMRLVVNAEIHMHMREFLR 196
            K  LVVN + H H++ F +
Sbjct: 427 AKRLLVVNRDGHNHVKHFWK 446


>gi|157113167|ref|XP_001651923.1| TFIIH basal transcription factor complex p52 subunit [Aedes
           aegypti]
 gi|108877858|gb|EAT42083.1| AAEL006356-PA [Aedes aegypti]
          Length = 487

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 77/212 (36%), Positives = 124/212 (58%), Gaps = 15/212 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNFR 54
           ++   + GLV   Q  RKE  F PT+LA N++       +    SS + +G+++VETN+R
Sbjct: 275 LQHLREFGLV--YQRKRKEGRFYPTRLAHNITSKNAVQTIQEDGSSVQDKGYIIVETNYR 332

Query: 55  MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
           +YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  AF  GITAEQIIS+L+Q+AHP
Sbjct: 333 VYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITAEQIISYLEQHAHP 392

Query: 115 RVADRIPSV------PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
            + +   ++      P  V DQI+LWE++ NR   T    Y +F S+  F    DYA+  
Sbjct: 393 TMLNMEQAINSKSPLPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSI 452

Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             + W++ +   +VV    H  ++ F +  +K
Sbjct: 453 GVMTWQNERTRTMVVTKNGHDDVKRFWKRYSK 484


>gi|328863850|gb|EGG12949.1| hypothetical protein MELLADRAFT_46475 [Melampsora larici-populina
           98AG31]
          Length = 454

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 75/204 (36%), Positives = 121/204 (59%), Gaps = 12/204 (5%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT--------DSSARKEGFVVVETN 52
           ++ D  D GLV L Q       F PT+L T L+ +           SS+    F+++ETN
Sbjct: 247 VLNDLVDYGLVYLTQSKT----FYPTRLVTTLTSTAPPIVSNPSDQSSSDPNEFLILETN 302

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+++ L   IL LF   + + PNL++G+IT+ES+  AF+NGI A+QIIS+L+ ++
Sbjct: 303 YRIYAYTSNPLQIAILNLFISFKSRFPNLVIGSITRESIKMAFKNGIRADQIISYLEFHS 362

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           H ++    P +P  V DQIRLWE + NRV     + YE+F S   +E   +Y++    +L
Sbjct: 363 HSQMKLLKPILPNTVVDQIRLWELEKNRVRDQEGYLYEDFKSVSDYEIVINYSKQIDIIL 422

Query: 173 WEDSKKMRLVVNAEIHMHMREFLR 196
           WE+ +  +  V+ + H  +REF++
Sbjct: 423 WENPELKKFFVSLDGHTALREFIK 446


>gi|268534706|ref|XP_002632484.1| Hypothetical protein CBG13719 [Caenorhabditis briggsae]
          Length = 482

 Score =  143 bits (360), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 126/203 (62%), Gaps = 9/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWF----IPTKLATNLSMSLTDSSARKEGFVVVETNFRMY 56
           ++    +LG++ +++  RK+  F    + T LATN ++  T S     G V+VETNFR+Y
Sbjct: 265 LLNHLRELGVIFIRK--RKDGVFFLTHLLTHLATNETIDDTSSEKASNGKVIVETNFRVY 322

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR- 115
           AY++S L   I+ LF+++ Y+  ++ VG IT+ES+  A ++GITA QIISFL+ NAHP+ 
Sbjct: 323 AYTSSLLQLAIIALFTEMTYRFTDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQC 382

Query: 116 VADRIP--SVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           VA   P   +P  V DQIRLWE +  R+++  A+ Y  F S D +     YA++R  LLW
Sbjct: 383 VATSGPVNCLPITVADQIRLWEDERRRMDLKDAYIYSHFESDDEYHGVVRYAQERGILLW 442

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
            + ++  ++VN E H  +R++ +
Sbjct: 443 ANPQQKLVIVNEEGHEAVRQWYK 465


>gi|302684027|ref|XP_003031694.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
 gi|300105387|gb|EFI96791.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
          Length = 476

 Score =  143 bits (360), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 77/192 (40%), Positives = 115/192 (59%), Gaps = 14/192 (7%)

Query: 18  RKESW--FIPTKLATNLSMSLTD-------SSARKEGFVVVETNFRMYAYSTSKLHCEIL 68
           RK+++  F PT+LAT L+ S          +S  KEGF+V+ETN+R+YAY+ + L   +L
Sbjct: 278 RKKNYPRFSPTRLATTLTSSAPPLPTSSGPTSGPKEGFIVLETNYRVYAYTDNALQTAVL 337

Query: 69  RLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSV 123
            LF   +Y+ PNL+VG+IT+ES+  A  NGITA+QIIS+L  +AHP++   +     P +
Sbjct: 338 NLFITPKYRFPNLVVGSITRESVKRAMSNGITADQIISYLITHAHPQMHKNVCLVANPLL 397

Query: 124 PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVV 183
           P  V DQIRLWE + NR +      Y EF S+  +E   +YA+    +LWE+  K     
Sbjct: 398 PVTVQDQIRLWELERNRTKSQEGFLYTEFASQADYEYVLNYAKQLGVVLWENHAKRCFFA 457

Query: 184 NAEIHMHMREFL 195
             E H +++ F+
Sbjct: 458 AQEGHANIKSFI 469


>gi|19113355|ref|NP_596563.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces pombe 972h-]
 gi|62901133|sp|P87303.2|TFB2_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|3080538|emb|CAA18666.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces pombe]
          Length = 447

 Score =  143 bits (360), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 71/202 (35%), Positives = 118/202 (58%), Gaps = 8/202 (3%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------EGFVVVETNFR 54
           M++D  + GLV  +++  K   F PT+LAT L+        ++      +GF++VETN+R
Sbjct: 243 MLEDLREYGLVYQRKITSKR--FYPTRLATGLTTDYRSLHGKQSENDDDKGFIIVETNYR 300

Query: 55  MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
           +YAY++S L   I+ LF+ +  +  NL+VG IT++S+  A  NGI AEQII++L  +AHP
Sbjct: 301 LYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRDSIRRALMNGIAAEQIITYLTTHAHP 360

Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
           ++   +P +P  + DQI LWE + NR+  TP   + +F +   F+ A +YA++   L+W+
Sbjct: 361 QMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILFRDFLTDSGFDQAVEYAKELGVLVWD 420

Query: 175 DSKKMRLVVNAEIHMHMREFLR 196
            S K    +       M  +L+
Sbjct: 421 SSLKRMFFITTTGAQPMIAYLK 442


>gi|156378083|ref|XP_001630974.1| predicted protein [Nematostella vectensis]
 gi|156218005|gb|EDO38911.1| predicted protein [Nematostella vectensis]
          Length = 471

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 77/204 (37%), Positives = 122/204 (59%), Gaps = 8/204 (3%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-----MSLTDSSARKEGFVVVETNFRMY 56
           ++   ++GLV   Q  RK   + PTKL+ NL+     ++ TDS   + GF+++ETN+R+Y
Sbjct: 269 LQQLREIGLVF--QRKRKSRRYYPTKLSVNLTAAGKGINTTDSQI-EAGFIIIETNYRVY 325

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
           AY+ S L   ++ LF +I  + PNL V ++T+ES   A  +GI+AEQI++FLQ  AHP +
Sbjct: 326 AYTESCLQVSLIGLFCEILCRFPNLCVASLTRESCQQALASGISAEQILNFLQTRAHPEM 385

Query: 117 ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
             R P +P  + DQ+RLWE + +R++ T    Y +F S+  FE    YA D   L+W +S
Sbjct: 386 LKRTPIIPSTISDQVRLWEMERSRMKFTEGVLYNQFLSQADFEMLRKYAEDLGVLIWANS 445

Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
            K  +VV+   H  ++ F + Q +
Sbjct: 446 TKRVVVVSRSGHDDVKRFWKRQRQ 469


>gi|303321033|ref|XP_003070511.1| Transcription factor tfb2 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|240110207|gb|EER28366.1| Transcription factor tfb2 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|320036051|gb|EFW17991.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Coccidioides posadasii str. Silveira]
          Length = 487

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 77/229 (33%), Positives = 125/229 (54%), Gaps = 34/229 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD----------------SSAR 42
           + D  D G+V       + + F PT+LAT L   SM+L+                 + A 
Sbjct: 257 LADLTDFGIVYQHSPASESTRFYPTRLATTLTSDSMALSSPISGNLAPAGPNINAATGAP 316

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G +T++S+  A E GITA+
Sbjct: 317 GTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITAD 376

Query: 103 QIISFLQQNAHPRV-----------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYY 149
           QIIS+L  +AHP++              +P+  +P  V DQIRLW+ + +R++ TP   +
Sbjct: 377 QIISYLTTHAHPQMRKVNASKSTSTTAGLPASVLPPTVVDQIRLWQLERDRLKATPGFLF 436

Query: 150 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           ++F S   FEA C YA +   L+W+  K+    V    H  +  FLR +
Sbjct: 437 KDFASSAEFEAPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAFLRSR 483


>gi|444727354|gb|ELW67853.1| Valyl-tRNA synthetase, mitochondrial [Tupaia chinensis]
          Length = 1382

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 76/189 (40%), Positives = 117/189 (61%), Gaps = 4/189 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  +S    +A + GF+VVETN+R+YAY+
Sbjct: 220 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 277

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 278 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 337

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 338 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 397

Query: 180 RLVVNAEIH 188
            +VV    H
Sbjct: 398 LMVVTPAGH 406


>gi|121714341|ref|XP_001274781.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus clavatus NRRL 1]
 gi|119402935|gb|EAW13355.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus clavatus NRRL 1]
          Length = 478

 Score =  142 bits (358), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 85/231 (36%), Positives = 128/231 (55%), Gaps = 41/231 (17%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA-------------------R 42
           + D AD G+V   Q   + + F PT+LAT L+   +DSSA                    
Sbjct: 256 LTDLADFGIV--YQESPEATHFYPTRLATTLT---SDSSALSNPIAGSLSGPTGTASNKA 310

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+
Sbjct: 311 GSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 370

Query: 103 QIISFLQQNAHPRV----ADRIPS---------VPENVCDQIRLWESDLNRVEMTPAHYY 149
           QIIS+L  +AHP++    A R  S         +P  V DQIRLW+ + +RV+ TP   +
Sbjct: 371 QIISYLSTHAHPQMRKHNASRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATPGFLF 430

Query: 150 EEFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
           ++F S   +EA C YA +   L+W+ D K+M  V     H  +  FLR + 
Sbjct: 431 KDFVSLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSRK 478


>gi|330918142|ref|XP_003298106.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
 gi|311328891|gb|EFQ93800.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
          Length = 482

 Score =  142 bits (358), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 113/194 (58%), Gaps = 21/194 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTD-------------SSA 41
           M+ D + +GL+   +  +    F PT+LAT L+      MS +              S+A
Sbjct: 260 MLDDLSSMGLI--YRSDKNARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSAA 317

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
             +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G +TKES++ A ++GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITS 377

Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
            QIIS+L   AHP++   +P +P  V DQIRLWE +  RVE TP +   EF S   +   
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDTEYRDV 437

Query: 162 CDYARDRSGLLWED 175
             YA     L+W++
Sbjct: 438 MGYASALGVLVWQN 451


>gi|388581217|gb|EIM21527.1| transcription factor Tfb2 [Wallemia sebi CBS 633.66]
          Length = 449

 Score =  142 bits (358), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 119/207 (57%), Gaps = 13/207 (6%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-----------LTDSSARKEGFVVV 49
           M+ D  D GLV  ++   K   F PT+LAT L+ S              +S+  + F+++
Sbjct: 226 MLSDLRDYGLVWQRKTSSKR--FYPTRLATTLTSSAPPLLPTTESSSFTTSSDNKRFIIL 283

Query: 50  ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
           ETN+R+YAY+++ L   IL LF  ++ + PNL++G IT++S+ +A  NGITAEQII +L 
Sbjct: 284 ETNYRLYAYTSNPLQISILNLFVTLKARYPNLVIGVITRDSIRSALSNGITAEQIIGYLT 343

Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
            +AH ++    P +P  V DQIRLWE + NR++      Y EF S+  +E   +YA+   
Sbjct: 344 SHAHTQMHRNNPLLPVTVSDQIRLWELEKNRLKADDGVLYAEFRSQPDYEILLNYAKSYD 403

Query: 170 GLLWEDSKKMRLVVNAEIHMHMREFLR 196
            +LW +  K    V  E H  +REF+R
Sbjct: 404 CVLWSNDIKRMFFVTLEGHQIVREFVR 430


>gi|195126587|ref|XP_002007752.1| GI12214 [Drosophila mojavensis]
 gi|193919361|gb|EDW18228.1| GI12214 [Drosophila mojavensis]
          Length = 492

 Score =  142 bits (357), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 79/213 (37%), Positives = 127/213 (59%), Gaps = 16/213 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKEGFVVVETNF 53
           ++   + GLV   Q  RKE  F PT+LA N+        S+S+ +   ++ G++VVETN+
Sbjct: 279 LQHLREFGLVF--QRKRKEHRFYPTRLALNVTNKEAAEASISVDEERMQERGYIVVETNY 336

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ S+L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q AH
Sbjct: 337 RVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAH 396

Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           P    V   I S   +P  V DQI+LWE + NR   T    Y +F S++ F    DYA+ 
Sbjct: 397 PNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQS 456

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           ++ L+W++ +   +VV    H  ++ + +  +K
Sbjct: 457 QNVLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 489


>gi|427785675|gb|JAA58289.1| Putative rna polymer [Rhipicephalus pulchellus]
          Length = 457

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 71/197 (36%), Positives = 118/197 (59%), Gaps = 4/197 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  R+   F PT+LA NL+  L +++ R  + G+++VETN+R+YAY+
Sbjct: 257 LQHLREFGLV--YQRKRRSGRFYPTRLAINLASGLKETTLRSFEAGYIMVETNYRVYAYT 314

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   +L LF ++ Y+ PNL+V  +T+ES+  A  +GIT+ QII FL+  AHP    +
Sbjct: 315 NSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQIIKFLRMYAHPEALKQ 374

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQ+RLWE + +R        Y +F S+  F+   +YA +   L+W++  K 
Sbjct: 375 TPVIPATIMDQLRLWEMERDRFVFREGVLYSQFISQSDFQLLRNYASELGVLIWDNPSKR 434

Query: 180 RLVVNAEIHMHMREFLR 196
            +VVN   H  +++F +
Sbjct: 435 VMVVNRNGHDEVKKFWK 451


>gi|195162678|ref|XP_002022181.1| GL25529 [Drosophila persimilis]
 gi|198464237|ref|XP_001353141.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
 gi|194104142|gb|EDW26185.1| GL25529 [Drosophila persimilis]
 gi|198149631|gb|EAL30642.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
          Length = 494

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 81/213 (38%), Positives = 126/213 (59%), Gaps = 16/213 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKE-GFVVVETNF 53
           ++   + GLV   Q  RKE  F PT+LA N++       M+++D  A ++ G++VVETN+
Sbjct: 281 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAATTMTVSDEEAMQDSGYIVVETNY 338

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+LQQ AH
Sbjct: 339 RVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLQQYAH 398

Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           P    V   I S   +P  V DQI+LWE + NR   T    Y +F S++ F    DYA+ 
Sbjct: 399 PNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQS 458

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
              L+W++ +   +VV    H  ++ + +  +K
Sbjct: 459 IHVLVWQNERTRTMVVQKHGHDDVKRYWKKYSK 491


>gi|348542188|ref|XP_003458568.1| PREDICTED: general transcription factor IIH subunit 4 [Oreochromis
           niloticus]
          Length = 461

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/207 (36%), Positives = 118/207 (57%), Gaps = 12/207 (5%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS----------SARKEGFVVVET 51
           ++   + GLV   Q  RK   + PT+LA  L+  +T S               GF+VVET
Sbjct: 253 LQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVTSSSSVSNLSSSPGTGDAGFIVVET 310

Query: 52  NFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 111
           N+R+YAY+ S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  NGITA+QII FL+  
Sbjct: 311 NYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQLTRESVQQAIANGITAQQIIHFLRTR 370

Query: 112 AHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGL 171
           AHP +  + P +P  + DQIRLWE + +R++ T    Y +F S+  FE   D A+    L
Sbjct: 371 AHPVMLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCL 430

Query: 172 LWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +W+D     +VV  + H  ++ F + Q
Sbjct: 431 VWQDVAHRVMVVTPQGHSEVKRFWKRQ 457


>gi|443688956|gb|ELT91478.1| hypothetical protein CAPTEDRAFT_18660 [Capitella teleta]
          Length = 463

 Score =  141 bits (355), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 113/198 (57%), Gaps = 5/198 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARKEGFVVVETNFRMYAY 58
           ++   + GLV   Q  R    F PT+LA NL   S      +A ++GF+VVETN+R+YAY
Sbjct: 263 LQHLREFGLV--YQRKRSSQRFYPTRLAVNLATGSKGGDSETASEDGFIVVETNYRVYAY 320

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   ++ LF  + Y+ PN+ VG ++++S+  A   GITAEQII+FL+ +AHP    
Sbjct: 321 TNSSLKVALISLFCSMMYRFPNMAVGVVSRDSVREALSRGITAEQIINFLRNHAHPHTQK 380

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
           R P +P  V DQIRLWE + +R        Y +F S+  FE   DYA+D   L +E+  +
Sbjct: 381 RKPILPPTVSDQIRLWELERDRFHFNEGVLYNQFLSQTDFEVLRDYAKDLGVLSYENIPR 440

Query: 179 MRLVVNAEIHMHMREFLR 196
             +VV    H  ++ F +
Sbjct: 441 RLMVVTRAGHDDVKRFWK 458


>gi|350413829|ref|XP_003490125.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
           impatiens]
          Length = 467

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/204 (36%), Positives = 122/204 (59%), Gaps = 6/204 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
            ++   + GLV   Q  RK   F PT+LA N++       +R   KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYA 320

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y+ S L   +L LF ++ Y+ PNL+V  +T++S+  A ++GITA QI+ +LQQ+AH ++ 
Sbjct: 321 YTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQIVGYLQQHAHGKMI 380

Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +  P V P  + DQI+LWE++ NR   +    Y +F S+  FE   D+A     L+W++ 
Sbjct: 381 EAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQNE 440

Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
           +K  +VV    H  +++F +  +K
Sbjct: 441 RKRTIVVTKAGHDDVKKFWKRYSK 464


>gi|449540295|gb|EMD31288.1| hypothetical protein CERSUDRAFT_119839 [Ceriporiopsis subvermispora
           B]
          Length = 412

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/203 (37%), Positives = 119/203 (58%), Gaps = 10/203 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS--------LTDSSARKEGFVVVETN 52
           M++D  D GL  L Q       F PT+LAT L+ S           + ++ +GF+V+ETN
Sbjct: 204 MLEDLRDYGL--LWQRSATSQRFSPTRLATTLTASSNPLPTSSSASADSQSQGFIVLETN 261

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ + L   +L LF  ++ + PNL++GA+T+ES+  A  NGITA+QIIS+L  +A
Sbjct: 262 YRVYAYTDNPLQIAVLNLFVTMKSRFPNLVIGAVTRESVKKALANGITADQIISYLTAHA 321

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP++    P +P  V DQIRLWE + NRV+    + Y  F S+  +E   +YA+    ++
Sbjct: 322 HPQMRRYKPLLPVTVQDQIRLWELEKNRVKSQEGYLYTAFASQADYEYVLNYAKQLDVVI 381

Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
           WE   +     + E H ++R F+
Sbjct: 382 WESPTRRCFFGSLEGHSNIRGFI 404


>gi|67541703|ref|XP_664619.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
 gi|40742471|gb|EAA61661.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
 gi|259483674|tpe|CBF79257.1| TPA: TFIIH and nucleotide excision repair factor 3 complexes
           subunit (Tfb2), putative (AFU_orthologue; AFUA_4G04360)
           [Aspergillus nidulans FGSC A4]
          Length = 482

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 84/228 (36%), Positives = 128/228 (56%), Gaps = 35/228 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLT-------------DSSARKEG 45
           + D AD G+V   Q   + S F PT+LAT L   S +L+             DS+    G
Sbjct: 260 LTDLADFGIV--YQDSPEASHFYPTRLATTLTSDSSALSNPISGALSDPDGGDSNQPGSG 317

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G +T++S+  A E GITA+QII
Sbjct: 318 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGITADQII 377

Query: 106 SFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP++            A   PSV P  V DQIRLW+ + +RV+ T    +++F
Sbjct: 378 SYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKDF 437

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
            S   +EA C YA +   L+W+ D K+M  V     H  +  FLR + 
Sbjct: 438 VSLAEYEAPCRYAEEIGVLIWKSDRKRMFFVTR---HEQVAAFLRSRK 482


>gi|261187837|ref|XP_002620336.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis SLH14081]
 gi|239593453|gb|EEQ76034.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis SLH14081]
 gi|239608433|gb|EEQ85420.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis ER-3]
 gi|327356043|gb|EGE84900.1| RNA pol II transcription initiation subunit [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 485

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 123/228 (53%), Gaps = 33/228 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLTDSSARK------- 43
           + D  D G+V      ++ + F PT+LAT L           S SLT  S          
Sbjct: 256 LADLTDFGIVYQHPPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPSGGPAAVSNEP 315

Query: 44  -EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G +T++S+  A E GITA+
Sbjct: 316 GTGFIIIETNYRLYAYTSSPLQISLISLFTTLKYRFPNLITGKLTRQSIRRAVEMGITAD 375

Query: 103 QIISFLQQNAHPRV-----------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
           QIIS+L  +AHP++              +P+V P  V DQIRLW+ + +R++ TP   ++
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSGANPAGVPAVLPPTVVDQIRLWQLERDRIKATPGFLFK 435

Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +F S   FE  C YA +   LLW+  K+    V    H  +  +LR +
Sbjct: 436 DFVSLAEFEGPCRYAEEIGVLLWKSEKRRMFFVTR--HEQVAAYLRSR 481


>gi|383852611|ref|XP_003701820.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Megachile rotundata]
          Length = 467

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/203 (36%), Positives = 121/203 (59%), Gaps = 6/203 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
           ++   + GLV   Q  RK   F PT+LA N++       +R   KEG++VVETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPDKEGYIVVETNYRVYAY 321

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   +L LF ++ Y+ PNL+V  +T++S+  A ++GITA QI+ +LQQ+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITASQIVGYLQQHAHSKMIE 381

Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
             P V P  + DQI+LWE++ NR   +    Y +F S+  FE   D+A     L+W+  +
Sbjct: 382 AGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSER 441

Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
           K  +VV    H  +++F +  +K
Sbjct: 442 KRTMVVTKAGHDDVKKFWKRYSK 464


>gi|41053973|ref|NP_956221.1| general transcription factor IIH subunit 4 [Danio rerio]
 gi|39794736|gb|AAH64301.1| General transcription factor IIH, polypeptide 4 [Danio rerio]
          Length = 466

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/212 (35%), Positives = 118/212 (55%), Gaps = 16/212 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE--------------GFV 47
           ++   + GLV   Q  RK   + PT+LA  L+  +T + A                 GF+
Sbjct: 254 LQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVTANPASGSASSALGAIPGTGDTGFI 311

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           VVETN+R+YAY+ S+L   ++ LFS++ Y+ PNL+V  +T+ES+  A  NGITA+QII F
Sbjct: 312 VVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAISNGITAQQIIHF 371

Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           L+  AHP +  + P +P  + DQIRLWE + +R++ T    Y +F S+  FE   D A+ 
Sbjct: 372 LRTRAHPVMLKQTPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQADFEVLRDRAQG 431

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
              L+W++     +VV    H  ++ F + Q 
Sbjct: 432 LGVLVWQNPAHRVMVVTPHGHSEVKRFWKRQK 463


>gi|321470926|gb|EFX81900.1| hypothetical protein DAPPUDRAFT_210868 [Daphnia pulex]
          Length = 463

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 123/206 (59%), Gaps = 9/206 (4%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------MSLTDSSARKEGFVVVETNFR 54
           ++   + GLV   Q  RK+  F PT+LA +++       ++  ++S++  G++VVETN+R
Sbjct: 256 LQHLREFGLV--YQRKRKDGRFYPTRLALDIAAGPKKSMLNSLNASSQTTGYIVVETNYR 313

Query: 55  MYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
           +YAY+ S L   ++ LF ++ Y+ PNL+VG IT+ES+  A + GITA+QI+SFL+Q+AH 
Sbjct: 314 VYAYTDSSLQVALIALFCELIYRFPNLVVGIITRESVREALKRGITADQIVSFLRQHAHS 373

Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
                 P +P  + DQI+LW  + +R        Y +F S+  FE   +YA++R  L+W 
Sbjct: 374 ECYKEPPVLPSTIADQIKLWAIERDRFLYKDGVLYNQFDSQADFEILRNYAQERGLLVWH 433

Query: 175 DSKKMRLVVNAEIHMHMREFLRGQNK 200
                ++VV  E H  +R+F +  ++
Sbjct: 434 TLNGRKMVVTKEGHDEVRKFWKRHSR 459


>gi|344252968|gb|EGW09072.1| General transcription factor IIH subunit 4 [Cricetulus griseus]
          Length = 463

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|410911638|ref|XP_003969297.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Takifugu rubripes]
          Length = 465

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/211 (36%), Positives = 120/211 (56%), Gaps = 15/211 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS-------------ARKEGFVV 48
           ++   + GLV   Q  RK   + PT+LA  L+   + SS             +   GF+V
Sbjct: 254 LQHLREFGLVF--QRKRKSRRYYPTRLAITLATGDSSSSLHTPTASLASTPGSGDSGFIV 311

Query: 49  VETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 108
           VETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  NGITA+QII FL
Sbjct: 312 VETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQVTRESVQQAIANGITAQQIIHFL 371

Query: 109 QQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
           +  AHP +  + P +P  + DQIRLWE + +R++ T    Y +F S+  FE   D A+  
Sbjct: 372 RTRAHPVMLRQTPFLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRDRAKSL 431

Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
             L+W+D+    +VV    H  +++F + Q 
Sbjct: 432 GCLVWQDAAHRVMVVTLWGHSEVKKFWKRQK 462


>gi|340710070|ref|XP_003393621.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
           terrestris]
          Length = 467

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/204 (36%), Positives = 121/204 (59%), Gaps = 6/204 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
            ++   + GLV   Q  RK   F PT+LA N++       +R   KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYA 320

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y+ S L   +L LF ++ Y+ PNL+V  +T++S+  A ++GITA QI+ +LQQ+AH ++ 
Sbjct: 321 YTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQIVGYLQQHAHGKMI 380

Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +  P V P  + DQI+LWE++ NR        Y +F S+  FE   D+A     L+W++ 
Sbjct: 381 EAGPPVLPPTIVDQIKLWENERNRFIFNEGVLYSQFLSQTDFEVLRDHALSTGVLIWQNE 440

Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
           +K  +VV    H  +++F +  +K
Sbjct: 441 RKRTIVVTKAGHDDVKKFWKRYSK 464


>gi|307177222|gb|EFN66420.1| General transcription factor IIH subunit 4 [Camponotus floridanus]
          Length = 467

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 73/204 (35%), Positives = 121/204 (59%), Gaps = 6/204 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
            ++   + GLV   Q  RK   F PT+LA N++       A+   KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGENKPLAKDTDKEGYIVVETNYRVYA 320

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y+ S L   +L LF ++ Y+ PN++V  +T++S+  A ++GITA QI+ +LQQ+ H ++ 
Sbjct: 321 YTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSIRQALKSGITASQIVGYLQQHVHSKMI 380

Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +R P + P  + DQI+LWE++ NR   +    Y +F S+  FE   D+A     L+W+  
Sbjct: 381 ERGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSE 440

Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
           +K  +VV    H  +++F +  +K
Sbjct: 441 RKRTMVVTKTGHDDVKKFWKRYSK 464


>gi|380014187|ref|XP_003691121.1| PREDICTED: general transcription factor IIH subunit 4-like [Apis
           florea]
          Length = 467

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 121/203 (59%), Gaps = 6/203 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
           ++   + GLV   Q  RK   F PT+LA N++       +R   KEG++VVETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYAY 321

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   +L LF ++ Y+ PNL+V  +T++S+  A ++GITA QI+ +LQQ+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQIVGYLQQHAHSKMIE 381

Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
             P + P  + DQI+LWE++ NR   +    Y +F S+  FE   D+A     L+W+  +
Sbjct: 382 AGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSER 441

Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
           K  +VV    H  +++F +  +K
Sbjct: 442 KRTMVVTKAGHDDVKKFWKRYSK 464


>gi|48102983|ref|XP_395476.1| PREDICTED: general transcription factor IIH subunit 4 [Apis
           mellifera]
          Length = 467

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 121/203 (59%), Gaps = 6/203 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
           ++   + GLV   Q  RK   F PT+LA N++       +R   KEG++VVETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKEGYIVVETNYRVYAY 321

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   +L LF ++ Y+ PNL+V  +T++S+  A ++GITA QI+ +LQQ+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQIVGYLQQHAHSKMIE 381

Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
             P + P  + DQI+LWE++ NR   +    Y +F S+  FE   D+A     L+W+  +
Sbjct: 382 AGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVLRDHALSTGVLIWQSER 441

Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
           K  +VV    H  +++F +  +K
Sbjct: 442 KRTMVVTKAGHDDVKKFWKRYSK 464


>gi|307212060|gb|EFN87943.1| General transcription factor IIH subunit 4 [Harpegnathos saltator]
          Length = 467

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/204 (36%), Positives = 120/204 (58%), Gaps = 6/204 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
            ++   + GLV   Q  RK   F PT+LA N++   T    R   KEG++VVETN+R+YA
Sbjct: 263 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIATGETKPLTRDTDKEGYIVVETNYRVYA 320

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y+ S L   +L LF ++ Y+ PNL+V  +T++S+  A ++GITA QI+ +LQQ+AH +  
Sbjct: 321 YTNSNLQVALLGLFCEMIYRFPNLVVSILTRDSVRQALKSGITATQIVGYLQQHAHSKTI 380

Query: 118 DRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +  P + P  + DQI LWE++ NR   +    Y +F S+  FE   D+A     L+W++ 
Sbjct: 381 EAGPPILPPTIVDQIMLWENERNRFLFSEGVLYSQFLSQTDFEVLRDHALTTGVLIWQNE 440

Query: 177 KKMRLVVNAEIHMHMREFLRGQNK 200
           +K  +VV    H  +++F +  +K
Sbjct: 441 RKRTIVVTKAGHDDIKKFWKRYSK 464


>gi|170120252|ref|XP_001891159.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164633568|gb|EDQ98188.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 475

 Score =  140 bits (353), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 78/208 (37%), Positives = 126/208 (60%), Gaps = 15/208 (7%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS-------LTDSSARKEGFVVVETNF 53
           M++D  D GL+  ++   K   F PT+LAT L+ S       + + S  +EGF+V+ETN+
Sbjct: 263 MLEDLVDYGLIWRRKPSSKG--FHPTRLATTLTSSSPPLPSSIGNRSGPQEGFIVLETNY 320

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ + L   IL LF  ++Y+ PNL+VGAIT++S+  A  NGI+A+QIIS+L  +AH
Sbjct: 321 RIYAYTDNPLQTAILNLFVSLKYRFPNLVVGAITRDSVKKALLNGISADQIISYLMSHAH 380

Query: 114 PRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           P++   +      P +P  V DQIRLWE + NR++    + Y  F S+  ++   +YA++
Sbjct: 381 PQMRKNVRLRSCNPLLPVTVQDQIRLWELEKNRLKSQEGYLYTSFASQADYDFVLNYAKE 440

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFL 195
              +LWE+  K     + + H ++R F+
Sbjct: 441 LGVVLWENPVKRCFFGSLDGHANIRGFI 468


>gi|189205068|ref|XP_001938869.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187985968|gb|EDU51456.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 482

 Score =  140 bits (352), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 71/194 (36%), Positives = 112/194 (57%), Gaps = 21/194 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTD-------------SSA 41
           M+ D + +GL+   +  +    F PT+LAT L+      MS +              S+ 
Sbjct: 260 MLDDLSSMGLI--YRSDKNARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSAT 317

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
             +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G +TKES++ A ++GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITS 377

Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
            QIIS+L   AHP++   +P +P  V DQIRLWE +  RVE TP +   EF S   +   
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDAEYRDV 437

Query: 162 CDYARDRSGLLWED 175
             YA     L+W++
Sbjct: 438 MGYASALGVLVWQN 451


>gi|351713119|gb|EHB16038.1| General transcription factor IIH subunit 4 [Heterocephalus glaber]
          Length = 463

 Score =  140 bits (352), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 121/200 (60%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 321 ESELQVALIALFSEMLYRFPNVVVAQVTRESVQQAIGSGITAQQIIHFLRTRAHPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|195019521|ref|XP_001984999.1| GH16811 [Drosophila grimshawi]
 gi|193898481|gb|EDV97347.1| GH16811 [Drosophila grimshawi]
          Length = 493

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 80/213 (37%), Positives = 123/213 (57%), Gaps = 16/213 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKEGFVVVETNF 53
           ++   + GLV   Q  RKE  F PT+LA N+        S S  D   ++ G++VVETN+
Sbjct: 280 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTNKDAAQASTSADDERMQERGYIVVETNY 337

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ S+L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q AH
Sbjct: 338 RVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAH 397

Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           P    V   I S   +P  V DQI+LWE + NR   T    Y +F S+  F    DYA+ 
Sbjct: 398 PNMKLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSQTDFVTLRDYAQS 457

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
            + L+W++ +   +VV    H  ++ + +  +K
Sbjct: 458 INMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 490


>gi|31201525|ref|XP_309710.1| AGAP010984-PA [Anopheles gambiae str. PEST]
 gi|21293295|gb|EAA05440.1| AGAP010984-PA [Anopheles gambiae str. PEST]
          Length = 501

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 76/216 (35%), Positives = 125/216 (57%), Gaps = 19/216 (8%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLA-----------TNLSMSLTDSSARKEGFVVVE 50
           ++   + GLV   Q  RKE  F PT+LA           T L+     ++ + +G+++VE
Sbjct: 285 LQHLREFGLV--YQRKRKEGRFYPTRLAHNITSKNATHATTLAQDQESNATKDKGYIIVE 342

Query: 51  TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
           TN+R+YAY+ S L   +L LF+++ Y+ PNL+VG ++++S+  AF  GITAEQIIS+L+Q
Sbjct: 343 TNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLSRDSVRQAFRGGITAEQIISYLEQ 402

Query: 111 NAHP------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDY 164
           +AHP      +  +   S+P  V DQI+LWE++ NR   T    Y +F S+  F    DY
Sbjct: 403 HAHPTMLTVEQAINSKSSLPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQADFITLRDY 462

Query: 165 ARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           A+    ++W++ +   +VV    H  +++F +  +K
Sbjct: 463 AQSIGVMIWQNERIRTMVVTKNGHDDVKKFWKRYSK 498


>gi|358370616|dbj|GAA87227.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus kawachii IFO 4308]
          Length = 478

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 86/229 (37%), Positives = 126/229 (55%), Gaps = 35/229 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-------------SSARKE 44
           M+ D AD G+V   Q       F PT+LAT L   S +L++             SS    
Sbjct: 255 MLTDLADFGIV--YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGT 312

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+QI
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372

Query: 105 ISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
           IS+L  +AHP++            A   PSV P  V DQIRLW+ + +RV+ T    +++
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432

Query: 152 FPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
           F S   +EA C YA +   L W+ D K+M  V     H  +  FLR + 
Sbjct: 433 FVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478


>gi|242794214|ref|XP_002482325.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces stipitatus ATCC 10500]
 gi|218718913|gb|EED18333.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces stipitatus ATCC 10500]
          Length = 481

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 79/215 (36%), Positives = 119/215 (55%), Gaps = 30/215 (13%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-----MSLTDSSARK-----------EG 45
           + D  D G+V       +   F PT+LAT L+     +S T SSA              G
Sbjct: 256 LTDLTDFGIVYQYPSAAEADRFYPTRLATTLTSDSSALSNTMSSALSAQTNAATGEPGSG 315

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G IT++S+  A E GITA+QII
Sbjct: 316 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQII 375

Query: 106 SFLQQNAHP--RVADRIPS-----------VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP  R  D + S           +P  V DQIRLW+ + +R++ TP   ++EF
Sbjct: 376 SYLSTHAHPQMRKEDAVKSTSNTAGLPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEF 435

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAE 186
            S   +E  C YA +   L+W+ D K+M  V   E
Sbjct: 436 ASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 470


>gi|145248676|ref|XP_001400677.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus niger CBS 513.88]
 gi|134081344|emb|CAK41847.1| unnamed protein product [Aspergillus niger]
          Length = 478

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 86/229 (37%), Positives = 126/229 (55%), Gaps = 35/229 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-------------SSARKE 44
           M+ D AD G+V   Q       F PT+LAT L   S +L++             SS    
Sbjct: 255 MLTDLADFGIV--YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGT 312

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+QI
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372

Query: 105 ISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
           IS+L  +AHP++            A   PSV P  V DQIRLW+ + +RV+ T    +++
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432

Query: 152 FPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
           F S   +EA C YA +   L W+ D K+M  V     H  +  FLR + 
Sbjct: 433 FVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478


>gi|353240543|emb|CCA72408.1| probable TFB2-TFIIH subunit (transcription/repair factor)
           [Piriformospora indica DSM 11827]
          Length = 462

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 70/205 (34%), Positives = 119/205 (58%), Gaps = 12/205 (5%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS----------MSLTDSSARKEGFVVVE 50
           M+ D  D GLV         + F PT+ AT L+          +++ + +   + FVV+E
Sbjct: 254 MVSDLRDFGLVYFP--SDTTTSFQPTRFATTLTSYTSSFSDHDLTIENGADLSQEFVVLE 311

Query: 51  TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
           TN+ +YAY+ + L   +L LF   + + PN+I+G++T++S+  A  NGITA+QI+S+L  
Sbjct: 312 TNYHVYAYTNNPLQIAVLNLFVSFKARFPNMIMGSLTRDSVKKALVNGITADQILSYLVT 371

Query: 111 NAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSG 170
           +AHP++    P +P  V DQIRLWE + +RV+    + Y+EF S + +E    YAR+   
Sbjct: 372 HAHPQMRKNNPIIPVTVQDQIRLWELERHRVKGQDGYLYKEFASMNDYEVVVQYARELGV 431

Query: 171 LLWEDSKKMRLVVNAEIHMHMREFL 195
           +LWE++ +     +A   +H+R ++
Sbjct: 432 VLWENASRRMFFADAAGRVHIRSYI 456


>gi|169610043|ref|XP_001798440.1| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
 gi|160701967|gb|EAT84391.2| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
          Length = 426

 Score =  140 bits (352), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 71/188 (37%), Positives = 110/188 (58%), Gaps = 15/188 (7%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------------MSLTDSSARKEGFV 47
           M+ D + +G+V   +  +    F PT+LAT L+                 DS A+ +GF+
Sbjct: 213 MLDDLSAMGIV--YRSSKDSPTFYPTRLATTLTSDSGALPGSDVGTTEKPDSKAQNKGFI 270

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           +VETN+R+YAY+ S +   IL LF+K++++ PNL+ G +TKES++ A + GIT+ QIIS+
Sbjct: 271 IVETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHRAVQAGITSAQIISY 330

Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           L   AHP++    P +P  V DQIRLWE +  RVE+T  +   EF S   +     YA+ 
Sbjct: 331 LTTYAHPQMQKSNPPLPPTVMDQIRLWEYEGERVEVTTGYLMREFGSESEYRDVMGYAQA 390

Query: 168 RSGLLWED 175
              L+W++
Sbjct: 391 LGVLVWKN 398


>gi|350639203|gb|EHA27557.1| transcription factor Tfb2 [Aspergillus niger ATCC 1015]
          Length = 478

 Score =  139 bits (351), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 86/229 (37%), Positives = 126/229 (55%), Gaps = 35/229 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-------------SSARKE 44
           M+ D AD G+V   Q       F PT+LAT L   S +L++             SS    
Sbjct: 255 MLTDLADFGIV--YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGT 312

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+QI
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372

Query: 105 ISFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
           IS+L  +AHP++            A   PSV P  V DQIRLW+ + +RV+ T    +++
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432

Query: 152 FPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQN 199
           F S   +EA C YA +   L W+ D K+M  V     H  +  FLR + 
Sbjct: 433 FVSLAEYEAPCRYAEEIGVLAWKSDRKRMFFVTR---HEQVAAFLRSRK 478


>gi|212535854|ref|XP_002148083.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
 gi|210070482|gb|EEA24572.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score =  139 bits (351), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 76/215 (35%), Positives = 119/215 (55%), Gaps = 30/215 (13%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-------------SMSLTDSSARKE---G 45
           + D  D G++       +   F PT+LAT L             S+S   ++A  E   G
Sbjct: 256 LADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLSAQTNAATGEPGSG 315

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G IT++S+  A E GITA+QII
Sbjct: 316 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQII 375

Query: 106 SFLQQNAHPRV-----------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP++              IP   +P  V DQIRLW+ + +R++ TP   ++EF
Sbjct: 376 SYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEF 435

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAE 186
            S   +E  C YA +   L+W+ D K+M  V   E
Sbjct: 436 ASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 470


>gi|255079788|ref|XP_002503474.1| predicted protein [Micromonas sp. RCC299]
 gi|226518741|gb|ACO64732.1| predicted protein [Micromonas sp. RCC299]
          Length = 458

 Score =  139 bits (351), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 77/196 (39%), Positives = 117/196 (59%), Gaps = 5/196 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE--GFVVVETNFRMYAY 58
           + +D A LGL+ L     KE +++PT+L     ++  D+S   +  G ++ ETNFR+YAY
Sbjct: 263 IAEDLAHLGLLYLFAGPGKEGYYVPTQLTAGKDVADGDASLGGDPGGHIIAETNFRVYAY 322

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP--RV 116
           + S + CEILRLF++ +Y+LPNL VG +T+E+++ A + G+ AEQII +++ +AHP  R 
Sbjct: 323 TFSDVECEILRLFTRPDYRLPNLYVGMLTREAVHEALDTGVAAEQIIKYIKSHAHPNARK 382

Query: 117 ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPS-RDVFEAACDYARDRSGLLWED 175
                 +P NV DQI LW  +  RV       Y +FP+  D + AA   A D   LLWE+
Sbjct: 383 TTNGSGIPPNVADQIMLWAMERRRVRSAECVLYCDFPTGTDEYAAAVKAASDAGVLLWEN 442

Query: 176 SKKMRLVVNAEIHMHM 191
            ++M+L V    H  M
Sbjct: 443 REQMKLAVAKSGHERM 458


>gi|296818081|ref|XP_002849377.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
           otae CBS 113480]
 gi|238839830|gb|EEQ29492.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
           otae CBS 113480]
          Length = 484

 Score =  139 bits (350), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 77/226 (34%), Positives = 120/226 (53%), Gaps = 32/226 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------------------SMSLTDSSA 41
           + D  D G+V         + F PT+LAT L                    S   T ++ 
Sbjct: 257 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMPGPVSGEPVVPAPSTGPTSATD 316

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
              GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G IT++S+  A E GITA
Sbjct: 317 GGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 376

Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
           +QI+S+L  +AHP++     S        +P  V DQIRLW+ + +R++ T    ++EF 
Sbjct: 377 DQIVSYLTTHAHPQMRKTKKSTSNVAASILPPTVVDQIRLWQLERDRIKATSGFLFKEFD 436

Query: 154 SRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQ 198
           +   FEA C YA +   L+W  DS++M  V     H  +  FLR +
Sbjct: 437 TFAEFEAPCKYAEEIGVLVWRSDSRRMFFVTR---HEQVAAFLRSR 479


>gi|115388809|ref|XP_001211910.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195994|gb|EAU37694.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 460

 Score =  139 bits (350), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 124/227 (54%), Gaps = 33/227 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSL-----TDSSARKEG 45
           + D AD G+V   Q     + F PT+LAT L           S SL     T S+    G
Sbjct: 238 LTDLADFGIV--YQHSPDATHFYPTRLATTLTSDASSLSNPISGSLSGPDGTSSAKAGSG 295

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F++VETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+QII
Sbjct: 296 FIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQII 355

Query: 106 SFLQQNAHPRV------------ADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP++            A   PSV P  V DQIRLW+ + +RV+ T    +++F
Sbjct: 356 SYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKDF 415

Query: 153 PSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
            S   +E  C YA +   L+W+ ++K    V    H  +  FLR + 
Sbjct: 416 ASLAEYEDPCRYAEEIGVLVWKSNRKRMFFVTR--HEQVAAFLRSRK 460


>gi|451996931|gb|EMD89397.1| hypothetical protein COCHEDRAFT_1141584 [Cochliobolus
           heterostrophus C5]
          Length = 482

 Score =  139 bits (350), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 77/214 (35%), Positives = 123/214 (57%), Gaps = 23/214 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTDS-------------SA 41
           M++D + +GLV   +  R    F PT+LAT L+      MS + +             +A
Sbjct: 260 MLEDLSAMGLV--YRSDRNARTFYPTRLATTLTSDSGSAMSTSSNDIAQASTSTTGPPTA 317

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
             +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G +TKES++ A + GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITS 377

Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
            QIIS+L   AHP++   +P +P  V DQIRLWE +  RVE T  +   EF S   +   
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDV 437

Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAE-IHMHMRE 193
            +YA     L+W+ D+ +   V + E I  ++R+
Sbjct: 438 LNYASALGVLVWQNDANRCFFVSHVEQISAYLRK 471


>gi|451847848|gb|EMD61155.1| hypothetical protein COCSADRAFT_97372 [Cochliobolus sativus ND90Pr]
          Length = 482

 Score =  139 bits (350), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 77/214 (35%), Positives = 122/214 (57%), Gaps = 23/214 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTDS-------------SA 41
           M++D + +GLV   +  R    F PT+LAT L+      MS +               +A
Sbjct: 260 MLEDLSAMGLVYRSE--RNARTFYPTRLATTLTSDSGSAMSASSKDIAQASTSTTGPPTA 317

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
             +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G +TKES++ A + GIT+
Sbjct: 318 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITS 377

Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
            QIIS+L   AHP++   +P +P  V DQIRLWE +  RVE T  +   EF S   +   
Sbjct: 378 AQIISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDV 437

Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAE-IHMHMRE 193
            +YA     L+W+ D+ +   V + E I  ++R+
Sbjct: 438 LNYASALGVLVWQNDANRCFFVSHVEQISAYLRK 471


>gi|302495702|ref|XP_003009866.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
 gi|291173385|gb|EFE29221.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
          Length = 386

 Score =  139 bits (349), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 122/224 (54%), Gaps = 32/224 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-----------------SSA 41
           + D  D G+V         + F PT+LAT L   S++++                  S  
Sbjct: 159 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPAVPTTTAGTTTSGD 218

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
              GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G IT++S+  A E GITA
Sbjct: 219 AGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 278

Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
           +QIIS+L  +AHP++     S        +P  V DQIRLW+ + +R++ T    ++EF 
Sbjct: 279 DQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFD 338

Query: 154 SRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLR 196
           +   FEA C YA +   L+W+ DS++M  V     H  +  FLR
Sbjct: 339 TFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLR 379


>gi|302667863|ref|XP_003025510.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
 gi|291189624|gb|EFE44899.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
          Length = 386

 Score =  139 bits (349), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 122/224 (54%), Gaps = 32/224 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTD-----------------SSA 41
           + D  D G+V         + F PT+LAT L   S++++                  S  
Sbjct: 159 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPAVPTTTAGTTTSGD 218

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
              GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G IT++S+  A E GITA
Sbjct: 219 AGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 278

Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
           +QIIS+L  +AHP++     S        +P  V DQIRLW+ + +R++ T    ++EF 
Sbjct: 279 DQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFD 338

Query: 154 SRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLR 196
           +   FEA C YA +   L+W+ DS++M  V     H  +  FLR
Sbjct: 339 TFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLR 379


>gi|299750117|ref|XP_001836551.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
           cinerea okayama7#130]
 gi|298408751|gb|EAU85259.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
           cinerea okayama7#130]
          Length = 403

 Score =  139 bits (349), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 77/203 (37%), Positives = 125/203 (61%), Gaps = 11/203 (5%)

Query: 1   MIKDFADLGLV-KLQQVGRKESWFIPTKLATNLSMSLTD-------SSARKEGFVVVETN 52
           M++D  D GL+ + +Q  ++   F PT+L+T L+ S          SS  +EGF+++ETN
Sbjct: 196 MLEDLRDYGLIWQRKQTSKR---FSPTRLSTTLTSSSPPLPSTSGASSGPQEGFIILETN 252

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ + L   +L LF+ ++Y+ PNL+VG+IT+ES+  A  NGI+A+QIIS+L  +A
Sbjct: 253 YRIYAYTDNPLQTAVLSLFASLKYRFPNLVVGSITRESVKKALLNGISADQIISYLITHA 312

Query: 113 HPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
           HP +    P +P  V DQIRLWE + NR++      Y  F S+  +E    YA++   +L
Sbjct: 313 HPNMRKNNPLLPVTVQDQIRLWELEKNRLKSREGFLYTAFASQADYELVLQYAKELDVVL 372

Query: 173 WEDSKKMRLVVNAEIHMHMREFL 195
           WE++ K     + E H +++ F+
Sbjct: 373 WENASKRCFFGSLEGHANIKGFI 395


>gi|212535856|ref|XP_002148084.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
 gi|210070483|gb|EEA24573.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
          Length = 422

 Score =  139 bits (349), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 76/215 (35%), Positives = 119/215 (55%), Gaps = 30/215 (13%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-------------SMSLTDSSARKE---G 45
           + D  D G++       +   F PT+LAT L             S+S   ++A  E   G
Sbjct: 197 LADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLSAQTNAATGEPGSG 256

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G IT++S+  A E GITA+QII
Sbjct: 257 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQII 316

Query: 106 SFLQQNAHPRV-----------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP++              IP   +P  V DQIRLW+ + +R++ TP   ++EF
Sbjct: 317 SYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEF 376

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAE 186
            S   +E  C YA +   L+W+ D K+M  V   E
Sbjct: 377 ASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTRYE 411


>gi|91094191|ref|XP_971121.1| PREDICTED: similar to TFIIH basal transcription factor complex p52
           subunit [Tribolium castaneum]
 gi|270010854|gb|EFA07302.1| hypothetical protein TcasGA2_TC015892 [Tribolium castaneum]
          Length = 472

 Score =  139 bits (349), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 73/207 (35%), Positives = 123/207 (59%), Gaps = 9/207 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETNFRM 55
            ++   + GLV   Q  RK   F PT+LA N++ S    +   E     G+++VETN+R+
Sbjct: 266 FLQHLREFGLV--YQRKRKAGRFYPTRLALNITCSQGPETRILEEDTPKGYIIVETNYRV 323

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY+ S L   ++ LF+++ Y+ PNL+VG IT++S+  A + GITA+QII +L+Q+AHP+
Sbjct: 324 YAYTDSNLQVALIGLFTELMYRFPNLVVGVITRDSIRQALKGGITADQIIGYLKQHAHPQ 383

Query: 116 V--ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           +   +    +P  V DQI+LWE + NR+  +    Y +F S+  F    +YA+    L+W
Sbjct: 384 MLEGEAKHPLPPTVVDQIKLWEIERNRLTYSEGVLYSQFLSQADFNILKEYAQSNGHLIW 443

Query: 174 EDSKKMRLVVNAEIHMHMREFLRGQNK 200
            + +K  L++N   H  +++F +  +K
Sbjct: 444 CNKEKRTLIINKSAHDDVKKFWKRYSK 470


>gi|240279262|gb|EER42767.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
           capsulatus H143]
          Length = 485

 Score =  139 bits (349), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 125/228 (54%), Gaps = 33/228 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKE- 44
           + D  D G+V      ++ + F PT+LAT L+                 +L  S++  E 
Sbjct: 256 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEP 315

Query: 45  --GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G +T++S+  A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375

Query: 103 QIISFLQQNAHPRVAD-----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
           QIIS+L  +AHP++              +P+V P  V DQIRLW+ + +R++ TP   ++
Sbjct: 376 QIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435

Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +F S   FE  C YA +   L+W+  +K    V    H  +  +LR +
Sbjct: 436 DFVSLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVAAYLRSK 481


>gi|295662078|ref|XP_002791593.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279719|gb|EEH35285.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 484

 Score =  139 bits (349), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 79/227 (34%), Positives = 123/227 (54%), Gaps = 32/227 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
           + D  D G+V      ++ + F PT+L T L           S SLT        +SS  
Sbjct: 256 LTDLTDFGIVYQHFPPQEATRFYPTRLVTTLTSDSATLGSSISTSLTGPSGPTAANSSEP 315

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+V+ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+
Sbjct: 316 STGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 375

Query: 103 QIISFLQQNAHPRVAD----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
           QIIS+L  +AHP++              P+V P  V DQIRLW+ + +R++ TP   +++
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKD 435

Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           F S   F+  C YA +   L+W+  K+    V    H  +  +LR +
Sbjct: 436 FVSLAEFDGPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAYLRSR 480


>gi|225682227|gb|EEH20511.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides brasiliensis Pb03]
 gi|226289599|gb|EEH45083.1| TFIIH basal transcription factor complex p52 subunit
           [Paracoccidioides brasiliensis Pb18]
          Length = 484

 Score =  139 bits (349), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 79/227 (34%), Positives = 123/227 (54%), Gaps = 32/227 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
           + D  D G+V      ++ + F PT+L T L           S SLT        +SS  
Sbjct: 256 LTDLTDFGIVYQHFPPQEATRFYPTRLVTTLTSDSATLGSSISTSLTGPSGPAAANSSEP 315

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+V+ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+
Sbjct: 316 STGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 375

Query: 103 QIISFLQQNAHPRVAD----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEE 151
           QIIS+L  +AHP++              P+V P  V DQIRLW+ + +R++ TP   +++
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKD 435

Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           F S   F+  C YA +   L+W+  K+    V    H  +  +LR +
Sbjct: 436 FVSLAEFDGPCRYAEEIGVLVWKSEKRRMFFVTR--HEQVAAYLRSR 480


>gi|392594388|gb|EIW83712.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
          Length = 486

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 76/219 (34%), Positives = 122/219 (55%), Gaps = 26/219 (11%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARK-------------- 43
           M++D  D GLV  ++   K   F PT+LAT L   S SL  +   +              
Sbjct: 262 MLEDLRDYGLVWQRKASSKR--FSPTRLATTLTSVSPSLPTAGGSRNPGSAPGASSVANS 319

Query: 44  -------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
                   GF+V+ETN+R+YAY+ + L   +L LF  ++Y+ PNL+VG +T+ES+  A  
Sbjct: 320 TGINTNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALG 379

Query: 97  NGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD 156
           NGI+AEQ+IS+L  +AHP++    P +P  V DQ+RLWE + NR++      Y++F ++ 
Sbjct: 380 NGISAEQVISYLTAHAHPQMRKNNPLLPVTVQDQVRLWELERNRLKSDDGFLYKDFATQA 439

Query: 157 VFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
            +E    YA++   +LWE++ K       + H ++R F+
Sbjct: 440 DYEYVLAYAKELDVVLWENTSKRCFFGTLDGHTNVRGFI 478


>gi|195378984|ref|XP_002048261.1| GJ11446 [Drosophila virilis]
 gi|194155419|gb|EDW70603.1| GJ11446 [Drosophila virilis]
          Length = 492

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 78/213 (36%), Positives = 124/213 (58%), Gaps = 16/213 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKEGFVVVETNF 53
           ++   + GLV   Q  RKE  F PT+LA N+        S S  +   ++ G++VVETN+
Sbjct: 279 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTNKDAAQASTSADEERMQERGYIVVETNY 336

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ S+L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q AH
Sbjct: 337 RVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAH 396

Query: 114 PR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           P    V   I S   +P  + DQI+LWE + NR   T    Y +F S++ F    DYA+ 
Sbjct: 397 PNMKLVESAIQSKSCLPPTIVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQS 456

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
            + L+W++ +   +VV    H  ++ + +  +K
Sbjct: 457 INMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 489


>gi|221120480|ref|XP_002159945.1| PREDICTED: general transcription factor IIH subunit 4-like [Hydra
           magnipapillata]
          Length = 458

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 69/185 (37%), Positives = 116/185 (62%), Gaps = 3/185 (1%)

Query: 15  QVGRKESWFIPTKLATNLSMSLT---DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLF 71
           Q  RK   F PT+LA NL  ++T   DSS++++GF+VVETN+R+YAY+ S LH  +L LF
Sbjct: 269 QRKRKSKRFYPTRLAINLGSAVTGNSDSSSQQQGFLVVETNYRVYAYTDSVLHIALLSLF 328

Query: 72  SKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQI 131
           + I+ + P+  V  +++ES+  A   GI+AEQII FL+  AH ++    P +   + DQI
Sbjct: 329 TDIKARFPSFTVALLSRESVQQALACGISAEQIIDFLKTRAHSQMTTSSPIIASTITDQI 388

Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
           +LWE + +R+  +    Y +F S+  FE    +A +++ LLW +++K  +VV+   H  +
Sbjct: 389 KLWEMERDRLRYSQGVLYNQFLSQSDFEMLRKFADEKNHLLWANNQKRLMVVSKSGHEDV 448

Query: 192 REFLR 196
           +++ +
Sbjct: 449 KKYWK 453


>gi|325089533|gb|EGC42843.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
           capsulatus H88]
          Length = 485

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 125/228 (54%), Gaps = 33/228 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKE- 44
           + D  D G+V      ++ + F PT+LAT L+                 +L  S++  E 
Sbjct: 256 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEP 315

Query: 45  --GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G +T++S+  A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375

Query: 103 QIISFLQQNAHPRVAD-----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
           QIIS+L  +AHP++              +P+V P  V DQIRLW+ + +R++ TP   ++
Sbjct: 376 QIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435

Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +F S   FE  C YA +   L+W+  +K    V    H  +  +LR +
Sbjct: 436 DFVSLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVATYLRSK 481


>gi|225555033|gb|EEH03326.1| RNA pol II transcription initiation subunit [Ajellomyces capsulatus
           G186AR]
          Length = 485

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 77/228 (33%), Positives = 124/228 (54%), Gaps = 33/228 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
           + D  D G+V      ++ + F PT+LAT L           S SLT         S+  
Sbjct: 256 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPNGVPSASSNEP 315

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G +T++S+  A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375

Query: 103 QIISFLQQNAHPRVAD-----------RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYE 150
           QIIS+L  +AHP++              +P+V P  V DQIRLW+ + +R++ TP   ++
Sbjct: 376 QIISYLTTHAHPQMRKYHATKAGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435

Query: 151 EFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +F +   FE  C YA +   L+W+  +K    V    H  +  +LR +
Sbjct: 436 DFVNLAEFEGPCRYAEEIGVLVWKSERKRMFFVTR--HEQVAAYLRSR 481


>gi|194749905|ref|XP_001957376.1| GF10392 [Drosophila ananassae]
 gi|190624658|gb|EDV40182.1| GF10392 [Drosophila ananassae]
          Length = 497

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 77/214 (35%), Positives = 123/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS---------MSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N++         ++  +   ++ G++VVETN
Sbjct: 283 LQHLREFGLV--YQRKRKEGRFYPTRLALNVTSKEAAVAATLTTDEEGVQESGYIVVETN 340

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 341 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 400

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S++ F    DYA+
Sbjct: 401 HPNMRLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQ 460

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 461 SIHVLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 494


>gi|340371925|ref|XP_003384495.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Amphimedon queenslandica]
          Length = 446

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 111/195 (56%), Gaps = 3/195 (1%)

Query: 5   FADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-DSSARKEGFVVVETNFRMYAYSTSKL 63
               GLV  ++   K   + PTKL   L+   T   +  + GF++VETN+R+ AY+ SKL
Sbjct: 251 LCQFGLVYQRKSSSKR--YYPTKLVIQLTAGETIGMNTSQAGFIIVETNYRVIAYTDSKL 308

Query: 64  HCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSV 123
           H   L LF +++Y+ PN+ VG IT+ES+  A  +GI A+QIISFL Q+AH  +  +   +
Sbjct: 309 HIATLALFCQMQYRFPNVAVGTITRESIQQALVHGIKADQIISFLTQHAHHNMLSKAHIL 368

Query: 124 PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVV 183
           P  V DQI+LWE + NR+       Y EF S   +E    YA D   LLW + ++  +V+
Sbjct: 369 PPTVTDQIKLWEIERNRLSFQEGILYSEFLSVTDYEKVKKYAEDLGVLLWSNRQRRLMVI 428

Query: 184 NAEIHMHMREFLRGQ 198
           + + H  +R F + Q
Sbjct: 429 HPDRHDEIRHFWKRQ 443


>gi|119486893|ref|XP_001262366.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Neosartorya fischeri NRRL 181]
 gi|119410523|gb|EAW20469.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Neosartorya fischeri NRRL 181]
          Length = 468

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/229 (34%), Positives = 123/229 (53%), Gaps = 35/229 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKEG 45
           + D AD G+V   Q     + F PT+LAT L+                 + T SS    G
Sbjct: 245 LTDLADFGIV--YQETPDATHFYPTRLATTLTSDSSTLSNPLAGSLSGPTGTSSSKAGSG 302

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+QII
Sbjct: 303 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQII 362

Query: 106 SFLQQNAHPRVADR-------------IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP++                +  +P  V DQIRLW+ + +RV+ T    + +F
Sbjct: 363 SYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRDF 422

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
            +   +EA C YA +   L+W+ D K+M  V     H  +  FL+ + K
Sbjct: 423 NTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HHQVAAFLKSRRK 468


>gi|327298217|ref|XP_003233802.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton rubrum CBS 118892]
 gi|326463980|gb|EGD89433.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton rubrum CBS 118892]
          Length = 484

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/229 (34%), Positives = 122/229 (53%), Gaps = 38/229 (16%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------------------ 43
           + D  D G+V         + F PT+LAT L+   +DS A                    
Sbjct: 257 LADLTDFGIVYQHTPASGSTRFYPTRLATTLT---SDSLAMSGPISGEPTVPTTSAGTTT 313

Query: 44  -----EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
                 GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G IT++S+  A E G
Sbjct: 314 SGDAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMG 373

Query: 99  ITAEQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYE 150
           ITA+QIIS+L  +AHP++     S        +P  V DQIRLW+ + +R++ T    ++
Sbjct: 374 ITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFK 433

Query: 151 EFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
           EF +   FEA C YA +   L+W+ DS++M  V     H  +  FLR +
Sbjct: 434 EFDTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 479


>gi|70982185|ref|XP_746621.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2) [Aspergillus fumigatus Af293]
 gi|66844244|gb|EAL84583.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus fumigatus Af293]
 gi|159122143|gb|EDP47265.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus fumigatus A1163]
          Length = 479

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/229 (34%), Positives = 123/229 (53%), Gaps = 35/229 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------SLTDSSARKEG 45
           + D AD G+V   Q     + F PT+LAT L+                 + T SS    G
Sbjct: 256 LTDLADFGIV--YQETPDATHFYPTRLATTLTSDSSTLSNPLAGSLSGPTGTSSSKAGSG 313

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G IT++S+  A E GITA+QII
Sbjct: 314 FIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQII 373

Query: 106 SFLQQNAHPRVADR-------------IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           S+L  +AHP++                +  +P  V DQIRLW+ + +RV+ T    + +F
Sbjct: 374 SYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRDF 433

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
            +   +EA C YA +   L+W+ D K+M  V     H  +  FL+ + K
Sbjct: 434 NTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR---HHQVAAFLKSRRK 479


>gi|195477962|ref|XP_002086438.1| GE22864 [Drosophila yakuba]
 gi|194186228|gb|EDW99839.1| GE22864 [Drosophila yakuba]
          Length = 241

 Score =  137 bits (345), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N+         S+++ + + +  G++VVETN
Sbjct: 27  LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 84

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 85  YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 144

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F    DYA+
Sbjct: 145 HPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQ 204

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 205 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 238


>gi|357618296|gb|EHJ71332.1| putative TFIIH basal transcription factor complex p52 subunit
           [Danaus plexippus]
          Length = 603

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/212 (35%), Positives = 122/212 (57%), Gaps = 15/212 (7%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS-----SARKEGFVVVETNFRM 55
            ++   + GLV   Q  RK   F PT+LA N++  + D      +A   G+++ ETN+R+
Sbjct: 393 FLQHLREFGLV--YQRKRKAGRFYPTRLALNIT-CVKDGVAPLQTAASSGYIIAETNYRV 449

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY+TS L   +L LF+++ Y+ PN++VG +T+ES+  A   GI+A+QII++L+Q++HP+
Sbjct: 450 YAYTTSALQVALLGLFTELVYRFPNVVVGVLTRESVRAALRGGISAQQIITYLEQHSHPQ 509

Query: 116 V--ADR-----IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
           +  +D+       S+P  V DQIRLWES+ NR   T    Y +F S+  F    DY R  
Sbjct: 510 MLKSDQGGIRSSSSLPPTVLDQIRLWESERNRFTYTEGVVYNQFLSQAEFNVLRDYGRSS 569

Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             L+W   +   +VV    H  ++ + +  +K
Sbjct: 570 GALVWAADRTRTMVVARAAHDDVKRYWKRYSK 601


>gi|156057729|ref|XP_001594788.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980]
 gi|154702381|gb|EDO02120.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 420

 Score =  136 bits (343), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 69/218 (31%), Positives = 125/218 (57%), Gaps = 25/218 (11%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------SLTDSSARKEGFV 47
           M+ +  D GL+ L      +  F PT+LAT L+              + + S+A ++GF+
Sbjct: 205 MLGNLIDFGLIYLPPSAPTQ--FFPTRLATTLTSDASALRTVSAGFDAASKSAANQKGFI 262

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           ++ETN+R+YAY+ S L   +L LF+K+  + PN++ G ++++S+  A  +GIT++QII++
Sbjct: 263 IIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAISHGITSDQIITY 322

Query: 108 LQQNAHPRV------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
           L  +AHP++      A   P +P  V DQIRLW+ +  R++  P   ++EF  +  +E  
Sbjct: 323 LSTHAHPQLVKASTAAHGGPVLPPTVVDQIRLWQLENERMKAVPGFLFKEFDGQKEYEGC 382

Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
             YA +   L+W+ D+K+M  V   E    +R++++ +
Sbjct: 383 AKYAEEVGVLVWKNDAKRMFFVTRVE---QLRDYIKAR 417


>gi|403417620|emb|CCM04320.1| predicted protein [Fibroporia radiculosa]
          Length = 472

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 63/159 (39%), Positives = 100/159 (62%)

Query: 37  TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
           + +    EGF+V+ETN+R+YAY+ + L   +L LF  ++ + PNL++GAIT+ES+  A  
Sbjct: 306 SGTGPHGEGFIVLETNYRIYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRESVKKALA 365

Query: 97  NGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD 156
           NGITA+QIIS+L  +AHP++    P +P  V DQIRLWE + NRV+    + Y  F S+ 
Sbjct: 366 NGITADQIISYLTAHAHPQMHKNKPLLPVTVQDQIRLWELERNRVKSEEGYLYTSFASQA 425

Query: 157 VFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
            +E   +YA+    +LWE+  +     + + H ++R F+
Sbjct: 426 DYEYVLNYAKQLDVVLWENPSRRSFFGSLDGHANIRGFI 464


>gi|16198089|gb|AAL13841.1| LD30622p [Drosophila melanogaster]
          Length = 350

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N+         S+++ + + +  G++VVETN
Sbjct: 136 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 193

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 194 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 253

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F    DYA+
Sbjct: 254 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 313

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 314 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 347


>gi|339521903|gb|AEJ84116.1| general transcription factor IIH subunit 4 [Capra hircus]
          Length = 463

 Score =  136 bits (342), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 118/200 (59%), Gaps = 4/200 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  +S    +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
             +L    + LFS++ Y+ PN++V  +T+ S+  A  +GITA+QI+ FL+  A P +  +
Sbjct: 321 EPELQIARIALFSEMLYRFPNMVVPQVTRGSVQQAIASGITAQQIVHFLRTRADPVMLKQ 380

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 381 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 440

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 441 LMVVTPAGHSDVKRFWKRQK 460


>gi|198428509|ref|XP_002131018.1| PREDICTED: similar to General transcription factor II H,
           polypeptide 4 [Ciona intestinalis]
          Length = 463

 Score =  136 bits (342), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/203 (35%), Positives = 121/203 (59%), Gaps = 7/203 (3%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS----MSLTDSSARKEGFVVVETNFRMYA 57
           ++ F ++GLV   Q  RK   F PT LA NL+     S+T+ +  + GF++ ETNFR+YA
Sbjct: 260 LQHFREMGLV--MQRKRKSRRFYPTYLAINLANRANASVTNVTNSR-GFILAETNFRVYA 316

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y+ S+L   IL LF K+ Y+ PN+ V  +T++S+ +A  NGITA+QI+ +++ NAHP + 
Sbjct: 317 YTDSELQYSILSLFCKMLYRFPNVCVLQLTRKSIQSAVVNGITADQILHYIKANAHPDML 376

Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
              P +   + DQ+RLW  + +R+       Y +F ++  FE   +YA++   L+WE+S 
Sbjct: 377 KNDPIIAPTLADQVRLWAMERDRLTYRDGVLYNQFLAQKDFEVLRNYAKELGALIWENSS 436

Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
           K  +VV  + H  ++ + +   K
Sbjct: 437 KRYMVVTKDGHDQVKRYWKKYKK 459


>gi|194872905|ref|XP_001973103.1| GG13537 [Drosophila erecta]
 gi|190654886|gb|EDV52129.1| GG13537 [Drosophila erecta]
          Length = 499

 Score =  136 bits (342), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N+         S+++ + + +  G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F    DYA+
Sbjct: 403 HPNMRMVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQ 462

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496


>gi|195477975|ref|XP_002086439.1| GE22863 [Drosophila yakuba]
 gi|194186229|gb|EDW99840.1| GE22863 [Drosophila yakuba]
          Length = 499

 Score =  135 bits (341), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N+         S+++ + + +  G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F    DYA+
Sbjct: 403 HPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQ 462

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496


>gi|195327811|ref|XP_002030611.1| GM24480 [Drosophila sechellia]
 gi|194119554|gb|EDW41597.1| GM24480 [Drosophila sechellia]
          Length = 499

 Score =  135 bits (341), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N+         S+++ + + +  G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F    DYA+
Sbjct: 403 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 462

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496


>gi|195590491|ref|XP_002084979.1| GD12553 [Drosophila simulans]
 gi|194196988|gb|EDX10564.1| GD12553 [Drosophila simulans]
          Length = 500

 Score =  135 bits (340), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N+         S+++ + + +  G++VVETN
Sbjct: 286 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 343

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 344 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 403

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F    DYA+
Sbjct: 404 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 463

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 464 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 497


>gi|425777809|gb|EKV15965.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Penicillium digitatum PHI26]
 gi|425782577|gb|EKV20476.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Penicillium digitatum Pd1]
          Length = 382

 Score =  135 bits (340), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 81/227 (35%), Positives = 125/227 (55%), Gaps = 38/227 (16%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA------------------RK 43
           + D AD G+V   Q     + F PT+LAT L+   +DSSA                    
Sbjct: 159 LADLADFGIVY--QEDADATHFYPTRLATTLT---SDSSALSNPVTGSLTGPVGPSGGSG 213

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
            GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G IT++S+  A E GITA+Q
Sbjct: 214 SGFIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQ 273

Query: 104 IISFLQQNAHPRV---------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           IIS+L  +AHP++            IP+  +P  V DQIRLW+ + +R+  T    +++F
Sbjct: 274 IISYLLSHAHPQLRKHSAAQSNGKGIPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDF 333

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
            S   ++A C YA +   L+W+ D K+M  V     H  +  FLR Q
Sbjct: 334 TSLAEYQAPCQYAAEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSQ 377


>gi|21357821|ref|NP_648780.1| marionette [Drosophila melanogaster]
 gi|7294263|gb|AAF49614.1| marionette [Drosophila melanogaster]
 gi|220942562|gb|ACL83824.1| Tfb2-PA [synthetic construct]
          Length = 499

 Score =  135 bits (340), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 122/214 (57%), Gaps = 17/214 (7%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL---------SMSLTDSSARKEGFVVVETN 52
           ++   + GLV   Q  RKE  F PT+LA N+         S+++ + + +  G++VVETN
Sbjct: 285 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 342

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQI+S+L+Q A
Sbjct: 343 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 402

Query: 113 HPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           HP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F    DYA+
Sbjct: 403 HPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQ 462

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W++ +   +VV    H  ++ + +  +K
Sbjct: 463 SIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSK 496


>gi|326483225|gb|EGE07235.1| transcription factor Tfb2 [Trichophyton equinum CBS 127.97]
          Length = 484

 Score =  135 bits (340), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 79/229 (34%), Positives = 121/229 (52%), Gaps = 38/229 (16%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------------------ 43
           + D  D G+V         + F PT+LAT L+   +DS A                    
Sbjct: 257 LADLTDFGIVYQHTPASGSARFYPTRLATTLT---SDSLAMSGPISGEPAVPTTTSGTTT 313

Query: 44  -----EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
                 GF++VETN R+YAY++S L   ++ LF+ ++++ PNLI G IT++S+  A E G
Sbjct: 314 SGDAGTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMG 373

Query: 99  ITAEQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYE 150
           ITA+QIIS+L  +AHP++     S        +P  V DQIRLW+ + +R++ T    ++
Sbjct: 374 ITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFK 433

Query: 151 EFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
           EF +   FEA C YA +   L+W+ DS++M  V     H  +  FLR +
Sbjct: 434 EFDTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR---HEQVAAFLRSR 479


>gi|325181847|emb|CCA16302.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 473

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/209 (36%), Positives = 118/209 (56%), Gaps = 15/209 (7%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFV------------- 47
           ++ DF + GL+  ++       F  T LA NL    T +S   E  V             
Sbjct: 261 LLTDFNNFGLLYRKRSNSDR--FYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLT 318

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           +VETNF++YAY+TS LH  +L +F  I  +LPNL +G IT+ESL +A  +GI+A+QI  F
Sbjct: 319 IVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDF 378

Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           L ++AHPR+    P VPEN+ DQI LWE + NR++      ++ F S++ FE   +YA+ 
Sbjct: 379 LLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQK 438

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           +  L W D   ++L +++     +R F++
Sbjct: 439 QRFLTWTDPIHLKLAISSNGIESVRNFIQ 467


>gi|195441209|ref|XP_002068409.1| GK20438 [Drosophila willistoni]
 gi|194164494|gb|EDW79395.1| GK20438 [Drosophila willistoni]
          Length = 512

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/220 (37%), Positives = 122/220 (55%), Gaps = 23/220 (10%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS-------------MSLTDSSARKE--GF 46
           ++   + GLV   Q  RKE  F PT+LA N++             MS ++   R +  G+
Sbjct: 292 LQHLREFGLVF--QRKRKEGRFYPTRLALNVTSKESADATTAIVTMSASEEEERMQDRGY 349

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG +T++S+  A   GITAEQIIS
Sbjct: 350 IVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIIS 409

Query: 107 FLQQNAHPR---VADRIPS---VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEA 160
           +L+Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y +F S   F  
Sbjct: 410 YLEQYAHPNMKLVESAINSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSTTDFVT 469

Query: 161 ACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             DYA+    L+W++ K   +VV    H  ++ + +  +K
Sbjct: 470 LRDYAQSIQVLVWQNEKTRTMVVQKNGHDDVKRYWKKYSK 509


>gi|315041383|ref|XP_003170068.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
           gypseum CBS 118893]
 gi|311345102|gb|EFR04305.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
           gypseum CBS 118893]
          Length = 484

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/226 (33%), Positives = 118/226 (52%), Gaps = 32/226 (14%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEG---------------- 45
           + D  D G+V         + F PT+LAT L+      S    G                
Sbjct: 257 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPTVPAAAAGATTAGD 316

Query: 46  ----FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
               F++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G IT++S+  A E GITA
Sbjct: 317 AGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 376

Query: 102 EQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
           +QI+S+L  +AHP++     S        +P  V DQIRLW+ + +R++ T    ++EF 
Sbjct: 377 DQIVSYLTTHAHPQMRKTKKSTTNVSSTVLPPTVVDQIRLWQLERDRIKATSGFLFKEFD 436

Query: 154 SRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQ 198
           +   FEA C YA +   L+W  DS++M  V     H  +  FLR +
Sbjct: 437 TFAEFEAPCKYAEEIGVLVWRSDSRRMFFVTR---HEQVAAFLRSR 479


>gi|50557260|ref|XP_506038.1| YALI0F30173p [Yarrowia lipolytica]
 gi|62901293|sp|Q6BZX4.1|TFB2_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49651908|emb|CAG78851.1| YALI0F30173p [Yarrowia lipolytica CLIB122]
          Length = 467

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 70/218 (32%), Positives = 121/218 (55%), Gaps = 20/218 (9%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM------------------SLTDSSAR 42
           M++D  D G+V  ++   +   F PT+LAT L+                   +++ S A 
Sbjct: 245 MLEDLRDYGIVYQRKASSRR--FYPTRLATTLTSETAALRTASQSMEAATQDTISSSVAA 302

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETNFR+YAY+ S L   +L LF  ++ +  N++ G I ++S+  A  NGITAE
Sbjct: 303 DSGFIILETNFRLYAYTESPLQIAVLNLFVNLKTRFANMVTGQINRDSVRFALSNGITAE 362

Query: 103 QIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAAC 162
           QII++L  +AHPR+      +P  V DQI+LW+ +++R+  T  + + EF + D ++   
Sbjct: 363 QIITYLSVHAHPRMKGMEHVLPPTVVDQIKLWQLEMDRIRATDGYLFSEFKNFDEYKDVS 422

Query: 163 DYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
            YA++   LL+E+  K + V        + EF++ +N+
Sbjct: 423 TYAKELGVLLYENPGKRKFVSTLAGSQQIVEFVKRRNQ 460


>gi|320167307|gb|EFW44206.1| transcription factor tfb2 [Capsaspora owczarzaki ATCC 30864]
          Length = 406

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/196 (36%), Positives = 116/196 (59%), Gaps = 6/196 (3%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAY 58
           +++   ++GLV  ++  R    F PT LA NL+     +  +A  +G++VVETNFR+YAY
Sbjct: 182 LLQHLREIGLVFRRK--RTSRRFYPTPLAINLASGSAKNLDAADVKGYIVVETNFRIYAY 239

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   +L LF  + Y+ PN++ G ++++S+  A   G+TAEQ+I FL+ +AHP++  
Sbjct: 240 TDSPLQLALLSLFVDLRYRFPNMVCGLLSRDSVRKALVKGLTAEQMIRFLRTHAHPQMRS 299

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
           R P +PE + DQ+RLWE + NR+ + PA  YE F ++   +    Y RD    L   S+ 
Sbjct: 300 RTPVLPETISDQLRLWELERNRLRVLPAVLYERFSNQREHDLLHHYGRDLGVELAHSSQH 359

Query: 179 MRLVVNAEIHMHMREF 194
             +VV  E H  ++ F
Sbjct: 360 --IVVTFEGHEQIKTF 373


>gi|325181222|emb|CCA15636.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 1033

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 76/209 (36%), Positives = 118/209 (56%), Gaps = 15/209 (7%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFV------------- 47
           ++ DF + GL+  ++       F  T LA NL    T +S   E  V             
Sbjct: 308 LLTDFNNFGLLYRKRSNSDR--FYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLT 365

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           +VETNF++YAY+TS LH  +L +F  I  +LPNL +G IT+ESL +A  +GI+A+QI  F
Sbjct: 366 IVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDF 425

Query: 108 LQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARD 167
           L ++AHPR+    P VPEN+ DQI LWE + NR++      ++ F S++ FE   +YA+ 
Sbjct: 426 LLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQK 485

Query: 168 RSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           +  L W D   ++L +++     +R F++
Sbjct: 486 QRFLTWTDPIHLKLAISSNGIESVRNFIQ 514


>gi|346464903|gb|AEO32296.1| hypothetical protein [Amblyomma maculatum]
          Length = 436

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 108/179 (60%), Gaps = 4/179 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR--KEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  R+   F PT+LA NL+  L +++ R  + G+++VETN+R+YAY+
Sbjct: 258 LQHLREFGLV--YQRKRRSGRFYPTRLAINLASGLRETNLRSYEAGYIMVETNYRVYAYT 315

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   +L LF ++ Y+ PNL+V  +T+ES+  A  +GIT+ QII FL+  AHP    +
Sbjct: 316 NSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQIIKFLRMYAHPEALKQ 375

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
            P +P  + DQ+RLWE + +R        Y +F S+  F+   +YA D   L+W++  K
Sbjct: 376 TPVIPATIMDQLRLWELERDRFVYREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSK 434


>gi|406862938|gb|EKD15987.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
          Length = 421

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 71/215 (33%), Positives = 123/215 (57%), Gaps = 24/215 (11%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD-------------SSARKEGFV 47
           M+K   D GLV       ++  F PT+L+T L+   +              S +  +GF+
Sbjct: 203 MLKYLVDFGLVYSPSSTPQQ--FFPTRLSTTLTSDASGLRSVSAGFDDALKSESGTKGFI 260

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           ++ETN+R+YAY+ S L   +L LF+K+  + PN++ G +T++S+ NA  +GIT++QIIS+
Sbjct: 261 IIETNYRLYAYTNSPLQIAVLALFTKMGVRYPNMVTGRVTRKSVANAISHGITSDQIISY 320

Query: 108 LQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAAC 162
           L+ +AHP++         P +P  V DQIRLW+ +  R++ T    ++EF ++  +E   
Sbjct: 321 LRAHAHPQLVKAAAVNGNPVLPPTVVDQIRLWQIENERMKATAGFLFKEFETQKEYEGCA 380

Query: 163 DYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLR 196
            YA +   L+W+ D+K+M  V     H  +R++L+
Sbjct: 381 RYADENGVLVWKNDAKRMFFVTR---HEQLRDYLK 412


>gi|255936123|ref|XP_002559088.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211583708|emb|CAP91723.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 483

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 79/227 (34%), Positives = 125/227 (55%), Gaps = 38/227 (16%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA------------------RK 43
           + D AD G+V  +      + F PT+LAT L+   +DSSA                    
Sbjct: 260 LMDLADFGIVYQEHA--DATRFYPTRLATTLT---SDSSALSNPVTGSLTGPAGPSGGSG 314

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
            GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G IT++S+  A E GITA+Q
Sbjct: 315 SGFIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQ 374

Query: 104 IISFLQQNAHPRV---------ADRIPS--VPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           IIS+L  +AHP++            +P+  +P  V DQIRLW+ + +R+  T    +++F
Sbjct: 375 IISYLLSHAHPQLRKHSAAQPNGKGVPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDF 434

Query: 153 PSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
            S   ++A C YA +   L+W+ D K+M  V     H  +  FLR Q
Sbjct: 435 TSLAEYQAPCQYAAEIGVLVWKSDRKRMFFVTR---HEQVAAFLRSQ 478


>gi|347830304|emb|CCD46001.1| similar to RNA polymerase II transcription factor B subunit 2
           [Botryotinia fuckeliana]
          Length = 471

 Score =  134 bits (337), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 124/218 (56%), Gaps = 25/218 (11%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------SLTDSSARKEGFV 47
           M+ +  DLGL+ L      +  F PT+LAT L+              + + S+A ++GF+
Sbjct: 256 MLANLIDLGLIYLPPSAPTQ--FFPTRLATTLTSDASALRTVAAGFDAASKSAASQKGFI 313

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           ++ETN+R+YAY+ S L   +L LF+K+  + PN++ G ++++S+  A  +GIT++QII++
Sbjct: 314 IIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITY 373

Query: 108 LQQNAHPRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
           L  +AHP++          P +P  V DQIRLW+ +  R++  P    ++F ++  +E  
Sbjct: 374 LSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFETQKEYEGC 433

Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
             YA +   L+W+ D+K+M  V   E    +R++ + +
Sbjct: 434 AKYAEEVGVLVWKSDAKRMFFVTRVE---QLRDYFKAK 468


>gi|334323685|ref|XP_003340422.1| PREDICTED: valyl-tRNA synthetase, mitochondrial-like [Monodelphis
           domestica]
          Length = 1641

 Score =  134 bits (336), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 124/231 (53%), Gaps = 35/231 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  +T +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  T-------------------------------SKLHCEILRLFSKIEYQLPNLIVGAITK 88
                                           S+L   ++ LFS++ Y+ PN++V  IT+
Sbjct: 321 GEVGGMRMGIGMLEGRNCGFFGIILFFSLATESELQIALIALFSEMLYRFPNMVVAQITR 380

Query: 89  ESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
           ES+  A  NGITA+QII FL+  AHP +  +IP +P  + DQIRLWE + +R+  +    
Sbjct: 381 ESVQQAIANGITAQQIIHFLKTRAHPVMLKQIPVLPPTITDQIRLWELERDRLRFSEGVL 440

Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
           Y +F S+  FE    +AR+   L++E++ K  +VV    H  ++ F + Q 
Sbjct: 441 YNQFLSQVDFELLLSHARELGVLVFENTAKRLMVVTPAGHSDVKRFWKRQK 491


>gi|345492819|ref|XP_001602702.2| PREDICTED: general transcription factor IIH subunit 4-like [Nasonia
           vitripennis]
          Length = 428

 Score =  133 bits (335), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 120/203 (59%), Gaps = 6/203 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
           ++   + GLV   Q  RK   F PT+LA N++       +R   KE FV+VETN+R+YAY
Sbjct: 225 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGQDKPISRDLEKERFVIVETNYRVYAY 282

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   ++ LF ++ Y+ PNL+V  +T++S+  A ++GITA QI+ +L Q+AH ++ D
Sbjct: 283 TNSNLQVALIGLFCELLYRFPNLVVAILTRDSVRAALKSGITAVQIVGYLNQHAHNKMID 342

Query: 119 -RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
              P++P  + DQI+LWE++ NR   +    Y +F S+  FE   D+A     ++W+  +
Sbjct: 343 PGPPTLPPTIVDQIKLWENERNRFIFSEGVLYSQFHSQIDFEVLRDHAVSLGVMIWQSDR 402

Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
           K  +VV    H  +++F +  +K
Sbjct: 403 KRTMVVTKTGHDDVKKFWKRYSK 425


>gi|154308354|ref|XP_001553513.1| hypothetical protein BC1G_08237 [Botryotinia fuckeliana B05.10]
          Length = 410

 Score =  133 bits (335), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 125/218 (57%), Gaps = 25/218 (11%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------SLTDSSARKEGFV 47
           M+ +  DLGL+ L      +  F PT+LAT L+              + + S+A ++GF+
Sbjct: 195 MLANLIDLGLIYLPPSAPTQ--FFPTRLATTLTSDASALRTVAAGFDAASKSAASQKGFI 252

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           ++ETN+R+YAY+ S L   +L LF+K+  + PN++ G ++++S+  A  +GIT++QII++
Sbjct: 253 IIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITY 312

Query: 108 LQQNAHPRV------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
           L  +AHP++      +   P +P  V DQIRLW+ +  R++  P    ++F ++  +E  
Sbjct: 313 LSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFETQKEYEGC 372

Query: 162 CDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQ 198
             YA +   L+W+ D+K+M  V   E    +R++ + +
Sbjct: 373 AKYAEEVGVLVWKSDAKRMFFVTRVE---QLRDYFKAK 407


>gi|348683175|gb|EGZ22990.1| hypothetical protein PHYSODRAFT_479459 [Phytophthora sojae]
          Length = 490

 Score =  133 bits (335), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 97/153 (63%)

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
            VVVETNF++YAY+TS LH  +L +F  I  +LPNL +G IT+ESL +A  +GI+A+QI 
Sbjct: 333 LVVVETNFKIYAYTTSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIY 392

Query: 106 SFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
            FL ++AHP++    P +PEN+ DQI LWE + NRV+      ++ F +++ +E+  DYA
Sbjct: 393 DFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFMEGILFDGFNTKEDYESVRDYA 452

Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +D   L W D    RL +       +R +++ Q
Sbjct: 453 KDLKVLTWSDPIHFRLSITTAGIDDVRHYIQSQ 485


>gi|389749775|gb|EIM90946.1| transcription factor Tfb2 [Stereum hirsutum FP-91666 SS1]
          Length = 494

 Score =  132 bits (333), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 75/212 (35%), Positives = 119/212 (56%), Gaps = 19/212 (8%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD----------SSARKEGFVVVE 50
           M++D  D GL+  ++   +   F PT+LAT L+ SL             + + +GF+++E
Sbjct: 265 MLEDLRDYGLIWQRKATSRR--FSPTRLATTLTSSLPPLPTASGTGTSGTGQTQGFIILE 322

Query: 51  TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
           TN+R+YAY+ + L   +L LF  ++ + PNL+VG+IT++S+  A  NGITA+QIIS+L  
Sbjct: 323 TNYRLYAYTDNPLQTAVLNLFVTMKSRFPNLVVGSITRDSVRKALSNGITADQIISYLTT 382

Query: 111 NAHPRVADRI-------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACD 163
            AHP++   +       P +P  V DQIRLWE + NR++      Y  F S+  +E    
Sbjct: 383 YAHPQMRKNVSRCRSFTPLLPVTVQDQIRLWELERNRLKSQEGFLYTAFASQADYEYVLT 442

Query: 164 YARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
           YA+    +LWE+  K       E H ++R F+
Sbjct: 443 YAKQLDVVLWENPTKRCFFGTMEGHANIRGFI 474


>gi|76154655|gb|AAX26095.2| SJCHGC05044 protein [Schistosoma japonicum]
          Length = 214

 Score =  132 bits (331), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 67/158 (42%), Positives = 96/158 (60%)

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           SS+   G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V  IT++S+  A   G
Sbjct: 52  SSSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 111

Query: 99  ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
           ITA QI+SFL  NAHP +    P +P  + DQIRLWE + +R        YE F     F
Sbjct: 112 ITANQILSFLTSNAHPDMLLSNPILPPTLTDQIRLWELERDRFVFQEGCLYEHFSRNTDF 171

Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           E   DYA++   LLWE+ ++  +VV+   H  +R+F +
Sbjct: 172 ELVRDYAKNIGVLLWENPERRLMVVSKSGHEDVRKFWK 209


>gi|332030486|gb|EGI70174.1| General transcription factor IIH subunit 4 [Acromyrmex echinatior]
          Length = 550

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 113/188 (60%), Gaps = 6/188 (3%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
           ++   + GLV   Q  RK   F PT+LA N++   T    R   KEG+++VETN+R+YAY
Sbjct: 264 LQHLREFGLV--YQRKRKAGRFYPTRLALNIATGETKPLTRDTDKEGYIIVETNYRVYAY 321

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   +L LF ++ Y+ PN++V  +T++S+  A ++GITA QI+ +L+Q+AH ++ +
Sbjct: 322 TNSNLQVALLGLFCEMLYRFPNVVVSILTRDSVRQALKSGITASQIVGYLRQHAHSKMIE 381

Query: 119 RIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
             P + P  + DQI+LWE++ NR   +    Y +F S+  FE   D+A     L+W+  +
Sbjct: 382 AGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFEVLRDHAVSTGVLIWQSER 441

Query: 178 KMRLVVNA 185
           +  L+ + 
Sbjct: 442 RSTLLADT 449


>gi|159474228|ref|XP_001695231.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276165|gb|EDP01939.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 435

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/139 (46%), Positives = 89/139 (64%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           + FV+VE+N+R+Y Y+ S +   +L LF K E  LPNL VGAI ++S+  A   GITA++
Sbjct: 297 DAFVIVESNYRVYVYTRSPVTIAVLELFVKREALLPNLFVGAIRRDSILAALARGITADE 356

Query: 104 IISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACD 163
           ++++L    HP +A R P VPE V DQIRLWE+ +NR++  PA  YE   SR ++E A  
Sbjct: 357 LVAYLSARPHPSIAARCPVVPEVVSDQIRLWEASMNRLQAHPAVLYENMESRQLYERAVA 416

Query: 164 YARDRSGLLWEDSKKMRLV 182
            AR    L WED  +MR V
Sbjct: 417 AARAAGTLQWEDGARMRFV 435


>gi|396459783|ref|XP_003834504.1| similar to RNA polymerase II transcription factor B subunit 2
           [Leptosphaeria maculans JN3]
 gi|312211053|emb|CBX91139.1| similar to RNA polymerase II transcription factor B subunit 2
           [Leptosphaeria maculans JN3]
          Length = 483

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 69/192 (35%), Positives = 108/192 (56%), Gaps = 21/192 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS------MSLTDS-------------SA 41
           M+ D + +GLV   +  +    F PT+LAT L+      MS + +             + 
Sbjct: 262 MLDDLSAMGLV--YRTSKDARTFYPTRLATTLTSDSGSAMSASSNDIAQANQGNAGTLAT 319

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
             +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G +TKES++ A   GIT+
Sbjct: 320 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVSAGITS 379

Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
            QIIS+L   AHP++   +  +P  V DQIRLWE +  RVE+TP    ++F S   +   
Sbjct: 380 AQIISYLSTYAHPQMQKNVSFIPPTVMDQIRLWEYEGERVEVTPGFLMKDFGSDAEYRDV 439

Query: 162 CDYARDRSGLLW 173
             YA     L+W
Sbjct: 440 LGYADALGVLVW 451


>gi|146414792|ref|XP_001483366.1| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 498

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 77/243 (31%), Positives = 121/243 (49%), Gaps = 51/243 (20%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDS--------SAR 42
           M+ D  D GLV   Q   +   F PT+LAT L          SM+L  +        S  
Sbjct: 251 MLADLRDYGLV--YQYSDQSDRFYPTRLATTLTSESASLKTPSMALDQAVDTSNGLDSGP 308

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
            +G +++ETNF++YAY+ S L   IL LF  +  +  N++ G IT+ES+ NA  NGIT++
Sbjct: 309 GQGSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSD 368

Query: 103 QIISFLQQNAHPRV-------------------------------ADRIPSVPENVCDQI 131
           QII+FL+ +AHP++                                 ++  +P  V DQI
Sbjct: 369 QIINFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQI 428

Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
           +LW+ +L+R++    + ++EF S+  F+  C+YA D   L+W D  KMR  V  +    +
Sbjct: 429 KLWQLELDRIQAFEGYLFKEFASQQEFDVLCNYASDVGVLIWSDRTKMRFFVTKDGISQV 488

Query: 192 REF 194
            +F
Sbjct: 489 ADF 491


>gi|190347675|gb|EDK39997.2| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 498

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 77/243 (31%), Positives = 121/243 (49%), Gaps = 51/243 (20%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDS--------SAR 42
           M+ D  D GLV   Q   +   F PT+LAT L          SM+L  +        S  
Sbjct: 251 MLADLRDYGLV--YQYSDQSDRFYPTRLATTLTSESASLKTPSMALDQAVDTSNGLDSGP 308

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
            +G +++ETNF++YAY+ S L   IL LF  +  +  N++ G IT+ES+ NA  NGIT++
Sbjct: 309 GQGSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSD 368

Query: 103 QIISFLQQNAHPRV-------------------------------ADRIPSVPENVCDQI 131
           QII+FL+ +AHP++                                 ++  +P  V DQI
Sbjct: 369 QIINFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQI 428

Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
           +LW+ +L+R++    + ++EF S+  F+  C+YA D   L+W D  KMR  V  +    +
Sbjct: 429 KLWQLELDRIQAFEGYLFKEFASQQEFDVLCNYASDVGVLIWSDRTKMRFFVTKDGISQV 488

Query: 192 REF 194
            +F
Sbjct: 489 ADF 491


>gi|353231086|emb|CCD77504.1| putative tfiih, polypeptide [Schistosoma mansoni]
          Length = 486

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/158 (41%), Positives = 97/158 (61%)

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           S++   G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V  IT++S+  A   G
Sbjct: 324 SNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 383

Query: 99  ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
           ITA QI+SFL  NAHP +  + P +P  + DQIRLWE + +R        YE+F     F
Sbjct: 384 ITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDF 443

Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           E   DYA+    LLWE+ ++  +VV+   H  +R+F +
Sbjct: 444 EMVRDYAKSIGVLLWENPERRLMVVSKAGHEDVRKFWK 481


>gi|256086358|ref|XP_002579367.1| tfiih polypeptide [Schistosoma mansoni]
          Length = 486

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 66/158 (41%), Positives = 97/158 (61%)

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           S++   G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V  IT++S+  A   G
Sbjct: 324 SNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 383

Query: 99  ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
           ITA QI+SFL  NAHP +  + P +P  + DQIRLWE + +R        YE+F     F
Sbjct: 384 ITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDF 443

Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           E   DYA+    LLWE+ ++  +VV+   H  +R+F +
Sbjct: 444 EMVRDYAKSIGVLLWENPERRLMVVSKAGHEDVRKFWK 481


>gi|391335225|ref|XP_003741996.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Metaseiulus occidentalis]
          Length = 457

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 116/203 (57%), Gaps = 5/203 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK--EGFVVVETNFRMYAY 58
            ++   + GLV   Q  RK   F PT+LA  L+  L +  A K  +G+++VETN+R+YAY
Sbjct: 254 FLQHLREFGLV--YQRKRKSGRFYPTRLAIGLASGLKELQATKDEQGYIIVETNYRVYAY 311

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   +L LF ++ Y+ PNL+V  +T+ES+  A + GIT+ QI  FL+  +H  V +
Sbjct: 312 TDSPLQVALLSLFCELLYRFPNLLVAVLTRESVRQALKGGITSNQITHFLKSRSHRVVLE 371

Query: 119 RIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
           R    +P  V DQ+RLWE + +R +   +  Y +F +   FE   +YARD   L +E+ +
Sbjct: 372 REEGIIPMTVTDQLRLWELERDRFKFGESVLYSQFQTTGDFEMLRNYARDIGVLKYENPQ 431

Query: 178 KMRLVVNAEIHMHMREFLRGQNK 200
           K  LVV+      +R+F +   K
Sbjct: 432 KRFLVVSKSGDAEVRQFWKRHKK 454


>gi|339237219|ref|XP_003380164.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
 gi|316977049|gb|EFV60220.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
          Length = 674

 Score =  130 bits (327), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 109/195 (55%), Gaps = 2/195 (1%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
            I+   ++GLV   Q  R   WF  T L + L+   + SS+ KEGF++VETNFR+Y Y+ 
Sbjct: 251 FIQHLREIGLV--HQRKRSAGWFYYTPLISVLTGLKSSSSSSKEGFLIVETNFRVYCYTD 308

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           S L   I+  F +  Y+ PNL+   + +ES+  AF+  I+AEQII +L  NAH  +  + 
Sbjct: 309 SVLDLAIVSTFCEPLYRFPNLVACILNRESVRRAFQVNISAEQIIQYLFSNAHKNMQKQT 368

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           P++P  V DQI+LWE + +R +  P   Y  F S   +    DYA+D   LL E      
Sbjct: 369 PTIPSTVTDQIKLWEMERDRFKFDPGVMYSNFFSDTDYITIRDYAKDLGVLLCEHEANRA 428

Query: 181 LVVNAEIHMHMREFL 195
           LVV+A+ H    +FL
Sbjct: 429 LVVSADGHEQSNQFL 443


>gi|301098671|ref|XP_002898428.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
 gi|262105199|gb|EEY63251.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
          Length = 483

 Score =  130 bits (327), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 119/232 (51%), Gaps = 38/232 (16%)

Query: 1   MIKDFADLGLV--KLQQVGRKESWFIPTKLATNL-----------SMSLTDSSARKEG-- 45
           ++ DF D GL+  K Q   R    F  T LA NL            +SLT S A      
Sbjct: 251 LLGDFIDFGLLFRKRQNSDR----FYTTSLAVNLIFGGSTGQKRSHVSLTSSFAGVRAGM 306

Query: 46  -------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAI 86
                               VVVETNF++YAY++S LH  +L +F  I  +LPNL +G I
Sbjct: 307 KSQVADPRQAPTVDHGARLLVVVETNFKIYAYTSSTLHVAMLSVFVDIVARLPNLAIGFI 366

Query: 87  TKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
           T+ESL +A  +GI+A+QI  FL ++AHP++    P +PEN+ DQI LWE + NRV+    
Sbjct: 367 TRESLRSALIHGISAQQIYDFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFLEG 426

Query: 147 HYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
             ++ F +++ +E+  DYA+    L W D    RL +       +R F++ Q
Sbjct: 427 ILFDGFNTKEDYESVRDYAKGLMVLTWSDPIHFRLSIATAGIDEVRHFIQNQ 478


>gi|407927340|gb|EKG20235.1| Transcription factor Tfb2 [Macrophomina phaseolina MS6]
          Length = 466

 Score =  130 bits (326), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 72/215 (33%), Positives = 121/215 (56%), Gaps = 20/215 (9%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL-------------SMSLTDSSAR-KEGF 46
           M+ D  D G+V   +   K + F PT+LAT L             S +L  +SA   +G+
Sbjct: 255 MLDDLNDFGIVY--RSPPKSNRFYPTRLATTLTSDAAALNNSDSFSSALNSTSASGSKGY 312

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +++ETN+R+YAY++S L   +L LF+ +E + PNL+ G +TKES+  A   GITA+QIIS
Sbjct: 313 IIIETNYRLYAYTSSPLQIAVLALFTDLETRFPNLVSGKLTKESVQRAIGLGITADQIIS 372

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           +L   AHP++  + P +P  V DQIRLW+ + +R+  T    +++F   D ++    YA 
Sbjct: 373 YLDTYAHPQMQRKTPVLPPTVVDQIRLWQIEGDRMRATEGFLFKDFRDWDEYQDLKKYAD 432

Query: 167 DRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
               L+W  D +++  +   E    + ++++ + K
Sbjct: 433 SIGVLVWSNDQRRLMFLTRVE---QISQYIKNRTK 464


>gi|281208616|gb|EFA82792.1| general transcription factor IIH [Polysphondylium pallidum PN500]
          Length = 458

 Score =  129 bits (325), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 73/207 (35%), Positives = 120/207 (57%), Gaps = 15/207 (7%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-----------DSSARKEGFVVV 49
           ++ D    GLV ++    K   F PT+L  +L+   T            ++ +++G++++
Sbjct: 252 LLFDLRQFGLVYIR--SEKSEIFYPTRLIISLTTGKTVTVIKDLAKEMSNTQKEQGYIIL 309

Query: 50  ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
           ETNFR+YAY+ S L   +L LF K+ Y+LPNL VG +T+ES+  AF +GITA+QI+ F++
Sbjct: 310 ETNFRIYAYTASSLQISLLSLFVKMLYRLPNLSVGILTRESIRTAFLHGITADQIVDFIK 369

Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
           QN HP +       PE V +QIR+WE++ NR+    A  ++ FP+++ F     YA+D S
Sbjct: 370 QNGHPNMLK--VGAPEIVFEQIRIWENERNRILYKKAVLFDSFPTQESFNMTLQYAKDLS 427

Query: 170 GLLWEDSKKMRLVVNAEIHMHMREFLR 196
             +W    K  LVV+      ++ ++R
Sbjct: 428 FYMWASEAKKVLVVSDNGFDAIKNYIR 454


>gi|395533817|ref|XP_003768949.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Sarcophilus harrisii]
          Length = 1638

 Score =  129 bits (324), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/234 (32%), Positives = 123/234 (52%), Gaps = 38/234 (16%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  +T +  +A + GF+VVETN+R+YAY+
Sbjct: 296 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYT 353

Query: 60  T----------------------------------SKLHCEILRLFSKIEYQLPNLIVGA 85
                                              S+L   ++ LFS++ Y+ PN++V  
Sbjct: 354 GEVGGIKMRIEIRGGGIGGELNLLWSSIVFSLETESELQIALIALFSEMLYRFPNMVVAQ 413

Query: 86  ITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           +T+ES+  A  NGITA+QII F++  AHP +  + P +P  + DQIRLWE + +R+  + 
Sbjct: 414 VTRESVQQAIANGITAQQIIHFMKTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFSE 473

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
              Y +F S+  FE    +AR+   L++E++ K  +VV    H  ++ F + Q 
Sbjct: 474 GVLYNQFLSQVDFELLLSHARELGVLVFENTAKRLMVVTPAGHSDVKRFWKRQK 527


>gi|66800623|ref|XP_629237.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
           discoideum AX4]
 gi|74850716|sp|Q54C29.1|TF2H4_DICDI RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=TFIIH basal transcription factor complex subunit 4
 gi|60462610|gb|EAL60813.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
           discoideum AX4]
          Length = 483

 Score =  129 bits (324), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 70/185 (37%), Positives = 114/185 (61%), Gaps = 11/185 (5%)

Query: 23  FIPTKLATNLS----MSLTDS-------SARKEGFVVVETNFRMYAYSTSKLHCEILRLF 71
           F PT+L  +L+    +SL  S       + +++G++V+ETN+R+YAY++S L   +L LF
Sbjct: 295 FYPTRLIISLTTGKTLSLIQSISSERTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLF 354

Query: 72  SKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQI 131
            K+ Y+LPNL VG IT+ES+  A  +GITA+QII F++ N+HP  A+    +P+ V +QI
Sbjct: 355 VKMLYRLPNLAVGIITRESIRTALIHGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQI 414

Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHM 191
            LWE++ NR+  T +  Y  FP+ D + A   +A+++   +W       LVV  E +  +
Sbjct: 415 LLWEAERNRITYTKSVLYNSFPTNDCYIATLKFAKEQDYYIWSHDPLKTLVVKEEGNDPI 474

Query: 192 REFLR 196
           R F++
Sbjct: 475 RNFIK 479


>gi|326474968|gb|EGD98977.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton tonsurans CBS 112818]
          Length = 489

 Score =  129 bits (324), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 109/204 (53%), Gaps = 34/204 (16%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK------------------ 43
           + D  D G+V         + F PT+LAT L+   +DS A                    
Sbjct: 257 LADLTDFGIVYQHTPASGSARFYPTRLATTLT---SDSLAMSGPISGEPAVPTTTSGTTT 313

Query: 44  -----EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
                 GF++VETN R+YAY++S L   ++ LF+ ++++ PNLI G IT++S+  A E G
Sbjct: 314 SGDAGTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMG 373

Query: 99  ITAEQIISFLQQNAHPRVADRIPS--------VPENVCDQIRLWESDLNRVEMTPAHYYE 150
           ITA+QIIS+L  +AHP++     S        +P  V DQIRLW+ + +R++ T    ++
Sbjct: 374 ITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFK 433

Query: 151 EFPSRDVFEAACDYARDRSGLLWE 174
           EF +   FEA C YA +   L+W+
Sbjct: 434 EFDTFAEFEAPCKYAEEIGVLVWK 457


>gi|154273196|ref|XP_001537450.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150415962|gb|EDN11306.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 455

 Score =  128 bits (322), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 70/197 (35%), Positives = 111/197 (56%), Gaps = 31/197 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL-----------SMSLT--------DSSAR 42
           + D  D G+V      ++ + F PT+LAT L           S SLT         S+  
Sbjct: 231 LADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPNGVPSATSNEP 290

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
             GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G +T++S+  A E GITA+
Sbjct: 291 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 350

Query: 103 QIISFLQQNAHPRV---------ADRI---PSVPENVCDQIRLWESDLNRVEMTPAHYYE 150
           QIIS+L  +AHP++         A+ +     +P  V DQIRLW+ + +R++ TP   ++
Sbjct: 351 QIISYLTTHAHPQMRKYHATKPGANPVGVHTVLPPTVVDQIRLWQLERDRIKATPGFLFK 410

Query: 151 EFPSRDVFEAACDYARD 167
           +F S   FE  C YA +
Sbjct: 411 DFVSLAEFEGPCRYAEE 427


>gi|330797174|ref|XP_003286637.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
 gi|325083385|gb|EGC36839.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
          Length = 471

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 62/155 (40%), Positives = 99/155 (63%)

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
           ++ G++V+ETN+R+YAY+ S L   +L LF K+ Y+LPNL VG IT+ES+  A  +GITA
Sbjct: 313 KESGYIVLETNYRLYAYTASSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITA 372

Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
           +QII F++ NAHP   +    +P+ V +QI LWE + NR+  T +  Y  FP+ D + A 
Sbjct: 373 DQIIDFIRHNAHPNAVNNGQPIPDVVAEQILLWEQERNRITYTKSVLYNSFPTTDCYHAT 432

Query: 162 CDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
             +A+++   +W + +   LVVN   +  +R F++
Sbjct: 433 LKFAKEQDYYIWSNDQLKTLVVNENGNDPIRNFIK 467


>gi|50310215|ref|XP_455127.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62901296|sp|Q6CLR2.1|TFB2_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49644263|emb|CAG97834.1| KLLA0F01056p [Kluyveromyces lactis]
          Length = 496

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 77/244 (31%), Positives = 122/244 (50%), Gaps = 50/244 (20%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLAT-----------------------NLSMSLT 37
           M++D  D GL+   Q    +  F PT+LAT                       N + ++ 
Sbjct: 250 MLQDMRDYGLIYQNQSDYAK--FYPTRLATLLTSDTKAFRSASVALDSVLNKANETTAVE 307

Query: 38  DSSARKE-------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
             S + E       G +++ETNF++Y+YS S L   IL LF  ++ +  N++ G +T+ES
Sbjct: 308 GDSGQDETTERTQDGALIIETNFKLYSYSNSPLQIAILSLFVHLKSRFANMVTGQLTRES 367

Query: 91  LYNAFENGITAEQIISFLQQNAHPR------------------VADRIPSVPENVCDQIR 132
           + NA  NGITAEQII++L+ +AHPR                  V + +  +P  V DQIR
Sbjct: 368 VRNALLNGITAEQIIAYLETHAHPRMRRLAEENLSKKLELDPTVKETLQVLPPTVVDQIR 427

Query: 133 LWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMR 192
           LW+ +L+R+     + Y +F S   ++   DYA+D   LLW++ KK    V+ E +  + 
Sbjct: 428 LWQLELDRIISYDGYLYTDFESYQEYQTVADYAKDIGVLLWQNEKKKMFFVSTEGNSQVL 487

Query: 193 EFLR 196
           +F R
Sbjct: 488 DFHR 491


>gi|440636738|gb|ELR06657.1| hypothetical protein GMDG_00274 [Geomyces destructans 20631-21]
          Length = 470

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 57/154 (37%), Positives = 97/154 (62%), Gaps = 2/154 (1%)

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
           GF+V+ETN+R+YAY++S L   +L LF+K+  + PN++ G +T++S+  A E+GITA+QI
Sbjct: 316 GFIVIETNYRLYAYTSSPLQIAVLALFAKLSTRYPNMVSGRVTRDSVRGAIEHGITADQI 375

Query: 105 ISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDY 164
           IS+L  +AHP++  + P +P  V DQIRLW+ +  R++ T    +++F S   +E    Y
Sbjct: 376 ISYLTTHAHPQLRRKSPVLPPTVVDQIRLWQIENERMKATAGFLFKDFASLAEYEGCVRY 435

Query: 165 ARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           A +   L+W    K    V    H  +R++++ +
Sbjct: 436 ADEIGVLVWRSDAKRNFFVTK--HEQLRDYIKSR 467


>gi|193633122|ref|XP_001945987.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Acyrthosiphon pisum]
          Length = 471

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/202 (34%), Positives = 115/202 (56%), Gaps = 7/202 (3%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
           ++   + GLV   Q  RK   F PT+L   +    + +S R   KE ++VVETNFR+YA 
Sbjct: 270 LQHLREFGLV--YQRKRKAGRFYPTRLVIEMGQGNSRTSERMKNKERYIVVETNFRIYAM 327

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   ++ LF+ + Y+ PN+  G +T++S+  A  +GITA QI+ FL  + HP++ +
Sbjct: 328 TDSDLKVALVALFTHMLYRFPNMSAGILTRDSVRTALRSGITAAQIVRFLTVHTHPQMQE 387

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
               +P+ V DQI LWE++ NR+  T    Y    + + +E   +YA D   L+W D ++
Sbjct: 388 --CGMPQTVIDQIYLWENERNRLTYTDGVLYSNINTPNDYETIKNYAADIGALVWCDERR 445

Query: 179 MRLVVNAEIHMHMREFLRGQNK 200
             +VV+ + H  +R+F + Q K
Sbjct: 446 RNIVVSTDGHDDVRKFWKKQPK 467


>gi|402866404|ref|XP_003897374.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial,
           partial [Papio anubis]
          Length = 1647

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 77/233 (33%), Positives = 121/233 (51%), Gaps = 37/233 (15%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY 
Sbjct: 308 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 365

Query: 59  --------------------------------STSKLHCEILRLFSKIEYQLPNLIVGAI 86
                                           S S+L   ++ LFS++ Y+ PN++V  +
Sbjct: 366 GEVGQSAPEEKKEGLEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQV 425

Query: 87  TKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
           T+ES+  A  +GITA+QII FL+  AHP +  + P +P  + DQIRLWE + +R+  T  
Sbjct: 426 TRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEG 485

Query: 147 HYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
             Y +F S+  FE    +AR+   L++E+S K  +VV    H  ++ F + Q 
Sbjct: 486 VLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 538


>gi|345569654|gb|EGX52519.1| hypothetical protein AOL_s00043g13 [Arthrobotrys oligospora ATCC
           24927]
          Length = 452

 Score =  125 bits (315), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 114/208 (54%), Gaps = 22/208 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM---SLTDSSAR---------KEGFVV 48
           M++D  D G+V  ++   +   F PT+LAT L+    SL   +A          + GF++
Sbjct: 248 MLEDLRDYGIVYQRKSSSRR--FYPTRLATGLTSEIRSLRSPAATLANATPGNDENGFLI 305

Query: 49  VETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 108
           +ETN+       S L   +L LF+++  + PNL+   IT+ES+  A   GITAEQII ++
Sbjct: 306 IETNYH------SPLQIAVLNLFARLISRFPNLVTARITRESIRQAIGKGITAEQIIDYM 359

Query: 109 QQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDR 168
             NAHP++    P +P  V DQIRLW+ +  R+++T  + ++EF +   F A C YA D 
Sbjct: 360 TANAHPQLRKNSPVLPPTVVDQIRLWQIEGERMKVTSGYLFKEFATNAEFNAVCKYADDI 419

Query: 169 SGLLWEDSKKMRLVVNAEIHMHMREFLR 196
             L+W D  K  L V+   H  + +++R
Sbjct: 420 GVLVWIDKNKGMLFVSK--HEQIADYIR 445


>gi|62088538|dbj|BAD92716.1| VARS2L protein variant [Homo sapiens]
          Length = 1653

 Score =  125 bits (315), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 77/238 (32%), Positives = 121/238 (50%), Gaps = 42/238 (17%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY 
Sbjct: 309 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 366

Query: 59  -------------------------------------STSKLHCEILRLFSKIEYQLPNL 81
                                                S S+L   ++ LFS++ Y+ PN+
Sbjct: 367 GEAGQRAPGRGGWGHLKEGLEGVWVWGWPPLPLSIPGSESELQIALIALFSEMLYRFPNM 426

Query: 82  IVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRV 141
           +V  +T+ES+  A  +GITA+QII FL+  AHP +  + P +P  + DQIRLWE + +R+
Sbjct: 427 VVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRL 486

Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
             T    Y +F S+  FE    +AR+   L++E+S K  +VV    H  ++ F + Q 
Sbjct: 487 RFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 544


>gi|403308524|ref|XP_003944708.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Saimiri boliviensis boliviensis]
          Length = 1752

 Score =  125 bits (315), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 77/238 (32%), Positives = 121/238 (50%), Gaps = 42/238 (17%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY 
Sbjct: 407 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 464

Query: 59  -------------------------------------STSKLHCEILRLFSKIEYQLPNL 81
                                                S S+L   ++ LFS++ Y+ PN+
Sbjct: 465 GEVGQRAPGRAGWGRGNARLEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPNM 524

Query: 82  IVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRV 141
           +V  +T+ES+  A  +GITA+QII FL+  AHP +  + P +P  + DQIRLWE + +R+
Sbjct: 525 VVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRL 584

Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
             T    Y +F S+  FE    +AR+   L++E+S K  +VV    H  ++ F + Q 
Sbjct: 585 RFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 642


>gi|390461382|ref|XP_002746360.2| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Callithrix jacchus]
          Length = 1675

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/239 (32%), Positives = 121/239 (50%), Gaps = 43/239 (17%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY 
Sbjct: 329 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 386

Query: 59  --------------------------------------STSKLHCEILRLFSKIEYQLPN 80
                                                 S S+L   ++ LFS++ Y+ PN
Sbjct: 387 GEAGQRAPGRAGWGXHLKERFEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPN 446

Query: 81  LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNR 140
           ++V  +T+ES+  A  +GITA+QII FL+  AHP +  + P +P  + DQIRLWE + +R
Sbjct: 447 MVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDR 506

Query: 141 VEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
           +  T    Y +F S+  FE    +AR+   L++E+S K  +VV    H  ++ F + Q 
Sbjct: 507 LRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQK 565


>gi|260945437|ref|XP_002617016.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
 gi|238848870|gb|EEQ38334.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
          Length = 511

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/257 (29%), Positives = 123/257 (47%), Gaps = 65/257 (25%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------MSLTDSSARKE-- 44
           M+ D  D GLV   Q     + F PT+LAT L+               +L D  A+ +  
Sbjct: 251 MLPDLRDYGLV--YQRSENSTRFFPTRLATTLTSESTGLKTPSMALNQTLEDPGAKDQEH 308

Query: 45  ----------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
                     G +++ETNF++YAY+ S L   IL LF  ++ +  N+++G IT+ES+  A
Sbjct: 309 VTSEAGESQSGQIILETNFKLYAYTNSPLEIAILNLFVNLKVRFANMVIGQITRESIRKA 368

Query: 95  FENGITAEQIISFLQQNAHPRV-------------------------------------A 117
             NGITA+QII FL+ +AH ++                                      
Sbjct: 369 LYNGITADQIIKFLETHAHSQMRTSAKEKLDKKIEFDASHNINTAGGAPQSKSSDSGVAQ 428

Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
            R+  +P NV DQI+LW+ +L+R++    + +++F +++ ++  C+YA +   LLW D  
Sbjct: 429 HRLEILPPNVVDQIKLWQLELDRIQTFEGYLFKDFKNQNEYDVLCNYATEVGVLLWSDKT 488

Query: 178 KMRLVVNAEIHMHMREF 194
           K+R  V AE    + +F
Sbjct: 489 KLRFFVTAEGMHQVADF 505


>gi|301786953|ref|XP_002928911.1| PREDICTED: LOW QUALITY PROTEIN: valyl-tRNA synthetase,
           mitochondrial-like [Ailuropoda melanoleuca]
          Length = 1637

 Score =  125 bits (313), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 78/242 (32%), Positives = 122/242 (50%), Gaps = 46/242 (19%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  +S    +A + GF+VVETN+R+YAY+
Sbjct: 286 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 343

Query: 60  T------------------------------------------SKLHCEILRLFSKIEYQ 77
                                                      S+L   ++ LFS++ Y+
Sbjct: 344 GEARQRAPNSRXGVGTFLVTLEGLEGVWVWGXPSSSLPIPGPESELQIALIALFSEMLYR 403

Query: 78  LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESD 137
            PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  + P +P  + DQIRLWE +
Sbjct: 404 FPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELE 463

Query: 138 LNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRG 197
            +R+  T    Y +F S+  FE    +AR+   L++E+S K  +VV    H  ++ F + 
Sbjct: 464 RDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKR 523

Query: 198 QN 199
           Q 
Sbjct: 524 QK 525


>gi|193654859|ref|XP_001944632.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Acyrthosiphon pisum]
          Length = 470

 Score =  124 bits (310), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 114/202 (56%), Gaps = 7/202 (3%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYAY 58
           ++   + GLV  +++  K   F PT+L   +    + +S R   KE ++VVETNFR+YA 
Sbjct: 269 LQHLQEFGLVYQRKL--KAGRFCPTRLVIEMGRENSHTSKRMKKKERYIVVETNFRIYAM 326

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   ++ LF+ + Y+ PN+  G +T++S+  A  NGITA QI+ FL  + HP++ +
Sbjct: 327 TDSDLKVALVALFTHMLYRFPNMSAGILTRDSVQTALRNGITAAQIVRFLTVHTHPQMQE 386

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
               +P+ V DQI LWE + NR+       Y +  + + +EA  +YA D   LLW D ++
Sbjct: 387 --CGMPQTVIDQIYLWEYERNRLTYRDGVLYSDINTPNDYEAIKNYAADIGALLWCDERQ 444

Query: 179 MRLVVNAEIHMHMREFLRGQNK 200
             ++V+ + H  +  F + Q K
Sbjct: 445 RNIIVSTDGHNDVTTFWKKQPK 466


>gi|431907067|gb|ELK11185.1| General transcription factor IIH subunit 4 [Pteropus alecto]
          Length = 447

 Score =  124 bits (310), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 113/200 (56%), Gaps = 20/200 (10%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--SARKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +  +A + GF+VVETN+R+YAY+
Sbjct: 263 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYT 320

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+Q                
Sbjct: 321 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQ---------------- 364

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
            P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K 
Sbjct: 365 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 424

Query: 180 RLVVNAEIHMHMREFLRGQN 199
            +VV    H  ++ F + Q 
Sbjct: 425 LMVVTPAGHSDVKRFWKRQK 444


>gi|171681598|ref|XP_001905742.1| hypothetical protein [Podospora anserina S mat+]
 gi|170940758|emb|CAP66407.1| unnamed protein product [Podospora anserina S mat+]
          Length = 540

 Score =  124 bits (310), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/233 (32%), Positives = 125/233 (53%), Gaps = 37/233 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL-------SMSLTDSSARKE--------- 44
           M+   AD GL+ + +  R + ++ PT+LAT L       S+S +  +A K+         
Sbjct: 310 MLPALADFGLIYIDR-DRPQQYY-PTRLATTLTSLSTMRSVSASIDAATKKTPGDAGSLG 367

Query: 45  ------------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
                       G +VVETN+R+YAY++S L   IL+LF ++  + PN++   +T+ES+ 
Sbjct: 368 ADSTPTAPADENGGIVVETNYRIYAYTSSPLQIAILKLFCRLHMRFPNMVTARLTRESVQ 427

Query: 93  NAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAH 147
            A + GITA QII +L  +AHP++     A     +P  V DQIRLW+ +  R++ TP  
Sbjct: 428 EAIKEGITANQIIDYLVAHAHPQMRRAAAARGTTVIPPTVMDQIRLWQLESQRMQKTPGF 487

Query: 148 YYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
            +++F S + +    +YA +   L+W+D +K    V+      +REFL+ + K
Sbjct: 488 QFKDFESVEEYRQLAEYATEIGVLVWKDDRKGTFFVSKV--EQIREFLKARKK 538


>gi|324512881|gb|ADY45318.1| General transcription factor IIH subunit 4 [Ascaris suum]
          Length = 481

 Score =  123 bits (309), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 73/194 (37%), Positives = 112/194 (57%), Gaps = 14/194 (7%)

Query: 7   DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETNFRMYAYSTS 61
           +LGLV +++  RK+ +F  T L  +L++S +  +A  E     G+++VETN+R+YAY+ S
Sbjct: 276 ELGLVFIRK--RKDGYFFITPLFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDS 333

Query: 62  KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV---AD 118
            L   IL  F+++ Y+  ++ VG +T++S+  A + GITA QIISFL+ NAHP     A+
Sbjct: 334 SLQLAILSTFTEMLYRFNDMSVGMLTRDSVRRALQVGITASQIISFLRANAHPETVAAAN 393

Query: 119 RIPS----VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
             P     VP  V DQIRLWE + +R+    +  Y  F S   +    +Y R +  LLW 
Sbjct: 394 ASPGVIHCVPVTVADQIRLWEDERHRLVFFDSALYSTFESEREYVGVKEYTRSQDILLWF 453

Query: 175 DSKKMRLVVNAEIH 188
           D  +  +VV  E H
Sbjct: 454 DDVQRLVVVTEEGH 467


>gi|298705308|emb|CBJ48998.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 534

 Score =  123 bits (309), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 71/200 (35%), Positives = 106/200 (53%), Gaps = 29/200 (14%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--------------------- 39
           + +DF +LGL+  ++   K + F PT +A NL    + S                     
Sbjct: 280 LAQDFVELGLLFKRKA--KSTRFYPTSIAVNLIFGSSPSGDAGGAGGGGTQRKPQPAGGL 337

Query: 40  -SARKEG----FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
              ++E      ++VETNF++ AY+ SKLH  +L LF ++   LPN IVGAIT+ES+  A
Sbjct: 338 GGGKREDDTSIHIIVETNFQVIAYTRSKLHFAMLSLFLELRALLPNAIVGAITRESMRKA 397

Query: 95  FENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYE-EFP 153
              GI   Q++ FL+ +AHP V  R P VPEN+ DQ+ LWE + +R+E       +  + 
Sbjct: 398 LSTGIKGRQVLDFLKWHAHPVVRRRTPVVPENIADQVLLWERERDRMEHRDGVLVDVSYA 457

Query: 154 SRDVFEAACDYARDRSGLLW 173
           SRD F    ++A  + GLLW
Sbjct: 458 SRDAFRGMTEFANAKQGLLW 477


>gi|354546081|emb|CCE42810.1| hypothetical protein CPAR2_204530 [Candida parapsilosis]
          Length = 451

 Score =  123 bits (308), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 77/259 (29%), Positives = 127/259 (49%), Gaps = 67/259 (25%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK-------------EG-- 45
           M+ D  DLGL+   Q  +  S F PT+LAT L+   +DSSA K             EG  
Sbjct: 198 MLADLKDLGLI--YQKSKTSSRFYPTRLATTLT---SDSSALKTPSMAVEQALEESEGQM 252

Query: 46  -------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
                   +++ETNF++YAY+ S L   IL LF +++ +  N++ G IT+ES+ NA  NG
Sbjct: 253 MTSTSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNG 312

Query: 99  ITAEQIISFLQQNAHPRVA------------------------------------DRIPS 122
           IT++QII FL+ +AHP++                                      ++  
Sbjct: 313 ITSDQIIKFLETHAHPQMKALAKEKLDKKVEFDASHNINTAGGAPQSKTDGTISQHKLEV 372

Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
           +P NV DQI+LW+ +L+R++    + +++F ++  ++   +YA +   L+W D  K +  
Sbjct: 373 IPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDMLSNYATELGVLVWGDKAKRKFF 432

Query: 183 VNAEIHMHMREF----LRG 197
           V  +    + +F    LRG
Sbjct: 433 VTKDGMAQVADFANRKLRG 451


>gi|397471696|ref|XP_003807420.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Pan paniscus]
          Length = 1673

 Score =  123 bits (308), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 77/245 (31%), Positives = 121/245 (49%), Gaps = 49/245 (20%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAY- 58
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY 
Sbjct: 322 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 379

Query: 59  --------------------------------------------STSKLHCEILRLFSKI 74
                                                       S S+L   ++ LFS++
Sbjct: 380 GEAGQRAPGRGGWGALXHFSXHLKEGLEGVWVWGCPPHPLSIPGSESELQIALIALFSEM 439

Query: 75  EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLW 134
            Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  + P +P  + DQIRLW
Sbjct: 440 LYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLW 499

Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
           E + +R+  T    Y +F S+  FE    +AR+   L++E+S K  +VV    H  ++ F
Sbjct: 500 ELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRF 559

Query: 195 LRGQN 199
            + Q 
Sbjct: 560 WKRQK 564


>gi|313224499|emb|CBY20289.1| unnamed protein product [Oikopleura dioica]
          Length = 411

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 121/199 (60%), Gaps = 5/199 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---KEGFVVVETNFRMYA 57
           +I+ F ++GL+  ++   KE  F PT+LA ++S++    ++    +E F++VETN+R+YA
Sbjct: 208 VIQHFREMGLIFKRK--SKEQRFYPTRLAQSISIAGGKKASSEDVQEQFILVETNYRIYA 265

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y+ S+LH  ++ LF++++Y+ P +IV  ++++S+  + + GI+AEQI+++L+ +AHP   
Sbjct: 266 YTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQILNYLRSSAHPIAR 325

Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
                VP+ V D I LW  +  R++      Y +F  ++ FE    YA+D   L+W + +
Sbjct: 326 KNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYAQDIRALVWANDE 385

Query: 178 KMRLVVNAEIHMHMREFLR 196
           +  +VV    H  ++ + +
Sbjct: 386 RRFMVVAPWSHDQIKSYYK 404


>gi|313245784|emb|CBY34779.1| unnamed protein product [Oikopleura dioica]
          Length = 456

 Score =  121 bits (304), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 120/199 (60%), Gaps = 5/199 (2%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS---LTDSSARKEGFVVVETNFRMYA 57
           +I+ F ++GL+  ++   KE  F PT+LA ++S++      S   +E F++VETN+R+YA
Sbjct: 253 VIQHFREMGLIFKRK--SKEQRFYPTRLAQSISIAGGKKASSEDVQEQFILVETNYRIYA 310

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y+ S+LH  ++ LF++++Y+ P +IV  ++++S+  + + GI+AEQI+++L+ +AHP   
Sbjct: 311 YTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQILNYLRSSAHPIAR 370

Query: 118 DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSK 177
                VP+ V D I LW  +  R++      Y +F  ++ FE    YA+D   L+W + +
Sbjct: 371 KNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYAQDIRALVWANDE 430

Query: 178 KMRLVVNAEIHMHMREFLR 196
           +  +VV    H  ++ + +
Sbjct: 431 RRFMVVAPWSHDQIKSYYK 449


>gi|393910939|gb|EJD76097.1| transcription factor Tfb2 family protein, variant [Loa loa]
          Length = 389

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/210 (35%), Positives = 114/210 (54%), Gaps = 18/210 (8%)

Query: 3   KDFA----DLGLVKLQQVGRKESWFIPTKLATNL-SMSLTDSSA----RKEGFVVVETNF 53
           KDF     +LGLV +++  RK+ +F  T L  +L S+S    +      + GFV+VETN+
Sbjct: 174 KDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNY 231

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ S L   IL  F+++ Y+  ++ VG +++E++  AF+ GITA QII+FL+ NAH
Sbjct: 232 RVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAH 291

Query: 114 PRV-------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           P            I SVP  V DQIRLWE +  R+    A  Y  F S   +     Y  
Sbjct: 292 PTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVS 351

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
            +  LLW D  +  +V+  E H  ++ + +
Sbjct: 352 SQGILLWCDDVQRLMVITEEGHESVKSWWK 381


>gi|366997767|ref|XP_003683620.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
 gi|357521915|emb|CCE61186.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
          Length = 504

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/252 (29%), Positives = 120/252 (47%), Gaps = 61/252 (24%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE---------------- 44
           M+KD  D GL+   Q     + F PT LAT L+   +DSS  +                 
Sbjct: 251 MLKDMRDYGLIF--QKTSNATVFYPTTLATQLT---SDSSTVRSASGAIDSLLSQTKEAN 305

Query: 45  ----------------------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLI 82
                                 G +VVETNF++Y+YS S L   IL LF  +  +  N++
Sbjct: 306 NNDSQNLGTDTDDLVGNQSVEVGALVVETNFKLYSYSNSPLQIAILSLFVHLRTRFSNMV 365

Query: 83  VGAITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRIPSVP 124
            G IT+ES+ NA  NG+TA+QII++LQ +AHP++                   D++  +P
Sbjct: 366 TGQITRESIRNALANGVTADQIIAYLQSHAHPQMRRLAETTLKKKLELDPNTKDQLQILP 425

Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
             V DQI+LW+ +L+R+       Y +F +   +    +YA+D   LLW++ +K +  V+
Sbjct: 426 PTVVDQIKLWQLELDRILSYDGSLYSDFDNNQEYNMLYNYAKDIGVLLWKNDRKRKFFVS 485

Query: 185 AEIHMHMREFLR 196
            E +  + +F +
Sbjct: 486 KEGNSQVLDFAK 497


>gi|393910938|gb|EFO26137.2| transcription factor Tfb2 family protein [Loa loa]
          Length = 486

 Score =  120 bits (302), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/210 (35%), Positives = 114/210 (54%), Gaps = 18/210 (8%)

Query: 3   KDFA----DLGLVKLQQVGRKESWFIPTKLATNL-SMSLTDSSA----RKEGFVVVETNF 53
           KDF     +LGLV +++  RK+ +F  T L  +L S+S    +      + GFV+VETN+
Sbjct: 271 KDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNY 328

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ S L   IL  F+++ Y+  ++ VG +++E++  AF+ GITA QII+FL+ NAH
Sbjct: 329 RVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAH 388

Query: 114 PRV-------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           P            I SVP  V DQIRLWE +  R+    A  Y  F S   +     Y  
Sbjct: 389 PTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVS 448

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
            +  LLW D  +  +V+  E H  ++ + +
Sbjct: 449 SQGILLWCDDVQRLMVITEEGHESVKSWWK 478


>gi|312069992|ref|XP_003137940.1| transcription factor Tfb2 family protein [Loa loa]
          Length = 470

 Score =  120 bits (302), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/210 (35%), Positives = 114/210 (54%), Gaps = 18/210 (8%)

Query: 3   KDFA----DLGLVKLQQVGRKESWFIPTKLATNL-SMSLTDSSA----RKEGFVVVETNF 53
           KDF     +LGLV +++  RK+ +F  T L  +L S+S    +      + GFV+VETN+
Sbjct: 255 KDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNY 312

Query: 54  RMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           R+YAY+ S L   IL  F+++ Y+  ++ VG +++E++  AF+ GITA QII+FL+ NAH
Sbjct: 313 RVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAH 372

Query: 114 PRV-------ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           P            I SVP  V DQIRLWE +  R+    A  Y  F S   +     Y  
Sbjct: 373 PTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVS 432

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
            +  LLW D  +  +V+  E H  ++ + +
Sbjct: 433 SQGILLWCDDVQRLMVITEEGHESVKSWWK 462


>gi|46127843|ref|XP_388475.1| hypothetical protein FG08299.1 [Gibberella zeae PH-1]
 gi|408390907|gb|EKJ70292.1| hypothetical protein FPSE_09509 [Fusarium pseudograminearum CS3096]
          Length = 494

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/235 (30%), Positives = 117/235 (49%), Gaps = 39/235 (16%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
           M+    D GL+ +     K S F PT+LAT L+                           
Sbjct: 262 MLPSLVDFGLIYIPN--HKRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSR 319

Query: 35  ----SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
               S T SS  + G V++ETN+R+YAY+ S L   +L LF+K++ + P+++ G I+++S
Sbjct: 320 APGSSATGSSNEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLQMRFPDMVAGRISRQS 379

Query: 91  LYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           +  A   GITAEQIIS+L  +AH ++      +  P +P  V DQIRLW+ +  R++ T 
Sbjct: 380 IRQAINFGITAEQIISYLSAHAHDQMRRTASLNNKPVLPPTVVDQIRLWQLENERMKTTG 439

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
              +++F     + A   +A +   L+W +  K      A  H  +R++LR + K
Sbjct: 440 GFLFKDFEDHKEYMAVAGFAEEVGVLVWRNDVKGMFF--ASKHEQIRDYLRIRKK 492


>gi|378729002|gb|EHY55461.1| transcription initiation factor TFIIH subunit H4 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 508

 Score =  120 bits (301), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 59/153 (38%), Positives = 92/153 (60%), Gaps = 14/153 (9%)

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F++VETN+R+YAY++S L   +L LF  +  + PNL+ G ++K S+  A + GITA+QII
Sbjct: 338 FIIVETNYRLYAYTSSPLQIALLSLFVNLRSRHPNLVTGKMSKSSVQRAIQAGITADQII 397

Query: 106 SFLQQNAHP-----------RVADR---IPSVPENVCDQIRLWESDLNRVEMTPAHYYEE 151
           S+L  +AHP           R AD    +P +P  + DQI LW+ + +R+  TP    ++
Sbjct: 398 SYLTSHAHPQMRRHAQAEQARNADPNRVVPILPATILDQIHLWQLERDRMTTTPGFLLKD 457

Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
           FP++  +EA C YA +   L+W++ KK    VN
Sbjct: 458 FPNQADYEAPCRYADEIGVLVWKNDKKRMFFVN 490


>gi|296422658|ref|XP_002840876.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637102|emb|CAZ85067.1| unnamed protein product [Tuber melanosporum]
          Length = 460

 Score =  120 bits (300), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 102/188 (54%), Gaps = 16/188 (8%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------MSLTDSSARKEGF 46
           M+ D  D G+V   Q       F PT+LAT L+               ++   +   +GF
Sbjct: 242 MLADLRDYGIV--YQRKSSSDRFYPTRLATTLTSESGGLRSASASMSSAMAKDAEEGKGF 299

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +++ETN+R+YAY+ S L   +L LF K+  + PNL+ G I++ S+  A + GITA+Q+I 
Sbjct: 300 IILETNYRVYAYTDSPLQIAVLNLFVKLSTRYPNLVSGRISRRSIQEAIKMGITADQVID 359

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           +L  +AHP++   + ++P  V DQIRLW+ +  R+  T    +++F S   FE    YA 
Sbjct: 360 YLSAHAHPQMRKSLVTLPPTVVDQIRLWQIEGERMRTTTGFLFKDFHSTQEFEEVAKYAE 419

Query: 167 DRSGLLWE 174
           +   L W+
Sbjct: 420 ELGVLKWK 427


>gi|344229873|gb|EGV61758.1| transcription factor Tfb2 [Candida tenuis ATCC 10573]
          Length = 498

 Score =  119 bits (299), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/237 (31%), Positives = 117/237 (49%), Gaps = 54/237 (22%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSL---------TDSSA 41
           M+ D  D GLV   Q     + F PT+LAT L          SM++         TDS  
Sbjct: 252 MLSDLRDFGLV--YQRSATSNKFYPTRLATTLTSDSNSLQTPSMAIDKANSGIDSTDSKQ 309

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
           R+E  +++ETNF++Y+Y+ S L   IL LF  +  +  N++ G IT+ES+ NA  NGITA
Sbjct: 310 RQES-IIIETNFKVYSYTQSPLEIAILNLFVHLRTRFANMVTGQITRESIRNALYNGITA 368

Query: 102 EQIISFLQQNAHPRVA--------------------------------DRIPSVPENVCD 129
            Q+I FL+ +AHP++                                  ++  +P NV D
Sbjct: 369 GQVIKFLETHAHPQMKMLAQEKLDKKIEFDASNNINTATGKSTDAPMQHKLEILPPNVVD 428

Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
           QI+LW+ +L+R++    + + EF ++  +E+  +YA +   L+W D KK    V  E
Sbjct: 429 QIKLWQLELDRIQTFDGYLFREFNNQIDYESLRNYASEIGVLVWSDDKKRTCFVTKE 485


>gi|170589001|ref|XP_001899262.1| Transcription factor Tfb2 family protein [Brugia malayi]
 gi|158593475|gb|EDP32070.1| Transcription factor Tfb2 family protein [Brugia malayi]
          Length = 465

 Score =  119 bits (299), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 113/207 (54%), Gaps = 18/207 (8%)

Query: 2   IKDFA----DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETN 52
           IKDF     +LGLV +++  RK+ +F  T L  +L+     S A  E     GFV+VETN
Sbjct: 249 IKDFLMHLRELGLVFIRK--RKDGFFFLTPLLNHLTGISNTSEAGMENRNQNGFVIVETN 306

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   IL  F+++ Y+  ++ VG +++E++  AF+ GITA QII+FL +NA
Sbjct: 307 YRVYAYTDSNLQLAILSTFTEMLYRFSDMSVGVLSREAVRRAFQVGITAAQIIAFLSRNA 366

Query: 113 HP-------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
           HP            I SVP  V DQI+LWE +  R+    A  Y  F S   +     Y 
Sbjct: 367 HPITCAVTTEYGRMIQSVPITVVDQIQLWEDERRRLTFCSAAVYSSFESEKEYFGLKGYV 426

Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMR 192
             ++ LLW    +  +V+  E H +++
Sbjct: 427 ISQNILLWCHDVQRLMVITEEGHENVK 453


>gi|344303892|gb|EGW34141.1| RNA polymerase II transcription factor B subunit 2 [Spathaspora
           passalidarum NRRL Y-27907]
          Length = 519

 Score =  119 bits (299), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 78/263 (29%), Positives = 122/263 (46%), Gaps = 74/263 (28%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK----------------- 43
           M+ D  D GLV  +      S F PT+LAT L+   +DSSA K                 
Sbjct: 252 MLADLKDYGLVYRRT--DTSSRFYPTRLATTLT---SDSSALKTPAMAVEQALDSVEGGT 306

Query: 44  -------------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
                        +G V++ETNF++YAY+ S L   IL LF  +  +  N++ G IT+ES
Sbjct: 307 DNEVSQTAPPGNIQGTVIIETNFKLYAYTNSPLEIAILNLFVHLRTRFANMVCGQITRES 366

Query: 91  LYNAFENGITAEQIISFLQQNAHPRV---------------------------------A 117
           + NA  NGITA+QII FL+ +AHP++                                 A
Sbjct: 367 IRNALYNGITADQIIRFLETHAHPQMKVLAKEKLDKKIEFDTSHNINTAGGAPQSQALRA 426

Query: 118 D------RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGL 171
           D      ++  +P NV DQI+LW+ +L+R++    + +++FP++  F+    YA +   L
Sbjct: 427 DGSVAQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFPNQQEFDKLSSYASELGVL 486

Query: 172 LWEDSKKMRLVVNAEIHMHMREF 194
           +W D  K +  V  +    + +F
Sbjct: 487 IWSDKIKRKFFVTKDGMSQVADF 509


>gi|50291141|ref|XP_448003.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62901305|sp|Q6FP41.1|TFB2_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49527314|emb|CAG60954.1| unnamed protein product [Candida glabrata]
          Length = 504

 Score =  119 bits (298), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 100/176 (56%), Gaps = 18/176 (10%)

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
           ++G +++ETNF++Y+Y  S L   IL LF  ++ +  N++ G IT+ES+  A  NGITA+
Sbjct: 325 QDGALIIETNFKLYSYCNSPLQIAILSLFVHLKSRFANMVAGQITRESIRRALINGITAD 384

Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
           Q+I++L+ +AHP++                   D +  +P  V DQI+LW+ +L+RV   
Sbjct: 385 QVIAYLESHAHPQMRRLAEEKLQKKLELDPNCKDPLQVLPPTVVDQIKLWQLELDRVLTY 444

Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               Y +F +   F   C YA+D   LLW+D +K +L V+ E +  + E+ + + K
Sbjct: 445 EGSLYIDFDTAQDFNMLCKYAQDIGALLWKDDRKRKLFVSREGNAQVLEYAKRKIK 500


>gi|448527133|ref|XP_003869438.1| TFIIH and nucleotide excision repair factor 3 complex subunit
           [Candida orthopsilosis Co 90-125]
 gi|380353791|emb|CCG23303.1| TFIIH and nucleotide excision repair factor 3 complex subunit
           [Candida orthopsilosis]
          Length = 500

 Score =  119 bits (298), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 75/259 (28%), Positives = 124/259 (47%), Gaps = 67/259 (25%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGF-------------- 46
           M+ D  DLGLV   Q     + F PT+LAT L+   +DSSA K                 
Sbjct: 247 MLADLKDLGLV--YQKSDTSNKFYPTRLATTLT---SDSSALKTPSMAVQQALEENEEQM 301

Query: 47  --------VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
                   +++ETNF++YAY+ S L   IL LF +++ +  N++ G IT+ES+ NA  NG
Sbjct: 302 MASNSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNG 361

Query: 99  ITAEQIISFLQQNAHPRV------------------------------------ADRIPS 122
           IT++QII FL+ +AHP++                                      ++  
Sbjct: 362 ITSDQIIKFLETHAHPQMRALAKEKLDKKVEFDASHNINTAGGAPQSKTDGAISQHKLEV 421

Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
           +P NV DQI+LW+ +L+R++    + +++F ++  ++   +YA +   L+W D  K +  
Sbjct: 422 IPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDTLSNYATELGVLVWGDKVKRKFF 481

Query: 183 VNAEIHMHMREF----LRG 197
           V  +    + +F    LRG
Sbjct: 482 VTKDGMAQVADFANRKLRG 500


>gi|156839389|ref|XP_001643386.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156113994|gb|EDO15528.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 504

 Score =  119 bits (298), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 74/250 (29%), Positives = 122/250 (48%), Gaps = 56/250 (22%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM------------------------SL 36
           M+KD  D GL+   Q     + F PT+LAT L+                           
Sbjct: 251 MMKDMRDYGLIF--QKNSNTNVFYPTRLATMLTSDSRTVRNASSAMDSVLTQSKDEPSGA 308

Query: 37  TDSSAR------------KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG 84
           T  SA             ++G +VVETNF++Y+YS S L   IL LF  ++ +  N++ G
Sbjct: 309 TSGSADVDEQVGTQGQNIQDGALVVETNFKLYSYSNSPLQIAILSLFVHLKTRFSNMVTG 368

Query: 85  AITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRIPSVPEN 126
            IT+ES+ NA  NGITAEQII++L+ +AHP++                   + +  +P  
Sbjct: 369 QITRESIRNALSNGITAEQIIAYLETHAHPQMRRLAEERLKKKLELDQNSKETLQILPPT 428

Query: 127 VCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
           V DQI+LW+ +L+R+       Y +F ++  ++    YA+D   L+W+D +K +  V+ E
Sbjct: 429 VVDQIKLWQLELDRIISYEGSLYSDFDNKQEYDMLYSYAQDIGVLIWKDDRKRKFFVSKE 488

Query: 187 IHMHMREFLR 196
            +  + ++ +
Sbjct: 489 SNSQVLDYAK 498


>gi|149238950|ref|XP_001525351.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
           elongisporus NRRL YB-4239]
 gi|146450844|gb|EDK45100.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
           elongisporus NRRL YB-4239]
          Length = 503

 Score =  119 bits (297), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 73/232 (31%), Positives = 112/232 (48%), Gaps = 60/232 (25%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS---------------------MSLTDS 39
           M+ D  D GLV   Q     S F PT+LAT L+                      SL  S
Sbjct: 250 MLADLKDYGLV--YQKSDTSSRFYPTRLATTLTSDSAALKTPSMAVEQALDETETSLASS 307

Query: 40  SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGI 99
           +  KE  +++ETNF+MYAY+ S L   IL LF  ++ +  N+I G IT+ES+ NA  NGI
Sbjct: 308 TQTKES-IIIETNFKMYAYTNSPLEIAILNLFVHMKTRFSNMICGQITRESIRNALYNGI 366

Query: 100 TAEQIISFLQQNAHPRV------------------------------------ADRIPSV 123
           TA+QII FL+ +AHP++                                      R+  +
Sbjct: 367 TADQIIKFLETHAHPQMRALAKERLDKKVEFDASHNINTAGGAPQSKTDGAISQHRLEVL 426

Query: 124 PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
           P NV DQI+LW+ +L+R++    + +++F ++  ++   +YA +   L+W D
Sbjct: 427 PPNVVDQIKLWQLELDRIQTFDGYLFKDFANQQEYDMLSNYAAEIGVLIWAD 478


>gi|358333989|dbj|GAA34589.2| transcription initiation factor TFIIH subunit 4 [Clonorchis
           sinensis]
          Length = 456

 Score =  119 bits (297), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 82/134 (61%)

Query: 33  SMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
           S+  T SS    G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V  IT++S+ 
Sbjct: 288 SVQPTASSTSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVR 347

Query: 93  NAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
            A   GITA+QIISFL  NAHP +    P +P  + DQIRLWE + +R        YE+F
Sbjct: 348 EALIRGITADQIISFLTTNAHPDMLREPPILPPTLVDQIRLWELERDRFVFQEGCLYEQF 407

Query: 153 PSRDVFEAACDYAR 166
                FE   D+A+
Sbjct: 408 SKSADFEMVRDFAK 421


>gi|452839401|gb|EME41340.1| hypothetical protein DOTSEDRAFT_156149 [Dothistroma septosporum
           NZE10]
          Length = 497

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 33/207 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------------------SMSLTD 38
           M+ D AD GL+  +        + PT+LAT L                      S  L  
Sbjct: 258 MLDDLADFGLIYRRTPDSHR--YYPTRLATTLTSDAPALTNNSLMKTTVMTGDPSTDLAA 315

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           S++ ++GF+++ETN+R+YAY++S L   IL LF+ +  + PNLI   ITK S  NA   G
Sbjct: 316 SASDEKGFIILETNYRLYAYTSSPLLISILSLFASLNTRYPNLITAKITKASTQNAIAAG 375

Query: 99  ITAEQIISFLQQNAHP-----RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
           IT++QII +L  +AHP       A+  P +P  V DQI+LW+ +  R+E T  +   +  
Sbjct: 376 ITSDQIIDYLTTHAHPVLRRQAAANDAPILPPTVVDQIKLWQMEGERMEATKGYLIRDIG 435

Query: 154 SRDVFEAACDYARDRSGLLWEDSKKMR 180
           S++ ++ A +YA D  G+L   SK+ R
Sbjct: 436 SQEEYDKAVNYA-DAIGVL---SKEFR 458


>gi|150863860|ref|XP_001382481.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
           II transcription factor B p52 subunit) (RNA polymerase
           II transcription factor B 52 kDa subunit)
           [Scheffersomyces stipitis CBS 6054]
 gi|149385116|gb|ABN64452.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
           II transcription factor B p52 subunit) (RNA polymerase
           II transcription factor B 52 kDa subunit)
           [Scheffersomyces stipitis CBS 6054]
          Length = 515

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 77/258 (29%), Positives = 121/258 (46%), Gaps = 68/258 (26%)

Query: 1   MIKDFADLGLV--KLQQVGRKESWFIPTKLATNL----------SMSL------------ 36
           M+ D  D GLV  +    GR    F PT+LAT L          SM++            
Sbjct: 252 MLADLRDYGLVYQRTDSSGR----FYPTRLATTLTSDSAALKTPSMAIEQALGASDGGEE 307

Query: 37  --TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
              D+    +G V++ETNF++YAY+ S L   IL LF  ++ +  N++ G IT+ES+ +A
Sbjct: 308 QQIDTPTGNQGTVIIETNFKLYAYTNSPLEIAILNLFVHLKSRFTNMVCGQITRESIRSA 367

Query: 95  FENGITAEQIISFLQQNAHPRV----------------------ADRIPS---------- 122
             NGITA+QII FL+ +AHP++                      A   P           
Sbjct: 368 LYNGITADQIIKFLETHAHPQMKLLAKEKLDKKIEFDTSHNINTAGGAPQSKVSGEGNVA 427

Query: 123 ------VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
                 +P  V DQI+LW+ +L+R++    + +++F ++  F+   +YA +   L+W D 
Sbjct: 428 QHKLEILPPTVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEFDTLSNYASEVGVLVWSDK 487

Query: 177 KKMRLVVNAEIHMHMREF 194
            K +  V AE    + EF
Sbjct: 488 IKKKFFVTAEGMTQVAEF 505


>gi|365986883|ref|XP_003670273.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
 gi|343769043|emb|CCD25030.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
          Length = 522

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 97/171 (56%), Gaps = 18/171 (10%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G ++VETNF++Y+YS S L   IL LF  ++++  N++ G IT+ES+  A  NGITAEQ
Sbjct: 343 DGSLIVETNFKLYSYSNSPLQIAILSLFVHLKFRFVNMVTGQITRESIRRALINGITAEQ 402

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   D +  +P  V DQIRLW+ +L+RV    
Sbjct: 403 IIAYLETHAHPQMRRLAEEKLEKKLELDANCKDSLQILPPTVVDQIRLWQLELDRVIAYE 462

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
              Y +F +   +     YA+D   LLW+D KK +  V+ E +  + ++ +
Sbjct: 463 GSLYSDFENNTEYTTLYKYAQDIGVLLWKDDKKRKFFVSKEGNSQVLDYAK 513


>gi|45187599|ref|NP_983822.1| ADL274Wp [Ashbya gossypii ATCC 10895]
 gi|62901343|sp|Q75B51.1|TFB2_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|44982337|gb|AAS51646.1| ADL274Wp [Ashbya gossypii ATCC 10895]
 gi|374107034|gb|AEY95942.1| FADL274Wp [Ashbya gossypii FDAG1]
          Length = 514

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 81/254 (31%), Positives = 123/254 (48%), Gaps = 62/254 (24%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS---MSLTDSSA--------RKEGF--- 46
           M++D  D GLV  +Q     S F PT+LAT L+   +S+  +S         R EG    
Sbjct: 260 MLQDLRDYGLVFQKQ--SNLSKFYPTRLATMLTSDVVSIRSASGAVNSVLRQRAEGVDGK 317

Query: 47  -----------------------VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIV 83
                                  ++VETNF++Y+YS S L   IL LF  ++ +  N++ 
Sbjct: 318 VLNGTALGDDDLQAGGEGALDGALIVETNFKLYSYSNSPLQIAILSLFIHLKTRFQNMVT 377

Query: 84  GAITKESLYNAFENGITAEQIISFLQQNAHP---RVA---------------DRIPSVPE 125
           G IT+ES+  A  NGITA+QII++++ +AHP   R+A               D +  +P 
Sbjct: 378 GQITRESIRRALHNGITADQIIAYMETHAHPQMRRLAGDNLEKKLELDPNCRDTLQVLPP 437

Query: 126 NVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNA 185
            V DQI+LW+ +L+R+     + + +F +   ++    YARD   LLW D KK    V+ 
Sbjct: 438 TVVDQIKLWQLELDRIISYDGYLFRDFDNLQEYQVLAQYARDIGVLLWSDDKKKMFFVSK 497

Query: 186 E-----IHMHMREF 194
           E     I  H R+F
Sbjct: 498 EGNAQVIDFHKRKF 511


>gi|452977455|gb|EME77221.1| hypothetical protein MYCFIDRAFT_146475 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 476

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/203 (35%), Positives = 110/203 (54%), Gaps = 33/203 (16%)

Query: 1   MIKDFADLGLV-KLQQVGRKESWFIPTKLATNLSMSLTD--------------------- 38
           M++D +DLGL+   +Q  R    + PT+LAT L+                          
Sbjct: 241 MLEDLSDLGLIFHPEQSDR----YYPTRLATTLTSDAPALLNSSHTSTTTTVSASSNDDL 296

Query: 39  -SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
            +SA ++GF+++ETN+R+YAY+ S L   IL LF+ +  + PNL+   ITK S+  A  +
Sbjct: 297 AASANEKGFIILETNYRLYAYTNSPLLISILSLFASLHTRYPNLVTAKITKTSVQAAISS 356

Query: 98  GITAEQIISFLQQNAHP---RVAD--RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           GIT+ QIIS+LQ +AHP   R A     P +P  V DQIRLW+ +  R+  T  +   E 
Sbjct: 357 GITSNQIISYLQTHAHPILRRTASMHNAPILPPTVVDQIRLWQIEGERMTSTKGYLIREV 416

Query: 153 PSRDVFEAACDYARDRSGLLWED 175
            S++ +E A  YA +  G+L +D
Sbjct: 417 GSKEDYEKAVQYA-EALGILVKD 438


>gi|403217068|emb|CCK71563.1| hypothetical protein KNAG_0H01490 [Kazachstania naganishii CBS
           8797]
          Length = 519

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 61/178 (34%), Positives = 99/178 (55%), Gaps = 18/178 (10%)

Query: 37  TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
           T S   ++G ++VETNF++Y+YS S L   IL LF  ++ +  N++ G ITK S+ NA +
Sbjct: 335 TQSQEVQDGVLIVETNFKVYSYSNSPLQIAILSLFVHLKSRFINMVTGQITKRSIINALD 394

Query: 97  NGITAEQIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDL 138
           NGITAEQII++L+ +AHP++                   D +  +P  V DQI+LW+ + 
Sbjct: 395 NGITAEQIIAYLETHAHPQMRRLAEAQLEKKRELDQNCKDPLQILPPTVVDQIKLWQLEK 454

Query: 139 NRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           +RV       Y +F S   +     YA+D   +LW + KK +L V+ E +  + ++ +
Sbjct: 455 DRVMAFEGSLYSDFESNAEYNILKKYAQDIGVMLWSEDKKRKLFVSQEGNSQVLDYAK 512


>gi|336466164|gb|EGO54329.1| hypothetical protein NEUTE1DRAFT_87562 [Neurospora tetrasperma FGSC
           2508]
 gi|350286986|gb|EGZ68233.1| transcription factor Tfb2 [Neurospora tetrasperma FGSC 2509]
          Length = 475

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 71/239 (29%), Positives = 120/239 (50%), Gaps = 44/239 (18%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSSARKEG----- 45
           M+    D GL+   +  R+   F PT+LAT L          S + T ++A   G     
Sbjct: 240 MLPSLVDFGLIYFPRDTRQ---FFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTT 296

Query: 46  -------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAI 86
                               +++ETN+R+YAY+TS L   +L LF ++  + PN++ G +
Sbjct: 297 TSTSGGPAATPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRL 356

Query: 87  TKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNRV 141
           T++S+  A   GITA+QIIS+L  +AH ++         P +P  V DQIRLW+ +  R+
Sbjct: 357 TRDSIRQAIGFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERM 416

Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           + T    +++F S + + A   YA +   L+W++ K  R +  A  H  +R++L+ + K
Sbjct: 417 QTTAGFLFKDFESTEEYAALSRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 473


>gi|254573074|ref|XP_002493646.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Komagataella pastoris GS115]
 gi|238033445|emb|CAY71467.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Komagataella pastoris GS115]
 gi|328354526|emb|CCA40923.1| RNA polymerase II transcription factor B subunit 2 [Komagataella
           pastoris CBS 7435]
          Length = 547

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/227 (29%), Positives = 111/227 (48%), Gaps = 44/227 (19%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTD-----SSARKEG 45
           M++D  D GL+   +  R   +F PT++AT L          SMS+          +  G
Sbjct: 307 MLEDLVDYGLIYYPEKNR---YFYPTRMATTLTSEKTTFKTASMSMNQVLEGGPEVKDHG 363

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
            +V+ETNF++Y Y+TS L   IL LF  +  +  N++ G IT+ES+  A  NGITA+QII
Sbjct: 364 SIVLETNFKLYCYTTSPLQIAILNLFVHLRTRFANMVTGMITRESVREALRNGITADQII 423

Query: 106 SFLQQNAHPRVAD--------------------------RIPSVPENVCDQIRLWESDLN 139
            +L+ +AH ++                            ++  +P  V DQI+LW+ +++
Sbjct: 424 KYLETHAHSQMKKLAEEKLLKKLEFDSNATETQQIGSQVKLEVIPPTVVDQIKLWQLEMD 483

Query: 140 RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
           R++      +++F +   FE   +YA +   +LW D  K +  V  E
Sbjct: 484 RLQTFAGFLFKDFANAQEFEQLANYADEVGVMLWRDDDKRKFFVTEE 530


>gi|164425579|ref|XP_960485.2| hypothetical protein NCU05523 [Neurospora crassa OR74A]
 gi|157070982|gb|EAA31249.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 475

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 71/239 (29%), Positives = 120/239 (50%), Gaps = 44/239 (18%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSSARKEG----- 45
           M+    D GL+   +  R+   F PT+LAT L          S + T ++A   G     
Sbjct: 240 MLPSLVDFGLIYFPRDTRQ---FFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTT 296

Query: 46  -------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAI 86
                               +++ETN+R+YAY+TS L   +L LF ++  + PN++ G +
Sbjct: 297 TSTSGGPAATPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRL 356

Query: 87  TKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNRV 141
           T++S+  A   GITA+QIIS+L  +AH ++         P +P  V DQIRLW+ +  R+
Sbjct: 357 TRDSIRQAIGFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERM 416

Query: 142 EMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           + T    +++F S + + A   YA +   L+W++ K  R +  A  H  +R++L+ + K
Sbjct: 417 QTTAGFLFKDFESTEEYAALSRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 473


>gi|402592023|gb|EJW85952.1| transcription factor Tfb2 family protein [Wuchereria bancrofti]
          Length = 470

 Score =  117 bits (292), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 75/211 (35%), Positives = 113/211 (53%), Gaps = 18/211 (8%)

Query: 2   IKDFA----DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETN 52
           IKDF     +LGLV +++  RK+  F  T L  +L+     S A  E     GFV+VETN
Sbjct: 254 IKDFLIHLRELGLVFIRK--RKDGVFFLTPLLNHLTGISNTSEASMENRNQHGFVIVETN 311

Query: 53  FRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 112
           +R+YAY+ S L   IL  F+++ Y+  ++ VG +++E++  AF+ GITA QII+FL  NA
Sbjct: 312 YRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLSTNA 371

Query: 113 HP-------RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
           HP            I SVP  V DQI+LWE +  R+    A  Y  F S   +     Y 
Sbjct: 372 HPITYAVTTEYGRMIQSVPITVADQIQLWEDERRRLTFCNATVYSSFESEKEYFGLKGYV 431

Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
             ++ LLW    +  +V+  E H +++ + +
Sbjct: 432 ISQNILLWCHDVQRLMVITEEGHENVKAWWK 462


>gi|410082501|ref|XP_003958829.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
 gi|372465418|emb|CCF59694.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
          Length = 514

 Score =  116 bits (291), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 57/172 (33%), Positives = 97/172 (56%), Gaps = 18/172 (10%)

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
           ++G +++ETNF++Y+YS S L   IL LF  ++ +  N++ G IT+ES+  A  NGITA+
Sbjct: 336 RDGSLIIETNFKLYSYSNSPLQIAILSLFVHLKSRFVNMVTGQITRESIRRALVNGITAD 395

Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
           QII++L+ +AHPR+                   D +  +P  V DQI+LW+ +L+RV   
Sbjct: 396 QIIAYLETHAHPRMRRLAEEKLEKKLELDPNSKDTLQILPPTVVDQIKLWQLELDRVITY 455

Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
               Y +F +   +     YA+D   L+W+D KK +  V+ E +  + ++ +
Sbjct: 456 DGSLYSDFENSTEYNMLSKYAQDIGVLIWKDDKKRKFFVSKEGNTQVLDYAK 507


>gi|294660113|ref|XP_002777731.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
 gi|218511847|sp|Q6BGW8.2|TFB2_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|199434471|emb|CAR66042.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
          Length = 515

 Score =  116 bits (290), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 73/254 (28%), Positives = 117/254 (46%), Gaps = 73/254 (28%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK----------------- 43
           M+ D  D GL+   Q       F PT+LAT L+   +DS+A K                 
Sbjct: 252 MLADLRDYGLI--YQRSENSERFYPTRLATTLT---SDSAALKSPSMAMEQALESTTETE 306

Query: 44  ----------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
                     +G +++ETNF++YAY+ S L   IL LF  ++ +  N++ G IT+ES+ N
Sbjct: 307 EQQNLASNSNQGTIILETNFKIYAYTNSPLEIAILNLFVHLKTRFSNMVCGQITRESIRN 366

Query: 94  AFENGITAEQIISFLQQNAHPRVA------------------------------------ 117
           A  NGITA+QII FL+ +AH ++                                     
Sbjct: 367 ALYNGITADQIIKFLETHAHSQMKILAKERLDKKIEFDTSNNINTAGGAPQSQMMTNENG 426

Query: 118 -----DRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLL 172
                 ++  +P  V DQI+LW+ +L+R++    + +++F S+  FEA  +YA +   LL
Sbjct: 427 TTVAQHKLEVLPPTVVDQIKLWQLELDRIQTFEGYLFKDFASQQEFEALSNYASEIGVLL 486

Query: 173 WEDSKKMRLVVNAE 186
           W +  KM+  V  +
Sbjct: 487 WSNKSKMKFFVTKD 500


>gi|326435722|gb|EGD81292.1| hypothetical protein PTSG_11329 [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score =  115 bits (287), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 106/188 (56%), Gaps = 9/188 (4%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL--SMSLTDSSARKEGFVVVETNFRMYAY 58
           +++ F+ +GL   ++  +K   + PT+LA  L  S +L    A + GF+V+ETNFR+YAY
Sbjct: 247 VLRHFSKIGLCYHRK--KKSKQYYPTRLALALRTSSALVAKDAMRCGFLVIETNFRIYAY 304

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S L   +L+LF+K  Y+LP ++V +IT+ S+  A   GITA QI+ FL+ +   R+  
Sbjct: 305 TKSDLKISLLKLFAKPLYKLPTMLVASITRSSVRQAMAQGITARQILHFLKVHCSSRMRH 364

Query: 119 RIPS----VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
           +  S    VP  V DQI LWE + +R+  T    +   P + +F     + ++R   +W 
Sbjct: 365 QPHSVFSPVPPTVSDQILLWEQETSRLSKTEGVLFRHLP-KPIFPKLLAFVKERQACIWY 423

Query: 175 DSKKMRLV 182
           D  K  ++
Sbjct: 424 DETKRAII 431


>gi|405952750|gb|EKC20525.1| General transcription factor IIH subunit 4 [Crassostrea gigas]
          Length = 380

 Score =  115 bits (287), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 108/198 (54%), Gaps = 25/198 (12%)

Query: 5   FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD--SSARKEGFVVVETNFRMYAYSTSK 62
           F+ LG  K+Q+       + PT+LA NL+   +D  S  +  G+++VETN+R+YAY+ S 
Sbjct: 203 FSTLGKRKIQR-------YYPTRLAINLAAGQSDFTSVGKNTGYLMVETNYRVYAYTDSP 255

Query: 63  LHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPS 122
           L   ++ LF ++ Y+ P   VG +T+ S+ +A   GITA+Q                 P 
Sbjct: 256 LQVALVALFCEMLYRFPTFSVGNLTRVSVRDALIRGITADQT----------------PV 299

Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
           +P  V DQ+RLWE + +R +      Y++F S++ FE   DYA+D   LLW+++ K  +V
Sbjct: 300 IPSTVTDQVRLWELERDRFKFMEGVLYDQFLSQNDFELLRDYAKDLGVLLWDNAIKRVMV 359

Query: 183 VNAEIHMHMREFLRGQNK 200
           V    H  ++ + + Q +
Sbjct: 360 VTKGGHDDVKRYWKRQKQ 377


>gi|255721741|ref|XP_002545805.1| TFIIH basal transcription factor complex subunit 2 [Candida
           tropicalis MYA-3404]
 gi|240136294|gb|EER35847.1| TFIIH basal transcription factor complex subunit 2 [Candida
           tropicalis MYA-3404]
          Length = 494

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 119/245 (48%), Gaps = 56/245 (22%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK---------------EG 45
           M+ D  D GLV   Q       F PT+LAT L+   +DS+A K               + 
Sbjct: 248 MLADLKDYGLV--YQRSDTSGRFYPTRLATTLT---SDSAALKTPAMAMDEEEQQVVTKE 302

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
            +++ETNF++YAY+ S L   IL LF +++ +  N++ G IT+ES+ NA  NGITA+QII
Sbjct: 303 SIIIETNFKLYAYTKSPLEIAILNLFVQLKTRFANMVCGQITRESIRNALYNGITADQII 362

Query: 106 SFLQQNAHPRV------------------------------------ADRIPSVPENVCD 129
            FL+ +AH ++                                      ++  +P NV D
Sbjct: 363 KFLETHAHSQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGSISQHKLEILPPNVVD 422

Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHM 189
           QI+LW+ +L+R++    + +++F ++  ++   +YA +   L+W D  K +  V  +   
Sbjct: 423 QIKLWQLELDRIQTFDGYLFKDFANQQEYDILSNYASELGVLIWADKVKRKFFVTKDGMT 482

Query: 190 HMREF 194
            + +F
Sbjct: 483 QVADF 487


>gi|342884444|gb|EGU84659.1| hypothetical protein FOXB_04847 [Fusarium oxysporum Fo5176]
          Length = 496

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 121/236 (51%), Gaps = 41/236 (17%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
           M+    D GL+ +     K S F PT+LAT L+                           
Sbjct: 264 MLPSLVDFGLIYIPN--HKRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSG 321

Query: 35  ----SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
               S+T ++  + G V++ETN+R+YAY+ S L   +L LF+K+  + P+++ G I+++S
Sbjct: 322 APGGSVTAANTEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLAMRFPDMVAGRISRQS 381

Query: 91  LYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           +  A + GITAEQIIS+L  +AH ++      +  P +P  V DQIRLW+ +  R++ T 
Sbjct: 382 IRQAIQFGITAEQIISYLSAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTS 441

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWE-DSKKMRLVVNAEIHMHMREFLRGQNK 200
              ++ F     ++    +A +   L+W+ D+++M     A  H  +R++++ + K
Sbjct: 442 GFLFKSFEDDREYKDIARFAEEVGVLVWKNDARQMFF---ASKHEQIRDYMKIRKK 494


>gi|401825125|ref|XP_003886658.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
           ATCC 50504]
 gi|395459803|gb|AFM97677.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
           ATCC 50504]
          Length = 414

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 109/197 (55%), Gaps = 16/197 (8%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-KEGFVVVETNFRMYAYS 59
           ++K    LG++KL     KES  I         + L ++S R +  F++VETN ++YAY+
Sbjct: 230 LLKYLEALGILKLY----KESLAIGKSF-----VQLFEASERNRREFIIVETNNKIYAYT 280

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+    ++ LF  +   LPNLI G+IT+ES+  AF+ GIT +QII FL+ +  P     
Sbjct: 281 NSEYEKSVIHLFCSVSVNLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASVKP----- 335

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKM 179
             S+P  +  QI +WES  NR+ M P + Y  F +   ++   ++  +RS L+  D  K 
Sbjct: 336 -GSLPPAISSQIMIWESKRNRIFMAPGYIYSNFLNLSDYQRVLEFCTERSYLIESDVDKR 394

Query: 180 RLVVNAEIHMHMREFLR 196
            +VV  E H  ++EF++
Sbjct: 395 MIVVKVEGHELVKEFIK 411


>gi|302917488|ref|XP_003052448.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256733388|gb|EEU46735.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 496

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/235 (29%), Positives = 116/235 (49%), Gaps = 39/235 (16%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS------LTDS--------------- 39
           M+    D GL+ +     K S F PT+LAT L+ S      ++D                
Sbjct: 264 MLPSLVDFGLIYIPN--HKRSMFFPTRLATTLTSSGNSLRSISDGVAAATAAALQPGQSG 321

Query: 40  ---------SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKES 90
                       + G V++ETN+R+YAY+ S L   +L LFSK+  + P+++ G ++++S
Sbjct: 322 GSGGSTTGSGQEQRGSVIIETNYRIYAYTQSTLQIAVLALFSKLSMRFPDMVAGRLSRQS 381

Query: 91  LYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           +  A   GITA+QIIS+L  +AH ++      +  P +P  V DQIRLW+ +  R++ T 
Sbjct: 382 IRQAINFGITADQIISYLAAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTS 441

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
              +++F     + A   +A +   L+W +  K      A  H  +R++LR + K
Sbjct: 442 GFLFKDFEDHKEYMAVAGFAEEVGVLVWRNDVKGMFF--ASKHEQIRDYLRIRKK 494


>gi|320581720|gb|EFW95939.1| TFIIH basal transcription factor complex subunit 2 [Ogataea
           parapolymorpha DL-1]
          Length = 544

 Score =  113 bits (282), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 67/234 (28%), Positives = 116/234 (49%), Gaps = 41/234 (17%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD---------------SSARKEG 45
           M+ D  D GL+       K+  F PT+LAT L+   T+                ++  +G
Sbjct: 303 MLDDLIDYGLIYTPSKTNKK-IFYPTRLATTLTSEHTNFKTSAAVIDQEITNSKNSNNQG 361

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
            +VVETNF++Y Y++S L   IL LF  ++ +  N++ G IT+ES+  A  NGITA+Q+I
Sbjct: 362 TIVVETNFKIYCYTSSPLQIAILNLFVHLKARFANMVTGVITRESVRRALINGITADQMI 421

Query: 106 SFLQQNAHP----RVADRIPS---------------------VPENVCDQIRLWESDLNR 140
           ++L  +AHP    +  D++                       +P  V DQI+LW+ +L+R
Sbjct: 422 NYLGSHAHPWMVKQAEDKLARKLELESSIGNAAGQRQIDLEILPPTVVDQIKLWQLELDR 481

Query: 141 VEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
           ++    + Y++F +   FE    Y  +   ++W+D  + R  V  E +  + ++
Sbjct: 482 IQSFKGYLYKDFSTDLEFEKLLTYGEEIGVIVWKDRARKRFFVTQEGNGQLLDY 535


>gi|254584224|ref|XP_002497680.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
 gi|238940573|emb|CAR28747.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
          Length = 509

 Score =  113 bits (282), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 92/162 (56%), Gaps = 18/162 (11%)

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
           ++G ++VETNF++Y+YS S L   IL LF  ++ +  N++ G +T+ES+  A  NGITA+
Sbjct: 332 QDGALIVETNFKLYSYSNSPLQIAILSLFVHMKSRFSNMVTGQLTRESIRQALLNGITAD 391

Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
           QII++L+ +AHP++                   D +  +P  V DQI+LW+ +L+R+   
Sbjct: 392 QIIAYLETHAHPQMRRMAEEQLEKRLELDPNSKDPLQILPPTVVDQIKLWQLELDRIISF 451

Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
               Y +F +   F     YA+D   L+W+D +K +  V+ E
Sbjct: 452 EGSLYSDFENHQEFTLLSSYAQDIGVLIWKDDRKKKFFVSKE 493


>gi|396080769|gb|AFN82390.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           romaleae SJ-2008]
          Length = 414

 Score =  113 bits (282), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 96/164 (58%), Gaps = 7/164 (4%)

Query: 34  MSLTDSSAR-KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
           + L ++S R +  F++VETN ++YAY+ S+    ++ LF  +   LPNLI G+IT+ES+ 
Sbjct: 254 VQLFEASERNRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVSVSLPNLIKGSITEESVN 313

Query: 93  NAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
            AF+ GIT +QII FL+ +  P       S+P  +  QI +WES  NR+ M P + Y  F
Sbjct: 314 VAFDKGITGKQIIHFLEASVKP------GSLPPAITSQIMIWESKRNRIFMVPGYIYSNF 367

Query: 153 PSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
            +   ++   ++  +RS L+  D  K  +VV  E H  +REF++
Sbjct: 368 LNLSDYQRVLEFCTERSYLIESDVGKRMIVVKLEGHELVREFIK 411


>gi|448100135|ref|XP_004199281.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
 gi|359380703|emb|CCE82944.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
          Length = 509

 Score =  113 bits (282), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 75/256 (29%), Positives = 122/256 (47%), Gaps = 64/256 (25%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSL------TDSSARKE 44
           M+ D  D GL+  Q    KE  F PT+LAT L          SM+L       +S A +E
Sbjct: 252 MLADLKDYGLI-YQNHNDKER-FYPTRLATTLTSEAAALRTPSMALNQVVDTANSEATEE 309

Query: 45  -------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
                  G +++ETNF++YAY+ S L   IL LF  ++ + PN++ G IT+ES+  A  N
Sbjct: 310 QMFSNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFN 369

Query: 98  GITAEQIISFLQQNAHPRVAD--------------------------------------- 118
           GITA+QII FL+ +AH ++                                         
Sbjct: 370 GITADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDSQQLSVGEFGSASSYH 429

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
           ++  +P  V DQI+LW+ +L+R++    + +++F S+  ++A  +YA +   LLW D  +
Sbjct: 430 KLEILPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRNYASELGVLLWSDKNR 489

Query: 179 MRLVVNAEIHMHMREF 194
            +  V  +    + +F
Sbjct: 490 KKFFVTKDGMSQVADF 505


>gi|448103836|ref|XP_004200137.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
 gi|359381559|emb|CCE82018.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
          Length = 512

 Score =  112 bits (281), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 75/256 (29%), Positives = 121/256 (47%), Gaps = 64/256 (25%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSL------TDSSARKE 44
           M+ D  D GL+  Q    KE  F PT+LAT L          SM+L       +S A +E
Sbjct: 252 MLADLKDYGLI-YQNHNDKER-FYPTRLATTLTSEAAALRTPSMALNQVVDTANSEATEE 309

Query: 45  -------GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
                  G +++ETNF++YAY+ S L   IL LF  ++ + PN++ G IT+ES+  A  N
Sbjct: 310 QMFSNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFN 369

Query: 98  GITAEQIISFLQQNAHPRVAD--------------------------------------- 118
           GITA+QII FL+ +AH ++                                         
Sbjct: 370 GITADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDNQQLSVGEFGSANSYH 429

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
           ++  +P  V DQI+LW+ +L+R++    + +++F S+  ++A   YA +   LLW D  +
Sbjct: 430 KLEVLPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRSYASELGVLLWSDKNR 489

Query: 179 MRLVVNAEIHMHMREF 194
            +  V  +    + +F
Sbjct: 490 KKFFVTKDGMSQVADF 505


>gi|340939354|gb|EGS19976.1| RNA polymerase II transcription factor B subunit 2-like protein
           [Chaetomium thermophilum var. thermophilum DSM 1495]
          Length = 514

 Score =  112 bits (281), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 69/232 (29%), Positives = 119/232 (51%), Gaps = 38/232 (16%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT----------------------- 37
           M+    D GL+ + +   ++  + PT+LAT L+ S +                       
Sbjct: 287 MLPALVDFGLIYIPREDTRQ--YFPTRLATTLTSSASALRSVSSGFTAATNNTANDASSL 344

Query: 38  ---DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
              D SA K G +++ETN+R+YAY++S L   +L LF+ +  +   ++ G +T+ES+  A
Sbjct: 345 GGADPSAHK-GSIIIETNYRLYAYTSSPLQIAVLALFTHLNMRFAGMVTGRLTRESIRRA 403

Query: 95  FENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 149
              GITA+QIIS+L  +AH ++     A   P +P  V DQIRLW+ +  R+  +P   +
Sbjct: 404 ISFGITADQIISYLASHAHEQMVRAAAAAGRPVLPPTVVDQIRLWQLENERMRTSPGFLF 463

Query: 150 EEFPSRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQNK 200
           ++F + + + A   YA +   L+W  D K+M      E    +R++L+ + K
Sbjct: 464 KDFENVEEYMALAGYAEEIGVLVWRSDRKRMFFASKFE---QLRDYLKSRKK 512


>gi|255713706|ref|XP_002553135.1| KLTH0D09790p [Lachancea thermotolerans]
 gi|238934515|emb|CAR22697.1| KLTH0D09790p [Lachancea thermotolerans CBS 6340]
          Length = 506

 Score =  112 bits (280), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 56/161 (34%), Positives = 93/161 (57%), Gaps = 18/161 (11%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G V++ETNF++Y+YS S L   +L LF  ++ +  N++ G IT+ES+  A  NGITA+Q
Sbjct: 330 DGAVILETNFKLYSYSNSPLQIAVLSLFVHLKSRFSNMVTGQITRESIRRALHNGITADQ 389

Query: 104 IISFLQQNAHP---RVADR---------------IPSVPENVCDQIRLWESDLNRVEMTP 145
           II++++ +AHP   R+A++               +  +P  V DQI+LW+ +L+R+    
Sbjct: 390 IIAYMETHAHPQMRRLAEQTLDKKMELDPNCNEGLQILPPTVVDQIKLWQLELDRIISYD 449

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
            + + +F +   +     YA+D   LLW D KK R  V+ E
Sbjct: 450 GYLFTDFENFQEYNMLSSYAKDIGVLLWSDDKKKRFFVSQE 490


>gi|68490740|ref|XP_710812.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
 gi|68490763|ref|XP_710801.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
 gi|46432046|gb|EAK91553.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
 gi|46432058|gb|EAK91564.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
 gi|238879997|gb|EEQ43635.1| TFIIH basal transcription factor complex subunit 2 [Candida
           albicans WO-1]
          Length = 494

 Score =  112 bits (280), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 69/224 (30%), Positives = 111/224 (49%), Gaps = 52/224 (23%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSS---ARKEGFV 47
           M+ D  D GLV   Q       F PT+LAT L          SM++ +     A KE  +
Sbjct: 248 MLADLKDYGLV--YQRSDTSGRFYPTRLATTLTSDSAALKTPSMAMDEEEQQVATKES-I 304

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           ++ETNF++YAY+ S L   IL LF   + +  N++ G IT+ES+ NA  NGITA+QII F
Sbjct: 305 IIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQIIKF 364

Query: 108 LQQNAHPRV------------------------------------ADRIPSVPENVCDQI 131
           L+ +AHP++                                      ++  +P NV DQI
Sbjct: 365 LETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMVSQHKLEILPPNVVDQI 424

Query: 132 RLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
           +LW+ +L+R++    + +++F ++  ++   +YA +   L+W D
Sbjct: 425 KLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWAD 468


>gi|241958748|ref|XP_002422093.1| TFIIH subunit, putative; tanscription initiation factor IIB,
           putative [Candida dubliniensis CD36]
 gi|223645438|emb|CAX40094.1| TFIIH subunit, putative [Candida dubliniensis CD36]
          Length = 494

 Score =  112 bits (280), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 68/226 (30%), Positives = 111/226 (49%), Gaps = 56/226 (24%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK---------------EG 45
           M+ D  D GLV   Q       F PT+LAT L+   +DS+A K               + 
Sbjct: 248 MLADLKDYGLV--YQRSDTSGRFYPTRLATTLT---SDSAALKTPSMAMDEEEQQVVSKE 302

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
            +++ETNF++YAY+ S L   IL LF   + +  N++ G IT+ES+ NA  NGITA+QII
Sbjct: 303 SIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQII 362

Query: 106 SFLQQNAHPRV------------------------------------ADRIPSVPENVCD 129
            FL+ +AHP++                                      ++  +P NV D
Sbjct: 363 KFLETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMISQHKLEILPPNVVD 422

Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWED 175
           QI+LW+ +L+R++    + +++F ++  ++   +YA +   L+W D
Sbjct: 423 QIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLIWAD 468


>gi|303388087|ref|XP_003072278.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           intestinalis ATCC 50506]
 gi|303301417|gb|ADM10918.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           intestinalis ATCC 50506]
          Length = 414

 Score =  112 bits (280), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 56/164 (34%), Positives = 99/164 (60%), Gaps = 7/164 (4%)

Query: 34  MSLTDSSAR-KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLY 92
           + L ++S R +  F++VETN ++YAY+ S+    ++ LF  + + LPNL  G+IT+ES+ 
Sbjct: 254 VQLFEASERNRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVSFNLPNLTKGSITEESVN 313

Query: 93  NAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
            AF+ GIT  QII FL+ ++ P       S+P  + +QI +WES  NR+ M P + Y  F
Sbjct: 314 AAFDKGITGRQIIHFLEASSKP------GSLPPAIINQIIIWESKRNRIFMAPGYLYSNF 367

Query: 153 PSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
            +   ++   ++  +R+ L+  D  +  +VVN + H+ ++EF++
Sbjct: 368 LNLSDYQKVLEFCSERNYLIESDIDRRMIVVNPKGHVFVKEFIK 411


>gi|389642093|ref|XP_003718679.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae 70-15]
 gi|351641232|gb|EHA49095.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae 70-15]
 gi|440469172|gb|ELQ38293.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae Y34]
 gi|440481925|gb|ELQ62459.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae P131]
          Length = 490

 Score =  112 bits (280), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 114/201 (56%), Gaps = 25/201 (12%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL--------SMSLTDSSARKE-------G 45
           M+ D  D GLV +      +  + PT+LAT L        S+S   ++A  E       G
Sbjct: 269 MLPDLNDFGLVYIPPSNPDQ--YFPTRLATTLTSGSSALRSVSSGVAAATAEAGENNTKG 326

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
            +++ETNFR+YAY+++ L   IL LF+ ++ +   ++ G +++ S+  A  +GITA+QII
Sbjct: 327 AIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAISHGITADQII 386

Query: 106 SFLQQNAHPRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFE 159
            +L  +AH ++  RI      P +P  V DQIRLW+ +  R+++   + +++F S+  F+
Sbjct: 387 EYLASHAHEQM-HRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKDFESQAEFK 445

Query: 160 AACDYARDRSGLLW-EDSKKM 179
           A  DYA +   L+W  D++++
Sbjct: 446 AIADYADEVGVLIWRSDARQL 466


>gi|392574410|gb|EIW67546.1| hypothetical protein TREMEDRAFT_33390 [Tremella mesenterica DSM
           1558]
          Length = 504

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/214 (30%), Positives = 113/214 (52%), Gaps = 16/214 (7%)

Query: 2   IKDFADLGLV--KLQQVGRKESWFIPTKLATNL------------SMSLTDSSARKEGFV 47
           + D    GL+       G K  +F PT L+T+L            +  +  S    + F+
Sbjct: 287 LDDLQSYGLIYRPATPSGAKADYFFPTHLSTSLCSGNSALSAALNTTEIDTSHEDDKKFL 346

Query: 48  VVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           ++ETN+++YAY+ ++L   IL LF  I+ Q  NL+VG + +  +  A E GI+A QII++
Sbjct: 347 ILETNYKIYAYTNNELEIAILNLFMDIKVQYRNLVVGKLDRAHVKAAMEKGISAYQIIAY 406

Query: 108 LQQNAHPRVADRIPSV-PENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR 166
           LQ +AHP++    P +    + DQ+ LW+ + NR+    +  +E F S+D++E     A+
Sbjct: 407 LQSHAHPQMYSSPPPILHTTIVDQLHLWDKERNRLRTEESEMFEFF-SKDLYEDTEAEAK 465

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
              GLL    K   L VN  +   +++F++GQ +
Sbjct: 466 RYDGLLLAVPKDKLLFVNPSVKDAIKDFVKGQQR 499


>gi|453082965|gb|EMF11011.1| RNA polymerase II transcription factor B subunit 2 [Mycosphaerella
           populorum SO2202]
          Length = 507

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 103/192 (53%), Gaps = 29/192 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM---SLTDSS----------------- 40
           M+ D  + GLV  +        + PT+LAT L+    +L ++S                 
Sbjct: 259 MLDDLTNFGLVYRRSPSSDR--YYPTRLATTLTSDAPALPNNSFTTTTTTNQNADPNDPA 316

Query: 41  --ARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
             A ++G++++ETN+R+YAY++S L   IL LF+ +  + PNLI   ITK S++ A  +G
Sbjct: 317 ATASEKGYIILETNYRLYAYTSSPLPISILSLFASLNTRYPNLITAKITKTSIHTAIASG 376

Query: 99  ITAEQIISFLQQNAHP-----RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
           IT++QIIS+LQ +AHP        +  P +P  V DQIRLW+ +  R++ T  +   +  
Sbjct: 377 ITSDQIISYLQTHAHPILRRQAALNSAPILPPTVVDQIRLWQIEGERMKSTKGYLIRDVG 436

Query: 154 SRDVFEAACDYA 165
           + D +  A  YA
Sbjct: 437 TEDDYTKAVQYA 448


>gi|340518979|gb|EGR49219.1| trancription factor tfb2-like protein [Trichoderma reesei QM6a]
          Length = 480

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 64/220 (29%), Positives = 110/220 (50%), Gaps = 24/220 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS---------------MSLTDSSARKEG 45
           M+    D GL+ + Q   K S F PT+LAT L+               +     +  + G
Sbjct: 263 MLPSLVDFGLIYIPQ--HKRSMFFPTRLATTLTSGGNSLRTISEGSLGLLGGGGTGEQAG 320

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
            VV+ETN+R+YAY+ S L   +L LF+K+  + P+++ G +++ S+  A   GITA+QII
Sbjct: 321 SVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQII 380

Query: 106 SFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEA 160
           S+L  +AH ++         P +P  V DQIRLW+ +  R++ T    + +F     +  
Sbjct: 381 SYLSAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYLD 440

Query: 161 ACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
              ++ +   L+W +        N   H  +++FL+ + K
Sbjct: 441 TARFSEEIGVLVWRNDNTRMFFANK--HEQIKDFLKSRKK 478


>gi|449296697|gb|EMC92716.1| hypothetical protein BAUCODRAFT_259506 [Baudoinia compniacensis
           UAMH 10762]
          Length = 489

 Score =  111 bits (278), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/195 (32%), Positives = 106/195 (54%), Gaps = 26/195 (13%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT----------------------- 37
           M+ D +D GLV  +     E+ + PT+LAT L+                           
Sbjct: 260 MLDDLSDFGLVYRRS--NDEARYYPTRLATTLTSDAPALPNTSLTSTTSTTTLASSSTDP 317

Query: 38  DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
            +SA ++G++++ETN+R+YAY++S L   IL LF+ ++ + PNLI   +TK S+ +A  +
Sbjct: 318 SASANEKGYIILETNYRLYAYTSSPLPITILSLFATLKTRYPNLITAKLTKSSIQSAIAS 377

Query: 98  GITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV 157
           GIT++QII++L  +AHP +  + P +P  V DQIRLW+ +  R+         +  + + 
Sbjct: 378 GITSDQIITYLTTHAHPILRRQNPVLPPTVVDQIRLWQIEGERMTAWKGFLIRDVGTAEE 437

Query: 158 FEAACDYARDRSGLL 172
           ++ A  YA D  G+L
Sbjct: 438 YDKAVQYA-DALGVL 451


>gi|367030175|ref|XP_003664371.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
           42464]
 gi|347011641|gb|AEO59126.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
           42464]
          Length = 512

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 66/232 (28%), Positives = 121/232 (52%), Gaps = 36/232 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
           M+    D GL+ + +   ++  + PT+LAT L+                           
Sbjct: 283 MLPALVDFGLIYIPREDTRQ--YFPTRLATTLTSSASSLRSVSSGFSAAAANNPGDASSL 340

Query: 35  -SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
            +  ++SA  +G V++ETN+R+YAY++S L   +L LF+++  +   +  G +T++S+  
Sbjct: 341 GTTPETSAASKGSVIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMATGRLTRDSIRR 400

Query: 94  AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
           A   GITA+QIIS+L  +AH ++     A   P +P  V DQIRLW+ +  R+  +    
Sbjct: 401 AIGFGITADQIISYLASHAHEQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFL 460

Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           +++F S D + +   YA +   L+W+  KK R+   ++I   +R++LR + +
Sbjct: 461 FKDFDSLDEYLSLSAYAEEIGVLVWKSDKK-RMFYASKIE-QLRDYLRSRKR 510


>gi|401887601|gb|EJT51582.1| hypothetical protein A1Q1_07170 [Trichosporon asahii var. asahii
           CBS 2479]
 gi|406699646|gb|EKD02845.1| hypothetical protein A1Q2_02789 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 451

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 119/207 (57%), Gaps = 14/207 (6%)

Query: 2   IKDFADLGLV-KLQQV-GRKESWFIPTKLATNLSMSLTD----SSARKEGFVVVETNFRM 55
           + D    G + + Q+V GRK   F PT LAT+L    T     ++A  + F+++ETN+R+
Sbjct: 237 LDDLEQYGFIYRRQEVDGRKTDQFFPTHLATSLCSGDTSVNRSATADDKRFLILETNYRV 296

Query: 56  YAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 115
           YAY++++L   IL LF  I  + PNLIVG + +  +  A + GI+A QIIS+L  +AHP+
Sbjct: 297 YAYTSNELEIAILNLFVNITIRYPNLIVGHLDRRHVKAAMDKGISAYQIISYLTTHAHPQ 356

Query: 116 VADRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWE 174
           + +  P  +  +V DQ+ LW+ + NRV+      + EF S+++FE   D A +R G L  
Sbjct: 357 MYNHPPPLLHPSVTDQLHLWDRERNRVQAQDTVMF-EFYSKELFETCRDEA-ERMGALQH 414

Query: 175 DSKKMR-----LVVNAEIHMHMREFLR 196
             ++ +     + V+  I   +R+F++
Sbjct: 415 SVQRGQGMAKLIFVDPTIRESLRDFIK 441


>gi|367009216|ref|XP_003679109.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
 gi|359746766|emb|CCE89898.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
          Length = 513

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 57/172 (33%), Positives = 95/172 (55%), Gaps = 18/172 (10%)

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
           ++G ++VETNF++Y+YS S L   IL LF  ++ +  N++ G IT+ES+  A  NGITA+
Sbjct: 332 QDGALIVETNFKLYSYSNSPLQIAILSLFVHLKSRFTNMVTGQITRESIRRALINGITAD 391

Query: 103 QIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMT 144
           QII++LQ +AHP++                   D +  +P  V DQI+LW+ +L+R+   
Sbjct: 392 QIIAYLQTHAHPQMRRLAEEKLEKKLELDANCKDTLQILPPTVVDQIKLWQLELDRIISY 451

Query: 145 PAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
               Y +  S   +     YA+D   LLW+D +K    V+ E +  + ++ +
Sbjct: 452 DGSLYSDIDSHQEYILLSTYAQDIGVLLWKDDRKRIFFVSKEGNSQVLDYAK 503


>gi|358385986|gb|EHK23582.1| hypothetical protein TRIVIDRAFT_36912 [Trichoderma virens Gv29-8]
          Length = 492

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 111/232 (47%), Gaps = 36/232 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMS------------------------- 35
           M+    D GL+ + Q   K S F PT+LAT L+                           
Sbjct: 263 MLPSLVDFGLIYIPQ--HKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQAL 320

Query: 36  --LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
             L  S   + G VV+ETN+R+YAY+ S L   +L LF+K+  + P+++ G +++ S+  
Sbjct: 321 GPLGGSGGEQAGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQ 380

Query: 94  AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
           A   GITA+QIIS+L  +AH ++         P +P  V DQIRLW+ +  R++ T    
Sbjct: 381 AINFGITADQIISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFL 440

Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           + +F     +     ++ +   L+W++        N   H  +++FL+ + +
Sbjct: 441 FRDFTDDKDYLDTARFSEEIGVLVWKNDHARMFFANK--HEQIKDFLKTRKR 490


>gi|367040709|ref|XP_003650735.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
 gi|346997996|gb|AEO64399.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
          Length = 513

 Score =  110 bits (275), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 118/231 (51%), Gaps = 35/231 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL--------------------------SM 34
           M+    D GL+ + +   ++  + PT+LAT L                          S+
Sbjct: 285 MLPALVDFGLIYIPREDTRQ--YFPTRLATTLTSSASALRSVSSGFSAAAANNPGDAASL 342

Query: 35  SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
            +T  +   +G +++ETN+R+YAY++S L   +L LF+++  +   ++ G +T++S+  A
Sbjct: 343 GMTPDTTPTKGSLIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMVTGRLTRDSIRRA 402

Query: 95  FENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYY 149
              GITA+QIIS+L  +AH ++     A   P +P  V DQIRLW+ +  R+  T    +
Sbjct: 403 ISFGITADQIISYLASHAHEQMTRAAAAAGRPVLPPTVVDQIRLWQLENERMRTTAGFLF 462

Query: 150 EEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           ++F S + + +   YA +   L+W+  +K   V  A     +R++L+ + K
Sbjct: 463 KDFDSPEEYVSLSGYAEEIGVLVWKSDRKR--VFFASKFEQLRDYLKSRKK 511


>gi|406604717|emb|CCH43852.1| RNA polymerase II transcription factor B subunit 2 [Wickerhamomyces
           ciferrii]
          Length = 488

 Score =  110 bits (274), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 66/227 (29%), Positives = 112/227 (49%), Gaps = 43/227 (18%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------------SLTD 38
           M+ D  D GL+  ++   +   F PT+LAT L+                       + T 
Sbjct: 247 MLGDLRDYGLIYQRKSTSRR--FYPTRLATTLTSDTTSLRSASSAMNKVIENAKDSAYTP 304

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           ++    G +++ETNF++YAY+ S L   IL LF  ++ +  NL+ G IT+ES+  A  +G
Sbjct: 305 TNIENSGTIIIETNFKVYAYTNSPLQIAILNLFVHLKARFSNLVTGQITRESIRKALVSG 364

Query: 99  ITAEQIISFLQQNAHPRV-------------------ADRIPSVPENVCDQIRLWESDLN 139
           IT+EQIIS+L+ +AHP++                    ++I  +   + DQI+LW+ +L+
Sbjct: 365 ITSEQIISYLESHAHPQLRRAAEEELNKKNGFESNNHGEKIQILQPTIADQIKLWQLELD 424

Query: 140 RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAE 186
           R+     + + +F S   ++    Y+ +   LLW DS K +  V  E
Sbjct: 425 RIMSFDGYLFTDFSSDQEYQVLSKYSEEIGVLLWNDSSKKKFFVTKE 471


>gi|444320663|ref|XP_004180988.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
 gi|387514031|emb|CCH61469.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
          Length = 518

 Score =  110 bits (274), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 91/165 (55%), Gaps = 18/165 (10%)

Query: 37  TDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
           +++ A  +G +++ETNF++Y+YS S L   +L LF  ++ +  N++ G IT++S+  A  
Sbjct: 321 SNTQATADGALIIETNFKLYSYSNSPLQIAVLSLFVHLKTRFSNMVTGQITRDSIRRALR 380

Query: 97  NGITAEQIISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDL 138
           NGITAEQII++L  ++HP++                   D +  +P  V DQI+LW+ +L
Sbjct: 381 NGITAEQIIAYLLTHSHPQMRRLAEGNLEKKLELDPNAKDSLQILPPTVVDQIKLWQLEL 440

Query: 139 NRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVV 183
           +R+       Y +F +   +     YA D   LLW++ KK +  V
Sbjct: 441 DRIISHEGSLYSDFENNQEYNLLSTYAEDIGVLLWKNDKKRKFFV 485


>gi|430812024|emb|CCJ30551.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 319

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 65/186 (34%), Positives = 98/186 (52%), Gaps = 21/186 (11%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           ++++  D G+V  ++  R+   F PT+LAT L+       +  E  +             
Sbjct: 139 VLENLKDYGIVYRRKSSRR---FYPTRLATILT-------SNSEILI-----------RH 177

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           S L   +L LF  +  +  NL+VG IT+ S+  AF NGITAEQIIS+L   AHP++    
Sbjct: 178 SPLQIAVLNLFVHLYLRFSNLVVGVITRNSVRQAFMNGITAEQIISYLTSYAHPQMKKNT 237

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           P +P  V DQIRLWE + NR++ T  + + +F S   FE A  YA + + ++WE   K  
Sbjct: 238 PIIPPTVNDQIRLWEMERNRLKATEGYLFRDFNSNSDFELALKYANELNVVVWEAPSKRV 297

Query: 181 LVVNAE 186
             VN +
Sbjct: 298 FFVNIQ 303


>gi|223993063|ref|XP_002286215.1| transcription factor of TFIIH family [Thalassiosira pseudonana
           CCMP1335]
 gi|220977530|gb|EED95856.1| transcription factor of TFIIH family [Thalassiosira pseudonana
           CCMP1335]
          Length = 461

 Score =  109 bits (272), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 27/195 (13%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS----------------ARKE 44
           +++DF+  GL+ + +V  K S F PT+ A NL  S   +                 +R  
Sbjct: 272 LMRDFSRFGLLFVCRVAGKNS-FYPTRAAVNLVASSEKAGRQASDLMGQTLAAPVPSRSH 330

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEY--QLPNLIVGAITKESLYNAFENGITAE 102
             VVV+TNF++ AY+ SKLH   L LF  +    +LPN+I   IT++S+ +AF  G+TA+
Sbjct: 331 LAVVVQTNFQVVAYTKSKLHISTLGLFCDVSSFRRLPNVIFFHITRDSIRSAFRLGVTAD 390

Query: 103 QIISFLQQNAHP--RVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV--F 158
           QI+ FL  +AHP  R  DR P VP NV DQI LW+ + +RV M     ++    RD   F
Sbjct: 391 QILRFLHVHAHPMLRSGDR-PMVPSNVVDQILLWDRERHRVVMDEVCVHQ---CRDAAEF 446

Query: 159 EAACDYARDRSGLLW 173
            A   YA D   L W
Sbjct: 447 TAVSQYASDVDALAW 461


>gi|323306981|gb|EGA60265.1| Tfb2p [Saccharomyces cerevisiae FostersO]
          Length = 448

 Score =  109 bits (272), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 87/153 (56%), Gaps = 18/153 (11%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G IT+ES+  A  NGITA+Q
Sbjct: 287 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 346

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   + +  +P  V DQIRLW+ +L+RV    
Sbjct: 347 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 406

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
              Y +F +   +     YA+D   LLW+D  K
Sbjct: 407 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDXK 439


>gi|346323394|gb|EGX92992.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Cordyceps militaris CM01]
          Length = 511

 Score =  108 bits (271), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 71/244 (29%), Positives = 118/244 (48%), Gaps = 48/244 (19%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------SLTDS-------------- 39
           M+   AD GLV +    +K S F PT+LAT L+        S++D               
Sbjct: 270 MLPALADFGLVYIPP--QKRSMFFPTRLATTLTSGASSGLRSISDGVAAATSAARSSPSS 327

Query: 40  ------------------SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNL 81
                             S  + G VV+ETN+R+YAY+ S L   +L LF+++  + P++
Sbjct: 328 SAHQPASSSAGSGPLGGGSGDQPGAVVIETNYRLYAYTQSTLQIAVLALFARLTMRFPDM 387

Query: 82  IVGAITKESLYNAFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWES 136
           + G +T+ S+  A + GITA+QII++L  +AH ++         P +P  V DQIRLW+ 
Sbjct: 388 VAGRLTRTSIRQAIQFGITADQIIAYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQL 447

Query: 137 DLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           +  R++ TP   + +F     +     +A +   L+W D KK   +  A     +R++L+
Sbjct: 448 ENERMKTTPGFLFRDFDDPKEYHDTRRFADEIGVLVWSDDKKG--LFFASKFEQIRDYLK 505

Query: 197 GQNK 200
            + K
Sbjct: 506 SRKK 509


>gi|320587960|gb|EFX00435.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Grosmannia clavigera kw1407]
          Length = 518

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 56/161 (34%), Positives = 92/161 (57%), Gaps = 7/161 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
           G V++ETNFR+YAY+ S L   +L LFSK+  + P ++ G +T++S+  A   GIT++QI
Sbjct: 356 GSVILETNFRLYAYTASPLQIAVLALFSKLSQRFPGMVAGKLTRDSVRRAISYGITSDQI 415

Query: 105 ISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFE 159
           IS+L  NAH ++     A+  P +P  V DQIRLWE +  R++      +++F S   ++
Sbjct: 416 ISYLAANAHEQMHKYAAANHRPVLPPTVVDQIRLWELENERMKSHKGFLFKDFDSAKEYD 475

Query: 160 AACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
               YA +   L++ + KK    V    H  +  +L+ + K
Sbjct: 476 MLAQYADEIGVLVYRNDKKRHFFVTK--HEQLVTYLKARKK 514


>gi|116201905|ref|XP_001226764.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
 gi|88177355|gb|EAQ84823.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
          Length = 503

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 120/232 (51%), Gaps = 36/232 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-------------------------- 34
           M+    D GL+ + +   ++  + PT+LAT L+                           
Sbjct: 274 MLPHLVDFGLIFIPREDTRQ--YFPTRLATTLTSSASSLRSVSSGFNAATANNPGDTSSL 331

Query: 35  -SLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
            +  ++SA  +G V++ETN+R+YAY++S L   +L LF+++  +   +  G +T++S+  
Sbjct: 332 GTTPETSAASKGSVIIETNYRLYAYTSSPLQIAVLGLFTQLNMRFAGMATGRLTRDSIRR 391

Query: 94  AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
           A   GITA+QIIS+L  +AH ++     A   P +P  V DQIRLW+ +  R+  +    
Sbjct: 392 AIGFGITADQIISYLASHAHDQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFL 451

Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           +++F S + + +   YA +   L+W   KK R+   ++I   +R++L+ + K
Sbjct: 452 FKDFDSLEEYVSLSSYADEVGVLVWRSDKK-RMFYASKIE-QLRDYLKSRKK 501


>gi|308460920|ref|XP_003092758.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
 gi|308252517|gb|EFO96469.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
          Length = 128

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 51/117 (43%), Positives = 78/117 (66%), Gaps = 3/117 (2%)

Query: 83  VGAITKESLYNAFENGITAEQIISFLQQNAHPR---VADRIPSVPENVCDQIRLWESDLN 139
           VG IT+ES+  A ++GITA QIISFL+ NAHP+    +  I  +P  V DQIRLWE +  
Sbjct: 3   VGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERR 62

Query: 140 RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
           R+++  A+ Y  F S D F+  CDYA++R  LLW ++++  ++VN E H ++R++ +
Sbjct: 63  RMDLKDAYIYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQWYK 119


>gi|361130709|gb|EHL02459.1| putative RNA polymerase II transcription factor B subunit 2 [Glarea
           lozoyensis 74030]
          Length = 210

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 108/206 (52%), Gaps = 33/206 (16%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE---------------- 44
           M+++  D GLV +    + +  F PT+LAT L+   +D+SA +                 
Sbjct: 1   MLENLVDFGLVYIPPSVKDQ--FFPTRLATTLT---SDASALRSITAGFESALSTGSGTA 55

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
           GF+++ETN+       S L   +L LF+K+  + PN++ G +T+ES+Y A  +GI+++QI
Sbjct: 56  GFIIIETNYH------SPLQIAVLALFTKLTTRYPNMVTGRVTRESVYRAVSSGISSDQI 109

Query: 105 ISFLQQNAHPRVADRI------PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
           I++L  +AHP +          P +P  V DQIRLW+ +  R++ T    +++F S D +
Sbjct: 110 ITYLSTHAHPELLKASAAKGGGPVLPPTVVDQIRLWQIENERMKTTYGFLFKDFESTDEY 169

Query: 159 EAACDYARDRSGLLWEDSKKMRLVVN 184
           +    YA +   L W    + +  V+
Sbjct: 170 KKLLKYADEIGVLTWASKTEEKFFVS 195


>gi|405123868|gb|AFR98631.1| transcription factor TFIIH complex subunit Tfb2 [Cryptococcus
           neoformans var. grubii H99]
          Length = 481

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 60/200 (30%), Positives = 111/200 (55%), Gaps = 8/200 (4%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
           + D A  G +     G  + W  PT LAT+L    + ++   SA  + F+++ETN+++YA
Sbjct: 275 LDDLAQYGFIYKSSPGSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y++++L   IL LF  I  + PNL+VG + ++ +  A E GI+A QII++L  +AHP++ 
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISAGQIIAYLSSHAHPQMY 392

Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +  P  +   + DQ+ LW+ + NR++      YE F S+++F+   + A+  + L    +
Sbjct: 393 NSPPPLLHPTIVDQLHLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAT 451

Query: 177 KKMRLVVNAEIHMHMREFLR 196
            +  L +       + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471


>gi|401623322|gb|EJS41426.1| tfb2p [Saccharomyces arboricola H-6]
          Length = 513

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G IT+ES+  A  NGITAEQ
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITAEQ 395

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   + +  +P  V DQIRLW+ +L+RV    
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVMTYE 455

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
              Y +F +   +     YA+D   LLW+D KK +  ++ E +  + +F +
Sbjct: 456 GSLYSDFETSQEYNVLNKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506


>gi|355693906|gb|AER99491.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
           putorius furo]
          Length = 143

 Score =  106 bits (264), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 52/139 (37%), Positives = 84/139 (60%)

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  + 
Sbjct: 2   SELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQT 61

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
           P +P  + DQIRLWE + +R+  T    Y +F S+  FE    +AR+   L++E+S K  
Sbjct: 62  PVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRL 121

Query: 181 LVVNAEIHMHMREFLRGQN 199
           +VV    H  ++ F + Q 
Sbjct: 122 MVVTPAGHSDVKRFWKRQK 140


>gi|429962184|gb|ELA41728.1| hypothetical protein VICG_01232 [Vittaforma corneae ATCC 50505]
          Length = 449

 Score =  106 bits (264), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 54/155 (34%), Positives = 85/155 (54%), Gaps = 8/155 (5%)

Query: 46  FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQII 105
           F+V+ETNF++YAY+++     +L LFSK  Y  PNLI     +ESL +AF  GITA+QII
Sbjct: 303 FIVLETNFKIYAYTSTAYDKSVLSLFSKTVYVFPNLIKACFDEESLLSAFNKGITAKQII 362

Query: 106 SFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA 165
            +LQ+++          VP+N+ +QI +WE   +R+     + Y +F     F     Y 
Sbjct: 363 KYLQEHSE--------EVPKNIVNQISIWEHRQHRIRARNGYLYHDFIHLSDFHRVLRYV 414

Query: 166 RDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             + GL++ D  K  +V    IH  ++ F++   K
Sbjct: 415 ESKGGLIYRDEVKRMIVGEERIHESVKNFIKEMQK 449


>gi|6325135|ref|NP_015203.1| Tfb2p [Saccharomyces cerevisiae S288c]
 gi|62901134|sp|Q02939.1|TFB2_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=General transcription and DNA repair
           factor IIH subunit TFB2; Short=TFIIH subunit TFB2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|1163092|gb|AAB68240.1| Tfb2p: TFIIH subunit [Saccharomyces cerevisiae]
 gi|1778059|gb|AAB40628.1| transcription/repair factor TFIIH subunit Tfb2, partial
           [Saccharomyces cerevisiae]
 gi|151942676|gb|EDN61022.1| transcription initiation factor TFIIH subunit [Saccharomyces
           cerevisiae YJM789]
 gi|190407835|gb|EDV11100.1| RNA polymerase II transcription factor B subunit 2 [Saccharomyces
           cerevisiae RM11-1a]
 gi|256274256|gb|EEU09164.1| Tfb2p [Saccharomyces cerevisiae JAY291]
 gi|259150035|emb|CAY86838.1| Tfb2p [Saccharomyces cerevisiae EC1118]
 gi|285815419|tpg|DAA11311.1| TPA: Tfb2p [Saccharomyces cerevisiae S288c]
 gi|323331200|gb|EGA72618.1| Tfb2p [Saccharomyces cerevisiae AWRI796]
 gi|323335024|gb|EGA76314.1| Tfb2p [Saccharomyces cerevisiae Vin13]
 gi|323346175|gb|EGA80465.1| Tfb2p [Saccharomyces cerevisiae Lalvin QA23]
 gi|323352000|gb|EGA84539.1| Tfb2p [Saccharomyces cerevisiae VL3]
 gi|365762780|gb|EHN04313.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392295887|gb|EIW06990.1| Tfb2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 513

 Score =  105 bits (263), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G IT+ES+  A  NGITA+Q
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 395

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   + +  +P  V DQIRLW+ +L+RV    
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 455

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
              Y +F +   +     YA+D   LLW+D KK +  ++ E +  + +F +
Sbjct: 456 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506


>gi|349581695|dbj|GAA26852.1| K7_Tfb2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 513

 Score =  105 bits (263), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G IT+ES+  A  NGITA+Q
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 395

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   + +  +P  V DQIRLW+ +L+RV    
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 455

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
              Y +F +   +     YA+D   LLW+D KK +  ++ E +  + +F +
Sbjct: 456 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 506


>gi|323302721|gb|EGA56527.1| Tfb2p [Saccharomyces cerevisiae FostersB]
          Length = 505

 Score =  105 bits (263), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 57/169 (33%), Positives = 96/169 (56%), Gaps = 18/169 (10%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G IT+ES+  A  NGITA+Q
Sbjct: 336 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 395

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   + +  +P  V DQIRLW+ +L+RV    
Sbjct: 396 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 455

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
              Y +F +   +     YA+D   LLW+D KK +  ++ E +  + +F
Sbjct: 456 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504


>gi|440493551|gb|ELQ76006.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB2 [Trachipleistophora
           hominis]
          Length = 426

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 94/165 (56%), Gaps = 7/165 (4%)

Query: 36  LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
           L  S+A ++ F++VETN+++YAY+TS     I++LFS+I  +LPNL+   IT+ES+  AF
Sbjct: 269 LFSSAAIQDSFLIVETNYKLYAYTTSSHELSIIKLFSQIVRELPNLVTAHITEESVNAAF 328

Query: 96  ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR 155
             GIT +QI+ +L + +          +P  V +QI +WE   +R++   A  Y  F + 
Sbjct: 329 LKGITGQQIVDYLTEKSKSE-------LPPVVLEQILIWERQRDRMKCIDAVIYSHFMTY 381

Query: 156 DVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           + +E    Y R++  L+  D  +  LVV  E H  ++ F++   K
Sbjct: 382 NEYEITYRYCREKGALIDHDEFRRLLVVRLECHNEVKNFIKNNIK 426


>gi|358394641|gb|EHK44034.1| hypothetical protein TRIATDRAFT_222970 [Trichoderma atroviride IMI
           206040]
          Length = 491

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/232 (27%), Positives = 108/232 (46%), Gaps = 36/232 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGF-------------- 46
           M+    D GL+ + Q   K S F PT+LAT L+          EG               
Sbjct: 262 MLPSLVDFGLIYIPQ--HKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQSL 319

Query: 47  -------------VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYN 93
                        VV+ETN+R+YAY+ S L   +L LF+K+  + P+++ G +++ S+  
Sbjct: 320 GPLGGGGEQQSGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQ 379

Query: 94  AFENGITAEQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHY 148
           A   GITA+QIIS+L  +AH ++         P +P  V DQIRLW+ +  R++ T    
Sbjct: 380 AINFGITADQIISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFL 439

Query: 149 YEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           + +F     +     ++ +   L+W +        N   H  +++FL+ + +
Sbjct: 440 FRDFTDDKDYLDTARFSEEIGVLVWRNDHARMFFANK--HEQIKDFLKTRKR 489


>gi|207340606|gb|EDZ68904.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 226

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 97/171 (56%), Gaps = 18/171 (10%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G IT+ES+  A  NGITA+Q
Sbjct: 49  DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 108

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   + +  +P  V DQIRLW+ +L+RV    
Sbjct: 109 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 168

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
              Y +F +   +     YA+D   LLW+D KK +  ++ E +  + +F +
Sbjct: 169 GSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDFAK 219


>gi|400597688|gb|EJP65418.1| RNA polymerase II transcription factor B subunit 2 [Beauveria
           bassiana ARSEF 2860]
          Length = 507

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/246 (28%), Positives = 116/246 (47%), Gaps = 51/246 (20%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDS--------------------- 39
           M+   AD GLV +    +K S F PT+LAT L+   + S                     
Sbjct: 265 MLPALADFGLVYIPP--QKRSMFFPTRLATTLTSGASSSLRSISEGVVAATSAALSASSS 322

Query: 40  -------------------SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPN 80
                              +  + G +V+ETN+R+YAY+ S L   +L LFS++  + P+
Sbjct: 323 SSTHQDQNHQSRAGPLGAGAGDQIGSIVIETNYRLYAYTQSTLQIAVLALFSRLTMRFPD 382

Query: 81  LIVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLW 134
           ++ G IT+ S+  A + GITA+QII++L  +AH +      +A R P +P  V DQIRLW
Sbjct: 383 MVAGRITRTSIRQAIQFGITADQIIAYLAAHAHEQMHRSAALAAR-PVLPPTVVDQIRLW 441

Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
           + +  R++ T    + +F  +  ++    +A +   L+W   K       A     +R++
Sbjct: 442 QLENERMKTTGGFLFRDFDDQKEYQDTRRFADEIGVLVWSSDKTGMFF--ASKFEQIRDY 499

Query: 195 LRGQNK 200
           L+ + K
Sbjct: 500 LKSRKK 505


>gi|452824378|gb|EME31381.1| transcription initiation factor TFIIH subunit H4 [Galdieria
           sulphuraria]
          Length = 740

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/181 (35%), Positives = 94/181 (51%), Gaps = 20/181 (11%)

Query: 5   FADLGLVKLQQVG--RKESWFIPTKLATNLSMS---LTD----------SSARKEGFVVV 49
            ADLGL+ L       K+S+F  T L   L+ S   L D          S+   +  ++V
Sbjct: 271 LADLGLIYLHISSSLEKDSYFYVTPLGVTLTRSICWLGDKNNYLLLEDWSNVDNDCRMIV 330

Query: 50  ETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 109
           +TNFR+Y Y+       +L LF +  Y+LPN+ VG IT++S+  A  NGITA+Q+IS+LQ
Sbjct: 331 QTNFRVYVYTNCTFQISLLSLFIQFLYRLPNMAVGVITRDSIRTALNNGITAQQMISYLQ 390

Query: 110 QNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRS 169
            + HP +  ++P     + DQIRLWE+   RV    A   + F +   FE    Y   R 
Sbjct: 391 NHMHPNMKGKLPIT---IIDQIRLWEAQRFRVTTKHALLLDHFDNMTCFEKT--YGTTRI 445

Query: 170 G 170
           G
Sbjct: 446 G 446


>gi|321265552|ref|XP_003197492.1| subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Cryptococcus gattii WM276]
 gi|317463972|gb|ADV25705.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes,
           putative [Cryptococcus gattii WM276]
          Length = 481

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 110/200 (55%), Gaps = 8/200 (4%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
           + D A  G +        + W  PT LAT+L    + ++   SA  + F+++ETN+++YA
Sbjct: 275 LNDLAQYGFIYKPSPDSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y++++L   IL LF  I  + PNL+VG + ++ +  A E GI+A QII++L  +AHP++ 
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMY 392

Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +  P  +   + DQ+ LW+ + NR++      YE F S+++F+   + A+  + L    +
Sbjct: 393 NSPPPLLHPTIVDQLHLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAT 451

Query: 177 KKMRLVVNAEIHMHMREFLR 196
            +  L +       + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471


>gi|346972761|gb|EGY16213.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           dahliae VdLs.17]
          Length = 476

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/219 (30%), Positives = 111/219 (50%), Gaps = 21/219 (9%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLA-TNLSM---SLTDS----------SARKEGF 46
           M+    D GLV +      + +  P   A TN+S    +LTD              +   
Sbjct: 259 MLPSLMDFGLVYIPSHKPFQYFVTPQARALTNVSTATRTLTDGMNALSQPQGGPGDQTNG 318

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           ++VETN+R+YAY+TS+L   +L LF  +  + P ++ G +T++S+  A + GITA+QIIS
Sbjct: 319 IIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIIS 378

Query: 107 FLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
           +L  +AH ++     AD  P +P  V DQIRLW+ +  R++ T    + +F     +   
Sbjct: 379 YLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDI 438

Query: 162 CDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             +A +   L W + +K      A  H  +R++L+ + K
Sbjct: 439 ARFADEIGVLAWRNDRKRMFF--ASKHEQIRDYLKLRKK 475


>gi|302405781|ref|XP_003000727.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           albo-atrum VaMs.102]
 gi|261360684|gb|EEY23112.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           albo-atrum VaMs.102]
          Length = 476

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/219 (30%), Positives = 111/219 (50%), Gaps = 21/219 (9%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLA-TNLSM---SLTDS----------SARKEGF 46
           M+    D GLV +      + +  P   A TN+S    +LTD              +   
Sbjct: 259 MLPSLMDFGLVYIPSHKPFQYFVTPQARALTNVSTATRTLTDGMNALSQSQGGPGDQTNG 318

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           ++VETN+R+YAY+TS+L   +L LF  +  + P ++ G +T++S+  A + GITA+QIIS
Sbjct: 319 IIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIIS 378

Query: 107 FLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
           +L  +AH ++     AD  P +P  V DQIRLW+ +  R++ T    + +F     +   
Sbjct: 379 YLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDI 438

Query: 162 CDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             +A +   L W + +K      A  H  +R++L+ + K
Sbjct: 439 ARFADEIGVLAWRNDRKRMFF--ASKHEQIRDYLKLRKK 475


>gi|322697444|gb|EFY89224.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Metarhizium acridum CQMa 102]
          Length = 541

 Score =  103 bits (258), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 54/166 (32%), Positives = 94/166 (56%), Gaps = 9/166 (5%)

Query: 41  ARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 100
           A ++G V+VETN+R+YAY+ S L   +L LF K+  + P+++ G +T+ S++ A   GIT
Sbjct: 377 ADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTRTSIHQAINFGIT 436

Query: 101 AEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPS 154
           A+QIIS+L  +AH +      +A++ P +P  V DQIRLW+ +  R++ T    + +F  
Sbjct: 437 ADQIISYLAAHAHEQMHRTAALANK-PILPPTVVDQIRLWQLENERMKTTGGFLFRDFED 495

Query: 155 RDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
              F     +A +   L+W   K      N   +  +R++L+ + +
Sbjct: 496 HKEFLDTARFAEEIGVLVWRSDKSGMFFANK--YEQIRDYLKSRKR 539


>gi|134118870|ref|XP_771938.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50254542|gb|EAL17291.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 481

 Score =  103 bits (258), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 110/200 (55%), Gaps = 8/200 (4%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
           + D A  G +        + W  PT LAT+L    + ++   SA  + F+++ETN+++YA
Sbjct: 275 LDDLAQYGFIYKSSPDSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y++++L   IL LF  I  + PNL+VG + ++ +  A E GI+A QII++L  +AHP++ 
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMY 392

Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +  P  +   + DQ+ LW+ + NR++      YE F S+++F+   + A+  + L    +
Sbjct: 393 NSPPPLLHPTIVDQLYLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAT 451

Query: 177 KKMRLVVNAEIHMHMREFLR 196
            +  L +       + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471


>gi|66358470|ref|XP_626413.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
           II]
 gi|46227864|gb|EAK88784.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
           II]
          Length = 709

 Score =  103 bits (257), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 110/208 (52%), Gaps = 28/208 (13%)

Query: 19  KESWFIPTKLATNLSMS----LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKI 74
           +E W   T   +N+ ++    L+ +S R E  ++V++NFR+Y Y+ S L  +ILR   ++
Sbjct: 496 EELWHKSTTETSNVKINEPNCLSMNSTRLEAGIIVQSNFRIYCYTASPLQAKILRHLCQV 555

Query: 75  EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCD 129
           + + PN+I G +T++ L +A+  G++AEQI+ F   NAHP +  R        +P NV  
Sbjct: 556 KVRGPNIICGILTRKGLLSAYSMGVSAEQILRFFSSNAHPIILRRFMLEGTSIIPVNVET 615

Query: 130 QIRLWESDLNRVEMTPAHYYEEFPSR----DVFEAACDYARDRSGLLW------------ 173
           Q++LWE D NR++++ A  + ++ +      +F     YAR +  LL+            
Sbjct: 616 QLKLWEKDKNRLKISHASTFSDWGASPNDIQLFRQTILYARSKDILLYNSPIELTEKELN 675

Query: 174 ---EDSKKMRLVVNAEIHMHMREFLRGQ 198
              E  KK+ LV+  E    ++ F+R +
Sbjct: 676 LNVELQKKIILVIKQEYEDDIKTFIRTK 703


>gi|366991461|ref|XP_003675496.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
 gi|342301361|emb|CCC69129.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
          Length = 516

 Score =  103 bits (256), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 56/171 (32%), Positives = 95/171 (55%), Gaps = 18/171 (10%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           +G +++ETNF++Y+YS S L    L LF  ++ +  N++ G IT+ES+  A  NGITA+Q
Sbjct: 340 DGSLIIETNFKLYSYSNSPLQIATLSLFVHLKSRFANMVTGQITRESIRRALINGITADQ 399

Query: 104 IISFLQQNAHPRV------------------ADRIPSVPENVCDQIRLWESDLNRVEMTP 145
           II++L+ +AHP++                   D +  +P  V DQI+LW+ +L+RV    
Sbjct: 400 IIAYLETHAHPQMRRLAEERLEKKLELDPNSKDPLQVLPPTVVDQIKLWQLELDRVITYE 459

Query: 146 AHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLR 196
              Y +F +   +     YA+D   LLW+D KK +  V+ E +  + ++ +
Sbjct: 460 GSLYSDFETIAEYTTLSKYAQDIGVLLWKDDKKKKFFVSKEGNSQVLDYAK 510


>gi|85691005|ref|XP_965902.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi GB-M1]
 gi|19068469|emb|CAD24937.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329766|gb|AGE96035.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi]
          Length = 414

 Score =  102 bits (254), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 107/198 (54%), Gaps = 18/198 (9%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS-MSLTDSSAR-KEGFVVVETNFRMYAY 58
           + K    LG+++L + G          LA   S + L ++S R +  F++VETN ++YAY
Sbjct: 230 LFKYLEALGILRLYEKG----------LAIGRSFVQLFEASERNRREFIIVETNNKIYAY 279

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S+    ++ LF  +   LPNLI G IT+ES+  AF+ GIT +QII FL+ +       
Sbjct: 280 TNSEYEKSVIHLFCNVTVNLPNLIKGIITEESVNVAFDKGITGKQIIHFLEASV------ 333

Query: 119 RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
           R  S+P  + +QI +WES  NR+ M P + Y  F +   ++   ++    + L+  D  K
Sbjct: 334 RQGSLPPAIRNQIVIWESKRNRIFMVPGYLYSNFLNLSDYQKVLEFCIRGNHLVESDVDK 393

Query: 179 MRLVVNAEIHMHMREFLR 196
             +VV  E H  ++EF++
Sbjct: 394 RMIVVKLEGHSLVKEFVK 411


>gi|58262306|ref|XP_568563.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|57230737|gb|AAW47046.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 481

 Score =  102 bits (254), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 109/200 (54%), Gaps = 8/200 (4%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNL----SMSLTDSSARKEGFVVVETNFRMYA 57
           + D A  G +        + W  PT LAT+L    + ++   SA  + F+++ETN+++YA
Sbjct: 275 LDDLAQYGFIYKSSPDSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYA 332

Query: 58  YSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 117
           Y++++L   IL LF  I  + PNL+VG + ++ +  A E GI+A QII++L  +AHP++ 
Sbjct: 333 YTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMY 392

Query: 118 DRIPS-VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDS 176
           +  P  +   + DQ+ LW+ + NR++      YE F S+++F+   + A+  + L     
Sbjct: 393 NSPPPLLHPTIVDQLYLWDRERNRLQTEETVMYEFF-SKELFDDTVNEAKANAALQHAAI 451

Query: 177 KKMRLVVNAEIHMHMREFLR 196
            +  L +       + EF++
Sbjct: 452 SQKLLFIEPHTKPAITEFVK 471


>gi|429965574|gb|ELA47571.1| hypothetical protein VCUG_00894 [Vavraia culicis 'floridensis']
          Length = 424

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 55/165 (33%), Positives = 93/165 (56%), Gaps = 7/165 (4%)

Query: 36  LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
           L  S++ ++ F+VVETN+++YAY+TS     I++LF +I  +LPNL+V  IT+ES+  AF
Sbjct: 267 LFSSASTQDSFLVVETNYKLYAYTTSSHELSIIKLFCQIIRELPNLVVAHITEESVNAAF 326

Query: 96  ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR 155
             GIT +QI+ +L + +          +P  V +QI +WE   +R++   A  Y  F + 
Sbjct: 327 VKGITGQQIVDYLNEKSRSE-------LPVVVLEQILIWERKRDRMKCMDAVIYSHFMTY 379

Query: 156 DVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
             +E    Y +++  L+  D  +  LVV  E H  ++ F++   K
Sbjct: 380 GEYEITYKYCKEKGALVDYDEFRRLLVVKLEYHNDVKNFIKSNIK 424


>gi|336270172|ref|XP_003349845.1| hypothetical protein SMAC_00733 [Sordaria macrospora k-hell]
 gi|380095234|emb|CCC06707.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 496

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/240 (28%), Positives = 114/240 (47%), Gaps = 51/240 (21%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL----------SMSLTDSSARKEG----- 45
           M+    D GL+   +  R+   F PT+LAT L          S + T ++A   G     
Sbjct: 266 MLPSLVDFGLIYFPRDTRQ---FFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTT 322

Query: 46  --------------------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGA 85
                                +++ETN+R+YAY+TS L   +L LF ++  + PN++  A
Sbjct: 323 TSTSGGPAGATPADKTGTSKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSAA 382

Query: 86  ITKESLYNAFENGITAEQIISFLQQNAHPRVADRI-----PSVPENVCDQIRLWESDLNR 140
           +       A   GITA+QIIS+L  +AH ++           +P  V DQIRLW+ +  R
Sbjct: 383 L------RAIGFGITADQIISYLHAHAHQQMVREATVTGKTVLPPTVVDQIRLWQLENER 436

Query: 141 VEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           ++ T    +++F S + + A   YA +   L+W++ K  R +  A  H  +R++L+ + K
Sbjct: 437 MQTTAGFLFKDFESTEEYAALSRYAEEIGVLVWKNDK--RQLFFASKHEQLRDYLKSRKK 494


>gi|402083674|gb|EJT78692.1| RNA polymerase II transcription factor B subunit 2 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 496

 Score =  101 bits (251), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 64/228 (28%), Positives = 113/228 (49%), Gaps = 33/228 (14%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM----------------------SLTD 38
           M+    D GLV +   G    +F PT+LAT L+                       S   
Sbjct: 271 MLPSLVDFGLVYIPP-GNTSQYF-PTRLATTLTSGSSALRSASSALAAATAEGPNASGGG 328

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
            S   +G +++ETN+R+YAY+++ L   IL LF+++ ++   ++ G + +ES+  A   G
Sbjct: 329 QSEAAKGSIIIETNYRLYAYTSTPLQIAILGLFAELRFRFAGMVTGRLDRESIKRAISYG 388

Query: 99  ITAEQIISFLQQNAHP---RVAD--RIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFP 153
           ITA+Q+I +L  +AH    R A   + P +P  V DQIRLW+ +  R+++     + +F 
Sbjct: 389 ITADQVIEYLAAHAHEQMHRTATLRKKPVLPPTVVDQIRLWQLETERMKIMRGFLFRDFD 448

Query: 154 SRDVFEAACDYARDRSGLLW-EDSKKMRLVVNAEIHMHMREFLRGQNK 200
           S+  ++    YA +   LLW  D++++      +    +  FL+ + K
Sbjct: 449 SQAEYDDLAKYADEIGVLLWRSDARQLFFASKVD---QLSVFLKARKK 493


>gi|300707677|ref|XP_002996037.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
 gi|239605297|gb|EEQ82366.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
          Length = 403

 Score =  101 bits (251), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 50/160 (31%), Positives = 96/160 (60%), Gaps = 6/160 (3%)

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           S  +K  F+++ETNF++YAY++S     I++LFS I  ++PNLI  +IT+ESL NAF  G
Sbjct: 247 SDTKKLRFILLETNFKLYAYTSSVYEMSIIQLFSNIYLKMPNLIKASITEESLSNAFSKG 306

Query: 99  ITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVF 158
           +T++QII+FL+  ++    D    +P  +  QI +WE+   R+++ P + Y  F +   +
Sbjct: 307 VTSQQIINFLK--SYSLFED----IPVAIISQIIIWETKRKRIKIFPGYLYSNFLNLIDY 360

Query: 159 EAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQ 198
           +    +      ++ +D +K  +V+  + +  +++F++ Q
Sbjct: 361 QKVVKFCLSNDCIIEKDDEKRMIVIKPDHNEVVKKFVKQQ 400


>gi|322711210|gb|EFZ02784.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Metarhizium anisopliae ARSEF 23]
          Length = 540

 Score =  100 bits (249), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 68/250 (27%), Positives = 115/250 (46%), Gaps = 55/250 (22%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTD---------------------- 38
           M+    D GL+ + Q   K S F PT+LAT L+   +                       
Sbjct: 294 MLPSLLDFGLIYIPQ--HKRSMFFPTRLATTLTSGGSSLRTISEGVAAATAAATTTAPAG 351

Query: 39  ------------------SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPN 80
                              SA ++G V+VETN+R+YAY+ S L   +L LF K+  + P+
Sbjct: 352 GPSSQSGSGSGGTGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPD 411

Query: 81  LIVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLW 134
           ++ G +T+ S++ A   GI+A+QIIS+L  +AH +      +A++ P +P  V DQIRLW
Sbjct: 412 MVAGRVTRTSIHQAINFGISADQIISYLAAHAHEQMHRSAALANK-PILPPTVVDQIRLW 470

Query: 135 ESDLN----RVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMH 190
           + +      R++ T    + +F     F     +A +   L+W   K      N   +  
Sbjct: 471 QLEWQLENERMKTTGGFLFRDFEDHKEFLDTARFAEEIGVLVWRSDKSGMFFANK--YEQ 528

Query: 191 MREFLRGQNK 200
           +R++L+ + +
Sbjct: 529 IRDYLKSRKR 538


>gi|310799661|gb|EFQ34554.1| transcription factor tfb2 [Glomerella graminicola M1.001]
          Length = 485

 Score = 99.8 bits (247), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 90/164 (54%), Gaps = 7/164 (4%)

Query: 42  RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITA 101
           R   F + ETN+R+YAY  + L   +L LF K++ +  +++ G +T+ S+ NA E GITA
Sbjct: 321 RVRPFRLSETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITA 380

Query: 102 EQIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD 156
           +QIIS+L  +AH ++     A   P +P  V DQIRLW+ +  R+  T    + +F S  
Sbjct: 381 DQIISYLAAHAHEQMHRMAAARSRPILPPTVIDQIRLWQLETERMTTTSGFLFRDFDSPR 440

Query: 157 VFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
            ++    YA +   L+W + K       A  H  +R++L+ + K
Sbjct: 441 EYDDIAGYAAEIGVLVWRNDKLGMFF--ASKHEQIRDYLKLRKK 482


>gi|194388768|dbj|BAG60352.1| unnamed protein product [Homo sapiens]
          Length = 386

 Score = 99.4 bits (246), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 49/121 (40%), Positives = 79/121 (65%), Gaps = 4/121 (3%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA--RKEGFVVVETNFRMYAYS 59
           ++   + GLV   Q  RK   + PT+LA NLS  ++ +     + GF+VVETN+R+YAY+
Sbjct: 268 LQHLREFGLVF--QRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYT 325

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
            S+L   ++ LFS++ Y+ PN++V  +T+ES+  A  +GITA+QII FL+  AHP +  +
Sbjct: 326 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 385

Query: 120 I 120
           +
Sbjct: 386 V 386


>gi|449017614|dbj|BAM81016.1| similar to TFIIH subunit TFB2 [Cyanidioschyzon merolae strain 10D]
          Length = 693

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 89/158 (56%), Gaps = 9/158 (5%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           ++VETNFR+YAY+ S     +L LF++I Y++P + +G IT++S+  A + GITA+Q++ 
Sbjct: 538 IIVETNFRLYAYAASSFQVALLSLFTRILYRMPGVAIGVITRDSVRRALKCGITAKQLLH 597

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF-PSRD---VFEAAC 162
           FL  ++         SVP NV DQI LWE +  R++  P    E F P+ D    F+   
Sbjct: 598 FLGIHSM-----EGKSVPFNVHDQILLWELERKRIQAYPGVLLEGFEPTSDGRAFFDQLQ 652

Query: 163 DYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
           +Y  +     W D  +  LV +A     +RE++R Q +
Sbjct: 653 EYTVELGAQQWCDRVRQLLVADASSFERLREWIRRQAQ 690


>gi|221482097|gb|EEE20458.1| tfiih, polypeptide, putative [Toxoplasma gondii GT1]
          Length = 861

 Score = 96.7 bits (239), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           E  ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG +T+ S+  A+++GITA+Q
Sbjct: 672 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 731

Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
           II FL+ +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 732 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 779


>gi|401410644|ref|XP_003884770.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
 gi|325119188|emb|CBZ54742.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
          Length = 868

 Score = 96.3 bits (238), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           E  ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG +T+ S+  A+++GITA+Q
Sbjct: 679 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 738

Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
           II FL+ +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 739 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 786


>gi|237842971|ref|XP_002370783.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
 gi|211968447|gb|EEB03643.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
          Length = 836

 Score = 96.3 bits (238), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           E  ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG +T+ S+  A+++GITA+Q
Sbjct: 647 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 706

Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
           II FL+ +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 707 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754


>gi|221502519|gb|EEE28246.1| tfiih, polypeptide, putative [Toxoplasma gondii VEG]
          Length = 836

 Score = 96.3 bits (238), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           E  ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG +T+ S+  A+++GITA+Q
Sbjct: 647 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 706

Query: 104 IISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPA 146
           II FL+ +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 707 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754


>gi|398406290|ref|XP_003854611.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
 gi|339474494|gb|EGP89587.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
          Length = 493

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 98/186 (52%), Gaps = 26/186 (13%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSA----------------RKE 44
           M+ D  D GL+   +     + + PT+LAT L+   +D+ A                 ++
Sbjct: 266 MLDDLQDFGLI--YRSSPTATAYYPTRLATTLT---SDAPALPNNTSSTTTTAATESNEK 320

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
           G++++ETN+R+YAY++S L   IL LFS +  + PNLI   ITK S+ +A    IT++QI
Sbjct: 321 GYIILETNYRLYAYTSSPLQISILSLFSTLHTRYPNLITAKITKSSIASAISQSITSDQI 380

Query: 105 ISFLQQNAHPRVADR-----IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFE 159
           I++L  +AHP +  +      P +P  V DQIRLW+ +  R+      Y  +    + + 
Sbjct: 381 ITYLSTHAHPVLRRQAAIMSAPILPPTVVDQIRLWQIEGERMTTWKGFYIRDVGGWEEYV 440

Query: 160 AACDYA 165
            A +YA
Sbjct: 441 KAVEYA 446


>gi|402465832|gb|EJW01474.1| hypothetical protein EDEG_03937 [Edhazardia aedis USNM 41457]
          Length = 465

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 90/156 (57%), Gaps = 12/156 (7%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           E F+ +ETNF++YAY+TSK    IL LFSKI  +LPNLI   I ++S+  AF+  I+A+Q
Sbjct: 316 ESFMYIETNFKLYAYTTSKYDFSILSLFSKISCKLPNLISAIINEDSVNTAFDKKISAKQ 375

Query: 104 IISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACD 163
           I  +L+            +VP+NV +Q+ +WES  NR++ +    ++ F +   F+ A D
Sbjct: 376 ISYYLKSKG--------KNVPKNVVEQVYIWESKRNRIKTSECTLFKGFLNLMDFKKAVD 427

Query: 164 YARDRSGL--LWEDSKKMRLVVNAEIHMHMREFLRG 197
             +++  L  ++ED  K  + VN       +E+++ 
Sbjct: 428 VCKEKHWLVDVYED--KRFIFVNNLYAEEFKEYIKN 461


>gi|164660182|ref|XP_001731214.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
 gi|159105114|gb|EDP44000.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
          Length = 231

 Score = 93.2 bits (230), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 81/120 (67%), Gaps = 7/120 (5%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR-----KEGFVVVETNFRMY 56
           +++ +D GLV   +  +   +F PT+LA+ L+ + +   +R     ++G++++ETN+R+Y
Sbjct: 102 LEELSDYGLVY--RPSKSAKYFFPTRLASTLTSTASPLLSRLNDQEEQGYLILETNYRVY 159

Query: 57  AYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
           AY+ + L   IL LF  ++ +LPNL++G +T+ S+ +A   GITA+QII++L  +AHP++
Sbjct: 160 AYTANLLRIAILNLFVTLKSRLPNLVIGQLTRHSVKSALNKGITADQIITYLTHHAHPQM 219


>gi|71033737|ref|XP_766510.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68353467|gb|EAN34227.1| hypothetical protein, conserved [Theileria parva]
          Length = 359

 Score = 92.8 bits (229), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 94/158 (59%), Gaps = 18/158 (11%)

Query: 24  IPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIV 83
           +PT    +LS+      A  +  ++V++NF++Y Y+ S L   +L    +++ + PNL+V
Sbjct: 146 LPTSKDLDLSIK-----AGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVV 200

Query: 84  GAITKESLYNAFENGITAEQIISFLQQNA-HPRVADR---------IPSVPENVCDQIRL 133
           G +T+ES+ +AF++GIT+++II FL  N  +  + ++           S+PENVC Q+++
Sbjct: 201 GVLTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGNQENTLLNSTFTYSIPENVCRQLKM 260

Query: 134 WESDLNRVEMTPAHYYEEFPSR---DVFEAACDYARDR 168
           WES+ +R+E+ P+  ++ +      D+F+    +A+ +
Sbjct: 261 WESERDRIELCPSIVFKRWDQDFMPDLFQRTVRWAQSK 298


>gi|422294120|gb|EKU21420.1| transcription initiation factor TFIIH subunit 4, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 213

 Score = 92.4 bits (228), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 101/205 (49%), Gaps = 40/205 (19%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR---------KEG------ 45
           +++ F   GL+   Q  R    F PT +A NL    T    R          EG      
Sbjct: 16  LLEKFISFGLI--YQDDRHSRHFYPTAVAVNLIFGGTVQEERLRRGHGHVGPEGRENTKE 73

Query: 46  ---------FVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
                     V+VETN+++ AY++S LH E+LR+F+ +  +LPN+++G IT+ S+  A  
Sbjct: 74  LRLVDPSQLAVIVETNYQLVAYTSSSLHVEMLRIFTDVRCRLPNVVIGFITRASVRRAMA 133

Query: 97  NGITAEQIISFLQQNAHPRV-ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR 155
           +GITA  I+SFL+ + H  V A +   +PENV  QI LW  + +RV+      +EE    
Sbjct: 134 SGITAATILSFLKTHTHVAVRAGKGRLLPENVEAQIELWHQERSRVK------FEEVMMI 187

Query: 156 DV-------FEAACDYARDRSGLLW 173
           D+       FE    YA + + + W
Sbjct: 188 DLSSLMVEEFEEVRTYAENLAVVCW 212


>gi|84998132|ref|XP_953787.1| TFIIH subunit, Tf2b  [Theileria annulata]
 gi|65304784|emb|CAI73109.1| TFIIH subunit, Tf2b homologue, putative [Theileria annulata]
          Length = 645

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 86/141 (60%), Gaps = 13/141 (9%)

Query: 41  ARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGIT 100
           A  +  ++V++NF++Y Y+ S L   +L    +++ + PNL+VG +T+ES+ +AF++GIT
Sbjct: 445 AGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKSGIT 504

Query: 101 AEQIISFLQQNA-HPRVADR---------IPSVPENVCDQIRLWESDLNRVEMTPAHYYE 150
           +++II FL  N  +  +  +           S+PENVC Q+++WES+ +R+E+ P+  ++
Sbjct: 505 SKEIIRFLSPNGMNSSIGSQENTLLNSSFTYSIPENVCRQLKMWESERDRIELCPSIVFK 564

Query: 151 EFPSR---DVFEAACDYARDR 168
            +      D+F+    +A+ +
Sbjct: 565 RWDQDFMPDLFQRTVRWAQSK 585


>gi|302831319|ref|XP_002947225.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
           nagariensis]
 gi|300267632|gb|EFJ51815.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
           nagariensis]
          Length = 90

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 54/85 (63%)

Query: 98  GITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDV 157
           GI A++++S+L    HP +A R P VPE V DQIRLWE+ +NR+       YE   SR++
Sbjct: 1   GIAADELVSYLAARPHPSIASRTPVVPEVVSDQIRLWEASMNRLRADSVVLYENLASREL 60

Query: 158 FEAACDYARDRSGLLWEDSKKMRLV 182
           FE A  ++R    LLWEDS +MR V
Sbjct: 61  FERALAFSRSSGTLLWEDSGQMRFV 85


>gi|323455511|gb|EGB11379.1| hypothetical protein AURANDRAFT_52511 [Aureococcus anophagefferens]
          Length = 291

 Score = 90.1 bits (222), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 55/150 (36%), Positives = 83/150 (55%), Gaps = 17/150 (11%)

Query: 8   LGLVKLQQVGRKESWFIPTKLATNLSMSLTDSS--ARKEGF------VVVETNFRMYAYS 59
           LGLV  +      S F PT L  +++     S+  AR  G       V+V+TNF++ AY+
Sbjct: 81  LGLVYRRNA--SSSRFYPTTLGVDVAFGARRSAGGARAGGDFRRPVDVIVQTNFQVLAYT 138

Query: 60  -----TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
                TS L    L LF+++  +LPNL+VG I+++++    + GI   QI+ FL+ +AHP
Sbjct: 139 DAGVNTSTLVLATLNLFAELTTRLPNLVVGTISRDAIKRCVDRGIRVPQIVKFLRAHAHP 198

Query: 115 RVADRIPSVPENVCDQIRLWESDLNRVEMT 144
             A +   VP+NV DQ+ LW  + NRV  T
Sbjct: 199 --AMKASGVPQNVTDQMALWAGEGNRVAFT 226


>gi|428673252|gb|EKX74165.1| conserved hypothetical protein [Babesia equi]
          Length = 668

 Score = 89.7 bits (221), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 77/124 (62%), Gaps = 7/124 (5%)

Query: 23  FIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLI 82
            +P+    ++  S++ +S      ++V++NF++Y Y+ + L  +IL +  +++ + PN++
Sbjct: 470 LVPSNSPLSVHTSISFTSEYLPSKIIVQSNFKVYVYTVNNLQFDILNILCEVQARTPNMV 529

Query: 83  VGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVE 142
           VG +T+ES   AF++GIT+ +II F         +    + PENV  Q+R+WE++ NRVE
Sbjct: 530 VGVLTRESAQRAFKSGITSHEIIRFFS-------STNTSTFPENVIRQLRMWEAERNRVE 582

Query: 143 MTPA 146
           ++PA
Sbjct: 583 LSPA 586


>gi|412992644|emb|CCO18624.1| predicted protein [Bathycoccus prasinos]
          Length = 464

 Score = 89.7 bits (221), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 63/89 (70%), Gaps = 5/89 (5%)

Query: 22  WFIPTKLATNLSMSLTDSSA-----RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEY 76
           W++PT L+  LS   T SSA     R +G ++VETNFR+YAY+ S+L  E+LRLF++ +Y
Sbjct: 373 WYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRLFTRPDY 432

Query: 77  QLPNLIVGAITKESLYNAFENGITAEQII 105
           +LPN  VG IT++S+ +A   GI+ +QI+
Sbjct: 433 KLPNAYVGMITRDSILDAMRAGISPDQIV 461


>gi|124806778|ref|XP_001350829.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
 gi|23496958|gb|AAN36509.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
          Length = 945

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 108/196 (55%), Gaps = 15/196 (7%)

Query: 3   KDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSK 62
           KD   L L K + + +K    I   L +++ M   + S   E  ++V++NF++Y Y++S 
Sbjct: 728 KDEKSLILDKRKNITKK---LINDSLYSDIKM---NGSNNLEIGLIVQSNFKVYLYTSSL 781

Query: 63  LHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPS 122
           L   IL    +++ + PN++VG +T+ S+ NA+ + ITA QII FL+  +HP   +   S
Sbjct: 782 LKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIKFLESYSHPGKNNFKSS 841

Query: 123 VPENVCDQIRLWESDLNRVEMTPAHYYEEFPSR---DVFEAACDYARDRSGLLW------ 173
           +P NV  Q++LWES+ +R+ +  A  ++ F       +++    +A  ++ LL+      
Sbjct: 842 IPMNVITQLKLWESERHRLTLEDAIVFKSFEKDFMPHLYQQIVIWANSKNYLLYYTPWPK 901

Query: 174 EDSKKMRLVVNAEIHM 189
            ++K+  L + AE ++
Sbjct: 902 NNTKEFDLWIKAEKYL 917


>gi|403221352|dbj|BAM39485.1| transcription factor IIH subunit [Theileria orientalis strain
           Shintoku]
          Length = 634

 Score = 88.6 bits (218), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 86/144 (59%), Gaps = 13/144 (9%)

Query: 38  DSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFEN 97
           + +A  E  ++V++NF++Y Y+ S L   +L    +++ + PNL+VG +T+ES+ +AF++
Sbjct: 412 NKTAGNESRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKS 471

Query: 98  GITAEQIISFLQ--------QNAHPRVADRIPS--VPENVCDQIRLWESDLNRVEMTPAH 147
           GIT+ +II FL                 D   +  +PENVC Q+++WES+ +R+E++PA 
Sbjct: 472 GITSNEIIRFLSPMKLSLSYAGTCNSFMDSFANYKIPENVCRQLKMWESERDRIELSPAI 531

Query: 148 YYEEFPSR---DVFEAACDYARDR 168
            ++ + +    ++F+    +A+ +
Sbjct: 532 LFKRWDADFMPELFQRTVRWAQSK 555


>gi|209880858|ref|XP_002141868.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209557474|gb|EEA07519.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 794

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 92/175 (52%), Gaps = 12/175 (6%)

Query: 10  LVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILR 69
           +V+L          I   + +NL +S     +  E  ++V++NFR+Y Y+ S L  +ILR
Sbjct: 572 IVELNNTSNDIDLKISHSIESNLFLS---GISNLEAGIIVQSNFRVYCYTASPLQAKILR 628

Query: 70  LFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP-----RVADRIPSVP 124
              +++ + PN+I G +T+  L +A+  G+ A QI+ F   NAHP      + D    +P
Sbjct: 629 HLCQVKVRGPNIICGVLTRRGLLSAYSMGVKAYQILRFFISNAHPIILKKHITDGTSIIP 688

Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSR----DVFEAACDYARDRSGLLWED 175
            +V  Q++LWE+D NR+E+     + ++ +     ++F+    YA  +  +L+ +
Sbjct: 689 ISVETQLKLWENDHNRLEINKVSLFSDWGNNKEDIELFKQTVTYAIGKQVVLYHN 743


>gi|389586531|dbj|GAB69260.1| hypothetical protein PCYB_146880 [Plasmodium cynomolgi strain B]
          Length = 913

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 70/106 (66%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           ++V++NF++Y Y+ S L   IL    +++ + PN++VG +T+ S+ NA+ + ITA QII 
Sbjct: 733 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 792

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           FL+  AHP  ++   S+P NV  Q++LWES+ +R+ +  A  ++ F
Sbjct: 793 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 838


>gi|399949686|gb|AFP65344.1| tfIIB-like protein [Chroomonas mesostigmatica CCMP1168]
          Length = 464

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/100 (41%), Positives = 64/100 (64%), Gaps = 4/100 (4%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +++E+NFR+Y Y  + L  ++L +FS+I YQLPNL VG IT++S+  AF +G T+  II 
Sbjct: 329 IIIESNFRIYVYKNAHLGNDLLLVFSEILYQLPNLFVGEITEKSILKAFNSGSTSRNIIG 388

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
           FL+ N HP      P +P  V +QI+ WE     +++T +
Sbjct: 389 FLKNNLHPIC----PWIPSAVTNQIKAWEFQKTEIKITES 424


>gi|221061889|ref|XP_002262514.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|193811664|emb|CAQ42392.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 943

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 70/106 (66%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           ++V++NF++Y Y+ S L   IL    +++ + PN++VG +T+ S+ NA+ + ITA QII 
Sbjct: 763 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 822

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           FL+  AHP  ++   S+P NV  Q++LWES+ +R+ +  A  ++ F
Sbjct: 823 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 868


>gi|60603004|gb|AAX27959.1| unknown [Schistosoma japonicum]
          Length = 100

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/76 (52%), Positives = 54/76 (71%)

Query: 39  SSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENG 98
           SS+   G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V  IT++S+  A   G
Sbjct: 8   SSSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 67

Query: 99  ITAEQIISFLQQNAHP 114
           ITA QI+SFL  NAHP
Sbjct: 68  ITANQILSFLTSNAHP 83


>gi|432098035|gb|ELK27922.1| General transcription factor IIH subunit 4 [Myotis davidii]
          Length = 377

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 89/184 (48%), Gaps = 36/184 (19%)

Query: 18  RKESWFIPTKLATNLS--MSLTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIE 75
           RK   + PT+LA +LS  +S    +A + GF+VVETN+++YAY+ S+L   ++ LFS++ 
Sbjct: 225 RKSRRYYPTRLAISLSSGVSGAGGTAHQPGFIVVETNYQLYAYTESELQIALIALFSEML 284

Query: 76  YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWE 135
           Y+ PN++V                                   + P +P  + DQIRLWE
Sbjct: 285 YRFPNMVVA----------------------------------QTPVLPPTITDQIRLWE 310

Query: 136 SDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
            + +R+  T    Y +F S+  F+     A +   L++E+  K  +VV    H  ++ F 
Sbjct: 311 LERDRLLFTEGVLYNQFLSQVDFDLLLARAWELGALVFENMAKQLMVVTPAGHREVKRFW 370

Query: 196 RGQN 199
           + Q 
Sbjct: 371 KQQK 374


>gi|156095867|ref|XP_001613968.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802842|gb|EDL44241.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 960

 Score = 86.7 bits (213), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 40/106 (37%), Positives = 69/106 (65%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           ++V++NF++Y Y+ S L   IL    +++ + PN++VG +T+ S+ NA+ + ITA QII 
Sbjct: 780 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 839

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           FL+  AHP  +    S+P NV  Q++LWES+ +R+ +  A  ++ F
Sbjct: 840 FLESYAHPGRSTFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 885


>gi|156086630|ref|XP_001610724.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797977|gb|EDO07156.1| conserved hypothetical protein [Babesia bovis]
          Length = 586

 Score = 86.7 bits (213), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 90/159 (56%), Gaps = 16/159 (10%)

Query: 1   MIKDFADLGLVKLQQVGRK-----ESWFIPTKLAT-----NLSMSLTDSSARKEGFVVVE 50
           +++   +LGLV     G K      S+ +  K A       +SMS+  +   K   +VV+
Sbjct: 343 ILRFLIELGLVYADSDGSKIYVTNLSFLVDVKHANLVEEHQISMSVCGNKGSK---MVVQ 399

Query: 51  TNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 110
           +NF++YAY  S L   +L    +++ + PNL++G +T+ SL  AF++GITA+Q+I F + 
Sbjct: 400 SNFKIYAYIPSALQMNVLNHICELQAKTPNLVIGVLTRSSLQTAFKSGITADQLICFFES 459

Query: 111 NAH---PRVADRIPSVPENVCDQIRLWESDLNRVEMTPA 146
                  ++   + +VPENV  Q+++WE++ NR+E+  A
Sbjct: 460 KGQYDDIQINRNVMNVPENVRRQLKMWEAERNRLELLNA 498


>gi|387595518|gb|EIJ93142.1| hypothetical protein NEPG_02098 [Nematocida parisii ERTm1]
          Length = 398

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/192 (32%), Positives = 96/192 (50%), Gaps = 16/192 (8%)

Query: 5   FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLH 64
           F  LGL+  ++   K   F PT      S+ L D     E F+V+E+NFR+Y YS   L 
Sbjct: 219 FESLGLIFFEKGLVK---FSPT-----FSL-LFDDEEGAEKFLVLESNFRLYIYSNRPLD 269

Query: 65  CEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVP 124
             I+ LFS    + PN+IV  I ++S+  A  +GITA QI  +L QN+       +  + 
Sbjct: 270 TFIISLFSIKSREFPNMIVAMINEDSIRQALMHGITAGQIRVYLNQNS-------MYEIN 322

Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
           ENV +QIRLWE  +NR+    ++ +  F +   F     Y  + +       +K  LVV 
Sbjct: 323 ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYREKRMLVVG 382

Query: 185 AEIHMHMREFLR 196
            E +  ++ F+R
Sbjct: 383 IENYESVKSFIR 394


>gi|378754852|gb|EHY64880.1| hypothetical protein NERG_01936 [Nematocida sp. 1 ERTm2]
          Length = 396

 Score = 84.0 bits (206), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/196 (30%), Positives = 95/196 (48%), Gaps = 16/196 (8%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           M+  F  LGL+  ++           + ++  S+ L D     E F+ +E+NFR+Y YS 
Sbjct: 213 MLDLFESLGLITFERG--------LVRFSSTFSL-LFDDEEGGEKFLTLESNFRLYIYSN 263

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
             L   I+ LFS    + PN+IV  I ++S+  A   GITA QI  +L QN+       +
Sbjct: 264 RPLDVFIISLFSIKIREFPNMIVAMINEDSIRQALTYGITAGQIRVYLNQNS-------M 316

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMR 180
             + ENV +QIRLWE  +NR+    ++ +  F +   F     Y  + +        K  
Sbjct: 317 HKINENVLEQIRLWEKRMNRIHAWESYIFSNFLNYKDFLLVESYCENNNVEHRSYRDKRV 376

Query: 181 LVVNAEIHMHMREFLR 196
           LVV  E + +++ F+R
Sbjct: 377 LVVGVENYDNVKSFIR 392


>gi|387592902|gb|EIJ87926.1| hypothetical protein NEQG_01998 [Nematocida parisii ERTm3]
          Length = 398

 Score = 84.0 bits (206), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 96/192 (50%), Gaps = 16/192 (8%)

Query: 5   FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYSTSKLH 64
           F  LGL+  ++   K   F PT      S+ L D     E F+V+E+NFR+Y YS   L 
Sbjct: 219 FESLGLIFFEKGLVK---FSPT-----FSL-LFDDEEGAEKFLVLESNFRLYIYSNRPLD 269

Query: 65  CEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVP 124
             I+ LFS    + PN++V  I ++S+  A  +GITA QI  +L QN+       +  + 
Sbjct: 270 TFIISLFSIKSREFPNMMVAMINEDSIRQALMHGITAGQIRVYLNQNS-------MYEIN 322

Query: 125 ENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVN 184
           ENV +QIRLWE  +NR+    ++ +  F +   F     Y  + +       +K  LVV 
Sbjct: 323 ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYREKRMLVVG 382

Query: 185 AEIHMHMREFLR 196
            E +  ++ F+R
Sbjct: 383 IENYESVKSFIR 394


>gi|403348380|gb|EJY73625.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Oxytricha trifallax]
          Length = 532

 Score = 83.6 bits (205), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 81/129 (62%), Gaps = 19/129 (14%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------MSLTDS--SARKE 44
           +++DF D+GLV ++   R++ + I + L + L                ++ D+  S  ++
Sbjct: 265 ILRDFDDMGLVGME---RRQKFVITSLLQSFLQSQTGQLQKSSDSSSSNVIDAQRSQSQD 321

Query: 45  GFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQI 104
            F++VETNF++YAY++S L+  +LR F ++E   PNL+VG +T++SL  AF+ GI++ QI
Sbjct: 322 KFLIVETNFKVYAYTSSDLYRALLRQFIRVECIFPNLVVGTLTRKSLQKAFQRGISSGQI 381

Query: 105 ISFLQQNAH 113
           +SFL+ + H
Sbjct: 382 LSFLESHIH 390


>gi|399219100|emb|CCF75987.1| unnamed protein product [Babesia microti strain RI]
          Length = 612

 Score = 83.2 bits (204), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 65/104 (62%), Gaps = 4/104 (3%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQ 103
           E  ++V++NF++Y Y+ S L   +L    +++ + PNL++G +T+ S + AF  GITA+Q
Sbjct: 429 ESGIIVQSNFKVYVYTASPLQINVLAHLCELQSRTPNLVIGVLTRASAHAAFRAGITAKQ 488

Query: 104 IISFLQQNAHPRVADRI----PSVPENVCDQIRLWESDLNRVEM 143
           I  FL+ ++HP +   +      +P NV  Q+ +WE++ NR+ +
Sbjct: 489 ICQFLETHSHPILLQNVREGGSYLPNNVVTQLNMWEAERNRISL 532


>gi|429854513|gb|ELA29524.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Colletotrichum gloeosporioides Nara gc5]
          Length = 509

 Score = 82.8 bits (203), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 63/101 (62%), Gaps = 5/101 (4%)

Query: 43  KEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAE 102
           + G ++VETN+R+YAY  + L   +L LF K++ +  +++ G +T+ S+ NA E GITA+
Sbjct: 369 RSGSIIVETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITAD 428

Query: 103 QIISFLQQNAHPRV-----ADRIPSVPENVCDQIRLWESDL 138
           QIIS+L  +AH ++         P +P  V DQIRL   D+
Sbjct: 429 QIISYLASHAHEQMHRFAAVRNKPVLPPTVIDQIRLEYEDI 469


>gi|365758016|gb|EHM99881.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 419

 Score = 82.4 bits (202), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 82/162 (50%), Gaps = 48/162 (29%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLS--------------------------- 33
           M++D  D GLV  + +   +S F PTKLA  L+                           
Sbjct: 249 MLQDMRDYGLVFQKHLN--DSIFYPTKLALMLTSDTKTIISASNAMDSVLRQNREEPSVN 306

Query: 34  ------MSLTDSSARKE-------------GFVVVETNFRMYAYSTSKLHCEILRLFSKI 74
                  S TD++A  +             G ++VETNF++Y+YS S L   +L LF  +
Sbjct: 307 EDGANGKSTTDTTASDDLNKAGSKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHL 366

Query: 75  EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 116
           + +  N+++G IT+ES+ NA  NGITA+QII++L+ +AHP++
Sbjct: 367 KARFVNMVLGQITRESIRNALTNGITADQIIAYLETHAHPQM 408


>gi|392594381|gb|EIW83705.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
          Length = 393

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 48/128 (37%), Positives = 70/128 (54%), Gaps = 26/128 (20%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARK-------------- 43
           M++D  D GLV  ++   K   F PT+LAT L   S SL  +   +              
Sbjct: 262 MLEDLRDYGLVWQRKASSKR--FSPTRLATTLTSVSPSLPTAGGSRNPGSAPGASSVANS 319

Query: 44  -------EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFE 96
                   GF+V+ETN+R+YAY+ + L   +L LF  ++Y+ PNL+VG +T+ES+  A  
Sbjct: 320 TGINTNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALG 379

Query: 97  NGITAEQI 104
           NGI+AEQI
Sbjct: 380 NGISAEQI 387


>gi|82704497|ref|XP_726580.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23482045|gb|EAA18145.1| Drosophila melanogaster LD30622p-related [Plasmodium yoelii yoelii]
          Length = 926

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 69/106 (65%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +++++NF++Y Y++S L   IL    +++ + PN++VG +T+ S+ NA+ + ITA+QII 
Sbjct: 746 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 805

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           FL+  +HP        +P NV  Q++LWE++ +R+ +  +  ++ F
Sbjct: 806 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 851


>gi|70950974|ref|XP_744765.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56524852|emb|CAH80923.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 922

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 69/106 (65%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +++++NF++Y Y++S L   IL    +++ + PN++VG +T+ S+ NA+ + ITA+QII 
Sbjct: 742 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 801

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           FL+  +HP        +P NV  Q++LWE++ +R+ +  +  ++ F
Sbjct: 802 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 847


>gi|68070707|ref|XP_677265.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497310|emb|CAH94094.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 929

 Score = 79.7 bits (195), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 69/106 (65%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +++++NF++Y Y++S L   IL    +++ + PN++VG +T+ S+ NA+ + ITA+QII 
Sbjct: 749 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 808

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEF 152
           FL+  +HP        +P NV  Q++LWE++ +R+ +  +  ++ F
Sbjct: 809 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLILEDSIVFKNF 854


>gi|397564488|gb|EJK44232.1| hypothetical protein THAOC_37249 [Thalassiosira oceanica]
          Length = 971

 Score = 79.3 bits (194), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 99/196 (50%), Gaps = 20/196 (10%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFR------ 54
           ++KDFA  GL+ + +V  K + F PT++A NL  S     +R+   ++            
Sbjct: 358 LMKDFARFGLLFVCRVAGKTA-FYPTRVAVNLVASNEKGGSRQSDALLPSVAATRSLEEA 416

Query: 55  MYAYSTSKLHCEILRLFSKIEYQ-LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 113
           M A   ++ H  +      I Y+ LPN++   +T++S+ +AF  G+TA+QI+ FLQ +AH
Sbjct: 417 MNAPDPTRSHLAV------ISYKRLPNVVFFHLTRDSIKSAFRLGVTADQILRFLQVHAH 470

Query: 114 PRV-ADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRD--VFEAACDYARDRSG 170
           P + +   P  P NV DQI LW+ +  RV M     ++    RD   F A   YA D   
Sbjct: 471 PMMRSGNQPMPPANVRDQILLWDRERRRVVMDEVWVHQ---CRDDAEFSAVGMYASDSEA 527

Query: 171 LLWEDSKKMRLVVNAE 186
           L W ++   +L +  +
Sbjct: 528 LAWGNAHTNKLYLQCD 543


>gi|428181353|gb|EKX50217.1| TFB2 transcription factor B2 nucleotide excision repair [Guillardia
           theta CCMP2712]
          Length = 463

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/209 (30%), Positives = 93/209 (44%), Gaps = 32/209 (15%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSM-SLTDSSARKEGFVVVETNFRMYAYS 59
           +I    DLG V  +   R+  +F PT L  NL   S   S    +  V  E         
Sbjct: 267 LITSLHDLGAVYFKPNDRQ--YFFPTPLIVNLCTESEVPSGPSTDAGVADEF-------- 316

Query: 60  TSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 119
           +++    ILRLF++  Y+ P++I+  IT+ES+ NA         II +L+ +AHP+  + 
Sbjct: 317 SAEPAAGILRLFTRPIYKFPHMIIAVITRESIRNAL--------IIEYLRMHAHPQCLEN 368

Query: 120 IPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYAR------------- 166
            P VPE V DQI  WE +  R+   PA  Y  F S +   A    AR             
Sbjct: 369 WPIVPEVVTDQICFWEQERCRIRAEPAVAYHNFFSGEAHRACEKEARRLKFNDKGPGEVA 428

Query: 167 DRSGLLWEDSKKMRLVVNAEIHMHMREFL 195
            R   L+ D+ +  LVV       +R+F+
Sbjct: 429 KREACLYSDTSEQLLVVPEAADPLIRKFI 457


>gi|440295536|gb|ELP88449.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Entamoeba invadens IP1]
          Length = 531

 Score = 76.6 bits (187), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 47/161 (29%), Positives = 85/161 (52%), Gaps = 20/161 (12%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
            + ETN+ +Y Y+ S    ++  LF +   QL NL VG IT +++  AF  GIT E +++
Sbjct: 290 TITETNYTVYIYTESFFQVKLYSLFIRQNLQLTNLCVGRITYDTVTEAFLKGITNEMLVN 349

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPA---HYYEEFP----SRDVFE 159
           FLQ           P++P+N+  QI LW+ +LNR++   A    +YE  P     ++++ 
Sbjct: 350 FLQ-----------PNLPKNIQAQIDLWKRELNRLKEVRAVKFRFYE--PELEVQKELYH 396

Query: 160 AACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQNK 200
                A    G+++   +++ L V  ++   ++EFLR + +
Sbjct: 397 LTKSEAEKMKGVVFYKEEELTLFVRYDVAEKIKEFLRRKTR 437


>gi|449521313|ref|XP_004167674.1| PREDICTED: RNA polymerase II transcription factor B subunit 2-like,
           partial [Cucumis sativus]
          Length = 296

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 38/42 (90%), Positives = 39/42 (92%), Gaps = 1/42 (2%)

Query: 2   IKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARK 43
           IKD ADLGLVKLQQ GRKESWFIPTKLATNLSMSL DSS+RK
Sbjct: 255 IKDLADLGLVKLQQ-GRKESWFIPTKLATNLSMSLADSSSRK 295


>gi|67483069|ref|XP_656826.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474050|gb|EAL51440.1| hypothetical protein EHI_192400 [Entamoeba histolytica HM-1:IMSS]
 gi|449706224|gb|EMD46115.1| tfiih basal transcription factor complex p52 subunit, putative
           [Entamoeba histolytica KU27]
          Length = 484

 Score = 72.8 bits (177), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 82/163 (50%), Gaps = 28/163 (17%)

Query: 5   FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR------KEGFVVVETNFRMYAY 58
           F DLG++    V  KE       +A    MSL  S+A       K+   + E N+ +Y Y
Sbjct: 249 FNDLGII----VYYKEQ----NVMAITPLMSLLRSNANIPSDLVKKPKTITEINYSVYIY 300

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S+   ++ RLF +  +QL NL VG +  +S+  AF  GIT+E +I+FLQ         
Sbjct: 301 TESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEMLINFLQ--------- 351

Query: 119 RIPSVPENVCDQIRLWESDLNRVE---MTPAHYYEEFPSRDVF 158
             P++P  +  QI LW+ ++NR +   +    +Y++   R +F
Sbjct: 352 --PNLPRTIQKQIDLWKKEINRFKEEHVVRYRFYDDAIGRQLF 392


>gi|160331881|ref|XP_001712647.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
 gi|159766096|gb|ABW98322.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
          Length = 432

 Score = 72.8 bits (177), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 32/97 (32%), Positives = 60/97 (61%), Gaps = 4/97 (4%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIIS 106
           +++E+N+R+Y Y  +  + ++  +FS+I Y LPN  VG IT+ S+  A ++GIT + I+ 
Sbjct: 300 IIIESNYRIYVYQKNSNNNQLFLIFSEILYILPNFFVGEITETSISRALKSGITIQNILG 359

Query: 107 FLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEM 143
           F+++N H        S+P  + +Q RLWE    ++++
Sbjct: 360 FIRENLHCVCR----SIPSTILNQFRLWEFQKKKIKI 392


>gi|380476902|emb|CCF44454.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
           higginsianum]
          Length = 126

 Score = 72.8 bits (177), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 67/126 (53%), Gaps = 9/126 (7%)

Query: 81  LIVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRIPSVPENVCDQIRLW 134
           ++ G +T+ S+ NA E GITA+QIIS+L  +AH +      V  R P +P  V DQIRLW
Sbjct: 1   MVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQMHRMAAVRSR-PVLPPTVVDQIRLW 59

Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHMHMREF 194
           + +  R+  T    + +F S   +E    YA +   L+W + K       A  H  +R++
Sbjct: 60  QLETERMTTTSGFLFRDFDSPKEYEVIAGYASEIGVLVWRNDKLGMFF--ASKHEQIRDY 117

Query: 195 LRGQNK 200
           L+ + K
Sbjct: 118 LKLRKK 123


>gi|167395228|ref|XP_001741280.1| TFIIH basal transcription factor complex p52 subunit [Entamoeba
           dispar SAW760]
 gi|165894177|gb|EDR22239.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Entamoeba dispar SAW760]
          Length = 489

 Score = 72.8 bits (177), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 50/163 (30%), Positives = 82/163 (50%), Gaps = 28/163 (17%)

Query: 5   FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSAR------KEGFVVVETNFRMYAY 58
           F DLG++    V  KE       +A    MSL  S+A       K+   + E N+ +Y Y
Sbjct: 249 FNDLGII----VYYKEQ----NVMAITPLMSLLRSNANIPSDLVKKPKTITEINYSVYIY 300

Query: 59  STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 118
           + S+   ++ RLF +  +QL NL VG +  +S+  AF  GIT+E +I+FLQ         
Sbjct: 301 TESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEMLINFLQ--------- 351

Query: 119 RIPSVPENVCDQIRLWESDLNRVE---MTPAHYYEEFPSRDVF 158
             P++P  +  QI LW+ ++NR +   +    +Y++   R +F
Sbjct: 352 --PNLPRTIQKQIDLWKKEINRFKEEHVVRYKFYDDAIGRQLF 392


>gi|238570605|ref|XP_002386883.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
 gi|215440101|gb|EEB87813.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
          Length = 69

 Score = 72.4 bits (176), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 33/69 (47%), Positives = 49/69 (71%), Gaps = 5/69 (7%)

Query: 40  SARKEGFVVVETNFRMYAYSTS-----KLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
           S  ++GF+V+ETN+R+YAY+ S      L   IL LF  ++Y+ PNL+VG+IT+ES+  A
Sbjct: 1   SGSQQGFIVLETNYRVYAYTGSFSLHNPLQTAILHLFVSLKYRFPNLVVGSITRESVKKA 60

Query: 95  FENGITAEQ 103
             NGI+A+Q
Sbjct: 61  LSNGISADQ 69


>gi|330038479|ref|XP_003239608.1| TATA binding protein of transcription factor IIB-like protein
           [Cryptomonas paramecium]
 gi|327206532|gb|AEA38710.1| TATA binding protein of transcription factor IIB-like protein
           [Cryptomonas paramecium]
          Length = 467

 Score = 69.3 bits (168), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 60/105 (57%), Gaps = 13/105 (12%)

Query: 36  LTDSSARKEGF-VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNA 94
           L+     KE F +++E+N+R+YAY+ +    +IL  F  + Y LP L VG I K S++ A
Sbjct: 307 LSKKIKTKECFQMIIESNYRVYAYNNNTFLNKILIQFCDLIYNLPGLFVGEINKTSIHKA 366

Query: 95  FENGITAEQIISFLQQNAHPRVADRIPSVPENVC----DQIRLWE 135
              GI A+ IISF+ +N+H         + +N C    +QIR+WE
Sbjct: 367 IHKGINAKNIISFILKNSH--------YIHQNSCNPIINQIRIWE 403


>gi|269861689|ref|XP_002650538.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220065990|gb|EED43520.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 291

 Score = 68.9 bits (167), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 36/158 (22%), Positives = 82/158 (51%), Gaps = 16/158 (10%)

Query: 36  LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
           L  ++   + F+ +ETNF++Y+Y+ +     IL LFS I +++P +I   + ++ + N  
Sbjct: 146 LYKNNTEIQKFIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTL 205

Query: 96  ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQ----IRLWESDLNRVEMTPAHYYEE 151
           + GI  +QI+ +++++              N CDQ    I +W+   NR+     + Y +
Sbjct: 206 DRGIKIQQIVEYIRRHT------------INNCDQILHMIEIWDKQRNRINKQIGYLYSD 253

Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHM 189
           F + + + +  +  +  + LL+++ ++  L V  ++H+
Sbjct: 254 FTNYNEYRSVLEQIKTDTDLLYKNEEERILFVKNKLHI 291


>gi|269860316|ref|XP_002649880.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220066720|gb|EED44193.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 369

 Score = 68.9 bits (167), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 36/158 (22%), Positives = 82/158 (51%), Gaps = 16/158 (10%)

Query: 36  LTDSSARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAF 95
           L  ++   + F+ +ETNF++Y+Y+ +     IL LFS I +++P +I   + ++ + N  
Sbjct: 224 LYKNNTEIQKFIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTL 283

Query: 96  ENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQ----IRLWESDLNRVEMTPAHYYEE 151
           + GI  +QI+ +++++              N CDQ    I +W+   NR+     + Y +
Sbjct: 284 DRGIKIQQIVEYIRRHT------------INNCDQILHMIEIWDKQRNRINKQIGYLYSD 331

Query: 152 FPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIHM 189
           F + + + +  +  +  + LL+++ ++  L V  ++H+
Sbjct: 332 FTNYNEYRSVLEQIKTDTDLLYKNEEERILFVKNKLHI 369


>gi|123488965|ref|XP_001325283.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121908180|gb|EAY13060.1| hypothetical protein TVAG_212420 [Trichomonas vaginalis G3]
          Length = 460

 Score = 67.8 bits (164), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/178 (23%), Positives = 81/178 (45%), Gaps = 9/178 (5%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           ++  F  +GLV +        +F PTK   N       +  + EG+++++TNF++ A+  
Sbjct: 257 LLSPFKTIGLVYIDG-----DYFYPTKSILNFFGK--SNIFQTEGWMLIDTNFKITAFPK 309

Query: 61  SKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRI 120
           S LH  +L+ F+ + Y+ P      I+  S   A   G T + II FL+ N   ++    
Sbjct: 310 SPLHTALLKKFANVTYEFPGFASAFISPNSFREALNQGTTLDDIIGFLKSNLSHKIGSG- 368

Query: 121 PSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKK 178
             +P  V  Q  +W     R+ +T      ++ + +    A   A+  +G ++  ++K
Sbjct: 369 -QIPSAVMKQFYVWRDQRERLTVTHECIMRQYTNPNDANLAAQCAKQLAGYVYGPAEK 425


>gi|167535774|ref|XP_001749560.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771952|gb|EDQ85611.1| predicted protein [Monosiga brevicollis MX1]
          Length = 417

 Score = 66.2 bits (160), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 54/94 (57%), Gaps = 5/94 (5%)

Query: 40  SARKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGI 99
           S  + GF++ ETNF +YAY+ S +   +L  F+ + Y+LP +    IT+ S+  A   GI
Sbjct: 287 SQTETGFLLAETNFHVYAYTDSAVKVALLSKFATLTYRLPIMTTAIITRTSVRRALSQGI 346

Query: 100 TAEQIISFLQQNA-----HPRVADRIPSVPENVC 128
           TA+QI+ F+Q+++     H   A   P+ P++  
Sbjct: 347 TADQILRFMQRSSLLLRHHDETAPTRPTSPQHAT 380


>gi|294950275|ref|XP_002786548.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
 gi|239900840|gb|EER18344.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
          Length = 580

 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 75/153 (49%), Gaps = 36/153 (23%)

Query: 30  TNLSMSLTDSSA--RKEGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-------PN 80
           + + MS+ D++A   K   + V++NF + AY+TS L   +L  F +++ QL       PN
Sbjct: 376 SKMMMSVDDTTAITNKSCRLFVDSNFAVTAYTTSSLDLRLLGTFVQLQRQLGDGREYDPN 435

Query: 81  ---LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR------------------ 119
               ++G +T+ S+ +A + G+T+E IIS+L+ +  PR A                    
Sbjct: 436 DFGCVLGTLTQSSVQSAAQRGVTSEYIISYLKSHVDPRAAHMGSQGGRSSAATANTGAAR 495

Query: 120 ----IPSVPENVCDQIRLWESDL--NRVEMTPA 146
               I  +P NV  QI LWE +   NR+ + P 
Sbjct: 496 GEKFIDGIPANVVTQITLWEREAIHNRLRIDPG 528


>gi|340508252|gb|EGR33999.1| hypothetical protein IMG5_027880 [Ichthyophthirius multifiliis]
          Length = 454

 Score = 63.5 bits (153), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 69/129 (53%), Gaps = 17/129 (13%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNL---SMSLTDSSARKEGFVVVETNFRMYA 57
           ++ D A LG++K   +G+++  F  T L       S+ +   +A+    ++VETNF +YA
Sbjct: 261 ILSDLAKLGMIKEFDLGQQKKVFGITPLIWQFCYRSIDIKTINAK----IIVETNFNLYA 316

Query: 58  Y----------STSKLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISF 107
           Y          S SK   ++L+ FSKI Y  P+LIV  +T+  +  AF  GIT++ +I F
Sbjct: 317 YLDYNPQNKTFSESKYIRDLLKKFSKIHYTFPHLIVAQLTEAKMKQAFNQGITSKLLIEF 376

Query: 108 LQQNAHPRV 116
             + +  ++
Sbjct: 377 FHKTSDAKL 385


>gi|47212202|emb|CAF90416.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 471

 Score = 60.5 bits (145), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 45/71 (63%), Gaps = 1/71 (1%)

Query: 76  YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWE 135
           Y+ PN++V  +T+ES+  A  NGITA+Q ++F    + P    + P +P  + DQIRLWE
Sbjct: 282 YRFPNVVVAQVTRESVQQAIANGITAQQRLTF-DLWSSPTPLSQTPVLPPTITDQIRLWE 340

Query: 136 SDLNRVEMTPA 146
            + +R++ T A
Sbjct: 341 LERDRLQFTEA 351


>gi|162605818|ref|XP_001713424.1| hypothetical protein GTHECHR3089 [Guillardia theta]
 gi|13794356|gb|AAK39733.1|AF083031_90 hypothetical protein [Guillardia theta]
          Length = 180

 Score = 59.3 bits (142), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 66/137 (48%), Gaps = 8/137 (5%)

Query: 5   FADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGF-VVVETNFRMYAYSTSKL 63
           F ++GL+ L   G+K    I  K+       +     + E F +++E NFR+Y +     
Sbjct: 5   FYNMGLINL--TGKKTIQ-ISNKIINLEKNGIIIQKKKTERFKIIIEVNFRIYIFKYFDF 61

Query: 64  HCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSV 123
                +  SK  Y LP   VG I +      F+ G+T++ I+ F+++N H    ++IPS 
Sbjct: 62  SNSFFKFISKKIYSLPYFYVGEINENYSRRLFQLGVTSDNILKFIKKNLH-YTCNKIPST 120

Query: 124 PENVCDQIRLWESDLNR 140
            E   D++R+WE    +
Sbjct: 121 FE---DRLRIWEISFKK 134


>gi|401419872|ref|XP_003874425.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322490661|emb|CBZ25923.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 559

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 61/144 (42%), Gaps = 49/144 (34%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
           ++ ETNFR+YAY+       IL  F+++E  +  NL    +T++S   A   GITA Q++
Sbjct: 359 IITETNFRLYAYTDDADLLNILNQFAELEEIVNGNLHCYRVTRDSFAAAMRKGITAAQVL 418

Query: 106 SFLQQNAHPRVADR---------------------------------IPS---------- 122
            FL   AHP +  R                                 +P           
Sbjct: 419 RFLSLRAHPSMLRRHGEREESGDTAAPSSAGGVSSSLTFSAPHRRGAVPGTCGGTTAIDA 478

Query: 123 -----VPENVCDQIRLWESDLNRV 141
                VP++ CDQ+R+WES+ +RV
Sbjct: 479 VSTIVVPQSFCDQMRMWESECHRV 502


>gi|146103275|ref|XP_001469523.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134073893|emb|CAM72632.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 557

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 59/144 (40%), Gaps = 49/144 (34%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
           ++ ETNFR+YAY+       IL  F+K+E  +  NL    +T++S  +A   GITA Q++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416

Query: 106 SFLQQNAHPR-------------------------------------VADRIPS------ 122
            FL   AHP                                      VA    S      
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476

Query: 123 -----VPENVCDQIRLWESDLNRV 141
                VP++ CDQ+ +WES+  RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500


>gi|398024082|ref|XP_003865202.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322503439|emb|CBZ38524.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 557

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 59/144 (40%), Gaps = 49/144 (34%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
           ++ ETNFR+YAY+       IL  F+K+E  +  NL    +T++S  +A   GITA Q++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416

Query: 106 SFLQQNAHPR-------------------------------------VADRIPS------ 122
            FL   AHP                                      VA    S      
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476

Query: 123 -----VPENVCDQIRLWESDLNRV 141
                VP++ CDQ+ +WES+  RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500


>gi|154345556|ref|XP_001568715.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066057|emb|CAM43844.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 561

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 58/143 (40%), Gaps = 48/143 (33%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQL-PNLIVGAITKESLYNAFENGITAEQII 105
           ++ ETNFR+YAY+       IL  F+++E  +  NL    +T++S   A   GITA Q++
Sbjct: 362 IITETNFRLYAYTEDSDLLNILHQFAELEEVVNGNLHCYRVTRDSFAAAMRKGITATQVL 421

Query: 106 SFLQQNAHPRVADR------------------IPS------------------------- 122
            FL   AHP +  R                   PS                         
Sbjct: 422 RFLSLRAHPSMLRRYGEKEENGDTKSPSPAGGAPSSTFSAPPRRAAVAGTRGSTTFIDAV 481

Query: 123 ----VPENVCDQIRLWESDLNRV 141
               VP + CDQ+R+WE +  RV
Sbjct: 482 TTLVVPHSFCDQLRMWERECRRV 504


>gi|157876614|ref|XP_001686652.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68129727|emb|CAJ09033.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 626

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 39/144 (27%), Positives = 58/144 (40%), Gaps = 49/144 (34%)

Query: 47  VVVETNFRMYAYSTSKLHCEILRLFSKIEYQLP-NLIVGAITKESLYNAFENGITAEQII 105
           ++ ETNFR++AY+       IL  F+++E  +  NL    +T+ S   A   GITA Q++
Sbjct: 426 IITETNFRLFAYTNDADLLNILNQFAELEEIVNGNLHCYRVTRASFAAAMRKGITAAQVL 485

Query: 106 SFLQQNAHPRVADR---------------------------IPS---------------- 122
            FL   AHP +  R                            P                 
Sbjct: 486 RFLSLRAHPSMLRRHGEREENGDTASPSSAGGASSSFTFSASPRRGAVAGTCGSTTSIDA 545

Query: 123 -----VPENVCDQIRLWESDLNRV 141
                VP++ CDQ+R+WES+  RV
Sbjct: 546 VSTLVVPQSFCDQMRMWESECRRV 569


>gi|145509120|ref|XP_001440504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407721|emb|CAK73107.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/170 (27%), Positives = 75/170 (44%), Gaps = 39/170 (22%)

Query: 1   MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKEGFVVVETNFRMYAYST 60
           +IKD  + GL   Q    + ++     L    S+S+          ++VE NFR+YAY  
Sbjct: 231 VIKDLQEFGLTHYQDQQMRITFLFWNFLYEPPSVSIGIQCN-----IIVEANFRIYAYLN 285

Query: 61  S------KLHCEILRLFSKIEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 114
           S      ++ C +L LFS+I+ +   LI+  +++ S+  A    + A+QII FL+ N+  
Sbjct: 286 SGDQQEEEILCNLLNLFSEIKKRFKILIIADLSESSIRKAVRENLQAKQIIQFLEMNSKQ 345

Query: 115 ------------------------RV-ADRIPS---VPENVCDQIRLWES 136
                                   RV  + +P    +P NV  QI+ WES
Sbjct: 346 LKQQAATEKQHKSNDELKKRLDFLRVFQEGVPEKAIIPHNVVQQIQYWES 395


>gi|342184190|emb|CCC93671.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 499

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 29/127 (22%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ ETNFR+YAY+ +     IL  F+  E  +  +IV   +T+ S   A   GI A+
Sbjct: 314 EDTIITETNFRLYAYTRNPDLLGILDQFAVKEIDIDGIIVCYRVTRSSFALALRKGIDAK 373

Query: 103 QIISFLQQNAHPR-----------VADRI----------PS-------VPENVCDQIRLW 134
            I+ FL   AHP            V+D +          PS       +P++ CDQ+  W
Sbjct: 374 HILQFLTLKAHPSMVRKDGGASRDVSDNLALNSVAGFGKPSEVHQSTVIPQSFCDQLMTW 433

Query: 135 ESDLNRV 141
           E +  R+
Sbjct: 434 ERECRRL 440


>gi|340057132|emb|CCC51474.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 502

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 34/130 (26%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ ETNFR+YAY+ +K    IL  F+  E ++  +IV   +T+ S   A   GI A+
Sbjct: 315 EDTIITETNFRLYAYTRNKHLLAILEQFAIKEAEVDGMIVCFRVTRASFAAALRKGIGAQ 374

Query: 103 QIISFLQQNAH-----------PR--------------------VADRIPSVPENVCDQI 131
            I+ FL+  AH           PR                     AD+I  +P + CDQ+
Sbjct: 375 HIVQFLRVKAHSSMLKHQLTCDPRDSAGLAATSSRVTPTDTPWTHADKI--IPRSFCDQL 432

Query: 132 RLWESDLNRV 141
             WE +  R+
Sbjct: 433 FTWERECRRL 442


>gi|71747422|ref|XP_822766.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832434|gb|EAN77938.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 500

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 29/127 (22%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ ETNFR+YAY+ +     IL  F+  E  +  ++V   +T+ S  +A   GI A 
Sbjct: 315 EDTIITETNFRLYAYTKNPDMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGAN 374

Query: 103 QIISFLQQNAHPRVADR------------IPS----------------VPENVCDQIRLW 134
            I+ FL   AHP +  +            +P+                +P++ CDQ+  W
Sbjct: 375 HILQFLTVKAHPSMIKQSNSEAGDPSCPVLPAASVGFGNTSEYRQGNIIPQSFCDQLFTW 434

Query: 135 ESDLNRV 141
           E +  R+
Sbjct: 435 ERECRRL 441


>gi|261332554|emb|CBH15549.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 500

 Score = 52.8 bits (125), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 57/127 (44%), Gaps = 29/127 (22%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ ETNFR+YAY+ +     IL  F+  E  +  ++V   +T+ S  +A   GI A 
Sbjct: 315 EDTIITETNFRLYAYTKNPDMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGAN 374

Query: 103 QIISFLQQNAHPRVADR------------IPS----------------VPENVCDQIRLW 134
            I+ FL   AHP +  +            +P+                +P++ CDQ+  W
Sbjct: 375 HILQFLTVKAHPSMIKQSNSEAGDPSCPVLPAASAGFGNTSEYRQGNIIPQSFCDQLFTW 434

Query: 135 ESDLNRV 141
           E +  R+
Sbjct: 435 ERECRRL 441


>gi|324515605|gb|ADY46257.1| General transcription factor IIH subunit 4 [Ascaris suum]
          Length = 245

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 49/76 (64%), Gaps = 7/76 (9%)

Query: 7   DLGLVKLQQVGRKESWFIPTKLATNLSMSLTDSSARKE-----GFVVVETNFRMYAYSTS 61
           +LGLV +++  RK+ +F  T L  +L++S +  +A  E     G+++VETN+R+YAY+ S
Sbjct: 170 ELGLVFIRK--RKDGYFFITPLFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDS 227

Query: 62  KLHCEILRLFSKIEYQ 77
            L   IL  F+++ Y+
Sbjct: 228 SLQLAILSTFTEMLYR 243


>gi|414589532|tpg|DAA40103.1| TPA: hypothetical protein ZEAMMB73_417245 [Zea mays]
          Length = 877

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/48 (50%), Positives = 33/48 (68%), Gaps = 9/48 (18%)

Query: 135 ESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLV 182
           ++DLNRVEM P++ YE+FPS+         ARD   LL +DSK+MRL+
Sbjct: 824 QTDLNRVEMIPSYLYEDFPSK---------ARDNGYLLCDDSKRMRLI 862


>gi|146161819|ref|XP_001033510.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila]
 gi|146142827|gb|EAR85847.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila
           SB210]
          Length = 484

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 17/87 (19%)

Query: 47  VVVETNFRMYAY-------------STSKLHCE----ILRLFSKIEYQLPNLIVGAITKE 89
           + +ETNF +YAY              T K   E    +L  F  + Y+ P+LIVG ++++
Sbjct: 322 ITIETNFNVYAYIDQEDPKEIKDSKDTLKDQTEHIEKLLASFCDLHYKFPHLIVGQLSEQ 381

Query: 90  SLYNAFENGITAEQIISFLQQNAHPRV 116
              + F+NG+TA  +I F  + + P++
Sbjct: 382 KTKDQFKNGLTARLLIQFFNKTSDPQM 408


>gi|407397449|gb|EKF27748.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           marinkellei]
          Length = 501

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 57/128 (44%), Gaps = 30/128 (23%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ E NFR+YAY+ +     IL  F++ + ++  +I    +T+++   A + GI + 
Sbjct: 315 EDTIITEANFRLYAYTKNPDLLNILNQFAERDAEIDQMIACYRVTRKTFAAALKRGIGSS 374

Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
            I+ FL   AHP      R  D                      RI   +P++ CDQ+  
Sbjct: 375 HILQFLAVKAHPSMLRNHRDGDSKKTSGLSVLGAGTRFKNTSDIRIDGIIPQSFCDQLMT 434

Query: 134 WESDLNRV 141
           WE +  R+
Sbjct: 435 WEKECRRL 442


>gi|71650598|ref|XP_813994.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           strain CL Brener]
 gi|70878928|gb|EAN92143.1| DNA repair and transcription factor protein, putative [Trypanosoma
           cruzi]
          Length = 501

 Score = 47.0 bits (110), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 30/128 (23%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ E NFR+YAY+ +     IL  F++ + ++  +I    +T+ +   A + GI + 
Sbjct: 315 EDTIITEANFRLYAYTKNPDLLNILNQFAERDAEVDQMIACYRVTRRTFAAALKRGIGSS 374

Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
            I+ FL   AHP      R  D                      RI   +P++ CDQ+  
Sbjct: 375 HILQFLAVKAHPSMLMHHREGDSKKASGLSVLGAGTRFKNASDIRIDEIIPQSFCDQLMT 434

Query: 134 WESDLNRV 141
           WE +  R+
Sbjct: 435 WEKECRRL 442


>gi|71425201|ref|XP_813043.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           strain CL Brener]
 gi|70877891|gb|EAN91192.1| DNA repair and transcription factor protein, putative [Trypanosoma
           cruzi]
          Length = 501

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 30/128 (23%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ E NFR+YAY+ +     IL  F++ + ++  +I    +T+ +   A + GI + 
Sbjct: 315 EDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQMIACYRVTRRTFAAALKRGIGSS 374

Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
            I+ FL   AHP      R  D                      RI   +P++ CDQ+  
Sbjct: 375 HILQFLAVKAHPSMLRHHREGDSKKASGLSVLGAGTRFKNASDIRIDEIIPQSFCDQLMT 434

Query: 134 WESDLNRV 141
           WE +  R+
Sbjct: 435 WEKECRRL 442


>gi|407835017|gb|EKF99108.1| DNA repair and transcription factor protein, putative [Trypanosoma
           cruzi]
          Length = 501

 Score = 45.8 bits (107), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 30/128 (23%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEYQLPNLIVG-AITKESLYNAFENGITAE 102
           E  ++ E NFR+YAY+ +     IL  F++ + ++  +I    +T+ +   A + GI + 
Sbjct: 315 EDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQMIACYRVTRRTFAAALKRGIGSL 374

Query: 103 QIISFLQQNAHP------RVAD----------------------RIPS-VPENVCDQIRL 133
            I+ FL   AHP      R  D                      RI   +P++ CDQ+  
Sbjct: 375 HILQFLAVKAHPSMLRHHREGDSKKASGLSVLGAGTRFKKASDIRIDEIIPQSFCDQLMT 434

Query: 134 WESDLNRV 141
           WE +  R+
Sbjct: 435 WEKECRRL 442


>gi|197107449|pdb|3DOM|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
 gi|197107451|pdb|3DOM|C Chain C, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
          Length = 108

 Score = 43.9 bits (102), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 43/83 (51%)

Query: 114 PRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLW 173
           P   + +  +P  V DQIRLW+ +L+RV       Y +F +   +     YA+D   LLW
Sbjct: 19  PNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLW 78

Query: 174 EDSKKMRLVVNAEIHMHMREFLR 196
           +D KK +  ++ E +  + +F +
Sbjct: 79  KDDKKKKFFISKEGNSQVLDFAK 101


>gi|47194594|emb|CAF93785.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 123

 Score = 42.7 bits (99), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 46/111 (41%), Gaps = 23/111 (20%)

Query: 112 AHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYA------ 165
           + P    + P +P  + DQIRLWE + +R++ T    Y +F S+  FE   D A      
Sbjct: 13  SSPTPLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRDRAQVGLFH 72

Query: 166 -----------------RDRSGLLWEDSKKMRLVVNAEIHMHMREFLRGQN 199
                            +    LLW+D+    +VV    H  +++F + Q 
Sbjct: 73  CCYTIRQKVHQQLSPVLQSLGCLLWQDAAHRVMVVTLWGHSEVKKFWKRQK 123


>gi|241725008|ref|XP_002412209.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
 gi|215505416|gb|EEC14910.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
          Length = 75

 Score = 42.7 bits (99), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 36/68 (52%)

Query: 129 DQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNAEIH 188
           DQ+RLWE + +R        Y +F S+  F+   +YA D   L+W++  K  +VVN   H
Sbjct: 2   DQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRVMVVNRNGH 61

Query: 189 MHMREFLR 196
             ++ F +
Sbjct: 62  DEVKRFWK 69


>gi|197107383|pdb|3DGP|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
          Length = 80

 Score = 40.8 bits (94), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 38/71 (53%)

Query: 126 NVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAACDYARDRSGLLWEDSKKMRLVVNA 185
            V DQIRLW+ +L+RV       Y +F +   +     YA+D   LLW+D KK +  ++ 
Sbjct: 3   TVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISK 62

Query: 186 EIHMHMREFLR 196
           E +  + +F +
Sbjct: 63  EGNSQVLDFAK 73


>gi|337743331|gb|AEI73162.1| GTF2H4 [Kryptolebias marmoratus]
          Length = 82

 Score = 39.7 bits (91), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 13/70 (18%)

Query: 1  MIKDFADLGLVKLQQVGRKESWFIPTKLATNLSMSLT-----------DSSARKEGFVVV 49
           ++   + GLV   Q  RK   + PT+LA  L+  +T              A   GF+VV
Sbjct: 15 FLQHLREFGLVF--QRKRKSRRYYPTRLAITLAAGVTSNSSSSSNLSSTQGAGDAGFIVV 72

Query: 50 ETNFRMYAYS 59
          ETN+R+YAY+
Sbjct: 73 ETNYRVYAYT 82


>gi|440297879|gb|ELP90520.1| hypothetical protein EIN_018790 [Entamoeba invadens IP1]
          Length = 1702

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 97   NGITAEQIISFLQQN--AHPRVADRIPSVPENVCDQI-RLWESDLNRVEMTPAHYYEEFP 153
            +G   EQ+  F+ +N  A+ R+ ++IPS+P+  C ++   +  ++ + +    H   E P
Sbjct: 1211 SGKVCEQMAFFISKNPKAYERLKEKIPSLPDKSCVELFDFFSENVQKNQQKYKHLNYESP 1270

Query: 154  SRDVFEAACDYARD 167
            S D  E   + ARD
Sbjct: 1271 SEDFLEQYTELARD 1284


>gi|261405782|ref|YP_003242023.1| hypothetical protein GYMC10_1934 [Paenibacillus sp. Y412MC10]
 gi|261282245|gb|ACX64216.1| hypothetical protein GYMC10_1934 [Paenibacillus sp. Y412MC10]
          Length = 572

 Score = 38.1 bits (87), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 9/94 (9%)

Query: 81  LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNR 140
           + V  +T++S+  A + G+  ++I +F  ++A          +PENV   +  W  +++R
Sbjct: 364 MTVYRLTRDSVIQAAKAGMEPDRIAAFFSKHA-------AAGMPENVQSALYQWGKEMDR 416

Query: 141 VEMTPAHYYEE-FPSRDVFEAACDYARDRSGLLW 173
             +  +  YEE FP  D   A  D   DR  + W
Sbjct: 417 AGLNESPRYEEAFPEEDTELAGTD-GNDRYPMQW 449


>gi|430746993|ref|YP_007206122.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
           18658]
 gi|430018713|gb|AGA30427.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
           18658]
          Length = 790

 Score = 37.0 bits (84), Expect = 4.0,   Method: Composition-based stats.
 Identities = 30/120 (25%), Positives = 55/120 (45%), Gaps = 10/120 (8%)

Query: 44  EGFVVVETNFRMYAYSTSKLHCEILRLFSKIEY--QLPNLIVGAITKESLYNAFENGITA 101
           E F+ V+ NF + AY    L   ++  FS+     Q+   +   +T ES+Y   E G+T 
Sbjct: 448 EHFLYVQPNFEIIAYRQG-LTPPLIGQFSRFALWSQVGAALELKLTPESVYRGLEGGLTP 506

Query: 102 EQIISFLQQNAHPRVADRIPSVPENVCDQIRLWESDLNRVEMTPAHYYEEFPSRDVFEAA 161
            +++  L +++  RV      +P  V + +R W     R+    +    EF +++  E A
Sbjct: 507 PEMLDRLARHSQ-RV------LPAGVSEAVRTWAGRRERITYYASATLVEFATKEDLEKA 559


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.134    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,954,759,714
Number of Sequences: 23463169
Number of extensions: 112206556
Number of successful extensions: 244311
Number of sequences better than 100.0: 396
Number of HSP's better than 100.0 without gapping: 378
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 243499
Number of HSP's gapped (non-prelim): 470
length of query: 200
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 65
effective length of database: 9,191,667,552
effective search space: 597458390880
effective search space used: 597458390880
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)