BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 020564
         (324 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|297746227|emb|CBI16283.3| unnamed protein product [Vitis vinifera]
          Length = 494

 Score =  573 bits (1477), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 272/323 (84%), Positives = 294/323 (91%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDF+PSRMAVVAK +EAF+REFFDQNPLSQIGLVT+KDG+A CLTDLGGSP+SH+KALMG
Sbjct: 171 MDFKPSRMAVVAKHIEAFIREFFDQNPLSQIGLVTIKDGLAQCLTDLGGSPDSHVKALMG 230

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGDSSLQNALDLV G L+QIPSYGHREVLILYSALSTCDPGDIMETIQ+CK+SKIR
Sbjct: 231 KLECSGDSSLQNALDLVHGYLNQIPSYGHREVLILYSALSTCDPGDIMETIQECKKSKIR 290

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGLSAE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 291 CSVIGLSAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIAEFAIANLIKMGF 350

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQRA EG ISICSCHKE KVG GYTCPRCKARVCELPT+CRICGL LVSSPHLARSYHHL
Sbjct: 351 PQRAAEGVISICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLVSSPHLARSYHHL 410

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
           FPI PFDEV+   LN+P  RS   CFGCQ+SLL  GNKP L VACPKCK+HFCL+CDIYI
Sbjct: 411 FPIPPFDEVSLSLLNNPHQRSSRACFGCQESLLIPGNKPTLCVACPKCKQHFCLDCDIYI 470

Query: 301 HESLHNCPGCESLRHSNPIVANE 323
           HESLHNCPGCES RHS  +   E
Sbjct: 471 HESLHNCPGCESFRHSKIVSVTE 493


>gi|225435253|ref|XP_002284994.1| PREDICTED: general transcription factor IIH subunit 2 [Vitis
           vinifera]
          Length = 433

 Score =  573 bits (1476), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 272/323 (84%), Positives = 294/323 (91%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDF+PSRMAVVAK +EAF+REFFDQNPLSQIGLVT+KDG+A CLTDLGGSP+SH+KALMG
Sbjct: 110 MDFKPSRMAVVAKHIEAFIREFFDQNPLSQIGLVTIKDGLAQCLTDLGGSPDSHVKALMG 169

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGDSSLQNALDLV G L+QIPSYGHREVLILYSALSTCDPGDIMETIQ+CK+SKIR
Sbjct: 170 KLECSGDSSLQNALDLVHGYLNQIPSYGHREVLILYSALSTCDPGDIMETIQECKKSKIR 229

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGLSAE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 230 CSVIGLSAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIAEFAIANLIKMGF 289

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQRA EG ISICSCHKE KVG GYTCPRCKARVCELPT+CRICGL LVSSPHLARSYHHL
Sbjct: 290 PQRAAEGVISICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLVSSPHLARSYHHL 349

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
           FPI PFDEV+   LN+P  RS   CFGCQ+SLL  GNKP L VACPKCK+HFCL+CDIYI
Sbjct: 350 FPIPPFDEVSLSLLNNPHQRSSRACFGCQESLLIPGNKPTLCVACPKCKQHFCLDCDIYI 409

Query: 301 HESLHNCPGCESLRHSNPIVANE 323
           HESLHNCPGCES RHS  +   E
Sbjct: 410 HESLHNCPGCESFRHSKIVSVTE 432


>gi|224119296|ref|XP_002331276.1| predicted protein [Populus trichocarpa]
 gi|222873701|gb|EEF10832.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  563 bits (1450), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 269/316 (85%), Positives = 289/316 (91%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDFRPSRMAVVA+ VEAF+REFFDQNPLSQI LVT+KDGVA  LT+LGGSPESHIKALM 
Sbjct: 97  MDFRPSRMAVVAQNVEAFIREFFDQNPLSQIALVTIKDGVAYSLTELGGSPESHIKALMA 156

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGDSSLQNAL+LV   L +IPSYG+REVLILYSAL+TCDPGDIMETIQKCK+SK+R
Sbjct: 157 KLECSGDSSLQNALELVHEYLDKIPSYGNREVLILYSALTTCDPGDIMETIQKCKKSKMR 216

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGLSAEMFICKHLCQ+TGG YSVALDESHFKELI+EHAPPPPAIAEFAIANLIKMGF
Sbjct: 217 CSVIGLSAEMFICKHLCQETGGLYSVALDESHFKELILEHAPPPPAIAEFAIANLIKMGF 276

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQRA EGSISICSCHKE KVG GY CPRCKARVCELPT+CRICGL LVSSPHLARSYHHL
Sbjct: 277 PQRAAEGSISICSCHKESKVGEGYICPRCKARVCELPTECRICGLTLVSSPHLARSYHHL 336

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
           FPIAPFDEV P   N+P  RS+ TCFGCQQSL++ GNKP L VACPKCK++FCL+CDIYI
Sbjct: 337 FPIAPFDEVKPSRQNEPHRRSQKTCFGCQQSLVNPGNKPSLQVACPKCKQYFCLDCDIYI 396

Query: 301 HESLHNCPGCESLRHS 316
           HESLHNCPGCESLR S
Sbjct: 397 HESLHNCPGCESLRAS 412


>gi|356571915|ref|XP_003554116.1| PREDICTED: general transcription factor IIH subunit 2-like [Glycine
           max]
          Length = 420

 Score =  557 bits (1435), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 257/322 (79%), Positives = 292/322 (90%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRPSRMAV+ KQVEAF+REFFDQNPLS +GLVT+KDG+A+C+T+LGGSPESHIKALMGK
Sbjct: 99  DFRPSRMAVMGKQVEAFIREFFDQNPLSHVGLVTIKDGIAHCITELGGSPESHIKALMGK 158

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L CSGD+SLQNAL+LV G L+QIPSYGHREVLILYSALSTCDPGD+METIQKCK+SKIRC
Sbjct: 159 LECSGDASLQNALELVLGYLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRC 218

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SVIGL+AEMF+CKHLCQ+TGG+YSVALDESHFKELI+EHAPPPPAIAE+A ANLIKMGFP
Sbjct: 219 SVIGLAAEMFVCKHLCQETGGTYSVALDESHFKELILEHAPPPPAIAEYATANLIKMGFP 278

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           QR+ EGS++IC+CH+E K G GYTCPRCK RVCELPT+CR+CGL L+SSPHLARSYHHLF
Sbjct: 279 QRSAEGSVAICTCHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLF 338

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           PI  FDEVTP   ND  +   +TCFGCQQSLLS GNKPGL V CPKCK+ FCL+CDIY+H
Sbjct: 339 PIVMFDEVTPSSQNDSNHSFPNTCFGCQQSLLSQGNKPGLSVICPKCKQQFCLDCDIYVH 398

Query: 302 ESLHNCPGCESLRHSNPIVANE 323
           ESLHNCPGCES RHS  + A++
Sbjct: 399 ESLHNCPGCESSRHSKSVTASQ 420


>gi|357509341|ref|XP_003624959.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355499974|gb|AES81177.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 426

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 257/323 (79%), Positives = 288/323 (89%), Gaps = 4/323 (1%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRPSRMAV+AKQVE F+REFFDQNPLS +GLVT KDGVANCLTDLGGSPESHIKALMGK
Sbjct: 101 DFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGGSPESHIKALMGK 160

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L CSGD+SLQNAL+LV   L+QIPSYGHREVLILYSALSTCDPGD+METIQKCK+SKIRC
Sbjct: 161 LECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRC 220

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SVIGL+AEMFICKHLCQ+TGG+YSVALDESHFKELI+EH+PPPPAIAE+A ANLIKMGFP
Sbjct: 221 SVIGLAAEMFICKHLCQETGGTYSVALDESHFKELILEHSPPPPAIAEYATANLIKMGFP 280

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           QRA EGS++IC+CH+E K G GYTCPRCK RVCELPT+CR+CGL L+SSPHLARSYHHLF
Sbjct: 281 QRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLF 340

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS----SGNKPGLYVACPKCKKHFCLECD 297
           PI PF E++P   NDP +   +TCFGCQQSLLS    +GNK  L V+CPKCK+ FCL+CD
Sbjct: 341 PIVPFAEISPSSQNDPNHSFPNTCFGCQQSLLSQGFGAGNKAELSVSCPKCKQQFCLDCD 400

Query: 298 IYIHESLHNCPGCESLRHSNPIV 320
           +YIHESLHNCPGCES RHS  + 
Sbjct: 401 MYIHESLHNCPGCESFRHSKSVT 423


>gi|356538258|ref|XP_003537621.1| PREDICTED: general transcription factor IIH subunit 2-like [Glycine
           max]
          Length = 419

 Score =  546 bits (1408), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 255/325 (78%), Positives = 292/325 (89%), Gaps = 7/325 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRPSRM V+ KQVEAF+REFFDQNPLS +GLVT+KDG+A+C+T+LGGSPESHIKALMGK
Sbjct: 99  DFRPSRMVVMGKQVEAFIREFFDQNPLSHVGLVTIKDGIAHCITELGGSPESHIKALMGK 158

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L CSGD+SLQNAL+LV G L+QIPSYGHREVLILYSALSTCDPGD+METIQKCK+SKIRC
Sbjct: 159 LECSGDASLQNALELVLGYLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRC 218

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SVIGL+AEMF+CKHLC++TGG+YSVALDESHFKELI+EHAPPPPAIAE++ ANLIKMGFP
Sbjct: 219 SVIGLAAEMFVCKHLCEETGGTYSVALDESHFKELILEHAPPPPAIAEYSTANLIKMGFP 278

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           QR+ EGS++IC+CH+E K G GYTCPRCK RVCELPT+CR+CGL L+SSPHLARSYHHLF
Sbjct: 279 QRSAEGSVAICTCHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLF 338

Query: 242 PIAPFDEVTPLCLNDPRNRSRS---TCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
           PI  FDEVTP      ++ SRS   TCFGCQQSLLS GNKPGL V CPKCK+ FCL+CDI
Sbjct: 339 PIVMFDEVTP----SQKDSSRSFPNTCFGCQQSLLSQGNKPGLSVICPKCKQQFCLDCDI 394

Query: 299 YIHESLHNCPGCESLRHSNPIVANE 323
           Y+HESLHNCPGCES RHS  + A++
Sbjct: 395 YVHESLHNCPGCESSRHSKSVTASQ 419


>gi|449451946|ref|XP_004143721.1| PREDICTED: general transcription factor IIH subunit 2-like [Cucumis
           sativus]
          Length = 423

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 255/325 (78%), Positives = 289/325 (88%), Gaps = 2/325 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDFRPSRMAVVAK V+AFVREFFDQNPLSQIGLVT+KDG ANCLTDLGGSPESH+KALMG
Sbjct: 99  MDFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMG 158

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGD+SLQN L+LV   L+QIPSYGHREVL+LYSAL++CDPGDIMET+QKCK SKIR
Sbjct: 159 KLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIR 218

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIA+ A+ NLIKMGF
Sbjct: 219 CSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGF 278

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQRA E SI+ICSCHKE KVG GYTCPRCKARVCELPT+CRICGL L+SSPHLARSYHHL
Sbjct: 279 PQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHL 338

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--SSGNKPGLYVACPKCKKHFCLECDI 298
           FPI PFDEV+    +DPR++    CFGCQ+SL+  S+GN P + V+CPKCK+HFCL+CDI
Sbjct: 339 FPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDI 398

Query: 299 YIHESLHNCPGCESLRHSNPIVANE 323
           YIHESLHNCPGCES R      ++E
Sbjct: 399 YIHESLHNCPGCESFRRPKLATSDE 423


>gi|449525401|ref|XP_004169706.1| PREDICTED: general transcription factor IIH subunit 2-like, partial
           [Cucumis sativus]
          Length = 329

 Score =  540 bits (1390), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 255/325 (78%), Positives = 289/325 (88%), Gaps = 2/325 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDFRPSRMAVVAK V+AFVREFFDQNPLSQIGLVT+KDG ANCLTDLGGSPESH+KALMG
Sbjct: 5   MDFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMG 64

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGD+SLQN L+LV   L+QIPSYGHREVL+LYSAL++CDPGDIMET+QKCK SKIR
Sbjct: 65  KLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIR 124

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIA+ A+ NLIKMGF
Sbjct: 125 CSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGF 184

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQRA E SI+ICSCHKE KVG GYTCPRCKARVCELPT+CRICGL L+SSPHLARSYHHL
Sbjct: 185 PQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHL 244

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--SSGNKPGLYVACPKCKKHFCLECDI 298
           FPI PFDEV+    +DPR++    CFGCQ+SL+  S+GN P + V+CPKCK+HFCL+CDI
Sbjct: 245 FPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDI 304

Query: 299 YIHESLHNCPGCESLRHSNPIVANE 323
           YIHESLHNCPGCES R      ++E
Sbjct: 305 YIHESLHNCPGCESFRRPKLATSDE 329


>gi|297848764|ref|XP_002892263.1| ATGTF2H2/GTF2H2 [Arabidopsis lyrata subsp. lyrata]
 gi|297338105|gb|EFH68522.1| ATGTF2H2/GTF2H2 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  529 bits (1362), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 252/314 (80%), Positives = 282/314 (89%), Gaps = 1/314 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDFRPSRMA++AK VEAF+REFFDQNPLSQIGLV++K+GVA+ LTDLGGSPE+HIKALMG
Sbjct: 97  MDFRPSRMAIMAKHVEAFIREFFDQNPLSQIGLVSIKNGVAHTLTDLGGSPETHIKALMG 156

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL   GDSSLQNAL+LV   L+Q+PSYGHREVLILYSAL TCDPGDIMETIQKCK+SK+R
Sbjct: 157 KLEALGDSSLQNALELVHEHLNQVPSYGHREVLILYSALCTCDPGDIMETIQKCKKSKLR 216

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGLSAEMFICKHLCQ+TGG YSVA+DE H K+L++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 217 CSVIGLSAEMFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGF 276

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQRA EGS++ICSCHKEVK+G GY CPRCKARVC+LPT+C ICGL LVSSPHLARSYHHL
Sbjct: 277 PQRAAEGSMAICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHL 336

Query: 241 FPIAPFDEVTPL-CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           FPIAPFDEV  L  LND R +   +CFGCQQSL+ +GNKPG  V C KCK +FCL+CDIY
Sbjct: 337 FPIAPFDEVPALSSLNDNRRKLGKSCFGCQQSLIGAGNKPGPCVTCRKCKHYFCLDCDIY 396

Query: 300 IHESLHNCPGCESL 313
           IHESLHNCPGCES+
Sbjct: 397 IHESLHNCPGCESI 410


>gi|30679101|ref|NP_683275.2| transcription initiation factor TFIIH subunit H2 [Arabidopsis
           thaliana]
 gi|22087278|gb|AAM90909.1|AF499443_1 p44/SSL1-like protein [Arabidopsis thaliana]
 gi|4056421|gb|AAC97995.1| Similar to gb|Z30094 basic transcripion factor 2, 44 kD subunit
           from Homo sapiens. EST gb|W43325 comes from this gene
           [Arabidopsis thaliana]
 gi|110738828|dbj|BAF01337.1| putative transcription factor [Arabidopsis thaliana]
 gi|332189663|gb|AEE27784.1| transcription initiation factor TFIIH subunit H2 [Arabidopsis
           thaliana]
          Length = 421

 Score =  526 bits (1355), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 251/314 (79%), Positives = 281/314 (89%), Gaps = 1/314 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDFRPSRMA++AK VEAF+REFFDQNPLSQIGLV++K+GVA+ LTDLGGSPE+HIKALMG
Sbjct: 97  MDFRPSRMAIMAKHVEAFIREFFDQNPLSQIGLVSIKNGVAHTLTDLGGSPETHIKALMG 156

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL   GDSSLQNAL+LV   L+Q+PSYGHREVLILYSAL TCDPGDIMETIQKCK+SK+R
Sbjct: 157 KLEALGDSSLQNALELVHEHLNQVPSYGHREVLILYSALCTCDPGDIMETIQKCKKSKLR 216

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGLSAEMFICKHLCQ+TGG YSVA+DE H K+L++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 217 CSVIGLSAEMFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGF 276

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQRA EGS++ICSCHKEVK+G GY CPRCKARVC+LPT+C ICGL LVSSPHLARSYHHL
Sbjct: 277 PQRAAEGSMAICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHL 336

Query: 241 FPIAPFDEVTPL-CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           FPIAPFDEV  L  LND R +   +CFGCQQSL+ +GNKP   V C KCK +FCL+CDIY
Sbjct: 337 FPIAPFDEVPALSSLNDNRRKLGKSCFGCQQSLIGAGNKPVPCVTCRKCKHYFCLDCDIY 396

Query: 300 IHESLHNCPGCESL 313
           IHESLHNCPGCES+
Sbjct: 397 IHESLHNCPGCESI 410


>gi|357164610|ref|XP_003580110.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Brachypodium distachyon]
          Length = 428

 Score =  503 bits (1296), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 236/315 (74%), Positives = 274/315 (86%), Gaps = 5/315 (1%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD+RPSRMAVVAK  EAF+REFFDQNPLS +GLVT+KDG+++ LTD+GGSPES IKALMG
Sbjct: 114 MDYRPSRMAVVAKCAEAFIREFFDQNPLSHVGLVTIKDGISHRLTDIGGSPESQIKALMG 173

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGDSSLQNAL+L+ G L+QIPSYGH+EVLILYSAL+TCDPGD+MET++KCK+SKIR
Sbjct: 174 KLECSGDSSLQNALELIHGYLNQIPSYGHKEVLILYSALNTCDPGDVMETVEKCKKSKIR 233

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 234 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 293

Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           PQR  E  ISICSCHK++K G  GY CPRCK  VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 294 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 353

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           LFP+APFDEVT         +    C+GCQQSL++S ++  L+V CPKC +HFC +CDIY
Sbjct: 354 LFPVAPFDEVT----FKLGQKGGQNCYGCQQSLINSDSQSNLHVRCPKCNQHFCFDCDIY 409

Query: 300 IHESLHNCPGCESLR 314
           IHESLHNCPGCES R
Sbjct: 410 IHESLHNCPGCESQR 424


>gi|242076424|ref|XP_002448148.1| hypothetical protein SORBIDRAFT_06g022140 [Sorghum bicolor]
 gi|241939331|gb|EES12476.1| hypothetical protein SORBIDRAFT_06g022140 [Sorghum bicolor]
          Length = 431

 Score =  503 bits (1296), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 239/316 (75%), Positives = 272/316 (86%), Gaps = 3/316 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD+RPSRMAVVAK  EAF+REFFDQNPLS +GLVT+KDG+A+ LT++GGSPES IKALMG
Sbjct: 115 MDYRPSRMAVVAKHAEAFIREFFDQNPLSHVGLVTIKDGIAHRLTEIGGSPESQIKALMG 174

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGDSSLQNAL+LV G L Q+PSYGH+EVLILYSAL+T DPGDIMETI KCK+SKIR
Sbjct: 175 KLECSGDSSLQNALELVHGYLDQVPSYGHKEVLILYSALNTFDPGDIMETITKCKKSKIR 234

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 235 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 294

Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           PQR  E  ISICSCHK++K G  GY CPRCK  VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 295 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 354

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           LFP+APFDEVTP  + +   R    CF CQQSL +   +  L+V CPKC +HFCL+CDIY
Sbjct: 355 LFPVAPFDEVTP--VPNRIQRGGQICFSCQQSLFNPDGQSNLHVRCPKCNQHFCLDCDIY 412

Query: 300 IHESLHNCPGCESLRH 315
           IHESLHNCPGCES R+
Sbjct: 413 IHESLHNCPGCESQRN 428


>gi|226531944|ref|NP_001142175.1| uncharacterized protein LOC100274342 [Zea mays]
 gi|194707472|gb|ACF87820.1| unknown [Zea mays]
 gi|414586431|tpg|DAA37002.1| TPA: hypothetical protein ZEAMMB73_625811 [Zea mays]
          Length = 428

 Score =  499 bits (1284), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 238/315 (75%), Positives = 269/315 (85%), Gaps = 3/315 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD+RPSRMAVVAK  EAF+REFFDQNPLS +GLVT+KDG+A+ LT++GGSPES IKALMG
Sbjct: 112 MDYRPSRMAVVAKHAEAFIREFFDQNPLSHVGLVTIKDGIAHRLTEIGGSPESQIKALMG 171

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGDSSLQNALDLV G L Q+PSYGH+EVLILYSAL+T DPGDI ETI KCK+SKIR
Sbjct: 172 KLECSGDSSLQNALDLVHGYLDQVPSYGHKEVLILYSALNTFDPGDITETIAKCKKSKIR 231

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 232 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 291

Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           PQR  E  ISICSCHK++K G  GY CPRCK  VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 292 PQRGAEELISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 351

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           LFP+ PFDEVTP  + +   R    CF CQQSL +   +  L+V CPKC +HFCL+CDIY
Sbjct: 352 LFPVPPFDEVTP--VPNRIQRGGQICFSCQQSLYNPDGQSNLHVRCPKCNQHFCLDCDIY 409

Query: 300 IHESLHNCPGCESLR 314
           IHESLHNCPGCES R
Sbjct: 410 IHESLHNCPGCESQR 424


>gi|326529607|dbj|BAK04750.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score =  498 bits (1283), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 235/318 (73%), Positives = 273/318 (85%), Gaps = 5/318 (1%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD+RPSRMAVVAK  EAF+REFFDQNPLS +GLVT+KDG+++ LT++GGSPES I ALMG
Sbjct: 116 MDYRPSRMAVVAKCAEAFIREFFDQNPLSHVGLVTIKDGISHRLTEIGGSPESQINALMG 175

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSGDSSLQNAL+LV G L+QIPSYGH+EVL LYSAL+TCDPGDIMETI+KCK+SK+R
Sbjct: 176 KLECSGDSSLQNALELVHGYLNQIPSYGHKEVLFLYSALNTCDPGDIMETIEKCKKSKVR 235

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 236 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 295

Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           PQR  E  ISICSCHK++K G  GY CPRCK  VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 296 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 355

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           LFP++PFDEVT         +    CFGCQQSL+++G +  ++V CPKC  HFC +CDIY
Sbjct: 356 LFPVSPFDEVT----FKLGQKGGQNCFGCQQSLINTGGQSNIHVRCPKCNHHFCFDCDIY 411

Query: 300 IHESLHNCPGCESLRHSN 317
           IHESLHNCPGCES R S+
Sbjct: 412 IHESLHNCPGCESQRSSS 429


>gi|32489532|emb|CAE04735.1| OSJNBa0043L24.23 [Oryza sativa Japonica Group]
 gi|116310777|emb|CAH67570.1| OSIGBa0101P20.13 [Oryza sativa Indica Group]
          Length = 432

 Score =  492 bits (1267), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 230/318 (72%), Positives = 271/318 (85%), Gaps = 1/318 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD+RPSRMAVVAK  E F+REFFDQNPLS +G+VT+KDG+++ LT++GGSPES IKALMG
Sbjct: 115 MDYRPSRMAVVAKYAEVFIREFFDQNPLSHVGIVTMKDGISHRLTEIGGSPESQIKALMG 174

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSG+ SLQNAL+LV G L Q+PSYGH+EVL LYSAL+TCDPGDIMETI KCK+SKIR
Sbjct: 175 KLECSGEPSLQNALELVHGYLDQVPSYGHKEVLFLYSALNTCDPGDIMETIAKCKKSKIR 234

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FICK+LC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 235 CSVIGLAAEIFICKYLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 294

Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           PQR  E  ISICSCHK++K G  GY CPRCK  VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 295 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 354

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           LFP+ PFDEV+ +  N    +    C+GCQQS ++  ++  L+V CPKC +HFCL+CDIY
Sbjct: 355 LFPVQPFDEVSSVHPNRLGQKGGQKCYGCQQSFINPDSQSSLHVRCPKCNQHFCLDCDIY 414

Query: 300 IHESLHNCPGCESLRHSN 317
           IHESLHNCPGCES R S+
Sbjct: 415 IHESLHNCPGCESQRSSS 432


>gi|255570827|ref|XP_002526366.1| btf, putative [Ricinus communis]
 gi|223534325|gb|EEF36037.1| btf, putative [Ricinus communis]
          Length = 465

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 225/274 (82%), Positives = 251/274 (91%), Gaps = 1/274 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD +PSRM V+A+ VEAF+REFFDQNPLSQIGLVT+KDGVA+ LT+LGGSPESHIKAL G
Sbjct: 99  MDIKPSRMVVIARHVEAFIREFFDQNPLSQIGLVTIKDGVAHPLTELGGSPESHIKALTG 158

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL C+GDSSLQNALDLV+G L QIP+YGHREVLILYSALSTCDPGDIMETIQ CK+SKIR
Sbjct: 159 KLECAGDSSLQNALDLVRGYLDQIPAYGHREVLILYSALSTCDPGDIMETIQNCKKSKIR 218

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGLSAEMFICKHLCQ+TGG YSVA+DE+H KEL++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 219 CSVIGLSAEMFICKHLCQETGGLYSVAMDETHLKELLLEHAPPPPAIAEFAIANLIKMGF 278

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQR+ EGSISICSCHKE KVG GY CPRCKARVC+LPT+CRICGL LVSSPHLARSYHHL
Sbjct: 279 PQRSAEGSISICSCHKEAKVGEGYICPRCKARVCDLPTECRICGLTLVSSPHLARSYHHL 338

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS 274
           FPIAPFDEV+    ++ ++R + TCFGCQQSLL+
Sbjct: 339 FPIAPFDEVSQR-QSELQHRPQKTCFGCQQSLLN 371


>gi|168025049|ref|XP_001765047.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683634|gb|EDQ70042.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 394

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 204/313 (65%), Positives = 256/313 (81%), Gaps = 1/313 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD RPSRM +V   VEAFVREFFDQNPLS +G++ +KDG+A+ LTDL GSPE+HI+AL  
Sbjct: 72  MDLRPSRMGIVVDCVEAFVREFFDQNPLSHLGVILLKDGIAHQLTDLSGSPETHIRALRS 131

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
            +  +GD+S+QN LDL +G L+QIPSYGHREVL++YSALST DPGD+MET+Q+CK++ IR
Sbjct: 132 NMESTGDASIQNGLDLARGYLTQIPSYGHREVLLVYSALSTIDPGDVMETVQECKKANIR 191

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSV+GLSAE++ICK LC+ TGG YSVA +E H KELIMEHAPPP A AE ++A+L++MGF
Sbjct: 192 CSVVGLSAEIYICKLLCEQTGGMYSVATNEGHLKELIMEHAPPPAAQAEMSVASLVRMGF 251

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQR  E ++++C+CH+EVK+G GYTCPRCKARVCELPT C ICGL LVSSPHLARSYHHL
Sbjct: 252 PQRGAEDAVALCACHREVKMGGGYTCPRCKARVCELPTQCHICGLTLVSSPHLARSYHHL 311

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNK-PGLYVACPKCKKHFCLECDIY 299
           FP++PF+EV  L   + R     +C+GC Q L + G K  G+ + CP+C++HFC +CD Y
Sbjct: 312 FPVSPFEEVENLASANGRQSLSKSCYGCLQELPTPGGKVKGVRLECPRCRQHFCFDCDNY 371

Query: 300 IHESLHNCPGCES 312
           IHESLHNCPGCES
Sbjct: 372 IHESLHNCPGCES 384


>gi|302791231|ref|XP_002977382.1| hypothetical protein SELMODRAFT_232956 [Selaginella moellendorffii]
 gi|300154752|gb|EFJ21386.1| hypothetical protein SELMODRAFT_232956 [Selaginella moellendorffii]
          Length = 394

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 191/318 (60%), Positives = 258/318 (81%), Gaps = 11/318 (3%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +DFRPSRMAVVAK VE FVRE+FDQNPLSQ+G++ +K+G+A+ +T+L GSPE+HI+AL  
Sbjct: 79  IDFRPSRMAVVAKAVEDFVREYFDQNPLSQLGIIVMKNGIASVVTELSGSPEAHIRALKS 138

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
            L   G++SLQN L+L    +  IPSYGHREV+I++SALSTCDPG+I+ET++KCK ++++
Sbjct: 139 NLESFGEASLQNGLELAHTYVQHIPSYGHREVVIVFSALSTCDPGNILETVKKCKAARMK 198

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSV+GL+AE++ICKH+ Q+TGGSY+VA++E H KE+++EH PPP A+ + + A+L++MGF
Sbjct: 199 CSVVGLTAEIYICKHISQETGGSYAVAMNEGHLKEILLEHVPPPAAMPDASSASLVRMGF 258

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQR  EG+++IC+CHKE+K+G GY CPRC+ARVCELPT+C +CGL LVSS HLARSYHHL
Sbjct: 259 PQRGSEGAVAICACHKELKIGGGYICPRCRARVCELPTECSLCGLALVSSAHLARSYHHL 318

Query: 241 FPIAPFDEVTPLCLNDPRNRSRS----TCFGCQQSLLSSGNKPGLYVACPKCKKHFCLEC 296
           FPI  FDE+    L D   R +S    +CFGC+  L  SG+   L + CP+CK+ FC +C
Sbjct: 319 FPIPTFDEL----LVDASLRGKSSFAGSCFGCRIQLSGSGS---LRLRCPRCKRDFCFDC 371

Query: 297 DIYIHESLHNCPGCESLR 314
           D+YIHESLHNCPGCE+L+
Sbjct: 372 DVYIHESLHNCPGCETLQ 389


>gi|302786380|ref|XP_002974961.1| hypothetical protein SELMODRAFT_232450 [Selaginella moellendorffii]
 gi|300157120|gb|EFJ23746.1| hypothetical protein SELMODRAFT_232450 [Selaginella moellendorffii]
          Length = 394

 Score =  419 bits (1077), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 189/318 (59%), Positives = 257/318 (80%), Gaps = 11/318 (3%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +DFRPSRMAVVAK VE FVRE+FDQNPLSQ+G++ +K+G+A+ +T+L GSPE+HI+AL  
Sbjct: 79  IDFRPSRMAVVAKAVEDFVREYFDQNPLSQLGIIVMKNGIASVVTELSGSPEAHIRALKS 138

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
            L   G++SLQN L+L    +  IPSYGHREV+I++SALSTCDPG+I+ET++KCK ++++
Sbjct: 139 NLESFGEASLQNGLELAHTYVQHIPSYGHREVVIVFSALSTCDPGNILETVKKCKAARMK 198

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSV+GL+AE++ICKH+ Q+TGGSY+VA++E H KE+++EH PPP A+ + + A+L++MGF
Sbjct: 199 CSVVGLTAEIYICKHISQETGGSYAVAMNEGHLKEILLEHVPPPAAMPDASSASLVRMGF 258

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQR  EG+++IC+CHKE+++G GY CPRC+ARVCELPT+C +CGL LVSS HLARSYHHL
Sbjct: 259 PQRGSEGAVAICACHKELRIGGGYICPRCRARVCELPTECSLCGLALVSSAHLARSYHHL 318

Query: 241 FPIAPFDEVTPLCLNDPRNRSRS----TCFGCQQSLLSSGNKPGLYVACPKCKKHFCLEC 296
           FPI  FDE+    L D   R +S    +CFGC+  L   G+   L + CP+CK+ FC +C
Sbjct: 319 FPIPTFDEL----LVDASLRGKSSFAGSCFGCRIQLSGGGS---LRLRCPRCKRDFCFDC 371

Query: 297 DIYIHESLHNCPGCESLR 314
           D+YIHESLHNCPGCE+L+
Sbjct: 372 DVYIHESLHNCPGCETLQ 389


>gi|218195182|gb|EEC77609.1| hypothetical protein OsI_16588 [Oryza sativa Indica Group]
 gi|222629181|gb|EEE61313.1| hypothetical protein OsJ_15417 [Oryza sativa Japonica Group]
          Length = 509

 Score =  353 bits (905), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 163/219 (74%), Positives = 192/219 (87%), Gaps = 1/219 (0%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD+RPSRMAVVAK  E F+REFFDQNPLS +G+VT+KDG+++ LT++GGSPES IKALMG
Sbjct: 115 MDYRPSRMAVVAKYAEVFIREFFDQNPLSHVGIVTMKDGISHRLTEIGGSPESQIKALMG 174

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSG+ SLQNAL+LV G L Q+PSYGH+EVL LYSAL+TCDPGDIMETI KCK+SKIR
Sbjct: 175 KLECSGEPSLQNALELVHGYLDQVPSYGHKEVLFLYSALNTCDPGDIMETIAKCKKSKIR 234

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSVIGL+AE+FICK+LC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 235 CSVIGLAAEIFICKYLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 294

Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPT 218
           PQR  E  ISICSCHK++K G  GY CPRCK  +  L +
Sbjct: 295 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNIVNLAS 333


>gi|302830270|ref|XP_002946701.1| hypothetical protein VOLCADRAFT_72869 [Volvox carteri f.
           nagariensis]
 gi|300267745|gb|EFJ51927.1| hypothetical protein VOLCADRAFT_72869 [Volvox carteri f.
           nagariensis]
          Length = 430

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 161/329 (48%), Positives = 222/329 (67%), Gaps = 19/329 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-MG 60
           D RP+R+A +     +F+REFFDQNPLSQ+G+  ++ G+   LTDL GSPE+ +  L   
Sbjct: 98  DLRPNRLACILGLCRSFIREFFDQNPLSQLGIAVMRGGLVEKLTDLSGSPEAQVARLDAA 157

Query: 61  KLGC-SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           KLG  +GD SLQNALDL   +L  +P YGHRE+L++++AL+TCDPG+++++++ CKE+ I
Sbjct: 158 KLGAPAGDVSLQNALDLSVSVLRSLPPYGHRELLVVFAALATCDPGNVLDSVRACKENNI 217

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SV+G++AE+++C+ + +DT GSY VAL+E+H ++L+M H  PPPA A  A A L++MG
Sbjct: 218 RVSVVGVAAEVYVCRRITEDTRGSYGVALNEAHLEQLLMAHTTPPPATAAQAKAELVRMG 277

Query: 180 FPQRAGEGSISICSCHKEVKVGVG-YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           FPQR+ E + S      E ++  G YTCPRCK+RV ELP +C ICGL L+SSPHLARSYH
Sbjct: 278 FPQRSTEEASSAVFVGLEARLLSGSYTCPRCKSRVPELPCECHICGLTLISSPHLARSYH 337

Query: 239 HLFPIAPF--------DEVTPLCLNDPRNRSR---ST---CFGCQQSLLSSGNKPGLYVA 284
           HLFP+  F         E   +    P  R     ST   CFGC + +   G    L++ 
Sbjct: 338 HLFPVPSFHEDSKERASEAAAVLRGQPAMRDEVPTSTGLYCFGCLREVSQPGAAGQLHLT 397

Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCE 311
             C +CK+ FC ECD Y+HE LHNCPGCE
Sbjct: 398 LRCGQCKQVFCFECDAYVHEQLHNCPGCE 426


>gi|281212115|gb|EFA86276.1| TFIIH subunit [Polysphondylium pallidum PN500]
          Length = 438

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 150/311 (48%), Positives = 206/311 (66%), Gaps = 11/311 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +PSR   +    E+F++EFFDQNP+SQI L+  K+  A  LT+L G+P  HI AL   
Sbjct: 109 DMKPSRHQCLLNSSESFIKEFFDQNPISQISLIVTKNSKAEKLTELNGNPNRHITALKNT 168

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
               GD S+QN+L++    LS +P YG RE++++YS+L+TCDPGD+ +TI+  K  +IR 
Sbjct: 169 SAMEGDPSIQNSLEVAISTLSHVPKYGSREIIVIYSSLTTCDPGDLTKTIEILKNEQIRV 228

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           S I ++AE+FICKH+ + T G+  V L+E HF E +M H  PPP   +   A L++MGFP
Sbjct: 229 SFIHMAAELFICKHISEQTHGTMKVVLNEEHFNECLMLHCQPPPTFGKTEAA-LVEMGFP 287

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q+       +C CH++++  VGYTCPRC ++ CELPTDC+IC L LVSSPHLARSYHHLF
Sbjct: 288 QQHTSSIPMMCMCHEQLRY-VGYTCPRCSSKFCELPTDCQICNLSLVSSPHLARSYHHLF 346

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
            +  F E+    L     ++  +CFGC    L+      L+  CP+CK+ FC ECD +IH
Sbjct: 347 QVPLFTEINWREL-----KTEISCFGC----LTVPKSSSLFFGCPRCKQSFCFECDQFIH 397

Query: 302 ESLHNCPGCES 312
           ESLHNCPGCE+
Sbjct: 398 ESLHNCPGCEN 408


>gi|330804328|ref|XP_003290148.1| TFIIH subunit [Dictyostelium purpureum]
 gi|325079746|gb|EGC33332.1| TFIIH subunit [Dictyostelium purpureum]
          Length = 423

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 149/310 (48%), Positives = 206/310 (66%), Gaps = 9/310 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +PSR  V+ +  E F++EFFDQNP+SQ+ ++  K+  A  +++L G+P+ HI+A+   
Sbjct: 108 DLKPSRYQVLIQNTEQFIKEFFDQNPISQLSIIITKNSKAEKISELSGNPQRHIQAMKDA 167

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           +   G+ S+QN+LD+    L  +P YG REVL L+S+L+TCDP D+ +TIQ  K+  IR 
Sbjct: 168 IAMEGEPSIQNSLDVAISSLCYVPKYGSREVLFLFSSLTTCDPTDLAKTIQTLKQENIRV 227

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           S I L+AE++IC+H+ + T G+  V L+E HF E +     PPP I +   A L++MGFP
Sbjct: 228 SFIHLAAELYICRHIAEQTNGTSKVILNEEHFLENLYSKCQPPPTIGKVEAA-LVEMGFP 286

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q+      S C CH+++K   GY CPRC  + CELPTDC+IC L LVSSPHLARSYHHLF
Sbjct: 287 QQITSSVPSTCICHEKMKYS-GYICPRCGVKSCELPTDCQICNLSLVSSPHLARSYHHLF 345

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           PI  F+EV    LN     +  TC+ C  S  S  +   L+ +CP+C + FCLECD+YIH
Sbjct: 346 PIPLFEEVNWKELN-----TNVTCYSCLTS--SETSILSLFFSCPRCDQVFCLECDMYIH 398

Query: 302 ESLHNCPGCE 311
           E+LHNCPGCE
Sbjct: 399 EALHNCPGCE 408


>gi|384251056|gb|EIE24534.1| transcription factor TFIIH, 44 kDa subunit, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 420

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 216/329 (65%), Gaps = 29/329 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R+AV++  ++ F+R+FFD+NPLS +G++ ++DG+A  LTD+ GSPE+HI  L G 
Sbjct: 71  DMRPNRLAVMSGVLQGFIRKFFDENPLSHLGIIVMRDGIAQRLTDMAGSPEAHISKLQGN 130

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREV---LILYSA----------LSTCDPGDIM 108
           L  SGD+SLQNA+D+    L  IP YGHRE    ++L  A          LSTCDPG+++
Sbjct: 131 LDASGDASLQNAMDMAAESLKSIPPYGHREACPQILLKPAHLLVLVLFAALSTCDPGNVL 190

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA 168
           + ++  K++ IR S++G++AE+ ICK   ++TGG+Y VAL+E HF++L+ EHAPPPP++A
Sbjct: 191 DAVKAAKQNSIRVSIVGVAAEVHICKVFTKETGGTYGVALNEKHFEDLVFEHAPPPPSLA 250

Query: 169 EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQL 227
             A A+L++MGF  R  +G        K+ K+   G+TCPRCKARV +LP  C ICGL L
Sbjct: 251 TDAAASLVRMGFAHRNQDGVEGTAFIGKDCKLSSGGFTCPRCKARVVDLPCSCHICGLTL 310

Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSG------NKPGL 281
           +SSPHLARSYHHLFP+  F E          +    +C+ C   L  S        + G+
Sbjct: 311 ISSPHLARSYHHLFPVKAFTE---------GDSGAVSCYACLADLEPSSLDDDAPGEVGV 361

Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGC 310
            V CP C + FC +CD +IHE+LHNCPGC
Sbjct: 362 VVRCPDCCQLFCFDCDAFIHETLHNCPGC 390


>gi|159490896|ref|XP_001703409.1| transcription factor TFIIH, 44 kDa subunit [Chlamydomonas
           reinhardtii]
 gi|158280333|gb|EDP06091.1| transcription factor TFIIH, 44 kDa subunit [Chlamydomonas
           reinhardtii]
          Length = 413

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 160/317 (50%), Positives = 229/317 (72%), Gaps = 10/317 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-MG 60
           D RP+R+  +     +FVREFFDQNPLSQ+G+  ++ G+   LTDL GSPE+ +  L  G
Sbjct: 99  DLRPNRLGCMLSLCRSFVREFFDQNPLSQLGIAVMRGGLVEKLTDLSGSPEAQVARLDAG 158

Query: 61  KLGC-SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           KLG  +GD+SLQNAL+L   LL Q+P YGHRE+L+L++AL++CDPG+++++++ CK++ I
Sbjct: 159 KLGAPAGDASLQNALELGVSLLKQLPPYGHRELLLLFAALASCDPGNVLDSVKACKDNNI 218

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SV+G++AE+++C+ + +DTGG+Y VAL+E H ++L+M H+PPPPA A  A A L++MG
Sbjct: 219 RVSVVGVAAEVYVCRRITEDTGGTYGVALNEGHLEQLLMAHSPPPPATAAQAKAELVRMG 278

Query: 180 FPQRAGEGSISICSCHKEVKVGVG-YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           FPQR+ E + S     +E ++  G YTCPRC++RV ELP +C  CGL L+SSPHLARSYH
Sbjct: 279 FPQRSTEEASSAVFVGQEARLLPGSYTCPRCRSRVPELPVECHTCGLTLISSPHLARSYH 338

Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA--CPKCKKHFCLEC 296
           HLFP+ P+ E + +        + + CFGC + +   G K  L++   C +CK+ FC EC
Sbjct: 339 HLFPVQPYHEDSEVMTT-----TGAYCFGCLREVSEPGVKGQLHLTLRCGQCKQVFCFEC 393

Query: 297 DIYIHESLHNCPGCESL 313
           D Y+HESLHNCPGCE L
Sbjct: 394 DAYVHESLHNCPGCEFL 410


>gi|255073175|ref|XP_002500262.1| predicted protein [Micromonas sp. RCC299]
 gi|226515524|gb|ACO61520.1| predicted protein [Micromonas sp. RCC299]
          Length = 427

 Score =  303 bits (777), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 153/337 (45%), Positives = 207/337 (61%), Gaps = 32/337 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RPSR+AVV+  +  F RE+F+QNPLSQ+GLV  ++G+A  LT+L GSPESHI AL   
Sbjct: 85  DMRPSRLAVVSGILYKFFREYFNQNPLSQLGLVVTRNGIAERLTELSGSPESHITALKEN 144

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L  +GD S+QN+L+ VQ  L+Q+P+YG REVL + SALS+CDPG++   I   K + IR 
Sbjct: 145 LEAAGDMSIQNSLEQVQSSLAQLPTYGTREVLFVVSALSSCDPGNVHTAIAAAKSANIRV 204

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SV+ ++AE+ IC+ + ++TGG + V+  + H ++L+M HAPPPP   +   A+L++MGFP
Sbjct: 205 SVVSVAAELHICRRMTEETGGMFGVSQSQHHLEDLLMAHAPPPPLNEQATKASLVEMGFP 264

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARV----------------------CELPTD 219
           Q+      +  S       G  Y CPRC +RV                       ELP  
Sbjct: 265 QKRPLDKGAFFSGR-----GGEYVCPRCASRVEVNPTHPKFVENIKINNPVTFLQELPAQ 319

Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLC---LNDPRNRSRSTCFGCQQSLLSSG 276
           C +C L LVSSPHLARSYHHLFP+APF+   P+    LND      + CF C      SG
Sbjct: 320 CSVCSLTLVSSPHLARSYHHLFPVAPFEVHRPVIMQELNDAAKLDATNCFACNLKFDRSG 379

Query: 277 NKPGLYV--ACPKCKKHFCLECDIYIHESLHNCPGCE 311
            +        CP+CKK +C +CD++IHE LHNCPGCE
Sbjct: 380 ARGNDDAPSVCPRCKKIYCFQCDVFIHEKLHNCPGCE 416


>gi|303285284|ref|XP_003061932.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456343|gb|EEH53644.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 404

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 158/329 (48%), Positives = 210/329 (63%), Gaps = 21/329 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RPSR+AV++  +  F REFF+QNPLSQ+GLV  ++G+A  +T+L G+PE+HI AL   
Sbjct: 90  DMRPSRLAVISSLMYKFFREFFNQNPLSQLGLVVTRNGIAERVTELSGNPETHIAALKEN 149

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L  +GD S+QN+L+ V   L Q+P YG REVL + SALSTCDPG++   I   K +KIR 
Sbjct: 150 LDAAGDMSIQNSLEQVHASLVQLPMYGSREVLFVVSALSTCDPGNVHTAIAAAKNAKIRV 209

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SV+ ++AEM +C+ + ++TGG + VA  + H +EL+M HAPPPP   E    +L++MGFP
Sbjct: 210 SVVSVAAEMHVCRRMTEETGGIFGVAQSQHHLEELLMAHAPPPPLNEESTKPSLVEMGFP 269

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q+      +  S       G  Y CPRCK+RV ELP+ C +C L LVSS HLARSYHHLF
Sbjct: 270 QKRPLDKDAFFSGR-----GGDYVCPRCKSRVEELPSQCTVCKLTLVSSAHLARSYHHLF 324

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------LSSGNKPGLYVACPKCKKHFCLE 295
           P+ PF E   L   D   +  ++CF C           S+ N P +   CPKCKK FC  
Sbjct: 325 PVPPFTEGFGL---DDEGKRHTSCFACYVDFEEGNESTSNENAPSV---CPKCKKTFCFV 378

Query: 296 CDIYIHESLHNCPGCESLRHSNPIVANEG 324
           CD+YIHE LHNCPGCE ++     +A EG
Sbjct: 379 CDVYIHEKLHNCPGCELVK----FLAEEG 403


>gi|110739908|dbj|BAF01859.1| putative transcription factor [Arabidopsis thaliana]
          Length = 196

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 145/195 (74%), Positives = 163/195 (83%), Gaps = 1/195 (0%)

Query: 130 MFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSI 189
           MFICKHLCQ+TGG YSVA+DE H K+L++EHAPPPPAIAEFAIANLIKMGFPQRA EGS+
Sbjct: 1   MFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGSM 60

Query: 190 SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEV 249
           +ICSCHKEVK+G GY CPRCKARVC+LPT+C ICGL LVSSPHLARSYHHLFPIAPFDEV
Sbjct: 61  AICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHLFPIAPFDEV 120

Query: 250 TPL-CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCP 308
             L  LND R +   +CFGCQQSL+ +GNKP   V C KCK +FCL+CDIYIHESLHNCP
Sbjct: 121 PALSSLNDNRRKLGKSCFGCQQSLIGAGNKPVPCVTCRKCKHYFCLDCDIYIHESLHNCP 180

Query: 309 GCESLRHSNPIVANE 323
           GCES+     +   E
Sbjct: 181 GCESIHRPKSVSLME 195


>gi|443732857|gb|ELU17421.1| hypothetical protein CAPTEDRAFT_178402 [Capitella teleta]
          Length = 390

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 150/321 (46%), Positives = 215/321 (66%), Gaps = 17/321 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R+    K +E FV EFFDQNP+SQ+G++T  +  A+ LT+LGG+P+ HI+AL G 
Sbjct: 73  DLRPNRLFASLKLLEKFVEEFFDQNPISQLGILTTSNKRADKLTELGGNPKRHIQALRGL 132

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +  C G+ SLQNALDL    L  +P +  REVLI++ +L+TCDPG I++TIQ  K + +
Sbjct: 133 KERACIGEPSLQNALDLAAQTLKHMPGHSSREVLIVFGSLTTCDPGSIIDTIQSMKANNV 192

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           + S+IGL+A++ IC+ +C DT G Y V +DE HFKEL+ +   PP A +    ++LI+MG
Sbjct: 193 QVSIIGLAADVRICRKICHDTQGQYHVIVDEPHFKELLNQQVTPPNATSSTE-SSLIRMG 251

Query: 180 FPQRAGEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           FP+       S+C+CH E          GY CP+C+++ CELP +CR CGL LVS+PHLA
Sbjct: 252 FPRHEQGQQPSMCNCHLESSSSEGFSTGGYFCPQCRSKYCELPVECRACGLTLVSAPHLA 311

Query: 235 RSYHHLFPIAPFDEV-TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
           RSYHHLF + PF+EV T + +    ++ +  C+ CQ  L    N P  Y  C KC++ FC
Sbjct: 312 RSYHHLFLLEPFNEVRTEVAI---FHKEKRICYSCQSQL----NDPMSY-QCSKCRQVFC 363

Query: 294 LECDIYIHESLHNCPGCESLR 314
           ++CD+++HE+LH+CPGC S R
Sbjct: 364 VDCDLFVHETLHSCPGCSSSR 384


>gi|384487455|gb|EIE79635.1| hypothetical protein RO3G_04340 [Rhizopus delemar RA 99-880]
          Length = 395

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 148/304 (48%), Positives = 203/304 (66%), Gaps = 13/304 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RPSR+ +     + FV EFFDQNP+SQ+G++  +DG+A  LT+L G+P  HIKAL  K
Sbjct: 96  DLRPSRIELTLTYAQQFVVEFFDQNPISQLGIIITRDGIAEKLTELSGNPTDHIKALKSK 155

Query: 62  LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                SG+ SLQNAL L +  +  +PS+G +EVL+++ +L+TCDP DI +TI   ++  +
Sbjct: 156 KNTETSGEPSLQNALQLARASMLGVPSHGSKEVLLIFGSLTTCDPSDIHDTIDLLRKELV 215

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R +V+GL+AE+ IC+ L      +Y V L+E+HFK+L+ E  PPP  +     +NLIKMG
Sbjct: 216 RVNVVGLAAEVQICRAL-----RTYGVVLNEAHFKDLLFEVVPPPAVMQNKNTSNLIKMG 270

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP+R  E + + C CH ++ +G GY CPRCK++VCELP+DC ICGL LVSSPHLARSYHH
Sbjct: 271 FPKRLVEDNATFCVCHSKLTMG-GYICPRCKSKVCELPSDCDICGLTLVSSPHLARSYHH 329

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           LFP+  FDEV      +  +   + CF C   L  S    G   +CPKCK+ FC ECDI+
Sbjct: 330 LFPVDNFDEV-----KNNISGYATHCFSCLSQLDKSEVVGGNRFSCPKCKQEFCAECDIF 384

Query: 300 IHES 303
           +HES
Sbjct: 385 VHES 388


>gi|66823583|ref|XP_645146.1| TFIIH subunit [Dictyostelium discoideum AX4]
 gi|74861629|sp|Q86KZ2.2|TF2H2_DICDI RecName: Full=General transcription factor IIH subunit 2; AltName:
           Full=TFIIH basal transcription factor complex subunit 2
 gi|60473387|gb|EAL71333.1| TFIIH subunit [Dictyostelium discoideum AX4]
          Length = 461

 Score =  300 bits (767), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 147/313 (46%), Positives = 204/313 (65%), Gaps = 13/313 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +PSR  V+ + VE F++EFFDQNP+SQ+ ++  K+  A  +++L G+   HI+A+   
Sbjct: 113 DLKPSRYQVLLQNVELFIKEFFDQNPISQLSIIITKNSKAEKISELSGNRLRHIQAMKDA 172

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           +   G+ S+QN+L++    L  +P YG REVL ++S+L+TCDP  + +TIQ  K   IR 
Sbjct: 173 IAMEGEPSIQNSLEVALSSLCYVPKYGSREVLFIFSSLTTCDPSSLQKTIQSLKNESIRV 232

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           S I ++AE++ICK + + T G+  V L+E HF E +M    PPP I +   A L++MGFP
Sbjct: 233 SFIHMAAELYICKAIAEQTNGTSKVILNEEHFNESLMLKCQPPPTIGKTEAA-LVEMGFP 291

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q+      S C CH+++K   GY CPRC  + CELPTDC+IC L LVSSPHLARSYHHLF
Sbjct: 292 QQITSTVPSPCICHEKMKYS-GYICPRCGVKSCELPTDCQICNLSLVSSPHLARSYHHLF 350

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP--GLYVACPKCKKHFCLECDIY 299
            I  F+EV    LN        TC GC    LSS  K    L+ +CP+C++ FCL+CD++
Sbjct: 351 QIPLFNEVNWKELN-----KNVTCIGC----LSSSEKSILSLFFSCPRCQEIFCLDCDLF 401

Query: 300 IHESLHNCPGCES 312
           IHESLHNCPGCE+
Sbjct: 402 IHESLHNCPGCEN 414


>gi|147862851|emb|CAN82988.1| hypothetical protein VITISV_011714 [Vitis vinifera]
          Length = 199

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 145/193 (75%), Positives = 156/193 (80%)

Query: 131 FICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSIS 190
           F+    C  T  S    L +SHFKEL++EHAPPPPAIAEFAIANLIKMGFPQRA EG IS
Sbjct: 6   FLLIQYCFITNISVLYLLLQSHFKELLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGVIS 65

Query: 191 ICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVT 250
           ICSCHKE KVG GYTCPRCKARVCELPT+CRICGL LVSSPHLARSYHHLFPI PFDEV+
Sbjct: 66  ICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLVSSPHLARSYHHLFPIPPFDEVS 125

Query: 251 PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
              LN+P  RS   CFGCQ+SLL  GNKP L VACPKCK+HFCL+CDIYIHESLHNCPGC
Sbjct: 126 LSLLNNPHQRSSRACFGCQESLLIPGNKPTLCVACPKCKQHFCLDCDIYIHESLHNCPGC 185

Query: 311 ESLRHSNPIVANE 323
           ES RHS  +   E
Sbjct: 186 ESFRHSKIVSVTE 198


>gi|328874281|gb|EGG22647.1| TFIIH subunit [Dictyostelium fasciculatum]
          Length = 457

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 145/313 (46%), Positives = 199/313 (63%), Gaps = 14/313 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-MG 60
           D +P+R   V    E F++EFFDQNP+SQ+ L+  K+  A  ++DL  +P  HI+ L M 
Sbjct: 108 DLKPNRHQAVLIAAENFIKEFFDQNPISQLSLIITKNSKAEKISDLSSNPNRHIQLLKMV 167

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
                GD S+QN+LD+    LS +P YG REV+ +YS+L+TCDPGD+ +TI   K   IR
Sbjct: 168 SSTIEGDPSIQNSLDVSIATLSYVPKYGSREVIFIYSSLTTCDPGDLSKTITTLKNENIR 227

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
            S + L+AE+++C+H+   T G+  V   E H  E ++ H  PPP I +   A L++MGF
Sbjct: 228 VSFVHLAAELYVCRHISDATNGTMKVIETEHHLNEALILHCQPPPTIGKVEAA-LVEMGF 286

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           PQ+    +  +C CH++ K  VGYTCPRC+++ CELPTDC+ICGL LVSSPHLARSYHHL
Sbjct: 287 PQQHTSQTPMMCVCHEQFKY-VGYTCPRCQSKFCELPTDCQICGLSLVSSPHLARSYHHL 345

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPG-LYVACPKCKKHFCLECDIY 299
           F +  F EV P  +     +    C+GC   +     KP  L   CP+CKK FC +CD  
Sbjct: 346 FQVPIFREVNPKSI-----QPNLKCYGCLLPI-----KPNYLNYGCPRCKKIFCFDCDQV 395

Query: 300 IHESLHNCPGCES 312
           +HES+HNCPGCE+
Sbjct: 396 VHESIHNCPGCEN 408


>gi|332017887|gb|EGI58547.1| General transcription factor IIH subunit 2 [Acromyrmex echinatior]
          Length = 598

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 149/322 (46%), Positives = 206/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++  ++  A  ++DL G+ + HIK L  +
Sbjct: 272 DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKISDLAGNSKKHIKELQIL 331

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +    G+ SLQN+L+L   LL  +PS+  +E+L++  AL+TCDPGDI ETI+  K   I
Sbjct: 332 QQTVVGGEPSLQNSLELATKLLKLLPSHASKEILVIIGALTTCDPGDINETIRNMKLDSI 391

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG +SVALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 392 RCSVIGLAAELYICKRMANITGGEHSVALDDKHYKEQLNAHIDPPPAATRLDAA-LVKMG 450

Query: 180 FPQ-----RAGEGSISICSCH-----KEVK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP       A + SI++C CH     + VK +  GY CP+C ++ CELP +CR CGL LV
Sbjct: 451 FPHHALHSNAPDTSITVCMCHAHNSDETVKFMTTGYLCPQCISKHCELPVECRACGLTLV 510

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYH+LFP+ PF E+ P       +   + CFGCQ++      K     AC KC
Sbjct: 511 SAPHLARSYHYLFPVDPFKEIAP-------DSDYTICFGCQKAFAQKDKK---VYACGKC 560

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
            + FCL+C+I+IHE LH CPGC
Sbjct: 561 NQMFCLDCEIFIHEILHTCPGC 582


>gi|260815126|ref|XP_002602325.1| hypothetical protein BRAFLDRAFT_282199 [Branchiostoma floridae]
 gi|229287633|gb|EEN58337.1| hypothetical protein BRAFLDRAFT_282199 [Branchiostoma floridae]
          Length = 396

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 214/324 (66%), Gaps = 20/324 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+ V  K +E F+ E+FDQNP+SQ+G++T K+  A  LT+LGG+P+ H+  L  +
Sbjct: 76  DLKPTRILVTLKLLENFIEEYFDQNPISQLGVITTKNKRAEKLTELGGNPKRHVTQLRTL 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C G+ S+ N+LDL    L  +P++  REVLI+  +L+TCDPGDI  T++  K+  I
Sbjct: 136 SSASCVGEPSIMNSLDLAAQTLKHMPTHTSREVLIIMGSLTTCDPGDINITMKMVKDLNI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE+ +CK LC  T G+Y + L+++HFK+L++EH  PPPA      ++LIKMG
Sbjct: 196 RCSVIGLAAEVQVCKKLCNMTNGTYGIILEDTHFKDLLLEHCTPPPATVN-TDSSLIKMG 254

Query: 180 FPQRA---GEGSI--SICSCHKEVKVG----VGYTCPRCKARVCELPTDCRICGLQLVSS 230
           FPQ        S+  S+C CH++ +       GY CP+C A+ CELP +C+ICGL LVS+
Sbjct: 255 FPQHTISQDHDSVKPSMCMCHQDTEAQSFTPSGYFCPQCHAKYCELPVECKICGLTLVSA 314

Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
           PHLARSYHH FP+  F E+    L+   +R    C GCQ  L    N P +Y  C +C +
Sbjct: 315 PHLARSYHHFFPLENFREIPLEELDAELSR---FCTGCQVQL----NGPVVY-CCTRCSR 366

Query: 291 HFCLECDIYIHESLHNCPGCESLR 314
            FC++CD++IHE+LH+CPGC + R
Sbjct: 367 PFCIDCDLFIHETLHSCPGCINKR 390


>gi|328772181|gb|EGF82220.1| hypothetical protein BATDEDRAFT_19068 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 409

 Score =  294 bits (753), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 205/321 (63%), Gaps = 12/321 (3%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +D RP+R+      +E FV EFFDQNPLS +G++  KDG+     +L G+P   +KAL  
Sbjct: 89  LDLRPNRLDCTLNFLEMFVVEFFDQNPLSHLGIIGTKDGMVEKWAELSGNPSEILKALTK 148

Query: 61  KLG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           +     +G+ SLQNAL+L +  L+ +P +  RE+++LY +L++CDPGDI ETI    +  
Sbjct: 149 RSNRETAGEPSLQNALELARRSLAHVPMHVSREIIVLYGSLTSCDPGDIQETINDLCKEN 208

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAI-AEFAIANLIK 177
           IR SVIGL+AE+ +CK +C  T G Y+V +++ H+KE++  H PPPP +  + + +N+I+
Sbjct: 209 IRVSVIGLAAEVQVCKKMCASTNGVYNVIMNDVHYKEILFAHIPPPPLLQTQLSASNIIQ 268

Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
           MGFP       + +CSCH+   +  G+ CPRC A +C +PTDC +C L L+SSP LARSY
Sbjct: 269 MGFPMMKIFDDMVLCSCHQN-PIQKGHICPRCSATICNIPTDCCLCSLTLISSPSLARSY 327

Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV------ACPKCKKH 291
           HHLFP+  F E+ P  +      S+  CF CQ     +    G Y+      ACP+CK H
Sbjct: 328 HHLFPVPNFTEIPP--VQSTSYPSQVECFSCQTPSSMTSFHSGTYMIPTARYACPRCKVH 385

Query: 292 FCLECDIYIHESLHNCPGCES 312
            CL+CD+YIH++LHNCPGC S
Sbjct: 386 VCLDCDVYIHDTLHNCPGCMS 406


>gi|307198378|gb|EFN79320.1| General transcription factor IIH subunit 2 [Harpegnathos saltator]
          Length = 400

 Score =  294 bits (753), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 206/324 (63%), Gaps = 24/324 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++T ++  A  ++DL G+ + HIK L  +
Sbjct: 76  DLKPTRFLCSLKLLENFIEEFFYQNPISQLGVITTRNKRAEKISDLAGNSKKHIKELQSL 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +   +G+ SLQN+L+L    L  +PS+  +E+LI+  AL+TCDPGDI ETIQ  K   I
Sbjct: 136 QQTMVTGEPSLQNSLELALKSLRLLPSHASKEILIIIGALTTCDPGDINETIQNMKSDGI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG + VALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMANATGGEHGVALDDKHYKEQLNSHIDPPPAATRLDAA-LVKMG 254

Query: 180 FPQRAGEGSI-----SICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP  A   ++     ++C CH E        +  GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSTVTDTPMTVCMCHAESSDELFKLMSTGYLCPQCLSKHCELPVECRSCGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYH+LFP+A F E+           + S+CFGCQ+       K  +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVATFTEI-------EFENNHSSCFGCQKVFTQKDKK--IYI-CEKC 364

Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
            + FCL+C+I+IHESLH CPGC +
Sbjct: 365 DQPFCLDCEIFIHESLHTCPGCAT 388


>gi|307169945|gb|EFN62454.1| General transcription factor IIH subunit 2 [Camponotus floridanus]
          Length = 401

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 151/335 (45%), Positives = 209/335 (62%), Gaps = 28/335 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++  ++  A  ++DL G+ + HIK L  +
Sbjct: 77  DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKISDLAGNSKKHIKELRAL 136

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +   +G+ SLQN+L+L    L  +PS+  +E+LI+  AL+TCDPGDI ETI+  K   I
Sbjct: 137 QQTIVTGEPSLQNSLELATKSLKLLPSHASKEILIIIGALTTCDPGDINETIRNMKLDSI 196

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG + VALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 197 RCSVIGLAAELYICKRMANITGGEHGVALDDKHYKEQLNAHIDPPPAATRLDAA-LVKMG 255

Query: 180 FPQR-----AGEGSISICSCHKE-----VKV-GVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP       A + S+++C CH E     VK+   GY CP+C ++ CELP +CR CGL LV
Sbjct: 256 FPHHALHSSASDTSMTVCMCHAENSDESVKLMSTGYLCPQCFSKHCELPVECRACGLTLV 315

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYH+LFP+ PF EVT          + S CFGCQ++      K      C KC
Sbjct: 316 SAPHLARSYHYLFPVEPFREVT-------FEGNHSFCFGCQKTFAQKDKK---VYTCEKC 365

Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNPIVANE 323
            + FCL+C+I+IH+ LH CPGC     +NP   N+
Sbjct: 366 NQTFCLDCEIFIHDILHTCPGCA----TNPATYNK 396


>gi|298708881|emb|CBJ30838.1| General transcription factor II H, polypeptide 2 [Ectocarpus
           siliculosus]
          Length = 412

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 208/339 (61%), Gaps = 37/339 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RPSR+AV  + V+ FV  +FDQNPLSQ+G++  ++G A  +T+L G+P++H++AL   
Sbjct: 80  DMRPSRLAVTLRIVQDFVTNYFDQNPLSQLGILVTREGRAEKITELSGNPKAHMEALTKD 139

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
               G++SLQN L++    L  +P YG+REV+++YS+LSTCDPGDI ETI K K  K+R 
Sbjct: 140 SDTKGEASLQNLLEMACTSLRAVPEYGNREVVVIYSSLSTCDPGDIHETIAKLKTHKVRA 199

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH---APPPPAIAEFAIANLIKM 178
           SV+GL AEMF+ + L ++T G YS+A DE+H++E +M      P PP       A+L++M
Sbjct: 200 SVVGLGAEMFVLRRLTEETSGDYSIAGDENHYRESLMAQCTPPPTPPGREGAMFADLVRM 259

Query: 179 GFPQRAGE--GSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
           GFP    +   S+      +E+ V  GY+CPRCK +  ELP++C IC L LVSSPHLARS
Sbjct: 260 GFPAETQDVFPSLGYSGNRQELSVS-GYSCPRCKTKTSELPSECVICALPLVSSPHLARS 318

Query: 237 YHHLFPIAPFDEV------------------------TPLCLNDPRNRSRSTCFGCQQSL 272
           YHHLFP+  FDE+                              +  + S S C GC + L
Sbjct: 319 YHHLFPVPQFDEIPAAAPAAAAAAASAGSAGGMGSGAGAGRATEEESASSSYCAGCLRDL 378

Query: 273 LSSGNKPGL-YVACPKCKKHFCLECDIYIHESLHNCPGC 310
                  GL    CP+C+  FCL+CD+Y+H+SLHNCPGC
Sbjct: 379 ------SGLPRYRCPECRSAFCLDCDMYVHDSLHNCPGC 411


>gi|291000862|ref|XP_002682998.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
           gruberi]
 gi|284096626|gb|EFC50254.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
           gruberi]
          Length = 349

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 200/313 (63%), Gaps = 16/313 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D + SR   V    + F++E+FDQNPLSQ+G++T K+GVA  L +L G+ +  +  L   
Sbjct: 48  DEKTSRAQSVISTAKEFIKEYFDQNPLSQLGIITTKNGVAKILKELTGNVKQLVSELKKS 107

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
            G  GD SLQN L+L    L QIP+YG +EV+++Y +LS+CDP +I  TI++ K++ IR 
Sbjct: 108 FG--GDPSLQNVLNLSYDSLHQIPNYGSKEVVVIYCSLSSCDPDNIFSTIKQLKDNNIRV 165

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAP----PPPAIAEFAIANLIK 177
           S+I L AE+F+CK + + T G+YSV  D+ HFKE++M HA        +  +      ++
Sbjct: 166 SLISLDAEVFVCKQIAKQTQGTYSVPTDDEHFKEVLMAHAAPPPTTSSSSTKIVAPPSMR 225

Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
           MGFPQR  +   S+C CHK++  G GY CPRCK++ C+LP +C  CGL LVSSPHLARSY
Sbjct: 226 MGFPQRRAQTLYSMCLCHKKITPG-GYICPRCKSKYCDLPVECSTCGLMLVSSPHLARSY 284

Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECD 297
           HHLFP+  F +        PR      C+GCQ+SL     +  + + CP C   FC+ECD
Sbjct: 285 HHLFPVQQFIDY------QPREGEEPYCYGCQKSL---PKESFISLQCPSCSNIFCVECD 335

Query: 298 IYIHESLHNCPGC 310
            +IHESLHNCPGC
Sbjct: 336 AFIHESLHNCPGC 348


>gi|405976292|gb|EKC40804.1| General transcription factor IIH subunit 2 [Crassostrea gigas]
          Length = 399

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 145/327 (44%), Positives = 210/327 (64%), Gaps = 22/327 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E FV E+FDQNP+SQ+G++  K+  A  +++LGG+P  HI  L  +
Sbjct: 75  DLKPTRLLSTLKLLEYFVEEYFDQNPISQLGIIITKNKRAEKVSELGGNPRRHITTLQSL 134

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C G+ SLQN+L+L    L  +P +  REVL++  +L+TCDP +I+ TI   ++  +
Sbjct: 135 ADKVCHGEPSLQNSLELAHQTLRHMPGHASREVLVVMGSLTTCDPDNIISTISMLRDGNV 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE+ +CK LC++T GSYSV LDE HFK+L+ +H  PPPA A    ++LI+MG
Sbjct: 195 RCSVIGLAAEVCVCKKLCKETHGSYSVILDECHFKDLLTQHVTPPPATASTE-SSLIRMG 253

Query: 180 FPQRA----GEGSISICSCHKEVK-----VGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
           FP        E   S+C CH + K        GY CP+CK++ CELP +C+ CGL LVS+
Sbjct: 254 FPHHQLSADKEERPSMCMCHLDSKNSQGFSSTGYFCPQCKSKYCELPIECKACGLTLVSA 313

Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
           PHLARSYHHLFP+  F E+    +      S   CF CQ ++     +  +Y+ C  C++
Sbjct: 314 PHLARSYHHLFPLDQFREIPTAQVG-----SNVLCFSCQTAV----QELNVYI-CDNCER 363

Query: 291 HFCLECDIYIHESLHNCPGCESLRHSN 317
            +CL+CD++IHE+LH+CPGC S R + 
Sbjct: 364 IYCLDCDLFIHETLHSCPGCASSRQTQ 390


>gi|350418129|ref|XP_003491749.1| PREDICTED: general transcription factor IIH subunit 2-like [Bombus
           impatiens]
          Length = 400

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 147/324 (45%), Positives = 212/324 (65%), Gaps = 24/324 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++  ++  A  +++L G+ + H+K +  M
Sbjct: 76  DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELAGNSKKHVKEVQNM 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++   G+ SLQN+++L    L  +PS+  +E+LI+  AL+TCDPGDI ETIQ  K   +
Sbjct: 136 QQIAPVGEPSLQNSIELALKSLRLLPSHASKEILIIVGALTTCDPGDINETIQNMKTDSV 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG + VALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254

Query: 180 FPQR-----AGEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP       A + ++++C CH E     VK +  GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSANDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYH+LFP+  F EV         + + STC+GCQ+ +LS  +K  +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVESFKEVV-------FSGTPSTCYGCQK-ILSQKDKK-VYI-CNKC 364

Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
            + FCL+C+I+IHESLH CPGC +
Sbjct: 365 NETFCLDCEIFIHESLHTCPGCAT 388


>gi|383853156|ref|XP_003702089.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Megachile rotundata]
          Length = 400

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 210/324 (64%), Gaps = 24/324 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           DF+P+R     K +E FV EFF QNP+SQ+G++  ++  A  +++L G+ + HIK +  M
Sbjct: 76  DFKPTRFLCSLKLLEDFVEEFFYQNPISQLGVIITRNKRAEKVSELVGNSKKHIKEIQNM 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++  +G+ SLQN+L+L    L  +PS+  +E+LI+  AL+TCDPGDI ETIQ  K   I
Sbjct: 136 QQMVPAGEPSLQNSLELALKSLRLLPSHASKEILIIIGALTTCDPGDINETIQGMKSDCI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG + VALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254

Query: 180 FPQRA-----GEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP  A      + ++++C CH E     VK +  GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSVNDSAMAVCMCHAENSEESVKLISTGYLCPQCLSKHCELPVECRACGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYH+LFP+ PF E          + + S C+GCQ++      K  +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVEPFTETV-------FDGTPSLCYGCQKNFSQMDKK--IYI-CNKC 364

Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
            + FCL+C+I+IHESLH CPGC +
Sbjct: 365 SQAFCLDCEIFIHESLHTCPGCAT 388


>gi|332373040|gb|AEE61661.1| unknown [Dendroctonus ponderosae]
          Length = 395

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 213/322 (66%), Gaps = 22/322 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+RM    + +E F+ EFFDQNP+SQ+G++ +++  A  + +LGG+   HIK L  M
Sbjct: 75  DLKPTRMLCSLRILEGFIEEFFDQNPISQMGIILMQNKRAEKVCELGGNFRKHIKYLRSM 134

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQN L+     L  +P++  REVL++  +L+TCDP DI  TIQK K+  I
Sbjct: 135 SKTSLLGEPSLQNGLEDAFSSLKLVPTHASREVLVIMGSLTTCDPTDINTTIQKFKQEGI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCS+IGL+AE+ IC+ L  +T G+Y++ LD+SH+K+L+ +   PPPA      A+LIKMG
Sbjct: 195 RCSIIGLAAEVHICRTLANETKGTYNIILDDSHYKDLLYQQVEPPPAAVALE-ASLIKMG 253

Query: 180 FP-QRAGEGS---ISICSCH----KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSS 230
           FP Q   EG+   ++IC CH    +E K+   GY CP+C ++ CELP +CR CGL LVS+
Sbjct: 254 FPHQMNTEGTDEPLTICMCHVDSTEEGKLTTGGYYCPQCLSKYCELPVECRACGLTLVSA 313

Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
           PHLARSYHHLFP A F+E+         ++  +TC+ CQ++  + G+K      CP CK+
Sbjct: 314 PHLARSYHHLFPPANFEEMQ-------FDKQVTTCYACQKT-FNDGDKQ--VYQCPTCKR 363

Query: 291 HFCLECDIYIHESLHNCPGCES 312
            FC++CDI+IH++LH CPGC +
Sbjct: 364 MFCIDCDIFIHDTLHTCPGCAT 385


>gi|320166414|gb|EFW43313.1| general transcription factor IIH [Capsaspora owczarzaki ATCC 30864]
          Length = 427

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 148/336 (44%), Positives = 212/336 (63%), Gaps = 20/336 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    + +E FV EFFDQNP+SQ+  +  +D  A  +++LGG+P  H+++L  K
Sbjct: 89  DLKPTRLEHSIRLLEQFVPEFFDQNPISQLNFIISRDAKAERISELGGNPAKHLESLRKK 148

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
              +G+ SLQN+L+L +  L  +PS+  +EVLI+   L++CDPGDI +TI   +   I C
Sbjct: 149 ASTAGEISLQNSLELARESLRLMPSHTSKEVLIIMGGLASCDPGDIFQTISLLELDNIHC 208

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SVIGLSAE+ ICK+L + T G ++V +DESHF++++ +H  PPPA +    A LI+MGFP
Sbjct: 209 SVIGLSAEVRICKYLAEKTKGVHNVIIDESHFRDMLFQHITPPPASSRTE-ALLIRMGFP 267

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           ++    + ++C+CHK  + G GY CP+CKA+ CELPT C +CGL LVSSPHLARSYHHLF
Sbjct: 268 RQNLSKAATLCACHKLFRAG-GYICPQCKAKCCELPTTCAVCGLTLVSSPHLARSYHHLF 326

Query: 242 PIAPFDEV-------------TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           P+APF EV               L  N        +C GC +   +  +       CP C
Sbjct: 327 PLAPFREVPLAHPPADANGLLAGLVANSTTLALHPSCRGCSKPTATMRSA----FQCPTC 382

Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNPIVANEG 324
              +CL+CDI+IH++LH CP C +L   +  VAN G
Sbjct: 383 CHVYCLDCDIFIHDTLHTCPSCVALLDGSA-VANGG 417


>gi|340715742|ref|XP_003396368.1| PREDICTED: general transcription factor IIH subunit 2-like [Bombus
           terrestris]
          Length = 400

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 212/324 (65%), Gaps = 24/324 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++  ++  A  +++L G+ + H+K +  M
Sbjct: 76  DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELAGNSKKHVKEVQNM 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++   G+ SLQN+++L    L  +P++  +E+LI+  AL+TCDPGDI ETIQ  K   +
Sbjct: 136 QQIAPVGEPSLQNSIELALKSLRLLPTHASKEILIIVGALTTCDPGDINETIQNMKTDSV 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG + VALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254

Query: 180 FPQR-----AGEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP       A + ++++C CH E     VK +  GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSANDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYH+LFP+  F EV         + + STC+GCQ+ +LS  +K  +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVESFKEVV-------FSGTPSTCYGCQK-ILSQKDKK-VYI-CNKC 364

Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
            + FCL+C+I+IHESLH CPGC +
Sbjct: 365 NETFCLDCEIFIHESLHTCPGCAT 388


>gi|380019349|ref|XP_003693572.1| PREDICTED: general transcription factor IIH subunit 2-like [Apis
           florea]
          Length = 405

 Score =  286 bits (732), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 147/327 (44%), Positives = 212/327 (64%), Gaps = 25/327 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++  ++  A  +++L G+ + HIK +  M
Sbjct: 76  DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELTGNSKKHIKEVQSM 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++  +G+ SLQN+++L    L  +PS+  +E+LI+  AL+TCDPGDI ETI+  K   +
Sbjct: 136 QQITPAGEPSLQNSIELALKSLRLLPSHASKEILIIVGALTTCDPGDINETIKNMKSDSV 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG + VALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254

Query: 180 FPQRA-----GEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP  A      + ++++C CH E     VK +  GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSTNDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEV---TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVAC 285
           S+PHLARSYH+LFP+ PF EV     +    P     S C+GCQ+ + S  +K  +Y+ C
Sbjct: 315 SAPHLARSYHYLFPVEPFKEVEYRKEVTFEHP-----SICYGCQK-IFSQKDK-KVYI-C 366

Query: 286 PKCKKHFCLECDIYIHESLHNCPGCES 312
            KC + FCL+C+I+IHESLH CPGC +
Sbjct: 367 NKCNQTFCLDCEIFIHESLHTCPGCAT 393


>gi|348680619|gb|EGZ20435.1| hypothetical protein PHYSODRAFT_345458 [Phytophthora sojae]
          Length = 387

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 139/325 (42%), Positives = 199/325 (61%), Gaps = 27/325 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D++P R+A  +  ++ FV+++FDQNP+SQ+G++ +K   A  L+DL G+P +H++ +   
Sbjct: 73  DWKPHRLACASDVLQQFVKDYFDQNPISQLGVIGIKGMTAEKLSDLSGNPNTHMERIAAA 132

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L    + SLQNAL++ +  L  +P+YG RE++++Y  L T DPGDI +T+   K   IR 
Sbjct: 133 LAVDKEPSLQNALEIAKSSLKTVPAYGSREIVVVYGNLVTADPGDIFQTLASLKRENIRV 192

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA--IANLIKMG 179
           S IG+ AEM + + +   T G+Y VA+D  H K L+     P P +A  A   A L++MG
Sbjct: 193 SFIGIGAEMHLLRRIADGTDGTYHVAVDAEHMKRLMTAFTFPSPTVATAASRFATLVEMG 252

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FPQR   G++S+C+CH+     VGY CPRCK++ C+LPT C++C L LVSSPHLARSYHH
Sbjct: 253 FPQRR-SGALSLCTCHQAFTT-VGYLCPRCKSKSCDLPTTCQVCNLPLVSSPHLARSYHH 310

Query: 240 LFPIAPFDE--------------VTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVAC 285
           LFP+A F +              VTP  +       +  CFGC   L   G   G    C
Sbjct: 311 LFPVAKFTQHLLRSGVTGEKGAKVTPSLV-------QKKCFGCLLPLGLDGE--GTAYEC 361

Query: 286 PKCKKHFCLECDIYIHESLHNCPGC 310
             C+  FC ECD+Y+H+SLHNCPGC
Sbjct: 362 TTCQNVFCSECDMYVHDSLHNCPGC 386


>gi|66549743|ref|XP_394997.2| PREDICTED: general transcription factor IIH subunit 2 isoform 1
           [Apis mellifera]
          Length = 405

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 146/327 (44%), Positives = 210/327 (64%), Gaps = 25/327 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++  ++  A  +++L G+ + HIK +  M
Sbjct: 76  DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELTGNSKKHIKEVQSM 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++  +G+ SLQN+++L    L  +PS+  +E+LI+  AL+TCDPGDI ETI+  K   +
Sbjct: 136 QQITPAGEPSLQNSIELALKSLRLLPSHASKEILIIVGALTTCDPGDINETIKNMKLDSV 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE++ICK +   TGG + VALD+ H+KE +  H  PPPA      A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254

Query: 180 FPQRA-----GEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP  A      + ++++C CH E     VK +  GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSTNDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEV---TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVAC 285
           S+PHLARSYH+LFP+ PF EV     +    P     S C+GCQ+  + S     +Y+ C
Sbjct: 315 SAPHLARSYHYLFPVEPFKEVEYRKEVTFEHP-----SICYGCQK--IFSQKDKKVYI-C 366

Query: 286 PKCKKHFCLECDIYIHESLHNCPGCES 312
            KC + FCL+C+I+IHESLH CPGC +
Sbjct: 367 NKCNQTFCLDCEIFIHESLHTCPGCAT 393


>gi|406604786|emb|CCH43771.1| General transcription factor IIH subunit 2 [Wickerhamomyces
           ciferrii]
          Length = 420

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 146/320 (45%), Positives = 200/320 (62%), Gaps = 16/320 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R ++       FV EFFDQNP+SQ+G++ +++G+   ++ +GG+P+ HI AL  +
Sbjct: 107 DLRPNRHSITITNAIQFVTEFFDQNPISQLGIIIMRNGIGQLVSQVGGNPQEHIDALKSI 166

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL   GD SLQNAL++ +GLL  + S+  REVLI+Y +L + DPG+I +T+    E KI
Sbjct: 167 KKLEPKGDPSLQNALEMSRGLLMHVASHCTREVLIIYGSLLSLDPGNIHKTVNSLVEEKI 226

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  +IGLSA++ ICK +C+ T      SY+V L+E HFKEL M+   P P         L
Sbjct: 227 RVKIIGLSAQVSICKEICKKTNFGDENSYNVILNEQHFKELFMDAVVPLPVNKINKSFTL 286

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E S S CSCH ++  G GY CP CK+++C LPT C  C   L+ S HLAR
Sbjct: 287 VKMGFPYRISEDSPSFCSCHSKLTYG-GYICPNCKSKICSLPTICPCCNTMLILSTHLAR 345

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---LSSGNKPGLYVACPKCKKHF 292
           SYHHL P+  F EV P+     +N   + CFGCQ  L   L +      Y  C  C   F
Sbjct: 346 SYHHLLPLKNFIEV-PV----DKNYDVNLCFGCQMELPDGLITQTSSSRY-KCVDCNNQF 399

Query: 293 CLECDIYIHESLHNCPGCES 312
           C++CD++IHE+LHNCPGCE+
Sbjct: 400 CIDCDVFIHETLHNCPGCEN 419


>gi|449670653|ref|XP_002167169.2| PREDICTED: general transcription factor IIH subunit 2-like [Hydra
           magnipapillata]
          Length = 384

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 145/317 (45%), Positives = 201/317 (63%), Gaps = 21/317 (6%)

Query: 3   FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG-- 60
            RPSR+A + K +E F+ E+FDQNP+SQ      ++  A  +TDL G+P  HI AL    
Sbjct: 78  LRPSRLACLTKLLENFIVEYFDQNPISQ-----TRNKRAEKITDLSGNPNIHIAALQNFV 132

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           K G  G+ SLQN+LD+    L  +P +  RE+LI++ +L+TCDPGDI  TI   K   IR
Sbjct: 133 KSGPEGEPSLQNSLDMALQFLGHLPKHASREILIVFGSLTTCDPGDIFTTINNLKNENIR 192

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSV+GLSAE+ +CK +  +T G Y+V LDE H  +L++EH  PP A      A+L++MGF
Sbjct: 193 CSVLGLSAEIKLCKTISSETNGIYNVILDEKHCNDLLLEHIRPPAAKLNVE-ASLVRMGF 251

Query: 181 PQRAGEGSISICSCHKEVKVG-----VGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           PQ       ++C CH E+K        GY CP+CK + CELP +C++CGL LVS+PHLAR
Sbjct: 252 PQHISNTYPALCLCHIELKNMQGFNCTGYFCPQCKNKYCELPVECKVCGLTLVSAPHLAR 311

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
           SY HLFP+ PF+EV  +     +N   S C GCQ++ + +     +   C +CK+ FC +
Sbjct: 312 SYQHLFPLPPFEEVRRIETAANKN---SICQGCQRNCIDA-----IVYICKECKEMFCND 363

Query: 296 CDIYIHESLHNCPGCES 312
           CD++IHE+LH CPGC S
Sbjct: 364 CDMFIHETLHTCPGCTS 380


>gi|42415511|ref|NP_963875.1| general transcription factor IIH subunit 2 [Danio rerio]
 gi|28278887|gb|AAH45397.1| General transcription factor IIH, polypeptide 2 [Danio rerio]
          Length = 392

 Score =  283 bits (724), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 152/323 (47%), Positives = 207/323 (64%), Gaps = 26/323 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K +E FV E+FDQNP+SQIG++T K+  A  LTDL G+P+ HI AL   
Sbjct: 75  DLKPNRLTSTLKLMEHFVEEYFDQNPISQIGIITTKNKRAEKLTDLAGNPKKHITALRKA 134

Query: 62  LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           +   C G+ SL N+L++    L  +P++  REVL+++S+L+TCDPG+I E I+     KI
Sbjct: 135 VDSTCVGEPSLYNSLNMALQTLKHMPAHTSREVLVIFSSLTTCDPGNIYELIKTLNGLKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGGSY+V LDESHFKEL++ H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTILTRETGGSYNVILDESHFKELLLLHVKPPPASSS-SECSLIRMG 253

Query: 180 FPQ--------RAGEGSISIC----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQL 227
           FPQ        +  + S S+     S   E+ +G GY CP+C+A+  ELP +C++CGL L
Sbjct: 254 FPQHVIASLSDQDAKPSFSMAHLDSSSEPELSLG-GYYCPQCRAKYTELPVECKVCGLTL 312

Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
           VS+PHLARS+HHLFP+  F E TPL            C  CQ  L     K      C  
Sbjct: 313 VSAPHLARSFHHLFPLEAFQE-TPL----EEYEGERFCEACQGEL-----KDKSVFTCLA 362

Query: 288 CKKHFCLECDIYIHESLHNCPGC 310
           CKK FC+ECDI+IH++LH CPGC
Sbjct: 363 CKKVFCVECDIFIHDTLHCCPGC 385


>gi|390343479|ref|XP_794347.2| PREDICTED: general transcription factor IIH subunit 2-like
           [Strongylocentrotus purpuratus]
          Length = 409

 Score =  281 bits (718), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 147/329 (44%), Positives = 206/329 (62%), Gaps = 22/329 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D +P+R+A   K +E F+ E+FDQNP+SQ+G++T  +  A  LT+LGG+P+ HI AL   
Sbjct: 76  DLKPNRLACCTKLLEHFIEEYFDQNPISQVGIITSSNMRAEKLTELGGNPQRHITALEKC 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C  + SLQNAL+L    L  +P +  RE+L++  +L+TCDPG+I +TIQ  K+  I
Sbjct: 136 NDKPCVKEPSLQNALELAAATLRHMPGHASREILVIMGSLTTCDPGNIHDTIQAMKDHSI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RC VIGL+A++ +C+ L   T G+Y V LDE+HFKEL+MEH  PPPA       + I+MG
Sbjct: 196 RCCVIGLAADVRVCRKLATVTHGTYGVILDETHFKELLMEHTIPPPARVNTE-PSPIRMG 254

Query: 180 FPQ------RAGEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ      ++   + S+C CH + K        GY CP+C+++ CELP +CR+CGL LV
Sbjct: 255 FPQHVIHTDKSKVSAPSMCMCHLDGKNSEGFGTGGYFCPQCQSKYCELPVECRVCGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARS+HHLFP+  F+E       D  +     C GCQ  +     +      CPKC
Sbjct: 315 SAPHLARSFHHLFPLDRFEEFK---REDHDHPDSLFCQGCQSHI-----RDQTAYRCPKC 366

Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSN 317
              FCL+C+++I ESLH+CPGC S R S 
Sbjct: 367 SNVFCLDCELFIQESLHSCPGCASTRPSQ 395


>gi|409044666|gb|EKM54147.1| hypothetical protein PHACADRAFT_185111 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 451

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 145/332 (43%), Positives = 210/332 (63%), Gaps = 28/332 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R  ++ +    F+ E+FDQNPL QIG+V ++ G+   + ++ G+P+  ++++   
Sbjct: 123 DMRPTRFDLMLQYAREFIAEWFDQNPLGQIGIVGMRGGIGERIGEMSGNPQDVLRSIAER 182

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL   G+ SLQNA+++ +  +S +P+   RE+ I++ +L+TCDPG+I +T+ +C + KI
Sbjct: 183 HKLEPVGEPSLQNAIEMARSSMSHLPTNSSREITIIFGSLTTCDPGNIHDTLDECVKDKI 242

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP----------PAIAE 169
           R S++ L+AEM IC+ LC  TGG + VAL+E HFK+L+ E  PPP          P  A 
Sbjct: 243 RISLVALAAEMKICRELCVKTGGQFGVALNEGHFKDLLFELIPPPAQRALTRANGPGAAN 302

Query: 170 FAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
            A A+LI MGFP R    S  S+C CH E+K   G+ CPRC A+VC++PTDC ICGL +V
Sbjct: 303 PA-ADLIMMGFPTRLPVTSQPSLCVCHGELK-SEGFLCPRCLAKVCDVPTDCDICGLMIV 360

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----QQSLLS------SGNK 278
           SSPHLARSYHHLFP+ P+  +T L   D      +TC+GC    Q+ +L        G  
Sbjct: 361 SSPHLARSYHHLFPVKPYTPITNL---DDATSLSATCYGCSVPFQEKVLQPNVPLVDGIS 417

Query: 279 PGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           P     CP+C+  FC ECD++IH+ LH CPGC
Sbjct: 418 PLGRYRCPECEHDFCTECDVFIHDVLHCCPGC 449


>gi|449271199|gb|EMC81725.1| General transcription factor IIH subunit 2 [Columba livia]
          Length = 395

 Score =  280 bits (716), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 148/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E FV E+FDQNP+SQ+GL+  K   A  +T+L G+ + HI AL   
Sbjct: 75  DLKPNRLTCTLKLLEYFVDEYFDQNPISQVGLIVTKSKRAEKMTELSGNSKKHITALKKA 134

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + CSG+ SL NAL+L    L  +P +  REVLI++S+L+TCDP +I + I+  K  KI
Sbjct: 135 ADMTCSGEPSLYNALNLAMQTLKHMPGHTSREVLIVFSSLTTCDPANIYDLIKCLKAVKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+M H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLMHHVSPPPASSN-SECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAQLESSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV PL       +    C GCQ  +        +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEV-PL----EEYQGERYCQGCQGEM----KDQNVYI-CKVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC+ECD+++H+SLH CPGC
Sbjct: 364 QNAFCVECDVFVHDSLHCCPGC 385


>gi|213511927|ref|NP_001135173.1| general transcription factor IIE, polypeptide 2-2 [Salmo salar]
 gi|197632439|gb|ACH70943.1| general transcription factor IIE, polypeptide 2-2 [Salmo salar]
          Length = 393

 Score =  280 bits (715), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 149/323 (46%), Positives = 201/323 (62%), Gaps = 25/323 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K VE FV E+FDQNP+SQ+G++T K+  A  LTDL G+P+ HI AL   
Sbjct: 75  DLKPNRLTSTLKLVEYFVEEYFDQNPISQVGIITTKNKRAEKLTDLAGNPKKHIDALKKA 134

Query: 62  LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C G+ SL N++ L    L  +P +  REVLI++S+L+TCDP +I E ++  K  KI
Sbjct: 135 KDSTCGGEPSLYNSISLAMQTLKHMPGHSSREVLIIFSSLTTCDPANIYELVKTLKALKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGGSY V LDESHF+EL+M H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFRELLMFHVKPPPATSS-SECSLIRMG 253

Query: 180 FPQRA------GEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQL 227
           FPQ         +   S    H +   G       GY CP+C+A+  ELP +C++CGL L
Sbjct: 254 FPQHTIASLSDQDAKPSFSQAHLDSTSGGPGLSMGGYFCPQCQAKYTELPVECKVCGLTL 313

Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
           VS+PHLARS+HHLFP+  F E +PL L+   +R R  C  CQ  L     K      CP 
Sbjct: 314 VSAPHLARSFHHLFPLEAFQE-SPLLLH---HRER-FCEACQGEL-----KDRSVFTCPS 363

Query: 288 CKKHFCLECDIYIHESLHNCPGC 310
           C   FC+ECD++IH++LH CP C
Sbjct: 364 CSSVFCVECDLFIHDTLHCCPSC 386


>gi|367002171|ref|XP_003685820.1| hypothetical protein TPHA_0E02960 [Tetrapisispora phaffii CBS 4417]
 gi|357524119|emb|CCE63386.1| hypothetical protein TPHA_0E02960 [Tetrapisispora phaffii CBS 4417]
          Length = 490

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 149/325 (45%), Positives = 200/325 (61%), Gaps = 22/325 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A+  +    F++EFFD+NP+SQ+G+V +++G+AN L+ +GG+P++HI AL  +
Sbjct: 170 DLRPNRNAMSIQYAIDFIQEFFDENPISQLGIVIMRNGLANLLSPIGGNPQAHIDALKSI 229

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +    G+ SLQNAL++ +GLL  +PS+  REVLI++  LS+ DPGDI +TIQ     KI
Sbjct: 230 RREEPKGNPSLQNALEMARGLLLPVPSHCTREVLIVFGGLSSTDPGDIHQTIQSLVNEKI 289

Query: 120 RCSVIGLSAEMFICKHLCQ----DTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ ICK LC+    D    Y V LDE HFKEL  E   P P         L
Sbjct: 290 RVKVIGLSAQVAICKELCKATNYDDESFYRVILDEVHFKELYDEAVTPLPVNKINKAFTL 349

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + C+CH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 350 VKMGFPTRVFEENPTFCACHSKLVYG-GYFCPNCNSKVCSLPTVCPCCDLMLILSTHLAR 408

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSLLSSGNK------PGLYVACPK 287
           SYHHL P+  F EV       P + +  T  CF CQ +  S  NK            C  
Sbjct: 409 SYHHLMPLKTFSEV-------PTSETFETENCFSCQMTFPSLRNKKTGELLTSSRYRCQD 461

Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
           C+  FC++CD++IHE LHNCPGCES
Sbjct: 462 CENDFCIDCDLFIHEILHNCPGCES 486


>gi|50308759|ref|XP_454384.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49643519|emb|CAG99471.1| KLLA0E09615p [Kluyveromyces lactis]
          Length = 488

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 200/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A+  +    F+ E+FDQNP+SQIG+V ++DG+A  ++ + G+P+ HI AL  +
Sbjct: 164 DLRPNRYAMTVQYAVDFIHEYFDQNPISQIGVVVMRDGLAQLISQVSGNPQEHIDALKML 223

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +    G++SLQNAL++ +GLL  +P++  REVLI++ ALST DPGDI +TI       I
Sbjct: 224 RRQEPKGNASLQNALEMARGLLLHVPAHCTREVLIVFGALSTTDPGDIHQTIGSLVHENI 283

Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ +CK LC+ T       Y + L++ HFKEL      P P         L
Sbjct: 284 RAKVIGLSAQVSVCKELCKQTNYGDNSYYGIILNDHHFKELFTSAVTPLPVSKVNKGFTL 343

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + S C+CH  +  G GY CP CK +VC LPT C  C L L+ S HLAR
Sbjct: 344 VKMGFPTRVFEDTPSFCACHSTLVHG-GYVCPNCKTKVCSLPTVCPSCDLMLILSTHLAR 402

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS-SGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV P+  + P +     CFGCQ+   +   NK G  +      C +C 
Sbjct: 403 SYHHLMPLKTFQEV-PVSDSFPTD----YCFGCQKRFPTLRNNKTGELLTSSRYRCEECD 457

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE+LHNCPGCES
Sbjct: 458 QDFCIDCDVFIHETLHNCPGCES 480


>gi|149727084|ref|XP_001504093.1| PREDICTED: general transcription factor IIH subunit 2 [Equus
           caballus]
          Length = 395

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 148/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIITKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 62  LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           +   C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP DI   IQ  K +KI
Sbjct: 135 VAMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSDIYGLIQTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+GCQ  L        +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYI-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 RHVFCVDCDVFVHDSLHCCPGC 385


>gi|145348017|ref|XP_001418454.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578683|gb|ABO96747.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 407

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 143/309 (46%), Positives = 197/309 (63%), Gaps = 12/309 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRP+R++VV +   +F+RE+F+QNPLSQ+G++  ++GVA  LT+L GSPE+H  AL   
Sbjct: 105 DFRPNRLSVVGQCATSFIREYFNQNPLSQLGIIVARNGVAERLTELSGSPEAHAAALRNA 164

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L  SGD SLQN L++ +  L  IPSYG REVL + S+LSTCDPG++   I   K +K+R 
Sbjct: 165 LDASGDFSLQNTLNVARTSLKSIPSYGSREVLYIVSSLSTCDPGNVWTEIAATKAAKVRV 224

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SV+ ++AE+ + + L ++TGG+Y V+L+  H  +LIM HAPPPP   +   ++L++MGFP
Sbjct: 225 SVVAVAAELHVSRRLTEETGGTYGVSLNADHLDDLIMAHAPPPPLPEDATKSSLVQMGFP 284

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q+      ++      V  G  Y CPRC  R+ ELP+ C +C L LVSSPHLARSYHHLF
Sbjct: 285 QKKHLSKDALV-----VGTGGEYVCPRCSGRIDELPSQCAVCRLTLVSSPHLARSYHHLF 339

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+APF E T    ++   +    C  C    L+   +      C +C   FC  CD YIH
Sbjct: 340 PVAPFKEYT---RDEASAKEVLECKAC----LALIKQTDPASKCEQCSNVFCFACDCYIH 392

Query: 302 ESLHNCPGC 310
           E LHNCP C
Sbjct: 393 EKLHNCPHC 401


>gi|156546970|ref|XP_001599241.1| PREDICTED: general transcription factor IIH subunit 2-like [Nasonia
           vitripennis]
          Length = 404

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 205/324 (63%), Gaps = 24/324 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R     K +E F+ EFF QNP+SQ+G++  ++  A  +TDL G+P+  ++ L  +
Sbjct: 76  DLKPTRFLCALKLLEDFIDEFFYQNPISQLGIIITRNKRAEKITDLTGNPKKPLQDLKNL 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +   +G+ SLQN+L+L    L  +PS+  +E+L++  +L+TCDPGDI ETIQ  K   +
Sbjct: 136 QQTSFTGEPSLQNSLELAAKTLKMLPSHASKEILLIMGSLTTCDPGDIGETIQSLKSDGV 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE+ ICK +  +TGG + V LD+ HFKE +  H  PPPA      A L+KMG
Sbjct: 196 RCSVIGLAAELNICKIMAVNTGGEHGVVLDDKHFKEKLTAHVDPPPAATRLDAA-LVKMG 254

Query: 180 FPQRA-----GEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP  A      E S+++C CH +           GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALQASTTELSMAVCMCHADKPDESSRFTNPGYLCPQCLSKHCELPVECRGCGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYH+LFPI  F E       DP     ++C+ CQ++ +    K  +YV C KC
Sbjct: 315 SAPHLARSYHYLFPIKHFKEEQ--YEGDP-----ASCYACQKTFVELDKK--VYV-CEKC 364

Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
           K+ FCL+C+I+IHESLH CPGC +
Sbjct: 365 KQTFCLDCEIFIHESLHTCPGCAT 388


>gi|336371442|gb|EGN99781.1| hypothetical protein SERLA73DRAFT_179993 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336384207|gb|EGO25355.1| hypothetical protein SERLADRAFT_465378 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 450

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 138/329 (41%), Positives = 208/329 (63%), Gaps = 24/329 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +    F+ E+FDQNPL QIG+V ++ G+   + ++ G+P+  +K++  +
Sbjct: 124 DMRPTRFDLTLEYAREFITEWFDQNPLGQIGVVGMRAGIGERIGEMSGNPQDVLKSIADR 183

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  +G+ SLQNA+D+ +  +S +P++  RE+L+++ +L+TCDPG+I +T+  C ++KI
Sbjct: 184 HRLEPTGEPSLQNAVDMARSSMSHLPTHSSREILVIFGSLTTCDPGNIHDTLDACVKNKI 243

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------AIAEF 170
           R SV+ L+AEM IC+ LC  TGG + VAL+E HFK+L+ E  PPP          + A  
Sbjct: 244 RISVVALAAEMKICRELCDKTGGQFGVALNEGHFKDLLFELVPPPAQRAMTRAAGSGATN 303

Query: 171 AIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
             A+L+ MGFP R  + S  S+C CH E+K   G+ CPRC A+VC++PTDC ICGL +VS
Sbjct: 304 PAADLMMMGFPTRLPDTSPASLCVCHSEMK-SEGFLCPRCMAKVCDVPTDCDICGLMIVS 362

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ--------SLLSSGNKPGL 281
           SPHLARSYHHLFP+ P+D +  L  + P   +   C  C +        S  + G  P  
Sbjct: 363 SPHLARSYHHLFPVKPYDAIGSLA-DTP--DAGLACHACARIFPTTALTSTSTEGMSPLN 419

Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGC 310
              CP+C   FC +CD+++H+ +H CPGC
Sbjct: 420 RYRCPECHNDFCGDCDVFVHDVVHCCPGC 448


>gi|393246283|gb|EJD53792.1| TFIIH basal transcription factor complex, subunit SSL1 [Auricularia
           delicata TFB-10046 SS5]
          Length = 464

 Score =  278 bits (711), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 143/330 (43%), Positives = 209/330 (63%), Gaps = 26/330 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  ++ + V  FV E+FDQNPL QIG+V ++ G+A  + D+ GSP+  +K++   
Sbjct: 137 DLRPTRFDLMLEYVREFVTEWFDQNPLGQIGIVGMRAGLAERVCDMSGSPQDVLKSISER 196

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  +GD SLQNAL+L +  +S +PS+  REVL+++ +++T DPG+I +T+  C E KI
Sbjct: 197 HKLEPTGDPSLQNALELSRASMSHLPSHSSREVLMIFGSITTVDPGNIHDTLAACLEDKI 256

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--------AIAEFA 171
           R S++ L+AEM IC+ LC+ +GG + VAL+E HFK+++ E  PPP         A     
Sbjct: 257 RISIVALAAEMRICRELCEKSGGVFGVALNEGHFKDVMFELIPPPAQRAAAKAAAGGGGT 316

Query: 172 IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
            A+L+ M FP R  E S  ++C CH E+K   G+ CPRC ARVC++PTDC +CG+ +VSS
Sbjct: 317 GADLLLMAFPVRLPESSAPTLCVCHLELK-SEGFLCPRCFARVCDVPTDCDVCGIMIVSS 375

Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-----------LSSGNKP 279
           PHLARSYHHLFP+  +   TP+   +   ++ +TC GC  +             + G  P
Sbjct: 376 PHLARSYHHLFPVKAY---TPVMTLEGEQQTSATCHGCSMAFQPHKAAASAGATAEGVSP 432

Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPG 309
                CP+C+  FC ECDI++H+ LH CPG
Sbjct: 433 LGRYRCPECQNDFCNECDIFVHDVLHVCPG 462


>gi|344272627|ref|XP_003408133.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Loxodonta africana]
          Length = 395

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 143/322 (44%), Positives = 203/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L++    L  +P +  REVLI++S+L+TCDP +I + ++  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLNMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLVKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+     N  R      C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEIPLEEYNGER-----FCYGCQGEL----KDQHVYV-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|170098262|ref|XP_001880350.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164644788|gb|EDR09037.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 396

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 135/330 (40%), Positives = 208/330 (63%), Gaps = 26/330 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIK--ALM 59
           D RP+R  ++ +    F+ E+FDQNPL QIG+V ++ G+   + ++ G+P+  +K  A  
Sbjct: 70  DMRPTRFDLMLQYAREFIAEWFDQNPLGQIGVVGMRAGIGERIGEMSGNPQDVLKYIAER 129

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  +G+ SLQNA+++ +  ++ +P++  RE+LI++ +L+TCDPG+I +TI  C ++KI
Sbjct: 130 HKLEPTGEPSLQNAIEMARSSMNHLPTHSSREILIIFGSLTTCDPGNIHDTIDTCVKNKI 189

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
           R SV+ L+AEM +C+ LC  TGG + VA++E HFK+L+ E  PPP   A   +       
Sbjct: 190 RISVVALAAEMKVCRELCNKTGGQFGVAMNEGHFKDLLFELIPPPAQHATSRVAGAGTGS 249

Query: 173 ----ANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQL 227
               A+L+ MGFP R  + +  ++C CH E+K   G+ CPRC ++VC++PTDC ICGL +
Sbjct: 250 SNPAADLMMMGFPTRLPDATAPTLCVCHSELK-SEGFLCPRCLSKVCDVPTDCDICGLMI 308

Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-------LSSGNKPG 280
           VSSPHLARSYHHLFP+ P+D +    + DP     S+C  C +          + G  P 
Sbjct: 309 VSSPHLARSYHHLFPVKPYDALPYGDVTDP----SSSCHACSRPFPTAVSASATEGVSPV 364

Query: 281 LYVACPKCKKHFCLECDIYIHESLHNCPGC 310
               CP+C   FC +CD+++H+ +H CPGC
Sbjct: 365 GRYKCPECHHDFCADCDVFVHDVIHCCPGC 394


>gi|225719554|gb|ACO15623.1| General transcription factor IIH subunit 2 [Caligus clemensi]
          Length = 397

 Score =  277 bits (709), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 140/326 (42%), Positives = 205/326 (62%), Gaps = 24/326 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D +P+R+  V K +E F+ E+F  NP+SQ+G++  ++  +  L++LGG+P  H + L   
Sbjct: 74  DMKPTRLLSVMKLLEVFIEEYFYLNPISQLGIILTRNKRSEVLSELGGNPRKHTERLRKA 133

Query: 60  -GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
              + C+G+ SLQNALD     L  +PS+G +E+L ++ +L+TCDP DI  TI  CK + 
Sbjct: 134 RDDIVCAGEPSLQNALDTALNYLKPMPSHGSKEILSVFGSLTTCDPSDINATIINCKAAN 193

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           +R S I L+AE+ I K L + TGG ++V LD+ HFKE++     PPP+  +   A+LIKM
Sbjct: 194 VRASFISLTAEVRIYKELTKVTGGDFNVILDDVHFKEILSSQLEPPPSATQMD-ASLIKM 252

Query: 179 GFPQRAGEGS------ISICSCHKEV---KVG-VGYTCPRCKARVCELPTDCRICGLQLV 228
           GFP   G+ +      + +C CH E    K+   G+ CP+C A+ CELP +C  CGL LV
Sbjct: 253 GFPCHTGDDTSDPRSGLGLCMCHLENNPPKISHTGFLCPQCSAKYCELPVECVSCGLTLV 312

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+ PFD+          ++S   CF C + L    N+     ACP+C
Sbjct: 313 SAPHLARSYHHLFPLPPFDQQI------SSSQSIRECFACMRGL----NRESKVFACPQC 362

Query: 289 KKHFCLECDIYIHESLHNCPGCESLR 314
            K++C++CDI+IHE+LH+CPGC S R
Sbjct: 363 HKNYCIDCDIFIHETLHSCPGCASTR 388


>gi|426246343|ref|XP_004016954.1| PREDICTED: general transcription factor IIH subunit 2 [Ovis aries]
          Length = 395

 Score =  276 bits (707), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R S+IGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +  +  +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+ CQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYACQGEL----KDQHVYV-CSVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|84370085|ref|NP_001033619.1| general transcription factor IIH subunit 2 [Bos taurus]
 gi|122142864|sp|Q2TBV5.1|TF2H2_BOVIN RecName: Full=General transcription factor IIH subunit 2; AltName:
           Full=General transcription factor IIH polypeptide 2
 gi|83638634|gb|AAI09594.1| General transcription factor IIH, polypeptide 2, 44kDa [Bos taurus]
 gi|296475883|tpg|DAA17998.1| TPA: general transcription factor IIH subunit 2 [Bos taurus]
 gi|440911341|gb|ELR61023.1| General transcription factor IIH subunit 2 [Bos grunniens mutus]
          Length = 395

 Score =  276 bits (706), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKSLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R S+IGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +  +  +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+ CQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYACQGEL----KDQHVYV-CSVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|449514090|ref|XP_004174671.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
           subunit 2 [Taeniopygia guttata]
          Length = 410

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIGL+  K   A  +T+L G+ + H+ AL   
Sbjct: 90  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGLIVTKSKRAEKMTELSGNSKKHVTALKKA 149

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + CSG+ SL N+L+L    L  +P +  REVL++ S+L+TCDP +I + I+  K  K+
Sbjct: 150 VDMNCSGEPSLYNSLNLAMQTLKHMPGHTSREVLVVLSSLTTCDPANIYDLIKCLKAVKV 209

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+M H  PPPA +  +  +LI+MG
Sbjct: 210 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLMHHVSPPPA-SSTSECSLIRMG 268

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+         G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 269 FPQHTIASLSDQDAKPSFSMVQLENNSDPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 328

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV PL       +    C GCQ  +        +Y+ C  C
Sbjct: 329 SAPHLARSYHHLFPLDAFQEV-PL----EEYKGERYCHGCQGEM----KDQNVYI-CKVC 378

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC+ECD+++H+SLH CPGC
Sbjct: 379 QNAFCVECDLFVHDSLHCCPGC 400


>gi|391326656|ref|XP_003737828.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Metaseiulus occidentalis]
          Length = 390

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 204/320 (63%), Gaps = 23/320 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RPSR+  V   ++ FV  F+D+NP+SQ+ ++   +  A  ++DL G+P  H + L  +
Sbjct: 76  DLRPSRIQCVLHLLDDFVDRFYDENPISQLAIIATCNKRAEKISDLSGNPLKHKEGLAKI 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                +G+ SLQN++++   +L  +P++  REV+I+  +L+TCDP DI ETI++ K+  +
Sbjct: 136 KDRLPTGEPSLQNSINMAAAVLQHMPAHSSREVVIILGSLTTCDPTDINETIEEAKKLGL 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE+++C+ L + TGGSY + +DE H +EL+  H  PPPA+     + ++KMG
Sbjct: 196 RCSVIGLAAEVYVCRRLTEITGGSYHIVVDEDHLRELLANHLLPPPALNNIECS-MVKMG 254

Query: 180 FP--QRAGEGSISICSCHKEVKVGV-------GYTCPRCKARVCELPTDCRICGLQLVSS 230
           FP  +   E + ++C CH E            G+ CP+CK++ CELP +CRICGL LVS+
Sbjct: 255 FPFHKNEQEDTPALCQCHLEAAATESVHITTGGFLCPQCKSKYCELPAECRICGLTLVSA 314

Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
           PHLARSYHHLFP+  F E T L   D R     TCF CQ    SS     +Y  C KCK+
Sbjct: 315 PHLARSYHHLFPLDNFKE-THLLKGDTR-----TCFSCQ----SSVEGQNMYT-CEKCKE 363

Query: 291 HFCLECDIYIHESLHNCPGC 310
            FCL+CDI+IH +LH CPGC
Sbjct: 364 EFCLDCDIFIHATLHLCPGC 383


>gi|348543161|ref|XP_003459052.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Oreochromis niloticus]
          Length = 392

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 150/322 (46%), Positives = 197/322 (61%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K +EAFV E+FDQNP+SQ+G++T K+  A  LTDL G+P+ HI ALM  
Sbjct: 75  DLKPNRLTSTLKLMEAFVDEYFDQNPISQVGIITTKNKRAEKLTDLAGNPKKHITALMKA 134

Query: 62  LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           +   C G+ SL N+L+L    L  +P +  RE+LI+ S+L+TCDP +I E I+  K  K+
Sbjct: 135 VDTVCVGEPSLYNSLNLAIQTLKHMPGHTSREILIILSSLTTCDPANIYELIKTLKSLKV 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGGSY V LDESHFKEL+M H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHIKPPPA-SSSSECSLIRMG 253

Query: 180 FPQRA------GEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ         +   S    H +   G      GY CP+C A   ELP +C++CGL LV
Sbjct: 254 FPQHTIASVTDQDAKPSFSMSHLDSSSGPGLSLGGYFCPQCHATYTELPVECKVCGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARS+HHLFP+  F E     L   R      C  CQ  L     K     +CP C
Sbjct: 314 SAPHLARSFHHLFPLQAFIESPVENLQGDR-----FCQACQAEL-----KDKSVFSCPSC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
              FC+ECD++IHESLH CP C
Sbjct: 364 HSVFCVECDLFIHESLHCCPCC 385


>gi|170035936|ref|XP_001845822.1| TFIIH basal transcription factor complex p44 subunit [Culex
           quinquefasciatus]
 gi|167878421|gb|EDS41804.1| TFIIH basal transcription factor complex p44 subunit [Culex
           quinquefasciatus]
          Length = 411

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 138/324 (42%), Positives = 203/324 (62%), Gaps = 22/324 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+LGGS   HIKA+  +
Sbjct: 76  DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGVIAMKAKRAEKITELGGSCRKHIKAVGNL 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  +G+ SLQN L+L    L  +PS+  RE+L++  +L+TCDP DI  TI   K   I
Sbjct: 136 NKLNLTGEPSLQNGLELALKTLKMVPSHASREILVVMGSLTTCDPTDIHITIDALKSEGI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSV+ LSAE+ +CK LC +TGG Y   LD+SHFK+ +++H  PP A  +   + +IKMG
Sbjct: 196 RCSVVSLSAEIRVCKFLCTETGGVYGAVLDDSHFKDQLLQHIDPPQAGNQQEFS-MIKMG 254

Query: 180 FPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLVS 229
           FP  +  EG    +++C CH +           GY CP+C ++ CELP +C  CGL L S
Sbjct: 255 FPHGKTEEGKDPPLTMCMCHIDSTDEPSKLTSGGYHCPQCYSKYCELPVECTACGLTLAS 314

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCK 289
           +PHLARSYHHLFP+  + E+       P  +  + C+GCQ+ L        +Y  C  C+
Sbjct: 315 APHLARSYHHLFPVPHYQEL-------PFQQQATNCYGCQK-LFGETTDKTIY-QCATCR 365

Query: 290 KHFCLECDIYIHESLHNCPGCESL 313
           + FC++CDI++HE++H+C GC ++
Sbjct: 366 QFFCIDCDIFVHETMHSCVGCTTI 389


>gi|363744781|ref|XP_003643123.1| PREDICTED: general transcription factor IIH subunit 2-like [Gallus
           gallus]
          Length = 395

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQ+GL+  K   A  +T+L G+P+ HI AL   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQMGLIVTKSKRAEKMTELSGNPKKHIAALKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L+L    L  +P +  REVLI++S+L+TCDP +I + I+  K  KI
Sbjct: 135 VDMNCQGEPSLYNSLNLAMQTLKHMPGHTSREVLIVFSSLTTCDPANIYDLIKCLKAVKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDE+H+KEL+M H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGTYHVILDETHYKELLMHHVSPPPA-SSTSECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       +  +   GY CP+C+A+  ELP +C+ICGL LV
Sbjct: 254 FPQHTTASLSDQDAKPSFSMAQLENNSEPCLTLDGYFCPQCRAKYSELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL       +    C GCQ  +        +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEYQGERYCQGCQAEI----KDQNVYI-CKTC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           K  FC+ECD+++H+SLH CPGC
Sbjct: 364 KNAFCVECDVFVHDSLHCCPGC 385


>gi|366992932|ref|XP_003676231.1| hypothetical protein NCAS_0D02890 [Naumovozyma castellii CBS 4309]
 gi|342302097|emb|CCC69870.1| hypothetical protein NCAS_0D02890 [Naumovozyma castellii CBS 4309]
          Length = 485

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 148/323 (45%), Positives = 200/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    F+ EFFDQNP+SQIG+V +++G+A+ ++ + G+P+ HI AL  +
Sbjct: 164 DLRPTRHAMMIQYAIDFIHEFFDQNPISQIGIVIMRNGLAHLVSQVSGNPQDHIDALKAI 223

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K   +G+ SLQNAL++ +GLL  +P++  REVLI++ +LS+ DPGDI +TI    + KI
Sbjct: 224 RKQEPAGNPSLQNALEMARGLLLPVPAHCTREVLIIFGSLSSTDPGDIHQTINSLVQEKI 283

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ ICK LC+ T       Y + LDE+HFKEL  E   P P         L
Sbjct: 284 RVKVIGLSAQVAICKELCKATNYGDDSFYKILLDETHFKELFDEAVTPLPVNKINKGFTL 343

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + S C+CH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 344 VKMGFPTRIFEDTPSFCACHSKLVYG-GYFCPSCHSKVCSLPTVCPCCDLMLILSTHLAR 402

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV P     P       CF CQ    +    K G  +      C  CK
Sbjct: 403 SYHHLMPLKTFVEV-PATETFP----TENCFSCQLVFPILKNQKTGKLLTSSRYRCDSCK 457

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           K FC++CD++IHE LHNCPGCES
Sbjct: 458 KDFCIDCDVFIHEILHNCPGCES 480


>gi|116812559|ref|NP_001070896.1| general transcription factor IIH subunit 2 [Rattus norvegicus]
 gi|182701382|sp|A0JN27.1|TF2H2_RAT RecName: Full=General transcription factor IIH subunit 2; AltName:
           Full=Basic transcription factor 2 44 kDa subunit;
           Short=BTF2 p44; AltName: Full=General transcription
           factor IIH polypeptide 2; AltName: Full=TFIIH basal
           transcription factor complex p44 subunit
 gi|116487735|gb|AAI26098.1| General transcription factor IIH, polypeptide 2 [Rattus norvegicus]
          Length = 396

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 204/324 (62%), Gaps = 27/324 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDE+H+KEL+  H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLARHVSPPPA-SSGSECSLIRMG 253

Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
           FPQ           + S S+      S    + +G GY CP+C+A+ CELP +C+ICGL 
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           LVS+PHLARSYHHLFP+  F E+ PL       +    C+GCQ  L        +YV C 
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEI-PL----EEYKGERFCYGCQGEL----KDQHVYV-CT 362

Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
            C+  FC++CD+++H+SLH CPGC
Sbjct: 363 VCRNVFCVDCDVFVHDSLHCCPGC 386


>gi|308805072|ref|XP_003079848.1| basic transcription factor 2, 44kD subunit-related (ISS)
           [Ostreococcus tauri]
 gi|116058305|emb|CAL53494.1| basic transcription factor 2, 44kD subunit-related (ISS)
           [Ostreococcus tauri]
          Length = 414

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 145/314 (46%), Positives = 192/314 (61%), Gaps = 8/314 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRP+R++VV     AFVREFF+QNPLSQ+G++  ++GVA  LT+L GSPE+H  AL   
Sbjct: 101 DFRPNRLSVVGHCAAAFVREFFNQNPLSQLGIIVARNGVAERLTELSGSPEAHCAALRNA 160

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L  SGD SLQN L+  +  L  IPSYG REVL + S+L+TCDPG++   IQ  K +K+R 
Sbjct: 161 LDASGDFSLQNTLNAARTSLKSIPSYGSREVLYIMSSLATCDPGNVWTEIQATKAAKVRV 220

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SV+ ++AE+ + + L ++TGG+Y V+++  H  +LIM HAPPPP   +   + L++MGFP
Sbjct: 221 SVVAVAAELHVARRLSEETGGTYGVSMNADHLDDLIMAHAPPPPLAEDATKSCLVQMGFP 280

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q+      ++      V     Y CPRC  R+ ELP+ C +C L LVSSPHLARSYHHLF
Sbjct: 281 QKKHVSKDALI-----VGTRGDYVCPRCSGRIDELPSQCTVCRLTLVSSPHLARSYHHLF 335

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+  F E      N     S      C  + L+S     L   C  C   FC  CDI+IH
Sbjct: 336 PVPAFKEYAS--RNVSAKESGIVALEC-SACLASITDASLASECEHCSNVFCFACDIFIH 392

Query: 302 ESLHNCPGCESLRH 315
           E LHNCP C    H
Sbjct: 393 ERLHNCPHCSCATH 406


>gi|149059192|gb|EDM10199.1| general transcription factor II H, polypeptide 2 (predicted),
           isoform CRA_b [Rattus norvegicus]
 gi|149059193|gb|EDM10200.1| general transcription factor II H, polypeptide 2 (predicted),
           isoform CRA_b [Rattus norvegicus]
          Length = 398

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 204/324 (62%), Gaps = 27/324 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDE+H+KEL+  H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLARHVSPPPA-SSGSECSLIRMG 253

Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
           FPQ           + S S+      S    + +G GY CP+C+A+ CELP +C+ICGL 
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           LVS+PHLARSYHHLFP+  F E+ PL       +    C+GCQ  L        +YV C 
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEI-PL----EEYKGERFCYGCQGEL----KDQHVYV-CT 362

Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
            C+  FC++CD+++H+SLH CPGC
Sbjct: 363 VCRNVFCVDCDVFVHDSLHCCPGC 386


>gi|395825440|ref|XP_003785941.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Otolemur garnettii]
          Length = 395

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 142/322 (44%), Positives = 203/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHIASLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R S+IGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  +  PPPA +  +  +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHYVSPPPASSN-SECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNSEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL      +     C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEHHGERFCYGCQGDL----KDQHVYV-CNAC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|157115005|ref|XP_001652513.1| btf [Aedes aegypti]
 gi|108877047|gb|EAT41272.1| AAEL007056-PA [Aedes aegypti]
          Length = 408

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 141/326 (43%), Positives = 203/326 (62%), Gaps = 26/326 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R     K +E F+ EFFDQNP+SQ+G++ +K   A  +++LGGS   H+KA+ G 
Sbjct: 76  DLKPTRFLCTLKLLEIFIEEFFDQNPISQLGVIAMKAKRAEKISELGGSCRKHVKAVAGL 135

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  +G+ SLQN LDL    L  +P++  REVL++  +L+TCDP DI  TI+  K   I
Sbjct: 136 SKLNLTGEPSLQNGLDLALKTLKMVPAHASREVLVVMGSLTTCDPTDIHITIEALKAEGI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA--IAEFAIANLIK 177
           RCSV+ LSAE+ +CK LC +TGG Y   LD++HFK+ +++H  PP A    EF   +LIK
Sbjct: 196 RCSVVSLSAEIRVCKFLCTETGGVYGAVLDDAHFKDQLLQHIDPPQAGNQQEF---SLIK 252

Query: 178 MGFPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQL 227
           MGFP  +  EG    +++C CH +           GY CP+C ++ C+LP +C  CGL L
Sbjct: 253 MGFPHGKTEEGKDPPLTMCMCHIDSTDEPSKLTSGGYHCPQCYSKYCDLPVECAACGLTL 312

Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
            S+PHLARSYHHLFP+  F E++            S C+GCQ+    S +K      C  
Sbjct: 313 ASAPHLARSYHHLFPVPHFQELS-------FEHQASICYGCQKPFTESTDK--TVYQCGT 363

Query: 288 CKKHFCLECDIYIHESLHNCPGCESL 313
           C++ FC++CDI+IHE++H+C GC ++
Sbjct: 364 CQQVFCIDCDIFIHETMHSCVGCTTI 389


>gi|426197578|gb|EKV47505.1| hypothetical protein AGABI2DRAFT_204710 [Agaricus bisporus var.
           bisporus H97]
          Length = 452

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 212/332 (63%), Gaps = 28/332 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG---SPE---SHI 55
           D RP+R  ++ +    F+ E+FDQNPL QIG+V ++ G+   + DL G   +P+   S+I
Sbjct: 124 DMRPTRFDLMLQYAREFILEWFDQNPLGQIGVVGMRAGLGERIGDLSGESGNPQEVLSYI 183

Query: 56  KALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
           K    KL  +G+ SLQNA+++ +  +S +P++  RE++I++ +L+TCDPG+I ET++ C 
Sbjct: 184 KE-RHKLEPTGEPSLQNAIEMAKASMSHLPTHSSREIIIIFGSLTTCDPGNIHETLETCV 242

Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------A 166
           ++K+R SV+ L+AEM IC+ L   TGG + VA++E HFK+L+ E  PPP          A
Sbjct: 243 KNKVRISVVALAAEMKICRELSDKTGGQFGVAMNEGHFKDLLFELVPPPAQRALTRTMGA 302

Query: 167 IAEFAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGL 225
            +    A+L+ MGFP R  E S  ++C CH E+K   G+ CPRC A+VC++PTDC +CGL
Sbjct: 303 PSASLSADLMMMGFPTRLPEASAPTLCVCHAELK-SQGFLCPRCLAKVCDVPTDCHVCGL 361

Query: 226 QLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP---GLY 282
            +VSSPHLARSYHHLFP+ P+D V  L   +   +    C GC +   ++ + P   G+ 
Sbjct: 362 MIVSSPHLARSYHHLFPVKPYDAVPNL---EDIPQPSDACHGCAKLFPTTTSIPIGEGVS 418

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGC 310
           V     CP C+  FC +CD+++HE +H CPGC
Sbjct: 419 VVGRYRCPDCRHDFCSDCDVFVHEVIHCCPGC 450


>gi|156387918|ref|XP_001634449.1| predicted protein [Nematostella vectensis]
 gi|156221532|gb|EDO42386.1| predicted protein [Nematostella vectensis]
          Length = 400

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 138/318 (43%), Positives = 194/318 (61%), Gaps = 15/318 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R++  AK +E F+ E+FDQNP+SQ+GL+  K+  A  L++L G+P+ HI  +   
Sbjct: 79  DLKPNRLSCSAKLLENFITEYFDQNPISQLGLIITKNKRAEKLSELSGNPKLHISTIQSA 138

Query: 62  LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C G+ SLQNAL L    L  +P +  REVL+L  +L++CDPGDI ET++  K   +
Sbjct: 139 CSKPCVGEPSLQNALVLAMQSLKHMPGHVSREVLVLMGSLTSCDPGDITETVKSLKNMNV 198

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCS++GL+AEM +CK +C  T GSY V LDE HFK+L+MEH  PP A     +  L  + 
Sbjct: 199 RCSIVGLAAEMRVCKQICSSTNGSYRVVLDERHFKDLLMEHVIPPTATVRICLV-LWYVI 257

Query: 180 FPQRAGEGSISICSCHKEVK-----VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           F         S+   H + K        GY CP+C+++ C+LP +C++CGL LVS+PHLA
Sbjct: 258 FESYCYIHVFSVIRSHMDNKNIKEFTTKGYFCPQCRSKYCDLPVECKVCGLTLVSAPHLA 317

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCL 294
           RSY HLFP+  + EV     N P        + CQ   ++   K     +CP+C+  FCL
Sbjct: 318 RSYQHLFPLQQYTEV-----NVPAQIPGQPPWTCQACQITLTEKTA--CSCPECRHVFCL 370

Query: 295 ECDIYIHESLHNCPGCES 312
           +CDIYIHESLH CPGC +
Sbjct: 371 DCDIYIHESLHTCPGCTA 388


>gi|365990367|ref|XP_003672013.1| hypothetical protein NDAI_0I02010 [Naumovozyma dairenensis CBS 421]
 gi|343770787|emb|CCD26770.1| hypothetical protein NDAI_0I02010 [Naumovozyma dairenensis CBS 421]
          Length = 520

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 147/325 (45%), Positives = 198/325 (60%), Gaps = 22/325 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +K+G+A+ ++ +GG+P+ HI  L  +
Sbjct: 199 DLRPNRHAMMIQYAIDFVHEFFDQNPISQLGILVMKNGLAHLVSQVGGNPQDHIDVLKSI 258

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K   +G+ SLQNAL++ +GLL  +P++  REVLI++ +LS+ DPGDI +TI    + KI
Sbjct: 259 RKQEPAGNPSLQNALEMARGLLLPVPAHCTREVLIIFGSLSSTDPGDIHQTINSLVQEKI 318

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ ICK LC  T       Y + LDE+HFKEL  E   P P         L
Sbjct: 319 RVKVIGLSAQVAICKELCMATNYGDDSFYKILLDETHFKELFDEAVTPLPVNKINKGFTL 378

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E S + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 379 VKMGFPTRIFEDSPTFCSCHSKLVYG-GYFCPSCHSKVCSLPTVCPCCDLMLILSTHLAR 437

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSLLSSGNK------PGLYVACPK 287
           SYHHL P+  F EV       P N +  T  CF CQ    +  N+            C  
Sbjct: 438 SYHHLMPLKTFLEV-------PSNETFPTENCFSCQLKFPTLKNQRNGSLLTSSRYRCSS 490

Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
           C   FC++CD++IHE LHNCPGCES
Sbjct: 491 CHSDFCIDCDVFIHEILHNCPGCES 515


>gi|449541165|gb|EMD32151.1| hypothetical protein CERSUDRAFT_119123 [Ceriporiopsis subvermispora
           B]
          Length = 445

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 139/329 (42%), Positives = 205/329 (62%), Gaps = 24/329 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  ++ +    F+ E+FDQNPL QIG+V ++ G+   + ++ G+P+  +KA+   
Sbjct: 119 DMRPTRFDLMLQYAREFIIEWFDQNPLGQIGIVGMRSGLGERIGEMTGNPQEVLKAISER 178

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL   G+ SLQNA+D+ +  +S +P +  RE+ I++ +L+TCDPG+I +T++ C + +I
Sbjct: 179 HKLEPGGEPSLQNAIDMARSSMSHLPVHSSREITIIFGSLTTCDPGNIHDTLEDCIKDRI 238

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA--------EFA 171
           R S++ L+AEM IC+ LC  TGG + VAL+E HFK+L+ E  PPP A A           
Sbjct: 239 RISIVALAAEMKICRDLCDKTGGQFGVALNEGHFKDLLFELIPPPVARALARTGGAPANP 298

Query: 172 IANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
            A+L+ MGFP R  + S   +C CH ++K   G+ CPRC+A+VC++PTDC ICGL +VSS
Sbjct: 299 AADLMIMGFPTRLPDTSPPGLCVCHSKLK-SEGFLCPRCRAKVCDVPTDCDICGLMIVSS 357

Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---------LSSGNKPGL 281
           PHLARSYHHLFP+  +  V  + L D  N + S C GC  +           S G  P  
Sbjct: 358 PHLARSYHHLFPVKAYQAV--ISLEDTPNPAPS-CHGCSTAFREAAPAMANTSDGMSPLG 414

Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGC 310
              CP C   FC +CD+++H+ +H CPGC
Sbjct: 415 RYRCPDCHNDFCTDCDVFVHDVVHCCPGC 443


>gi|156844855|ref|XP_001645488.1| hypothetical protein Kpol_1004p2 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156116152|gb|EDO17630.1| hypothetical protein Kpol_1004p2 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 497

 Score =  273 bits (697), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 148/323 (45%), Positives = 197/323 (60%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +    F  EFFDQNP+SQ+G+V +K+G+A  LT + G+P+ HI AL  +
Sbjct: 177 DLRPNRNVMSIQYAIDFTHEFFDQNPISQLGIVIMKNGLAQLLTPVSGNPQEHIDALKSI 236

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +    G+ SLQNAL++ +GLL  +P++  REVLI++  LS+ DPGDI +TIQ   E KI
Sbjct: 237 RRQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGGLSSTDPGDIHQTIQSLVEEKI 296

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ IC+ LC+ T  +    Y + LDE+HFKEL  E   P P         L
Sbjct: 297 RVKVIGLSAQVAICQELCKATNYNDDSFYRIILDETHFKELFDEAVTPLPVNKINKGFTL 356

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E S S C+CH ++  G GY CP C+ +VC LP  C  C L L+ S HLAR
Sbjct: 357 VKMGFPTRVFEDSPSFCACHSKLVYG-GYFCPNCQNKVCSLPIVCPCCDLMLILSTHLAR 415

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV P   N P +     CF CQ    +    K G  +      C  CK
Sbjct: 416 SYHHLMPLKTFSEV-PHTENFPTD----NCFSCQIRFPVLRNQKTGELLTSSRYRCTDCK 470

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCE+
Sbjct: 471 QDFCIDCDVFIHEVLHNCPGCEA 493


>gi|327263012|ref|XP_003216315.1| PREDICTED: general transcription factor IIH subunit 2-like [Anolis
           carolinensis]
          Length = 406

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 143/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQ+G++  K   A  LT+L G+P  HI AL   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQVGIIITKGKRAEKLTELAGNPRKHITALKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C+G+ SL NAL+L    L  +P +  REVL+++S+L+TCDP +I + I+  K  KI
Sbjct: 135 VDMTCAGEPSLYNALNLATQTLKHMPGHTSREVLVIFSSLTTCDPSNIYDLIKSLKMLKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH++EL+M H  PPPA +  + ++LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYRELLMHHVSPPPATSG-SESSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+  ELP +C++CGL LV
Sbjct: 254 FPQHTIASLTDQDAKPSFSMAHLENSSEPGLTLGGYFCPQCRAKYSELPVECKVCGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV PL            C GC   +        +Y+ C +C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEV-PL----EEYTGERYCQGCHGGI----GDQHVYI-CKEC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CDI+ HE+LH CPGC
Sbjct: 364 QSGFCVDCDIFAHETLHCCPGC 385


>gi|321470853|gb|EFX81828.1| hypothetical protein DAPPUDRAFT_317251 [Daphnia pulex]
          Length = 405

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 34/339 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R   V K +E FV EFFD NP+SQ+G++T K   +  ++ L G+ + HI+AL  M
Sbjct: 77  DLKPTRFLCVLKLLELFVHEFFDLNPISQLGILTTKVKRSEQVSALAGNQKKHIEALQQM 136

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C G+ SLQN+L+     L  +P++  REVL+L+ +L+TCDPGDI +TI+  KE+ I
Sbjct: 137 KDTSCEGEPSLQNSLERAMNGLKNMPAHSSREVLVLFGSLTTCDPGDIQKTIKSLKENNI 196

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R S+IGL+AE+ IC+ + + TGG+Y+V LD+ H KELI+     PPA+A    A+L+KMG
Sbjct: 197 RVSIIGLAAEVRICREIAKRTGGTYNVLLDDHHLKELILNQV-QPPAVAGSMEASLVKMG 255

Query: 180 FP-----------QRAGEGSISICSCHKEVKV------GVGYTCPRCKARVCELPTDCRI 222
           FP             A + + + C CH E +         GY CP+C +R CELP +C+ 
Sbjct: 256 FPGGKPGSSDGASDSAADHAPAYCLCHIENETSCLMGNNSGYNCPQCGSRYCELPVECKQ 315

Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
           CGL LVS+PHLARSYHHLFPI PF E T +          + CF C +        P +Y
Sbjct: 316 CGLTLVSAPHLARSYHHLFPIKPFIERTEV-------PEMTHCFACAKPF--GELDPNVY 366

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPIVA 321
             C  C +  CL+CD++I E+LH CPGC     S+PI A
Sbjct: 367 -ECENCNQIVCLDCDLFIRETLHTCPGCA----SDPITA 400


>gi|409080664|gb|EKM81024.1| hypothetical protein AGABI1DRAFT_71744 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 452

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 211/332 (63%), Gaps = 28/332 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG---SPE---SHI 55
           D RP+R  ++ +    F+ E+FDQNPL QIG+V ++ G+   + DL G   +P+   S+I
Sbjct: 124 DMRPTRFDLMLQYAREFILEWFDQNPLGQIGVVGMRAGLGERIGDLSGESGNPQEVLSYI 183

Query: 56  KALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
           K    KL  +G+ SLQNA+++ +  +S +P++  RE++I++ +L+TCDPG+I ET++ C 
Sbjct: 184 KE-RHKLEPTGEPSLQNAIEMAKASMSHLPTHSSREIIIIFGSLTTCDPGNIHETLETCV 242

Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------A 166
           ++K+R SV+ L+AEM IC+ L   TGG + VA++E HFK+L+ E  PPP          A
Sbjct: 243 KNKVRISVVALAAEMKICRELSDKTGGQFGVAMNEGHFKDLLFELVPPPAQRALTRTMGA 302

Query: 167 IAEFAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGL 225
            +    A+L+ MGFP R  E S  ++C CH E+K   G+ CPRC A+VC++PTDC +CGL
Sbjct: 303 PSASLSADLMMMGFPTRLPEASAPTLCVCHAELK-SQGFLCPRCLAKVCDVPTDCHVCGL 361

Query: 226 QLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP---GLY 282
            +VSSPHLARSYHHLFP+ P+D V  L   +   +    C GC +    + + P   G+ 
Sbjct: 362 MIVSSPHLARSYHHLFPVKPYDAVPNL---EDIPQPSDACHGCAKLFPMTTSIPIGEGVS 418

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGC 310
           V     CP C+  FC +CD+++HE +H CPGC
Sbjct: 419 VVGRYRCPDCRHDFCSDCDVFVHEVIHCCPGC 450


>gi|403214446|emb|CCK68947.1| hypothetical protein KNAG_0B05150 [Kazachstania naganishii CBS
           8797]
          Length = 469

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 199/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFD+NP+SQIG+V +++G+A+ ++ + G+P+ H+ AL  M
Sbjct: 149 DLRPNRHAMMIQYAIDFVHEFFDENPISQIGIVIMRNGLAHLVSSVSGNPQDHLDALKHM 208

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL+L +GLL  +PS+  REVLI++ +LS+ DPGDI +TI    + KI
Sbjct: 209 RKQEPKGNPSLQNALELARGLLLPVPSHSTREVLIIFGSLSSTDPGDIHQTIASLVDEKI 268

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ +CK LC+ T       Y + LDE+HFK+L  E   P P         L
Sbjct: 269 RVKVIGLSAQVAVCKELCKLTNFGDDSFYKILLDETHFKDLFDEAVTPLPVNKINKGFTL 328

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E   + CSCH ++  G GY CP C  +VC LPT C  C L L+ S HLAR
Sbjct: 329 VKMGFPTRVFESVPTFCSCHSKLIYG-GYFCPNCHNKVCSLPTVCPCCDLTLILSTHLAR 387

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQS--LLSSGNKPGLYVA----CPKCK 289
           SYHHL P+  F+EV     +         CF CQ+   +L +     L  +    C  C+
Sbjct: 388 SYHHLMPLKTFNEV-----DAKETFPTENCFACQRRFPILKNHKSDQLLTSSRYRCDDCR 442

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 443 QDFCIDCDVFIHEVLHNCPGCES 465


>gi|410083393|ref|XP_003959274.1| hypothetical protein KAFR_0J00710 [Kazachstania africana CBS 2517]
 gi|372465865|emb|CCF60139.1| hypothetical protein KAFR_0J00710 [Kazachstania africana CBS 2517]
          Length = 469

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 144/325 (44%), Positives = 202/325 (62%), Gaps = 22/325 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RPSR A++ +    FV EFFDQNP+SQIG+V +++G+A  ++ + G+P+ HI AL  +
Sbjct: 148 DLRPSRHAMMIQYAIDFVHEFFDQNPISQIGIVIMRNGLAQLVSQVSGNPQDHIDALKSI 207

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +PS+  REVLI++ +LS+ DPGD+ +TI      +I
Sbjct: 208 RKQEPKGNPSLQNALEMARGLLLPVPSHCTREVLIIFGSLSSTDPGDVHQTINSLVRERI 267

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ ICK LC+ T       Y++ LDE+HFKEL  E   P P         L
Sbjct: 268 RTRVIGLSAQVAICKELCKQTNYGDESYYNILLDETHFKELFDEAVTPLPVNKINKGFTL 327

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C +++C LPT C  C L L+ S HLAR
Sbjct: 328 VKMGFPTRIFEDTPTFCSCHSKLIYG-GYFCPNCHSKICSLPTVCPCCDLMLILSTHLAR 386

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSL-LSSGNKPGLYVA-----CPK 287
           SYHHL P+  F EV       P + +  T  C+ CQ+   +   +K G  +      C  
Sbjct: 387 SYHHLMPLKTFKEV-------PSDETFKTENCYSCQKKFPILKNHKTGNLLTSSRYRCSD 439

Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
           C++ FC++CD++IHE LHNCPGCE+
Sbjct: 440 CQEDFCIDCDLFIHEVLHNCPGCEA 464


>gi|302916479|ref|XP_003052050.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256732989|gb|EEU46337.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 469

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 150/341 (43%), Positives = 202/341 (59%), Gaps = 36/341 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D  P+R  +  +   AFVREFF+QNP+SQ+G++ ++DGVA  ++D+GG+P  H++ L G 
Sbjct: 111 DLLPTRYRLTLRYAAAFVREFFEQNPISQLGIIGMRDGVAVRISDVGGNPAEHLERLKGL 170

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI ET+      +I
Sbjct: 171 EDQDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHETVGNLITDRI 230

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R S++GLSA++ IC  LC  T       Y+VA+DE HF+EL +    PP    AE + A+
Sbjct: 231 RVSIVGLSAQVAICADLCSRTNAGDESQYNVAMDEVHFRELFLAATTPPVTRTAEQSTAS 290

Query: 175 LIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R     G  S C+CH       GY C RC ARVC LP +C  CGL L+ S H
Sbjct: 291 LLMMGFPSRTLVAGGDTSYCACHNR-PFREGYLCTRCSARVCRLPAECPACGLTLILSTH 349

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRS-TCFGCQQSL-------------LSSGNK 278
           LARSYHHLFP+  + EV P       N +RS  CF CQ                  +G K
Sbjct: 350 LARSYHHLFPLRNWVEV-PWA-----NAARSAACFSCQAPFPEPPKTKAPDKPKEDAGPK 403

Query: 279 PGLYV------ACPKCKKHFCLECDIYIHESLHNCPGCESL 313
           P   V      AC  C++HFC++CD++ HE +HNCPGC+S+
Sbjct: 404 PAKGVSESGRYACEVCRQHFCIDCDVFAHEVIHNCPGCQSI 444


>gi|392569356|gb|EIW62529.1| TFIIH basal transcription factor complex subunit SSL1 [Trametes
           versicolor FP-101664 SS1]
          Length = 452

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 134/331 (40%), Positives = 206/331 (62%), Gaps = 26/331 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  ++ +    FV E+FDQNPL QIG+  ++ G+   + ++ G+P+  +KA+   
Sbjct: 124 DMRPTRFDLMLQYAREFVTEWFDQNPLGQIGIAGMRGGLGERIGEMSGNPQDVLKAISER 183

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +L  +G+ SLQNA+++ +  +S +P++  RE+L+++ +L+TCDPG+I +T+ +C + +I
Sbjct: 184 HRLEPNGEPSLQNAIEMARSSMSHLPTHSSREILVIFGSLTTCDPGNIHDTMDECVKDRI 243

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------AIAEF 170
           R S++ L+AEM IC+ LC  TGG +SV ++E HFK+L+ E  PPP           +   
Sbjct: 244 RISIVALAAEMKICRDLCDKTGGQFSVVMNEGHFKDLLFELIPPPAQRAVARTGSGVTAN 303

Query: 171 AIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
             A+L+ MGFP R  + S  S+C CH ++K   G+ CPRC A+VC++PTDC ICGL +VS
Sbjct: 304 PAADLMIMGFPMRLPDTSPPSLCVCHSQMK-SEGFLCPRCLAKVCDVPTDCDICGLMIVS 362

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSS----------GNKP 279
           SPHLARSYHHLFP+  +  V    L D  N S+  C GC  +   +          G  P
Sbjct: 363 SPHLARSYHHLFPVKAYQAVN--TLEDTLNPSKQ-CHGCSVAFRGTPSVVAGASGEGMSP 419

Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                CP+C   FC +CD+++H+ +H CPGC
Sbjct: 420 FGRYRCPECHNDFCTDCDVFVHDVVHCCPGC 450


>gi|58331891|ref|NP_001011081.1| general transcription factor IIH, polypeptide 2, 44kDa [Xenopus
           (Silurana) tropicalis]
 gi|54038231|gb|AAH84471.1| general transcription factor IIH, polypeptide 2, 44kDa [Xenopus
           (Silurana) tropicalis]
 gi|115530769|emb|CAL49300.1| general transcription factor IIH polypeptide 4, 52kDa [Xenopus
           (Silurana) tropicalis]
          Length = 393

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 199/322 (61%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIGL+  ++  A  LT+L G+P  H+ AL   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGLIVTRNKRAEKLTELAGNPRQHLNALKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C+G+ SL N+L+L    L  +P +  RE+L+++S+L+TCDP +I + I+  K SKI
Sbjct: 135 VDMNCNGEPSLYNSLNLALQTLKHMPGHTSREILVIFSSLTTCDPTNIYDIIKCLKASKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG Y V LDESH+KEL+M H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYKELLMHHVIPPPA-SSTSECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+  ELP +C++C L LV
Sbjct: 254 FPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGGYFCPQCRAKYSELPVECKVCRLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV+              C GC   L     K      CP C
Sbjct: 314 SAPHLARSYHHLFPLDAFKEVSL-----EEYEGERYCRGCDGEL-----KDQQVYICPVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC+ECD++IHESLH CPGC
Sbjct: 364 QCVFCIECDLFIHESLHCCPGC 385


>gi|255711256|ref|XP_002551911.1| KLTH0B02794p [Lachancea thermotolerans]
 gi|238933289|emb|CAR21473.1| KLTH0B02794p [Lachancea thermotolerans CBS 6340]
          Length = 470

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 195/323 (60%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A+  +    FV EFFDQNP+SQIG+V +++ +A  ++ + G+P+ HI AL  +
Sbjct: 149 DLRPNRYAMTIQYAINFVHEFFDQNPISQIGIVCMRNDLAQLVSQVSGNPQDHIDALKAI 208

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+SSLQNAL++ +GLL  +P++  REVLI++ ALS+ DPGDI +TI    + +I
Sbjct: 209 RKQEPKGNSSLQNALEMARGLLLHVPAHCTREVLIIFGALSSTDPGDIHQTIASLVQERI 268

Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ ICK LC+ T       Y V L+E HFK+L  E   P P         L
Sbjct: 269 RVRVIGLSAQVAICKELCKQTNYGDNSFYGVILNEVHFKDLFAEAVTPLPVNKINKGFTL 328

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E   S C+CH ++  G GY CP CK++VC LPT C  C L L+ S HLAR
Sbjct: 329 VKMGFPTRVYEDMPSFCTCHSKLMYG-GYFCPNCKSKVCSLPTVCPCCDLMLILSTHLAR 387

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV------ACPKCK 289
           SYHHL P+  F EV     +        TC+ CQ+      N+    +       C  C 
Sbjct: 388 SYHHLMPLKTFQEV-----DVAETFPTDTCYSCQKRFPRLKNQKTQELLTSSRYRCADCH 442

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 443 QDFCIDCDVFIHEILHNCPGCES 465


>gi|367014707|ref|XP_003681853.1| hypothetical protein TDEL_0E03990 [Torulaspora delbrueckii]
 gi|359749514|emb|CCE92642.1| hypothetical protein TDEL_0E03990 [Torulaspora delbrueckii]
          Length = 464

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 200/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G+VT+++G+A  ++ + G+P+ HI AL  +
Sbjct: 143 DLRPNRHAMIVQYAIDFVHEFFDQNPISQLGIVTMRNGLAQLISQISGNPQDHIDALKLI 202

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  +EVLI++ +LS+ DPGDI +TI    +  I
Sbjct: 203 RKQEPKGNPSLQNALEMSRGLLLPVPAHCTKEVLIVFGSLSSTDPGDIHQTINSLVQESI 262

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ ICK LC+ T       Y V LDE+H KEL  +   P P         L
Sbjct: 263 RVKVIGLSAQVAICKELCKATNYGDESYYQVPLDETHCKELFDQAVVPLPVNKINKGFTL 322

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           ++MGFP R  E S + C+CH ++  G GY CP C+++VC LPT C  C L L+ S HLAR
Sbjct: 323 VRMGFPTRIFEDSPTFCACHSQLMYG-GYFCPNCQSKVCSLPTVCPCCDLMLILSTHLAR 381

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGN-KPGLYVA-----CPKCK 289
           SYHHL P+  F EV P     P +     CF CQ    +  N K G  +      C  CK
Sbjct: 382 SYHHLMPLKTFTEV-PSVEEFPTD----NCFSCQIRFPTIKNHKTGELLTSSRYRCESCK 436

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           K FC++CD+YIHE LHNCPGCES
Sbjct: 437 KDFCIDCDVYIHEILHNCPGCES 459


>gi|452846260|gb|EME48193.1| hypothetical protein DOTSEDRAFT_69968 [Dothistroma septosporum
           NZE10]
          Length = 449

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 145/347 (41%), Positives = 203/347 (58%), Gaps = 37/347 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +     +AFVREF +QNP+SQ+G+V +KDG+A  ++D+ G+PE HIK L   
Sbjct: 95  DLRPTRYLLTLTYAKAFVREFIEQNPISQLGIVGMKDGLAIRVSDMSGNPEDHIKILNEY 154

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K   +G+ SLQN LD+ +  L   PS+G REVL++  AL T DPGDI +TI+ C + +I
Sbjct: 155 AKKEPTGNPSLQNGLDMARAALYHTPSHGTREVLVVMGALLTSDPGDIHDTIKACIKDRI 214

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFA---- 171
           R S+IGL+A+M IC  +C+ T       Y+VA+D+  F+EL+M+   PP   +  A    
Sbjct: 215 RVSIIGLAAQMHICAEICRKTNQGDESCYNVAIDDVDFRELLMKSTTPPVMRSTDADALR 274

Query: 172 --IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
              ANL+ MGFP R  E   ++C+CH ++  G GY C RC A+VC LP  C  C L L+ 
Sbjct: 275 LNQANLLMMGFPSRIVEDKATLCACHGQLTRG-GYLCSRCGAKVCSLPATCPTCDLTLIL 333

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----------QQSLLSSGNK 278
           S HLARSYHHLFP+  + EV+    +  R +    C+GC           + + L   +K
Sbjct: 334 STHLARSYHHLFPLQNWVEVS---WDRARRKGSVQCYGCLAGFPRVPVEYETTQLEVRDK 390

Query: 279 PGLYVA----------CPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
           P    A          C  C+ HFC++CD+  HE +HNCPGC S  H
Sbjct: 391 PRRRRAEGASESSRYECETCQNHFCIDCDVAAHEMIHNCPGCMSNAH 437


>gi|158293200|ref|XP_314533.4| AGAP010559-PA [Anopheles gambiae str. PEST]
 gi|157016844|gb|EAA09916.5| AGAP010559-PA [Anopheles gambiae str. PEST]
          Length = 421

 Score =  270 bits (691), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 137/329 (41%), Positives = 206/329 (62%), Gaps = 22/329 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKA---L 58
           D +P+R     K +E F+ EFFDQNP+SQ+G++ +K   A  +++LGG+   HIKA   L
Sbjct: 76  DLKPTRFICSLKLLELFIEEFFDQNPISQLGVIAMKAKRAEKISELGGTSRKHIKAVHAL 135

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
              +   G+ SLQN L+L    L  IP +  RE+L++  +L++CDP D+  TI+  K   
Sbjct: 136 TNGVPLVGEPSLQNGLELALKTLRMIPQHASREILVIMGSLTSCDPNDVHLTIENLKTEG 195

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA--IAEFAIANLI 176
           +RCSV+ LSAE+ ICK LC +TGG +   LD++H+K+ +++H  PP A    EF+   LI
Sbjct: 196 VRCSVLSLSAEIRICKFLCTETGGLFGAVLDDAHYKDQLLQHIDPPQAGNQQEFS---LI 252

Query: 177 KMGFP----QRAGEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQ 226
           KMGFP    +   E  +++C CH +           GY CP+C ++ CELP +C  CGL 
Sbjct: 253 KMGFPHGKTESGKEPPLTMCMCHIDSVDEPAKLTSGGYHCPQCYSKYCELPVECSACGLT 312

Query: 227 LVSSPHLARSYHHLFPIAPFDE--VTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
           L S+PHLARSYHHLFP+  F+E  +  + + +PR+   + C+ CQ++L S  +K  +   
Sbjct: 313 LASAPHLARSYHHLFPVPHFNELPLEQVQVQEPRDPPVTNCYACQKTLASGTDK--MVYE 370

Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCESL 313
           C  CK+ FC++CDI+IHE+LH+C GC ++
Sbjct: 371 CDACKQVFCIDCDIFIHETLHSCVGCTTI 399


>gi|6323033|ref|NP_013105.1| Ssl1p [Saccharomyces cerevisiae S288c]
 gi|417813|sp|Q04673.1|SSL1_YEAST RecName: Full=Suppressor of stem-loop protein 1; AltName:
           Full=General transcription and DNA repair factor IIH
           subunit SSL1; Short=TFIIH subunit SSL1
 gi|2696|emb|CAA78992.1| suppressor of stem-loop [Saccharomyces cerevisiae]
 gi|1360294|emb|CAA97527.1| SSL1 [Saccharomyces cerevisiae]
 gi|190406046|gb|EDV09313.1| suppressor of stem-loop protein 1 [Saccharomyces cerevisiae
           RM11-1a]
 gi|256272337|gb|EEU07321.1| Ssl1p [Saccharomyces cerevisiae JAY291]
 gi|259147994|emb|CAY81243.1| Ssl1p [Saccharomyces cerevisiae EC1118]
 gi|285813427|tpg|DAA09323.1| TPA: Ssl1p [Saccharomyces cerevisiae S288c]
 gi|323308054|gb|EGA61307.1| Ssl1p [Saccharomyces cerevisiae FostersO]
 gi|323332472|gb|EGA73880.1| Ssl1p [Saccharomyces cerevisiae AWRI796]
 gi|365764293|gb|EHN05817.1| Ssl1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|384214|prf||1905312A SSL1 gene
          Length = 461

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV        + RS   CF CQ    +   +K G  +      C  CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456


>gi|207343219|gb|EDZ70748.1| YLR005Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 371

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 50  DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 109

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 110 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 169

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 170 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 229

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 230 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 288

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV        + RS   CF CQ    +   +K G  +      C  CK
Sbjct: 289 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 343

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 344 QEFCVDCDVFIHEILHNCPGCES 366


>gi|323336560|gb|EGA77826.1| Ssl1p [Saccharomyces cerevisiae Vin13]
          Length = 461

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV        + RS   CF CQ    +   +K G  +      C  CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456


>gi|323303957|gb|EGA57737.1| Ssl1p [Saccharomyces cerevisiae FostersB]
          Length = 461

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV        + RS   CF CQ    +   +K G  +      C  CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456


>gi|401837686|gb|EJT41582.1| SSL1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 462

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 195/323 (60%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RPSR A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 141 DLRPSRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 200

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 201 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 260

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 261 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 320

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 321 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 379

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV               CF CQ    +   +K G  +      C  CK
Sbjct: 380 SYHHLMPLKTFAEVPT-----TETFRSEDCFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 434

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CDI+IHE LHNCPGCES
Sbjct: 435 QEFCVDCDIFIHEILHNCPGCES 457


>gi|395814688|ref|XP_003780876.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Otolemur garnettii]
          Length = 395

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 141/322 (43%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHIASLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R S+IGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  +  PPPA    +  +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHYVSPPPASLN-SECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNSEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL      +     C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEHHGERFCYGCQGDL----KDQHVYV-CNAC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++++SLH CPGC
Sbjct: 364 QNVFCVDCDVFVYDSLHCCPGC 385


>gi|312384212|gb|EFR28989.1| hypothetical protein AND_02404 [Anopheles darlingi]
          Length = 1093

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 137/326 (42%), Positives = 202/326 (61%), Gaps = 19/326 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R     K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KA+   
Sbjct: 76  DLKPTRFICSLKLLELFIEEFFDQNPISQLGVIAMKAKRAEKITELSGTARKHVKAVHAL 135

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
               +G+ SLQN L+L    L  +PS+  REVL++  +L+TCDP DI  TI+  K   IR
Sbjct: 136 NTQLTGEPSLQNGLELALKTLRLVPSHASREVLVIMGSLTTCDPTDIHVTIETLKTEGIR 195

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA--IAEFAIANLIKM 178
           CSV+ LSAE+ ICK+LC +TGG+Y+  LD++H+K+ +++H  PP A    EF   +LIKM
Sbjct: 196 CSVVSLSAEIRICKYLCTETGGTYAAVLDDAHYKDQLLQHIDPPQAGNQQEF---SLIKM 252

Query: 179 GFP----QRAGEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLV 228
           GFP    +   E  +++C CH +           GY CP+C ++ CELP +C  CGL L 
Sbjct: 253 GFPHGKTETGKEPPLTMCMCHIDSTDEPAKLTSGGYHCPQCYSKYCELPVECAACGLTLA 312

Query: 229 SSPHLARSYHHLFPIAPFDEVT-PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
           S+PHLARS+HHLFP+  F E+T      + R    + C+ CQ+ L  + +K      C  
Sbjct: 313 SAPHLARSFHHLFPVPHFKELTLEQVQQEAREVPVNYCYACQKPLTVTTDKT--VYECDA 370

Query: 288 CKKHFCLECDIYIHESLHNCPGCESL 313
           C+  FC++CDI+IHE+LH+C GC ++
Sbjct: 371 CRMVFCIDCDIFIHETLHSCVGCTTI 396


>gi|392297981|gb|EIW09080.1| Ssl1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 461

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV        + RS   CF CQ    +   +K G  +      C  CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456


>gi|51013237|gb|AAT92912.1| YLR005W [Saccharomyces cerevisiae]
          Length = 461

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV        + RS   CF CQ    +   +K G  +      C  CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456


>gi|151941173|gb|EDN59551.1| transcription initiation factor TFIIH subunit [Saccharomyces
           cerevisiae YJM789]
          Length = 461

 Score =  270 bits (690), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 197/323 (60%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV P     P       CF CQ    +   +K G  +      C  CK
Sbjct: 379 SYHHLMPLKTFAEV-PTTEKFP----SEDCFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456


>gi|349579731|dbj|GAA24892.1| K7_Ssl1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 461

 Score =  270 bits (690), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQGHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL  E   P P         L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + CSCH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
           SYHHL P+  F EV        + RS   CF CQ    +   +K G  +      C  CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456


>gi|452986441|gb|EME86197.1| hypothetical protein MYCFIDRAFT_39800 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 465

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 143/344 (41%), Positives = 198/344 (57%), Gaps = 37/344 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +      AF+RE+F+QNP+SQ+G++ ++DGVA  ++D+ G+P  HI A+   
Sbjct: 107 DLRPTRHLLTISNTVAFIREYFEQNPISQLGILGMRDGVALRVSDMSGNPNVHIAAVRAL 166

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G    G +SLQNALD+ +  L   PS+G REV+I+  ALS+ DPGDI +TI+ C + KI
Sbjct: 167 RGTDPKGSASLQNALDMARAALYHTPSHGTREVVIILGALSSSDPGDIHDTIKACIKDKI 226

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
           R ++IGL+A+M IC  + + T       Y+VA+DE H++EL M    PP   A    A  
Sbjct: 227 RVNIIGLAAQMQICAEIARKTNQGATNCYNVAVDEVHYRELFMGITTPPVVRATDTEAQK 286

Query: 175 -----LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
                L+ MGFP R  EG  ++C+CH  +  G GY C RCKA+VC LP  C  C L L+ 
Sbjct: 287 RNQAALLMMGFPSRIVEGKATLCACHGNLTRG-GYLCSRCKAKVCNLPATCPTCDLTLIL 345

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ---SLLSS----------- 275
           S HLARSYHHLFP+  +DEV+       + +    C+GC     SL SS           
Sbjct: 346 STHLARSYHHLFPLQNWDEVS---WERAQEKGSIACYGCHSAFPSLWSSTDETNGASHTR 402

Query: 276 -------GNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                  G        C  C+ HFC++CD++ HE  HNCPGC+S
Sbjct: 403 TRPKRAEGASESSRYECRTCQNHFCIDCDVFCHEVAHNCPGCQS 446


>gi|19075300|ref|NP_587800.1| transcription factor TFIIH complex subunit Ssl1
           [Schizosaccharomyces pombe 972h-]
 gi|26400388|sp|O74995.1|TFH47_SCHPO RecName: Full=TFIIH basal transcription factor complex p47 subunit;
           AltName: Full=Suppressor of stem-loop protein 1 homolog;
           Short=SSL1 homolog
 gi|3406059|gb|AAC29144.1| TFIIH subunit p47 [Schizosaccharomyces pombe]
 gi|3560161|emb|CAA20673.1| transcription factor TFIIH complex subunit Ssl1
           [Schizosaccharomyces pombe]
          Length = 421

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 195/321 (60%), Gaps = 17/321 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF   R  +  K    FV EFF+QNP+SQ+ ++ V DG+A+ +TDL G+P+SHI+ L   
Sbjct: 103 DFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGIAHRITDLHGNPQSHIQKLKSL 162

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
             CSG+ SLQNAL++ +  LS I S+G REVLI++ ++ + DPGDI +TI       IR 
Sbjct: 163 RDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSILSSDPGDIFKTIDALVHDSIRV 222

Query: 122 SVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHA-PPPPAIAEFAIANLI 176
            ++GL+AE+ ICK +C  T  S    Y V + E HF+EL++E   PP    A+   A+L+
Sbjct: 223 RIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFRELLLESTIPPATDSAKTTDASLV 282

Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
            MGFP +  E   S+C+CH     G G+ CPRCKA+VC LP +C  C L L+ S HLARS
Sbjct: 283 MMGFPSKVVEQLPSLCACHSIPSRG-GFHCPRCKAKVCTLPIECPSCSLVLILSTHLARS 341

Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP------GLYVACPKCKK 290
           YHHLFP+  + E+ P   N P++   + CF CQ         P       +  ACP CK 
Sbjct: 342 YHHLFPLKNWSEI-PWSAN-PKS---THCFACQLPFPKPPVSPFDESTSSMRYACPSCKN 396

Query: 291 HFCLECDIYIHESLHNCPGCE 311
           HFCL+CD++ HE LH C GC+
Sbjct: 397 HFCLDCDVFAHEQLHECYGCQ 417


>gi|392578803|gb|EIW71930.1| hypothetical protein TREMEDRAFT_72606 [Tremella mesenterica DSM
           1558]
          Length = 462

 Score =  270 bits (689), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 139/350 (39%), Positives = 204/350 (58%), Gaps = 42/350 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRPSR  +  + + ++V E+FDQNPL QIG++ ++D ++  L  +GG+P+  +  L  K
Sbjct: 112 DFRPSRFELTLQYMRSYVVEWFDQNPLGQIGMIIMRDRLSEVLIPMGGNPQEILSVLTDK 171

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
             L  SG+ SLQN L + +G +S +PS    E+L+++S++ST DP    +I + +     
Sbjct: 172 RKLEPSGEPSLQNGLVMAKGGMSHLPSTSSLEILVVFSSISTADPDGPINIHQVLSDLSS 231

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP----------- 165
           S++R S+I LS E+ IC+ +CQ TGG + VA+DE H +EL+ E  PPP            
Sbjct: 232 SRVRTSIISLSGEIKICRQICQRTGGKFGVAMDEDHLRELLWETIPPPAQTQSSNPTTLG 291

Query: 166 ---AIAEFA------------IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCK 210
              A+A               + +L+ MGFP R   G  ++C CH  +K G GY CPRC 
Sbjct: 292 VRNALAAGGNRVDGDGSRPPPVGDLMVMGFPVRLPPGGETLCVCHGLLKKG-GYLCPRCG 350

Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ 270
           +++C++PTDC +CGL +VSSPHLARS+  LFP+A +D +T       R RS  TC+GC +
Sbjct: 351 SKLCDVPTDCEVCGLMVVSSPHLARSFWFLFPVAHYDALTVETTMGDRERSSPTCYGCDR 410

Query: 271 SL----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           S           +  G  P     C +C K FC ECD+YIH++LH CPGC
Sbjct: 411 SFKTMDETGSAHVVDGISPTGRYRCARCFKDFCSECDLYIHDTLHTCPGC 460


>gi|31418653|gb|AAH53382.1| General transcription factor II H, polypeptide 2 [Mus musculus]
          Length = 396

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 142/324 (43%), Positives = 202/324 (62%), Gaps = 27/324 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDE+H+KEL+  H  P PA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLAHHVSPLPA-SSSSECSLIRMG 253

Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
           FPQ           + S S+      S    + +G GY CP+C+A+ CELP +C+ICGL 
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           LVS+PHLARSYHHLFP+  F E++         +    C+GCQ  L        +YV C 
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CT 362

Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
            C+  FC++CD+++H+SLH CPGC
Sbjct: 363 VCQNVFCVDCDVFVHDSLHCCPGC 386


>gi|444318053|ref|XP_004179684.1| hypothetical protein TBLA_0C03620 [Tetrapisispora blattae CBS 6284]
 gi|387512725|emb|CCH60165.1| hypothetical protein TBLA_0C03620 [Tetrapisispora blattae CBS 6284]
          Length = 447

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 147/326 (45%), Positives = 199/326 (61%), Gaps = 22/326 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R ++  +    FV EFFDQNP+SQ+G++ +++G+A+ ++ + G+P+ HI +L  M
Sbjct: 124 DLRPNRYSMSLQYAIDFVHEFFDQNPISQMGVIIMRNGLAHLVSQISGNPQDHIDSLKAM 183

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P +  REVLI++  LS+ DPGDI +TI    + KI
Sbjct: 184 RKQEPKGNPSLQNALEMARGLLLPVPPHCTREVLIVFGGLSSTDPGDIHQTITSLVQEKI 243

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R +VIGLSA++ IC+ LC  T      SY V LDE HFKEL  +   P P         L
Sbjct: 244 RATVIGLSAQVAICQELCTSTNYGDSDSYRVILDEVHFKELFSQAVTPLPVNKINKGFTL 303

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + C+CH ++  G GY CP CK++VC LP  C  C L L+ S HLAR
Sbjct: 304 VKMGFPTRVFEDTPTFCTCHHKLIYG-GYFCPNCKSKVCSLPIVCPCCDLMLILSTHLAR 362

Query: 236 SYHHLFPIAPFDEVTPLCLND---PRNRSRSTCFGCQQSLLS-SGNKPGLYVA-----CP 286
           SYHHL P+  F EV    + D   P N     CF CQ+       NK G  +      C 
Sbjct: 363 SYHHLMPLNTFTEVE--VVKDAAFPTN----NCFSCQKGFPHLVNNKTGELLTSSRYRCG 416

Query: 287 KCKKHFCLECDIYIHESLHNCPGCES 312
           +C K FC++CD+YIHE LHNCPGCES
Sbjct: 417 ECHKDFCIDCDVYIHEILHNCPGCES 442


>gi|3327882|dbj|BAA31745.1| SSL1 [Schizosaccharomyces pombe]
          Length = 392

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 195/321 (60%), Gaps = 17/321 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF   R  +  K    FV EFF+QNP+SQ+ ++ V DG+A+ +TDL G+P+SHI+ L   
Sbjct: 74  DFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGIAHRITDLHGNPQSHIQKLKSL 133

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
             CSG+ SLQNAL++ +  LS I S+G REVLI++ ++ + DPGDI +TI       IR 
Sbjct: 134 RDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSILSSDPGDIFKTIDALVHDSIRV 193

Query: 122 SVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHA-PPPPAIAEFAIANLI 176
            ++GL+AE+ ICK +C  T  S    Y V + E HF+EL++E   PP    A+   A+L+
Sbjct: 194 RIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFRELLLESTIPPATDSAKTTDASLV 253

Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
            MGFP +  E   S+C+CH     G G+ CPRCKA+VC LP +C  C L L+ S HLARS
Sbjct: 254 MMGFPSKVVEQLPSLCACHSIPSRG-GFHCPRCKAKVCTLPIECPSCSLVLILSTHLARS 312

Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP------GLYVACPKCKK 290
           YHHLFP+  + E+ P   N P++   + CF CQ         P       +  ACP CK 
Sbjct: 313 YHHLFPLKNWSEI-PWSAN-PKS---THCFACQLPFPKPPVSPFDESTSSMRYACPSCKN 367

Query: 291 HFCLECDIYIHESLHNCPGCE 311
           HFCL+CD++ HE LH C GC+
Sbjct: 368 HFCLDCDVFAHEQLHECYGCQ 388


>gi|395325355|gb|EJF57779.1| TFIIH basal transcription factor complex subunit SSL1 [Dichomitus
           squalens LYAD-421 SS1]
          Length = 448

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 134/327 (40%), Positives = 206/327 (62%), Gaps = 22/327 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R  ++ +    FV E+FDQNPL QIG+V ++ G+   + ++ G+P+  +KA++  
Sbjct: 124 DMRPTRFDLMLQYAREFVTEWFDQNPLGQIGIVGMRGGLGERIGEMSGNPQDVLKAILER 183

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  +G+ SLQNA+++ +  +S +P+   RE+++++ +L+TCDPG+I +T+ +C + +I
Sbjct: 184 HKLLPNGEPSLQNAVEMARSSMSHLPTQSSREIVVIFGSLTTCDPGNIHDTLDECVKDRI 243

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--AIAEFA---IAN 174
           R S++ L+AEM +C+ LC  TGG + VA++E HFK+L+ E  PPP   A+A       A+
Sbjct: 244 RISIVALAAEMKVCRELCDKTGGQFGVAMNEGHFKDLLFELIPPPAQRAVARTGGGTAAD 303

Query: 175 LIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           L+ MGFP R  + S  S+C CH ++K   G+ CPRC A+VC++PTDC ICGL +VSSPHL
Sbjct: 304 LMIMGFPMRLPDTSPPSLCVCHSQMK-SEGFLCPRCLAKVCDVPTDCDICGLMIVSSPHL 362

Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSS----------GNKPGLYV 283
           ARSYHHLFP+  +  VT L   D        C GC  +   +          G  P    
Sbjct: 363 ARSYHHLFPVKSYSAVTTL---DDTPEPSKRCHGCAVAFKETPSTMLGASGEGMSPYGRY 419

Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGC 310
            C +C   FC +CD+++H+ +H CPGC
Sbjct: 420 RCLECHSDFCADCDVFVHDVVHCCPGC 446


>gi|73949709|ref|XP_535266.2| PREDICTED: general transcription factor IIH subunit 2 isoform 1
           [Canis lupus familiaris]
          Length = 395

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 148/322 (45%), Positives = 205/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+ PF E+ PL   +  N  R  C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDPFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|351712123|gb|EHB15042.1| General transcription factor IIH subunit 2, partial [Heterocephalus
           glaber]
          Length = 392

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 203/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+A   K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLACTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+T         +    C+GCQ  L        +YV C +C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEMTL-----EEYKGERFCYGCQGQL----KDQHVYV-CAEC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|358059749|dbj|GAA94518.1| hypothetical protein E5Q_01170 [Mixia osmundae IAM 14324]
          Length = 458

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/341 (40%), Positives = 206/341 (60%), Gaps = 37/341 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R+ +  K    FV E+FDQNP+SQ+ ++  KDG+A  ++ L G+P  H++AL  K
Sbjct: 122 DLRPTRLELTIKYAIEFVNEYFDQNPISQMAILVTKDGIAERISPLSGNPVDHVRALESK 181

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  SG+ SLQN L++ +  L+ +PS+G REV+I++ +L+TCDPG+I  T++   + ++
Sbjct: 182 RKLDPSGEPSLQNVLEMARSGLAHLPSHGSREVVIIFGSLTTCDPGNIHTTMEALVKDRV 241

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP----------AIAE 169
           R +++GLSAEM ICK +C+ T G Y V  +E+H+K+L+ E  PPP           + + 
Sbjct: 242 RVNIVGLSAEMSICKEVCKRTKGVYGVVTNEAHYKDLLFELVPPPATHRPEAGPDGSRSN 301

Query: 170 FAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
              A+L++MGFP  +     ++CSCH  +K   G+ CPRC +R+CE+PT+C IC L +VS
Sbjct: 302 NTGADLMQMGFPSLSSSTFGALCSCHSRLKT-TGFNCPRCASRLCEVPTECGICSLTVVS 360

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSR-STCFGCQ-------QSLLSSGNKP-- 279
           SPHLARSY HLFP+  + EV       P   S  + CF C         S  ++G  P  
Sbjct: 361 SPHLARSYRHLFPVKNYLEVNN---GQPGEDSWPAECFACSLKFSTVAVSAATTGAPPTA 417

Query: 280 ----------GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                     G Y  CP+C  HFC++CD++ HE L  CPGC
Sbjct: 418 ASTASRISSTGRY-ECPRCHNHFCIDCDLHAHEVLGVCPGC 457


>gi|147906745|ref|NP_001085329.1| general transcription factor IIH, polypeptide 2, 44kDa [Xenopus
           laevis]
 gi|49256020|gb|AAH71091.1| MGC81060 protein [Xenopus laevis]
          Length = 395

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 199/322 (61%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIGL+  ++  A  LT+L G+P  HI A+   
Sbjct: 75  DLKPNRLTCTLKLLEFFVEEYFDQNPISQIGLIVTRNKRAEKLTELAGNPRQHINAMKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + CSG+ SL N+L+L    L  +P +  RE+L+++S+L+TCDP +I + I+  K SK+
Sbjct: 135 VDMTCSGEPSLYNSLNLALQTLKHMPGHTSREILVIFSSLTTCDPTNIYDMIKCLKASKV 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG Y V LDESH+KEL+M H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYKELLMHHVIPPPA-SSSSECSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+CKA+  ELP +C++C L LV
Sbjct: 254 FPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGGYFCPQCKAKYSELPVECKVCRLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV     +  R      C GC   L     K      C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFKEVRLEEYDGER-----YCRGCDGEL-----KDQQVYVCTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC+ECD+++HESLH CPGC
Sbjct: 364 QCVFCIECDLFVHESLHCCPGC 385


>gi|410948792|ref|XP_003981114.1| PREDICTED: general transcription factor IIH subunit 2 [Felis catus]
          Length = 366

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 143/309 (46%), Positives = 198/309 (64%), Gaps = 24/309 (7%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSSLQN 72
           +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L     + C G+ SL N
Sbjct: 59  LEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYN 118

Query: 73  ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
           +L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KIR SVIGLSAE+ +
Sbjct: 119 SLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKATKIRVSVIGLSAEVRV 178

Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA-------- 184
           C  L ++TGG+Y V LDESH+KEL+  H  PPPA +  +  +LI+MGFPQ          
Sbjct: 179 CTVLARETGGTYHVILDESHYKELLTHHVSPPPA-SSGSECSLIRMGFPQHTIASLSDQD 237

Query: 185 GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
            + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLF
Sbjct: 238 AKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLF 297

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+  F E+ PL   +  N  R  C+GCQ  L        +YV C  C+  FC++CD+++H
Sbjct: 298 PLDAFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVH 347

Query: 302 ESLHNCPGC 310
           +SLH CPGC
Sbjct: 348 DSLHCCPGC 356


>gi|392865317|gb|EAS31097.2| transcription factor ssl1 [Coccidioides immitis RS]
          Length = 471

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 141/334 (42%), Positives = 202/334 (60%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 118 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 177

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                +G  SLQNAL++ +G L   PS+G REVLI++ AL + DPGDI +TI      KI
Sbjct: 178 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGALLSSDPGDIHKTITSLVTDKI 237

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R  V+GL+AE+ IC+ +C  T       Y VAL+E HF+EL+ME   PP   ++  A+ +
Sbjct: 238 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPPVTRSQKQAVNS 297

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E S S+C+CH +   G GY C RC +++C LP +C  CGL L+ S HLA
Sbjct: 298 LLMMGFPSRTVESSQSLCACHSKPSRG-GYLCSRCGSKICTLPAECPACGLTLILSTHLA 356

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
           RSYHHLFP+  + EV       P N++   S CF C           Q     G   G+ 
Sbjct: 357 RSYHHLFPLINWVEV-------PWNKASISSNCFACGNPFPPVPVRAQWETRGGTVKGMS 409

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 410 VSSRYECTVCHNHFCIDCDVFAHEIVHNCPGCQS 443


>gi|346716173|ref|NP_001231240.1| general transcription factor IIH subunit 2 [Sus scrofa]
          Length = 395

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 205/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C+G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCNGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+AR CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRARYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+GCQ  L        +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYI-CSVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QHVFCVDCDVFVHDSLHCCPGC 385


>gi|355693895|gb|AER99488.1| proteinral transcription factor IIH, polypeptide 2, 44kDa [Mustela
           putorius furo]
          Length = 393

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 74  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 133

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 134 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKSAKI 193

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 194 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 252

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 253 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 312

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL      +     C+GCQ  L        +YV C  C
Sbjct: 313 SAPHLARSYHHLFPLDAFQEI-PL----EEHNGEKFCYGCQGEL----KDQHVYV-CAVC 362

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 363 QNVFCVDCDVFVHDSLHCCPGC 384


>gi|332233766|ref|XP_003266076.1| PREDICTED: general transcription factor IIH subunit 2 isoform 1
           [Nomascus leucogenys]
          Length = 395

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRAGEG--------SISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ             S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTNASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+    L +  N  R  C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI----LLEEHNGER-FCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|402224004|gb|EJU04067.1| TFIIH basal transcription factor complex subunit SSL1 [Dacryopinax
           sp. DJM-731 SS1]
          Length = 440

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 133/331 (40%), Positives = 199/331 (60%), Gaps = 29/331 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RPSR  +       FV E+FDQNPL Q+G+V +++GV   + D+ G+P+  +K  + K
Sbjct: 116 DMRPSRFDLTLSYAREFVSEWFDQNPLGQMGVVGMRNGVGERIGDMSGNPQDVLKCFLDK 175

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  +G+ SLQN++++ +  ++ +P++  RE+LI++ +L+T DPG++++T+  C   +I
Sbjct: 176 TLLEPAGEPSLQNSIEMARASMNHLPTHSSREILIIFGSLTTVDPGNLLDTVDACVRDQI 235

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP---------PAIAEF 170
           R S++ LSAEM +C+ +C+ TGG + VAL+E H+K+L+ E  PPP          A A  
Sbjct: 236 RISIVALSAEMKVCRDMCERTGGVFGVALNEGHYKDLLFELVPPPADKSPVSRLAAQAGE 295

Query: 171 AIANLIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
             + L++MGFP R       S+C CH  ++   GY CPRC ++VC++PTDC ICGL +VS
Sbjct: 296 GQSELMQMGFPTRLPATAPPSLCVCHSRLR-SEGYICPRCNSKVCDVPTDCDICGLMIVS 354

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP---------- 279
           SPHLARSYHHLFP   +  V  L       +  S C  C     ++   P          
Sbjct: 355 SPHLARSYHHLFPAPAYKAVLEL-----DGKEASACHACSLPFPTTARAPLQTEEGMSPH 409

Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           G Y  CPKCK  F  ECD+++H+ LH CPGC
Sbjct: 410 GRY-RCPKCKHDFDAECDLFVHDVLHVCPGC 439


>gi|328858838|gb|EGG07949.1| hypothetical protein MELLADRAFT_35246 [Melampsora larici-populina
           98AG31]
          Length = 455

 Score =  266 bits (681), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 138/339 (40%), Positives = 207/339 (61%), Gaps = 39/339 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R+ +     + FV EFFDQNP+SQ+ +   KD +A  L+ L G+P  H KAL  K
Sbjct: 124 DLRPNRLEMSLNYAQEFVTEFFDQNPISQMCIFITKDAIAERLSPLSGNPVDHHKALSNK 183

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  SG+ SLQNAL++ +  L+ +P +G +E L++  +L+TCDP DI +TI+K ++ ++
Sbjct: 184 RKLEPSGEPSLQNALEMARATLAHLPPHGSKETLVIMGSLTTCDPEDIHKTIEKLEKDRM 243

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
           R S++GLSAE+ ICK +C+ T G Y V ++E+HFKEL+ E  PPPP++   +        
Sbjct: 244 RVSIVGLSAEINICKEICRKTKGKYGVIMNETHFKELLFESIPPPPSLVNSSTSQSNRST 303

Query: 173 ----ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
               ++L++MGFP +      S+C+CH ++K   G+ CPRC +++C++PTDC ICGL +V
Sbjct: 304 NTTSSDLMQMGFPTKLSNPIPSLCACHSKLK-SSGFICPRCGSKLCDVPTDCLICGLTVV 362

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRS---RSTCFGCQQSL------------- 272
           SSPHLARSY HLFP+A + E+      D  N      ++C+GC Q               
Sbjct: 363 SSPHLARSYRHLFPVANWKEI------DSENHEIECPTSCYGCSQVFSTSSTTTTTTTTT 416

Query: 273 -LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
             +  N+   ++ C +CK  FC  CD++ HE L  CPGC
Sbjct: 417 TTTITNQSSKFM-CNRCKHIFCSTCDLF-HEQLGLCPGC 453


>gi|50288221|ref|XP_446539.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49525847|emb|CAG59466.1| unnamed protein product [Candida glabrata]
          Length = 439

 Score =  266 bits (681), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 142/323 (43%), Positives = 200/323 (61%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +Q   FV EFFDQNP+SQ+G+VT+++G+A  ++ + G+P+ HI AL  +
Sbjct: 118 DLRPNRHAMIIQQAIDFVHEFFDQNPISQMGIVTMRNGLAQLVSQVSGNPQDHIDALKSI 177

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LS+ DPGDI +TI    + +I
Sbjct: 178 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSSNDPGDIHQTIASLVQEQI 237

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  V+GLSAE+ ICK LC+ T       Y V LDE+H K+L  E   P P         L
Sbjct: 238 RVKVLGLSAEVAICKELCKQTNMGDTSFYQVLLDETHLKKLFDEAVTPLPVNKINRGFTL 297

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E   + C+CH ++  G GY CP C ++VC LPT C  C L L+ S HLAR
Sbjct: 298 VKMGFPTRIFEEQPTFCACHAKLIYG-GYFCPNCNSKVCSLPTVCPCCDLMLILSTHLAR 356

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQS--LLSSGNKPGLYVA----CPKCK 289
           S+HHL P+  F EV P   + P       C+ CQ    ++ S     +  +    C +CK
Sbjct: 357 SFHHLMPLKTFVEV-PAAESFP----TENCYSCQMKFPIIKSQKSNQMLTSSRYRCEECK 411

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
             +C++CD++IHESLH CPGCES
Sbjct: 412 NDYCIDCDVFIHESLHTCPGCES 434


>gi|296422371|ref|XP_002840734.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636957|emb|CAZ84925.1| unnamed protein product [Tuber melanosporum]
          Length = 469

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 144/335 (42%), Positives = 208/335 (62%), Gaps = 32/335 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRP+R  +  +    FVRE+F+QNP+SQ+G++ ++DG+A  ++DL G+P  HI+A+ GK
Sbjct: 122 DFRPTRYLLSLRYACDFVREYFEQNPISQLGVLGMRDGLAVRISDLSGNPVDHIEAI-GK 180

Query: 62  LGCS---GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           L      G  SLQNAL++ +  L   PS+G REV+I++ AL + DPGDI +TI    E K
Sbjct: 181 LRLDEGKGAPSLQNALEMSRAALFHAPSHGTREVVIVFGALLSSDPGDIHQTISHLVEDK 240

Query: 119 IRCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHA-PPPPAIAEFAIA 173
           IR SV+GLSA++ +C+ LC+ T     G Y VALD+ HF+EL+++   PP    A+ + +
Sbjct: 241 IRVSVVGLSAQVAVCRELCKRTNAGDEGVYGVALDDKHFRELLLDTTIPPVTRSAKMSAS 300

Query: 174 NLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           +L+ MGFP R  E + S+C+CH  +  G GY C RC  +VC LP +C  CGL L+ S HL
Sbjct: 301 SLLMMGFPSRMVEKTASLCACHSTLTKG-GYLCSRCSTKVCALPMECPACGLTLILSTHL 359

Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQ---------------QSLLSSGN 277
           ARSYHHLFP+  + EV P      +  S+ST C+ CQ                +  ++G 
Sbjct: 360 ARSYHHLFPLKNWVEV-PW-----KAASKSTHCYACQIPFPHPPAKPEELSAPAKTATGT 413

Query: 278 KPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                 AC  C  HFC++CD++ HE+L+NCP C+S
Sbjct: 414 SVSSRYACTICHNHFCIDCDVFCHETLYNCPACQS 448


>gi|242008465|ref|XP_002425024.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Pediculus humanus corporis]
 gi|212508673|gb|EEB12286.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Pediculus humanus corporis]
          Length = 401

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 139/328 (42%), Positives = 204/328 (62%), Gaps = 24/328 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG--SPESHIKALM 59
           D  P+R     K +E F+ EFF+QNP++Q+G++  ++  A  L+DL G    +  I   +
Sbjct: 76  DLSPNRQLCTVKLLEGFIDEFFEQNPIAQMGVIITRNKRAEKLSDLAGVSKRQKEIIKSI 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           G L C+G  SLQNAL+L    L   P++  REV+ + + L+TCDPG++ +TI+  KE  I
Sbjct: 136 GDLACTGQPSLQNALELAGKSLKLRPTHASREVIAIIANLTTCDPGNVADTIRFMKEENI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVIGL+AE+++ K L ++T G+YSV LD+ HFK  + +   PP   +  + A+LIKMG
Sbjct: 196 RCSVIGLAAEVYVYKTLTKETKGTYSVILDDVHFKNQLFQQIDPPAMTSNLS-ASLIKMG 254

Query: 180 FPQRA---GEGSISICSCH---KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           FP  +    + S+ +C CH    EVK+   GY CP+C A+ CELP +C++C L LVS+PH
Sbjct: 255 FPHHSLHDDKSSLGLCMCHLDDSEVKLKSDGYKCPQCAAKYCELPVECKVCSLTLVSAPH 314

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHF 292
           LAR++HHLFP+  F+        D +N +   CFGCQ+    S     LY  C KC++ F
Sbjct: 315 LARTFHHLFPVQSFERF------DGKN-AEGYCFGCQKKF--SEGDTYLY-KCNKCEQRF 364

Query: 293 CLECDIYIHESLHNCPGCESLRHSNPIV 320
           C  CDI++H+SL  CPGC     +NP++
Sbjct: 365 CGVCDIFVHDSLRTCPGCA----TNPLL 388


>gi|195377775|ref|XP_002047663.1| GJ11798 [Drosophila virilis]
 gi|194154821|gb|EDW70005.1| GJ11798 [Drosophila virilis]
          Length = 442

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 137/330 (41%), Positives = 202/330 (61%), Gaps = 24/330 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL G 
Sbjct: 78  DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALAGL 137

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI +TI + K  +I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKTLKVVPSHASREIVIIMGSLTTCDPVDINQTIDELKRERI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ IC++L Q T G++   +D++H+++ ++    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHICRYLTQQTMGTFGAVMDDAHYRDQLLAQVDPPPA-AKTQDNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHSKNEVEGKDAPLSMCMCHIENLDEPSELSTTGHHCPQCNSKYCELPVECQCCGLTLV 316

Query: 229 SSPHLARSYHHLFPI-----APFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV 283
           S+PHLARSYHHLFP+      PF+ V P     P   S   C+GC + L +  +K     
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFESVPPAA---PDCGSLRKCYGCIKVLNAQTDKS--VF 371

Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGCESL 313
            C  C + FC++CDI+IHE+LH C GC ++
Sbjct: 372 KCGYCNQFFCIDCDIFIHETLHACVGCNTI 401


>gi|301760269|ref|XP_002915938.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Ailuropoda melanoleuca]
          Length = 395

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQ+G++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQVGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL      +     C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEHNGEKFCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QTVFCVDCDVFVHDSLHCCPGC 385


>gi|393217828|gb|EJD03317.1| TFIIH basal transcription factor complex, subunit SSL1 [Fomitiporia
           mediterranea MF3/22]
          Length = 481

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 135/363 (37%), Positives = 203/363 (55%), Gaps = 57/363 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +      AF+ E+FDQNPL QIG++ ++ G+A  + ++  +P   +K++  +
Sbjct: 120 DMRPTRFDLALSTARAFIVEWFDQNPLGQIGVIGMRSGIAERIGEMSSNPHDVLKSIQDR 179

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  +G+ SLQNA+++ +  +S +P++  RE++IL+ +L+T DPG+I ET+  C   +I
Sbjct: 180 HRLEPAGEPSLQNAVEMARSTMSHLPTHSSREIVILFGSLTTVDPGNIHETLDACIRDRI 239

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA----------- 168
           R +++ L+AEM IC+ +C  TGG + V+L+E HF++L+ E  PPP   A           
Sbjct: 240 RINLVALAAEMKICRDMCGKTGGQFGVSLNEGHFRDLLFELIPPPAQRALSRAAGAGASA 299

Query: 169 -----EFAIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
                    A+L+ MGFP R  E S  ++C CH E+K   G+ CPRC A+VC++PTDC +
Sbjct: 300 SAGSGGNPAADLMMMGFPTRLPEASPPALCVCHSELK-SAGFVCPRCAAKVCDVPTDCDV 358

Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------- 268
           CGL +VSSPHLARSYHHLFP+ P+  VT   LN     +   C GC              
Sbjct: 359 CGLMIVSSPHLARSYHHLFPVKPYQAVTDTELNS--TTASRACHGCASPFPPVSSGTSTS 416

Query: 269 ---------------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNC 307
                                  ++ S G  P     CP CK  FC ECD+++H+ +H C
Sbjct: 417 TNPVTGSAAGTGAAEGASAAGTSTVTSDGMSPLGRYRCPDCKNDFCTECDVFVHDVVHCC 476

Query: 308 PGC 310
           PGC
Sbjct: 477 PGC 479


>gi|348569038|ref|XP_003470305.1| PREDICTED: general transcription factor IIH subunit 2-like [Cavia
           porcellus]
          Length = 395

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 IDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIAALSDQDAKPSFSMAHLDSSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL       +    C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEYKGERFCYGCQGQL----KDQHVYV-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|440631747|gb|ELR01666.1| hypothetical protein GMDG_00042 [Geomyces destructans 20631-21]
          Length = 491

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 137/336 (40%), Positives = 202/336 (60%), Gaps = 31/336 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +   +F+ EFF+QNP+SQ+G++ ++DG+A  ++ L G+P  H+ AL GK
Sbjct: 124 DLRPTRYLLTLRHATSFISEFFEQNPISQLGILGMRDGLAKPISPLSGTPTVHLGAL-GK 182

Query: 62  LGCS---GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           L      G  SLQNAL++ +  L   PS+G REVLI+  AL + DPGDI  TI      +
Sbjct: 183 LRTQDPQGSPSLQNALEMARASLFHAPSHGTREVLIISGALLSSDPGDIHTTISSLTSDR 242

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA-EFAIANLIK 177
           IR S IGL+A++ I   +C  T G YSVAL E HF+ L+M    PPP  A E   ++L+ 
Sbjct: 243 IRVSAIGLAAQVAILSEICTKTKGDYSVALHEEHFRALLMGATTPPPTRAKEQNQSSLLM 302

Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
           MGFP R    ++++C+CH ++  G GY C RC ++VC LP +C  CGL L+ S HLARSY
Sbjct: 303 MGFPSRTVATAVTLCACHSKLTRG-GYLCSRCGSKVCSLPAECPACGLTLILSTHLARSY 361

Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC---------------------QQSLLSSG 276
           HHLFP+  + EV+    ++       +CF C                     +++++ + 
Sbjct: 362 HHLFPLRNWREVS---WSEAAGSGVGSCFACGVGFNKPPPRQVVEGKKEGGRREAMMKAV 418

Query: 277 NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
           ++ G Y AC  C +HFC++CD++ HE +HNCPGC+S
Sbjct: 419 SESGRY-ACEVCGEHFCIDCDVFAHEVVHNCPGCQS 453


>gi|281337668|gb|EFB13252.1| hypothetical protein PANDA_003972 [Ailuropoda melanoleuca]
          Length = 364

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQ+G++  K   A  LT+L G+P  HI +L   
Sbjct: 44  DLKPNRLTCTLKLLEYFVEEYFDQNPISQVGIIVTKSKRAEKLTELSGNPRKHITSLKKA 103

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 104 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 163

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 164 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 222

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 223 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 282

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL      +     C+GCQ  L        +YV C  C
Sbjct: 283 SAPHLARSYHHLFPLDAFQEI-PL----EEHNGEKFCYGCQGEL----KDQHVYV-CAVC 332

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 333 QTVFCVDCDVFVHDSLHCCPGC 354


>gi|386780666|ref|NP_001247509.1| general transcription factor IIH subunit 2 [Macaca mulatta]
 gi|297675411|ref|XP_002815673.1| PREDICTED: general transcription factor IIH subunit 2 isoform 2
           [Pongo abelii]
 gi|297675413|ref|XP_002815674.1| PREDICTED: general transcription factor IIH subunit 2 isoform 3
           [Pongo abelii]
 gi|402871770|ref|XP_003899824.1| PREDICTED: general transcription factor IIH subunit 2 isoform 1
           [Papio anubis]
 gi|402871772|ref|XP_003899825.1| PREDICTED: general transcription factor IIH subunit 2 isoform 2
           [Papio anubis]
 gi|355691370|gb|EHH26555.1| General transcription factor IIH polypeptide 2 [Macaca mulatta]
 gi|355749973|gb|EHH54311.1| General transcription factor IIH polypeptide 2 [Macaca
           fascicularis]
 gi|380785669|gb|AFE64710.1| general transcription factor IIH subunit 2 [Macaca mulatta]
 gi|383411171|gb|AFH28799.1| general transcription factor IIH subunit 2 [Macaca mulatta]
 gi|384940950|gb|AFI34080.1| general transcription factor IIH subunit 2 [Macaca mulatta]
          Length = 395

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 201/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+     N  R      C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEIPLEEYNGER-----FCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|195592457|ref|XP_002085951.1| GD15058 [Drosophila simulans]
 gi|194197960|gb|EDX11536.1| GD15058 [Drosophila simulans]
          Length = 438

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 138/331 (41%), Positives = 204/331 (61%), Gaps = 21/331 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+GL+ +K   A  +T+L G+   H+KAL  +
Sbjct: 78  DLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKAKRAEKVTELTGTSRVHLKALESL 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
           S+PHLARSYHHLFP+  F+E+     P   +D  +  R  C+GC ++L    +K    V 
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDFTSDVRE-CYGCAKALGQGVDKVADKVV 375

Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
             C  CK+ FC++CDI+IH++LH C GC ++
Sbjct: 376 FRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406


>gi|342888788|gb|EGU88007.1| hypothetical protein FOXB_01490 [Fusarium oxysporum Fo5176]
          Length = 468

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 151/355 (42%), Positives = 206/355 (58%), Gaps = 41/355 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D  P+R  ++     AFVREFF+QNP+SQ+G++ ++DGVA  ++D+GG+P  H++ L G 
Sbjct: 107 DLLPTRYRLMLSYAAAFVREFFEQNPISQLGIIGMRDGVAVRISDVGGNPTEHLEKLKGL 166

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI ETI      +I
Sbjct: 167 ENEDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHETIGNLITDRI 226

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R S++GLSA++ IC  LC  T       Y++A+DE HF+EL +    PP    AE + A+
Sbjct: 227 RVSIVGLSAQVAICADLCSRTNAGDESQYNIAMDEVHFRELFLAATTPPVTRTAEQSTAS 286

Query: 175 LIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R     G+ S C+CH       GY C RC ARVC +P +C  C L L+ S H
Sbjct: 287 LLMMGFPSRTLVPNGTTSYCACHNR-PFREGYLCTRCGARVCRIPAECPACDLTLILSTH 345

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGC-------------QQSLLSSG- 276
           LARSYHHLFP+  + EV       P  + SRS  CF C              +S   SG 
Sbjct: 346 LARSYHHLFPLRNWVEV-------PWTKASRSAACFSCLAPFPEPPKGKAPDKSREDSGA 398

Query: 277 -------NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPIVANEG 324
                  ++ G Y AC  C +HFC++CD++ HE +HNCPGC+SL       A+ G
Sbjct: 399 PKTAKGVSESGRY-ACEVCGQHFCIDCDVFAHEVVHNCPGCQSLLSKTDAAASSG 452


>gi|412993157|emb|CCO16690.1| predicted protein [Bathycoccus prasinos]
          Length = 452

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 147/336 (43%), Positives = 195/336 (58%), Gaps = 30/336 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R AV+   +  FVREFF+QNPLSQ+GL+  KDG A  +T+L GSPE+H+KA+   
Sbjct: 112 DMRPNRSAVLLPLMIKFVREFFNQNPLSQLGLIACKDGKAERITELSGSPETHVKAIKKA 171

Query: 62  L---GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
               G  G  SLQN L+     L  +P +G RE++ + S+LSTCDPG+I ++++K K+ K
Sbjct: 172 FASDGVGGSFSLQNGLEQAMEGLRDVPPFGAREIIAIISSLSTCDPGNINDSVRKVKKMK 231

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
            R +VI ++AE  + K L ++T G +SV+LD+SH   LIME APPP  + E A   L++M
Sbjct: 232 ARTNVICVAAETRVFKKLSEETKGKFSVSLDQSHLTRLIMECAPPPALLLETAKPALVEM 291

Query: 179 GFPQRA------GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           GFP+R       G G                Y CPRC+AR  ELP+ C  C L LVSSPH
Sbjct: 292 GFPRREPRKFGLGAGDEDDRDVLTIGPRNGEYRCPRCEARAEELPSQCGTCQLSLVSSPH 351

Query: 233 LARSYHHLFPIAPFDEVT------------------PLCLNDPRNRSRSTCFGCQQSLLS 274
           LARSYHHLFP+ PF EV                       +D   +    CFGC   + +
Sbjct: 352 LARSYHHLFPVMPFVEVKVDGDSKNEEEEEKKRKKNSGKDDDDDGQFLRECFGCCVVVDA 411

Query: 275 SGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           S    G+   C KC+K FC  CD+YIH+ LHNC GC
Sbjct: 412 S---TGMLSKCLKCEKEFCFACDVYIHDRLHNCVGC 444


>gi|6681762|ref|NP_001506.1| general transcription factor IIH subunit 2 [Homo sapiens]
 gi|17380326|sp|Q13888.1|TF2H2_HUMAN RecName: Full=General transcription factor IIH subunit 2; AltName:
           Full=Basic transcription factor 2 44 kDa subunit;
           Short=BTF2 p44; AltName: Full=General transcription
           factor IIH polypeptide 2; AltName: Full=TFIIH basal
           transcription factor complex p44 subunit
 gi|496609|emb|CAA82910.1| basic transcripion factor 2, 44 kD subunit [Homo sapiens]
 gi|5531809|gb|AAD44479.1| basic transcription factor 2 [Homo sapiens]
 gi|146327202|gb|AAI41604.1| General transcription factor IIH, polypeptide 2, 44kDa [synthetic
           construct]
 gi|147898053|gb|AAI40304.1| General transcription factor IIH, polypeptide 2, 44kDa [synthetic
           construct]
 gi|208968221|dbj|BAG73949.1| General transcription factor IIH subunit 2 [synthetic construct]
          Length = 395

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|114599829|ref|XP_001158018.1| PREDICTED: uncharacterized protein LOC461830 isoform 9 [Pan
           troglodytes]
 gi|114599831|ref|XP_001158068.1| PREDICTED: uncharacterized protein LOC461830 isoform 10 [Pan
           troglodytes]
 gi|114599833|ref|XP_001158128.1| PREDICTED: uncharacterized protein LOC461830 isoform 11 [Pan
           troglodytes]
 gi|397470434|ref|XP_003806827.1| PREDICTED: general transcription factor IIH subunit 2 [Pan
           paniscus]
 gi|397470436|ref|XP_003806828.1| PREDICTED: general transcription factor IIH subunit 2 [Pan
           paniscus]
 gi|397470438|ref|XP_003806829.1| PREDICTED: general transcription factor IIH subunit 2 [Pan
           paniscus]
 gi|410207616|gb|JAA01027.1| general transcription factor IIH, polypeptide 2C [Pan troglodytes]
 gi|410258220|gb|JAA17077.1| general transcription factor IIH, polypeptide 2C [Pan troglodytes]
 gi|410299712|gb|JAA28456.1| general transcription factor IIH, polypeptide 2, 44kDa [Pan
           troglodytes]
 gi|410350531|gb|JAA41869.1| general transcription factor IIH, polypeptide 2C [Pan troglodytes]
 gi|410350533|gb|JAA41870.1| general transcription factor IIH, polypeptide 2D [Pan troglodytes]
          Length = 395

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 201/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHVTSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+     N  R      C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEIPLEEYNGER-----FCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|320040921|gb|EFW22854.1| RNA polymerase TFIIH complex subunit Ssl1 [Coccidioides posadasii
           str. Silveira]
          Length = 471

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 118 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 177

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                +G  SLQNAL++ +G L   PS+G REVLI++  L + DPGDI +TI      KI
Sbjct: 178 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGTLLSSDPGDIHKTITSLVADKI 237

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R  V+GL+AE+ IC+ +C  T       Y VAL+E HF+EL+ME   PP   ++  A+ +
Sbjct: 238 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPPVTRSQKQAVNS 297

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E S S+C+CH +   G GY C RC +++C LP +C  CGL L+ S HLA
Sbjct: 298 LLMMGFPSRTVEPSQSLCACHSKPSRG-GYLCSRCGSKICTLPAECPACGLTLILSTHLA 356

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
           RSYHHLFP+  + EV       P N++   S CF C           Q     G   G+ 
Sbjct: 357 RSYHHLFPLINWVEV-------PWNKASISSNCFACGNPFPPVPVRAQWETRGGTVKGMS 409

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 410 VSSRYECTVCHNHFCIDCDVFAHEIVHNCPGCQS 443


>gi|21356299|ref|NP_649427.1| Ssl1, isoform A [Drosophila melanogaster]
 gi|442634242|ref|NP_001262227.1| Ssl1, isoform B [Drosophila melanogaster]
 gi|10726963|gb|AAF51879.2| Ssl1, isoform A [Drosophila melanogaster]
 gi|15010516|gb|AAK77306.1| GH08526p [Drosophila melanogaster]
 gi|220945098|gb|ACL85092.1| Ssl1-PA [synthetic construct]
 gi|220952514|gb|ACL88800.1| Ssl1-PA [synthetic construct]
 gi|440216208|gb|AGB94920.1| Ssl1, isoform B [Drosophila melanogaster]
          Length = 438

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 138/331 (41%), Positives = 204/331 (61%), Gaps = 21/331 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+GL+ +K   A  +T+L G+   H+KAL  +
Sbjct: 78  DLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKAKRAEKVTELTGTSRVHLKALESL 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
           S+PHLARSYHHLFP+  F+E+     P   +D  +  R  C+GC ++L    +K    V 
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDLTSDVRE-CYGCAKALGQGVDKVADKVV 375

Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
             C  CK+ FC++CDI+IH++LH C GC ++
Sbjct: 376 FRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406


>gi|303319293|ref|XP_003069646.1| TFIIH basal transcription factor complex p47 subunit , putative
           [Coccidioides posadasii C735 delta SOWgp]
 gi|240109332|gb|EER27501.1| TFIIH basal transcription factor complex p47 subunit , putative
           [Coccidioides posadasii C735 delta SOWgp]
          Length = 471

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 118 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 177

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                +G  SLQNAL++ +G L   PS+G REVLI++  L + DPGDI +TI      KI
Sbjct: 178 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGTLLSSDPGDIHKTITSLVADKI 237

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R  V+GL+AE+ IC+ +C  T       Y VAL+E HF+EL+ME   PP   ++  A+ +
Sbjct: 238 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPPVTRSQKQAVNS 297

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E S S+C+CH +   G GY C RC +++C LP +C  CGL L+ S HLA
Sbjct: 298 LLMMGFPSRTVEPSQSLCACHSKPSRG-GYLCSRCGSKICTLPAECPACGLTLILSTHLA 356

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
           RSYHHLFP+  + EV       P N++   S CF C           Q     G   G+ 
Sbjct: 357 RSYHHLFPLINWVEV-------PWNKASISSNCFACGNPFPPVPVRAQWETRGGTVKGMS 409

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 410 VSSRYECTVCHNHFCIDCDVFAHEIVHNCPGCQS 443


>gi|158258154|dbj|BAF85050.1| unnamed protein product [Homo sapiens]
          Length = 395

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKEA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHLSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|254566501|ref|XP_002490361.1| Component of the core form of RNA polymerase transcription factor
           TFIIH [Komagataella pastoris GS115]
 gi|238030157|emb|CAY68080.1| Component of the core form of RNA polymerase transcription factor
           TFIIH [Komagataella pastoris GS115]
 gi|328350755|emb|CCA37155.1| Suppressor of stem-loop protein 1 [Komagataella pastoris CBS 7435]
          Length = 447

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A+       FV  FFDQNP+SQ+G+V +++G+A  ++++ G+P +H++AL  +
Sbjct: 118 DLRPNRAALTISNAIEFVNNFFDQNPISQLGIVLMRNGIATLVSEVSGNPNNHVEALRSI 177

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +L  +GD SLQNA ++ +GLL   PS+  RE+L+++ AL T DPGDI  TI    + KI
Sbjct: 178 RRLEPTGDPSLQNAFEMARGLLLHTPSHSTREILMIFGALFTSDPGDIHTTIDSLVKEKI 237

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPP----PPAIAEFA 171
           R  +IGL+A++ ICK +C+ T       Y+V L+E HFK+L+++   P      A  E  
Sbjct: 238 RVKIIGLTAQVAICKEICRRTNYGDDTLYAVILNEGHFKDLLLDAVTPLAISKSAQVEMR 297

Query: 172 IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
              LIK GFP R  E   S C+CH ++  G GY CP CK++VC LPT C  C L L+ S 
Sbjct: 298 GFTLIKAGFPTRISEAIPSFCACHSKLIHG-GYICPGCKSKVCMLPTVCPCCNLMLILST 356

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----------QQSLLSSGNKPG 280
           HLARSYHHLFP+  F EV PL      +   + CF C           ++S   +  K  
Sbjct: 357 HLARSYHHLFPLKLFQEV-PLA----ESYISTNCFSCLTEFPQGMSESERSNSDNRAKKE 411

Query: 281 LYVA----CPKCKKHFCLECDIYIHESLHNCPGC 310
            + +    CP C KHFC++CD+++H+ LHNCPGC
Sbjct: 412 FHTSSRYQCPDCNKHFCVDCDVFVHDVLHNCPGC 445


>gi|109809737|ref|NP_001035955.1| general transcription factor IIH subunit 2-like protein [Homo
           sapiens]
 gi|148922852|ref|NP_001092198.1| general transcription factor IIH subunit 2-like protein [Homo
           sapiens]
 gi|74737138|sp|Q6P1K8.1|T2H2L_HUMAN RecName: Full=General transcription factor IIH subunit 2-like
           protein; AltName: Full=General transcription factor IIH
           polypeptide 2-like protein
 gi|40353046|gb|AAH64557.1| GTF2H2D protein [Homo sapiens]
 gi|40674450|gb|AAH65021.1| General transcription factor IIH, polypeptide 2D [Homo sapiens]
 gi|76779844|gb|AAI05981.1| GTF2H2C protein [Homo sapiens]
 gi|189054850|dbj|BAG37691.1| unnamed protein product [Homo sapiens]
 gi|219521708|gb|AAI71860.1| General transcription factor IIH, polypeptide 2D [Homo sapiens]
 gi|312152202|gb|ADQ32613.1| general transcription factor IIH, polypeptide 2, 44kDa [synthetic
           construct]
          Length = 395

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKEA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHLSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385


>gi|195496832|ref|XP_002095860.1| GE22646 [Drosophila yakuba]
 gi|194181961|gb|EDW95572.1| GE22646 [Drosophila yakuba]
          Length = 438

 Score =  264 bits (675), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 137/331 (41%), Positives = 205/331 (61%), Gaps = 21/331 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL  +
Sbjct: 78  DLKPTRLRCTVKLLELFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALESL 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDELKKEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
           S+PHLARSYHHLFP+  F+E+     P   +D  +  R  C+ C ++L  S +K    V 
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDLTSGVRE-CYACAKALGQSVDKVADKVV 375

Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
             C  CK++FC++CDI+IH++LH C GC ++
Sbjct: 376 YRCGFCKQYFCVDCDIFIHDTLHACVGCNTI 406


>gi|194876451|ref|XP_001973778.1| GG16285 [Drosophila erecta]
 gi|190655561|gb|EDV52804.1| GG16285 [Drosophila erecta]
          Length = 438

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 137/331 (41%), Positives = 202/331 (61%), Gaps = 21/331 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL  +
Sbjct: 78  DLKPTRLRCTVKLLELFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALESL 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K   I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDELKREGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
           S+PHLARSYHHLFP+  F+E+     P   +D  N  R  C+ C ++L    +K    V 
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDLNNGVRE-CYACAKALGQGVDKAADKVV 375

Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
             C  CK+ FC++CDI+IH++LH C GC ++
Sbjct: 376 FRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406


>gi|398406773|ref|XP_003854852.1| hypothetical protein MYCGRDRAFT_37615 [Zymoseptoria tritici IPO323]
 gi|339474736|gb|EGP89828.1| hypothetical protein MYCGRDRAFT_37615 [Zymoseptoria tritici IPO323]
          Length = 445

 Score =  264 bits (674), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 140/344 (40%), Positives = 195/344 (56%), Gaps = 34/344 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +      AF+REFF+QNP+SQ+G++ +++G+A  ++D+ G+P  HI A+   
Sbjct: 100 DLRPTRHLLTITYTIAFIREFFEQNPISQLGILGMREGLAIRVSDMSGNPNDHIAAVRAL 159

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G    G+ SLQN LD+ +  L   PS+G REV+I+  AL T DPGDI +TI+ C + +I
Sbjct: 160 RGTDPKGNPSLQNGLDMARAALYHTPSHGTREVVIILGALLTSDPGDIHDTIKACIKDRI 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
           R ++IGL+A+M IC  +C+ T       Y+VA+DE HF+EL+M    PP   +  A A  
Sbjct: 220 RVTIIGLAAQMHICAEICRKTNAGDDNCYNVAVDEVHFRELLMGITTPPVVRSTDAEAQK 279

Query: 175 -----LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
                L+ MGFP R  E   ++C+CH  +  G GY C RCKA+VC LP  C  C L L+ 
Sbjct: 280 LNQAALLMMGFPSRIVEEHATLCACHGNLTRG-GYLCSRCKAKVCSLPATCPTCDLTLIL 338

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC------------------QQS 271
           S HLARSYHHLFP+  + EV+        +R    CFGC                    S
Sbjct: 339 STHLARSYHHLFPLQNWVEVS---WQRAADRGSEQCFGCLSPFPRVHGATNGDDADKTYS 395

Query: 272 LLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
             + G        C  C+ HFC++CD + HE+ HNCPGC S  H
Sbjct: 396 KRAEGASESSRYECESCQNHFCIDCDCFNHETSHNCPGCTSSTH 439


>gi|336272171|ref|XP_003350843.1| hypothetical protein SMAC_02512 [Sordaria macrospora k-hell]
 gi|380095007|emb|CCC07509.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 466

 Score =  264 bits (674), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 153/355 (43%), Positives = 200/355 (56%), Gaps = 51/355 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D  P+R  VV      FVRE+F+QNP+SQ+G+V ++DG+A  ++DL G+P  HI+ LM  
Sbjct: 94  DLLPNRYRVVLNNAIGFVREYFEQNPISQLGIVGMRDGIAVRISDLSGNPAEHIEKLMQW 153

Query: 61  --KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
             +    G+SSLQNAL + +G L Q PS+  REVLI+Y AL + DPGDI +TI      +
Sbjct: 154 SEQQDPQGNSSLQNALQMCRGALYQTPSHATREVLIIYGALVSIDPGDIHDTINDLVADR 213

Query: 119 IRCSVIGLSAEMFICKHLCQDTG---GSYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           IR SV+GL+ ++ IC  LC+ T    G+YSVA+DE H KEL      PP     E   A+
Sbjct: 214 IRVSVVGLAGQVAICTELCKRTNNHDGNYSVAIDEVHLKELFFAATTPPVTRTPEQNTAS 273

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A +  +S C+CH +     GYTCPRC  +VC LP DC IC L L+ S H
Sbjct: 274 LLMMGFPSRTLAPKDHVSYCACHAK-PTREGYTCPRCGIKVCRLPIDCPICKLTLIQSTH 332

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------------------- 268
           LARSYHHLFP+  F EV P         SRST C+ C                       
Sbjct: 333 LARSYHHLFPLKVFVEV-PWS-----QASRSTACYSCLSPFPTGPRDTVAAAPAVLGGRA 386

Query: 269 ----QQSLLSSGNKPGL-------YVACPKCKKHFCLECDIYIHESLHNCPGCES 312
               +Q       KP L         AC  C  HFC++CD++ HE++HNCPGC+S
Sbjct: 387 GKGGKQDSDPKNPKPELKGVSESGRYACQVCGNHFCIDCDVFAHETIHNCPGCQS 441


>gi|255948596|ref|XP_002565065.1| Pc22g11160 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592082|emb|CAP98404.1| Pc22g11160 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 450

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 205/343 (59%), Gaps = 34/343 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++ + G+P  H+ A+  +
Sbjct: 101 DLRPTRYLLTLRYAQEFVREFFEQNPISQLGVLGLRDGLALRVSAMSGNPTEHLTAIQTL 160

Query: 60  GKLGCSGDSSLQNALDLVQGLL---SQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
                 G  SLQN L++ +G L   S  PS+G REVL +Y +L + DPGDI  TI     
Sbjct: 161 KTQDPKGLPSLQNGLEMARGALFEFSHTPSHGTREVLFIYGSLLSSDPGDIHRTIDTLIS 220

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAI 172
            KIR  ++GL A++ IC+ LC  T G    +Y VAL+E HF+EL+M+   PP A ++   
Sbjct: 221 DKIRVGIVGLGAQVAICRELCTKTNGGDETAYGVALNEQHFRELMMDVTTPPAAYSQKES 280

Query: 173 AN-LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           A+ L+ MGFP R  E S S+C+CH +   G GY C RC ++VC LP +C  CGL L+ S 
Sbjct: 281 ASSLLMMGFPSRTVESSPSLCACHSKPSRG-GYLCSRCNSKVCSLPAECPSCGLTLILST 339

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLS----------SGNKPG 280
           HLARSYHHLFP+  + EV P      R  SRS+ CF C  S  S               G
Sbjct: 340 HLARSYHHLFPLINWVEV-PW-----RRASRSSACFACGLSFPSVPPEDHWQTAENQTKG 393

Query: 281 LYVA----CPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPI 319
           + V+    C  C  HFC++CD++ HE +HNCPGC+S   SNP+
Sbjct: 394 MSVSSRYECTACHNHFCIDCDLFAHEVVHNCPGCQS--RSNPL 434


>gi|195348862|ref|XP_002040966.1| GM22477 [Drosophila sechellia]
 gi|194122476|gb|EDW44519.1| GM22477 [Drosophila sechellia]
          Length = 438

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 138/332 (41%), Positives = 204/332 (61%), Gaps = 23/332 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+GL+ +K   A  +T+L G+   H+KAL  +
Sbjct: 78  DLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKAKRAEKVTELTGTSRVHLKALESL 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-----CFGCQQSLLSSGNKPGLYV 283
           S+PHLARSYHHLFP+  F+E+    +  P + S  T     C+GC ++L    +K    V
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAM--PASSSEFTSDVRECYGCAKALGQGVDKVADKV 374

Query: 284 A--CPKCKKHFCLECDIYIHESLHNCPGCESL 313
              C  CK+ FC++CDI+IH++LH C GC ++
Sbjct: 375 VFRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406


>gi|121717593|ref|XP_001276096.1| transcription factor and DNA repair complex, core TFIIH, putative
           [Aspergillus clavatus NRRL 1]
 gi|119404294|gb|EAW14670.1| transcription factor and DNA repair complex, core TFIIH, putative
           [Aspergillus clavatus NRRL 1]
          Length = 456

 Score =  263 bits (673), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 139/333 (41%), Positives = 202/333 (60%), Gaps = 29/333 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 106 DLRPTRYLLTLRYAQEFVREFFEQNPISQLGVLGLRDGLAIRISDMSGNPTEHISAIQAL 165

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN L + +G L   PS+G REV I++ +L + DPGDI  TI      KI
Sbjct: 166 RDHDAKGLPSLQNGLGMARGALFHTPSHGTREVFIVFGSLLSSDPGDIYRTIATLVNDKI 225

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R  ++GL+A++ IC+ LC  T G    +Y VAL+E HF+EL+M+   PP   ++  + ++
Sbjct: 226 RVRIVGLAAQVAICRELCSRTNGGDDTTYGVALNEQHFRELMMDVTTPPVTYSQKQSTSS 285

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY+C RC ++VC LP +C  CGL L+ S HLA
Sbjct: 286 LLMMGFPSRTVESCPSLCACHSKPSCG-GYSCSRCNSKVCGLPAECPSCGLTLILSTHLA 344

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------QQSLLSSGNKPGLYV 283
           RSYHHLFP+  + EV P      +  SRS+ CF C           Q   +     G+ V
Sbjct: 345 RSYHHLFPLMNWVEV-PW-----QQASRSSACFACGVYFPAVPPNDQWQTTESQAKGMSV 398

Query: 284 A----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           +    C  CK HFC++CD++ HE +HNCPGC+S
Sbjct: 399 SSRYECTVCKNHFCIDCDLFAHEVVHNCPGCQS 431


>gi|289743109|gb|ADD20302.1| RNA polymerase II transcription initiation [Glossina morsitans
           morsitans]
          Length = 419

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 140/324 (43%), Positives = 200/324 (61%), Gaps = 22/324 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    + +E F+ EFFDQNP+SQ+GL+ +K   A  +T L G+ + H+ AL  +
Sbjct: 78  DLKPTRLLCTLRLLEIFIEEFFDQNPVSQMGLIVLKSKRAEKITALTGTAKLHLNALESL 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K+  SG+ SLQN LDL    L  +PS+  RE+L++ ++L+TCDP DI  TI   K   I
Sbjct: 138 SKISLSGEPSLQNGLDLALKSLKVVPSHASREILVIMASLTTCDPVDINVTIDILKNEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+++C+HL Q T G Y   LD++HF++ ++ +  PP A ++    +LIKMG
Sbjct: 198 RCSVISLSAEIYVCRHLTQQTVGLYGAVLDDAHFRDQLLAYIDPPAA-SQTQENSLIKMG 256

Query: 180 FPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLVS 229
           FP  RA E     +S+C CH E           GY CP+C ++ CELP +C+ CGL LVS
Sbjct: 257 FPHARADESKDPPLSMCMCHIENPEEPSKLTSGGYHCPQCFSKYCELPVECQTCGLTLVS 316

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCK 289
           +PHLARSYHHLFP+  F E+         N   + C+ CQ+ L  S NK      C  C 
Sbjct: 317 APHLARSYHHLFPVQHFAELA-------YNGQAAVCYACQRPLSESTNK--FVYRCEICT 367

Query: 290 KHFCLECDIYIHESLHNCPGCESL 313
           + +C +CDI+IH++LH C GC ++
Sbjct: 368 QIYCCDCDIFIHDTLHTCVGCNTI 391


>gi|126314992|ref|XP_001364912.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Monodelphis domestica]
          Length = 395

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 149/330 (45%), Positives = 202/330 (61%), Gaps = 27/330 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHIASLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L L    L  +P +  REVL+L+S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSLAMQTLKHMPGHTSREVLVLFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGAYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+  ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV+    N  R      C GC    L       +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEVSLEEYNGER-----FCHGC----LGELKDQHVYI-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
           +  FC++CD++IHESLH CPGC    H NP
Sbjct: 364 QNVFCVDCDLFIHESLHCCPGC---IHKNP 390


>gi|440793605|gb|ELR14784.1| Ssl1like protein [Acanthamoeba castellanii str. Neff]
          Length = 360

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 134/310 (43%), Positives = 188/310 (60%), Gaps = 49/310 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +PSR+AV  K VE+F+RE+F+QNP++Q+G+V ++DG A  L++LGG+P  HI  L  K
Sbjct: 97  DMKPSRLAVTLKIVESFIREYFEQNPIAQLGIVIMRDGAAEKLSELGGNPARHIGVLKDK 156

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
                               + +P YG RE+++L++ L+TCDP DI  TI K K+  IRC
Sbjct: 157 --------------------AHVPKYGSREIVVLFAGLATCDPTDIKSTISKLKKDSIRC 196

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           S++GLSAE+ IC+ L + T G+Y VAL E H+++L+  H  PPPA  +   ++LI+MGFP
Sbjct: 197 SIVGLSAEIHICRVLAESTDGTYGVALHEGHYRQLVFAHVSPPPAQPQQVASSLIRMGFP 256

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
            R  +   S CSCHK ++   GY    C A+ CELP +C +CGL LVSS HLARSYHHLF
Sbjct: 257 TRKRDHFPSFCSCHKSLRRE-GY----CGAKCCELPMECPVCGLTLVSSSHLARSYHHLF 311

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+  F                       Q+  S+ +   +Y ACP+C + FC ECD++ H
Sbjct: 312 PLPLF-----------------------QAGSSTTSGAEVY-ACPRCARSFCYECDVFAH 347

Query: 302 ESLHNCPGCE 311
           + LH C GCE
Sbjct: 348 DLLHTCAGCE 357


>gi|347831479|emb|CCD47176.1| similar to transcription factor Zn, C2H2 [Botryotinia fuckeliana]
          Length = 478

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 205/335 (61%), Gaps = 32/335 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
           D RP+R  +  +    FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI   KAL
Sbjct: 122 DMRPTRYLLTIRYASEFVTEYFEQNPISQLGIIGMRDGIAVRISDMSGNPTEHIERLKAL 181

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
               G  G+ SLQNAL++ +  L   PS+G RE+LI+Y AL + DPGDI ETI      +
Sbjct: 182 RVDQG-QGNPSLQNALEMSRAALFHAPSHGTREILIIYGALLSSDPGDIHETISSLITDR 240

Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
           IR S++GL+A++ IC  LC  T      +YSVAL+E HF++L+M    PP     + + +
Sbjct: 241 IRVSIVGLAAQVAICAELCSRTNAGDDTAYSVALNEEHFRQLMMATTTPPVTRTKKQSQS 300

Query: 174 NLIKMGFPQRAGE--GSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           +L+ MGFP R  +   S+S C+CH ++  G GY C RC+++VC LP +C  CGL L+ S 
Sbjct: 301 SLLMMGFPSRTLDPGKSMSFCACHGKLSRG-GYLCSRCESKVCSLPAECPACGLTLILST 359

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL--------------LSSGN 277
           HLARSYHHLFP+  + EV    L     +S+  C+GCQ +               L   +
Sbjct: 360 HLARSYHHLFPLRNWAEV----LWKDAGKSKG-CYGCQVTFPQRDEHEKSASEQPLKGMS 414

Query: 278 KPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
           + G Y AC  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 415 ESGRY-ACEVCGNHFCIDCDVFAHEVVHNCPGCQS 448


>gi|45188211|ref|NP_984434.1| ADR338Cp [Ashbya gossypii ATCC 10895]
 gi|44983055|gb|AAS52258.1| ADR338Cp [Ashbya gossypii ATCC 10895]
 gi|374107649|gb|AEY96557.1| FADR338Cp [Ashbya gossypii FDAG1]
          Length = 469

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 145/328 (44%), Positives = 194/328 (59%), Gaps = 28/328 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R A+  +    FV  FFDQNP+SQ+ +V +++G+A  ++ + G+P+ HI+AL   
Sbjct: 149 DLRPNRHAMTVQYAIDFVHNFFDQNPISQLCIVAMRNGMAQLVSQVSGNPQEHIEALKAV 208

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +PS+  REVLI++ ALS+ DPGDI +TI       I
Sbjct: 209 RKQEPKGNPSLQNALEMARGLLLHVPSHCTREVLIVFGALSSTDPGDIHQTIASLANEHI 268

Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  VIGLSA++ ICK LC+ T       Y V L+E+HFK+L  E   P P         L
Sbjct: 269 RTRVIGLSAQVAICKELCKQTNYGDNSYYGVILNETHFKDLFAEAVVPLPVNKMNKGFTL 328

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + S C+CH  +  G GY CP CK++VC LP  C  C L L+ S HLAR
Sbjct: 329 VKMGFPTRIFEDTPSFCTCHSRLVHG-GYFCPNCKSKVCSLPIVCPCCDLMLILSTHLAR 387

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ-----------QSLLSSGNKPGLYVA 284
           SYHHL P+  F E+ P+ +  P       CF CQ           Q LL+S         
Sbjct: 388 SYHHLMPLKVFQEL-PVDIQFP----TENCFSCQKKFPRLRNYKTQDLLTSSR-----YR 437

Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
           C  CK  FC++CD+++HE LHNCPGCES
Sbjct: 438 CENCKCDFCIDCDVFVHEILHNCPGCES 465


>gi|145242846|ref|XP_001393996.1| TFIIH basal transcription factor complex p47 subunit [Aspergillus
           niger CBS 513.88]
 gi|134078553|emb|CAK40474.1| unnamed protein product [Aspergillus niger]
 gi|350640268|gb|EHA28621.1| hypothetical protein ASPNIDRAFT_212323 [Aspergillus niger ATCC
           1015]
          Length = 457

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 134/333 (40%), Positives = 204/333 (61%), Gaps = 29/333 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ +KDG+A  ++D+ G+P  HI A+  +
Sbjct: 108 DLRPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLKDGLAVRVSDMSGNPTEHISAIQSL 167

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN  ++ +G L   PS+G RE+ +++ +L + DPGDI +TI      KI
Sbjct: 168 RDQDPKGLPSLQNGFEMARGALFHTPSHGTREIFVIFGSLLSSDPGDIHQTITTLINDKI 227

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R  ++GL+A++ IC+ +C  T G    +Y VAL+E HF++L+M    PP   ++  + ++
Sbjct: 228 RVGIVGLAAQVAICREICGKTNGGDDTTYGVALNEQHFRDLVMNVTTPPATYSQKQSTSS 287

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC ++VC LP +C  CGL L+ S HLA
Sbjct: 288 LLMMGFPSRTVEAYPSLCACHSKPSRG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 346

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------QQSLLSSGNKPGLYV 283
           RSYHHLFP+  + EV+       ++ SRS+ CF C          +Q   + G   G+ V
Sbjct: 347 RSYHHLFPLMNWVEVS------WQHASRSSVCFACGISFPSVPPKEQWHTTEGQTKGISV 400

Query: 284 A----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           +    C  C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 401 SSRYECTVCRNHFCIDCDLFAHEVVHNCPGCQS 433


>gi|331241057|ref|XP_003333178.1| hypothetical protein PGTG_14725 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309312168|gb|EFP88759.1| hypothetical protein PGTG_14725 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 462

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 207/345 (60%), Gaps = 44/345 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R+ +     + FV EFFDQNP+SQ+ ++  KD +A  L+ L G+P  H KAL   
Sbjct: 125 DLRPNRLDMSLTYAKEFVTEFFDQNPISQLCILITKDAIAERLSGLSGNPLDHHKALSNK 184

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  SG+ SLQN+L++ +  LS +PS+G REVL+++ +L+TCDP DI +T++  ++ ++
Sbjct: 185 AKLTPSGEPSLQNSLEMARASLSHLPSHGSREVLVIFGSLTTCDPDDINKTLENLEKDRM 244

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
           R S++GL+AE+ ICK +C+ T G Y V L+E HFKEL+ E   PPP IA+          
Sbjct: 245 RVSMVGLAAEVRICKEICKRTQGQYGVILNEHHFKELLFEAISPPP-IAKSTTNGPKTQS 303

Query: 173 ANLIKMGFPQR-------AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGL 225
           ++LI+MGFP +           + S C+CH +++   G+ CPRC ++ CE+PT+C +CGL
Sbjct: 304 SSLIQMGFPNKITHDAHQVDASTHSFCACHSKLQ-STGFICPRCGSKNCEIPTNCSVCGL 362

Query: 226 QLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL----LSSGNKP-- 279
            +VSSPHLARSY HLFP++ + E     L+    +    CFGC + L     S  N    
Sbjct: 363 TIVSSPHLARSYRHLFPVSNWIEKNANSLSHQTFK----CFGCDKELQVNKASQTNDESN 418

Query: 280 --------------GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                          LY  CP+CK  FC ECDIY HE L  CPGC
Sbjct: 419 GNSNNATNANNPSINLY-HCPRCKNMFCYECDIY-HEQLGLCPGC 461


>gi|219521045|gb|AAI71919.1| GTF2H2C protein [Homo sapiens]
          Length = 338

 Score =  263 bits (671), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 203/322 (63%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 18  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 77

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 78  VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 137

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SV GLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 138 RVSVTGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 196

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 197 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 256

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E+ PL   +  N  R  C+GCQ  L        +YV C  C
Sbjct: 257 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 306

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD+++H+SLH CPGC
Sbjct: 307 QNVFCVDCDVFVHDSLHCCPGC 328


>gi|195019313|ref|XP_001984954.1| GH16776 [Drosophila grimshawi]
 gi|193898436|gb|EDV97302.1| GH16776 [Drosophila grimshawi]
          Length = 443

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 138/330 (41%), Positives = 200/330 (60%), Gaps = 24/330 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL  M
Sbjct: 78  DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGIIALKAKRAEKITELTGTSRVHLKALAGM 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+ +I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKTLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKERI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ IC++L Q T G++   LD+SH+++ ++    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHICRYLTQQTMGTFGAVLDDSHYRDQLLSQVDPPPA-AKTQDNSLIRMG 256

Query: 180 FPQRAGEGS-----ISICSCHKEV------KVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHSKNEVEGKDEPLSMCMCHIENLEEPSGLSTTGHHCPQCNSKYCELPVECQCCGLTLV 316

Query: 229 SSPHLARSYHHLFPIA-----PFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV 283
           S+PHLARSYHHLFP+      PF+ V P     P   S   C+ C + L    +K     
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFESVPPAA---PDCGSLRKCYACIKVLNGLTDKS--VF 371

Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGCESL 313
            C  C + FC++CDI+IHE+LH C GC ++
Sbjct: 372 KCGYCNQFFCIDCDIFIHETLHACVGCNTI 401


>gi|156050059|ref|XP_001590991.1| hypothetical protein SS1G_07615 [Sclerotinia sclerotiorum 1980]
 gi|154692017|gb|EDN91755.1| hypothetical protein SS1G_07615 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 479

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 142/335 (42%), Positives = 206/335 (61%), Gaps = 32/335 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
           D RP+R  +  +    FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI   K+L
Sbjct: 122 DMRPTRYLLTLRYASEFVTEYFEQNPISQLGIIGMRDGIAVRISDMSGNPTEHIERLKSL 181

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
               G  G+ SLQNAL++ +  L   PS+G RE+LI+Y AL T DPGDI ETI      +
Sbjct: 182 RVDQG-QGNPSLQNALEMSRAALFHAPSHGTREILIIYGALLTSDPGDIHETISSLIADR 240

Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
           IR S++GL+A++ IC  +C  T      +YSVAL+E HF++L+M    PP     + + +
Sbjct: 241 IRVSIVGLAAQVAICAEICSRTNAGDDTAYSVALNEEHFRQLMMATTTPPVTRTKKQSHS 300

Query: 174 NLIKMGFPQRA-GEG-SISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           +L+ MGFP R  G G S+S C+CH ++  G GY C RC ++VC LP +C  CGL L+ S 
Sbjct: 301 SLLMMGFPSRTLGPGKSMSFCACHGKLSRG-GYLCSRCGSKVCSLPAECPACGLTLILST 359

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL--------------LSSGN 277
           HLARSYHHLFP+  + EV  L  +  + R+   C+GCQ                 L S +
Sbjct: 360 HLARSYHHLFPLRNWVEV--LWKDAGKTRA---CYGCQAPFPRRDEHEKGAPEPGLKSMS 414

Query: 278 KPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
           + G Y AC  C  HFC++CD++ H+ +HNCPGC+S
Sbjct: 415 ESGRY-ACEVCGNHFCIDCDVFAHDVVHNCPGCQS 448


>gi|259481102|tpe|CBF74326.1| TPA: component of the core form of RNA polymerase transcription
           factor TFIIH (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 455

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 137/337 (40%), Positives = 202/337 (59%), Gaps = 29/337 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 106 DLRPTRYLLTLRYAQEFVREFFEQNPISQVGVIGLRDGLALRISDMSGNPTEHISAIQDL 165

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN L++ +G L   PS+G RE+ I++ +L + DPGDI +TI    + KI
Sbjct: 166 RSQDPKGLPSLQNGLEMARGALFHTPSHGTREIFIVFGSLLSSDPGDIHQTITTLIDDKI 225

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIA-N 174
           R  ++GL+A++ IC+ LC  T G     Y VAL+E HF+EL++    PP   ++   A +
Sbjct: 226 RVGIVGLAAQVAICRELCSRTNGGDDTYYGVALNEQHFRELMLAVTTPPATYSQKQSASS 285

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH     G GY C RC ++VC LP +C  CGL L+ S HLA
Sbjct: 286 LLMMGFPSRTIESFPSLCACHSNPTPG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 344

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------QQSLLSSGNKPGLYVA 284
           RSYHHLFP+  + EV+       R    +TCF C          +Q   +     G+ V+
Sbjct: 345 RSYHHLFPLMNWVEVSW-----ERAHRSTTCFACGIAFPTIPPKEQWQATQNLAKGMSVS 399

Query: 285 ----CPKCKKHFCLECDIYIHESLHNCPGCES--LRH 315
               C  C+ HFC++CD++ HE +HNCPGC+S  +RH
Sbjct: 400 SRYECTVCENHFCIDCDLFAHEVVHNCPGCQSRVVRH 436


>gi|400594628|gb|EJP62466.1| TFIIH basal transcription factor complex p47 subunit , putative
           [Beauveria bassiana ARSEF 2860]
          Length = 503

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 149/348 (42%), Positives = 202/348 (58%), Gaps = 45/348 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
           D  P+R  +  +   AFVREFF+QNP+SQ+G+V ++DGVA  ++DLGG+P  H+   KAL
Sbjct: 134 DMLPTRHRLTLRYAAAFVREFFEQNPISQLGIVGMRDGVAVRISDLGGNPAEHLERLKAL 193

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
            G+    G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +T+      +
Sbjct: 194 EGQ-DPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTMTNLISDR 252

Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
           IR S++GLSA + IC  LC  T       Y+VA+DE HF+EL +    PP     E + A
Sbjct: 253 IRVSIVGLSAHLSICAELCSRTNAGDTSQYNVAMDEVHFRELFLAATTPPITRTQEQSTA 312

Query: 174 NLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           +L+ MGFP R  A  G++S C+CH +     G+ C RC +RVC LP +C  CGL L+ S 
Sbjct: 313 SLLMMGFPSRVLAPGGAVSYCACHNK-PCREGFLCTRCGSRVCRLPAECPACGLTLILST 371

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRS-TCFGCQ--------------------- 269
           HLARSYHHLFP+  + EV P       N ++S  CF CQ                     
Sbjct: 372 HLARSYHHLFPLRNWVEV-PWA-----NAAQSAACFACQCPFSAPPRTVEGGVNGEKHAD 425

Query: 270 -----QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                + L + G       AC  C  HFC++CD+Y HE +HNCPGC+S
Sbjct: 426 TQETGRRLAAKGVSESGRYACEVCGNHFCIDCDVYAHEVIHNCPGCQS 473


>gi|169776637|ref|XP_001822785.1| TFIIH basal transcription factor complex p47 subunit [Aspergillus
           oryzae RIB40]
 gi|83771520|dbj|BAE61652.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391874438|gb|EIT83320.1| RNA polymerase II transcription initiation protein [Aspergillus
           oryzae 3.042]
          Length = 458

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 136/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  +  VREFF+QNP+SQ+G++ ++DG+A  ++DL G+P  HI A+  +
Sbjct: 107 DLRPTRYLLTLRYAQELVREFFEQNPISQLGVLGLRDGLAIRISDLSGNPTEHISAIQTL 166

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN +++ +G L   PS+G RE+ I++ +L + DPGDI +TI      K+
Sbjct: 167 RDQDPKGLPSLQNGIEMARGALFHTPSHGTREIFIIFGSLLSSDPGDIHQTIANLINDKV 226

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIA-N 174
           R  ++GL+A++ IC+ LC  T G     Y VAL+E HF+EL+M+   PP   ++   A +
Sbjct: 227 RVGIVGLAAQVAICRELCAKTNGGDDTRYGVALNEQHFRELLMDVTTPPATYSQKQSASS 286

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E S S+C+CH +   G GY C RC ++VC LP +C  CGL L+ S HLA
Sbjct: 287 LLMMGFPSRTVETSPSLCACHSKSSCG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 345

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R+   S CF C           Q   +     G+ 
Sbjct: 346 RSYHHLFPLVNWVEV-------PWQRASRSSVCFACGIAFPPVPPKDQWQTTENQAKGMS 398

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 399 VSSRYECTVCENHFCIDCDLFAHEVVHNCPGCQS 432


>gi|118344316|ref|NP_001071980.1| zinc finger protein [Ciona intestinalis]
 gi|92081522|dbj|BAE93308.1| zinc finger protein [Ciona intestinalis]
          Length = 395

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 141/323 (43%), Positives = 197/323 (60%), Gaps = 32/323 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R   V K +E F  +FFDQNP+SQ+G V+     A  L+D+ GS + HIK L   
Sbjct: 76  DLKPNRAVAVLKALEIFSEKFFDQNPISQLGFVSTCAKRAEVLSDMSGSLQKHIKVLQD- 134

Query: 62  LG----------CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETI 111
           LG          C+G+ SLQN+L+L    L  +PS+  +E+L++  +L+TCDPG I +TI
Sbjct: 135 LGQKIIKTNAKHCTGEPSLQNSLELCAKSLKHMPSHTSKEILVVMGSLTTCDPGSIHDTI 194

Query: 112 QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA 171
            +  +  IRCSVIGLSAE+ ICK+L + T G++ + LDESHF +L+ +H PPP  I+   
Sbjct: 195 TQIAKQSIRCSVIGLSAEVHICKYLAKSTKGTFGIILDESHFADLLTDHVPPPAEIS--P 252

Query: 172 IANLIKMGFPQRAG---EGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
            ++LIKMGFP        G IS+C       VG GY CP+C+A+  ELP  C IC L LV
Sbjct: 253 ESSLIKMGFPHHVSPHESGKISLCMSRNTFTVG-GYFCPQCQAKYSELPVQCVICALTLV 311

Query: 229 SSPHLARSYHHLFPIAPFDEV-TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
           S+PHLARSYHH FP+  F E   P+           TC+ C + + ++      YV C +
Sbjct: 312 SAPHLARSYHHFFPLPMFKEQEVPV--------GSLTCYSCCKPITTAT-----YV-CEQ 357

Query: 288 CKKHFCLECDIYIHESLHNCPGC 310
           C   FC++C+ +IHE+LH+CPGC
Sbjct: 358 CAISFCMDCNAFIHETLHSCPGC 380


>gi|16740713|gb|AAH16231.1| Gtf2h2 protein [Mus musculus]
 gi|148668487|gb|EDL00806.1| general transcription factor II H, polypeptide 2, isoform CRA_a
           [Mus musculus]
          Length = 398

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 143/324 (44%), Positives = 202/324 (62%), Gaps = 27/324 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDE+H+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLAHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
           FPQ           + S S+      S    + +G GY CP+C+A+ CELP +C+ICGL 
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           LVS+PHLARSYHHLFP+  F E++         +    C+GCQ  L        +YV C 
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CT 362

Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
            C+  FC++CD+++H+SLH CPGC
Sbjct: 363 VCQNVFCVDCDVFVHDSLHCCPGC 386


>gi|225543398|ref|NP_071294.3| general transcription factor IIH subunit 2 [Mus musculus]
 gi|18203119|sp|Q9JIB4.1|TF2H2_MOUSE RecName: Full=General transcription factor IIH subunit 2; AltName:
           Full=Basic transcription factor 2 44 kDa subunit;
           Short=BTF2 p44; AltName: Full=General transcription
           factor IIH polypeptide 2; AltName: Full=TFIIH basal
           transcription factor complex p44 subunit
 gi|9082152|gb|AAF82753.1| general transcription factor IIH polypeptide 2 [Mus musculus]
          Length = 396

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 143/324 (44%), Positives = 202/324 (62%), Gaps = 27/324 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDE+H+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLAHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
           FPQ           + S S+      S    + +G GY CP+C+A+ CELP +C+ICGL 
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           LVS+PHLARSYHHLFP+  F E++         +    C+GCQ  L        +YV C 
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CT 362

Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
            C+  FC++CD+++H+SLH CPGC
Sbjct: 363 VCQNVFCVDCDVFVHDSLHCCPGC 386


>gi|126320570|ref|XP_001362699.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Monodelphis domestica]
          Length = 395

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 148/330 (44%), Positives = 202/330 (61%), Gaps = 27/330 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P+ H+ +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPKKHVASLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L L    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSLAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKSLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGVYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+  ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV+    N  R      C GC    L       +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEVSLEEYNGER-----FCQGC----LGELKDQHVYI-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
           +  FC++CD++IHESLH CPGC    H NP
Sbjct: 364 QNVFCVDCDLFIHESLHCCPGC---IHKNP 390


>gi|390459877|ref|XP_002744933.2| PREDICTED: general transcription factor IIH subunit 2 isoform 1
           [Callithrix jacchus]
          Length = 404

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 146/331 (44%), Positives = 204/331 (61%), Gaps = 33/331 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  H+ +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHVTSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRIC----- 223
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C+IC     
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGKKPS 313

Query: 224 ----GLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
               GL LVS+PHLARSYHHLFP+ PF E+    L +  N  R  C+GCQ  L       
Sbjct: 314 IHYSGLTLVSAPHLARSYHHLFPLDPFQEI----LLEEYNGER-FCYGCQGEL----KDQ 364

Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
            +YV C  C+  FC++CD+++H+SLH CPGC
Sbjct: 365 HVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 394


>gi|388580345|gb|EIM20660.1| TFIIH basal transcription factor complex, subunit SSL1 [Wallemia
           sebi CBS 633.66]
          Length = 444

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 130/342 (38%), Positives = 209/342 (61%), Gaps = 37/342 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +    +V+E+FDQNP+ Q+G + ++ GVA  +  + GS    +K+L  K
Sbjct: 108 DMRPNRFLLSLEYAREYVKEYFDQNPIGQMGAIGMRSGVAEWICKMSGSQHDLVKSLQNK 167

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  +G+ SLQNAL++ +  ++ +P++  RE+++++ +L+TCDPG+I +T++     KI
Sbjct: 168 NKLEPNGEPSLQNALEMARASMAHLPTHASREIVVVFGSLTTCDPGNIHDTLRALIRDKI 227

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
           R ++I L+AE+ I + + + TGG+ SVALDE H+K+++ E  PPP      A+       
Sbjct: 228 RVNIISLAAEVRILREVAEKTGGTLSVALDEGHYKDVLFETVPPPAVHTAKALNTVEQGG 287

Query: 173 -----ANLIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
                ++L++MGF  R      +++C+CH E++   GY CPRC +++C++PTDC +C L 
Sbjct: 288 GSVDESDLMQMGFAVRLPYTAPLTLCACHSELRRQ-GYICPRCGSKLCDIPTDCAVCDLV 346

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ---------------- 270
           +VSSPHLARSYHHLFP+  +  V P  + +    S + CFGC+Q                
Sbjct: 347 VVSSPHLARSYHHLFPVPDWAVVNPQAITET---SDTRCFGCRQVFPPVAQSMTLPTPTA 403

Query: 271 -SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
            ++ ++ +  G Y  CPKC   FC ECDIY HE+LH CPGC+
Sbjct: 404 ATVSANISATGRY-RCPKCMHDFCSECDIYCHETLHVCPGCQ 444


>gi|453087906|gb|EMF15947.1| TFIIH basal transcription factor complex, subunit SSL1
           [Mycosphaerella populorum SO2202]
          Length = 460

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 136/334 (40%), Positives = 194/334 (58%), Gaps = 37/334 (11%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
           V AF+RE+F+QNP+SQ+G++ +++G+A  ++D+ G+P  HI A+  +      G+ SLQN
Sbjct: 115 VIAFIREYFEQNPISQLGILGMREGLAIRVSDMSGNPNDHINAIKALRSTDPKGNPSLQN 174

Query: 73  ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
           AL++ +  L   PS+G REV I+  AL T DPGDI +TI+ C + KIR ++IGL+A+MFI
Sbjct: 175 ALEMARAALYHTPSHGTREVAIILGALHTSDPGDIHDTIKACIKDKIRVNIIGLAAQMFI 234

Query: 133 CKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIAN------LIKMGFPQ 182
           C  +C+ T       Y+VA+DE H++EL+M    PP   A    A       L+ MGFP 
Sbjct: 235 CAEICRKTNQGDTNCYNVAVDEVHYRELLMSITTPPVVRASDVEAQKRNQAALLMMGFPS 294

Query: 183 RAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFP 242
           R  E   ++C+CH  +  G GY C RCKA+VC LP  C  C L L+ S HLARSYHHLFP
Sbjct: 295 RIVEEKATLCACHGNLTRG-GYLCSRCKAKVCNLPATCPTCELTLILSTHLARSYHHLFP 353

Query: 243 IAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---------------------LSSGNKPGL 281
           +  +DEV+       + +    CFGCQ                         + G     
Sbjct: 354 LQNWDEVS---WKRAQQKRSVQCFGCQSPFPALSGGIDGVDGSANGHSRPKRAEGASESS 410

Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
              C  C++HFC++CD++ HE +HNCPGC+S  H
Sbjct: 411 RYECSTCQQHFCIDCDVFCHEVVHNCPGCQSNAH 444


>gi|345304797|ref|XP_001512459.2| PREDICTED: general transcription factor IIH subunit 2-like
           [Ornithorhynchus anatinus]
          Length = 395

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/330 (43%), Positives = 203/330 (61%), Gaps = 27/330 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  H  AL   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHTTALKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L L    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCLGEPSLYNSLSLAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKSLKTAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R S+IGLSAE+ +C  L ++TGG+Y V LDESH+KEL++ H  PPPA +     +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTVLARETGGTYHVILDESHYKELLIHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSSEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E++    +  R      C GCQ  L        +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEISLEEYHGER-----VCQGCQGEL----KDQQVYI-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
           +  FC+ECD+++H+ LH CPGC    H NP
Sbjct: 364 QSVFCVECDLFVHDFLHCCPGC---IHKNP 390


>gi|354488323|ref|XP_003506320.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Cricetulus griseus]
 gi|344248232|gb|EGW04336.1| General transcription factor IIH subunit 2 [Cricetulus griseus]
          Length = 395

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 141/322 (43%), Positives = 200/322 (62%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C+G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCNGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDE+H+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+  ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F E++         +    C+GCQ  L        +YV C  C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
           +  FC++CD++IH+ LH CPGC
Sbjct: 364 QNVFCVDCDVFIHDCLHCCPGC 385


>gi|378727991|gb|EHY54450.1| transcription initiation factor TFIIH subunit H2, variant
           [Exophiala dermatitidis NIH/UT8656]
 gi|378727992|gb|EHY54451.1| transcription initiation factor TFIIH subunit H2 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 464

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 137/330 (41%), Positives = 199/330 (60%), Gaps = 24/330 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +     E+F+REFF+QNP+SQ+G+V ++DG+A  ++D+ G+P +H+ A+  +
Sbjct: 100 DLRPTRYLLTLMYTESFIREFFEQNPISQLGIVGMRDGIAVRVSDMSGNPTAHLAAIQKI 159

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G  SL+NAL++ +  L   PS+G REVL+++ AL T DPGDI  TI      KI
Sbjct: 160 RKQEPKGQPSLENALEMARAALFHAPSHGTREVLLIFGALHTSDPGDIHRTIDNLVTDKI 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIME-HAPPPPAIAEFAIAN 174
           R +VIGL+A++ +C  L   T      +Y V L E H++EL+M    PP  + +  + ++
Sbjct: 220 RATVIGLAAQVAVCAELVSKTNNGDLSNYGVVLHEQHYRELLMAVTTPPVTSESTKSASS 279

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH       GY C RC  +VC LP++C +CGL L+ S HLA
Sbjct: 280 LLMMGFPSRTVEDYPSLCACHTNPSRD-GYLCSRCSTKVCGLPSECPVCGLTLILSTHLA 338

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------QQSLLSSGNKPGLYVA 284
           RSYHHLFP+  + EV P        RS++ CF C          +Q   +     G Y  
Sbjct: 339 RSYHHLFPLVNWLEV-PW---SQAYRSKA-CFACLKQFPVVPPREQWTAAKTTTSGRY-E 392

Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCESLR 314
           CP C  HFC++CD++ HE +HNCPGC+S+R
Sbjct: 393 CPVCHHHFCIDCDLFAHEVVHNCPGCQSVR 422


>gi|194752572|ref|XP_001958595.1| GF11005 [Drosophila ananassae]
 gi|190625877|gb|EDV41401.1| GF11005 [Drosophila ananassae]
          Length = 422

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 199/329 (60%), Gaps = 21/329 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL   
Sbjct: 78  DLKPTRLRCTIKLLEQFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALESL 137

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           +  S   + SLQN LDL    L  +PS+  RE++++  +L+TCDP DI  TI + K+  I
Sbjct: 138 INVSLTSEPSLQNGLDLALKTLKVVPSHASREIVVIMGSLTTCDPVDINLTIDELKKEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTLGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E      V    G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSVLCTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEVT----PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
           S+PHLARSYHHL P+  F+EVT    P   +D  N     C+ C + L    +K      
Sbjct: 317 SAPHLARSYHHLIPVPNFEEVTLDSMPTVSSD-YNGGVKECYACAKVLTFVADKS--IFK 373

Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCESL 313
           C  C++ FC++CD +IHE+LH C GC ++
Sbjct: 374 CGFCQQFFCIDCDAFIHETLHACVGCNTI 402


>gi|195127781|ref|XP_002008346.1| GI13436 [Drosophila mojavensis]
 gi|193919955|gb|EDW18822.1| GI13436 [Drosophila mojavensis]
          Length = 461

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 134/327 (40%), Positives = 200/327 (61%), Gaps = 18/327 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL  +
Sbjct: 78  DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGIIALKAKRAEKITELTGTSRVHLKALASL 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K  +I
Sbjct: 138 ANVPLTSEPSLQNGLDLALKTLKVVPSHASREIVIIMGSLTTCDPVDINVTIDELKRERI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ IC++L Q T G++   LD++H+++ ++    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHICRYLTQQTMGTFGAVLDDAHYRDQLLSQVDPPPA-AKTQDNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHSKNEVEGKDAPLSMCMCHIENLDEPSELSTTGHHCPQCNSKYCELPVECQCCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCL--NDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           S+PHLARSYHHLFP+  F+E++   +    P   S   C+ C + L    +K      C 
Sbjct: 317 SAPHLARSYHHLFPVPNFEELSYDMVPPASPDCGSLRKCYACIKVLNPVTDKS--VFKCG 374

Query: 287 KCKKHFCLECDIYIHESLHNCPGCESL 313
            C + FC++CDI+IHE+LH C GC ++
Sbjct: 375 YCNQFFCIDCDIFIHETLHACVGCNTI 401


>gi|213402509|ref|XP_002172027.1| transcription factor TFIIH complex subunit Ssl1
           [Schizosaccharomyces japonicus yFS275]
 gi|212000074|gb|EEB05734.1| transcription factor TFIIH complex subunit Ssl1
           [Schizosaccharomyces japonicus yFS275]
          Length = 422

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 136/322 (42%), Positives = 193/322 (59%), Gaps = 17/322 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFR  R+ +  K    F+  +F+QNP+SQ+G++ +KDG+A  ++D+ G+P+ HI  L   
Sbjct: 104 DFRTKRVDLQIKYGTEFIINYFEQNPISQLGIIAMKDGIAVKISDIHGNPQDHINKLKKL 163

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
             C G +SLQNAL++ +  L+ I S+G REVLI++ +L + DPGDI +TI    +  I  
Sbjct: 164 RECKGMASLQNALEMARASLAHIASHGTREVLIIFGSLLSSDPGDIFQTIDSLVKENISV 223

Query: 122 SVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAI-ANLI 176
            +IGLSAE+ +CK +C+ T      +Y V L+E+H +EL+ME   PP   ++  I A+L+
Sbjct: 224 HIIGLSAEVSVCKEICRRTNNGAQNAYGVILNENHLEELLMEKTIPPATHSDEGIPASLV 283

Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
           KMGFP +  E   S+CSCH     G G+ CPRCK +VC LP +C  C L L+ S HLARS
Sbjct: 284 KMGFPSKVIEPFPSLCSCHSVASRG-GFHCPRCKCKVCTLPIECPGCSLILILSTHLARS 342

Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------LSSGNKPGLYVACPKCKK 290
           YHHLFP+  + E+ P         + + CF CQ             N      ACP C  
Sbjct: 343 YHHLFPLKNWREI-PW----SEKPTSTYCFACQAPFPMTPQAKEDQNASSSRYACPSCGH 397

Query: 291 HFCLECDIYIHESLHNCPGCES 312
           HFC+ECD++ HE LH C GC+S
Sbjct: 398 HFCIECDVFAHEQLHECFGCQS 419


>gi|410925308|ref|XP_003976123.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Takifugu rubripes]
          Length = 392

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 192/322 (59%), Gaps = 24/322 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K +EAFV E+FDQNP+SQ+GL+T K+  A  LTDL G+P+ H  AL   
Sbjct: 75  DLKPNRLTSTLKLLEAFVDEYFDQNPISQMGLITTKNKRAEKLTDLSGNPKKHCAALKKA 134

Query: 62  LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           +   C G+ SL N L L    L  +P +  RE+LI+ S+L+TCDP +I E I+  K  K+
Sbjct: 135 VDTVCVGEPSLYNCLSLAVQTLRHMPGHTSRELLIILSSLTTCDPANIYELIETLKTLKV 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGGSY V LDESHFKEL+M H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPA-SSSSECSLIRMG 253

Query: 180 FPQRA------GEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ         +   S    H +   G      GY CP+C A+  ELP +C++CGL LV
Sbjct: 254 FPQHTVASLTDQDAKPSFSMAHLDSSSGPDLSLGGYFCPQCHAKYTELPVECKVCGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
            +PHLARS+HHLFP+  F E     L   R     +C  C   L     K     +CP C
Sbjct: 314 LAPHLARSFHHLFPLQVFTESPADDLQTER-----SCQACHGEL-----KDKSVFSCPSC 363

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
              FC++C ++IH+SLH CP C
Sbjct: 364 LSVFCVDCHLFIHDSLHCCPCC 385


>gi|195160565|ref|XP_002021146.1| GL24973 [Drosophila persimilis]
 gi|194118259|gb|EDW40302.1| GL24973 [Drosophila persimilis]
          Length = 445

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 136/331 (41%), Positives = 200/331 (60%), Gaps = 22/331 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL G 
Sbjct: 78  DLKPTRLRCTIKLLEQFIEEFFDQNPISQMGIIALKAKRAEKITELTGTSRVHLKALEGL 137

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  I
Sbjct: 138 ANVPLTSEPSLQNGLDLALKTLKVVPSHASREIVIVMGSLTTCDPVDINLTIDELKKEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ I ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHIARYLTQQTRGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELCTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLND--PRNRSRSTCFGCQQSLLSSGNKPGLY 282
           S+PHLARSYHHLFP+  F+E+     P   +D        + C+ C +  +   +K    
Sbjct: 317 SAPHLARSYHHLFPVPNFEEIPFDSLPKASSDRGGTGTGTTECYACMKLFVPVADK--TV 374

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESL 313
             C  CK+ FCL+CDI+IH++LH C GC ++
Sbjct: 375 FKCGFCKQFFCLDCDIFIHDTLHACVGCNTI 405


>gi|146324050|ref|XP_001481497.1| RNA polymerase TFIIH complex subunit Ssl1 [Aspergillus fumigatus
           Af293]
 gi|129556348|gb|EBA27248.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Aspergillus
           fumigatus Af293]
 gi|159126166|gb|EDP51282.1| transcription factor and DNA repair complex, core TFIIH, putative
           [Aspergillus fumigatus A1163]
          Length = 456

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 139/334 (41%), Positives = 200/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 107 DLRPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLRDGLAIRVSDMSGNPTEHISAIQAL 166

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN L++ +G L   PS+G REV +++ ++ + DPGDI +TI      KI
Sbjct: 167 RDHDPKGLPSLQNGLEMARGALFHTPSHGTREVFVIFGSILSSDPGDIHQTITTLINDKI 226

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
              + GL+A++ IC+ LC  T      +Y VAL+E HF+EL+M+   PP   ++  A ++
Sbjct: 227 SVRIAGLAAQVAICRELCSRTNAGDDTTYGVALNEQHFRELMMDVTTPPVTYSQKKATSS 286

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH E   G GY C RC ++VC LP +C  CGL L+ S HLA
Sbjct: 287 LLMMGFPSRTVETCPSLCACHSEPSCG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 345

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGC----------QQSLLSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R SRS TCF C           Q   +     G+ 
Sbjct: 346 RSYHHLFPLMNWIEV-------PWQRASRSLTCFACGIYFPTVPPKDQWQATESQAKGMS 398

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  CK HFC++CD++ HE +HNCPGC+S
Sbjct: 399 VSSRYECTVCKNHFCIDCDLFAHEIVHNCPGCQS 432


>gi|408397644|gb|EKJ76784.1| hypothetical protein FPSE_02970 [Fusarium pseudograminearum CS3096]
          Length = 469

 Score =  258 bits (659), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 200/342 (58%), Gaps = 37/342 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D  P+R  ++     AFVREFF+QNP+SQ+G++ ++DGVA  ++D+GG+P  H+  L G 
Sbjct: 108 DLLPTRYRLMLSYAVAFVREFFEQNPISQLGIIGMRDGVAVRVSDVGGNPAEHLDKLKGL 167

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G+ SLQNAL++ +G L   PS+G REV I+Y AL + DPGDI ETI      +I
Sbjct: 168 ESQDPQGNPSLQNALEMCRGALFHAPSHGTREVFIIYGALLSSDPGDIHETIGNLITDRI 227

Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R S++GLSA + IC  LC  T    G  Y++A+DE HF+EL +    PP     E + A+
Sbjct: 228 RVSIVGLSAHLAICADLCSRTNAGDGSQYNIAMDEVHFRELFLAATTPPVTRTVEQSTAS 287

Query: 175 LIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R     G+ S C+CH       GY C RC ARVC +P++C  C L L+ S H
Sbjct: 288 LLMMGFPSRTLVPNGTTSYCACHNR-PCREGYLCTRCGARVCRIPSECPSCDLTLILSTH 346

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------QQSLLSSG--- 276
           LARSYHHLFP+  + EV P      +  + + CF C              +S   SG   
Sbjct: 347 LARSYHHLFPLRNWIEV-PWA----KATTSAGCFSCLAPFPEPPKNKGHDKSKEDSGAPK 401

Query: 277 -----NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESL 313
                ++ G Y AC  C +HFC++CD++ HE +HNCPGC+SL
Sbjct: 402 TAKGVSESGRY-ACEVCGQHFCIDCDVFAHEIVHNCPGCQSL 442


>gi|345568422|gb|EGX51316.1| hypothetical protein AOL_s00054g386 [Arthrobotrys oligospora ATCC
           24927]
          Length = 542

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/362 (38%), Positives = 201/362 (55%), Gaps = 51/362 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +    FV E+F+QNP+SQ+ ++ + DG+A  ++D+GG+P  H+ AL  K
Sbjct: 124 DLRPTRFLLTLRYARDFVTEYFEQNPISQLSIIGMHDGLALRISDMGGNPADHLTALQEK 183

Query: 62  ---LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
              +   G SSLQNAL++ +  L   P++G RE+L+++ ALS+ DPGDI +TI    E +
Sbjct: 184 AFTVEPKGASSLQNALEMGRAELFHTPTHGTREILLIFGALSSSDPGDIHDTINSLIEDR 243

Query: 119 IRCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
           I+  ++GL+A++ +C+ L   T      +Y V LDE H++EL+M    PP     E ++ 
Sbjct: 244 IQVRIVGLAAQVAVCRELVMKTNDGDDSAYGVILDEGHYRELMMAATTPPVTKTEEKSVP 303

Query: 174 NLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           +L+ MGFP R  E   S+CSCH  +  G GY CPRC A+VC LPT+C  CGL L+ S HL
Sbjct: 304 SLLMMGFPSRTVEKHESLCSCHSNLTKG-GYICPRCTAKVCFLPTECPCCGLTLILSTHL 362

Query: 234 ARSYHHLFPIAPFDEV---------------TPLCLNDPRNRS----------------R 262
           ARSYHHLFP+  + EV               +P   N  +  S                 
Sbjct: 363 ARSYHHLFPLKNWVEVPWSEAHTSTHCWSCQSPFPTNQSQTSSIKIKIRNPSTTAATATA 422

Query: 263 STCFGCQQSLLSSGNK-----------PGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
           S   G   +  +  N            P    AC  CK HFC++CDI+ HESLHNCPGC+
Sbjct: 423 SRSNGATTTSTAPKNDHQQQQQQPPQPPTQRYACTDCKTHFCIDCDIFTHESLHNCPGCQ 482

Query: 312 SL 313
           SL
Sbjct: 483 SL 484


>gi|198464821|ref|XP_001353376.2| GA10767 [Drosophila pseudoobscura pseudoobscura]
 gi|198149896|gb|EAL30883.2| GA10767 [Drosophila pseudoobscura pseudoobscura]
          Length = 445

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 135/331 (40%), Positives = 200/331 (60%), Gaps = 22/331 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K ++ F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL G 
Sbjct: 78  DLKPTRLRCTIKLLDQFIEEFFDQNPISQMGIIALKAKRAEKITELTGTSRVHLKALEGL 137

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  I
Sbjct: 138 ANVPLTSEPSLQNGLDLALKTLKVVPSHASREIVIVMGSLTTCDPVDINLTIDELKKEGI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ I ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+MG
Sbjct: 198 RCSVISLSAEIHIARYLTQQTRGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256

Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           FP    E       +S+C CH E           G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELCTTGHHCPQCNSKYCELPVECQSCGLTLV 316

Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLND--PRNRSRSTCFGCQQSLLSSGNKPGLY 282
           S+PHLARSYHHLFP+  F+E+     P   +D        + C+ C +  +   +K    
Sbjct: 317 SAPHLARSYHHLFPVPNFEEIPFDSLPKASSDRGGTGTGTTECYACMKLFVPVADK--TV 374

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESL 313
             C  CK+ FCL+CDI+IH++LH C GC ++
Sbjct: 375 FKCGFCKQFFCLDCDIFIHDTLHACVGCNTI 405


>gi|395517618|ref|XP_003762972.1| PREDICTED: general transcription factor IIH subunit 2 [Sarcophilus
           harrisii]
          Length = 395

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 146/330 (44%), Positives = 200/330 (60%), Gaps = 27/330 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+   HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNARKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L L    L  +P +  REVLI++ +L+TCDP +I + I+  K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSLAMQTLKHMPGHTSREVLIIFGSLTTCDPSNIYDLIKSLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +     +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253

Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ           + S S+       + G+   GY CP+C+A+  ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313

Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           S+PHLARSYHHLFP+  F EV+    N  R      C GC    L       +Y+ C  C
Sbjct: 314 SAPHLARSYHHLFPLEAFQEVSLEEYNGER-----FCQGC----LGELKDQHVYI-CTVC 363

Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
           +  FC++CD++IH+SLH CPGC    H NP
Sbjct: 364 QNVFCVDCDLFIHDSLHCCPGC---IHKNP 390


>gi|452824299|gb|EME31303.1| transcription initiation factor TFIIH subunit H2 isoform 1
           [Galdieria sulphuraria]
          Length = 398

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 137/325 (42%), Positives = 199/325 (61%), Gaps = 14/325 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +PSR  V     + F+  +F++NP+SQ+ +V ++DGVA  L+ +G +P  H + +   
Sbjct: 78  DVKPSRGEVCLSCAQKFLYSYFNENPISQLAVVVLRDGVAEKLSSMGSNPRQHSEVVKNA 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            + G  G+ SLQN LD+   LL  IPSYG REVLILY+++S+CDPGDI +TI+K ++ +I
Sbjct: 138 NQKGFYGNCSLQNGLDVALSLLHSIPSYGSREVLILYNSISSCDPGDIRQTIEKLEKERI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RC+VIG++AE++I K+L   T GSY V ++ESH  EL+ +   P   I       L++MG
Sbjct: 198 RCNVIGMAAELYILKYLAARTHGSYFVCMNESHLLELLEDFVVPSALIENNTKTALVRMG 257

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP         +C   K ++  V + CPRC+    E+P +C +CGL LVSS  LARSYHH
Sbjct: 258 FPTLKAYKEPKVCLNDKVLRNQV-FVCPRCECCYGEIPIECVLCGLILVSSSQLARSYHH 316

Query: 240 LFPIAPFDEVTPLCLNDPRNRSR---STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLEC 296
           LFP+A F E+      D   +S    + CF C+ +    G++   Y  CP C+  FC  C
Sbjct: 317 LFPVANFHELE----EDKYKQSHFQPNYCFACKGTFPFEGSE---YFVCPNCQNIFCGIC 369

Query: 297 DIYIHESLHNCPGCESLRHSNPIVA 321
           DI+IH+ L+NCPGCE +R  N I +
Sbjct: 370 DIFIHDGLNNCPGCE-IRPENRIAS 393


>gi|58271290|ref|XP_572801.1| RNA polymerase II transcription factor [Cryptococcus neoformans
           var. neoformans JEC21]
 gi|57229060|gb|AAW45494.1| RNA polymerase II transcription factor, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 460

 Score =  257 bits (656), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 137/349 (39%), Positives = 195/349 (55%), Gaps = 45/349 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+RP+R  V+   +  +V E+FDQNPL QIG++ ++D ++  L  +GGSPE  ++AL  K
Sbjct: 116 DYRPTRFEVILGYLRTYVVEWFDQNPLGQIGVIAMRDRLSEVLIPMGGSPEEIVRALSDK 175

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
             L  SG+ SLQN L + +G ++ +PS    E+L+++SA+ST DP     I   +     
Sbjct: 176 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLEILVIFSAISTADPDGPITIHNVLDTLAT 235

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
             IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E  PPP        AN  
Sbjct: 236 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQEHLKDLLWETIPPPATTIAPVTANVR 295

Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
                                L+ MGFP R   G  ++C+CH  ++ G GY CPRC +++
Sbjct: 296 SALAAGGRGPNQTGERAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 354

Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL- 272
           C++PTDC +CGL +VSSPHLARS+  LFP+A +    PL + D    S  TCFGC     
Sbjct: 355 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLSIEDVVESSE-TCFGCDSEFS 410

Query: 273 -----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                      +  G  P     C KCK  FC +CD+YIH++LH CPGC
Sbjct: 411 DATAINAGVAQVEDGVSPTGRYRCAKCKHDFCADCDLYIHDTLHTCPGC 459


>gi|195440790|ref|XP_002068223.1| GK12882 [Drosophila willistoni]
 gi|194164308|gb|EDW79209.1| GK12882 [Drosophila willistoni]
          Length = 454

 Score =  256 bits (655), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 133/325 (40%), Positives = 198/325 (60%), Gaps = 22/325 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+L G+   H+KAL  +
Sbjct: 75  DLKPTRLRCTLKLLEIFIDEFFDQNPISQLGIIALKAKRAEKITELTGTSRVHLKALESL 134

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +  + + SLQN LD+    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  I
Sbjct: 135 ANVVLTSEPSLQNGLDMALKTLKVVPSHASREIVIIMGSLTTCDPVDINLTINELKKEGI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI LSAE+ + ++L ++T GSY   LD++HF++ ++    PPPA A+    +LI+MG
Sbjct: 195 RCSVISLSAEIHVARYLAKETMGSYGAVLDDAHFRDQLLSQVDPPPA-AKTQHNSLIRMG 253

Query: 180 FPQRAG------EGSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQL 227
           FP          +  +S+C CH E           G+ CP+C ++ CELP +C+ C L L
Sbjct: 254 FPHSTSNEADGKDAPLSMCMCHIENVDEPSELSTTGHYCPQCHSKYCELPVECQSCDLTL 313

Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
           VS+PHLARSYHH+FP+  F+E+    L  P     + CF C + L +  +K      C  
Sbjct: 314 VSAPHLARSYHHIFPVPNFEEMPHDQLPGP-----TECFSCAKLLATVADK--TVYKCIH 366

Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
           C + FCL+CDI+IHE+LH C GC S
Sbjct: 367 CGEVFCLDCDIFIHETLHACVGCNS 391


>gi|58271288|ref|XP_572800.1| RNA polymerase II transcription factor [Cryptococcus neoformans
           var. neoformans JEC21]
 gi|134114810|ref|XP_773703.1| hypothetical protein CNBH1580 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256331|gb|EAL19056.1| hypothetical protein CNBH1580 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57229059|gb|AAW45493.1| RNA polymerase II transcription factor, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 460

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 136/349 (38%), Positives = 195/349 (55%), Gaps = 45/349 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+RP+R  V+   +  +V E+FDQNPL QIG++ ++D ++  L  +GG+PE  ++AL  K
Sbjct: 116 DYRPTRFEVILGYLRTYVVEWFDQNPLGQIGVIAMRDRLSEVLIPMGGNPEEIVRALSDK 175

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
             L  SG+ SLQN L + +G ++ +PS    E+L+++SA+ST DP     I   +     
Sbjct: 176 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLEILVIFSAISTADPDGPITIHNVLDTLAT 235

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
             IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E  PPP        AN  
Sbjct: 236 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQEHLKDLLWETIPPPATTIAPVTANVR 295

Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
                                L+ MGFP R   G  ++C+CH  ++ G GY CPRC +++
Sbjct: 296 SALAAGGRGPNQTGERAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 354

Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL- 272
           C++PTDC +CGL +VSSPHLARS+  LFP+A +    PL + D    S  TCFGC     
Sbjct: 355 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLSIEDVVESSE-TCFGCDSEFS 410

Query: 273 -----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                      +  G  P     C KCK  FC +CD+YIH++LH CPGC
Sbjct: 411 DATAINAGVAQVEDGVSPTGRYRCAKCKHDFCADCDLYIHDTLHTCPGC 459


>gi|321261922|ref|XP_003195680.1| component of RNA polymerase transcription factor TFIIH; Ssl1p
           [Cryptococcus gattii WM276]
 gi|317462154|gb|ADV23893.1| Component of RNA polymerase transcription factor TFIIH, putative;
           Ssl1p [Cryptococcus gattii WM276]
          Length = 461

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 137/349 (39%), Positives = 194/349 (55%), Gaps = 45/349 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+RPSR  V+   +  +V E+FDQNPL QIG++ ++D ++  +  +GGSPE  ++AL  K
Sbjct: 116 DYRPSRFEVILGYLRTYVVEWFDQNPLGQIGVIVMRDRLSEVVIPMGGSPEEIVRALSDK 175

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
             L  SG+ SLQN L + +G ++ +PS    E L+++SA+ST DP     I   +     
Sbjct: 176 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLETLVIFSAISTADPDGPITIHNVLDTLVT 235

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
             IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E  PPP        AN  
Sbjct: 236 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQDHLKDLLWETIPPPATTITPVTANVR 295

Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
                                L+ MGFP R   G  ++C+CH  ++ G GY CPRC +++
Sbjct: 296 SALAAGGRGPNQTGGRAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 354

Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL- 272
           C++PTDC +CGL +VSSPHLARS+  LFP+A +    PL + D    S  TCFGC     
Sbjct: 355 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLAIEDVVG-SSGTCFGCDSEFS 410

Query: 273 -----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                      +  G  P     C KCK  FC +CD+YIH++LH CPGC
Sbjct: 411 DTSAINAGVAQVEDGVSPAGRYRCAKCKHDFCADCDLYIHDTLHTCPGC 459


>gi|212537907|ref|XP_002149109.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210068851|gb|EEA22942.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 453

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 140/340 (41%), Positives = 201/340 (59%), Gaps = 31/340 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV EFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 103 DIRPTRYLLALRYAQEFVIEFFEQNPISQLGVIGMRDGLAVRISDMSGNPTDHITAIHAL 162

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REVLI++ +L + DPGDI +T++     KI
Sbjct: 163 RPDDPKGLPSLQNALEMARGALFHTPSHGTREVLIVFGSLLSSDPGDIHQTLKSLIADKI 222

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIA-EFAIAN 174
           R  ++GL+A++ IC+ LC  T G     Y VAL+E HF+EL+M+   PP A +   + + 
Sbjct: 223 RVGIVGLAAQVAICRELCAKTNGGDDSVYGVALNEQHFRELMMDVTIPPAAYSYRKSSSA 282

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC  +VC LP +C  CGL L+ S HLA
Sbjct: 283 LLMMGFPSRTVESFPSLCACHSKPSCG-GYLCSRCGNKVCGLPAECPSCGLTLILSTHLA 341

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS-RST-CFGC----------QQSLLSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R+ RST C+ C           Q         G+ 
Sbjct: 342 RSYHHLFPLINWVEV-------PWKRALRSTNCYACGIQFPTVPPADQWGAFENQAKGMS 394

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCESLRHSNP 318
           V+    C  C  HFC++CD++ HE +HNCPGC+S     P
Sbjct: 395 VSSRYECSACHNHFCIDCDLFAHEIVHNCPGCQSTNFMKP 434


>gi|254577733|ref|XP_002494853.1| ZYRO0A11176p [Zygosaccharomyces rouxii]
 gi|238937742|emb|CAR25920.1| ZYRO0A11176p [Zygosaccharomyces rouxii]
          Length = 477

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 141/323 (43%), Positives = 195/323 (60%), Gaps = 18/323 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQIG+V++++G+A+ ++ + G+P+ H+ AL  +
Sbjct: 156 DMRPNRHAMLVQYSIDFVHEFFDQNPISQIGIVSMRNGLAHMISQVSGNPQDHVDALKSI 215

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P +  +EVLI++ +LS+ DPGDI  TI    +  I
Sbjct: 216 RKQEPKGNPSLQNALEMTRGLLLPVPPHCTKEVLIVFGSLSSTDPGDIHLTIGSLVQEGI 275

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
           R  +IGLSA++ IC+ LC+ T       Y V LDE H KEL  E   P P         L
Sbjct: 276 RVKIIGLSAQVAICRELCKATNYGDESFYKVPLDEIHCKELFAEAVTPLPVNKINKGFTL 335

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +KMGFP R  E + + C+CH ++  G GY CP C+++VC LP  C  C L L+ S HLAR
Sbjct: 336 VKMGFPTRIFEEAPTFCACHSKLLYG-GYFCPNCQSKVCSLPMVCPCCDLMLILSTHLAR 394

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGN-KPGLYVA-----CPKCK 289
           SYHHL P+  F EV    +    N     CF CQ    +  N K G  +      C  C 
Sbjct: 395 SYHHLMPLKTFSEVPTGQVFPTEN-----CFSCQVRFPTVKNHKTGELLTSSRYRCEDCL 449

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           K FC++CD++IHE LHNCPGCES
Sbjct: 450 KDFCIDCDVFIHEILHNCPGCES 472


>gi|46123249|ref|XP_386178.1| hypothetical protein FG06002.1 [Gibberella zeae PH-1]
          Length = 469

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 201/343 (58%), Gaps = 39/343 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D  P+R  ++     AFVREFF+QNP+SQ+G++ ++DGVA  ++D+GG+P  H+  L G 
Sbjct: 108 DLLPTRYRLMLSYAVAFVREFFEQNPISQLGIIGMRDGVAVRVSDVGGNPAEHLDKLKGL 167

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G+ SLQNAL++ +G L   PS+G REV I+Y AL + DPGDI ETI      +I
Sbjct: 168 ESQDPQGNPSLQNALEMCRGALFHAPSHGTREVFIIYGALLSSDPGDIHETIGNLITDRI 227

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP--PAIAEFAIA 173
           R S++GLSA + IC  LC  T       Y++A+DE HF+EL +    PP    +A+ + A
Sbjct: 228 RVSIVGLSAHLAICADLCSRTNAGDESQYNIAMDEVHFRELFLAATTPPVTRTVAQ-STA 286

Query: 174 NLIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           +L+ MGFP R     G+ S C+CH       GY C RC ARVC +P++C  C L L+ S 
Sbjct: 287 SLLMMGFPSRTLVPNGTTSYCACHNR-PCREGYLCTRCGARVCRIPSECPSCDLTLILST 345

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------QQSLLSSG-- 276
           HLARSYHHLFP+  + EV P      +  + + CF C              +S   SG  
Sbjct: 346 HLARSYHHLFPLRNWIEV-PWA----KATTSAGCFSCLAPFPEPPKNKGHDKSKEDSGAP 400

Query: 277 ------NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESL 313
                 ++ G Y AC  C +HFC++CD++ HE +HNCPGC+SL
Sbjct: 401 KTAKGVSESGRY-ACEVCGQHFCIDCDVFAHEVVHNCPGCQSL 442


>gi|358391363|gb|EHK40767.1| hypothetical protein TRIATDRAFT_78413 [Trichoderma atroviride IMI
           206040]
          Length = 476

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 142/342 (41%), Positives = 200/342 (58%), Gaps = 40/342 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  +      AFVRE+F+QNP+SQ+G+V ++DGVA  ++D+GG+P  H++ L   
Sbjct: 111 DLLPTRHRLTLSYAAAFVREYFEQNPISQLGIVGMRDGVAVRISDMGGNPAEHLERLRDL 170

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G    G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      +I
Sbjct: 171 EGQDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTIGNLIADRI 230

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R SV+GLSA++ IC  LC  T       Y+VA+DE+H++EL +    PP     E + ++
Sbjct: 231 RVSVVGLSAQVAICAELCSRTNAGDESQYNVAMDETHYRELFLAATTPPVNRTKEQSTSS 290

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A  G+ S C+CH +     GY C RC A+VC LP++C  CGL L+ S H
Sbjct: 291 LLMMGFPSRTLAPGGTTSYCACHSK-PCREGYLCTRCSAKVCRLPSECPACGLTLILSTH 349

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQ--------------------Q 270
           LARSYHHLFP+  + EV       P   +  +  CF CQ                    +
Sbjct: 350 LARSYHHLFPLRNWVEV-------PWAEATQSIACFSCQTFFPEPPKSIKDKGKDEAPAK 402

Query: 271 SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
           +L    ++ G Y  C  C  HFC++CD++ H  +HNCPGC+S
Sbjct: 403 ALAKGVSESGRY-KCQVCGNHFCIDCDVFAHMVIHNCPGCQS 443


>gi|196007624|ref|XP_002113678.1| hypothetical protein TRIADDRAFT_57351 [Trichoplax adhaerens]
 gi|190584082|gb|EDV24152.1| hypothetical protein TRIADDRAFT_57351 [Trichoplax adhaerens]
          Length = 353

 Score =  253 bits (647), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 123/268 (45%), Positives = 172/268 (64%), Gaps = 20/268 (7%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +D +P+R+    K ++ F+ E+FDQNP+SQ+G++  ++  A  LTD+GG+P  HI AL  
Sbjct: 76  IDLKPNRLVATVKLLDDFITEYFDQNPISQLGVIITRNKRAEKLTDIGGNPRRHIAALRQ 135

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
                C G+ SLQNAL+L    L  +PS+  RE+L++Y++L++CDP DI  T++   E  
Sbjct: 136 AACKPCIGEPSLQNALELASQTLRHVPSHASREILVIYASLTSCDPSDIHATVKNLVEDN 195

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           IRCS IGLSAEM+I K L  +TGGSY V LDE H+KEL+  +  PP A  +   ++LI+M
Sbjct: 196 IRCSAIGLSAEMYILKKLASETGGSYRVILDEHHYKELLTHYVIPPVATGKLG-SSLIRM 254

Query: 179 GFPQRAGEGSISICSCHKEVKV-----GVGYTCPRCKARVCELPTDCRICG--------- 224
           GFP+       S+C CH + +V       GY CP+CKA+ CELP +C+ICG         
Sbjct: 255 GFPEYRTNSHPSMCQCHLDSRVVKSFSTTGYFCPQCKAKYCELPIECQICGKMNITTKTR 314

Query: 225 ---LQLVSSPHLARSYHHLFPIAPFDEV 249
              L LVS+PHLARSY HLFP+  F+EV
Sbjct: 315 NMRLSLVSAPHLARSYQHLFPLEDFEEV 342


>gi|47210864|emb|CAF92597.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 395

 Score =  253 bits (647), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 195/324 (60%), Gaps = 26/324 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K +E FV E+FDQNP+SQ+G++T K+  A  LTDL G+P+ H  AL   
Sbjct: 76  DLKPNRLTSTLKLMEGFVEEYFDQNPISQMGIITTKNKRAEKLTDLAGNPKKHAAALKKA 135

Query: 62  L--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           +   C G+ SL N L L    L  +P +  REVLI+ S+L+TCDPG+I E IQ  K  K+
Sbjct: 136 VDSACVGEPSLYNCLSLALQTLRHMPGHTSREVLIILSSLTTCDPGNIYELIQTLKSLKV 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM- 178
           R SV+GLSAE+ +C  L ++TGGSY V LDESHFKEL+M H  PPPA    +  +LI+M 
Sbjct: 196 RVSVVGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPASCS-SECSLIRMA 254

Query: 179 GFPQRA------GEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQ 226
           GFPQ         +   S    H E   G       GY CP+C A+  ELP +C++CGL 
Sbjct: 255 GFPQHTMASLTDQDAKPSFSMAHLEGGGGGPDLSLGGYFCPQCHAKYTELPVECKVCGLT 314

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           LV +PHLARS+HHLFP+  F E +    + P++R    C  CQ  L     K      CP
Sbjct: 315 LVLAPHLARSFHHLFPLQVFPESS--AEDPPKDR---FCQACQGRL-----KDKSVFTCP 364

Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
            C   FC+ECD++IH+SLH CP C
Sbjct: 365 SCHSVFCVECDLFIHDSLHCCPCC 388


>gi|406863243|gb|EKD16291.1| RNA polymerase TFIIH complex subunit Ssl1 [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 524

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 140/351 (39%), Positives = 205/351 (58%), Gaps = 36/351 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +    FV EFF+QNP+SQ+G++ ++DG+A  ++++GG+P  HI+ +  +
Sbjct: 177 DLRPTRYLLTLRFASEFVTEFFEQNPISQLGVLGMRDGIAIRVSEMGGNPTEHIENIKKL 236

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L   G+ SLQNAL++ + +L   P++G REVLI+  AL T DPGDI ET++     +I
Sbjct: 237 RDLQGQGNPSLQNALEMARAILHYAPAHGTREVLIISGALLTSDPGDIHETVRSLVTDRI 296

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R SV+GL+A++ I   +C  T     G+Y VAL+E HF+EL+M    PP    +  +   
Sbjct: 297 RVSVVGLAAQVKILSVICSKTNAGNEGNYHVALNEQHFRELVMALTTPPVTRTKTQSQPT 356

Query: 175 LIKMGFPQR-AGEG-SISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R +G G ++S C+CH ++  G GY C RC ++VC LP DC +CGL LV S H
Sbjct: 357 LLMMGFPSRISGVGKAMSYCACHSKLVRG-GYLCSRCDSKVCSLPADCPVCGLTLVLSTH 415

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGC------------QQSLLSSGNK 278
           LARSYHHLFP+  + EV       P +++  +  CF C            +      G  
Sbjct: 416 LARSYHHLFPLKNWVEV-------PWSQAGKSKACFACLTAFPEVPSHLEKNGAEREGTS 468

Query: 279 PGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPI-----VANEG 324
                 C  C  HFC++CD++ HE +HNCPGC+S      I     V N+G
Sbjct: 469 ESGRYTCEDCGYHFCIDCDVHAHEVVHNCPGCQSDSQQQQIGIAHEVKNDG 519


>gi|50551723|ref|XP_503336.1| YALI0D26862p [Yarrowia lipolytica]
 gi|49649204|emb|CAG81542.1| YALI0D26862p [Yarrowia lipolytica CLIB122]
          Length = 496

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 143/334 (42%), Positives = 195/334 (58%), Gaps = 26/334 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R   V      FV  FFDQNP+SQ+G++ +++G A  ++ LG +P  HI AL   
Sbjct: 112 DLRPNRYQFVINHAIEFVTNFFDQNPISQLGILGMRNGQAISISTLGSNPNDHINALTAA 171

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL   GD SLQNAL++ +GLL  +PS+  +EVL++  AL + DPGDI  TI K    K+
Sbjct: 172 KKLEPQGDPSLQNALEMARGLLFHVPSHCTKEVLVVLGALLSADPGDIHVTIDKLVIDKV 231

Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
           R  VIGL+A++ ICK LC  T       Y V L+E HF+EL+ E   P     +   AN 
Sbjct: 232 RARVIGLAAQVAICKELCVKTNFGDASYYGVVLNEQHFQELMDEATTPLAESQQSQSANP 291

Query: 175 --LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
             L+ MGFP +  E + S+ +    +  G GY CP+CK +V  LPT C  CGL L+ S H
Sbjct: 292 ASLVLMGFPSKVSEAAPSLSASDAALTQG-GYVCPQCKVKVSSLPTVCPCCGLTLILSTH 350

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ----------QSLLSSGNKPGLY 282
           LARSYHHLFP+APF EV   C N  ++     C GCQ          +++ + G+    Y
Sbjct: 351 LARSYHHLFPLAPFIEVP--CKNAHKSE---FCAGCQSKFPVVARDAKNIQADGSMTSRY 405

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESLRHS 316
             CP C  HFC++CD++ HE LHNC GC++  +S
Sbjct: 406 -ECPTCHSHFCIDCDVFCHEILHNCIGCQARSYS 438


>gi|340518378|gb|EGR48619.1| predicted protein [Trichoderma reesei QM6a]
          Length = 480

 Score =  251 bits (642), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 147/340 (43%), Positives = 196/340 (57%), Gaps = 39/340 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  +      AFVRE+F+QNP+SQ+ +V ++DGVA  ++D+GG+P  H++ L   
Sbjct: 113 DLLPTRYRLALSYAAAFVREYFEQNPISQLAIVGMRDGVALRVSDMGGNPAEHLERLREY 172

Query: 62  LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G   SG+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      +I
Sbjct: 173 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTINNLVAERI 232

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R SV+GLSA++ IC  LC  T       Y+VA+DE+HFKEL +    PP     E + A+
Sbjct: 233 RVSVVGLSAQVAICAELCSRTNVGDDSQYNVAMDETHFKELFLALTTPPVNRTKEQSTAS 292

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A  G+ S C+CH +     GY C RC  +VC LP++C  CGL L+ S H
Sbjct: 293 LLMMGFPSRTLAPGGTSSYCACHSQ-PCREGYLCTRCGVKVCRLPSECPACGLTLILSTH 351

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLS--SGNK----------- 278
           LARSYHHLFP+  + EV P         +RST CF CQ         NK           
Sbjct: 352 LARSYHHLFPLRNWVEV-PWA-----EAARSTACFSCQSPFPEPPKANKEKGKDEAAPAK 405

Query: 279 --------PGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                    G Y  C  C  HFC++CD++ H  +HNCPGC
Sbjct: 406 APAKGVSESGRY-KCQVCDHHFCIDCDVFAHMVIHNCPGC 444


>gi|296817267|ref|XP_002848970.1| suppressor of stem-loop protein 1 [Arthroderma otae CBS 113480]
 gi|238839423|gb|EEQ29085.1| suppressor of stem-loop protein 1 [Arthroderma otae CBS 113480]
          Length = 484

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV EFF+QNP+SQ+G++ +KDGVA  ++++ G+P  HI AL  +
Sbjct: 125 DLRPTRYLLTLRYAQKFVLEFFEQNPISQLGIMGMKDGVAVQVSEMSGNPTEHILALHAL 184

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   P++G REV+I+  AL + DPGDI +TI      K+
Sbjct: 185 RSEDPKGLPSLQNALEMARGSLYHTPTHGTREVIIILGALLSSDPGDIRQTITSLVADKV 244

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
           R  +IGL+A++ IC+ LC  T      SY VA+DE H  +L+++   PP   ++ ++ ++
Sbjct: 245 RVGIIGLAADVAICRELCAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSSS 304

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC ++VC LP +C  C L L+ S HLA
Sbjct: 305 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 363

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRS--TCFGC----------QQSLLSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R+    +CF C          ++     G+  G+ 
Sbjct: 364 RSYHHLFPLINWVEV-------PWKRASKSVSCFACGNVFPPVPPREKWDRQEGSVKGMS 416

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 417 VSSRYECTVCHHHFCIDCDVFAHEIVHNCPGCQS 450


>gi|407917809|gb|EKG11111.1| von Willebrand factor type A [Macrophomina phaseolina MS6]
          Length = 481

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 138/354 (38%), Positives = 198/354 (55%), Gaps = 44/354 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF+P+R A V K  + +VREFF+QNP+SQ+ ++T+ DG+ N +++L G+P  HI  L   
Sbjct: 106 DFKPTRYAAVLKYAQEYVREFFEQNPISQLSVMTMYDGLCNRVSELSGNPNDHISPLQRM 165

Query: 62  LGCS--------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQK 113
                       G  SLQNAL+  +  L   PS+G REV+I+  AL + DPGDI +TI  
Sbjct: 166 RDTRNPNYQDPRGSPSLQNALEQARAALYHTPSHGTREVIIILGALLSLDPGDIFKTIAN 225

Query: 114 CKESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP----- 164
           C ++ +R ++IG+   + IC+ +C  T G    +Y VA+D+ HF++L++    PP     
Sbjct: 226 CVKNNVRVNIIGMGGRLKICQEICSRTNGGDESAYVVAVDQLHFRDLLLATTTPPVIRQT 285

Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICG 224
            A A    A+L+KMGFP R  E + +IC+CH  +  G GY C +C+A+VC LPT C  C 
Sbjct: 286 TAQAAANPASLLKMGFPSRVEESAPTICACHGTLTRG-GYLCSQCQAKVCSLPTTCPSCN 344

Query: 225 LQLVSSPHLARSYHHLFPIAPFDEVT--------------------PLCLNDPRNRSRST 264
           L L+ S HLARSYHHLFP+  + EVT                    PL       +  +T
Sbjct: 345 LTLILSTHLARSYHHLFPLKNWREVTWERARQVGSTECKSCLTPFPPLPARSAAAKKAAT 404

Query: 265 CFGCQQSLLSSGNKPGLYVA------CPKCKKHFCLECDIYIHESLHNCPGCES 312
             G Q     + N  G+  +      C  C+ HFC++CD+Y HE +HNCPGC S
Sbjct: 405 ANGEQHGAPGTKNTEGVGASESSRYECEDCRNHFCVDCDVYCHEIVHNCPGCLS 458


>gi|389623489|ref|XP_003709398.1| RNA polymerase TFIIH complex subunit Ssl1 [Magnaporthe oryzae
           70-15]
 gi|351648927|gb|EHA56786.1| RNA polymerase TFIIH complex subunit Ssl1 [Magnaporthe oryzae
           70-15]
 gi|440469566|gb|ELQ38674.1| suppressor of stem-loop protein 1 [Magnaporthe oryzae Y34]
 gi|440482863|gb|ELQ63315.1| suppressor of stem-loop protein 1 [Magnaporthe oryzae P131]
          Length = 516

 Score =  250 bits (638), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 146/378 (38%), Positives = 203/378 (53%), Gaps = 76/378 (20%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D  P+R  +       FVRE+FDQNP+SQ+ +V ++D +A  ++DL G+P  HI+ L  +
Sbjct: 114 DLMPTRHRLTISYAIEFVREYFDQNPISQLAIVGMRDSIAVRISDLSGNPAEHIEKLRAL 173

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  SG  SLQNAL++ +G L   PS+G REV+I++ AL + DPGDI ETI    + +I
Sbjct: 174 PDLDPSGSPSLQNALEMCRGALFNTPSHGTREVVIIFGALLSLDPGDIHETIDGLMQDRI 233

Query: 120 RCSVIGLSAEMFICKHLCQDT----------------GGSYSVALDESHFKELIMEHAPP 163
           R SVIGL+A+++IC+ +C+ T                G +Y+VAL E HF+EL+M    P
Sbjct: 234 RVSVIGLAAQVYICEEICRRTNSGKAAADLGIVTDTDGSAYNVALHEVHFRELLMSATRP 293

Query: 164 PPAI------AEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP 217
           P         A    A+L+ MGFP R  +   ++CSCH    V  G+ C RC +RVC LP
Sbjct: 294 PVTRQSQLQQATGETASLLMMGFPSRKTDKDPALCSCHNR-PVRNGFLCTRCGSRVCRLP 352

Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRS-RS-TCFGCQQSLLSS 275
           ++C  CGL L+ S HLARSYHHLFP+  + EV       P N + RS +CF C      +
Sbjct: 353 SECPACGLTLILSTHLARSYHHLFPLKIWSEV-------PWNDAVRSVSCFACHAPFPDA 405

Query: 276 GNK-----------------------------------------PGLYVACPKCKKHFCL 294
            NK                                          G Y+ C  C+ HFC+
Sbjct: 406 KNKDADKGKDAEKDPGDKHLKVPSKDTNGKKKKPAAPVPKGVSESGRYM-CQVCRHHFCI 464

Query: 295 ECDIYIHESLHNCPGCES 312
           +CD++ HE LHNCPGC+S
Sbjct: 465 DCDVFSHEVLHNCPGCQS 482


>gi|430811329|emb|CCJ31252.1| unnamed protein product, partial [Pneumocystis jirovecii]
          Length = 319

 Score =  250 bits (638), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 133/323 (41%), Positives = 194/323 (60%), Gaps = 26/323 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D  PSR  V   Q  +FV E+F+QNPL+Q+G++ ++DG A  ++  GG+P  H+  L  +
Sbjct: 5   DLIPSRYVVSIDQASSFVSEYFEQNPLAQLGIIGMRDGRAISISPYGGNPHEHLMTLSRL 64

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++  SGD+SLQNAL++ + +L  +P +G REVL+++ +L++ DPGDI ETI    E KI
Sbjct: 65  KQMDPSGDASLQNALEMARTILYPVPKHGTREVLLIFGSLTSFDPGDIYETINVLIEEKI 124

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFA-IAN 174
           R  VIGLSA M +C+ +C++T      SY + L+E HF +L+M+   PP        +  
Sbjct: 125 RVYVIGLSASMSVCQQICKETNAGDESSYGIVLNEQHFHDLLMQFVIPPALKKSHKFLET 184

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           LI MGFP +  E   ++C+CH +V +  GY CPRC ++VC LP +C  C L L+ S HLA
Sbjct: 185 LIMMGFPSKLIENDETLCACHSDV-IKEGYLCPRCFSKVCSLPINCPCCDLTLILSTHLA 243

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQ---------QSLLSSGNKPGLYVA 284
           RSYHHLFP+  + EV       P   + ST C+ C           S+  S +    Y A
Sbjct: 244 RSYHHLFPLKNWIEV-------PWEEAYSTHCYACLIPFPEKHEVTSITHSVSISSRY-A 295

Query: 285 CPKCKKHFCLECDIYIHESLHNC 307
           C +C KHFC++CD++ HE L  C
Sbjct: 296 CSECHKHFCIDCDVFAHEILFEC 318


>gi|346326021|gb|EGX95617.1| TFIIH basal transcription factor complex, subunit SSL1 [Cordyceps
           militaris CM01]
          Length = 508

 Score =  249 bits (637), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 145/352 (41%), Positives = 195/352 (55%), Gaps = 47/352 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D  P+R  +  +   AFVREFF+QNP+SQ+G+V ++DGVA  ++DLGG+P  H++ L  +
Sbjct: 135 DMLPTRHRLTLRYAAAFVREFFEQNPISQLGIVGMRDGVAVRVSDLGGNPAEHLERLKAL 194

Query: 60  GKLGCSGDSSLQNALDLVQG---------LLSQIPSYGHREVLILYSALSTCDPGDIMET 110
                 G+ SLQNAL++ +G           S  PS+G REVLI+Y AL + DPGDI +T
Sbjct: 195 ESQDPQGNPSLQNALEMCRGCNANEFNVHTNSHAPSHGTREVLIIYGALLSSDPGDIHDT 254

Query: 111 IQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPP-P 165
           +      +IR S++GLSA + IC  LC  T       Y VA DE HF+EL +    PP  
Sbjct: 255 LASLINDRIRVSIVGLSAHLSICAELCARTNAGDTSQYGVATDEVHFRELFLAATTPPVT 314

Query: 166 AIAEFAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
              E + A+L+ MGFP R  A  G+ S C+CH       G+ C RC  RVC LP +C  C
Sbjct: 315 RTKEQSTASLLMMGFPSRVLAPGGATSFCACHNR-PCREGFLCTRCGTRVCRLPAECPAC 373

Query: 224 GLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ-------------- 269
            L L+ S HLARSYHHLFP+  + EV+     D   RS   CF CQ              
Sbjct: 374 ALTLILSTHLARSYHHLFPLRNWVEVS---WADAVGRS-PACFACQCPFPAAAATGGDAS 429

Query: 270 ---------QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                    +  +   ++ G Y AC  C  HFC++CD+Y HE +HNCPGC+S
Sbjct: 430 AREEGQGKKRPTVKGVSESGRY-ACEVCGNHFCIDCDVYAHEVIHNCPGCQS 480


>gi|297603026|ref|NP_001053274.2| Os04g0508900 [Oryza sativa Japonica Group]
 gi|255675611|dbj|BAF15188.2| Os04g0508900, partial [Oryza sativa Japonica Group]
          Length = 173

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 115/150 (76%), Positives = 137/150 (91%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MD+RPSRMAVVAK  E F+REFFDQNPLS +G+VT+KDG+++ LT++GGSPES IKALMG
Sbjct: 5   MDYRPSRMAVVAKYAEVFIREFFDQNPLSHVGIVTMKDGISHRLTEIGGSPESQIKALMG 64

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           KL CSG+ SLQNAL+LV G L Q+PSYGH+EVL LYSAL+TCDPGDIMETI KCK+SKIR
Sbjct: 65  KLECSGEPSLQNALELVHGYLDQVPSYGHKEVLFLYSALNTCDPGDIMETIAKCKKSKIR 124

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDE 150
           CSVIGL+AE+FICK+LC++TGGSY+VALDE
Sbjct: 125 CSVIGLAAEIFICKYLCEETGGSYTVALDE 154


>gi|326468925|gb|EGD92934.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton tonsurans
           CBS 112818]
 gi|326480095|gb|EGE04105.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton equinum CBS
           127.97]
          Length = 489

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV+EFF+QNP+SQ+G++ +KDG+A  ++++ G+P  H+ AL  +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 185

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REV+I+  AL + DPGDI +TI      K+
Sbjct: 186 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
           R  +IGL+A++ IC+ +C  T      SY VA+DE H  +L+++   PP   ++ ++  +
Sbjct: 246 RVGIIGLAADVAICREICAKTNDGNDSSYGVAMDEKHLWDLLLDVTTPPVTYSQRYSSNS 305

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC ++VC LP +C  C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R SRS +CF C              +   +  G+ 
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDTWDMRDRSIKGMS 417

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSRRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451


>gi|327301499|ref|XP_003235442.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton rubrum CBS
           118892]
 gi|326462794|gb|EGD88247.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton rubrum CBS
           118892]
          Length = 489

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV+EFF+QNP+SQ+G++ +KDG+A  ++++ G+P  H+ AL  +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 185

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REV+I+  AL + DPGDI +TI      K+
Sbjct: 186 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
           R  +IGL+A++ IC+ +C  T      SY VA+DE H  +L+++   PP   ++ ++  +
Sbjct: 246 RVGIIGLAADVAICREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSNS 305

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC ++VC LP +C  C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R SRS +CF C              +   +  G+ 
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDKWDMRDRSIKGMS 417

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451


>gi|302656432|ref|XP_003019969.1| hypothetical protein TRV_05937 [Trichophyton verrucosum HKI 0517]
 gi|291183747|gb|EFE39345.1| hypothetical protein TRV_05937 [Trichophyton verrucosum HKI 0517]
          Length = 515

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV+EFF+QNP+SQ+G++ +KDG+A  ++++ G+P  H+ AL  +
Sbjct: 151 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 210

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REV+I+  AL + DPGDI +TI      K+
Sbjct: 211 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 270

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
           R  +IGL+A++ IC+ +C  T      SY VA+DE H  +L+++   PP   ++ ++  +
Sbjct: 271 RVGIIGLAADVAICREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSNS 330

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC ++VC LP +C  C L L+ S HLA
Sbjct: 331 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 389

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R SRS +CF C              +   +  G+ 
Sbjct: 390 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDKWDMRDRSIKGMS 442

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 443 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 476


>gi|341038582|gb|EGS23574.1| putative transcription factor [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 534

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 144/390 (36%), Positives = 203/390 (52%), Gaps = 83/390 (21%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--- 58
           D  P+R  +       FVRE+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI+ L   
Sbjct: 118 DLLPNRYLLTLNYAVDFVREYFEQNPISQMGIIAMRDGIAVRVSDMSGNPADHIERLRFW 177

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
                  G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI ETI    + +
Sbjct: 178 AEHQEPQGNPSLQNALEMCRGALYHTPSHGTREVLIVYGALLSSDPGDIHETISNLVKDR 237

Query: 119 IRCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAI-- 172
           IR +V+GL+A++ +C  LC  T      +Y+VAL E HF+EL +    PPPA A  A   
Sbjct: 238 IRVTVVGLAAQVAVCAELCTRTNHGDDSTYAVALHEQHFRELFLAATIPPPATASSATDK 297

Query: 173 -------------ANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELP 217
                        A+L+ MGFP R  A    +S+C+CH       GY C RC+A+VC LP
Sbjct: 298 NGANGNANAASTDASLLMMGFPSRTLASASHVSLCACHSRPSRE-GYLCTRCRAKVCRLP 356

Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------- 268
            +C  CGL L+ S HLARSYHHLFP+  + EV+     + R   +  CF C         
Sbjct: 357 AECPACGLTLILSTHLARSYHHLFPLKGWVEVS---WAEARKSKQVGCFACLAPFPLPPA 413

Query: 269 ----------------------QQSLLSSGNK------PGLYVA---------------- 284
                                 Q+   SS N        G+ +A                
Sbjct: 414 PGSEKTGKEPTQKTQGQAQQPPQERQGSSSNSNNAKKTTGISLATALPEARAVGVSESGR 473

Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCES 312
             CP C KHFC++CD++ HE +HNCPGC++
Sbjct: 474 YKCPTCGKHFCIDCDVFAHEVIHNCPGCQA 503


>gi|302506370|ref|XP_003015142.1| hypothetical protein ARB_06903 [Arthroderma benhamiae CBS 112371]
 gi|291178713|gb|EFE34502.1| hypothetical protein ARB_06903 [Arthroderma benhamiae CBS 112371]
          Length = 490

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 198/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV+EFF+QNP+SQ+G+  +KDG+A  ++++ G+P  H+ AL  +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGITGMKDGIAVQISEMSGNPTEHMLALHAL 185

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REV+I+  AL + DPGDI +TI      K+
Sbjct: 186 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
           R  +IGL+A++ IC+ +C  T      SY VA+DE H  +L+++   PP   ++ ++  +
Sbjct: 246 RVGIIGLAADVAICREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSNS 305

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC ++VC LP +C  C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
           RSYHHLFP+  + EV       P  R SRS +CF C              +   +  G+ 
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDKWDMRDRSIKGMS 417

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451


>gi|291224884|ref|XP_002732433.1| PREDICTED: general transcription factor IIH, polypeptide 2, 44kD
           subunit-like [Saccoglossus kowalevskii]
          Length = 593

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 127/267 (47%), Positives = 175/267 (65%), Gaps = 17/267 (6%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSSLQN 72
           +E F+ E+FD NP+SQ+G++T  +  AN LT+LGG+P+ HI AL       C  + SLQN
Sbjct: 59  LEHFLEEYFDHNPISQVGILTTSNKRANRLTELGGNPKRHIAALQKCVDKQCLKEPSLQN 118

Query: 73  ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
           AL+L    L  +P +  REVL++  +L+TCDPG+I +TI+  K+  IRCSVIGLSA++ +
Sbjct: 119 ALELAATTLRHVPGHASREVLVILGSLTTCDPGNIHDTIKSMKDLNIRCSVIGLSADVRV 178

Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ---RAGEGSI 189
           CK LC +T GSYSV LDE HFKE+++EH  PP A A    ++LI+MGFPQ    A +  +
Sbjct: 179 CKRLCSETQGSYSVILDEPHFKEVLLEHIKPPLATASTE-SSLIRMGFPQHQLHAEKDKL 237

Query: 190 ---SICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
              S+C CH + K        GY CP+CK++ CELP +C+ CGL LVS+PHLARS+HHLF
Sbjct: 238 EKPSMCMCHLDSKSTQGFSTGGYFCPQCKSKYCELPIECKACGLTLVSAPHLARSFHHLF 297

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGC 268
           P+  F E+    L+D +      C  C
Sbjct: 298 PLESFQEIQ---LSDLQESDSRVCLSC 321


>gi|341877728|gb|EGT33663.1| hypothetical protein CAEBREN_16370 [Caenorhabditis brenneri]
          Length = 376

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 136/311 (43%), Positives = 187/311 (60%), Gaps = 20/311 (6%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           PSR  VV K +++F+  FF+QNP++QIGL+T KD  A  LT + G+     +AL  M + 
Sbjct: 82  PSRFVVVMKALQSFLERFFEQNPIAQIGLITCKDRKAERLTMMTGNIRVLKEALNSMSEA 141

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C GD SLQNAL L    L  +P +  REVLI+ +ALS+ DPG+I  TI+  K   IRCS
Sbjct: 142 FCGGDFSLQNALQLACANLKGMPGHVSREVLIVMAALSSIDPGNIFSTIETMKRMNIRCS 201

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
            IGLSAEMFICK + + T G YSVALD  H + L  +H  PP + A+ +  N I +GFP 
Sbjct: 202 AIGLSAEMFICKEMAKATKGEYSVALDPDHLQLLFSKHTLPPSS-AKSSECNAIHVGFPH 260

Query: 183 RAGEGSISICSCHKEVK--VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
                + S C CH + K     G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELIKTRSFCVCHPDSKPISSRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
            P+A F ++         + +R +C+ C+  L S G +      C KC+  FCL+CD  +
Sbjct: 321 QPLAAFQQI---------DVTRGSCYACETRLTSEGFR------CEKCRLVFCLDCDTLL 365

Query: 301 HESLHNCPGCE 311
           HESLH CPGC+
Sbjct: 366 HESLHVCPGCK 376


>gi|443898092|dbj|GAC75430.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit SSL1 [Pseudozyma antarctica
           T-34]
          Length = 498

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 129/342 (37%), Positives = 190/342 (55%), Gaps = 37/342 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  V  +    FV E+FDQNP+ Q+ ++  ++G+A  L  +GG+   H  +L  K
Sbjct: 160 DMRPNRFDVTLQYAREFVNEYFDQNPIGQLSVIGTREGIAERLAMMGGNTVDHTASLSNK 219

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L   G+ SLQNAL++ +  L  +P+   RE+L ++ +L+TCDPG+I +TI    +  I
Sbjct: 220 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTINTLVKDNI 279

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA----------- 168
           R SV+ L+AE+ + K +C  TGG +SVAL+E HF + + +  PPP               
Sbjct: 280 RVSVVHLAAEVKVFKDVCTRTGGVFSVALNEGHFHDALFDLVPPPAVDGPRKGKRRAGVQ 339

Query: 169 -EFAIAN---LIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
            E  + N   L++M FP R     + ++C+CH   + G GY CPRC  +VC++PTDC +C
Sbjct: 340 EEEEVQNGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYLCPRCGVKVCDVPTDCPVC 398

Query: 224 GLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------------- 268
           G+ +V S HLARSYHHLFP+  +  V    + D   R+   CF C               
Sbjct: 399 GITIVMSTHLARSYHHLFPVPNWKAVAWEAVGD---RAEDACFSCNLQFPKKDDNRTAAA 455

Query: 269 QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
            ++L  +G  P     C +C   FCLECD ++HE LH CPGC
Sbjct: 456 NKALEDAGLSPSSRYRCTRCNHDFCLECDAFVHEQLHVCPGC 497


>gi|339258242|ref|XP_003369307.1| general transcription factor IIH subunit 2 [Trichinella spiralis]
 gi|316966476|gb|EFV51048.1| general transcription factor IIH subunit 2 [Trichinella spiralis]
          Length = 403

 Score =  247 bits (631), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 136/324 (41%), Positives = 190/324 (58%), Gaps = 23/324 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESH---IKAL 58
           D  PSR   V K +  FV +FFDQNP+SQI ++T KD   + L D  GS   H   +KA+
Sbjct: 88  DLLPSRFISVIKALSLFVDDFFDQNPISQISIITAKDKKTDKLVDFTGSARKHKEFLKAI 147

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
             ++  SG+ SLQN+L+    +L  +PS+  REVL++  +LSTCDPGDI E+++  K + 
Sbjct: 148 AEEIP-SGEFSLQNSLETANEMLRHMPSHSSREVLVIMGSLSTCDPGDIEESLEILKRNN 206

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           IRC+ I LSAEM+ICK + + T G Y+V +DE   K L+   A PP A  +   ANLI+M
Sbjct: 207 IRCNFIALSAEMYICKRIAKATNGKYAVIIDEDDLKTLLSVFALPPVATTQLG-ANLIRM 265

Query: 179 GF--------PQRAGEGSISICSCHKEVKVG----VGYTCPRCKARVCELPTDCRICGLQ 226
           G          ++ G     I   H E  +G    VG+ CP+C  + C +P +C+ICGL 
Sbjct: 266 GNSDFRRGLKTRKVGHFVFGIHMKHGE-GIGKLTCVGFICPQCNFKYCFVPMECQICGLV 324

Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
           L S+PHLARSY HL+PI PF+E       D   +    C GC  ++         YV CP
Sbjct: 325 LASAPHLARSYQHLYPILPFEEKAI----DKHMKKELYCAGCFSAIEVKVMFDVSYV-CP 379

Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
            C++ FC+ CD+ IHE++H+CPGC
Sbjct: 380 GCRQDFCINCDLLIHETMHSCPGC 403


>gi|358377786|gb|EHK15469.1| hypothetical protein TRIVIDRAFT_56425 [Trichoderma virens Gv29-8]
          Length = 402

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 142/341 (41%), Positives = 198/341 (58%), Gaps = 38/341 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  +      AFVRE+F+QNP+SQ+ +V ++DGVA  ++D+GG+P  H++ L   
Sbjct: 44  DLLPTRYRLTLSYAAAFVREYFEQNPISQLAIVGMRDGVAVRVSDMGGNPAEHLERLRDL 103

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G    G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      +I
Sbjct: 104 DGQDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTISNLIAERI 163

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIME-HAPPPPAIAEFAIAN 174
           R SV+GLSA++ IC  LC  T       Y+VA+DE+HFK+L +    PP     E + ++
Sbjct: 164 RVSVVGLSAQVAICAELCSRTNVGDDSQYNVAMDETHFKDLFLAITTPPVNRTKEQSTSS 223

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A  G+ S C+CH +     GY C RC  +VC LP++C  CGL L+ S H
Sbjct: 224 LLMMGFPSRTLAPGGTTSYCACHSK-PCREGYLCTRCGVKVCRLPSECPACGLTLILSTH 282

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRS-TCFGCQ--------------------QS 271
           LARSYHHLFP+  + EV P         +RS  CF CQ                    ++
Sbjct: 283 LARSYHHLFPLRNWVEV-PWA-----EATRSIACFSCQCPFPEPPKPNKEKGKDEAPTKA 336

Query: 272 LLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                ++ G Y  C  C  HFC++CD++ H  +HNCPGC+S
Sbjct: 337 PAKGVSESGRY-KCQVCGNHFCIDCDVFAHMVIHNCPGCQS 376


>gi|315049235|ref|XP_003173992.1| stem-loop protein 1 suppressor [Arthroderma gypseum CBS 118893]
 gi|311341959|gb|EFR01162.1| stem-loop protein 1 suppressor [Arthroderma gypseum CBS 118893]
          Length = 487

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 133/334 (39%), Positives = 200/334 (59%), Gaps = 31/334 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV+EFF+QNP+SQ+G++ +KDG+A  ++++ G+P  H+ AL  +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 185

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REV+I+  AL + DPGDI +TI      K+
Sbjct: 186 RSEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
           R  +IGL+A++ +C+ +C  T      SY VA+DE H  +L+++   PP   ++ ++ ++
Sbjct: 246 RVGIIGLAADVAVCREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPITYSQRYSSSS 305

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E   S+C+CH +   G GY C RC ++VC LP +C  C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSLLS----------SGNKPGLY 282
           RSYHHLFP+  + EV       P  R SRS +CF C     +            +  G+ 
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSISCFACGNLFPAVPPRDKWDKRDKSIKGMS 417

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451


>gi|389741483|gb|EIM82671.1| Ssl1-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 382

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 117/261 (44%), Positives = 177/261 (67%), Gaps = 14/261 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +    FV E+FDQNPL Q+G+V ++ G+   + ++ G+P+  +KAL   
Sbjct: 112 DLRPTRFDLSLEYAREFVTEWFDQNPLGQVGVVGMRAGLGERVGEMSGNPQDVLKALSER 171

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  +G+ SLQNA+++ +  +S +P++  RE+L+++ +L+T DPG+I +T+Q C + KI
Sbjct: 172 HKLEPTGEPSLQNAIEMARSSMSHLPTHSSREILVIFGSLTTVDPGNIYDTLQSCIKDKI 231

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--AIAEFA------ 171
           R S++ L+AEM IC+ LC  TGG + VA++E HFK+L+ E  PPP   AIA         
Sbjct: 232 RISIVSLAAEMKICRELCDKTGGQFGVAMNEGHFKDLLFELIPPPAQRAIARTTGGPGAA 291

Query: 172 --IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
              A+L+ MGFP R  + S+ S+C CH E++   G+ CPRC ++VC++PTDC +CGL +V
Sbjct: 292 NPAADLMMMGFPTRLPDTSLPSLCVCHAELR-SAGFLCPRCMSKVCDVPTDCDVCGLMIV 350

Query: 229 SSPHLARSYHHLFPIAPFDEV 249
           SSPHLARSYHHLFP+ P+  V
Sbjct: 351 SSPHLARSYHHLFPVKPYAAV 371


>gi|164662116|ref|XP_001732180.1| hypothetical protein MGL_0773 [Malassezia globosa CBS 7966]
 gi|159106082|gb|EDP44966.1| hypothetical protein MGL_0773 [Malassezia globosa CBS 7966]
          Length = 349

 Score =  246 bits (628), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 131/348 (37%), Positives = 192/348 (55%), Gaps = 43/348 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R  +  +    FV ++FDQNP+ Q+ ++  +DG+A  L+ +GG+   H   L   
Sbjct: 5   DLRPTRFDLTLQLARQFVADYFDQNPIGQLAILCARDGLAERLSLMGGNTFDHGAVLSSR 64

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL   G+ S+QNAL++    L+ +PS   RE+L + ++L++ DPG+I  TI K  +  I
Sbjct: 65  RKLEPRGEPSIQNALEMALSSLTHLPSSNTREILFISASLTSVDPGNIYHTIDKLVDEHI 124

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIA------ 173
           + SVI L+AEM + K  C  TGG ++V ++E H+KEL+M+H  PP  I E + +      
Sbjct: 125 QLSVISLAAEMHVLKDACARTGGDFNVVMNEDHYKELLMQHV-PPRVITEASSSSHLAHS 183

Query: 174 ---NLIKMGFPQRAG-EGSISICSCHKEVKVG-------------VGYTCPRCKARVCEL 216
              +L+ MGFP+R       S C+CH  +  G              GYTCPRC ++VC++
Sbjct: 184 NECDLLVMGFPRRIPFHTPASFCACHGRLVTGSRSLSYADRGAAPAGYTCPRCASKVCQV 243

Query: 217 PTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------- 268
           PTDC  CG+ ++ S HLARSYHHLFP+  ++   P+        S S CF C        
Sbjct: 244 PTDCPTCGITIIMSTHLARSYHHLFPVQNYE---PVAWEAVSTTSPSNCFACAFPFPPRP 300

Query: 269 ------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                    ++ S   P     CPKC +HFCLECD ++ E LH CPGC
Sbjct: 301 SNLDETSHVVMDSTLAPSSRYMCPKCHQHFCLECDAFVQEQLHTCPGC 348


>gi|346979033|gb|EGY22485.1| suppressor of stem-loop protein [Verticillium dahliae VdLs.17]
          Length = 461

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 135/346 (39%), Positives = 201/346 (58%), Gaps = 41/346 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D  P+R+ +  +    FV+E+F+QNP+SQ+G+V ++DGVA  ++++GG+P  H++A+   
Sbjct: 101 DLLPTRLRLTWRLAREFVKEYFEQNPISQMGIVGMRDGVAVRVSEMGGNPVEHVEAINKW 160

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++   G  SLQNAL++ +G L   PS+G REVL++Y AL + DPGDI +TI      +I
Sbjct: 161 AEIDPQGSPSLQNALEMCRGALFHAPSHGTREVLVVYGALLSSDPGDIHDTIANLLADRI 220

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R S++GL+A++ IC+ LC+ T       Y++ALDE H++EL++    PP    +  + A+
Sbjct: 221 RVSIVGLAAQVAICEELCRKTNAGDATQYNIALDEMHYRELLLRTTTPPVTRTQAQSTAS 280

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A   + + C+CH    V  GY C RC  RVC LP +C  CGL L+ S H
Sbjct: 281 LLMMGFPSRTLAPADTTAYCACHN-TPVREGYLCTRCGTRVCRLPAECPACGLTLILSTH 339

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------------QQSLLSSGNK 278
           LARSYHHLFP+  + EV P      R  +   C+ C              + +    G+K
Sbjct: 340 LARSYHHLFPLRNWVEV-PWA----RAAASRACYACLAPFPDPPPQRRGHRTNGTDGGDK 394

Query: 279 PGLYV------------ACPKCKKHFCLECDIYIHESLHNCPGCES 312
               V            AC  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 395 EAAPVPKQKGVSESSRYACQVCGNHFCIDCDVFAHEVIHNCPGCQS 440


>gi|353237926|emb|CCA69887.1| probable SSL1-TFIIH subunit (transcription initiation factor),
           factor B [Piriformospora indica DSM 11827]
          Length = 386

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 119/268 (44%), Positives = 177/268 (66%), Gaps = 21/268 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +    FV E+FDQNPL QIG+V +++GVA  +  L G+P   +K +  +
Sbjct: 120 DLRPNRFDLSLEYSREFVTEWFDQNPLGQIGIVGMREGVAERICGLSGNPPDVLKTIADR 179

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  +G+ SLQNA+++ +G +S +P++  REVL+++ +L+TCDPG+I ET++ C E KI
Sbjct: 180 RNLEPAGEPSLQNAIEMARGSMSHLPTHSSREVLVIFGSLTTCDPGNIHETLKSCVEDKI 239

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP-----AIAEFA--- 171
           R SV+ L+AEM IC+  C +TGG + VA++E H+++++ EH PPP      A+A+ +   
Sbjct: 240 RVSVVALAAEMKICRDFCTETGGLFGVAMNEGHYRDMLFEHVPPPAVRKIEAVAKISNGN 299

Query: 172 ---------IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCR 221
                       L+ MGFP R  E S  S+C CH  +K   GY CPRC ++VC++PTDC 
Sbjct: 300 GAGPAPAAATGELMMMGFPTRLPETSAPSLCVCHGIMK-SEGYICPRCLSKVCDIPTDCN 358

Query: 222 ICGLQLVSSPHLARSYHHLFPIAPFDEV 249
           +CGL +VS+PHLARSYHHLFP+ P+  V
Sbjct: 359 VCGLVIVSAPHLARSYHHLFPVKPYQPV 386


>gi|390351315|ref|XP_782325.3| PREDICTED: general transcription factor IIH subunit 2-like
           [Strongylocentrotus purpuratus]
          Length = 500

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 122/260 (46%), Positives = 173/260 (66%), Gaps = 14/260 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D +P+R+A   K +E F+ E+FDQNP+SQ+G++T  +  A  LT+LGG+P+ HI AL   
Sbjct: 76  DLKPNRLACCTKLLEHFIEEYFDQNPISQVGIITSSNMRAEKLTELGGNPQRHITALEKC 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C  + SLQNAL+L    L  +P +  RE+L++  +L+TCDPG+I +TIQ  K+  I
Sbjct: 136 NDKPCVKEPSLQNALELAAATLRHMPGHASREILVIMGSLTTCDPGNIHDTIQAMKDHSI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RC VIGL+A++ +C+ L   T G+Y V LDE+HFKEL+MEH  PPPA       + I+MG
Sbjct: 196 RCCVIGLAADVRVCRKLATVTHGTYGVILDETHFKELLMEHTIPPPARVNTE-PSPIRMG 254

Query: 180 FPQ------RAGEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
           FPQ      ++   + S+C CH + K        GY CP+C+++ CELP +CR+CGL LV
Sbjct: 255 FPQHVIHTDKSKVSAPSMCMCHLDGKNSEGFGTGGYFCPQCQSKYCELPVECRVCGLTLV 314

Query: 229 SSPHLARSYHHLFPIAPFDE 248
           S+PHLARS+HHLFP+  F+E
Sbjct: 315 SAPHLARSFHHLFPLDRFEE 334



 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 44/98 (44%), Positives = 56/98 (57%), Gaps = 8/98 (8%)

Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
           C   GL LVS+PHLARS+HHLFP+  F+E       D  +     C GCQ  +     + 
Sbjct: 397 CLSSGLTLVSAPHLARSFHHLFPLDRFEEFK---REDHDHPDSLFCQGCQSHI-----RD 448

Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
                CPKC   FCL+C+++I ESLH+CPGC S R S 
Sbjct: 449 QTAYRCPKCSNVFCLDCELFIQESLHSCPGCASTRPSQ 486


>gi|429848008|gb|ELA23542.1| tfiih basal transcription factor complex p47 subunit
           [Colletotrichum gloeosporioides Nara gc5]
          Length = 488

 Score =  243 bits (620), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 139/357 (38%), Positives = 197/357 (55%), Gaps = 54/357 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  V       FV+E+F+QNP+SQ+G++ ++DGVA  ++D+ G+P  H++ L   
Sbjct: 110 DMLPNRYRVTWAYAADFVKEYFEQNPISQLGIIGMRDGVALRISDMSGNPTDHLEKLKSF 169

Query: 62  LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G   SG+ SLQNAL++ +G L   PS+G REVL+++ AL + DP DI ETI      +I
Sbjct: 170 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLVVFGALLSSDPADIHETISSLITDRI 229

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R SV+GL+A++ IC  LC  T       YSV ++++HF++L++    PP    A  + A+
Sbjct: 230 RVSVVGLAAQVAICAELCSRTNSGDDTQYSVCMNDAHFRDLLLAATTPPVTRTAAQSTAS 289

Query: 175 LIKMGFPQRA---GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           L+ MGFP R    GE + +IC+CH +     GY C RC  RVC LP +C  CGL L+ S 
Sbjct: 290 LLMMGFPSRTLVRGEAT-AICACHNK-PAREGYLCTRCSTRVCRLPIECPACGLTLILST 347

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------------------- 268
           HLARSYHHLFP+  + EV P      +  S   C+ C                       
Sbjct: 348 HLARSYHHLFPLRNWVEV-PWA----KAGSSKACYSCLTAFPEPPRASKKRDKEKANDND 402

Query: 269 -------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                          S L   ++ G Y AC  C  HFC++CD+Y HE +HNCPGC+S
Sbjct: 403 GVTTTTPVAPTPVAPSELKGVSESGRY-ACTVCNSHFCIDCDVYAHEVIHNCPGCQS 458


>gi|346464883|gb|AEO32286.1| hypothetical protein [Amblyomma maculatum]
          Length = 448

 Score =  243 bits (620), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 125/281 (44%), Positives = 183/281 (65%), Gaps = 14/281 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV EFFDQNP+S +G+++  +  A  L +L G+    ++AL   
Sbjct: 75  DLKPTRLICTLKMLELFVEEFFDQNPISNVGVISTCNKRAQKLCELAGNTGKVVEALKSC 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                SG+ SLQN+L+L   +L  +P +  REVLI+  +L+TCDPG+I  TI+  ++  I
Sbjct: 135 KTQVPSGEPSLQNSLELAAEVLRHLPVHTSREVLIIMGSLTTCDPGNIQTTIETMRKCNI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCS++GL+AE+++C  L + TGG+Y+V LDE+HFK+++ +HA PPP     A ++LI+MG
Sbjct: 195 RCSIVGLAAEVYVCCRLTKTTGGTYNVILDENHFKDILFQHAIPPPVTGN-AESSLIRMG 253

Query: 180 FP--QRAGEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           FP  +   EG  S+C CH + +         GY CP+C  + C LP +C++CGL LVS+P
Sbjct: 254 FPYHRTETEGKPSLCFCHLDSQNPSDGLSKGGYFCPQCNGKYCTLPVECKVCGLTLVSAP 313

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL 272
           HLARSYHHLF + PFDEV P  L  P + +  TCF CQ S+
Sbjct: 314 HLARSYHHLFALEPFDEVPPDTL--PAD-TPVTCFACQVSI 351


>gi|308501637|ref|XP_003113003.1| hypothetical protein CRE_25217 [Caenorhabditis remanei]
 gi|308265304|gb|EFP09257.1| hypothetical protein CRE_25217 [Caenorhabditis remanei]
          Length = 376

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 132/310 (42%), Positives = 185/310 (59%), Gaps = 20/310 (6%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           PSR  VV K +++F+  FF+QNP++QIGL+T KD  A  +T + G+     ++L  + + 
Sbjct: 82  PSRFVVVMKALQSFLERFFEQNPIAQIGLITCKDRKAERMTMMTGNIRVLKESLNSLTEA 141

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C GD SLQNAL L    L  +P +  REV+I+ +ALS+ DPG+I  TI+  K   IRCS
Sbjct: 142 FCGGDFSLQNALQLACANLKGMPGHVSREVVIVMAALSSIDPGNIFSTIESMKRMNIRCS 201

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
            IGLSAEMFICK + + T G YSVALD  H + L  +H  PP + A+ +  N I +GFP 
Sbjct: 202 AIGLSAEMFICKEMAKATKGEYSVALDPDHLQLLFSKHTLPPSS-AKSSECNAIHVGFPH 260

Query: 183 RAGEGSISICSCHKEVK--VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
                + S C CH + K     G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELIKTRSFCVCHPDSKPVSSRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
            P+A F ++         + +R  C+ C+  L S G +      C KC+  FCL+CD  +
Sbjct: 321 QPLAAFQQI---------DVARGQCYACETRLASEGFR------CEKCRSVFCLDCDTLL 365

Query: 301 HESLHNCPGC 310
           HESLH CPGC
Sbjct: 366 HESLHVCPGC 375


>gi|449300691|gb|EMC96703.1| hypothetical protein BAUCODRAFT_486555 [Baudoinia compniacensis
           UAMH 10762]
          Length = 459

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 134/349 (38%), Positives = 193/349 (55%), Gaps = 39/349 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +       FVRE+F+QNP+SQ+G+  +++G+A  ++D+ G+P  HI AL  +
Sbjct: 104 DLRPTRHLLTINYTIMFVREYFEQNPISQLGIFGMREGLAIRISDMSGNPNDHIAALKQL 163

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G+ SLQNALD+ +  L   PS+G REV+I+  AL + DPGDI +TI+ C   +I
Sbjct: 164 RSTEPKGNPSLQNALDMARAALYHTPSHGTREVVIVLGALLSSDPGDIHDTIKACIRDRI 223

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPP------PAIAE 169
           R  +IGL+A+M IC  +C+ T G     Y+VA+DE  +++ +M+   PP          +
Sbjct: 224 RVRIIGLAAQMHICAEICRKTNGGDENYYNVAVDEVDYRQQLMDITTPPVLRSTDTEAQK 283

Query: 170 FAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
              A L+ MGFP R  E   ++C+CH  +  G GY C RCKA+VC LP  C  C L L+ 
Sbjct: 284 HNQAALLMMGFPSRIVEEKPTLCACHGNLTRG-GYLCSRCKAKVCNLPATCPTCDLTLIL 342

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------------------- 268
           S HLARSYHHLFP+  + EV+       R      CFGC                     
Sbjct: 343 STHLARSYHHLFPLRNWVEVS---WTRARQEGSIQCFGCLTPFPRIPNPQEAVNGTEHDV 399

Query: 269 --QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
              ++  + G        C  C +HFC++CD++ HE +HNCPGC+S  H
Sbjct: 400 ASARTKRAEGASESSRYECETCGQHFCIDCDVFCHEVVHNCPGCQSNAH 448


>gi|190348397|gb|EDK40844.2| hypothetical protein PGUG_04942 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 488

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/374 (36%), Positives = 207/374 (55%), Gaps = 68/374 (18%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R A+  + ++ F+ EFFDQNP+SQ+G+V +++GV+N ++++ GSP+ HI  L   
Sbjct: 116 DLRPNRFAMTLQLLQEFIVEFFDQNPISQLGIVMMRNGVSNLVSEVNGSPQYHIDKLRSL 175

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L++ + +L      +   +   +EVLIL+ AL T DPGDI 
Sbjct: 176 RSRQHNRFEPKGDPSLQNSLEMARAMLKYNFGTNMDDNKNSKEVLILFGALFTSDPGDIH 235

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDT--------------GGSYSVALDESHFK 154
           +TI    + +I+  +IGLSA++ IC+ +   T              G +YSV ++E+H++
Sbjct: 236 KTIDNLIKDEIKVKIIGLSAQVSICQQIANKTNNLGSRTTGANSSSGSNYSVIMNETHYR 295

Query: 155 ELIMEHAPPPPAIAEFAIAN----LIKMGFPQR------AGEGSISI-----CSCH---- 195
           EL+M+   P P  A    +     LIKMGFP +      A   ++++     C+CH    
Sbjct: 296 ELLMDCVVPLPVAATEKRSTVGVPLIKMGFPSKVSPPISASNNTVNLDLPVLCACHPTNG 355

Query: 196 -----KEVKVG-------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
                  V +G       +GY CP+C++RVC LPT C  CGL L+ S HLARSYHHL P+
Sbjct: 356 SLESKDAVTIGAESSTSIIGYQCPQCRSRVCHLPTICPTCGLMLILSTHLARSYHHLVPL 415

Query: 244 APFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA-----CPKCKKHFCLECDI 298
             + EV         +   + C+GC      +G   G   A     CP C + FC++CD+
Sbjct: 416 REYKEVPA-----ADHYEGTHCYGCLLKFPEAGGPDGPLRASSRYRCPTCARDFCIDCDV 470

Query: 299 YIHESLHNCPGCES 312
           ++HE+LHNCPGCES
Sbjct: 471 FVHETLHNCPGCES 484


>gi|256075586|ref|XP_002574099.1| btf [Schistosoma mansoni]
 gi|360045424|emb|CCD82972.1| putative btf [Schistosoma mansoni]
          Length = 423

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 139/348 (39%), Positives = 196/348 (56%), Gaps = 46/348 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K    FVRE+FDQNP+SQ+G++   D  A  LT+L G+P  H+ AL  +
Sbjct: 85  DLKPNRLICTVKAACTFVREYFDQNPISQLGIIVTSDRRAERLTELSGNPRPHLAALQSL 144

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C G+ SLQNAL L +  L    +  H E+++L ++L+TCDP DI +TI+    ++I
Sbjct: 145 YTRTCIGEPSLQNALLLAESRLKY--TIHHNEIVVLMASLTTCDPSDIHQTIKSLSSNRI 202

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSVI L+ E+F+ + L Q T G++ V +DE H K ++ +  PPP A  E   ANLI+MG
Sbjct: 203 RCSVISLAVEVFVYRALAQLTQGTFHVIMDELHLKNVLKDLVPPPVAAIETE-ANLIRMG 261

Query: 180 FPQRA----GEGSISICSCHKEVKVGVG---------YTCPRCKARVCELPTDCRICGLQ 226
           FP          ++  C CH   K             Y CPRC +  CELP +C +CGL 
Sbjct: 262 FPHSETFDLDRFAVKRCMCHLSNKPTNDKTDTTTNPHYACPRCHSAYCELPVECNVCGLT 321

Query: 227 LVSSPHLARSYHHLFPIAPFDEV--------TPLCLNDPRNRSRST-------------C 265
           LV++PHLAR+YHHLFP+  F+ V        T L   D + +S+ T             C
Sbjct: 322 LVAAPHLARAYHHLFPLDLFEPVNGSGSFSETSL---DNQEKSKETIDSADSVDHQQRLC 378

Query: 266 FGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESL 313
           FGC  +++     PG    CPKCK  FC  CD  +H+S+H+CPGC +L
Sbjct: 379 FGC--NVIIPSKIPG--YQCPKCKFIFCHSCDAVLHDSIHSCPGCLTL 422


>gi|392593840|gb|EIW83165.1| Ssl1-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
          Length = 375

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 116/259 (44%), Positives = 175/259 (67%), Gaps = 18/259 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R  +  +    F+ E+FDQNPL QIG+V ++ GV   + ++ G+P+  + ++   
Sbjct: 118 DMRPTRFDLTLEYTREFITEWFDQNPLGQIGVVGMRAGVGERIGEMSGNPQEVLGSIADR 177

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K+  +G+ SLQNA+++ +  +S +P++  RE+LI+  +L+TCDPG+I +T+  C ++KI
Sbjct: 178 HKIEPTGEPSLQNAIEMARNGMSHLPTHSSREILIVIGSLTTCDPGNIHDTLDSCVKNKI 237

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
           R SV+ L+AEM IC+ LC  TGG + VAL+E HFK+L+ E  PPP   A+ A+       
Sbjct: 238 RVSVVALAAEMKICRELCDKTGGQFGVALNEGHFKDLLFELVPPP---AQRAVNRAGGGA 294

Query: 173 ----ANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQL 227
               A+L+ MGFP R  + +  S+C CH ++K   G+ CPRC A+VC++PTDC ICGL +
Sbjct: 295 GSTSADLMMMGFPTRLPDTTPPSLCVCHSDLK-SEGFLCPRCMAKVCDVPTDCDICGLMI 353

Query: 228 VSSPHLARSYHHLFPIAPF 246
           VSSPHLARSYHHLFP+ P+
Sbjct: 354 VSSPHLARSYHHLFPVRPY 372


>gi|149248450|ref|XP_001528612.1| suppressor of stem-loop protein 1 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146448566|gb|EDK42954.1| suppressor of stem-loop protein 1 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 509

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 144/393 (36%), Positives = 205/393 (52%), Gaps = 93/393 (23%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R+++    ++ FV EFFDQNP+SQIG+V +++GVAN ++++ G P+ HI  L   
Sbjct: 126 DLRPTRLSMTLNYLQEFVVEFFDQNPISQIGIVLMRNGVANLVSEVSGLPQYHIDKLRQL 185

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 K    GD SLQN L++ + LL      S   +   +E+L++  +L T DPG+I 
Sbjct: 186 KARQHNKFEPKGDPSLQNTLEMARSLLKFNFGTSSNNTKNSKEILVVLGSLFTSDPGNIH 245

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG--------GSYSVALDESHFKELIMEH 160
           +TI+   + +I+  VIGLSA++ IC+ L   T          +Y V ++E HFKEL+M+ 
Sbjct: 246 KTIEGLVKDEIKVRVIGLSAQVAICQELVNRTNHEPKNTMSKNYGVIMNEYHFKELLMDC 305

Query: 161 APPPPAIAEF------------AIANLIKMGFPQRA------------------------ 184
             P P   EF            A   +IKMGFP +A                        
Sbjct: 306 VTPLPLTEEFRKPTESTNQGGTAGVPVIKMGFPSKAQPNVTSAIGNTEFTVEFPNLNASY 365

Query: 185 -GEGS--------ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
             +GS        +S  S  +   + +GY CP+CK++VC LPT C +CGL L+ S HLAR
Sbjct: 366 PTQGSAESKDVIEVSNNSLRQSASLIIGYRCPQCKSKVCNLPTICPVCGLMLILSTHLAR 425

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ----------------QSLLSSGNKP 279
           SYHHL P+AP+ EV      D        C+GCQ                 S++SS  K 
Sbjct: 426 SYHHLVPLAPYVEVEVKSAYDSEE-----CYGCQLRFPKGVDLKVEKKSLDSMISSRYK- 479

Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                CPKCKK FC+ CD+++HE LHNCPGCE+
Sbjct: 480 -----CPKCKKDFCINCDVFVHEVLHNCPGCEN 507


>gi|390602659|gb|EIN12052.1| Ssl1-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 377

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 114/258 (44%), Positives = 174/258 (67%), Gaps = 11/258 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +    F+ E+FDQNPL QIG+V ++ GV   +  + G+P+  +KA+   
Sbjct: 121 DMRPTRFGLTLEYAREFITEWFDQNPLGQIGIVGMRAGVGERIGQMSGNPQDVLKAISER 180

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K+  +G+ SLQNA+++ +  +S +P++  +E+LI++ +L+T DPG+I +T+  C + +I
Sbjct: 181 HKMEPNGEPSLQNAIEMARSSMSHLPTHSSKEILIVFGSLTTVDPGNIHDTLDACIKDRI 240

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP-------AIAEFAI 172
           R S++ L+AEM IC+ +C  TGG + VA++E H+K+L+ E  PPP        A +  A 
Sbjct: 241 RISIVALAAEMKICREICDKTGGQFGVAMNEGHYKDLLFELIPPPAQRALTSKAGSGGAT 300

Query: 173 ANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           A+L+ MGFP R  + S  S+C CH E+K   G+ CPRC A+VC++PTDC ICGL +VSSP
Sbjct: 301 ADLMIMGFPTRLPDTSPPSLCVCHSELK-SEGFLCPRCLAKVCDVPTDCDICGLMIVSSP 359

Query: 232 HLARSYHHLFPIAPFDEV 249
           HLARSYHHLFP+ P+  V
Sbjct: 360 HLARSYHHLFPVKPYTAV 377


>gi|302680178|ref|XP_003029771.1| hypothetical protein SCHCODRAFT_82959 [Schizophyllum commune H4-8]
 gi|300103461|gb|EFI94868.1| hypothetical protein SCHCODRAFT_82959 [Schizophyllum commune H4-8]
          Length = 400

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 113/263 (42%), Positives = 175/263 (66%), Gaps = 19/263 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  ++ +    FV E+FDQNPL QIG+V ++ G+   + ++ G+P+  +K++   
Sbjct: 116 DMRPTRFDLMLQYAREFVVEWFDQNPLGQIGVVGMRSGIGERICEMTGNPQDVLKSISER 175

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +L   G+ SLQNA+++ +  +S +P++  RE+L+++ +L+TCDPGDI +T+  C   KI
Sbjct: 176 HRLEPQGEPSLQNAIEMARSSMSHLPTHSSREILLIFGSLTTCDPGDIHDTLDTCVRQKI 235

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP------------PAI 167
           R SV+ L+AEM IC+  C  TGG + VAL+E HF++++ E  PPP            P  
Sbjct: 236 RVSVVALAAEMKICREFCDKTGGQFGVALNEGHFRDILFELVPPPAQRALGGAGMGAPGK 295

Query: 168 AEFA---IANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
           A  A    A+L+ MGFP R  + +  S+C+CH E++   G+ CPRC+++VC++PTDC +C
Sbjct: 296 ATAASGPSADLMMMGFPTRLPDTTPPSLCACHGELR-SAGFLCPRCQSKVCDVPTDCDVC 354

Query: 224 GLQLVSSPHLARSYHHLFPIAPF 246
           GL +VSSPHLARSYHHLFP+ P+
Sbjct: 355 GLMIVSSPHLARSYHHLFPVRPY 377


>gi|223999581|ref|XP_002289463.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974671|gb|EED93000.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 422

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 135/344 (39%), Positives = 189/344 (54%), Gaps = 49/344 (14%)

Query: 7   RMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-------M 59
           R+  +      FV E++DQNPL  +G+V  +DG A  LT LGGSP+ H  AL       M
Sbjct: 86  RLETIINLASEFVNEYYDQNPLGHLGVVICQDGEAEMLTTLGGSPKKHKLALGAALVTEM 145

Query: 60  GKLGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKE 116
            + G S  G+ SLQN +++    L   P YG RE++++ SAL+TCDPGDI+ ET+ +   
Sbjct: 146 KRRGPSIGGEFSLQNGIEVAGRSLGYAPRYGSREIIVITSALATCDPGDILGETLPRLLH 205

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA-------- 168
           + IR S + L AE+ ICK L + TGG   V +D  H ++L+M HA PPP +         
Sbjct: 206 AGIRVSAVSLQAELHICKKLAEVTGGLAGVCMDSRHLRDLVMGHAIPPPVVNTSKADDGE 265

Query: 169 -----------EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP 217
                         I   + MGFP R  +      S         GY CPRC++R  +LP
Sbjct: 266 TTTAHNGRNHNHTMICEFVTMGFPSRDDDRR----SNRLIAFARTGYVCPRCRSRCSDLP 321

Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPFDEVT-PLCLNDPRNRSRSTCFGCQQSL---- 272
           +DC +CGL+L+ +PHLAR++HHLFP+ PF+E+   L +N  R      CFGC + +    
Sbjct: 322 SDCAVCGLRLILAPHLARTFHHLFPVRPFEELDEALVINCDR-----CCFGCLKVIGCKP 376

Query: 273 LSSGNKPG------LYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           + S +  G      L   CP+C   FC +CD Y+HE+LHNCPGC
Sbjct: 377 MDSTDGAGDEGESILRFQCPECHNAFCPDCDAYLHETLHNCPGC 420


>gi|380482556|emb|CCF41168.1| hypothetical protein CH063_11526 [Colletotrichum higginsianum]
          Length = 480

 Score =  239 bits (609), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 135/362 (37%), Positives = 199/362 (54%), Gaps = 52/362 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  V       FV+E+F+QNP+SQ+ ++ ++DGVA  ++D+ G+P  H++ L   
Sbjct: 104 DMLPNRYRVAWAYAADFVKEYFEQNPISQLAIIGMRDGVAIRISDMSGNPTDHLEKLKTF 163

Query: 62  LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G   SG+ SLQNAL++ +G L   PS+G REVLI++ AL + DPGDI +TI      +I
Sbjct: 164 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLIVFGALLSSDPGDIHDTITSLIADRI 223

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R S++GL+A++ IC  LC  T       YSV ++++HF++L++    PP    A  + A+
Sbjct: 224 RVSIVGLAAQVAICAELCSRTNAGDDTQYSVCMNDAHFRDLLLAATTPPVTRTAAQSTAS 283

Query: 175 LIKMGFPQRA---GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           L+ MGFP R    GE + ++C+CH +     GY C RC  RVC LP +C  C L L+ S 
Sbjct: 284 LLMMGFPSRTLAQGEAT-AVCACHNK-PAREGYLCTRCGTRVCRLPIECPACSLTLILST 341

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------------------- 272
           HLARSYHHLFP+  + EV    L     +S++ C+ C  +                    
Sbjct: 342 HLARSYHHLFPLRNWVEV----LWAEATKSKA-CYSCLTTFPEPPKASRKKDKDKDSDGV 396

Query: 273 ---------------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
                          L   ++ G Y  C  C  HFC++CD+Y HE +HNCPGC+S     
Sbjct: 397 PTTTPVAPTPVIASELKGVSESGRY-GCTVCGNHFCIDCDVYAHEVIHNCPGCQSDTRGA 455

Query: 318 PI 319
           P+
Sbjct: 456 PM 457


>gi|240277005|gb|EER40515.1| transcription factor TFIIH complex subunit Ssl1 [Ajellomyces
           capsulatus H143]
 gi|325094945|gb|EGC48255.1| transcription factor TFIIH [Ajellomyces capsulatus H88]
          Length = 505

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 142/356 (39%), Positives = 202/356 (56%), Gaps = 49/356 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV EFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 134 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHIMAIQAL 193

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   P++G REVLI++ AL + DPGDI +TI      KI
Sbjct: 194 RPKDPKGMPSLQNALEMARGTLFHTPTHGTREVLIIFGALLSSDPGDIHQTITALVADKI 253

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIME-HAPPPPAIAEFAI-- 172
           R S+IGL+A++ IC+ LC  T       Y +AL+E HF+EL M   APPP  +A  +   
Sbjct: 254 RISIIGLAAQVAICRDLCGRTNNGDDTVYGIALNEQHFRELFMNVTAPPPTTVAPTSTPT 313

Query: 173 ----------------ANLIKMGF--PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
                           ++L+ MGF         + S+C+CH +     GY C RC A+VC
Sbjct: 314 ASSANGPKTTTTNTTTSSLLMMGFPSLTLTTTPTPSLCACHSKPSRA-GYLCCRCNAKVC 372

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----Q 269
            LP+ C  CGL L+ S HLARSYHHLFP+  + EV+       R RS ++CF C     +
Sbjct: 373 TLPSSCPCCGLTLILSTHLARSYHHLFPLMNWVEVS--WRRAARKRS-ASCFACGVGFPR 429

Query: 270 QSLLSSGN-----KPGLYVA--------CPKCKKHFCLECDIYIHESLHNCPGCES 312
              L SG      K  L V         CP C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 430 MPKLVSGEPEETAKAALGVGVSVSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQS 485


>gi|193627420|ref|XP_001947532.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Acyrthosiphon pisum]
          Length = 388

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 198/321 (61%), Gaps = 18/321 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R     K ++ F++++FDQNP+SQ+GL+T ++  A  +++L G+ + H++ L   
Sbjct: 74  DLKPTRQLCTYKLLQDFIQDYFDQNPISQLGLITTRNSSAERISELSGNRKYHLEELKKT 133

Query: 62  LG----CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKES 117
            G    C+G  S+QN+L++   ++  +PS+  +E+LI+ S+LS+CDPG+I  +I+  K  
Sbjct: 134 FGDFNYCNGLMSVQNSLEIALSVMKMLPSHTSKEILIIGSSLSSCDPGEINTSIEMLKTH 193

Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIK 177
            IR S+I L+AE+ + +HLC +T G ++V +D+ HFK ++     P P +     A+ +K
Sbjct: 194 NIRVSMIHLAAEVRMFRHLCNETKGKHNVIVDDVHFKHILWSLVEPVP-LPNSVDASCVK 252

Query: 178 MGFPQRAGEG-SISICSCH--KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           MGFPQ   +    + CSCH  +  K+   G+ CP+C ++ CELP +C+ CGL LVSS HL
Sbjct: 253 MGFPQELEQKPPFTTCSCHLAEGGKLNAKGFFCPQCNSKYCELPVECKCCGLILVSSLHL 312

Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
           ARS HHL PI PF ++      +    S + C+GC++ +        +Y  C  CKKH+C
Sbjct: 313 ARSLHHLVPIKPFIKI------ELEEGSSAYCYGCRKRIKVPAE--NVYF-CESCKKHYC 363

Query: 294 LECDIYIHESLHNCPGCESLR 314
             CDIY+H +LH CPGC   R
Sbjct: 364 DGCDIYVHNTLHVCPGCAVKR 384


>gi|388856633|emb|CCF49750.1| probable SSL1-TFIIH subunit (transcription initiation factor),
           factor B [Ustilago hordei]
          Length = 506

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 129/353 (36%), Positives = 191/353 (54%), Gaps = 46/353 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  V  +  + FV E+FDQNP+ Q+ ++  ++G+A  L  +GG+   H  +L  K
Sbjct: 154 DMRPNRFDVTLQYAKEFVMEYFDQNPIGQLSVIGTREGIAERLAMMGGNTVDHTASLSNK 213

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L   G+ SLQNAL++ +  L  +P+   RE+L ++ +L+TCDPG+I +TI    +  I
Sbjct: 214 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTINTLVKDNI 273

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA------------- 166
           R S++ L+AE+ + K +C  TGG++SVAL+E HF + + EH PPP               
Sbjct: 274 RVSIVHLAAEVKVFKDVCTRTGGTFSVALNEGHFHDSLFEHVPPPAVEGPRRPKRRIIAA 333

Query: 167 --------IAEFAIAN---LIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
                     E  +AN   L++M FP R     + ++C+CH   + G GY CPRC  +VC
Sbjct: 334 SGNGAATTEEEEEVANGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYLCPRCGVKVC 392

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ----- 269
           ++PTDC +CG+ +V S HLARSYHHLFP+ P  +V P        +    C  C      
Sbjct: 393 DVPTDCPVCGITIVMSTHLARSYHHLFPV-PNWKVVPWSALLSAAKVEPACLSCNLLFPT 451

Query: 270 ------------QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                       ++L  +G        C +C   FCLECD ++HE LH CPGC
Sbjct: 452 QEQGAARSAAANKALEEAGLSESSRYRCGRCSHDFCLECDAFVHEQLHVCPGC 504


>gi|322700310|gb|EFY92066.1| TFIIH basal transcription factor complex p47 subunit , putative
           [Metarhizium acridum CQMa 102]
          Length = 457

 Score =  237 bits (605), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 138/347 (39%), Positives = 192/347 (55%), Gaps = 54/347 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  +      AFVRE+F+QNP+SQ+G++ ++DGVA  ++DL G+P  HI+ L   
Sbjct: 112 DLLPTRYRLTISYALAFVREYFEQNPISQLGIIGMRDGVAVRISDLSGNPAEHIERLKAV 171

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G    G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI ETI      KI
Sbjct: 172 EGQDPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHETIANLVTDKI 231

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R S++GLSA++ IC  LC  T       Y++A+DE HF++L +    PP    AE + A+
Sbjct: 232 RVSIVGLSAQVAICADLCSRTNAGDDSQYNIAMDEVHFRDLFLASTTPPVTRTAEQSTAS 291

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A  G+IS C+CH +  +  GY                  CGL L+ S H
Sbjct: 292 LLMMGFPSRTLAPGGAISFCACHSK-PLREGYNS----------------CGLTLILSTH 334

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC---------------------QQ 270
           LARSYHHLFP+  + EV+          +RST CF C                      +
Sbjct: 335 LARSYHHLFPLRNWVEVSWT------EATRSTACFSCLCPFPEPPKDKVDGVEKSKDDTR 388

Query: 271 SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
            L + G       AC  C  HFC++CD++ H+ +HNCPGC+S+ H +
Sbjct: 389 HLKAKGVSESGRYACEVCGNHFCIDCDVFAHQVIHNCPGCQSIVHHD 435


>gi|17555086|ref|NP_499239.1| Protein T16H12.4 [Caenorhabditis elegans]
 gi|27923998|sp|P34567.3|TF2H2_CAEEL RecName: Full=General transcription factor IIH subunit 2; AltName:
           Full=TFIIH basal transcription factor complex subunit 2
 gi|15718246|emb|CAA83139.3| Protein T16H12.4 [Caenorhabditis elegans]
          Length = 376

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 130/310 (41%), Positives = 183/310 (59%), Gaps = 20/310 (6%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           PSR  VV K ++ F+  FF+QNP++QIGL+T KD  A+ LT + G+     ++L  + + 
Sbjct: 82  PSRFVVVMKALQTFLDRFFEQNPIAQIGLITCKDRKADRLTMMTGNIRVLKESLNTLTEA 141

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C GD SLQNAL L    L  +P +  REV+++ SALST DPG+I  TI+  K   IRCS
Sbjct: 142 FCGGDFSLQNALQLACANLKGMPGHVSREVVLVISALSTIDPGNIYSTIETMKRMNIRCS 201

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
            IGLSAEMF+CK + + T G YSVALD  H + L  +H  PP + A+ +  N I +GFP 
Sbjct: 202 AIGLSAEMFVCKEMAKATKGEYSVALDPDHLQLLFSKHTLPPSS-AKSSECNAIHVGFPH 260

Query: 183 RAGEGSISICSCHKEVK--VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
                + S C CH + K     G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELITTRSFCVCHPDTKPISSRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
            P++ F+++           +R  C+ C+  L   G +      C  C+  FCL+CD  +
Sbjct: 321 QPLSAFEQI---------EVTRGFCYACETRLSGEGFR------CGSCQLVFCLDCDTLL 365

Query: 301 HESLHNCPGC 310
           HESLH CPGC
Sbjct: 366 HESLHVCPGC 375


>gi|343426526|emb|CBQ70055.1| probable SSL1-TFIIH subunit (transcription initiation factor),
           factor B [Sporisorium reilianum SRZ2]
          Length = 511

 Score =  236 bits (603), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 127/353 (35%), Positives = 190/353 (53%), Gaps = 48/353 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  V  +    FV E+FDQNP+ Q+ ++  + G+A  L  +GG+   H  +L  K
Sbjct: 162 DMRPNRFDVTLQYAREFVGEYFDQNPIGQLSIIGTRQGIAERLAMMGGNTVDHTASLSNK 221

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L   G+ SLQNAL++ +  L  +P+   RE+L ++ +L+TCDPG+I +TI    +  I
Sbjct: 222 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTIGTLVKDNI 281

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP------------AI 167
           R S++ L+AE+ + K +C  TGG++SVAL+E HF + + E  PPP             A 
Sbjct: 282 RVSIVHLAAEVKVFKDVCTRTGGTFSVALNEGHFHDSLFELVPPPAVEGPRRTKRRSGAA 341

Query: 168 AEFAIAN------------LIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
              A A+            L++M FP R     + ++C+CH   + G GY CPRC  +VC
Sbjct: 342 TNGAAADGEDDEEVHNGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYVCPRCGVKVC 400

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC------ 268
           ++PTDC +CG+ +V S HLARSYHHLFP+  +  V    ++     S + C  C      
Sbjct: 401 DVPTDCPVCGITIVMSTHLARSYHHLFPVPNWKAVPWTAVS---ATSDAACLSCNLPFPT 457

Query: 269 -----------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                       ++L  +G        C +C + FCLECD ++HE LH CPGC
Sbjct: 458 LQDGEASSAAANKALEDAGLSASSRYRCTRCSRDFCLECDAFVHEQLHVCPGC 510


>gi|154276968|ref|XP_001539329.1| suppressor of stem-loop protein 1 [Ajellomyces capsulatus NAm1]
 gi|150414402|gb|EDN09767.1| suppressor of stem-loop protein 1 [Ajellomyces capsulatus NAm1]
          Length = 506

 Score =  236 bits (603), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 143/356 (40%), Positives = 203/356 (57%), Gaps = 49/356 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV EFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 135 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHIMAIQAL 194

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   P++G REVLI++ AL + DPGDI +TI      KI
Sbjct: 195 RPKDPKGTPSLQNALEMARGALFHTPTHGTREVLIIFGALLSSDPGDIHQTITALVADKI 254

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIME-HAPPPPAIAEFAI-- 172
           R S+IGL+A++ IC+ LC  T       Y VAL+E HF+EL M+  APP   +A  +   
Sbjct: 255 RISIIGLAAQVAICRDLCARTNNGDDTVYGVALNEQHFRELFMDVTAPPATTVAPTSTPT 314

Query: 173 ---AN-------------LIKMGF--PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
              AN             L+ MGF         + S+C+CH +     GY C RC A+VC
Sbjct: 315 AFSANGPKTTTTSTTTSSLLMMGFPSLTLTTTPTPSLCACHSKPSRA-GYLCCRCNAKVC 373

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----Q 269
            LP+ C  CGL L+ S HLARSYHHLFP+  + EV+       R RS ++CF C     +
Sbjct: 374 TLPSSCPCCGLTLILSTHLARSYHHLFPLMNWVEVS--WRRAARKRS-ASCFACGVGFPR 430

Query: 270 QSLLSSGNKP---------GLYVA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
              L SG            G+ V+    CP C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 431 MPKLVSGEPEETAKAAVGVGVSVSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQS 486


>gi|226295046|gb|EEH50466.1| suppressor of stem-loop protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 518

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 135/363 (37%), Positives = 186/363 (51%), Gaps = 56/363 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +  + FVREFFDQNP+SQ+G+V ++DG+A  ++D+ G+P  HI A+ G 
Sbjct: 121 DLRPTRYLLTLRYAQDFVREFFDQNPISQLGVVGMRDGLAVRISDMSGNPTEHILAIQGL 180

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      KI
Sbjct: 181 RAKDPKGMPSLQNALEMARGALFHTPSHGTREVLIIYGALLSSDPGDIHKTITSLITDKI 240

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAI--- 172
              V+GL+A++ IC+ L   T       Y+VA++E HF+EL++    PP           
Sbjct: 241 HVYVLGLAAQVSICQELVTRTNNGDDSGYNVAMNEQHFRELVLNVTTPPATTLASHTAAA 300

Query: 173 ------------ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
                         L+ MGFP R      ++C+CH       GY CPRC  +VC LP  C
Sbjct: 301 AANGTGTNPSTDGTLLPMGFPNRHLTPHPTLCACHSTPSRS-GYLCPRCCTKVCTLPASC 359

Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPG 280
             C L L+ S HLARSYHHLFP+  + EV+           R  CF C    +     PG
Sbjct: 360 PSCNLTLILSTHLARSYHHLFPLMNWVEVSWRKAARAEAEGRVGCFACG---VGFAGVPG 416

Query: 281 LYVA-------------------------------CPKCKKHFCLECDIYIHESLHNCPG 309
            +V                                C  C+ HFC++CD++ HE +HNCPG
Sbjct: 417 EFVGAEGDEEREEEEDGKGEGKGASRGISVSGRYECLVCRCHFCIDCDVFAHEVVHNCPG 476

Query: 310 CES 312
           C+S
Sbjct: 477 CQS 479


>gi|403414452|emb|CCM01152.1| predicted protein [Fibroporia radiculosa]
          Length = 377

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 117/258 (45%), Positives = 169/258 (65%), Gaps = 17/258 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  ++ +    F+ E+FDQNPL QIG+V ++ G+ N    L    E H      K
Sbjct: 123 DMRPTRFDLMLQYAREFITEWFDQNPLGQIGIVGMRGGIGNPQDVLKAISERH------K 176

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L  +G+ SLQNA+D+ +  +  +P++  RE+LI++ +L+TCDPG+I +++  C + +IR 
Sbjct: 177 LEPNGEPSLQNAIDMARASMGHLPTHSSREILIIFGSLTTCDPGNIHDSLDDCVKDRIRI 236

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--AIAEFA-------I 172
           SV+ L+AEM IC+ LC  TGG + VAL+E HFK+L+ E  PPP   A+A           
Sbjct: 237 SVVALAAEMKICRELCDKTGGQFGVALNEGHFKDLLFELIPPPAQRAVARTKGGPAGNPS 296

Query: 173 ANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           A+L+ MGFP R  + S  S+C+CH ++K   G+ CPRC A+VC++PTDC ICGL +VSSP
Sbjct: 297 ADLMMMGFPTRLPDTSAPSLCACHSQLK-SEGFLCPRCCAKVCDVPTDCDICGLMIVSSP 355

Query: 232 HLARSYHHLFPIAPFDEV 249
           HLARSYHHLFP+  +  V
Sbjct: 356 HLARSYHHLFPVKTYKAV 373


>gi|268573216|ref|XP_002641585.1| Hypothetical protein CBG09887 [Caenorhabditis briggsae]
          Length = 376

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 131/310 (42%), Positives = 184/310 (59%), Gaps = 20/310 (6%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           PSR  VV K ++ F+  FF+QNP++QIGL+T KD  A+ LT + G+     ++L  + + 
Sbjct: 82  PSRFVVVMKALQQFLERFFEQNPIAQIGLITCKDRKADRLTMMTGNIRILKESLSSLTEA 141

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C GD SLQNAL L    L  +P +  REV+I+ +ALS+ DPG+I  TI+  K   IRCS
Sbjct: 142 FCGGDFSLQNALQLACVNLKGMPGHVSREVVIVMAALSSIDPGNIFSTIETMKRMNIRCS 201

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
            IGLSAEMFICK + + T G YSVALD  H   L  +H  PP + A+ +  N I +GFP 
Sbjct: 202 AIGLSAEMFICKEMAKATKGEYSVALDPDHLHLLFSKHTLPPSS-AKSSECNAIHVGFPH 260

Query: 183 RAGEGSISICSCHKEVK-VGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
                + S C CH + K + + G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELIKTRSFCVCHPDSKPISLRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
            P+A F ++           +R +C+ C+      G +      C KC+  FCL+CD  +
Sbjct: 321 QPLAAFKQI---------EVTRGSCYACETRFSDEGFR------CEKCRLVFCLDCDTLL 365

Query: 301 HESLHNCPGC 310
           HESLH CPGC
Sbjct: 366 HESLHVCPGC 375


>gi|146414033|ref|XP_001482987.1| hypothetical protein PGUG_04942 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 488

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 134/374 (35%), Positives = 205/374 (54%), Gaps = 68/374 (18%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R A+  + ++ F+ EFFDQNP+SQ+G+V +++GV+N ++++ G P+ HI  L   
Sbjct: 116 DLRPNRFAMTLQLLQEFIVEFFDQNPISQLGIVMMRNGVSNLVSEVNGLPQYHIDKLRLL 175

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD  LQN+L++ + +L      +   +   +EVLIL+ AL T DPGDI 
Sbjct: 176 RSRQHNRFEPKGDPLLQNSLEMARAMLKYNFGTNMDDNKNSKEVLILFGALFTSDPGDIH 235

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDT--------------GGSYSVALDESHFK 154
           +TI    + +I+  +IGLSA++ IC+ +   T              G +YSV ++E+H++
Sbjct: 236 KTIDNLIKDEIKVKIIGLSAQVSICQQIANKTNNLGSRTTGANSSSGSNYSVIMNETHYR 295

Query: 155 ELIMEHAPPPPAIAEFAIAN----LIKMGFPQR------AGEGSISI-----CSCH---- 195
           EL+M+   P P  A    +     LIKMGFP +      A   ++++     C+CH    
Sbjct: 296 ELLMDCVVPLPVAATEKRSTVGVPLIKMGFPLKVLPPISASNNTVNLDLPVLCACHPTNG 355

Query: 196 -----KEVKVG-------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
                  V +G       +GY CP+C++RVC LPT C  CGL L+ S HLARSYHHL P+
Sbjct: 356 SLESKDAVTIGAESSTSIIGYQCPQCRSRVCHLPTICPTCGLMLILSTHLARSYHHLVPL 415

Query: 244 APFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA-----CPKCKKHFCLECDI 298
             + EV         +   + C+GC      +G   G   A     CP C + FC++CD+
Sbjct: 416 REYKEVPA-----ADHYEGTHCYGCLLKFPEAGGPDGPLRASSRYRCPTCARDFCIDCDV 470

Query: 299 YIHESLHNCPGCES 312
           ++HE+LHNCPGCES
Sbjct: 471 FVHETLHNCPGCES 484


>gi|322708259|gb|EFY99836.1| TFIIH basal transcription factor complex p47 subunit , putative
           [Metarhizium anisopliae ARSEF 23]
          Length = 456

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 140/348 (40%), Positives = 192/348 (55%), Gaps = 56/348 (16%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  +      AFVREFF+QNP+SQ+G++ ++DGVA  ++DLGG+P  HI+ L   
Sbjct: 111 DLLPTRYRLTISYALAFVREFFEQNPISQLGIIGMRDGVAVRISDLGGNPAEHIERLKAV 170

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G    G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI ETI      KI
Sbjct: 171 EGQDPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHETISNLIADKI 230

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
           R S++GLSA++ IC  LC  T       Y++A+DE HF++L +    PP    AE   A+
Sbjct: 231 RVSIVGLSAQVAICADLCSRTNAGDDSQYNIAMDEVHFRDLFLAGTTPPVTRTAEQNTAS 290

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A  G+IS C+CH +  +  GY                  CGL L+ S H
Sbjct: 291 LLMMGFPSRTLAPGGAISFCACHSK-PLREGYNS----------------CGLTLILSTH 333

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------------------Q 269
           LARSYHHLFP+  + EV+          +RST CF C                      +
Sbjct: 334 LARSYHHLFPLRNWVEVSWA------EATRSTVCFSCLCPFPEPPKDKIDGVEKSKDDTR 387

Query: 270 QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
                  ++ G Y AC  C  HFC++CD++ H+ +HNCPGC+S  H +
Sbjct: 388 HPKAKGVSESGRY-ACEVCGNHFCIDCDVFAHQVIHNCPGCQSTVHHD 434


>gi|219116518|ref|XP_002179054.1| suppressor of stem-loop protein 1 [Phaeodactylum tricornutum CCAP
           1055/1]
 gi|217409821|gb|EEC49752.1| suppressor of stem-loop protein 1 [Phaeodactylum tricornutum CCAP
           1055/1]
          Length = 413

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 125/317 (39%), Positives = 187/317 (58%), Gaps = 24/317 (7%)

Query: 6   SRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLG 63
           +R+ V    ++ FV+E++DQNPLSQ+G V +K+G A  LT L  S ++H  AL  + ++ 
Sbjct: 106 TRIDVTVAHLQHFVQEYYDQNPLSQLGFVLLKNGEAEILTQLSSSSKTHKLALDSVAQMA 165

Query: 64  CS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKE 116
            +      G+ SLQN L+L    L   PS+G RE++++ +ALSTCDPG ++ ET+ + ++
Sbjct: 166 AAEGPSGGGEFSLQNGLELAGRSLGHQPSHGSREIVLVTAALSTCDPGYLLTETLPRLRQ 225

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIA-NL 175
           ++IR S + L+AE+ +C+ L  +T G   V LD +H ++ +     PPP +        L
Sbjct: 226 AQIRVSCLALNAELHVCRKLADETHGVLGVCLDRAHLRDWLRSQTVPPPTLPSRPHGCAL 285

Query: 176 IKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           ++MGFP R  A        +  + V     YTCP C+A+   LP DC +CGLQLV +PHL
Sbjct: 286 VRMGFPTRIIADGPEFVHATRQQTVLARTAYTCPVCQAKNAALPADCAVCGLQLVLAPHL 345

Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
           ARS+HHLFP+ PF EV P           +     Q  +++S     L  ACP C + FC
Sbjct: 346 ARSFHHLFPVPPFREVQP---------DTAVVSNSQVPVVASRQ---LRFACPDCHQVFC 393

Query: 294 LECDIYIHESLHNCPGC 310
           ++CD ++HESLHNCPGC
Sbjct: 394 VDCDAFLHESLHNCPGC 410


>gi|393909764|gb|EJD75581.1| nucleolar protein 14 [Loa loa]
          Length = 1208

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 128/313 (40%), Positives = 186/313 (59%), Gaps = 21/313 (6%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           PSR+AV  K +  F+ +F +QNP+SQ+G+V  KD  A CL  L G+     ++L  + ++
Sbjct: 83  PSRLAVTIKALNQFLDKFSEQNPISQVGIVVCKDKRAECLIPLTGNVRLVKESLSTITEV 142

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C G+ SL N+L      L   P Y  REV+++ ++LSTCDP +I  T +  K   IRCS
Sbjct: 143 LCHGEFSLHNSLMAAIKSLHSYPGYASREVILIVASLSTCDPSNIFGTFELLKRYHIRCS 202

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
           VI LSAE+FI + LC  T G ++V LD +HF+ ++ EHA PP + +  A +++++MGFP 
Sbjct: 203 VISLSAEVFIFRKLCSATSGCHNVILDSTHFEVILNEHANPPIS-SRNAESSVVRMGFPA 261

Query: 183 RAGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
                S S C CH+ E++   G G+ CP+C AR C LP +CRIC L L+S+P LARS H+
Sbjct: 262 HESIDSPSFCLCHQSEIRPSGGRGFFCPQCGARYCSLPVECRICKLTLISAPQLARSLHN 321

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           L P+  F+E+         + +   CF C + L            C  CK  FC++CD+ 
Sbjct: 322 LLPLPAFEEI---------DTTERVCFACIRQLDDKS------FVCKNCKSTFCIDCDVL 366

Query: 300 IHESLHNCPGCES 312
           +HESL  CPGC+S
Sbjct: 367 LHESLQICPGCKS 379


>gi|354547034|emb|CCE43767.1| hypothetical protein CPAR2_214110 [Candida parapsilosis]
          Length = 498

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 142/378 (37%), Positives = 201/378 (53%), Gaps = 72/378 (19%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R+++    +  FV EFFDQNP+SQ+G++ +++GVAN ++++ GSP+ HI  L   
Sbjct: 124 DLRPTRLSLTLNYLSEFVVEFFDQNPISQLGIILMRNGVANLISEVSGSPQYHIDRLRQL 183

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 K    GD SLQN L++ + LL      +   S   +E+L+++ +L T DPGDI 
Sbjct: 184 KARQHNKYEPKGDPSLQNCLEMARSLLKFNFGSASNNSKNSKEILLVFGSLFTSDPGDIH 243

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIME- 159
           +TI    +  I+ SVIGLSA++ IC+ L   T           Y V ++E+HFKEL+M+ 
Sbjct: 244 KTIDSLVKDNIKVSVIGLSAQVAICQELVNKTNHESRNSSSKHYGVIMNETHFKELLMDC 303

Query: 160 -HAPPPPAIAEFAIAN----LIKMGFPQRAGEGSISICSCHK------------------ 196
               P P   E  I      LIKMGFP +    + S     +                  
Sbjct: 304 VTPLPLPESEEIKIETKGVPLIKMGFPSKVQPNATSTIGNSEYTVEFPQLNASYPTQGSN 363

Query: 197 ------EVKVGV------GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIA 244
                 EV  G+      GY CP+CK++VC LPT C +CGL L+ S HLARSYHHL P+A
Sbjct: 364 DSKDVVEVNSGLAMSQTFGYQCPQCKSKVCNLPTICPVCGLMLILSTHLARSYHHLVPLA 423

Query: 245 PFDEVTPLCLNDPRNRSRSTCFGCQ-------QSLLSSGNKPGLYVA---CPKCKKHFCL 294
           P+ EV PL      +  R  C+GCQ       +S  ++G    L  +   C  C   FC+
Sbjct: 424 PYKEV-PLST----SYDREYCYGCQLIFPSGVKSTKNTGTIDSLTSSRYRCANCCNDFCI 478

Query: 295 ECDIYIHESLHNCPGCES 312
            CD++IHE LHNCPGCE+
Sbjct: 479 NCDVFIHEVLHNCPGCEN 496


>gi|344233967|gb|EGV65837.1| hypothetical protein CANTEDRAFT_118624 [Candida tenuis ATCC 10573]
          Length = 488

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 141/378 (37%), Positives = 207/378 (54%), Gaps = 74/378 (19%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESH---IKAL 58
           D RP+R +++   ++ FV EFFDQNP+SQ+G+V +++G+++ ++D+ GSP+ H   I+ L
Sbjct: 117 DLRPNRFSMMLSLLQEFVVEFFDQNPISQLGIVLMRNGISDVVSDVNGSPQFHLDKIRQL 176

Query: 59  MGK----LGCSGDSSLQNALDLVQGLLSQIPSYGH--------REVLILYSALSTCDPGD 106
            GK        GD SLQNAL+L + LL    ++G+        +E+LI++ AL T DPGD
Sbjct: 177 RGKQHNRFEPKGDPSLQNALELSRSLLRY--NFGNNINDTKNSKEILIIFGALFTSDPGD 234

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIM 158
           I +TI    +  I+  VIGLSA++ IC+ +   T  +        Y V ++ESHFKEL+M
Sbjct: 235 IHKTISSLVKDNIKVKVIGLSAKVAICQEIVNKTNHTSNNFNNPNYGVIMNESHFKELLM 294

Query: 159 EHAPPPPAIAE-------FAIANLIKMGFPQRAGEGS-----------ISICSCH----- 195
           +   P P   E            L+KMGFP +                I IC+CH     
Sbjct: 295 DCVVPLPITVEQQKEIEASKSVPLLKMGFPTKVQPTLSSSSGGHSLDFIQICACHSGSRT 354

Query: 196 ---------KEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPF 246
                    ++    +GY CP+C ++VC LPT C ICGL L+ S HLARSYHHL P+  +
Sbjct: 355 ENGSDAAAIQQSSSVIGYRCPQCSSKVCSLPTVCPICGLMLILSTHLARSYHHLVPLVDY 414

Query: 247 DEVTPLCLNDPRNRSRSTCFGCQQSL------LSSGNKPGLYVA------CPKCKKHFCL 294
           +EV P+  + P     S C+GC          +   +K  L         C KCK  FC+
Sbjct: 415 EEV-PVKGSYP----SSFCYGCLLQFPEGIENIHKDSKEKLETMSSSRYRCKKCKNDFCI 469

Query: 295 ECDIYIHESLHNCPGCES 312
           +CD+++HE+LHNCPGCE+
Sbjct: 470 DCDVFVHETLHNCPGCEN 487


>gi|150864125|ref|XP_001382831.2| hypothetical protein PICST_29472 [Scheffersomyces stipitis CBS
           6054]
 gi|149385381|gb|ABN64802.2| component of RNA polymerase transcription factor [Scheffersomyces
           stipitis CBS 6054]
          Length = 489

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 139/383 (36%), Positives = 204/383 (53%), Gaps = 77/383 (20%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RPSR ++  + ++ FV EFFDQNP+SQ+G++ +++GV+N ++++ GSP+ HI  +   
Sbjct: 111 DLRPSRFSMTLQLLQDFVVEFFDQNPISQLGIILMRNGVSNLVSEVSGSPQYHIDKIRQL 170

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L++ + LL      +   +   +EVL+++ AL T DPGDI 
Sbjct: 171 KSRQHNRYEPKGDPSLQNSLEMARSLLKFNFGNNANDTKNSKEVLLIFGALFTSDPGDIH 230

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG--------GSYSVALDESHFKELIMEH 160
           +TI    + +I+  +IGLSA++ IC+ L   T          +Y V ++ESHFKEL M+ 
Sbjct: 231 KTIDNLVKDEIKVKIIGLSAQVAICQDLVNRTNHQQKNISSKNYGVIMNESHFKELFMDC 290

Query: 161 APPPP----AIAEFAIAN----LIKMGFPQRAGEGSISICSCHK---------------- 196
             P P     + E A AN    +IKMGFP +      S  + ++                
Sbjct: 291 VTPLPITTSQVNENAEANKGVPIIKMGFPTKIQPVLTSTLASNEYTIEFPQLSASDPTHG 350

Query: 197 -------EVKVG----------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
                  E+  G          +GY CP+CK++VC LPT C +CGL L+ S HLARSYHH
Sbjct: 351 LDDNQVVEINNGTDSSIQATNVIGYQCPQCKSKVCNLPTVCPVCGLMLILSTHLARSYHH 410

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------QQSLLSSGNKPGLYVACPKCK 289
           L P+ PF EV P+        +   CFGC           +S  S  N       C KC 
Sbjct: 411 LVPLVPFKEV-PVS----SKYNSKHCFGCLLKFPKGVSNTKSKSSLENMTSSRYRCMKCS 465

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
             FC++CD++IHE+LHNCPGCE+
Sbjct: 466 HDFCIDCDVFIHETLHNCPGCEN 488


>gi|320585884|gb|EFW98563.1| suppressor of stem-loop protein 1 [Grosmannia clavigera kw1407]
          Length = 524

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 136/352 (38%), Positives = 188/352 (53%), Gaps = 42/352 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ PS   V       FVR FF+QNP+SQ+ +V ++DGVA  ++D+GGSP  H+  L   
Sbjct: 144 DYLPSCQRVALDCAADFVRAFFEQNPISQLAVVAMRDGVALRVSDMGGSPAEHLDRLADW 203

Query: 62  LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G    G  SLQNAL++ +G L   P++G REVL+++ AL + DP DI +TI      +I
Sbjct: 204 DGVEPQGQPSLQNALEMCRGALFHTPAHGTREVLVVFGALLSSDPADIHDTIAALVADRI 263

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPA------IAE 169
           R S++GL+A++ +C  +C  T      +Y+VA  + HF EL++    PP         A 
Sbjct: 264 RVSIVGLAAKVAVCAEICARTNPGDEAAYAVARHDVHFHELVLAATTPPATRPDDGDAAS 323

Query: 170 FAIANLIKMGFPQR------AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
              A+L+ MGFP R      AG G +S+C+CH    V  GY C RC A+VC LP +C  C
Sbjct: 324 TGPASLLMMGFPSRAIADTVAGGGGLSVCACHSR-PVREGYGCTRCGAKVCRLPAECPAC 382

Query: 224 GLQLVSSPHLARSYHHLFPIAPFDEVT-PLCLNDPRNRSRSTCFGC-------------- 268
           GL L+ S HLARSYHHLFP+  + EV+     +  R R    C  C              
Sbjct: 383 GLTLILSTHLARSYHHLFPLRSWVEVSWTAAADAVRRRRAVACHACLTPFPAVDTAAAAA 442

Query: 269 --------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                         SG       AC  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 443 AAVTAAAAATQTTGSGVSESGRYACEVCGHHFCIDCDVFAHEVIHNCPGCQS 494


>gi|448514642|ref|XP_003867164.1| RNA polymerase transcription factor TFIIH core component [Candida
           orthopsilosis Co 90-125]
 gi|380351502|emb|CCG21726.1| RNA polymerase transcription factor TFIIH core component [Candida
           orthopsilosis Co 90-125]
          Length = 511

 Score =  234 bits (596), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 140/381 (36%), Positives = 197/381 (51%), Gaps = 78/381 (20%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R+++    +  FV EFFDQNP+SQ+G++ +++GVAN ++++ GSP+ HI  L   
Sbjct: 137 DLRPTRLSLTLNYLSEFVVEFFDQNPISQLGIILMRNGVANLISEVSGSPQYHIDRLKQL 196

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 K    GD SLQN L++ + LL      +   S   +EVL+++ +L T DPGDI 
Sbjct: 197 KARQHNKYEPKGDPSLQNCLEMARSLLKFNFGSASNNSKNSKEVLLVFGSLFTSDPGDIH 256

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
           +TI+   +  I+ SVIGLSA++ IC+ L   T           Y V ++E+HFKEL+M+ 
Sbjct: 257 KTIESLVKDNIKVSVIGLSAQVAICQELVNRTNNEPRNSSSKHYGVIMNETHFKELLMDC 316

Query: 161 APPPP------AIAEFAIANLIKMGFPQRAGEGSISICSCHK------------------ 196
             P P         E     LIKMGFP +    + S     +                  
Sbjct: 317 VTPLPLPESEETKVETKGVPLIKMGFPSKVQPNATSTIGNSEYTVEFPRLNASYPTQGSN 376

Query: 197 ------EVKVGV------GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIA 244
                 EV  G+      GY CP+CK++VC LPT C +CGL L+ S HLARSYHHL P+A
Sbjct: 377 DSRDIVEVNAGLVTSPSFGYQCPQCKSKVCNLPTICPVCGLMLILSTHLARSYHHLVPLA 436

Query: 245 PFDEVTPLCLNDPRNRSRST--CFGCQQSLLSSGNKPGLYVA-----------CPKCKKH 291
           P+ E+       P + S  T  C+GCQ     SG K                 C  C   
Sbjct: 437 PYKEI-------PVSPSYDTDFCYGCQLK-FPSGTKSTKETKTIDSITSSRYRCTNCNND 488

Query: 292 FCLECDIYIHESLHNCPGCES 312
           FC+ CD+++HE LHNCPGCE+
Sbjct: 489 FCINCDVFVHEVLHNCPGCEN 509


>gi|448096877|ref|XP_004198537.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
 gi|359379959|emb|CCE82200.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
          Length = 514

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 138/381 (36%), Positives = 204/381 (53%), Gaps = 75/381 (19%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R +++   ++ F+ EFFDQNP+SQ+G+V +++G+A  ++D+ GSP+ HI  +   
Sbjct: 138 DLRPTRFSMMLSLLQEFILEFFDQNPISQMGIVMMRNGIAIRISDVSGSPQYHIDKIRQL 197

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L+L + LL          +   +E+LI++ AL T DPG+I 
Sbjct: 198 KTKQHNRFEPKGDPSLQNSLELARSLLMYNFNNKVNDTQNSKEILIIFGALFTSDPGNIH 257

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS----------YSVALDESHFKELIM 158
           +TI    +  I+  +IGL+A++ IC+ + + T  S          Y V ++ESHFKEL+M
Sbjct: 258 KTIDNLVKDDIKIKIIGLAAQVAICEEIVRRTNSSKTSSSYINTNYGVIMNESHFKELLM 317

Query: 159 EHAPPPPAIAEFAIAN-----LIKMGFPQR-----AGEGSI-----SICSCH-------- 195
           +   P P   +F         LIKMGFP +         SI     SICSCH        
Sbjct: 318 DCVIPLPVSEDFEKKEKKGVPLIKMGFPSKIQPSVTSNSSINLNLPSICSCHPDEENDAS 377

Query: 196 --KEV----------KVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
             KE           K  +GY CP+C  +VC+LPT C ICGL L+ S HLARSYHHL P+
Sbjct: 378 HSKEAGEAGNGTYSSKKVIGYKCPQCLNKVCQLPTICPICGLMLILSTHLARSYHHLVPL 437

Query: 244 APFDEVTPLCLNDPRNRSRSTCFGC--------QQSLLSSGNKPGLYVA----CPKCKKH 291
             + EV  +          + C+GC        + +     NK     +    C KC++ 
Sbjct: 438 EAYKEVPTM-----EEYESTYCYGCLLKFPSGRKNAQSDDANKIESLTSSRYRCGKCQQD 492

Query: 292 FCLECDIYIHESLHNCPGCES 312
           FC++CD++IHE+LHNCPGCE+
Sbjct: 493 FCIDCDVFIHETLHNCPGCEN 513


>gi|255724012|ref|XP_002546935.1| suppressor of stem-loop protein 1 [Candida tropicalis MYA-3404]
 gi|240134826|gb|EER34380.1| suppressor of stem-loop protein 1 [Candida tropicalis MYA-3404]
          Length = 505

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 134/387 (34%), Positives = 200/387 (51%), Gaps = 80/387 (20%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R+++    ++ FV EFFDQNP+S++G++ +++G+AN ++++ GSP+ HI  +   
Sbjct: 124 DLRPNRLSMTLSYLQDFVTEFFDQNPISRLGIILMRNGIANLVSEVSGSPQYHIDKIRQL 183

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L++ + LL      +   +   +EVLI++ AL T DPGDI 
Sbjct: 184 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNTKNSKEVLIIFGALFTSDPGDIH 243

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
            TI    + +I+ SVIGLSA++ IC+ L   T           Y V ++ESHFKEL+M+ 
Sbjct: 244 RTIDNLIKDEIKVSVIGLSAQVAICQELVNRTNKEPRNSQSKHYGVIMNESHFKELLMDS 303

Query: 161 APPPP----------AIAEFAIANLIKMGFPQR--------------------------- 183
             P P          A +E     +++MGFP +                           
Sbjct: 304 VTPLPLTESEKRLQDAQSEDGGVPVLRMGFPSKVQPTLTSVVTGTDMIIEFPHLNASFPT 363

Query: 184 -AGEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
              E S           V       +GY CP+CK++VC LPT C +CGL L+ S HLARS
Sbjct: 364 QGSEDSKDAVEIQNNRAVAAASSSVIGYQCPQCKSKVCNLPTLCPVCGLMLILSTHLARS 423

Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---LSSGNKPGLYVA-------CP 286
           YHHL P+A F EV    + D        CFGCQ      + +G + G   +       C 
Sbjct: 424 YHHLVPLAQFKEVPVAPVYDSE-----FCFGCQLKFPDGVKAGTQKGSLESMTSSRYRCK 478

Query: 287 KCKKHFCLECDIYIHESLHNCPGCESL 313
           +C+K FC+ CD+++HE LH CPGCE++
Sbjct: 479 RCEKSFCINCDVFVHEVLHTCPGCENM 505


>gi|241950787|ref|XP_002418116.1| TFIIH subunit, putative; general transcription and DNA repair
           factor IIH subunit, putative; suppressor of stem-loop
           protein 1, putative [Candida dubliniensis CD36]
 gi|223641455|emb|CAX43416.1| TFIIH subunit, putative [Candida dubliniensis CD36]
          Length = 501

 Score =  231 bits (590), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 197/383 (51%), Gaps = 78/383 (20%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RPSR+++    ++ FV EFFDQNP+S++G++ +++GVAN ++++ GSP+ H+  +   
Sbjct: 124 DLRPSRLSMTLNYLQEFVVEFFDQNPISRLGIILMRNGVANLVSEVSGSPQYHVDKIRNL 183

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L++ + LL      +   S   +E+L+++ AL T DPGDI 
Sbjct: 184 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNSKNSKEILVIFGALFTSDPGDIH 243

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
           +TI    + +I+ SVIGLSA++ IC+ L   T           Y V ++ESHFKEL+ME 
Sbjct: 244 KTIDSLVKDEIKTSVIGLSAQVAICQELVNRTNKEPRNSQSKHYGVIMNESHFKELLMES 303

Query: 161 APPPPAI--------AEFAIANLIKMGFPQR----------------------------A 184
             P P          +E     +++MGFP +                             
Sbjct: 304 VTPLPLTESEKQIQESEQNGVPVLRMGFPTKVQPTLTSAIGGSDYIIEFPHLNASFPTQG 363

Query: 185 GEGSISICSCHKEVKVG----VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
            E S  +              +GY CP+CK +VC LPT C +CGL L+ S HLARSYHHL
Sbjct: 364 SEDSKDVVEIQTNKPAAASSLIGYQCPQCKCKVCNLPTICPVCGLMLILSTHLARSYHHL 423

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY-----------VACPKCK 289
            P+AP+ EV    + D      + CFGCQ      G K G               C KC 
Sbjct: 424 VPLAPYKEVKVSPIYD-----STYCFGCQLK-FPEGVKLGTLKKSLESMTSSRYRCLKCS 477

Query: 290 KHFCLECDIYIHESLHNCPGCES 312
           + FC+ CD+++HE LHNCPGCE+
Sbjct: 478 QDFCINCDVFVHEVLHNCPGCEN 500


>gi|310796716|gb|EFQ32177.1| hypothetical protein GLRG_07321 [Glomerella graminicola M1.001]
          Length = 482

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 136/359 (37%), Positives = 198/359 (55%), Gaps = 44/359 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  V       FV+E+F+QNP+SQ+G++ ++DGVA  ++D+ G+P  H++ L   
Sbjct: 104 DMLPNRYRVAWAYAADFVKEYFEQNPISQLGIIGMRDGVAIRISDMSGNPTDHLEKLKTF 163

Query: 62  LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            G   SG+ SLQNAL++ +G L   PS+G REVLI++ AL + DPGDI +TI      +I
Sbjct: 164 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLIVFGALLSSDPGDIHDTISSLITDRI 223

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R S++GL+A++ IC  LC  T       YSV ++++HF++L++    PP       + A+
Sbjct: 224 RVSIVGLAAQVAICAELCSRTNAGDETQYSVCMNDAHFRDLLLATTTPPVTRTVAQSTAS 283

Query: 175 LIKMGFPQRA---GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           L+ MGFP R    GE + ++C+CH +     GY C RC  RVC LP +C  C L L+ S 
Sbjct: 284 LLMMGFPSRTLAQGEAT-AVCACHNK-PAREGYLCTRCGTRVCRLPIECPACSLTLILST 341

Query: 232 HLARSYHHLFPIAPFDEVTPL----------CLND----PRNRSRSTCFG---------- 267
           HLARSYHHLFP+  + EV+            CL      P+   R               
Sbjct: 342 HLARSYHHLFPLRNWIEVSWAEAAKSKACYSCLTTFPEPPKTGRRKDKDKDKEKDGLHTT 401

Query: 268 -------CQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPI 319
                     S L   ++ G Y +C  C  HFC++CD+Y HE +HNCPGC+S     P+
Sbjct: 402 APVAPTPVTASDLKGVSESGRY-SCTVCGNHFCIDCDVYAHEVIHNCPGCQSDTRGAPM 459


>gi|238881955|gb|EEQ45593.1| suppressor of stem-loop protein 1 [Candida albicans WO-1]
          Length = 472

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 136/388 (35%), Positives = 202/388 (52%), Gaps = 83/388 (21%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R+++    ++ FV EFFDQNP+S++G++ +++GVAN ++++ GSP+ H+  +   
Sbjct: 90  DLRPTRLSMTLNYLQEFVVEFFDQNPISRLGIILMRNGVANLVSEVSGSPQYHVDKIRNL 149

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L++ + LL      +   S   +E+L+++ AL T DPGDI 
Sbjct: 150 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNSKNSKEILVIFGALFTSDPGDIH 209

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
           +TI    + +I+ SVIGLSA++ IC+ L   T           Y V ++ESHFKEL+ME 
Sbjct: 210 KTIDNLVKDEIKASVIGLSAQVAICQELVNRTNNEPRNSQSKHYGVIMNESHFKELLMES 269

Query: 161 APPPPAI--------AEFAIANLIKMGFPQRA-------------------------GEG 187
             P P          +E     +++MGFP +                           +G
Sbjct: 270 VTPLPLTESEKQMQESEQNGVPVLRMGFPTKVQPTLTSAIGGSDYIVEFPHLNASFPTQG 329

Query: 188 S------ISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           S      + I +      V       +GY CP+CK +VC LPT C +CGL L+ S HLAR
Sbjct: 330 SEDSKDVVEIQTNKPAAAVATTSSSVIGYQCPQCKCKVCNLPTICPVCGLMLILSTHLAR 389

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA----------- 284
           SYHHL P+AP+ EV    + D      + CFGCQ      G K G +             
Sbjct: 390 SYHHLVPLAPYKEVKVSPIYD-----STYCFGCQLK-FPEGVKLGTHKGTLESMTSSRYR 443

Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
           C KC + FC+ CD+++HE LHNCPGCE+
Sbjct: 444 CLKCSQDFCINCDVFVHEVLHNCPGCEN 471


>gi|258571509|ref|XP_002544558.1| suppressor of stem-loop protein 1 [Uncinocarpus reesii 1704]
 gi|237904828|gb|EEP79229.1| suppressor of stem-loop protein 1 [Uncinocarpus reesii 1704]
          Length = 413

 Score =  230 bits (587), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 121/274 (44%), Positives = 170/274 (62%), Gaps = 13/274 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +  + FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI AL   
Sbjct: 85  DLRPTRYLLTLRFAQEFVLEYFEQNPISQLGIIGMRDGLAVKISDMSGNPTEHILALQAL 144

Query: 62  LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                +G  SLQNAL++ +G L   PS+G REVLIL+ AL + DPGDI +T+      KI
Sbjct: 145 RAKDPNGLPSLQNALEMARGTLFHTPSHGTREVLILFGALLSSDPGDIHQTVSSLISDKI 204

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
           R  V+GL+AE+ IC+ +C  T       Y VAL+E HF+EL+ME   PP   ++   AN 
Sbjct: 205 RVGVVGLAAEVAICREICAKTNAGDDSGYGVALNEQHFRELMMETTTPPVTYSKKQAANS 264

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           L+ MGFP R  E + S+C+CH     G GY C RC ++VC LP +C  CGL L+ S HLA
Sbjct: 265 LLMMGFPSRTVEPAPSMCACHSMPSRG-GYLCSRCGSKVCTLPAECPACGLTLILSTHLA 323

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
           RSYHHLFP+  + EV P      +  + + CF C
Sbjct: 324 RSYHHLFPLINWIEV-PW----KKASTSANCFAC 352


>gi|68466623|ref|XP_722507.1| hypothetical protein CaO19.1457 [Candida albicans SC5314]
 gi|68466906|ref|XP_722368.1| hypothetical protein CaO19.9032 [Candida albicans SC5314]
 gi|46444338|gb|EAL03613.1| hypothetical protein CaO19.9032 [Candida albicans SC5314]
 gi|46444486|gb|EAL03760.1| hypothetical protein CaO19.1457 [Candida albicans SC5314]
          Length = 507

 Score =  230 bits (587), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 136/388 (35%), Positives = 202/388 (52%), Gaps = 83/388 (21%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R+++    ++ FV EFFDQNP+S++G++ +++GVAN ++++ GSP+ H+  +   
Sbjct: 125 DLRPTRLSMTLNYLQEFVVEFFDQNPISRLGIILMRNGVANLVSEVSGSPQYHVDKIRNL 184

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L++ + LL      +   S   +E+L+++ AL T DPGDI 
Sbjct: 185 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNSKNSKEILVIFGALFTSDPGDIH 244

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
           +TI    + +I+ SVIGLSA++ IC+ L   T           Y V ++ESHFKEL+ME 
Sbjct: 245 KTIDSLVKDEIKASVIGLSAQVAICQELVNRTNNEPRNSQSKHYGVIMNESHFKELLMES 304

Query: 161 APPPPAI--------AEFAIANLIKMGFPQRA-------------------------GEG 187
             P P          +E     +++MGFP +                           +G
Sbjct: 305 VTPLPLTESEKQIQESEQNGVPVLRMGFPTKVQPTLTSAIGGSDYIVEFPHLNASFPTQG 364

Query: 188 S------ISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           S      + I +      V       +GY CP+CK +VC LPT C +CGL L+ S HLAR
Sbjct: 365 SEDSKDVVEIQTNKPAAAVATTSSSVIGYQCPQCKCKVCNLPTICPVCGLMLILSTHLAR 424

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA----------- 284
           SYHHL P+AP+ EV    + D      + CFGCQ      G K G +             
Sbjct: 425 SYHHLVPLAPYKEVKVSPIYD-----STYCFGCQLK-FPEGVKLGTHKGTLESMTSSRYR 478

Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
           C KC + FC+ CD+++HE LHNCPGCE+
Sbjct: 479 CLKCSQDFCINCDVFVHEVLHNCPGCEN 506


>gi|448110898|ref|XP_004201714.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
 gi|359464703|emb|CCE88408.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
          Length = 514

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 141/382 (36%), Positives = 205/382 (53%), Gaps = 77/382 (20%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R +++   ++ F+ EFFDQNP+SQ+G+V +++G+A  ++D+ GSP+ HI  +   
Sbjct: 138 DLRPTRFSMMLSLLQEFILEFFDQNPISQMGIVMMRNGIAVRISDVSGSPQYHIDKIRQL 197

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 +    GD SLQN+L+L + LL          +   +E+LI++ AL T DPG+I 
Sbjct: 198 KTKQHNRFEPKGDPSLQNSLELARSLLMYNFNNKVNDTQNSKEILIIFGALFTSDPGNIH 257

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS----------YSVALDESHFKELIM 158
           +TI    +  I+  +IGL+A++ IC+ + + T  S          Y V ++ESHFKEL+M
Sbjct: 258 KTIDNLVKDDIKVKIIGLAAQVAICEEIVRRTNSSKTSSSYINTNYGVIMNESHFKELLM 317

Query: 159 EHAPPPPAIAEFAIAN-----LIKMGFPQR-----AGEGSI-----SICSCH-------- 195
           +   P P   +F         LIKMGFP +         SI     SICSCH        
Sbjct: 318 DCVIPLPVSEDFEKKEKKGVPLIKMGFPSKIQPSITSNSSINLNLPSICSCHPDEENESS 377

Query: 196 --KEV----------KVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
             KE           K  +GY CP+C  +VC+LPT C ICGL L+ S HLARSYHHL P+
Sbjct: 378 HSKEAGEAGNGTYSSKKVIGYKCPQCLNKVCQLPTICPICGLMLILSTHLARSYHHLVPL 437

Query: 244 APFDEVTPL-----------CLNDP--RNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
             + EV  +            L  P  R  + S      +SL SS  +      C KC++
Sbjct: 438 EAYREVPTMEEYESIYCYGCLLKFPPGRKNAESDETNKIESLTSSRYR------CGKCQQ 491

Query: 291 HFCLECDIYIHESLHNCPGCES 312
            FC++CD++IHE+LHNCPGCE+
Sbjct: 492 DFCIDCDVFIHETLHNCPGCEN 513


>gi|294654358|ref|XP_456408.2| DEHA2A01606p [Debaryomyces hansenii CBS767]
 gi|199428818|emb|CAG84360.2| DEHA2A01606p [Debaryomyces hansenii CBS767]
          Length = 514

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 132/385 (34%), Positives = 206/385 (53%), Gaps = 79/385 (20%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R +++   ++ FV E+FDQNP+SQ+G++ +++GV+N ++++ GSP+ H+  +   
Sbjct: 134 DLRPTRFSMMLSLLQEFVVEYFDQNPISQLGIIMMRNGVSNLISEVNGSPQYHLDKIRQL 193

Query: 60  -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
                 K    GD SLQNAL++ + LL      +   +   +E+LI++ AL T DPGDI 
Sbjct: 194 KSRQHNKYEPKGDPSLQNALEMARSLLMYNFGTNLNDTKNSKEILIIFGALFTSDPGDIH 253

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDT-------------GGSYSVALDESHFKE 155
           +TI    + +I+  VIGLSA++ IC+ L   T             G +Y V ++E HF+E
Sbjct: 254 KTINSLVKDEIKVKVIGLSAQVAICQELVNKTNNPGKSNTFWSAGGNNYGVIMNEIHFRE 313

Query: 156 LIMEHAPPPPAIAEFAIAN-------LIKMGFPQR------AGEGSISI-----CSCHKE 197
           L+M+   P P      I         +IKMGFP +      +   ++S+     C+CH  
Sbjct: 314 LLMDCVIPLPVTTNAVIDENNSKEVPIIKMGFPSKIQPVITSSTSNLSVDFPQLCACHPT 373

Query: 198 ----------------VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
                            +  +GY CP+C  +VC LPT C ICGL L+ S HLARSYHHL 
Sbjct: 374 QDSLESKNNDPTATLGTRSVIGYECPQCNNKVCHLPTICPICGLMLILSTHLARSYHHLV 433

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGC--------QQSLLSSGNKPGL------YVACPK 287
           P+  + EV P+      + S + C+GC        + + ++   K  +         C K
Sbjct: 434 PLGDYKEV-PVS----ESYSSTYCYGCLLKFPSGIKNTDVNEDGKTKMDSLTSSRYRCGK 488

Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
           C+  FC++CD+++HESLHNCPGCE+
Sbjct: 489 CQNDFCIDCDVFVHESLHNCPGCEN 513


>gi|71022397|ref|XP_761428.1| hypothetical protein UM05281.1 [Ustilago maydis 521]
 gi|46101297|gb|EAK86530.1| hypothetical protein UM05281.1 [Ustilago maydis 521]
          Length = 539

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 127/357 (35%), Positives = 187/357 (52%), Gaps = 62/357 (17%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  V  +    FV E+FDQNP+ Q+ ++  + G+A  L  +GG+   H  +L  K
Sbjct: 163 DMRPNRFDVTLQYAREFVGEYFDQNPIGQLSIIGTRSGIAERLAMMGGNTVDHTASLSNK 222

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L   G+ SLQNAL++ +  L  +P+   RE+L ++ +L+TCDPG+I +TI    +  I
Sbjct: 223 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTIATLVKDNI 282

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP-------------A 166
           R S++ L+AE+ + K +C  TGG++SVAL+E HF + + E  PPP               
Sbjct: 283 RVSIVHLAAEVKVFKDVCTRTGGTFSVALNEGHFHDSLFELVPPPAVEGKPRRTRQHMVG 342

Query: 167 IAEFAI---------------ANLIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCK 210
           IA+ +                 +L++M FP R     + ++C+CH   + G GY CPRC 
Sbjct: 343 IADGSTAMDAEDDDEDGVQAGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYLCPRCG 401

Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPI-----APFDEVTPLCLNDPRNRSRSTC 265
            +VC +PTDC +CG+ +V S HLARSYHHLFP+      P+  VTP         S   C
Sbjct: 402 VKVCNVPTDCPVCGITIVMSTHLARSYHHLFPVPNWKAVPWSSVTP--------DSDGAC 453

Query: 266 FGC-----------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLH 305
           F C                  ++L  +G  P     C +C   FCLECD ++HE LH
Sbjct: 454 FSCNVPFPSLQERKEKSAAANKALEEAGLSPSSRYRCGRCAIDFCLECDAFVHEQLH 510


>gi|239614970|gb|EEQ91957.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
           ER-3]
 gi|327356553|gb|EGE85410.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 525

 Score =  227 bits (579), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 133/361 (36%), Positives = 197/361 (54%), Gaps = 54/361 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV EFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 140 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHILAIQSL 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN L++ +G L + P++G REV I++ AL + DPGDI +TI      KI
Sbjct: 200 RPKDPKGMPSLQNTLEMARGALFRTPTHGTREVFIIFGALLSSDPGDIHKTINTLVADKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIME--------------HA 161
           R S+IGL+A++ IC+ +C  T       Y VAL+E HF+EL M                 
Sbjct: 260 RVSIIGLAAQVAICRDICARTNNGDDSGYGVALNEQHFRELFMNVTTPPATTVAPTPTTT 319

Query: 162 PPPPAIAEFAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
                 ++   ++L+ MGFP R  +   + ++C+CH +     GY C RC A+VC LPT 
Sbjct: 320 KEETKTSQTTTSSLLMMGFPSRTLSPTTTPTLCACHSKPSR-TGYLCSRCGAKVCTLPTS 378

Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------- 272
           C  CGL L+ S HLARSYHHLFP+  + EV+       R RS ++CF C           
Sbjct: 379 CPCCGLTLILSTHLARSYHHLFPLMNWVEVS--WRRAARKRS-ASCFACAVGFPRVPKQF 435

Query: 273 -----------------LSSGNKPGLYVA----CPKCKKHFCLECDIYIHESLHNCPGCE 311
                             +     G+ V+    CP C+ HFC++CD++ HE +HNCPGC+
Sbjct: 436 SGGEQEGAGEEQGEGEGETVKGPSGISVSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQ 495

Query: 312 S 312
           S
Sbjct: 496 S 496


>gi|324518504|gb|ADY47122.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
          Length = 386

 Score =  227 bits (578), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 120/313 (38%), Positives = 174/313 (55%), Gaps = 21/313 (6%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           P+R+ V  K +  F+ +F +QNP+SQ+G++  +D  A  L  L G+  +  +AL  + + 
Sbjct: 87  PTRLNVTLKVLNQFLEKFSEQNPISQVGIIICRDKRAERLIQLTGTIRAVKEALSTINEA 146

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C G+ SLQN L +    L   P +  RE++ + ++LSTCDP +I  T +  K + +RCS
Sbjct: 147 SCHGEFSLQNGLLVALRSLQHFPGHASREIIAIVASLSTCDPSNIFGTFEVLKRNNVRCS 206

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
           VIGLSAE+F+ K LC  T G Y V LD  HF ELI+     PP       + +++MGFP 
Sbjct: 207 VIGLSAEVFVYKKLCSTTSGRYDVILDGDHF-ELILNEYTNPPVTKRNVESCVVRMGFPS 265

Query: 183 RAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
                  + C CH+       G G+ CP+C AR C LP +CR+C L L+S+P LARS+HH
Sbjct: 266 HQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLARSFHH 325

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           L P+  F EV         + +   CFGC + L           AC  C  ++C++CD+ 
Sbjct: 326 LLPLPAFKEV---------DTTSGICFGCAKPLEQKS------FACKSCDANYCIDCDLL 370

Query: 300 IHESLHNCPGCES 312
           +HESL  CP C S
Sbjct: 371 LHESLQLCPACPS 383


>gi|443923297|gb|ELU42560.1| RNA polymerase II transcription factor [Rhizoctonia solani AG-1 IA]
          Length = 877

 Score =  226 bits (577), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 117/288 (40%), Positives = 172/288 (59%), Gaps = 25/288 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +      AFV E+ +QNPL QIG+V ++ G+   + ++ G+P   ++A+  K
Sbjct: 176 DLRPTRFELALDCARAFVVEWCEQNPLGQIGVVGMRAGIGERIVEMTGNPHDVLRAIADK 235

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             L  +G+ SLQNA+++ +  +  +      E++I++ +L+TCDPGDI ET+  C + KI
Sbjct: 236 RKLEPAGEPSLQNAIEVARAGMRHV-RLTREEIVIIFGSLTTCDPGDIYETLDSCVKDKI 294

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA-------- 171
           R S++ L+AEM ICK LC+ T GS+ VAL+E HFK+L+ EH PPP   A           
Sbjct: 295 RISLVALAAEMKICKELCEKTEGSFGVALNEGHFKDLLFEHIPPPAHRAARTGNDKIPSS 354

Query: 172 ----------IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
                      A+L+ MGFP R    S  ++C CH       G+ CPRC A++CE+PTDC
Sbjct: 355 QTGKANPLNPNADLMLMGFPTRLPPTSAPALCVCHPSRMRAEGFLCPRCSAKLCEVPTDC 414

Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
            +CGL +VSSPHLARSYHHLFP+  +  +  +    P +   +TC GC
Sbjct: 415 DVCGLMVVSSPHLARSYHHLFPVGAYTAINEI---GPDDTPSATCQGC 459


>gi|171689514|ref|XP_001909697.1| hypothetical protein [Podospora anserina S mat+]
 gi|170944719|emb|CAP70830.1| unnamed protein product [Podospora anserina S mat+]
          Length = 501

 Score =  223 bits (568), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 139/376 (36%), Positives = 192/376 (51%), Gaps = 75/376 (19%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D  P+R A+       FV  FF QNP+SQ+G++ ++DG+A  ++D+ G+P  HI+ L   
Sbjct: 108 DLLPNRFALTFSYAMEFVNTFFQQNPISQLGIIGMRDGIAVRISDMSGNPVEHIEKLRQW 167

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      +I
Sbjct: 168 ALKDPIGNPSLQNALEMCRGHLYHTPSHGTREVLIIYGALLSSDPGDISDTITSLIADRI 227

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPA-IAEFAIAN 174
           R S+IGL+A++ IC  LC  T  +    Y +AL E HF+EL +    PP    AE + A+
Sbjct: 228 RVSIIGLAAQVAICAELCARTNDNDDSQYRIALHEQHFRELFLAATTPPVTHEAEQSNAS 287

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
           L+ MGFP R  A +  +S+C+CH +     GYTC RC+ +VC LP  C +CGL L+ S H
Sbjct: 288 LLMMGFPSRSLASKDFVSLCACHNK-PTREGYTCTRCRIKVCRLPASCPVCGLTLILSIH 346

Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSLL----------------S 274
           LARSYHHLFP+  +  V       P   ++ +  CF CQ                    S
Sbjct: 347 LARSYHHLFPLKSWVAV-------PWTEAKKSVACFSCQTPFPPVPKAAPPKIKLKVKES 399

Query: 275 SG---------------------------NKPGLY-----VACPKCKKHFCLEC------ 296
           SG                             PGL          +  ++ C EC      
Sbjct: 400 SGVGGQTAANIAKAKGRGEVPAKTNTVTAPTPGLLPEAIKAGVSESGRYKCPECEEHFCI 459

Query: 297 --DIYIHESLHNCPGC 310
             DIY HE++HNCPGC
Sbjct: 460 DCDIYAHETIHNCPGC 475


>gi|451846168|gb|EMD59479.1| hypothetical protein COCSADRAFT_41319 [Cochliobolus sativus ND90Pr]
          Length = 496

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 122/364 (33%), Positives = 187/364 (51%), Gaps = 60/364 (16%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R   +    + ++REFF+QNP+SQ+ ++ + DGV   +++L G+P  H+ A+ G 
Sbjct: 118 DMRPNRFITMINYTQDYIREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIQGL 177

Query: 62  LGCS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
                     G  SLQNAL+L +  L   P++G REV++++ +L + DPGDI ++++ C 
Sbjct: 178 RSKDDGKEPKGSPSLQNALELARATLYHTPNHGTREVIVVFGSLLSLDPGDIHQSVKACV 237

Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP----PAI 167
             +IR S+IG+ A + IC  + + T       Y++A D+   +EL+     PP    P +
Sbjct: 238 RDRIRVSIIGMGARLKICTEIVKRTNAGDESEYTIATDQEMLRELLQATTTPPVIRQPLL 297

Query: 168 AEF-----------AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCEL 216
            +            + A L+ MGFP R  E  +++C+CH  + +G GYTC RC A+VC L
Sbjct: 298 TDKTAPVTTTQSSESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSAKVCSL 356

Query: 217 PTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------- 268
           P  C  C L L+ S HLARSYHHLFP+  +  V+    +  R      C GC        
Sbjct: 357 PITCPSCQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREMGSKQCVGCLAAFSNPP 413

Query: 269 ----------------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHN 306
                                   S     ++ G Y  C  C+ HFC++CD++ H  LHN
Sbjct: 414 SNAKENGEAKRDSEAKVAKKDDDDSEEQKASESGRY-ECRACESHFCIDCDMFAHMVLHN 472

Query: 307 CPGC 310
           CPGC
Sbjct: 473 CPGC 476


>gi|367035208|ref|XP_003666886.1| hypothetical protein MYCTH_2311995 [Myceliophthora thermophila ATCC
           42464]
 gi|347014159|gb|AEO61641.1| hypothetical protein MYCTH_2311995 [Myceliophthora thermophila ATCC
           42464]
          Length = 589

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 125/298 (41%), Positives = 174/298 (58%), Gaps = 35/298 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D  P+R  +       FVRE+F+QNP+SQ+G+V ++DG+A  ++D+GG+P  HI+ L   
Sbjct: 117 DLLPNRYLLTLNYAVDFVREYFEQNPISQLGIVGMRDGIAVRISDMGGNPAEHIEKLRTW 176

Query: 61  --KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
             +    G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI K    +
Sbjct: 177 AEQQEPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHDTIAKLLNDR 236

Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKEL--------IMEHAPPPPA 166
           IR SV+GL+A++ IC+ LC  T G    SY+VAL E HF+EL        + +  PPPP 
Sbjct: 237 IRVSVVGLAAQVAICEQLCTRTNGGDPSSYAVALHEQHFRELFLAATTPPVTQSTPPPPG 296

Query: 167 IAE--------------FAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCK 210
             +                 A+L+ MGFP R  A +  +S+C+CH       GY C RC+
Sbjct: 297 AGDGTNNNNNSQQQSQQSQQASLLIMGFPSRALASKDHVSLCACHNR-PTREGYACTRCR 355

Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
            +VC LP +C +CGL LV S HLARSYHHLFP+  +  V+       R +S   CF C
Sbjct: 356 TKVCRLPAECPVCGLTLVLSTHLARSYHHLFPLRGWVAVS--WAEAARAKS-GVCFAC 410



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 18/29 (62%), Positives = 24/29 (82%)

Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGCES 312
           ACP C  HFC++CD++ HE +HNCPGC+S
Sbjct: 486 ACPVCGNHFCIDCDVFAHEMIHNCPGCQS 514


>gi|326433104|gb|EGD78674.1| hypothetical protein PTSG_01653 [Salpingoeca sp. ATCC 50818]
          Length = 394

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 193/317 (60%), Gaps = 24/317 (7%)

Query: 3   FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPE---SHIKALM 59
            +P+R+      +  FV  FF +NP+S +G++ +++GVA  + +L G+ +    H+ AL+
Sbjct: 73  LKPNRLTCTVHNLIDFVHSFFHENPISDMGVIVMREGVARVICELNGNKDVLVEHLSALL 132

Query: 60  ---GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
                L  +G++SLQNAL + +  L  IPS+  REV+I+ + L++CDPG+I++TI+  + 
Sbjct: 133 HPKTPLFPAGEASLQNALAVARRSLRSIPSHATREVVIVQATLASCDPGNILDTIKSLRS 192

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLI 176
            K+  +V+GL+A + IC+ +C++TGG ++VALDE H  EL++    PP A A+     + 
Sbjct: 193 DKVTVNVVGLAAAVRICETVCKETGGIHAVALDEDHLGELLLALTTPPAAPADADATQM- 251

Query: 177 KMGFPQRAGEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           ++GF   A EG        K+ ++G+   GY CP+C+A+V ++P  C  CGL LVSSPHL
Sbjct: 252 RLGFA--AYEGRRPTVRRQKDGQLGITTGGYVCPQCRAKVSDIPQRCTTCGLMLVSSPHL 309

Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
           A+++HH+FP+  F EVT          +   C GC Q L ++   P     CP C    C
Sbjct: 310 AKTFHHMFPLPVFKEVTLA--------AAEWCAGCTQPLPTT--LPAY--ECPSCGARVC 357

Query: 294 LECDIYIHESLHNCPGC 310
           L+CD+Y+H  LHNCPGC
Sbjct: 358 LDCDVYVHRELHNCPGC 374


>gi|358340016|dbj|GAA47968.1| transcription initiation factor TFIIH subunit 2 [Clonorchis
           sinensis]
          Length = 929

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 121/290 (41%), Positives = 167/290 (57%), Gaps = 22/290 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    + V  FVR++FDQNP+SQ+G++   D  A  LT+L G+P  H+ AL  +
Sbjct: 606 DLKPTRLVCTLRAVAEFVRDYFDQNPISQLGIIVTADRQAERLTELSGNPRCHLAALETL 665

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               CSG+ SLQNAL L +  L   P   H E+L++ ++L+TCDPGDI +TIQ    + I
Sbjct: 666 FTRPCSGEPSLQNALTLAESRLKYTPH--HSEILVIMASLTTCDPGDIHKTIQSLAANHI 723

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSV+ L+ E+F+ + L Q T G + V LDE H K ++    PPP A  + A A LI+M 
Sbjct: 724 RCSVVSLAVEVFVYRALAQITQGQFHVILDEPHLKTVLKNFVPPPAATVD-APATLIRMA 782

Query: 180 FPQRAG----EGSISICSCHKEVKVGVG------YTCPRCKARVCELPTDCRICGLQLVS 229
           FP          S  +C CH   +          Y CPRC A  CELP +C +CGL LV+
Sbjct: 783 FPHSETFDLRNFSTVLCMCHLNQQGSSTDSSHPQYACPRCCAAYCELPVECTVCGLTLVA 842

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCL--NDPRNRSRS-----TCFGCQQSL 272
           +PHLAR+YHHLFP+  F  ++P  L   +P    R+     TC GC   L
Sbjct: 843 APHLARAYHHLFPLDAFTPISPAELRQTNPDQLPRTETGIITCAGCDVVL 892


>gi|396458460|ref|XP_003833843.1| hypothetical protein LEMA_P066040.1 [Leptosphaeria maculans JN3]
 gi|312210391|emb|CBX90478.1| hypothetical protein LEMA_P066040.1 [Leptosphaeria maculans JN3]
          Length = 618

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 129/372 (34%), Positives = 187/372 (50%), Gaps = 67/372 (18%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
           D RP+R   +    + +VREFF+QNP+SQ+ ++ + DGV   +++L G+P  H+ A+   
Sbjct: 219 DMRPNRFFAMINYTQDYVREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIASL 278

Query: 60  -----GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKC 114
                GK    G  SLQNAL+L +  L   P +G REV+I++ +L + DPGDI  T++ C
Sbjct: 279 RSKEDGK-EPKGAPSLQNALELARATLYHTPPHGTREVIIVFGSLLSLDPGDIHTTVKAC 337

Query: 115 KESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF 170
              +IR S+IG+ A + IC  +   T       Y+VA+D+   KEL++     PP + E 
Sbjct: 338 VRDRIRVSIIGMGARLKICTEIVTRTNAGDTSEYTVAIDQEALKELLLA-TTTPPVVRET 396

Query: 171 ----------AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
                     + A L++MGFP R  E  +++C+CH  + +G GYTC RC A+VC LP  C
Sbjct: 397 KQSGTEPPPESAAALMRMGFPSRVIEDQLTMCACHGNLTLG-GYTCSRCSAKVCSLPITC 455

Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC------------ 268
             C L L+ S HLARSYHHLFP+  +  V+       R    + C GC            
Sbjct: 456 PSCQLTLLLSTHLARSYHHLFPLRNWATVS---WARAREVGSTQCTGCLAAFSKPPPKDN 512

Query: 269 -------------------------QQSLLSSGNK---PGLYVACPKCKKHFCLECDIYI 300
                                    Q   L   ++   P     C  C  HFC++CD++ 
Sbjct: 513 NRHVVEDEDEDDDATQPRKHAASTTQDDALDGEDQRASPSGRYECQACNSHFCIDCDVFA 572

Query: 301 HESLHNCPGCES 312
           H  LHNCPGC S
Sbjct: 573 HMVLHNCPGCLS 584


>gi|428671746|gb|EKX72661.1| conserved hypothetical protein [Babesia equi]
          Length = 409

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 118/312 (37%), Positives = 177/312 (56%), Gaps = 21/312 (6%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDF+P R+      V+ FV+E F Q P++QIGL+ +++ V+  ++ LG +P+  I+ L  
Sbjct: 107 MDFKPDRLYCAFGAVKEFVKELFHQGPITQIGLIGLRNKVSTMISQLGTNPDEQIELLGN 166

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
            L  G SG +SLQN L++   +LS +PSY  REVLI++ +  T DPG+I+ T+ K KE+ 
Sbjct: 167 ALKEGPSGTASLQNGLEMALTILSTLPSYTTREVLIIFGSNRTLDPGNILATLYKLKENH 226

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           I  + I L+ E++I K++C +TGG  SVA+D +H + L+ +   PPP    +    L K+
Sbjct: 227 ICVNAISLAPELYILKNICTETGGMCSVAMDAAHLRTLLNDFTIPPPW-HNWMEPVLTKV 285

Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
            FP      + S+C+CH  +     Y CP+C ++ C +PT CR CG+ LVS P ++R+++
Sbjct: 286 AFPPLKKTTTASLCACHSNL-THTAYICPQCHSKSCSIPTRCRCCGIYLVSPPDISRAFY 344

Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
           HL P   F       + D R     TC        S+G+       CP C   FC  CD 
Sbjct: 345 HLIPPKTF-------IKDSRTGRCDTC----NYETSNGS------TCPDCASFFCEYCDA 387

Query: 299 YIHESLHNCPGC 310
           YIH  LH CP C
Sbjct: 388 YIHNELHQCPHC 399


>gi|452824298|gb|EME31302.1| transcription initiation factor TFIIH subunit H2 isoform 2
           [Galdieria sulphuraria]
          Length = 334

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 110/250 (44%), Positives = 160/250 (64%), Gaps = 3/250 (1%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +PSR  V     + F+  +F++NP+SQ+ +V ++DGVA  L+ +G +P  H + +   
Sbjct: 78  DVKPSRGEVCLSCAQKFLYSYFNENPISQLAVVVLRDGVAEKLSSMGSNPRQHSEVVKNA 137

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            + G  G+ SLQN LD+   LL  IPSYG REVLILY+++S+CDPGDI +TI+K ++ +I
Sbjct: 138 NQKGFYGNCSLQNGLDVALSLLHSIPSYGSREVLILYNSISSCDPGDIRQTIEKLEKERI 197

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RC+VIG++AE++I K+L   T GSY V ++ESH  EL+ +   P   I       L++MG
Sbjct: 198 RCNVIGMAAELYILKYLAARTHGSYFVCMNESHLLELLEDFVVPSALIENNTKTALVRMG 257

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP         +C   K ++  V + CPRC+    E+P +C +CGL LVSS  LARSYHH
Sbjct: 258 FPTLKAYKEPKVCLNDKVLRNQV-FVCPRCECCYGEIPIECVLCGLILVSSSQLARSYHH 316

Query: 240 LFPIAPFDEV 249
           LFP+A F E+
Sbjct: 317 LFPVANFHEL 326


>gi|260943474|ref|XP_002616035.1| hypothetical protein CLUG_03276 [Clavispora lusitaniae ATCC 42720]
 gi|238849684|gb|EEQ39148.1| hypothetical protein CLUG_03276 [Clavispora lusitaniae ATCC 42720]
          Length = 459

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 130/354 (36%), Positives = 189/354 (53%), Gaps = 47/354 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D RP+R       ++ FV EFFDQNP++Q+ +V +++GVA+ +++L GS + HI+ L   
Sbjct: 107 DLRPTRFGATVAHLQDFVAEFFDQNPIAQMAVVMMRNGVASVVSELSGSAQLHIERLRAL 166

Query: 61  -----KLGCSGDSSLQNALDLVQGLLSQ---IPSYGHREVLILYSALSTCDPGDIMETIQ 112
                 L   GD SLQNAL+L + LL+      S   RE+++++ AL T DPGDI +TI 
Sbjct: 167 RARQHTLEPKGDPSLQNALELARALLTAGGGPTSRSSREIVVVFGALFTSDPGDIHQTID 226

Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIA 168
                +IR  VIGL+A++ IC+ L   T       Y V + ++HF+EL+M    P  A+A
Sbjct: 227 SLVRDEIRVRVIGLAAQVAICRELVARTNAGECAGYGVIVGDAHFRELLMACVEPL-AVA 285

Query: 169 EFAIAN-----LIKMGFPQRA--GEGSISICSCHKEVK-VGVGYTCPRCKARVCELPTDC 220
             A        +++MGFP RA    G   +C+CH +      GY CP+C++RVC LP  C
Sbjct: 286 RRAAETATGVPVLRMGFPSRAPAAPGMPVLCACHPQGGGADTGYWCPQCRSRVCSLPCVC 345

Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRST---------------- 264
            +CGL L+ S HLAR++HHL P   ++           N S +                 
Sbjct: 346 PVCGLMLILSTHLARTHHHLVPPPDYETKEETKETKETNDSNNDNSNSNNDSSNSNNDSS 405

Query: 265 ------CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                 CFGC   L       G   AC  C + FC++CD+++HE LHNCPGCE+
Sbjct: 406 GNSSERCFGC---LAQGDGALGARYACAACGQRFCIDCDVFVHEVLHNCPGCEA 456


>gi|238610326|ref|XP_002397695.1| hypothetical protein MPER_01834 [Moniliophthora perniciosa FA553]
 gi|215472703|gb|EEB98625.1| hypothetical protein MPER_01834 [Moniliophthora perniciosa FA553]
          Length = 251

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 111/248 (44%), Positives = 166/248 (66%), Gaps = 13/248 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLG-GSPESHIKALMG 60
           D RP+R +++ +    FV E+FDQNPL Q+G+V   +  +   ++   G+P+  ++++  
Sbjct: 5   DMRPTRFSLMLQYAREFVLEWFDQNPLGQMGVVVPIEVGSLIYSNFEPGNPQDVLRSISD 64

Query: 61  --KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
             KL   GD SLQNA+D+ +  ++ +P++  RE++IL+ +L+T DPG+I ET+ +C ++K
Sbjct: 65  RHKLEPMGDPSLQNAIDMARNSMNHLPTHSSREIVILFGSLTTVDPGNIHETLDECVKNK 124

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--------AIAEF 170
           IR S++ L+AEM IC+ LC  TGG + VAL+E HFK+L+ E  PPP         A +  
Sbjct: 125 IRISLVALAAEMRICRDLCIKTGGEFGVALNEGHFKDLLFELIPPPAQRAISKAGAGSTN 184

Query: 171 AIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
           A ANL+ MGFP R  + S  S+C CH ++K   G+ CPRC A+VC++PTDC ICGL +VS
Sbjct: 185 ASANLMMMGFPTRLPDASPPSLCVCHSDLKSD-GFLCPRCLAKVCDVPTDCDICGLMIVS 243

Query: 230 SPHLARSY 237
           SPHLARSY
Sbjct: 244 SPHLARSY 251


>gi|406697522|gb|EKD00781.1| component of RNA polymerase transcription factor TFIIH, Ssl1p
           [Trichosporon asahii var. asahii CBS 8904]
          Length = 436

 Score =  213 bits (542), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 115/291 (39%), Positives = 172/291 (59%), Gaps = 33/291 (11%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK--LGCSGDSSLQN 72
           + +FV E+FDQNPL QIG++ +++ +A  L  + G+P+  I+AL  K  L  +G+ SLQN
Sbjct: 112 LRSFVIEWFDQNPLGQIGIILLRNRLAQTLVPMCGNPQEIIEALADKRKLEPAGEPSLQN 171

Query: 73  ALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKESKIRCSVIGLSAE 129
            L + +G +S +PS    EVL+L+SA+ST DP    +I + + +   +++R ++I LSAE
Sbjct: 172 GLVMARGGMSHLPSTSSLEVLVLFSAISTADPDGSMNIHQVLDELVSARVRVTIISLSAE 231

Query: 130 MFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP------PAIAEF------------- 170
           + ICK + + TGG + VA+DE H+KEL+ E  PPP      P  A               
Sbjct: 232 IKICKQIAERTGGRFGVAIDEDHYKELLWETIPPPAQTITAPVTANVREALARGGRQAPG 291

Query: 171 ------AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICG 224
                  + +L+ MGFP R      S C+CH  +K G GY CPRC A++C++PTDC +CG
Sbjct: 292 APKRPPPLGDLLVMGFPTRLPGAGESFCACHGLLKRG-GYMCPRCGAKLCDVPTDCEVCG 350

Query: 225 LQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSS 275
           L +VSSPHLARS+  LFP++ +D +    L   +   R+ CFGC     +S
Sbjct: 351 LMVVSSPHLARSFWLLFPVSKYDTLGRGVLT--KEAVRAECFGCDAPFPTS 399


>gi|167376758|ref|XP_001734134.1| TFIIH basal transcription factor complex p44 subunit [Entamoeba
           dispar SAW760]
 gi|165904499|gb|EDR29715.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Entamoeba dispar SAW760]
          Length = 362

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 116/319 (36%), Positives = 175/319 (54%), Gaps = 18/319 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D++P+R+  +  +++ F  EF+  NPLS+IGL+  ++  A        S    I+  M  
Sbjct: 54  DYQPNRITFIQTKLKEFFEEFYIDNPLSEIGLICTRNKTAE--QQFPLSRNLIIQKEMKW 111

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
             C G  SLQ +LDL Q +LS   +   RE++I+ S+++TCDPG+I +TI+  K++ IRC
Sbjct: 112 GNCEGQMSLQISLDLAQQILSNTSNSSSREIIIVTSSITTCDPGNIYDTIETLKKTTIRC 171

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM--G 179
           SVI  + EM I K L + TGG Y   ++E H + ++     PP           I++  G
Sbjct: 172 SVISFAPEMHITKLLTKITGGDYHTIMNEKHAENVLNTFIIPPIYKENTYEPKTIQLYIG 231

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP++      +IC CH ++ +   + CP C    CELP  C+IC   L+ S H ARSYH 
Sbjct: 232 FPKQLNIP--TICECHSKIDIN-HFECPICNFCYCELPIQCKICNAILLLSHHFARSYHF 288

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNK----PGLYVACPKCKKHFCLE 295
           +FPI PF  +T +       + +  CFGC + +     K    P     C KC K++C E
Sbjct: 289 MFPIEPFKIITMI-------KPQEKCFGCGKEIDDRIEKKEEEPVNIYQCKKCLKYYCNE 341

Query: 296 CDIYIHESLHNCPGCESLR 314
           CD +IH+ L+NCPGCES +
Sbjct: 342 CDSFIHDVLYNCPGCESKK 360


>gi|399216155|emb|CCF72843.1| unnamed protein product [Babesia microti strain RI]
          Length = 408

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 119/317 (37%), Positives = 172/317 (54%), Gaps = 11/317 (3%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +DF+P+R+     Q   F+RE F Q P+SQ+G+V +K+ +A  ++  G + +  I  L  
Sbjct: 94  IDFKPNRLHTAVAQTSKFIRELFYQCPISQVGIVAMKNKLAKLISPFGSNTDEQISRLND 153

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
            L  G  GD SLQNAL++   LL  +P Y  RE+LI++ ++ TCDPG+I  TI++ K+  
Sbjct: 154 LLSDGPEGDISLQNALEMSISLLDDVPQYTTREILIVFGSIKTCDPGNIFATIKRLKKHH 213

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           I    I LS+E++I KH+C +T GS+SVA+D      L   +   PP   +  +  L+K+
Sbjct: 214 ITVDAISLSSELYILKHICTETNGSHSVAIDTHDLTSLFNIYT-NPPRWQDGMLPILVKV 272

Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           GFP      + S+C CH ++ V   YTCP+C A  C +P+ C+ C + LVS   + R +H
Sbjct: 273 GFPSLTKSNTASLCVCHGKL-VFTSYTCPQCSAITCNIPSRCKCCQIYLVSPADVCRGFH 331

Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA--CPKCKKHFCLEC 296
           HL     F  V    L         TC  C   L +S    G   A  CP C K FC  C
Sbjct: 332 HLCAPPEFTNVPADILG-----GCDTCASCSVELKNSVMYSGEDRAKQCPGCMKIFCRIC 386

Query: 297 DIYIHESLHNCPGCESL 313
           D +IH +LH CP C  L
Sbjct: 387 DEFIHGTLHQCPFCIKL 403


>gi|225678604|gb|EEH16888.1| TFIIH basal transcription factor complex p44 subunit
           [Paracoccidioides brasiliensis Pb03]
          Length = 493

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 116/288 (40%), Positives = 158/288 (54%), Gaps = 22/288 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +  + FVREFFDQNP+SQ+G+V ++DG+A  ++D+ G+P  HI A+ G 
Sbjct: 121 DLRPTRYLLTLRYAQDFVREFFDQNPISQLGVVGMRDGLAVRISDMSGNPTEHILAIQGL 180

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      KI
Sbjct: 181 RAKDPKGMPSLQNALEMARGALFHTPSHGTREVLIIYGALLSSDPGDIHKTITSLITDKI 240

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAI--- 172
              V+GL+A++ IC+ L   T       Y+VA++E HF+EL++    PP           
Sbjct: 241 HVYVLGLAAQVSICQELVTRTNNGDDSGYNVAMNEQHFRELVLNVTTPPATTLASHTAAA 300

Query: 173 ------------ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
                         L+ MGFP R      ++C+CH       GY CPRC  +VC LP  C
Sbjct: 301 AANGTGTNPSTDGTLLPMGFPNRHLTPHPTLCACHSTPSRS-GYLCPRCCTKVCTLPASC 359

Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
             C L L+ S HLARSYHHLFP+  + EV+           R  CF C
Sbjct: 360 PSCNLTLILSTHLARSYHHLFPLMNWVEVSWRKAARAEAEGRVGCFAC 407


>gi|432876344|ref|XP_004073002.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
           subunit 2-like [Oryzias latipes]
          Length = 340

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 130/315 (41%), Positives = 171/315 (54%), Gaps = 62/315 (19%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVT--VKDGVANCLT--DLGGSPESHIKA 57
           D +P+R+A   K +E FV E+FDQNP+SQ+G  +  + D     L   +  G+P+ HI A
Sbjct: 75  DLKPNRLAATLKLLEGFVDEYFDQNPISQVGKXSSYLIDDHRRALNTGEASGNPKKHITA 134

Query: 58  LMGKL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
           L   +   C G+ SL NAL+L    L  +P +  REVLI+ S+L+TCDP +I + IQ  K
Sbjct: 135 LKKAVDSACVGEPSLYNALNLAVQTLKHMPGHTSREVLIILSSLTTCDPANIYDLIQTLK 194

Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
             K+R SVIGLSAE+ +C  L ++TGGSY V LDESHFKEL+M H  PPPA +  +  +L
Sbjct: 195 GLKVRVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPA-SSSSECSL 253

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           I+MG                                             L LVS+PHLAR
Sbjct: 254 IRMG---------------------------------------------LTLVSAPHLAR 268

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
           S+HHLFP+ PF E +   L   R      C GCQ  L         YV CP C+  FC+E
Sbjct: 269 SFHHLFPMVPFLESSVEELQGSR-----FCEGCQGEL----RDKSAYV-CPSCRGVFCVE 318

Query: 296 CDIYIHESLHNCPGC 310
           CD++IH+SLH CP C
Sbjct: 319 CDLFIHDSLHCCPCC 333


>gi|116199505|ref|XP_001225564.1| hypothetical protein CHGG_07908 [Chaetomium globosum CBS 148.51]
 gi|88179187|gb|EAQ86655.1| hypothetical protein CHGG_07908 [Chaetomium globosum CBS 148.51]
          Length = 463

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 124/371 (33%), Positives = 177/371 (47%), Gaps = 99/371 (26%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--- 58
           D  P+R  +       FVRE+F+QNP+SQ+G+V ++DG+A  ++D+GG+P  HI+ L   
Sbjct: 118 DLLPNRYLLTLNYAVDFVREYFEQNPISQLGIVGMRDGIAVRISDMGGNPAEHIEKLRVW 177

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
             +    G+ SLQNAL++ +G L   PS+G RE+LI+Y AL + DPGDI +TI      +
Sbjct: 178 AEQQEPQGNPSLQNALEMCRGALFHTPSHGTREILIIYGALLSSDPGDIHDTITNLLADR 237

Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAIAN 174
           IR S++GL+A++ IC+ LC  T G    +Y+VAL E HF+EL +    PP          
Sbjct: 238 IRVSIVGLAAQVAICEQLCSRTNGGDPSNYAVALHEQHFRELFLAATTPP---------- 287

Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
                                                 VC LP +C +C L LV S HLA
Sbjct: 288 --------------------------------------VCRLPAECPVCALTLVLSTHLA 309

Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---------------------- 272
           RSYHHLFP+  +  V+     D R R  S CF C  +                       
Sbjct: 310 RSYHHLFPLRGWAAVS---WADARARGSSACFACLAAFPAVPPSGQIQTQTQVQANGNGN 366

Query: 273 ------------------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES-L 313
                              ++G       AC  C  HFC++CD++ HE +HNCPGC+S  
Sbjct: 367 GNGRDTDRDKGRDAAEVKAAAGVSESGRYACTVCNNHFCIDCDVFAHEVIHNCPGCQSDT 426

Query: 314 RHSNPIVANEG 324
           R +  + A +G
Sbjct: 427 RGTEGLQAADG 437


>gi|189195190|ref|XP_001933933.1| TFIIH basal transcription factor complex p44 subunit [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187979812|gb|EDU46438.1| TFIIH basal transcription factor complex p44 subunit [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 460

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 114/300 (38%), Positives = 167/300 (55%), Gaps = 30/300 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R   +    + +VREFF+QNP+SQ+ ++ + DGV   +++L G+P  H+ A+ G 
Sbjct: 118 DMRPNRFITMINFTQEYVREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIQGI 177

Query: 62  LGCS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
                     G  SLQNAL+L +  L   PS+G REV++++ +L + DPGDI +T++ C 
Sbjct: 178 RSKDDGKEPKGAPSLQNALELARATLYHTPSHGTREVIVVFGSLLSLDPGDIHQTVKACV 237

Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFA 171
             +IR SVIG+SA + IC  +   T       Y++A D+   KEL++    PP       
Sbjct: 238 RDRIRVSVIGMSARLKICTEIVTRTNAGDESEYTIATDQEMLKELLLATTTPPVIRQSAP 297

Query: 172 IAN------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
           +A             L+ MGFP R  E  +++C+CH  + +G GYTC RC A+VC LP  
Sbjct: 298 VATTTAAPSPESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSAKVCSLPIT 356

Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
           C  C L L+ S HLARSYHHLFP+  +  V+    +  R +    C GC    LSS +KP
Sbjct: 357 CPSCQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREKGSKECVGC----LSSFSKP 409


>gi|449019861|dbj|BAM83263.1| similar to TFIIH subunit SSL1 [Cyanidioschyzon merolae strain 10D]
          Length = 549

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 121/361 (33%), Positives = 188/361 (52%), Gaps = 54/361 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +PSR + +    + FV++FF +N +SQ+ +V ++DG+A  ++++ GS   HI AL   
Sbjct: 180 DLKPSRWSAILSGAQYFVQDFFSENAVSQLAIVILQDGIAAQVSEMSGSVRLHIDALRSL 239

Query: 61  --------KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQ 112
                     G  G +SLQN L+L   + + +P +G RE+LI+Y  LSTCDPGDI +TIQ
Sbjct: 240 QRGTNKFLHDGDGGQTSLQNCLELGLRIHASVPRFGTREMLIVYGGLSTCDPGDIHKTIQ 299

Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPP--------- 163
            C ++ IR SVIGL AE++I +   + T G+Y V +D  HF +L+ +  PP         
Sbjct: 300 NCVDASIRVSVIGLGAEVYILRATAERTQGTYRVPMDNDHFHQLLRQQVPPPPPPPQASR 359

Query: 164 -----PPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT 218
                 P+ A +    ++ MGFP +     + + +        +G+ CPRC+  + E+P 
Sbjct: 360 STKHLEPSEAPWTAMQMV-MGFPPKLARDRV-VVAIDTRKACRLGFECPRCRQWLSEVPG 417

Query: 219 DCRICGLQLVSSPHLARSYHHLFPIAPFDE-------VTPLCLNDPRNRS---------- 261
           +C +CGL L S  HLARSYHH+FP   F+E            ++D    S          
Sbjct: 418 NCILCGLTLTSVLHLARSYHHIFPAPRFEEREEPSSSRASALMSDEEGFSVSPSSVPERM 477

Query: 262 --------RSTCFGCQQSLLSSGNKPGLYVACPK-CKKHFCLECDIYIHESLHNCPGCES 312
                   +  C GC ++L     K  L + C   C++ FC++CD  +HE LH CP C +
Sbjct: 478 PDAAAPPVKWQCSGCSRTL---STKDDLQLQCTGICRQLFCIDCDQLVHERLHTCPACLA 534

Query: 313 L 313
           +
Sbjct: 535 M 535


>gi|440296353|gb|ELP89180.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Entamoeba invadens IP1]
          Length = 358

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 115/314 (36%), Positives = 172/314 (54%), Gaps = 19/314 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF P+R++ +  ++  F   FF  NPLSQIGLV  ++  A+ +  L  +     +   GK
Sbjct: 54  DFHPTRISFINTKMNNFFDNFFVDNPLSQIGLVATRNSTADIICPLTRNISKLKETAWGK 113

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
             C+G  S+Q +L+L    L        RE++ + SA+STCDPG+I ETI K +++ IR 
Sbjct: 114 --CNGSISVQISLELCLQFLKDTKDMTSREIVFVTSAISTCDPGNIFETIGKVRDNLIRV 171

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP--PAIAEFAIANLIKMG 179
           SVI +SAE+ + + + + TGG Y   ++E H +E+I     PP   A  +     L+ +G
Sbjct: 172 SVISISAEVHVMRQMTKVTGGDYFTIMNEKHAEEVIKTFIEPPRYKATGKEPKVELLFVG 231

Query: 180 FPQRAGEGSISICSCHKEVKVGVG-YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           FP+       +IC CH   K+G+G Y CP C    CELP +C++C   L+ S H ARS+H
Sbjct: 232 FPRELERS--TICECHS--KIGLGHYECPVCGFCYCELPVECKVCKANLIFSHHFARSFH 287

Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
            LFP+ PF     +   +P       CF C +      ++   +  C KC K FC  C+I
Sbjct: 288 FLFPVKPF-----VLEKEPE-----VCFACGKKYEEKKDEEMKFWKCEKCSKSFCETCNI 337

Query: 299 YIHESLHNCPGCES 312
           +IH+ L NCPGCE+
Sbjct: 338 FIHDVLFNCPGCEN 351


>gi|336468464|gb|EGO56627.1| hypothetical protein NEUTE1DRAFT_84011 [Neurospora tetrasperma FGSC
           2508]
 gi|350289275|gb|EGZ70500.1| TFIIH basal transcription factor complex, subunit SSL1 [Neurospora
           tetrasperma FGSC 2509]
          Length = 416

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 113/254 (44%), Positives = 148/254 (58%), Gaps = 32/254 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  VV      FVRE+F+QNP+SQ+G+V ++DG+A  ++DL G+P  HI+     
Sbjct: 94  DLLPNRYRVVLNNAIGFVREYFEQNPISQLGIVGMRDGIAVRISDLSGNPAEHIE----- 148

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
                               +Q PS+  REVLI+Y AL + DPGDI +TI      +IR 
Sbjct: 149 --------------------NQTPSHATREVLIIYGALVSIDPGDIHDTINDLVADRIRV 188

Query: 122 SVIGLSAEMFICKHLCQDTG---GSYSVALDESHFKELIMEHAPPP-PAIAEFAIANLIK 177
           SV+GL+ ++ IC  LC+ T    G+YSVA+DE H KEL      PP     E   A+L+ 
Sbjct: 189 SVVGLAGQVAICSELCKRTNNHDGNYSVAVDEVHLKELFFAATTPPVTRTPEQNTASLLM 248

Query: 178 MGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           MGFP R  A +  +S C+CH +     GYTCPRC  +VC LP DC IC L L+ S HLAR
Sbjct: 249 MGFPSRTLAPKDHVSFCACHAK-PTREGYTCPRCGIKVCRLPIDCPICKLTLIQSTHLAR 307

Query: 236 SYHHLFPIAPFDEV 249
           SYHHLFP+  F EV
Sbjct: 308 SYHHLFPLKVFVEV 321


>gi|330925787|ref|XP_003301194.1| hypothetical protein PTT_12637 [Pyrenophora teres f. teres 0-1]
 gi|311324299|gb|EFQ90721.1| hypothetical protein PTT_12637 [Pyrenophora teres f. teres 0-1]
          Length = 462

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 115/300 (38%), Positives = 168/300 (56%), Gaps = 30/300 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R   +    + +VREFF+QNP+SQ+ ++ + DGV   +++L G+P  H+ A+ G 
Sbjct: 118 DMRPNRFITMINFTQEYVREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIQGL 177

Query: 62  LGCS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
                     G  SLQNAL+L +  L   PS+G REV++++ +L + DPGDI +T++ C 
Sbjct: 178 RSKDDGKEPKGAPSLQNALELARATLYHTPSHGTREVIVVFGSLLSLDPGDIHQTVKACV 237

Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP------P 165
             +IR SVIG+SA + IC  +   T       Y++A D+   KEL++    PP      P
Sbjct: 238 RDRIRVSVIGMSARLKICTEIVTRTNAGDESEYTIATDQEMLKELLLATTTPPVIRQSAP 297

Query: 166 AIAEFAI------ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
           A    A+      A L+ MGFP R  E  +++C+CH  + +G GYTC RC  +VC LP  
Sbjct: 298 AATTPAVPPPESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSTKVCSLPIT 356

Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
           C  C L L+ S HLARSYHHLFP+  +  V+    +  R +    C GC    L S +KP
Sbjct: 357 CPSCQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREKGSKECVGC----LVSFSKP 409


>gi|295657028|ref|XP_002789089.1| suppressor of stem-loop protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226284987|gb|EEH40553.1| suppressor of stem-loop protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 513

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 115/290 (39%), Positives = 159/290 (54%), Gaps = 24/290 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R  +  +  + FVREFFDQNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+ G 
Sbjct: 121 DLRPTRYLLTLRYAQDFVREFFDQNPISQLGVLGMRDGLAVRISDMSGNPTEHILAIQGL 180

Query: 62  LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      KI
Sbjct: 181 RAKDPKGMPSLQNALEMARGALFHTPSHGTREVLIIYGALLSSDPGDIHKTITSLITDKI 240

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPP---------- 165
              V+GL+A++ IC+ L   T       Y+VA++E HF+EL++    PP           
Sbjct: 241 HVYVLGLAAQVSICQELVTRTNNGDDSGYNVAMNEQHFRELVLNVTTPPATTLASHTTTT 300

Query: 166 -------AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT 218
                  +        L+ MGFP R      ++C+CH       GY CPRC  +VC LP 
Sbjct: 301 TGAANGTSTNTSTDGTLLPMGFPNRHLTPHPTLCACHS-TPSRSGYLCPRCCTKVCTLPA 359

Query: 219 DCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
            C  C L L+ S HLARSYHHLFP+  + EV+           R  CF C
Sbjct: 360 SCPSCKLTLILSTHLARSYHHLFPLMNWVEVSWRKAARAEAEGRVGCFAC 409


>gi|85080851|ref|XP_956615.1| suppressor of stem-loop protein 1 [Neurospora crassa OR74A]
 gi|28881240|emb|CAD70478.1| related to transcription factor TFIIH chain p47 [Neurospora crassa]
 gi|28917686|gb|EAA27379.1| suppressor of stem-loop protein 1 [Neurospora crassa OR74A]
          Length = 416

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 112/254 (44%), Positives = 148/254 (58%), Gaps = 32/254 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  VV      FVRE+F+QNP+SQ+G+V ++DG+A  ++DL G+P  HI+     
Sbjct: 94  DLLPNRYRVVLNNAIGFVREYFEQNPISQLGIVGMRDGIAVRISDLSGNPAEHIE----- 148

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
                               +Q PS+  RE+LI+Y AL + DPGDI +TI      +IR 
Sbjct: 149 --------------------NQTPSHATREILIIYGALVSIDPGDIHDTINDLVADRIRV 188

Query: 122 SVIGLSAEMFICKHLCQDTG---GSYSVALDESHFKELIMEHAPPP-PAIAEFAIANLIK 177
           SV+GL+ ++ IC  LC+ T    G+YSVA+DE H KEL      PP     E   A+L+ 
Sbjct: 189 SVVGLAGQVAICSELCKRTNNHDGNYSVAVDEVHLKELFFAATTPPVTRTPEQNTASLLM 248

Query: 178 MGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           MGFP R  A +  +S C+CH +     GYTCPRC  +VC LP DC IC L L+ S HLAR
Sbjct: 249 MGFPSRTLAPKDHVSFCACHAK-PTREGYTCPRCGIKVCRLPIDCPICKLTLIQSTHLAR 307

Query: 236 SYHHLFPIAPFDEV 249
           SYHHLFP+  F EV
Sbjct: 308 SYHHLFPLKVFVEV 321


>gi|302418082|ref|XP_003006872.1| suppressor of stem-loop protein [Verticillium albo-atrum VaMs.102]
 gi|261354474|gb|EEY16902.1| suppressor of stem-loop protein [Verticillium albo-atrum VaMs.102]
          Length = 468

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 185/348 (53%), Gaps = 38/348 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D  P+R+ +  +    FV+E+F+QNP+SQ+G+V ++DGVA  ++++GG+P  H++A+   
Sbjct: 101 DLLPTRLRLTWRLAREFVKEYFEQNPISQMGIVGMRDGVAVRVSEMGGNPVEHVEAINKW 160

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            ++   G  SLQNAL++ +G L   PS+G REVL++Y AL + DPGDI +TI      +I
Sbjct: 161 AEIDPQGSPSLQNALEMCRGALFHAPSHGTREVLVVYGALLSSDPGDIHDTIANLLTDRI 220

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R S++GL+A++ IC+ LC+ T       Y++ALDE H++EL++    PP    +  + A+
Sbjct: 221 RVSIVGLAAQVAICEELCRKTNAGDATQYNIALDEMHYRELLLRTTTPPVTRTQAQSTAS 280

Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS--- 229
           L+ MGFP R  A   + + C+CH    V  GY C RC  RVC LP +C  CGL       
Sbjct: 281 LLMMGFPSRTLAPADTTAYCACHN-TPVREGYLCTRCGTRVCRLPAECPACGLTPHPLHP 339

Query: 230 -------------------SPHLARSYHHLFPIAPFDEVTPLCL------NDPRNRSRST 264
                               P  A S      +APF +             D R R  + 
Sbjct: 340 PRPLLPPPLSRCATGSRSPGPRAAASRACYSCLAPFPDPPAQRRGHRANGTDGRRRGDAG 399

Query: 265 CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
                 +    G       AC  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 400 DEEPAPAPKQKGVSESSRYACQVCGNHFCIDCDVFAHEVIHNCPGCQS 447


>gi|407044792|gb|EKE42827.1| TFIIH basal transcription factor complex subunit, putative
           [Entamoeba nuttalli P19]
          Length = 363

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 114/318 (35%), Positives = 175/318 (55%), Gaps = 20/318 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D++P+R+  +  +++ F  EF+  NPLS+IGL+  ++  A        S    I+  M  
Sbjct: 54  DYQPNRITFIQTKLKEFFEEFYIDNPLSEIGLICTRNKTAE--QQFPLSRNLIIQKEMKW 111

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
             C G  SLQ +LDL Q +LS   +   RE++I+ S+++TCDPG+I +TI+  K++ IRC
Sbjct: 112 GNCEGQMSLQISLDLAQQILSTTSNSSSREIIIVTSSITTCDPGNIYDTIETLKKTTIRC 171

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM--G 179
           SVI  + EM I + L + TGG Y   ++E H ++++     PP           I++  G
Sbjct: 172 SVISFAPEMHITRLLTKVTGGDYHTIMNEKHAEDVLHTFIIPPIYKENAYEPKTIQLYIG 231

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP++      +IC CH ++ +   + CP C    CELP  C+IC   L+ S H ARSYH 
Sbjct: 232 FPKQLNVP--TICECHSKIDIN-HFECPICGFCYCELPIQCKICNAILLLSHHFARSYHF 288

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP-----GLYVACPKCKKHFCL 294
           +FPI PF  +  +       + +  CFGC + +     K       +Y  C KC K++C 
Sbjct: 289 MFPIEPFKVIAMI-------KPQEKCFGCGKEIDDRMEKKEEETVNVY-QCKKCLKYYCN 340

Query: 295 ECDIYIHESLHNCPGCES 312
           ECD +IH+ L+NCPGCES
Sbjct: 341 ECDSFIHDVLYNCPGCES 358


>gi|402076515|gb|EJT71938.1| hypothetical protein GGTG_11190 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 540

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 117/279 (41%), Positives = 160/279 (57%), Gaps = 32/279 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D  PSR  V       F REFFDQNP+SQ+ +V ++DGVA  ++D  G+P  HI+ L  +
Sbjct: 121 DMLPSRHRVAIAYAVDFAREFFDQNPISQLAIVGMRDGVAVRISDTSGNPADHIEKLRQL 180

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            +L   G+ SLQNAL++ +G L   PS+G REV++++ ALS+ DPGDI ETI      +I
Sbjct: 181 VELPPMGNPSLQNALEMCRGALFNTPSHGTREVVMIFGALSSSDPGDIHETIDGLVTDRI 240

Query: 120 RCSVIGLSAEMFICKHLC-----------------QDTGGSYSVALDESHFKELIMEHAP 162
           R S+IGLSA++FIC+ LC                 +D    Y VA+D+ HF+EL++    
Sbjct: 241 RVSIIGLSAQVFICEDLCRRTNAGHSASEPVLVGGEDESSGYMVAMDDVHFRELLLSVTR 300

Query: 163 PPPA------------IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCK 210
           PP                  A A+L+ MGFP R      ++C+CH       G+ C RC 
Sbjct: 301 PPATRQSQLQQQGAAAGGGEAGASLLMMGFPSRTTGTDPALCACHNR-PTRDGFLCTRCG 359

Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEV 249
           +RVC LP +C  CGL L+ S HLARSYHHLFP+  + EV
Sbjct: 360 SRVCRLPAECPACGLTLILSTHLARSYHHLFPLRIWVEV 398



 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 28/35 (80%), Gaps = 1/35 (2%)

Query: 277 NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
           ++ G Y+ C  C+KHFC++CD++ HE +HNCPGC+
Sbjct: 483 SESGRYM-CQVCRKHFCIDCDVFAHEVVHNCPGCQ 516


>gi|340378764|ref|XP_003387897.1| PREDICTED: general transcription factor IIH subunit 2-like
           [Amphimedon queenslandica]
          Length = 354

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 168/314 (53%), Gaps = 43/314 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R       ++ F+ EFFD NP+SQ             + ++      H +AL   
Sbjct: 81  DLKPNRFTCTKTVLKKFLDEFFDFNPISQRDRRRYHTNTIINICNVS----VHKRALDTD 136

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           + CSG+ SLQNA++L   +L  IP++  REV+I++ +L++CDPG+I  TI    +  IRC
Sbjct: 137 VECSGECSLQNAMELSLRMLRHIPTHASREVIIIHGSLTSCDPGNIFSTINDFNQQNIRC 196

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SVIGL+A + +C  +C+ T G Y V LDE HF++ +++H  PP   A    A LIKM   
Sbjct: 197 SVIGLAASVKLCHTICERTSGVYQVILDEVHFRDCLLQHCRPPGVKASTETA-LIKM--- 252

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
              GEG           +   G     C+++ CELP +C  CG+ LV +P LARSYHHLF
Sbjct: 253 ---GEGVSLNIKLWPHYQFVCGKEQAWCQSKYCELPVECVTCGITLVLAPQLARSYHHLF 309

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+  F E T                              +YV CP C+  FC++CD+Y H
Sbjct: 310 PLKMFIEST------------------------------VYV-CPSCRHSFCIDCDLYCH 338

Query: 302 ESLHNCPGCE-SLR 314
           E+LHNCPGCE SLR
Sbjct: 339 ETLHNCPGCENSLR 352


>gi|67479545|ref|XP_655154.1| TFIIH basal transcription factor complex subunit [Entamoeba
           histolytica HM-1:IMSS]
 gi|56472271|gb|EAL49768.1| TFIIH basal transcription factor complex subunit, putative
           [Entamoeba histolytica HM-1:IMSS]
 gi|449704914|gb|EMD45073.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Entamoeba histolytica KU27]
          Length = 363

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 114/318 (35%), Positives = 175/318 (55%), Gaps = 20/318 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D++P+R+  +  +++ F  EF+  NPLS+IGL+  ++  A        S    I+  M  
Sbjct: 54  DYQPNRITFIQTKLKEFFEEFYIDNPLSEIGLICTRNKTAE--QQFPLSRNLIIQKEMKW 111

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
             C G  SLQ +LDL Q +LS   +   RE++I+ S+++TCDPG+I +TI+  K++ IRC
Sbjct: 112 GNCEGQMSLQISLDLAQQILSTTSNSSSREIIIVTSSITTCDPGNIYDTIETLKKTTIRC 171

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM--G 179
           SVI  + EM I + L + TGG Y   ++E H ++++     PP           I++  G
Sbjct: 172 SVISFAPEMHITRLLTKVTGGDYHTIMNEKHAEDVLHTFIIPPIYKENAYEPKTIQLYIG 231

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP++      +IC CH ++ +   + CP C    CELP  C+IC   L+ S H ARSYH 
Sbjct: 232 FPKQLNVP--TICECHSKIDIN-HFECPICGFCYCELPIQCKICNAILLLSHHFARSYHF 288

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP-----GLYVACPKCKKHFCL 294
           +FPI PF  +  +       + +  CFGC + +     K       +Y  C KC K++C 
Sbjct: 289 MFPIEPFKVIAMI-------KPQEKCFGCGKEIDDRIEKKEEETVNVY-QCKKCLKYYCD 340

Query: 295 ECDIYIHESLHNCPGCES 312
           ECD +IH+ L+NCPGCES
Sbjct: 341 ECDSFIHDVLYNCPGCES 358


>gi|405122345|gb|AFR97112.1| RNA polymerase II transcription factor [Cryptococcus neoformans
           var. grubii H99]
          Length = 425

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 115/303 (37%), Positives = 165/303 (54%), Gaps = 45/303 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+RP+R  V+   +  +V E+FDQNPL QIG++ ++            SPE  ++AL  K
Sbjct: 120 DYRPTRFEVILGYLRTYVVEWFDQNPLGQIGVIAMR------------SPEEIVRALSDK 167

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
             L  SG+ SLQN L + +G ++ +PS    E+L+++SA+ST DP     I   +     
Sbjct: 168 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLEILVIFSAISTADPDGPVTIHNVLDTLAT 227

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
             IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E  PPP        AN  
Sbjct: 228 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQEHLKDLLWETIPPPATTIAPVTANVR 287

Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
                                L+ MGFP R   G  ++C+CH  ++ G GY CPRC +++
Sbjct: 288 SALAAGGRGPNQTGERAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 346

Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL 273
           C++PTDC +CGL +VSSPHLARS+  LFP+A +    PL + D    S  TCFGC     
Sbjct: 347 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLAIEDVVESSE-TCFGCDSEFS 402

Query: 274 SSG 276
             G
Sbjct: 403 DVG 405


>gi|255634763|gb|ACU17743.1| unknown [Glycine max]
          Length = 214

 Score =  203 bits (516), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 93/115 (80%), Positives = 108/115 (93%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRPSRMAV+ KQVEAF+REFFDQNPLS +GLVT+KDG+A+C+T+LGGSPESHIKALMGK
Sbjct: 99  DFRPSRMAVMGKQVEAFIREFFDQNPLSHVGLVTIKDGIAHCITELGGSPESHIKALMGK 158

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
           L CSGD+SLQNAL+LV G L+QIPSYGHREVLILYSALSTCDPGD+METI + ++
Sbjct: 159 LECSGDASLQNALELVLGYLNQIPSYGHREVLILYSALSTCDPGDLMETIPEMQK 213


>gi|300123288|emb|CBK24561.2| unnamed protein product [Blastocystis hominis]
          Length = 626

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 121/320 (37%), Positives = 174/320 (54%), Gaps = 26/320 (8%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +PSR+ VV  Q++ F+R+FF  N LS + ++   D  A  LT L    + H+ AL  K
Sbjct: 5   DLKPSRLEVVVSQLKQFIRDFFYHNSLSSLCIIVSHDRRAFQLTPLSRHMKEHLNAL-DK 63

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L CSG  SLQ +LD+    +   PS   +E++ L S++STCDP DI +TI   +E  I+C
Sbjct: 64  LTCSGKFSLQLSLDIAIKTMELAPSQFAKEIICLISSISTCDPRDIYKTISIIQEKHIQC 123

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP----PAI-------AEF 170
           SV+ LS E+++ + L Q T G ++V  D         +H  PP    P +       A  
Sbjct: 124 SVMSLSGEVYLFQKLAQITSGIFAVPEDPQQVHHFFSQHISPPINSDPIVLVKREDGATV 183

Query: 171 AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
               + +MGFP         IC+CH        Y CPRC++ VCELP  C +C L LV  
Sbjct: 184 THREICEMGFPNNVQLS--HICACHVRF-CSKFYECPRCRSYVCELPVRCPVCMLMLVDY 240

Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
            HL +SYHHLFP   F+E      ND  ++   TC+GC  S++  G    + + CPKC K
Sbjct: 241 THLTQSYHHLFPPKNFEEC-----ND--HKPGITCYGC-CSVIKEGE---ICLQCPKCMK 289

Query: 291 HFCLECDIYIHESLHNCPGC 310
           +FC  CD ++H+++ +CPGC
Sbjct: 290 YFCQTCDAFVHDNIFHCPGC 309


>gi|401395984|ref|XP_003879726.1| putative general transcription factor IIH polypeptide 2 [Neospora
           caninum Liverpool]
 gi|325114133|emb|CBZ49691.1| putative general transcription factor IIH polypeptide 2 [Neospora
           caninum Liverpool]
          Length = 487

 Score =  201 bits (510), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 173/340 (50%), Gaps = 45/340 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVAN------CLTDLGGSPESHI 55
           DFRP R++ V +  E F+  F  QNPL+Q+  V +    A+      C    G SP    
Sbjct: 113 DFRPDRLSCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSADPIVTSRCGATRGDSPSVPA 172

Query: 56  KALMGKLGC-------------------------SGDSSLQNALDLVQGLLSQIPSYGHR 90
               G +G                          +G  SL+N L L + LL+ +P Y  R
Sbjct: 173 ARSPGSVGTHLFSSNAEECIASLREKRKTTPGVGTGVPSLKNGLRLAKDLLACVPPYCTR 232

Query: 91  EVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDE 150
           EVLILY +L TCD G I ETI   KE  I C+VI L+AEM + K LCQ+TGG ++V L  
Sbjct: 233 EVLILYGSLRTCDVGCIEETIADVKEGNICCNVICLAAEMHVLKKLCQETGGRHAVPLHR 292

Query: 151 SHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCK 210
            H + L+++H   PPA +     +L++MGFP      + ++CSCH+++     Y CP+C 
Sbjct: 293 EHLRSLLVQHT-LPPAWSSSMQPSLVRMGFPSLKSTSTAALCSCHQQLTFS-AYVCPQCG 350

Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ 270
           A++C++P  CR C L LVS   ++RS+H L P  PFD V  +        ++  C  C  
Sbjct: 351 AKLCDIPNRCRCCFLHLVSPADISRSFHSLCPPLPFDPVADI------PEAKRVCACCTT 404

Query: 271 SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
            L   G++      CP C + FC +CD+Y HE L  C  C
Sbjct: 405 RLERGGSQ------CPDCGEIFCHDCDVYTHEQLRQCAFC 438


>gi|324520669|gb|ADY47691.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
          Length = 345

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 102/256 (39%), Positives = 148/256 (57%), Gaps = 6/256 (2%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           P+R+ V  K +  F+ +F +QNP+SQ+G++  +D  A  L  L G+  +  +AL  + + 
Sbjct: 87  PTRLNVTLKVLNQFLEKFSEQNPISQVGIIICRDKRAERLIQLTGTIRAVKEALSTINEA 146

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C G+ SLQN L +    L   P +  RE++ + ++LSTCDP +I  T +  K + +RCS
Sbjct: 147 SCHGEFSLQNGLLVALRSLQHFPGHASREIIAIVASLSTCDPSNIFGTFEVLKRNNVRCS 206

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
           VIGLSAE+F+ K LC  T G Y V LD  HF ELI+     PP       + +++MGFP 
Sbjct: 207 VIGLSAEVFVYKKLCSTTSGRYDVILDGDHF-ELILNEYTNPPVTKRNVESCVVRMGFPS 265

Query: 183 RAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
                  + C CH+       G G+ CP+C AR C LP +CR+C L L+S+P LARS+HH
Sbjct: 266 HQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLARSFHH 325

Query: 240 LFPIAPFDEVTPLCLN 255
           L P+  F EV   C+ 
Sbjct: 326 LLPLPAFKEVIDYCVK 341


>gi|312076202|ref|XP_003140756.1| general transcription factor IIH subunit 2 [Loa loa]
          Length = 1131

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 105/252 (41%), Positives = 149/252 (59%), Gaps = 19/252 (7%)

Query: 64  CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSV 123
           C G+ SL N+L      L   P Y  REV+++ ++LSTCDP +I  T +  K   IRCSV
Sbjct: 95  CHGEFSLHNSLMAAIKSLHSYPGYASREVILIVASLSTCDPSNIFGTFELLKRYHIRCSV 154

Query: 124 IGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQR 183
           I LSAE+FI + LC  T G ++V LD +HF+ ++ EHA PP + +  A +++++MGFP  
Sbjct: 155 ISLSAEVFIFRKLCSATSGCHNVILDSTHFEVILNEHANPPIS-SRNAESSVVRMGFPAH 213

Query: 184 AGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
               S S C CH+ E++   G G+ CP+C AR C LP +CRIC L L+S+P LARS H+L
Sbjct: 214 ESIDSPSFCLCHQSEIRPSGGRGFFCPQCGARYCSLPVECRICKLTLISAPQLARSLHNL 273

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
            P+  F+E+         + +   CF C + L            C  CK  FC++CD+ +
Sbjct: 274 LPLPAFEEI---------DTTERVCFACIRQLDDKS------FVCKNCKSTFCIDCDVLL 318

Query: 301 HESLHNCPGCES 312
           HESL  CPGC+S
Sbjct: 319 HESLQICPGCKS 330


>gi|324520938|gb|ADY47747.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
          Length = 340

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 101/250 (40%), Positives = 146/250 (58%), Gaps = 6/250 (2%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
           P+R+ V  K +  F+ +F +QNP+SQ+G++  +D  A  L  L G+  +  +AL  + + 
Sbjct: 87  PTRLNVTLKVLNQFLEKFSEQNPISQVGIIICRDKRAERLIQLTGTIRAVKEALSTINEA 146

Query: 63  GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
            C G+ SLQN L +    L   P +  RE++ + ++LSTCDP +I  T +  K + +RCS
Sbjct: 147 SCHGEFSLQNGLLVALRSLQHFPGHASREIIAIVASLSTCDPSNIFGTFEVLKRNNVRCS 206

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
           VIGLSAE+F+ K LC  T G Y V LD  HF ELI+     PP       + +++MGFP 
Sbjct: 207 VIGLSAEVFVYKKLCSTTSGRYDVILDGDHF-ELILNEYTNPPVTKRNVESCVVRMGFPS 265

Query: 183 RAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
                  + C CH+       G G+ CP+C AR C LP +CR+C L L+S+P LARS+HH
Sbjct: 266 HQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLARSFHH 325

Query: 240 LFPIAPFDEV 249
           L P+  F EV
Sbjct: 326 LLPLPAFKEV 335


>gi|90082659|dbj|BAE90511.1| unnamed protein product [Macaca fascicularis]
          Length = 237

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 113/238 (47%), Positives = 155/238 (65%), Gaps = 22/238 (9%)

Query: 84  IPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
           +P +  REVLI++S+L+TCDP +I + I+  K +KIR SVIGLSAE+ +C  L ++TGG+
Sbjct: 1   MPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGT 60

Query: 144 YSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA--------GEGSISICSCH 195
           Y V LDESH+KEL+  H  PPPA +     +LI+MGFPQ           + S S+    
Sbjct: 61  YHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMGFPQHTIASLSDQDAKPSFSMAHLD 119

Query: 196 KEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPL 252
              + G+   GY CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLFP+  F E+ PL
Sbjct: 120 GNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEI-PL 178

Query: 253 CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
              +  N  R  C+GCQ  L        +YV C  C+  FC++CD+++H+SLH CPGC
Sbjct: 179 ---EEYNGER-FCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 227


>gi|344303241|gb|EGW33515.1| hypothetical protein SPAPADRAFT_60859 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 346

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 126/351 (35%), Positives = 176/351 (50%), Gaps = 77/351 (21%)

Query: 36  VKDGVANCLTDLGGSPESHIKALM-------GKLGCSGDSSLQNALDLVQGLL------S 82
           +++GVAN ++++ GSP+ H+  +         K    GD SLQN+L++ + LL      +
Sbjct: 1   MRNGVANLVSEVSGSPQYHLDKIRQLKARQHNKFEPKGDPSLQNSLEMARSLLKYNFGSN 60

Query: 83  QIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG- 141
              +   +E+L+++ AL T DPGDI +TI    +  I+  VIGLSA++ IC+ L   T  
Sbjct: 61  ANNTKNSKEILVIFGALFTSDPGDIHKTIGNLVKDDIKVKVIGLSAQVAICQELVNRTNR 120

Query: 142 -------GSYSVALDESHFKELIMEHAPPPPAI---AEFAIAN----LIKMGFP---QRA 184
                   +Y V ++ESHFKEL+M+   P P      EF  +N    +IKMGFP   Q  
Sbjct: 121 EEKNTSSKNYGVIMNESHFKELLMDCVTPLPLTEQEKEFEESNHGVPVIKMGFPTKIQPT 180

Query: 185 GEGSISICSCHKE------------------------------VKVGVGYTCPRCKARVC 214
           G  SI       E                                V VGY CP+CK++VC
Sbjct: 181 GNSSIGNTDFTIEFPQLNASFPTAGSDDSSDVVQINHNMGATTTGVAVGYQCPQCKSKVC 240

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-- 272
            LPT C +CGL L+ S HLARSYHHL P+ P+ EV P+            CFGCQ     
Sbjct: 241 NLPTICPVCGLMLILSTHLARSYHHLVPLRPYKEV-PIS----STYESQFCFGCQLQFPK 295

Query: 273 ---LSSGNKPGL------YVACPKCKKHFCLECDIYIHESLHNCPGCESLR 314
               +S +K  L         C KC K FC+ CD+++HE LHNCPGCE+ +
Sbjct: 296 GVRTTSKSKGKLESMTSSRYRCMKCSKDFCINCDVFVHEVLHNCPGCENSK 346


>gi|170585402|ref|XP_001897473.1| TFIIH basal transcription factor complex p44 subunit [Brugia
           malayi]
 gi|158595152|gb|EDP33725.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Brugia malayi]
          Length = 361

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 105/252 (41%), Positives = 154/252 (61%), Gaps = 6/252 (2%)

Query: 3   FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MG 60
            +PSR+AV  K +  F+ +F +QNP+SQ+G+V  +D  A  L  L G+     ++L  + 
Sbjct: 81  IQPSRLAVTIKALNQFLDKFSEQNPISQVGVVVCRDKRAERLIPLTGNVRLVKESLSTIT 140

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           +L C G+ SL N+L      L   P +  REV+++ ++LSTCDP  I  T +  K   IR
Sbjct: 141 ELLCHGEFSLHNSLMATIRSLHSYPGHASREVILIVASLSTCDPSSIFGTFELLKRYHIR 200

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
           CSV+ LSAE+F+ K LC  T G + V LD SHF+ ++ EHA  PP  +  A +++++MGF
Sbjct: 201 CSVVSLSAEIFVFKKLCSITSGRHDVILDSSHFEVILSEHA-IPPVCSRNAESSVVRMGF 259

Query: 181 PQRAGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
           P        S C CH+ EV+   G G+ CP+C AR C LP +CRIC L L+S+P LARS 
Sbjct: 260 PAHESIDCPSFCLCHQSEVRPSGGRGFFCPQCGARYCSLPVECRICKLTLISAPQLARSL 319

Query: 238 HHLFPIAPFDEV 249
           ++L P+  F+E+
Sbjct: 320 YNLLPLPAFEEI 331


>gi|299751185|ref|XP_001830111.2| RNA polymerase II transcription factor [Coprinopsis cinerea
           okayama7#130]
 gi|298409259|gb|EAU91776.2| RNA polymerase II transcription factor [Coprinopsis cinerea
           okayama7#130]
          Length = 430

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 105/287 (36%), Positives = 163/287 (56%), Gaps = 49/287 (17%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLG-----GSPESHIK 56
           D RP+R  +  +    F+ E+FDQNPL QIG+V ++ G+A+           G+P+  ++
Sbjct: 123 DMRPTRFDLTLQYTREFIAEWFDQNPLGQIGIVGMRAGLASTFVKFTECRRIGNPQEVLR 182

Query: 57  ALMG--KLGCSGDSSLQNALDLVQGLL---------SQIPSYGHREVLILYSALSTCDPG 105
           A+    KL  +G+ SLQNA+++ +  +         S +P++  RE+LIL+ +L+TCDPG
Sbjct: 183 AIADRHKLEPTGEPSLQNAIEMARSNMNNIANESVSSHLPTHSSREILILFGSLTTCDPG 242

Query: 106 DIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG------GSYSVALDESHFKELIME 159
           +I +T++ C ++KIR SV+ L+AEM IC+ LC  TG      G + VA++E H+K+L+ E
Sbjct: 243 NIHDTLETCIKNKIRISVVALAAEMKICRELCDKTGDYIPKEGQFGVAMNEGHYKDLLFE 302

Query: 160 HAPPPPAIA---------------EFAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVG 203
             PPP   A                   A+L+ MGFP R  E S  ++C CH E+K   G
Sbjct: 303 LVPPPAQKAAPRPAGPGSAPQPPSTNPAADLMMMGFPTRLPESSQPTLCVCHSELK-SQG 361

Query: 204 YTCPRCKARVCELPTDCRICGLQLV----------SSPHLARSYHHL 240
           + CPRC+++VC++PTDC +CGL +           S P L   + HL
Sbjct: 362 FLCPRCQSKVCDVPTDCDVCGLMITIIAGDNEHCSSMPRLLTPFPHL 408


>gi|156083725|ref|XP_001609346.1| transcriptional factor 2 subunit [Babesia bovis T2Bo]
 gi|154796597|gb|EDO05778.1| transcriptional factor 2 subunit [Babesia bovis]
          Length = 364

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 91/252 (36%), Positives = 146/252 (57%), Gaps = 4/252 (1%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +DF+P R+   A  V+  ++  F Q P++Q+ ++T+++  +  ++ LG  P  HI  L  
Sbjct: 107 VDFKPDRLHCAAGAVKGLIQGLFHQGPMTQLAIITMRNKRSTMVSKLGTPPSEHIALLDE 166

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           K+  G  G  SLQN L++   +L+ +P Y  RE+L+++ A  T DPG+I+ T+QK K+  
Sbjct: 167 KIKEGTDGVPSLQNGLEMAISILTDMPPYTTREILVIFGATKTFDPGNILTTLQKLKDEH 226

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           +  S + ++ EM+I KH+C +TGGSY+V  D  H K L+ +H   PP    +    L K+
Sbjct: 227 VSVSAVSIAPEMYILKHICTETGGSYAVCKDSVHLKSLLNDHT-MPPKWRPWMEPVLTKV 285

Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           GFP      + S+C CH  +     Y CP+C ++ C +PT C+ C L LVS P ++R +H
Sbjct: 286 GFPPLEKANTASLCICHSTLTYK-AYICPQCHSKSCAIPTKCKCCRLYLVSPPDISRMFH 344

Query: 239 HLFPIAPFDEVT 250
            L P  PF  V+
Sbjct: 345 QLIPPKPFINVS 356


>gi|403367585|gb|EJY83616.1| P44/SSL1-like protein [Oxytricha trifallax]
          Length = 499

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 95/254 (37%), Positives = 149/254 (58%), Gaps = 6/254 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           + RP+R  V+   +  FV ++FDQNPLS +  V      A  L+D   SP+ HI+ +   
Sbjct: 73  EMRPNRAIVIKSYLSNFVIDYFDQNPLSCLSFVATFRERAIILSDFNDSPQDHIEKICRF 132

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
               G++S+QN+L+L     + +PSY  +E+LI++ +++ CDPG+I ET  K K   I C
Sbjct: 133 SDLEGNASIQNSLELCIENFTAVPSYAKKEILIIFCSITNCDPGNIFETYDKLKSKMITC 192

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA-----IANLI 176
           SVI LSA MFI + + Q TGG + ++ D++HF++L+     P  +I +          LI
Sbjct: 193 SVISLSASMFILQKVTQLTGGEFFLSKDQAHFQDLLKRLLVPKESIEDLTQRKQQQNTLI 252

Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
           KMGFP R      ++CSCH +++  V Y CP C++  CEL + C++C + LVS+ HL+R+
Sbjct: 253 KMGFPMRKILSYPAVCSCHNDLRFLV-YICPNCQSPSCELSSFCKVCQIMLVSAAHLSRT 311

Query: 237 YHHLFPIAPFDEVT 250
             H   +A F  +T
Sbjct: 312 AQHNQNLANFQRLT 325



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 4/77 (5%)

Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECD 297
           + +FP+     V  +   D  +     C GC + + +S +   + V C  CKK +CL+CD
Sbjct: 427 NEIFPLNSMKTVGSILQKDGHSY---YCNGCSKRVEASFDNVDI-VICTICKKFYCLDCD 482

Query: 298 IYIHESLHNCPGCESLR 314
           I+IHE+L +CP C+ ++
Sbjct: 483 IFIHETLLSCPTCQLIQ 499


>gi|71027729|ref|XP_763508.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350461|gb|EAN31225.1| hypothetical protein, conserved [Theileria parva]
          Length = 447

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 66/357 (18%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDF+P R+      ++ F+ E +   P++Q+G+V +++ + N +T  G +P   ++ L  
Sbjct: 106 MDFKPDRLYCAFNSLKEFLGELYSSGPITQVGIVVMRNKICNVITQFGTNPNEQMELLSS 165

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
            L  G  G SSLQN L++   ++  +P Y  RE+L+++ +  T DPG+I+ T+ + K++ 
Sbjct: 166 VLKDGPEGSSSLQNGLEMCMKIMCDLPYYSTREILVIFGSNRTLDPGNILLTLDQLKQNF 225

Query: 119 IRCSVIGLSAEMFICK-------------------------------------------- 134
           I  + I LS E++I K                                            
Sbjct: 226 ITVNSISLSPELYILKVALLRYGVRSNGTSYLLVILPQSYPPQWDSLWPLWLLVPHIYTS 285

Query: 135 -HLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICS 193
            ++C++TGG+YSVA D +H K L      PPP    +    LIK+ FP      S+S+C 
Sbjct: 286 YNICKETGGNYSVARDVNHLKLLFNNLTIPPPW-KSWMEPILIKVSFPPMKRGTSVSLCC 344

Query: 194 CHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLC 253
           CH +V V   Y CP+C +  C +PT C+ CG+ +VS P ++R++HHL P  PF  V    
Sbjct: 345 CHNKV-VNTVYICPQCHSNSCYIPTKCQGCGIFMVSPPDISRAFHHLIPPKPFHHV---- 399

Query: 254 LNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
                      C GC   + S          C  C   FC  CD YIH+ LH CP C
Sbjct: 400 ------EGGMDCVGCNLRVESG-------YECQDCGGLFCQHCDKYIHQDLHQCPKC 443


>gi|237841927|ref|XP_002370261.1| general transcription factor IIH polypeptide 2, putative
           [Toxoplasma gondii ME49]
 gi|211967925|gb|EEB03121.1| general transcription factor IIH polypeptide 2, putative
           [Toxoplasma gondii ME49]
 gi|221503074|gb|EEE28780.1| btf, putative [Toxoplasma gondii VEG]
          Length = 627

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 120/336 (35%), Positives = 163/336 (48%), Gaps = 40/336 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRP R+  V +  E F+  F  QNPL+Q+  V +    A  L   G +           
Sbjct: 261 DFRPDRLRCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSAEPLAARGEAARDSAAVSARP 320

Query: 62  LGC---------------------------SGDSSLQNALDLVQGLLSQIPSYGHREVLI 94
            G                            +G  SL N L L + LL+ +P Y  REVL+
Sbjct: 321 SGSASMQLFSSSASESIAALREKRKHPGVGTGVPSLNNGLHLAKDLLAGVPPYCTREVLV 380

Query: 95  LYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFK 154
           L+ +L TCD G I ETI   K+S I C+VI L+AE+ + K+LCQ TGG + V L   H +
Sbjct: 381 LFGSLRTCDVGCIEETIAAVKKSNICCNVICLAAELHVLKNLCQATGGRHDVPLHREHLR 440

Query: 155 ELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
            L+M+H  PP   A      LI+MGFP      + ++CSCH+ +     Y CP+C A++C
Sbjct: 441 ALLMQHTLPPAWSASMQPC-LIRMGFPSLKSTSTAALCSCHQHLTFS-SYVCPQCGAKLC 498

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS 274
             P  CR C L LVS   ++RS+H L P  PFD   P+  + P   ++  C  C   L  
Sbjct: 499 TTPNRCRCCFLHLVSPADISRSFHSLCPPLPFD---PVPADAP--EAKRLCACCTTQLDR 553

Query: 275 SGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
            G +      CP C + FC +CDIY HE L  C  C
Sbjct: 554 GGAQ------CPDCGEIFCHDCDIYSHEQLRQCAFC 583


>gi|221482733|gb|EEE21071.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 627

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 120/336 (35%), Positives = 163/336 (48%), Gaps = 40/336 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRP R+  V +  E F+  F  QNPL+Q+  V +    A  L   G +           
Sbjct: 261 DFRPDRLRCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSAEPLAARGEAARDSAAVSARP 320

Query: 62  LGC---------------------------SGDSSLQNALDLVQGLLSQIPSYGHREVLI 94
            G                            +G  SL N L L + LL+ +P Y  REVL+
Sbjct: 321 SGSASMQLFSSSASESIAALREKRKHPGVGTGVPSLNNGLHLAKDLLAGVPPYCTREVLV 380

Query: 95  LYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFK 154
           L+ +L TCD G I ETI   K+S I C+VI L+AE+ + K+LCQ TGG + V L   H +
Sbjct: 381 LFGSLRTCDVGCIEETIAAVKKSNICCNVICLAAELHVLKNLCQATGGRHDVPLHREHLR 440

Query: 155 ELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
            L+M+H  PP   A      LI+MGFP      + ++CSCH+ +     Y CP+C A++C
Sbjct: 441 ALLMQHTLPPAWSASMQPC-LIRMGFPSLKSTSTAALCSCHQHLTFS-SYVCPQCGAKLC 498

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS 274
             P  CR C L LVS   ++RS+H L P  PFD   P+  + P   ++  C  C   L  
Sbjct: 499 TTPNRCRCCFLHLVSPADISRSFHSLCPPLPFD---PVPADAP--EAKRLCACCTTQLDR 553

Query: 275 SGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
            G +      CP C + FC +CDIY HE L  C  C
Sbjct: 554 GGAQ------CPDCGEIFCHDCDIYSHEQLRQCAFC 583


>gi|348569006|ref|XP_003470289.1| PREDICTED: hypothetical protein LOC100720431 [Cavia porcellus]
          Length = 637

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 119/312 (38%), Positives = 170/312 (54%), Gaps = 49/312 (15%)

Query: 12  AKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSS 69
           A+ +E FV E+FDQNP+SQ+ ++  K   A  LT+L G P+ HI +L     + C G+ S
Sbjct: 352 AQLLEYFVEEYFDQNPISQVCIIVTKSKRAEKLTELSGMPQKHITSLKKAIDMTCHGEPS 411

Query: 70  LQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAE 129
           L N+L +    L+                       D+M+T+   K +KIR SVIGL AE
Sbjct: 412 LYNSLSMAMQTLNL----------------------DLMKTL---KTAKIRVSVIGLWAE 446

Query: 130 MFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA----- 184
           + +C  L ++TGG+Y   LDESH+KEL+  H  PP   +  +  +LI MGFPQ       
Sbjct: 447 VRVCAALARETGGTYHEILDESHYKELLTHHVSPP-PASSSSECSLIHMGFPQHTIAALA 505

Query: 185 ---GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
               + S S+       + G+   GY  P+C A+ CELP +C+ICGL LVS+PHL  SYH
Sbjct: 506 DQDAKPSFSMAHLDSSSEPGLALGGYFYPQCWAKYCELPVECKICGLTLVSAPHLVWSYH 565

Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
           HLFP+  F E+ PL       +    C+GCQ  L        +YV C   +  FC++CD+
Sbjct: 566 HLFPLDAFQEI-PL----EEYKGERFCYGCQGQL----KDQHVYV-CTVFQNVFCVDCDV 615

Query: 299 YIHESLHNCPGC 310
           ++H+S+H C GC
Sbjct: 616 FVHDSVHCCLGC 627


>gi|239793430|dbj|BAH72833.1| ACYPI003683 [Acyrthosiphon pisum]
          Length = 245

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 101/250 (40%), Positives = 154/250 (61%), Gaps = 14/250 (5%)

Query: 69  SLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSA 128
           S+QN+L++   ++  +PS+  +E+LI+ S+LS+CDPG+I  +I+  K   IR S+I L+A
Sbjct: 2   SVQNSLEIALSVMKMLPSHTSKEILIIGSSLSSCDPGEINTSIEMLKTHNIRVSMIHLAA 61

Query: 129 EMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEG- 187
           E+ + +HLC +T G ++V +D+ HFK ++     P P +     A+ +KMGFPQ   +  
Sbjct: 62  EVRMFRHLCNETKGKHNVIVDDVHFKHILWSLVEPVP-LPNSVDASCVKMGFPQELEQKP 120

Query: 188 SISICSCH--KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIA 244
             + CSCH  +  K+   G+ CP+C ++ CELP +C+ CGL LVSS HLARS HHL PI 
Sbjct: 121 PFTTCSCHLAEGGKLNAKGFFCPQCNSKYCELPVECKCCGLILVSSLHLARSLHHLVPIK 180

Query: 245 PFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESL 304
           PF ++      +    S + C+GC++ +        +Y  C  CKKH+C  CDIY+H +L
Sbjct: 181 PFIKI------ELEEGSSAYCYGCRKRIKVPAE--NVYF-CESCKKHYCDGCDIYVHNTL 231

Query: 305 HNCPGCESLR 314
           H CPGC   R
Sbjct: 232 HVCPGCAVKR 241


>gi|118377110|ref|XP_001021737.1| transcription factor ssl1 family protein [Tetrahymena thermophila]
 gi|89303503|gb|EAS01491.1| transcription factor ssl1 family protein [Tetrahymena thermophila
           SB210]
          Length = 372

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 105/317 (33%), Positives = 167/317 (52%), Gaps = 14/317 (4%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +DF+PSR+ +    ++ F++ FFD NPLS + +   ++G    L +   S  S +   + 
Sbjct: 63  VDFKPSRIGLTQIYLKNFIKNFFDSNPLSVLSIGKTQNGQCITLQEFTSSI-SLLDKTIE 121

Query: 61  KLGCSG--DSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           KL      D SL   L+    + ++ P Y  REV+++ S  ST D GDI E I K K+ K
Sbjct: 122 KLVVDEKEDFSLFECLNETFNIFAEAPPYAFREVVLIQSTPSTKDKGDINEQINKAKQCK 181

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           I  ++I L+A+ FI + L ++T G Y  AL E  ++ L+  +  PP  I       LI++
Sbjct: 182 ITFNIISLNAKTFIFQKLTKETQGKYDTALHEKDYENLLNSYCEPPQQILNKTEKTLIQV 241

Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           GFP +       +C+CH E+K    Y CP+CK++ CE+P+ C  C L L+   H+ +SYH
Sbjct: 242 GFPNKKYSKISVLCACHMELKTNF-YICPQCKSKNCEVPSKCSGCQLNLILPLHITKSYH 300

Query: 239 HLFPIAPFD----EVTPLCLNDPRNRSRST-CFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
               +  F+    E +   +   ++++++  C GC + +     K   +  C  CK  FC
Sbjct: 301 FSQNLKEFELLGQEKSNNSMETEQDKTQNILCNGCSEDI-----KEDYFSKCQNCKSIFC 355

Query: 294 LECDIYIHESLHNCPGC 310
           LECDI IH+   NCP C
Sbjct: 356 LECDILIHKQKLNCPKC 372


>gi|170035934|ref|XP_001845821.1| btf [Culex quinquefasciatus]
 gi|167878420|gb|EDS41803.1| btf [Culex quinquefasciatus]
          Length = 367

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 95/220 (43%), Positives = 137/220 (62%), Gaps = 13/220 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K +E F+ EFFDQNP+SQ+G++ +K   A  +T+LGGS   HIKA+  +
Sbjct: 76  DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGVIAMKAKRAEKITELGGSCRKHIKAVGNL 135

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            KL  +G+ SLQN L+L    L  +PS+  RE+L++  +L+TCDP DI  TI   K   I
Sbjct: 136 NKLNLTGEPSLQNGLELALKTLKMVPSHASREILVVMGSLTTCDPTDIHITIDALKSEGI 195

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RCSV+ LSAE+ +CK LC +TGG Y   LD+SHFK+ +++H  PP A  +   + +IKMG
Sbjct: 196 RCSVVSLSAEIRVCKFLCTETGGVYGAVLDDSHFKDQLLQHIDPPQAGNQQEFS-MIKMG 254

Query: 180 FPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRC 209
           FP  +  EG    +++C CH +           GY CP+C
Sbjct: 255 FPHGKTEEGKDPPLTMCMCHIDSTDEPSKLTSGGYHCPQC 294


>gi|349604001|gb|AEP99672.1| General transcription factor IIH subunit 2-like protein, partial
           [Equus caballus]
          Length = 274

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 89/183 (48%), Positives = 122/183 (66%), Gaps = 3/183 (1%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIITKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 62  LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           +   C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP DI   IQ  K +KI
Sbjct: 135 VAMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSDIYGLIQTLKAAKI 194

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           R SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+  H  PPPA +  +  +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253

Query: 180 FPQ 182
           FPQ
Sbjct: 254 FPQ 256


>gi|322801575|gb|EFZ22231.1| hypothetical protein SINV_15041 [Solenopsis invicta]
          Length = 202

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 95/199 (47%), Positives = 124/199 (62%), Gaps = 22/199 (11%)

Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
           VIGL+AE++ICK +   TGG +SVALD+ H+KE +  H  PPPA      A L+KMGFP 
Sbjct: 1   VIGLAAELYICKRMANLTGGEHSVALDDKHYKEQLNAHIDPPPAATRLDAA-LVKMGFPH 59

Query: 183 RA-----GEGSISICSCHKE-----VKV-GVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
            A      + S+++C CH +     VK+   GY CP+C ++ CELP +CR CGL LVS+P
Sbjct: 60  HALHSSAPDTSMTVCMCHAQNSDEPVKLMTTGYLCPQCLSKHCELPVECRACGLTLVSAP 119

Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKH 291
           HLARSYH+LFP+ PF E+             S CFGCQ++L     K      C KC + 
Sbjct: 120 HLARSYHYLFPVDPFKEIA-------SESDHSFCFGCQKALAQKDKK---VYTCGKCNQT 169

Query: 292 FCLECDIYIHESLHNCPGC 310
           FCL+C+I+IHE LH CPGC
Sbjct: 170 FCLDCEIFIHEILHTCPGC 188


>gi|323454955|gb|EGB10824.1| hypothetical protein AURANDRAFT_58771 [Aureococcus anophagefferens]
          Length = 361

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 115/317 (36%), Positives = 151/317 (47%), Gaps = 33/317 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RPSR+A        F+ + F  NP S   +V  +DG A   + L G+  +H  AL  +
Sbjct: 69  DLRPSRLAACCAAGAEFLADLFAGNPFSSACVVDARDGAAKLASPLSGAARAHADALAAV 128

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           G+   +G  SLQ AL+     L   P YG REV++L S+L+T D GD+     +   +KI
Sbjct: 129 GRAAPAGAFSLQRALEAAAAALEAAPDYGFREVVVLSSSLATRDAGDLNAVAARLAAAKI 188

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           RC V+ L+AE ++ + L   TGG ++VALD  H   L+     PPP  A+ A        
Sbjct: 189 RCHVVNLAAETYVVRKLADATGGEFAVALDAGHLAALLGNRVEPPPLAADDA-------- 240

Query: 180 FPQRAGEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
             + AGE              G        + CPRC  R  ELP  C +C L LVS+PHL
Sbjct: 241 --RLAGEPPPLPAPALASSAGGAPAWRATTHACPRCATRAAELPAACAVCDLPLVSAPHL 298

Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
           ARSYHHLFP+A F           R      CF C   L            CP C   FC
Sbjct: 299 ARSYHHLFPVAAFPG---------RVAEDGACFACGGGLAPRAFD------CPACGAAFC 343

Query: 294 LECDIYIHESLHNCPGC 310
            +CD  +H +LH CPGC
Sbjct: 344 GDCDEAVHAALHVCPGC 360


>gi|348557313|ref|XP_003464464.1| PREDICTED: general transcription factor IIH subunit 2-like [Cavia
           porcellus]
          Length = 302

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 117/309 (37%), Positives = 168/309 (54%), Gaps = 52/309 (16%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
           +E FV E+FDQN +SQIG++  K   A  LT+L G+P  HI +L     + C G+ SL N
Sbjct: 23  LEYFVEEYFDQNSISQIGIIVTKR--AEKLTELSGNPRKHITSLKKATDMTCHGEPSLYN 80

Query: 73  ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
           +L +                     A+ T +  D+M+T++  K       VIG+SAE+ +
Sbjct: 81  SLSM---------------------AMQTLNFLDLMKTLKTAK-----MCVIGMSAEVRV 114

Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA-------- 184
           C  L ++TG +Y   LDESH+KEL+  H  PPPA +     +LI MGFPQ          
Sbjct: 115 CTGLARETGSTYHEILDESHYKELLTHHVSPPPASSSSE-CSLIHMGFPQHTIAALADQD 173

Query: 185 GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
            + S S+       + G+   GY  P+C A+ CELP +C++CGL LVS+PHLA  YHHLF
Sbjct: 174 AKPSFSMAHLDSSSEPGLALGGYFYPQCWAKYCELPVECKMCGLTLVSAPHLAWLYHHLF 233

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+  F E+ PL       +    C+GCQ  L        +YV C   +  FC++CD+++H
Sbjct: 234 PLDAFQEI-PL----EEYKGERFCYGCQGQL----KDQHVYV-CIVFQNVFCVDCDVFVH 283

Query: 302 ESLHNCPGC 310
           +S+H CPGC
Sbjct: 284 DSVHCCPGC 292


>gi|389586086|dbj|GAB68815.1| transcription factor [Plasmodium cynomolgi strain B]
          Length = 401

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 96/312 (30%), Positives = 165/312 (52%), Gaps = 18/312 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF+P R+ VV + VE+F+  FF +NP+  +G+V +K+  A  +  L  + +  +KAL+ +
Sbjct: 100 DFKPDRINVVLESVESFLTNFFFKNPVGHVGVVALKNSSAKLIQPLTSNMDDIMKALLKE 159

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +G  G  SLQ  L++   LL  IP YG +E+LI+Y ++ TCD  +I+  +    ++ +
Sbjct: 160 RSMGLQGSPSLQQGLEIAHDLLIDIPLYGTKEILIIYGSIRTCDKKNILNILNLIVKNNM 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
             + + ++ EM I KH+C++T G+Y + + ++      M +    P         LI + 
Sbjct: 220 HVNCVSIAPEMHILKHICEETNGAYKICMTKNALLNE-MNNVTETPLWMSGMEPQLIHIC 278

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP +    +  +CSCH ++     Y C  C +  C++P+ C++CG+ L+S   L+   ++
Sbjct: 279 FPVKKKISTQIMCSCHNQLNTDT-YICNFCNSYTCKIPSKCKVCGMHLISMHDLSHITNN 337

Query: 240 LFPIAPFDEVTPLCLNDPRNRSR-STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
           L       + +PL L     +   S C  C + L    ++      C KCK  FCL CD+
Sbjct: 338 L-------QGSPLFLEIKNEKDGPSVCVSCNKRLYDKVSQ------CSKCKNIFCLGCDL 384

Query: 299 YIHESLHNCPGC 310
           YIHE L+ CP C
Sbjct: 385 YIHEDLNQCPFC 396


>gi|294911611|ref|XP_002778020.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886141|gb|EER09815.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 421

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 106/328 (32%), Positives = 178/328 (54%), Gaps = 32/328 (9%)

Query: 1   MDFRPSRMAVVAKQV-EAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM 59
           MD++ +R+  V   +   F+ ++FD NP+S + ++ ++D  A+ +T + G P S ++ ++
Sbjct: 107 MDYKQNRLDFVVSHLCRNFIPQYFDLNPISNLSVLALRDQRAHFVTRMSGQPASQMERVL 166

Query: 60  --GKLGCSGDSSLQNALDLV-QGLLSQIPSYGHREVLILYSALSTCDP---GDIM----- 108
                G SG +SL NAL+ V Q     +P YG REVLI++ +L + DP   G ++     
Sbjct: 167 QFSTGGASGSASLVNALEAVGQAAKGSLPRYGTREVLIVWGSLHSADPPTSGKLIPDCVD 226

Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA 168
             I++ K + +R SVI LS E++  + + + TGG++SVA    HFK L+ +H  PP  ++
Sbjct: 227 RLIEEIKPNSVRVSVISLSPELYALRKVVEMTGGTFSVATSPIHFKRLMQKHLTPPNWVS 286

Query: 169 EFAIANLIKMGFP-QRAGEGSISI-----CSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
                + IKMGFP +RA +G+ +      C CH  ++    Y CP+C + VCE+P +C +
Sbjct: 287 S---PSYIKMGFPVRRACDGNHTADPPIKCMCHNRLQKTFVYICPQCHSPVCEIPVNCPV 343

Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
           C L LV    L + + H++ + P   + P  ++ P++    TC  C       G +    
Sbjct: 344 CRLPLVDDDALKKHHRHIYSM-PTYTLLP-TVDYPKS---YTCQFCGTDFTEGGAR---- 394

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGC 310
             C +C    C ECD++ H  L +CPGC
Sbjct: 395 --CDQCLSDVCYECDMFAHNKLRHCPGC 420


>gi|82753444|ref|XP_727680.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23483641|gb|EAA19245.1| W43325 comes from this gene., putative [Plasmodium yoelii yoelii]
          Length = 401

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 96/312 (30%), Positives = 166/312 (53%), Gaps = 18/312 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           DF+P+R+ V+ + VE F++ FF +NP+  +G+V +K+  A  +  L  + +  + +L+  
Sbjct: 100 DFKPNRITVILECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKE 159

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K G +G  SLQ  L++   LL  +P YG +E+LI+Y ++ TCD  +I++ +    ++ +
Sbjct: 160 QKEGLNGSPSLQEGLEIAHNLLMDMPLYGTKEILIMYGSIRTCDKKNILKYLDLLIKNNM 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
             + I ++ EM I KH+C+ T G Y +   ++     I + A  P  +       LI + 
Sbjct: 220 YVNCISIAPEMHILKHICEKTNGIYKICTSKNILINEINQVAETPLWMHGME-PQLIHIC 278

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP +    +  +CSCH  +     Y C  C +  C++P+ C+ICG+ L+S   L+   ++
Sbjct: 279 FPVKKKINTQIVCSCHNILNTDT-YICNFCNSYTCKIPSKCKICGIHLISMHDLSHITNN 337

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
           L       + +PL +     ++  T C  C Q L    ++      C KCK  FCLECD+
Sbjct: 338 L-------QASPLFVEIKNEQNYYTHCSSCNQQLYDKVSQ------CTKCKNIFCLECDV 384

Query: 299 YIHESLHNCPGC 310
           +IHE L+ CP C
Sbjct: 385 FIHEDLNQCPFC 396


>gi|68068799|ref|XP_676310.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56495950|emb|CAH97731.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 401

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 164/311 (52%), Gaps = 16/311 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           DF+P+R+ V+ + VE F++ FF +NP+  +G+V +K+  A  +  L  + +  + +L+  
Sbjct: 100 DFKPNRITVILECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKE 159

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K G  G  SLQ  L++   LL  +P YG +E+LI+Y ++ TCD  +I++ +    ++ +
Sbjct: 160 QKEGLQGSPSLQEGLEIAHNLLMDMPLYGTKEILIMYGSIRTCDKKNILKYLDLLIKNNM 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
             + I ++ EM I KH+C+ T G Y +  +++     I + A  P  +       LI + 
Sbjct: 220 YVNCISIAPEMHILKHICEKTNGIYKICTNKNILINEINQVAETPLWMHGME-PQLIHIC 278

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP +    +  +CSCH  +     Y C  C +  C++P+ C+ICG+ L+S   L+   + 
Sbjct: 279 FPVKKKINTQIVCSCHNILNTDT-YICNFCNSYTCKIPSKCKICGIHLISMHDLSHITNS 337

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           L     F E+     N+  N +   C  C Q L    ++      C KCK  FCLECD++
Sbjct: 338 LQASPLFVEIR----NEQNNHTH--CASCNQLLYDKVSQ------CTKCKNIFCLECDVF 385

Query: 300 IHESLHNCPGC 310
           IHE L+ CP C
Sbjct: 386 IHEDLNQCPFC 396


>gi|225554656|gb|EEH02952.1| transcription factor TFIIH complex subunit Ssl1 [Ajellomyces
           capsulatus G186AR]
          Length = 467

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/274 (40%), Positives = 149/274 (54%), Gaps = 47/274 (17%)

Query: 82  SQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG 141
           S  P++G REVLI++ AL + DPGDI +TI      KIR S+IGL+A++ IC+ LC  T 
Sbjct: 178 SHTPTHGTREVLIIFGALLSSDPGDIHQTITALVADKIRISIIGLAAQVAICRDLCARTN 237

Query: 142 GS----YSVALDESHFKELIME-HAPPPPAIAEFAI------------------ANLIKM 178
                 Y VAL+E HF+EL M+  APP   +A  +                   ++L+ M
Sbjct: 238 NGDDTVYGVALNEQHFRELFMDVTAPPVTTVAPTSTPTASSANGPKTTTTNTTASSLLMM 297

Query: 179 GFPQRAGEGSISI--CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
           GFP      + +   C+CH +     GY C RC A+VC LP+ C  CGL L+ S HLARS
Sbjct: 298 GFPSLTLTTTPTPSLCACHSKPSRA-GYLCCRCNAKVCTLPSSCPCCGLTLILSTHLARS 356

Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----QQSLLSSGNKP---------GLY 282
           YHHLFP+  + EV+       R RS S CF C     +   L SG            G+ 
Sbjct: 357 YHHLFPLMNWVEVS--WRRAARKRSAS-CFACGVGFPRMPKLVSGEPEETAKAAVGVGVS 413

Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           V+    CP C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 414 VSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQS 447


>gi|156102767|ref|XP_001617076.1| transcription factor [Plasmodium vivax Sal-1]
 gi|148805950|gb|EDL47349.1| transcription factor, putative [Plasmodium vivax]
          Length = 401

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 162/313 (51%), Gaps = 20/313 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF+P R+ VV + VE F+  FF +NP+  +G+V +K+  A  +  L  + E    AL+ +
Sbjct: 100 DFKPDRINVVLECVENFLTHFFFKNPVGHVGVVALKNSSAKLIQPLTSNMEDITNALLKE 159

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +G  G  SLQ  L++   LL  IP YG +E+LI+Y ++ TCD  +I+  +    ++ +
Sbjct: 160 RSMGLQGSPSLQQGLEIAHDLLIDIPLYGTKEILIMYGSIRTCDKKNILNILNLIVKNNM 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
             + + ++ EM I KH+C+ T GSY + + ++      M +    P         LI + 
Sbjct: 220 HVNCVSIAPEMHILKHICEQTNGSYKICMTKNSLMNE-MNNITETPLWMMGMEPQLIHIC 278

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP +    +  +CSCH  +     Y C  C +  C++P+ C++CG+ L+S   L+   ++
Sbjct: 279 FPIKKKISTQIMCSCHNNLNTDT-YICNFCNSYTCKIPSKCKVCGMHLISMHDLSHITNN 337

Query: 240 LFPIAPFDEVTPLCLNDPRNRSR--STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECD 297
           L       + +PL L + +N  +  S C  C + L    ++      C KC   FCL CD
Sbjct: 338 L-------QGSPLFL-EIKNEEKGPSVCVSCNKRLYDKVSQ------CSKCGNLFCLACD 383

Query: 298 IYIHESLHNCPGC 310
           +YIHE L+ CP C
Sbjct: 384 LYIHEDLNQCPFC 396


>gi|189240734|ref|XP_967768.2| PREDICTED: similar to btf [Tribolium castaneum]
          Length = 236

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 81/177 (45%), Positives = 122/177 (68%), Gaps = 4/177 (2%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
           +E F+ EFFDQNP+SQ+G++ + +  A  ++DL G+   HIKA+  + +    G+ SLQN
Sbjct: 52  LENFIEEFFDQNPISQLGIIIMHNKRAEKISDLAGNARKHIKAVQNLKQTSLVGEPSLQN 111

Query: 73  ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
           ++++    L  +PS+  RE+L++  +L+TCDPGDI  TIQ  K   +RCSVIGL+AE+++
Sbjct: 112 SIEMALRALKLLPSHASREILVIMGSLTTCDPGDITTTIQMLKTEGVRCSVIGLAAEVYV 171

Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP-QRAGEGS 188
           C+ L   TGG Y+V LD+ HFK+L+ +   PPPA A+   +NLIKMGFP Q + EG+
Sbjct: 172 CRQLANQTGGLYNVILDDCHFKDLLYQQVDPPPA-AQTLESNLIKMGFPHQMSVEGT 227


>gi|270012914|gb|EFA09362.1| hypothetical protein TcasGA2_TC001923 [Tribolium castaneum]
          Length = 191

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 81/177 (45%), Positives = 122/177 (68%), Gaps = 4/177 (2%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
           +E F+ EFFDQNP+SQ+G++ + +  A  ++DL G+   HIKA+  + +    G+ SLQN
Sbjct: 7   LENFIEEFFDQNPISQLGIIIMHNKRAEKISDLAGNARKHIKAVQNLKQTSLVGEPSLQN 66

Query: 73  ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
           ++++    L  +PS+  RE+L++  +L+TCDPGDI  TIQ  K   +RCSVIGL+AE+++
Sbjct: 67  SIEMALRALKLLPSHASREILVIMGSLTTCDPGDITTTIQMLKTEGVRCSVIGLAAEVYV 126

Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP-QRAGEGS 188
           C+ L   TGG Y+V LD+ HFK+L+ +   PPPA A+   +NLIKMGFP Q + EG+
Sbjct: 127 CRQLANQTGGLYNVILDDCHFKDLLYQQVDPPPA-AQTLESNLIKMGFPHQMSVEGT 182


>gi|221060951|ref|XP_002262045.1| ssl1-like protein [Plasmodium knowlesi strain H]
 gi|193811195|emb|CAQ41923.1| ssl1-like protein, putative [Plasmodium knowlesi strain H]
          Length = 401

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 158/311 (50%), Gaps = 16/311 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF+P R+ VV + VE F+  FF +NP+  +G+V +K+  A  +  L  + E  + AL+ +
Sbjct: 100 DFKPDRINVVLECVENFLTNFFFKNPVGHVGVVALKNSSAKLIQPLTSNMEDIMNALVKE 159

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             +G  G  SL+  L++   LL  IP YG +E+LI+Y ++ TCD  +I+  +    ++ +
Sbjct: 160 RSMGLQGSPSLEQGLEIAHDLLVDIPLYGTKEILIMYGSIRTCDKKNILNILNLIVKNNM 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
             + + +S EM I KH+C++T G Y + + ++      M +    P         LI + 
Sbjct: 220 HVNCVSISPEMHILKHICEETNGVYKICMTKNALMNE-MNNVTETPLWMIGMEPQLIHIC 278

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP +    +  +CSCH  +     Y C  C +  C++P+ C++CG+ L+S   L+   ++
Sbjct: 279 FPIKKKISTQIMCSCHNHLNTDT-YICNFCNSYTCKIPSKCKVCGMHLISMHDLSHITNN 337

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           L     F E+             S C  C + L    ++      C KCK  FCL CD+Y
Sbjct: 338 LQGSPLFIEIK------NEKNGPSACVSCNKPLYDKVSQ------CSKCKNIFCLGCDLY 385

Query: 300 IHESLHNCPGC 310
           IHE L+ CP C
Sbjct: 386 IHEDLNQCPFC 396


>gi|313240907|emb|CBY33192.1| unnamed protein product [Oikopleura dioica]
          Length = 384

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 107/322 (33%), Positives = 156/322 (48%), Gaps = 29/322 (9%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-- 58
           +D  P+R     + ++ FV +FF   PL Q+ LV +++  AN +  LGGS    +KAL  
Sbjct: 79  IDRYPNRAGCAVRSLKEFVPKFFASCPLGQLALVILQNKRANIVVPLGGSEARLLKALND 138

Query: 59  --MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
             +      G  S+ N +   + +L+ +  + +RE++ +  +L+T D      TI+    
Sbjct: 139 IEVKGFKTGGQCSMVNGIAASKSMLNAVGEHSNREIIFVVGSLNTIDVTSPFATIETVAN 198

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLI 176
             IRCSV+ LSAE+ I K L + T G Y V LD     +  +E    PP  +      L+
Sbjct: 199 EGIRCSVVSLSAEVNIWKKLAERTDGKYFVPLDPIDVSDK-LEELSRPPVESSSRQGVLV 257

Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
           KMGFPQ+  +                 Y CP+C+ RV  +PT C +C L LVS+ HLAR 
Sbjct: 258 KMGFPQKEADAR---------------YICPQCRTRVKAIPTLCDVCKLSLVSAAHLARC 302

Query: 237 YHHLFPIAPFDEVTP--------LCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           YHHLFP +      P        L L   +      C GC  + +S   +   +V C KC
Sbjct: 303 YHHLFPPSSVTPSIPEQSGEEIDLELERSQFEKIRPCVGCNYAPISV-EEGRQFVVCDKC 361

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
               C ECD +I E LH+CPGC
Sbjct: 362 LNSLCGECDGFIDEHLHSCPGC 383


>gi|428176074|gb|EKX44960.1| SSL1 transcription factor & nucleotide excision repair [Guillardia
           theta CCMP2712]
          Length = 289

 Score =  167 bits (423), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 105/315 (33%), Positives = 161/315 (51%), Gaps = 42/315 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R+ VV + V +F+  FFDQNP+SQ+ LV ++ G A  LT+L G+   H   L  +
Sbjct: 5   DMRPTRLGVVQQVVGSFLDNFFDQNPISQVSLVELRGGKAEKLTELSGNSRHHKNKLEER 64

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           L     +           +  +  SY  R   ++        PG  +    + K + ++ 
Sbjct: 65  LAAHRSAGAGMPSMRAGVVWDKRSSYDVRIFELMR-------PGGHLSDY-RLKSAGVKV 116

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAI----AEFAIANLIK 177
           +++G++AE+++ + + + TGGSYSVA      ++++++H  P P       +   A  + 
Sbjct: 117 NIVGMAAELYVARAVSKRTGGSYSVATHAVQVRDMVLKHTSPSPVEEGEGGKEKPAAGMW 176

Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
           +GFP+++  G++             G+ CPRC   V E+P DC +CGL+L+SS HLARSY
Sbjct: 177 VGFPRKS-YGALG------------GWQCPRCMDVVKEIPADCGLCGLKLLSSSHLARSY 223

Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKH-FCLEC 296
           HHLFP+  F E+      +    S STC GC              V   K  KH FC  C
Sbjct: 224 HHLFPVQSFHELED---EEGPATSTSTCGGCG-------------VLLGKESKHVFCGLC 267

Query: 297 DIYIHESLHNCPGCE 311
           D +IHE LH CPGCE
Sbjct: 268 DGFIHEMLHVCPGCE 282


>gi|66359374|ref|XP_626865.1| transcription factor TFIIH with a vWA domain [Cryptosporidium
           parvum Iowa II]
 gi|46228124|gb|EAK89023.1| transcription factor TFIIH with a vWA domain [Cryptosporidium
           parvum Iowa II]
          Length = 423

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 101/334 (30%), Positives = 174/334 (52%), Gaps = 22/334 (6%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +D++P R+  + K  E F+++  + NPL+QI ++++ DG+   +     +    + +++ 
Sbjct: 94  LDYKPDRLQCMVKCNEIFIKQLLEDNPLTQISVISIYDGIGEVIISYNSNFLEIMTSILN 153

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKES 117
            L  GC G  S+QN L+  + LL  IP YG +E++    ++ + D   +  E ++    +
Sbjct: 154 YLKKGCKGSMSIQNGLEKAKYLLVSIPPYGTKEIIFFLGSMRSVDNSFLFNEWVEGFSSN 213

Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH--APPPPAIAEFAI-AN 174
            I  + I    E++I K + + TGG    A++  H  +L +E+   PPP  IA   +  N
Sbjct: 214 NIIINAILFIPELYIIKTITKMTGGVCLCAMNNDHLLKLTLENFIKPPPNNIASTPLNIN 273

Query: 175 LIKMGFPQRAGEGSISI-CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           L+ MGFP+     +    CSCH+ +    GY+CP+CK+ VC LPT C +C + L+S  HL
Sbjct: 274 LVTMGFPEYVSNQTHPFGCSCHRSL-THRGYSCPKCKSIVCYLPTKCPVCLIYLISPNHL 332

Query: 234 ARSYHHLFPIAPFDEVTPLCLND----PRNRSRSTCFGCQQSL-----LSSGNKPGLY-- 282
           A+S+ +LF      ++TP+   +      +  ++ C  C+        L+SG++  +   
Sbjct: 333 AKSFAYLF---QHPDITPIKNGEYSKIESSSIKTKCELCESQFSNNVYLNSGSRSNIINK 389

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESLRHS 316
             C  C   FC EC  +I  +LH CP C SLR S
Sbjct: 390 YLCTNCSTQFCNECCKFIFNTLHQCPICCSLRIS 423


>gi|70952334|ref|XP_745342.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56525635|emb|CAH76815.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 400

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 164/312 (52%), Gaps = 19/312 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           DF+P+R+ V+ + VE F++ FF +NP+  +G+V +K+  A  +  L  + +  + +L+  
Sbjct: 100 DFKPNRITVILECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKE 159

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K G  G  SLQ  L++   LL  +  YG +E+LI+Y ++ TCD  +I++ +    ++ +
Sbjct: 160 QKEGLQGSPSLQEGLEIAHNLLMDML-YGTKEILIMYGSIRTCDKKNILKYLDLLIKNNM 218

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
             + I ++ EM I KH+C+ T G Y +   ++     I + A  P  +       LI + 
Sbjct: 219 YVNCISIAPEMHILKHICEKTHGIYKICTSKNILINEINQVAETPLWMHGME-PQLIHIC 277

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP +    +  +CSCH  +     Y C  C +  C++P+ C++CG+ L+S   L+   ++
Sbjct: 278 FPVKKKINTQIVCSCHNTLNTDT-YICNFCNSYTCKIPSKCKVCGIHLISMHDLSHITNN 336

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
           L       + +PL +     ++  T C  C Q L    ++      C KCK  FCLECD+
Sbjct: 337 L-------QASPLFIEIKNEQNYYTYCSSCNQQLYDKVSQ------CTKCKNIFCLECDV 383

Query: 299 YIHESLHNCPGC 310
           +IHE L+ CP C
Sbjct: 384 FIHEDLNQCPFC 395


>gi|124513058|ref|XP_001349885.1| TFIIH basal transcription factor subunit [Plasmodium falciparum
           3D7]
 gi|23615302|emb|CAD52293.1| TFIIH basal transcription factor subunit [Plasmodium falciparum
           3D7]
          Length = 401

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 161/311 (51%), Gaps = 16/311 (5%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D +P R+ V  + VE+F++ FF +NP+  +G+V +K+  A  +     + +  + +++ +
Sbjct: 100 DLKPDRINVALECVESFLKNFFFKNPVGHVGVVALKNSSAKLIQPFTSNVDDILNSILKE 159

Query: 62  --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
              G  G  SL+  L +   LL  +P YG +EVLI+Y ++ TCD  +I+  ++   +S I
Sbjct: 160 RTAGLQGSPSLEEGLQIAHDLLIDMPLYGTKEVLIMYGSIRTCDKKNILNVLELLVKSNI 219

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
             + I ++ EM I KH+C+ + G Y +   ++     I  +A  P  +       LI + 
Sbjct: 220 YVNCISIAPEMHILKHICEKSNGFYKICSSKNSLMNEINNNAETPLWMQGME-PQLIHIC 278

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP +    +  +CSCH ++     Y C  C +  C++P+ C++CG+ L+S   L+   ++
Sbjct: 279 FPTKKKISTQIMCSCHGKLNTDT-YVCNFCNSYTCKIPSKCKVCGIHLISMHDLSHITNN 337

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           L     F E+     N+  N     C  C Q L +      +Y  C KC+  FCLECDI+
Sbjct: 338 LQGSPLFIEIK----NEQGN--YKVCSSCNQQLYNK-----IY-QCTKCQHIFCLECDIF 385

Query: 300 IHESLHNCPGC 310
           IHE L+ CP C
Sbjct: 386 IHEELNQCPFC 396


>gi|67595351|ref|XP_665994.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54656889|gb|EAL35767.1| hypothetical protein Chro.30353 [Cryptosporidium hominis]
          Length = 423

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 102/335 (30%), Positives = 172/335 (51%), Gaps = 24/335 (7%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +D++P R+  + K  E F+++  + NPL+QI ++++ DG+   +     +    + +++ 
Sbjct: 94  LDYKPDRLQCMVKCNEIFIKQLLEDNPLTQISVISIYDGIGEVIISYNSNFLEIMTSILN 153

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKES 117
            L  GC G  S+QN L+  + LL  IP YG +E++    ++ + D   +  E ++    +
Sbjct: 154 YLKKGCKGSMSIQNGLEKAKYLLVSIPPYGTKEIIFFLGSMRSVDNSFLFNEWVEGFSSN 213

Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH--APPPPAIAEFAI-AN 174
            I  + I    E++I K + + TGG    A++  H  +L +E+   PPP  IA   +  N
Sbjct: 214 NIIINAILFIPELYIIKTITKMTGGVCLCAMNNDHLLKLTLENFIKPPPNNIASTPLNIN 273

Query: 175 LIKMGFPQRAGEGSISI-CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           L+ MGFP+     +    CSCH  +    GY+CP+CK+ VC LPT C +C + L+S  HL
Sbjct: 274 LVTMGFPEYLNNQTHPFGCSCHGSL-THRGYSCPKCKSIVCYLPTKCPVCLIYLISPNHL 332

Query: 234 ARSYHHLF---PIAPFD--EVTPLCLNDPRNRSRSTCFGCQQSL-----LSSGNKPGLY- 282
           A+S+ +LF    I P    E + +      +  ++ C  C+        L+SG++  +  
Sbjct: 333 AKSFAYLFQHPDIIPIKNGEYSKI----ESSSIKTKCELCESQFSNNVYLNSGSRSNIIN 388

Query: 283 -VACPKCKKHFCLECDIYIHESLHNCPGCESLRHS 316
              C  C   FC EC  +I  +LH CP C SLR S
Sbjct: 389 KYLCTNCNTQFCNECCKFIFTTLHQCPICCSLRIS 423


>gi|300709301|ref|XP_002996816.1| hypothetical protein NCER_100001 [Nosema ceranae BRL01]
 gi|239606142|gb|EEQ83145.1| hypothetical protein NCER_100001 [Nosema ceranae BRL01]
          Length = 335

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 102/312 (32%), Positives = 161/312 (51%), Gaps = 36/312 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ PS    V + +E F+  FF +NP+S +   TV +           +    I  L+ +
Sbjct: 59  DYLPSIRKNVIRSLEKFIPTFFLENPISGLSFSTVNEKTVK------STNSVEIADLLNQ 112

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
            G  G+ SL N L      + QI SY   RE+L++ S+L   DP    + I   ++  I+
Sbjct: 113 KG-EGNFSLLNGL---YDAIDQIKSYTFCREILVIVSSLVLKDPDSYTDVIDLLRKHNIK 168

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
            ++I L  E+ I K++ + TGG + V L+  HF  ++       P     +  NLIK+GF
Sbjct: 169 VNIISLCGELMIYKNIVESTGGKFFVPLNIDHFNYIL--RCMTVPGELNSSTINLIKLGF 226

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           P+   E  +  C+CH +++  VGY CP CK  +C LP  C IC LQLVSS ++A+++ H+
Sbjct: 227 PKVIYEEGV--CACHLQLQ-SVGYECPLCKTFICSLPMGCPICELQLVSSLNIAKAFQHM 283

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
           +P+APF +    C N         CF C        N PG + +C KC   +C  CD+++
Sbjct: 284 YPLAPFTK----CCN-------GVCFVC--------NSPGNF-SCEKCNSVYCDSCDLFV 323

Query: 301 HESLHNCPGCES 312
           H +L+ C GC++
Sbjct: 324 HNNLNFCLGCKN 335


>gi|261190444|ref|XP_002621631.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
           SLH14081]
 gi|239591054|gb|EEQ73635.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
           SLH14081]
          Length = 502

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 92/246 (37%), Positives = 140/246 (56%), Gaps = 23/246 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV EFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 140 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHILAIQSL 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN L++ +G L + P++G REV I++ AL + DPGDI +TI      KI
Sbjct: 200 RPKDPKGMPSLQNTLEMARGALFRTPTHGTREVFIIFGALLSSDPGDIHKTINTLVADKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIME--------------HA 161
           R S+IGL+A++ IC+ +C  T       Y VAL+E HF+EL M                 
Sbjct: 260 RVSIIGLAAQVAICRDICARTNNGDDSGYGVALNEQHFRELFMNVTTPPATTVAPTPTTT 319

Query: 162 PPPPAIAEFAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
                 ++   ++L+ MGFP R  +   + ++C+CH +     GY C RC A+VC LPT 
Sbjct: 320 KEETKTSQTTTSSLLMMGFPSRTLSPTTTPTLCACHSKPSR-TGYLCSRCGAKVCTLPTS 378

Query: 220 CRICGL 225
           C   GL
Sbjct: 379 CPCGGL 384


>gi|209880135|ref|XP_002141507.1| suppressor of stem-loop protein 1 [Cryptosporidium muris RN66]
 gi|209557113|gb|EEA07158.1| suppressor of stem-loop protein 1, putative [Cryptosporidium muris
           RN66]
          Length = 424

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 95/321 (29%), Positives = 159/321 (49%), Gaps = 13/321 (4%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM- 59
           +D++P R+  + + ++ F+     +NPL+QI ++++++G+ + +T    +    +  ++ 
Sbjct: 96  LDYKPDRLQCMVRCIDTFICNLLQENPLTQISVISIRNGLTSIVTTYNSNYREIVSGILS 155

Query: 60  -GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKES 117
             K GCSG  S++N L+  + LL+ IP YG +E++    ++ + D   ++ E +      
Sbjct: 156 EAKNGCSGVMSIRNGLEKAKQLLASIPPYGTKEIVFFLGSMRSIDNDSMLNEWLDDFIAH 215

Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI---AN 174
            I  + +    E+FI K +   TGG    AL+  H  +L++E+   PP+ ++       N
Sbjct: 216 NIVINALLFIPELFIIKTITTKTGGICLCALNSEHMLQLLLENFVKPPSYSQLNTPLHIN 275

Query: 175 LIKMGFPQRAGEGSISI-CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
           L+ MGFP         + CSCHK +    GY CPRCK+ VC LP  C IC L L S+ HL
Sbjct: 276 LVPMGFPMYFNNSGYPLQCSCHKSI-TSDGYCCPRCKSLVCYLPIKCPICNLYLASANHL 334

Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNK----PGLYVACPKCK 289
            +S+  LF   P  EV  L   + ++     C  CQ    S        P   + C  C 
Sbjct: 335 TKSFAFLFK-PPSMEVLQLNPGEYKSDIPRHCRFCQNLFTSKATYRNGFPFSLIKCTNCC 393

Query: 290 KHFCLECDIYIHESLHNCPGC 310
              C++C   I  +LH CP C
Sbjct: 394 SFLCIDCCKLILLALHQCPEC 414


>gi|358371546|dbj|GAA88153.1| RNA polymerase TFIIH complex subunit Ssl1 [Aspergillus kawachii IFO
           4308]
          Length = 320

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 75/192 (39%), Positives = 121/192 (63%), Gaps = 7/192 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ +KDG+A  ++D+ G+P  HI A+  +
Sbjct: 108 DLRPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLKDGLAVRVSDMSGNPTEHISAIQSL 167

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN  ++ +G L   PS+G RE+ +++ +L + DPGDI +TI      KI
Sbjct: 168 RDQDPKGLPSLQNGFEMARGALFHTPSHGTREIFVIFGSLLSSDPGDIHQTITTLINDKI 227

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
           R  ++GL+A++ IC+ +C  T G    +Y VAL+E HF++L+M    PP   ++  + ++
Sbjct: 228 RVGIVGLAAQVAICREICGKTNGGDDTTYGVALNEQHFRDLVMSVTTPPATYSQKQSTSS 287

Query: 175 LIKMGFPQRAGE 186
           L+ MGFP R  E
Sbjct: 288 LLMMGFPSRTVE 299


>gi|242807960|ref|XP_002485064.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218715689|gb|EED15111.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 313

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 73/169 (43%), Positives = 111/169 (65%), Gaps = 6/169 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV EFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 103 DIRPTRYLLALRYAQEFVIEFFEQNPISQLGVIGMRDGLAVRISDMSGNPTDHITAIHAL 162

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQNAL++ +G L   PS+G REVLI++ +L + DPGDI +T++     KI
Sbjct: 163 RSDDPKGLPSLQNALEMARGALFHTPSHGTREVLIVFGSLLSSDPGDIHQTLKSLVADKI 222

Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPP 164
           R  ++GL+A++ IC+ LC  T G     Y VAL+E HF+EL+ME   PP
Sbjct: 223 RVGIVGLAAQVAICRELCAKTNGGDDTVYGVALNEQHFRELMMEVTIPP 271


>gi|119182598|ref|XP_001242425.1| hypothetical protein CIMG_06321 [Coccidioides immitis RS]
          Length = 364

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 74/169 (43%), Positives = 110/169 (65%), Gaps = 6/169 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 123 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 182

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                +G  SLQNAL++ +G L   PS+G REVLI++ AL + DPGDI +TI      KI
Sbjct: 183 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGALLSSDPGDIHKTITSLVTDKI 242

Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP 164
           R  V+GL+AE+ IC+ +C  T       Y VAL+E HF+EL+ME   PP
Sbjct: 243 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPP 291



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 22/28 (78%)

Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
           C  C  HFC++CD++ HE +HNCPGC+S
Sbjct: 309 CTVCHNHFCIDCDVFAHEIVHNCPGCQS 336


>gi|365759512|gb|EHN01295.1| Ssl1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 303

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/161 (46%), Positives = 109/161 (67%), Gaps = 6/161 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RPSR A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 141 DLRPSRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 200

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 201 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 260

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKEL 156
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL
Sbjct: 261 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKEL 301


>gi|359474618|ref|XP_003631495.1| PREDICTED: uncharacterized protein LOC100853435 [Vitis vinifera]
          Length = 235

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 71/94 (75%), Positives = 83/94 (88%)

Query: 15  VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNAL 74
           ++   +EFFDQ+P SQIGLVT+KDG+A CLTDLGGS +SH+KALMGKL CSGDSSLQNAL
Sbjct: 142 LQTMKQEFFDQDPFSQIGLVTIKDGLAPCLTDLGGSIDSHVKALMGKLECSGDSSLQNAL 201

Query: 75  DLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM 108
           +LV+G L+ IPSYG+REVLILYSALS CD GDIM
Sbjct: 202 NLVRGYLNLIPSYGYREVLILYSALSACDTGDIM 235


>gi|323347610|gb|EGA81877.1| Ssl1p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 302

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 74/161 (45%), Positives = 109/161 (67%), Gaps = 6/161 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKEL 156
           R  V+GLSA++ ICK LC+ T       Y + LDE+H KEL
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKEL 300


>gi|313239111|emb|CBY14088.1| unnamed protein product [Oikopleura dioica]
          Length = 369

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/322 (31%), Positives = 147/322 (45%), Gaps = 44/322 (13%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-- 58
           +D  P+R     + ++ FV +FF   PL Q+ LV +++  AN +  LGGS    +KAL  
Sbjct: 79  IDRYPNRAGCAVRSLKEFVPKFFASCPLGQLALVILQNKRANIVVPLGGSEARLLKALND 138

Query: 59  --MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
             +      G  S+ N +   + +L+ +  + +RE++ +  +L+T D      TI+    
Sbjct: 139 IEVKGFKTGGQCSMVNGIAASKSMLNAVGEHSNREIIFVVGSLNTIDVTSPFATIETVAN 198

Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLI 176
             IRCSV+ LSAE+ I K L + T G Y V LD     +  +E    PP  +      L+
Sbjct: 199 EGIRCSVVSLSAEVNIWKKLAERTDGKYFVPLDPIDVSDK-LEELSRPPVESSSRQGVLV 257

Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
           KMGFPQ                              V  +PT C +C L LVS+ HLAR 
Sbjct: 258 KMGFPQ------------------------------VKAIPTLCDVCKLSLVSAAHLARC 287

Query: 237 YHHLFPIAPFDEVTP--------LCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
           YHHLFP +      P        L L   +      C GC  + +S   +   +V C KC
Sbjct: 288 YHHLFPPSSVTPSIPEQSGEEIDLELERSQFEKIGPCVGCNYAPISV-EEGRQFVVCDKC 346

Query: 289 KKHFCLECDIYIHESLHNCPGC 310
               C ECD +I E LH+CPGC
Sbjct: 347 LNSLCGECDGFIDEHLHSCPGC 368


>gi|403222976|dbj|BAM41107.1| uncharacterized protein TOT_030000370 [Theileria orientalis strain
           Shintoku]
          Length = 468

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 99/336 (29%), Positives = 165/336 (49%), Gaps = 32/336 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF+P R+      ++ FV++ F + P++Q+G+V +++ VAN +  LG +PE  ++ L   
Sbjct: 136 DFKPDRLYCSFGALKEFVKDLFSKGPITQLGMVVMRNKVANLICQLGTNPEEQLELLGNS 195

Query: 62  L--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
           L  G  G SSLQN L++   +L ++PSY  REVL+++ +  T D G+I+ T++K K++ +
Sbjct: 196 LKEGADGPSSLQNGLEMCLNMLYELPSYSTREVLVIFGSNRTLDAGNILVTLEKLKQNNL 255

Query: 120 RCSVIGLSAE----MFICK---------HLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
             + I LS E    +F  K              T    S   D+ +   + +        
Sbjct: 256 TINSISLSPENRRQLFGGKGCKSPETPTQQLHITSTMVSTMSDQIYMLPVSIVWNKLISL 315

Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
             ++    L+K+ FP      ++S+C CH  + V   Y CP+C ++ C +PT C++    
Sbjct: 316 RKKWMEPVLLKVAFPPLKKTATVSLCVCHSSL-VNKAYICPQCHSKSCYIPTKCKVRTFI 374

Query: 227 LV-----SSPHLARS--YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGN-K 278
           L+     ++ H   S  YHH      F  +T      P+N S+   +   Q   SS N +
Sbjct: 375 LIGAIDHNNRHAVESTWYHHQIFQGLF--ITSYL---PKNSSK---YENTQVTCSSCNLR 426

Query: 279 PGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLR 314
             +   CP C   FC  C++YI E LH CP C  L+
Sbjct: 427 NNVGFRCPSCDSWFCEYCNVYIQEELHQCPMCLFLK 462


>gi|401825920|ref|XP_003887054.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit SSL1 [Encephalitozoon hellem
           ATCC 50504]
 gi|392998212|gb|AFM98073.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit SSL1 [Encephalitozoon hellem
           ATCC 50504]
          Length = 340

 Score =  147 bits (370), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 98/311 (31%), Positives = 155/311 (49%), Gaps = 34/311 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF P+    VA+ +E F+  F+++NPLS +  ++ +D     ++ +    +  I A +G+
Sbjct: 59  DFLPTFRTNVARILEDFIPSFYNENPLSTLSFLSTRDVCVKYISSM----DMDIHAFLGQ 114

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
            G S   SL N L   +G +  + +  H +E+L++ ++ ST DP    E + + K   I+
Sbjct: 115 TG-SKWFSLLNGL---EGSIEIMENTMHVKEILVIVASTSTRDPHGYTEVLNRLKTHNIK 170

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
              I L  E+ + K + + TGG + V +D  H   ++ E + P          +L+K+GF
Sbjct: 171 VHFISLCGEVTLYKSISKATGGRFYVPIDVGHLSMIMKELSHPTDFNG--TTLSLVKIGF 228

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           P    E S+  C+CH EVK  VGY CP CK  VC LP  C IC  QLVS+ +L++S   +
Sbjct: 229 PLPTMESSV--CACHLEVK-SVGYECPVCKTMVCSLPISCPICNTQLVSTLNLSKSLRFI 285

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
           +P+ PF E           R+   C  CQ          G Y  C  CK  +C  C+ ++
Sbjct: 286 YPLKPFIE-----------RAEDACRICQNK--------GAY-QCDLCKSTYCSYCNGFV 325

Query: 301 HESLHNCPGCE 311
           H +L  C  CE
Sbjct: 326 HSTLSFCIYCE 336


>gi|396081175|gb|AFN82793.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit SSL1 [Encephalitozoon
           romaleae SJ-2008]
          Length = 340

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/311 (31%), Positives = 155/311 (49%), Gaps = 34/311 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF P+    VA+ +E F+  F+++NPLS +  ++ +D     ++ +    +  + A +G+
Sbjct: 59  DFLPTFRTNVARILEEFIPSFYNENPLSTLSFLSTRDVCVKYISSM----DMDVHAFLGQ 114

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
            G S   SL N L   +G +  + +  H +E+L++ ++ ST DP    E + K K   I+
Sbjct: 115 TG-SKWFSLLNGL---EGSIEIMRNTMHVKEILVIVASTSTRDPHGYTEVLNKLKTYNIK 170

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
              I L  E+ + K + + TGG + V +D  H   ++ E + P          +L+K+GF
Sbjct: 171 VHFISLCGELTLYKSISKATGGRFYVPVDVGHLSMIMKELSHPTDFNG--TTLSLVKIGF 228

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           P    E S+  CSCH E+K  +GY CP CK  VC LPT C ICG QLVS+ +L++S   L
Sbjct: 229 PLPTIESSV--CSCHLEMK-SIGYECPVCKTMVCSLPTSCPICGTQLVSTLNLSKSLRFL 285

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
           +P+ PF E           ++   C  C+          G Y  C  CK  FC  C+   
Sbjct: 286 YPLKPFIE-----------KAEGICRICRDK--------GAY-QCELCKSTFCSYCNGLA 325

Query: 301 HESLHNCPGCE 311
           H +L  C  CE
Sbjct: 326 HNTLSFCIYCE 336


>gi|19074105|ref|NP_584711.1| similarity to SUPPRESSOR OF STEM LOOP PROTEIN 1 [Encephalitozoon
           cuniculi GB-M1]
 gi|19068747|emb|CAD25215.1| similarity to SUPPRESSOR OF STEM LOOP PROTEIN 1 [Encephalitozoon
           cuniculi GB-M1]
 gi|449328973|gb|AGE95248.1| suppressor of stem loop protein 1 [Encephalitozoon cuniculi]
          Length = 340

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 103/313 (32%), Positives = 154/313 (49%), Gaps = 38/313 (12%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF P+  A V K +E F+  F+ +NPLS +  ++++D           S +  I A +G+
Sbjct: 59  DFLPTFRANVTKVLEGFIPSFYSENPLSTLSFLSIRD----VCVKYTSSMDIDIHAFLGQ 114

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
            G S   SL N L   +G +  + +  H +E+L++ ++ ST DP    E + K +   I+
Sbjct: 115 TG-SKWFSLLNGL---EGSIEVMKNTMHVKEILVIVASTSTRDPHGYAEVLAKLRAYNIK 170

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEF--AIANLIKM 178
              I L  E+ + K + + T G + V +D  H   ++ E + P     +F     +L+K+
Sbjct: 171 VHFISLCGEITLYKSISKATEGRFYVPVDAGHLSAIMRELSHP----TDFNGTKLSLVKI 226

Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           GFP    E S+  C+CH EVK G GY CP CK  VC LP  C IC  QLVS+ +L++S  
Sbjct: 227 GFPSPMMEPSV--CACHLEVK-GAGYECPVCKTMVCSLPISCPICSTQLVSTLNLSKSLR 283

Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
            L+P+ PF E           ++  TC  CQ          G Y  C  CK  FC  C+ 
Sbjct: 284 FLYPLRPFVE-----------KAEGTCRVCQGK--------GAY-QCELCKSTFCSYCNR 323

Query: 299 YIHESLHNCPGCE 311
            IH +L  C  CE
Sbjct: 324 LIHNTLSFCIYCE 336


>gi|303388880|ref|XP_003072673.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit SSL1 [Encephalitozoon
           intestinalis ATCC 50506]
 gi|303301815|gb|ADM11313.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit SSL1 [Encephalitozoon
           intestinalis ATCC 50506]
          Length = 340

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 101/311 (32%), Positives = 153/311 (49%), Gaps = 34/311 (10%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF P+    VAK +E F+  F+ +NPLS +  ++V+D     ++ +    +  I A +G+
Sbjct: 59  DFLPTFRTNVAKILEGFIPSFYSENPLSTLSFLSVRDVCVKYISSM----DMDIHAFLGQ 114

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
            G S   SL N L   +G +  I +  H +E+L++ ++ ST DP    E + K K   I+
Sbjct: 115 TG-SKWFSLLNGL---EGSVEIIKNTTHVKEILVIVASTSTRDPHGYTEVLTKLKVHNIK 170

Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
              I L  E+ + K + + T G + V +D  H   +IM+    P       ++ L+K+GF
Sbjct: 171 VHFISLCGEVTLYKSISKATEGRFYVPVDVGHL-SIIMKELSHPTDFNGTTLS-LVKIGF 228

Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
           P    E S+  C+CH E+K   GY CP CK  VC LP  C IC  QLVS+ +L++S   L
Sbjct: 229 PLPVIEPSV--CACHLEMK-SAGYECPVCKTMVCSLPVSCPICNTQLVSTLNLSKSLRFL 285

Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
           +P+ PF E           +    C  CQ          G Y  C  CK  FC  C+ ++
Sbjct: 286 YPLKPFIE-----------KEGKMCRICQSK--------GGY-QCELCKSIFCNSCNGFV 325

Query: 301 HESLHNCPGCE 311
           H +L  C  CE
Sbjct: 326 HNTLSFCIYCE 336


>gi|326935424|ref|XP_003213771.1| PREDICTED: general transcription factor IIH subunit 2-like, partial
           [Meleagris gallopavo]
          Length = 208

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 75/164 (45%), Positives = 104/164 (63%), Gaps = 7/164 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQI--GLVTVKDGVANCLTDLGGSPESHIKALM 59
           D +P+R+    K +E FV E+FDQNP+SQ   G V   +        + G+P+ HI AL 
Sbjct: 30  DLKPNRLTCTLKLLEYFVEEYFDQNPISQDPPGHVAATE---ERRARMPGNPKKHIAALK 86

Query: 60  G--KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKES 117
               + C G+ SL N+L+L    L  +P +  REVLI++S+L+TCDP +I E I+  K  
Sbjct: 87  KAVDMNCQGEPSLYNSLNLAMQTLKHMPGHTSREVLIVFSSLTTCDPANIYELIKCLKAV 146

Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHA 161
           KIR SVIGLSAE+ +C  L ++TGG+Y V LDESH+KEL+M H 
Sbjct: 147 KIRVSVIGLSAEVRVCTVLTRETGGTYHVILDESHYKELLMHHV 190


>gi|451994856|gb|EMD87325.1| hypothetical protein COCHEDRAFT_1197444 [Cochliobolus
           heterostrophus C5]
          Length = 718

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 151/358 (42%), Gaps = 109/358 (30%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D RP+R   +    + ++REFF+QNP+SQ+ ++ + D  +                    
Sbjct: 118 DMRPNRFITMINYTQDYIREFFEQNPISQMSVLGMHDECSRV------------------ 159

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
               G+S   N L   Q   ++    GHR +                   Q  +    R 
Sbjct: 160 ----GES---NTLPHAQSWHTR----GHRRLR------------------QPAESGPWRH 190

Query: 122 SVIGLSAEMFICKHLCQ--DTG--GSYSVALDESHFKELIMEHAPPP------------- 164
           S I        CK +C+  DTG    Y++A D+   +EL++    PP             
Sbjct: 191 SPI--------CKGVCEGPDTGDESEYTIATDQEMLRELLLATTTPPVIRQPLVTDKTAP 242

Query: 165 --PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
              A +  + A L+ MGFP R  E  +++C+CH  + +G GYTC RC A+VC LP  C  
Sbjct: 243 ATTAQSSESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSAKVCSLPITCPS 301

Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------- 268
           C L L+ S HLARSYHHLFP+  +  V+    +  R      C GC              
Sbjct: 302 CQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREMGSKQCVGCLAAFSNPPSNAKEN 358

Query: 269 ---QQ-------------SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
              QQ             S     ++ G Y  C  C+ HFC++CD++ H  LHNCPGC
Sbjct: 359 GEAQQDTEAKVTKKHDDDSEEQKASESGRY-ECRACESHFCIDCDMFAHMVLHNCPGC 415


>gi|145473807|ref|XP_001462567.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430407|emb|CAK95194.1| unnamed protein product [Paramecium tetraurelia]
          Length = 361

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/317 (29%), Positives = 149/317 (47%), Gaps = 31/317 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF+PSR+AV  + + AF+ +F + NPLSQIG+   ++     + D   S   +IK  +  
Sbjct: 67  DFKPSRLAVALQFLPAFIEQFLECNPLSQIGIAVAEEYRCKTILDFTSS-WVNIKQYLST 125

Query: 62  L------GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
           +      G S  + LQ AL +     S    +    +L++ ++  + D  D+ E  ++C+
Sbjct: 126 IHSINEAGFSMAACLQTALHV----FSTTQLHAQSSILVVTTSTYSDDKLDLNEWSERCQ 181

Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
            + I+ +VI  +  +     + Q T G Y   L+E  F + I +   P  +     I  L
Sbjct: 182 SAAIQINVISFTGSINQLIKITQATDGQYLCPLNEFQFSQEIQKFVSPQESKNHKMITQL 241

Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
           +K+GFPQ+      ++C CH E+ V   Y CP C ++VC  P  C IC   +V    + R
Sbjct: 242 VKIGFPQKLIVSQPTLCQCHLEI-VYNFYKCPVCYSKVCSPPLLCPICSTWIVLPHQILR 300

Query: 236 S--YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
           S  Y+ L       E+T    ND  +     C GC +          ++  C +CK  FC
Sbjct: 301 SDGYNTLGVF----EIT----NDGLDH---VCSGCLEP------TTVIHSTCERCKNFFC 343

Query: 294 LECDIYIHESLHNCPGC 310
           L+CDI IH     CPGC
Sbjct: 344 LDCDILIHSKFKVCPGC 360


>gi|321447765|gb|EFX61168.1| hypothetical protein DAPPUDRAFT_340521 [Daphnia pulex]
          Length = 167

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 71/182 (39%), Positives = 104/182 (57%), Gaps = 23/182 (12%)

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           M    C+G++SLQN+ +     L  +P++  REVL L   L         + I+  KE+ 
Sbjct: 7   MKDTSCNGETSLQNSFERAMCGLKNMPAHSSREVLTLLVILVIYK-----KKIKSFKENN 61

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           IR S+IGL+AE+ IC+ + + TGG+Y+V LD+ + KE ++ +   P A+A    A+L+KM
Sbjct: 62  IRVSIIGLAAEVRICQEIAKRTGGTYNVLLDDHYLKETLILNQVQPLAVAGTMEASLVKM 121

Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
           GF   +                  GY CP+  +R CELP +C+ CGL LVS+PHLARSYH
Sbjct: 122 GFSNNS------------------GYNCPQSGSRYCELPVECKQCGLTLVSAPHLARSYH 163

Query: 239 HL 240
            L
Sbjct: 164 LL 165


>gi|301097433|ref|XP_002897811.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
 gi|262106559|gb|EEY64611.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
          Length = 164

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/174 (43%), Positives = 98/174 (56%), Gaps = 27/174 (15%)

Query: 154 KELIMEHAPPPPAIAEFA--IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKA 211
           K L+     P P +A  A   A L++MGFPQR   G++S+C+CH+     VGY CPRCK+
Sbjct: 2   KRLMTAFTFPSPTVATAASRFATLVEMGFPQRRS-GALSLCTCHQAFTT-VGYLCPRCKS 59

Query: 212 RVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDE--------------VTPLCLNDP 257
           + C+LPT C++C L LVSSPHLARSYHHLFP+A F +              ++P      
Sbjct: 60  KSCDLPTTCQVCNLPLVSSPHLARSYHHLFPVAKFAQHLLRSGATGEKGAKISP------ 113

Query: 258 RNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
               +  CFGC   L   G        C  C+  FC ECD Y+H+SLHNCPGC 
Sbjct: 114 -ELVQKKCFGCLLLLGLDGEGAAY--ECATCQNVFCSECDTYVHDSLHNCPGCS 164


>gi|367054704|ref|XP_003657730.1| hypothetical protein THITE_2123698 [Thielavia terrestris NRRL 8126]
 gi|347004996|gb|AEO71394.1| hypothetical protein THITE_2123698 [Thielavia terrestris NRRL 8126]
          Length = 282

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 62/138 (44%), Positives = 90/138 (65%), Gaps = 3/138 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D  P+R  +       FVRE+F+QNP+SQ+G++ ++DGVA  ++D+GG+P  HI+ L   
Sbjct: 121 DLLPTRYLLTLNYAVDFVREYFEQNPISQMGIIAMRDGVAVRISDMGGNPAEHIEKLRTW 180

Query: 62  LGC---SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
                  G+ SLQNAL++ +G L   PS+G REVLI+Y AL + DPGDI +TI      +
Sbjct: 181 AAQGEPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHDTIANLITDR 240

Query: 119 IRCSVIGLSAEMFICKHL 136
           IR S++GLSA++ IC  L
Sbjct: 241 IRVSIVGLSAQVAICAEL 258


>gi|123469529|ref|XP_001317976.1| transcription factor ssl1 family protein [Trichomonas vaginalis G3]
 gi|121900723|gb|EAY05753.1| transcription factor ssl1 family protein [Trichomonas vaginalis G3]
          Length = 394

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 104/336 (30%), Positives = 158/336 (47%), Gaps = 51/336 (15%)

Query: 3   FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MG 60
           F  +RM ++  Q   F+ +FF QNPLSQ+ ++   +     LT L  + + H+K +  + 
Sbjct: 78  FNQNRMKLIQDQTIDFINDFFVQNPLSQLSILATYESTCRILTPLSCNVQDHVKKIKSLS 137

Query: 61  KLGCSGDSSLQNALDLVQGLL---------SQIPSYGHREVLILYSALSTCDPGDIMETI 111
                G+ SL+++L +   +L         +QI +   +EVLI+Y +L+TCD   I +T+
Sbjct: 138 LEDHGGEPSLESSLSIATAILGNGEKNPGLAQIST--TKEVLIVYGSLTTCDNSPIYKTL 195

Query: 112 QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA 171
              ++SKI+ S+IGL A++F+ + +  +T G Y V +   H  ++      PP    EF 
Sbjct: 196 NLVRDSKIKVSIIGLGAKVFVLEKIAAETKGEYFVPVSIEHLHDIFHSFVIPP----EFM 251

Query: 172 IAN---LIKMGFPQRAGEGSISI-----------CSCHKEVKVGVGYTCPRCKARVCELP 217
           +      I  GF   A     S             +   E     GY CPRC  RV  +P
Sbjct: 252 LNQRRYFIPFGFASLASPTVPSFDVFKLMSNPPSKTNKPETPQYGGYQCPRCGTRVFSIP 311

Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPF-DEVTPLCLNDPRNRSRSTCFGCQQSLLSSG 276
             C +C   L+S  +L R+  HL P+  F  E +P             CFGC        
Sbjct: 312 CFCPVCRDFLISPAYLKRTVCHLTPVPLFVSEESP---------DTDHCFGC-------- 354

Query: 277 NKPGLYVAC--PKCKKHFCLECDIYIHESLHNCPGC 310
           N P   + C  PKC K FC +CD +IHE L +CPGC
Sbjct: 355 NMPFEEIPCRCPKCNKCFCKKCDPFIHEVLQSCPGC 390


>gi|169596905|ref|XP_001791876.1| hypothetical protein SNOG_01225 [Phaeosphaeria nodorum SN15]
 gi|160707395|gb|EAT90874.2| hypothetical protein SNOG_01225 [Phaeosphaeria nodorum SN15]
          Length = 716

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 76/211 (36%), Positives = 105/211 (49%), Gaps = 46/211 (21%)

Query: 142 GSYSVALDESHFKELIMEHAPPPPAIAEFAI----------ANLIKMGFPQRAGEGSISI 191
           G Y++A D+   KEL++     PP I + A           A L+ MGFP R  E   ++
Sbjct: 206 GEYTIATDQEMLKELLIA-TTTPPVIRQTATTTDAPPPESAAALMMMGFPSRVVEDKPTM 264

Query: 192 CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTP 251
           C+CH  + +G GYTC RC A+VC LP  C  C L L+ S HLARSYHHLFP+  +  V+ 
Sbjct: 265 CACHGNLTMG-GYTCSRCSAKVCSLPITCPSCQLTLLLSTHLARSYHHLFPLRNWATVS- 322

Query: 252 LCLNDPRNRSRSTCFGC-----------------QQSLLSSGNK-------------PGL 281
              +  R +    C  C                 +Q+L ++G+K              G 
Sbjct: 323 --WSRAREKGSKECVSCLSSFSKPPASDTERTQTEQNLETNGHKRDADDSEEQKASESGR 380

Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGCES 312
           Y  C  C+ HFC++CD++ H  LHNCPGC S
Sbjct: 381 Y-ECRACESHFCIDCDMFAHMVLHNCPGCLS 410



 Score = 40.8 bits (94), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 15/37 (40%), Positives = 25/37 (67%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKD 38
           D RP+R   +    + +VREFF+QNP+SQ+ ++ + D
Sbjct: 119 DMRPNRYITMINYAQDYVREFFEQNPISQMSVLGMHD 155


>gi|76157401|gb|AAX28337.2| SJCHGC09266 protein [Schistosoma japonicum]
          Length = 234

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 62/149 (41%), Positives = 95/149 (63%), Gaps = 4/149 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D +P+R+    K    FVRE+FDQNP+SQ+G++   D  A  LT+L G+P  H+ AL  +
Sbjct: 88  DLKPNRLICTIKAACTFVREYFDQNPISQLGIIVTSDRRAERLTELSGNPRPHLAALQSL 147

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
               C+G+ SLQNAL L +  L    +  H E+++L ++L+TCDPGDI +TI+    ++I
Sbjct: 148 YSRTCAGEPSLQNALLLAESRLKY--TIHHNEIVVLIASLTTCDPGDIHQTIKSLSSNRI 205

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVAL 148
            CSVI L+ E+F+ + L Q T G++ V +
Sbjct: 206 HCSVISLAVEVFVYRALAQFTQGTFHVIM 234


>gi|402580964|gb|EJW74913.1| hypothetical protein WUBG_14179, partial [Wuchereria bancrofti]
          Length = 144

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 65/138 (47%), Positives = 89/138 (64%), Gaps = 4/138 (2%)

Query: 115 KESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN 174
           K   IRCSVI LSAE+F+ K LC  T G + V LD +HF+ ++ EHA PP   +  A ++
Sbjct: 3   KRYHIRCSVISLSAEIFVFKKLCSITSGRHDVVLDSTHFEVILSEHAIPP-VCSRNAESS 61

Query: 175 LIKMGFPQRAGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
           +++MGFP        S C CH+ EV+   G G+ CP+C AR C LP +CRIC L L+S+P
Sbjct: 62  VVRMGFPAHESIDCPSFCLCHQSEVRPPGGRGFFCPQCGARYCSLPVECRICKLTLISAP 121

Query: 232 HLARSYHHLFPIAPFDEV 249
            LARS H+L P+  F+E+
Sbjct: 122 QLARSLHNLLPLPAFEEI 139


>gi|403267800|ref|XP_003925995.1| PREDICTED: general transcription factor IIH subunit 2 [Saimiri
           boliviensis boliviensis]
          Length = 233

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/134 (45%), Positives = 88/134 (65%), Gaps = 2/134 (1%)

Query: 30  QIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSSLQNALDLVQGLLSQIPSY 87
           +IG++  K   A  LT+L G+P  HI +L     + C G+ SL N+L +    L  +P +
Sbjct: 30  KIGIIVTKSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGH 89

Query: 88  GHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVA 147
             REVLI++S+L+TCDP +I + I+  K +KIR SVIGLSAE+ +C  L ++TGG+Y V 
Sbjct: 90  TSREVLIIFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVI 149

Query: 148 LDESHFKELIMEHA 161
           LDESH+KEL+  H 
Sbjct: 150 LDESHYKELLTHHV 163



 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 5/51 (9%)

Query: 260 RSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           R    C+GCQ  L        +YV C  C+  FC++CD+++H+SLH CPGC
Sbjct: 178 RMAGFCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 223


>gi|425774738|gb|EKV13039.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Penicillium
           digitatum PHI26]
 gi|425780731|gb|EKV18732.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Penicillium
           digitatum Pd1]
          Length = 296

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 57/138 (41%), Positives = 90/138 (65%), Gaps = 2/138 (1%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  H+ A+  +
Sbjct: 101 DLRPTRYLLALRYAQEFVREFFEQNPISQLGVLGLRDGLALRVSDMSGNPTGHLTAIQAL 160

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN L++ +G L   PS+G REVL +Y +L + DPGDI +TI      KI
Sbjct: 161 KMQDPKGLPSLQNGLEMARGALFHTPSHGTREVLFIYGSLLSSDPGDIHQTIDALISDKI 220

Query: 120 RCSVIGLSAEMFICKHLC 137
           R  ++GL+A++ IC+ L 
Sbjct: 221 RVGIVGLAAQVAICRELS 238


>gi|56786210|gb|AAW29208.1| Ssl1 [Drosophila yakuba]
 gi|56786212|gb|AAW29209.1| Ssl1 [Drosophila yakuba]
 gi|56786214|gb|AAW29210.1| Ssl1 [Drosophila yakuba]
 gi|56786216|gb|AAW29211.1| Ssl1 [Drosophila yakuba]
 gi|56786226|gb|AAW29216.1| Ssl1 [Drosophila yakuba]
 gi|56786228|gb|AAW29217.1| Ssl1 [Drosophila yakuba]
 gi|56786230|gb|AAW29218.1| Ssl1 [Drosophila yakuba]
 gi|56786232|gb|AAW29219.1| Ssl1 [Drosophila yakuba]
          Length = 166

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/165 (38%), Positives = 96/165 (58%), Gaps = 12/165 (7%)

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           +  +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI + K+  
Sbjct: 2   LANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDELKKEG 61

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           IRCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+M
Sbjct: 62  IRCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRM 120

Query: 179 GFPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKAR 212
           GFP    E       +S+C CH E           G+ CP+C ++
Sbjct: 121 GFPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSK 165


>gi|429962932|gb|ELA42476.1| hypothetical protein VICG_00575 [Vittaforma corneae ATCC 50505]
          Length = 335

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 85/309 (27%), Positives = 150/309 (48%), Gaps = 34/309 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+   +++  +  FV +F   NPLS +  +T +            S E + K ++  
Sbjct: 59  DYVPTVRNILSNTIPRFVSQFKISNPLSILSFLTCRSVFE------KYSREFNPKIMLNT 112

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
           +G SG+ S  N L     ++    S  ++EVL++ S++ T D G   +     K+  I+ 
Sbjct: 113 VG-SGNFSFLNCLKSAIEMIRT--STYNKEVLVITSSIGTKDTGAYEQVFSDIKKYNIKV 169

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           +VI +  E+ + K + + + G + V LD  HF+ L+ +   P   +   A + L+K+GFP
Sbjct: 170 NVISICGEVTLFKKISELSNGVFKVPLDSFHFEILLHQFTEPLECLE--ATSCLVKLGFP 227

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q      +  C CH + +  + Y CP CKA +C LP +C IC  QLVS  ++++SY+ ++
Sbjct: 228 QITSNSGL--CVCHLKFEHSL-YECPVCKAYICCLPCECPICSTQLVSPLNISKSYYFIY 284

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+ PF       +++   + R    G     L           C +C+ H C +C+ ++H
Sbjct: 285 PLKPF-------ISNENGKCRKC--GRDAKFL-----------CEECQNHLCEKCNKFMH 324

Query: 302 ESLHNCPGC 310
           E L  C  C
Sbjct: 325 EDLGFCIFC 333


>gi|56786218|gb|AAW29212.1| Ssl1 [Drosophila santomea]
 gi|56786220|gb|AAW29213.1| Ssl1 [Drosophila santomea]
 gi|56786222|gb|AAW29214.1| Ssl1 [Drosophila santomea]
 gi|56786224|gb|AAW29215.1| Ssl1 [Drosophila santomea]
 gi|56786234|gb|AAW29220.1| Ssl1 [Drosophila santomea]
 gi|56786236|gb|AAW29221.1| Ssl1 [Drosophila santomea]
 gi|56786238|gb|AAW29222.1| Ssl1 [Drosophila santomea]
 gi|56786240|gb|AAW29223.1| Ssl1 [Drosophila santomea]
          Length = 166

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 63/165 (38%), Positives = 95/165 (57%), Gaps = 12/165 (7%)

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           +  +  + + SLQN LDL    L  +PS+  RE++I+  +L+TCDP DI  TI   K+  
Sbjct: 2   LANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDDLKKEG 61

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           IRCSVI LSAE+ + ++L Q T G++   LD++HF++ +M    PPPA A+    +LI+M
Sbjct: 62  IRCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRM 120

Query: 179 GFPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKAR 212
           GFP    E       +S+C CH E           G+ CP+C ++
Sbjct: 121 GFPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSK 165


>gi|145532284|ref|XP_001451903.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124419569|emb|CAK84506.1| unnamed protein product [Paramecium tetraurelia]
          Length = 373

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 88/328 (26%), Positives = 149/328 (45%), Gaps = 43/328 (13%)

Query: 2   DFRPSRMAVVAKQV----------EAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS- 50
           DF+PSR+AV  + +          ++F+ +F + NPLSQIG+   ++     + D   S 
Sbjct: 69  DFKPSRLAVALQFLPVKKHTIIHNQSFIEQFLEGNPLSQIGIAVAEEYKCKTILDFTSSC 128

Query: 51  --PESHIKAL--MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGD 106
              + ++  +  + + G S  + LQ AL +     S    +    +L + ++  + D  D
Sbjct: 129 VNIKQYLSTIHSINEAGFSMAACLQTALHV----FSSTQLHAQSSILFVTTSTYSDDKLD 184

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
           + E  +KC+ + I+ ++I  +  +     + Q T G Y   ++E  F + I +   P  +
Sbjct: 185 LNEWSEKCQSAAIQINMISFAGAIHQLIKITQATDGQYLCPINEFQFSQEIQKFISPQES 244

Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
                I  L+K+GFPQ+      ++C CH E+     Y CP C ++VC  P  C IC   
Sbjct: 245 KNHKMITQLVKIGFPQKFIVSQPTLCQCHLEIIYNF-YKCPVCYSKVCSPPLLCPICSTW 303

Query: 227 LVSSPHLARS--YHHL--FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
           +V    + RS  Y+ L  F I          +ND  +     C GC +          ++
Sbjct: 304 IVLPHQILRSDGYNTLGVFEI----------INDGLDH---ICQGCLEP------TSVIH 344

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGC 310
             C +CK  FCL+CD+ IH     CPGC
Sbjct: 345 STCERCKNFFCLDCDVLIHSKFKVCPGC 372


>gi|440493575|gb|ELQ76030.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit SSL1 [Trachipleistophora
           hominis]
          Length = 334

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 88/313 (28%), Positives = 148/313 (47%), Gaps = 42/313 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF PS    V  ++  F+++F+D+NP+S + L+  ++    C + +     S  +   G 
Sbjct: 61  DFLPSFRYHVTTKLNHFIKKFYDENPISVLSLLIYRNN--KCESYIVLDRLSETEGFFGV 118

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
            G +G+ SL N+L++    L++      +E+L++  +L T D    +E ++  K+ KI  
Sbjct: 119 AG-TGEFSLINSLNVSLEFLNRDDYI--KEILVITGSLYTID----IEELRSFKDIKIH- 170

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
             I   AE++  K + + T G Y V ++ S    L +E    P +I   +  NL+K+GFP
Sbjct: 171 -FIAFRAEVYFFKDVAEKTHGKYYVPVEMSDI-SLFLESFCMPNSINSGSAHNLLKLGFP 228

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q    G +    C +  K G  Y CP C  +VC LP  C +C  QLV+S  LA+S ++ +
Sbjct: 229 Q-VLTGDLVCACCFRYSKKG--YECPVCHTKVCNLPIKCPVCKTQLVTSNILAQSLYYCY 285

Query: 242 PIAPFDEVT---PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
           P+  F++      +C  D R+R                        C KC   +C  C  
Sbjct: 286 PLEDFEKAGGACRVCGGDGRDR------------------------CKKCGSIYCERCSY 321

Query: 299 YIHESLHNCPGCE 311
           +IH  L+ C  C+
Sbjct: 322 FIHNDLYFCIYCD 334


>gi|67528154|ref|XP_661887.1| hypothetical protein AN4283.2 [Aspergillus nidulans FGSC A4]
 gi|40739631|gb|EAA58821.1| hypothetical protein AN4283.2 [Aspergillus nidulans FGSC A4]
          Length = 277

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 86/130 (66%), Gaps = 2/130 (1%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 106 DLRPTRYLLTLRYAQEFVREFFEQNPISQVGVIGLRDGLALRISDMSGNPTEHISAIQDL 165

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
                 G  SLQN L++ +G L   PS+G RE+ I++ +L + DPGDI +TI    + KI
Sbjct: 166 RSQDPKGLPSLQNGLEMARGALFHTPSHGTREIFIVFGSLLSSDPGDIHQTITTLIDDKI 225

Query: 120 RCSVIGLSAE 129
           R  ++GL+A+
Sbjct: 226 RVGIVGLAAQ 235


>gi|323353892|gb|EGA85745.1| Ssl1p [Saccharomyces cerevisiae VL3]
          Length = 273

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/127 (44%), Positives = 86/127 (67%), Gaps = 2/127 (1%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R A++ +    FV EFFDQNP+SQ+G++ +++G+A  ++ + G+P+ HI AL  +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
            K    G+ SLQNAL++ +GLL  +P++  REVLI++ +LST DPGDI +TI      KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259

Query: 120 RCSVIGL 126
           R    G+
Sbjct: 260 RVKGFGV 266


>gi|429965596|gb|ELA47593.1| transcription factor ssl1, partial [Vavraia culicis 'floridensis']
          Length = 381

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 90/310 (29%), Positives = 149/310 (48%), Gaps = 36/310 (11%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF PS    V  ++  F+++F+D+NP+S + L+  ++    C   +     S  +   G 
Sbjct: 108 DFLPSFRYHVTTKLNHFIKKFYDENPISILSLLIYRNN--RCENYIVLDRMSETENFFGV 165

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
            G +G+ SL N+L++    LS+      +E+L++  +L T D     E ++  ++ K+  
Sbjct: 166 AG-TGEFSLMNSLNVSLDFLSRDDHI--KEILVVTGSLYTID----FEEMRAFRDIKVH- 217

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
             I L AE++  K++ Q T G Y V ++ S    L +E    P +I   +  NL+K+GFP
Sbjct: 218 -FIALRAEVYFFKNIAQKTHGKYYVPVEVSDI-SLFLESFCMPNSINSSSALNLLKLGFP 275

Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
           Q A  G +    C +  K G  Y CP C  +VC LP  C IC  QLV+S  L++S ++ +
Sbjct: 276 Q-ALTGDLVCACCFRCSKKG--YECPVCHTKVCNLPIKCPICKTQLVTSNILSQSLYYCY 332

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+  F++   +C           C G        G+       C KC   +C  C  +IH
Sbjct: 333 PLEDFEKSNGVC---------RVCGG-------DGSD-----QCKKCMSVYCEGCCYFIH 371

Query: 302 ESLHNCPGCE 311
             L+ C  C+
Sbjct: 372 NDLYFCIYCD 381


>gi|383141961|gb|AFG52334.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141967|gb|AFG52337.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141975|gb|AFG52341.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141977|gb|AFG52342.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141987|gb|AFG52347.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
          Length = 74

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 56/75 (74%), Positives = 59/75 (78%), Gaps = 3/75 (4%)

Query: 198 VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP 257
           VKVG GYTCPRCKARVCELPT+C ICGL LVSSPHLARSYHHLFP+ PF+EV     ND 
Sbjct: 1   VKVGGGYTCPRCKARVCELPTECHICGLTLVSSPHLARSYHHLFPVTPFEEVLRTSSND- 59

Query: 258 RNRSRSTCFGCQQSL 272
             R   TCFGCQQ L
Sbjct: 60  --RLPRTCFGCQQFL 72


>gi|402469018|gb|EJW04086.1| hypothetical protein EDEG_01608 [Edhazardia aedis USNM 41457]
          Length = 341

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 147/316 (46%), Gaps = 46/316 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF PS       ++E FV +F + NP+S + +    +  +  + +L    +         
Sbjct: 63  DFLPSFRISTKNKLEKFVEKFSEDNPISTLTVCVNSNKTSESMQNLNSPGD--------- 113

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
               G  SL+ AL+    +L    S   +E+L++  +LS  D   +  TI+   ++ I+ 
Sbjct: 114 ----GFFSLKKALETGYEVLKD--STNIKEMLLIIFSLSINDRNGLETTIENVVKANIKV 167

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           ++I L AE+ I K +C  TGG + V LD++H++  +M     P ++    I+ LI++GFP
Sbjct: 168 NIINLCAELKILKTVCDRTGGKFCVPLDDNHYETSLMSFL-TPTSVPNSTIS-LIQIGFP 225

Query: 182 --QRAGEGSISICSCHKEVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
             + + + +   C CH ++       Y+C +C A +C LP +C IC     SS  L ++ 
Sbjct: 226 KVEVSDKNTPFCCVCHLKIIRPPFETYSCVKCSAILCSLPVECPICDTLNASSIGLCKNI 285

Query: 238 HHLFPIAPFDEVTPLCL--NDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
           HH + +  F      C+  ND          GC Q              C KC   FC +
Sbjct: 286 HHFYVLEDFVPSDGFCVVCNDK---------GCMQ--------------CQKCLNVFCRD 322

Query: 296 CDIYIHESLHNCPGCE 311
           C+ ++H++++ C  CE
Sbjct: 323 CNSFLHDNINFCFYCE 338


>gi|1737212|gb|AAC52046.1| basic transcription factor 2 p44, partial [Homo sapiens]
          Length = 143

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/108 (52%), Positives = 75/108 (69%), Gaps = 10/108 (9%)

Query: 203 GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSR 262
           GY CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLFP+  F E+ PL   +  N  R
Sbjct: 36  GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEI-PL---EEYNGER 91

Query: 263 STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
             C+GCQ  L        +YV C  C+  FC++CD+++H+SLH CPGC
Sbjct: 92  -FCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 133


>gi|383141959|gb|AFG52333.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141963|gb|AFG52335.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141965|gb|AFG52336.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141969|gb|AFG52338.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141971|gb|AFG52339.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141973|gb|AFG52340.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141979|gb|AFG52343.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141981|gb|AFG52344.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141983|gb|AFG52345.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141985|gb|AFG52346.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141989|gb|AFG52348.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
 gi|383141991|gb|AFG52349.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
          Length = 74

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/75 (73%), Positives = 58/75 (77%), Gaps = 3/75 (4%)

Query: 198 VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP 257
           VKVG GYTCPRCKAR CELPT+C ICGL LVSSPHLARSYHHLFP+ PF+EV     ND 
Sbjct: 1   VKVGGGYTCPRCKARACELPTECHICGLTLVSSPHLARSYHHLFPVTPFEEVLRTSSND- 59

Query: 258 RNRSRSTCFGCQQSL 272
             R   TCFGCQQ L
Sbjct: 60  --RLPRTCFGCQQFL 72


>gi|426384438|ref|XP_004058775.1| PREDICTED: general transcription factor IIH subunit 2-like [Gorilla
           gorilla gorilla]
          Length = 260

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 53/113 (46%), Positives = 75/113 (66%), Gaps = 2/113 (1%)

Query: 50  SPESHIKALMG--KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDI 107
           +P  HI +L     + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I
Sbjct: 96  NPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNI 155

Query: 108 METIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH 160
            + I+  K +KIR SVIGLSAE+ +C  L ++TGG Y V LDESH+KEL+  H
Sbjct: 156 YDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGIYHVILDESHYKELLTHH 208


>gi|361068769|gb|AEW08696.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
          Length = 74

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 54/75 (72%), Positives = 58/75 (77%), Gaps = 3/75 (4%)

Query: 198 VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP 257
           VKVG GYTCPRCKA VCELPT+C ICGL LVSSPHLARSYHHLFP+ PF++V     ND 
Sbjct: 1   VKVGGGYTCPRCKAHVCELPTECHICGLTLVSSPHLARSYHHLFPVTPFEKVLRTSSND- 59

Query: 258 RNRSRSTCFGCQQSL 272
             R   TCFGCQQ L
Sbjct: 60  --RLPRTCFGCQQFL 72


>gi|147803583|emb|CAN77666.1| hypothetical protein VITISV_035971 [Vitis vinifera]
          Length = 328

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/68 (75%), Positives = 60/68 (88%)

Query: 44  LTDLGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCD 103
           + DLGGS +SH+KALMGKL CSGDSSLQNAL+LV+G L+ IPSYG+REVLILYSALS CD
Sbjct: 114 MQDLGGSIDSHVKALMGKLECSGDSSLQNALNLVRGYLNLIPSYGYREVLILYSALSACD 173

Query: 104 PGDIMETI 111
            GDI+ T+
Sbjct: 174 TGDIIPTL 181


>gi|294942110|ref|XP_002783381.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Perkinsus marinus ATCC 50983]
 gi|239895836|gb|EER15177.1| TFIIH basal transcription factor complex p44 subunit, putative
           [Perkinsus marinus ATCC 50983]
          Length = 188

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 65/195 (33%), Positives = 103/195 (52%), Gaps = 20/195 (10%)

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
           SVI LS E++  + + + TGG++SVA    HFK L+ +H  PP  ++     + IKMGFP
Sbjct: 8   SVISLSPELYALRKVVEMTGGTFSVATSPIHFKRLMQKHLTPPNWVSS---PSYIKMGFP 64

Query: 182 -QRAGEGSISI-----CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
            +RA +G+ +      C CH  ++    Y CP+C + VCE+P +C +C L LV    L +
Sbjct: 65  VRRACDGNHTADPPIKCMCHNRLQKTFVYICPQCHSPVCEIPVNCPVCRLPLVDDDALKK 124

Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
            + H++ + P   + P  ++ P++    TC  C       G +      C +C    C E
Sbjct: 125 HHRHIYSM-PTYTLLPT-VDYPKSY---TCQFCGTDFTEGGAR------CDQCLSDVCYE 173

Query: 296 CDIYIHESLHNCPGC 310
           CD++ H  L +CPGC
Sbjct: 174 CDMFAHNKLRHCPGC 188


>gi|194383680|dbj|BAG59198.1| unnamed protein product [Homo sapiens]
          Length = 203

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 52/127 (40%), Positives = 79/127 (62%), Gaps = 4/127 (3%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
             + C G+ SL N+L +    L  +P +  REVLI++S+L+TCDP +I + I+   + KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIK--VDQKI 192

Query: 120 RCSVIGL 126
           +  VI +
Sbjct: 193 KTKVITV 199


>gi|387592763|gb|EIJ87787.1| hypothetical protein NEQG_01859 [Nematocida parisii ERTm3]
 gi|387595388|gb|EIJ93012.1| hypothetical protein NEPG_01967 [Nematocida parisii ERTm1]
          Length = 366

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 145/319 (45%), Gaps = 30/319 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF PSR   V K V  F REF + NPLS +GL  V  G++  ++ +    E  I+    +
Sbjct: 59  DFLPSRKYYVKKGVMKFFREFIESNPLSTVGLAVVCGGMSYLVSSI-LCEEEEIEMCFSQ 117

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTC--DPGDIMETIQKCKESKI 119
              +G+ SL +A + +            +EV+ + S  +     P  +  T+       I
Sbjct: 118 NEGAGNFSLGSAFETINEFFHGCSFL--KEVVFIISGFTFVGSSPFILKSTLIN---KGI 172

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
               I ++ EM I + L +++GG + +         L+     PPP  A  A  +++K+G
Sbjct: 173 IMHTIHMAGEMEILRKLSEESGGVFGIVNCPEDLSTLLGLICIPPPHSAS-ARLSMLKIG 231

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP    E   +IC+CH ++K   GY CP C  +VC++P  C +C   L +  HL ++ H 
Sbjct: 232 FPSPIHEN--TICACHLDLK-EWGYECPFCTTKVCKVPGVCPVCENILSAPVHLLKALH- 287

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
                 + +  P+ +       + TC GC +      N   L+ +CP CK   CL+C  +
Sbjct: 288 ------WSDSAPIYV----AAIKGTCRGCFE------NNIQLH-SCPGCKNSLCLDCTRF 330

Query: 300 IHESLHNCPGCESLRHSNP 318
           I + L+ C  C     S P
Sbjct: 331 IRQELNFCIFCPGAVSSVP 349


>gi|378754973|gb|EHY65001.1| hypothetical protein NERG_02057 [Nematocida sp. 1 ERTm2]
          Length = 357

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 90/321 (28%), Positives = 141/321 (43%), Gaps = 50/321 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DF PSR   V K V  F REF + NPLS +GL  V  G++  ++ +    E  I+    +
Sbjct: 59  DFLPSRKYYVKKGVIKFFREFTESNPLSTVGLAVVCGGMSYLVSSI-LCEEDEIEMCFAQ 117

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSAL------------STCDPGDIME 109
               G  SL +A + V+    Q  S+  +EV+ + S              S  + G IM 
Sbjct: 118 NEGQGGFSLGSAFETVKEFF-QGCSF-LKEVIFIVSGFTFVGRSPFILKNSLINKGIIMH 175

Query: 110 TIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAE 169
           TI              ++ EM I + + ++ GG + +         L+     PPP  A 
Sbjct: 176 TIH-------------MAGEMEILRKISEECGGIFGIVNCPEDLSTLLGLICIPPPHSAS 222

Query: 170 FAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
            A  +++K+GFP    E   +IC+CH E+K   GY CP C  +VC++P  C IC   L +
Sbjct: 223 -ARLSMLKIGFPSSIQEN--TICACHLELK-EWGYECPFCTTKVCKVPGVCPICENILSA 278

Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCK 289
             HL ++ H       + +  P+   + + R    C GC +  +           CP CK
Sbjct: 279 PVHLLKALH-------WSDSAPIYAPEGKGR----CRGCSEENIQLN-------GCPGCK 320

Query: 290 KHFCLECDIYIHESLHNCPGC 310
              C++C  +I + L+ C  C
Sbjct: 321 SRLCMDCTGFIRQELNFCIFC 341


>gi|402592150|gb|EJW86079.1| hypothetical protein WUBG_03012, partial [Wuchereria bancrofti]
          Length = 220

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 83/141 (58%), Gaps = 2/141 (1%)

Query: 3   FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MG 60
            +PSR+AV  K +  F+ +F +QNP+SQ+G+V  KD  A  L  L G+     ++L  + 
Sbjct: 80  MQPSRLAVTIKALNQFLDKFSEQNPISQVGVVVCKDKRAERLIPLTGNVRLVKESLSTIT 139

Query: 61  KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
           +L C G+ SL N+L      L   P +  REV+++ ++LSTCDP  I  T +  K   IR
Sbjct: 140 ELLCHGEFSLHNSLMATIRSLHSYPGHASREVILIVASLSTCDPSSIFGTFELLKRYHIR 199

Query: 121 CSVIGLSAEMFICKHLCQDTG 141
           CSVI LSAE+F+ K LC  T 
Sbjct: 200 CSVISLSAEIFVFKKLCSITS 220


>gi|70917739|ref|XP_732957.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56504309|emb|CAH83594.1| hypothetical protein PC300587.00.0 [Plasmodium chabaudi chabaudi]
          Length = 161

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 46/145 (31%), Positives = 86/145 (59%), Gaps = 2/145 (1%)

Query: 4   RPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--K 61
           +P+R+ V+ + VE F++ FF +NP+  +G+V +K+  A  +  L  + +  + +L+   K
Sbjct: 1   KPNRITVIFECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKEQK 60

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
            G  G  SLQ  L++   LL  +P YG +E+LI+Y ++ TCD  +I++ +    ++ +  
Sbjct: 61  EGLQGSPSLQEGLEIAHNLLMDMPLYGTKEILIMYGSIRTCDKKNILKYLDLLIKNNMYV 120

Query: 122 SVIGLSAEMFICKHLCQDTGGSYSV 146
           + I ++ EM I KH+C+ T G Y +
Sbjct: 121 NCISIAPEMHILKHICEKTHGIYKI 145


>gi|85000913|ref|XP_955175.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65303321|emb|CAI75699.1| hypothetical protein, conserved [Theileria annulata]
          Length = 319

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 113/220 (51%), Gaps = 33/220 (15%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           MDF+P R+      ++ F+   +   P++Q+G++ +++ + N +T  G +P+  ++ L  
Sbjct: 108 MDFKPDRLYCAFNSLKEFMNVLYGSGPITQVGIIVMRNKICNVITQFGTNPDEQMELLSN 167

Query: 61  KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
            L  G  G SSLQN L++   ++ ++P Y  RE+LI++ +  T DPG+I+ T+ K K++ 
Sbjct: 168 ILKDGPEGSSSLQNGLEMCLKIMCELPYYMTREILIIFGSNKTLDPGNILITLDKLKQNF 227

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
           I  + I LS E++I K       G + V+                PP I+  A  +LI  
Sbjct: 228 ITVNCISLSPELYILK-----GCGIFMVS----------------PPDISR-AFHHLI-- 263

Query: 179 GFPQRAG---EGSISICSCHKEVKVGVGYTCPRCKARVCE 215
             P ++    E +I+   C+  + + +GY C  C+   CE
Sbjct: 264 --PPKSFHKIERNINCSGCN--LNIEIGYECQNCQGIFCE 299



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 17/88 (19%)

Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
           CG+ +VS P ++R++HHL P   F ++        RN +   C GC  ++        + 
Sbjct: 245 CGIFMVSPPDISRAFHHLIPPKSFHKIE-------RNIN---CSGCNLNI-------EIG 287

Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGC 310
             C  C+  FC  CD YIH+ LH CP C
Sbjct: 288 YECQNCQGIFCEYCDKYIHQDLHQCPIC 315


>gi|269859684|ref|XP_002649566.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220066929|gb|EED44398.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 346

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/311 (25%), Positives = 148/311 (47%), Gaps = 28/311 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           ++ P+  + +  ++  F   F   NPLS+I   T    V N             K L+  
Sbjct: 60  EYLPTIRSFILPELSDFSDNFVKLNPLSKITFTTFNYKVLNTAFKFD-------KNLLSS 112

Query: 62  LGCSGDSSLQNALDLVQGLL-SQIPSYGHREVLILYSALSTCDPG-DIMETIQKCKESKI 119
            G +G  S  + L  + G+  ++I S   +E L++ S++   +   +  + +++  + KI
Sbjct: 113 YG-TGHFSFTSCLSYINGITDTKITS---KECLLIISSVGVINEEINFDKVMKEIIQKKI 168

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           + ++I +  E+ + K +C  TGG Y V ++   FK ++ +   PP ++      +LI++G
Sbjct: 169 KINIISICGEVTLLKKVCISTGGKYVVPINSFQFKSILSDFLYPPESLD--LKCSLIEVG 226

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP+      +  C CH  +   V + CPRCK  VC +P+ C IC L+L+SS  +     +
Sbjct: 227 FPKNIYTQHL--CVCHLNL-CSVLFECPRCKGLVCTIPSTCPICKLELISSIDICDLICY 283

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
            + + PF ++      +  +  ++ C+GC++  + S         C KC   FC  CD  
Sbjct: 284 NYYLEPFIKIAT--SKELSSLEKNKCYGCEKLEVIS--------VCNKCLSPFCYNCDAK 333

Query: 300 IHESLHNCPGC 310
           +H  ++ CP C
Sbjct: 334 LHNVINFCPFC 344


>gi|407425981|gb|EKF39551.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           marinkellei]
          Length = 356

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 83/346 (23%), Positives = 138/346 (39%), Gaps = 50/346 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+R+  +  Q+  FV  + D NPL+ +G+VT++DGVA+ L     +    I+ L  K
Sbjct: 19  DYLPTRLLALRPQLGRFVHAYLDANPLASLGVVTMRDGVAHRLNSCTTNASDIIQTLEVK 78

Query: 62  ---LGCSGDSSLQNALDLVQGLLSQI-----------PSYGHRE----------VLILYS 97
               G SG  SL+N L      L  +           P  G +E          +L++ S
Sbjct: 79  YFLFGGSGAMSLENGLRFALSELVDMKRIAKRVRRTEPRAGGKEAREEPTARLRILLVSS 138

Query: 98  ALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELI 157
           +++  DP D+   I+   + +IR  V+     + + +     TGG+    ++  H  E++
Sbjct: 139 SVTIIDPHDVFGVIKMLAKLRIRVDVVSFCGAVHVFEEAAVSTGGTLYTPMNYDHLTEVL 198

Query: 158 MEHAPPPPA-IAEFAIANLIKMGFP-------QRAGEGSISICSCHKEVKVGVGYT-CPR 208
              A P    I       +I++GFP          GEG                Y  CP+
Sbjct: 199 GRLASPDKRHICPEERPTMIRIGFPVYIECEDSEGGEGG------------STSYVACPQ 246

Query: 209 CKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRS----T 264
           C      +P+ C +C L L S+P +  ++     + P  E            +       
Sbjct: 247 CALMQTSIPSTCPLCKLLLCSAPLIHNTFIANNELCPVSENVLDAFEGAEKPTEDYLNER 306

Query: 265 CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           C  CQ  ++    K  L   C  C    C+ C+ Y+ E +  CP C
Sbjct: 307 CSLCQYGMMGEDGK-ALVWRCNSCHCLRCVSCEKYVRERIGLCPTC 351


>gi|426384446|ref|XP_004058778.1| PREDICTED: general transcription factor IIH subunit 2-like, partial
           [Gorilla gorilla gorilla]
          Length = 119

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 62/129 (48%), Positives = 82/129 (63%), Gaps = 15/129 (11%)

Query: 144 YSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG 203
           Y V LDESH+KEL+  H  PPPA +     +LI+MG   +  +G+         + +G G
Sbjct: 2   YHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG---KHLDGNTE-----PGLTLG-G 51

Query: 204 YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRS 263
           Y CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLFP+  F E+ PL   +  N  R 
Sbjct: 52  YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEI-PL---EEYNGER- 106

Query: 264 TCFGCQQSL 272
            C+GCQ  L
Sbjct: 107 FCYGCQGEL 115


>gi|269860338|ref|XP_002649891.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220066731|gb|EED44204.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 346

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 77/311 (24%), Positives = 148/311 (47%), Gaps = 28/311 (9%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           ++ P+  + +  ++  F   F   NPLS+I        V N             K L+  
Sbjct: 60  EYLPTIRSFILPELSDFSDNFVKLNPLSKITFTIFNYKVLNTAFKFD-------KNLLSS 112

Query: 62  LGCSGDSSLQNALDLVQGLL-SQIPSYGHREVLILYSALSTCDPG-DIMETIQKCKESKI 119
            G +G  S  + L  + G+  ++I S   +E L++ S++   +   +  + +++  + KI
Sbjct: 113 YG-TGHFSFTSCLSYINGITDTKITS---KECLLIISSVGVINKEINFDKVMKEIIQKKI 168

Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
           + ++I +  E+ + K +C  TGG Y V ++   FK ++ +   PP ++      +LI++G
Sbjct: 169 KINIISICGEVTLLKKVCISTGGKYVVPINSFQFKSILSDFLYPPESLD--LKCSLIEVG 226

Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
           FP+      +  C+CH  +   V + CP+CK  VC +P+ C IC L+L+SS  +     +
Sbjct: 227 FPKNIYTQHL--CACHLNL-CSVLFECPKCKGLVCTIPSACPICKLELISSIDICDLICY 283

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
            + + PF ++      +  +  ++ C+GC++  + S         C KC   FC  CD  
Sbjct: 284 NYHLEPFIKIAT--SKELSSLEKNKCYGCEKLEVIS--------VCNKCLSPFCYNCDAK 333

Query: 300 IHESLHNCPGC 310
           +H  ++ CP C
Sbjct: 334 LHNVINFCPFC 344


>gi|71664710|ref|XP_819333.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           strain CL Brener]
 gi|70884629|gb|EAN97482.1| DNA repair and transcription factor protein, putative [Trypanosoma
           cruzi]
          Length = 356

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/350 (24%), Positives = 149/350 (42%), Gaps = 58/350 (16%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+R+  +  Q+  FV  + D NPL+ +G+VT++DGVA+ L     +    I+ L  K
Sbjct: 19  DYLPTRLLALRPQLGRFVHAYLDANPLASLGVVTMRDGVAHRLNSCTTNASDIIQTLEVK 78

Query: 62  ---LGCSGDSSLQNALDLVQGLLSQI-----------PSYGHRE----------VLILYS 97
               G SG  SL+N L      L  +           P  G +E          +L++ S
Sbjct: 79  YFLFGGSGAMSLENGLRFALSELVDMKRIAKRVRRTEPHAGGKEAREEPTARLRILLVSS 138

Query: 98  ALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELI 157
           +++  DP D+   I+   + ++R  VI     + +       TGG+    ++  H  E++
Sbjct: 139 SVTVIDPHDVFGVIKMMAKLRVRVDVISFCGAVHVFGEAAVSTGGTLYTPMNYDHLTEVL 198

Query: 158 -MEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG-----YTCPRCKA 211
            +  +P    + +     +I++GFP           +C KE + G G       CP+C  
Sbjct: 199 GLLASPDKHHVCQEERPAMIRIGFP--------VYINC-KEPEGGEGGSTNYVACPQCAL 249

Query: 212 RVCELPTDCRICGLQLVSSPHLARSY---HHLFPIAP--------FDEVTPLCLNDPRNR 260
               +P+ C +C L L S+P +  ++   + L P++          D+ T   LN+    
Sbjct: 250 VQTSIPSTCPLCKLLLCSAPLIHNTFIAHNELCPVSENVVDAFEGTDKPTEDYLNE---- 305

Query: 261 SRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
               C  CQ  ++    +  L   C  C    C+ C+ Y+ E +  CP C
Sbjct: 306 ---RCSLCQYGMMGEDGR-ALVWRCNSCHCLRCVSCEKYVRERIGLCPTC 351


>gi|222871523|gb|EEF08654.1| predicted protein [Populus trichocarpa]
          Length = 104

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 42/77 (54%), Positives = 59/77 (76%)

Query: 84  IPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
           +P +  REVLI++S+L+TCDP +I + I+  K +KIR SVIGLSAE+ +C  L ++TGG+
Sbjct: 1   MPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGT 60

Query: 144 YSVALDESHFKELIMEH 160
           Y V LDESH+KEL+  H
Sbjct: 61  YHVILDESHYKELLTHH 77


>gi|407843613|gb|EKG01508.1| DNA repair and transcription factor protein [Trypanosoma cruzi]
          Length = 397

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 146/350 (41%), Gaps = 58/350 (16%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+R+  +  Q+  FV  + D NPL+ +G+VT++DGVA+ L     +    I+ L  K
Sbjct: 60  DYLPTRLLALRPQLGRFVHAYLDANPLASLGVVTMRDGVAHRLNSCTTNASDIIQTLEVK 119

Query: 62  ---LGCSGDSSLQNALDLVQGLLSQI-----------PSYGHRE----------VLILYS 97
               G SG  SL+N L      L  +           P  G +E          +L++ S
Sbjct: 120 YFLFGGSGAMSLENGLRFALSELVDMKRIAKRVRRTEPRAGGKEAREEPTARLRILLVSS 179

Query: 98  ALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELI 157
           +++  DP D+   I+   +  +R  VI     + +       TGG+    ++  H  E++
Sbjct: 180 SVTVIDPHDVFGVIKMMAKLHVRVDVISFCGAVHVFGEAAVSTGGTLYTPMNYDHLTEVL 239

Query: 158 MEHAPPPPA-IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG-----YTCPRCKA 211
              A P    + +     +I++GFP            C KE + G G       CP+C  
Sbjct: 240 GRLASPDKRHVCQEERPAMIRIGFP--------VYIDC-KEPEGGEGGSTNYVACPQCAL 290

Query: 212 RVCELPTDCRICGLQLVSSPHLARSY---HHLFPIAP--------FDEVTPLCLNDPRNR 260
               +P+ C +C L L S+P +  ++   + L P++          D+ T   LN+    
Sbjct: 291 VQTSIPSTCPLCKLLLCSAPLIHNTFIAHNELCPVSENVVDAFEGADKPTEDYLNE---- 346

Query: 261 SRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
               C  CQ  ++    +  L   C  C    C+ C+ Y+ E +  CP C
Sbjct: 347 ---RCSLCQYGMMGEDGR-ALVWRCNSCHCLRCVSCEKYVRERIGLCPTC 392


>gi|342182931|emb|CCC92411.1| putative DNA repair and transcription factor protein [Trypanosoma
           congolense IL3000]
          Length = 354

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 80/343 (23%), Positives = 135/343 (39%), Gaps = 45/343 (13%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+R+  +  Q+ AFV  F D NPL+ +G+V ++DGVA+ L     +      AL  K
Sbjct: 19  DYLPTRLLALRPQLNAFVHAFLDDNPLASLGVVVMRDGVAHRLISCTANATDIAHALELK 78

Query: 62  ---LGCSGDSSLQNALDLVQGLLSQIPSYGHR-----------------------EVLIL 95
               G SG  SL+N L +    L  +     R                        V+++
Sbjct: 79  YFLFGGSGAMSLENGLRMALSELVDLKRIAKRVRREAEMPGPNEEGLRPDHATQLRVILV 138

Query: 96  YSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKE 155
            S+++  DP D+    +   + ++R  V+     +   +     TGG     ++  H   
Sbjct: 139 ASSVTLVDPHDVFRVQKVVAQLRVRVDVVSFCGAVHALQEAALTTGGQLYTPMNYEHLTG 198

Query: 156 LIMEHAPPPPAIAEFAIAN--LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
           ++ + A P             +I++GFP+              E K      CP+C    
Sbjct: 199 ILQKLAAPERGSTHPLTERPAMIQIGFPRYV----------EAEGKEKRYMGCPQCGLIQ 248

Query: 214 CELPTDCRICGLQLVSSPHLARSY-HHLFPIAPFDEVTPLCLNDPRNRSRST-----CFG 267
             +P+ C +C L L S+P +  S+     PIAP  +V       P   S           
Sbjct: 249 TSVPSTCPLCKLLLCSAPLIHTSFITKNEPIAPSQKVESK-QKAPSGESDEAPSVHCSLC 307

Query: 268 CQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           C++  L  G+   ++  C  C +  C  C+ Y+ E +  CP C
Sbjct: 308 CRRKPLGVGDGGMVFWRCSCCLRERCDVCEAYVKEGIGLCPTC 350


>gi|56753383|gb|AAW24895.1| SJCHGC07056 protein [Schistosoma japonicum]
          Length = 150

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 72/134 (53%), Gaps = 28/134 (20%)

Query: 204 YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP----RN 259
           Y CPRC A  CELP +C +CGL LV++PHLAR+YHHLFP+  F+ V     ++      +
Sbjct: 19  YACPRCHAAYCELPVECNVCGLTLVAAPHLARAYHHLFPLDLFEPVVDNNSSEKSLVNED 78

Query: 260 RSRST--------------------CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
           +S+ T                    C GC  +++     PG    CPKC   FC  CD  
Sbjct: 79  KSKETINSSSDDNNNNNNNNNQLRVCTGC--NVIIPSRIPGY--QCPKCTFIFCHSCDAI 134

Query: 300 IHESLHNCPGCESL 313
           +H+S+H+CPGC +L
Sbjct: 135 LHDSIHSCPGCLTL 148


>gi|72393367|ref|XP_847484.1| DNA repair and transcription factor protein [Trypanosoma brucei
           TREU927]
 gi|62359582|gb|AAX80015.1| DNA repair and transcription factor protein, putative [Trypanosoma
           brucei]
 gi|70803514|gb|AAZ13418.1| DNA repair and transcription factor protein, putative [Trypanosoma
           brucei brucei strain 927/4 GUTat10.1]
          Length = 351

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 83/342 (24%), Positives = 138/342 (40%), Gaps = 44/342 (12%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +D+ P+R+  +  Q+  FV  + D NPL+ +G+V ++DGVA  L     +    ++ L  
Sbjct: 18  LDYLPTRLIALRPQLNKFVHTYLDANPLASLGVVVMRDGVAQRLIPCTTNATDIVQTLEL 77

Query: 61  K---LGCSGDSSLQNALDLVQGLLSQIPSYGHR---------------------EVLILY 96
           K    G SG  SL+N L +    L  +     R                      +++  
Sbjct: 78  KYFRFGGSGAMSLENGLRMALSELVDLRRIAKRLRENNSGSTGSDVQEDVTSRLRIVLAS 137

Query: 97  SALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKEL 156
           S+++  DP D+    +   + ++R  VI     + + +     TGGS    ++  H  E+
Sbjct: 138 SSVTIVDPLDVFAVQKIVAQLRVRVDVISFCGAVHVLQEAAVLTGGSLHTPMNYEHLTEI 197

Query: 157 IMEHAPPPPAIAEFAIAN--LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
           +   A P    A        LI++GFP   G+          E +     TCP+C     
Sbjct: 198 LRHLASPEGGSARHVDVQPALIQIGFPMYVGD----------EAEGKRYLTCPQCGLIQT 247

Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPI-APFDEVTPLCLNDPRNRSRST-----CFGC 268
            +P+ C +C L L S P +  ++     + AP  +V  +        +        C  C
Sbjct: 248 SIPSTCSLCKLLLFSVPLIHVTFITRNELCAPSQKVEHVTKKKTTAAAGGAADTARCSLC 307

Query: 269 QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           Q+ L  +G   G Y  C  C    C  C+ YI E+L  CP C
Sbjct: 308 QRKL--AGEDGGGYWCCGCCLCERCEACETYIRENLGLCPTC 347


>gi|340055688|emb|CCC50009.1| putative DNA repair and transcription factor protein, fragment
           [Trypanosoma vivax Y486]
          Length = 355

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 110/262 (41%), Gaps = 39/262 (14%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+R+  +  Q+ AFV  +FD+NPL+ +G+V ++DGVA+ +     +    ++AL  K
Sbjct: 19  DYLPTRLLALRPQLHAFVHTYFDENPLAALGVVVMRDGVAHRICSCTTNATDILQALEVK 78

Query: 62  ---LGCSGDSSLQNALDLVQGLLSQIPSYGHR---------------------------E 91
               G SG  SL+N L L    L ++     R                            
Sbjct: 79  YFLFGGSGAMSLENGLRLALSELVELKRIAKRLRQDGVDTGNGSNGSRGPTLEQPSARLR 138

Query: 92  VLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDES 151
           ++++ S+++  DP D+    +     ++R  VI  S  +   +     TGG     L   
Sbjct: 139 IVVISSSVTLVDPHDVFAVQRLVARLRVRVDVISFSGAVHALEEAAAVTGGMLHTPLGYD 198

Query: 152 HFKELIMEHAPPPPAIA--EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRC 209
           H   ++   A P    A  E   + +I++GFP       + + S   EV+      CP+C
Sbjct: 199 HLTTILRRLAAPDKMAARREGEKSPMIRIGFP-------LHMESAEAEVERRRFIACPQC 251

Query: 210 KARVCELPTDCRICGLQLVSSP 231
                  P+ C +C L + S P
Sbjct: 252 GLVQTATPSTCHLCKLLICSVP 273


>gi|261330741|emb|CBH13726.1| DNA repair and transcription factor protein,putative [Trypanosoma
           brucei gambiense DAL972]
          Length = 351

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 106/257 (41%), Gaps = 36/257 (14%)

Query: 1   MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +D+ P+R+  +  Q+  FV  + D NPL+ +G+V ++DGVA  L     +    ++ L  
Sbjct: 18  LDYLPTRLIALRPQLNKFVHTYLDANPLASLGVVVMRDGVAQRLIPCTTNATDIVQTLEL 77

Query: 61  K---LGCSGDSSLQNALDLVQGLLSQIPSYGHR---------------------EVLILY 96
           K    G SG  SL+N L +    L  +     R                      +++  
Sbjct: 78  KYFRFGGSGAMSLENGLRMALSELVDLRRIAKRLRENNSGSTGSDVQEDVTSRLRIVLAS 137

Query: 97  SALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKEL 156
           S+++  DP D+    +   + ++R  VI     + + +     TGGS    ++  H  E+
Sbjct: 138 SSVTIVDPLDVFAVQKIVAQLRVRVDVISFCGAVHVLQEAAVLTGGSLHTPMNYEHLTEI 197

Query: 157 IMEHAPPPPAIAEFAIAN--LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
           +   A P    A        LI++GFP   G+          E +     TCP+C     
Sbjct: 198 LRHLASPEGGSARHVGVQPALIQIGFPMYVGD----------EAEGKRYLTCPQCGLIQT 247

Query: 215 ELPTDCRICGLQLVSSP 231
            +P+ C +C L L S P
Sbjct: 248 SIPSTCPLCKLLLFSVP 264


>gi|95007103|emb|CAJ20323.1| ssl1-like protein [Toxoplasma gondii RH]
          Length = 647

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/339 (26%), Positives = 127/339 (37%), Gaps = 56/339 (16%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           DFRP R+  V +  E F+  F  QNPL+Q+  V +    A  L   G +           
Sbjct: 291 DFRPDRLRCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSAEPLAARGEAARDSAAVSARP 350

Query: 62  LGC---------------------------SGDSSLQNALDLVQGLLSQIPSYGHREVLI 94
            G                            +G  SL N L L + LL+ +P Y  REVL+
Sbjct: 351 SGSASMQLFSSSASESIAALREKRKHPGVGTGVPSLNNGLHLAKDLLAGVPPYCTREVLV 410

Query: 95  LYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFK 154
           L+ +L TCD G I ETI   K+S I C+VI L+AE+ + K      G   S     S   
Sbjct: 411 LFGSLRTCDVGCIEETIAAVKKSNICCNVICLAAELHVLKFSESLPGNGRSTRCASSPRA 470

Query: 155 ELIMEHAPPPPAI---AEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKA 211
                HA   PA       A+++   + F +    G     S      +      P  + 
Sbjct: 471 SARSSHATHSPASLVCQHAALSHPHGISFAEVDEHGGPLQLSPTPHFLL---LRLPPVRR 527

Query: 212 RVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQS 271
                P    +    L+S+P LA  +     + P D             ++  C  C   
Sbjct: 528 EALHHPQPLSM----LLSAPRLAGRH---LAVVPADA----------PEAKRLCACCTTQ 570

Query: 272 LLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           L   G +      CP C + FC +CDIY HE L  C  C
Sbjct: 571 LDRGGAQ------CPDCGEIFCHDCDIYSHEQLRQCAFC 603


>gi|115432926|ref|XP_001216600.1| hypothetical protein ATEG_07979 [Aspergillus terreus NIH2624]
 gi|114189452|gb|EAU31152.1| hypothetical protein ATEG_07979 [Aspergillus terreus NIH2624]
          Length = 202

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 59/92 (64%), Gaps = 2/92 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
           D RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +
Sbjct: 108 DLRPTRYLLTLRYAQEFVREFFEQNPISQLGVLGLRDGLAVRISDMSGNPTEHISAIQAL 167

Query: 60  GKLGCSGDSSLQNALDLVQGLLSQIPSYGHRE 91
                 G  SLQN L++ +G L  + +  HRE
Sbjct: 168 RDQDPKGLPSLQNGLEMARGALLYVLTLNHRE 199


>gi|147832400|emb|CAN64423.1| hypothetical protein VITISV_032275 [Vitis vinifera]
          Length = 578

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 32/41 (78%), Positives = 38/41 (92%)

Query: 20  REFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
           +EFFDQ+P SQIGLVT+KDG+A CLTDLGGSP+SH+KALM 
Sbjct: 199 KEFFDQDPFSQIGLVTIKDGLAQCLTDLGGSPDSHVKALMA 239


>gi|324562745|gb|ADY49854.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
          Length = 85

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 47/81 (58%), Gaps = 3/81 (3%)

Query: 178 MGFPQRAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
           MGFP        + C CH+       G G+ CP+C AR C LP +CR+C L L+S+P LA
Sbjct: 1   MGFPSHQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLA 60

Query: 235 RSYHHLFPIAPFDEVTPLCLN 255
           RS+HHL P+  F EV   C+ 
Sbjct: 61  RSFHHLLPLPAFKEVIDYCVK 81


>gi|123446514|ref|XP_001312007.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121893838|gb|EAX99077.1| hypothetical protein TVAG_290240 [Trichomonas vaginalis G3]
          Length = 166

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 77/137 (56%), Gaps = 13/137 (9%)

Query: 8   MAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCS 65
           M ++  Q   F+ +FF QNPLSQ+ ++   +     LT L  + + H+K +  +      
Sbjct: 1   MKLIQDQTIDFINDFFVQNPLSQLSILATYESTCRILTPLSCNVQDHVKKIKSLSLEDHG 60

Query: 66  GDSSLQNALDLVQGLL---------SQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
           G+ SL+++L +   +L         +QI +   +EVLI+Y +L+TCD   I +T+   ++
Sbjct: 61  GEPSLESSLSIATAILGNGEKNPGLAQIST--TKEVLIVYGSLTTCDNSPIYKTLNLVRD 118

Query: 117 SKIRCSVIGLSAEMFIC 133
           SKI+ S+IGL A++F+ 
Sbjct: 119 SKIKVSIIGLGAKVFVL 135


>gi|154318002|ref|XP_001558320.1| hypothetical protein BC1G_02984 [Botryotinia fuckeliana B05.10]
          Length = 211

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/89 (40%), Positives = 57/89 (64%), Gaps = 4/89 (4%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
           D RP+R  +  +    FV E+F+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI   KAL
Sbjct: 122 DMRPTRYLLTIRYASEFVTEYFEQNPISQLGIIGMRDGIAVRISDMSGNPTEHIERLKAL 181

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSY 87
               G  G+ SLQNAL++ +  L   PS+
Sbjct: 182 RVDQG-QGNPSLQNALEMSRAALFHAPSH 209


>gi|26347121|dbj|BAC37209.1| unnamed protein product [Mus musculus]
          Length = 201

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 49/82 (59%), Gaps = 2/82 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLL 81
             + C G+ SL N+L +    L
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTL 156


>gi|30584663|gb|AAP36584.1| Homo sapiens general transcription factor IIH, polypeptide 2, 44kDa
           [synthetic construct]
          Length = 166

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 50/85 (58%), Gaps = 2/85 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQI 84
             + C G+ SL N+L +    L  +
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKLV 159


>gi|13529149|gb|AAH05345.1| GTF2H2 protein [Homo sapiens]
 gi|30582385|gb|AAP35419.1| general transcription factor IIH, polypeptide 2, 44kDa [Homo
           sapiens]
          Length = 165

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 50/85 (58%), Gaps = 2/85 (2%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P  HI +L   
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134

Query: 61  -KLGCSGDSSLQNALDLVQGLLSQI 84
             + C G+ SL N+L +    L  +
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKLV 159


>gi|238503359|ref|XP_002382913.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Aspergillus
          flavus NRRL3357]
 gi|220691723|gb|EED48071.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Aspergillus
          flavus NRRL3357]
          Length = 87

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 32/82 (39%), Positives = 53/82 (64%), Gaps = 2/82 (2%)

Query: 2  DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
          D RP+R  +  +  +  VREFF+QNP+SQ+G++ ++DG+A  ++DL G+P  HI A+  +
Sbjct: 5  DLRPTRYLLTLRYAQELVREFFEQNPISQLGVLGLRDGLAIRISDLSGNPTEHISAIQTL 64

Query: 60 GKLGCSGDSSLQNALDLVQGLL 81
                G  SLQN +++ +G L
Sbjct: 65 RDQDPKGLPSLQNGIEMARGAL 86


>gi|241165648|ref|XP_002409694.1| Bcl-2-associated transcription factor (BTF), putative [Ixodes
           scapularis]
 gi|215494619|gb|EEC04260.1| Bcl-2-associated transcription factor (BTF), putative [Ixodes
           scapularis]
          Length = 123

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/83 (44%), Positives = 56/83 (67%), Gaps = 4/83 (4%)

Query: 111 IQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEF 170
           +QKC    IRCSV+GL+AE+ +C  L + TGG+Y+V +DE+HFKE++ +HA PPP     
Sbjct: 1   MQKCS---IRCSVVGLAAEVRVCCALTKATGGTYNVIMDENHFKEILFQHAIPPPVTGN- 56

Query: 171 AIANLIKMGFPQRAGEGSISICS 193
           + ++LI+M     A   S++I S
Sbjct: 57  SESSLIRMVHVSPACPPSLTIMS 79


>gi|321452740|gb|EFX64060.1| hypothetical protein DAPPUDRAFT_267127 [Daphnia pulex]
          Length = 135

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 59/100 (59%), Gaps = 11/100 (11%)

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
           M    C+G++SLQN+ +     L  +P++  RE+L L   L           I+  KE+ 
Sbjct: 39  MKDTSCNGETSLQNSFERAMCGLKNMPAHSSREMLTLLVIL-----------IKSFKENN 87

Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIM 158
           IR S+IGL+AE+ IC+ + + TGG+Y+V LD+ + KE ++
Sbjct: 88  IRVSIIGLAAEVRICQEIAKRTGGTYNVLLDDHYLKETLI 127


>gi|169806142|ref|XP_001827816.1| transcription-repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|161779264|gb|EDQ31287.1| transcription-repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 187

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 47/183 (25%), Positives = 86/183 (46%), Gaps = 15/183 (8%)

Query: 128 AEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEG 187
            E+ + K +C  TGG Y V ++   FK ++ +      ++      +LI++ FP+     
Sbjct: 18  GEVTLLKKVCISTGGKYVVLINSFQFKSILSDFLYLLESLD--LKCSLIEVEFPKNIYTQ 75

Query: 188 SISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFD 247
            +  C+CH  +   V + CP+CK  VC +P  C IC L+L+SS  +     + + +  F 
Sbjct: 76  HL--CACHLNL-CSVLFECPKCKGFVCTIPFTCPICKLELISSTDIRDLICYNYYLESFV 132

Query: 248 EVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNC 307
           ++      +  +  ++ C+GC++  + S         C KC   FC  CD  +H  ++ C
Sbjct: 133 KIVT--SKELSSLEKNKCYGCEKLEVIS--------VCNKCLSPFCYNCDAKLHNVINFC 182

Query: 308 PGC 310
             C
Sbjct: 183 SFC 185


>gi|119498681|ref|XP_001266098.1| transcription factor and DNA repair complex, core TFIIH, putative
           [Neosartorya fischeri NRRL 181]
 gi|119414262|gb|EAW24201.1| transcription factor and DNA repair complex, core TFIIH, putative
           [Neosartorya fischeri NRRL 181]
          Length = 157

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 53/80 (66%), Gaps = 2/80 (2%)

Query: 4   RPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGK 61
           RP+R  +  +  + FVREFF+QNP+SQ+G++ ++DG+A  ++D+ G+P  HI A+  +  
Sbjct: 77  RPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLRDGLAIRVSDMSGNPTEHISAIQALRD 136

Query: 62  LGCSGDSSLQNALDLVQGLL 81
               G  SLQN L++ +G L
Sbjct: 137 HDPKGLPSLQNGLEMARGAL 156


>gi|431907809|gb|ELK11416.1| Baculoviral IAP repeat-containing protein 1 [Pteropus alecto]
          Length = 1274

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/97 (41%), Positives = 56/97 (57%), Gaps = 12/97 (12%)

Query: 139 DTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA--------GEGSIS 190
           +  G+Y V LDESH+KEL+  H  PPPA +     +LI+MGFPQ           + S S
Sbjct: 77  ELSGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMGFPQHTIASLSDQDAKPSFS 135

Query: 191 ICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICG 224
           +       + G+   GY CP+C+A+ CELP +C+ICG
Sbjct: 136 MAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICG 172



 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 39/63 (61%), Gaps = 5/63 (7%)

Query: 2  DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS-----PESHIK 56
          D +P+R+    K +E F+ E+FDQNP+SQIG++  K   A  LT+L G+      ESH K
Sbjct: 33 DLKPNRLTCTLKLLEYFIEEYFDQNPISQIGIIVTKSKRAEKLTELSGTYHVILDESHYK 92

Query: 57 ALM 59
           L+
Sbjct: 93 ELL 95


>gi|253742551|gb|EES99376.1| TFIIH P44 [Giardia intestinalis ATCC 50581]
          Length = 393

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 70/319 (21%), Positives = 121/319 (37%), Gaps = 32/319 (10%)

Query: 18  FVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLV 77
           +V+      PL ++  +T+++ +         +  +  + L      SG  +L   LD  
Sbjct: 78  YVQFILSHRPLIRVAFMTLQNSLCYLELRFSSNLSAFEEVLRDGFRPSGHIALATGLDQA 137

Query: 78  QGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHL 136
             LL    S    R +L +  + +T D  DI   ++K   ++++   + L+    +  H+
Sbjct: 138 YELLMNSKSAASSRRILYINFSSATIDSLDIFNVLEKLLSAQVQVDTVSLTNSFGVLDHI 197

Query: 137 CQDTGGSYSVALDESH------------FKELIMEHAPPPPAIAEFAIANLIKMGFPQRA 184
            + TGG + +   E+               E +          +       I +GFP   
Sbjct: 198 SRKTGGRHLIIDRETKGGLLNLLKRLELRSEDVRITQATEEGDSNVDEGKFIMVGFPVHI 257

Query: 185 GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS---YHHLF 241
            + +I  C CH     G  Y CP C A VC++ ++C  CGLQLV  P+L R    +    
Sbjct: 258 CDDAI--CFCHNRHGTGF-YECPFCYAVVCDI-SECVCCGLQLVMYPNLYRGALLHKRAL 313

Query: 242 PIAPFDE--VTPLCLNDPRN--------RSRSTCFGCQQSLLSSGNKPGLYVACPKCKKH 291
                D   V P       N        +++S C  CQ    +  ++      C  C   
Sbjct: 314 SFTKSDSSLVQPEAATSSANQAEDSKHIKAKSVCSLCQMQYGTDTDRSQYL--CSFCSGP 371

Query: 292 FCLECDIYIHESLHNCPGC 310
           FC  C   + E+  +CP C
Sbjct: 372 FCKSCSELLAEAKVSCPVC 390


>gi|736404|gb|AAA64502.1| BTF2p44 [Homo sapiens]
          Length = 130

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 35/54 (64%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI 55
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+P   I
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKXI 128


>gi|432104566|gb|ELK31178.1| Baculoviral IAP repeat-containing protein 1 [Myotis davidii]
          Length = 1414

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 5/63 (7%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS-----PESHIK 56
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G+      ESH K
Sbjct: 94  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGTYHVILDESHYK 153

Query: 57  ALM 59
            L+
Sbjct: 154 ELL 156


>gi|159115826|ref|XP_001708135.1| TFIIH P44 [Giardia lamblia ATCC 50803]
 gi|157436245|gb|EDO80461.1| TFIIH P44 [Giardia lamblia ATCC 50803]
          Length = 406

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 58/247 (23%), Positives = 98/247 (39%), Gaps = 36/247 (14%)

Query: 18  FVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLV 77
           +V+      PL ++  +T+++ +         +  +  + L      SG  +L   LD  
Sbjct: 78  YVQFILSHRPLIRVAFMTLQNSLCYLELRFSSNLSAFEEVLRDGFRPSGHIALATGLDQA 137

Query: 78  QGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHL 136
             LL    S    R +L +  + +T D  DI   ++K   ++I+  ++ L+    +  H+
Sbjct: 138 YELLMNNKSAASSRRILYVNFSNATLDSLDIFSVLEKLLAAQIQVDIVSLTNRFGVLDHI 197

Query: 137 CQDTGGSYSVA-----------------------LDESHFKELIMEH-----APPPPAIA 168
            + TGG + +                        L  S  +E   EH     A       
Sbjct: 198 SRVTGGRHLIVDRDVKGGLLNLLKRLELRSEDTRLMHSKGRE---EHSLNLQASHNQTAT 254

Query: 169 EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
           +      I +GFP    + +I  C CH     G  Y CP C A VC++ ++C  CGLQLV
Sbjct: 255 DTEGGKFIMVGFPVHICDDAI--CFCHNRHGTGF-YECPFCYAVVCDI-SECVCCGLQLV 310

Query: 229 SSPHLAR 235
             P+L R
Sbjct: 311 MYPNLYR 317


>gi|308162603|gb|EFO64989.1| TFIIH P44 [Giardia lamblia P15]
          Length = 416

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/254 (22%), Positives = 98/254 (38%), Gaps = 40/254 (15%)

Query: 18  FVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLV 77
           +V+      PL ++  +T+++ +         +  +  + L      SG  +L   LD  
Sbjct: 78  YVQFILSHRPLIRVAFMTLQNSLCYLELRFSSNLSAFEEVLRDGFRPSGHIALATGLDQA 137

Query: 78  QGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHL 136
             LL    S    R +L +  + +T D  DI   ++K   ++I+  V+ L+    +  H+
Sbjct: 138 YDLLMNSGSAASSRRILYINFSNATLDSLDIFSVLEKLFAAQIQVDVVSLTNSFGVLDHI 197

Query: 137 CQDTGGSY----------------------------------SVALDESHFKELIMEHAP 162
            + TGG +                                  ++ L  SH + +      
Sbjct: 198 SRTTGGRHLIVDREIKGGLLNLLKRLELRSEDAKIMQSKGEGTLNLQMSHNQIITDAKCT 257

Query: 163 PPPAIA-EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCR 221
               +A        I +GFP    + +I  C CH     G  Y CP C A VC++ ++C 
Sbjct: 258 KDTVLASNTEEGKFIMVGFPVHICDDAI--CFCHNRHGTGF-YECPFCYAIVCDI-SECV 313

Query: 222 ICGLQLVSSPHLAR 235
            CGLQLV  P+L R
Sbjct: 314 CCGLQLVMYPNLYR 327


>gi|222838932|gb|EEE77283.1| predicted protein [Populus trichocarpa]
          Length = 72

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 23/48 (47%), Positives = 32/48 (66%)

Query: 2  DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG 49
          D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L G
Sbjct: 18 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSG 65


>gi|321454255|gb|EFX65433.1| hypothetical protein DAPPUDRAFT_264704 [Daphnia pulex]
          Length = 72

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 16/85 (18%)

Query: 84  IPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
           +P++  RE+L L   L                E+ IR S+IGL+AE+ IC+ + + TGG+
Sbjct: 1   MPAHSSREMLTLLVIL----------------ENNIRVSIIGLAAEVRICQEIAKRTGGT 44

Query: 144 YSVALDESHFKELIMEHAPPPPAIA 168
           Y+V LD+ + KE ++ +   P A+A
Sbjct: 45  YNVLLDDHYLKETLILNQVQPLAVA 69


>gi|149059194|gb|EDM10201.1| general transcription factor II H, polypeptide 2 (predicted),
           isoform CRA_c [Rattus norvegicus]
          Length = 159

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS 50
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L  +
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSDT 123


>gi|148668488|gb|EDL00807.1| general transcription factor II H, polypeptide 2, isoform CRA_b
           [Mus musculus]
          Length = 159

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS 50
           D +P+R+    K +E FV E+FDQNP+SQIG++  K   A  LT+L  +
Sbjct: 75  DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSDT 123


>gi|294930956|ref|XP_002779725.1| btf, putative [Perkinsus marinus ATCC 50983]
 gi|239889260|gb|EER11520.1| btf, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 53/92 (57%), Gaps = 4/92 (4%)

Query: 1   MDFRPSRMAVVAKQV-EAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM 59
           MD++ +R+  V   +   F+ ++FD NP+S + ++ ++D  A+ +T + G P S ++ ++
Sbjct: 107 MDYKQNRLDFVVSHLCRNFIPQYFDLNPISNLSVLALRDQRAHFVTRMSGQPASQMERVL 166

Query: 60  --GKLGCSGDSSLQNALDLV-QGLLSQIPSYG 88
                G SG +SL NAL+ V Q     +P YG
Sbjct: 167 QFSTGGASGSASLVNALEAVGQAAKGSLPRYG 198


>gi|399949974|gb|AFP65630.1| hypothetical protein CMESO_485 [Chroomonas mesostigmatica CCMP1168]
          Length = 334

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/175 (26%), Positives = 75/175 (42%), Gaps = 17/175 (9%)

Query: 11  VAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDS 68
           + K V   +  F   NP  Q+ ++ VK+     ++ L G+ + H + L  + K    G++
Sbjct: 28  IFKLVSKLIHYFLYSNPSYQLAIILVKNNSIEQISHLSGNKKHHNQGLYRIKKKQDLGEN 87

Query: 69  SLQNALDLVQGLLSQIPSYGHREVLIL----YSALSTCDPGDIMETIQKCKESKIRCSVI 124
           SL+ A  L + LLS       +E++I     Y +      G          ++KI+ SVI
Sbjct: 88  SLKTAFFLAKKLLSFNKKKSEQEIIIFMGGSYVSSVFHSFGSFFV------KNKIKFSVI 141

Query: 125 GLSAEMFICKHLCQDTGGSYSVALDESHFK-----ELIMEHAPPPPAIAEFAIAN 174
               + F+ + L + TGG Y V   +S+F       L M      P    FAI N
Sbjct: 142 MFYEKTFLFETLVKITGGFYIVLKKDSNFDLISCMRLNMFRKTQIPKKVYFAIGN 196


>gi|496307|gb|AAA35101.1| stem loop mutation suppressor, partial [Saccharomyces cerevisiae]
          Length = 62

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 6/54 (11%)

Query: 265 CFGCQQSL-LSSGNKPGLYVA-----CPKCKKHFCLECDIYIHESLHNCPGCES 312
           CF CQ    +   +K G  +      C  CK+ FC++CD++IHE LHNCPGCES
Sbjct: 4   CFSCQSRFPILKNHKNGKLLTSSRYRCEDCKQEFCVDCDVFIHEILHNCPGCES 57


>gi|157870269|ref|XP_001683685.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126751|emb|CAJ05184.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 446

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/188 (20%), Positives = 78/188 (41%), Gaps = 30/188 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+ +  +   +  FV  + D  PL+ +G+V ++DG+++ L     +    +  L   
Sbjct: 19  DYLPNYLLAMRPPLLRFVERYLDSTPLASLGVVVMRDGISHRLLPCTTNRNEIVDVLERD 78

Query: 62  L---GCSGDSSLQNALDLVQGLL-----------------SQIPSYGHR----------E 91
           +   G SG++S++N L +    L                 S + +   R           
Sbjct: 79  VFLHGGSGNTSMENGLRMAMSELVDMREVAALAAATAKKSSSVAARNPRAAWKGSATQLN 138

Query: 92  VLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDES 151
           V++L ++++  DP D+   +       IR SV+ L   + +      +TGG+    L+  
Sbjct: 139 VIVLTASVTLIDPTDVFAVVNIMAVLSIRISVVSLVGAVHVFDVCTVETGGTLYCPLNYD 198

Query: 152 HFKELIME 159
           H   ++ E
Sbjct: 199 HLLHIMDE 206


>gi|312076198|ref|XP_003140754.1| hypothetical protein LOAG_05169 [Loa loa]
          Length = 127

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 30/45 (66%)

Query: 5   PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG 49
           PSR+AV  K +  F+ +F +QNP+SQ+G+V  KD  A CL  L G
Sbjct: 83  PSRLAVTIKALNQFLDKFSEQNPISQVGIVVCKDKRAECLIPLTG 127


>gi|66361323|pdb|1Z60|A Chain A, Solution Structure Of The Carboxy-Terminal Domain Of Human
           Tfiih P44 Subunit
          Length = 59

 Score = 45.8 bits (107), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 31/46 (67%), Gaps = 5/46 (10%)

Query: 265 CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
           C+GCQ  L        +YV C  C+  FC++CD+++H+SLH+CPGC
Sbjct: 18  CYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHSCPGC 58


>gi|736405|gb|AAA64503.1| BTF2p44 [Homo sapiens]
          Length = 69

 Score = 44.7 bits (104), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 10/69 (14%)

Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
           P+  F E+ PL   +  N  R  C+GCQ  L        +YV C  C+  FC++CD+++H
Sbjct: 1   PLDAFQEI-PL---EEYNGER-FCYGCQGELKDQH----VYV-CAXCQNVFCVDCDVFVH 50

Query: 302 ESLHNCPGC 310
           +SLH CPGC
Sbjct: 51  DSLHCCPGC 59


>gi|392579844|gb|EIW72971.1| hypothetical protein TREMEDRAFT_70875 [Tremella mesenterica DSM
           1558]
          Length = 343

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 57/132 (43%), Gaps = 8/132 (6%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--- 58
           D+ P+R    A+ V        D NP S +GL+T+     + L     +P + I  L   
Sbjct: 20  DYPPTRFQAQAQAVSTVFTAKTDSNPESAVGLMTMAGKSPSLLV----TPTNDIGKLLSA 75

Query: 59  MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETI-QKCKES 117
           MGK    GDS    A+ + Q  L    +   R+ +I++      DP + +  + +K +++
Sbjct: 76  MGKASIGGDSDFSTAVQIAQLALKHRENKNQRQRVIVFVGSPVSDPQEALVKLGKKLRKN 135

Query: 118 KIRCSVIGLSAE 129
            +   V+    E
Sbjct: 136 NVLVDVVTFGEE 147


>gi|156341095|ref|XP_001620650.1| hypothetical protein NEMVEDRAFT_v1g147480 [Nematostella
          vectensis]
 gi|156205835|gb|EDO28550.1| predicted protein [Nematostella vectensis]
          Length = 93

 Score = 43.1 bits (100), Expect = 0.17,   Method: Composition-based stats.
 Identities = 15/30 (50%), Positives = 24/30 (80%)

Query: 2  DFRPSRMAVVAKQVEAFVREFFDQNPLSQI 31
          D +P+R++  AK +E F+ E+FDQNP+SQ+
Sbjct: 56 DLKPNRLSCSAKLLENFITEYFDQNPISQV 85


>gi|407926572|gb|EKG19539.1| Transcription factor Tfb4 [Macrophomina phaseolina MS6]
          Length = 380

 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 54/132 (40%), Gaps = 12/132 (9%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVAL-DESHFKELIMEHAPPPP 165
           +M  I   +  +I   ++ L+ +    +  C  TGG Y   L  E   + L+M   P   
Sbjct: 218 VMNAIFAAQRQRIPIDILKLAGDTVFLQQACDATGGIYLKPLAPEGLLQYLMMAFLPDET 277

Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD--CRIC 223
           A     + +  ++ F  RA       C CH+ V V +GY C  C +  CE P D  C  C
Sbjct: 278 ARKHLVLPSAGEVDF--RAA------CFCHRRV-VDIGYVCSVCLSIFCEPPADQTCPTC 328

Query: 224 GLQLVSSPHLAR 235
              L  S  + R
Sbjct: 329 TNPLSVSSSITR 340


>gi|426197233|gb|EKV47160.1| hypothetical protein AGABI2DRAFT_204058 [Agaricus bisporus var.
           bisporus H97]
          Length = 361

 Score = 41.6 bits (96), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 15/121 (12%)

Query: 107 IMETI---QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAP 162
           IM ++   QK K +   C V G   +    +     TGGSY  +   ++  + LIM   P
Sbjct: 162 IMNSVFSAQKLKVTIDVCQVFG--PDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLP 219

Query: 163 PPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
           PPP     A+    K+ F  RA       C CHK + V +G+ C  C +  C+    C  
Sbjct: 220 PPPIRKVLAVPTQDKVDF--RAA------CFCHKNI-VDIGFVCSVCLSIFCQPVPVCST 270

Query: 223 C 223
           C
Sbjct: 271 C 271


>gi|89096888|ref|ZP_01169779.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp.
           NRRL B-14911]
 gi|89088268|gb|EAR67378.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp.
           NRRL B-14911]
          Length = 459

 Score = 41.6 bits (96), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 7/123 (5%)

Query: 47  LGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVL-ILYSALSTCDPG 105
           LG   ++     M     SG + L  A+D  + LLS   S  ++  L I+   + TCD G
Sbjct: 224 LGAYEKTAFNKSMDSFEASGWTPLAGAIDKARELLSAYNSTDYKNTLYIVSDGVETCD-G 282

Query: 106 DIMETIQKCKESKI--RCSVIGLSAE---MFICKHLCQDTGGSYSVALDESHFKELIMEH 160
           D +E  Q+ + S I  + ++IG   +       K + +  GG+Y+   D+   ++ +++ 
Sbjct: 283 DPVEAAQQLQGSNIEAKVNIIGFDVDDEGQKQLKEVAEAGGGTYATVRDKDELEDQVLKK 342

Query: 161 APP 163
             P
Sbjct: 343 WKP 345


>gi|409080333|gb|EKM80693.1| hypothetical protein AGABI1DRAFT_119282 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 361

 Score = 41.6 bits (96), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 15/121 (12%)

Query: 107 IMETI---QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAP 162
           IM ++   QK K +   C V G   +    +     TGGSY  +   ++  + LIM   P
Sbjct: 162 IMNSVFSAQKLKVTIDVCQVFG--PDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLP 219

Query: 163 PPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
           PPP     A+    K+ F  RA       C CHK + V +G+ C  C +  C+    C  
Sbjct: 220 PPPIRKVLAVPTQDKVDF--RAA------CFCHKNI-VDIGFVCSVCLSIFCQPVPVCST 270

Query: 223 C 223
           C
Sbjct: 271 C 271


>gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM
           11571]
 gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM
           11571]
          Length = 316

 Score = 41.6 bits (96), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 41/176 (23%), Positives = 75/176 (42%), Gaps = 28/176 (15%)

Query: 2   DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
           D+ P+R+       E  +    D +P   +G+V  + G A+  + L    +S I+ L   
Sbjct: 104 DYSPNRLEATKSAAEELIN---DLDPKDYVGIVVFESG-ASTASYLSPDKDSVIENLENI 159

Query: 62  LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM--ETIQKCKESKI 119
           +   G +++ + L L   +   IP+   ++V+IL S     + G I   E IQ  K+S I
Sbjct: 160 MEKDGATAIGDGLSLGINMADSIPN--RKKVVILLSD-GVNNAGVISPDEAIQFAKDSDI 216

Query: 120 RCSVIGLSAEMFIC-------------------KHLCQDTGGSYSVALDESHFKEL 156
           +   IG+ +E  +                    K +  +TGG Y  ++D+    E+
Sbjct: 217 QVFTIGMGSEQPVVMGYDWFGNPQYAELDEATLKEIADETGGKYFKSVDDQTLNEI 272


>gi|449683803|ref|XP_002159513.2| PREDICTED: general transcription factor IIH subunit 3-like [Hydra
           magnipapillata]
          Length = 291

 Score = 40.8 bits (94), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 49/215 (22%), Positives = 80/215 (37%), Gaps = 27/215 (12%)

Query: 37  KDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGH------- 89
           KDG     +++  + +   K L  +      S++ N   L+ G L++   Y H       
Sbjct: 86  KDGKYELFSEMNETLQCEFKRLFAE----SVSNMTNRPSLLAGALTKALCYIHSHDRTAN 141

Query: 90  ------REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
                 R ++I  S+ S+     +M  I    +  I      L       +     TGG 
Sbjct: 142 GRRTNARILIIKGSSDSSSQYMTVMNCIFAASKKNIVIDCCALQNNSGFMQQASDITGGV 201

Query: 144 YSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG 203
           Y    D S   E ++    P P + E       K+  P  +     + C CHK++ V VG
Sbjct: 202 YFFIDDFSGMLEYLLWIFLPDPGLRE-------KLNLPTSSQIDYRAACFCHKQL-VDVG 253

Query: 204 YTCPRCKARVCELPTDCRICG--LQLVSSPHLARS 236
           + C  C +  C+    C  C    +L S P  A+S
Sbjct: 254 FVCSVCLSIYCQFMPKCATCQTRFKLPSLPLNAKS 288


>gi|302496747|ref|XP_003010374.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
 gi|291173917|gb|EFE29734.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
          Length = 414

 Score = 40.8 bits (94), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
           IM +I  C+   I   +  L+ +    +  C  T G Y ++LD    F + +M    P  
Sbjct: 251 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 308

Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
              + A  NLI    P R      + C CH++V V VG+ C  C +  CE P   +C  C
Sbjct: 309 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VNVGFVCSICLSIFCEPPEGANCLTC 361

Query: 224 GLQL 227
           G  L
Sbjct: 362 GTHL 365


>gi|242764012|ref|XP_002340688.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218723884|gb|EED23301.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 380

 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 64/247 (25%), Positives = 96/247 (38%), Gaps = 32/247 (12%)

Query: 3   FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI-KALMGK 61
           +RP R  VV +QV   VRE       S +   +    +A  LT       SHI +  M  
Sbjct: 121 YRPFR--VVEEQVLENVRELLASTDASDVSATSTM--LAGALT----LALSHINRRTMTW 172

Query: 62  LGCSGDSSLQNALDLVQGLLSQIP-----SYGHREVLILYSALSTCDPGD----IMETIQ 112
               G+SS+  A        +        S G +  +++ S  S+ D       IM +I 
Sbjct: 173 TETHGNSSVDTANAATGSSSAAAVSGGNVSLGLQSRILIVSVSSSTDSAHQYIPIMNSIF 232

Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH--FKELIMEHAPPPPAIAEF 170
            C+   I   V  LS +    +     T G Y ++L E     + L+M   P   +    
Sbjct: 233 ACQRLHIPIDVCKLSGDAVFLQQASDATRGVY-MSLTEPRGLLQYLMMAFLPDQRSRKHL 291

Query: 171 AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT--DCRICGLQLV 228
            + + + + F  RA       C CH+ V V +G+ C  C +  CE P   DC  CG  L 
Sbjct: 292 VLPSRVDVDF--RAA------CFCHRRV-VNIGFVCSICLSIFCEPPENGDCLTCGTHLE 342

Query: 229 SSPHLAR 235
              + AR
Sbjct: 343 IGDYGAR 349


>gi|327299656|ref|XP_003234521.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
           118892]
 gi|326463415|gb|EGD88868.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
           118892]
          Length = 413

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
           IM +I  C+   I   +  L+ +    +  C  T G Y ++LD    F + +M    P  
Sbjct: 250 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 307

Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
              + A  NLI    P R      + C CH++V V VG+ C  C +  CE P   +C  C
Sbjct: 308 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTC 360

Query: 224 GLQL 227
           G  L
Sbjct: 361 GTHL 364


>gi|212529242|ref|XP_002144778.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210074176|gb|EEA28263.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 380

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 96/246 (39%), Gaps = 30/246 (12%)

Query: 3   FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI-KALMGK 61
           +RP R+  V +QV   VRE       S +   +    +A  LT       SHI +  M  
Sbjct: 121 YRPFRL--VEEQVMENVRELLASTNASDVTATSTM--MAGALT----LALSHINRRTMTW 172

Query: 62  LGCSGDSSLQNALDLVQGLLSQIP-----SYGHREVLILYSALSTCDPGD----IMETIQ 112
               G+S+++ A        +        S G +  +++ S  S+ D       IM +I 
Sbjct: 173 TETHGNSNIETAGASASSSATAAVSGGNLSLGLQSRILIVSVSSSTDSAHQYIPIMNSIF 232

Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAPPPPAIAEFA 171
            C+   I   V  LS +    +     T G Y S++      + L+M   P   +     
Sbjct: 233 ACQRLHIPIDVCKLSGDAVFLQQASDATRGVYMSLSEPRGLLQYLMMAFLPDQRSRKHLI 292

Query: 172 IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT--DCRICGLQLVS 229
           + + + + F  RA       C CH+ V V +G+ C  C +  CE P   DC  CG  L  
Sbjct: 293 VPSRVDVDF--RAA------CFCHRRV-VNIGFVCSICLSIFCEPPENGDCLTCGTHLEI 343

Query: 230 SPHLAR 235
             + AR
Sbjct: 344 GDYSAR 349


>gi|326473589|gb|EGD97598.1| transcription factor TFIIH subunit Tfb4 [Trichophyton tonsurans CBS
           112818]
          Length = 415

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
           IM +I  C+   I   +  L+ +    +  C  T G Y ++LD    F + +M    P  
Sbjct: 252 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 309

Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
              + A  NLI    P R      + C CH++V V VG+ C  C +  CE P   +C  C
Sbjct: 310 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTC 362

Query: 224 GLQL 227
           G  L
Sbjct: 363 GTHL 366


>gi|326480715|gb|EGE04725.1| TFIIH basal transcription factor complex p34 subunit [Trichophyton
           equinum CBS 127.97]
          Length = 415

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
           IM +I  C+   I   +  L+ +    +  C  T G Y ++LD    F + +M    P  
Sbjct: 252 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 309

Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
              + A  NLI    P R      + C CH++V V VG+ C  C +  CE P   +C  C
Sbjct: 310 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTC 362

Query: 224 GLQL 227
           G  L
Sbjct: 363 GTHL 366


>gi|315052780|ref|XP_003175764.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           gypseum CBS 118893]
 gi|311341079|gb|EFR00282.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           gypseum CBS 118893]
          Length = 413

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 54/125 (43%), Gaps = 14/125 (11%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALD--ESHFKELIMEHAPPP 164
           IM +I  C+   I   +  L+ +    +  C  T G Y ++LD      + L++   P  
Sbjct: 250 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 307

Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
               + A  NLI    P R      + C CH++V V VG+ C  C +  CE P   +C  
Sbjct: 308 ----QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLT 359

Query: 223 CGLQL 227
           CG  L
Sbjct: 360 CGTHL 364


>gi|302656245|ref|XP_003019878.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
 gi|291183651|gb|EFE39254.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
          Length = 517

 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 14/126 (11%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH--FKELIMEHAPPP 164
           IM +I  C+   I   +  L+ +    +  C  T G Y ++LD      + L++   P  
Sbjct: 354 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLP-- 410

Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
               + A  NLI    P R      + C CH++V V VG+ C  C +  CE P   +C  
Sbjct: 411 ---DQRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLT 463

Query: 223 CGLQLV 228
           CG  L 
Sbjct: 464 CGTHLA 469


>gi|296815720|ref|XP_002848197.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           otae CBS 113480]
 gi|238841222|gb|EEQ30884.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           otae CBS 113480]
          Length = 413

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 49/123 (39%), Gaps = 10/123 (8%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
           IM +I  C+   I   +  L+ +    +  C  T G Y        F + +M    P   
Sbjct: 250 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIYMSVDSPRGFLQYLMLAFLPDQR 309

Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRICG 224
           +      NL+    P R      + C CH++V V VG+ C  C +  CE P   +C  CG
Sbjct: 310 VRR----NLV---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTCG 361

Query: 225 LQL 227
             L
Sbjct: 362 THL 364


>gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum So ce56]
 gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum So ce56]
          Length = 656

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/142 (20%), Positives = 57/142 (40%), Gaps = 7/142 (4%)

Query: 26  NPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIP 85
            P   + L T    V   L   G   +  I A +  L   G +++ + +DL   L  +  
Sbjct: 332 KPGDTVALCTYAGSVREVLAPTGIESKGKILAALADLTAGGSTAMSSGIDLAYSLAERTL 391

Query: 86  SYGH-REVLILYSALSTCDP---GDIMETIQKCKESKIRCSVIGLSAEMF---ICKHLCQ 138
             GH   V++L    +   P    +I++TI++ ++  I  S +G     +   + + L  
Sbjct: 392 VKGHVNRVIVLSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQGNYKDLMMEQLAN 451

Query: 139 DTGGSYSVALDESHFKELIMEH 160
              G+Y+    E+  + +  E 
Sbjct: 452 QGDGNYAYIDSEAQARRVFSEQ 473


>gi|301122235|ref|XP_002908844.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
 gi|262099606|gb|EEY57658.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
          Length = 293

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 140 TGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVK 199
           TGG Y    D S   + ++    P P++ +        +  P +      ++C CH+EV 
Sbjct: 197 TGGIYYKPNDHSGLLQYLISIYLPDPSMRKL-------LKLPSQDSVDFRAMCFCHREV- 248

Query: 200 VGVGYTCPRCKARVCELPTDCRICGLQ 226
           +   Y CP C +  CE    C  CG++
Sbjct: 249 ISTAYVCPVCLSLFCEFRPICSTCGIR 275


>gi|258567552|ref|XP_002584520.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237905966|gb|EEP80367.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 398

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 53/125 (42%), Gaps = 14/125 (11%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDE--SHFKELIMEHAPPP 164
           IM +I  C+   I   +  LS +    +  C  T G Y V +D      + L++   P  
Sbjct: 235 IMNSIFACQRLHIPIDICKLSGDAVFLQQACDATRGIY-VPVDHPLGFLQYLMVAFLPDQ 293

Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
            + +   +   + + F  RA       C CH++V V VG+ C  C +  CE P   DC  
Sbjct: 294 RSRSHLILPTRVDVDF--RAA------CFCHRKV-VDVGFVCSICLSIFCEPPEGADCLT 344

Query: 223 CGLQL 227
           CG  L
Sbjct: 345 CGTHL 349


>gi|348676264|gb|EGZ16082.1| hypothetical protein PHYSODRAFT_508027 [Phytophthora sojae]
          Length = 298

 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 140 TGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVK 199
           TGG Y    D S   + ++    P P++ +        +  P +      ++C CH+EV 
Sbjct: 202 TGGIYYKPNDHSGLLQYLISIYLPDPSMRKL-------LKLPSQDSVDFRAMCFCHREV- 253

Query: 200 VGVGYTCPRCKARVCELPTDCRICGLQ 226
           +   Y CP C +  CE    C  CG++
Sbjct: 254 ISTAYVCPVCLSLFCEFRPICSTCGIR 280


>gi|170596989|ref|XP_001902970.1| Zinc finger, C2H2 type family protein [Brugia malayi]
 gi|158589013|gb|EDP28181.1| Zinc finger, C2H2 type family protein [Brugia malayi]
          Length = 179

 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 18/100 (18%)

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--------SSGNKPGLYVACPKCKKH 291
           L PI    EV  L   + R R +  C  C++ ++         +G KP    +C  CKK+
Sbjct: 52  LVPIEKLMEVVKLSKQERRTRKKQRCEICRKEVIYMKSHMTTHTGEKP---YSCSICKKN 108

Query: 292 FCLECDIYIH------ESLHNCPGC-ESLRHSNPIVANEG 324
           F    D+  H      E LH+CP C +S  H + ++++ G
Sbjct: 109 FTQFGDMKKHMMIHTGEKLHSCPICRKSFTHKHHLLSHMG 148


>gi|429729565|ref|ZP_19264224.1| von Willebrand factor type A domain protein [Corynebacterium durum
           F0235]
 gi|429149589|gb|EKX92567.1| von Willebrand factor type A domain protein [Corynebacterium durum
           F0235]
          Length = 747

 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 64/148 (43%), Gaps = 20/148 (13%)

Query: 30  QIGLVTVKDGVAN-------------CLTDLGGSPESHIKALMGKLGCSGDSSLQNALDL 76
           Q+ L+T   G  N              L  LGGS +  I A +  L  SG + +  AL  
Sbjct: 58  QLALLTYGTGTGNSDEEKEAGCKDVQTLVPLGGSRD-EIAAQIDSLVASGYTPIGPALLA 116

Query: 77  VQGLL-SQIPSYGHREVLILYSALSTCDPGDIMETIQKC--KESKIRCSVIGLSAEMFI- 132
            +  L ++      R ++++   + TC P ++ E  +    + S +   V+GL+A+  + 
Sbjct: 117 AEKTLDTKKTEKNKRHIVLVSDGIDTCAPPEMAEVARDIHKRNSNVTIDVVGLNADETVR 176

Query: 133 --CKHLCQDTGGSYSVALDESHFKELIM 158
              K L    GG+Y+ A DE+    L+ 
Sbjct: 177 EQLKELAAAGGGTYADATDEASVSTLVT 204


>gi|303314233|ref|XP_003067125.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|240106793|gb|EER24980.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|320037386|gb|EFW19323.1| transcription factor TFIIH subunit Tfb4 [Coccidioides posadasii
           str. Silveira]
          Length = 395

 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 50/123 (40%), Gaps = 10/123 (8%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
           IM +I  C+   I   +  LS +    +  C  T G Y        F + +M    P   
Sbjct: 232 IMNSIFACQRLHIPIDICKLSGDAVFLQQACDATRGIYVPVDHPRGFLQYLMVAFLP--- 288

Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRICG 224
             + +  +LI    P R      + C CH++V V VG+ C  C +  CE P   DC  CG
Sbjct: 289 -DQRSRRHLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGADCLTCG 343

Query: 225 LQL 227
             L
Sbjct: 344 THL 346


>gi|119174356|ref|XP_001239539.1| hypothetical protein CIMG_09160 [Coccidioides immitis RS]
 gi|392869735|gb|EAS28255.2| transcription factor tfb4 [Coccidioides immitis RS]
          Length = 396

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 50/123 (40%), Gaps = 10/123 (8%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
           IM +I  C+   I   +  LS +    +  C  T G Y        F + +M    P   
Sbjct: 233 IMNSIFACQRLHIPIDICKLSGDAVFLQQACDATRGIYVPVDHPRGFLQYLMVAFLP--- 289

Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRICG 224
             + +  +LI    P R      + C CH++V V VG+ C  C +  CE P   DC  CG
Sbjct: 290 -DQRSRRHLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGADCLTCG 344

Query: 225 LQL 227
             L
Sbjct: 345 THL 347


>gi|429730769|ref|ZP_19265415.1| von Willebrand factor type A domain protein [Corynebacterium durum
           F0235]
 gi|429147207|gb|EKX90237.1| von Willebrand factor type A domain protein [Corynebacterium durum
           F0235]
          Length = 661

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 5/81 (6%)

Query: 83  QIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC--SVIGLSAEMFICKHL---C 137
           ++PS G R+++++   + TC P  + E  ++ ++  +    +VIGL+ +      L    
Sbjct: 148 ELPSQGARQIVLVSDGIDTCAPPPVCEVAKQIRQRGVDVVINVIGLNVDDQARSELQCVA 207

Query: 138 QDTGGSYSVALDESHFKELIM 158
           ++ GGSY+ A D +  KE ++
Sbjct: 208 KEGGGSYADAKDAASLKEQLV 228


>gi|170585154|ref|XP_001897351.1| Zinc finger, C2H2 type family protein [Brugia malayi]
 gi|158595226|gb|EDP33795.1| Zinc finger, C2H2 type family protein [Brugia malayi]
          Length = 764

 Score = 38.5 bits (88), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 18/100 (18%)

Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--------SSGNKPGLYVACPKCKKH 291
           L PI    EV  L   + R R +  C  C++ ++         +G KP    +C  CKK+
Sbjct: 310 LVPIEKLMEVVKLSKQERRTRKKQRCEICRKEVIYMKSHMTTHTGEKP---YSCSICKKN 366

Query: 292 FCLECDIYIH------ESLHNCPGC-ESLRHSNPIVANEG 324
           F    D+  H      E LH+CP C +S  H + ++++ G
Sbjct: 367 FTQFGDMKKHMMIHTGEKLHSCPICRKSFTHKHHLLSHMG 406


>gi|255953875|ref|XP_002567690.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211589401|emb|CAP95542.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 374

 Score = 37.7 bits (86), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 51/125 (40%), Gaps = 14/125 (11%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH--FKELIMEHAPPP 164
           +M +I  C+   I   V  LS +    +     T G Y +AL E     + L+M   P  
Sbjct: 222 VMNSIFACQRLSIPIDVCKLSGDAVFLQQASDATKGVY-MALAEPRGLLQYLMMAFLPDQ 280

Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
            +     +   + + F  RA       C CH+ V V VG+ C  C +  CE P   DC  
Sbjct: 281 RSRRHLVLPTRVDVDF--RAA------CFCHRRV-VDVGFVCSICLSIFCEPPPGGDCMT 331

Query: 223 CGLQL 227
           CG  L
Sbjct: 332 CGSHL 336


>gi|121713686|ref|XP_001274454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           clavatus NRRL 1]
 gi|119402607|gb|EAW13028.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           clavatus NRRL 1]
          Length = 386

 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 53/132 (40%), Gaps = 12/132 (9%)

Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAPPPP 165
           IM  I  C+   +   V  LS +    +     T G Y S+A      + L+M   P   
Sbjct: 232 IMNAIFACQRLHVPIDVCKLSGDAVFLQQASDATKGVYMSLAEPRGLLQYLMMAFLPDQR 291

Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD--CRIC 223
           +     +   + + F  RA       C CH+ V V +G+ C  C +  CE P +  C  C
Sbjct: 292 SRRHLVLPTRVDVDF--RAA------CFCHRRV-VDIGFVCSICLSIFCEPPENGGCLTC 342

Query: 224 GLQLVSSPHLAR 235
           G  L +  + AR
Sbjct: 343 GTHLETGDYGAR 354


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.138    0.436 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,177,347,693
Number of Sequences: 23463169
Number of extensions: 209273081
Number of successful extensions: 522464
Number of sequences better than 100.0: 540
Number of HSP's better than 100.0 without gapping: 420
Number of HSP's successfully gapped in prelim test: 120
Number of HSP's that attempted gapping in prelim test: 520237
Number of HSP's gapped (non-prelim): 899
length of query: 324
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 182
effective length of database: 9,027,425,369
effective search space: 1642991417158
effective search space used: 1642991417158
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)