BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020564
(324 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297746227|emb|CBI16283.3| unnamed protein product [Vitis vinifera]
Length = 494
Score = 573 bits (1477), Expect = e-161, Method: Compositional matrix adjust.
Identities = 272/323 (84%), Positives = 294/323 (91%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDF+PSRMAVVAK +EAF+REFFDQNPLSQIGLVT+KDG+A CLTDLGGSP+SH+KALMG
Sbjct: 171 MDFKPSRMAVVAKHIEAFIREFFDQNPLSQIGLVTIKDGLAQCLTDLGGSPDSHVKALMG 230
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGDSSLQNALDLV G L+QIPSYGHREVLILYSALSTCDPGDIMETIQ+CK+SKIR
Sbjct: 231 KLECSGDSSLQNALDLVHGYLNQIPSYGHREVLILYSALSTCDPGDIMETIQECKKSKIR 290
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGLSAE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 291 CSVIGLSAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIAEFAIANLIKMGF 350
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQRA EG ISICSCHKE KVG GYTCPRCKARVCELPT+CRICGL LVSSPHLARSYHHL
Sbjct: 351 PQRAAEGVISICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLVSSPHLARSYHHL 410
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
FPI PFDEV+ LN+P RS CFGCQ+SLL GNKP L VACPKCK+HFCL+CDIYI
Sbjct: 411 FPIPPFDEVSLSLLNNPHQRSSRACFGCQESLLIPGNKPTLCVACPKCKQHFCLDCDIYI 470
Query: 301 HESLHNCPGCESLRHSNPIVANE 323
HESLHNCPGCES RHS + E
Sbjct: 471 HESLHNCPGCESFRHSKIVSVTE 493
>gi|225435253|ref|XP_002284994.1| PREDICTED: general transcription factor IIH subunit 2 [Vitis
vinifera]
Length = 433
Score = 573 bits (1476), Expect = e-161, Method: Compositional matrix adjust.
Identities = 272/323 (84%), Positives = 294/323 (91%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDF+PSRMAVVAK +EAF+REFFDQNPLSQIGLVT+KDG+A CLTDLGGSP+SH+KALMG
Sbjct: 110 MDFKPSRMAVVAKHIEAFIREFFDQNPLSQIGLVTIKDGLAQCLTDLGGSPDSHVKALMG 169
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGDSSLQNALDLV G L+QIPSYGHREVLILYSALSTCDPGDIMETIQ+CK+SKIR
Sbjct: 170 KLECSGDSSLQNALDLVHGYLNQIPSYGHREVLILYSALSTCDPGDIMETIQECKKSKIR 229
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGLSAE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 230 CSVIGLSAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIAEFAIANLIKMGF 289
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQRA EG ISICSCHKE KVG GYTCPRCKARVCELPT+CRICGL LVSSPHLARSYHHL
Sbjct: 290 PQRAAEGVISICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLVSSPHLARSYHHL 349
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
FPI PFDEV+ LN+P RS CFGCQ+SLL GNKP L VACPKCK+HFCL+CDIYI
Sbjct: 350 FPIPPFDEVSLSLLNNPHQRSSRACFGCQESLLIPGNKPTLCVACPKCKQHFCLDCDIYI 409
Query: 301 HESLHNCPGCESLRHSNPIVANE 323
HESLHNCPGCES RHS + E
Sbjct: 410 HESLHNCPGCESFRHSKIVSVTE 432
>gi|224119296|ref|XP_002331276.1| predicted protein [Populus trichocarpa]
gi|222873701|gb|EEF10832.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 563 bits (1450), Expect = e-158, Method: Compositional matrix adjust.
Identities = 269/316 (85%), Positives = 289/316 (91%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDFRPSRMAVVA+ VEAF+REFFDQNPLSQI LVT+KDGVA LT+LGGSPESHIKALM
Sbjct: 97 MDFRPSRMAVVAQNVEAFIREFFDQNPLSQIALVTIKDGVAYSLTELGGSPESHIKALMA 156
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGDSSLQNAL+LV L +IPSYG+REVLILYSAL+TCDPGDIMETIQKCK+SK+R
Sbjct: 157 KLECSGDSSLQNALELVHEYLDKIPSYGNREVLILYSALTTCDPGDIMETIQKCKKSKMR 216
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGLSAEMFICKHLCQ+TGG YSVALDESHFKELI+EHAPPPPAIAEFAIANLIKMGF
Sbjct: 217 CSVIGLSAEMFICKHLCQETGGLYSVALDESHFKELILEHAPPPPAIAEFAIANLIKMGF 276
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQRA EGSISICSCHKE KVG GY CPRCKARVCELPT+CRICGL LVSSPHLARSYHHL
Sbjct: 277 PQRAAEGSISICSCHKESKVGEGYICPRCKARVCELPTECRICGLTLVSSPHLARSYHHL 336
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
FPIAPFDEV P N+P RS+ TCFGCQQSL++ GNKP L VACPKCK++FCL+CDIYI
Sbjct: 337 FPIAPFDEVKPSRQNEPHRRSQKTCFGCQQSLVNPGNKPSLQVACPKCKQYFCLDCDIYI 396
Query: 301 HESLHNCPGCESLRHS 316
HESLHNCPGCESLR S
Sbjct: 397 HESLHNCPGCESLRAS 412
>gi|356571915|ref|XP_003554116.1| PREDICTED: general transcription factor IIH subunit 2-like [Glycine
max]
Length = 420
Score = 557 bits (1435), Expect = e-156, Method: Compositional matrix adjust.
Identities = 257/322 (79%), Positives = 292/322 (90%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRPSRMAV+ KQVEAF+REFFDQNPLS +GLVT+KDG+A+C+T+LGGSPESHIKALMGK
Sbjct: 99 DFRPSRMAVMGKQVEAFIREFFDQNPLSHVGLVTIKDGIAHCITELGGSPESHIKALMGK 158
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L CSGD+SLQNAL+LV G L+QIPSYGHREVLILYSALSTCDPGD+METIQKCK+SKIRC
Sbjct: 159 LECSGDASLQNALELVLGYLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRC 218
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SVIGL+AEMF+CKHLCQ+TGG+YSVALDESHFKELI+EHAPPPPAIAE+A ANLIKMGFP
Sbjct: 219 SVIGLAAEMFVCKHLCQETGGTYSVALDESHFKELILEHAPPPPAIAEYATANLIKMGFP 278
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
QR+ EGS++IC+CH+E K G GYTCPRCK RVCELPT+CR+CGL L+SSPHLARSYHHLF
Sbjct: 279 QRSAEGSVAICTCHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLF 338
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
PI FDEVTP ND + +TCFGCQQSLLS GNKPGL V CPKCK+ FCL+CDIY+H
Sbjct: 339 PIVMFDEVTPSSQNDSNHSFPNTCFGCQQSLLSQGNKPGLSVICPKCKQQFCLDCDIYVH 398
Query: 302 ESLHNCPGCESLRHSNPIVANE 323
ESLHNCPGCES RHS + A++
Sbjct: 399 ESLHNCPGCESSRHSKSVTASQ 420
>gi|357509341|ref|XP_003624959.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355499974|gb|AES81177.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 426
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 257/323 (79%), Positives = 288/323 (89%), Gaps = 4/323 (1%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRPSRMAV+AKQVE F+REFFDQNPLS +GLVT KDGVANCLTDLGGSPESHIKALMGK
Sbjct: 101 DFRPSRMAVIAKQVELFIREFFDQNPLSHVGLVTTKDGVANCLTDLGGSPESHIKALMGK 160
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L CSGD+SLQNAL+LV L+QIPSYGHREVLILYSALSTCDPGD+METIQKCK+SKIRC
Sbjct: 161 LECSGDASLQNALELVHSNLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRC 220
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SVIGL+AEMFICKHLCQ+TGG+YSVALDESHFKELI+EH+PPPPAIAE+A ANLIKMGFP
Sbjct: 221 SVIGLAAEMFICKHLCQETGGTYSVALDESHFKELILEHSPPPPAIAEYATANLIKMGFP 280
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
QRA EGS++IC+CH+E K G GYTCPRCK RVCELPT+CR+CGL L+SSPHLARSYHHLF
Sbjct: 281 QRAAEGSVAICTCHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLF 340
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS----SGNKPGLYVACPKCKKHFCLECD 297
PI PF E++P NDP + +TCFGCQQSLLS +GNK L V+CPKCK+ FCL+CD
Sbjct: 341 PIVPFAEISPSSQNDPNHSFPNTCFGCQQSLLSQGFGAGNKAELSVSCPKCKQQFCLDCD 400
Query: 298 IYIHESLHNCPGCESLRHSNPIV 320
+YIHESLHNCPGCES RHS +
Sbjct: 401 MYIHESLHNCPGCESFRHSKSVT 423
>gi|356538258|ref|XP_003537621.1| PREDICTED: general transcription factor IIH subunit 2-like [Glycine
max]
Length = 419
Score = 546 bits (1408), Expect = e-153, Method: Compositional matrix adjust.
Identities = 255/325 (78%), Positives = 292/325 (89%), Gaps = 7/325 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRPSRM V+ KQVEAF+REFFDQNPLS +GLVT+KDG+A+C+T+LGGSPESHIKALMGK
Sbjct: 99 DFRPSRMVVMGKQVEAFIREFFDQNPLSHVGLVTIKDGIAHCITELGGSPESHIKALMGK 158
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L CSGD+SLQNAL+LV G L+QIPSYGHREVLILYSALSTCDPGD+METIQKCK+SKIRC
Sbjct: 159 LECSGDASLQNALELVLGYLNQIPSYGHREVLILYSALSTCDPGDLMETIQKCKKSKIRC 218
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SVIGL+AEMF+CKHLC++TGG+YSVALDESHFKELI+EHAPPPPAIAE++ ANLIKMGFP
Sbjct: 219 SVIGLAAEMFVCKHLCEETGGTYSVALDESHFKELILEHAPPPPAIAEYSTANLIKMGFP 278
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
QR+ EGS++IC+CH+E K G GYTCPRCK RVCELPT+CR+CGL L+SSPHLARSYHHLF
Sbjct: 279 QRSAEGSVAICTCHEEAKTGGGYTCPRCKVRVCELPTECRVCGLTLISSPHLARSYHHLF 338
Query: 242 PIAPFDEVTPLCLNDPRNRSRS---TCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
PI FDEVTP ++ SRS TCFGCQQSLLS GNKPGL V CPKCK+ FCL+CDI
Sbjct: 339 PIVMFDEVTP----SQKDSSRSFPNTCFGCQQSLLSQGNKPGLSVICPKCKQQFCLDCDI 394
Query: 299 YIHESLHNCPGCESLRHSNPIVANE 323
Y+HESLHNCPGCES RHS + A++
Sbjct: 395 YVHESLHNCPGCESSRHSKSVTASQ 419
>gi|449451946|ref|XP_004143721.1| PREDICTED: general transcription factor IIH subunit 2-like [Cucumis
sativus]
Length = 423
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 255/325 (78%), Positives = 289/325 (88%), Gaps = 2/325 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDFRPSRMAVVAK V+AFVREFFDQNPLSQIGLVT+KDG ANCLTDLGGSPESH+KALMG
Sbjct: 99 MDFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMG 158
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGD+SLQN L+LV L+QIPSYGHREVL+LYSAL++CDPGDIMET+QKCK SKIR
Sbjct: 159 KLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIR 218
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIA+ A+ NLIKMGF
Sbjct: 219 CSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGF 278
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQRA E SI+ICSCHKE KVG GYTCPRCKARVCELPT+CRICGL L+SSPHLARSYHHL
Sbjct: 279 PQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHL 338
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--SSGNKPGLYVACPKCKKHFCLECDI 298
FPI PFDEV+ +DPR++ CFGCQ+SL+ S+GN P + V+CPKCK+HFCL+CDI
Sbjct: 339 FPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDI 398
Query: 299 YIHESLHNCPGCESLRHSNPIVANE 323
YIHESLHNCPGCES R ++E
Sbjct: 399 YIHESLHNCPGCESFRRPKLATSDE 423
>gi|449525401|ref|XP_004169706.1| PREDICTED: general transcription factor IIH subunit 2-like, partial
[Cucumis sativus]
Length = 329
Score = 540 bits (1390), Expect = e-151, Method: Compositional matrix adjust.
Identities = 255/325 (78%), Positives = 289/325 (88%), Gaps = 2/325 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDFRPSRMAVVAK V+AFVREFFDQNPLSQIGLVT+KDG ANCLTDLGGSPESH+KALMG
Sbjct: 5 MDFRPSRMAVVAKHVDAFVREFFDQNPLSQIGLVTIKDGFANCLTDLGGSPESHVKALMG 64
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGD+SLQN L+LV L+QIPSYGHREVL+LYSAL++CDPGDIMET+QKCK SKIR
Sbjct: 65 KLECSGDASLQNGLELVHSYLNQIPSYGHREVLVLYSALNSCDPGDIMETVQKCKTSKIR 124
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FIC+HLCQ+TGGSYSVALDESHFKEL++EHAPPPPAIA+ A+ NLIKMGF
Sbjct: 125 CSVIGLTAEIFICRHLCQETGGSYSVALDESHFKELLLEHAPPPPAIADSAMPNLIKMGF 184
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQRA E SI+ICSCHKE KVG GYTCPRCKARVCELPT+CRICGL L+SSPHLARSYHHL
Sbjct: 185 PQRAAESSIAICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLISSPHLARSYHHL 244
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--SSGNKPGLYVACPKCKKHFCLECDI 298
FPI PFDEV+ +DPR++ CFGCQ+SL+ S+GN P + V+CPKCK+HFCL+CDI
Sbjct: 245 FPIIPFDEVSDKVFHDPRHQLPKVCFGCQESLMNPSTGNSPSIRVSCPKCKQHFCLDCDI 304
Query: 299 YIHESLHNCPGCESLRHSNPIVANE 323
YIHESLHNCPGCES R ++E
Sbjct: 305 YIHESLHNCPGCESFRRPKLATSDE 329
>gi|297848764|ref|XP_002892263.1| ATGTF2H2/GTF2H2 [Arabidopsis lyrata subsp. lyrata]
gi|297338105|gb|EFH68522.1| ATGTF2H2/GTF2H2 [Arabidopsis lyrata subsp. lyrata]
Length = 421
Score = 529 bits (1362), Expect = e-148, Method: Compositional matrix adjust.
Identities = 252/314 (80%), Positives = 282/314 (89%), Gaps = 1/314 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDFRPSRMA++AK VEAF+REFFDQNPLSQIGLV++K+GVA+ LTDLGGSPE+HIKALMG
Sbjct: 97 MDFRPSRMAIMAKHVEAFIREFFDQNPLSQIGLVSIKNGVAHTLTDLGGSPETHIKALMG 156
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL GDSSLQNAL+LV L+Q+PSYGHREVLILYSAL TCDPGDIMETIQKCK+SK+R
Sbjct: 157 KLEALGDSSLQNALELVHEHLNQVPSYGHREVLILYSALCTCDPGDIMETIQKCKKSKLR 216
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGLSAEMFICKHLCQ+TGG YSVA+DE H K+L++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 217 CSVIGLSAEMFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGF 276
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQRA EGS++ICSCHKEVK+G GY CPRCKARVC+LPT+C ICGL LVSSPHLARSYHHL
Sbjct: 277 PQRAAEGSMAICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHL 336
Query: 241 FPIAPFDEVTPL-CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
FPIAPFDEV L LND R + +CFGCQQSL+ +GNKPG V C KCK +FCL+CDIY
Sbjct: 337 FPIAPFDEVPALSSLNDNRRKLGKSCFGCQQSLIGAGNKPGPCVTCRKCKHYFCLDCDIY 396
Query: 300 IHESLHNCPGCESL 313
IHESLHNCPGCES+
Sbjct: 397 IHESLHNCPGCESI 410
>gi|30679101|ref|NP_683275.2| transcription initiation factor TFIIH subunit H2 [Arabidopsis
thaliana]
gi|22087278|gb|AAM90909.1|AF499443_1 p44/SSL1-like protein [Arabidopsis thaliana]
gi|4056421|gb|AAC97995.1| Similar to gb|Z30094 basic transcripion factor 2, 44 kD subunit
from Homo sapiens. EST gb|W43325 comes from this gene
[Arabidopsis thaliana]
gi|110738828|dbj|BAF01337.1| putative transcription factor [Arabidopsis thaliana]
gi|332189663|gb|AEE27784.1| transcription initiation factor TFIIH subunit H2 [Arabidopsis
thaliana]
Length = 421
Score = 526 bits (1355), Expect = e-147, Method: Compositional matrix adjust.
Identities = 251/314 (79%), Positives = 281/314 (89%), Gaps = 1/314 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDFRPSRMA++AK VEAF+REFFDQNPLSQIGLV++K+GVA+ LTDLGGSPE+HIKALMG
Sbjct: 97 MDFRPSRMAIMAKHVEAFIREFFDQNPLSQIGLVSIKNGVAHTLTDLGGSPETHIKALMG 156
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL GDSSLQNAL+LV L+Q+PSYGHREVLILYSAL TCDPGDIMETIQKCK+SK+R
Sbjct: 157 KLEALGDSSLQNALELVHEHLNQVPSYGHREVLILYSALCTCDPGDIMETIQKCKKSKLR 216
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGLSAEMFICKHLCQ+TGG YSVA+DE H K+L++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 217 CSVIGLSAEMFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGF 276
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQRA EGS++ICSCHKEVK+G GY CPRCKARVC+LPT+C ICGL LVSSPHLARSYHHL
Sbjct: 277 PQRAAEGSMAICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHL 336
Query: 241 FPIAPFDEVTPL-CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
FPIAPFDEV L LND R + +CFGCQQSL+ +GNKP V C KCK +FCL+CDIY
Sbjct: 337 FPIAPFDEVPALSSLNDNRRKLGKSCFGCQQSLIGAGNKPVPCVTCRKCKHYFCLDCDIY 396
Query: 300 IHESLHNCPGCESL 313
IHESLHNCPGCES+
Sbjct: 397 IHESLHNCPGCESI 410
>gi|357164610|ref|XP_003580110.1| PREDICTED: general transcription factor IIH subunit 2-like
[Brachypodium distachyon]
Length = 428
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 236/315 (74%), Positives = 274/315 (86%), Gaps = 5/315 (1%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD+RPSRMAVVAK EAF+REFFDQNPLS +GLVT+KDG+++ LTD+GGSPES IKALMG
Sbjct: 114 MDYRPSRMAVVAKCAEAFIREFFDQNPLSHVGLVTIKDGISHRLTDIGGSPESQIKALMG 173
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGDSSLQNAL+L+ G L+QIPSYGH+EVLILYSAL+TCDPGD+MET++KCK+SKIR
Sbjct: 174 KLECSGDSSLQNALELIHGYLNQIPSYGHKEVLILYSALNTCDPGDVMETVEKCKKSKIR 233
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 234 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 293
Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
PQR E ISICSCHK++K G GY CPRCK VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 294 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 353
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
LFP+APFDEVT + C+GCQQSL++S ++ L+V CPKC +HFC +CDIY
Sbjct: 354 LFPVAPFDEVT----FKLGQKGGQNCYGCQQSLINSDSQSNLHVRCPKCNQHFCFDCDIY 409
Query: 300 IHESLHNCPGCESLR 314
IHESLHNCPGCES R
Sbjct: 410 IHESLHNCPGCESQR 424
>gi|242076424|ref|XP_002448148.1| hypothetical protein SORBIDRAFT_06g022140 [Sorghum bicolor]
gi|241939331|gb|EES12476.1| hypothetical protein SORBIDRAFT_06g022140 [Sorghum bicolor]
Length = 431
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 239/316 (75%), Positives = 272/316 (86%), Gaps = 3/316 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD+RPSRMAVVAK EAF+REFFDQNPLS +GLVT+KDG+A+ LT++GGSPES IKALMG
Sbjct: 115 MDYRPSRMAVVAKHAEAFIREFFDQNPLSHVGLVTIKDGIAHRLTEIGGSPESQIKALMG 174
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGDSSLQNAL+LV G L Q+PSYGH+EVLILYSAL+T DPGDIMETI KCK+SKIR
Sbjct: 175 KLECSGDSSLQNALELVHGYLDQVPSYGHKEVLILYSALNTFDPGDIMETITKCKKSKIR 234
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 235 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 294
Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
PQR E ISICSCHK++K G GY CPRCK VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 295 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 354
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
LFP+APFDEVTP + + R CF CQQSL + + L+V CPKC +HFCL+CDIY
Sbjct: 355 LFPVAPFDEVTP--VPNRIQRGGQICFSCQQSLFNPDGQSNLHVRCPKCNQHFCLDCDIY 412
Query: 300 IHESLHNCPGCESLRH 315
IHESLHNCPGCES R+
Sbjct: 413 IHESLHNCPGCESQRN 428
>gi|226531944|ref|NP_001142175.1| uncharacterized protein LOC100274342 [Zea mays]
gi|194707472|gb|ACF87820.1| unknown [Zea mays]
gi|414586431|tpg|DAA37002.1| TPA: hypothetical protein ZEAMMB73_625811 [Zea mays]
Length = 428
Score = 499 bits (1284), Expect = e-139, Method: Compositional matrix adjust.
Identities = 238/315 (75%), Positives = 269/315 (85%), Gaps = 3/315 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD+RPSRMAVVAK EAF+REFFDQNPLS +GLVT+KDG+A+ LT++GGSPES IKALMG
Sbjct: 112 MDYRPSRMAVVAKHAEAFIREFFDQNPLSHVGLVTIKDGIAHRLTEIGGSPESQIKALMG 171
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGDSSLQNALDLV G L Q+PSYGH+EVLILYSAL+T DPGDI ETI KCK+SKIR
Sbjct: 172 KLECSGDSSLQNALDLVHGYLDQVPSYGHKEVLILYSALNTFDPGDITETIAKCKKSKIR 231
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 232 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 291
Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
PQR E ISICSCHK++K G GY CPRCK VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 292 PQRGAEELISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 351
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
LFP+ PFDEVTP + + R CF CQQSL + + L+V CPKC +HFCL+CDIY
Sbjct: 352 LFPVPPFDEVTP--VPNRIQRGGQICFSCQQSLYNPDGQSNLHVRCPKCNQHFCLDCDIY 409
Query: 300 IHESLHNCPGCESLR 314
IHESLHNCPGCES R
Sbjct: 410 IHESLHNCPGCESQR 424
>gi|326529607|dbj|BAK04750.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 235/318 (73%), Positives = 273/318 (85%), Gaps = 5/318 (1%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD+RPSRMAVVAK EAF+REFFDQNPLS +GLVT+KDG+++ LT++GGSPES I ALMG
Sbjct: 116 MDYRPSRMAVVAKCAEAFIREFFDQNPLSHVGLVTIKDGISHRLTEIGGSPESQINALMG 175
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSGDSSLQNAL+LV G L+QIPSYGH+EVL LYSAL+TCDPGDIMETI+KCK+SK+R
Sbjct: 176 KLECSGDSSLQNALELVHGYLNQIPSYGHKEVLFLYSALNTCDPGDIMETIEKCKKSKVR 235
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FICKHLC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 236 CSVIGLAAEIFICKHLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 295
Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
PQR E ISICSCHK++K G GY CPRCK VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 296 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 355
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
LFP++PFDEVT + CFGCQQSL+++G + ++V CPKC HFC +CDIY
Sbjct: 356 LFPVSPFDEVT----FKLGQKGGQNCFGCQQSLINTGGQSNIHVRCPKCNHHFCFDCDIY 411
Query: 300 IHESLHNCPGCESLRHSN 317
IHESLHNCPGCES R S+
Sbjct: 412 IHESLHNCPGCESQRSSS 429
>gi|32489532|emb|CAE04735.1| OSJNBa0043L24.23 [Oryza sativa Japonica Group]
gi|116310777|emb|CAH67570.1| OSIGBa0101P20.13 [Oryza sativa Indica Group]
Length = 432
Score = 492 bits (1267), Expect = e-137, Method: Compositional matrix adjust.
Identities = 230/318 (72%), Positives = 271/318 (85%), Gaps = 1/318 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD+RPSRMAVVAK E F+REFFDQNPLS +G+VT+KDG+++ LT++GGSPES IKALMG
Sbjct: 115 MDYRPSRMAVVAKYAEVFIREFFDQNPLSHVGIVTMKDGISHRLTEIGGSPESQIKALMG 174
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSG+ SLQNAL+LV G L Q+PSYGH+EVL LYSAL+TCDPGDIMETI KCK+SKIR
Sbjct: 175 KLECSGEPSLQNALELVHGYLDQVPSYGHKEVLFLYSALNTCDPGDIMETIAKCKKSKIR 234
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FICK+LC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 235 CSVIGLAAEIFICKYLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 294
Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
PQR E ISICSCHK++K G GY CPRCK VCELPT+CR CGL LVSSPHLARSYHH
Sbjct: 295 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNVCELPTECRTCGLTLVSSPHLARSYHH 354
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
LFP+ PFDEV+ + N + C+GCQQS ++ ++ L+V CPKC +HFCL+CDIY
Sbjct: 355 LFPVQPFDEVSSVHPNRLGQKGGQKCYGCQQSFINPDSQSSLHVRCPKCNQHFCLDCDIY 414
Query: 300 IHESLHNCPGCESLRHSN 317
IHESLHNCPGCES R S+
Sbjct: 415 IHESLHNCPGCESQRSSS 432
>gi|255570827|ref|XP_002526366.1| btf, putative [Ricinus communis]
gi|223534325|gb|EEF36037.1| btf, putative [Ricinus communis]
Length = 465
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 225/274 (82%), Positives = 251/274 (91%), Gaps = 1/274 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD +PSRM V+A+ VEAF+REFFDQNPLSQIGLVT+KDGVA+ LT+LGGSPESHIKAL G
Sbjct: 99 MDIKPSRMVVIARHVEAFIREFFDQNPLSQIGLVTIKDGVAHPLTELGGSPESHIKALTG 158
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL C+GDSSLQNALDLV+G L QIP+YGHREVLILYSALSTCDPGDIMETIQ CK+SKIR
Sbjct: 159 KLECAGDSSLQNALDLVRGYLDQIPAYGHREVLILYSALSTCDPGDIMETIQNCKKSKIR 218
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGLSAEMFICKHLCQ+TGG YSVA+DE+H KEL++EHAPPPPAIAEFAIANLIKMGF
Sbjct: 219 CSVIGLSAEMFICKHLCQETGGLYSVAMDETHLKELLLEHAPPPPAIAEFAIANLIKMGF 278
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQR+ EGSISICSCHKE KVG GY CPRCKARVC+LPT+CRICGL LVSSPHLARSYHHL
Sbjct: 279 PQRSAEGSISICSCHKEAKVGEGYICPRCKARVCDLPTECRICGLTLVSSPHLARSYHHL 338
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS 274
FPIAPFDEV+ ++ ++R + TCFGCQQSLL+
Sbjct: 339 FPIAPFDEVSQR-QSELQHRPQKTCFGCQQSLLN 371
>gi|168025049|ref|XP_001765047.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683634|gb|EDQ70042.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 394
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 204/313 (65%), Positives = 256/313 (81%), Gaps = 1/313 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD RPSRM +V VEAFVREFFDQNPLS +G++ +KDG+A+ LTDL GSPE+HI+AL
Sbjct: 72 MDLRPSRMGIVVDCVEAFVREFFDQNPLSHLGVILLKDGIAHQLTDLSGSPETHIRALRS 131
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
+ +GD+S+QN LDL +G L+QIPSYGHREVL++YSALST DPGD+MET+Q+CK++ IR
Sbjct: 132 NMESTGDASIQNGLDLARGYLTQIPSYGHREVLLVYSALSTIDPGDVMETVQECKKANIR 191
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSV+GLSAE++ICK LC+ TGG YSVA +E H KELIMEHAPPP A AE ++A+L++MGF
Sbjct: 192 CSVVGLSAEIYICKLLCEQTGGMYSVATNEGHLKELIMEHAPPPAAQAEMSVASLVRMGF 251
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQR E ++++C+CH+EVK+G GYTCPRCKARVCELPT C ICGL LVSSPHLARSYHHL
Sbjct: 252 PQRGAEDAVALCACHREVKMGGGYTCPRCKARVCELPTQCHICGLTLVSSPHLARSYHHL 311
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNK-PGLYVACPKCKKHFCLECDIY 299
FP++PF+EV L + R +C+GC Q L + G K G+ + CP+C++HFC +CD Y
Sbjct: 312 FPVSPFEEVENLASANGRQSLSKSCYGCLQELPTPGGKVKGVRLECPRCRQHFCFDCDNY 371
Query: 300 IHESLHNCPGCES 312
IHESLHNCPGCES
Sbjct: 372 IHESLHNCPGCES 384
>gi|302791231|ref|XP_002977382.1| hypothetical protein SELMODRAFT_232956 [Selaginella moellendorffii]
gi|300154752|gb|EFJ21386.1| hypothetical protein SELMODRAFT_232956 [Selaginella moellendorffii]
Length = 394
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 191/318 (60%), Positives = 258/318 (81%), Gaps = 11/318 (3%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+DFRPSRMAVVAK VE FVRE+FDQNPLSQ+G++ +K+G+A+ +T+L GSPE+HI+AL
Sbjct: 79 IDFRPSRMAVVAKAVEDFVREYFDQNPLSQLGIIVMKNGIASVVTELSGSPEAHIRALKS 138
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
L G++SLQN L+L + IPSYGHREV+I++SALSTCDPG+I+ET++KCK ++++
Sbjct: 139 NLESFGEASLQNGLELAHTYVQHIPSYGHREVVIVFSALSTCDPGNILETVKKCKAARMK 198
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSV+GL+AE++ICKH+ Q+TGGSY+VA++E H KE+++EH PPP A+ + + A+L++MGF
Sbjct: 199 CSVVGLTAEIYICKHISQETGGSYAVAMNEGHLKEILLEHVPPPAAMPDASSASLVRMGF 258
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQR EG+++IC+CHKE+K+G GY CPRC+ARVCELPT+C +CGL LVSS HLARSYHHL
Sbjct: 259 PQRGSEGAVAICACHKELKIGGGYICPRCRARVCELPTECSLCGLALVSSAHLARSYHHL 318
Query: 241 FPIAPFDEVTPLCLNDPRNRSRS----TCFGCQQSLLSSGNKPGLYVACPKCKKHFCLEC 296
FPI FDE+ L D R +S +CFGC+ L SG+ L + CP+CK+ FC +C
Sbjct: 319 FPIPTFDEL----LVDASLRGKSSFAGSCFGCRIQLSGSGS---LRLRCPRCKRDFCFDC 371
Query: 297 DIYIHESLHNCPGCESLR 314
D+YIHESLHNCPGCE+L+
Sbjct: 372 DVYIHESLHNCPGCETLQ 389
>gi|302786380|ref|XP_002974961.1| hypothetical protein SELMODRAFT_232450 [Selaginella moellendorffii]
gi|300157120|gb|EFJ23746.1| hypothetical protein SELMODRAFT_232450 [Selaginella moellendorffii]
Length = 394
Score = 419 bits (1077), Expect = e-115, Method: Compositional matrix adjust.
Identities = 189/318 (59%), Positives = 257/318 (80%), Gaps = 11/318 (3%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+DFRPSRMAVVAK VE FVRE+FDQNPLSQ+G++ +K+G+A+ +T+L GSPE+HI+AL
Sbjct: 79 IDFRPSRMAVVAKAVEDFVREYFDQNPLSQLGIIVMKNGIASVVTELSGSPEAHIRALKS 138
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
L G++SLQN L+L + IPSYGHREV+I++SALSTCDPG+I+ET++KCK ++++
Sbjct: 139 NLESFGEASLQNGLELAHTYVQHIPSYGHREVVIVFSALSTCDPGNILETVKKCKAARMK 198
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSV+GL+AE++ICKH+ Q+TGGSY+VA++E H KE+++EH PPP A+ + + A+L++MGF
Sbjct: 199 CSVVGLTAEIYICKHISQETGGSYAVAMNEGHLKEILLEHVPPPAAMPDASSASLVRMGF 258
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQR EG+++IC+CHKE+++G GY CPRC+ARVCELPT+C +CGL LVSS HLARSYHHL
Sbjct: 259 PQRGSEGAVAICACHKELRIGGGYICPRCRARVCELPTECSLCGLALVSSAHLARSYHHL 318
Query: 241 FPIAPFDEVTPLCLNDPRNRSRS----TCFGCQQSLLSSGNKPGLYVACPKCKKHFCLEC 296
FPI FDE+ L D R +S +CFGC+ L G+ L + CP+CK+ FC +C
Sbjct: 319 FPIPTFDEL----LVDASLRGKSSFAGSCFGCRIQLSGGGS---LRLRCPRCKRDFCFDC 371
Query: 297 DIYIHESLHNCPGCESLR 314
D+YIHESLHNCPGCE+L+
Sbjct: 372 DVYIHESLHNCPGCETLQ 389
>gi|218195182|gb|EEC77609.1| hypothetical protein OsI_16588 [Oryza sativa Indica Group]
gi|222629181|gb|EEE61313.1| hypothetical protein OsJ_15417 [Oryza sativa Japonica Group]
Length = 509
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 163/219 (74%), Positives = 192/219 (87%), Gaps = 1/219 (0%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD+RPSRMAVVAK E F+REFFDQNPLS +G+VT+KDG+++ LT++GGSPES IKALMG
Sbjct: 115 MDYRPSRMAVVAKYAEVFIREFFDQNPLSHVGIVTMKDGISHRLTEIGGSPESQIKALMG 174
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSG+ SLQNAL+LV G L Q+PSYGH+EVL LYSAL+TCDPGDIMETI KCK+SKIR
Sbjct: 175 KLECSGEPSLQNALELVHGYLDQVPSYGHKEVLFLYSALNTCDPGDIMETIAKCKKSKIR 234
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSVIGL+AE+FICK+LC++TGGSY+VALDESHFKEL++EHAPPPPAIAE+A ANLIKMGF
Sbjct: 235 CSVIGLAAEIFICKYLCEETGGSYTVALDESHFKELLLEHAPPPPAIAEYAAANLIKMGF 294
Query: 181 PQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPT 218
PQR E ISICSCHK++K G GY CPRCK + L +
Sbjct: 295 PQRGAEDLISICSCHKKIKSGAEGYICPRCKVNIVNLAS 333
>gi|302830270|ref|XP_002946701.1| hypothetical protein VOLCADRAFT_72869 [Volvox carteri f.
nagariensis]
gi|300267745|gb|EFJ51927.1| hypothetical protein VOLCADRAFT_72869 [Volvox carteri f.
nagariensis]
Length = 430
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 161/329 (48%), Positives = 222/329 (67%), Gaps = 19/329 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-MG 60
D RP+R+A + +F+REFFDQNPLSQ+G+ ++ G+ LTDL GSPE+ + L
Sbjct: 98 DLRPNRLACILGLCRSFIREFFDQNPLSQLGIAVMRGGLVEKLTDLSGSPEAQVARLDAA 157
Query: 61 KLGC-SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KLG +GD SLQNALDL +L +P YGHRE+L++++AL+TCDPG+++++++ CKE+ I
Sbjct: 158 KLGAPAGDVSLQNALDLSVSVLRSLPPYGHRELLVVFAALATCDPGNVLDSVRACKENNI 217
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SV+G++AE+++C+ + +DT GSY VAL+E+H ++L+M H PPPA A A A L++MG
Sbjct: 218 RVSVVGVAAEVYVCRRITEDTRGSYGVALNEAHLEQLLMAHTTPPPATAAQAKAELVRMG 277
Query: 180 FPQRAGEGSISICSCHKEVKVGVG-YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
FPQR+ E + S E ++ G YTCPRCK+RV ELP +C ICGL L+SSPHLARSYH
Sbjct: 278 FPQRSTEEASSAVFVGLEARLLSGSYTCPRCKSRVPELPCECHICGLTLISSPHLARSYH 337
Query: 239 HLFPIAPF--------DEVTPLCLNDPRNRSR---ST---CFGCQQSLLSSGNKPGLYVA 284
HLFP+ F E + P R ST CFGC + + G L++
Sbjct: 338 HLFPVPSFHEDSKERASEAAAVLRGQPAMRDEVPTSTGLYCFGCLREVSQPGAAGQLHLT 397
Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCE 311
C +CK+ FC ECD Y+HE LHNCPGCE
Sbjct: 398 LRCGQCKQVFCFECDAYVHEQLHNCPGCE 426
>gi|281212115|gb|EFA86276.1| TFIIH subunit [Polysphondylium pallidum PN500]
Length = 438
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 150/311 (48%), Positives = 206/311 (66%), Gaps = 11/311 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +PSR + E+F++EFFDQNP+SQI L+ K+ A LT+L G+P HI AL
Sbjct: 109 DMKPSRHQCLLNSSESFIKEFFDQNPISQISLIVTKNSKAEKLTELNGNPNRHITALKNT 168
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
GD S+QN+L++ LS +P YG RE++++YS+L+TCDPGD+ +TI+ K +IR
Sbjct: 169 SAMEGDPSIQNSLEVAISTLSHVPKYGSREIIVIYSSLTTCDPGDLTKTIEILKNEQIRV 228
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
S I ++AE+FICKH+ + T G+ V L+E HF E +M H PPP + A L++MGFP
Sbjct: 229 SFIHMAAELFICKHISEQTHGTMKVVLNEEHFNECLMLHCQPPPTFGKTEAA-LVEMGFP 287
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q+ +C CH++++ VGYTCPRC ++ CELPTDC+IC L LVSSPHLARSYHHLF
Sbjct: 288 QQHTSSIPMMCMCHEQLRY-VGYTCPRCSSKFCELPTDCQICNLSLVSSPHLARSYHHLF 346
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
+ F E+ L ++ +CFGC L+ L+ CP+CK+ FC ECD +IH
Sbjct: 347 QVPLFTEINWREL-----KTEISCFGC----LTVPKSSSLFFGCPRCKQSFCFECDQFIH 397
Query: 302 ESLHNCPGCES 312
ESLHNCPGCE+
Sbjct: 398 ESLHNCPGCEN 408
>gi|330804328|ref|XP_003290148.1| TFIIH subunit [Dictyostelium purpureum]
gi|325079746|gb|EGC33332.1| TFIIH subunit [Dictyostelium purpureum]
Length = 423
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 149/310 (48%), Positives = 206/310 (66%), Gaps = 9/310 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +PSR V+ + E F++EFFDQNP+SQ+ ++ K+ A +++L G+P+ HI+A+
Sbjct: 108 DLKPSRYQVLIQNTEQFIKEFFDQNPISQLSIIITKNSKAEKISELSGNPQRHIQAMKDA 167
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+ G+ S+QN+LD+ L +P YG REVL L+S+L+TCDP D+ +TIQ K+ IR
Sbjct: 168 IAMEGEPSIQNSLDVAISSLCYVPKYGSREVLFLFSSLTTCDPTDLAKTIQTLKQENIRV 227
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
S I L+AE++IC+H+ + T G+ V L+E HF E + PPP I + A L++MGFP
Sbjct: 228 SFIHLAAELYICRHIAEQTNGTSKVILNEEHFLENLYSKCQPPPTIGKVEAA-LVEMGFP 286
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q+ S C CH+++K GY CPRC + CELPTDC+IC L LVSSPHLARSYHHLF
Sbjct: 287 QQITSSVPSTCICHEKMKYS-GYICPRCGVKSCELPTDCQICNLSLVSSPHLARSYHHLF 345
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
PI F+EV LN + TC+ C S S + L+ +CP+C + FCLECD+YIH
Sbjct: 346 PIPLFEEVNWKELN-----TNVTCYSCLTS--SETSILSLFFSCPRCDQVFCLECDMYIH 398
Query: 302 ESLHNCPGCE 311
E+LHNCPGCE
Sbjct: 399 EALHNCPGCE 408
>gi|384251056|gb|EIE24534.1| transcription factor TFIIH, 44 kDa subunit, partial [Coccomyxa
subellipsoidea C-169]
Length = 420
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 216/329 (65%), Gaps = 29/329 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R+AV++ ++ F+R+FFD+NPLS +G++ ++DG+A LTD+ GSPE+HI L G
Sbjct: 71 DMRPNRLAVMSGVLQGFIRKFFDENPLSHLGIIVMRDGIAQRLTDMAGSPEAHISKLQGN 130
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREV---LILYSA----------LSTCDPGDIM 108
L SGD+SLQNA+D+ L IP YGHRE ++L A LSTCDPG+++
Sbjct: 131 LDASGDASLQNAMDMAAESLKSIPPYGHREACPQILLKPAHLLVLVLFAALSTCDPGNVL 190
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA 168
+ ++ K++ IR S++G++AE+ ICK ++TGG+Y VAL+E HF++L+ EHAPPPP++A
Sbjct: 191 DAVKAAKQNSIRVSIVGVAAEVHICKVFTKETGGTYGVALNEKHFEDLVFEHAPPPPSLA 250
Query: 169 EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGV-GYTCPRCKARVCELPTDCRICGLQL 227
A A+L++MGF R +G K+ K+ G+TCPRCKARV +LP C ICGL L
Sbjct: 251 TDAAASLVRMGFAHRNQDGVEGTAFIGKDCKLSSGGFTCPRCKARVVDLPCSCHICGLTL 310
Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSG------NKPGL 281
+SSPHLARSYHHLFP+ F E + +C+ C L S + G+
Sbjct: 311 ISSPHLARSYHHLFPVKAFTE---------GDSGAVSCYACLADLEPSSLDDDAPGEVGV 361
Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGC 310
V CP C + FC +CD +IHE+LHNCPGC
Sbjct: 362 VVRCPDCCQLFCFDCDAFIHETLHNCPGC 390
>gi|159490896|ref|XP_001703409.1| transcription factor TFIIH, 44 kDa subunit [Chlamydomonas
reinhardtii]
gi|158280333|gb|EDP06091.1| transcription factor TFIIH, 44 kDa subunit [Chlamydomonas
reinhardtii]
Length = 413
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 229/317 (72%), Gaps = 10/317 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-MG 60
D RP+R+ + +FVREFFDQNPLSQ+G+ ++ G+ LTDL GSPE+ + L G
Sbjct: 99 DLRPNRLGCMLSLCRSFVREFFDQNPLSQLGIAVMRGGLVEKLTDLSGSPEAQVARLDAG 158
Query: 61 KLGC-SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KLG +GD+SLQNAL+L LL Q+P YGHRE+L+L++AL++CDPG+++++++ CK++ I
Sbjct: 159 KLGAPAGDASLQNALELGVSLLKQLPPYGHRELLLLFAALASCDPGNVLDSVKACKDNNI 218
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SV+G++AE+++C+ + +DTGG+Y VAL+E H ++L+M H+PPPPA A A A L++MG
Sbjct: 219 RVSVVGVAAEVYVCRRITEDTGGTYGVALNEGHLEQLLMAHSPPPPATAAQAKAELVRMG 278
Query: 180 FPQRAGEGSISICSCHKEVKVGVG-YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
FPQR+ E + S +E ++ G YTCPRC++RV ELP +C CGL L+SSPHLARSYH
Sbjct: 279 FPQRSTEEASSAVFVGQEARLLPGSYTCPRCRSRVPELPVECHTCGLTLISSPHLARSYH 338
Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA--CPKCKKHFCLEC 296
HLFP+ P+ E + + + + CFGC + + G K L++ C +CK+ FC EC
Sbjct: 339 HLFPVQPYHEDSEVMTT-----TGAYCFGCLREVSEPGVKGQLHLTLRCGQCKQVFCFEC 393
Query: 297 DIYIHESLHNCPGCESL 313
D Y+HESLHNCPGCE L
Sbjct: 394 DAYVHESLHNCPGCEFL 410
>gi|255073175|ref|XP_002500262.1| predicted protein [Micromonas sp. RCC299]
gi|226515524|gb|ACO61520.1| predicted protein [Micromonas sp. RCC299]
Length = 427
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 153/337 (45%), Positives = 207/337 (61%), Gaps = 32/337 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RPSR+AVV+ + F RE+F+QNPLSQ+GLV ++G+A LT+L GSPESHI AL
Sbjct: 85 DMRPSRLAVVSGILYKFFREYFNQNPLSQLGLVVTRNGIAERLTELSGSPESHITALKEN 144
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L +GD S+QN+L+ VQ L+Q+P+YG REVL + SALS+CDPG++ I K + IR
Sbjct: 145 LEAAGDMSIQNSLEQVQSSLAQLPTYGTREVLFVVSALSSCDPGNVHTAIAAAKSANIRV 204
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SV+ ++AE+ IC+ + ++TGG + V+ + H ++L+M HAPPPP + A+L++MGFP
Sbjct: 205 SVVSVAAELHICRRMTEETGGMFGVSQSQHHLEDLLMAHAPPPPLNEQATKASLVEMGFP 264
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARV----------------------CELPTD 219
Q+ + S G Y CPRC +RV ELP
Sbjct: 265 QKRPLDKGAFFSGR-----GGEYVCPRCASRVEVNPTHPKFVENIKINNPVTFLQELPAQ 319
Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLC---LNDPRNRSRSTCFGCQQSLLSSG 276
C +C L LVSSPHLARSYHHLFP+APF+ P+ LND + CF C SG
Sbjct: 320 CSVCSLTLVSSPHLARSYHHLFPVAPFEVHRPVIMQELNDAAKLDATNCFACNLKFDRSG 379
Query: 277 NKPGLYV--ACPKCKKHFCLECDIYIHESLHNCPGCE 311
+ CP+CKK +C +CD++IHE LHNCPGCE
Sbjct: 380 ARGNDDAPSVCPRCKKIYCFQCDVFIHEKLHNCPGCE 416
>gi|303285284|ref|XP_003061932.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456343|gb|EEH53644.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 404
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 158/329 (48%), Positives = 210/329 (63%), Gaps = 21/329 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RPSR+AV++ + F REFF+QNPLSQ+GLV ++G+A +T+L G+PE+HI AL
Sbjct: 90 DMRPSRLAVISSLMYKFFREFFNQNPLSQLGLVVTRNGIAERVTELSGNPETHIAALKEN 149
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L +GD S+QN+L+ V L Q+P YG REVL + SALSTCDPG++ I K +KIR
Sbjct: 150 LDAAGDMSIQNSLEQVHASLVQLPMYGSREVLFVVSALSTCDPGNVHTAIAAAKNAKIRV 209
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SV+ ++AEM +C+ + ++TGG + VA + H +EL+M HAPPPP E +L++MGFP
Sbjct: 210 SVVSVAAEMHVCRRMTEETGGIFGVAQSQHHLEELLMAHAPPPPLNEESTKPSLVEMGFP 269
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q+ + S G Y CPRCK+RV ELP+ C +C L LVSS HLARSYHHLF
Sbjct: 270 QKRPLDKDAFFSGR-----GGDYVCPRCKSRVEELPSQCTVCKLTLVSSAHLARSYHHLF 324
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------LSSGNKPGLYVACPKCKKHFCLE 295
P+ PF E L D + ++CF C S+ N P + CPKCKK FC
Sbjct: 325 PVPPFTEGFGL---DDEGKRHTSCFACYVDFEEGNESTSNENAPSV---CPKCKKTFCFV 378
Query: 296 CDIYIHESLHNCPGCESLRHSNPIVANEG 324
CD+YIHE LHNCPGCE ++ +A EG
Sbjct: 379 CDVYIHEKLHNCPGCELVK----FLAEEG 403
>gi|110739908|dbj|BAF01859.1| putative transcription factor [Arabidopsis thaliana]
Length = 196
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 145/195 (74%), Positives = 163/195 (83%), Gaps = 1/195 (0%)
Query: 130 MFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSI 189
MFICKHLCQ+TGG YSVA+DE H K+L++EHAPPPPAIAEFAIANLIKMGFPQRA EGS+
Sbjct: 1 MFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGSM 60
Query: 190 SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEV 249
+ICSCHKEVK+G GY CPRCKARVC+LPT+C ICGL LVSSPHLARSYHHLFPIAPFDEV
Sbjct: 61 AICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHLFPIAPFDEV 120
Query: 250 TPL-CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCP 308
L LND R + +CFGCQQSL+ +GNKP V C KCK +FCL+CDIYIHESLHNCP
Sbjct: 121 PALSSLNDNRRKLGKSCFGCQQSLIGAGNKPVPCVTCRKCKHYFCLDCDIYIHESLHNCP 180
Query: 309 GCESLRHSNPIVANE 323
GCES+ + E
Sbjct: 181 GCESIHRPKSVSLME 195
>gi|443732857|gb|ELU17421.1| hypothetical protein CAPTEDRAFT_178402 [Capitella teleta]
Length = 390
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/321 (46%), Positives = 215/321 (66%), Gaps = 17/321 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R+ K +E FV EFFDQNP+SQ+G++T + A+ LT+LGG+P+ HI+AL G
Sbjct: 73 DLRPNRLFASLKLLEKFVEEFFDQNPISQLGILTTSNKRADKLTELGGNPKRHIQALRGL 132
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SLQNALDL L +P + REVLI++ +L+TCDPG I++TIQ K + +
Sbjct: 133 KERACIGEPSLQNALDLAAQTLKHMPGHSSREVLIVFGSLTTCDPGSIIDTIQSMKANNV 192
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ S+IGL+A++ IC+ +C DT G Y V +DE HFKEL+ + PP A + ++LI+MG
Sbjct: 193 QVSIIGLAADVRICRKICHDTQGQYHVIVDEPHFKELLNQQVTPPNATSSTE-SSLIRMG 251
Query: 180 FPQRAGEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
FP+ S+C+CH E GY CP+C+++ CELP +CR CGL LVS+PHLA
Sbjct: 252 FPRHEQGQQPSMCNCHLESSSSEGFSTGGYFCPQCRSKYCELPVECRACGLTLVSAPHLA 311
Query: 235 RSYHHLFPIAPFDEV-TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
RSYHHLF + PF+EV T + + ++ + C+ CQ L N P Y C KC++ FC
Sbjct: 312 RSYHHLFLLEPFNEVRTEVAI---FHKEKRICYSCQSQL----NDPMSY-QCSKCRQVFC 363
Query: 294 LECDIYIHESLHNCPGCESLR 314
++CD+++HE+LH+CPGC S R
Sbjct: 364 VDCDLFVHETLHSCPGCSSSR 384
>gi|384487455|gb|EIE79635.1| hypothetical protein RO3G_04340 [Rhizopus delemar RA 99-880]
Length = 395
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 148/304 (48%), Positives = 203/304 (66%), Gaps = 13/304 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RPSR+ + + FV EFFDQNP+SQ+G++ +DG+A LT+L G+P HIKAL K
Sbjct: 96 DLRPSRIELTLTYAQQFVVEFFDQNPISQLGIIITRDGIAEKLTELSGNPTDHIKALKSK 155
Query: 62 LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
SG+ SLQNAL L + + +PS+G +EVL+++ +L+TCDP DI +TI ++ +
Sbjct: 156 KNTETSGEPSLQNALQLARASMLGVPSHGSKEVLLIFGSLTTCDPSDIHDTIDLLRKELV 215
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R +V+GL+AE+ IC+ L +Y V L+E+HFK+L+ E PPP + +NLIKMG
Sbjct: 216 RVNVVGLAAEVQICRAL-----RTYGVVLNEAHFKDLLFEVVPPPAVMQNKNTSNLIKMG 270
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP+R E + + C CH ++ +G GY CPRCK++VCELP+DC ICGL LVSSPHLARSYHH
Sbjct: 271 FPKRLVEDNATFCVCHSKLTMG-GYICPRCKSKVCELPSDCDICGLTLVSSPHLARSYHH 329
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
LFP+ FDEV + + + CF C L S G +CPKCK+ FC ECDI+
Sbjct: 330 LFPVDNFDEV-----KNNISGYATHCFSCLSQLDKSEVVGGNRFSCPKCKQEFCAECDIF 384
Query: 300 IHES 303
+HES
Sbjct: 385 VHES 388
>gi|66823583|ref|XP_645146.1| TFIIH subunit [Dictyostelium discoideum AX4]
gi|74861629|sp|Q86KZ2.2|TF2H2_DICDI RecName: Full=General transcription factor IIH subunit 2; AltName:
Full=TFIIH basal transcription factor complex subunit 2
gi|60473387|gb|EAL71333.1| TFIIH subunit [Dictyostelium discoideum AX4]
Length = 461
Score = 300 bits (767), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 204/313 (65%), Gaps = 13/313 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +PSR V+ + VE F++EFFDQNP+SQ+ ++ K+ A +++L G+ HI+A+
Sbjct: 113 DLKPSRYQVLLQNVELFIKEFFDQNPISQLSIIITKNSKAEKISELSGNRLRHIQAMKDA 172
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+ G+ S+QN+L++ L +P YG REVL ++S+L+TCDP + +TIQ K IR
Sbjct: 173 IAMEGEPSIQNSLEVALSSLCYVPKYGSREVLFIFSSLTTCDPSSLQKTIQSLKNESIRV 232
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
S I ++AE++ICK + + T G+ V L+E HF E +M PPP I + A L++MGFP
Sbjct: 233 SFIHMAAELYICKAIAEQTNGTSKVILNEEHFNESLMLKCQPPPTIGKTEAA-LVEMGFP 291
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q+ S C CH+++K GY CPRC + CELPTDC+IC L LVSSPHLARSYHHLF
Sbjct: 292 QQITSTVPSPCICHEKMKYS-GYICPRCGVKSCELPTDCQICNLSLVSSPHLARSYHHLF 350
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP--GLYVACPKCKKHFCLECDIY 299
I F+EV LN TC GC LSS K L+ +CP+C++ FCL+CD++
Sbjct: 351 QIPLFNEVNWKELN-----KNVTCIGC----LSSSEKSILSLFFSCPRCQEIFCLDCDLF 401
Query: 300 IHESLHNCPGCES 312
IHESLHNCPGCE+
Sbjct: 402 IHESLHNCPGCEN 414
>gi|147862851|emb|CAN82988.1| hypothetical protein VITISV_011714 [Vitis vinifera]
Length = 199
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 145/193 (75%), Positives = 156/193 (80%)
Query: 131 FICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSIS 190
F+ C T S L +SHFKEL++EHAPPPPAIAEFAIANLIKMGFPQRA EG IS
Sbjct: 6 FLLIQYCFITNISVLYLLLQSHFKELLLEHAPPPPAIAEFAIANLIKMGFPQRAAEGVIS 65
Query: 191 ICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVT 250
ICSCHKE KVG GYTCPRCKARVCELPT+CRICGL LVSSPHLARSYHHLFPI PFDEV+
Sbjct: 66 ICSCHKEAKVGGGYTCPRCKARVCELPTECRICGLTLVSSPHLARSYHHLFPIPPFDEVS 125
Query: 251 PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
LN+P RS CFGCQ+SLL GNKP L VACPKCK+HFCL+CDIYIHESLHNCPGC
Sbjct: 126 LSLLNNPHQRSSRACFGCQESLLIPGNKPTLCVACPKCKQHFCLDCDIYIHESLHNCPGC 185
Query: 311 ESLRHSNPIVANE 323
ES RHS + E
Sbjct: 186 ESFRHSKIVSVTE 198
>gi|328874281|gb|EGG22647.1| TFIIH subunit [Dictyostelium fasciculatum]
Length = 457
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 145/313 (46%), Positives = 199/313 (63%), Gaps = 14/313 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-MG 60
D +P+R V E F++EFFDQNP+SQ+ L+ K+ A ++DL +P HI+ L M
Sbjct: 108 DLKPNRHQAVLIAAENFIKEFFDQNPISQLSLIITKNSKAEKISDLSSNPNRHIQLLKMV 167
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
GD S+QN+LD+ LS +P YG REV+ +YS+L+TCDPGD+ +TI K IR
Sbjct: 168 SSTIEGDPSIQNSLDVSIATLSYVPKYGSREVIFIYSSLTTCDPGDLSKTITTLKNENIR 227
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
S + L+AE+++C+H+ T G+ V E H E ++ H PPP I + A L++MGF
Sbjct: 228 VSFVHLAAELYVCRHISDATNGTMKVIETEHHLNEALILHCQPPPTIGKVEAA-LVEMGF 286
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
PQ+ + +C CH++ K VGYTCPRC+++ CELPTDC+ICGL LVSSPHLARSYHHL
Sbjct: 287 PQQHTSQTPMMCVCHEQFKY-VGYTCPRCQSKFCELPTDCQICGLSLVSSPHLARSYHHL 345
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPG-LYVACPKCKKHFCLECDIY 299
F + F EV P + + C+GC + KP L CP+CKK FC +CD
Sbjct: 346 FQVPIFREVNPKSI-----QPNLKCYGCLLPI-----KPNYLNYGCPRCKKIFCFDCDQV 395
Query: 300 IHESLHNCPGCES 312
+HES+HNCPGCE+
Sbjct: 396 VHESIHNCPGCEN 408
>gi|332017887|gb|EGI58547.1| General transcription factor IIH subunit 2 [Acromyrmex echinatior]
Length = 598
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/322 (46%), Positives = 206/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++ ++ A ++DL G+ + HIK L +
Sbjct: 272 DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKISDLAGNSKKHIKELQIL 331
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ G+ SLQN+L+L LL +PS+ +E+L++ AL+TCDPGDI ETI+ K I
Sbjct: 332 QQTVVGGEPSLQNSLELATKLLKLLPSHASKEILVIIGALTTCDPGDINETIRNMKLDSI 391
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG +SVALD+ H+KE + H PPPA A L+KMG
Sbjct: 392 RCSVIGLAAELYICKRMANITGGEHSVALDDKHYKEQLNAHIDPPPAATRLDAA-LVKMG 450
Query: 180 FPQ-----RAGEGSISICSCH-----KEVK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A + SI++C CH + VK + GY CP+C ++ CELP +CR CGL LV
Sbjct: 451 FPHHALHSNAPDTSITVCMCHAHNSDETVKFMTTGYLCPQCISKHCELPVECRACGLTLV 510
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYH+LFP+ PF E+ P + + CFGCQ++ K AC KC
Sbjct: 511 SAPHLARSYHYLFPVDPFKEIAP-------DSDYTICFGCQKAFAQKDKK---VYACGKC 560
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FCL+C+I+IHE LH CPGC
Sbjct: 561 NQMFCLDCEIFIHEILHTCPGC 582
>gi|260815126|ref|XP_002602325.1| hypothetical protein BRAFLDRAFT_282199 [Branchiostoma floridae]
gi|229287633|gb|EEN58337.1| hypothetical protein BRAFLDRAFT_282199 [Branchiostoma floridae]
Length = 396
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 214/324 (66%), Gaps = 20/324 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ V K +E F+ E+FDQNP+SQ+G++T K+ A LT+LGG+P+ H+ L +
Sbjct: 76 DLKPTRILVTLKLLENFIEEYFDQNPISQLGVITTKNKRAEKLTELGGNPKRHVTQLRTL 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C G+ S+ N+LDL L +P++ REVLI+ +L+TCDPGDI T++ K+ I
Sbjct: 136 SSASCVGEPSIMNSLDLAAQTLKHMPTHTSREVLIIMGSLTTCDPGDINITMKMVKDLNI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE+ +CK LC T G+Y + L+++HFK+L++EH PPPA ++LIKMG
Sbjct: 196 RCSVIGLAAEVQVCKKLCNMTNGTYGIILEDTHFKDLLLEHCTPPPATVN-TDSSLIKMG 254
Query: 180 FPQRA---GEGSI--SICSCHKEVKVG----VGYTCPRCKARVCELPTDCRICGLQLVSS 230
FPQ S+ S+C CH++ + GY CP+C A+ CELP +C+ICGL LVS+
Sbjct: 255 FPQHTISQDHDSVKPSMCMCHQDTEAQSFTPSGYFCPQCHAKYCELPVECKICGLTLVSA 314
Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
PHLARSYHH FP+ F E+ L+ +R C GCQ L N P +Y C +C +
Sbjct: 315 PHLARSYHHFFPLENFREIPLEELDAELSR---FCTGCQVQL----NGPVVY-CCTRCSR 366
Query: 291 HFCLECDIYIHESLHNCPGCESLR 314
FC++CD++IHE+LH+CPGC + R
Sbjct: 367 PFCIDCDLFIHETLHSCPGCINKR 390
>gi|328772181|gb|EGF82220.1| hypothetical protein BATDEDRAFT_19068 [Batrachochytrium
dendrobatidis JAM81]
Length = 409
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 205/321 (63%), Gaps = 12/321 (3%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+D RP+R+ +E FV EFFDQNPLS +G++ KDG+ +L G+P +KAL
Sbjct: 89 LDLRPNRLDCTLNFLEMFVVEFFDQNPLSHLGIIGTKDGMVEKWAELSGNPSEILKALTK 148
Query: 61 KLG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ +G+ SLQNAL+L + L+ +P + RE+++LY +L++CDPGDI ETI +
Sbjct: 149 RSNRETAGEPSLQNALELARRSLAHVPMHVSREIIVLYGSLTSCDPGDIQETINDLCKEN 208
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAI-AEFAIANLIK 177
IR SVIGL+AE+ +CK +C T G Y+V +++ H+KE++ H PPPP + + + +N+I+
Sbjct: 209 IRVSVIGLAAEVQVCKKMCASTNGVYNVIMNDVHYKEILFAHIPPPPLLQTQLSASNIIQ 268
Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
MGFP + +CSCH+ + G+ CPRC A +C +PTDC +C L L+SSP LARSY
Sbjct: 269 MGFPMMKIFDDMVLCSCHQN-PIQKGHICPRCSATICNIPTDCCLCSLTLISSPSLARSY 327
Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV------ACPKCKKH 291
HHLFP+ F E+ P + S+ CF CQ + G Y+ ACP+CK H
Sbjct: 328 HHLFPVPNFTEIPP--VQSTSYPSQVECFSCQTPSSMTSFHSGTYMIPTARYACPRCKVH 385
Query: 292 FCLECDIYIHESLHNCPGCES 312
CL+CD+YIH++LHNCPGC S
Sbjct: 386 VCLDCDVYIHDTLHNCPGCMS 406
>gi|307198378|gb|EFN79320.1| General transcription factor IIH subunit 2 [Harpegnathos saltator]
Length = 400
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 206/324 (63%), Gaps = 24/324 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++T ++ A ++DL G+ + HIK L +
Sbjct: 76 DLKPTRFLCSLKLLENFIEEFFYQNPISQLGVITTRNKRAEKISDLAGNSKKHIKELQSL 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ +G+ SLQN+L+L L +PS+ +E+LI+ AL+TCDPGDI ETIQ K I
Sbjct: 136 QQTMVTGEPSLQNSLELALKSLRLLPSHASKEILIIIGALTTCDPGDINETIQNMKSDGI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG + VALD+ H+KE + H PPPA A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMANATGGEHGVALDDKHYKEQLNSHIDPPPAATRLDAA-LVKMG 254
Query: 180 FPQRAGEGSI-----SICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A ++ ++C CH E + GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSTVTDTPMTVCMCHAESSDELFKLMSTGYLCPQCLSKHCELPVECRSCGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYH+LFP+A F E+ + S+CFGCQ+ K +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVATFTEI-------EFENNHSSCFGCQKVFTQKDKK--IYI-CEKC 364
Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
+ FCL+C+I+IHESLH CPGC +
Sbjct: 365 DQPFCLDCEIFIHESLHTCPGCAT 388
>gi|307169945|gb|EFN62454.1| General transcription factor IIH subunit 2 [Camponotus floridanus]
Length = 401
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 209/335 (62%), Gaps = 28/335 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++ ++ A ++DL G+ + HIK L +
Sbjct: 77 DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKISDLAGNSKKHIKELRAL 136
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ +G+ SLQN+L+L L +PS+ +E+LI+ AL+TCDPGDI ETI+ K I
Sbjct: 137 QQTIVTGEPSLQNSLELATKSLKLLPSHASKEILIIIGALTTCDPGDINETIRNMKLDSI 196
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG + VALD+ H+KE + H PPPA A L+KMG
Sbjct: 197 RCSVIGLAAELYICKRMANITGGEHGVALDDKHYKEQLNAHIDPPPAATRLDAA-LVKMG 255
Query: 180 FPQR-----AGEGSISICSCHKE-----VKV-GVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A + S+++C CH E VK+ GY CP+C ++ CELP +CR CGL LV
Sbjct: 256 FPHHALHSSASDTSMTVCMCHAENSDESVKLMSTGYLCPQCFSKHCELPVECRACGLTLV 315
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYH+LFP+ PF EVT + S CFGCQ++ K C KC
Sbjct: 316 SAPHLARSYHYLFPVEPFREVT-------FEGNHSFCFGCQKTFAQKDKK---VYTCEKC 365
Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNPIVANE 323
+ FCL+C+I+IH+ LH CPGC +NP N+
Sbjct: 366 NQTFCLDCEIFIHDILHTCPGCA----TNPATYNK 396
>gi|298708881|emb|CBJ30838.1| General transcription factor II H, polypeptide 2 [Ectocarpus
siliculosus]
Length = 412
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 208/339 (61%), Gaps = 37/339 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RPSR+AV + V+ FV +FDQNPLSQ+G++ ++G A +T+L G+P++H++AL
Sbjct: 80 DMRPSRLAVTLRIVQDFVTNYFDQNPLSQLGILVTREGRAEKITELSGNPKAHMEALTKD 139
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G++SLQN L++ L +P YG+REV+++YS+LSTCDPGDI ETI K K K+R
Sbjct: 140 SDTKGEASLQNLLEMACTSLRAVPEYGNREVVVIYSSLSTCDPGDIHETIAKLKTHKVRA 199
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH---APPPPAIAEFAIANLIKM 178
SV+GL AEMF+ + L ++T G YS+A DE+H++E +M P PP A+L++M
Sbjct: 200 SVVGLGAEMFVLRRLTEETSGDYSIAGDENHYRESLMAQCTPPPTPPGREGAMFADLVRM 259
Query: 179 GFPQRAGE--GSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
GFP + S+ +E+ V GY+CPRCK + ELP++C IC L LVSSPHLARS
Sbjct: 260 GFPAETQDVFPSLGYSGNRQELSVS-GYSCPRCKTKTSELPSECVICALPLVSSPHLARS 318
Query: 237 YHHLFPIAPFDEV------------------------TPLCLNDPRNRSRSTCFGCQQSL 272
YHHLFP+ FDE+ + + S S C GC + L
Sbjct: 319 YHHLFPVPQFDEIPAAAPAAAAAAASAGSAGGMGSGAGAGRATEEESASSSYCAGCLRDL 378
Query: 273 LSSGNKPGL-YVACPKCKKHFCLECDIYIHESLHNCPGC 310
GL CP+C+ FCL+CD+Y+H+SLHNCPGC
Sbjct: 379 ------SGLPRYRCPECRSAFCLDCDMYVHDSLHNCPGC 411
>gi|291000862|ref|XP_002682998.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
gruberi]
gi|284096626|gb|EFC50254.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
gruberi]
Length = 349
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 200/313 (63%), Gaps = 16/313 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D + SR V + F++E+FDQNPLSQ+G++T K+GVA L +L G+ + + L
Sbjct: 48 DEKTSRAQSVISTAKEFIKEYFDQNPLSQLGIITTKNGVAKILKELTGNVKQLVSELKKS 107
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G GD SLQN L+L L QIP+YG +EV+++Y +LS+CDP +I TI++ K++ IR
Sbjct: 108 FG--GDPSLQNVLNLSYDSLHQIPNYGSKEVVVIYCSLSSCDPDNIFSTIKQLKDNNIRV 165
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAP----PPPAIAEFAIANLIK 177
S+I L AE+F+CK + + T G+YSV D+ HFKE++M HA + + ++
Sbjct: 166 SLISLDAEVFVCKQIAKQTQGTYSVPTDDEHFKEVLMAHAAPPPTTSSSSTKIVAPPSMR 225
Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
MGFPQR + S+C CHK++ G GY CPRCK++ C+LP +C CGL LVSSPHLARSY
Sbjct: 226 MGFPQRRAQTLYSMCLCHKKITPG-GYICPRCKSKYCDLPVECSTCGLMLVSSPHLARSY 284
Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECD 297
HHLFP+ F + PR C+GCQ+SL + + + CP C FC+ECD
Sbjct: 285 HHLFPVQQFIDY------QPREGEEPYCYGCQKSL---PKESFISLQCPSCSNIFCVECD 335
Query: 298 IYIHESLHNCPGC 310
+IHESLHNCPGC
Sbjct: 336 AFIHESLHNCPGC 348
>gi|405976292|gb|EKC40804.1| General transcription factor IIH subunit 2 [Crassostrea gigas]
Length = 399
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 145/327 (44%), Positives = 210/327 (64%), Gaps = 22/327 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E FV E+FDQNP+SQ+G++ K+ A +++LGG+P HI L +
Sbjct: 75 DLKPTRLLSTLKLLEYFVEEYFDQNPISQLGIIITKNKRAEKVSELGGNPRRHITTLQSL 134
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C G+ SLQN+L+L L +P + REVL++ +L+TCDP +I+ TI ++ +
Sbjct: 135 ADKVCHGEPSLQNSLELAHQTLRHMPGHASREVLVVMGSLTTCDPDNIISTISMLRDGNV 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE+ +CK LC++T GSYSV LDE HFK+L+ +H PPPA A ++LI+MG
Sbjct: 195 RCSVIGLAAEVCVCKKLCKETHGSYSVILDECHFKDLLTQHVTPPPATASTE-SSLIRMG 253
Query: 180 FPQRA----GEGSISICSCHKEVK-----VGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
FP E S+C CH + K GY CP+CK++ CELP +C+ CGL LVS+
Sbjct: 254 FPHHQLSADKEERPSMCMCHLDSKNSQGFSSTGYFCPQCKSKYCELPIECKACGLTLVSA 313
Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
PHLARSYHHLFP+ F E+ + S CF CQ ++ + +Y+ C C++
Sbjct: 314 PHLARSYHHLFPLDQFREIPTAQVG-----SNVLCFSCQTAV----QELNVYI-CDNCER 363
Query: 291 HFCLECDIYIHESLHNCPGCESLRHSN 317
+CL+CD++IHE+LH+CPGC S R +
Sbjct: 364 IYCLDCDLFIHETLHSCPGCASSRQTQ 390
>gi|350418129|ref|XP_003491749.1| PREDICTED: general transcription factor IIH subunit 2-like [Bombus
impatiens]
Length = 400
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/324 (45%), Positives = 212/324 (65%), Gaps = 24/324 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++ ++ A +++L G+ + H+K + M
Sbjct: 76 DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELAGNSKKHVKEVQNM 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ G+ SLQN+++L L +PS+ +E+LI+ AL+TCDPGDI ETIQ K +
Sbjct: 136 QQIAPVGEPSLQNSIELALKSLRLLPSHASKEILIIVGALTTCDPGDINETIQNMKTDSV 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG + VALD+ H+KE + H PPPA A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254
Query: 180 FPQR-----AGEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A + ++++C CH E VK + GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSANDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYH+LFP+ F EV + + STC+GCQ+ +LS +K +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVESFKEVV-------FSGTPSTCYGCQK-ILSQKDKK-VYI-CNKC 364
Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
+ FCL+C+I+IHESLH CPGC +
Sbjct: 365 NETFCLDCEIFIHESLHTCPGCAT 388
>gi|383853156|ref|XP_003702089.1| PREDICTED: general transcription factor IIH subunit 2-like
[Megachile rotundata]
Length = 400
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 210/324 (64%), Gaps = 24/324 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
DF+P+R K +E FV EFF QNP+SQ+G++ ++ A +++L G+ + HIK + M
Sbjct: 76 DFKPTRFLCSLKLLEDFVEEFFYQNPISQLGVIITRNKRAEKVSELVGNSKKHIKEIQNM 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ +G+ SLQN+L+L L +PS+ +E+LI+ AL+TCDPGDI ETIQ K I
Sbjct: 136 QQMVPAGEPSLQNSLELALKSLRLLPSHASKEILIIIGALTTCDPGDINETIQGMKSDCI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG + VALD+ H+KE + H PPPA A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254
Query: 180 FPQRA-----GEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A + ++++C CH E VK + GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSVNDSAMAVCMCHAENSEESVKLISTGYLCPQCLSKHCELPVECRACGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYH+LFP+ PF E + + S C+GCQ++ K +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVEPFTETV-------FDGTPSLCYGCQKNFSQMDKK--IYI-CNKC 364
Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
+ FCL+C+I+IHESLH CPGC +
Sbjct: 365 SQAFCLDCEIFIHESLHTCPGCAT 388
>gi|332373040|gb|AEE61661.1| unknown [Dendroctonus ponderosae]
Length = 395
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 213/322 (66%), Gaps = 22/322 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+RM + +E F+ EFFDQNP+SQ+G++ +++ A + +LGG+ HIK L M
Sbjct: 75 DLKPTRMLCSLRILEGFIEEFFDQNPISQMGIILMQNKRAEKVCELGGNFRKHIKYLRSM 134
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQN L+ L +P++ REVL++ +L+TCDP DI TIQK K+ I
Sbjct: 135 SKTSLLGEPSLQNGLEDAFSSLKLVPTHASREVLVIMGSLTTCDPTDINTTIQKFKQEGI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCS+IGL+AE+ IC+ L +T G+Y++ LD+SH+K+L+ + PPPA A+LIKMG
Sbjct: 195 RCSIIGLAAEVHICRTLANETKGTYNIILDDSHYKDLLYQQVEPPPAAVALE-ASLIKMG 253
Query: 180 FP-QRAGEGS---ISICSCH----KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSS 230
FP Q EG+ ++IC CH +E K+ GY CP+C ++ CELP +CR CGL LVS+
Sbjct: 254 FPHQMNTEGTDEPLTICMCHVDSTEEGKLTTGGYYCPQCLSKYCELPVECRACGLTLVSA 313
Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
PHLARSYHHLFP A F+E+ ++ +TC+ CQ++ + G+K CP CK+
Sbjct: 314 PHLARSYHHLFPPANFEEMQ-------FDKQVTTCYACQKT-FNDGDKQ--VYQCPTCKR 363
Query: 291 HFCLECDIYIHESLHNCPGCES 312
FC++CDI+IH++LH CPGC +
Sbjct: 364 MFCIDCDIFIHDTLHTCPGCAT 385
>gi|320166414|gb|EFW43313.1| general transcription factor IIH [Capsaspora owczarzaki ATCC 30864]
Length = 427
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 148/336 (44%), Positives = 212/336 (63%), Gaps = 20/336 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ + +E FV EFFDQNP+SQ+ + +D A +++LGG+P H+++L K
Sbjct: 89 DLKPTRLEHSIRLLEQFVPEFFDQNPISQLNFIISRDAKAERISELGGNPAKHLESLRKK 148
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+G+ SLQN+L+L + L +PS+ +EVLI+ L++CDPGDI +TI + I C
Sbjct: 149 ASTAGEISLQNSLELARESLRLMPSHTSKEVLIIMGGLASCDPGDIFQTISLLELDNIHC 208
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SVIGLSAE+ ICK+L + T G ++V +DESHF++++ +H PPPA + A LI+MGFP
Sbjct: 209 SVIGLSAEVRICKYLAEKTKGVHNVIIDESHFRDMLFQHITPPPASSRTE-ALLIRMGFP 267
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
++ + ++C+CHK + G GY CP+CKA+ CELPT C +CGL LVSSPHLARSYHHLF
Sbjct: 268 RQNLSKAATLCACHKLFRAG-GYICPQCKAKCCELPTTCAVCGLTLVSSPHLARSYHHLF 326
Query: 242 PIAPFDEV-------------TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
P+APF EV L N +C GC + + + CP C
Sbjct: 327 PLAPFREVPLAHPPADANGLLAGLVANSTTLALHPSCRGCSKPTATMRSA----FQCPTC 382
Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNPIVANEG 324
+CL+CDI+IH++LH CP C +L + VAN G
Sbjct: 383 CHVYCLDCDIFIHDTLHTCPSCVALLDGSA-VANGG 417
>gi|340715742|ref|XP_003396368.1| PREDICTED: general transcription factor IIH subunit 2-like [Bombus
terrestris]
Length = 400
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 146/324 (45%), Positives = 212/324 (65%), Gaps = 24/324 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++ ++ A +++L G+ + H+K + M
Sbjct: 76 DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELAGNSKKHVKEVQNM 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ G+ SLQN+++L L +P++ +E+LI+ AL+TCDPGDI ETIQ K +
Sbjct: 136 QQIAPVGEPSLQNSIELALKSLRLLPTHASKEILIIVGALTTCDPGDINETIQNMKTDSV 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG + VALD+ H+KE + H PPPA A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254
Query: 180 FPQR-----AGEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A + ++++C CH E VK + GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSANDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYH+LFP+ F EV + + STC+GCQ+ +LS +K +Y+ C KC
Sbjct: 315 SAPHLARSYHYLFPVESFKEVV-------FSGTPSTCYGCQK-ILSQKDKK-VYI-CNKC 364
Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
+ FCL+C+I+IHESLH CPGC +
Sbjct: 365 NETFCLDCEIFIHESLHTCPGCAT 388
>gi|380019349|ref|XP_003693572.1| PREDICTED: general transcription factor IIH subunit 2-like [Apis
florea]
Length = 405
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 147/327 (44%), Positives = 212/327 (64%), Gaps = 25/327 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++ ++ A +++L G+ + HIK + M
Sbjct: 76 DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELTGNSKKHIKEVQSM 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ +G+ SLQN+++L L +PS+ +E+LI+ AL+TCDPGDI ETI+ K +
Sbjct: 136 QQITPAGEPSLQNSIELALKSLRLLPSHASKEILIIVGALTTCDPGDINETIKNMKSDSV 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG + VALD+ H+KE + H PPPA A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254
Query: 180 FPQRA-----GEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A + ++++C CH E VK + GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSTNDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEV---TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVAC 285
S+PHLARSYH+LFP+ PF EV + P S C+GCQ+ + S +K +Y+ C
Sbjct: 315 SAPHLARSYHYLFPVEPFKEVEYRKEVTFEHP-----SICYGCQK-IFSQKDK-KVYI-C 366
Query: 286 PKCKKHFCLECDIYIHESLHNCPGCES 312
KC + FCL+C+I+IHESLH CPGC +
Sbjct: 367 NKCNQTFCLDCEIFIHESLHTCPGCAT 393
>gi|348680619|gb|EGZ20435.1| hypothetical protein PHYSODRAFT_345458 [Phytophthora sojae]
Length = 387
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 139/325 (42%), Positives = 199/325 (61%), Gaps = 27/325 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D++P R+A + ++ FV+++FDQNP+SQ+G++ +K A L+DL G+P +H++ +
Sbjct: 73 DWKPHRLACASDVLQQFVKDYFDQNPISQLGVIGIKGMTAEKLSDLSGNPNTHMERIAAA 132
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L + SLQNAL++ + L +P+YG RE++++Y L T DPGDI +T+ K IR
Sbjct: 133 LAVDKEPSLQNALEIAKSSLKTVPAYGSREIVVVYGNLVTADPGDIFQTLASLKRENIRV 192
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA--IANLIKMG 179
S IG+ AEM + + + T G+Y VA+D H K L+ P P +A A A L++MG
Sbjct: 193 SFIGIGAEMHLLRRIADGTDGTYHVAVDAEHMKRLMTAFTFPSPTVATAASRFATLVEMG 252
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FPQR G++S+C+CH+ VGY CPRCK++ C+LPT C++C L LVSSPHLARSYHH
Sbjct: 253 FPQRR-SGALSLCTCHQAFTT-VGYLCPRCKSKSCDLPTTCQVCNLPLVSSPHLARSYHH 310
Query: 240 LFPIAPFDE--------------VTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVAC 285
LFP+A F + VTP + + CFGC L G G C
Sbjct: 311 LFPVAKFTQHLLRSGVTGEKGAKVTPSLV-------QKKCFGCLLPLGLDGE--GTAYEC 361
Query: 286 PKCKKHFCLECDIYIHESLHNCPGC 310
C+ FC ECD+Y+H+SLHNCPGC
Sbjct: 362 TTCQNVFCSECDMYVHDSLHNCPGC 386
>gi|66549743|ref|XP_394997.2| PREDICTED: general transcription factor IIH subunit 2 isoform 1
[Apis mellifera]
Length = 405
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 146/327 (44%), Positives = 210/327 (64%), Gaps = 25/327 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++ ++ A +++L G+ + HIK + M
Sbjct: 76 DLKPTRFLCSLKLLEDFIEEFFYQNPISQLGVIITRNKRAEKVSELTGNSKKHIKEVQSM 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ +G+ SLQN+++L L +PS+ +E+LI+ AL+TCDPGDI ETI+ K +
Sbjct: 136 QQITPAGEPSLQNSIELALKSLRLLPSHASKEILIIVGALTTCDPGDINETIKNMKLDSV 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE++ICK + TGG + VALD+ H+KE + H PPPA A L+KMG
Sbjct: 196 RCSVIGLAAELYICKRMATATGGEHGVALDDKHYKEQLNMHIDPPPAATRLDAA-LVKMG 254
Query: 180 FPQRA-----GEGSISICSCHKE-----VK-VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A + ++++C CH E VK + GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALHSSTNDSAMAVCMCHAESSDESVKLLSTGYLCPQCLSKHCELPVECRACGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEV---TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVAC 285
S+PHLARSYH+LFP+ PF EV + P S C+GCQ+ + S +Y+ C
Sbjct: 315 SAPHLARSYHYLFPVEPFKEVEYRKEVTFEHP-----SICYGCQK--IFSQKDKKVYI-C 366
Query: 286 PKCKKHFCLECDIYIHESLHNCPGCES 312
KC + FCL+C+I+IHESLH CPGC +
Sbjct: 367 NKCNQTFCLDCEIFIHESLHTCPGCAT 393
>gi|406604786|emb|CCH43771.1| General transcription factor IIH subunit 2 [Wickerhamomyces
ciferrii]
Length = 420
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 146/320 (45%), Positives = 200/320 (62%), Gaps = 16/320 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R ++ FV EFFDQNP+SQ+G++ +++G+ ++ +GG+P+ HI AL +
Sbjct: 107 DLRPNRHSITITNAIQFVTEFFDQNPISQLGIIIMRNGIGQLVSQVGGNPQEHIDALKSI 166
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL GD SLQNAL++ +GLL + S+ REVLI+Y +L + DPG+I +T+ E KI
Sbjct: 167 KKLEPKGDPSLQNALEMSRGLLMHVASHCTREVLIIYGSLLSLDPGNIHKTVNSLVEEKI 226
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R +IGLSA++ ICK +C+ T SY+V L+E HFKEL M+ P P L
Sbjct: 227 RVKIIGLSAQVSICKEICKKTNFGDENSYNVILNEQHFKELFMDAVVPLPVNKINKSFTL 286
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E S S CSCH ++ G GY CP CK+++C LPT C C L+ S HLAR
Sbjct: 287 VKMGFPYRISEDSPSFCSCHSKLTYG-GYICPNCKSKICSLPTICPCCNTMLILSTHLAR 345
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---LSSGNKPGLYVACPKCKKHF 292
SYHHL P+ F EV P+ +N + CFGCQ L L + Y C C F
Sbjct: 346 SYHHLLPLKNFIEV-PV----DKNYDVNLCFGCQMELPDGLITQTSSSRY-KCVDCNNQF 399
Query: 293 CLECDIYIHESLHNCPGCES 312
C++CD++IHE+LHNCPGCE+
Sbjct: 400 CIDCDVFIHETLHNCPGCEN 419
>gi|449670653|ref|XP_002167169.2| PREDICTED: general transcription factor IIH subunit 2-like [Hydra
magnipapillata]
Length = 384
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 201/317 (63%), Gaps = 21/317 (6%)
Query: 3 FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG-- 60
RPSR+A + K +E F+ E+FDQNP+SQ ++ A +TDL G+P HI AL
Sbjct: 78 LRPSRLACLTKLLENFIVEYFDQNPISQ-----TRNKRAEKITDLSGNPNIHIAALQNFV 132
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
K G G+ SLQN+LD+ L +P + RE+LI++ +L+TCDPGDI TI K IR
Sbjct: 133 KSGPEGEPSLQNSLDMALQFLGHLPKHASREILIVFGSLTTCDPGDIFTTINNLKNENIR 192
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSV+GLSAE+ +CK + +T G Y+V LDE H +L++EH PP A A+L++MGF
Sbjct: 193 CSVLGLSAEIKLCKTISSETNGIYNVILDEKHCNDLLLEHIRPPAAKLNVE-ASLVRMGF 251
Query: 181 PQRAGEGSISICSCHKEVKVG-----VGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
PQ ++C CH E+K GY CP+CK + CELP +C++CGL LVS+PHLAR
Sbjct: 252 PQHISNTYPALCLCHIELKNMQGFNCTGYFCPQCKNKYCELPVECKVCGLTLVSAPHLAR 311
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
SY HLFP+ PF+EV + +N S C GCQ++ + + + C +CK+ FC +
Sbjct: 312 SYQHLFPLPPFEEVRRIETAANKN---SICQGCQRNCIDA-----IVYICKECKEMFCND 363
Query: 296 CDIYIHESLHNCPGCES 312
CD++IHE+LH CPGC S
Sbjct: 364 CDMFIHETLHTCPGCTS 380
>gi|42415511|ref|NP_963875.1| general transcription factor IIH subunit 2 [Danio rerio]
gi|28278887|gb|AAH45397.1| General transcription factor IIH, polypeptide 2 [Danio rerio]
Length = 392
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 152/323 (47%), Positives = 207/323 (64%), Gaps = 26/323 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K +E FV E+FDQNP+SQIG++T K+ A LTDL G+P+ HI AL
Sbjct: 75 DLKPNRLTSTLKLMEHFVEEYFDQNPISQIGIITTKNKRAEKLTDLAGNPKKHITALRKA 134
Query: 62 LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L++ L +P++ REVL+++S+L+TCDPG+I E I+ KI
Sbjct: 135 VDSTCVGEPSLYNSLNMALQTLKHMPAHTSREVLVIFSSLTTCDPGNIYELIKTLNGLKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGGSY+V LDESHFKEL++ H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTILTRETGGSYNVILDESHFKELLLLHVKPPPASSS-SECSLIRMG 253
Query: 180 FPQ--------RAGEGSISIC----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQL 227
FPQ + + S S+ S E+ +G GY CP+C+A+ ELP +C++CGL L
Sbjct: 254 FPQHVIASLSDQDAKPSFSMAHLDSSSEPELSLG-GYYCPQCRAKYTELPVECKVCGLTL 312
Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
VS+PHLARS+HHLFP+ F E TPL C CQ L K C
Sbjct: 313 VSAPHLARSFHHLFPLEAFQE-TPL----EEYEGERFCEACQGEL-----KDKSVFTCLA 362
Query: 288 CKKHFCLECDIYIHESLHNCPGC 310
CKK FC+ECDI+IH++LH CPGC
Sbjct: 363 CKKVFCVECDIFIHDTLHCCPGC 385
>gi|390343479|ref|XP_794347.2| PREDICTED: general transcription factor IIH subunit 2-like
[Strongylocentrotus purpuratus]
Length = 409
Score = 281 bits (718), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 206/329 (62%), Gaps = 22/329 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D +P+R+A K +E F+ E+FDQNP+SQ+G++T + A LT+LGG+P+ HI AL
Sbjct: 76 DLKPNRLACCTKLLEHFIEEYFDQNPISQVGIITSSNMRAEKLTELGGNPQRHITALEKC 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C + SLQNAL+L L +P + RE+L++ +L+TCDPG+I +TIQ K+ I
Sbjct: 136 NDKPCVKEPSLQNALELAAATLRHMPGHASREILVIMGSLTTCDPGNIHDTIQAMKDHSI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RC VIGL+A++ +C+ L T G+Y V LDE+HFKEL+MEH PPPA + I+MG
Sbjct: 196 RCCVIGLAADVRVCRKLATVTHGTYGVILDETHFKELLMEHTIPPPARVNTE-PSPIRMG 254
Query: 180 FPQ------RAGEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ ++ + S+C CH + K GY CP+C+++ CELP +CR+CGL LV
Sbjct: 255 FPQHVIHTDKSKVSAPSMCMCHLDGKNSEGFGTGGYFCPQCQSKYCELPVECRVCGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARS+HHLFP+ F+E D + C GCQ + + CPKC
Sbjct: 315 SAPHLARSFHHLFPLDRFEEFK---REDHDHPDSLFCQGCQSHI-----RDQTAYRCPKC 366
Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSN 317
FCL+C+++I ESLH+CPGC S R S
Sbjct: 367 SNVFCLDCELFIQESLHSCPGCASTRPSQ 395
>gi|409044666|gb|EKM54147.1| hypothetical protein PHACADRAFT_185111 [Phanerochaete carnosa
HHB-10118-sp]
Length = 451
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 145/332 (43%), Positives = 210/332 (63%), Gaps = 28/332 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R ++ + F+ E+FDQNPL QIG+V ++ G+ + ++ G+P+ ++++
Sbjct: 123 DMRPTRFDLMLQYAREFIAEWFDQNPLGQIGIVGMRGGIGERIGEMSGNPQDVLRSIAER 182
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL G+ SLQNA+++ + +S +P+ RE+ I++ +L+TCDPG+I +T+ +C + KI
Sbjct: 183 HKLEPVGEPSLQNAIEMARSSMSHLPTNSSREITIIFGSLTTCDPGNIHDTLDECVKDKI 242
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP----------PAIAE 169
R S++ L+AEM IC+ LC TGG + VAL+E HFK+L+ E PPP P A
Sbjct: 243 RISLVALAAEMKICRELCVKTGGQFGVALNEGHFKDLLFELIPPPAQRALTRANGPGAAN 302
Query: 170 FAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
A A+LI MGFP R S S+C CH E+K G+ CPRC A+VC++PTDC ICGL +V
Sbjct: 303 PA-ADLIMMGFPTRLPVTSQPSLCVCHGELK-SEGFLCPRCLAKVCDVPTDCDICGLMIV 360
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----QQSLLS------SGNK 278
SSPHLARSYHHLFP+ P+ +T L D +TC+GC Q+ +L G
Sbjct: 361 SSPHLARSYHHLFPVKPYTPITNL---DDATSLSATCYGCSVPFQEKVLQPNVPLVDGIS 417
Query: 279 PGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
P CP+C+ FC ECD++IH+ LH CPGC
Sbjct: 418 PLGRYRCPECEHDFCTECDVFIHDVLHCCPGC 449
>gi|449271199|gb|EMC81725.1| General transcription factor IIH subunit 2 [Columba livia]
Length = 395
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E FV E+FDQNP+SQ+GL+ K A +T+L G+ + HI AL
Sbjct: 75 DLKPNRLTCTLKLLEYFVDEYFDQNPISQVGLIVTKSKRAEKMTELSGNSKKHITALKKA 134
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ CSG+ SL NAL+L L +P + REVLI++S+L+TCDP +I + I+ K KI
Sbjct: 135 ADMTCSGEPSLYNALNLAMQTLKHMPGHTSREVLIVFSSLTTCDPANIYDLIKCLKAVKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+M H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLMHHVSPPPASSN-SECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAQLESSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV PL + C GCQ + +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEV-PL----EEYQGERYCQGCQGEM----KDQNVYI-CKVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC+ECD+++H+SLH CPGC
Sbjct: 364 QNAFCVECDVFVHDSLHCCPGC 385
>gi|213511927|ref|NP_001135173.1| general transcription factor IIE, polypeptide 2-2 [Salmo salar]
gi|197632439|gb|ACH70943.1| general transcription factor IIE, polypeptide 2-2 [Salmo salar]
Length = 393
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 149/323 (46%), Positives = 201/323 (62%), Gaps = 25/323 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K VE FV E+FDQNP+SQ+G++T K+ A LTDL G+P+ HI AL
Sbjct: 75 DLKPNRLTSTLKLVEYFVEEYFDQNPISQVGIITTKNKRAEKLTDLAGNPKKHIDALKKA 134
Query: 62 LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C G+ SL N++ L L +P + REVLI++S+L+TCDP +I E ++ K KI
Sbjct: 135 KDSTCGGEPSLYNSISLAMQTLKHMPGHSSREVLIIFSSLTTCDPANIYELVKTLKALKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGGSY V LDESHF+EL+M H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFRELLMFHVKPPPATSS-SECSLIRMG 253
Query: 180 FPQRA------GEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQL 227
FPQ + S H + G GY CP+C+A+ ELP +C++CGL L
Sbjct: 254 FPQHTIASLSDQDAKPSFSQAHLDSTSGGPGLSMGGYFCPQCQAKYTELPVECKVCGLTL 313
Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
VS+PHLARS+HHLFP+ F E +PL L+ +R R C CQ L K CP
Sbjct: 314 VSAPHLARSFHHLFPLEAFQE-SPLLLH---HRER-FCEACQGEL-----KDRSVFTCPS 363
Query: 288 CKKHFCLECDIYIHESLHNCPGC 310
C FC+ECD++IH++LH CP C
Sbjct: 364 CSSVFCVECDLFIHDTLHCCPSC 386
>gi|367002171|ref|XP_003685820.1| hypothetical protein TPHA_0E02960 [Tetrapisispora phaffii CBS 4417]
gi|357524119|emb|CCE63386.1| hypothetical protein TPHA_0E02960 [Tetrapisispora phaffii CBS 4417]
Length = 490
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 149/325 (45%), Positives = 200/325 (61%), Gaps = 22/325 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A+ + F++EFFD+NP+SQ+G+V +++G+AN L+ +GG+P++HI AL +
Sbjct: 170 DLRPNRNAMSIQYAIDFIQEFFDENPISQLGIVIMRNGLANLLSPIGGNPQAHIDALKSI 229
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ G+ SLQNAL++ +GLL +PS+ REVLI++ LS+ DPGDI +TIQ KI
Sbjct: 230 RREEPKGNPSLQNALEMARGLLLPVPSHCTREVLIVFGGLSSTDPGDIHQTIQSLVNEKI 289
Query: 120 RCSVIGLSAEMFICKHLCQ----DTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ ICK LC+ D Y V LDE HFKEL E P P L
Sbjct: 290 RVKVIGLSAQVAICKELCKATNYDDESFYRVILDEVHFKELYDEAVTPLPVNKINKAFTL 349
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + C+CH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 350 VKMGFPTRVFEENPTFCACHSKLVYG-GYFCPNCNSKVCSLPTVCPCCDLMLILSTHLAR 408
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSLLSSGNK------PGLYVACPK 287
SYHHL P+ F EV P + + T CF CQ + S NK C
Sbjct: 409 SYHHLMPLKTFSEV-------PTSETFETENCFSCQMTFPSLRNKKTGELLTSSRYRCQD 461
Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
C+ FC++CD++IHE LHNCPGCES
Sbjct: 462 CENDFCIDCDLFIHEILHNCPGCES 486
>gi|50308759|ref|XP_454384.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49643519|emb|CAG99471.1| KLLA0E09615p [Kluyveromyces lactis]
Length = 488
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 200/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A+ + F+ E+FDQNP+SQIG+V ++DG+A ++ + G+P+ HI AL +
Sbjct: 164 DLRPNRYAMTVQYAVDFIHEYFDQNPISQIGVVVMRDGLAQLISQVSGNPQEHIDALKML 223
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ G++SLQNAL++ +GLL +P++ REVLI++ ALST DPGDI +TI I
Sbjct: 224 RRQEPKGNASLQNALEMARGLLLHVPAHCTREVLIVFGALSTTDPGDIHQTIGSLVHENI 283
Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ +CK LC+ T Y + L++ HFKEL P P L
Sbjct: 284 RAKVIGLSAQVSVCKELCKQTNYGDNSYYGIILNDHHFKELFTSAVTPLPVSKVNKGFTL 343
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + S C+CH + G GY CP CK +VC LPT C C L L+ S HLAR
Sbjct: 344 VKMGFPTRVFEDTPSFCACHSTLVHG-GYVCPNCKTKVCSLPTVCPSCDLMLILSTHLAR 402
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS-SGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV P+ + P + CFGCQ+ + NK G + C +C
Sbjct: 403 SYHHLMPLKTFQEV-PVSDSFPTD----YCFGCQKRFPTLRNNKTGELLTSSRYRCEECD 457
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE+LHNCPGCES
Sbjct: 458 QDFCIDCDVFIHETLHNCPGCES 480
>gi|149727084|ref|XP_001504093.1| PREDICTED: general transcription factor IIH subunit 2 [Equus
caballus]
Length = 395
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIITKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 62 LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP DI IQ K +KI
Sbjct: 135 VAMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSDIYGLIQTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+GCQ L +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYI-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 RHVFCVDCDVFVHDSLHCCPGC 385
>gi|145348017|ref|XP_001418454.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578683|gb|ABO96747.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 407
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 197/309 (63%), Gaps = 12/309 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRP+R++VV + +F+RE+F+QNPLSQ+G++ ++GVA LT+L GSPE+H AL
Sbjct: 105 DFRPNRLSVVGQCATSFIREYFNQNPLSQLGIIVARNGVAERLTELSGSPEAHAAALRNA 164
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L SGD SLQN L++ + L IPSYG REVL + S+LSTCDPG++ I K +K+R
Sbjct: 165 LDASGDFSLQNTLNVARTSLKSIPSYGSREVLYIVSSLSTCDPGNVWTEIAATKAAKVRV 224
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SV+ ++AE+ + + L ++TGG+Y V+L+ H +LIM HAPPPP + ++L++MGFP
Sbjct: 225 SVVAVAAELHVSRRLTEETGGTYGVSLNADHLDDLIMAHAPPPPLPEDATKSSLVQMGFP 284
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q+ ++ V G Y CPRC R+ ELP+ C +C L LVSSPHLARSYHHLF
Sbjct: 285 QKKHLSKDALV-----VGTGGEYVCPRCSGRIDELPSQCAVCRLTLVSSPHLARSYHHLF 339
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+APF E T ++ + C C L+ + C +C FC CD YIH
Sbjct: 340 PVAPFKEYT---RDEASAKEVLECKAC----LALIKQTDPASKCEQCSNVFCFACDCYIH 392
Query: 302 ESLHNCPGC 310
E LHNCP C
Sbjct: 393 EKLHNCPHC 401
>gi|156546970|ref|XP_001599241.1| PREDICTED: general transcription factor IIH subunit 2-like [Nasonia
vitripennis]
Length = 404
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 205/324 (63%), Gaps = 24/324 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R K +E F+ EFF QNP+SQ+G++ ++ A +TDL G+P+ ++ L +
Sbjct: 76 DLKPTRFLCALKLLEDFIDEFFYQNPISQLGIIITRNKRAEKITDLTGNPKKPLQDLKNL 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ +G+ SLQN+L+L L +PS+ +E+L++ +L+TCDPGDI ETIQ K +
Sbjct: 136 QQTSFTGEPSLQNSLELAAKTLKMLPSHASKEILLIMGSLTTCDPGDIGETIQSLKSDGV 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE+ ICK + +TGG + V LD+ HFKE + H PPPA A L+KMG
Sbjct: 196 RCSVIGLAAELNICKIMAVNTGGEHGVVLDDKHFKEKLTAHVDPPPAATRLDAA-LVKMG 254
Query: 180 FPQRA-----GEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP A E S+++C CH + GY CP+C ++ CELP +CR CGL LV
Sbjct: 255 FPHHALQASTTELSMAVCMCHADKPDESSRFTNPGYLCPQCLSKHCELPVECRGCGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYH+LFPI F E DP ++C+ CQ++ + K +YV C KC
Sbjct: 315 SAPHLARSYHYLFPIKHFKEEQ--YEGDP-----ASCYACQKTFVELDKK--VYV-CEKC 364
Query: 289 KKHFCLECDIYIHESLHNCPGCES 312
K+ FCL+C+I+IHESLH CPGC +
Sbjct: 365 KQTFCLDCEIFIHESLHTCPGCAT 388
>gi|336371442|gb|EGN99781.1| hypothetical protein SERLA73DRAFT_179993 [Serpula lacrymans var.
lacrymans S7.3]
gi|336384207|gb|EGO25355.1| hypothetical protein SERLADRAFT_465378 [Serpula lacrymans var.
lacrymans S7.9]
Length = 450
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 138/329 (41%), Positives = 208/329 (63%), Gaps = 24/329 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + F+ E+FDQNPL QIG+V ++ G+ + ++ G+P+ +K++ +
Sbjct: 124 DMRPTRFDLTLEYAREFITEWFDQNPLGQIGVVGMRAGIGERIGEMSGNPQDVLKSIADR 183
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L +G+ SLQNA+D+ + +S +P++ RE+L+++ +L+TCDPG+I +T+ C ++KI
Sbjct: 184 HRLEPTGEPSLQNAVDMARSSMSHLPTHSSREILVIFGSLTTCDPGNIHDTLDACVKNKI 243
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------AIAEF 170
R SV+ L+AEM IC+ LC TGG + VAL+E HFK+L+ E PPP + A
Sbjct: 244 RISVVALAAEMKICRELCDKTGGQFGVALNEGHFKDLLFELVPPPAQRAMTRAAGSGATN 303
Query: 171 AIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
A+L+ MGFP R + S S+C CH E+K G+ CPRC A+VC++PTDC ICGL +VS
Sbjct: 304 PAADLMMMGFPTRLPDTSPASLCVCHSEMK-SEGFLCPRCMAKVCDVPTDCDICGLMIVS 362
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ--------SLLSSGNKPGL 281
SPHLARSYHHLFP+ P+D + L + P + C C + S + G P
Sbjct: 363 SPHLARSYHHLFPVKPYDAIGSLA-DTP--DAGLACHACARIFPTTALTSTSTEGMSPLN 419
Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGC 310
CP+C FC +CD+++H+ +H CPGC
Sbjct: 420 RYRCPECHNDFCGDCDVFVHDVVHCCPGC 448
>gi|393246283|gb|EJD53792.1| TFIIH basal transcription factor complex, subunit SSL1 [Auricularia
delicata TFB-10046 SS5]
Length = 464
Score = 278 bits (711), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/330 (43%), Positives = 209/330 (63%), Gaps = 26/330 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R ++ + V FV E+FDQNPL QIG+V ++ G+A + D+ GSP+ +K++
Sbjct: 137 DLRPTRFDLMLEYVREFVTEWFDQNPLGQIGIVGMRAGLAERVCDMSGSPQDVLKSISER 196
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL +GD SLQNAL+L + +S +PS+ REVL+++ +++T DPG+I +T+ C E KI
Sbjct: 197 HKLEPTGDPSLQNALELSRASMSHLPSHSSREVLMIFGSITTVDPGNIHDTLAACLEDKI 256
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--------AIAEFA 171
R S++ L+AEM IC+ LC+ +GG + VAL+E HFK+++ E PPP A
Sbjct: 257 RISIVALAAEMRICRELCEKSGGVFGVALNEGHFKDVMFELIPPPAQRAAAKAAAGGGGT 316
Query: 172 IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
A+L+ M FP R E S ++C CH E+K G+ CPRC ARVC++PTDC +CG+ +VSS
Sbjct: 317 GADLLLMAFPVRLPESSAPTLCVCHLELK-SEGFLCPRCFARVCDVPTDCDVCGIMIVSS 375
Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-----------LSSGNKP 279
PHLARSYHHLFP+ + TP+ + ++ +TC GC + + G P
Sbjct: 376 PHLARSYHHLFPVKAY---TPVMTLEGEQQTSATCHGCSMAFQPHKAAASAGATAEGVSP 432
Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPG 309
CP+C+ FC ECDI++H+ LH CPG
Sbjct: 433 LGRYRCPECQNDFCNECDIFVHDVLHVCPG 462
>gi|344272627|ref|XP_003408133.1| PREDICTED: general transcription factor IIH subunit 2-like
[Loxodonta africana]
Length = 395
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 203/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L++ L +P + REVLI++S+L+TCDP +I + ++ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLNMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLVKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEIPLEEYNGER-----FCYGCQGEL----KDQHVYV-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|170098262|ref|XP_001880350.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164644788|gb|EDR09037.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 396
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 135/330 (40%), Positives = 208/330 (63%), Gaps = 26/330 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIK--ALM 59
D RP+R ++ + F+ E+FDQNPL QIG+V ++ G+ + ++ G+P+ +K A
Sbjct: 70 DMRPTRFDLMLQYAREFIAEWFDQNPLGQIGVVGMRAGIGERIGEMSGNPQDVLKYIAER 129
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL +G+ SLQNA+++ + ++ +P++ RE+LI++ +L+TCDPG+I +TI C ++KI
Sbjct: 130 HKLEPTGEPSLQNAIEMARSSMNHLPTHSSREILIIFGSLTTCDPGNIHDTIDTCVKNKI 189
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
R SV+ L+AEM +C+ LC TGG + VA++E HFK+L+ E PPP A +
Sbjct: 190 RISVVALAAEMKVCRELCNKTGGQFGVAMNEGHFKDLLFELIPPPAQHATSRVAGAGTGS 249
Query: 173 ----ANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQL 227
A+L+ MGFP R + + ++C CH E+K G+ CPRC ++VC++PTDC ICGL +
Sbjct: 250 SNPAADLMMMGFPTRLPDATAPTLCVCHSELK-SEGFLCPRCLSKVCDVPTDCDICGLMI 308
Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-------LSSGNKPG 280
VSSPHLARSYHHLFP+ P+D + + DP S+C C + + G P
Sbjct: 309 VSSPHLARSYHHLFPVKPYDALPYGDVTDP----SSSCHACSRPFPTAVSASATEGVSPV 364
Query: 281 LYVACPKCKKHFCLECDIYIHESLHNCPGC 310
CP+C FC +CD+++H+ +H CPGC
Sbjct: 365 GRYKCPECHHDFCADCDVFVHDVIHCCPGC 394
>gi|225719554|gb|ACO15623.1| General transcription factor IIH subunit 2 [Caligus clemensi]
Length = 397
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 140/326 (42%), Positives = 205/326 (62%), Gaps = 24/326 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D +P+R+ V K +E F+ E+F NP+SQ+G++ ++ + L++LGG+P H + L
Sbjct: 74 DMKPTRLLSVMKLLEVFIEEYFYLNPISQLGIILTRNKRSEVLSELGGNPRKHTERLRKA 133
Query: 60 -GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ C+G+ SLQNALD L +PS+G +E+L ++ +L+TCDP DI TI CK +
Sbjct: 134 RDDIVCAGEPSLQNALDTALNYLKPMPSHGSKEILSVFGSLTTCDPSDINATIINCKAAN 193
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
+R S I L+AE+ I K L + TGG ++V LD+ HFKE++ PPP+ + A+LIKM
Sbjct: 194 VRASFISLTAEVRIYKELTKVTGGDFNVILDDVHFKEILSSQLEPPPSATQMD-ASLIKM 252
Query: 179 GFPQRAGEGS------ISICSCHKEV---KVG-VGYTCPRCKARVCELPTDCRICGLQLV 228
GFP G+ + + +C CH E K+ G+ CP+C A+ CELP +C CGL LV
Sbjct: 253 GFPCHTGDDTSDPRSGLGLCMCHLENNPPKISHTGFLCPQCSAKYCELPVECVSCGLTLV 312
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ PFD+ ++S CF C + L N+ ACP+C
Sbjct: 313 SAPHLARSYHHLFPLPPFDQQI------SSSQSIRECFACMRGL----NRESKVFACPQC 362
Query: 289 KKHFCLECDIYIHESLHNCPGCESLR 314
K++C++CDI+IHE+LH+CPGC S R
Sbjct: 363 HKNYCIDCDIFIHETLHSCPGCASTR 388
>gi|426246343|ref|XP_004016954.1| PREDICTED: general transcription factor IIH subunit 2 [Ovis aries]
Length = 395
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R S+IGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + + +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+ CQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYACQGEL----KDQHVYV-CSVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|84370085|ref|NP_001033619.1| general transcription factor IIH subunit 2 [Bos taurus]
gi|122142864|sp|Q2TBV5.1|TF2H2_BOVIN RecName: Full=General transcription factor IIH subunit 2; AltName:
Full=General transcription factor IIH polypeptide 2
gi|83638634|gb|AAI09594.1| General transcription factor IIH, polypeptide 2, 44kDa [Bos taurus]
gi|296475883|tpg|DAA17998.1| TPA: general transcription factor IIH subunit 2 [Bos taurus]
gi|440911341|gb|ELR61023.1| General transcription factor IIH subunit 2 [Bos grunniens mutus]
Length = 395
Score = 276 bits (706), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKSLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R S+IGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + + +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+ CQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYACQGEL----KDQHVYV-CSVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|449514090|ref|XP_004174671.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
subunit 2 [Taeniopygia guttata]
Length = 410
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIGL+ K A +T+L G+ + H+ AL
Sbjct: 90 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGLIVTKSKRAEKMTELSGNSKKHVTALKKA 149
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ CSG+ SL N+L+L L +P + REVL++ S+L+TCDP +I + I+ K K+
Sbjct: 150 VDMNCSGEPSLYNSLNLAMQTLKHMPGHTSREVLVVLSSLTTCDPANIYDLIKCLKAVKV 209
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+M H PPPA + + +LI+MG
Sbjct: 210 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLMHHVSPPPA-SSTSECSLIRMG 268
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 269 FPQHTIASLSDQDAKPSFSMVQLENNSDPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 328
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV PL + C GCQ + +Y+ C C
Sbjct: 329 SAPHLARSYHHLFPLDAFQEV-PL----EEYKGERYCHGCQGEM----KDQNVYI-CKVC 378
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC+ECD+++H+SLH CPGC
Sbjct: 379 QNAFCVECDLFVHDSLHCCPGC 400
>gi|391326656|ref|XP_003737828.1| PREDICTED: general transcription factor IIH subunit 2-like
[Metaseiulus occidentalis]
Length = 390
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/320 (44%), Positives = 204/320 (63%), Gaps = 23/320 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RPSR+ V ++ FV F+D+NP+SQ+ ++ + A ++DL G+P H + L +
Sbjct: 76 DLRPSRIQCVLHLLDDFVDRFYDENPISQLAIIATCNKRAEKISDLSGNPLKHKEGLAKI 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G+ SLQN++++ +L +P++ REV+I+ +L+TCDP DI ETI++ K+ +
Sbjct: 136 KDRLPTGEPSLQNSINMAAAVLQHMPAHSSREVVIILGSLTTCDPTDINETIEEAKKLGL 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE+++C+ L + TGGSY + +DE H +EL+ H PPPA+ + ++KMG
Sbjct: 196 RCSVIGLAAEVYVCRRLTEITGGSYHIVVDEDHLRELLANHLLPPPALNNIECS-MVKMG 254
Query: 180 FP--QRAGEGSISICSCHKEVKVGV-------GYTCPRCKARVCELPTDCRICGLQLVSS 230
FP + E + ++C CH E G+ CP+CK++ CELP +CRICGL LVS+
Sbjct: 255 FPFHKNEQEDTPALCQCHLEAAATESVHITTGGFLCPQCKSKYCELPAECRICGLTLVSA 314
Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
PHLARSYHHLFP+ F E T L D R TCF CQ SS +Y C KCK+
Sbjct: 315 PHLARSYHHLFPLDNFKE-THLLKGDTR-----TCFSCQ----SSVEGQNMYT-CEKCKE 363
Query: 291 HFCLECDIYIHESLHNCPGC 310
FCL+CDI+IH +LH CPGC
Sbjct: 364 EFCLDCDIFIHATLHLCPGC 383
>gi|348543161|ref|XP_003459052.1| PREDICTED: general transcription factor IIH subunit 2-like
[Oreochromis niloticus]
Length = 392
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 197/322 (61%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K +EAFV E+FDQNP+SQ+G++T K+ A LTDL G+P+ HI ALM
Sbjct: 75 DLKPNRLTSTLKLMEAFVDEYFDQNPISQVGIITTKNKRAEKLTDLAGNPKKHITALMKA 134
Query: 62 LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L+L L +P + RE+LI+ S+L+TCDP +I E I+ K K+
Sbjct: 135 VDTVCVGEPSLYNSLNLAIQTLKHMPGHTSREILIILSSLTTCDPANIYELIKTLKSLKV 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGGSY V LDESHFKEL+M H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHIKPPPA-SSSSECSLIRMG 253
Query: 180 FPQRA------GEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S H + G GY CP+C A ELP +C++CGL LV
Sbjct: 254 FPQHTIASVTDQDAKPSFSMSHLDSSSGPGLSLGGYFCPQCHATYTELPVECKVCGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARS+HHLFP+ F E L R C CQ L K +CP C
Sbjct: 314 SAPHLARSFHHLFPLQAFIESPVENLQGDR-----FCQACQAEL-----KDKSVFSCPSC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
FC+ECD++IHESLH CP C
Sbjct: 364 HSVFCVECDLFIHESLHCCPCC 385
>gi|170035936|ref|XP_001845822.1| TFIIH basal transcription factor complex p44 subunit [Culex
quinquefasciatus]
gi|167878421|gb|EDS41804.1| TFIIH basal transcription factor complex p44 subunit [Culex
quinquefasciatus]
Length = 411
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 138/324 (42%), Positives = 203/324 (62%), Gaps = 22/324 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+LGGS HIKA+ +
Sbjct: 76 DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGVIAMKAKRAEKITELGGSCRKHIKAVGNL 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL +G+ SLQN L+L L +PS+ RE+L++ +L+TCDP DI TI K I
Sbjct: 136 NKLNLTGEPSLQNGLELALKTLKMVPSHASREILVVMGSLTTCDPTDIHITIDALKSEGI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSV+ LSAE+ +CK LC +TGG Y LD+SHFK+ +++H PP A + + +IKMG
Sbjct: 196 RCSVVSLSAEIRVCKFLCTETGGVYGAVLDDSHFKDQLLQHIDPPQAGNQQEFS-MIKMG 254
Query: 180 FPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLVS 229
FP + EG +++C CH + GY CP+C ++ CELP +C CGL L S
Sbjct: 255 FPHGKTEEGKDPPLTMCMCHIDSTDEPSKLTSGGYHCPQCYSKYCELPVECTACGLTLAS 314
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCK 289
+PHLARSYHHLFP+ + E+ P + + C+GCQ+ L +Y C C+
Sbjct: 315 APHLARSYHHLFPVPHYQEL-------PFQQQATNCYGCQK-LFGETTDKTIY-QCATCR 365
Query: 290 KHFCLECDIYIHESLHNCPGCESL 313
+ FC++CDI++HE++H+C GC ++
Sbjct: 366 QFFCIDCDIFVHETMHSCVGCTTI 389
>gi|363744781|ref|XP_003643123.1| PREDICTED: general transcription factor IIH subunit 2-like [Gallus
gallus]
Length = 395
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQ+GL+ K A +T+L G+P+ HI AL
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQMGLIVTKSKRAEKMTELSGNPKKHIAALKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L+L L +P + REVLI++S+L+TCDP +I + I+ K KI
Sbjct: 135 VDMNCQGEPSLYNSLNLAMQTLKHMPGHTSREVLIVFSSLTTCDPANIYDLIKCLKAVKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDE+H+KEL+M H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGTYHVILDETHYKELLMHHVSPPPA-SSTSECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + + GY CP+C+A+ ELP +C+ICGL LV
Sbjct: 254 FPQHTTASLSDQDAKPSFSMAQLENNSEPCLTLDGYFCPQCRAKYSELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + C GCQ + +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEYQGERYCQGCQAEI----KDQNVYI-CKTC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
K FC+ECD+++H+SLH CPGC
Sbjct: 364 KNAFCVECDVFVHDSLHCCPGC 385
>gi|366992932|ref|XP_003676231.1| hypothetical protein NCAS_0D02890 [Naumovozyma castellii CBS 4309]
gi|342302097|emb|CCC69870.1| hypothetical protein NCAS_0D02890 [Naumovozyma castellii CBS 4309]
Length = 485
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 200/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + F+ EFFDQNP+SQIG+V +++G+A+ ++ + G+P+ HI AL +
Sbjct: 164 DLRPTRHAMMIQYAIDFIHEFFDQNPISQIGIVIMRNGLAHLVSQVSGNPQDHIDALKAI 223
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K +G+ SLQNAL++ +GLL +P++ REVLI++ +LS+ DPGDI +TI + KI
Sbjct: 224 RKQEPAGNPSLQNALEMARGLLLPVPAHCTREVLIIFGSLSSTDPGDIHQTINSLVQEKI 283
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ ICK LC+ T Y + LDE+HFKEL E P P L
Sbjct: 284 RVKVIGLSAQVAICKELCKATNYGDDSFYKILLDETHFKELFDEAVTPLPVNKINKGFTL 343
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + S C+CH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 344 VKMGFPTRIFEDTPSFCACHSKLVYG-GYFCPSCHSKVCSLPTVCPCCDLMLILSTHLAR 402
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV P P CF CQ + K G + C CK
Sbjct: 403 SYHHLMPLKTFVEV-PATETFP----TENCFSCQLVFPILKNQKTGKLLTSSRYRCDSCK 457
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
K FC++CD++IHE LHNCPGCES
Sbjct: 458 KDFCIDCDVFIHEILHNCPGCES 480
>gi|116812559|ref|NP_001070896.1| general transcription factor IIH subunit 2 [Rattus norvegicus]
gi|182701382|sp|A0JN27.1|TF2H2_RAT RecName: Full=General transcription factor IIH subunit 2; AltName:
Full=Basic transcription factor 2 44 kDa subunit;
Short=BTF2 p44; AltName: Full=General transcription
factor IIH polypeptide 2; AltName: Full=TFIIH basal
transcription factor complex p44 subunit
gi|116487735|gb|AAI26098.1| General transcription factor IIH, polypeptide 2 [Rattus norvegicus]
Length = 396
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 204/324 (62%), Gaps = 27/324 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDE+H+KEL+ H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLARHVSPPPA-SSGSECSLIRMG 253
Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
FPQ + S S+ S + +G GY CP+C+A+ CELP +C+ICGL
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
LVS+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEI-PL----EEYKGERFCYGCQGEL----KDQHVYV-CT 362
Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
C+ FC++CD+++H+SLH CPGC
Sbjct: 363 VCRNVFCVDCDVFVHDSLHCCPGC 386
>gi|308805072|ref|XP_003079848.1| basic transcription factor 2, 44kD subunit-related (ISS)
[Ostreococcus tauri]
gi|116058305|emb|CAL53494.1| basic transcription factor 2, 44kD subunit-related (ISS)
[Ostreococcus tauri]
Length = 414
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 192/314 (61%), Gaps = 8/314 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRP+R++VV AFVREFF+QNPLSQ+G++ ++GVA LT+L GSPE+H AL
Sbjct: 101 DFRPNRLSVVGHCAAAFVREFFNQNPLSQLGIIVARNGVAERLTELSGSPEAHCAALRNA 160
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L SGD SLQN L+ + L IPSYG REVL + S+L+TCDPG++ IQ K +K+R
Sbjct: 161 LDASGDFSLQNTLNAARTSLKSIPSYGSREVLYIMSSLATCDPGNVWTEIQATKAAKVRV 220
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SV+ ++AE+ + + L ++TGG+Y V+++ H +LIM HAPPPP + + L++MGFP
Sbjct: 221 SVVAVAAELHVARRLSEETGGTYGVSMNADHLDDLIMAHAPPPPLAEDATKSCLVQMGFP 280
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q+ ++ V Y CPRC R+ ELP+ C +C L LVSSPHLARSYHHLF
Sbjct: 281 QKKHVSKDALI-----VGTRGDYVCPRCSGRIDELPSQCTVCRLTLVSSPHLARSYHHLF 335
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ F E N S C + L+S L C C FC CDI+IH
Sbjct: 336 PVPAFKEYAS--RNVSAKESGIVALEC-SACLASITDASLASECEHCSNVFCFACDIFIH 392
Query: 302 ESLHNCPGCESLRH 315
E LHNCP C H
Sbjct: 393 ERLHNCPHCSCATH 406
>gi|149059192|gb|EDM10199.1| general transcription factor II H, polypeptide 2 (predicted),
isoform CRA_b [Rattus norvegicus]
gi|149059193|gb|EDM10200.1| general transcription factor II H, polypeptide 2 (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 398
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 204/324 (62%), Gaps = 27/324 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDE+H+KEL+ H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLARHVSPPPA-SSGSECSLIRMG 253
Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
FPQ + S S+ S + +G GY CP+C+A+ CELP +C+ICGL
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
LVS+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEI-PL----EEYKGERFCYGCQGEL----KDQHVYV-CT 362
Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
C+ FC++CD+++H+SLH CPGC
Sbjct: 363 VCRNVFCVDCDVFVHDSLHCCPGC 386
>gi|395825440|ref|XP_003785941.1| PREDICTED: general transcription factor IIH subunit 2-like
[Otolemur garnettii]
Length = 395
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 203/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHIASLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R S+IGLSAE+ +C L ++TGG+Y V LDESH+KEL+ + PPPA + + +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHYVSPPPASSN-SECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNSEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEHHGERFCYGCQGDL----KDQHVYV-CNAC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|157115005|ref|XP_001652513.1| btf [Aedes aegypti]
gi|108877047|gb|EAT41272.1| AAEL007056-PA [Aedes aegypti]
Length = 408
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/326 (43%), Positives = 203/326 (62%), Gaps = 26/326 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R K +E F+ EFFDQNP+SQ+G++ +K A +++LGGS H+KA+ G
Sbjct: 76 DLKPTRFLCTLKLLEIFIEEFFDQNPISQLGVIAMKAKRAEKISELGGSCRKHVKAVAGL 135
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL +G+ SLQN LDL L +P++ REVL++ +L+TCDP DI TI+ K I
Sbjct: 136 SKLNLTGEPSLQNGLDLALKTLKMVPAHASREVLVVMGSLTTCDPTDIHITIEALKAEGI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA--IAEFAIANLIK 177
RCSV+ LSAE+ +CK LC +TGG Y LD++HFK+ +++H PP A EF +LIK
Sbjct: 196 RCSVVSLSAEIRVCKFLCTETGGVYGAVLDDAHFKDQLLQHIDPPQAGNQQEF---SLIK 252
Query: 178 MGFPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQL 227
MGFP + EG +++C CH + GY CP+C ++ C+LP +C CGL L
Sbjct: 253 MGFPHGKTEEGKDPPLTMCMCHIDSTDEPSKLTSGGYHCPQCYSKYCDLPVECAACGLTL 312
Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
S+PHLARSYHHLFP+ F E++ S C+GCQ+ S +K C
Sbjct: 313 ASAPHLARSYHHLFPVPHFQELS-------FEHQASICYGCQKPFTESTDK--TVYQCGT 363
Query: 288 CKKHFCLECDIYIHESLHNCPGCESL 313
C++ FC++CDI+IHE++H+C GC ++
Sbjct: 364 CQQVFCIDCDIFIHETMHSCVGCTTI 389
>gi|426197578|gb|EKV47505.1| hypothetical protein AGABI2DRAFT_204710 [Agaricus bisporus var.
bisporus H97]
Length = 452
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 212/332 (63%), Gaps = 28/332 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG---SPE---SHI 55
D RP+R ++ + F+ E+FDQNPL QIG+V ++ G+ + DL G +P+ S+I
Sbjct: 124 DMRPTRFDLMLQYAREFILEWFDQNPLGQIGVVGMRAGLGERIGDLSGESGNPQEVLSYI 183
Query: 56 KALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
K KL +G+ SLQNA+++ + +S +P++ RE++I++ +L+TCDPG+I ET++ C
Sbjct: 184 KE-RHKLEPTGEPSLQNAIEMAKASMSHLPTHSSREIIIIFGSLTTCDPGNIHETLETCV 242
Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------A 166
++K+R SV+ L+AEM IC+ L TGG + VA++E HFK+L+ E PPP A
Sbjct: 243 KNKVRISVVALAAEMKICRELSDKTGGQFGVAMNEGHFKDLLFELVPPPAQRALTRTMGA 302
Query: 167 IAEFAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGL 225
+ A+L+ MGFP R E S ++C CH E+K G+ CPRC A+VC++PTDC +CGL
Sbjct: 303 PSASLSADLMMMGFPTRLPEASAPTLCVCHAELK-SQGFLCPRCLAKVCDVPTDCHVCGL 361
Query: 226 QLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP---GLY 282
+VSSPHLARSYHHLFP+ P+D V L + + C GC + ++ + P G+
Sbjct: 362 MIVSSPHLARSYHHLFPVKPYDAVPNL---EDIPQPSDACHGCAKLFPTTTSIPIGEGVS 418
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGC 310
V CP C+ FC +CD+++HE +H CPGC
Sbjct: 419 VVGRYRCPDCRHDFCSDCDVFVHEVIHCCPGC 450
>gi|156387918|ref|XP_001634449.1| predicted protein [Nematostella vectensis]
gi|156221532|gb|EDO42386.1| predicted protein [Nematostella vectensis]
Length = 400
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 194/318 (61%), Gaps = 15/318 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R++ AK +E F+ E+FDQNP+SQ+GL+ K+ A L++L G+P+ HI +
Sbjct: 79 DLKPNRLSCSAKLLENFITEYFDQNPISQLGLIITKNKRAEKLSELSGNPKLHISTIQSA 138
Query: 62 LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C G+ SLQNAL L L +P + REVL+L +L++CDPGDI ET++ K +
Sbjct: 139 CSKPCVGEPSLQNALVLAMQSLKHMPGHVSREVLVLMGSLTSCDPGDITETVKSLKNMNV 198
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCS++GL+AEM +CK +C T GSY V LDE HFK+L+MEH PP A + L +
Sbjct: 199 RCSIVGLAAEMRVCKQICSSTNGSYRVVLDERHFKDLLMEHVIPPTATVRICLV-LWYVI 257
Query: 180 FPQRAGEGSISICSCHKEVK-----VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
F S+ H + K GY CP+C+++ C+LP +C++CGL LVS+PHLA
Sbjct: 258 FESYCYIHVFSVIRSHMDNKNIKEFTTKGYFCPQCRSKYCDLPVECKVCGLTLVSAPHLA 317
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCL 294
RSY HLFP+ + EV N P + CQ ++ K +CP+C+ FCL
Sbjct: 318 RSYQHLFPLQQYTEV-----NVPAQIPGQPPWTCQACQITLTEKTA--CSCPECRHVFCL 370
Query: 295 ECDIYIHESLHNCPGCES 312
+CDIYIHESLH CPGC +
Sbjct: 371 DCDIYIHESLHTCPGCTA 388
>gi|365990367|ref|XP_003672013.1| hypothetical protein NDAI_0I02010 [Naumovozyma dairenensis CBS 421]
gi|343770787|emb|CCD26770.1| hypothetical protein NDAI_0I02010 [Naumovozyma dairenensis CBS 421]
Length = 520
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 147/325 (45%), Positives = 198/325 (60%), Gaps = 22/325 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +K+G+A+ ++ +GG+P+ HI L +
Sbjct: 199 DLRPNRHAMMIQYAIDFVHEFFDQNPISQLGILVMKNGLAHLVSQVGGNPQDHIDVLKSI 258
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K +G+ SLQNAL++ +GLL +P++ REVLI++ +LS+ DPGDI +TI + KI
Sbjct: 259 RKQEPAGNPSLQNALEMARGLLLPVPAHCTREVLIIFGSLSSTDPGDIHQTINSLVQEKI 318
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ ICK LC T Y + LDE+HFKEL E P P L
Sbjct: 319 RVKVIGLSAQVAICKELCMATNYGDDSFYKILLDETHFKELFDEAVTPLPVNKINKGFTL 378
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E S + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 379 VKMGFPTRIFEDSPTFCSCHSKLVYG-GYFCPSCHSKVCSLPTVCPCCDLMLILSTHLAR 437
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSLLSSGNK------PGLYVACPK 287
SYHHL P+ F EV P N + T CF CQ + N+ C
Sbjct: 438 SYHHLMPLKTFLEV-------PSNETFPTENCFSCQLKFPTLKNQRNGSLLTSSRYRCSS 490
Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
C FC++CD++IHE LHNCPGCES
Sbjct: 491 CHSDFCIDCDVFIHEILHNCPGCES 515
>gi|449541165|gb|EMD32151.1| hypothetical protein CERSUDRAFT_119123 [Ceriporiopsis subvermispora
B]
Length = 445
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 205/329 (62%), Gaps = 24/329 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R ++ + F+ E+FDQNPL QIG+V ++ G+ + ++ G+P+ +KA+
Sbjct: 119 DMRPTRFDLMLQYAREFIIEWFDQNPLGQIGIVGMRSGLGERIGEMTGNPQEVLKAISER 178
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL G+ SLQNA+D+ + +S +P + RE+ I++ +L+TCDPG+I +T++ C + +I
Sbjct: 179 HKLEPGGEPSLQNAIDMARSSMSHLPVHSSREITIIFGSLTTCDPGNIHDTLEDCIKDRI 238
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA--------EFA 171
R S++ L+AEM IC+ LC TGG + VAL+E HFK+L+ E PPP A A
Sbjct: 239 RISIVALAAEMKICRDLCDKTGGQFGVALNEGHFKDLLFELIPPPVARALARTGGAPANP 298
Query: 172 IANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
A+L+ MGFP R + S +C CH ++K G+ CPRC+A+VC++PTDC ICGL +VSS
Sbjct: 299 AADLMIMGFPTRLPDTSPPGLCVCHSKLK-SEGFLCPRCRAKVCDVPTDCDICGLMIVSS 357
Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---------LSSGNKPGL 281
PHLARSYHHLFP+ + V + L D N + S C GC + S G P
Sbjct: 358 PHLARSYHHLFPVKAYQAV--ISLEDTPNPAPS-CHGCSTAFREAAPAMANTSDGMSPLG 414
Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGC 310
CP C FC +CD+++H+ +H CPGC
Sbjct: 415 RYRCPDCHNDFCTDCDVFVHDVVHCCPGC 443
>gi|156844855|ref|XP_001645488.1| hypothetical protein Kpol_1004p2 [Vanderwaltozyma polyspora DSM
70294]
gi|156116152|gb|EDO17630.1| hypothetical protein Kpol_1004p2 [Vanderwaltozyma polyspora DSM
70294]
Length = 497
Score = 273 bits (697), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 197/323 (60%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + F EFFDQNP+SQ+G+V +K+G+A LT + G+P+ HI AL +
Sbjct: 177 DLRPNRNVMSIQYAIDFTHEFFDQNPISQLGIVIMKNGLAQLLTPVSGNPQEHIDALKSI 236
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ G+ SLQNAL++ +GLL +P++ REVLI++ LS+ DPGDI +TIQ E KI
Sbjct: 237 RRQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGGLSSTDPGDIHQTIQSLVEEKI 296
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ IC+ LC+ T + Y + LDE+HFKEL E P P L
Sbjct: 297 RVKVIGLSAQVAICQELCKATNYNDDSFYRIILDETHFKELFDEAVTPLPVNKINKGFTL 356
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E S S C+CH ++ G GY CP C+ +VC LP C C L L+ S HLAR
Sbjct: 357 VKMGFPTRVFEDSPSFCACHSKLVYG-GYFCPNCQNKVCSLPIVCPCCDLMLILSTHLAR 415
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV P N P + CF CQ + K G + C CK
Sbjct: 416 SYHHLMPLKTFSEV-PHTENFPTD----NCFSCQIRFPVLRNQKTGELLTSSRYRCTDCK 470
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCE+
Sbjct: 471 QDFCIDCDVFIHEVLHNCPGCEA 493
>gi|327263012|ref|XP_003216315.1| PREDICTED: general transcription factor IIH subunit 2-like [Anolis
carolinensis]
Length = 406
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQ+G++ K A LT+L G+P HI AL
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQVGIIITKGKRAEKLTELAGNPRKHITALKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C+G+ SL NAL+L L +P + REVL+++S+L+TCDP +I + I+ K KI
Sbjct: 135 VDMTCAGEPSLYNALNLATQTLKHMPGHTSREVLVIFSSLTTCDPSNIYDLIKSLKMLKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH++EL+M H PPPA + + ++LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYRELLMHHVSPPPATSG-SESSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ ELP +C++CGL LV
Sbjct: 254 FPQHTIASLTDQDAKPSFSMAHLENSSEPGLTLGGYFCPQCRAKYSELPVECKVCGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV PL C GC + +Y+ C +C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEV-PL----EEYTGERYCQGCHGGI----GDQHVYI-CKEC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CDI+ HE+LH CPGC
Sbjct: 364 QSGFCVDCDIFAHETLHCCPGC 385
>gi|321470853|gb|EFX81828.1| hypothetical protein DAPPUDRAFT_317251 [Daphnia pulex]
Length = 405
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 34/339 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R V K +E FV EFFD NP+SQ+G++T K + ++ L G+ + HI+AL M
Sbjct: 77 DLKPTRFLCVLKLLELFVHEFFDLNPISQLGILTTKVKRSEQVSALAGNQKKHIEALQQM 136
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C G+ SLQN+L+ L +P++ REVL+L+ +L+TCDPGDI +TI+ KE+ I
Sbjct: 137 KDTSCEGEPSLQNSLERAMNGLKNMPAHSSREVLVLFGSLTTCDPGDIQKTIKSLKENNI 196
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R S+IGL+AE+ IC+ + + TGG+Y+V LD+ H KELI+ PPA+A A+L+KMG
Sbjct: 197 RVSIIGLAAEVRICREIAKRTGGTYNVLLDDHHLKELILNQV-QPPAVAGSMEASLVKMG 255
Query: 180 FP-----------QRAGEGSISICSCHKEVKV------GVGYTCPRCKARVCELPTDCRI 222
FP A + + + C CH E + GY CP+C +R CELP +C+
Sbjct: 256 FPGGKPGSSDGASDSAADHAPAYCLCHIENETSCLMGNNSGYNCPQCGSRYCELPVECKQ 315
Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
CGL LVS+PHLARSYHHLFPI PF E T + + CF C + P +Y
Sbjct: 316 CGLTLVSAPHLARSYHHLFPIKPFIERTEV-------PEMTHCFACAKPF--GELDPNVY 366
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPIVA 321
C C + CL+CD++I E+LH CPGC S+PI A
Sbjct: 367 -ECENCNQIVCLDCDLFIRETLHTCPGCA----SDPITA 400
>gi|409080664|gb|EKM81024.1| hypothetical protein AGABI1DRAFT_71744 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 452
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 211/332 (63%), Gaps = 28/332 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG---SPE---SHI 55
D RP+R ++ + F+ E+FDQNPL QIG+V ++ G+ + DL G +P+ S+I
Sbjct: 124 DMRPTRFDLMLQYAREFILEWFDQNPLGQIGVVGMRAGLGERIGDLSGESGNPQEVLSYI 183
Query: 56 KALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
K KL +G+ SLQNA+++ + +S +P++ RE++I++ +L+TCDPG+I ET++ C
Sbjct: 184 KE-RHKLEPTGEPSLQNAIEMAKASMSHLPTHSSREIIIIFGSLTTCDPGNIHETLETCV 242
Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------A 166
++K+R SV+ L+AEM IC+ L TGG + VA++E HFK+L+ E PPP A
Sbjct: 243 KNKVRISVVALAAEMKICRELSDKTGGQFGVAMNEGHFKDLLFELVPPPAQRALTRTMGA 302
Query: 167 IAEFAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGL 225
+ A+L+ MGFP R E S ++C CH E+K G+ CPRC A+VC++PTDC +CGL
Sbjct: 303 PSASLSADLMMMGFPTRLPEASAPTLCVCHAELK-SQGFLCPRCLAKVCDVPTDCHVCGL 361
Query: 226 QLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP---GLY 282
+VSSPHLARSYHHLFP+ P+D V L + + C GC + + + P G+
Sbjct: 362 MIVSSPHLARSYHHLFPVKPYDAVPNL---EDIPQPSDACHGCAKLFPMTTSIPIGEGVS 418
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGC 310
V CP C+ FC +CD+++HE +H CPGC
Sbjct: 419 VVGRYRCPDCRHDFCSDCDVFVHEVIHCCPGC 450
>gi|403214446|emb|CCK68947.1| hypothetical protein KNAG_0B05150 [Kazachstania naganishii CBS
8797]
Length = 469
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 199/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFD+NP+SQIG+V +++G+A+ ++ + G+P+ H+ AL M
Sbjct: 149 DLRPNRHAMMIQYAIDFVHEFFDENPISQIGIVIMRNGLAHLVSSVSGNPQDHLDALKHM 208
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL+L +GLL +PS+ REVLI++ +LS+ DPGDI +TI + KI
Sbjct: 209 RKQEPKGNPSLQNALELARGLLLPVPSHSTREVLIIFGSLSSTDPGDIHQTIASLVDEKI 268
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ +CK LC+ T Y + LDE+HFK+L E P P L
Sbjct: 269 RVKVIGLSAQVAVCKELCKLTNFGDDSFYKILLDETHFKDLFDEAVTPLPVNKINKGFTL 328
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + CSCH ++ G GY CP C +VC LPT C C L L+ S HLAR
Sbjct: 329 VKMGFPTRVFESVPTFCSCHSKLIYG-GYFCPNCHNKVCSLPTVCPCCDLTLILSTHLAR 387
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQS--LLSSGNKPGLYVA----CPKCK 289
SYHHL P+ F+EV + CF CQ+ +L + L + C C+
Sbjct: 388 SYHHLMPLKTFNEV-----DAKETFPTENCFACQRRFPILKNHKSDQLLTSSRYRCDDCR 442
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 443 QDFCIDCDVFIHEVLHNCPGCES 465
>gi|410083393|ref|XP_003959274.1| hypothetical protein KAFR_0J00710 [Kazachstania africana CBS 2517]
gi|372465865|emb|CCF60139.1| hypothetical protein KAFR_0J00710 [Kazachstania africana CBS 2517]
Length = 469
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/325 (44%), Positives = 202/325 (62%), Gaps = 22/325 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RPSR A++ + FV EFFDQNP+SQIG+V +++G+A ++ + G+P+ HI AL +
Sbjct: 148 DLRPSRHAMMIQYAIDFVHEFFDQNPISQIGIVIMRNGLAQLVSQVSGNPQDHIDALKSI 207
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +PS+ REVLI++ +LS+ DPGD+ +TI +I
Sbjct: 208 RKQEPKGNPSLQNALEMARGLLLPVPSHCTREVLIIFGSLSSTDPGDVHQTINSLVRERI 267
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ ICK LC+ T Y++ LDE+HFKEL E P P L
Sbjct: 268 RTRVIGLSAQVAICKELCKQTNYGDESYYNILLDETHFKELFDEAVTPLPVNKINKGFTL 327
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C +++C LPT C C L L+ S HLAR
Sbjct: 328 VKMGFPTRIFEDTPTFCSCHSKLIYG-GYFCPNCHSKICSLPTVCPCCDLMLILSTHLAR 386
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSL-LSSGNKPGLYVA-----CPK 287
SYHHL P+ F EV P + + T C+ CQ+ + +K G + C
Sbjct: 387 SYHHLMPLKTFKEV-------PSDETFKTENCYSCQKKFPILKNHKTGNLLTSSRYRCSD 439
Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
C++ FC++CD++IHE LHNCPGCE+
Sbjct: 440 CQEDFCIDCDLFIHEVLHNCPGCEA 464
>gi|302916479|ref|XP_003052050.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256732989|gb|EEU46337.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 469
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 202/341 (59%), Gaps = 36/341 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D P+R + + AFVREFF+QNP+SQ+G++ ++DGVA ++D+GG+P H++ L G
Sbjct: 111 DLLPTRYRLTLRYAAAFVREFFEQNPISQLGIIGMRDGVAVRISDVGGNPAEHLERLKGL 170
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI ET+ +I
Sbjct: 171 EDQDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHETVGNLITDRI 230
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R S++GLSA++ IC LC T Y+VA+DE HF+EL + PP AE + A+
Sbjct: 231 RVSIVGLSAQVAICADLCSRTNAGDESQYNVAMDEVHFRELFLAATTPPVTRTAEQSTAS 290
Query: 175 LIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R G S C+CH GY C RC ARVC LP +C CGL L+ S H
Sbjct: 291 LLMMGFPSRTLVAGGDTSYCACHNR-PFREGYLCTRCSARVCRLPAECPACGLTLILSTH 349
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRS-TCFGCQQSL-------------LSSGNK 278
LARSYHHLFP+ + EV P N +RS CF CQ +G K
Sbjct: 350 LARSYHHLFPLRNWVEV-PWA-----NAARSAACFSCQAPFPEPPKTKAPDKPKEDAGPK 403
Query: 279 PGLYV------ACPKCKKHFCLECDIYIHESLHNCPGCESL 313
P V AC C++HFC++CD++ HE +HNCPGC+S+
Sbjct: 404 PAKGVSESGRYACEVCRQHFCIDCDVFAHEVIHNCPGCQSI 444
>gi|392569356|gb|EIW62529.1| TFIIH basal transcription factor complex subunit SSL1 [Trametes
versicolor FP-101664 SS1]
Length = 452
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 134/331 (40%), Positives = 206/331 (62%), Gaps = 26/331 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R ++ + FV E+FDQNPL QIG+ ++ G+ + ++ G+P+ +KA+
Sbjct: 124 DMRPTRFDLMLQYAREFVTEWFDQNPLGQIGIAGMRGGLGERIGEMSGNPQDVLKAISER 183
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+L +G+ SLQNA+++ + +S +P++ RE+L+++ +L+TCDPG+I +T+ +C + +I
Sbjct: 184 HRLEPNGEPSLQNAIEMARSSMSHLPTHSSREILVIFGSLTTCDPGNIHDTMDECVKDRI 243
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP---------AIAEF 170
R S++ L+AEM IC+ LC TGG +SV ++E HFK+L+ E PPP +
Sbjct: 244 RISIVALAAEMKICRDLCDKTGGQFSVVMNEGHFKDLLFELIPPPAQRAVARTGSGVTAN 303
Query: 171 AIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
A+L+ MGFP R + S S+C CH ++K G+ CPRC A+VC++PTDC ICGL +VS
Sbjct: 304 PAADLMIMGFPMRLPDTSPPSLCVCHSQMK-SEGFLCPRCLAKVCDVPTDCDICGLMIVS 362
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSS----------GNKP 279
SPHLARSYHHLFP+ + V L D N S+ C GC + + G P
Sbjct: 363 SPHLARSYHHLFPVKAYQAVN--TLEDTLNPSKQ-CHGCSVAFRGTPSVVAGASGEGMSP 419
Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
CP+C FC +CD+++H+ +H CPGC
Sbjct: 420 FGRYRCPECHNDFCTDCDVFVHDVVHCCPGC 450
>gi|58331891|ref|NP_001011081.1| general transcription factor IIH, polypeptide 2, 44kDa [Xenopus
(Silurana) tropicalis]
gi|54038231|gb|AAH84471.1| general transcription factor IIH, polypeptide 2, 44kDa [Xenopus
(Silurana) tropicalis]
gi|115530769|emb|CAL49300.1| general transcription factor IIH polypeptide 4, 52kDa [Xenopus
(Silurana) tropicalis]
Length = 393
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 199/322 (61%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIGL+ ++ A LT+L G+P H+ AL
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGLIVTRNKRAEKLTELAGNPRQHLNALKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C+G+ SL N+L+L L +P + RE+L+++S+L+TCDP +I + I+ K SKI
Sbjct: 135 VDMNCNGEPSLYNSLNLALQTLKHMPGHTSREILVIFSSLTTCDPTNIYDIIKCLKASKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG Y V LDESH+KEL+M H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYKELLMHHVIPPPA-SSTSECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ ELP +C++C L LV
Sbjct: 254 FPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGGYFCPQCRAKYSELPVECKVCRLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV+ C GC L K CP C
Sbjct: 314 SAPHLARSYHHLFPLDAFKEVSL-----EEYEGERYCRGCDGEL-----KDQQVYICPVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC+ECD++IHESLH CPGC
Sbjct: 364 QCVFCIECDLFIHESLHCCPGC 385
>gi|255711256|ref|XP_002551911.1| KLTH0B02794p [Lachancea thermotolerans]
gi|238933289|emb|CAR21473.1| KLTH0B02794p [Lachancea thermotolerans CBS 6340]
Length = 470
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 195/323 (60%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A+ + FV EFFDQNP+SQIG+V +++ +A ++ + G+P+ HI AL +
Sbjct: 149 DLRPNRYAMTIQYAINFVHEFFDQNPISQIGIVCMRNDLAQLVSQVSGNPQDHIDALKAI 208
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+SSLQNAL++ +GLL +P++ REVLI++ ALS+ DPGDI +TI + +I
Sbjct: 209 RKQEPKGNSSLQNALEMARGLLLHVPAHCTREVLIIFGALSSTDPGDIHQTIASLVQERI 268
Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ ICK LC+ T Y V L+E HFK+L E P P L
Sbjct: 269 RVRVIGLSAQVAICKELCKQTNYGDNSFYGVILNEVHFKDLFAEAVTPLPVNKINKGFTL 328
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E S C+CH ++ G GY CP CK++VC LPT C C L L+ S HLAR
Sbjct: 329 VKMGFPTRVYEDMPSFCTCHSKLMYG-GYFCPNCKSKVCSLPTVCPCCDLMLILSTHLAR 387
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV------ACPKCK 289
SYHHL P+ F EV + TC+ CQ+ N+ + C C
Sbjct: 388 SYHHLMPLKTFQEV-----DVAETFPTDTCYSCQKRFPRLKNQKTQELLTSSRYRCADCH 442
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 443 QDFCIDCDVFIHEILHNCPGCES 465
>gi|367014707|ref|XP_003681853.1| hypothetical protein TDEL_0E03990 [Torulaspora delbrueckii]
gi|359749514|emb|CCE92642.1| hypothetical protein TDEL_0E03990 [Torulaspora delbrueckii]
Length = 464
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 200/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G+VT+++G+A ++ + G+P+ HI AL +
Sbjct: 143 DLRPNRHAMIVQYAIDFVHEFFDQNPISQLGIVTMRNGLAQLISQISGNPQDHIDALKLI 202
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ +EVLI++ +LS+ DPGDI +TI + I
Sbjct: 203 RKQEPKGNPSLQNALEMSRGLLLPVPAHCTKEVLIVFGSLSSTDPGDIHQTINSLVQESI 262
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ ICK LC+ T Y V LDE+H KEL + P P L
Sbjct: 263 RVKVIGLSAQVAICKELCKATNYGDESYYQVPLDETHCKELFDQAVVPLPVNKINKGFTL 322
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
++MGFP R E S + C+CH ++ G GY CP C+++VC LPT C C L L+ S HLAR
Sbjct: 323 VRMGFPTRIFEDSPTFCACHSQLMYG-GYFCPNCQSKVCSLPTVCPCCDLMLILSTHLAR 381
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGN-KPGLYVA-----CPKCK 289
SYHHL P+ F EV P P + CF CQ + N K G + C CK
Sbjct: 382 SYHHLMPLKTFTEV-PSVEEFPTD----NCFSCQIRFPTIKNHKTGELLTSSRYRCESCK 436
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
K FC++CD+YIHE LHNCPGCES
Sbjct: 437 KDFCIDCDVYIHEILHNCPGCES 459
>gi|452846260|gb|EME48193.1| hypothetical protein DOTSEDRAFT_69968 [Dothistroma septosporum
NZE10]
Length = 449
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 145/347 (41%), Positives = 203/347 (58%), Gaps = 37/347 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + +AFVREF +QNP+SQ+G+V +KDG+A ++D+ G+PE HIK L
Sbjct: 95 DLRPTRYLLTLTYAKAFVREFIEQNPISQLGIVGMKDGLAIRVSDMSGNPEDHIKILNEY 154
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K +G+ SLQN LD+ + L PS+G REVL++ AL T DPGDI +TI+ C + +I
Sbjct: 155 AKKEPTGNPSLQNGLDMARAALYHTPSHGTREVLVVMGALLTSDPGDIHDTIKACIKDRI 214
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFA---- 171
R S+IGL+A+M IC +C+ T Y+VA+D+ F+EL+M+ PP + A
Sbjct: 215 RVSIIGLAAQMHICAEICRKTNQGDESCYNVAIDDVDFRELLMKSTTPPVMRSTDADALR 274
Query: 172 --IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
ANL+ MGFP R E ++C+CH ++ G GY C RC A+VC LP C C L L+
Sbjct: 275 LNQANLLMMGFPSRIVEDKATLCACHGQLTRG-GYLCSRCGAKVCSLPATCPTCDLTLIL 333
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----------QQSLLSSGNK 278
S HLARSYHHLFP+ + EV+ + R + C+GC + + L +K
Sbjct: 334 STHLARSYHHLFPLQNWVEVS---WDRARRKGSVQCYGCLAGFPRVPVEYETTQLEVRDK 390
Query: 279 PGLYVA----------CPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
P A C C+ HFC++CD+ HE +HNCPGC S H
Sbjct: 391 PRRRRAEGASESSRYECETCQNHFCIDCDVAAHEMIHNCPGCMSNAH 437
>gi|158293200|ref|XP_314533.4| AGAP010559-PA [Anopheles gambiae str. PEST]
gi|157016844|gb|EAA09916.5| AGAP010559-PA [Anopheles gambiae str. PEST]
Length = 421
Score = 270 bits (691), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 206/329 (62%), Gaps = 22/329 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKA---L 58
D +P+R K +E F+ EFFDQNP+SQ+G++ +K A +++LGG+ HIKA L
Sbjct: 76 DLKPTRFICSLKLLELFIEEFFDQNPISQLGVIAMKAKRAEKISELGGTSRKHIKAVHAL 135
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ G+ SLQN L+L L IP + RE+L++ +L++CDP D+ TI+ K
Sbjct: 136 TNGVPLVGEPSLQNGLELALKTLRMIPQHASREILVIMGSLTSCDPNDVHLTIENLKTEG 195
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA--IAEFAIANLI 176
+RCSV+ LSAE+ ICK LC +TGG + LD++H+K+ +++H PP A EF+ LI
Sbjct: 196 VRCSVLSLSAEIRICKFLCTETGGLFGAVLDDAHYKDQLLQHIDPPQAGNQQEFS---LI 252
Query: 177 KMGFP----QRAGEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQ 226
KMGFP + E +++C CH + GY CP+C ++ CELP +C CGL
Sbjct: 253 KMGFPHGKTESGKEPPLTMCMCHIDSVDEPAKLTSGGYHCPQCYSKYCELPVECSACGLT 312
Query: 227 LVSSPHLARSYHHLFPIAPFDE--VTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
L S+PHLARSYHHLFP+ F+E + + + +PR+ + C+ CQ++L S +K +
Sbjct: 313 LASAPHLARSYHHLFPVPHFNELPLEQVQVQEPRDPPVTNCYACQKTLASGTDK--MVYE 370
Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK+ FC++CDI+IHE+LH+C GC ++
Sbjct: 371 CDACKQVFCIDCDIFIHETLHSCVGCTTI 399
>gi|6323033|ref|NP_013105.1| Ssl1p [Saccharomyces cerevisiae S288c]
gi|417813|sp|Q04673.1|SSL1_YEAST RecName: Full=Suppressor of stem-loop protein 1; AltName:
Full=General transcription and DNA repair factor IIH
subunit SSL1; Short=TFIIH subunit SSL1
gi|2696|emb|CAA78992.1| suppressor of stem-loop [Saccharomyces cerevisiae]
gi|1360294|emb|CAA97527.1| SSL1 [Saccharomyces cerevisiae]
gi|190406046|gb|EDV09313.1| suppressor of stem-loop protein 1 [Saccharomyces cerevisiae
RM11-1a]
gi|256272337|gb|EEU07321.1| Ssl1p [Saccharomyces cerevisiae JAY291]
gi|259147994|emb|CAY81243.1| Ssl1p [Saccharomyces cerevisiae EC1118]
gi|285813427|tpg|DAA09323.1| TPA: Ssl1p [Saccharomyces cerevisiae S288c]
gi|323308054|gb|EGA61307.1| Ssl1p [Saccharomyces cerevisiae FostersO]
gi|323332472|gb|EGA73880.1| Ssl1p [Saccharomyces cerevisiae AWRI796]
gi|365764293|gb|EHN05817.1| Ssl1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|384214|prf||1905312A SSL1 gene
Length = 461
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV + RS CF CQ + +K G + C CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456
>gi|207343219|gb|EDZ70748.1| YLR005Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 371
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 50 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 109
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 110 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 169
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 170 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 229
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 230 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 288
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV + RS CF CQ + +K G + C CK
Sbjct: 289 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 343
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 344 QEFCVDCDVFIHEILHNCPGCES 366
>gi|323336560|gb|EGA77826.1| Ssl1p [Saccharomyces cerevisiae Vin13]
Length = 461
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV + RS CF CQ + +K G + C CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456
>gi|323303957|gb|EGA57737.1| Ssl1p [Saccharomyces cerevisiae FostersB]
Length = 461
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV + RS CF CQ + +K G + C CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456
>gi|401837686|gb|EJT41582.1| SSL1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 462
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 195/323 (60%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RPSR A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 141 DLRPSRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 200
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 201 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 260
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 261 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 320
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 321 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 379
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV CF CQ + +K G + C CK
Sbjct: 380 SYHHLMPLKTFAEVPT-----TETFRSEDCFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 434
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CDI+IHE LHNCPGCES
Sbjct: 435 QEFCVDCDIFIHEILHNCPGCES 457
>gi|395814688|ref|XP_003780876.1| PREDICTED: general transcription factor IIH subunit 2-like
[Otolemur garnettii]
Length = 395
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHIASLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R S+IGLSAE+ +C L ++TGG+Y V LDESH+KEL+ + PPPA + +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHYVSPPPASLN-SECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNSEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEHHGERFCYGCQGDL----KDQHVYV-CNAC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++++SLH CPGC
Sbjct: 364 QNVFCVDCDVFVYDSLHCCPGC 385
>gi|312384212|gb|EFR28989.1| hypothetical protein AND_02404 [Anopheles darlingi]
Length = 1093
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 202/326 (61%), Gaps = 19/326 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KA+
Sbjct: 76 DLKPTRFICSLKLLELFIEEFFDQNPISQLGVIAMKAKRAEKITELSGTARKHVKAVHAL 135
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
+G+ SLQN L+L L +PS+ REVL++ +L+TCDP DI TI+ K IR
Sbjct: 136 NTQLTGEPSLQNGLELALKTLRLVPSHASREVLVIMGSLTTCDPTDIHVTIETLKTEGIR 195
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA--IAEFAIANLIKM 178
CSV+ LSAE+ ICK+LC +TGG+Y+ LD++H+K+ +++H PP A EF +LIKM
Sbjct: 196 CSVVSLSAEIRICKYLCTETGGTYAAVLDDAHYKDQLLQHIDPPQAGNQQEF---SLIKM 252
Query: 179 GFP----QRAGEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLV 228
GFP + E +++C CH + GY CP+C ++ CELP +C CGL L
Sbjct: 253 GFPHGKTETGKEPPLTMCMCHIDSTDEPAKLTSGGYHCPQCYSKYCELPVECAACGLTLA 312
Query: 229 SSPHLARSYHHLFPIAPFDEVT-PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
S+PHLARS+HHLFP+ F E+T + R + C+ CQ+ L + +K C
Sbjct: 313 SAPHLARSFHHLFPVPHFKELTLEQVQQEAREVPVNYCYACQKPLTVTTDKT--VYECDA 370
Query: 288 CKKHFCLECDIYIHESLHNCPGCESL 313
C+ FC++CDI+IHE+LH+C GC ++
Sbjct: 371 CRMVFCIDCDIFIHETLHSCVGCTTI 396
>gi|392297981|gb|EIW09080.1| Ssl1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 461
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV + RS CF CQ + +K G + C CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456
>gi|51013237|gb|AAT92912.1| YLR005W [Saccharomyces cerevisiae]
Length = 461
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV + RS CF CQ + +K G + C CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456
>gi|151941173|gb|EDN59551.1| transcription initiation factor TFIIH subunit [Saccharomyces
cerevisiae YJM789]
Length = 461
Score = 270 bits (690), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 197/323 (60%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV P P CF CQ + +K G + C CK
Sbjct: 379 SYHHLMPLKTFAEV-PTTEKFP----SEDCFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456
>gi|349579731|dbj|GAA24892.1| K7_Ssl1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 461
Score = 270 bits (690), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 198/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQGHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSA++ ICK LC+ T Y + LDE+H KEL E P P L
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKELFNEAVTPLPVNKINKGFTL 319
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + CSCH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 320 VKMGFPTRIFEDTPTFCSCHSKLVYG-GYFCPNCHSKVCSLPTVCPCCDLMLILSTHLAR 378
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-LSSGNKPGLYVA-----CPKCK 289
SYHHL P+ F EV + RS CF CQ + +K G + C CK
Sbjct: 379 SYHHLMPLKTFAEVP----TTEKFRSED-CFSCQSRFPILKNHKNGKLLTSSRYRCEDCK 433
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC++CD++IHE LHNCPGCES
Sbjct: 434 QEFCVDCDVFIHEILHNCPGCES 456
>gi|452986441|gb|EME86197.1| hypothetical protein MYCFIDRAFT_39800 [Pseudocercospora fijiensis
CIRAD86]
Length = 465
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 198/344 (57%), Gaps = 37/344 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + AF+RE+F+QNP+SQ+G++ ++DGVA ++D+ G+P HI A+
Sbjct: 107 DLRPTRHLLTISNTVAFIREYFEQNPISQLGILGMRDGVALRVSDMSGNPNVHIAAVRAL 166
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G +SLQNALD+ + L PS+G REV+I+ ALS+ DPGDI +TI+ C + KI
Sbjct: 167 RGTDPKGSASLQNALDMARAALYHTPSHGTREVVIILGALSSSDPGDIHDTIKACIKDKI 226
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
R ++IGL+A+M IC + + T Y+VA+DE H++EL M PP A A
Sbjct: 227 RVNIIGLAAQMQICAEIARKTNQGATNCYNVAVDEVHYRELFMGITTPPVVRATDTEAQK 286
Query: 175 -----LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
L+ MGFP R EG ++C+CH + G GY C RCKA+VC LP C C L L+
Sbjct: 287 RNQAALLMMGFPSRIVEGKATLCACHGNLTRG-GYLCSRCKAKVCNLPATCPTCDLTLIL 345
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ---SLLSS----------- 275
S HLARSYHHLFP+ +DEV+ + + C+GC SL SS
Sbjct: 346 STHLARSYHHLFPLQNWDEVS---WERAQEKGSIACYGCHSAFPSLWSSTDETNGASHTR 402
Query: 276 -------GNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
G C C+ HFC++CD++ HE HNCPGC+S
Sbjct: 403 TRPKRAEGASESSRYECRTCQNHFCIDCDVFCHEVAHNCPGCQS 446
>gi|19075300|ref|NP_587800.1| transcription factor TFIIH complex subunit Ssl1
[Schizosaccharomyces pombe 972h-]
gi|26400388|sp|O74995.1|TFH47_SCHPO RecName: Full=TFIIH basal transcription factor complex p47 subunit;
AltName: Full=Suppressor of stem-loop protein 1 homolog;
Short=SSL1 homolog
gi|3406059|gb|AAC29144.1| TFIIH subunit p47 [Schizosaccharomyces pombe]
gi|3560161|emb|CAA20673.1| transcription factor TFIIH complex subunit Ssl1
[Schizosaccharomyces pombe]
Length = 421
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 195/321 (60%), Gaps = 17/321 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF R + K FV EFF+QNP+SQ+ ++ V DG+A+ +TDL G+P+SHI+ L
Sbjct: 103 DFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGIAHRITDLHGNPQSHIQKLKSL 162
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
CSG+ SLQNAL++ + LS I S+G REVLI++ ++ + DPGDI +TI IR
Sbjct: 163 RDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSILSSDPGDIFKTIDALVHDSIRV 222
Query: 122 SVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHA-PPPPAIAEFAIANLI 176
++GL+AE+ ICK +C T S Y V + E HF+EL++E PP A+ A+L+
Sbjct: 223 RIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFRELLLESTIPPATDSAKTTDASLV 282
Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
MGFP + E S+C+CH G G+ CPRCKA+VC LP +C C L L+ S HLARS
Sbjct: 283 MMGFPSKVVEQLPSLCACHSIPSRG-GFHCPRCKAKVCTLPIECPSCSLVLILSTHLARS 341
Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP------GLYVACPKCKK 290
YHHLFP+ + E+ P N P++ + CF CQ P + ACP CK
Sbjct: 342 YHHLFPLKNWSEI-PWSAN-PKS---THCFACQLPFPKPPVSPFDESTSSMRYACPSCKN 396
Query: 291 HFCLECDIYIHESLHNCPGCE 311
HFCL+CD++ HE LH C GC+
Sbjct: 397 HFCLDCDVFAHEQLHECYGCQ 417
>gi|392578803|gb|EIW71930.1| hypothetical protein TREMEDRAFT_72606 [Tremella mesenterica DSM
1558]
Length = 462
Score = 270 bits (689), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 204/350 (58%), Gaps = 42/350 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRPSR + + + ++V E+FDQNPL QIG++ ++D ++ L +GG+P+ + L K
Sbjct: 112 DFRPSRFELTLQYMRSYVVEWFDQNPLGQIGMIIMRDRLSEVLIPMGGNPQEILSVLTDK 171
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
L SG+ SLQN L + +G +S +PS E+L+++S++ST DP +I + +
Sbjct: 172 RKLEPSGEPSLQNGLVMAKGGMSHLPSTSSLEILVVFSSISTADPDGPINIHQVLSDLSS 231
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP----------- 165
S++R S+I LS E+ IC+ +CQ TGG + VA+DE H +EL+ E PPP
Sbjct: 232 SRVRTSIISLSGEIKICRQICQRTGGKFGVAMDEDHLRELLWETIPPPAQTQSSNPTTLG 291
Query: 166 ---AIAEFA------------IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCK 210
A+A + +L+ MGFP R G ++C CH +K G GY CPRC
Sbjct: 292 VRNALAAGGNRVDGDGSRPPPVGDLMVMGFPVRLPPGGETLCVCHGLLKKG-GYLCPRCG 350
Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ 270
+++C++PTDC +CGL +VSSPHLARS+ LFP+A +D +T R RS TC+GC +
Sbjct: 351 SKLCDVPTDCEVCGLMVVSSPHLARSFWFLFPVAHYDALTVETTMGDRERSSPTCYGCDR 410
Query: 271 SL----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
S + G P C +C K FC ECD+YIH++LH CPGC
Sbjct: 411 SFKTMDETGSAHVVDGISPTGRYRCARCFKDFCSECDLYIHDTLHTCPGC 460
>gi|31418653|gb|AAH53382.1| General transcription factor II H, polypeptide 2 [Mus musculus]
Length = 396
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 202/324 (62%), Gaps = 27/324 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDE+H+KEL+ H P PA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLAHHVSPLPA-SSSSECSLIRMG 253
Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
FPQ + S S+ S + +G GY CP+C+A+ CELP +C+ICGL
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
LVS+PHLARSYHHLFP+ F E++ + C+GCQ L +YV C
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CT 362
Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
C+ FC++CD+++H+SLH CPGC
Sbjct: 363 VCQNVFCVDCDVFVHDSLHCCPGC 386
>gi|444318053|ref|XP_004179684.1| hypothetical protein TBLA_0C03620 [Tetrapisispora blattae CBS 6284]
gi|387512725|emb|CCH60165.1| hypothetical protein TBLA_0C03620 [Tetrapisispora blattae CBS 6284]
Length = 447
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 147/326 (45%), Positives = 199/326 (61%), Gaps = 22/326 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R ++ + FV EFFDQNP+SQ+G++ +++G+A+ ++ + G+P+ HI +L M
Sbjct: 124 DLRPNRYSMSLQYAIDFVHEFFDQNPISQMGVIIMRNGLAHLVSQISGNPQDHIDSLKAM 183
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P + REVLI++ LS+ DPGDI +TI + KI
Sbjct: 184 RKQEPKGNPSLQNALEMARGLLLPVPPHCTREVLIVFGGLSSTDPGDIHQTITSLVQEKI 243
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R +VIGLSA++ IC+ LC T SY V LDE HFKEL + P P L
Sbjct: 244 RATVIGLSAQVAICQELCTSTNYGDSDSYRVILDEVHFKELFSQAVTPLPVNKINKGFTL 303
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + C+CH ++ G GY CP CK++VC LP C C L L+ S HLAR
Sbjct: 304 VKMGFPTRVFEDTPTFCTCHHKLIYG-GYFCPNCKSKVCSLPIVCPCCDLMLILSTHLAR 362
Query: 236 SYHHLFPIAPFDEVTPLCLND---PRNRSRSTCFGCQQSLLS-SGNKPGLYVA-----CP 286
SYHHL P+ F EV + D P N CF CQ+ NK G + C
Sbjct: 363 SYHHLMPLNTFTEVE--VVKDAAFPTN----NCFSCQKGFPHLVNNKTGELLTSSRYRCG 416
Query: 287 KCKKHFCLECDIYIHESLHNCPGCES 312
+C K FC++CD+YIHE LHNCPGCES
Sbjct: 417 ECHKDFCIDCDVYIHEILHNCPGCES 442
>gi|3327882|dbj|BAA31745.1| SSL1 [Schizosaccharomyces pombe]
Length = 392
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 195/321 (60%), Gaps = 17/321 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF R + K FV EFF+QNP+SQ+ ++ V DG+A+ +TDL G+P+SHI+ L
Sbjct: 74 DFHHKRFDLQIKYASEFVLEFFEQNPISQLSIIGVMDGIAHRITDLHGNPQSHIQKLKSL 133
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
CSG+ SLQNAL++ + LS I S+G REVLI++ ++ + DPGDI +TI IR
Sbjct: 134 RDCSGNFSLQNALEMARASLSHIASHGTREVLIIFGSILSSDPGDIFKTIDALVHDSIRV 193
Query: 122 SVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHA-PPPPAIAEFAIANLI 176
++GL+AE+ ICK +C T S Y V + E HF+EL++E PP A+ A+L+
Sbjct: 194 RIVGLAAEVAICKEICNKTNSSTKNAYGVVISEQHFRELLLESTIPPATDSAKTTDASLV 253
Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
MGFP + E S+C+CH G G+ CPRCKA+VC LP +C C L L+ S HLARS
Sbjct: 254 MMGFPSKVVEQLPSLCACHSIPSRG-GFHCPRCKAKVCTLPIECPSCSLVLILSTHLARS 312
Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP------GLYVACPKCKK 290
YHHLFP+ + E+ P N P++ + CF CQ P + ACP CK
Sbjct: 313 YHHLFPLKNWSEI-PWSAN-PKS---THCFACQLPFPKPPVSPFDESTSSMRYACPSCKN 367
Query: 291 HFCLECDIYIHESLHNCPGCE 311
HFCL+CD++ HE LH C GC+
Sbjct: 368 HFCLDCDVFAHEQLHECYGCQ 388
>gi|395325355|gb|EJF57779.1| TFIIH basal transcription factor complex subunit SSL1 [Dichomitus
squalens LYAD-421 SS1]
Length = 448
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 206/327 (62%), Gaps = 22/327 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R ++ + FV E+FDQNPL QIG+V ++ G+ + ++ G+P+ +KA++
Sbjct: 124 DMRPTRFDLMLQYAREFVTEWFDQNPLGQIGIVGMRGGLGERIGEMSGNPQDVLKAILER 183
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL +G+ SLQNA+++ + +S +P+ RE+++++ +L+TCDPG+I +T+ +C + +I
Sbjct: 184 HKLLPNGEPSLQNAVEMARSSMSHLPTQSSREIVVIFGSLTTCDPGNIHDTLDECVKDRI 243
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--AIAEFA---IAN 174
R S++ L+AEM +C+ LC TGG + VA++E HFK+L+ E PPP A+A A+
Sbjct: 244 RISIVALAAEMKVCRELCDKTGGQFGVAMNEGHFKDLLFELIPPPAQRAVARTGGGTAAD 303
Query: 175 LIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
L+ MGFP R + S S+C CH ++K G+ CPRC A+VC++PTDC ICGL +VSSPHL
Sbjct: 304 LMIMGFPMRLPDTSPPSLCVCHSQMK-SEGFLCPRCLAKVCDVPTDCDICGLMIVSSPHL 362
Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSS----------GNKPGLYV 283
ARSYHHLFP+ + VT L D C GC + + G P
Sbjct: 363 ARSYHHLFPVKSYSAVTTL---DDTPEPSKRCHGCAVAFKETPSTMLGASGEGMSPYGRY 419
Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGC 310
C +C FC +CD+++H+ +H CPGC
Sbjct: 420 RCLECHSDFCADCDVFVHDVVHCCPGC 446
>gi|73949709|ref|XP_535266.2| PREDICTED: general transcription factor IIH subunit 2 isoform 1
[Canis lupus familiaris]
Length = 395
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 148/322 (45%), Positives = 205/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ PF E+ PL + N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDPFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|351712123|gb|EHB15042.1| General transcription factor IIH subunit 2, partial [Heterocephalus
glaber]
Length = 392
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 203/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+A K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLACTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+T + C+GCQ L +YV C +C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEMTL-----EEYKGERFCYGCQGQL----KDQHVYV-CAEC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|358059749|dbj|GAA94518.1| hypothetical protein E5Q_01170 [Mixia osmundae IAM 14324]
Length = 458
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 206/341 (60%), Gaps = 37/341 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R+ + K FV E+FDQNP+SQ+ ++ KDG+A ++ L G+P H++AL K
Sbjct: 122 DLRPTRLELTIKYAIEFVNEYFDQNPISQMAILVTKDGIAERISPLSGNPVDHVRALESK 181
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L SG+ SLQN L++ + L+ +PS+G REV+I++ +L+TCDPG+I T++ + ++
Sbjct: 182 RKLDPSGEPSLQNVLEMARSGLAHLPSHGSREVVIIFGSLTTCDPGNIHTTMEALVKDRV 241
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP----------AIAE 169
R +++GLSAEM ICK +C+ T G Y V +E+H+K+L+ E PPP + +
Sbjct: 242 RVNIVGLSAEMSICKEVCKRTKGVYGVVTNEAHYKDLLFELVPPPATHRPEAGPDGSRSN 301
Query: 170 FAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
A+L++MGFP + ++CSCH +K G+ CPRC +R+CE+PT+C IC L +VS
Sbjct: 302 NTGADLMQMGFPSLSSSTFGALCSCHSRLKT-TGFNCPRCASRLCEVPTECGICSLTVVS 360
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSR-STCFGCQ-------QSLLSSGNKP-- 279
SPHLARSY HLFP+ + EV P S + CF C S ++G P
Sbjct: 361 SPHLARSYRHLFPVKNYLEVNN---GQPGEDSWPAECFACSLKFSTVAVSAATTGAPPTA 417
Query: 280 ----------GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
G Y CP+C HFC++CD++ HE L CPGC
Sbjct: 418 ASTASRISSTGRY-ECPRCHNHFCIDCDLHAHEVLGVCPGC 457
>gi|147906745|ref|NP_001085329.1| general transcription factor IIH, polypeptide 2, 44kDa [Xenopus
laevis]
gi|49256020|gb|AAH71091.1| MGC81060 protein [Xenopus laevis]
Length = 395
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 199/322 (61%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIGL+ ++ A LT+L G+P HI A+
Sbjct: 75 DLKPNRLTCTLKLLEFFVEEYFDQNPISQIGLIVTRNKRAEKLTELAGNPRQHINAMKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ CSG+ SL N+L+L L +P + RE+L+++S+L+TCDP +I + I+ K SK+
Sbjct: 135 VDMTCSGEPSLYNSLNLALQTLKHMPGHTSREILVIFSSLTTCDPTNIYDMIKCLKASKV 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG Y V LDESH+KEL+M H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGVYHVILDESHYKELLMHHVIPPPA-SSSSECSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+CKA+ ELP +C++C L LV
Sbjct: 254 FPQHTMGCLSDQDAKPSFSMAHLDNTSEPGLTLGGYFCPQCKAKYSELPVECKVCRLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV + R C GC L K C C
Sbjct: 314 SAPHLARSYHHLFPLDAFKEVRLEEYDGER-----YCRGCDGEL-----KDQQVYVCTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC+ECD+++HESLH CPGC
Sbjct: 364 QCVFCIECDLFVHESLHCCPGC 385
>gi|410948792|ref|XP_003981114.1| PREDICTED: general transcription factor IIH subunit 2 [Felis catus]
Length = 366
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 198/309 (64%), Gaps = 24/309 (7%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSSLQN 72
+E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L + C G+ SL N
Sbjct: 59 LEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYN 118
Query: 73 ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
+L + L +P + REVLI++S+L+TCDP +I + I+ K +KIR SVIGLSAE+ +
Sbjct: 119 SLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKATKIRVSVIGLSAEVRV 178
Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA-------- 184
C L ++TGG+Y V LDESH+KEL+ H PPPA + + +LI+MGFPQ
Sbjct: 179 CTVLARETGGTYHVILDESHYKELLTHHVSPPPA-SSGSECSLIRMGFPQHTIASLSDQD 237
Query: 185 GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
+ S S+ + G+ GY CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLF
Sbjct: 238 AKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLF 297
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ F E+ PL + N R C+GCQ L +YV C C+ FC++CD+++H
Sbjct: 298 PLDAFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVH 347
Query: 302 ESLHNCPGC 310
+SLH CPGC
Sbjct: 348 DSLHCCPGC 356
>gi|392865317|gb|EAS31097.2| transcription factor ssl1 [Coccidioides immitis RS]
Length = 471
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 141/334 (42%), Positives = 202/334 (60%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 118 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 177
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G SLQNAL++ +G L PS+G REVLI++ AL + DPGDI +TI KI
Sbjct: 178 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGALLSSDPGDIHKTITSLVTDKI 237
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R V+GL+AE+ IC+ +C T Y VAL+E HF+EL+ME PP ++ A+ +
Sbjct: 238 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPPVTRSQKQAVNS 297
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S S+C+CH + G GY C RC +++C LP +C CGL L+ S HLA
Sbjct: 298 LLMMGFPSRTVESSQSLCACHSKPSRG-GYLCSRCGSKICTLPAECPACGLTLILSTHLA 356
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
RSYHHLFP+ + EV P N++ S CF C Q G G+
Sbjct: 357 RSYHHLFPLINWVEV-------PWNKASISSNCFACGNPFPPVPVRAQWETRGGTVKGMS 409
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 410 VSSRYECTVCHNHFCIDCDVFAHEIVHNCPGCQS 443
>gi|346716173|ref|NP_001231240.1| general transcription factor IIH subunit 2 [Sus scrofa]
Length = 395
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 205/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C+G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCNGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+AR CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRARYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+GCQ L +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEHNGER-FCYGCQGEL----KDQHVYI-CSVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QHVFCVDCDVFVHDSLHCCPGC 385
>gi|355693895|gb|AER99488.1| proteinral transcription factor IIH, polypeptide 2, 44kDa [Mustela
putorius furo]
Length = 393
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 74 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 133
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 134 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKSAKI 193
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 194 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 252
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 253 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 312
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C C
Sbjct: 313 SAPHLARSYHHLFPLDAFQEI-PL----EEHNGEKFCYGCQGEL----KDQHVYV-CAVC 362
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 363 QNVFCVDCDVFVHDSLHCCPGC 384
>gi|332233766|ref|XP_003266076.1| PREDICTED: general transcription factor IIH subunit 2 isoform 1
[Nomascus leucogenys]
Length = 395
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRAGEG--------SISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTNASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ L + N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI----LLEEHNGER-FCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|402224004|gb|EJU04067.1| TFIIH basal transcription factor complex subunit SSL1 [Dacryopinax
sp. DJM-731 SS1]
Length = 440
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 133/331 (40%), Positives = 199/331 (60%), Gaps = 29/331 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RPSR + FV E+FDQNPL Q+G+V +++GV + D+ G+P+ +K + K
Sbjct: 116 DMRPSRFDLTLSYAREFVSEWFDQNPLGQMGVVGMRNGVGERIGDMSGNPQDVLKCFLDK 175
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L +G+ SLQN++++ + ++ +P++ RE+LI++ +L+T DPG++++T+ C +I
Sbjct: 176 TLLEPAGEPSLQNSIEMARASMNHLPTHSSREILIIFGSLTTVDPGNLLDTVDACVRDQI 235
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP---------PAIAEF 170
R S++ LSAEM +C+ +C+ TGG + VAL+E H+K+L+ E PPP A A
Sbjct: 236 RISIVALSAEMKVCRDMCERTGGVFGVALNEGHYKDLLFELVPPPADKSPVSRLAAQAGE 295
Query: 171 AIANLIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
+ L++MGFP R S+C CH ++ GY CPRC ++VC++PTDC ICGL +VS
Sbjct: 296 GQSELMQMGFPTRLPATAPPSLCVCHSRLR-SEGYICPRCNSKVCDVPTDCDICGLMIVS 354
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP---------- 279
SPHLARSYHHLFP + V L + S C C ++ P
Sbjct: 355 SPHLARSYHHLFPAPAYKAVLEL-----DGKEASACHACSLPFPTTARAPLQTEEGMSPH 409
Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
G Y CPKCK F ECD+++H+ LH CPGC
Sbjct: 410 GRY-RCPKCKHDFDAECDLFVHDVLHVCPGC 439
>gi|328858838|gb|EGG07949.1| hypothetical protein MELLADRAFT_35246 [Melampsora larici-populina
98AG31]
Length = 455
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 138/339 (40%), Positives = 207/339 (61%), Gaps = 39/339 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R+ + + FV EFFDQNP+SQ+ + KD +A L+ L G+P H KAL K
Sbjct: 124 DLRPNRLEMSLNYAQEFVTEFFDQNPISQMCIFITKDAIAERLSPLSGNPVDHHKALSNK 183
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L SG+ SLQNAL++ + L+ +P +G +E L++ +L+TCDP DI +TI+K ++ ++
Sbjct: 184 RKLEPSGEPSLQNALEMARATLAHLPPHGSKETLVIMGSLTTCDPEDIHKTIEKLEKDRM 243
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
R S++GLSAE+ ICK +C+ T G Y V ++E+HFKEL+ E PPPP++ +
Sbjct: 244 RVSIVGLSAEINICKEICRKTKGKYGVIMNETHFKELLFESIPPPPSLVNSSTSQSNRST 303
Query: 173 ----ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
++L++MGFP + S+C+CH ++K G+ CPRC +++C++PTDC ICGL +V
Sbjct: 304 NTTSSDLMQMGFPTKLSNPIPSLCACHSKLK-SSGFICPRCGSKLCDVPTDCLICGLTVV 362
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRS---RSTCFGCQQSL------------- 272
SSPHLARSY HLFP+A + E+ D N ++C+GC Q
Sbjct: 363 SSPHLARSYRHLFPVANWKEI------DSENHEIECPTSCYGCSQVFSTSSTTTTTTTTT 416
Query: 273 -LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
+ N+ ++ C +CK FC CD++ HE L CPGC
Sbjct: 417 TTTITNQSSKFM-CNRCKHIFCSTCDLF-HEQLGLCPGC 453
>gi|50288221|ref|XP_446539.1| hypothetical protein [Candida glabrata CBS 138]
gi|49525847|emb|CAG59466.1| unnamed protein product [Candida glabrata]
Length = 439
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 142/323 (43%), Positives = 200/323 (61%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ +Q FV EFFDQNP+SQ+G+VT+++G+A ++ + G+P+ HI AL +
Sbjct: 118 DLRPNRHAMIIQQAIDFVHEFFDQNPISQMGIVTMRNGLAQLVSQVSGNPQDHIDALKSI 177
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LS+ DPGDI +TI + +I
Sbjct: 178 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSSNDPGDIHQTIASLVQEQI 237
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R V+GLSAE+ ICK LC+ T Y V LDE+H K+L E P P L
Sbjct: 238 RVKVLGLSAEVAICKELCKQTNMGDTSFYQVLLDETHLKKLFDEAVTPLPVNKINRGFTL 297
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + C+CH ++ G GY CP C ++VC LPT C C L L+ S HLAR
Sbjct: 298 VKMGFPTRIFEEQPTFCACHAKLIYG-GYFCPNCNSKVCSLPTVCPCCDLMLILSTHLAR 356
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQS--LLSSGNKPGLYVA----CPKCK 289
S+HHL P+ F EV P + P C+ CQ ++ S + + C +CK
Sbjct: 357 SFHHLMPLKTFVEV-PAAESFP----TENCYSCQMKFPIIKSQKSNQMLTSSRYRCEECK 411
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+C++CD++IHESLH CPGCES
Sbjct: 412 NDYCIDCDVFIHESLHTCPGCES 434
>gi|296422371|ref|XP_002840734.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636957|emb|CAZ84925.1| unnamed protein product [Tuber melanosporum]
Length = 469
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 144/335 (42%), Positives = 208/335 (62%), Gaps = 32/335 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRP+R + + FVRE+F+QNP+SQ+G++ ++DG+A ++DL G+P HI+A+ GK
Sbjct: 122 DFRPTRYLLSLRYACDFVREYFEQNPISQLGVLGMRDGLAVRISDLSGNPVDHIEAI-GK 180
Query: 62 LGCS---GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
L G SLQNAL++ + L PS+G REV+I++ AL + DPGDI +TI E K
Sbjct: 181 LRLDEGKGAPSLQNALEMSRAALFHAPSHGTREVVIVFGALLSSDPGDIHQTISHLVEDK 240
Query: 119 IRCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHA-PPPPAIAEFAIA 173
IR SV+GLSA++ +C+ LC+ T G Y VALD+ HF+EL+++ PP A+ + +
Sbjct: 241 IRVSVVGLSAQVAVCRELCKRTNAGDEGVYGVALDDKHFRELLLDTTIPPVTRSAKMSAS 300
Query: 174 NLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
+L+ MGFP R E + S+C+CH + G GY C RC +VC LP +C CGL L+ S HL
Sbjct: 301 SLLMMGFPSRMVEKTASLCACHSTLTKG-GYLCSRCSTKVCALPMECPACGLTLILSTHL 359
Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQ---------------QSLLSSGN 277
ARSYHHLFP+ + EV P + S+ST C+ CQ + ++G
Sbjct: 360 ARSYHHLFPLKNWVEV-PW-----KAASKSTHCYACQIPFPHPPAKPEELSAPAKTATGT 413
Query: 278 KPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
AC C HFC++CD++ HE+L+NCP C+S
Sbjct: 414 SVSSRYACTICHNHFCIDCDVFCHETLYNCPACQS 448
>gi|242008465|ref|XP_002425024.1| TFIIH basal transcription factor complex p44 subunit, putative
[Pediculus humanus corporis]
gi|212508673|gb|EEB12286.1| TFIIH basal transcription factor complex p44 subunit, putative
[Pediculus humanus corporis]
Length = 401
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 139/328 (42%), Positives = 204/328 (62%), Gaps = 24/328 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG--SPESHIKALM 59
D P+R K +E F+ EFF+QNP++Q+G++ ++ A L+DL G + I +
Sbjct: 76 DLSPNRQLCTVKLLEGFIDEFFEQNPIAQMGVIITRNKRAEKLSDLAGVSKRQKEIIKSI 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G L C+G SLQNAL+L L P++ REV+ + + L+TCDPG++ +TI+ KE I
Sbjct: 136 GDLACTGQPSLQNALELAGKSLKLRPTHASREVIAIIANLTTCDPGNVADTIRFMKEENI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVIGL+AE+++ K L ++T G+YSV LD+ HFK + + PP + + A+LIKMG
Sbjct: 196 RCSVIGLAAEVYVYKTLTKETKGTYSVILDDVHFKNQLFQQIDPPAMTSNLS-ASLIKMG 254
Query: 180 FPQRA---GEGSISICSCH---KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPH 232
FP + + S+ +C CH EVK+ GY CP+C A+ CELP +C++C L LVS+PH
Sbjct: 255 FPHHSLHDDKSSLGLCMCHLDDSEVKLKSDGYKCPQCAAKYCELPVECKVCSLTLVSAPH 314
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHF 292
LAR++HHLFP+ F+ D +N + CFGCQ+ S LY C KC++ F
Sbjct: 315 LARTFHHLFPVQSFERF------DGKN-AEGYCFGCQKKF--SEGDTYLY-KCNKCEQRF 364
Query: 293 CLECDIYIHESLHNCPGCESLRHSNPIV 320
C CDI++H+SL CPGC +NP++
Sbjct: 365 CGVCDIFVHDSLRTCPGCA----TNPLL 388
>gi|195377775|ref|XP_002047663.1| GJ11798 [Drosophila virilis]
gi|194154821|gb|EDW70005.1| GJ11798 [Drosophila virilis]
Length = 442
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 137/330 (41%), Positives = 202/330 (61%), Gaps = 24/330 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL G
Sbjct: 78 DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALAGL 137
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI +TI + K +I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKTLKVVPSHASREIVIIMGSLTTCDPVDINQTIDELKRERI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ IC++L Q T G++ +D++H+++ ++ PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHICRYLTQQTMGTFGAVMDDAHYRDQLLAQVDPPPA-AKTQDNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHSKNEVEGKDAPLSMCMCHIENLDEPSELSTTGHHCPQCNSKYCELPVECQCCGLTLV 316
Query: 229 SSPHLARSYHHLFPI-----APFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV 283
S+PHLARSYHHLFP+ PF+ V P P S C+GC + L + +K
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFESVPPAA---PDCGSLRKCYGCIKVLNAQTDKS--VF 371
Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGCESL 313
C C + FC++CDI+IHE+LH C GC ++
Sbjct: 372 KCGYCNQFFCIDCDIFIHETLHACVGCNTI 401
>gi|301760269|ref|XP_002915938.1| PREDICTED: general transcription factor IIH subunit 2-like
[Ailuropoda melanoleuca]
Length = 395
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQ+G++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQVGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEHNGEKFCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QTVFCVDCDVFVHDSLHCCPGC 385
>gi|393217828|gb|EJD03317.1| TFIIH basal transcription factor complex, subunit SSL1 [Fomitiporia
mediterranea MF3/22]
Length = 481
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 135/363 (37%), Positives = 203/363 (55%), Gaps = 57/363 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + AF+ E+FDQNPL QIG++ ++ G+A + ++ +P +K++ +
Sbjct: 120 DMRPTRFDLALSTARAFIVEWFDQNPLGQIGVIGMRSGIAERIGEMSSNPHDVLKSIQDR 179
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L +G+ SLQNA+++ + +S +P++ RE++IL+ +L+T DPG+I ET+ C +I
Sbjct: 180 HRLEPAGEPSLQNAVEMARSTMSHLPTHSSREIVILFGSLTTVDPGNIHETLDACIRDRI 239
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA----------- 168
R +++ L+AEM IC+ +C TGG + V+L+E HF++L+ E PPP A
Sbjct: 240 RINLVALAAEMKICRDMCGKTGGQFGVSLNEGHFRDLLFELIPPPAQRALSRAAGAGASA 299
Query: 169 -----EFAIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
A+L+ MGFP R E S ++C CH E+K G+ CPRC A+VC++PTDC +
Sbjct: 300 SAGSGGNPAADLMMMGFPTRLPEASPPALCVCHSELK-SAGFVCPRCAAKVCDVPTDCDV 358
Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------- 268
CGL +VSSPHLARSYHHLFP+ P+ VT LN + C GC
Sbjct: 359 CGLMIVSSPHLARSYHHLFPVKPYQAVTDTELNS--TTASRACHGCASPFPPVSSGTSTS 416
Query: 269 ---------------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNC 307
++ S G P CP CK FC ECD+++H+ +H C
Sbjct: 417 TNPVTGSAAGTGAAEGASAAGTSTVTSDGMSPLGRYRCPDCKNDFCTECDVFVHDVVHCC 476
Query: 308 PGC 310
PGC
Sbjct: 477 PGC 479
>gi|348569038|ref|XP_003470305.1| PREDICTED: general transcription factor IIH subunit 2-like [Cavia
porcellus]
Length = 395
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 IDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIAALSDQDAKPSFSMAHLDSSSEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL----EEYKGERFCYGCQGQL----KDQHVYV-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|440631747|gb|ELR01666.1| hypothetical protein GMDG_00042 [Geomyces destructans 20631-21]
Length = 491
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 202/336 (60%), Gaps = 31/336 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + +F+ EFF+QNP+SQ+G++ ++DG+A ++ L G+P H+ AL GK
Sbjct: 124 DLRPTRYLLTLRHATSFISEFFEQNPISQLGILGMRDGLAKPISPLSGTPTVHLGAL-GK 182
Query: 62 LGCS---GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
L G SLQNAL++ + L PS+G REVLI+ AL + DPGDI TI +
Sbjct: 183 LRTQDPQGSPSLQNALEMARASLFHAPSHGTREVLIISGALLSSDPGDIHTTISSLTSDR 242
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA-EFAIANLIK 177
IR S IGL+A++ I +C T G YSVAL E HF+ L+M PPP A E ++L+
Sbjct: 243 IRVSAIGLAAQVAILSEICTKTKGDYSVALHEEHFRALLMGATTPPPTRAKEQNQSSLLM 302
Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
MGFP R ++++C+CH ++ G GY C RC ++VC LP +C CGL L+ S HLARSY
Sbjct: 303 MGFPSRTVATAVTLCACHSKLTRG-GYLCSRCGSKVCSLPAECPACGLTLILSTHLARSY 361
Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC---------------------QQSLLSSG 276
HHLFP+ + EV+ ++ +CF C +++++ +
Sbjct: 362 HHLFPLRNWREVS---WSEAAGSGVGSCFACGVGFNKPPPRQVVEGKKEGGRREAMMKAV 418
Query: 277 NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
++ G Y AC C +HFC++CD++ HE +HNCPGC+S
Sbjct: 419 SESGRY-ACEVCGEHFCIDCDVFAHEVVHNCPGCQS 453
>gi|281337668|gb|EFB13252.1| hypothetical protein PANDA_003972 [Ailuropoda melanoleuca]
Length = 364
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 202/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQ+G++ K A LT+L G+P HI +L
Sbjct: 44 DLKPNRLTCTLKLLEYFVEEYFDQNPISQVGIIVTKSKRAEKLTELSGNPRKHITSLKKA 103
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 104 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 163
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 164 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 222
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 223 FPQHTIASLSDQDAKPSFSMAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 282
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + C+GCQ L +YV C C
Sbjct: 283 SAPHLARSYHHLFPLDAFQEI-PL----EEHNGEKFCYGCQGEL----KDQHVYV-CAVC 332
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 333 QTVFCVDCDVFVHDSLHCCPGC 354
>gi|386780666|ref|NP_001247509.1| general transcription factor IIH subunit 2 [Macaca mulatta]
gi|297675411|ref|XP_002815673.1| PREDICTED: general transcription factor IIH subunit 2 isoform 2
[Pongo abelii]
gi|297675413|ref|XP_002815674.1| PREDICTED: general transcription factor IIH subunit 2 isoform 3
[Pongo abelii]
gi|402871770|ref|XP_003899824.1| PREDICTED: general transcription factor IIH subunit 2 isoform 1
[Papio anubis]
gi|402871772|ref|XP_003899825.1| PREDICTED: general transcription factor IIH subunit 2 isoform 2
[Papio anubis]
gi|355691370|gb|EHH26555.1| General transcription factor IIH polypeptide 2 [Macaca mulatta]
gi|355749973|gb|EHH54311.1| General transcription factor IIH polypeptide 2 [Macaca
fascicularis]
gi|380785669|gb|AFE64710.1| general transcription factor IIH subunit 2 [Macaca mulatta]
gi|383411171|gb|AFH28799.1| general transcription factor IIH subunit 2 [Macaca mulatta]
gi|384940950|gb|AFI34080.1| general transcription factor IIH subunit 2 [Macaca mulatta]
Length = 395
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 201/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEIPLEEYNGER-----FCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|195592457|ref|XP_002085951.1| GD15058 [Drosophila simulans]
gi|194197960|gb|EDX11536.1| GD15058 [Drosophila simulans]
Length = 438
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 204/331 (61%), Gaps = 21/331 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+GL+ +K A +T+L G+ H+KAL +
Sbjct: 78 DLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKAKRAEKVTELTGTSRVHLKALESL 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+ I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
S+PHLARSYHHLFP+ F+E+ P +D + R C+GC ++L +K V
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDFTSDVRE-CYGCAKALGQGVDKVADKVV 375
Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK+ FC++CDI+IH++LH C GC ++
Sbjct: 376 FRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406
>gi|342888788|gb|EGU88007.1| hypothetical protein FOXB_01490 [Fusarium oxysporum Fo5176]
Length = 468
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 151/355 (42%), Positives = 206/355 (58%), Gaps = 41/355 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D P+R ++ AFVREFF+QNP+SQ+G++ ++DGVA ++D+GG+P H++ L G
Sbjct: 107 DLLPTRYRLMLSYAAAFVREFFEQNPISQLGIIGMRDGVAVRISDVGGNPTEHLEKLKGL 166
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI ETI +I
Sbjct: 167 ENEDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHETIGNLITDRI 226
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R S++GLSA++ IC LC T Y++A+DE HF+EL + PP AE + A+
Sbjct: 227 RVSIVGLSAQVAICADLCSRTNAGDESQYNIAMDEVHFRELFLAATTPPVTRTAEQSTAS 286
Query: 175 LIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R G+ S C+CH GY C RC ARVC +P +C C L L+ S H
Sbjct: 287 LLMMGFPSRTLVPNGTTSYCACHNR-PFREGYLCTRCGARVCRIPAECPACDLTLILSTH 345
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGC-------------QQSLLSSG- 276
LARSYHHLFP+ + EV P + SRS CF C +S SG
Sbjct: 346 LARSYHHLFPLRNWVEV-------PWTKASRSAACFSCLAPFPEPPKGKAPDKSREDSGA 398
Query: 277 -------NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPIVANEG 324
++ G Y AC C +HFC++CD++ HE +HNCPGC+SL A+ G
Sbjct: 399 PKTAKGVSESGRY-ACEVCGQHFCIDCDVFAHEVVHNCPGCQSLLSKTDAAASSG 452
>gi|412993157|emb|CCO16690.1| predicted protein [Bathycoccus prasinos]
Length = 452
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/336 (43%), Positives = 195/336 (58%), Gaps = 30/336 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R AV+ + FVREFF+QNPLSQ+GL+ KDG A +T+L GSPE+H+KA+
Sbjct: 112 DMRPNRSAVLLPLMIKFVREFFNQNPLSQLGLIACKDGKAERITELSGSPETHVKAIKKA 171
Query: 62 L---GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
G G SLQN L+ L +P +G RE++ + S+LSTCDPG+I ++++K K+ K
Sbjct: 172 FASDGVGGSFSLQNGLEQAMEGLRDVPPFGAREIIAIISSLSTCDPGNINDSVRKVKKMK 231
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
R +VI ++AE + K L ++T G +SV+LD+SH LIME APPP + E A L++M
Sbjct: 232 ARTNVICVAAETRVFKKLSEETKGKFSVSLDQSHLTRLIMECAPPPALLLETAKPALVEM 291
Query: 179 GFPQRA------GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
GFP+R G G Y CPRC+AR ELP+ C C L LVSSPH
Sbjct: 292 GFPRREPRKFGLGAGDEDDRDVLTIGPRNGEYRCPRCEARAEELPSQCGTCQLSLVSSPH 351
Query: 233 LARSYHHLFPIAPFDEVT------------------PLCLNDPRNRSRSTCFGCQQSLLS 274
LARSYHHLFP+ PF EV +D + CFGC + +
Sbjct: 352 LARSYHHLFPVMPFVEVKVDGDSKNEEEEEKKRKKNSGKDDDDDGQFLRECFGCCVVVDA 411
Query: 275 SGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
S G+ C KC+K FC CD+YIH+ LHNC GC
Sbjct: 412 S---TGMLSKCLKCEKEFCFACDVYIHDRLHNCVGC 444
>gi|6681762|ref|NP_001506.1| general transcription factor IIH subunit 2 [Homo sapiens]
gi|17380326|sp|Q13888.1|TF2H2_HUMAN RecName: Full=General transcription factor IIH subunit 2; AltName:
Full=Basic transcription factor 2 44 kDa subunit;
Short=BTF2 p44; AltName: Full=General transcription
factor IIH polypeptide 2; AltName: Full=TFIIH basal
transcription factor complex p44 subunit
gi|496609|emb|CAA82910.1| basic transcripion factor 2, 44 kD subunit [Homo sapiens]
gi|5531809|gb|AAD44479.1| basic transcription factor 2 [Homo sapiens]
gi|146327202|gb|AAI41604.1| General transcription factor IIH, polypeptide 2, 44kDa [synthetic
construct]
gi|147898053|gb|AAI40304.1| General transcription factor IIH, polypeptide 2, 44kDa [synthetic
construct]
gi|208968221|dbj|BAG73949.1| General transcription factor IIH subunit 2 [synthetic construct]
Length = 395
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|114599829|ref|XP_001158018.1| PREDICTED: uncharacterized protein LOC461830 isoform 9 [Pan
troglodytes]
gi|114599831|ref|XP_001158068.1| PREDICTED: uncharacterized protein LOC461830 isoform 10 [Pan
troglodytes]
gi|114599833|ref|XP_001158128.1| PREDICTED: uncharacterized protein LOC461830 isoform 11 [Pan
troglodytes]
gi|397470434|ref|XP_003806827.1| PREDICTED: general transcription factor IIH subunit 2 [Pan
paniscus]
gi|397470436|ref|XP_003806828.1| PREDICTED: general transcription factor IIH subunit 2 [Pan
paniscus]
gi|397470438|ref|XP_003806829.1| PREDICTED: general transcription factor IIH subunit 2 [Pan
paniscus]
gi|410207616|gb|JAA01027.1| general transcription factor IIH, polypeptide 2C [Pan troglodytes]
gi|410258220|gb|JAA17077.1| general transcription factor IIH, polypeptide 2C [Pan troglodytes]
gi|410299712|gb|JAA28456.1| general transcription factor IIH, polypeptide 2, 44kDa [Pan
troglodytes]
gi|410350531|gb|JAA41869.1| general transcription factor IIH, polypeptide 2C [Pan troglodytes]
gi|410350533|gb|JAA41870.1| general transcription factor IIH, polypeptide 2D [Pan troglodytes]
Length = 395
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 201/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P H+ +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHVTSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEIPLEEYNGER-----FCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|320040921|gb|EFW22854.1| RNA polymerase TFIIH complex subunit Ssl1 [Coccidioides posadasii
str. Silveira]
Length = 471
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 118 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 177
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G SLQNAL++ +G L PS+G REVLI++ L + DPGDI +TI KI
Sbjct: 178 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGTLLSSDPGDIHKTITSLVADKI 237
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R V+GL+AE+ IC+ +C T Y VAL+E HF+EL+ME PP ++ A+ +
Sbjct: 238 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPPVTRSQKQAVNS 297
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S S+C+CH + G GY C RC +++C LP +C CGL L+ S HLA
Sbjct: 298 LLMMGFPSRTVEPSQSLCACHSKPSRG-GYLCSRCGSKICTLPAECPACGLTLILSTHLA 356
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
RSYHHLFP+ + EV P N++ S CF C Q G G+
Sbjct: 357 RSYHHLFPLINWVEV-------PWNKASISSNCFACGNPFPPVPVRAQWETRGGTVKGMS 409
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 410 VSSRYECTVCHNHFCIDCDVFAHEIVHNCPGCQS 443
>gi|21356299|ref|NP_649427.1| Ssl1, isoform A [Drosophila melanogaster]
gi|442634242|ref|NP_001262227.1| Ssl1, isoform B [Drosophila melanogaster]
gi|10726963|gb|AAF51879.2| Ssl1, isoform A [Drosophila melanogaster]
gi|15010516|gb|AAK77306.1| GH08526p [Drosophila melanogaster]
gi|220945098|gb|ACL85092.1| Ssl1-PA [synthetic construct]
gi|220952514|gb|ACL88800.1| Ssl1-PA [synthetic construct]
gi|440216208|gb|AGB94920.1| Ssl1, isoform B [Drosophila melanogaster]
Length = 438
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 204/331 (61%), Gaps = 21/331 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+GL+ +K A +T+L G+ H+KAL +
Sbjct: 78 DLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKAKRAEKVTELTGTSRVHLKALESL 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+ I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
S+PHLARSYHHLFP+ F+E+ P +D + R C+GC ++L +K V
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDLTSDVRE-CYGCAKALGQGVDKVADKVV 375
Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK+ FC++CDI+IH++LH C GC ++
Sbjct: 376 FRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406
>gi|303319293|ref|XP_003069646.1| TFIIH basal transcription factor complex p47 subunit , putative
[Coccidioides posadasii C735 delta SOWgp]
gi|240109332|gb|EER27501.1| TFIIH basal transcription factor complex p47 subunit , putative
[Coccidioides posadasii C735 delta SOWgp]
Length = 471
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 118 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 177
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G SLQNAL++ +G L PS+G REVLI++ L + DPGDI +TI KI
Sbjct: 178 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGTLLSSDPGDIHKTITSLVADKI 237
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R V+GL+AE+ IC+ +C T Y VAL+E HF+EL+ME PP ++ A+ +
Sbjct: 238 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPPVTRSQKQAVNS 297
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S S+C+CH + G GY C RC +++C LP +C CGL L+ S HLA
Sbjct: 298 LLMMGFPSRTVEPSQSLCACHSKPSRG-GYLCSRCGSKICTLPAECPACGLTLILSTHLA 356
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
RSYHHLFP+ + EV P N++ S CF C Q G G+
Sbjct: 357 RSYHHLFPLINWVEV-------PWNKASISSNCFACGNPFPPVPVRAQWETRGGTVKGMS 409
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 410 VSSRYECTVCHNHFCIDCDVFAHEIVHNCPGCQS 443
>gi|158258154|dbj|BAF85050.1| unnamed protein product [Homo sapiens]
Length = 395
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKEA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHLSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|254566501|ref|XP_002490361.1| Component of the core form of RNA polymerase transcription factor
TFIIH [Komagataella pastoris GS115]
gi|238030157|emb|CAY68080.1| Component of the core form of RNA polymerase transcription factor
TFIIH [Komagataella pastoris GS115]
gi|328350755|emb|CCA37155.1| Suppressor of stem-loop protein 1 [Komagataella pastoris CBS 7435]
Length = 447
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A+ FV FFDQNP+SQ+G+V +++G+A ++++ G+P +H++AL +
Sbjct: 118 DLRPNRAALTISNAIEFVNNFFDQNPISQLGIVLMRNGIATLVSEVSGNPNNHVEALRSI 177
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+L +GD SLQNA ++ +GLL PS+ RE+L+++ AL T DPGDI TI + KI
Sbjct: 178 RRLEPTGDPSLQNAFEMARGLLLHTPSHSTREILMIFGALFTSDPGDIHTTIDSLVKEKI 237
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPP----PPAIAEFA 171
R +IGL+A++ ICK +C+ T Y+V L+E HFK+L+++ P A E
Sbjct: 238 RVKIIGLTAQVAICKEICRRTNYGDDTLYAVILNEGHFKDLLLDAVTPLAISKSAQVEMR 297
Query: 172 IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
LIK GFP R E S C+CH ++ G GY CP CK++VC LPT C C L L+ S
Sbjct: 298 GFTLIKAGFPTRISEAIPSFCACHSKLIHG-GYICPGCKSKVCMLPTVCPCCNLMLILST 356
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----------QQSLLSSGNKPG 280
HLARSYHHLFP+ F EV PL + + CF C ++S + K
Sbjct: 357 HLARSYHHLFPLKLFQEV-PLA----ESYISTNCFSCLTEFPQGMSESERSNSDNRAKKE 411
Query: 281 LYVA----CPKCKKHFCLECDIYIHESLHNCPGC 310
+ + CP C KHFC++CD+++H+ LHNCPGC
Sbjct: 412 FHTSSRYQCPDCNKHFCVDCDVFVHDVLHNCPGC 445
>gi|109809737|ref|NP_001035955.1| general transcription factor IIH subunit 2-like protein [Homo
sapiens]
gi|148922852|ref|NP_001092198.1| general transcription factor IIH subunit 2-like protein [Homo
sapiens]
gi|74737138|sp|Q6P1K8.1|T2H2L_HUMAN RecName: Full=General transcription factor IIH subunit 2-like
protein; AltName: Full=General transcription factor IIH
polypeptide 2-like protein
gi|40353046|gb|AAH64557.1| GTF2H2D protein [Homo sapiens]
gi|40674450|gb|AAH65021.1| General transcription factor IIH, polypeptide 2D [Homo sapiens]
gi|76779844|gb|AAI05981.1| GTF2H2C protein [Homo sapiens]
gi|189054850|dbj|BAG37691.1| unnamed protein product [Homo sapiens]
gi|219521708|gb|AAI71860.1| General transcription factor IIH, polypeptide 2D [Homo sapiens]
gi|312152202|gb|ADQ32613.1| general transcription factor IIH, polypeptide 2, 44kDa [synthetic
construct]
Length = 395
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 204/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKEA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHLSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 364 QNVFCVDCDVFVHDSLHCCPGC 385
>gi|195496832|ref|XP_002095860.1| GE22646 [Drosophila yakuba]
gi|194181961|gb|EDW95572.1| GE22646 [Drosophila yakuba]
Length = 438
Score = 264 bits (675), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 137/331 (41%), Positives = 205/331 (61%), Gaps = 21/331 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL +
Sbjct: 78 DLKPTRLRCTVKLLELFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALESL 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+ I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDELKKEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
S+PHLARSYHHLFP+ F+E+ P +D + R C+ C ++L S +K V
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDLTSGVRE-CYACAKALGQSVDKVADKVV 375
Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK++FC++CDI+IH++LH C GC ++
Sbjct: 376 YRCGFCKQYFCVDCDIFIHDTLHACVGCNTI 406
>gi|194876451|ref|XP_001973778.1| GG16285 [Drosophila erecta]
gi|190655561|gb|EDV52804.1| GG16285 [Drosophila erecta]
Length = 438
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 137/331 (41%), Positives = 202/331 (61%), Gaps = 21/331 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL +
Sbjct: 78 DLKPTRLRCTVKLLELFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALESL 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDELKREGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
S+PHLARSYHHLFP+ F+E+ P +D N R C+ C ++L +K V
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAMPASSSDLNNGVRE-CYACAKALGQGVDKAADKVV 375
Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK+ FC++CDI+IH++LH C GC ++
Sbjct: 376 FRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406
>gi|398406773|ref|XP_003854852.1| hypothetical protein MYCGRDRAFT_37615 [Zymoseptoria tritici IPO323]
gi|339474736|gb|EGP89828.1| hypothetical protein MYCGRDRAFT_37615 [Zymoseptoria tritici IPO323]
Length = 445
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 195/344 (56%), Gaps = 34/344 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + AF+REFF+QNP+SQ+G++ +++G+A ++D+ G+P HI A+
Sbjct: 100 DLRPTRHLLTITYTIAFIREFFEQNPISQLGILGMREGLAIRVSDMSGNPNDHIAAVRAL 159
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G+ SLQN LD+ + L PS+G REV+I+ AL T DPGDI +TI+ C + +I
Sbjct: 160 RGTDPKGNPSLQNGLDMARAALYHTPSHGTREVVIILGALLTSDPGDIHDTIKACIKDRI 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
R ++IGL+A+M IC +C+ T Y+VA+DE HF+EL+M PP + A A
Sbjct: 220 RVTIIGLAAQMHICAEICRKTNAGDDNCYNVAVDEVHFRELLMGITTPPVVRSTDAEAQK 279
Query: 175 -----LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
L+ MGFP R E ++C+CH + G GY C RCKA+VC LP C C L L+
Sbjct: 280 LNQAALLMMGFPSRIVEEHATLCACHGNLTRG-GYLCSRCKAKVCSLPATCPTCDLTLIL 338
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC------------------QQS 271
S HLARSYHHLFP+ + EV+ +R CFGC S
Sbjct: 339 STHLARSYHHLFPLQNWVEVS---WQRAADRGSEQCFGCLSPFPRVHGATNGDDADKTYS 395
Query: 272 LLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
+ G C C+ HFC++CD + HE+ HNCPGC S H
Sbjct: 396 KRAEGASESSRYECESCQNHFCIDCDCFNHETSHNCPGCTSSTH 439
>gi|336272171|ref|XP_003350843.1| hypothetical protein SMAC_02512 [Sordaria macrospora k-hell]
gi|380095007|emb|CCC07509.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 466
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 200/355 (56%), Gaps = 51/355 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D P+R VV FVRE+F+QNP+SQ+G+V ++DG+A ++DL G+P HI+ LM
Sbjct: 94 DLLPNRYRVVLNNAIGFVREYFEQNPISQLGIVGMRDGIAVRISDLSGNPAEHIEKLMQW 153
Query: 61 --KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ G+SSLQNAL + +G L Q PS+ REVLI+Y AL + DPGDI +TI +
Sbjct: 154 SEQQDPQGNSSLQNALQMCRGALYQTPSHATREVLIIYGALVSIDPGDIHDTINDLVADR 213
Query: 119 IRCSVIGLSAEMFICKHLCQDTG---GSYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
IR SV+GL+ ++ IC LC+ T G+YSVA+DE H KEL PP E A+
Sbjct: 214 IRVSVVGLAGQVAICTELCKRTNNHDGNYSVAIDEVHLKELFFAATTPPVTRTPEQNTAS 273
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A + +S C+CH + GYTCPRC +VC LP DC IC L L+ S H
Sbjct: 274 LLMMGFPSRTLAPKDHVSYCACHAK-PTREGYTCPRCGIKVCRLPIDCPICKLTLIQSTH 332
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------------------- 268
LARSYHHLFP+ F EV P SRST C+ C
Sbjct: 333 LARSYHHLFPLKVFVEV-PWS-----QASRSTACYSCLSPFPTGPRDTVAAAPAVLGGRA 386
Query: 269 ----QQSLLSSGNKPGL-------YVACPKCKKHFCLECDIYIHESLHNCPGCES 312
+Q KP L AC C HFC++CD++ HE++HNCPGC+S
Sbjct: 387 GKGGKQDSDPKNPKPELKGVSESGRYACQVCGNHFCIDCDVFAHETIHNCPGCQS 441
>gi|255948596|ref|XP_002565065.1| Pc22g11160 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592082|emb|CAP98404.1| Pc22g11160 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 450
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 205/343 (59%), Gaps = 34/343 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++ + G+P H+ A+ +
Sbjct: 101 DLRPTRYLLTLRYAQEFVREFFEQNPISQLGVLGLRDGLALRVSAMSGNPTEHLTAIQTL 160
Query: 60 GKLGCSGDSSLQNALDLVQGLL---SQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
G SLQN L++ +G L S PS+G REVL +Y +L + DPGDI TI
Sbjct: 161 KTQDPKGLPSLQNGLEMARGALFEFSHTPSHGTREVLFIYGSLLSSDPGDIHRTIDTLIS 220
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAI 172
KIR ++GL A++ IC+ LC T G +Y VAL+E HF+EL+M+ PP A ++
Sbjct: 221 DKIRVGIVGLGAQVAICRELCTKTNGGDETAYGVALNEQHFRELMMDVTTPPAAYSQKES 280
Query: 173 AN-LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
A+ L+ MGFP R E S S+C+CH + G GY C RC ++VC LP +C CGL L+ S
Sbjct: 281 ASSLLMMGFPSRTVESSPSLCACHSKPSRG-GYLCSRCNSKVCSLPAECPSCGLTLILST 339
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLS----------SGNKPG 280
HLARSYHHLFP+ + EV P R SRS+ CF C S S G
Sbjct: 340 HLARSYHHLFPLINWVEV-PW-----RRASRSSACFACGLSFPSVPPEDHWQTAENQTKG 393
Query: 281 LYVA----CPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPI 319
+ V+ C C HFC++CD++ HE +HNCPGC+S SNP+
Sbjct: 394 MSVSSRYECTACHNHFCIDCDLFAHEVVHNCPGCQS--RSNPL 434
>gi|195348862|ref|XP_002040966.1| GM22477 [Drosophila sechellia]
gi|194122476|gb|EDW44519.1| GM22477 [Drosophila sechellia]
Length = 438
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 138/332 (41%), Positives = 204/332 (61%), Gaps = 23/332 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+GL+ +K A +T+L G+ H+KAL +
Sbjct: 78 DLKPTRLRCTVKLLELFIEEFFDQNPISQLGLIALKAKRAEKVTELTGTSRVHLKALESL 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+ I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-----CFGCQQSLLSSGNKPGLYV 283
S+PHLARSYHHLFP+ F+E+ + P + S T C+GC ++L +K V
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFEAM--PASSSEFTSDVRECYGCAKALGQGVDKVADKV 374
Query: 284 A--CPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK+ FC++CDI+IH++LH C GC ++
Sbjct: 375 VFRCGFCKQFFCVDCDIFIHDTLHACVGCNTI 406
>gi|121717593|ref|XP_001276096.1| transcription factor and DNA repair complex, core TFIIH, putative
[Aspergillus clavatus NRRL 1]
gi|119404294|gb|EAW14670.1| transcription factor and DNA repair complex, core TFIIH, putative
[Aspergillus clavatus NRRL 1]
Length = 456
Score = 263 bits (673), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 202/333 (60%), Gaps = 29/333 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 106 DLRPTRYLLTLRYAQEFVREFFEQNPISQLGVLGLRDGLAIRISDMSGNPTEHISAIQAL 165
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN L + +G L PS+G REV I++ +L + DPGDI TI KI
Sbjct: 166 RDHDAKGLPSLQNGLGMARGALFHTPSHGTREVFIVFGSLLSSDPGDIYRTIATLVNDKI 225
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R ++GL+A++ IC+ LC T G +Y VAL+E HF+EL+M+ PP ++ + ++
Sbjct: 226 RVRIVGLAAQVAICRELCSRTNGGDDTTYGVALNEQHFRELMMDVTTPPVTYSQKQSTSS 285
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY+C RC ++VC LP +C CGL L+ S HLA
Sbjct: 286 LLMMGFPSRTVESCPSLCACHSKPSCG-GYSCSRCNSKVCGLPAECPSCGLTLILSTHLA 344
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------QQSLLSSGNKPGLYV 283
RSYHHLFP+ + EV P + SRS+ CF C Q + G+ V
Sbjct: 345 RSYHHLFPLMNWVEV-PW-----QQASRSSACFACGVYFPAVPPNDQWQTTESQAKGMSV 398
Query: 284 A----CPKCKKHFCLECDIYIHESLHNCPGCES 312
+ C CK HFC++CD++ HE +HNCPGC+S
Sbjct: 399 SSRYECTVCKNHFCIDCDLFAHEVVHNCPGCQS 431
>gi|289743109|gb|ADD20302.1| RNA polymerase II transcription initiation [Glossina morsitans
morsitans]
Length = 419
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 140/324 (43%), Positives = 200/324 (61%), Gaps = 22/324 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ + +E F+ EFFDQNP+SQ+GL+ +K A +T L G+ + H+ AL +
Sbjct: 78 DLKPTRLLCTLRLLEIFIEEFFDQNPVSQMGLIVLKSKRAEKITALTGTAKLHLNALESL 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K+ SG+ SLQN LDL L +PS+ RE+L++ ++L+TCDP DI TI K I
Sbjct: 138 SKISLSGEPSLQNGLDLALKSLKVVPSHASREILVIMASLTTCDPVDINVTIDILKNEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+++C+HL Q T G Y LD++HF++ ++ + PP A ++ +LIKMG
Sbjct: 198 RCSVISLSAEIYVCRHLTQQTVGLYGAVLDDAHFRDQLLAYIDPPAA-SQTQENSLIKMG 256
Query: 180 FPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLVS 229
FP RA E +S+C CH E GY CP+C ++ CELP +C+ CGL LVS
Sbjct: 257 FPHARADESKDPPLSMCMCHIENPEEPSKLTSGGYHCPQCFSKYCELPVECQTCGLTLVS 316
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCK 289
+PHLARSYHHLFP+ F E+ N + C+ CQ+ L S NK C C
Sbjct: 317 APHLARSYHHLFPVQHFAELA-------YNGQAAVCYACQRPLSESTNK--FVYRCEICT 367
Query: 290 KHFCLECDIYIHESLHNCPGCESL 313
+ +C +CDI+IH++LH C GC ++
Sbjct: 368 QIYCCDCDIFIHDTLHTCVGCNTI 391
>gi|126314992|ref|XP_001364912.1| PREDICTED: general transcription factor IIH subunit 2-like
[Monodelphis domestica]
Length = 395
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 149/330 (45%), Positives = 202/330 (61%), Gaps = 27/330 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHIASLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L L L +P + REVL+L+S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSLAMQTLKHMPGHTSREVLVLFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGAYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV+ N R C GC L +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEVSLEEYNGER-----FCHGC----LGELKDQHVYI-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
+ FC++CD++IHESLH CPGC H NP
Sbjct: 364 QNVFCVDCDLFIHESLHCCPGC---IHKNP 390
>gi|440793605|gb|ELR14784.1| Ssl1like protein [Acanthamoeba castellanii str. Neff]
Length = 360
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 188/310 (60%), Gaps = 49/310 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +PSR+AV K VE+F+RE+F+QNP++Q+G+V ++DG A L++LGG+P HI L K
Sbjct: 97 DMKPSRLAVTLKIVESFIREYFEQNPIAQLGIVIMRDGAAEKLSELGGNPARHIGVLKDK 156
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+ +P YG RE+++L++ L+TCDP DI TI K K+ IRC
Sbjct: 157 --------------------AHVPKYGSREIVVLFAGLATCDPTDIKSTISKLKKDSIRC 196
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
S++GLSAE+ IC+ L + T G+Y VAL E H+++L+ H PPPA + ++LI+MGFP
Sbjct: 197 SIVGLSAEIHICRVLAESTDGTYGVALHEGHYRQLVFAHVSPPPAQPQQVASSLIRMGFP 256
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
R + S CSCHK ++ GY C A+ CELP +C +CGL LVSS HLARSYHHLF
Sbjct: 257 TRKRDHFPSFCSCHKSLRRE-GY----CGAKCCELPMECPVCGLTLVSSSHLARSYHHLF 311
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ F Q+ S+ + +Y ACP+C + FC ECD++ H
Sbjct: 312 PLPLF-----------------------QAGSSTTSGAEVY-ACPRCARSFCYECDVFAH 347
Query: 302 ESLHNCPGCE 311
+ LH C GCE
Sbjct: 348 DLLHTCAGCE 357
>gi|347831479|emb|CCD47176.1| similar to transcription factor Zn, C2H2 [Botryotinia fuckeliana]
Length = 478
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 205/335 (61%), Gaps = 32/335 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
D RP+R + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI KAL
Sbjct: 122 DMRPTRYLLTIRYASEFVTEYFEQNPISQLGIIGMRDGIAVRISDMSGNPTEHIERLKAL 181
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
G G+ SLQNAL++ + L PS+G RE+LI+Y AL + DPGDI ETI +
Sbjct: 182 RVDQG-QGNPSLQNALEMSRAALFHAPSHGTREILIIYGALLSSDPGDIHETISSLITDR 240
Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
IR S++GL+A++ IC LC T +YSVAL+E HF++L+M PP + + +
Sbjct: 241 IRVSIVGLAAQVAICAELCSRTNAGDDTAYSVALNEEHFRQLMMATTTPPVTRTKKQSQS 300
Query: 174 NLIKMGFPQRAGE--GSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
+L+ MGFP R + S+S C+CH ++ G GY C RC+++VC LP +C CGL L+ S
Sbjct: 301 SLLMMGFPSRTLDPGKSMSFCACHGKLSRG-GYLCSRCESKVCSLPAECPACGLTLILST 359
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL--------------LSSGN 277
HLARSYHHLFP+ + EV L +S+ C+GCQ + L +
Sbjct: 360 HLARSYHHLFPLRNWAEV----LWKDAGKSKG-CYGCQVTFPQRDEHEKSASEQPLKGMS 414
Query: 278 KPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
+ G Y AC C HFC++CD++ HE +HNCPGC+S
Sbjct: 415 ESGRY-ACEVCGNHFCIDCDVFAHEVVHNCPGCQS 448
>gi|45188211|ref|NP_984434.1| ADR338Cp [Ashbya gossypii ATCC 10895]
gi|44983055|gb|AAS52258.1| ADR338Cp [Ashbya gossypii ATCC 10895]
gi|374107649|gb|AEY96557.1| FADR338Cp [Ashbya gossypii FDAG1]
Length = 469
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 145/328 (44%), Positives = 194/328 (59%), Gaps = 28/328 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R A+ + FV FFDQNP+SQ+ +V +++G+A ++ + G+P+ HI+AL
Sbjct: 149 DLRPNRHAMTVQYAIDFVHNFFDQNPISQLCIVAMRNGMAQLVSQVSGNPQEHIEALKAV 208
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +PS+ REVLI++ ALS+ DPGDI +TI I
Sbjct: 209 RKQEPKGNPSLQNALEMARGLLLHVPSHCTREVLIVFGALSSTDPGDIHQTIASLANEHI 268
Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R VIGLSA++ ICK LC+ T Y V L+E+HFK+L E P P L
Sbjct: 269 RTRVIGLSAQVAICKELCKQTNYGDNSYYGVILNETHFKDLFAEAVVPLPVNKMNKGFTL 328
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + S C+CH + G GY CP CK++VC LP C C L L+ S HLAR
Sbjct: 329 VKMGFPTRIFEDTPSFCTCHSRLVHG-GYFCPNCKSKVCSLPIVCPCCDLMLILSTHLAR 387
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ-----------QSLLSSGNKPGLYVA 284
SYHHL P+ F E+ P+ + P CF CQ Q LL+S
Sbjct: 388 SYHHLMPLKVFQEL-PVDIQFP----TENCFSCQKKFPRLRNYKTQDLLTSSR-----YR 437
Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
C CK FC++CD+++HE LHNCPGCES
Sbjct: 438 CENCKCDFCIDCDVFVHEILHNCPGCES 465
>gi|145242846|ref|XP_001393996.1| TFIIH basal transcription factor complex p47 subunit [Aspergillus
niger CBS 513.88]
gi|134078553|emb|CAK40474.1| unnamed protein product [Aspergillus niger]
gi|350640268|gb|EHA28621.1| hypothetical protein ASPNIDRAFT_212323 [Aspergillus niger ATCC
1015]
Length = 457
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 134/333 (40%), Positives = 204/333 (61%), Gaps = 29/333 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ +KDG+A ++D+ G+P HI A+ +
Sbjct: 108 DLRPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLKDGLAVRVSDMSGNPTEHISAIQSL 167
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN ++ +G L PS+G RE+ +++ +L + DPGDI +TI KI
Sbjct: 168 RDQDPKGLPSLQNGFEMARGALFHTPSHGTREIFVIFGSLLSSDPGDIHQTITTLINDKI 227
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R ++GL+A++ IC+ +C T G +Y VAL+E HF++L+M PP ++ + ++
Sbjct: 228 RVGIVGLAAQVAICREICGKTNGGDDTTYGVALNEQHFRDLVMNVTTPPATYSQKQSTSS 287
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC ++VC LP +C CGL L+ S HLA
Sbjct: 288 LLMMGFPSRTVEAYPSLCACHSKPSRG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 346
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------QQSLLSSGNKPGLYV 283
RSYHHLFP+ + EV+ ++ SRS+ CF C +Q + G G+ V
Sbjct: 347 RSYHHLFPLMNWVEVS------WQHASRSSVCFACGISFPSVPPKEQWHTTEGQTKGISV 400
Query: 284 A----CPKCKKHFCLECDIYIHESLHNCPGCES 312
+ C C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 401 SSRYECTVCRNHFCIDCDLFAHEVVHNCPGCQS 433
>gi|331241057|ref|XP_003333178.1| hypothetical protein PGTG_14725 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312168|gb|EFP88759.1| hypothetical protein PGTG_14725 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 462
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 207/345 (60%), Gaps = 44/345 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R+ + + FV EFFDQNP+SQ+ ++ KD +A L+ L G+P H KAL
Sbjct: 125 DLRPNRLDMSLTYAKEFVTEFFDQNPISQLCILITKDAIAERLSGLSGNPLDHHKALSNK 184
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL SG+ SLQN+L++ + LS +PS+G REVL+++ +L+TCDP DI +T++ ++ ++
Sbjct: 185 AKLTPSGEPSLQNSLEMARASLSHLPSHGSREVLVIFGSLTTCDPDDINKTLENLEKDRM 244
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
R S++GL+AE+ ICK +C+ T G Y V L+E HFKEL+ E PPP IA+
Sbjct: 245 RVSMVGLAAEVRICKEICKRTQGQYGVILNEHHFKELLFEAISPPP-IAKSTTNGPKTQS 303
Query: 173 ANLIKMGFPQR-------AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGL 225
++LI+MGFP + + S C+CH +++ G+ CPRC ++ CE+PT+C +CGL
Sbjct: 304 SSLIQMGFPNKITHDAHQVDASTHSFCACHSKLQ-STGFICPRCGSKNCEIPTNCSVCGL 362
Query: 226 QLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL----LSSGNKP-- 279
+VSSPHLARSY HLFP++ + E L+ + CFGC + L S N
Sbjct: 363 TIVSSPHLARSYRHLFPVSNWIEKNANSLSHQTFK----CFGCDKELQVNKASQTNDESN 418
Query: 280 --------------GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
LY CP+CK FC ECDIY HE L CPGC
Sbjct: 419 GNSNNATNANNPSINLY-HCPRCKNMFCYECDIY-HEQLGLCPGC 461
>gi|219521045|gb|AAI71919.1| GTF2H2C protein [Homo sapiens]
Length = 338
Score = 263 bits (671), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 203/322 (63%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 18 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 77
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 78 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 137
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SV GLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 138 RVSVTGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 196
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+ICGL LV
Sbjct: 197 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLV 256
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E+ PL + N R C+GCQ L +YV C C
Sbjct: 257 SAPHLARSYHHLFPLDAFQEI-PL---EEYNGER-FCYGCQGEL----KDQHVYV-CAVC 306
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD+++H+SLH CPGC
Sbjct: 307 QNVFCVDCDVFVHDSLHCCPGC 328
>gi|195019313|ref|XP_001984954.1| GH16776 [Drosophila grimshawi]
gi|193898436|gb|EDV97302.1| GH16776 [Drosophila grimshawi]
Length = 443
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 138/330 (41%), Positives = 200/330 (60%), Gaps = 24/330 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL M
Sbjct: 78 DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGIIALKAKRAEKITELTGTSRVHLKALAGM 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+ +I
Sbjct: 138 ANVSLTSEPSLQNGLDLALKTLKVVPSHASREIVIIMGSLTTCDPVDINLTIDELKKERI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ IC++L Q T G++ LD+SH+++ ++ PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHICRYLTQQTMGTFGAVLDDSHYRDQLLSQVDPPPA-AKTQDNSLIRMG 256
Query: 180 FPQRAGEGS-----ISICSCHKEV------KVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHSKNEVEGKDEPLSMCMCHIENLEEPSGLSTTGHHCPQCNSKYCELPVECQCCGLTLV 316
Query: 229 SSPHLARSYHHLFPIA-----PFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYV 283
S+PHLARSYHHLFP+ PF+ V P P S C+ C + L +K
Sbjct: 317 SAPHLARSYHHLFPVPNFEELPFESVPPAA---PDCGSLRKCYACIKVLNGLTDKS--VF 371
Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGCESL 313
C C + FC++CDI+IHE+LH C GC ++
Sbjct: 372 KCGYCNQFFCIDCDIFIHETLHACVGCNTI 401
>gi|156050059|ref|XP_001590991.1| hypothetical protein SS1G_07615 [Sclerotinia sclerotiorum 1980]
gi|154692017|gb|EDN91755.1| hypothetical protein SS1G_07615 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 479
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 142/335 (42%), Positives = 206/335 (61%), Gaps = 32/335 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
D RP+R + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI K+L
Sbjct: 122 DMRPTRYLLTLRYASEFVTEYFEQNPISQLGIIGMRDGIAVRISDMSGNPTEHIERLKSL 181
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
G G+ SLQNAL++ + L PS+G RE+LI+Y AL T DPGDI ETI +
Sbjct: 182 RVDQG-QGNPSLQNALEMSRAALFHAPSHGTREILIIYGALLTSDPGDIHETISSLIADR 240
Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
IR S++GL+A++ IC +C T +YSVAL+E HF++L+M PP + + +
Sbjct: 241 IRVSIVGLAAQVAICAEICSRTNAGDDTAYSVALNEEHFRQLMMATTTPPVTRTKKQSHS 300
Query: 174 NLIKMGFPQRA-GEG-SISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
+L+ MGFP R G G S+S C+CH ++ G GY C RC ++VC LP +C CGL L+ S
Sbjct: 301 SLLMMGFPSRTLGPGKSMSFCACHGKLSRG-GYLCSRCGSKVCSLPAECPACGLTLILST 359
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL--------------LSSGN 277
HLARSYHHLFP+ + EV L + + R+ C+GCQ L S +
Sbjct: 360 HLARSYHHLFPLRNWVEV--LWKDAGKTRA---CYGCQAPFPRRDEHEKGAPEPGLKSMS 414
Query: 278 KPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
+ G Y AC C HFC++CD++ H+ +HNCPGC+S
Sbjct: 415 ESGRY-ACEVCGNHFCIDCDVFAHDVVHNCPGCQS 448
>gi|259481102|tpe|CBF74326.1| TPA: component of the core form of RNA polymerase transcription
factor TFIIH (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 455
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 137/337 (40%), Positives = 202/337 (59%), Gaps = 29/337 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 106 DLRPTRYLLTLRYAQEFVREFFEQNPISQVGVIGLRDGLALRISDMSGNPTEHISAIQDL 165
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN L++ +G L PS+G RE+ I++ +L + DPGDI +TI + KI
Sbjct: 166 RSQDPKGLPSLQNGLEMARGALFHTPSHGTREIFIVFGSLLSSDPGDIHQTITTLIDDKI 225
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIA-N 174
R ++GL+A++ IC+ LC T G Y VAL+E HF+EL++ PP ++ A +
Sbjct: 226 RVGIVGLAAQVAICRELCSRTNGGDDTYYGVALNEQHFRELMLAVTTPPATYSQKQSASS 285
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH G GY C RC ++VC LP +C CGL L+ S HLA
Sbjct: 286 LLMMGFPSRTIESFPSLCACHSNPTPG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 344
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------QQSLLSSGNKPGLYVA 284
RSYHHLFP+ + EV+ R +TCF C +Q + G+ V+
Sbjct: 345 RSYHHLFPLMNWVEVSW-----ERAHRSTTCFACGIAFPTIPPKEQWQATQNLAKGMSVS 399
Query: 285 ----CPKCKKHFCLECDIYIHESLHNCPGCES--LRH 315
C C+ HFC++CD++ HE +HNCPGC+S +RH
Sbjct: 400 SRYECTVCENHFCIDCDLFAHEVVHNCPGCQSRVVRH 436
>gi|400594628|gb|EJP62466.1| TFIIH basal transcription factor complex p47 subunit , putative
[Beauveria bassiana ARSEF 2860]
Length = 503
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 149/348 (42%), Positives = 202/348 (58%), Gaps = 45/348 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
D P+R + + AFVREFF+QNP+SQ+G+V ++DGVA ++DLGG+P H+ KAL
Sbjct: 134 DMLPTRHRLTLRYAAAFVREFFEQNPISQLGIVGMRDGVAVRISDLGGNPAEHLERLKAL 193
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
G+ G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +T+ +
Sbjct: 194 EGQ-DPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTMTNLISDR 252
Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
IR S++GLSA + IC LC T Y+VA+DE HF+EL + PP E + A
Sbjct: 253 IRVSIVGLSAHLSICAELCSRTNAGDTSQYNVAMDEVHFRELFLAATTPPITRTQEQSTA 312
Query: 174 NLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
+L+ MGFP R A G++S C+CH + G+ C RC +RVC LP +C CGL L+ S
Sbjct: 313 SLLMMGFPSRVLAPGGAVSYCACHNK-PCREGFLCTRCGSRVCRLPAECPACGLTLILST 371
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRS-TCFGCQ--------------------- 269
HLARSYHHLFP+ + EV P N ++S CF CQ
Sbjct: 372 HLARSYHHLFPLRNWVEV-PWA-----NAAQSAACFACQCPFSAPPRTVEGGVNGEKHAD 425
Query: 270 -----QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
+ L + G AC C HFC++CD+Y HE +HNCPGC+S
Sbjct: 426 TQETGRRLAAKGVSESGRYACEVCGNHFCIDCDVYAHEVIHNCPGCQS 473
>gi|169776637|ref|XP_001822785.1| TFIIH basal transcription factor complex p47 subunit [Aspergillus
oryzae RIB40]
gi|83771520|dbj|BAE61652.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391874438|gb|EIT83320.1| RNA polymerase II transcription initiation protein [Aspergillus
oryzae 3.042]
Length = 458
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + VREFF+QNP+SQ+G++ ++DG+A ++DL G+P HI A+ +
Sbjct: 107 DLRPTRYLLTLRYAQELVREFFEQNPISQLGVLGLRDGLAIRISDLSGNPTEHISAIQTL 166
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN +++ +G L PS+G RE+ I++ +L + DPGDI +TI K+
Sbjct: 167 RDQDPKGLPSLQNGIEMARGALFHTPSHGTREIFIIFGSLLSSDPGDIHQTIANLINDKV 226
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIA-N 174
R ++GL+A++ IC+ LC T G Y VAL+E HF+EL+M+ PP ++ A +
Sbjct: 227 RVGIVGLAAQVAICRELCAKTNGGDDTRYGVALNEQHFRELLMDVTTPPATYSQKQSASS 286
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S S+C+CH + G GY C RC ++VC LP +C CGL L+ S HLA
Sbjct: 287 LLMMGFPSRTVETSPSLCACHSKSSCG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 345
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS--RSTCFGC----------QQSLLSSGNKPGLY 282
RSYHHLFP+ + EV P R+ S CF C Q + G+
Sbjct: 346 RSYHHLFPLVNWVEV-------PWQRASRSSVCFACGIAFPPVPPKDQWQTTENQAKGMS 398
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 399 VSSRYECTVCENHFCIDCDLFAHEVVHNCPGCQS 432
>gi|118344316|ref|NP_001071980.1| zinc finger protein [Ciona intestinalis]
gi|92081522|dbj|BAE93308.1| zinc finger protein [Ciona intestinalis]
Length = 395
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 197/323 (60%), Gaps = 32/323 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R V K +E F +FFDQNP+SQ+G V+ A L+D+ GS + HIK L
Sbjct: 76 DLKPNRAVAVLKALEIFSEKFFDQNPISQLGFVSTCAKRAEVLSDMSGSLQKHIKVLQD- 134
Query: 62 LG----------CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETI 111
LG C+G+ SLQN+L+L L +PS+ +E+L++ +L+TCDPG I +TI
Sbjct: 135 LGQKIIKTNAKHCTGEPSLQNSLELCAKSLKHMPSHTSKEILVVMGSLTTCDPGSIHDTI 194
Query: 112 QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA 171
+ + IRCSVIGLSAE+ ICK+L + T G++ + LDESHF +L+ +H PPP I+
Sbjct: 195 TQIAKQSIRCSVIGLSAEVHICKYLAKSTKGTFGIILDESHFADLLTDHVPPPAEIS--P 252
Query: 172 IANLIKMGFPQRAG---EGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
++LIKMGFP G IS+C VG GY CP+C+A+ ELP C IC L LV
Sbjct: 253 ESSLIKMGFPHHVSPHESGKISLCMSRNTFTVG-GYFCPQCQAKYSELPVQCVICALTLV 311
Query: 229 SSPHLARSYHHLFPIAPFDEV-TPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
S+PHLARSYHH FP+ F E P+ TC+ C + + ++ YV C +
Sbjct: 312 SAPHLARSYHHFFPLPMFKEQEVPV--------GSLTCYSCCKPITTAT-----YV-CEQ 357
Query: 288 CKKHFCLECDIYIHESLHNCPGC 310
C FC++C+ +IHE+LH+CPGC
Sbjct: 358 CAISFCMDCNAFIHETLHSCPGC 380
>gi|16740713|gb|AAH16231.1| Gtf2h2 protein [Mus musculus]
gi|148668487|gb|EDL00806.1| general transcription factor II H, polypeptide 2, isoform CRA_a
[Mus musculus]
Length = 398
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 202/324 (62%), Gaps = 27/324 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDE+H+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLAHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
FPQ + S S+ S + +G GY CP+C+A+ CELP +C+ICGL
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
LVS+PHLARSYHHLFP+ F E++ + C+GCQ L +YV C
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CT 362
Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
C+ FC++CD+++H+SLH CPGC
Sbjct: 363 VCQNVFCVDCDVFVHDSLHCCPGC 386
>gi|225543398|ref|NP_071294.3| general transcription factor IIH subunit 2 [Mus musculus]
gi|18203119|sp|Q9JIB4.1|TF2H2_MOUSE RecName: Full=General transcription factor IIH subunit 2; AltName:
Full=Basic transcription factor 2 44 kDa subunit;
Short=BTF2 p44; AltName: Full=General transcription
factor IIH polypeptide 2; AltName: Full=TFIIH basal
transcription factor complex p44 subunit
gi|9082152|gb|AAF82753.1| general transcription factor IIH polypeptide 2 [Mus musculus]
Length = 396
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/324 (44%), Positives = 202/324 (62%), Gaps = 27/324 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKTAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDE+H+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLAHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISIC-----SCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
FPQ + S S+ S + +G GY CP+C+A+ CELP +C+ICGL
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNNSTEPGLTLG-GYFCPQCRAKYCELPVECKICGLT 312
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
LVS+PHLARSYHHLFP+ F E++ + C+GCQ L +YV C
Sbjct: 313 LVSAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CT 362
Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
C+ FC++CD+++H+SLH CPGC
Sbjct: 363 VCQNVFCVDCDVFVHDSLHCCPGC 386
>gi|126320570|ref|XP_001362699.1| PREDICTED: general transcription factor IIH subunit 2-like
[Monodelphis domestica]
Length = 395
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 148/330 (44%), Positives = 202/330 (61%), Gaps = 27/330 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P+ H+ +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPKKHVASLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L L L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSLAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKSLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGVYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV+ N R C GC L +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEVSLEEYNGER-----FCQGC----LGELKDQHVYI-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
+ FC++CD++IHESLH CPGC H NP
Sbjct: 364 QNVFCVDCDLFIHESLHCCPGC---IHKNP 390
>gi|390459877|ref|XP_002744933.2| PREDICTED: general transcription factor IIH subunit 2 isoform 1
[Callithrix jacchus]
Length = 404
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 146/331 (44%), Positives = 204/331 (61%), Gaps = 33/331 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P H+ +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHVTSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTALARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRIC----- 223
FPQ + S S+ + G+ GY CP+C+A+ CELP +C+IC
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDGNTEPGLTLGGYFCPQCRAKYCELPVECKICGKKPS 313
Query: 224 ----GLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
GL LVS+PHLARSYHHLFP+ PF E+ L + N R C+GCQ L
Sbjct: 314 IHYSGLTLVSAPHLARSYHHLFPLDPFQEI----LLEEYNGER-FCYGCQGEL----KDQ 364
Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
+YV C C+ FC++CD+++H+SLH CPGC
Sbjct: 365 HVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 394
>gi|388580345|gb|EIM20660.1| TFIIH basal transcription factor complex, subunit SSL1 [Wallemia
sebi CBS 633.66]
Length = 444
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 209/342 (61%), Gaps = 37/342 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + +V+E+FDQNP+ Q+G + ++ GVA + + GS +K+L K
Sbjct: 108 DMRPNRFLLSLEYAREYVKEYFDQNPIGQMGAIGMRSGVAEWICKMSGSQHDLVKSLQNK 167
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L +G+ SLQNAL++ + ++ +P++ RE+++++ +L+TCDPG+I +T++ KI
Sbjct: 168 NKLEPNGEPSLQNALEMARASMAHLPTHASREIVVVFGSLTTCDPGNIHDTLRALIRDKI 227
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
R ++I L+AE+ I + + + TGG+ SVALDE H+K+++ E PPP A+
Sbjct: 228 RVNIISLAAEVRILREVAEKTGGTLSVALDEGHYKDVLFETVPPPAVHTAKALNTVEQGG 287
Query: 173 -----ANLIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
++L++MGF R +++C+CH E++ GY CPRC +++C++PTDC +C L
Sbjct: 288 GSVDESDLMQMGFAVRLPYTAPLTLCACHSELRRQ-GYICPRCGSKLCDIPTDCAVCDLV 346
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ---------------- 270
+VSSPHLARSYHHLFP+ + V P + + S + CFGC+Q
Sbjct: 347 VVSSPHLARSYHHLFPVPDWAVVNPQAITET---SDTRCFGCRQVFPPVAQSMTLPTPTA 403
Query: 271 -SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
++ ++ + G Y CPKC FC ECDIY HE+LH CPGC+
Sbjct: 404 ATVSANISATGRY-RCPKCMHDFCSECDIYCHETLHVCPGCQ 444
>gi|453087906|gb|EMF15947.1| TFIIH basal transcription factor complex, subunit SSL1
[Mycosphaerella populorum SO2202]
Length = 460
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 136/334 (40%), Positives = 194/334 (58%), Gaps = 37/334 (11%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
V AF+RE+F+QNP+SQ+G++ +++G+A ++D+ G+P HI A+ + G+ SLQN
Sbjct: 115 VIAFIREYFEQNPISQLGILGMREGLAIRVSDMSGNPNDHINAIKALRSTDPKGNPSLQN 174
Query: 73 ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
AL++ + L PS+G REV I+ AL T DPGDI +TI+ C + KIR ++IGL+A+MFI
Sbjct: 175 ALEMARAALYHTPSHGTREVAIILGALHTSDPGDIHDTIKACIKDKIRVNIIGLAAQMFI 234
Query: 133 CKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIAEFAIAN------LIKMGFPQ 182
C +C+ T Y+VA+DE H++EL+M PP A A L+ MGFP
Sbjct: 235 CAEICRKTNQGDTNCYNVAVDEVHYRELLMSITTPPVVRASDVEAQKRNQAALLMMGFPS 294
Query: 183 RAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFP 242
R E ++C+CH + G GY C RCKA+VC LP C C L L+ S HLARSYHHLFP
Sbjct: 295 RIVEEKATLCACHGNLTRG-GYLCSRCKAKVCNLPATCPTCELTLILSTHLARSYHHLFP 353
Query: 243 IAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---------------------LSSGNKPGL 281
+ +DEV+ + + CFGCQ + G
Sbjct: 354 LQNWDEVS---WKRAQQKRSVQCFGCQSPFPALSGGIDGVDGSANGHSRPKRAEGASESS 410
Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
C C++HFC++CD++ HE +HNCPGC+S H
Sbjct: 411 RYECSTCQQHFCIDCDVFCHEVVHNCPGCQSNAH 444
>gi|345304797|ref|XP_001512459.2| PREDICTED: general transcription factor IIH subunit 2-like
[Ornithorhynchus anatinus]
Length = 395
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/330 (43%), Positives = 203/330 (61%), Gaps = 27/330 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P H AL
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHTTALKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L L L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCLGEPSLYNSLSLAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKSLKTAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R S+IGLSAE+ +C L ++TGG+Y V LDESH+KEL++ H PPPA + +LI+MG
Sbjct: 195 RVSIIGLSAEVRVCTVLARETGGTYHVILDESHYKELLIHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ CELP +C++CGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSSEPGLTLGGYFCPQCRAKYCELPVECKVCGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E++ + R C GCQ L +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEISLEEYHGER-----VCQGCQGEL----KDQQVYI-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
+ FC+ECD+++H+ LH CPGC H NP
Sbjct: 364 QSVFCVECDLFVHDFLHCCPGC---IHKNP 390
>gi|354488323|ref|XP_003506320.1| PREDICTED: general transcription factor IIH subunit 2-like
[Cricetulus griseus]
gi|344248232|gb|EGW04336.1| General transcription factor IIH subunit 2 [Cricetulus griseus]
Length = 395
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 200/322 (62%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C+G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCNGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDE+H+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDETHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDNSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F E++ + C+GCQ L +YV C C
Sbjct: 314 SAPHLARSYHHLFPLDAFQEISL-----EEYKGERFCYGCQGEL----KDQHVYV-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
+ FC++CD++IH+ LH CPGC
Sbjct: 364 QNVFCVDCDVFIHDCLHCCPGC 385
>gi|378727991|gb|EHY54450.1| transcription initiation factor TFIIH subunit H2, variant
[Exophiala dermatitidis NIH/UT8656]
gi|378727992|gb|EHY54451.1| transcription initiation factor TFIIH subunit H2 [Exophiala
dermatitidis NIH/UT8656]
Length = 464
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 137/330 (41%), Positives = 199/330 (60%), Gaps = 24/330 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + E+F+REFF+QNP+SQ+G+V ++DG+A ++D+ G+P +H+ A+ +
Sbjct: 100 DLRPTRYLLTLMYTESFIREFFEQNPISQLGIVGMRDGIAVRVSDMSGNPTAHLAAIQKI 159
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G SL+NAL++ + L PS+G REVL+++ AL T DPGDI TI KI
Sbjct: 160 RKQEPKGQPSLENALEMARAALFHAPSHGTREVLLIFGALHTSDPGDIHRTIDNLVTDKI 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIME-HAPPPPAIAEFAIAN 174
R +VIGL+A++ +C L T +Y V L E H++EL+M PP + + + ++
Sbjct: 220 RATVIGLAAQVAVCAELVSKTNNGDLSNYGVVLHEQHYRELLMAVTTPPVTSESTKSASS 279
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH GY C RC +VC LP++C +CGL L+ S HLA
Sbjct: 280 LLMMGFPSRTVEDYPSLCACHTNPSRD-GYLCSRCSTKVCGLPSECPVCGLTLILSTHLA 338
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------QQSLLSSGNKPGLYVA 284
RSYHHLFP+ + EV P RS++ CF C +Q + G Y
Sbjct: 339 RSYHHLFPLVNWLEV-PW---SQAYRSKA-CFACLKQFPVVPPREQWTAAKTTTSGRY-E 392
Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCESLR 314
CP C HFC++CD++ HE +HNCPGC+S+R
Sbjct: 393 CPVCHHHFCIDCDLFAHEVVHNCPGCQSVR 422
>gi|194752572|ref|XP_001958595.1| GF11005 [Drosophila ananassae]
gi|190625877|gb|EDV41401.1| GF11005 [Drosophila ananassae]
Length = 422
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 199/329 (60%), Gaps = 21/329 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL
Sbjct: 78 DLKPTRLRCTIKLLEQFIEEFFDQNPISQLGIIALKAKRAEKVTELTGTSRVHLKALESL 137
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ S + SLQN LDL L +PS+ RE++++ +L+TCDP DI TI + K+ I
Sbjct: 138 INVSLTSEPSLQNGLDLALKTLKVVPSHASREIVVIMGSLTTCDPVDINLTIDELKKEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHVARYLTQQTLGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E V G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSVLCTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEVT----PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA 284
S+PHLARSYHHL P+ F+EVT P +D N C+ C + L +K
Sbjct: 317 SAPHLARSYHHLIPVPNFEEVTLDSMPTVSSD-YNGGVKECYACAKVLTFVADKS--IFK 373
Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCESL 313
C C++ FC++CD +IHE+LH C GC ++
Sbjct: 374 CGFCQQFFCIDCDAFIHETLHACVGCNTI 402
>gi|195127781|ref|XP_002008346.1| GI13436 [Drosophila mojavensis]
gi|193919955|gb|EDW18822.1| GI13436 [Drosophila mojavensis]
Length = 461
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 200/327 (61%), Gaps = 18/327 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL +
Sbjct: 78 DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGIIALKAKRAEKITELTGTSRVHLKALASL 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K +I
Sbjct: 138 ANVPLTSEPSLQNGLDLALKTLKVVPSHASREIVIIMGSLTTCDPVDINVTIDELKRERI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ IC++L Q T G++ LD++H+++ ++ PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHICRYLTQQTMGTFGAVLDDAHYRDQLLSQVDPPPA-AKTQDNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHSKNEVEGKDAPLSMCMCHIENLDEPSELSTTGHHCPQCNSKYCELPVECQCCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCL--NDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
S+PHLARSYHHLFP+ F+E++ + P S C+ C + L +K C
Sbjct: 317 SAPHLARSYHHLFPVPNFEELSYDMVPPASPDCGSLRKCYACIKVLNPVTDKS--VFKCG 374
Query: 287 KCKKHFCLECDIYIHESLHNCPGCESL 313
C + FC++CDI+IHE+LH C GC ++
Sbjct: 375 YCNQFFCIDCDIFIHETLHACVGCNTI 401
>gi|213402509|ref|XP_002172027.1| transcription factor TFIIH complex subunit Ssl1
[Schizosaccharomyces japonicus yFS275]
gi|212000074|gb|EEB05734.1| transcription factor TFIIH complex subunit Ssl1
[Schizosaccharomyces japonicus yFS275]
Length = 422
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 136/322 (42%), Positives = 193/322 (59%), Gaps = 17/322 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFR R+ + K F+ +F+QNP+SQ+G++ +KDG+A ++D+ G+P+ HI L
Sbjct: 104 DFRTKRVDLQIKYGTEFIINYFEQNPISQLGIIAMKDGIAVKISDIHGNPQDHINKLKKL 163
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
C G +SLQNAL++ + L+ I S+G REVLI++ +L + DPGDI +TI + I
Sbjct: 164 RECKGMASLQNALEMARASLAHIASHGTREVLIIFGSLLSSDPGDIFQTIDSLVKENISV 223
Query: 122 SVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAI-ANLI 176
+IGLSAE+ +CK +C+ T +Y V L+E+H +EL+ME PP ++ I A+L+
Sbjct: 224 HIIGLSAEVSVCKEICRRTNNGAQNAYGVILNENHLEELLMEKTIPPATHSDEGIPASLV 283
Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
KMGFP + E S+CSCH G G+ CPRCK +VC LP +C C L L+ S HLARS
Sbjct: 284 KMGFPSKVIEPFPSLCSCHSVASRG-GFHCPRCKCKVCTLPIECPGCSLILILSTHLARS 342
Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------LSSGNKPGLYVACPKCKK 290
YHHLFP+ + E+ P + + CF CQ N ACP C
Sbjct: 343 YHHLFPLKNWREI-PW----SEKPTSTYCFACQAPFPMTPQAKEDQNASSSRYACPSCGH 397
Query: 291 HFCLECDIYIHESLHNCPGCES 312
HFC+ECD++ HE LH C GC+S
Sbjct: 398 HFCIECDVFAHEQLHECFGCQS 419
>gi|410925308|ref|XP_003976123.1| PREDICTED: general transcription factor IIH subunit 2-like
[Takifugu rubripes]
Length = 392
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 192/322 (59%), Gaps = 24/322 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K +EAFV E+FDQNP+SQ+GL+T K+ A LTDL G+P+ H AL
Sbjct: 75 DLKPNRLTSTLKLLEAFVDEYFDQNPISQMGLITTKNKRAEKLTDLSGNPKKHCAALKKA 134
Query: 62 LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N L L L +P + RE+LI+ S+L+TCDP +I E I+ K K+
Sbjct: 135 VDTVCVGEPSLYNCLSLAVQTLRHMPGHTSRELLIILSSLTTCDPANIYELIETLKTLKV 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGGSY V LDESHFKEL+M H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPA-SSSSECSLIRMG 253
Query: 180 FPQRA------GEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S H + G GY CP+C A+ ELP +C++CGL LV
Sbjct: 254 FPQHTVASLTDQDAKPSFSMAHLDSSSGPDLSLGGYFCPQCHAKYTELPVECKVCGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
+PHLARS+HHLFP+ F E L R +C C L K +CP C
Sbjct: 314 LAPHLARSFHHLFPLQVFTESPADDLQTER-----SCQACHGEL-----KDKSVFSCPSC 363
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
FC++C ++IH+SLH CP C
Sbjct: 364 LSVFCVDCHLFIHDSLHCCPCC 385
>gi|195160565|ref|XP_002021146.1| GL24973 [Drosophila persimilis]
gi|194118259|gb|EDW40302.1| GL24973 [Drosophila persimilis]
Length = 445
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 136/331 (41%), Positives = 200/331 (60%), Gaps = 22/331 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL G
Sbjct: 78 DLKPTRLRCTIKLLEQFIEEFFDQNPISQMGIIALKAKRAEKITELTGTSRVHLKALEGL 137
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+ I
Sbjct: 138 ANVPLTSEPSLQNGLDLALKTLKVVPSHASREIVIVMGSLTTCDPVDINLTIDELKKEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ I ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHIARYLTQQTRGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELCTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLND--PRNRSRSTCFGCQQSLLSSGNKPGLY 282
S+PHLARSYHHLFP+ F+E+ P +D + C+ C + + +K
Sbjct: 317 SAPHLARSYHHLFPVPNFEEIPFDSLPKASSDRGGTGTGTTECYACMKLFVPVADK--TV 374
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK+ FCL+CDI+IH++LH C GC ++
Sbjct: 375 FKCGFCKQFFCLDCDIFIHDTLHACVGCNTI 405
>gi|146324050|ref|XP_001481497.1| RNA polymerase TFIIH complex subunit Ssl1 [Aspergillus fumigatus
Af293]
gi|129556348|gb|EBA27248.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Aspergillus
fumigatus Af293]
gi|159126166|gb|EDP51282.1| transcription factor and DNA repair complex, core TFIIH, putative
[Aspergillus fumigatus A1163]
Length = 456
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/334 (41%), Positives = 200/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 107 DLRPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLRDGLAIRVSDMSGNPTEHISAIQAL 166
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN L++ +G L PS+G REV +++ ++ + DPGDI +TI KI
Sbjct: 167 RDHDPKGLPSLQNGLEMARGALFHTPSHGTREVFVIFGSILSSDPGDIHQTITTLINDKI 226
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
+ GL+A++ IC+ LC T +Y VAL+E HF+EL+M+ PP ++ A ++
Sbjct: 227 SVRIAGLAAQVAICRELCSRTNAGDDTTYGVALNEQHFRELMMDVTTPPVTYSQKKATSS 286
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH E G GY C RC ++VC LP +C CGL L+ S HLA
Sbjct: 287 LLMMGFPSRTVETCPSLCACHSEPSCG-GYLCSRCNSKVCGLPAECPSCGLTLILSTHLA 345
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGC----------QQSLLSSGNKPGLY 282
RSYHHLFP+ + EV P R SRS TCF C Q + G+
Sbjct: 346 RSYHHLFPLMNWIEV-------PWQRASRSLTCFACGIYFPTVPPKDQWQATESQAKGMS 398
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C CK HFC++CD++ HE +HNCPGC+S
Sbjct: 399 VSSRYECTVCKNHFCIDCDLFAHEIVHNCPGCQS 432
>gi|408397644|gb|EKJ76784.1| hypothetical protein FPSE_02970 [Fusarium pseudograminearum CS3096]
Length = 469
Score = 258 bits (659), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 200/342 (58%), Gaps = 37/342 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D P+R ++ AFVREFF+QNP+SQ+G++ ++DGVA ++D+GG+P H+ L G
Sbjct: 108 DLLPTRYRLMLSYAVAFVREFFEQNPISQLGIIGMRDGVAVRVSDVGGNPAEHLDKLKGL 167
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G+ SLQNAL++ +G L PS+G REV I+Y AL + DPGDI ETI +I
Sbjct: 168 ESQDPQGNPSLQNALEMCRGALFHAPSHGTREVFIIYGALLSSDPGDIHETIGNLITDRI 227
Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R S++GLSA + IC LC T G Y++A+DE HF+EL + PP E + A+
Sbjct: 228 RVSIVGLSAHLAICADLCSRTNAGDGSQYNIAMDEVHFRELFLAATTPPVTRTVEQSTAS 287
Query: 175 LIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R G+ S C+CH GY C RC ARVC +P++C C L L+ S H
Sbjct: 288 LLMMGFPSRTLVPNGTTSYCACHNR-PCREGYLCTRCGARVCRIPSECPSCDLTLILSTH 346
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------QQSLLSSG--- 276
LARSYHHLFP+ + EV P + + + CF C +S SG
Sbjct: 347 LARSYHHLFPLRNWIEV-PWA----KATTSAGCFSCLAPFPEPPKNKGHDKSKEDSGAPK 401
Query: 277 -----NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESL 313
++ G Y AC C +HFC++CD++ HE +HNCPGC+SL
Sbjct: 402 TAKGVSESGRY-ACEVCGQHFCIDCDVFAHEIVHNCPGCQSL 442
>gi|345568422|gb|EGX51316.1| hypothetical protein AOL_s00054g386 [Arthrobotrys oligospora ATCC
24927]
Length = 542
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/362 (38%), Positives = 201/362 (55%), Gaps = 51/362 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + FV E+F+QNP+SQ+ ++ + DG+A ++D+GG+P H+ AL K
Sbjct: 124 DLRPTRFLLTLRYARDFVTEYFEQNPISQLSIIGMHDGLALRISDMGGNPADHLTALQEK 183
Query: 62 ---LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ G SSLQNAL++ + L P++G RE+L+++ ALS+ DPGDI +TI E +
Sbjct: 184 AFTVEPKGASSLQNALEMGRAELFHTPTHGTREILLIFGALSSSDPGDIHDTINSLIEDR 243
Query: 119 IRCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPP-PAIAEFAIA 173
I+ ++GL+A++ +C+ L T +Y V LDE H++EL+M PP E ++
Sbjct: 244 IQVRIVGLAAQVAVCRELVMKTNDGDDSAYGVILDEGHYRELMMAATTPPVTKTEEKSVP 303
Query: 174 NLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
+L+ MGFP R E S+CSCH + G GY CPRC A+VC LPT+C CGL L+ S HL
Sbjct: 304 SLLMMGFPSRTVEKHESLCSCHSNLTKG-GYICPRCTAKVCFLPTECPCCGLTLILSTHL 362
Query: 234 ARSYHHLFPIAPFDEV---------------TPLCLNDPRNRS----------------R 262
ARSYHHLFP+ + EV +P N + S
Sbjct: 363 ARSYHHLFPLKNWVEVPWSEAHTSTHCWSCQSPFPTNQSQTSSIKIKIRNPSTTAATATA 422
Query: 263 STCFGCQQSLLSSGNK-----------PGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
S G + + N P AC CK HFC++CDI+ HESLHNCPGC+
Sbjct: 423 SRSNGATTTSTAPKNDHQQQQQQPPQPPTQRYACTDCKTHFCIDCDIFTHESLHNCPGCQ 482
Query: 312 SL 313
SL
Sbjct: 483 SL 484
>gi|198464821|ref|XP_001353376.2| GA10767 [Drosophila pseudoobscura pseudoobscura]
gi|198149896|gb|EAL30883.2| GA10767 [Drosophila pseudoobscura pseudoobscura]
Length = 445
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 135/331 (40%), Positives = 200/331 (60%), Gaps = 22/331 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K ++ F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL G
Sbjct: 78 DLKPTRLRCTIKLLDQFIEEFFDQNPISQMGIIALKAKRAEKITELTGTSRVHLKALEGL 137
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+ I
Sbjct: 138 ANVPLTSEPSLQNGLDLALKTLKVVPSHASREIVIVMGSLTTCDPVDINLTIDELKKEGI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ I ++L Q T G++ LD++HF++ +M PPPA A+ +LI+MG
Sbjct: 198 RCSVISLSAEIHIARYLTQQTRGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRMG 256
Query: 180 FPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
FP E +S+C CH E G+ CP+C ++ CELP +C+ CGL LV
Sbjct: 257 FPHTKNEVEGKDAPLSMCMCHIENLEEPSELCTTGHHCPQCNSKYCELPVECQSCGLTLV 316
Query: 229 SSPHLARSYHHLFPIAPFDEV----TPLCLND--PRNRSRSTCFGCQQSLLSSGNKPGLY 282
S+PHLARSYHHLFP+ F+E+ P +D + C+ C + + +K
Sbjct: 317 SAPHLARSYHHLFPVPNFEEIPFDSLPKASSDRGGTGTGTTECYACMKLFVPVADK--TV 374
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESL 313
C CK+ FCL+CDI+IH++LH C GC ++
Sbjct: 375 FKCGFCKQFFCLDCDIFIHDTLHACVGCNTI 405
>gi|395517618|ref|XP_003762972.1| PREDICTED: general transcription factor IIH subunit 2 [Sarcophilus
harrisii]
Length = 395
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 200/330 (60%), Gaps = 27/330 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+ HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNARKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L L L +P + REVLI++ +L+TCDP +I + I+ K +KI
Sbjct: 135 VDMTCHGEPSLYNSLSLAMQTLKHMPGHTSREVLIIFGSLTTCDPSNIYDLIKSLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG 253
Query: 180 FPQRA--------GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ + S S+ + G+ GY CP+C+A+ ELP +C+ICGL LV
Sbjct: 254 FPQHTIASLSDQDAKPSFSMAHLDSSTEPGLTLGGYFCPQCRAKYSELPVECKICGLTLV 313
Query: 229 SSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
S+PHLARSYHHLFP+ F EV+ N R C GC L +Y+ C C
Sbjct: 314 SAPHLARSYHHLFPLEAFQEVSLEEYNGER-----FCQGC----LGELKDQHVYI-CTVC 363
Query: 289 KKHFCLECDIYIHESLHNCPGCESLRHSNP 318
+ FC++CD++IH+SLH CPGC H NP
Sbjct: 364 QNVFCVDCDLFIHDSLHCCPGC---IHKNP 390
>gi|452824299|gb|EME31303.1| transcription initiation factor TFIIH subunit H2 isoform 1
[Galdieria sulphuraria]
Length = 398
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 137/325 (42%), Positives = 199/325 (61%), Gaps = 14/325 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +PSR V + F+ +F++NP+SQ+ +V ++DGVA L+ +G +P H + +
Sbjct: 78 DVKPSRGEVCLSCAQKFLYSYFNENPISQLAVVVLRDGVAEKLSSMGSNPRQHSEVVKNA 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ G G+ SLQN LD+ LL IPSYG REVLILY+++S+CDPGDI +TI+K ++ +I
Sbjct: 138 NQKGFYGNCSLQNGLDVALSLLHSIPSYGSREVLILYNSISSCDPGDIRQTIEKLEKERI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RC+VIG++AE++I K+L T GSY V ++ESH EL+ + P I L++MG
Sbjct: 198 RCNVIGMAAELYILKYLAARTHGSYFVCMNESHLLELLEDFVVPSALIENNTKTALVRMG 257
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP +C K ++ V + CPRC+ E+P +C +CGL LVSS LARSYHH
Sbjct: 258 FPTLKAYKEPKVCLNDKVLRNQV-FVCPRCECCYGEIPIECVLCGLILVSSSQLARSYHH 316
Query: 240 LFPIAPFDEVTPLCLNDPRNRSR---STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLEC 296
LFP+A F E+ D +S + CF C+ + G++ Y CP C+ FC C
Sbjct: 317 LFPVANFHELE----EDKYKQSHFQPNYCFACKGTFPFEGSE---YFVCPNCQNIFCGIC 369
Query: 297 DIYIHESLHNCPGCESLRHSNPIVA 321
DI+IH+ L+NCPGCE +R N I +
Sbjct: 370 DIFIHDGLNNCPGCE-IRPENRIAS 393
>gi|58271290|ref|XP_572801.1| RNA polymerase II transcription factor [Cryptococcus neoformans
var. neoformans JEC21]
gi|57229060|gb|AAW45494.1| RNA polymerase II transcription factor, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 460
Score = 257 bits (656), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 195/349 (55%), Gaps = 45/349 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+RP+R V+ + +V E+FDQNPL QIG++ ++D ++ L +GGSPE ++AL K
Sbjct: 116 DYRPTRFEVILGYLRTYVVEWFDQNPLGQIGVIAMRDRLSEVLIPMGGSPEEIVRALSDK 175
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
L SG+ SLQN L + +G ++ +PS E+L+++SA+ST DP I +
Sbjct: 176 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLEILVIFSAISTADPDGPITIHNVLDTLAT 235
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E PPP AN
Sbjct: 236 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQEHLKDLLWETIPPPATTIAPVTANVR 295
Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
L+ MGFP R G ++C+CH ++ G GY CPRC +++
Sbjct: 296 SALAAGGRGPNQTGERAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 354
Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL- 272
C++PTDC +CGL +VSSPHLARS+ LFP+A + PL + D S TCFGC
Sbjct: 355 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLSIEDVVESSE-TCFGCDSEFS 410
Query: 273 -----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
+ G P C KCK FC +CD+YIH++LH CPGC
Sbjct: 411 DATAINAGVAQVEDGVSPTGRYRCAKCKHDFCADCDLYIHDTLHTCPGC 459
>gi|195440790|ref|XP_002068223.1| GK12882 [Drosophila willistoni]
gi|194164308|gb|EDW79209.1| GK12882 [Drosophila willistoni]
Length = 454
Score = 256 bits (655), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 198/325 (60%), Gaps = 22/325 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+L G+ H+KAL +
Sbjct: 75 DLKPTRLRCTLKLLEIFIDEFFDQNPISQLGIIALKAKRAEKITELTGTSRVHLKALESL 134
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ + + SLQN LD+ L +PS+ RE++I+ +L+TCDP DI TI + K+ I
Sbjct: 135 ANVVLTSEPSLQNGLDMALKTLKVVPSHASREIVIIMGSLTTCDPVDINLTINELKKEGI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI LSAE+ + ++L ++T GSY LD++HF++ ++ PPPA A+ +LI+MG
Sbjct: 195 RCSVISLSAEIHVARYLAKETMGSYGAVLDDAHFRDQLLSQVDPPPA-AKTQHNSLIRMG 253
Query: 180 FPQRAG------EGSISICSCHKE------VKVGVGYTCPRCKARVCELPTDCRICGLQL 227
FP + +S+C CH E G+ CP+C ++ CELP +C+ C L L
Sbjct: 254 FPHSTSNEADGKDAPLSMCMCHIENVDEPSELSTTGHYCPQCHSKYCELPVECQSCDLTL 313
Query: 228 VSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPK 287
VS+PHLARSYHH+FP+ F+E+ L P + CF C + L + +K C
Sbjct: 314 VSAPHLARSYHHIFPVPNFEEMPHDQLPGP-----TECFSCAKLLATVADK--TVYKCIH 366
Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
C + FCL+CDI+IHE+LH C GC S
Sbjct: 367 CGEVFCLDCDIFIHETLHACVGCNS 391
>gi|58271288|ref|XP_572800.1| RNA polymerase II transcription factor [Cryptococcus neoformans
var. neoformans JEC21]
gi|134114810|ref|XP_773703.1| hypothetical protein CNBH1580 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256331|gb|EAL19056.1| hypothetical protein CNBH1580 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57229059|gb|AAW45493.1| RNA polymerase II transcription factor, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 460
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 195/349 (55%), Gaps = 45/349 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+RP+R V+ + +V E+FDQNPL QIG++ ++D ++ L +GG+PE ++AL K
Sbjct: 116 DYRPTRFEVILGYLRTYVVEWFDQNPLGQIGVIAMRDRLSEVLIPMGGNPEEIVRALSDK 175
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
L SG+ SLQN L + +G ++ +PS E+L+++SA+ST DP I +
Sbjct: 176 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLEILVIFSAISTADPDGPITIHNVLDTLAT 235
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E PPP AN
Sbjct: 236 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQEHLKDLLWETIPPPATTIAPVTANVR 295
Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
L+ MGFP R G ++C+CH ++ G GY CPRC +++
Sbjct: 296 SALAAGGRGPNQTGERAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 354
Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL- 272
C++PTDC +CGL +VSSPHLARS+ LFP+A + PL + D S TCFGC
Sbjct: 355 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLSIEDVVESSE-TCFGCDSEFS 410
Query: 273 -----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
+ G P C KCK FC +CD+YIH++LH CPGC
Sbjct: 411 DATAINAGVAQVEDGVSPTGRYRCAKCKHDFCADCDLYIHDTLHTCPGC 459
>gi|321261922|ref|XP_003195680.1| component of RNA polymerase transcription factor TFIIH; Ssl1p
[Cryptococcus gattii WM276]
gi|317462154|gb|ADV23893.1| Component of RNA polymerase transcription factor TFIIH, putative;
Ssl1p [Cryptococcus gattii WM276]
Length = 461
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 194/349 (55%), Gaps = 45/349 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+RPSR V+ + +V E+FDQNPL QIG++ ++D ++ + +GGSPE ++AL K
Sbjct: 116 DYRPSRFEVILGYLRTYVVEWFDQNPLGQIGVIVMRDRLSEVVIPMGGSPEEIVRALSDK 175
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
L SG+ SLQN L + +G ++ +PS E L+++SA+ST DP I +
Sbjct: 176 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLETLVIFSAISTADPDGPITIHNVLDTLVT 235
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E PPP AN
Sbjct: 236 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQDHLKDLLWETIPPPATTITPVTANVR 295
Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
L+ MGFP R G ++C+CH ++ G GY CPRC +++
Sbjct: 296 SALAAGGRGPNQTGGRAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 354
Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL- 272
C++PTDC +CGL +VSSPHLARS+ LFP+A + PL + D S TCFGC
Sbjct: 355 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLAIEDVVG-SSGTCFGCDSEFS 410
Query: 273 -----------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
+ G P C KCK FC +CD+YIH++LH CPGC
Sbjct: 411 DTSAINAGVAQVEDGVSPAGRYRCAKCKHDFCADCDLYIHDTLHTCPGC 459
>gi|212537907|ref|XP_002149109.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
marneffei ATCC 18224]
gi|210068851|gb|EEA22942.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
marneffei ATCC 18224]
Length = 453
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 201/340 (59%), Gaps = 31/340 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV EFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 103 DIRPTRYLLALRYAQEFVIEFFEQNPISQLGVIGMRDGLAVRISDMSGNPTDHITAIHAL 162
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REVLI++ +L + DPGDI +T++ KI
Sbjct: 163 RPDDPKGLPSLQNALEMARGALFHTPSHGTREVLIVFGSLLSSDPGDIHQTLKSLIADKI 222
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIA-EFAIAN 174
R ++GL+A++ IC+ LC T G Y VAL+E HF+EL+M+ PP A + + +
Sbjct: 223 RVGIVGLAAQVAICRELCAKTNGGDDSVYGVALNEQHFRELMMDVTIPPAAYSYRKSSSA 282
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC +VC LP +C CGL L+ S HLA
Sbjct: 283 LLMMGFPSRTVESFPSLCACHSKPSCG-GYLCSRCGNKVCGLPAECPSCGLTLILSTHLA 341
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRS-RST-CFGC----------QQSLLSSGNKPGLY 282
RSYHHLFP+ + EV P R+ RST C+ C Q G+
Sbjct: 342 RSYHHLFPLINWVEV-------PWKRALRSTNCYACGIQFPTVPPADQWGAFENQAKGMS 394
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCESLRHSNP 318
V+ C C HFC++CD++ HE +HNCPGC+S P
Sbjct: 395 VSSRYECSACHNHFCIDCDLFAHEIVHNCPGCQSTNFMKP 434
>gi|254577733|ref|XP_002494853.1| ZYRO0A11176p [Zygosaccharomyces rouxii]
gi|238937742|emb|CAR25920.1| ZYRO0A11176p [Zygosaccharomyces rouxii]
Length = 477
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 141/323 (43%), Positives = 195/323 (60%), Gaps = 18/323 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQIG+V++++G+A+ ++ + G+P+ H+ AL +
Sbjct: 156 DMRPNRHAMLVQYSIDFVHEFFDQNPISQIGIVSMRNGLAHMISQVSGNPQDHVDALKSI 215
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P + +EVLI++ +LS+ DPGDI TI + I
Sbjct: 216 RKQEPKGNPSLQNALEMTRGLLLPVPPHCTKEVLIVFGSLSSTDPGDIHLTIGSLVQEGI 275
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
R +IGLSA++ IC+ LC+ T Y V LDE H KEL E P P L
Sbjct: 276 RVKIIGLSAQVAICRELCKATNYGDESFYKVPLDEIHCKELFAEAVTPLPVNKINKGFTL 335
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+KMGFP R E + + C+CH ++ G GY CP C+++VC LP C C L L+ S HLAR
Sbjct: 336 VKMGFPTRIFEEAPTFCACHSKLLYG-GYFCPNCQSKVCSLPMVCPCCDLMLILSTHLAR 394
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGN-KPGLYVA-----CPKCK 289
SYHHL P+ F EV + N CF CQ + N K G + C C
Sbjct: 395 SYHHLMPLKTFSEVPTGQVFPTEN-----CFSCQVRFPTVKNHKTGELLTSSRYRCEDCL 449
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
K FC++CD++IHE LHNCPGCES
Sbjct: 450 KDFCIDCDVFIHEILHNCPGCES 472
>gi|46123249|ref|XP_386178.1| hypothetical protein FG06002.1 [Gibberella zeae PH-1]
Length = 469
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 201/343 (58%), Gaps = 39/343 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D P+R ++ AFVREFF+QNP+SQ+G++ ++DGVA ++D+GG+P H+ L G
Sbjct: 108 DLLPTRYRLMLSYAVAFVREFFEQNPISQLGIIGMRDGVAVRVSDVGGNPAEHLDKLKGL 167
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G+ SLQNAL++ +G L PS+G REV I+Y AL + DPGDI ETI +I
Sbjct: 168 ESQDPQGNPSLQNALEMCRGALFHAPSHGTREVFIIYGALLSSDPGDIHETIGNLITDRI 227
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP--PAIAEFAIA 173
R S++GLSA + IC LC T Y++A+DE HF+EL + PP +A+ + A
Sbjct: 228 RVSIVGLSAHLAICADLCSRTNAGDESQYNIAMDEVHFRELFLAATTPPVTRTVAQ-STA 286
Query: 174 NLIKMGFPQRA--GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
+L+ MGFP R G+ S C+CH GY C RC ARVC +P++C C L L+ S
Sbjct: 287 SLLMMGFPSRTLVPNGTTSYCACHNR-PCREGYLCTRCGARVCRIPSECPSCDLTLILST 345
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------QQSLLSSG-- 276
HLARSYHHLFP+ + EV P + + + CF C +S SG
Sbjct: 346 HLARSYHHLFPLRNWIEV-PWA----KATTSAGCFSCLAPFPEPPKNKGHDKSKEDSGAP 400
Query: 277 ------NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESL 313
++ G Y AC C +HFC++CD++ HE +HNCPGC+SL
Sbjct: 401 KTAKGVSESGRY-ACEVCGQHFCIDCDVFAHEVVHNCPGCQSL 442
>gi|358391363|gb|EHK40767.1| hypothetical protein TRIATDRAFT_78413 [Trichoderma atroviride IMI
206040]
Length = 476
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 200/342 (58%), Gaps = 40/342 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R + AFVRE+F+QNP+SQ+G+V ++DGVA ++D+GG+P H++ L
Sbjct: 111 DLLPTRHRLTLSYAAAFVREYFEQNPISQLGIVGMRDGVAVRISDMGGNPAEHLERLRDL 170
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI +I
Sbjct: 171 EGQDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTIGNLIADRI 230
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R SV+GLSA++ IC LC T Y+VA+DE+H++EL + PP E + ++
Sbjct: 231 RVSVVGLSAQVAICAELCSRTNAGDESQYNVAMDETHYRELFLAATTPPVNRTKEQSTSS 290
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A G+ S C+CH + GY C RC A+VC LP++C CGL L+ S H
Sbjct: 291 LLMMGFPSRTLAPGGTTSYCACHSK-PCREGYLCTRCSAKVCRLPSECPACGLTLILSTH 349
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQ--------------------Q 270
LARSYHHLFP+ + EV P + + CF CQ +
Sbjct: 350 LARSYHHLFPLRNWVEV-------PWAEATQSIACFSCQTFFPEPPKSIKDKGKDEAPAK 402
Query: 271 SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
+L ++ G Y C C HFC++CD++ H +HNCPGC+S
Sbjct: 403 ALAKGVSESGRY-KCQVCGNHFCIDCDVFAHMVIHNCPGCQS 443
>gi|196007624|ref|XP_002113678.1| hypothetical protein TRIADDRAFT_57351 [Trichoplax adhaerens]
gi|190584082|gb|EDV24152.1| hypothetical protein TRIADDRAFT_57351 [Trichoplax adhaerens]
Length = 353
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 123/268 (45%), Positives = 172/268 (64%), Gaps = 20/268 (7%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+D +P+R+ K ++ F+ E+FDQNP+SQ+G++ ++ A LTD+GG+P HI AL
Sbjct: 76 IDLKPNRLVATVKLLDDFITEYFDQNPISQLGVIITRNKRAEKLTDIGGNPRRHIAALRQ 135
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
C G+ SLQNAL+L L +PS+ RE+L++Y++L++CDP DI T++ E
Sbjct: 136 AACKPCIGEPSLQNALELASQTLRHVPSHASREILVIYASLTSCDPSDIHATVKNLVEDN 195
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
IRCS IGLSAEM+I K L +TGGSY V LDE H+KEL+ + PP A + ++LI+M
Sbjct: 196 IRCSAIGLSAEMYILKKLASETGGSYRVILDEHHYKELLTHYVIPPVATGKLG-SSLIRM 254
Query: 179 GFPQRAGEGSISICSCHKEVKV-----GVGYTCPRCKARVCELPTDCRICG--------- 224
GFP+ S+C CH + +V GY CP+CKA+ CELP +C+ICG
Sbjct: 255 GFPEYRTNSHPSMCQCHLDSRVVKSFSTTGYFCPQCKAKYCELPIECQICGKMNITTKTR 314
Query: 225 ---LQLVSSPHLARSYHHLFPIAPFDEV 249
L LVS+PHLARSY HLFP+ F+EV
Sbjct: 315 NMRLSLVSAPHLARSYQHLFPLEDFEEV 342
>gi|47210864|emb|CAF92597.1| unnamed protein product [Tetraodon nigroviridis]
Length = 395
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 195/324 (60%), Gaps = 26/324 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K +E FV E+FDQNP+SQ+G++T K+ A LTDL G+P+ H AL
Sbjct: 76 DLKPNRLTSTLKLMEGFVEEYFDQNPISQMGIITTKNKRAEKLTDLAGNPKKHAAALKKA 135
Query: 62 L--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N L L L +P + REVLI+ S+L+TCDPG+I E IQ K K+
Sbjct: 136 VDSACVGEPSLYNCLSLALQTLRHMPGHTSREVLIILSSLTTCDPGNIYELIQTLKSLKV 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM- 178
R SV+GLSAE+ +C L ++TGGSY V LDESHFKEL+M H PPPA + +LI+M
Sbjct: 196 RVSVVGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPASCS-SECSLIRMA 254
Query: 179 GFPQRA------GEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQ 226
GFPQ + S H E G GY CP+C A+ ELP +C++CGL
Sbjct: 255 GFPQHTMASLTDQDAKPSFSMAHLEGGGGGPDLSLGGYFCPQCHAKYTELPVECKVCGLT 314
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
LV +PHLARS+HHLFP+ F E + + P++R C CQ L K CP
Sbjct: 315 LVLAPHLARSFHHLFPLQVFPESS--AEDPPKDR---FCQACQGRL-----KDKSVFTCP 364
Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
C FC+ECD++IH+SLH CP C
Sbjct: 365 SCHSVFCVECDLFIHDSLHCCPCC 388
>gi|406863243|gb|EKD16291.1| RNA polymerase TFIIH complex subunit Ssl1 [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 524
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 140/351 (39%), Positives = 205/351 (58%), Gaps = 36/351 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + FV EFF+QNP+SQ+G++ ++DG+A ++++GG+P HI+ + +
Sbjct: 177 DLRPTRYLLTLRFASEFVTEFFEQNPISQLGVLGMRDGIAIRVSEMGGNPTEHIENIKKL 236
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L G+ SLQNAL++ + +L P++G REVLI+ AL T DPGDI ET++ +I
Sbjct: 237 RDLQGQGNPSLQNALEMARAILHYAPAHGTREVLIISGALLTSDPGDIHETVRSLVTDRI 296
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R SV+GL+A++ I +C T G+Y VAL+E HF+EL+M PP + +
Sbjct: 297 RVSVVGLAAQVKILSVICSKTNAGNEGNYHVALNEQHFRELVMALTTPPVTRTKTQSQPT 356
Query: 175 LIKMGFPQR-AGEG-SISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R +G G ++S C+CH ++ G GY C RC ++VC LP DC +CGL LV S H
Sbjct: 357 LLMMGFPSRISGVGKAMSYCACHSKLVRG-GYLCSRCDSKVCSLPADCPVCGLTLVLSTH 415
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGC------------QQSLLSSGNK 278
LARSYHHLFP+ + EV P +++ + CF C + G
Sbjct: 416 LARSYHHLFPLKNWVEV-------PWSQAGKSKACFACLTAFPEVPSHLEKNGAEREGTS 468
Query: 279 PGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPI-----VANEG 324
C C HFC++CD++ HE +HNCPGC+S I V N+G
Sbjct: 469 ESGRYTCEDCGYHFCIDCDVHAHEVVHNCPGCQSDSQQQQIGIAHEVKNDG 519
>gi|50551723|ref|XP_503336.1| YALI0D26862p [Yarrowia lipolytica]
gi|49649204|emb|CAG81542.1| YALI0D26862p [Yarrowia lipolytica CLIB122]
Length = 496
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/334 (42%), Positives = 195/334 (58%), Gaps = 26/334 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R V FV FFDQNP+SQ+G++ +++G A ++ LG +P HI AL
Sbjct: 112 DLRPNRYQFVINHAIEFVTNFFDQNPISQLGILGMRNGQAISISTLGSNPNDHINALTAA 171
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL GD SLQNAL++ +GLL +PS+ +EVL++ AL + DPGDI TI K K+
Sbjct: 172 KKLEPQGDPSLQNALEMARGLLFHVPSHCTKEVLVVLGALLSADPGDIHVTIDKLVIDKV 231
Query: 120 RCSVIGLSAEMFICKHLCQDT----GGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
R VIGL+A++ ICK LC T Y V L+E HF+EL+ E P + AN
Sbjct: 232 RARVIGLAAQVAICKELCVKTNFGDASYYGVVLNEQHFQELMDEATTPLAESQQSQSANP 291
Query: 175 --LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP + E + S+ + + G GY CP+CK +V LPT C CGL L+ S H
Sbjct: 292 ASLVLMGFPSKVSEAAPSLSASDAALTQG-GYVCPQCKVKVSSLPTVCPCCGLTLILSTH 350
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ----------QSLLSSGNKPGLY 282
LARSYHHLFP+APF EV C N ++ C GCQ +++ + G+ Y
Sbjct: 351 LARSYHHLFPLAPFIEVP--CKNAHKSE---FCAGCQSKFPVVARDAKNIQADGSMTSRY 405
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESLRHS 316
CP C HFC++CD++ HE LHNC GC++ +S
Sbjct: 406 -ECPTCHSHFCIDCDVFCHEILHNCIGCQARSYS 438
>gi|340518378|gb|EGR48619.1| predicted protein [Trichoderma reesei QM6a]
Length = 480
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 196/340 (57%), Gaps = 39/340 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R + AFVRE+F+QNP+SQ+ +V ++DGVA ++D+GG+P H++ L
Sbjct: 113 DLLPTRYRLALSYAAAFVREYFEQNPISQLAIVGMRDGVALRVSDMGGNPAEHLERLREY 172
Query: 62 LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SG+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI +I
Sbjct: 173 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTINNLVAERI 232
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R SV+GLSA++ IC LC T Y+VA+DE+HFKEL + PP E + A+
Sbjct: 233 RVSVVGLSAQVAICAELCSRTNVGDDSQYNVAMDETHFKELFLALTTPPVNRTKEQSTAS 292
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A G+ S C+CH + GY C RC +VC LP++C CGL L+ S H
Sbjct: 293 LLMMGFPSRTLAPGGTSSYCACHSQ-PCREGYLCTRCGVKVCRLPSECPACGLTLILSTH 351
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLS--SGNK----------- 278
LARSYHHLFP+ + EV P +RST CF CQ NK
Sbjct: 352 LARSYHHLFPLRNWVEV-PWA-----EAARSTACFSCQSPFPEPPKANKEKGKDEAAPAK 405
Query: 279 --------PGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
G Y C C HFC++CD++ H +HNCPGC
Sbjct: 406 APAKGVSESGRY-KCQVCDHHFCIDCDVFAHMVIHNCPGC 444
>gi|296817267|ref|XP_002848970.1| suppressor of stem-loop protein 1 [Arthroderma otae CBS 113480]
gi|238839423|gb|EEQ29085.1| suppressor of stem-loop protein 1 [Arthroderma otae CBS 113480]
Length = 484
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV EFF+QNP+SQ+G++ +KDGVA ++++ G+P HI AL +
Sbjct: 125 DLRPTRYLLTLRYAQKFVLEFFEQNPISQLGIMGMKDGVAVQVSEMSGNPTEHILALHAL 184
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L P++G REV+I+ AL + DPGDI +TI K+
Sbjct: 185 RSEDPKGLPSLQNALEMARGSLYHTPTHGTREVIIILGALLSSDPGDIRQTITSLVADKV 244
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
R +IGL+A++ IC+ LC T SY VA+DE H +L+++ PP ++ ++ ++
Sbjct: 245 RVGIIGLAADVAICRELCAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSSS 304
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC ++VC LP +C C L L+ S HLA
Sbjct: 305 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 363
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRS--TCFGC----------QQSLLSSGNKPGLY 282
RSYHHLFP+ + EV P R+ +CF C ++ G+ G+
Sbjct: 364 RSYHHLFPLINWVEV-------PWKRASKSVSCFACGNVFPPVPPREKWDRQEGSVKGMS 416
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 417 VSSRYECTVCHHHFCIDCDVFAHEIVHNCPGCQS 450
>gi|407917809|gb|EKG11111.1| von Willebrand factor type A [Macrophomina phaseolina MS6]
Length = 481
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 138/354 (38%), Positives = 198/354 (55%), Gaps = 44/354 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF+P+R A V K + +VREFF+QNP+SQ+ ++T+ DG+ N +++L G+P HI L
Sbjct: 106 DFKPTRYAAVLKYAQEYVREFFEQNPISQLSVMTMYDGLCNRVSELSGNPNDHISPLQRM 165
Query: 62 LGCS--------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQK 113
G SLQNAL+ + L PS+G REV+I+ AL + DPGDI +TI
Sbjct: 166 RDTRNPNYQDPRGSPSLQNALEQARAALYHTPSHGTREVIIILGALLSLDPGDIFKTIAN 225
Query: 114 CKESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP----- 164
C ++ +R ++IG+ + IC+ +C T G +Y VA+D+ HF++L++ PP
Sbjct: 226 CVKNNVRVNIIGMGGRLKICQEICSRTNGGDESAYVVAVDQLHFRDLLLATTTPPVIRQT 285
Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICG 224
A A A+L+KMGFP R E + +IC+CH + G GY C +C+A+VC LPT C C
Sbjct: 286 TAQAAANPASLLKMGFPSRVEESAPTICACHGTLTRG-GYLCSQCQAKVCSLPTTCPSCN 344
Query: 225 LQLVSSPHLARSYHHLFPIAPFDEVT--------------------PLCLNDPRNRSRST 264
L L+ S HLARSYHHLFP+ + EVT PL + +T
Sbjct: 345 LTLILSTHLARSYHHLFPLKNWREVTWERARQVGSTECKSCLTPFPPLPARSAAAKKAAT 404
Query: 265 CFGCQQSLLSSGNKPGLYVA------CPKCKKHFCLECDIYIHESLHNCPGCES 312
G Q + N G+ + C C+ HFC++CD+Y HE +HNCPGC S
Sbjct: 405 ANGEQHGAPGTKNTEGVGASESSRYECEDCRNHFCVDCDVYCHEIVHNCPGCLS 458
>gi|389623489|ref|XP_003709398.1| RNA polymerase TFIIH complex subunit Ssl1 [Magnaporthe oryzae
70-15]
gi|351648927|gb|EHA56786.1| RNA polymerase TFIIH complex subunit Ssl1 [Magnaporthe oryzae
70-15]
gi|440469566|gb|ELQ38674.1| suppressor of stem-loop protein 1 [Magnaporthe oryzae Y34]
gi|440482863|gb|ELQ63315.1| suppressor of stem-loop protein 1 [Magnaporthe oryzae P131]
Length = 516
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 146/378 (38%), Positives = 203/378 (53%), Gaps = 76/378 (20%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D P+R + FVRE+FDQNP+SQ+ +V ++D +A ++DL G+P HI+ L +
Sbjct: 114 DLMPTRHRLTISYAIEFVREYFDQNPISQLAIVGMRDSIAVRISDLSGNPAEHIEKLRAL 173
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L SG SLQNAL++ +G L PS+G REV+I++ AL + DPGDI ETI + +I
Sbjct: 174 PDLDPSGSPSLQNALEMCRGALFNTPSHGTREVVIIFGALLSLDPGDIHETIDGLMQDRI 233
Query: 120 RCSVIGLSAEMFICKHLCQDT----------------GGSYSVALDESHFKELIMEHAPP 163
R SVIGL+A+++IC+ +C+ T G +Y+VAL E HF+EL+M P
Sbjct: 234 RVSVIGLAAQVYICEEICRRTNSGKAAADLGIVTDTDGSAYNVALHEVHFRELLMSATRP 293
Query: 164 PPAI------AEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP 217
P A A+L+ MGFP R + ++CSCH V G+ C RC +RVC LP
Sbjct: 294 PVTRQSQLQQATGETASLLMMGFPSRKTDKDPALCSCHNR-PVRNGFLCTRCGSRVCRLP 352
Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRS-RS-TCFGCQQSLLSS 275
++C CGL L+ S HLARSYHHLFP+ + EV P N + RS +CF C +
Sbjct: 353 SECPACGLTLILSTHLARSYHHLFPLKIWSEV-------PWNDAVRSVSCFACHAPFPDA 405
Query: 276 GNK-----------------------------------------PGLYVACPKCKKHFCL 294
NK G Y+ C C+ HFC+
Sbjct: 406 KNKDADKGKDAEKDPGDKHLKVPSKDTNGKKKKPAAPVPKGVSESGRYM-CQVCRHHFCI 464
Query: 295 ECDIYIHESLHNCPGCES 312
+CD++ HE LHNCPGC+S
Sbjct: 465 DCDVFSHEVLHNCPGCQS 482
>gi|430811329|emb|CCJ31252.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 319
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 133/323 (41%), Positives = 194/323 (60%), Gaps = 26/323 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D PSR V Q +FV E+F+QNPL+Q+G++ ++DG A ++ GG+P H+ L +
Sbjct: 5 DLIPSRYVVSIDQASSFVSEYFEQNPLAQLGIIGMRDGRAISISPYGGNPHEHLMTLSRL 64
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ SGD+SLQNAL++ + +L +P +G REVL+++ +L++ DPGDI ETI E KI
Sbjct: 65 KQMDPSGDASLQNALEMARTILYPVPKHGTREVLLIFGSLTSFDPGDIYETINVLIEEKI 124
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFA-IAN 174
R VIGLSA M +C+ +C++T SY + L+E HF +L+M+ PP +
Sbjct: 125 RVYVIGLSASMSVCQQICKETNAGDESSYGIVLNEQHFHDLLMQFVIPPALKKSHKFLET 184
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
LI MGFP + E ++C+CH +V + GY CPRC ++VC LP +C C L L+ S HLA
Sbjct: 185 LIMMGFPSKLIENDETLCACHSDV-IKEGYLCPRCFSKVCSLPINCPCCDLTLILSTHLA 243
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGCQ---------QSLLSSGNKPGLYVA 284
RSYHHLFP+ + EV P + ST C+ C S+ S + Y A
Sbjct: 244 RSYHHLFPLKNWIEV-------PWEEAYSTHCYACLIPFPEKHEVTSITHSVSISSRY-A 295
Query: 285 CPKCKKHFCLECDIYIHESLHNC 307
C +C KHFC++CD++ HE L C
Sbjct: 296 CSECHKHFCIDCDVFAHEILFEC 318
>gi|346326021|gb|EGX95617.1| TFIIH basal transcription factor complex, subunit SSL1 [Cordyceps
militaris CM01]
Length = 508
Score = 249 bits (637), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 145/352 (41%), Positives = 195/352 (55%), Gaps = 47/352 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D P+R + + AFVREFF+QNP+SQ+G+V ++DGVA ++DLGG+P H++ L +
Sbjct: 135 DMLPTRHRLTLRYAAAFVREFFEQNPISQLGIVGMRDGVAVRVSDLGGNPAEHLERLKAL 194
Query: 60 GKLGCSGDSSLQNALDLVQG---------LLSQIPSYGHREVLILYSALSTCDPGDIMET 110
G+ SLQNAL++ +G S PS+G REVLI+Y AL + DPGDI +T
Sbjct: 195 ESQDPQGNPSLQNALEMCRGCNANEFNVHTNSHAPSHGTREVLIIYGALLSSDPGDIHDT 254
Query: 111 IQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPP-P 165
+ +IR S++GLSA + IC LC T Y VA DE HF+EL + PP
Sbjct: 255 LASLINDRIRVSIVGLSAHLSICAELCARTNAGDTSQYGVATDEVHFRELFLAATTPPVT 314
Query: 166 AIAEFAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
E + A+L+ MGFP R A G+ S C+CH G+ C RC RVC LP +C C
Sbjct: 315 RTKEQSTASLLMMGFPSRVLAPGGATSFCACHNR-PCREGFLCTRCGTRVCRLPAECPAC 373
Query: 224 GLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ-------------- 269
L L+ S HLARSYHHLFP+ + EV+ D RS CF CQ
Sbjct: 374 ALTLILSTHLARSYHHLFPLRNWVEVS---WADAVGRS-PACFACQCPFPAAAATGGDAS 429
Query: 270 ---------QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
+ + ++ G Y AC C HFC++CD+Y HE +HNCPGC+S
Sbjct: 430 AREEGQGKKRPTVKGVSESGRY-ACEVCGNHFCIDCDVYAHEVIHNCPGCQS 480
>gi|297603026|ref|NP_001053274.2| Os04g0508900 [Oryza sativa Japonica Group]
gi|255675611|dbj|BAF15188.2| Os04g0508900, partial [Oryza sativa Japonica Group]
Length = 173
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 115/150 (76%), Positives = 137/150 (91%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MD+RPSRMAVVAK E F+REFFDQNPLS +G+VT+KDG+++ LT++GGSPES IKALMG
Sbjct: 5 MDYRPSRMAVVAKYAEVFIREFFDQNPLSHVGIVTMKDGISHRLTEIGGSPESQIKALMG 64
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
KL CSG+ SLQNAL+LV G L Q+PSYGH+EVL LYSAL+TCDPGDIMETI KCK+SKIR
Sbjct: 65 KLECSGEPSLQNALELVHGYLDQVPSYGHKEVLFLYSALNTCDPGDIMETIAKCKKSKIR 124
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDE 150
CSVIGL+AE+FICK+LC++TGGSY+VALDE
Sbjct: 125 CSVIGLAAEIFICKYLCEETGGSYTVALDE 154
>gi|326468925|gb|EGD92934.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton tonsurans
CBS 112818]
gi|326480095|gb|EGE04105.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton equinum CBS
127.97]
Length = 489
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV+EFF+QNP+SQ+G++ +KDG+A ++++ G+P H+ AL +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 185
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REV+I+ AL + DPGDI +TI K+
Sbjct: 186 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
R +IGL+A++ IC+ +C T SY VA+DE H +L+++ PP ++ ++ +
Sbjct: 246 RVGIIGLAADVAICREICAKTNDGNDSSYGVAMDEKHLWDLLLDVTTPPVTYSQRYSSNS 305
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC ++VC LP +C C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
RSYHHLFP+ + EV P R SRS +CF C + + G+
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDTWDMRDRSIKGMS 417
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSRRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451
>gi|327301499|ref|XP_003235442.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton rubrum CBS
118892]
gi|326462794|gb|EGD88247.1| RNA polymerase TFIIH complex subunit Ssl1 [Trichophyton rubrum CBS
118892]
Length = 489
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV+EFF+QNP+SQ+G++ +KDG+A ++++ G+P H+ AL +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 185
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REV+I+ AL + DPGDI +TI K+
Sbjct: 186 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
R +IGL+A++ IC+ +C T SY VA+DE H +L+++ PP ++ ++ +
Sbjct: 246 RVGIIGLAADVAICREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSNS 305
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC ++VC LP +C C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
RSYHHLFP+ + EV P R SRS +CF C + + G+
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDKWDMRDRSIKGMS 417
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451
>gi|302656432|ref|XP_003019969.1| hypothetical protein TRV_05937 [Trichophyton verrucosum HKI 0517]
gi|291183747|gb|EFE39345.1| hypothetical protein TRV_05937 [Trichophyton verrucosum HKI 0517]
Length = 515
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 199/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV+EFF+QNP+SQ+G++ +KDG+A ++++ G+P H+ AL +
Sbjct: 151 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 210
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REV+I+ AL + DPGDI +TI K+
Sbjct: 211 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 270
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
R +IGL+A++ IC+ +C T SY VA+DE H +L+++ PP ++ ++ +
Sbjct: 271 RVGIIGLAADVAICREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSNS 330
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC ++VC LP +C C L L+ S HLA
Sbjct: 331 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 389
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
RSYHHLFP+ + EV P R SRS +CF C + + G+
Sbjct: 390 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDKWDMRDRSIKGMS 442
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 443 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 476
>gi|341038582|gb|EGS23574.1| putative transcription factor [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 534
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/390 (36%), Positives = 203/390 (52%), Gaps = 83/390 (21%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--- 58
D P+R + FVRE+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI+ L
Sbjct: 118 DLLPNRYLLTLNYAVDFVREYFEQNPISQMGIIAMRDGIAVRVSDMSGNPADHIERLRFW 177
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI ETI + +
Sbjct: 178 AEHQEPQGNPSLQNALEMCRGALYHTPSHGTREVLIVYGALLSSDPGDIHETISNLVKDR 237
Query: 119 IRCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAEFAI-- 172
IR +V+GL+A++ +C LC T +Y+VAL E HF+EL + PPPA A A
Sbjct: 238 IRVTVVGLAAQVAVCAELCTRTNHGDDSTYAVALHEQHFRELFLAATIPPPATASSATDK 297
Query: 173 -------------ANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELP 217
A+L+ MGFP R A +S+C+CH GY C RC+A+VC LP
Sbjct: 298 NGANGNANAASTDASLLMMGFPSRTLASASHVSLCACHSRPSRE-GYLCTRCRAKVCRLP 356
Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------- 268
+C CGL L+ S HLARSYHHLFP+ + EV+ + R + CF C
Sbjct: 357 AECPACGLTLILSTHLARSYHHLFPLKGWVEVS---WAEARKSKQVGCFACLAPFPLPPA 413
Query: 269 ----------------------QQSLLSSGNK------PGLYVA---------------- 284
Q+ SS N G+ +A
Sbjct: 414 PGSEKTGKEPTQKTQGQAQQPPQERQGSSSNSNNAKKTTGISLATALPEARAVGVSESGR 473
Query: 285 --CPKCKKHFCLECDIYIHESLHNCPGCES 312
CP C KHFC++CD++ HE +HNCPGC++
Sbjct: 474 YKCPTCGKHFCIDCDVFAHEVIHNCPGCQA 503
>gi|302506370|ref|XP_003015142.1| hypothetical protein ARB_06903 [Arthroderma benhamiae CBS 112371]
gi|291178713|gb|EFE34502.1| hypothetical protein ARB_06903 [Arthroderma benhamiae CBS 112371]
Length = 490
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 198/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV+EFF+QNP+SQ+G+ +KDG+A ++++ G+P H+ AL +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGITGMKDGIAVQISEMSGNPTEHMLALHAL 185
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REV+I+ AL + DPGDI +TI K+
Sbjct: 186 RAEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
R +IGL+A++ IC+ +C T SY VA+DE H +L+++ PP ++ ++ +
Sbjct: 246 RVGIIGLAADVAICREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPVTYSQRYSSNS 305
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC ++VC LP +C C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSL----------LSSGNKPGLY 282
RSYHHLFP+ + EV P R SRS +CF C + + G+
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSVSCFACGNPFPAVPPRDKWDMRDRSIKGMS 417
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451
>gi|291224884|ref|XP_002732433.1| PREDICTED: general transcription factor IIH, polypeptide 2, 44kD
subunit-like [Saccoglossus kowalevskii]
Length = 593
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 127/267 (47%), Positives = 175/267 (65%), Gaps = 17/267 (6%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSSLQN 72
+E F+ E+FD NP+SQ+G++T + AN LT+LGG+P+ HI AL C + SLQN
Sbjct: 59 LEHFLEEYFDHNPISQVGILTTSNKRANRLTELGGNPKRHIAALQKCVDKQCLKEPSLQN 118
Query: 73 ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
AL+L L +P + REVL++ +L+TCDPG+I +TI+ K+ IRCSVIGLSA++ +
Sbjct: 119 ALELAATTLRHVPGHASREVLVILGSLTTCDPGNIHDTIKSMKDLNIRCSVIGLSADVRV 178
Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ---RAGEGSI 189
CK LC +T GSYSV LDE HFKE+++EH PP A A ++LI+MGFPQ A + +
Sbjct: 179 CKRLCSETQGSYSVILDEPHFKEVLLEHIKPPLATASTE-SSLIRMGFPQHQLHAEKDKL 237
Query: 190 ---SICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
S+C CH + K GY CP+CK++ CELP +C+ CGL LVS+PHLARS+HHLF
Sbjct: 238 EKPSMCMCHLDSKSTQGFSTGGYFCPQCKSKYCELPIECKACGLTLVSAPHLARSFHHLF 297
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGC 268
P+ F E+ L+D + C C
Sbjct: 298 PLESFQEIQ---LSDLQESDSRVCLSC 321
>gi|341877728|gb|EGT33663.1| hypothetical protein CAEBREN_16370 [Caenorhabditis brenneri]
Length = 376
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 187/311 (60%), Gaps = 20/311 (6%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
PSR VV K +++F+ FF+QNP++QIGL+T KD A LT + G+ +AL M +
Sbjct: 82 PSRFVVVMKALQSFLERFFEQNPIAQIGLITCKDRKAERLTMMTGNIRVLKEALNSMSEA 141
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C GD SLQNAL L L +P + REVLI+ +ALS+ DPG+I TI+ K IRCS
Sbjct: 142 FCGGDFSLQNALQLACANLKGMPGHVSREVLIVMAALSSIDPGNIFSTIETMKRMNIRCS 201
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
IGLSAEMFICK + + T G YSVALD H + L +H PP + A+ + N I +GFP
Sbjct: 202 AIGLSAEMFICKEMAKATKGEYSVALDPDHLQLLFSKHTLPPSS-AKSSECNAIHVGFPH 260
Query: 183 RAGEGSISICSCHKEVK--VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
+ S C CH + K G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELIKTRSFCVCHPDSKPISSRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
P+A F ++ + +R +C+ C+ L S G + C KC+ FCL+CD +
Sbjct: 321 QPLAAFQQI---------DVTRGSCYACETRLTSEGFR------CEKCRLVFCLDCDTLL 365
Query: 301 HESLHNCPGCE 311
HESLH CPGC+
Sbjct: 366 HESLHVCPGCK 376
>gi|443898092|dbj|GAC75430.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit SSL1 [Pseudozyma antarctica
T-34]
Length = 498
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 129/342 (37%), Positives = 190/342 (55%), Gaps = 37/342 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R V + FV E+FDQNP+ Q+ ++ ++G+A L +GG+ H +L K
Sbjct: 160 DMRPNRFDVTLQYAREFVNEYFDQNPIGQLSVIGTREGIAERLAMMGGNTVDHTASLSNK 219
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L G+ SLQNAL++ + L +P+ RE+L ++ +L+TCDPG+I +TI + I
Sbjct: 220 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTINTLVKDNI 279
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA----------- 168
R SV+ L+AE+ + K +C TGG +SVAL+E HF + + + PPP
Sbjct: 280 RVSVVHLAAEVKVFKDVCTRTGGVFSVALNEGHFHDALFDLVPPPAVDGPRKGKRRAGVQ 339
Query: 169 -EFAIAN---LIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
E + N L++M FP R + ++C+CH + G GY CPRC +VC++PTDC +C
Sbjct: 340 EEEEVQNGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYLCPRCGVKVCDVPTDCPVC 398
Query: 224 GLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------------- 268
G+ +V S HLARSYHHLFP+ + V + D R+ CF C
Sbjct: 399 GITIVMSTHLARSYHHLFPVPNWKAVAWEAVGD---RAEDACFSCNLQFPKKDDNRTAAA 455
Query: 269 QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
++L +G P C +C FCLECD ++HE LH CPGC
Sbjct: 456 NKALEDAGLSPSSRYRCTRCNHDFCLECDAFVHEQLHVCPGC 497
>gi|339258242|ref|XP_003369307.1| general transcription factor IIH subunit 2 [Trichinella spiralis]
gi|316966476|gb|EFV51048.1| general transcription factor IIH subunit 2 [Trichinella spiralis]
Length = 403
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 136/324 (41%), Positives = 190/324 (58%), Gaps = 23/324 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESH---IKAL 58
D PSR V K + FV +FFDQNP+SQI ++T KD + L D GS H +KA+
Sbjct: 88 DLLPSRFISVIKALSLFVDDFFDQNPISQISIITAKDKKTDKLVDFTGSARKHKEFLKAI 147
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
++ SG+ SLQN+L+ +L +PS+ REVL++ +LSTCDPGDI E+++ K +
Sbjct: 148 AEEIP-SGEFSLQNSLETANEMLRHMPSHSSREVLVIMGSLSTCDPGDIEESLEILKRNN 206
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
IRC+ I LSAEM+ICK + + T G Y+V +DE K L+ A PP A + ANLI+M
Sbjct: 207 IRCNFIALSAEMYICKRIAKATNGKYAVIIDEDDLKTLLSVFALPPVATTQLG-ANLIRM 265
Query: 179 GF--------PQRAGEGSISICSCHKEVKVG----VGYTCPRCKARVCELPTDCRICGLQ 226
G ++ G I H E +G VG+ CP+C + C +P +C+ICGL
Sbjct: 266 GNSDFRRGLKTRKVGHFVFGIHMKHGE-GIGKLTCVGFICPQCNFKYCFVPMECQICGLV 324
Query: 227 LVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACP 286
L S+PHLARSY HL+PI PF+E D + C GC ++ YV CP
Sbjct: 325 LASAPHLARSYQHLYPILPFEEKAI----DKHMKKELYCAGCFSAIEVKVMFDVSYV-CP 379
Query: 287 KCKKHFCLECDIYIHESLHNCPGC 310
C++ FC+ CD+ IHE++H+CPGC
Sbjct: 380 GCRQDFCINCDLLIHETMHSCPGC 403
>gi|358377786|gb|EHK15469.1| hypothetical protein TRIVIDRAFT_56425 [Trichoderma virens Gv29-8]
Length = 402
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 142/341 (41%), Positives = 198/341 (58%), Gaps = 38/341 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R + AFVRE+F+QNP+SQ+ +V ++DGVA ++D+GG+P H++ L
Sbjct: 44 DLLPTRYRLTLSYAAAFVREYFEQNPISQLAIVGMRDGVAVRVSDMGGNPAEHLERLRDL 103
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI +I
Sbjct: 104 DGQDPQGNPSLQNALEMCRGALFHAPSHGTREVLIIYGALLSSDPGDIHDTISNLIAERI 163
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIME-HAPPPPAIAEFAIAN 174
R SV+GLSA++ IC LC T Y+VA+DE+HFK+L + PP E + ++
Sbjct: 164 RVSVVGLSAQVAICAELCSRTNVGDDSQYNVAMDETHFKDLFLAITTPPVNRTKEQSTSS 223
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A G+ S C+CH + GY C RC +VC LP++C CGL L+ S H
Sbjct: 224 LLMMGFPSRTLAPGGTTSYCACHSK-PCREGYLCTRCGVKVCRLPSECPACGLTLILSTH 282
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRS-TCFGCQ--------------------QS 271
LARSYHHLFP+ + EV P +RS CF CQ ++
Sbjct: 283 LARSYHHLFPLRNWVEV-PWA-----EATRSIACFSCQCPFPEPPKPNKEKGKDEAPTKA 336
Query: 272 LLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
++ G Y C C HFC++CD++ H +HNCPGC+S
Sbjct: 337 PAKGVSESGRY-KCQVCGNHFCIDCDVFAHMVIHNCPGCQS 376
>gi|315049235|ref|XP_003173992.1| stem-loop protein 1 suppressor [Arthroderma gypseum CBS 118893]
gi|311341959|gb|EFR01162.1| stem-loop protein 1 suppressor [Arthroderma gypseum CBS 118893]
Length = 487
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 133/334 (39%), Positives = 200/334 (59%), Gaps = 31/334 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV+EFF+QNP+SQ+G++ +KDG+A ++++ G+P H+ AL +
Sbjct: 126 DLRPTRYLLTLRYAQKFVQEFFEQNPISQLGIMGMKDGIAVQISEMSGNPTEHMLALHAL 185
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REV+I+ AL + DPGDI +TI K+
Sbjct: 186 RSEDPKGLPSLQNALEMARGSLYHTPSHGTREVIIILGALLSSDPGDIHQTITSLVADKV 245
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPAIAE-FAIAN 174
R +IGL+A++ +C+ +C T SY VA+DE H +L+++ PP ++ ++ ++
Sbjct: 246 RVGIIGLAADVAVCREICAKTNDGNDSSYGVAIDEKHLWDLLLDVTTPPITYSQRYSSSS 305
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E S+C+CH + G GY C RC ++VC LP +C C L L+ S HLA
Sbjct: 306 LLMMGFPSRTVEPFPSLCACHSKPVRG-GYMCSRCGSKVCSLPAECPTCNLTLILSTHLA 364
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNR-SRS-TCFGCQQSLLS----------SGNKPGLY 282
RSYHHLFP+ + EV P R SRS +CF C + + G+
Sbjct: 365 RSYHHLFPLINWVEV-------PWKRASRSISCFACGNLFPAVPPRDKWDKRDKSIKGMS 417
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ C C HFC++CD++ HE +HNCPGC+S
Sbjct: 418 VSSRYECTVCHHHFCIDCDVFAHEVVHNCPGCQS 451
>gi|389741483|gb|EIM82671.1| Ssl1-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 382
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 117/261 (44%), Positives = 177/261 (67%), Gaps = 14/261 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + FV E+FDQNPL Q+G+V ++ G+ + ++ G+P+ +KAL
Sbjct: 112 DLRPTRFDLSLEYAREFVTEWFDQNPLGQVGVVGMRAGLGERVGEMSGNPQDVLKALSER 171
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL +G+ SLQNA+++ + +S +P++ RE+L+++ +L+T DPG+I +T+Q C + KI
Sbjct: 172 HKLEPTGEPSLQNAIEMARSSMSHLPTHSSREILVIFGSLTTVDPGNIYDTLQSCIKDKI 231
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--AIAEFA------ 171
R S++ L+AEM IC+ LC TGG + VA++E HFK+L+ E PPP AIA
Sbjct: 232 RISIVSLAAEMKICRELCDKTGGQFGVAMNEGHFKDLLFELIPPPAQRAIARTTGGPGAA 291
Query: 172 --IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
A+L+ MGFP R + S+ S+C CH E++ G+ CPRC ++VC++PTDC +CGL +V
Sbjct: 292 NPAADLMMMGFPTRLPDTSLPSLCVCHAELR-SAGFLCPRCMSKVCDVPTDCDVCGLMIV 350
Query: 229 SSPHLARSYHHLFPIAPFDEV 249
SSPHLARSYHHLFP+ P+ V
Sbjct: 351 SSPHLARSYHHLFPVKPYAAV 371
>gi|164662116|ref|XP_001732180.1| hypothetical protein MGL_0773 [Malassezia globosa CBS 7966]
gi|159106082|gb|EDP44966.1| hypothetical protein MGL_0773 [Malassezia globosa CBS 7966]
Length = 349
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 131/348 (37%), Positives = 192/348 (55%), Gaps = 43/348 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R + + FV ++FDQNP+ Q+ ++ +DG+A L+ +GG+ H L
Sbjct: 5 DLRPTRFDLTLQLARQFVADYFDQNPIGQLAILCARDGLAERLSLMGGNTFDHGAVLSSR 64
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL G+ S+QNAL++ L+ +PS RE+L + ++L++ DPG+I TI K + I
Sbjct: 65 RKLEPRGEPSIQNALEMALSSLTHLPSSNTREILFISASLTSVDPGNIYHTIDKLVDEHI 124
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIA------ 173
+ SVI L+AEM + K C TGG ++V ++E H+KEL+M+H PP I E + +
Sbjct: 125 QLSVISLAAEMHVLKDACARTGGDFNVVMNEDHYKELLMQHV-PPRVITEASSSSHLAHS 183
Query: 174 ---NLIKMGFPQRAG-EGSISICSCHKEVKVG-------------VGYTCPRCKARVCEL 216
+L+ MGFP+R S C+CH + G GYTCPRC ++VC++
Sbjct: 184 NECDLLVMGFPRRIPFHTPASFCACHGRLVTGSRSLSYADRGAAPAGYTCPRCASKVCQV 243
Query: 217 PTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------- 268
PTDC CG+ ++ S HLARSYHHLFP+ ++ P+ S S CF C
Sbjct: 244 PTDCPTCGITIIMSTHLARSYHHLFPVQNYE---PVAWEAVSTTSPSNCFACAFPFPPRP 300
Query: 269 ------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
++ S P CPKC +HFCLECD ++ E LH CPGC
Sbjct: 301 SNLDETSHVVMDSTLAPSSRYMCPKCHQHFCLECDAFVQEQLHTCPGC 348
>gi|346979033|gb|EGY22485.1| suppressor of stem-loop protein [Verticillium dahliae VdLs.17]
Length = 461
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/346 (39%), Positives = 201/346 (58%), Gaps = 41/346 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D P+R+ + + FV+E+F+QNP+SQ+G+V ++DGVA ++++GG+P H++A+
Sbjct: 101 DLLPTRLRLTWRLAREFVKEYFEQNPISQMGIVGMRDGVAVRVSEMGGNPVEHVEAINKW 160
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ G SLQNAL++ +G L PS+G REVL++Y AL + DPGDI +TI +I
Sbjct: 161 AEIDPQGSPSLQNALEMCRGALFHAPSHGTREVLVVYGALLSSDPGDIHDTIANLLADRI 220
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R S++GL+A++ IC+ LC+ T Y++ALDE H++EL++ PP + + A+
Sbjct: 221 RVSIVGLAAQVAICEELCRKTNAGDATQYNIALDEMHYRELLLRTTTPPVTRTQAQSTAS 280
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A + + C+CH V GY C RC RVC LP +C CGL L+ S H
Sbjct: 281 LLMMGFPSRTLAPADTTAYCACHN-TPVREGYLCTRCGTRVCRLPAECPACGLTLILSTH 339
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------------QQSLLSSGNK 278
LARSYHHLFP+ + EV P R + C+ C + + G+K
Sbjct: 340 LARSYHHLFPLRNWVEV-PWA----RAAASRACYACLAPFPDPPPQRRGHRTNGTDGGDK 394
Query: 279 PGLYV------------ACPKCKKHFCLECDIYIHESLHNCPGCES 312
V AC C HFC++CD++ HE +HNCPGC+S
Sbjct: 395 EAAPVPKQKGVSESSRYACQVCGNHFCIDCDVFAHEVIHNCPGCQS 440
>gi|353237926|emb|CCA69887.1| probable SSL1-TFIIH subunit (transcription initiation factor),
factor B [Piriformospora indica DSM 11827]
Length = 386
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 119/268 (44%), Positives = 177/268 (66%), Gaps = 21/268 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + FV E+FDQNPL QIG+V +++GVA + L G+P +K + +
Sbjct: 120 DLRPNRFDLSLEYSREFVTEWFDQNPLGQIGIVGMREGVAERICGLSGNPPDVLKTIADR 179
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L +G+ SLQNA+++ +G +S +P++ REVL+++ +L+TCDPG+I ET++ C E KI
Sbjct: 180 RNLEPAGEPSLQNAIEMARGSMSHLPTHSSREVLVIFGSLTTCDPGNIHETLKSCVEDKI 239
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP-----AIAEFA--- 171
R SV+ L+AEM IC+ C +TGG + VA++E H+++++ EH PPP A+A+ +
Sbjct: 240 RVSVVALAAEMKICRDFCTETGGLFGVAMNEGHYRDMLFEHVPPPAVRKIEAVAKISNGN 299
Query: 172 ---------IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCR 221
L+ MGFP R E S S+C CH +K GY CPRC ++VC++PTDC
Sbjct: 300 GAGPAPAAATGELMMMGFPTRLPETSAPSLCVCHGIMK-SEGYICPRCLSKVCDIPTDCN 358
Query: 222 ICGLQLVSSPHLARSYHHLFPIAPFDEV 249
+CGL +VS+PHLARSYHHLFP+ P+ V
Sbjct: 359 VCGLVIVSAPHLARSYHHLFPVKPYQPV 386
>gi|390351315|ref|XP_782325.3| PREDICTED: general transcription factor IIH subunit 2-like
[Strongylocentrotus purpuratus]
Length = 500
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 122/260 (46%), Positives = 173/260 (66%), Gaps = 14/260 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D +P+R+A K +E F+ E+FDQNP+SQ+G++T + A LT+LGG+P+ HI AL
Sbjct: 76 DLKPNRLACCTKLLEHFIEEYFDQNPISQVGIITSSNMRAEKLTELGGNPQRHITALEKC 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C + SLQNAL+L L +P + RE+L++ +L+TCDPG+I +TIQ K+ I
Sbjct: 136 NDKPCVKEPSLQNALELAAATLRHMPGHASREILVIMGSLTTCDPGNIHDTIQAMKDHSI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RC VIGL+A++ +C+ L T G+Y V LDE+HFKEL+MEH PPPA + I+MG
Sbjct: 196 RCCVIGLAADVRVCRKLATVTHGTYGVILDETHFKELLMEHTIPPPARVNTE-PSPIRMG 254
Query: 180 FPQ------RAGEGSISICSCHKEVKVGV-----GYTCPRCKARVCELPTDCRICGLQLV 228
FPQ ++ + S+C CH + K GY CP+C+++ CELP +CR+CGL LV
Sbjct: 255 FPQHVIHTDKSKVSAPSMCMCHLDGKNSEGFGTGGYFCPQCQSKYCELPVECRVCGLTLV 314
Query: 229 SSPHLARSYHHLFPIAPFDE 248
S+PHLARS+HHLFP+ F+E
Sbjct: 315 SAPHLARSFHHLFPLDRFEE 334
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 44/98 (44%), Positives = 56/98 (57%), Gaps = 8/98 (8%)
Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
C GL LVS+PHLARS+HHLFP+ F+E D + C GCQ + +
Sbjct: 397 CLSSGLTLVSAPHLARSFHHLFPLDRFEEFK---REDHDHPDSLFCQGCQSHI-----RD 448
Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
CPKC FCL+C+++I ESLH+CPGC S R S
Sbjct: 449 QTAYRCPKCSNVFCLDCELFIQESLHSCPGCASTRPSQ 486
>gi|429848008|gb|ELA23542.1| tfiih basal transcription factor complex p47 subunit
[Colletotrichum gloeosporioides Nara gc5]
Length = 488
Score = 243 bits (620), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 139/357 (38%), Positives = 197/357 (55%), Gaps = 54/357 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R V FV+E+F+QNP+SQ+G++ ++DGVA ++D+ G+P H++ L
Sbjct: 110 DMLPNRYRVTWAYAADFVKEYFEQNPISQLGIIGMRDGVALRISDMSGNPTDHLEKLKSF 169
Query: 62 LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SG+ SLQNAL++ +G L PS+G REVL+++ AL + DP DI ETI +I
Sbjct: 170 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLVVFGALLSSDPADIHETISSLITDRI 229
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R SV+GL+A++ IC LC T YSV ++++HF++L++ PP A + A+
Sbjct: 230 RVSVVGLAAQVAICAELCSRTNSGDDTQYSVCMNDAHFRDLLLAATTPPVTRTAAQSTAS 289
Query: 175 LIKMGFPQRA---GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
L+ MGFP R GE + +IC+CH + GY C RC RVC LP +C CGL L+ S
Sbjct: 290 LLMMGFPSRTLVRGEAT-AICACHNK-PAREGYLCTRCSTRVCRLPIECPACGLTLILST 347
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------------------- 268
HLARSYHHLFP+ + EV P + S C+ C
Sbjct: 348 HLARSYHHLFPLRNWVEV-PWA----KAGSSKACYSCLTAFPEPPRASKKRDKEKANDND 402
Query: 269 -------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
S L ++ G Y AC C HFC++CD+Y HE +HNCPGC+S
Sbjct: 403 GVTTTTPVAPTPVAPSELKGVSESGRY-ACTVCNSHFCIDCDVYAHEVIHNCPGCQS 458
>gi|346464883|gb|AEO32286.1| hypothetical protein [Amblyomma maculatum]
Length = 448
Score = 243 bits (620), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 125/281 (44%), Positives = 183/281 (65%), Gaps = 14/281 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV EFFDQNP+S +G+++ + A L +L G+ ++AL
Sbjct: 75 DLKPTRLICTLKMLELFVEEFFDQNPISNVGVISTCNKRAQKLCELAGNTGKVVEALKSC 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
SG+ SLQN+L+L +L +P + REVLI+ +L+TCDPG+I TI+ ++ I
Sbjct: 135 KTQVPSGEPSLQNSLELAAEVLRHLPVHTSREVLIIMGSLTTCDPGNIQTTIETMRKCNI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCS++GL+AE+++C L + TGG+Y+V LDE+HFK+++ +HA PPP A ++LI+MG
Sbjct: 195 RCSIVGLAAEVYVCCRLTKTTGGTYNVILDENHFKDILFQHAIPPPVTGN-AESSLIRMG 253
Query: 180 FP--QRAGEGSISICSCHKEVK------VGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
FP + EG S+C CH + + GY CP+C + C LP +C++CGL LVS+P
Sbjct: 254 FPYHRTETEGKPSLCFCHLDSQNPSDGLSKGGYFCPQCNGKYCTLPVECKVCGLTLVSAP 313
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL 272
HLARSYHHLF + PFDEV P L P + + TCF CQ S+
Sbjct: 314 HLARSYHHLFALEPFDEVPPDTL--PAD-TPVTCFACQVSI 351
>gi|308501637|ref|XP_003113003.1| hypothetical protein CRE_25217 [Caenorhabditis remanei]
gi|308265304|gb|EFP09257.1| hypothetical protein CRE_25217 [Caenorhabditis remanei]
Length = 376
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 185/310 (59%), Gaps = 20/310 (6%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
PSR VV K +++F+ FF+QNP++QIGL+T KD A +T + G+ ++L + +
Sbjct: 82 PSRFVVVMKALQSFLERFFEQNPIAQIGLITCKDRKAERMTMMTGNIRVLKESLNSLTEA 141
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C GD SLQNAL L L +P + REV+I+ +ALS+ DPG+I TI+ K IRCS
Sbjct: 142 FCGGDFSLQNALQLACANLKGMPGHVSREVVIVMAALSSIDPGNIFSTIESMKRMNIRCS 201
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
IGLSAEMFICK + + T G YSVALD H + L +H PP + A+ + N I +GFP
Sbjct: 202 AIGLSAEMFICKEMAKATKGEYSVALDPDHLQLLFSKHTLPPSS-AKSSECNAIHVGFPH 260
Query: 183 RAGEGSISICSCHKEVK--VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
+ S C CH + K G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELIKTRSFCVCHPDSKPVSSRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
P+A F ++ + +R C+ C+ L S G + C KC+ FCL+CD +
Sbjct: 321 QPLAAFQQI---------DVARGQCYACETRLASEGFR------CEKCRSVFCLDCDTLL 365
Query: 301 HESLHNCPGC 310
HESLH CPGC
Sbjct: 366 HESLHVCPGC 375
>gi|449300691|gb|EMC96703.1| hypothetical protein BAUCODRAFT_486555 [Baudoinia compniacensis
UAMH 10762]
Length = 459
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 134/349 (38%), Positives = 193/349 (55%), Gaps = 39/349 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + FVRE+F+QNP+SQ+G+ +++G+A ++D+ G+P HI AL +
Sbjct: 104 DLRPTRHLLTINYTIMFVREYFEQNPISQLGIFGMREGLAIRISDMSGNPNDHIAALKQL 163
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G+ SLQNALD+ + L PS+G REV+I+ AL + DPGDI +TI+ C +I
Sbjct: 164 RSTEPKGNPSLQNALDMARAALYHTPSHGTREVVIVLGALLSSDPGDIHDTIKACIRDRI 223
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPP------PAIAE 169
R +IGL+A+M IC +C+ T G Y+VA+DE +++ +M+ PP +
Sbjct: 224 RVRIIGLAAQMHICAEICRKTNGGDENYYNVAVDEVDYRQQLMDITTPPVLRSTDTEAQK 283
Query: 170 FAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
A L+ MGFP R E ++C+CH + G GY C RCKA+VC LP C C L L+
Sbjct: 284 HNQAALLMMGFPSRIVEEKPTLCACHGNLTRG-GYLCSRCKAKVCNLPATCPTCDLTLIL 342
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC--------------------- 268
S HLARSYHHLFP+ + EV+ R CFGC
Sbjct: 343 STHLARSYHHLFPLRNWVEVS---WTRARQEGSIQCFGCLTPFPRIPNPQEAVNGTEHDV 399
Query: 269 --QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRH 315
++ + G C C +HFC++CD++ HE +HNCPGC+S H
Sbjct: 400 ASARTKRAEGASESSRYECETCGQHFCIDCDVFCHEVVHNCPGCQSNAH 448
>gi|190348397|gb|EDK40844.2| hypothetical protein PGUG_04942 [Meyerozyma guilliermondii ATCC
6260]
Length = 488
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/374 (36%), Positives = 207/374 (55%), Gaps = 68/374 (18%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R A+ + ++ F+ EFFDQNP+SQ+G+V +++GV+N ++++ GSP+ HI L
Sbjct: 116 DLRPNRFAMTLQLLQEFIVEFFDQNPISQLGIVMMRNGVSNLVSEVNGSPQYHIDKLRSL 175
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L++ + +L + + +EVLIL+ AL T DPGDI
Sbjct: 176 RSRQHNRFEPKGDPSLQNSLEMARAMLKYNFGTNMDDNKNSKEVLILFGALFTSDPGDIH 235
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDT--------------GGSYSVALDESHFK 154
+TI + +I+ +IGLSA++ IC+ + T G +YSV ++E+H++
Sbjct: 236 KTIDNLIKDEIKVKIIGLSAQVSICQQIANKTNNLGSRTTGANSSSGSNYSVIMNETHYR 295
Query: 155 ELIMEHAPPPPAIAEFAIAN----LIKMGFPQR------AGEGSISI-----CSCH---- 195
EL+M+ P P A + LIKMGFP + A ++++ C+CH
Sbjct: 296 ELLMDCVVPLPVAATEKRSTVGVPLIKMGFPSKVSPPISASNNTVNLDLPVLCACHPTNG 355
Query: 196 -----KEVKVG-------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
V +G +GY CP+C++RVC LPT C CGL L+ S HLARSYHHL P+
Sbjct: 356 SLESKDAVTIGAESSTSIIGYQCPQCRSRVCHLPTICPTCGLMLILSTHLARSYHHLVPL 415
Query: 244 APFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA-----CPKCKKHFCLECDI 298
+ EV + + C+GC +G G A CP C + FC++CD+
Sbjct: 416 REYKEVPA-----ADHYEGTHCYGCLLKFPEAGGPDGPLRASSRYRCPTCARDFCIDCDV 470
Query: 299 YIHESLHNCPGCES 312
++HE+LHNCPGCES
Sbjct: 471 FVHETLHNCPGCES 484
>gi|256075586|ref|XP_002574099.1| btf [Schistosoma mansoni]
gi|360045424|emb|CCD82972.1| putative btf [Schistosoma mansoni]
Length = 423
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 139/348 (39%), Positives = 196/348 (56%), Gaps = 46/348 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K FVRE+FDQNP+SQ+G++ D A LT+L G+P H+ AL +
Sbjct: 85 DLKPNRLICTVKAACTFVREYFDQNPISQLGIIVTSDRRAERLTELSGNPRPHLAALQSL 144
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C G+ SLQNAL L + L + H E+++L ++L+TCDP DI +TI+ ++I
Sbjct: 145 YTRTCIGEPSLQNALLLAESRLKY--TIHHNEIVVLMASLTTCDPSDIHQTIKSLSSNRI 202
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSVI L+ E+F+ + L Q T G++ V +DE H K ++ + PPP A E ANLI+MG
Sbjct: 203 RCSVISLAVEVFVYRALAQLTQGTFHVIMDELHLKNVLKDLVPPPVAAIETE-ANLIRMG 261
Query: 180 FPQRA----GEGSISICSCHKEVKVGVG---------YTCPRCKARVCELPTDCRICGLQ 226
FP ++ C CH K Y CPRC + CELP +C +CGL
Sbjct: 262 FPHSETFDLDRFAVKRCMCHLSNKPTNDKTDTTTNPHYACPRCHSAYCELPVECNVCGLT 321
Query: 227 LVSSPHLARSYHHLFPIAPFDEV--------TPLCLNDPRNRSRST-------------C 265
LV++PHLAR+YHHLFP+ F+ V T L D + +S+ T C
Sbjct: 322 LVAAPHLARAYHHLFPLDLFEPVNGSGSFSETSL---DNQEKSKETIDSADSVDHQQRLC 378
Query: 266 FGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESL 313
FGC +++ PG CPKCK FC CD +H+S+H+CPGC +L
Sbjct: 379 FGC--NVIIPSKIPG--YQCPKCKFIFCHSCDAVLHDSIHSCPGCLTL 422
>gi|392593840|gb|EIW83165.1| Ssl1-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 375
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 116/259 (44%), Positives = 175/259 (67%), Gaps = 18/259 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R + + F+ E+FDQNPL QIG+V ++ GV + ++ G+P+ + ++
Sbjct: 118 DMRPTRFDLTLEYTREFITEWFDQNPLGQIGVVGMRAGVGERIGEMSGNPQEVLGSIADR 177
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K+ +G+ SLQNA+++ + +S +P++ RE+LI+ +L+TCDPG+I +T+ C ++KI
Sbjct: 178 HKIEPTGEPSLQNAIEMARNGMSHLPTHSSREILIVIGSLTTCDPGNIHDTLDSCVKNKI 237
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI------- 172
R SV+ L+AEM IC+ LC TGG + VAL+E HFK+L+ E PPP A+ A+
Sbjct: 238 RVSVVALAAEMKICRELCDKTGGQFGVALNEGHFKDLLFELVPPP---AQRAVNRAGGGA 294
Query: 173 ----ANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQL 227
A+L+ MGFP R + + S+C CH ++K G+ CPRC A+VC++PTDC ICGL +
Sbjct: 295 GSTSADLMMMGFPTRLPDTTPPSLCVCHSDLK-SEGFLCPRCMAKVCDVPTDCDICGLMI 353
Query: 228 VSSPHLARSYHHLFPIAPF 246
VSSPHLARSYHHLFP+ P+
Sbjct: 354 VSSPHLARSYHHLFPVRPY 372
>gi|149248450|ref|XP_001528612.1| suppressor of stem-loop protein 1 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146448566|gb|EDK42954.1| suppressor of stem-loop protein 1 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 509
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 144/393 (36%), Positives = 205/393 (52%), Gaps = 93/393 (23%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R+++ ++ FV EFFDQNP+SQIG+V +++GVAN ++++ G P+ HI L
Sbjct: 126 DLRPTRLSMTLNYLQEFVVEFFDQNPISQIGIVLMRNGVANLVSEVSGLPQYHIDKLRQL 185
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
K GD SLQN L++ + LL S + +E+L++ +L T DPG+I
Sbjct: 186 KARQHNKFEPKGDPSLQNTLEMARSLLKFNFGTSSNNTKNSKEILVVLGSLFTSDPGNIH 245
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG--------GSYSVALDESHFKELIMEH 160
+TI+ + +I+ VIGLSA++ IC+ L T +Y V ++E HFKEL+M+
Sbjct: 246 KTIEGLVKDEIKVRVIGLSAQVAICQELVNRTNHEPKNTMSKNYGVIMNEYHFKELLMDC 305
Query: 161 APPPPAIAEF------------AIANLIKMGFPQRA------------------------ 184
P P EF A +IKMGFP +A
Sbjct: 306 VTPLPLTEEFRKPTESTNQGGTAGVPVIKMGFPSKAQPNVTSAIGNTEFTVEFPNLNASY 365
Query: 185 -GEGS--------ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+GS +S S + + +GY CP+CK++VC LPT C +CGL L+ S HLAR
Sbjct: 366 PTQGSAESKDVIEVSNNSLRQSASLIIGYRCPQCKSKVCNLPTICPVCGLMLILSTHLAR 425
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ----------------QSLLSSGNKP 279
SYHHL P+AP+ EV D C+GCQ S++SS K
Sbjct: 426 SYHHLVPLAPYVEVEVKSAYDSEE-----CYGCQLRFPKGVDLKVEKKSLDSMISSRYK- 479
Query: 280 GLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
CPKCKK FC+ CD+++HE LHNCPGCE+
Sbjct: 480 -----CPKCKKDFCINCDVFVHEVLHNCPGCEN 507
>gi|390602659|gb|EIN12052.1| Ssl1-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 377
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 114/258 (44%), Positives = 174/258 (67%), Gaps = 11/258 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + F+ E+FDQNPL QIG+V ++ GV + + G+P+ +KA+
Sbjct: 121 DMRPTRFGLTLEYAREFITEWFDQNPLGQIGIVGMRAGVGERIGQMSGNPQDVLKAISER 180
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K+ +G+ SLQNA+++ + +S +P++ +E+LI++ +L+T DPG+I +T+ C + +I
Sbjct: 181 HKMEPNGEPSLQNAIEMARSSMSHLPTHSSKEILIVFGSLTTVDPGNIHDTLDACIKDRI 240
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP-------AIAEFAI 172
R S++ L+AEM IC+ +C TGG + VA++E H+K+L+ E PPP A + A
Sbjct: 241 RISIVALAAEMKICREICDKTGGQFGVAMNEGHYKDLLFELIPPPAQRALTSKAGSGGAT 300
Query: 173 ANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
A+L+ MGFP R + S S+C CH E+K G+ CPRC A+VC++PTDC ICGL +VSSP
Sbjct: 301 ADLMIMGFPTRLPDTSPPSLCVCHSELK-SEGFLCPRCLAKVCDVPTDCDICGLMIVSSP 359
Query: 232 HLARSYHHLFPIAPFDEV 249
HLARSYHHLFP+ P+ V
Sbjct: 360 HLARSYHHLFPVKPYTAV 377
>gi|302680178|ref|XP_003029771.1| hypothetical protein SCHCODRAFT_82959 [Schizophyllum commune H4-8]
gi|300103461|gb|EFI94868.1| hypothetical protein SCHCODRAFT_82959 [Schizophyllum commune H4-8]
Length = 400
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 113/263 (42%), Positives = 175/263 (66%), Gaps = 19/263 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R ++ + FV E+FDQNPL QIG+V ++ G+ + ++ G+P+ +K++
Sbjct: 116 DMRPTRFDLMLQYAREFVVEWFDQNPLGQIGVVGMRSGIGERICEMTGNPQDVLKSISER 175
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+L G+ SLQNA+++ + +S +P++ RE+L+++ +L+TCDPGDI +T+ C KI
Sbjct: 176 HRLEPQGEPSLQNAIEMARSSMSHLPTHSSREILLIFGSLTTCDPGDIHDTLDTCVRQKI 235
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP------------PAI 167
R SV+ L+AEM IC+ C TGG + VAL+E HF++++ E PPP P
Sbjct: 236 RVSVVALAAEMKICREFCDKTGGQFGVALNEGHFRDILFELVPPPAQRALGGAGMGAPGK 295
Query: 168 AEFA---IANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
A A A+L+ MGFP R + + S+C+CH E++ G+ CPRC+++VC++PTDC +C
Sbjct: 296 ATAASGPSADLMMMGFPTRLPDTTPPSLCACHGELR-SAGFLCPRCQSKVCDVPTDCDVC 354
Query: 224 GLQLVSSPHLARSYHHLFPIAPF 246
GL +VSSPHLARSYHHLFP+ P+
Sbjct: 355 GLMIVSSPHLARSYHHLFPVRPY 377
>gi|223999581|ref|XP_002289463.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974671|gb|EED93000.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 422
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 189/344 (54%), Gaps = 49/344 (14%)
Query: 7 RMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-------M 59
R+ + FV E++DQNPL +G+V +DG A LT LGGSP+ H AL M
Sbjct: 86 RLETIINLASEFVNEYYDQNPLGHLGVVICQDGEAEMLTTLGGSPKKHKLALGAALVTEM 145
Query: 60 GKLGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKE 116
+ G S G+ SLQN +++ L P YG RE++++ SAL+TCDPGDI+ ET+ +
Sbjct: 146 KRRGPSIGGEFSLQNGIEVAGRSLGYAPRYGSREIIVITSALATCDPGDILGETLPRLLH 205
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA-------- 168
+ IR S + L AE+ ICK L + TGG V +D H ++L+M HA PPP +
Sbjct: 206 AGIRVSAVSLQAELHICKKLAEVTGGLAGVCMDSRHLRDLVMGHAIPPPVVNTSKADDGE 265
Query: 169 -----------EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP 217
I + MGFP R + S GY CPRC++R +LP
Sbjct: 266 TTTAHNGRNHNHTMICEFVTMGFPSRDDDRR----SNRLIAFARTGYVCPRCRSRCSDLP 321
Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPFDEVT-PLCLNDPRNRSRSTCFGCQQSL---- 272
+DC +CGL+L+ +PHLAR++HHLFP+ PF+E+ L +N R CFGC + +
Sbjct: 322 SDCAVCGLRLILAPHLARTFHHLFPVRPFEELDEALVINCDR-----CCFGCLKVIGCKP 376
Query: 273 LSSGNKPG------LYVACPKCKKHFCLECDIYIHESLHNCPGC 310
+ S + G L CP+C FC +CD Y+HE+LHNCPGC
Sbjct: 377 MDSTDGAGDEGESILRFQCPECHNAFCPDCDAYLHETLHNCPGC 420
>gi|380482556|emb|CCF41168.1| hypothetical protein CH063_11526 [Colletotrichum higginsianum]
Length = 480
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/362 (37%), Positives = 199/362 (54%), Gaps = 52/362 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R V FV+E+F+QNP+SQ+ ++ ++DGVA ++D+ G+P H++ L
Sbjct: 104 DMLPNRYRVAWAYAADFVKEYFEQNPISQLAIIGMRDGVAIRISDMSGNPTDHLEKLKTF 163
Query: 62 LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SG+ SLQNAL++ +G L PS+G REVLI++ AL + DPGDI +TI +I
Sbjct: 164 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLIVFGALLSSDPGDIHDTITSLIADRI 223
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R S++GL+A++ IC LC T YSV ++++HF++L++ PP A + A+
Sbjct: 224 RVSIVGLAAQVAICAELCSRTNAGDDTQYSVCMNDAHFRDLLLAATTPPVTRTAAQSTAS 283
Query: 175 LIKMGFPQRA---GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
L+ MGFP R GE + ++C+CH + GY C RC RVC LP +C C L L+ S
Sbjct: 284 LLMMGFPSRTLAQGEAT-AVCACHNK-PAREGYLCTRCGTRVCRLPIECPACSLTLILST 341
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------------------- 272
HLARSYHHLFP+ + EV L +S++ C+ C +
Sbjct: 342 HLARSYHHLFPLRNWVEV----LWAEATKSKA-CYSCLTTFPEPPKASRKKDKDKDSDGV 396
Query: 273 ---------------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
L ++ G Y C C HFC++CD+Y HE +HNCPGC+S
Sbjct: 397 PTTTPVAPTPVIASELKGVSESGRY-GCTVCGNHFCIDCDVYAHEVIHNCPGCQSDTRGA 455
Query: 318 PI 319
P+
Sbjct: 456 PM 457
>gi|240277005|gb|EER40515.1| transcription factor TFIIH complex subunit Ssl1 [Ajellomyces
capsulatus H143]
gi|325094945|gb|EGC48255.1| transcription factor TFIIH [Ajellomyces capsulatus H88]
Length = 505
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 142/356 (39%), Positives = 202/356 (56%), Gaps = 49/356 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV EFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 134 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHIMAIQAL 193
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L P++G REVLI++ AL + DPGDI +TI KI
Sbjct: 194 RPKDPKGMPSLQNALEMARGTLFHTPTHGTREVLIIFGALLSSDPGDIHQTITALVADKI 253
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIME-HAPPPPAIAEFAI-- 172
R S+IGL+A++ IC+ LC T Y +AL+E HF+EL M APPP +A +
Sbjct: 254 RISIIGLAAQVAICRDLCGRTNNGDDTVYGIALNEQHFRELFMNVTAPPPTTVAPTSTPT 313
Query: 173 ----------------ANLIKMGF--PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
++L+ MGF + S+C+CH + GY C RC A+VC
Sbjct: 314 ASSANGPKTTTTNTTTSSLLMMGFPSLTLTTTPTPSLCACHSKPSRA-GYLCCRCNAKVC 372
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----Q 269
LP+ C CGL L+ S HLARSYHHLFP+ + EV+ R RS ++CF C +
Sbjct: 373 TLPSSCPCCGLTLILSTHLARSYHHLFPLMNWVEVS--WRRAARKRS-ASCFACGVGFPR 429
Query: 270 QSLLSSGN-----KPGLYVA--------CPKCKKHFCLECDIYIHESLHNCPGCES 312
L SG K L V CP C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 430 MPKLVSGEPEETAKAALGVGVSVSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQS 485
>gi|193627420|ref|XP_001947532.1| PREDICTED: general transcription factor IIH subunit 2-like
[Acyrthosiphon pisum]
Length = 388
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 198/321 (61%), Gaps = 18/321 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R K ++ F++++FDQNP+SQ+GL+T ++ A +++L G+ + H++ L
Sbjct: 74 DLKPTRQLCTYKLLQDFIQDYFDQNPISQLGLITTRNSSAERISELSGNRKYHLEELKKT 133
Query: 62 LG----CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKES 117
G C+G S+QN+L++ ++ +PS+ +E+LI+ S+LS+CDPG+I +I+ K
Sbjct: 134 FGDFNYCNGLMSVQNSLEIALSVMKMLPSHTSKEILIIGSSLSSCDPGEINTSIEMLKTH 193
Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIK 177
IR S+I L+AE+ + +HLC +T G ++V +D+ HFK ++ P P + A+ +K
Sbjct: 194 NIRVSMIHLAAEVRMFRHLCNETKGKHNVIVDDVHFKHILWSLVEPVP-LPNSVDASCVK 252
Query: 178 MGFPQRAGEG-SISICSCH--KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
MGFPQ + + CSCH + K+ G+ CP+C ++ CELP +C+ CGL LVSS HL
Sbjct: 253 MGFPQELEQKPPFTTCSCHLAEGGKLNAKGFFCPQCNSKYCELPVECKCCGLILVSSLHL 312
Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
ARS HHL PI PF ++ + S + C+GC++ + +Y C CKKH+C
Sbjct: 313 ARSLHHLVPIKPFIKI------ELEEGSSAYCYGCRKRIKVPAE--NVYF-CESCKKHYC 363
Query: 294 LECDIYIHESLHNCPGCESLR 314
CDIY+H +LH CPGC R
Sbjct: 364 DGCDIYVHNTLHVCPGCAVKR 384
>gi|388856633|emb|CCF49750.1| probable SSL1-TFIIH subunit (transcription initiation factor),
factor B [Ustilago hordei]
Length = 506
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/353 (36%), Positives = 191/353 (54%), Gaps = 46/353 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R V + + FV E+FDQNP+ Q+ ++ ++G+A L +GG+ H +L K
Sbjct: 154 DMRPNRFDVTLQYAKEFVMEYFDQNPIGQLSVIGTREGIAERLAMMGGNTVDHTASLSNK 213
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L G+ SLQNAL++ + L +P+ RE+L ++ +L+TCDPG+I +TI + I
Sbjct: 214 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTINTLVKDNI 273
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA------------- 166
R S++ L+AE+ + K +C TGG++SVAL+E HF + + EH PPP
Sbjct: 274 RVSIVHLAAEVKVFKDVCTRTGGTFSVALNEGHFHDSLFEHVPPPAVEGPRRPKRRIIAA 333
Query: 167 --------IAEFAIAN---LIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
E +AN L++M FP R + ++C+CH + G GY CPRC +VC
Sbjct: 334 SGNGAATTEEEEEVANGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYLCPRCGVKVC 392
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQ----- 269
++PTDC +CG+ +V S HLARSYHHLFP+ P +V P + C C
Sbjct: 393 DVPTDCPVCGITIVMSTHLARSYHHLFPV-PNWKVVPWSALLSAAKVEPACLSCNLLFPT 451
Query: 270 ------------QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
++L +G C +C FCLECD ++HE LH CPGC
Sbjct: 452 QEQGAARSAAANKALEEAGLSESSRYRCGRCSHDFCLECDAFVHEQLHVCPGC 504
>gi|322700310|gb|EFY92066.1| TFIIH basal transcription factor complex p47 subunit , putative
[Metarhizium acridum CQMa 102]
Length = 457
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 192/347 (55%), Gaps = 54/347 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R + AFVRE+F+QNP+SQ+G++ ++DGVA ++DL G+P HI+ L
Sbjct: 112 DLLPTRYRLTISYALAFVREYFEQNPISQLGIIGMRDGVAVRISDLSGNPAEHIERLKAV 171
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI ETI KI
Sbjct: 172 EGQDPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHETIANLVTDKI 231
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R S++GLSA++ IC LC T Y++A+DE HF++L + PP AE + A+
Sbjct: 232 RVSIVGLSAQVAICADLCSRTNAGDDSQYNIAMDEVHFRDLFLASTTPPVTRTAEQSTAS 291
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A G+IS C+CH + + GY CGL L+ S H
Sbjct: 292 LLMMGFPSRTLAPGGAISFCACHSK-PLREGYNS----------------CGLTLILSTH 334
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC---------------------QQ 270
LARSYHHLFP+ + EV+ +RST CF C +
Sbjct: 335 LARSYHHLFPLRNWVEVSWT------EATRSTACFSCLCPFPEPPKDKVDGVEKSKDDTR 388
Query: 271 SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
L + G AC C HFC++CD++ H+ +HNCPGC+S+ H +
Sbjct: 389 HLKAKGVSESGRYACEVCGNHFCIDCDVFAHQVIHNCPGCQSIVHHD 435
>gi|17555086|ref|NP_499239.1| Protein T16H12.4 [Caenorhabditis elegans]
gi|27923998|sp|P34567.3|TF2H2_CAEEL RecName: Full=General transcription factor IIH subunit 2; AltName:
Full=TFIIH basal transcription factor complex subunit 2
gi|15718246|emb|CAA83139.3| Protein T16H12.4 [Caenorhabditis elegans]
Length = 376
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 183/310 (59%), Gaps = 20/310 (6%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
PSR VV K ++ F+ FF+QNP++QIGL+T KD A+ LT + G+ ++L + +
Sbjct: 82 PSRFVVVMKALQTFLDRFFEQNPIAQIGLITCKDRKADRLTMMTGNIRVLKESLNTLTEA 141
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C GD SLQNAL L L +P + REV+++ SALST DPG+I TI+ K IRCS
Sbjct: 142 FCGGDFSLQNALQLACANLKGMPGHVSREVVLVISALSTIDPGNIYSTIETMKRMNIRCS 201
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
IGLSAEMF+CK + + T G YSVALD H + L +H PP + A+ + N I +GFP
Sbjct: 202 AIGLSAEMFVCKEMAKATKGEYSVALDPDHLQLLFSKHTLPPSS-AKSSECNAIHVGFPH 260
Query: 183 RAGEGSISICSCHKEVK--VGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
+ S C CH + K G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELITTRSFCVCHPDTKPISSRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
P++ F+++ +R C+ C+ L G + C C+ FCL+CD +
Sbjct: 321 QPLSAFEQI---------EVTRGFCYACETRLSGEGFR------CGSCQLVFCLDCDTLL 365
Query: 301 HESLHNCPGC 310
HESLH CPGC
Sbjct: 366 HESLHVCPGC 375
>gi|343426526|emb|CBQ70055.1| probable SSL1-TFIIH subunit (transcription initiation factor),
factor B [Sporisorium reilianum SRZ2]
Length = 511
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 127/353 (35%), Positives = 190/353 (53%), Gaps = 48/353 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R V + FV E+FDQNP+ Q+ ++ + G+A L +GG+ H +L K
Sbjct: 162 DMRPNRFDVTLQYAREFVGEYFDQNPIGQLSIIGTRQGIAERLAMMGGNTVDHTASLSNK 221
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L G+ SLQNAL++ + L +P+ RE+L ++ +L+TCDPG+I +TI + I
Sbjct: 222 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTIGTLVKDNI 281
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP------------AI 167
R S++ L+AE+ + K +C TGG++SVAL+E HF + + E PPP A
Sbjct: 282 RVSIVHLAAEVKVFKDVCTRTGGTFSVALNEGHFHDSLFELVPPPAVEGPRRTKRRSGAA 341
Query: 168 AEFAIAN------------LIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
A A+ L++M FP R + ++C+CH + G GY CPRC +VC
Sbjct: 342 TNGAAADGEDDEEVHNGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYVCPRCGVKVC 400
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC------ 268
++PTDC +CG+ +V S HLARSYHHLFP+ + V ++ S + C C
Sbjct: 401 DVPTDCPVCGITIVMSTHLARSYHHLFPVPNWKAVPWTAVS---ATSDAACLSCNLPFPT 457
Query: 269 -----------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
++L +G C +C + FCLECD ++HE LH CPGC
Sbjct: 458 LQDGEASSAAANKALEDAGLSASSRYRCTRCSRDFCLECDAFVHEQLHVCPGC 510
>gi|154276968|ref|XP_001539329.1| suppressor of stem-loop protein 1 [Ajellomyces capsulatus NAm1]
gi|150414402|gb|EDN09767.1| suppressor of stem-loop protein 1 [Ajellomyces capsulatus NAm1]
Length = 506
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 143/356 (40%), Positives = 203/356 (57%), Gaps = 49/356 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV EFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 135 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHIMAIQAL 194
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L P++G REVLI++ AL + DPGDI +TI KI
Sbjct: 195 RPKDPKGTPSLQNALEMARGALFHTPTHGTREVLIIFGALLSSDPGDIHQTITALVADKI 254
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIME-HAPPPPAIAEFAI-- 172
R S+IGL+A++ IC+ LC T Y VAL+E HF+EL M+ APP +A +
Sbjct: 255 RISIIGLAAQVAICRDLCARTNNGDDTVYGVALNEQHFRELFMDVTAPPATTVAPTSTPT 314
Query: 173 ---AN-------------LIKMGF--PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
AN L+ MGF + S+C+CH + GY C RC A+VC
Sbjct: 315 AFSANGPKTTTTSTTTSSLLMMGFPSLTLTTTPTPSLCACHSKPSRA-GYLCCRCNAKVC 373
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----Q 269
LP+ C CGL L+ S HLARSYHHLFP+ + EV+ R RS ++CF C +
Sbjct: 374 TLPSSCPCCGLTLILSTHLARSYHHLFPLMNWVEVS--WRRAARKRS-ASCFACGVGFPR 430
Query: 270 QSLLSSGNKP---------GLYVA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
L SG G+ V+ CP C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 431 MPKLVSGEPEETAKAAVGVGVSVSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQS 486
>gi|226295046|gb|EEH50466.1| suppressor of stem-loop protein [Paracoccidioides brasiliensis
Pb18]
Length = 518
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 135/363 (37%), Positives = 186/363 (51%), Gaps = 56/363 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + + FVREFFDQNP+SQ+G+V ++DG+A ++D+ G+P HI A+ G
Sbjct: 121 DLRPTRYLLTLRYAQDFVREFFDQNPISQLGVVGMRDGLAVRISDMSGNPTEHILAIQGL 180
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI KI
Sbjct: 181 RAKDPKGMPSLQNALEMARGALFHTPSHGTREVLIIYGALLSSDPGDIHKTITSLITDKI 240
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAI--- 172
V+GL+A++ IC+ L T Y+VA++E HF+EL++ PP
Sbjct: 241 HVYVLGLAAQVSICQELVTRTNNGDDSGYNVAMNEQHFRELVLNVTTPPATTLASHTAAA 300
Query: 173 ------------ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
L+ MGFP R ++C+CH GY CPRC +VC LP C
Sbjct: 301 AANGTGTNPSTDGTLLPMGFPNRHLTPHPTLCACHSTPSRS-GYLCPRCCTKVCTLPASC 359
Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPG 280
C L L+ S HLARSYHHLFP+ + EV+ R CF C + PG
Sbjct: 360 PSCNLTLILSTHLARSYHHLFPLMNWVEVSWRKAARAEAEGRVGCFACG---VGFAGVPG 416
Query: 281 LYVA-------------------------------CPKCKKHFCLECDIYIHESLHNCPG 309
+V C C+ HFC++CD++ HE +HNCPG
Sbjct: 417 EFVGAEGDEEREEEEDGKGEGKGASRGISVSGRYECLVCRCHFCIDCDVFAHEVVHNCPG 476
Query: 310 CES 312
C+S
Sbjct: 477 CQS 479
>gi|403414452|emb|CCM01152.1| predicted protein [Fibroporia radiculosa]
Length = 377
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 117/258 (45%), Positives = 169/258 (65%), Gaps = 17/258 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R ++ + F+ E+FDQNPL QIG+V ++ G+ N L E H K
Sbjct: 123 DMRPTRFDLMLQYAREFITEWFDQNPLGQIGIVGMRGGIGNPQDVLKAISERH------K 176
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L +G+ SLQNA+D+ + + +P++ RE+LI++ +L+TCDPG+I +++ C + +IR
Sbjct: 177 LEPNGEPSLQNAIDMARASMGHLPTHSSREILIIFGSLTTCDPGNIHDSLDDCVKDRIRI 236
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--AIAEFA-------I 172
SV+ L+AEM IC+ LC TGG + VAL+E HFK+L+ E PPP A+A
Sbjct: 237 SVVALAAEMKICRELCDKTGGQFGVALNEGHFKDLLFELIPPPAQRAVARTKGGPAGNPS 296
Query: 173 ANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
A+L+ MGFP R + S S+C+CH ++K G+ CPRC A+VC++PTDC ICGL +VSSP
Sbjct: 297 ADLMMMGFPTRLPDTSAPSLCACHSQLK-SEGFLCPRCCAKVCDVPTDCDICGLMIVSSP 355
Query: 232 HLARSYHHLFPIAPFDEV 249
HLARSYHHLFP+ + V
Sbjct: 356 HLARSYHHLFPVKTYKAV 373
>gi|268573216|ref|XP_002641585.1| Hypothetical protein CBG09887 [Caenorhabditis briggsae]
Length = 376
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 184/310 (59%), Gaps = 20/310 (6%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
PSR VV K ++ F+ FF+QNP++QIGL+T KD A+ LT + G+ ++L + +
Sbjct: 82 PSRFVVVMKALQQFLERFFEQNPIAQIGLITCKDRKADRLTMMTGNIRILKESLSSLTEA 141
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C GD SLQNAL L L +P + REV+I+ +ALS+ DPG+I TI+ K IRCS
Sbjct: 142 FCGGDFSLQNALQLACVNLKGMPGHVSREVVIVMAALSSIDPGNIFSTIETMKRMNIRCS 201
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
IGLSAEMFICK + + T G YSVALD H L +H PP + A+ + N I +GFP
Sbjct: 202 AIGLSAEMFICKEMAKATKGEYSVALDPDHLHLLFSKHTLPPSS-AKSSECNAIHVGFPH 260
Query: 183 RAGEGSISICSCHKEVK-VGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
+ S C CH + K + + G+ C +C AR C +P +C +C L LV++P LAR++ HL
Sbjct: 261 HELIKTRSFCVCHPDSKPISLRGFICTQCGARHCSIPAECPVCKLTLVAAPQLARAFRHL 320
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
P+A F ++ +R +C+ C+ G + C KC+ FCL+CD +
Sbjct: 321 QPLAAFKQI---------EVTRGSCYACETRFSDEGFR------CEKCRLVFCLDCDTLL 365
Query: 301 HESLHNCPGC 310
HESLH CPGC
Sbjct: 366 HESLHVCPGC 375
>gi|146414033|ref|XP_001482987.1| hypothetical protein PGUG_04942 [Meyerozyma guilliermondii ATCC
6260]
Length = 488
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/374 (35%), Positives = 205/374 (54%), Gaps = 68/374 (18%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R A+ + ++ F+ EFFDQNP+SQ+G+V +++GV+N ++++ G P+ HI L
Sbjct: 116 DLRPNRFAMTLQLLQEFIVEFFDQNPISQLGIVMMRNGVSNLVSEVNGLPQYHIDKLRLL 175
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD LQN+L++ + +L + + +EVLIL+ AL T DPGDI
Sbjct: 176 RSRQHNRFEPKGDPLLQNSLEMARAMLKYNFGTNMDDNKNSKEVLILFGALFTSDPGDIH 235
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDT--------------GGSYSVALDESHFK 154
+TI + +I+ +IGLSA++ IC+ + T G +YSV ++E+H++
Sbjct: 236 KTIDNLIKDEIKVKIIGLSAQVSICQQIANKTNNLGSRTTGANSSSGSNYSVIMNETHYR 295
Query: 155 ELIMEHAPPPPAIAEFAIAN----LIKMGFPQR------AGEGSISI-----CSCH---- 195
EL+M+ P P A + LIKMGFP + A ++++ C+CH
Sbjct: 296 ELLMDCVVPLPVAATEKRSTVGVPLIKMGFPLKVLPPISASNNTVNLDLPVLCACHPTNG 355
Query: 196 -----KEVKVG-------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
V +G +GY CP+C++RVC LPT C CGL L+ S HLARSYHHL P+
Sbjct: 356 SLESKDAVTIGAESSTSIIGYQCPQCRSRVCHLPTICPTCGLMLILSTHLARSYHHLVPL 415
Query: 244 APFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA-----CPKCKKHFCLECDI 298
+ EV + + C+GC +G G A CP C + FC++CD+
Sbjct: 416 REYKEVPA-----ADHYEGTHCYGCLLKFPEAGGPDGPLRASSRYRCPTCARDFCIDCDV 470
Query: 299 YIHESLHNCPGCES 312
++HE+LHNCPGCES
Sbjct: 471 FVHETLHNCPGCES 484
>gi|322708259|gb|EFY99836.1| TFIIH basal transcription factor complex p47 subunit , putative
[Metarhizium anisopliae ARSEF 23]
Length = 456
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 140/348 (40%), Positives = 192/348 (55%), Gaps = 56/348 (16%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R + AFVREFF+QNP+SQ+G++ ++DGVA ++DLGG+P HI+ L
Sbjct: 111 DLLPTRYRLTISYALAFVREFFEQNPISQLGIIGMRDGVAVRISDLGGNPAEHIERLKAV 170
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI ETI KI
Sbjct: 171 EGQDPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHETISNLIADKI 230
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP-PAIAEFAIAN 174
R S++GLSA++ IC LC T Y++A+DE HF++L + PP AE A+
Sbjct: 231 RVSIVGLSAQVAICADLCSRTNAGDDSQYNIAMDEVHFRDLFLAGTTPPVTRTAEQNTAS 290
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A G+IS C+CH + + GY CGL L+ S H
Sbjct: 291 LLMMGFPSRTLAPGGAISFCACHSK-PLREGYNS----------------CGLTLILSTH 333
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST-CFGC----------------------Q 269
LARSYHHLFP+ + EV+ +RST CF C +
Sbjct: 334 LARSYHHLFPLRNWVEVSWA------EATRSTVCFSCLCPFPEPPKDKIDGVEKSKDDTR 387
Query: 270 QSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSN 317
++ G Y AC C HFC++CD++ H+ +HNCPGC+S H +
Sbjct: 388 HPKAKGVSESGRY-ACEVCGNHFCIDCDVFAHQVIHNCPGCQSTVHHD 434
>gi|219116518|ref|XP_002179054.1| suppressor of stem-loop protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
gi|217409821|gb|EEC49752.1| suppressor of stem-loop protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
Length = 413
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 187/317 (58%), Gaps = 24/317 (7%)
Query: 6 SRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLG 63
+R+ V ++ FV+E++DQNPLSQ+G V +K+G A LT L S ++H AL + ++
Sbjct: 106 TRIDVTVAHLQHFVQEYYDQNPLSQLGFVLLKNGEAEILTQLSSSSKTHKLALDSVAQMA 165
Query: 64 CS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKE 116
+ G+ SLQN L+L L PS+G RE++++ +ALSTCDPG ++ ET+ + ++
Sbjct: 166 AAEGPSGGGEFSLQNGLELAGRSLGHQPSHGSREIVLVTAALSTCDPGYLLTETLPRLRQ 225
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIA-NL 175
++IR S + L+AE+ +C+ L +T G V LD +H ++ + PPP + L
Sbjct: 226 AQIRVSCLALNAELHVCRKLADETHGVLGVCLDRAHLRDWLRSQTVPPPTLPSRPHGCAL 285
Query: 176 IKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
++MGFP R A + + V YTCP C+A+ LP DC +CGLQLV +PHL
Sbjct: 286 VRMGFPTRIIADGPEFVHATRQQTVLARTAYTCPVCQAKNAALPADCAVCGLQLVLAPHL 345
Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
ARS+HHLFP+ PF EV P + Q +++S L ACP C + FC
Sbjct: 346 ARSFHHLFPVPPFREVQP---------DTAVVSNSQVPVVASRQ---LRFACPDCHQVFC 393
Query: 294 LECDIYIHESLHNCPGC 310
++CD ++HESLHNCPGC
Sbjct: 394 VDCDAFLHESLHNCPGC 410
>gi|393909764|gb|EJD75581.1| nucleolar protein 14 [Loa loa]
Length = 1208
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 186/313 (59%), Gaps = 21/313 (6%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
PSR+AV K + F+ +F +QNP+SQ+G+V KD A CL L G+ ++L + ++
Sbjct: 83 PSRLAVTIKALNQFLDKFSEQNPISQVGIVVCKDKRAECLIPLTGNVRLVKESLSTITEV 142
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C G+ SL N+L L P Y REV+++ ++LSTCDP +I T + K IRCS
Sbjct: 143 LCHGEFSLHNSLMAAIKSLHSYPGYASREVILIVASLSTCDPSNIFGTFELLKRYHIRCS 202
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
VI LSAE+FI + LC T G ++V LD +HF+ ++ EHA PP + + A +++++MGFP
Sbjct: 203 VISLSAEVFIFRKLCSATSGCHNVILDSTHFEVILNEHANPPIS-SRNAESSVVRMGFPA 261
Query: 183 RAGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
S S C CH+ E++ G G+ CP+C AR C LP +CRIC L L+S+P LARS H+
Sbjct: 262 HESIDSPSFCLCHQSEIRPSGGRGFFCPQCGARYCSLPVECRICKLTLISAPQLARSLHN 321
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
L P+ F+E+ + + CF C + L C CK FC++CD+
Sbjct: 322 LLPLPAFEEI---------DTTERVCFACIRQLDDKS------FVCKNCKSTFCIDCDVL 366
Query: 300 IHESLHNCPGCES 312
+HESL CPGC+S
Sbjct: 367 LHESLQICPGCKS 379
>gi|354547034|emb|CCE43767.1| hypothetical protein CPAR2_214110 [Candida parapsilosis]
Length = 498
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 142/378 (37%), Positives = 201/378 (53%), Gaps = 72/378 (19%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R+++ + FV EFFDQNP+SQ+G++ +++GVAN ++++ GSP+ HI L
Sbjct: 124 DLRPTRLSLTLNYLSEFVVEFFDQNPISQLGIILMRNGVANLISEVSGSPQYHIDRLRQL 183
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
K GD SLQN L++ + LL + S +E+L+++ +L T DPGDI
Sbjct: 184 KARQHNKYEPKGDPSLQNCLEMARSLLKFNFGSASNNSKNSKEILLVFGSLFTSDPGDIH 243
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIME- 159
+TI + I+ SVIGLSA++ IC+ L T Y V ++E+HFKEL+M+
Sbjct: 244 KTIDSLVKDNIKVSVIGLSAQVAICQELVNKTNHESRNSSSKHYGVIMNETHFKELLMDC 303
Query: 160 -HAPPPPAIAEFAIAN----LIKMGFPQRAGEGSISICSCHK------------------ 196
P P E I LIKMGFP + + S +
Sbjct: 304 VTPLPLPESEEIKIETKGVPLIKMGFPSKVQPNATSTIGNSEYTVEFPQLNASYPTQGSN 363
Query: 197 ------EVKVGV------GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIA 244
EV G+ GY CP+CK++VC LPT C +CGL L+ S HLARSYHHL P+A
Sbjct: 364 DSKDVVEVNSGLAMSQTFGYQCPQCKSKVCNLPTICPVCGLMLILSTHLARSYHHLVPLA 423
Query: 245 PFDEVTPLCLNDPRNRSRSTCFGCQ-------QSLLSSGNKPGLYVA---CPKCKKHFCL 294
P+ EV PL + R C+GCQ +S ++G L + C C FC+
Sbjct: 424 PYKEV-PLST----SYDREYCYGCQLIFPSGVKSTKNTGTIDSLTSSRYRCANCCNDFCI 478
Query: 295 ECDIYIHESLHNCPGCES 312
CD++IHE LHNCPGCE+
Sbjct: 479 NCDVFIHEVLHNCPGCEN 496
>gi|344233967|gb|EGV65837.1| hypothetical protein CANTEDRAFT_118624 [Candida tenuis ATCC 10573]
Length = 488
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 141/378 (37%), Positives = 207/378 (54%), Gaps = 74/378 (19%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESH---IKAL 58
D RP+R +++ ++ FV EFFDQNP+SQ+G+V +++G+++ ++D+ GSP+ H I+ L
Sbjct: 117 DLRPNRFSMMLSLLQEFVVEFFDQNPISQLGIVLMRNGISDVVSDVNGSPQFHLDKIRQL 176
Query: 59 MGK----LGCSGDSSLQNALDLVQGLLSQIPSYGH--------REVLILYSALSTCDPGD 106
GK GD SLQNAL+L + LL ++G+ +E+LI++ AL T DPGD
Sbjct: 177 RGKQHNRFEPKGDPSLQNALELSRSLLRY--NFGNNINDTKNSKEILIIFGALFTSDPGD 234
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIM 158
I +TI + I+ VIGLSA++ IC+ + T + Y V ++ESHFKEL+M
Sbjct: 235 IHKTISSLVKDNIKVKVIGLSAKVAICQEIVNKTNHTSNNFNNPNYGVIMNESHFKELLM 294
Query: 159 EHAPPPPAIAE-------FAIANLIKMGFPQRAGEGS-----------ISICSCH----- 195
+ P P E L+KMGFP + I IC+CH
Sbjct: 295 DCVVPLPITVEQQKEIEASKSVPLLKMGFPTKVQPTLSSSSGGHSLDFIQICACHSGSRT 354
Query: 196 ---------KEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPF 246
++ +GY CP+C ++VC LPT C ICGL L+ S HLARSYHHL P+ +
Sbjct: 355 ENGSDAAAIQQSSSVIGYRCPQCSSKVCSLPTVCPICGLMLILSTHLARSYHHLVPLVDY 414
Query: 247 DEVTPLCLNDPRNRSRSTCFGCQQSL------LSSGNKPGLYVA------CPKCKKHFCL 294
+EV P+ + P S C+GC + +K L C KCK FC+
Sbjct: 415 EEV-PVKGSYP----SSFCYGCLLQFPEGIENIHKDSKEKLETMSSSRYRCKKCKNDFCI 469
Query: 295 ECDIYIHESLHNCPGCES 312
+CD+++HE+LHNCPGCE+
Sbjct: 470 DCDVFVHETLHNCPGCEN 487
>gi|150864125|ref|XP_001382831.2| hypothetical protein PICST_29472 [Scheffersomyces stipitis CBS
6054]
gi|149385381|gb|ABN64802.2| component of RNA polymerase transcription factor [Scheffersomyces
stipitis CBS 6054]
Length = 489
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 139/383 (36%), Positives = 204/383 (53%), Gaps = 77/383 (20%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RPSR ++ + ++ FV EFFDQNP+SQ+G++ +++GV+N ++++ GSP+ HI +
Sbjct: 111 DLRPSRFSMTLQLLQDFVVEFFDQNPISQLGIILMRNGVSNLVSEVSGSPQYHIDKIRQL 170
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L++ + LL + + +EVL+++ AL T DPGDI
Sbjct: 171 KSRQHNRYEPKGDPSLQNSLEMARSLLKFNFGNNANDTKNSKEVLLIFGALFTSDPGDIH 230
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG--------GSYSVALDESHFKELIMEH 160
+TI + +I+ +IGLSA++ IC+ L T +Y V ++ESHFKEL M+
Sbjct: 231 KTIDNLVKDEIKVKIIGLSAQVAICQDLVNRTNHQQKNISSKNYGVIMNESHFKELFMDC 290
Query: 161 APPPP----AIAEFAIAN----LIKMGFPQRAGEGSISICSCHK---------------- 196
P P + E A AN +IKMGFP + S + ++
Sbjct: 291 VTPLPITTSQVNENAEANKGVPIIKMGFPTKIQPVLTSTLASNEYTIEFPQLSASDPTHG 350
Query: 197 -------EVKVG----------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
E+ G +GY CP+CK++VC LPT C +CGL L+ S HLARSYHH
Sbjct: 351 LDDNQVVEINNGTDSSIQATNVIGYQCPQCKSKVCNLPTVCPVCGLMLILSTHLARSYHH 410
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGC----------QQSLLSSGNKPGLYVACPKCK 289
L P+ PF EV P+ + CFGC +S S N C KC
Sbjct: 411 LVPLVPFKEV-PVS----SKYNSKHCFGCLLKFPKGVSNTKSKSSLENMTSSRYRCMKCS 465
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
FC++CD++IHE+LHNCPGCE+
Sbjct: 466 HDFCIDCDVFIHETLHNCPGCEN 488
>gi|320585884|gb|EFW98563.1| suppressor of stem-loop protein 1 [Grosmannia clavigera kw1407]
Length = 524
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 136/352 (38%), Positives = 188/352 (53%), Gaps = 42/352 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ PS V FVR FF+QNP+SQ+ +V ++DGVA ++D+GGSP H+ L
Sbjct: 144 DYLPSCQRVALDCAADFVRAFFEQNPISQLAVVAMRDGVALRVSDMGGSPAEHLDRLADW 203
Query: 62 LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G SLQNAL++ +G L P++G REVL+++ AL + DP DI +TI +I
Sbjct: 204 DGVEPQGQPSLQNALEMCRGALFHTPAHGTREVLVVFGALLSSDPADIHDTIAALVADRI 263
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKELIMEHAPPPPA------IAE 169
R S++GL+A++ +C +C T +Y+VA + HF EL++ PP A
Sbjct: 264 RVSIVGLAAKVAVCAEICARTNPGDEAAYAVARHDVHFHELVLAATTPPATRPDDGDAAS 323
Query: 170 FAIANLIKMGFPQR------AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRIC 223
A+L+ MGFP R AG G +S+C+CH V GY C RC A+VC LP +C C
Sbjct: 324 TGPASLLMMGFPSRAIADTVAGGGGLSVCACHSR-PVREGYGCTRCGAKVCRLPAECPAC 382
Query: 224 GLQLVSSPHLARSYHHLFPIAPFDEVT-PLCLNDPRNRSRSTCFGC-------------- 268
GL L+ S HLARSYHHLFP+ + EV+ + R R C C
Sbjct: 383 GLTLILSTHLARSYHHLFPLRSWVEVSWTAAADAVRRRRAVACHACLTPFPAVDTAAAAA 442
Query: 269 --------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
SG AC C HFC++CD++ HE +HNCPGC+S
Sbjct: 443 AAVTAAAAATQTTGSGVSESGRYACEVCGHHFCIDCDVFAHEVIHNCPGCQS 494
>gi|448514642|ref|XP_003867164.1| RNA polymerase transcription factor TFIIH core component [Candida
orthopsilosis Co 90-125]
gi|380351502|emb|CCG21726.1| RNA polymerase transcription factor TFIIH core component [Candida
orthopsilosis Co 90-125]
Length = 511
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 140/381 (36%), Positives = 197/381 (51%), Gaps = 78/381 (20%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R+++ + FV EFFDQNP+SQ+G++ +++GVAN ++++ GSP+ HI L
Sbjct: 137 DLRPTRLSLTLNYLSEFVVEFFDQNPISQLGIILMRNGVANLISEVSGSPQYHIDRLKQL 196
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
K GD SLQN L++ + LL + S +EVL+++ +L T DPGDI
Sbjct: 197 KARQHNKYEPKGDPSLQNCLEMARSLLKFNFGSASNNSKNSKEVLLVFGSLFTSDPGDIH 256
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
+TI+ + I+ SVIGLSA++ IC+ L T Y V ++E+HFKEL+M+
Sbjct: 257 KTIESLVKDNIKVSVIGLSAQVAICQELVNRTNNEPRNSSSKHYGVIMNETHFKELLMDC 316
Query: 161 APPPP------AIAEFAIANLIKMGFPQRAGEGSISICSCHK------------------ 196
P P E LIKMGFP + + S +
Sbjct: 317 VTPLPLPESEETKVETKGVPLIKMGFPSKVQPNATSTIGNSEYTVEFPRLNASYPTQGSN 376
Query: 197 ------EVKVGV------GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIA 244
EV G+ GY CP+CK++VC LPT C +CGL L+ S HLARSYHHL P+A
Sbjct: 377 DSRDIVEVNAGLVTSPSFGYQCPQCKSKVCNLPTICPVCGLMLILSTHLARSYHHLVPLA 436
Query: 245 PFDEVTPLCLNDPRNRSRST--CFGCQQSLLSSGNKPGLYVA-----------CPKCKKH 291
P+ E+ P + S T C+GCQ SG K C C
Sbjct: 437 PYKEI-------PVSPSYDTDFCYGCQLK-FPSGTKSTKETKTIDSITSSRYRCTNCNND 488
Query: 292 FCLECDIYIHESLHNCPGCES 312
FC+ CD+++HE LHNCPGCE+
Sbjct: 489 FCINCDVFVHEVLHNCPGCEN 509
>gi|448096877|ref|XP_004198537.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
gi|359379959|emb|CCE82200.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
Length = 514
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 138/381 (36%), Positives = 204/381 (53%), Gaps = 75/381 (19%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R +++ ++ F+ EFFDQNP+SQ+G+V +++G+A ++D+ GSP+ HI +
Sbjct: 138 DLRPTRFSMMLSLLQEFILEFFDQNPISQMGIVMMRNGIAIRISDVSGSPQYHIDKIRQL 197
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L+L + LL + +E+LI++ AL T DPG+I
Sbjct: 198 KTKQHNRFEPKGDPSLQNSLELARSLLMYNFNNKVNDTQNSKEILIIFGALFTSDPGNIH 257
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS----------YSVALDESHFKELIM 158
+TI + I+ +IGL+A++ IC+ + + T S Y V ++ESHFKEL+M
Sbjct: 258 KTIDNLVKDDIKIKIIGLAAQVAICEEIVRRTNSSKTSSSYINTNYGVIMNESHFKELLM 317
Query: 159 EHAPPPPAIAEFAIAN-----LIKMGFPQR-----AGEGSI-----SICSCH-------- 195
+ P P +F LIKMGFP + SI SICSCH
Sbjct: 318 DCVIPLPVSEDFEKKEKKGVPLIKMGFPSKIQPSVTSNSSINLNLPSICSCHPDEENDAS 377
Query: 196 --KEV----------KVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
KE K +GY CP+C +VC+LPT C ICGL L+ S HLARSYHHL P+
Sbjct: 378 HSKEAGEAGNGTYSSKKVIGYKCPQCLNKVCQLPTICPICGLMLILSTHLARSYHHLVPL 437
Query: 244 APFDEVTPLCLNDPRNRSRSTCFGC--------QQSLLSSGNKPGLYVA----CPKCKKH 291
+ EV + + C+GC + + NK + C KC++
Sbjct: 438 EAYKEVPTM-----EEYESTYCYGCLLKFPSGRKNAQSDDANKIESLTSSRYRCGKCQQD 492
Query: 292 FCLECDIYIHESLHNCPGCES 312
FC++CD++IHE+LHNCPGCE+
Sbjct: 493 FCIDCDVFIHETLHNCPGCEN 513
>gi|255724012|ref|XP_002546935.1| suppressor of stem-loop protein 1 [Candida tropicalis MYA-3404]
gi|240134826|gb|EER34380.1| suppressor of stem-loop protein 1 [Candida tropicalis MYA-3404]
Length = 505
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 134/387 (34%), Positives = 200/387 (51%), Gaps = 80/387 (20%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R+++ ++ FV EFFDQNP+S++G++ +++G+AN ++++ GSP+ HI +
Sbjct: 124 DLRPNRLSMTLSYLQDFVTEFFDQNPISRLGIILMRNGIANLVSEVSGSPQYHIDKIRQL 183
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L++ + LL + + +EVLI++ AL T DPGDI
Sbjct: 184 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNTKNSKEVLIIFGALFTSDPGDIH 243
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
TI + +I+ SVIGLSA++ IC+ L T Y V ++ESHFKEL+M+
Sbjct: 244 RTIDNLIKDEIKVSVIGLSAQVAICQELVNRTNKEPRNSQSKHYGVIMNESHFKELLMDS 303
Query: 161 APPPP----------AIAEFAIANLIKMGFPQR--------------------------- 183
P P A +E +++MGFP +
Sbjct: 304 VTPLPLTESEKRLQDAQSEDGGVPVLRMGFPSKVQPTLTSVVTGTDMIIEFPHLNASFPT 363
Query: 184 -AGEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
E S V +GY CP+CK++VC LPT C +CGL L+ S HLARS
Sbjct: 364 QGSEDSKDAVEIQNNRAVAAASSSVIGYQCPQCKSKVCNLPTLCPVCGLMLILSTHLARS 423
Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---LSSGNKPGLYVA-------CP 286
YHHL P+A F EV + D CFGCQ + +G + G + C
Sbjct: 424 YHHLVPLAQFKEVPVAPVYDSE-----FCFGCQLKFPDGVKAGTQKGSLESMTSSRYRCK 478
Query: 287 KCKKHFCLECDIYIHESLHNCPGCESL 313
+C+K FC+ CD+++HE LH CPGCE++
Sbjct: 479 RCEKSFCINCDVFVHEVLHTCPGCENM 505
>gi|241950787|ref|XP_002418116.1| TFIIH subunit, putative; general transcription and DNA repair
factor IIH subunit, putative; suppressor of stem-loop
protein 1, putative [Candida dubliniensis CD36]
gi|223641455|emb|CAX43416.1| TFIIH subunit, putative [Candida dubliniensis CD36]
Length = 501
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 197/383 (51%), Gaps = 78/383 (20%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RPSR+++ ++ FV EFFDQNP+S++G++ +++GVAN ++++ GSP+ H+ +
Sbjct: 124 DLRPSRLSMTLNYLQEFVVEFFDQNPISRLGIILMRNGVANLVSEVSGSPQYHVDKIRNL 183
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L++ + LL + S +E+L+++ AL T DPGDI
Sbjct: 184 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNSKNSKEILVIFGALFTSDPGDIH 243
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
+TI + +I+ SVIGLSA++ IC+ L T Y V ++ESHFKEL+ME
Sbjct: 244 KTIDSLVKDEIKTSVIGLSAQVAICQELVNRTNKEPRNSQSKHYGVIMNESHFKELLMES 303
Query: 161 APPPPAI--------AEFAIANLIKMGFPQR----------------------------A 184
P P +E +++MGFP +
Sbjct: 304 VTPLPLTESEKQIQESEQNGVPVLRMGFPTKVQPTLTSAIGGSDYIIEFPHLNASFPTQG 363
Query: 185 GEGSISICSCHKEVKVG----VGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
E S + +GY CP+CK +VC LPT C +CGL L+ S HLARSYHHL
Sbjct: 364 SEDSKDVVEIQTNKPAAASSLIGYQCPQCKCKVCNLPTICPVCGLMLILSTHLARSYHHL 423
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY-----------VACPKCK 289
P+AP+ EV + D + CFGCQ G K G C KC
Sbjct: 424 VPLAPYKEVKVSPIYD-----STYCFGCQLK-FPEGVKLGTLKKSLESMTSSRYRCLKCS 477
Query: 290 KHFCLECDIYIHESLHNCPGCES 312
+ FC+ CD+++HE LHNCPGCE+
Sbjct: 478 QDFCINCDVFVHEVLHNCPGCEN 500
>gi|310796716|gb|EFQ32177.1| hypothetical protein GLRG_07321 [Glomerella graminicola M1.001]
Length = 482
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 136/359 (37%), Positives = 198/359 (55%), Gaps = 44/359 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R V FV+E+F+QNP+SQ+G++ ++DGVA ++D+ G+P H++ L
Sbjct: 104 DMLPNRYRVAWAYAADFVKEYFEQNPISQLGIIGMRDGVAIRISDMSGNPTDHLEKLKTF 163
Query: 62 LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SG+ SLQNAL++ +G L PS+G REVLI++ AL + DPGDI +TI +I
Sbjct: 164 EGQDPSGNPSLQNALEMCRGALFHAPSHGTREVLIVFGALLSSDPGDIHDTISSLITDRI 223
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R S++GL+A++ IC LC T YSV ++++HF++L++ PP + A+
Sbjct: 224 RVSIVGLAAQVAICAELCSRTNAGDETQYSVCMNDAHFRDLLLATTTPPVTRTVAQSTAS 283
Query: 175 LIKMGFPQRA---GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
L+ MGFP R GE + ++C+CH + GY C RC RVC LP +C C L L+ S
Sbjct: 284 LLMMGFPSRTLAQGEAT-AVCACHNK-PAREGYLCTRCGTRVCRLPIECPACSLTLILST 341
Query: 232 HLARSYHHLFPIAPFDEVTPL----------CLND----PRNRSRSTCFG---------- 267
HLARSYHHLFP+ + EV+ CL P+ R
Sbjct: 342 HLARSYHHLFPLRNWIEVSWAEAAKSKACYSCLTTFPEPPKTGRRKDKDKDKEKDGLHTT 401
Query: 268 -------CQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLRHSNPI 319
S L ++ G Y +C C HFC++CD+Y HE +HNCPGC+S P+
Sbjct: 402 APVAPTPVTASDLKGVSESGRY-SCTVCGNHFCIDCDVYAHEVIHNCPGCQSDTRGAPM 459
>gi|238881955|gb|EEQ45593.1| suppressor of stem-loop protein 1 [Candida albicans WO-1]
Length = 472
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/388 (35%), Positives = 202/388 (52%), Gaps = 83/388 (21%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R+++ ++ FV EFFDQNP+S++G++ +++GVAN ++++ GSP+ H+ +
Sbjct: 90 DLRPTRLSMTLNYLQEFVVEFFDQNPISRLGIILMRNGVANLVSEVSGSPQYHVDKIRNL 149
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L++ + LL + S +E+L+++ AL T DPGDI
Sbjct: 150 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNSKNSKEILVIFGALFTSDPGDIH 209
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
+TI + +I+ SVIGLSA++ IC+ L T Y V ++ESHFKEL+ME
Sbjct: 210 KTIDNLVKDEIKASVIGLSAQVAICQELVNRTNNEPRNSQSKHYGVIMNESHFKELLMES 269
Query: 161 APPPPAI--------AEFAIANLIKMGFPQRA-------------------------GEG 187
P P +E +++MGFP + +G
Sbjct: 270 VTPLPLTESEKQMQESEQNGVPVLRMGFPTKVQPTLTSAIGGSDYIVEFPHLNASFPTQG 329
Query: 188 S------ISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
S + I + V +GY CP+CK +VC LPT C +CGL L+ S HLAR
Sbjct: 330 SEDSKDVVEIQTNKPAAAVATTSSSVIGYQCPQCKCKVCNLPTICPVCGLMLILSTHLAR 389
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA----------- 284
SYHHL P+AP+ EV + D + CFGCQ G K G +
Sbjct: 390 SYHHLVPLAPYKEVKVSPIYD-----STYCFGCQLK-FPEGVKLGTHKGTLESMTSSRYR 443
Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
C KC + FC+ CD+++HE LHNCPGCE+
Sbjct: 444 CLKCSQDFCINCDVFVHEVLHNCPGCEN 471
>gi|258571509|ref|XP_002544558.1| suppressor of stem-loop protein 1 [Uncinocarpus reesii 1704]
gi|237904828|gb|EEP79229.1| suppressor of stem-loop protein 1 [Uncinocarpus reesii 1704]
Length = 413
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 121/274 (44%), Positives = 170/274 (62%), Gaps = 13/274 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI AL
Sbjct: 85 DLRPTRYLLTLRFAQEFVLEYFEQNPISQLGIIGMRDGLAVKISDMSGNPTEHILALQAL 144
Query: 62 LGC--SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G SLQNAL++ +G L PS+G REVLIL+ AL + DPGDI +T+ KI
Sbjct: 145 RAKDPNGLPSLQNALEMARGTLFHTPSHGTREVLILFGALLSSDPGDIHQTVSSLISDKI 204
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAIAN- 174
R V+GL+AE+ IC+ +C T Y VAL+E HF+EL+ME PP ++ AN
Sbjct: 205 RVGVVGLAAEVAICREICAKTNAGDDSGYGVALNEQHFRELMMETTTPPVTYSKKQAANS 264
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
L+ MGFP R E + S+C+CH G GY C RC ++VC LP +C CGL L+ S HLA
Sbjct: 265 LLMMGFPSRTVEPAPSMCACHSMPSRG-GYLCSRCGSKVCTLPAECPACGLTLILSTHLA 323
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
RSYHHLFP+ + EV P + + + CF C
Sbjct: 324 RSYHHLFPLINWIEV-PW----KKASTSANCFAC 352
>gi|68466623|ref|XP_722507.1| hypothetical protein CaO19.1457 [Candida albicans SC5314]
gi|68466906|ref|XP_722368.1| hypothetical protein CaO19.9032 [Candida albicans SC5314]
gi|46444338|gb|EAL03613.1| hypothetical protein CaO19.9032 [Candida albicans SC5314]
gi|46444486|gb|EAL03760.1| hypothetical protein CaO19.1457 [Candida albicans SC5314]
Length = 507
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 136/388 (35%), Positives = 202/388 (52%), Gaps = 83/388 (21%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R+++ ++ FV EFFDQNP+S++G++ +++GVAN ++++ GSP+ H+ +
Sbjct: 125 DLRPTRLSMTLNYLQEFVVEFFDQNPISRLGIILMRNGVANLVSEVSGSPQYHVDKIRNL 184
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L++ + LL + S +E+L+++ AL T DPGDI
Sbjct: 185 KARQHNRFEPKGDPSLQNSLEMARSLLKFNFGSTSNNSKNSKEILVIFGALFTSDPGDIH 244
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS--------YSVALDESHFKELIMEH 160
+TI + +I+ SVIGLSA++ IC+ L T Y V ++ESHFKEL+ME
Sbjct: 245 KTIDSLVKDEIKASVIGLSAQVAICQELVNRTNNEPRNSQSKHYGVIMNESHFKELLMES 304
Query: 161 APPPPAI--------AEFAIANLIKMGFPQRA-------------------------GEG 187
P P +E +++MGFP + +G
Sbjct: 305 VTPLPLTESEKQIQESEQNGVPVLRMGFPTKVQPTLTSAIGGSDYIVEFPHLNASFPTQG 364
Query: 188 S------ISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
S + I + V +GY CP+CK +VC LPT C +CGL L+ S HLAR
Sbjct: 365 SEDSKDVVEIQTNKPAAAVATTSSSVIGYQCPQCKCKVCNLPTICPVCGLMLILSTHLAR 424
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA----------- 284
SYHHL P+AP+ EV + D + CFGCQ G K G +
Sbjct: 425 SYHHLVPLAPYKEVKVSPIYD-----STYCFGCQLK-FPEGVKLGTHKGTLESMTSSRYR 478
Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
C KC + FC+ CD+++HE LHNCPGCE+
Sbjct: 479 CLKCSQDFCINCDVFVHEVLHNCPGCEN 506
>gi|448110898|ref|XP_004201714.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
gi|359464703|emb|CCE88408.1| Piso0_001913 [Millerozyma farinosa CBS 7064]
Length = 514
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 141/382 (36%), Positives = 205/382 (53%), Gaps = 77/382 (20%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R +++ ++ F+ EFFDQNP+SQ+G+V +++G+A ++D+ GSP+ HI +
Sbjct: 138 DLRPTRFSMMLSLLQEFILEFFDQNPISQMGIVMMRNGIAVRISDVSGSPQYHIDKIRQL 197
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
+ GD SLQN+L+L + LL + +E+LI++ AL T DPG+I
Sbjct: 198 KTKQHNRFEPKGDPSLQNSLELARSLLMYNFNNKVNDTQNSKEILIIFGALFTSDPGNIH 257
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS----------YSVALDESHFKELIM 158
+TI + I+ +IGL+A++ IC+ + + T S Y V ++ESHFKEL+M
Sbjct: 258 KTIDNLVKDDIKVKIIGLAAQVAICEEIVRRTNSSKTSSSYINTNYGVIMNESHFKELLM 317
Query: 159 EHAPPPPAIAEFAIAN-----LIKMGFPQR-----AGEGSI-----SICSCH-------- 195
+ P P +F LIKMGFP + SI SICSCH
Sbjct: 318 DCVIPLPVSEDFEKKEKKGVPLIKMGFPSKIQPSITSNSSINLNLPSICSCHPDEENESS 377
Query: 196 --KEV----------KVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPI 243
KE K +GY CP+C +VC+LPT C ICGL L+ S HLARSYHHL P+
Sbjct: 378 HSKEAGEAGNGTYSSKKVIGYKCPQCLNKVCQLPTICPICGLMLILSTHLARSYHHLVPL 437
Query: 244 APFDEVTPL-----------CLNDP--RNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
+ EV + L P R + S +SL SS + C KC++
Sbjct: 438 EAYREVPTMEEYESIYCYGCLLKFPPGRKNAESDETNKIESLTSSRYR------CGKCQQ 491
Query: 291 HFCLECDIYIHESLHNCPGCES 312
FC++CD++IHE+LHNCPGCE+
Sbjct: 492 DFCIDCDVFIHETLHNCPGCEN 513
>gi|294654358|ref|XP_456408.2| DEHA2A01606p [Debaryomyces hansenii CBS767]
gi|199428818|emb|CAG84360.2| DEHA2A01606p [Debaryomyces hansenii CBS767]
Length = 514
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 132/385 (34%), Positives = 206/385 (53%), Gaps = 79/385 (20%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R +++ ++ FV E+FDQNP+SQ+G++ +++GV+N ++++ GSP+ H+ +
Sbjct: 134 DLRPTRFSMMLSLLQEFVVEYFDQNPISQLGIIMMRNGVSNLISEVNGSPQYHLDKIRQL 193
Query: 60 -----GKLGCSGDSSLQNALDLVQGLL------SQIPSYGHREVLILYSALSTCDPGDIM 108
K GD SLQNAL++ + LL + + +E+LI++ AL T DPGDI
Sbjct: 194 KSRQHNKYEPKGDPSLQNALEMARSLLMYNFGTNLNDTKNSKEILIIFGALFTSDPGDIH 253
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDT-------------GGSYSVALDESHFKE 155
+TI + +I+ VIGLSA++ IC+ L T G +Y V ++E HF+E
Sbjct: 254 KTINSLVKDEIKVKVIGLSAQVAICQELVNKTNNPGKSNTFWSAGGNNYGVIMNEIHFRE 313
Query: 156 LIMEHAPPPPAIAEFAIAN-------LIKMGFPQR------AGEGSISI-----CSCHKE 197
L+M+ P P I +IKMGFP + + ++S+ C+CH
Sbjct: 314 LLMDCVIPLPVTTNAVIDENNSKEVPIIKMGFPSKIQPVITSSTSNLSVDFPQLCACHPT 373
Query: 198 ----------------VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
+ +GY CP+C +VC LPT C ICGL L+ S HLARSYHHL
Sbjct: 374 QDSLESKNNDPTATLGTRSVIGYECPQCNNKVCHLPTICPICGLMLILSTHLARSYHHLV 433
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGC--------QQSLLSSGNKPGL------YVACPK 287
P+ + EV P+ + S + C+GC + + ++ K + C K
Sbjct: 434 PLGDYKEV-PVS----ESYSSTYCYGCLLKFPSGIKNTDVNEDGKTKMDSLTSSRYRCGK 488
Query: 288 CKKHFCLECDIYIHESLHNCPGCES 312
C+ FC++CD+++HESLHNCPGCE+
Sbjct: 489 CQNDFCIDCDVFVHESLHNCPGCEN 513
>gi|71022397|ref|XP_761428.1| hypothetical protein UM05281.1 [Ustilago maydis 521]
gi|46101297|gb|EAK86530.1| hypothetical protein UM05281.1 [Ustilago maydis 521]
Length = 539
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 127/357 (35%), Positives = 187/357 (52%), Gaps = 62/357 (17%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R V + FV E+FDQNP+ Q+ ++ + G+A L +GG+ H +L K
Sbjct: 163 DMRPNRFDVTLQYAREFVGEYFDQNPIGQLSIIGTRSGIAERLAMMGGNTVDHTASLSNK 222
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L G+ SLQNAL++ + L +P+ RE+L ++ +L+TCDPG+I +TI + I
Sbjct: 223 RRLEPRGEPSLQNALEMARSSLVHLPASNSREILAIFGSLTTCDPGNIHDTIATLVKDNI 282
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP-------------A 166
R S++ L+AE+ + K +C TGG++SVAL+E HF + + E PPP
Sbjct: 283 RVSIVHLAAEVKVFKDVCTRTGGTFSVALNEGHFHDSLFELVPPPAVEGKPRRTRQHMVG 342
Query: 167 IAEFAI---------------ANLIKMGFPQR-AGEGSISICSCHKEVKVGVGYTCPRCK 210
IA+ + +L++M FP R + ++C+CH + G GY CPRC
Sbjct: 343 IADGSTAMDAEDDDEDGVQAGVDLLQMAFPLRLPAHAAPTLCACHSRSR-GSGYLCPRCG 401
Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPI-----APFDEVTPLCLNDPRNRSRSTC 265
+VC +PTDC +CG+ +V S HLARSYHHLFP+ P+ VTP S C
Sbjct: 402 VKVCNVPTDCPVCGITIVMSTHLARSYHHLFPVPNWKAVPWSSVTP--------DSDGAC 453
Query: 266 FGC-----------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLH 305
F C ++L +G P C +C FCLECD ++HE LH
Sbjct: 454 FSCNVPFPSLQERKEKSAAANKALEEAGLSPSSRYRCGRCAIDFCLECDAFVHEQLH 510
>gi|239614970|gb|EEQ91957.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
ER-3]
gi|327356553|gb|EGE85410.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
ATCC 18188]
Length = 525
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 133/361 (36%), Positives = 197/361 (54%), Gaps = 54/361 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV EFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 140 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHILAIQSL 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN L++ +G L + P++G REV I++ AL + DPGDI +TI KI
Sbjct: 200 RPKDPKGMPSLQNTLEMARGALFRTPTHGTREVFIIFGALLSSDPGDIHKTINTLVADKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIME--------------HA 161
R S+IGL+A++ IC+ +C T Y VAL+E HF+EL M
Sbjct: 260 RVSIIGLAAQVAICRDICARTNNGDDSGYGVALNEQHFRELFMNVTTPPATTVAPTPTTT 319
Query: 162 PPPPAIAEFAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
++ ++L+ MGFP R + + ++C+CH + GY C RC A+VC LPT
Sbjct: 320 KEETKTSQTTTSSLLMMGFPSRTLSPTTTPTLCACHSKPSR-TGYLCSRCGAKVCTLPTS 378
Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL------- 272
C CGL L+ S HLARSYHHLFP+ + EV+ R RS ++CF C
Sbjct: 379 CPCCGLTLILSTHLARSYHHLFPLMNWVEVS--WRRAARKRS-ASCFACAVGFPRVPKQF 435
Query: 273 -----------------LSSGNKPGLYVA----CPKCKKHFCLECDIYIHESLHNCPGCE 311
+ G+ V+ CP C+ HFC++CD++ HE +HNCPGC+
Sbjct: 436 SGGEQEGAGEEQGEGEGETVKGPSGISVSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQ 495
Query: 312 S 312
S
Sbjct: 496 S 496
>gi|324518504|gb|ADY47122.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
Length = 386
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 120/313 (38%), Positives = 174/313 (55%), Gaps = 21/313 (6%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
P+R+ V K + F+ +F +QNP+SQ+G++ +D A L L G+ + +AL + +
Sbjct: 87 PTRLNVTLKVLNQFLEKFSEQNPISQVGIIICRDKRAERLIQLTGTIRAVKEALSTINEA 146
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C G+ SLQN L + L P + RE++ + ++LSTCDP +I T + K + +RCS
Sbjct: 147 SCHGEFSLQNGLLVALRSLQHFPGHASREIIAIVASLSTCDPSNIFGTFEVLKRNNVRCS 206
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
VIGLSAE+F+ K LC T G Y V LD HF ELI+ PP + +++MGFP
Sbjct: 207 VIGLSAEVFVYKKLCSTTSGRYDVILDGDHF-ELILNEYTNPPVTKRNVESCVVRMGFPS 265
Query: 183 RAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
+ C CH+ G G+ CP+C AR C LP +CR+C L L+S+P LARS+HH
Sbjct: 266 HQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLARSFHH 325
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
L P+ F EV + + CFGC + L AC C ++C++CD+
Sbjct: 326 LLPLPAFKEV---------DTTSGICFGCAKPLEQKS------FACKSCDANYCIDCDLL 370
Query: 300 IHESLHNCPGCES 312
+HESL CP C S
Sbjct: 371 LHESLQLCPACPS 383
>gi|443923297|gb|ELU42560.1| RNA polymerase II transcription factor [Rhizoctonia solani AG-1 IA]
Length = 877
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 117/288 (40%), Positives = 172/288 (59%), Gaps = 25/288 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + AFV E+ +QNPL QIG+V ++ G+ + ++ G+P ++A+ K
Sbjct: 176 DLRPTRFELALDCARAFVVEWCEQNPLGQIGVVGMRAGIGERIVEMTGNPHDVLRAIADK 235
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L +G+ SLQNA+++ + + + E++I++ +L+TCDPGDI ET+ C + KI
Sbjct: 236 RKLEPAGEPSLQNAIEVARAGMRHV-RLTREEIVIIFGSLTTCDPGDIYETLDSCVKDKI 294
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA-------- 171
R S++ L+AEM ICK LC+ T GS+ VAL+E HFK+L+ EH PPP A
Sbjct: 295 RISLVALAAEMKICKELCEKTEGSFGVALNEGHFKDLLFEHIPPPAHRAARTGNDKIPSS 354
Query: 172 ----------IANLIKMGFPQRAGEGSI-SICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
A+L+ MGFP R S ++C CH G+ CPRC A++CE+PTDC
Sbjct: 355 QTGKANPLNPNADLMLMGFPTRLPPTSAPALCVCHPSRMRAEGFLCPRCSAKLCEVPTDC 414
Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
+CGL +VSSPHLARSYHHLFP+ + + + P + +TC GC
Sbjct: 415 DVCGLMVVSSPHLARSYHHLFPVGAYTAINEI---GPDDTPSATCQGC 459
>gi|171689514|ref|XP_001909697.1| hypothetical protein [Podospora anserina S mat+]
gi|170944719|emb|CAP70830.1| unnamed protein product [Podospora anserina S mat+]
Length = 501
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 139/376 (36%), Positives = 192/376 (51%), Gaps = 75/376 (19%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D P+R A+ FV FF QNP+SQ+G++ ++DG+A ++D+ G+P HI+ L
Sbjct: 108 DLLPNRFALTFSYAMEFVNTFFQQNPISQLGIIGMRDGIAVRISDMSGNPVEHIEKLRQW 167
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI +I
Sbjct: 168 ALKDPIGNPSLQNALEMCRGHLYHTPSHGTREVLIIYGALLSSDPGDISDTITSLIADRI 227
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPA-IAEFAIAN 174
R S+IGL+A++ IC LC T + Y +AL E HF+EL + PP AE + A+
Sbjct: 228 RVSIIGLAAQVAICAELCARTNDNDDSQYRIALHEQHFRELFLAATTPPVTHEAEQSNAS 287
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPH 232
L+ MGFP R A + +S+C+CH + GYTC RC+ +VC LP C +CGL L+ S H
Sbjct: 288 LLMMGFPSRSLASKDFVSLCACHNK-PTREGYTCTRCRIKVCRLPASCPVCGLTLILSIH 346
Query: 233 LARSYHHLFPIAPFDEVTPLCLNDPRNRSRST--CFGCQQSLL----------------S 274
LARSYHHLFP+ + V P ++ + CF CQ S
Sbjct: 347 LARSYHHLFPLKSWVAV-------PWTEAKKSVACFSCQTPFPPVPKAAPPKIKLKVKES 399
Query: 275 SG---------------------------NKPGLY-----VACPKCKKHFCLEC------ 296
SG PGL + ++ C EC
Sbjct: 400 SGVGGQTAANIAKAKGRGEVPAKTNTVTAPTPGLLPEAIKAGVSESGRYKCPECEEHFCI 459
Query: 297 --DIYIHESLHNCPGC 310
DIY HE++HNCPGC
Sbjct: 460 DCDIYAHETIHNCPGC 475
>gi|451846168|gb|EMD59479.1| hypothetical protein COCSADRAFT_41319 [Cochliobolus sativus ND90Pr]
Length = 496
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 187/364 (51%), Gaps = 60/364 (16%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + ++REFF+QNP+SQ+ ++ + DGV +++L G+P H+ A+ G
Sbjct: 118 DMRPNRFITMINYTQDYIREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIQGL 177
Query: 62 LGCS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
G SLQNAL+L + L P++G REV++++ +L + DPGDI ++++ C
Sbjct: 178 RSKDDGKEPKGSPSLQNALELARATLYHTPNHGTREVIVVFGSLLSLDPGDIHQSVKACV 237
Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP----PAI 167
+IR S+IG+ A + IC + + T Y++A D+ +EL+ PP P +
Sbjct: 238 RDRIRVSIIGMGARLKICTEIVKRTNAGDESEYTIATDQEMLRELLQATTTPPVIRQPLL 297
Query: 168 AEF-----------AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCEL 216
+ + A L+ MGFP R E +++C+CH + +G GYTC RC A+VC L
Sbjct: 298 TDKTAPVTTTQSSESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSAKVCSL 356
Query: 217 PTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------- 268
P C C L L+ S HLARSYHHLFP+ + V+ + R C GC
Sbjct: 357 PITCPSCQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREMGSKQCVGCLAAFSNPP 413
Query: 269 ----------------------QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHN 306
S ++ G Y C C+ HFC++CD++ H LHN
Sbjct: 414 SNAKENGEAKRDSEAKVAKKDDDDSEEQKASESGRY-ECRACESHFCIDCDMFAHMVLHN 472
Query: 307 CPGC 310
CPGC
Sbjct: 473 CPGC 476
>gi|367035208|ref|XP_003666886.1| hypothetical protein MYCTH_2311995 [Myceliophthora thermophila ATCC
42464]
gi|347014159|gb|AEO61641.1| hypothetical protein MYCTH_2311995 [Myceliophthora thermophila ATCC
42464]
Length = 589
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 174/298 (58%), Gaps = 35/298 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D P+R + FVRE+F+QNP+SQ+G+V ++DG+A ++D+GG+P HI+ L
Sbjct: 117 DLLPNRYLLTLNYAVDFVREYFEQNPISQLGIVGMRDGIAVRISDMGGNPAEHIEKLRTW 176
Query: 61 --KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI K +
Sbjct: 177 AEQQEPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHDTIAKLLNDR 236
Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKEL--------IMEHAPPPPA 166
IR SV+GL+A++ IC+ LC T G SY+VAL E HF+EL + + PPPP
Sbjct: 237 IRVSVVGLAAQVAICEQLCTRTNGGDPSSYAVALHEQHFRELFLAATTPPVTQSTPPPPG 296
Query: 167 IAE--------------FAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCK 210
+ A+L+ MGFP R A + +S+C+CH GY C RC+
Sbjct: 297 AGDGTNNNNNSQQQSQQSQQASLLIMGFPSRALASKDHVSLCACHNR-PTREGYACTRCR 355
Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
+VC LP +C +CGL LV S HLARSYHHLFP+ + V+ R +S CF C
Sbjct: 356 TKVCRLPAECPVCGLTLVLSTHLARSYHHLFPLRGWVAVS--WAEAARAKS-GVCFAC 410
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 18/29 (62%), Positives = 24/29 (82%)
Query: 284 ACPKCKKHFCLECDIYIHESLHNCPGCES 312
ACP C HFC++CD++ HE +HNCPGC+S
Sbjct: 486 ACPVCGNHFCIDCDVFAHEMIHNCPGCQS 514
>gi|326433104|gb|EGD78674.1| hypothetical protein PTSG_01653 [Salpingoeca sp. ATCC 50818]
Length = 394
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 193/317 (60%), Gaps = 24/317 (7%)
Query: 3 FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPE---SHIKALM 59
+P+R+ + FV FF +NP+S +G++ +++GVA + +L G+ + H+ AL+
Sbjct: 73 LKPNRLTCTVHNLIDFVHSFFHENPISDMGVIVMREGVARVICELNGNKDVLVEHLSALL 132
Query: 60 ---GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
L +G++SLQNAL + + L IPS+ REV+I+ + L++CDPG+I++TI+ +
Sbjct: 133 HPKTPLFPAGEASLQNALAVARRSLRSIPSHATREVVIVQATLASCDPGNILDTIKSLRS 192
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLI 176
K+ +V+GL+A + IC+ +C++TGG ++VALDE H EL++ PP A A+ +
Sbjct: 193 DKVTVNVVGLAAAVRICETVCKETGGIHAVALDEDHLGELLLALTTPPAAPADADATQM- 251
Query: 177 KMGFPQRAGEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
++GF A EG K+ ++G+ GY CP+C+A+V ++P C CGL LVSSPHL
Sbjct: 252 RLGFA--AYEGRRPTVRRQKDGQLGITTGGYVCPQCRAKVSDIPQRCTTCGLMLVSSPHL 309
Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
A+++HH+FP+ F EVT + C GC Q L ++ P CP C C
Sbjct: 310 AKTFHHMFPLPVFKEVTLA--------AAEWCAGCTQPLPTT--LPAY--ECPSCGARVC 357
Query: 294 LECDIYIHESLHNCPGC 310
L+CD+Y+H LHNCPGC
Sbjct: 358 LDCDVYVHRELHNCPGC 374
>gi|358340016|dbj|GAA47968.1| transcription initiation factor TFIIH subunit 2 [Clonorchis
sinensis]
Length = 929
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 121/290 (41%), Positives = 167/290 (57%), Gaps = 22/290 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ + V FVR++FDQNP+SQ+G++ D A LT+L G+P H+ AL +
Sbjct: 606 DLKPTRLVCTLRAVAEFVRDYFDQNPISQLGIIVTADRQAERLTELSGNPRCHLAALETL 665
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
CSG+ SLQNAL L + L P H E+L++ ++L+TCDPGDI +TIQ + I
Sbjct: 666 FTRPCSGEPSLQNALTLAESRLKYTPH--HSEILVIMASLTTCDPGDIHKTIQSLAANHI 723
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSV+ L+ E+F+ + L Q T G + V LDE H K ++ PPP A + A A LI+M
Sbjct: 724 RCSVVSLAVEVFVYRALAQITQGQFHVILDEPHLKTVLKNFVPPPAATVD-APATLIRMA 782
Query: 180 FPQRAG----EGSISICSCHKEVKVGVG------YTCPRCKARVCELPTDCRICGLQLVS 229
FP S +C CH + Y CPRC A CELP +C +CGL LV+
Sbjct: 783 FPHSETFDLRNFSTVLCMCHLNQQGSSTDSSHPQYACPRCCAAYCELPVECTVCGLTLVA 842
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCL--NDPRNRSRS-----TCFGCQQSL 272
+PHLAR+YHHLFP+ F ++P L +P R+ TC GC L
Sbjct: 843 APHLARAYHHLFPLDAFTPISPAELRQTNPDQLPRTETGIITCAGCDVVL 892
>gi|396458460|ref|XP_003833843.1| hypothetical protein LEMA_P066040.1 [Leptosphaeria maculans JN3]
gi|312210391|emb|CBX90478.1| hypothetical protein LEMA_P066040.1 [Leptosphaeria maculans JN3]
Length = 618
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 187/372 (50%), Gaps = 67/372 (18%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM-- 59
D RP+R + + +VREFF+QNP+SQ+ ++ + DGV +++L G+P H+ A+
Sbjct: 219 DMRPNRFFAMINYTQDYVREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIASL 278
Query: 60 -----GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKC 114
GK G SLQNAL+L + L P +G REV+I++ +L + DPGDI T++ C
Sbjct: 279 RSKEDGK-EPKGAPSLQNALELARATLYHTPPHGTREVIIVFGSLLSLDPGDIHTTVKAC 337
Query: 115 KESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF 170
+IR S+IG+ A + IC + T Y+VA+D+ KEL++ PP + E
Sbjct: 338 VRDRIRVSIIGMGARLKICTEIVTRTNAGDTSEYTVAIDQEALKELLLA-TTTPPVVRET 396
Query: 171 ----------AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
+ A L++MGFP R E +++C+CH + +G GYTC RC A+VC LP C
Sbjct: 397 KQSGTEPPPESAAALMRMGFPSRVIEDQLTMCACHGNLTLG-GYTCSRCSAKVCSLPITC 455
Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC------------ 268
C L L+ S HLARSYHHLFP+ + V+ R + C GC
Sbjct: 456 PSCQLTLLLSTHLARSYHHLFPLRNWATVS---WARAREVGSTQCTGCLAAFSKPPPKDN 512
Query: 269 -------------------------QQSLLSSGNK---PGLYVACPKCKKHFCLECDIYI 300
Q L ++ P C C HFC++CD++
Sbjct: 513 NRHVVEDEDEDDDATQPRKHAASTTQDDALDGEDQRASPSGRYECQACNSHFCIDCDVFA 572
Query: 301 HESLHNCPGCES 312
H LHNCPGC S
Sbjct: 573 HMVLHNCPGCLS 584
>gi|428671746|gb|EKX72661.1| conserved hypothetical protein [Babesia equi]
Length = 409
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 177/312 (56%), Gaps = 21/312 (6%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDF+P R+ V+ FV+E F Q P++QIGL+ +++ V+ ++ LG +P+ I+ L
Sbjct: 107 MDFKPDRLYCAFGAVKEFVKELFHQGPITQIGLIGLRNKVSTMISQLGTNPDEQIELLGN 166
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
L G SG +SLQN L++ +LS +PSY REVLI++ + T DPG+I+ T+ K KE+
Sbjct: 167 ALKEGPSGTASLQNGLEMALTILSTLPSYTTREVLIIFGSNRTLDPGNILATLYKLKENH 226
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
I + I L+ E++I K++C +TGG SVA+D +H + L+ + PPP + L K+
Sbjct: 227 ICVNAISLAPELYILKNICTETGGMCSVAMDAAHLRTLLNDFTIPPPW-HNWMEPVLTKV 285
Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
FP + S+C+CH + Y CP+C ++ C +PT CR CG+ LVS P ++R+++
Sbjct: 286 AFPPLKKTTTASLCACHSNL-THTAYICPQCHSKSCSIPTRCRCCGIYLVSPPDISRAFY 344
Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
HL P F + D R TC S+G+ CP C FC CD
Sbjct: 345 HLIPPKTF-------IKDSRTGRCDTC----NYETSNGS------TCPDCASFFCEYCDA 387
Query: 299 YIHESLHNCPGC 310
YIH LH CP C
Sbjct: 388 YIHNELHQCPHC 399
>gi|452824298|gb|EME31302.1| transcription initiation factor TFIIH subunit H2 isoform 2
[Galdieria sulphuraria]
Length = 334
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 110/250 (44%), Positives = 160/250 (64%), Gaps = 3/250 (1%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +PSR V + F+ +F++NP+SQ+ +V ++DGVA L+ +G +P H + +
Sbjct: 78 DVKPSRGEVCLSCAQKFLYSYFNENPISQLAVVVLRDGVAEKLSSMGSNPRQHSEVVKNA 137
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ G G+ SLQN LD+ LL IPSYG REVLILY+++S+CDPGDI +TI+K ++ +I
Sbjct: 138 NQKGFYGNCSLQNGLDVALSLLHSIPSYGSREVLILYNSISSCDPGDIRQTIEKLEKERI 197
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RC+VIG++AE++I K+L T GSY V ++ESH EL+ + P I L++MG
Sbjct: 198 RCNVIGMAAELYILKYLAARTHGSYFVCMNESHLLELLEDFVVPSALIENNTKTALVRMG 257
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP +C K ++ V + CPRC+ E+P +C +CGL LVSS LARSYHH
Sbjct: 258 FPTLKAYKEPKVCLNDKVLRNQV-FVCPRCECCYGEIPIECVLCGLILVSSSQLARSYHH 316
Query: 240 LFPIAPFDEV 249
LFP+A F E+
Sbjct: 317 LFPVANFHEL 326
>gi|260943474|ref|XP_002616035.1| hypothetical protein CLUG_03276 [Clavispora lusitaniae ATCC 42720]
gi|238849684|gb|EEQ39148.1| hypothetical protein CLUG_03276 [Clavispora lusitaniae ATCC 42720]
Length = 459
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 130/354 (36%), Positives = 189/354 (53%), Gaps = 47/354 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D RP+R ++ FV EFFDQNP++Q+ +V +++GVA+ +++L GS + HI+ L
Sbjct: 107 DLRPTRFGATVAHLQDFVAEFFDQNPIAQMAVVMMRNGVASVVSELSGSAQLHIERLRAL 166
Query: 61 -----KLGCSGDSSLQNALDLVQGLLSQ---IPSYGHREVLILYSALSTCDPGDIMETIQ 112
L GD SLQNAL+L + LL+ S RE+++++ AL T DPGDI +TI
Sbjct: 167 RARQHTLEPKGDPSLQNALELARALLTAGGGPTSRSSREIVVVFGALFTSDPGDIHQTID 226
Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPPPAIA 168
+IR VIGL+A++ IC+ L T Y V + ++HF+EL+M P A+A
Sbjct: 227 SLVRDEIRVRVIGLAAQVAICRELVARTNAGECAGYGVIVGDAHFRELLMACVEPL-AVA 285
Query: 169 EFAIAN-----LIKMGFPQRA--GEGSISICSCHKEVK-VGVGYTCPRCKARVCELPTDC 220
A +++MGFP RA G +C+CH + GY CP+C++RVC LP C
Sbjct: 286 RRAAETATGVPVLRMGFPSRAPAAPGMPVLCACHPQGGGADTGYWCPQCRSRVCSLPCVC 345
Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRST---------------- 264
+CGL L+ S HLAR++HHL P ++ N S +
Sbjct: 346 PVCGLMLILSTHLARTHHHLVPPPDYETKEETKETKETNDSNNDNSNSNNDSSNSNNDSS 405
Query: 265 ------CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
CFGC L G AC C + FC++CD+++HE LHNCPGCE+
Sbjct: 406 GNSSERCFGC---LAQGDGALGARYACAACGQRFCIDCDVFVHEVLHNCPGCEA 456
>gi|238610326|ref|XP_002397695.1| hypothetical protein MPER_01834 [Moniliophthora perniciosa FA553]
gi|215472703|gb|EEB98625.1| hypothetical protein MPER_01834 [Moniliophthora perniciosa FA553]
Length = 251
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 111/248 (44%), Positives = 166/248 (66%), Gaps = 13/248 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLG-GSPESHIKALMG 60
D RP+R +++ + FV E+FDQNPL Q+G+V + + ++ G+P+ ++++
Sbjct: 5 DMRPTRFSLMLQYAREFVLEWFDQNPLGQMGVVVPIEVGSLIYSNFEPGNPQDVLRSISD 64
Query: 61 --KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
KL GD SLQNA+D+ + ++ +P++ RE++IL+ +L+T DPG+I ET+ +C ++K
Sbjct: 65 RHKLEPMGDPSLQNAIDMARNSMNHLPTHSSREIVILFGSLTTVDPGNIHETLDECVKNK 124
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPP--------AIAEF 170
IR S++ L+AEM IC+ LC TGG + VAL+E HFK+L+ E PPP A +
Sbjct: 125 IRISLVALAAEMRICRDLCIKTGGEFGVALNEGHFKDLLFELIPPPAQRAISKAGAGSTN 184
Query: 171 AIANLIKMGFPQRAGEGS-ISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
A ANL+ MGFP R + S S+C CH ++K G+ CPRC A+VC++PTDC ICGL +VS
Sbjct: 185 ASANLMMMGFPTRLPDASPPSLCVCHSDLKSD-GFLCPRCLAKVCDVPTDCDICGLMIVS 243
Query: 230 SPHLARSY 237
SPHLARSY
Sbjct: 244 SPHLARSY 251
>gi|406697522|gb|EKD00781.1| component of RNA polymerase transcription factor TFIIH, Ssl1p
[Trichosporon asahii var. asahii CBS 8904]
Length = 436
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 115/291 (39%), Positives = 172/291 (59%), Gaps = 33/291 (11%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK--LGCSGDSSLQN 72
+ +FV E+FDQNPL QIG++ +++ +A L + G+P+ I+AL K L +G+ SLQN
Sbjct: 112 LRSFVIEWFDQNPLGQIGIILLRNRLAQTLVPMCGNPQEIIEALADKRKLEPAGEPSLQN 171
Query: 73 ALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKESKIRCSVIGLSAE 129
L + +G +S +PS EVL+L+SA+ST DP +I + + + +++R ++I LSAE
Sbjct: 172 GLVMARGGMSHLPSTSSLEVLVLFSAISTADPDGSMNIHQVLDELVSARVRVTIISLSAE 231
Query: 130 MFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP------PAIAEF------------- 170
+ ICK + + TGG + VA+DE H+KEL+ E PPP P A
Sbjct: 232 IKICKQIAERTGGRFGVAIDEDHYKELLWETIPPPAQTITAPVTANVREALARGGRQAPG 291
Query: 171 ------AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICG 224
+ +L+ MGFP R S C+CH +K G GY CPRC A++C++PTDC +CG
Sbjct: 292 APKRPPPLGDLLVMGFPTRLPGAGESFCACHGLLKRG-GYMCPRCGAKLCDVPTDCEVCG 350
Query: 225 LQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSS 275
L +VSSPHLARS+ LFP++ +D + L + R+ CFGC +S
Sbjct: 351 LMVVSSPHLARSFWLLFPVSKYDTLGRGVLT--KEAVRAECFGCDAPFPTS 399
>gi|167376758|ref|XP_001734134.1| TFIIH basal transcription factor complex p44 subunit [Entamoeba
dispar SAW760]
gi|165904499|gb|EDR29715.1| TFIIH basal transcription factor complex p44 subunit, putative
[Entamoeba dispar SAW760]
Length = 362
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 175/319 (54%), Gaps = 18/319 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D++P+R+ + +++ F EF+ NPLS+IGL+ ++ A S I+ M
Sbjct: 54 DYQPNRITFIQTKLKEFFEEFYIDNPLSEIGLICTRNKTAE--QQFPLSRNLIIQKEMKW 111
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
C G SLQ +LDL Q +LS + RE++I+ S+++TCDPG+I +TI+ K++ IRC
Sbjct: 112 GNCEGQMSLQISLDLAQQILSNTSNSSSREIIIVTSSITTCDPGNIYDTIETLKKTTIRC 171
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM--G 179
SVI + EM I K L + TGG Y ++E H + ++ PP I++ G
Sbjct: 172 SVISFAPEMHITKLLTKITGGDYHTIMNEKHAENVLNTFIIPPIYKENTYEPKTIQLYIG 231
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP++ +IC CH ++ + + CP C CELP C+IC L+ S H ARSYH
Sbjct: 232 FPKQLNIP--TICECHSKIDIN-HFECPICNFCYCELPIQCKICNAILLLSHHFARSYHF 288
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNK----PGLYVACPKCKKHFCLE 295
+FPI PF +T + + + CFGC + + K P C KC K++C E
Sbjct: 289 MFPIEPFKIITMI-------KPQEKCFGCGKEIDDRIEKKEEEPVNIYQCKKCLKYYCNE 341
Query: 296 CDIYIHESLHNCPGCESLR 314
CD +IH+ L+NCPGCES +
Sbjct: 342 CDSFIHDVLYNCPGCESKK 360
>gi|399216155|emb|CCF72843.1| unnamed protein product [Babesia microti strain RI]
Length = 408
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 119/317 (37%), Positives = 172/317 (54%), Gaps = 11/317 (3%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+DF+P+R+ Q F+RE F Q P+SQ+G+V +K+ +A ++ G + + I L
Sbjct: 94 IDFKPNRLHTAVAQTSKFIRELFYQCPISQVGIVAMKNKLAKLISPFGSNTDEQISRLND 153
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
L G GD SLQNAL++ LL +P Y RE+LI++ ++ TCDPG+I TI++ K+
Sbjct: 154 LLSDGPEGDISLQNALEMSISLLDDVPQYTTREILIVFGSIKTCDPGNIFATIKRLKKHH 213
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
I I LS+E++I KH+C +T GS+SVA+D L + PP + + L+K+
Sbjct: 214 ITVDAISLSSELYILKHICTETNGSHSVAIDTHDLTSLFNIYT-NPPRWQDGMLPILVKV 272
Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
GFP + S+C CH ++ V YTCP+C A C +P+ C+ C + LVS + R +H
Sbjct: 273 GFPSLTKSNTASLCVCHGKL-VFTSYTCPQCSAITCNIPSRCKCCQIYLVSPADVCRGFH 331
Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVA--CPKCKKHFCLEC 296
HL F V L TC C L +S G A CP C K FC C
Sbjct: 332 HLCAPPEFTNVPADILG-----GCDTCASCSVELKNSVMYSGEDRAKQCPGCMKIFCRIC 386
Query: 297 DIYIHESLHNCPGCESL 313
D +IH +LH CP C L
Sbjct: 387 DEFIHGTLHQCPFCIKL 403
>gi|225678604|gb|EEH16888.1| TFIIH basal transcription factor complex p44 subunit
[Paracoccidioides brasiliensis Pb03]
Length = 493
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 116/288 (40%), Positives = 158/288 (54%), Gaps = 22/288 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + + FVREFFDQNP+SQ+G+V ++DG+A ++D+ G+P HI A+ G
Sbjct: 121 DLRPTRYLLTLRYAQDFVREFFDQNPISQLGVVGMRDGLAVRISDMSGNPTEHILAIQGL 180
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI KI
Sbjct: 181 RAKDPKGMPSLQNALEMARGALFHTPSHGTREVLIIYGALLSSDPGDIHKTITSLITDKI 240
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAI--- 172
V+GL+A++ IC+ L T Y+VA++E HF+EL++ PP
Sbjct: 241 HVYVLGLAAQVSICQELVTRTNNGDDSGYNVAMNEQHFRELVLNVTTPPATTLASHTAAA 300
Query: 173 ------------ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDC 220
L+ MGFP R ++C+CH GY CPRC +VC LP C
Sbjct: 301 AANGTGTNPSTDGTLLPMGFPNRHLTPHPTLCACHSTPSRS-GYLCPRCCTKVCTLPASC 359
Query: 221 RICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
C L L+ S HLARSYHHLFP+ + EV+ R CF C
Sbjct: 360 PSCNLTLILSTHLARSYHHLFPLMNWVEVSWRKAARAEAEGRVGCFAC 407
>gi|432876344|ref|XP_004073002.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
subunit 2-like [Oryzias latipes]
Length = 340
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 171/315 (54%), Gaps = 62/315 (19%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVT--VKDGVANCLT--DLGGSPESHIKA 57
D +P+R+A K +E FV E+FDQNP+SQ+G + + D L + G+P+ HI A
Sbjct: 75 DLKPNRLAATLKLLEGFVDEYFDQNPISQVGKXSSYLIDDHRRALNTGEASGNPKKHITA 134
Query: 58 LMGKL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
L + C G+ SL NAL+L L +P + REVLI+ S+L+TCDP +I + IQ K
Sbjct: 135 LKKAVDSACVGEPSLYNALNLAVQTLKHMPGHTSREVLIILSSLTTCDPANIYDLIQTLK 194
Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
K+R SVIGLSAE+ +C L ++TGGSY V LDESHFKEL+M H PPPA + + +L
Sbjct: 195 GLKVRVSVIGLSAEVRVCTVLTRETGGSYHVILDESHFKELLMLHVKPPPA-SSSSECSL 253
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
I+MG L LVS+PHLAR
Sbjct: 254 IRMG---------------------------------------------LTLVSAPHLAR 268
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
S+HHLFP+ PF E + L R C GCQ L YV CP C+ FC+E
Sbjct: 269 SFHHLFPMVPFLESSVEELQGSR-----FCEGCQGEL----RDKSAYV-CPSCRGVFCVE 318
Query: 296 CDIYIHESLHNCPGC 310
CD++IH+SLH CP C
Sbjct: 319 CDLFIHDSLHCCPCC 333
>gi|116199505|ref|XP_001225564.1| hypothetical protein CHGG_07908 [Chaetomium globosum CBS 148.51]
gi|88179187|gb|EAQ86655.1| hypothetical protein CHGG_07908 [Chaetomium globosum CBS 148.51]
Length = 463
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 124/371 (33%), Positives = 177/371 (47%), Gaps = 99/371 (26%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--- 58
D P+R + FVRE+F+QNP+SQ+G+V ++DG+A ++D+GG+P HI+ L
Sbjct: 118 DLLPNRYLLTLNYAVDFVREYFEQNPISQLGIVGMRDGIAVRISDMGGNPAEHIEKLRVW 177
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ G+ SLQNAL++ +G L PS+G RE+LI+Y AL + DPGDI +TI +
Sbjct: 178 AEQQEPQGNPSLQNALEMCRGALFHTPSHGTREILIIYGALLSSDPGDIHDTITNLLADR 237
Query: 119 IRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFAIAN 174
IR S++GL+A++ IC+ LC T G +Y+VAL E HF+EL + PP
Sbjct: 238 IRVSIVGLAAQVAICEQLCSRTNGGDPSNYAVALHEQHFRELFLAATTPP---------- 287
Query: 175 LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
VC LP +C +C L LV S HLA
Sbjct: 288 --------------------------------------VCRLPAECPVCALTLVLSTHLA 309
Query: 235 RSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL---------------------- 272
RSYHHLFP+ + V+ D R R S CF C +
Sbjct: 310 RSYHHLFPLRGWAAVS---WADARARGSSACFACLAAFPAVPPSGQIQTQTQVQANGNGN 366
Query: 273 ------------------LSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES-L 313
++G AC C HFC++CD++ HE +HNCPGC+S
Sbjct: 367 GNGRDTDRDKGRDAAEVKAAAGVSESGRYACTVCNNHFCIDCDVFAHEVIHNCPGCQSDT 426
Query: 314 RHSNPIVANEG 324
R + + A +G
Sbjct: 427 RGTEGLQAADG 437
>gi|189195190|ref|XP_001933933.1| TFIIH basal transcription factor complex p44 subunit [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187979812|gb|EDU46438.1| TFIIH basal transcription factor complex p44 subunit [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 460
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 114/300 (38%), Positives = 167/300 (55%), Gaps = 30/300 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + +VREFF+QNP+SQ+ ++ + DGV +++L G+P H+ A+ G
Sbjct: 118 DMRPNRFITMINFTQEYVREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIQGI 177
Query: 62 LGCS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
G SLQNAL+L + L PS+G REV++++ +L + DPGDI +T++ C
Sbjct: 178 RSKDDGKEPKGAPSLQNALELARATLYHTPSHGTREVIVVFGSLLSLDPGDIHQTVKACV 237
Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEFA 171
+IR SVIG+SA + IC + T Y++A D+ KEL++ PP
Sbjct: 238 RDRIRVSVIGMSARLKICTEIVTRTNAGDESEYTIATDQEMLKELLLATTTPPVIRQSAP 297
Query: 172 IAN------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
+A L+ MGFP R E +++C+CH + +G GYTC RC A+VC LP
Sbjct: 298 VATTTAAPSPESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSAKVCSLPIT 356
Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
C C L L+ S HLARSYHHLFP+ + V+ + R + C GC LSS +KP
Sbjct: 357 CPSCQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREKGSKECVGC----LSSFSKP 409
>gi|449019861|dbj|BAM83263.1| similar to TFIIH subunit SSL1 [Cyanidioschyzon merolae strain 10D]
Length = 549
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 188/361 (52%), Gaps = 54/361 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +PSR + + + FV++FF +N +SQ+ +V ++DG+A ++++ GS HI AL
Sbjct: 180 DLKPSRWSAILSGAQYFVQDFFSENAVSQLAIVILQDGIAAQVSEMSGSVRLHIDALRSL 239
Query: 61 --------KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQ 112
G G +SLQN L+L + + +P +G RE+LI+Y LSTCDPGDI +TIQ
Sbjct: 240 QRGTNKFLHDGDGGQTSLQNCLELGLRIHASVPRFGTREMLIVYGGLSTCDPGDIHKTIQ 299
Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPP--------- 163
C ++ IR SVIGL AE++I + + T G+Y V +D HF +L+ + PP
Sbjct: 300 NCVDASIRVSVIGLGAEVYILRATAERTQGTYRVPMDNDHFHQLLRQQVPPPPPPPQASR 359
Query: 164 -----PPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT 218
P+ A + ++ MGFP + + + + +G+ CPRC+ + E+P
Sbjct: 360 STKHLEPSEAPWTAMQMV-MGFPPKLARDRV-VVAIDTRKACRLGFECPRCRQWLSEVPG 417
Query: 219 DCRICGLQLVSSPHLARSYHHLFPIAPFDE-------VTPLCLNDPRNRS---------- 261
+C +CGL L S HLARSYHH+FP F+E ++D S
Sbjct: 418 NCILCGLTLTSVLHLARSYHHIFPAPRFEEREEPSSSRASALMSDEEGFSVSPSSVPERM 477
Query: 262 --------RSTCFGCQQSLLSSGNKPGLYVACPK-CKKHFCLECDIYIHESLHNCPGCES 312
+ C GC ++L K L + C C++ FC++CD +HE LH CP C +
Sbjct: 478 PDAAAPPVKWQCSGCSRTL---STKDDLQLQCTGICRQLFCIDCDQLVHERLHTCPACLA 534
Query: 313 L 313
+
Sbjct: 535 M 535
>gi|440296353|gb|ELP89180.1| TFIIH basal transcription factor complex p44 subunit, putative
[Entamoeba invadens IP1]
Length = 358
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/314 (36%), Positives = 172/314 (54%), Gaps = 19/314 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF P+R++ + ++ F FF NPLSQIGLV ++ A+ + L + + GK
Sbjct: 54 DFHPTRISFINTKMNNFFDNFFVDNPLSQIGLVATRNSTADIICPLTRNISKLKETAWGK 113
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
C+G S+Q +L+L L RE++ + SA+STCDPG+I ETI K +++ IR
Sbjct: 114 --CNGSISVQISLELCLQFLKDTKDMTSREIVFVTSAISTCDPGNIFETIGKVRDNLIRV 171
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP--PAIAEFAIANLIKMG 179
SVI +SAE+ + + + + TGG Y ++E H +E+I PP A + L+ +G
Sbjct: 172 SVISISAEVHVMRQMTKVTGGDYFTIMNEKHAEEVIKTFIEPPRYKATGKEPKVELLFVG 231
Query: 180 FPQRAGEGSISICSCHKEVKVGVG-YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
FP+ +IC CH K+G+G Y CP C CELP +C++C L+ S H ARS+H
Sbjct: 232 FPRELERS--TICECHS--KIGLGHYECPVCGFCYCELPVECKVCKANLIFSHHFARSFH 287
Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
LFP+ PF + +P CF C + ++ + C KC K FC C+I
Sbjct: 288 FLFPVKPF-----VLEKEPE-----VCFACGKKYEEKKDEEMKFWKCEKCSKSFCETCNI 337
Query: 299 YIHESLHNCPGCES 312
+IH+ L NCPGCE+
Sbjct: 338 FIHDVLFNCPGCEN 351
>gi|336468464|gb|EGO56627.1| hypothetical protein NEUTE1DRAFT_84011 [Neurospora tetrasperma FGSC
2508]
gi|350289275|gb|EGZ70500.1| TFIIH basal transcription factor complex, subunit SSL1 [Neurospora
tetrasperma FGSC 2509]
Length = 416
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 113/254 (44%), Positives = 148/254 (58%), Gaps = 32/254 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R VV FVRE+F+QNP+SQ+G+V ++DG+A ++DL G+P HI+
Sbjct: 94 DLLPNRYRVVLNNAIGFVREYFEQNPISQLGIVGMRDGIAVRISDLSGNPAEHIE----- 148
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+Q PS+ REVLI+Y AL + DPGDI +TI +IR
Sbjct: 149 --------------------NQTPSHATREVLIIYGALVSIDPGDIHDTINDLVADRIRV 188
Query: 122 SVIGLSAEMFICKHLCQDTG---GSYSVALDESHFKELIMEHAPPP-PAIAEFAIANLIK 177
SV+GL+ ++ IC LC+ T G+YSVA+DE H KEL PP E A+L+
Sbjct: 189 SVVGLAGQVAICSELCKRTNNHDGNYSVAVDEVHLKELFFAATTPPVTRTPEQNTASLLM 248
Query: 178 MGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
MGFP R A + +S C+CH + GYTCPRC +VC LP DC IC L L+ S HLAR
Sbjct: 249 MGFPSRTLAPKDHVSFCACHAK-PTREGYTCPRCGIKVCRLPIDCPICKLTLIQSTHLAR 307
Query: 236 SYHHLFPIAPFDEV 249
SYHHLFP+ F EV
Sbjct: 308 SYHHLFPLKVFVEV 321
>gi|330925787|ref|XP_003301194.1| hypothetical protein PTT_12637 [Pyrenophora teres f. teres 0-1]
gi|311324299|gb|EFQ90721.1| hypothetical protein PTT_12637 [Pyrenophora teres f. teres 0-1]
Length = 462
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/300 (38%), Positives = 168/300 (56%), Gaps = 30/300 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + +VREFF+QNP+SQ+ ++ + DGV +++L G+P H+ A+ G
Sbjct: 118 DMRPNRFITMINFTQEYVREFFEQNPISQMSVLGMHDGVCIRVSELSGNPAEHVAAIQGL 177
Query: 62 LGCS------GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
G SLQNAL+L + L PS+G REV++++ +L + DPGDI +T++ C
Sbjct: 178 RSKDDGKEPKGAPSLQNALELARATLYHTPSHGTREVIVVFGSLLSLDPGDIHQTVKACV 237
Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP------P 165
+IR SVIG+SA + IC + T Y++A D+ KEL++ PP P
Sbjct: 238 RDRIRVSVIGMSARLKICTEIVTRTNAGDESEYTIATDQEMLKELLLATTTPPVIRQSAP 297
Query: 166 AIAEFAI------ANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
A A+ A L+ MGFP R E +++C+CH + +G GYTC RC +VC LP
Sbjct: 298 AATTPAVPPPESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSTKVCSLPIT 356
Query: 220 CRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP 279
C C L L+ S HLARSYHHLFP+ + V+ + R + C GC L S +KP
Sbjct: 357 CPSCQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREKGSKECVGC----LVSFSKP 409
>gi|295657028|ref|XP_002789089.1| suppressor of stem-loop protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226284987|gb|EEH40553.1| suppressor of stem-loop protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 513
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 159/290 (54%), Gaps = 24/290 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + + FVREFFDQNP+SQ+G++ ++DG+A ++D+ G+P HI A+ G
Sbjct: 121 DLRPTRYLLTLRYAQDFVREFFDQNPISQLGVLGMRDGLAVRISDMSGNPTEHILAIQGL 180
Query: 62 LGCS--GDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI KI
Sbjct: 181 RAKDPKGMPSLQNALEMARGALFHTPSHGTREVLIIYGALLSSDPGDIHKTITSLITDKI 240
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPP---------- 165
V+GL+A++ IC+ L T Y+VA++E HF+EL++ PP
Sbjct: 241 HVYVLGLAAQVSICQELVTRTNNGDDSGYNVAMNEQHFRELVLNVTTPPATTLASHTTTT 300
Query: 166 -------AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT 218
+ L+ MGFP R ++C+CH GY CPRC +VC LP
Sbjct: 301 TGAANGTSTNTSTDGTLLPMGFPNRHLTPHPTLCACHS-TPSRSGYLCPRCCTKVCTLPA 359
Query: 219 DCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC 268
C C L L+ S HLARSYHHLFP+ + EV+ R CF C
Sbjct: 360 SCPSCKLTLILSTHLARSYHHLFPLMNWVEVSWRKAARAEAEGRVGCFAC 409
>gi|85080851|ref|XP_956615.1| suppressor of stem-loop protein 1 [Neurospora crassa OR74A]
gi|28881240|emb|CAD70478.1| related to transcription factor TFIIH chain p47 [Neurospora crassa]
gi|28917686|gb|EAA27379.1| suppressor of stem-loop protein 1 [Neurospora crassa OR74A]
Length = 416
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 112/254 (44%), Positives = 148/254 (58%), Gaps = 32/254 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R VV FVRE+F+QNP+SQ+G+V ++DG+A ++DL G+P HI+
Sbjct: 94 DLLPNRYRVVLNNAIGFVREYFEQNPISQLGIVGMRDGIAVRISDLSGNPAEHIE----- 148
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+Q PS+ RE+LI+Y AL + DPGDI +TI +IR
Sbjct: 149 --------------------NQTPSHATREILIIYGALVSIDPGDIHDTINDLVADRIRV 188
Query: 122 SVIGLSAEMFICKHLCQDTG---GSYSVALDESHFKELIMEHAPPP-PAIAEFAIANLIK 177
SV+GL+ ++ IC LC+ T G+YSVA+DE H KEL PP E A+L+
Sbjct: 189 SVVGLAGQVAICSELCKRTNNHDGNYSVAVDEVHLKELFFAATTPPVTRTPEQNTASLLM 248
Query: 178 MGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
MGFP R A + +S C+CH + GYTCPRC +VC LP DC IC L L+ S HLAR
Sbjct: 249 MGFPSRTLAPKDHVSFCACHAK-PTREGYTCPRCGIKVCRLPIDCPICKLTLIQSTHLAR 307
Query: 236 SYHHLFPIAPFDEV 249
SYHHLFP+ F EV
Sbjct: 308 SYHHLFPLKVFVEV 321
>gi|302418082|ref|XP_003006872.1| suppressor of stem-loop protein [Verticillium albo-atrum VaMs.102]
gi|261354474|gb|EEY16902.1| suppressor of stem-loop protein [Verticillium albo-atrum VaMs.102]
Length = 468
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 185/348 (53%), Gaps = 38/348 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D P+R+ + + FV+E+F+QNP+SQ+G+V ++DGVA ++++GG+P H++A+
Sbjct: 101 DLLPTRLRLTWRLAREFVKEYFEQNPISQMGIVGMRDGVAVRVSEMGGNPVEHVEAINKW 160
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
++ G SLQNAL++ +G L PS+G REVL++Y AL + DPGDI +TI +I
Sbjct: 161 AEIDPQGSPSLQNALEMCRGALFHAPSHGTREVLVVYGALLSSDPGDIHDTIANLLTDRI 220
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R S++GL+A++ IC+ LC+ T Y++ALDE H++EL++ PP + + A+
Sbjct: 221 RVSIVGLAAQVAICEELCRKTNAGDATQYNIALDEMHYRELLLRTTTPPVTRTQAQSTAS 280
Query: 175 LIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS--- 229
L+ MGFP R A + + C+CH V GY C RC RVC LP +C CGL
Sbjct: 281 LLMMGFPSRTLAPADTTAYCACHN-TPVREGYLCTRCGTRVCRLPAECPACGLTPHPLHP 339
Query: 230 -------------------SPHLARSYHHLFPIAPFDEVTPLCL------NDPRNRSRST 264
P A S +APF + D R R +
Sbjct: 340 PRPLLPPPLSRCATGSRSPGPRAAASRACYSCLAPFPDPPAQRRGHRANGTDGRRRGDAG 399
Query: 265 CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCES 312
+ G AC C HFC++CD++ HE +HNCPGC+S
Sbjct: 400 DEEPAPAPKQKGVSESSRYACQVCGNHFCIDCDVFAHEVIHNCPGCQS 447
>gi|407044792|gb|EKE42827.1| TFIIH basal transcription factor complex subunit, putative
[Entamoeba nuttalli P19]
Length = 363
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 175/318 (55%), Gaps = 20/318 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D++P+R+ + +++ F EF+ NPLS+IGL+ ++ A S I+ M
Sbjct: 54 DYQPNRITFIQTKLKEFFEEFYIDNPLSEIGLICTRNKTAE--QQFPLSRNLIIQKEMKW 111
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
C G SLQ +LDL Q +LS + RE++I+ S+++TCDPG+I +TI+ K++ IRC
Sbjct: 112 GNCEGQMSLQISLDLAQQILSTTSNSSSREIIIVTSSITTCDPGNIYDTIETLKKTTIRC 171
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM--G 179
SVI + EM I + L + TGG Y ++E H ++++ PP I++ G
Sbjct: 172 SVISFAPEMHITRLLTKVTGGDYHTIMNEKHAEDVLHTFIIPPIYKENAYEPKTIQLYIG 231
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP++ +IC CH ++ + + CP C CELP C+IC L+ S H ARSYH
Sbjct: 232 FPKQLNVP--TICECHSKIDIN-HFECPICGFCYCELPIQCKICNAILLLSHHFARSYHF 288
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP-----GLYVACPKCKKHFCL 294
+FPI PF + + + + CFGC + + K +Y C KC K++C
Sbjct: 289 MFPIEPFKVIAMI-------KPQEKCFGCGKEIDDRMEKKEEETVNVY-QCKKCLKYYCN 340
Query: 295 ECDIYIHESLHNCPGCES 312
ECD +IH+ L+NCPGCES
Sbjct: 341 ECDSFIHDVLYNCPGCES 358
>gi|402076515|gb|EJT71938.1| hypothetical protein GGTG_11190 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 540
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 117/279 (41%), Positives = 160/279 (57%), Gaps = 32/279 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D PSR V F REFFDQNP+SQ+ +V ++DGVA ++D G+P HI+ L +
Sbjct: 121 DMLPSRHRVAIAYAVDFAREFFDQNPISQLAIVGMRDGVAVRISDTSGNPADHIEKLRQL 180
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+L G+ SLQNAL++ +G L PS+G REV++++ ALS+ DPGDI ETI +I
Sbjct: 181 VELPPMGNPSLQNALEMCRGALFNTPSHGTREVVMIFGALSSSDPGDIHETIDGLVTDRI 240
Query: 120 RCSVIGLSAEMFICKHLC-----------------QDTGGSYSVALDESHFKELIMEHAP 162
R S+IGLSA++FIC+ LC +D Y VA+D+ HF+EL++
Sbjct: 241 RVSIIGLSAQVFICEDLCRRTNAGHSASEPVLVGGEDESSGYMVAMDDVHFRELLLSVTR 300
Query: 163 PPPA------------IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCK 210
PP A A+L+ MGFP R ++C+CH G+ C RC
Sbjct: 301 PPATRQSQLQQQGAAAGGGEAGASLLMMGFPSRTTGTDPALCACHNR-PTRDGFLCTRCG 359
Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEV 249
+RVC LP +C CGL L+ S HLARSYHHLFP+ + EV
Sbjct: 360 SRVCRLPAECPACGLTLILSTHLARSYHHLFPLRIWVEV 398
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 28/35 (80%), Gaps = 1/35 (2%)
Query: 277 NKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
++ G Y+ C C+KHFC++CD++ HE +HNCPGC+
Sbjct: 483 SESGRYM-CQVCRKHFCIDCDVFAHEVVHNCPGCQ 516
>gi|340378764|ref|XP_003387897.1| PREDICTED: general transcription factor IIH subunit 2-like
[Amphimedon queenslandica]
Length = 354
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 168/314 (53%), Gaps = 43/314 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R ++ F+ EFFD NP+SQ + ++ H +AL
Sbjct: 81 DLKPNRFTCTKTVLKKFLDEFFDFNPISQRDRRRYHTNTIINICNVS----VHKRALDTD 136
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+ CSG+ SLQNA++L +L IP++ REV+I++ +L++CDPG+I TI + IRC
Sbjct: 137 VECSGECSLQNAMELSLRMLRHIPTHASREVIIIHGSLTSCDPGNIFSTINDFNQQNIRC 196
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SVIGL+A + +C +C+ T G Y V LDE HF++ +++H PP A A LIKM
Sbjct: 197 SVIGLAASVKLCHTICERTSGVYQVILDEVHFRDCLLQHCRPPGVKASTETA-LIKM--- 252
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
GEG + G C+++ CELP +C CG+ LV +P LARSYHHLF
Sbjct: 253 ---GEGVSLNIKLWPHYQFVCGKEQAWCQSKYCELPVECVTCGITLVLAPQLARSYHHLF 309
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ F E T +YV CP C+ FC++CD+Y H
Sbjct: 310 PLKMFIEST------------------------------VYV-CPSCRHSFCIDCDLYCH 338
Query: 302 ESLHNCPGCE-SLR 314
E+LHNCPGCE SLR
Sbjct: 339 ETLHNCPGCENSLR 352
>gi|67479545|ref|XP_655154.1| TFIIH basal transcription factor complex subunit [Entamoeba
histolytica HM-1:IMSS]
gi|56472271|gb|EAL49768.1| TFIIH basal transcription factor complex subunit, putative
[Entamoeba histolytica HM-1:IMSS]
gi|449704914|gb|EMD45073.1| TFIIH basal transcription factor complex p44 subunit, putative
[Entamoeba histolytica KU27]
Length = 363
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 114/318 (35%), Positives = 175/318 (55%), Gaps = 20/318 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D++P+R+ + +++ F EF+ NPLS+IGL+ ++ A S I+ M
Sbjct: 54 DYQPNRITFIQTKLKEFFEEFYIDNPLSEIGLICTRNKTAE--QQFPLSRNLIIQKEMKW 111
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
C G SLQ +LDL Q +LS + RE++I+ S+++TCDPG+I +TI+ K++ IRC
Sbjct: 112 GNCEGQMSLQISLDLAQQILSTTSNSSSREIIIVTSSITTCDPGNIYDTIETLKKTTIRC 171
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM--G 179
SVI + EM I + L + TGG Y ++E H ++++ PP I++ G
Sbjct: 172 SVISFAPEMHITRLLTKVTGGDYHTIMNEKHAEDVLHTFIIPPIYKENAYEPKTIQLYIG 231
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP++ +IC CH ++ + + CP C CELP C+IC L+ S H ARSYH
Sbjct: 232 FPKQLNVP--TICECHSKIDIN-HFECPICGFCYCELPIQCKICNAILLLSHHFARSYHF 288
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKP-----GLYVACPKCKKHFCL 294
+FPI PF + + + + CFGC + + K +Y C KC K++C
Sbjct: 289 MFPIEPFKVIAMI-------KPQEKCFGCGKEIDDRIEKKEEETVNVY-QCKKCLKYYCD 340
Query: 295 ECDIYIHESLHNCPGCES 312
ECD +IH+ L+NCPGCES
Sbjct: 341 ECDSFIHDVLYNCPGCES 358
>gi|405122345|gb|AFR97112.1| RNA polymerase II transcription factor [Cryptococcus neoformans
var. grubii H99]
Length = 425
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/303 (37%), Positives = 165/303 (54%), Gaps = 45/303 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+RP+R V+ + +V E+FDQNPL QIG++ ++ SPE ++AL K
Sbjct: 120 DYRPTRFEVILGYLRTYVVEWFDQNPLGQIGVIAMR------------SPEEIVRALSDK 167
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPG---DIMETIQKCKE 116
L SG+ SLQN L + +G ++ +PS E+L+++SA+ST DP I +
Sbjct: 168 RKLEPSGEPSLQNGLVMAKGGMAHLPSTSSLEILVIFSAISTADPDGPVTIHNVLDTLAT 227
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN-- 174
IR S++ LS E+ ICK + + TGG + VALD+ H K+L+ E PPP AN
Sbjct: 228 GHIRTSILSLSGEIKICKQIAERTGGKFGVALDQEHLKDLLWETIPPPATTIAPVTANVR 287
Query: 175 ---------------------LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
L+ MGFP R G ++C+CH ++ G GY CPRC +++
Sbjct: 288 SALAAGGRGPNQTGERAPAGDLMVMGFPIRLPLGGETMCACHGLLRKG-GYLCPRCGSKL 346
Query: 214 CELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL 273
C++PTDC +CGL +VSSPHLARS+ LFP+A + PL + D S TCFGC
Sbjct: 347 CDVPTDCEVCGLMVVSSPHLARSFWFLFPVANYG---PLAIEDVVESSE-TCFGCDSEFS 402
Query: 274 SSG 276
G
Sbjct: 403 DVG 405
>gi|255634763|gb|ACU17743.1| unknown [Glycine max]
Length = 214
Score = 203 bits (516), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 93/115 (80%), Positives = 108/115 (93%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRPSRMAV+ KQVEAF+REFFDQNPLS +GLVT+KDG+A+C+T+LGGSPESHIKALMGK
Sbjct: 99 DFRPSRMAVMGKQVEAFIREFFDQNPLSHVGLVTIKDGIAHCITELGGSPESHIKALMGK 158
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
L CSGD+SLQNAL+LV G L+QIPSYGHREVLILYSALSTCDPGD+METI + ++
Sbjct: 159 LECSGDASLQNALELVLGYLNQIPSYGHREVLILYSALSTCDPGDLMETIPEMQK 213
>gi|300123288|emb|CBK24561.2| unnamed protein product [Blastocystis hominis]
Length = 626
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/320 (37%), Positives = 174/320 (54%), Gaps = 26/320 (8%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +PSR+ VV Q++ F+R+FF N LS + ++ D A LT L + H+ AL K
Sbjct: 5 DLKPSRLEVVVSQLKQFIRDFFYHNSLSSLCIIVSHDRRAFQLTPLSRHMKEHLNAL-DK 63
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L CSG SLQ +LD+ + PS +E++ L S++STCDP DI +TI +E I+C
Sbjct: 64 LTCSGKFSLQLSLDIAIKTMELAPSQFAKEIICLISSISTCDPRDIYKTISIIQEKHIQC 123
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPP----PAI-------AEF 170
SV+ LS E+++ + L Q T G ++V D +H PP P + A
Sbjct: 124 SVMSLSGEVYLFQKLAQITSGIFAVPEDPQQVHHFFSQHISPPINSDPIVLVKREDGATV 183
Query: 171 AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSS 230
+ +MGFP IC+CH Y CPRC++ VCELP C +C L LV
Sbjct: 184 THREICEMGFPNNVQLS--HICACHVRF-CSKFYECPRCRSYVCELPVRCPVCMLMLVDY 240
Query: 231 PHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKK 290
HL +SYHHLFP F+E ND ++ TC+GC S++ G + + CPKC K
Sbjct: 241 THLTQSYHHLFPPKNFEEC-----ND--HKPGITCYGC-CSVIKEGE---ICLQCPKCMK 289
Query: 291 HFCLECDIYIHESLHNCPGC 310
+FC CD ++H+++ +CPGC
Sbjct: 290 YFCQTCDAFVHDNIFHCPGC 309
>gi|401395984|ref|XP_003879726.1| putative general transcription factor IIH polypeptide 2 [Neospora
caninum Liverpool]
gi|325114133|emb|CBZ49691.1| putative general transcription factor IIH polypeptide 2 [Neospora
caninum Liverpool]
Length = 487
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 173/340 (50%), Gaps = 45/340 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVAN------CLTDLGGSPESHI 55
DFRP R++ V + E F+ F QNPL+Q+ V + A+ C G SP
Sbjct: 113 DFRPDRLSCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSADPIVTSRCGATRGDSPSVPA 172
Query: 56 KALMGKLGC-------------------------SGDSSLQNALDLVQGLLSQIPSYGHR 90
G +G +G SL+N L L + LL+ +P Y R
Sbjct: 173 ARSPGSVGTHLFSSNAEECIASLREKRKTTPGVGTGVPSLKNGLRLAKDLLACVPPYCTR 232
Query: 91 EVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDE 150
EVLILY +L TCD G I ETI KE I C+VI L+AEM + K LCQ+TGG ++V L
Sbjct: 233 EVLILYGSLRTCDVGCIEETIADVKEGNICCNVICLAAEMHVLKKLCQETGGRHAVPLHR 292
Query: 151 SHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCK 210
H + L+++H PPA + +L++MGFP + ++CSCH+++ Y CP+C
Sbjct: 293 EHLRSLLVQHT-LPPAWSSSMQPSLVRMGFPSLKSTSTAALCSCHQQLTFS-AYVCPQCG 350
Query: 211 ARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQ 270
A++C++P CR C L LVS ++RS+H L P PFD V + ++ C C
Sbjct: 351 AKLCDIPNRCRCCFLHLVSPADISRSFHSLCPPLPFDPVADI------PEAKRVCACCTT 404
Query: 271 SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
L G++ CP C + FC +CD+Y HE L C C
Sbjct: 405 RLERGGSQ------CPDCGEIFCHDCDVYTHEQLRQCAFC 438
>gi|324520669|gb|ADY47691.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
Length = 345
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 102/256 (39%), Positives = 148/256 (57%), Gaps = 6/256 (2%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
P+R+ V K + F+ +F +QNP+SQ+G++ +D A L L G+ + +AL + +
Sbjct: 87 PTRLNVTLKVLNQFLEKFSEQNPISQVGIIICRDKRAERLIQLTGTIRAVKEALSTINEA 146
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C G+ SLQN L + L P + RE++ + ++LSTCDP +I T + K + +RCS
Sbjct: 147 SCHGEFSLQNGLLVALRSLQHFPGHASREIIAIVASLSTCDPSNIFGTFEVLKRNNVRCS 206
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
VIGLSAE+F+ K LC T G Y V LD HF ELI+ PP + +++MGFP
Sbjct: 207 VIGLSAEVFVYKKLCSTTSGRYDVILDGDHF-ELILNEYTNPPVTKRNVESCVVRMGFPS 265
Query: 183 RAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
+ C CH+ G G+ CP+C AR C LP +CR+C L L+S+P LARS+HH
Sbjct: 266 HQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLARSFHH 325
Query: 240 LFPIAPFDEVTPLCLN 255
L P+ F EV C+
Sbjct: 326 LLPLPAFKEVIDYCVK 341
>gi|312076202|ref|XP_003140756.1| general transcription factor IIH subunit 2 [Loa loa]
Length = 1131
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 105/252 (41%), Positives = 149/252 (59%), Gaps = 19/252 (7%)
Query: 64 CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSV 123
C G+ SL N+L L P Y REV+++ ++LSTCDP +I T + K IRCSV
Sbjct: 95 CHGEFSLHNSLMAAIKSLHSYPGYASREVILIVASLSTCDPSNIFGTFELLKRYHIRCSV 154
Query: 124 IGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQR 183
I LSAE+FI + LC T G ++V LD +HF+ ++ EHA PP + + A +++++MGFP
Sbjct: 155 ISLSAEVFIFRKLCSATSGCHNVILDSTHFEVILNEHANPPIS-SRNAESSVVRMGFPAH 213
Query: 184 AGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
S S C CH+ E++ G G+ CP+C AR C LP +CRIC L L+S+P LARS H+L
Sbjct: 214 ESIDSPSFCLCHQSEIRPSGGRGFFCPQCGARYCSLPVECRICKLTLISAPQLARSLHNL 273
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
P+ F+E+ + + CF C + L C CK FC++CD+ +
Sbjct: 274 LPLPAFEEI---------DTTERVCFACIRQLDDKS------FVCKNCKSTFCIDCDVLL 318
Query: 301 HESLHNCPGCES 312
HESL CPGC+S
Sbjct: 319 HESLQICPGCKS 330
>gi|324520938|gb|ADY47747.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
Length = 340
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 101/250 (40%), Positives = 146/250 (58%), Gaps = 6/250 (2%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKL 62
P+R+ V K + F+ +F +QNP+SQ+G++ +D A L L G+ + +AL + +
Sbjct: 87 PTRLNVTLKVLNQFLEKFSEQNPISQVGIIICRDKRAERLIQLTGTIRAVKEALSTINEA 146
Query: 63 GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCS 122
C G+ SLQN L + L P + RE++ + ++LSTCDP +I T + K + +RCS
Sbjct: 147 SCHGEFSLQNGLLVALRSLQHFPGHASREIIAIVASLSTCDPSNIFGTFEVLKRNNVRCS 206
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
VIGLSAE+F+ K LC T G Y V LD HF ELI+ PP + +++MGFP
Sbjct: 207 VIGLSAEVFVYKKLCSTTSGRYDVILDGDHF-ELILNEYTNPPVTKRNVESCVVRMGFPS 265
Query: 183 RAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
+ C CH+ G G+ CP+C AR C LP +CR+C L L+S+P LARS+HH
Sbjct: 266 HQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLARSFHH 325
Query: 240 LFPIAPFDEV 249
L P+ F EV
Sbjct: 326 LLPLPAFKEV 335
>gi|90082659|dbj|BAE90511.1| unnamed protein product [Macaca fascicularis]
Length = 237
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 113/238 (47%), Positives = 155/238 (65%), Gaps = 22/238 (9%)
Query: 84 IPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
+P + REVLI++S+L+TCDP +I + I+ K +KIR SVIGLSAE+ +C L ++TGG+
Sbjct: 1 MPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGT 60
Query: 144 YSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA--------GEGSISICSCH 195
Y V LDESH+KEL+ H PPPA + +LI+MGFPQ + S S+
Sbjct: 61 YHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMGFPQHTIASLSDQDAKPSFSMAHLD 119
Query: 196 KEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPL 252
+ G+ GY CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLFP+ F E+ PL
Sbjct: 120 GNTEPGLTLGGYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEI-PL 178
Query: 253 CLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
+ N R C+GCQ L +YV C C+ FC++CD+++H+SLH CPGC
Sbjct: 179 ---EEYNGER-FCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 227
>gi|344303241|gb|EGW33515.1| hypothetical protein SPAPADRAFT_60859 [Spathaspora passalidarum
NRRL Y-27907]
Length = 346
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 176/351 (50%), Gaps = 77/351 (21%)
Query: 36 VKDGVANCLTDLGGSPESHIKALM-------GKLGCSGDSSLQNALDLVQGLL------S 82
+++GVAN ++++ GSP+ H+ + K GD SLQN+L++ + LL +
Sbjct: 1 MRNGVANLVSEVSGSPQYHLDKIRQLKARQHNKFEPKGDPSLQNSLEMARSLLKYNFGSN 60
Query: 83 QIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG- 141
+ +E+L+++ AL T DPGDI +TI + I+ VIGLSA++ IC+ L T
Sbjct: 61 ANNTKNSKEILVIFGALFTSDPGDIHKTIGNLVKDDIKVKVIGLSAQVAICQELVNRTNR 120
Query: 142 -------GSYSVALDESHFKELIMEHAPPPPAI---AEFAIAN----LIKMGFP---QRA 184
+Y V ++ESHFKEL+M+ P P EF +N +IKMGFP Q
Sbjct: 121 EEKNTSSKNYGVIMNESHFKELLMDCVTPLPLTEQEKEFEESNHGVPVIKMGFPTKIQPT 180
Query: 185 GEGSISICSCHKE------------------------------VKVGVGYTCPRCKARVC 214
G SI E V VGY CP+CK++VC
Sbjct: 181 GNSSIGNTDFTIEFPQLNASFPTAGSDDSSDVVQINHNMGATTTGVAVGYQCPQCKSKVC 240
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSL-- 272
LPT C +CGL L+ S HLARSYHHL P+ P+ EV P+ CFGCQ
Sbjct: 241 NLPTICPVCGLMLILSTHLARSYHHLVPLRPYKEV-PIS----STYESQFCFGCQLQFPK 295
Query: 273 ---LSSGNKPGL------YVACPKCKKHFCLECDIYIHESLHNCPGCESLR 314
+S +K L C KC K FC+ CD+++HE LHNCPGCE+ +
Sbjct: 296 GVRTTSKSKGKLESMTSSRYRCMKCSKDFCINCDVFVHEVLHNCPGCENSK 346
>gi|170585402|ref|XP_001897473.1| TFIIH basal transcription factor complex p44 subunit [Brugia
malayi]
gi|158595152|gb|EDP33725.1| TFIIH basal transcription factor complex p44 subunit, putative
[Brugia malayi]
Length = 361
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 105/252 (41%), Positives = 154/252 (61%), Gaps = 6/252 (2%)
Query: 3 FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MG 60
+PSR+AV K + F+ +F +QNP+SQ+G+V +D A L L G+ ++L +
Sbjct: 81 IQPSRLAVTIKALNQFLDKFSEQNPISQVGVVVCRDKRAERLIPLTGNVRLVKESLSTIT 140
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
+L C G+ SL N+L L P + REV+++ ++LSTCDP I T + K IR
Sbjct: 141 ELLCHGEFSLHNSLMATIRSLHSYPGHASREVILIVASLSTCDPSSIFGTFELLKRYHIR 200
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
CSV+ LSAE+F+ K LC T G + V LD SHF+ ++ EHA PP + A +++++MGF
Sbjct: 201 CSVVSLSAEIFVFKKLCSITSGRHDVILDSSHFEVILSEHA-IPPVCSRNAESSVVRMGF 259
Query: 181 PQRAGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
P S C CH+ EV+ G G+ CP+C AR C LP +CRIC L L+S+P LARS
Sbjct: 260 PAHESIDCPSFCLCHQSEVRPSGGRGFFCPQCGARYCSLPVECRICKLTLISAPQLARSL 319
Query: 238 HHLFPIAPFDEV 249
++L P+ F+E+
Sbjct: 320 YNLLPLPAFEEI 331
>gi|299751185|ref|XP_001830111.2| RNA polymerase II transcription factor [Coprinopsis cinerea
okayama7#130]
gi|298409259|gb|EAU91776.2| RNA polymerase II transcription factor [Coprinopsis cinerea
okayama7#130]
Length = 430
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 105/287 (36%), Positives = 163/287 (56%), Gaps = 49/287 (17%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLG-----GSPESHIK 56
D RP+R + + F+ E+FDQNPL QIG+V ++ G+A+ G+P+ ++
Sbjct: 123 DMRPTRFDLTLQYTREFIAEWFDQNPLGQIGIVGMRAGLASTFVKFTECRRIGNPQEVLR 182
Query: 57 ALMG--KLGCSGDSSLQNALDLVQGLL---------SQIPSYGHREVLILYSALSTCDPG 105
A+ KL +G+ SLQNA+++ + + S +P++ RE+LIL+ +L+TCDPG
Sbjct: 183 AIADRHKLEPTGEPSLQNAIEMARSNMNNIANESVSSHLPTHSSREILILFGSLTTCDPG 242
Query: 106 DIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG------GSYSVALDESHFKELIME 159
+I +T++ C ++KIR SV+ L+AEM IC+ LC TG G + VA++E H+K+L+ E
Sbjct: 243 NIHDTLETCIKNKIRISVVALAAEMKICRELCDKTGDYIPKEGQFGVAMNEGHYKDLLFE 302
Query: 160 HAPPPPAIA---------------EFAIANLIKMGFPQRAGEGSI-SICSCHKEVKVGVG 203
PPP A A+L+ MGFP R E S ++C CH E+K G
Sbjct: 303 LVPPPAQKAAPRPAGPGSAPQPPSTNPAADLMMMGFPTRLPESSQPTLCVCHSELK-SQG 361
Query: 204 YTCPRCKARVCELPTDCRICGLQLV----------SSPHLARSYHHL 240
+ CPRC+++VC++PTDC +CGL + S P L + HL
Sbjct: 362 FLCPRCQSKVCDVPTDCDVCGLMITIIAGDNEHCSSMPRLLTPFPHL 408
>gi|156083725|ref|XP_001609346.1| transcriptional factor 2 subunit [Babesia bovis T2Bo]
gi|154796597|gb|EDO05778.1| transcriptional factor 2 subunit [Babesia bovis]
Length = 364
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 91/252 (36%), Positives = 146/252 (57%), Gaps = 4/252 (1%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+DF+P R+ A V+ ++ F Q P++Q+ ++T+++ + ++ LG P HI L
Sbjct: 107 VDFKPDRLHCAAGAVKGLIQGLFHQGPMTQLAIITMRNKRSTMVSKLGTPPSEHIALLDE 166
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
K+ G G SLQN L++ +L+ +P Y RE+L+++ A T DPG+I+ T+QK K+
Sbjct: 167 KIKEGTDGVPSLQNGLEMAISILTDMPPYTTREILVIFGATKTFDPGNILTTLQKLKDEH 226
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
+ S + ++ EM+I KH+C +TGGSY+V D H K L+ +H PP + L K+
Sbjct: 227 VSVSAVSIAPEMYILKHICTETGGSYAVCKDSVHLKSLLNDHT-MPPKWRPWMEPVLTKV 285
Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
GFP + S+C CH + Y CP+C ++ C +PT C+ C L LVS P ++R +H
Sbjct: 286 GFPPLEKANTASLCICHSTLTYK-AYICPQCHSKSCAIPTKCKCCRLYLVSPPDISRMFH 344
Query: 239 HLFPIAPFDEVT 250
L P PF V+
Sbjct: 345 QLIPPKPFINVS 356
>gi|403367585|gb|EJY83616.1| P44/SSL1-like protein [Oxytricha trifallax]
Length = 499
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 149/254 (58%), Gaps = 6/254 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
+ RP+R V+ + FV ++FDQNPLS + V A L+D SP+ HI+ +
Sbjct: 73 EMRPNRAIVIKSYLSNFVIDYFDQNPLSCLSFVATFRERAIILSDFNDSPQDHIEKICRF 132
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G++S+QN+L+L + +PSY +E+LI++ +++ CDPG+I ET K K I C
Sbjct: 133 SDLEGNASIQNSLELCIENFTAVPSYAKKEILIIFCSITNCDPGNIFETYDKLKSKMITC 192
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA-----IANLI 176
SVI LSA MFI + + Q TGG + ++ D++HF++L+ P +I + LI
Sbjct: 193 SVISLSASMFILQKVTQLTGGEFFLSKDQAHFQDLLKRLLVPKESIEDLTQRKQQQNTLI 252
Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
KMGFP R ++CSCH +++ V Y CP C++ CEL + C++C + LVS+ HL+R+
Sbjct: 253 KMGFPMRKILSYPAVCSCHNDLRFLV-YICPNCQSPSCELSSFCKVCQIMLVSAAHLSRT 311
Query: 237 YHHLFPIAPFDEVT 250
H +A F +T
Sbjct: 312 AQHNQNLANFQRLT 325
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECD 297
+ +FP+ V + D + C GC + + +S + + V C CKK +CL+CD
Sbjct: 427 NEIFPLNSMKTVGSILQKDGHSY---YCNGCSKRVEASFDNVDI-VICTICKKFYCLDCD 482
Query: 298 IYIHESLHNCPGCESLR 314
I+IHE+L +CP C+ ++
Sbjct: 483 IFIHETLLSCPTCQLIQ 499
>gi|71027729|ref|XP_763508.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350461|gb|EAN31225.1| hypothetical protein, conserved [Theileria parva]
Length = 447
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 66/357 (18%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDF+P R+ ++ F+ E + P++Q+G+V +++ + N +T G +P ++ L
Sbjct: 106 MDFKPDRLYCAFNSLKEFLGELYSSGPITQVGIVVMRNKICNVITQFGTNPNEQMELLSS 165
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
L G G SSLQN L++ ++ +P Y RE+L+++ + T DPG+I+ T+ + K++
Sbjct: 166 VLKDGPEGSSSLQNGLEMCMKIMCDLPYYSTREILVIFGSNRTLDPGNILLTLDQLKQNF 225
Query: 119 IRCSVIGLSAEMFICK-------------------------------------------- 134
I + I LS E++I K
Sbjct: 226 ITVNSISLSPELYILKVALLRYGVRSNGTSYLLVILPQSYPPQWDSLWPLWLLVPHIYTS 285
Query: 135 -HLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICS 193
++C++TGG+YSVA D +H K L PPP + LIK+ FP S+S+C
Sbjct: 286 YNICKETGGNYSVARDVNHLKLLFNNLTIPPPW-KSWMEPILIKVSFPPMKRGTSVSLCC 344
Query: 194 CHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLC 253
CH +V V Y CP+C + C +PT C+ CG+ +VS P ++R++HHL P PF V
Sbjct: 345 CHNKV-VNTVYICPQCHSNSCYIPTKCQGCGIFMVSPPDISRAFHHLIPPKPFHHV---- 399
Query: 254 LNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
C GC + S C C FC CD YIH+ LH CP C
Sbjct: 400 ------EGGMDCVGCNLRVESG-------YECQDCGGLFCQHCDKYIHQDLHQCPKC 443
>gi|237841927|ref|XP_002370261.1| general transcription factor IIH polypeptide 2, putative
[Toxoplasma gondii ME49]
gi|211967925|gb|EEB03121.1| general transcription factor IIH polypeptide 2, putative
[Toxoplasma gondii ME49]
gi|221503074|gb|EEE28780.1| btf, putative [Toxoplasma gondii VEG]
Length = 627
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 120/336 (35%), Positives = 163/336 (48%), Gaps = 40/336 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRP R+ V + E F+ F QNPL+Q+ V + A L G +
Sbjct: 261 DFRPDRLRCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSAEPLAARGEAARDSAAVSARP 320
Query: 62 LGC---------------------------SGDSSLQNALDLVQGLLSQIPSYGHREVLI 94
G +G SL N L L + LL+ +P Y REVL+
Sbjct: 321 SGSASMQLFSSSASESIAALREKRKHPGVGTGVPSLNNGLHLAKDLLAGVPPYCTREVLV 380
Query: 95 LYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFK 154
L+ +L TCD G I ETI K+S I C+VI L+AE+ + K+LCQ TGG + V L H +
Sbjct: 381 LFGSLRTCDVGCIEETIAAVKKSNICCNVICLAAELHVLKNLCQATGGRHDVPLHREHLR 440
Query: 155 ELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
L+M+H PP A LI+MGFP + ++CSCH+ + Y CP+C A++C
Sbjct: 441 ALLMQHTLPPAWSASMQPC-LIRMGFPSLKSTSTAALCSCHQHLTFS-SYVCPQCGAKLC 498
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS 274
P CR C L LVS ++RS+H L P PFD P+ + P ++ C C L
Sbjct: 499 TTPNRCRCCFLHLVSPADISRSFHSLCPPLPFD---PVPADAP--EAKRLCACCTTQLDR 553
Query: 275 SGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
G + CP C + FC +CDIY HE L C C
Sbjct: 554 GGAQ------CPDCGEIFCHDCDIYSHEQLRQCAFC 583
>gi|221482733|gb|EEE21071.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 627
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 120/336 (35%), Positives = 163/336 (48%), Gaps = 40/336 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRP R+ V + E F+ F QNPL+Q+ V + A L G +
Sbjct: 261 DFRPDRLRCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSAEPLAARGEAARDSAAVSARP 320
Query: 62 LGC---------------------------SGDSSLQNALDLVQGLLSQIPSYGHREVLI 94
G +G SL N L L + LL+ +P Y REVL+
Sbjct: 321 SGSASMQLFSSSASESIAALREKRKHPGVGTGVPSLNNGLHLAKDLLAGVPPYCTREVLV 380
Query: 95 LYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFK 154
L+ +L TCD G I ETI K+S I C+VI L+AE+ + K+LCQ TGG + V L H +
Sbjct: 381 LFGSLRTCDVGCIEETIAAVKKSNICCNVICLAAELHVLKNLCQATGGRHDVPLHREHLR 440
Query: 155 ELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
L+M+H PP A LI+MGFP + ++CSCH+ + Y CP+C A++C
Sbjct: 441 ALLMQHTLPPAWSASMQPC-LIRMGFPSLKSTSTAALCSCHQHLTFS-SYVCPQCGAKLC 498
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLS 274
P CR C L LVS ++RS+H L P PFD P+ + P ++ C C L
Sbjct: 499 TTPNRCRCCFLHLVSPADISRSFHSLCPPLPFD---PVPADAP--EAKRLCACCTTQLDR 553
Query: 275 SGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
G + CP C + FC +CDIY HE L C C
Sbjct: 554 GGAQ------CPDCGEIFCHDCDIYSHEQLRQCAFC 583
>gi|348569006|ref|XP_003470289.1| PREDICTED: hypothetical protein LOC100720431 [Cavia porcellus]
Length = 637
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 170/312 (54%), Gaps = 49/312 (15%)
Query: 12 AKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSS 69
A+ +E FV E+FDQNP+SQ+ ++ K A LT+L G P+ HI +L + C G+ S
Sbjct: 352 AQLLEYFVEEYFDQNPISQVCIIVTKSKRAEKLTELSGMPQKHITSLKKAIDMTCHGEPS 411
Query: 70 LQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAE 129
L N+L + L+ D+M+T+ K +KIR SVIGL AE
Sbjct: 412 LYNSLSMAMQTLNL----------------------DLMKTL---KTAKIRVSVIGLWAE 446
Query: 130 MFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA----- 184
+ +C L ++TGG+Y LDESH+KEL+ H PP + + +LI MGFPQ
Sbjct: 447 VRVCAALARETGGTYHEILDESHYKELLTHHVSPP-PASSSSECSLIHMGFPQHTIAALA 505
Query: 185 ---GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
+ S S+ + G+ GY P+C A+ CELP +C+ICGL LVS+PHL SYH
Sbjct: 506 DQDAKPSFSMAHLDSSSEPGLALGGYFYPQCWAKYCELPVECKICGLTLVSAPHLVWSYH 565
Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
HLFP+ F E+ PL + C+GCQ L +YV C + FC++CD+
Sbjct: 566 HLFPLDAFQEI-PL----EEYKGERFCYGCQGQL----KDQHVYV-CTVFQNVFCVDCDV 615
Query: 299 YIHESLHNCPGC 310
++H+S+H C GC
Sbjct: 616 FVHDSVHCCLGC 627
>gi|239793430|dbj|BAH72833.1| ACYPI003683 [Acyrthosiphon pisum]
Length = 245
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/250 (40%), Positives = 154/250 (61%), Gaps = 14/250 (5%)
Query: 69 SLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSA 128
S+QN+L++ ++ +PS+ +E+LI+ S+LS+CDPG+I +I+ K IR S+I L+A
Sbjct: 2 SVQNSLEIALSVMKMLPSHTSKEILIIGSSLSSCDPGEINTSIEMLKTHNIRVSMIHLAA 61
Query: 129 EMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEG- 187
E+ + +HLC +T G ++V +D+ HFK ++ P P + A+ +KMGFPQ +
Sbjct: 62 EVRMFRHLCNETKGKHNVIVDDVHFKHILWSLVEPVP-LPNSVDASCVKMGFPQELEQKP 120
Query: 188 SISICSCH--KEVKVGV-GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIA 244
+ CSCH + K+ G+ CP+C ++ CELP +C+ CGL LVSS HLARS HHL PI
Sbjct: 121 PFTTCSCHLAEGGKLNAKGFFCPQCNSKYCELPVECKCCGLILVSSLHLARSLHHLVPIK 180
Query: 245 PFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESL 304
PF ++ + S + C+GC++ + +Y C CKKH+C CDIY+H +L
Sbjct: 181 PFIKI------ELEEGSSAYCYGCRKRIKVPAE--NVYF-CESCKKHYCDGCDIYVHNTL 231
Query: 305 HNCPGCESLR 314
H CPGC R
Sbjct: 232 HVCPGCAVKR 241
>gi|118377110|ref|XP_001021737.1| transcription factor ssl1 family protein [Tetrahymena thermophila]
gi|89303503|gb|EAS01491.1| transcription factor ssl1 family protein [Tetrahymena thermophila
SB210]
Length = 372
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 167/317 (52%), Gaps = 14/317 (4%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+DF+PSR+ + ++ F++ FFD NPLS + + ++G L + S S + +
Sbjct: 63 VDFKPSRIGLTQIYLKNFIKNFFDSNPLSVLSIGKTQNGQCITLQEFTSSI-SLLDKTIE 121
Query: 61 KLGCSG--DSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
KL D SL L+ + ++ P Y REV+++ S ST D GDI E I K K+ K
Sbjct: 122 KLVVDEKEDFSLFECLNETFNIFAEAPPYAFREVVLIQSTPSTKDKGDINEQINKAKQCK 181
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
I ++I L+A+ FI + L ++T G Y AL E ++ L+ + PP I LI++
Sbjct: 182 ITFNIISLNAKTFIFQKLTKETQGKYDTALHEKDYENLLNSYCEPPQQILNKTEKTLIQV 241
Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
GFP + +C+CH E+K Y CP+CK++ CE+P+ C C L L+ H+ +SYH
Sbjct: 242 GFPNKKYSKISVLCACHMELKTNF-YICPQCKSKNCEVPSKCSGCQLNLILPLHITKSYH 300
Query: 239 HLFPIAPFD----EVTPLCLNDPRNRSRST-CFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
+ F+ E + + ++++++ C GC + + K + C CK FC
Sbjct: 301 FSQNLKEFELLGQEKSNNSMETEQDKTQNILCNGCSEDI-----KEDYFSKCQNCKSIFC 355
Query: 294 LECDIYIHESLHNCPGC 310
LECDI IH+ NCP C
Sbjct: 356 LECDILIHKQKLNCPKC 372
>gi|170035934|ref|XP_001845821.1| btf [Culex quinquefasciatus]
gi|167878420|gb|EDS41803.1| btf [Culex quinquefasciatus]
Length = 367
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 137/220 (62%), Gaps = 13/220 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K +E F+ EFFDQNP+SQ+G++ +K A +T+LGGS HIKA+ +
Sbjct: 76 DLKPTRLLCTLKLLEIFIEEFFDQNPISQLGVIAMKAKRAEKITELGGSCRKHIKAVGNL 135
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
KL +G+ SLQN L+L L +PS+ RE+L++ +L+TCDP DI TI K I
Sbjct: 136 NKLNLTGEPSLQNGLELALKTLKMVPSHASREILVVMGSLTTCDPTDIHITIDALKSEGI 195
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RCSV+ LSAE+ +CK LC +TGG Y LD+SHFK+ +++H PP A + + +IKMG
Sbjct: 196 RCSVVSLSAEIRVCKFLCTETGGVYGAVLDDSHFKDQLLQHIDPPQAGNQQEFS-MIKMG 254
Query: 180 FPQ-RAGEGS---ISICSCHKEVK------VGVGYTCPRC 209
FP + EG +++C CH + GY CP+C
Sbjct: 255 FPHGKTEEGKDPPLTMCMCHIDSTDEPSKLTSGGYHCPQC 294
>gi|349604001|gb|AEP99672.1| General transcription factor IIH subunit 2-like protein, partial
[Equus caballus]
Length = 274
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 89/183 (48%), Positives = 122/183 (66%), Gaps = 3/183 (1%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIITKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 62 LG--CSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP DI IQ K +KI
Sbjct: 135 VAMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSDIYGLIQTLKAAKI 194
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
R SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+ H PPPA + + +LI+MG
Sbjct: 195 RVSVIGLSAEVRVCTVLARETGGTYHVILDESHYKELLTHHVSPPPASSN-SECSLIRMG 253
Query: 180 FPQ 182
FPQ
Sbjct: 254 FPQ 256
>gi|322801575|gb|EFZ22231.1| hypothetical protein SINV_15041 [Solenopsis invicta]
Length = 202
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 95/199 (47%), Positives = 124/199 (62%), Gaps = 22/199 (11%)
Query: 123 VIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQ 182
VIGL+AE++ICK + TGG +SVALD+ H+KE + H PPPA A L+KMGFP
Sbjct: 1 VIGLAAELYICKRMANLTGGEHSVALDDKHYKEQLNAHIDPPPAATRLDAA-LVKMGFPH 59
Query: 183 RA-----GEGSISICSCHKE-----VKV-GVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
A + S+++C CH + VK+ GY CP+C ++ CELP +CR CGL LVS+P
Sbjct: 60 HALHSSAPDTSMTVCMCHAQNSDEPVKLMTTGYLCPQCLSKHCELPVECRACGLTLVSAP 119
Query: 232 HLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKH 291
HLARSYH+LFP+ PF E+ S CFGCQ++L K C KC +
Sbjct: 120 HLARSYHYLFPVDPFKEIA-------SESDHSFCFGCQKALAQKDKK---VYTCGKCNQT 169
Query: 292 FCLECDIYIHESLHNCPGC 310
FCL+C+I+IHE LH CPGC
Sbjct: 170 FCLDCEIFIHEILHTCPGC 188
>gi|323454955|gb|EGB10824.1| hypothetical protein AURANDRAFT_58771 [Aureococcus anophagefferens]
Length = 361
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 115/317 (36%), Positives = 151/317 (47%), Gaps = 33/317 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RPSR+A F+ + F NP S +V +DG A + L G+ +H AL +
Sbjct: 69 DLRPSRLAACCAAGAEFLADLFAGNPFSSACVVDARDGAAKLASPLSGAARAHADALAAV 128
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G+ +G SLQ AL+ L P YG REV++L S+L+T D GD+ + +KI
Sbjct: 129 GRAAPAGAFSLQRALEAAAAALEAAPDYGFREVVVLSSSLATRDAGDLNAVAARLAAAKI 188
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
RC V+ L+AE ++ + L TGG ++VALD H L+ PPP A+ A
Sbjct: 189 RCHVVNLAAETYVVRKLADATGGEFAVALDAGHLAALLGNRVEPPPLAADDA-------- 240
Query: 180 FPQRAGEGSISICSCHKEVKVG------VGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
+ AGE G + CPRC R ELP C +C L LVS+PHL
Sbjct: 241 --RLAGEPPPLPAPALASSAGGAPAWRATTHACPRCATRAAELPAACAVCDLPLVSAPHL 298
Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
ARSYHHLFP+A F R CF C L CP C FC
Sbjct: 299 ARSYHHLFPVAAFPG---------RVAEDGACFACGGGLAPRAFD------CPACGAAFC 343
Query: 294 LECDIYIHESLHNCPGC 310
+CD +H +LH CPGC
Sbjct: 344 GDCDEAVHAALHVCPGC 360
>gi|348557313|ref|XP_003464464.1| PREDICTED: general transcription factor IIH subunit 2-like [Cavia
porcellus]
Length = 302
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 117/309 (37%), Positives = 168/309 (54%), Gaps = 52/309 (16%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
+E FV E+FDQN +SQIG++ K A LT+L G+P HI +L + C G+ SL N
Sbjct: 23 LEYFVEEYFDQNSISQIGIIVTKR--AEKLTELSGNPRKHITSLKKATDMTCHGEPSLYN 80
Query: 73 ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
+L + A+ T + D+M+T++ K VIG+SAE+ +
Sbjct: 81 SLSM---------------------AMQTLNFLDLMKTLKTAK-----MCVIGMSAEVRV 114
Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA-------- 184
C L ++TG +Y LDESH+KEL+ H PPPA + +LI MGFPQ
Sbjct: 115 CTGLARETGSTYHEILDESHYKELLTHHVSPPPASSSSE-CSLIHMGFPQHTIAALADQD 173
Query: 185 GEGSISICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
+ S S+ + G+ GY P+C A+ CELP +C++CGL LVS+PHLA YHHLF
Sbjct: 174 AKPSFSMAHLDSSSEPGLALGGYFYPQCWAKYCELPVECKMCGLTLVSAPHLAWLYHHLF 233
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ F E+ PL + C+GCQ L +YV C + FC++CD+++H
Sbjct: 234 PLDAFQEI-PL----EEYKGERFCYGCQGQL----KDQHVYV-CIVFQNVFCVDCDVFVH 283
Query: 302 ESLHNCPGC 310
+S+H CPGC
Sbjct: 284 DSVHCCPGC 292
>gi|389586086|dbj|GAB68815.1| transcription factor [Plasmodium cynomolgi strain B]
Length = 401
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 165/312 (52%), Gaps = 18/312 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF+P R+ VV + VE+F+ FF +NP+ +G+V +K+ A + L + + +KAL+ +
Sbjct: 100 DFKPDRINVVLESVESFLTNFFFKNPVGHVGVVALKNSSAKLIQPLTSNMDDIMKALLKE 159
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G G SLQ L++ LL IP YG +E+LI+Y ++ TCD +I+ + ++ +
Sbjct: 160 RSMGLQGSPSLQQGLEIAHDLLIDIPLYGTKEILIIYGSIRTCDKKNILNILNLIVKNNM 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ + ++ EM I KH+C++T G+Y + + ++ M + P LI +
Sbjct: 220 HVNCVSIAPEMHILKHICEETNGAYKICMTKNALLNE-MNNVTETPLWMSGMEPQLIHIC 278
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP + + +CSCH ++ Y C C + C++P+ C++CG+ L+S L+ ++
Sbjct: 279 FPVKKKISTQIMCSCHNQLNTDT-YICNFCNSYTCKIPSKCKVCGMHLISMHDLSHITNN 337
Query: 240 LFPIAPFDEVTPLCLNDPRNRSR-STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
L + +PL L + S C C + L ++ C KCK FCL CD+
Sbjct: 338 L-------QGSPLFLEIKNEKDGPSVCVSCNKRLYDKVSQ------CSKCKNIFCLGCDL 384
Query: 299 YIHESLHNCPGC 310
YIHE L+ CP C
Sbjct: 385 YIHEDLNQCPFC 396
>gi|294911611|ref|XP_002778020.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886141|gb|EER09815.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 421
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 178/328 (54%), Gaps = 32/328 (9%)
Query: 1 MDFRPSRMAVVAKQV-EAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM 59
MD++ +R+ V + F+ ++FD NP+S + ++ ++D A+ +T + G P S ++ ++
Sbjct: 107 MDYKQNRLDFVVSHLCRNFIPQYFDLNPISNLSVLALRDQRAHFVTRMSGQPASQMERVL 166
Query: 60 --GKLGCSGDSSLQNALDLV-QGLLSQIPSYGHREVLILYSALSTCDP---GDIM----- 108
G SG +SL NAL+ V Q +P YG REVLI++ +L + DP G ++
Sbjct: 167 QFSTGGASGSASLVNALEAVGQAAKGSLPRYGTREVLIVWGSLHSADPPTSGKLIPDCVD 226
Query: 109 ETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIA 168
I++ K + +R SVI LS E++ + + + TGG++SVA HFK L+ +H PP ++
Sbjct: 227 RLIEEIKPNSVRVSVISLSPELYALRKVVEMTGGTFSVATSPIHFKRLMQKHLTPPNWVS 286
Query: 169 EFAIANLIKMGFP-QRAGEGSISI-----CSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
+ IKMGFP +RA +G+ + C CH ++ Y CP+C + VCE+P +C +
Sbjct: 287 S---PSYIKMGFPVRRACDGNHTADPPIKCMCHNRLQKTFVYICPQCHSPVCEIPVNCPV 343
Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
C L LV L + + H++ + P + P ++ P++ TC C G +
Sbjct: 344 CRLPLVDDDALKKHHRHIYSM-PTYTLLP-TVDYPKS---YTCQFCGTDFTEGGAR---- 394
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGC 310
C +C C ECD++ H L +CPGC
Sbjct: 395 --CDQCLSDVCYECDMFAHNKLRHCPGC 420
>gi|82753444|ref|XP_727680.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23483641|gb|EAA19245.1| W43325 comes from this gene., putative [Plasmodium yoelii yoelii]
Length = 401
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 166/312 (53%), Gaps = 18/312 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
DF+P+R+ V+ + VE F++ FF +NP+ +G+V +K+ A + L + + + +L+
Sbjct: 100 DFKPNRITVILECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKE 159
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G +G SLQ L++ LL +P YG +E+LI+Y ++ TCD +I++ + ++ +
Sbjct: 160 QKEGLNGSPSLQEGLEIAHNLLMDMPLYGTKEILIMYGSIRTCDKKNILKYLDLLIKNNM 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ I ++ EM I KH+C+ T G Y + ++ I + A P + LI +
Sbjct: 220 YVNCISIAPEMHILKHICEKTNGIYKICTSKNILINEINQVAETPLWMHGME-PQLIHIC 278
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP + + +CSCH + Y C C + C++P+ C+ICG+ L+S L+ ++
Sbjct: 279 FPVKKKINTQIVCSCHNILNTDT-YICNFCNSYTCKIPSKCKICGIHLISMHDLSHITNN 337
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
L + +PL + ++ T C C Q L ++ C KCK FCLECD+
Sbjct: 338 L-------QASPLFVEIKNEQNYYTHCSSCNQQLYDKVSQ------CTKCKNIFCLECDV 384
Query: 299 YIHESLHNCPGC 310
+IHE L+ CP C
Sbjct: 385 FIHEDLNQCPFC 396
>gi|68068799|ref|XP_676310.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56495950|emb|CAH97731.1| conserved hypothetical protein [Plasmodium berghei]
Length = 401
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 164/311 (52%), Gaps = 16/311 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
DF+P+R+ V+ + VE F++ FF +NP+ +G+V +K+ A + L + + + +L+
Sbjct: 100 DFKPNRITVILECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKE 159
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G G SLQ L++ LL +P YG +E+LI+Y ++ TCD +I++ + ++ +
Sbjct: 160 QKEGLQGSPSLQEGLEIAHNLLMDMPLYGTKEILIMYGSIRTCDKKNILKYLDLLIKNNM 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ I ++ EM I KH+C+ T G Y + +++ I + A P + LI +
Sbjct: 220 YVNCISIAPEMHILKHICEKTNGIYKICTNKNILINEINQVAETPLWMHGME-PQLIHIC 278
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP + + +CSCH + Y C C + C++P+ C+ICG+ L+S L+ +
Sbjct: 279 FPVKKKINTQIVCSCHNILNTDT-YICNFCNSYTCKIPSKCKICGIHLISMHDLSHITNS 337
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
L F E+ N+ N + C C Q L ++ C KCK FCLECD++
Sbjct: 338 LQASPLFVEIR----NEQNNHTH--CASCNQLLYDKVSQ------CTKCKNIFCLECDVF 385
Query: 300 IHESLHNCPGC 310
IHE L+ CP C
Sbjct: 386 IHEDLNQCPFC 396
>gi|225554656|gb|EEH02952.1| transcription factor TFIIH complex subunit Ssl1 [Ajellomyces
capsulatus G186AR]
Length = 467
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/274 (40%), Positives = 149/274 (54%), Gaps = 47/274 (17%)
Query: 82 SQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTG 141
S P++G REVLI++ AL + DPGDI +TI KIR S+IGL+A++ IC+ LC T
Sbjct: 178 SHTPTHGTREVLIIFGALLSSDPGDIHQTITALVADKIRISIIGLAAQVAICRDLCARTN 237
Query: 142 GS----YSVALDESHFKELIME-HAPPPPAIAEFAI------------------ANLIKM 178
Y VAL+E HF+EL M+ APP +A + ++L+ M
Sbjct: 238 NGDDTVYGVALNEQHFRELFMDVTAPPVTTVAPTSTPTASSANGPKTTTTNTTASSLLMM 297
Query: 179 GFPQRAGEGSISI--CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
GFP + + C+CH + GY C RC A+VC LP+ C CGL L+ S HLARS
Sbjct: 298 GFPSLTLTTTPTPSLCACHSKPSRA-GYLCCRCNAKVCTLPSSCPCCGLTLILSTHLARS 356
Query: 237 YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-----QQSLLSSGNKP---------GLY 282
YHHLFP+ + EV+ R RS S CF C + L SG G+
Sbjct: 357 YHHLFPLMNWVEVS--WRRAARKRSAS-CFACGVGFPRMPKLVSGEPEETAKAAVGVGVS 413
Query: 283 VA----CPKCKKHFCLECDIYIHESLHNCPGCES 312
V+ CP C+ HFC++CD++ HE +HNCPGC+S
Sbjct: 414 VSGRYECPVCECHFCIDCDVFAHEVVHNCPGCQS 447
>gi|156102767|ref|XP_001617076.1| transcription factor [Plasmodium vivax Sal-1]
gi|148805950|gb|EDL47349.1| transcription factor, putative [Plasmodium vivax]
Length = 401
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 162/313 (51%), Gaps = 20/313 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF+P R+ VV + VE F+ FF +NP+ +G+V +K+ A + L + E AL+ +
Sbjct: 100 DFKPDRINVVLECVENFLTHFFFKNPVGHVGVVALKNSSAKLIQPLTSNMEDITNALLKE 159
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G G SLQ L++ LL IP YG +E+LI+Y ++ TCD +I+ + ++ +
Sbjct: 160 RSMGLQGSPSLQQGLEIAHDLLIDIPLYGTKEILIMYGSIRTCDKKNILNILNLIVKNNM 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ + ++ EM I KH+C+ T GSY + + ++ M + P LI +
Sbjct: 220 HVNCVSIAPEMHILKHICEQTNGSYKICMTKNSLMNE-MNNITETPLWMMGMEPQLIHIC 278
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP + + +CSCH + Y C C + C++P+ C++CG+ L+S L+ ++
Sbjct: 279 FPIKKKISTQIMCSCHNNLNTDT-YICNFCNSYTCKIPSKCKVCGMHLISMHDLSHITNN 337
Query: 240 LFPIAPFDEVTPLCLNDPRNRSR--STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECD 297
L + +PL L + +N + S C C + L ++ C KC FCL CD
Sbjct: 338 L-------QGSPLFL-EIKNEEKGPSVCVSCNKRLYDKVSQ------CSKCGNLFCLACD 383
Query: 298 IYIHESLHNCPGC 310
+YIHE L+ CP C
Sbjct: 384 LYIHEDLNQCPFC 396
>gi|189240734|ref|XP_967768.2| PREDICTED: similar to btf [Tribolium castaneum]
Length = 236
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 81/177 (45%), Positives = 122/177 (68%), Gaps = 4/177 (2%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
+E F+ EFFDQNP+SQ+G++ + + A ++DL G+ HIKA+ + + G+ SLQN
Sbjct: 52 LENFIEEFFDQNPISQLGIIIMHNKRAEKISDLAGNARKHIKAVQNLKQTSLVGEPSLQN 111
Query: 73 ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
++++ L +PS+ RE+L++ +L+TCDPGDI TIQ K +RCSVIGL+AE+++
Sbjct: 112 SIEMALRALKLLPSHASREILVIMGSLTTCDPGDITTTIQMLKTEGVRCSVIGLAAEVYV 171
Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP-QRAGEGS 188
C+ L TGG Y+V LD+ HFK+L+ + PPPA A+ +NLIKMGFP Q + EG+
Sbjct: 172 CRQLANQTGGLYNVILDDCHFKDLLYQQVDPPPA-AQTLESNLIKMGFPHQMSVEGT 227
>gi|270012914|gb|EFA09362.1| hypothetical protein TcasGA2_TC001923 [Tribolium castaneum]
Length = 191
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 81/177 (45%), Positives = 122/177 (68%), Gaps = 4/177 (2%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDSSLQN 72
+E F+ EFFDQNP+SQ+G++ + + A ++DL G+ HIKA+ + + G+ SLQN
Sbjct: 7 LENFIEEFFDQNPISQLGIIIMHNKRAEKISDLAGNARKHIKAVQNLKQTSLVGEPSLQN 66
Query: 73 ALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFI 132
++++ L +PS+ RE+L++ +L+TCDPGDI TIQ K +RCSVIGL+AE+++
Sbjct: 67 SIEMALRALKLLPSHASREILVIMGSLTTCDPGDITTTIQMLKTEGVRCSVIGLAAEVYV 126
Query: 133 CKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP-QRAGEGS 188
C+ L TGG Y+V LD+ HFK+L+ + PPPA A+ +NLIKMGFP Q + EG+
Sbjct: 127 CRQLANQTGGLYNVILDDCHFKDLLYQQVDPPPA-AQTLESNLIKMGFPHQMSVEGT 182
>gi|221060951|ref|XP_002262045.1| ssl1-like protein [Plasmodium knowlesi strain H]
gi|193811195|emb|CAQ41923.1| ssl1-like protein, putative [Plasmodium knowlesi strain H]
Length = 401
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/311 (30%), Positives = 158/311 (50%), Gaps = 16/311 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF+P R+ VV + VE F+ FF +NP+ +G+V +K+ A + L + E + AL+ +
Sbjct: 100 DFKPDRINVVLECVENFLTNFFFKNPVGHVGVVALKNSSAKLIQPLTSNMEDIMNALVKE 159
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G G SL+ L++ LL IP YG +E+LI+Y ++ TCD +I+ + ++ +
Sbjct: 160 RSMGLQGSPSLEQGLEIAHDLLVDIPLYGTKEILIMYGSIRTCDKKNILNILNLIVKNNM 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ + +S EM I KH+C++T G Y + + ++ M + P LI +
Sbjct: 220 HVNCVSISPEMHILKHICEETNGVYKICMTKNALMNE-MNNVTETPLWMIGMEPQLIHIC 278
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP + + +CSCH + Y C C + C++P+ C++CG+ L+S L+ ++
Sbjct: 279 FPIKKKISTQIMCSCHNHLNTDT-YICNFCNSYTCKIPSKCKVCGMHLISMHDLSHITNN 337
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
L F E+ S C C + L ++ C KCK FCL CD+Y
Sbjct: 338 LQGSPLFIEIK------NEKNGPSACVSCNKPLYDKVSQ------CSKCKNIFCLGCDLY 385
Query: 300 IHESLHNCPGC 310
IHE L+ CP C
Sbjct: 386 IHEDLNQCPFC 396
>gi|313240907|emb|CBY33192.1| unnamed protein product [Oikopleura dioica]
Length = 384
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 156/322 (48%), Gaps = 29/322 (9%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-- 58
+D P+R + ++ FV +FF PL Q+ LV +++ AN + LGGS +KAL
Sbjct: 79 IDRYPNRAGCAVRSLKEFVPKFFASCPLGQLALVILQNKRANIVVPLGGSEARLLKALND 138
Query: 59 --MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
+ G S+ N + + +L+ + + +RE++ + +L+T D TI+
Sbjct: 139 IEVKGFKTGGQCSMVNGIAASKSMLNAVGEHSNREIIFVVGSLNTIDVTSPFATIETVAN 198
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLI 176
IRCSV+ LSAE+ I K L + T G Y V LD + +E PP + L+
Sbjct: 199 EGIRCSVVSLSAEVNIWKKLAERTDGKYFVPLDPIDVSDK-LEELSRPPVESSSRQGVLV 257
Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
KMGFPQ+ + Y CP+C+ RV +PT C +C L LVS+ HLAR
Sbjct: 258 KMGFPQKEADAR---------------YICPQCRTRVKAIPTLCDVCKLSLVSAAHLARC 302
Query: 237 YHHLFPIAPFDEVTP--------LCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
YHHLFP + P L L + C GC + +S + +V C KC
Sbjct: 303 YHHLFPPSSVTPSIPEQSGEEIDLELERSQFEKIRPCVGCNYAPISV-EEGRQFVVCDKC 361
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
C ECD +I E LH+CPGC
Sbjct: 362 LNSLCGECDGFIDEHLHSCPGC 383
>gi|428176074|gb|EKX44960.1| SSL1 transcription factor & nucleotide excision repair [Guillardia
theta CCMP2712]
Length = 289
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 161/315 (51%), Gaps = 42/315 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R+ VV + V +F+ FFDQNP+SQ+ LV ++ G A LT+L G+ H L +
Sbjct: 5 DMRPTRLGVVQQVVGSFLDNFFDQNPISQVSLVELRGGKAEKLTELSGNSRHHKNKLEER 64
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
L + + + SY R ++ PG + + K + ++
Sbjct: 65 LAAHRSAGAGMPSMRAGVVWDKRSSYDVRIFELMR-------PGGHLSDY-RLKSAGVKV 116
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAI----AEFAIANLIK 177
+++G++AE+++ + + + TGGSYSVA ++++++H P P + A +
Sbjct: 117 NIVGMAAELYVARAVSKRTGGSYSVATHAVQVRDMVLKHTSPSPVEEGEGGKEKPAAGMW 176
Query: 178 MGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
+GFP+++ G++ G+ CPRC V E+P DC +CGL+L+SS HLARSY
Sbjct: 177 VGFPRKS-YGALG------------GWQCPRCMDVVKEIPADCGLCGLKLLSSSHLARSY 223
Query: 238 HHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKH-FCLEC 296
HHLFP+ F E+ + S STC GC V K KH FC C
Sbjct: 224 HHLFPVQSFHELED---EEGPATSTSTCGGCG-------------VLLGKESKHVFCGLC 267
Query: 297 DIYIHESLHNCPGCE 311
D +IHE LH CPGCE
Sbjct: 268 DGFIHEMLHVCPGCE 282
>gi|66359374|ref|XP_626865.1| transcription factor TFIIH with a vWA domain [Cryptosporidium
parvum Iowa II]
gi|46228124|gb|EAK89023.1| transcription factor TFIIH with a vWA domain [Cryptosporidium
parvum Iowa II]
Length = 423
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 101/334 (30%), Positives = 174/334 (52%), Gaps = 22/334 (6%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+D++P R+ + K E F+++ + NPL+QI ++++ DG+ + + + +++
Sbjct: 94 LDYKPDRLQCMVKCNEIFIKQLLEDNPLTQISVISIYDGIGEVIISYNSNFLEIMTSILN 153
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKES 117
L GC G S+QN L+ + LL IP YG +E++ ++ + D + E ++ +
Sbjct: 154 YLKKGCKGSMSIQNGLEKAKYLLVSIPPYGTKEIIFFLGSMRSVDNSFLFNEWVEGFSSN 213
Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH--APPPPAIAEFAI-AN 174
I + I E++I K + + TGG A++ H +L +E+ PPP IA + N
Sbjct: 214 NIIINAILFIPELYIIKTITKMTGGVCLCAMNNDHLLKLTLENFIKPPPNNIASTPLNIN 273
Query: 175 LIKMGFPQRAGEGSISI-CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
L+ MGFP+ + CSCH+ + GY+CP+CK+ VC LPT C +C + L+S HL
Sbjct: 274 LVTMGFPEYVSNQTHPFGCSCHRSL-THRGYSCPKCKSIVCYLPTKCPVCLIYLISPNHL 332
Query: 234 ARSYHHLFPIAPFDEVTPLCLND----PRNRSRSTCFGCQQSL-----LSSGNKPGLY-- 282
A+S+ +LF ++TP+ + + ++ C C+ L+SG++ +
Sbjct: 333 AKSFAYLF---QHPDITPIKNGEYSKIESSSIKTKCELCESQFSNNVYLNSGSRSNIINK 389
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGCESLRHS 316
C C FC EC +I +LH CP C SLR S
Sbjct: 390 YLCTNCSTQFCNECCKFIFNTLHQCPICCSLRIS 423
>gi|70952334|ref|XP_745342.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56525635|emb|CAH76815.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 400
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 164/312 (52%), Gaps = 19/312 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
DF+P+R+ V+ + VE F++ FF +NP+ +G+V +K+ A + L + + + +L+
Sbjct: 100 DFKPNRITVILECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKE 159
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G G SLQ L++ LL + YG +E+LI+Y ++ TCD +I++ + ++ +
Sbjct: 160 QKEGLQGSPSLQEGLEIAHNLLMDML-YGTKEILIMYGSIRTCDKKNILKYLDLLIKNNM 218
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ I ++ EM I KH+C+ T G Y + ++ I + A P + LI +
Sbjct: 219 YVNCISIAPEMHILKHICEKTHGIYKICTSKNILINEINQVAETPLWMHGME-PQLIHIC 277
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP + + +CSCH + Y C C + C++P+ C++CG+ L+S L+ ++
Sbjct: 278 FPVKKKINTQIVCSCHNTLNTDT-YICNFCNSYTCKIPSKCKVCGIHLISMHDLSHITNN 336
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRST-CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
L + +PL + ++ T C C Q L ++ C KCK FCLECD+
Sbjct: 337 L-------QASPLFIEIKNEQNYYTYCSSCNQQLYDKVSQ------CTKCKNIFCLECDV 383
Query: 299 YIHESLHNCPGC 310
+IHE L+ CP C
Sbjct: 384 FIHEDLNQCPFC 395
>gi|124513058|ref|XP_001349885.1| TFIIH basal transcription factor subunit [Plasmodium falciparum
3D7]
gi|23615302|emb|CAD52293.1| TFIIH basal transcription factor subunit [Plasmodium falciparum
3D7]
Length = 401
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 161/311 (51%), Gaps = 16/311 (5%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D +P R+ V + VE+F++ FF +NP+ +G+V +K+ A + + + + +++ +
Sbjct: 100 DLKPDRINVALECVESFLKNFFFKNPVGHVGVVALKNSSAKLIQPFTSNVDDILNSILKE 159
Query: 62 --LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G G SL+ L + LL +P YG +EVLI+Y ++ TCD +I+ ++ +S I
Sbjct: 160 RTAGLQGSPSLEEGLQIAHDLLIDMPLYGTKEVLIMYGSIRTCDKKNILNVLELLVKSNI 219
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ I ++ EM I KH+C+ + G Y + ++ I +A P + LI +
Sbjct: 220 YVNCISIAPEMHILKHICEKSNGFYKICSSKNSLMNEINNNAETPLWMQGME-PQLIHIC 278
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP + + +CSCH ++ Y C C + C++P+ C++CG+ L+S L+ ++
Sbjct: 279 FPTKKKISTQIMCSCHGKLNTDT-YVCNFCNSYTCKIPSKCKVCGIHLISMHDLSHITNN 337
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
L F E+ N+ N C C Q L + +Y C KC+ FCLECDI+
Sbjct: 338 LQGSPLFIEIK----NEQGN--YKVCSSCNQQLYNK-----IY-QCTKCQHIFCLECDIF 385
Query: 300 IHESLHNCPGC 310
IHE L+ CP C
Sbjct: 386 IHEELNQCPFC 396
>gi|67595351|ref|XP_665994.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54656889|gb|EAL35767.1| hypothetical protein Chro.30353 [Cryptosporidium hominis]
Length = 423
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 102/335 (30%), Positives = 172/335 (51%), Gaps = 24/335 (7%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+D++P R+ + K E F+++ + NPL+QI ++++ DG+ + + + +++
Sbjct: 94 LDYKPDRLQCMVKCNEIFIKQLLEDNPLTQISVISIYDGIGEVIISYNSNFLEIMTSILN 153
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKES 117
L GC G S+QN L+ + LL IP YG +E++ ++ + D + E ++ +
Sbjct: 154 YLKKGCKGSMSIQNGLEKAKYLLVSIPPYGTKEIIFFLGSMRSVDNSFLFNEWVEGFSSN 213
Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH--APPPPAIAEFAI-AN 174
I + I E++I K + + TGG A++ H +L +E+ PPP IA + N
Sbjct: 214 NIIINAILFIPELYIIKTITKMTGGVCLCAMNNDHLLKLTLENFIKPPPNNIASTPLNIN 273
Query: 175 LIKMGFPQRAGEGSISI-CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
L+ MGFP+ + CSCH + GY+CP+CK+ VC LPT C +C + L+S HL
Sbjct: 274 LVTMGFPEYLNNQTHPFGCSCHGSL-THRGYSCPKCKSIVCYLPTKCPVCLIYLISPNHL 332
Query: 234 ARSYHHLF---PIAPFD--EVTPLCLNDPRNRSRSTCFGCQQSL-----LSSGNKPGLY- 282
A+S+ +LF I P E + + + ++ C C+ L+SG++ +
Sbjct: 333 AKSFAYLFQHPDIIPIKNGEYSKI----ESSSIKTKCELCESQFSNNVYLNSGSRSNIIN 388
Query: 283 -VACPKCKKHFCLECDIYIHESLHNCPGCESLRHS 316
C C FC EC +I +LH CP C SLR S
Sbjct: 389 KYLCTNCNTQFCNECCKFIFTTLHQCPICCSLRIS 423
>gi|300709301|ref|XP_002996816.1| hypothetical protein NCER_100001 [Nosema ceranae BRL01]
gi|239606142|gb|EEQ83145.1| hypothetical protein NCER_100001 [Nosema ceranae BRL01]
Length = 335
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 102/312 (32%), Positives = 161/312 (51%), Gaps = 36/312 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ PS V + +E F+ FF +NP+S + TV + + I L+ +
Sbjct: 59 DYLPSIRKNVIRSLEKFIPTFFLENPISGLSFSTVNEKTVK------STNSVEIADLLNQ 112
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
G G+ SL N L + QI SY RE+L++ S+L DP + I ++ I+
Sbjct: 113 KG-EGNFSLLNGL---YDAIDQIKSYTFCREILVIVSSLVLKDPDSYTDVIDLLRKHNIK 168
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
++I L E+ I K++ + TGG + V L+ HF ++ P + NLIK+GF
Sbjct: 169 VNIISLCGELMIYKNIVESTGGKFFVPLNIDHFNYIL--RCMTVPGELNSSTINLIKLGF 226
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
P+ E + C+CH +++ VGY CP CK +C LP C IC LQLVSS ++A+++ H+
Sbjct: 227 PKVIYEEGV--CACHLQLQ-SVGYECPLCKTFICSLPMGCPICELQLVSSLNIAKAFQHM 283
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
+P+APF + C N CF C N PG + +C KC +C CD+++
Sbjct: 284 YPLAPFTK----CCN-------GVCFVC--------NSPGNF-SCEKCNSVYCDSCDLFV 323
Query: 301 HESLHNCPGCES 312
H +L+ C GC++
Sbjct: 324 HNNLNFCLGCKN 335
>gi|261190444|ref|XP_002621631.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
SLH14081]
gi|239591054|gb|EEQ73635.1| RNA polymerase TFIIH complex subunit Ssl1 [Ajellomyces dermatitidis
SLH14081]
Length = 502
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 92/246 (37%), Positives = 140/246 (56%), Gaps = 23/246 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV EFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 140 DLRPTRYLLTLRYAQEFVLEFFEQNPISQLGVLGMRDGLAVRISDMSGNPTDHILAIQSL 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN L++ +G L + P++G REV I++ AL + DPGDI +TI KI
Sbjct: 200 RPKDPKGMPSLQNTLEMARGALFRTPTHGTREVFIIFGALLSSDPGDIHKTINTLVADKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIME--------------HA 161
R S+IGL+A++ IC+ +C T Y VAL+E HF+EL M
Sbjct: 260 RVSIIGLAAQVAICRDICARTNNGDDSGYGVALNEQHFRELFMNVTTPPATTVAPTPTTT 319
Query: 162 PPPPAIAEFAIANLIKMGFPQR--AGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD 219
++ ++L+ MGFP R + + ++C+CH + GY C RC A+VC LPT
Sbjct: 320 KEETKTSQTTTSSLLMMGFPSRTLSPTTTPTLCACHSKPSR-TGYLCSRCGAKVCTLPTS 378
Query: 220 CRICGL 225
C GL
Sbjct: 379 CPCGGL 384
>gi|209880135|ref|XP_002141507.1| suppressor of stem-loop protein 1 [Cryptosporidium muris RN66]
gi|209557113|gb|EEA07158.1| suppressor of stem-loop protein 1, putative [Cryptosporidium muris
RN66]
Length = 424
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 159/321 (49%), Gaps = 13/321 (4%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM- 59
+D++P R+ + + ++ F+ +NPL+QI ++++++G+ + +T + + ++
Sbjct: 96 LDYKPDRLQCMVRCIDTFICNLLQENPLTQISVISIRNGLTSIVTTYNSNYREIVSGILS 155
Query: 60 -GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM-ETIQKCKES 117
K GCSG S++N L+ + LL+ IP YG +E++ ++ + D ++ E +
Sbjct: 156 EAKNGCSGVMSIRNGLEKAKQLLASIPPYGTKEIVFFLGSMRSIDNDSMLNEWLDDFIAH 215
Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAI---AN 174
I + + E+FI K + TGG AL+ H +L++E+ PP+ ++ N
Sbjct: 216 NIVINALLFIPELFIIKTITTKTGGICLCALNSEHMLQLLLENFVKPPSYSQLNTPLHIN 275
Query: 175 LIKMGFPQRAGEGSISI-CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHL 233
L+ MGFP + CSCHK + GY CPRCK+ VC LP C IC L L S+ HL
Sbjct: 276 LVPMGFPMYFNNSGYPLQCSCHKSI-TSDGYCCPRCKSLVCYLPIKCPICNLYLASANHL 334
Query: 234 ARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNK----PGLYVACPKCK 289
+S+ LF P EV L + ++ C CQ S P + C C
Sbjct: 335 TKSFAFLFK-PPSMEVLQLNPGEYKSDIPRHCRFCQNLFTSKATYRNGFPFSLIKCTNCC 393
Query: 290 KHFCLECDIYIHESLHNCPGC 310
C++C I +LH CP C
Sbjct: 394 SFLCIDCCKLILLALHQCPEC 414
>gi|358371546|dbj|GAA88153.1| RNA polymerase TFIIH complex subunit Ssl1 [Aspergillus kawachii IFO
4308]
Length = 320
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 75/192 (39%), Positives = 121/192 (63%), Gaps = 7/192 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ +KDG+A ++D+ G+P HI A+ +
Sbjct: 108 DLRPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLKDGLAVRVSDMSGNPTEHISAIQSL 167
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN ++ +G L PS+G RE+ +++ +L + DPGDI +TI KI
Sbjct: 168 RDQDPKGLPSLQNGFEMARGALFHTPSHGTREIFVIFGSLLSSDPGDIHQTITTLINDKI 227
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPPPAIAEF-AIAN 174
R ++GL+A++ IC+ +C T G +Y VAL+E HF++L+M PP ++ + ++
Sbjct: 228 RVGIVGLAAQVAICREICGKTNGGDDTTYGVALNEQHFRDLVMSVTTPPATYSQKQSTSS 287
Query: 175 LIKMGFPQRAGE 186
L+ MGFP R E
Sbjct: 288 LLMMGFPSRTVE 299
>gi|242807960|ref|XP_002485064.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
stipitatus ATCC 10500]
gi|218715689|gb|EED15111.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Talaromyces
stipitatus ATCC 10500]
Length = 313
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 73/169 (43%), Positives = 111/169 (65%), Gaps = 6/169 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV EFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 103 DIRPTRYLLALRYAQEFVIEFFEQNPISQLGVIGMRDGLAVRISDMSGNPTDHITAIHAL 162
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQNAL++ +G L PS+G REVLI++ +L + DPGDI +T++ KI
Sbjct: 163 RSDDPKGLPSLQNALEMARGALFHTPSHGTREVLIVFGSLLSSDPGDIHQTLKSLVADKI 222
Query: 120 RCSVIGLSAEMFICKHLCQDTGGS----YSVALDESHFKELIMEHAPPP 164
R ++GL+A++ IC+ LC T G Y VAL+E HF+EL+ME PP
Sbjct: 223 RVGIVGLAAQVAICRELCAKTNGGDDTVYGVALNEQHFRELMMEVTIPP 271
>gi|119182598|ref|XP_001242425.1| hypothetical protein CIMG_06321 [Coccidioides immitis RS]
Length = 364
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 74/169 (43%), Positives = 110/169 (65%), Gaps = 6/169 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 123 DVRPTRYLLTLRYAQEFVLEYFEQNPISQLGIIGMRDGLAVRISDMSGNPSEHILAIQAL 182
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+G SLQNAL++ +G L PS+G REVLI++ AL + DPGDI +TI KI
Sbjct: 183 RTRDPTGLPSLQNALEMARGALFHTPSHGTREVLIIFGALLSSDPGDIHKTITSLVTDKI 242
Query: 120 RCSVIGLSAEMFICKHLCQDTGG----SYSVALDESHFKELIMEHAPPP 164
R V+GL+AE+ IC+ +C T Y VAL+E HF+EL+ME PP
Sbjct: 243 RVGVVGLAAEVAICREICSKTNAGDDSGYGVALNEQHFRELMMEITTPP 291
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 16/28 (57%), Positives = 22/28 (78%)
Query: 285 CPKCKKHFCLECDIYIHESLHNCPGCES 312
C C HFC++CD++ HE +HNCPGC+S
Sbjct: 309 CTVCHNHFCIDCDVFAHEIVHNCPGCQS 336
>gi|365759512|gb|EHN01295.1| Ssl1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 303
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/161 (46%), Positives = 109/161 (67%), Gaps = 6/161 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RPSR A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 141 DLRPSRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 200
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 201 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 260
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKEL 156
R V+GLSA++ ICK LC+ T Y + LDE+H KEL
Sbjct: 261 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKEL 301
>gi|359474618|ref|XP_003631495.1| PREDICTED: uncharacterized protein LOC100853435 [Vitis vinifera]
Length = 235
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/94 (75%), Positives = 83/94 (88%)
Query: 15 VEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNAL 74
++ +EFFDQ+P SQIGLVT+KDG+A CLTDLGGS +SH+KALMGKL CSGDSSLQNAL
Sbjct: 142 LQTMKQEFFDQDPFSQIGLVTIKDGLAPCLTDLGGSIDSHVKALMGKLECSGDSSLQNAL 201
Query: 75 DLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM 108
+LV+G L+ IPSYG+REVLILYSALS CD GDIM
Sbjct: 202 NLVRGYLNLIPSYGYREVLILYSALSACDTGDIM 235
>gi|323347610|gb|EGA81877.1| Ssl1p [Saccharomyces cerevisiae Lalvin QA23]
Length = 302
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 74/161 (45%), Positives = 109/161 (67%), Gaps = 6/161 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGLSAEMFICKHLCQDTG----GSYSVALDESHFKEL 156
R V+GLSA++ ICK LC+ T Y + LDE+H KEL
Sbjct: 260 RVKVLGLSAQVAICKELCKATNYGDESFYKILLDETHLKEL 300
>gi|313239111|emb|CBY14088.1| unnamed protein product [Oikopleura dioica]
Length = 369
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 147/322 (45%), Gaps = 44/322 (13%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL-- 58
+D P+R + ++ FV +FF PL Q+ LV +++ AN + LGGS +KAL
Sbjct: 79 IDRYPNRAGCAVRSLKEFVPKFFASCPLGQLALVILQNKRANIVVPLGGSEARLLKALND 138
Query: 59 --MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
+ G S+ N + + +L+ + + +RE++ + +L+T D TI+
Sbjct: 139 IEVKGFKTGGQCSMVNGIAASKSMLNAVGEHSNREIIFVVGSLNTIDVTSPFATIETVAN 198
Query: 117 SKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLI 176
IRCSV+ LSAE+ I K L + T G Y V LD + +E PP + L+
Sbjct: 199 EGIRCSVVSLSAEVNIWKKLAERTDGKYFVPLDPIDVSDK-LEELSRPPVESSSRQGVLV 257
Query: 177 KMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS 236
KMGFPQ V +PT C +C L LVS+ HLAR
Sbjct: 258 KMGFPQ------------------------------VKAIPTLCDVCKLSLVSAAHLARC 287
Query: 237 YHHLFPIAPFDEVTP--------LCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKC 288
YHHLFP + P L L + C GC + +S + +V C KC
Sbjct: 288 YHHLFPPSSVTPSIPEQSGEEIDLELERSQFEKIGPCVGCNYAPISV-EEGRQFVVCDKC 346
Query: 289 KKHFCLECDIYIHESLHNCPGC 310
C ECD +I E LH+CPGC
Sbjct: 347 LNSLCGECDGFIDEHLHSCPGC 368
>gi|403222976|dbj|BAM41107.1| uncharacterized protein TOT_030000370 [Theileria orientalis strain
Shintoku]
Length = 468
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 165/336 (49%), Gaps = 32/336 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF+P R+ ++ FV++ F + P++Q+G+V +++ VAN + LG +PE ++ L
Sbjct: 136 DFKPDRLYCSFGALKEFVKDLFSKGPITQLGMVVMRNKVANLICQLGTNPEEQLELLGNS 195
Query: 62 L--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
L G G SSLQN L++ +L ++PSY REVL+++ + T D G+I+ T++K K++ +
Sbjct: 196 LKEGADGPSSLQNGLEMCLNMLYELPSYSTREVLVIFGSNRTLDAGNILVTLEKLKQNNL 255
Query: 120 RCSVIGLSAE----MFICK---------HLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
+ I LS E +F K T S D+ + + +
Sbjct: 256 TINSISLSPENRRQLFGGKGCKSPETPTQQLHITSTMVSTMSDQIYMLPVSIVWNKLISL 315
Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
++ L+K+ FP ++S+C CH + V Y CP+C ++ C +PT C++
Sbjct: 316 RKKWMEPVLLKVAFPPLKKTATVSLCVCHSSL-VNKAYICPQCHSKSCYIPTKCKVRTFI 374
Query: 227 LV-----SSPHLARS--YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGN-K 278
L+ ++ H S YHH F +T P+N S+ + Q SS N +
Sbjct: 375 LIGAIDHNNRHAVESTWYHHQIFQGLF--ITSYL---PKNSSK---YENTQVTCSSCNLR 426
Query: 279 PGLYVACPKCKKHFCLECDIYIHESLHNCPGCESLR 314
+ CP C FC C++YI E LH CP C L+
Sbjct: 427 NNVGFRCPSCDSWFCEYCNVYIQEELHQCPMCLFLK 462
>gi|401825920|ref|XP_003887054.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit SSL1 [Encephalitozoon hellem
ATCC 50504]
gi|392998212|gb|AFM98073.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit SSL1 [Encephalitozoon hellem
ATCC 50504]
Length = 340
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 155/311 (49%), Gaps = 34/311 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF P+ VA+ +E F+ F+++NPLS + ++ +D ++ + + I A +G+
Sbjct: 59 DFLPTFRTNVARILEDFIPSFYNENPLSTLSFLSTRDVCVKYISSM----DMDIHAFLGQ 114
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
G S SL N L +G + + + H +E+L++ ++ ST DP E + + K I+
Sbjct: 115 TG-SKWFSLLNGL---EGSIEIMENTMHVKEILVIVASTSTRDPHGYTEVLNRLKTHNIK 170
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
I L E+ + K + + TGG + V +D H ++ E + P +L+K+GF
Sbjct: 171 VHFISLCGEVTLYKSISKATGGRFYVPIDVGHLSMIMKELSHPTDFNG--TTLSLVKIGF 228
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
P E S+ C+CH EVK VGY CP CK VC LP C IC QLVS+ +L++S +
Sbjct: 229 PLPTMESSV--CACHLEVK-SVGYECPVCKTMVCSLPISCPICNTQLVSTLNLSKSLRFI 285
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
+P+ PF E R+ C CQ G Y C CK +C C+ ++
Sbjct: 286 YPLKPFIE-----------RAEDACRICQNK--------GAY-QCDLCKSTYCSYCNGFV 325
Query: 301 HESLHNCPGCE 311
H +L C CE
Sbjct: 326 HSTLSFCIYCE 336
>gi|396081175|gb|AFN82793.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit SSL1 [Encephalitozoon
romaleae SJ-2008]
Length = 340
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 155/311 (49%), Gaps = 34/311 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF P+ VA+ +E F+ F+++NPLS + ++ +D ++ + + + A +G+
Sbjct: 59 DFLPTFRTNVARILEEFIPSFYNENPLSTLSFLSTRDVCVKYISSM----DMDVHAFLGQ 114
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
G S SL N L +G + + + H +E+L++ ++ ST DP E + K K I+
Sbjct: 115 TG-SKWFSLLNGL---EGSIEIMRNTMHVKEILVIVASTSTRDPHGYTEVLNKLKTYNIK 170
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
I L E+ + K + + TGG + V +D H ++ E + P +L+K+GF
Sbjct: 171 VHFISLCGELTLYKSISKATGGRFYVPVDVGHLSMIMKELSHPTDFNG--TTLSLVKIGF 228
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
P E S+ CSCH E+K +GY CP CK VC LPT C ICG QLVS+ +L++S L
Sbjct: 229 PLPTIESSV--CSCHLEMK-SIGYECPVCKTMVCSLPTSCPICGTQLVSTLNLSKSLRFL 285
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
+P+ PF E ++ C C+ G Y C CK FC C+
Sbjct: 286 YPLKPFIE-----------KAEGICRICRDK--------GAY-QCELCKSTFCSYCNGLA 325
Query: 301 HESLHNCPGCE 311
H +L C CE
Sbjct: 326 HNTLSFCIYCE 336
>gi|19074105|ref|NP_584711.1| similarity to SUPPRESSOR OF STEM LOOP PROTEIN 1 [Encephalitozoon
cuniculi GB-M1]
gi|19068747|emb|CAD25215.1| similarity to SUPPRESSOR OF STEM LOOP PROTEIN 1 [Encephalitozoon
cuniculi GB-M1]
gi|449328973|gb|AGE95248.1| suppressor of stem loop protein 1 [Encephalitozoon cuniculi]
Length = 340
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 154/313 (49%), Gaps = 38/313 (12%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF P+ A V K +E F+ F+ +NPLS + ++++D S + I A +G+
Sbjct: 59 DFLPTFRANVTKVLEGFIPSFYSENPLSTLSFLSIRD----VCVKYTSSMDIDIHAFLGQ 114
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
G S SL N L +G + + + H +E+L++ ++ ST DP E + K + I+
Sbjct: 115 TG-SKWFSLLNGL---EGSIEVMKNTMHVKEILVIVASTSTRDPHGYAEVLAKLRAYNIK 170
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEF--AIANLIKM 178
I L E+ + K + + T G + V +D H ++ E + P +F +L+K+
Sbjct: 171 VHFISLCGEITLYKSISKATEGRFYVPVDAGHLSAIMRELSHP----TDFNGTKLSLVKI 226
Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
GFP E S+ C+CH EVK G GY CP CK VC LP C IC QLVS+ +L++S
Sbjct: 227 GFPSPMMEPSV--CACHLEVK-GAGYECPVCKTMVCSLPISCPICSTQLVSTLNLSKSLR 283
Query: 239 HLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
L+P+ PF E ++ TC CQ G Y C CK FC C+
Sbjct: 284 FLYPLRPFVE-----------KAEGTCRVCQGK--------GAY-QCELCKSTFCSYCNR 323
Query: 299 YIHESLHNCPGCE 311
IH +L C CE
Sbjct: 324 LIHNTLSFCIYCE 336
>gi|303388880|ref|XP_003072673.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit SSL1 [Encephalitozoon
intestinalis ATCC 50506]
gi|303301815|gb|ADM11313.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit SSL1 [Encephalitozoon
intestinalis ATCC 50506]
Length = 340
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 101/311 (32%), Positives = 153/311 (49%), Gaps = 34/311 (10%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF P+ VAK +E F+ F+ +NPLS + ++V+D ++ + + I A +G+
Sbjct: 59 DFLPTFRTNVAKILEGFIPSFYSENPLSTLSFLSVRDVCVKYISSM----DMDIHAFLGQ 114
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIR 120
G S SL N L +G + I + H +E+L++ ++ ST DP E + K K I+
Sbjct: 115 TG-SKWFSLLNGL---EGSVEIIKNTTHVKEILVIVASTSTRDPHGYTEVLTKLKVHNIK 170
Query: 121 CSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGF 180
I L E+ + K + + T G + V +D H +IM+ P ++ L+K+GF
Sbjct: 171 VHFISLCGEVTLYKSISKATEGRFYVPVDVGHL-SIIMKELSHPTDFNGTTLS-LVKIGF 228
Query: 181 PQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHL 240
P E S+ C+CH E+K GY CP CK VC LP C IC QLVS+ +L++S L
Sbjct: 229 PLPVIEPSV--CACHLEMK-SAGYECPVCKTMVCSLPVSCPICNTQLVSTLNLSKSLRFL 285
Query: 241 FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYI 300
+P+ PF E + C CQ G Y C CK FC C+ ++
Sbjct: 286 YPLKPFIE-----------KEGKMCRICQSK--------GGY-QCELCKSIFCNSCNGFV 325
Query: 301 HESLHNCPGCE 311
H +L C CE
Sbjct: 326 HNTLSFCIYCE 336
>gi|326935424|ref|XP_003213771.1| PREDICTED: general transcription factor IIH subunit 2-like, partial
[Meleagris gallopavo]
Length = 208
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/164 (45%), Positives = 104/164 (63%), Gaps = 7/164 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQI--GLVTVKDGVANCLTDLGGSPESHIKALM 59
D +P+R+ K +E FV E+FDQNP+SQ G V + + G+P+ HI AL
Sbjct: 30 DLKPNRLTCTLKLLEYFVEEYFDQNPISQDPPGHVAATE---ERRARMPGNPKKHIAALK 86
Query: 60 G--KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKES 117
+ C G+ SL N+L+L L +P + REVLI++S+L+TCDP +I E I+ K
Sbjct: 87 KAVDMNCQGEPSLYNSLNLAMQTLKHMPGHTSREVLIVFSSLTTCDPANIYELIKCLKAV 146
Query: 118 KIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHA 161
KIR SVIGLSAE+ +C L ++TGG+Y V LDESH+KEL+M H
Sbjct: 147 KIRVSVIGLSAEVRVCTVLTRETGGTYHVILDESHYKELLMHHV 190
>gi|451994856|gb|EMD87325.1| hypothetical protein COCHEDRAFT_1197444 [Cochliobolus
heterostrophus C5]
Length = 718
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 151/358 (42%), Gaps = 109/358 (30%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D RP+R + + ++REFF+QNP+SQ+ ++ + D +
Sbjct: 118 DMRPNRFITMINYTQDYIREFFEQNPISQMSVLGMHDECSRV------------------ 159
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G+S N L Q ++ GHR + Q + R
Sbjct: 160 ----GES---NTLPHAQSWHTR----GHRRLR------------------QPAESGPWRH 190
Query: 122 SVIGLSAEMFICKHLCQ--DTG--GSYSVALDESHFKELIMEHAPPP------------- 164
S I CK +C+ DTG Y++A D+ +EL++ PP
Sbjct: 191 SPI--------CKGVCEGPDTGDESEYTIATDQEMLRELLLATTTPPVIRQPLVTDKTAP 242
Query: 165 --PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
A + + A L+ MGFP R E +++C+CH + +G GYTC RC A+VC LP C
Sbjct: 243 ATTAQSSESAAALMMMGFPSRVVEDQLTMCACHGNLTMG-GYTCSRCSAKVCSLPITCPS 301
Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGC-------------- 268
C L L+ S HLARSYHHLFP+ + V+ + R C GC
Sbjct: 302 CQLTLLLSTHLARSYHHLFPLRNWATVS---WSRAREMGSKQCVGCLAAFSNPPSNAKEN 358
Query: 269 ---QQ-------------SLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
QQ S ++ G Y C C+ HFC++CD++ H LHNCPGC
Sbjct: 359 GEAQQDTEAKVTKKHDDDSEEQKASESGRY-ECRACESHFCIDCDMFAHMVLHNCPGC 415
>gi|145473807|ref|XP_001462567.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430407|emb|CAK95194.1| unnamed protein product [Paramecium tetraurelia]
Length = 361
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 149/317 (47%), Gaps = 31/317 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF+PSR+AV + + AF+ +F + NPLSQIG+ ++ + D S +IK +
Sbjct: 67 DFKPSRLAVALQFLPAFIEQFLECNPLSQIGIAVAEEYRCKTILDFTSS-WVNIKQYLST 125
Query: 62 L------GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCK 115
+ G S + LQ AL + S + +L++ ++ + D D+ E ++C+
Sbjct: 126 IHSINEAGFSMAACLQTALHV----FSTTQLHAQSSILVVTTSTYSDDKLDLNEWSERCQ 181
Query: 116 ESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANL 175
+ I+ +VI + + + Q T G Y L+E F + I + P + I L
Sbjct: 182 SAAIQINVISFTGSINQLIKITQATDGQYLCPLNEFQFSQEIQKFVSPQESKNHKMITQL 241
Query: 176 IKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+K+GFPQ+ ++C CH E+ V Y CP C ++VC P C IC +V + R
Sbjct: 242 VKIGFPQKLIVSQPTLCQCHLEI-VYNFYKCPVCYSKVCSPPLLCPICSTWIVLPHQILR 300
Query: 236 S--YHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFC 293
S Y+ L E+T ND + C GC + ++ C +CK FC
Sbjct: 301 SDGYNTLGVF----EIT----NDGLDH---VCSGCLEP------TTVIHSTCERCKNFFC 343
Query: 294 LECDIYIHESLHNCPGC 310
L+CDI IH CPGC
Sbjct: 344 LDCDILIHSKFKVCPGC 360
>gi|321447765|gb|EFX61168.1| hypothetical protein DAPPUDRAFT_340521 [Daphnia pulex]
Length = 167
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 71/182 (39%), Positives = 104/182 (57%), Gaps = 23/182 (12%)
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
M C+G++SLQN+ + L +P++ REVL L L + I+ KE+
Sbjct: 7 MKDTSCNGETSLQNSFERAMCGLKNMPAHSSREVLTLLVILVIYK-----KKIKSFKENN 61
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
IR S+IGL+AE+ IC+ + + TGG+Y+V LD+ + KE ++ + P A+A A+L+KM
Sbjct: 62 IRVSIIGLAAEVRICQEIAKRTGGTYNVLLDDHYLKETLILNQVQPLAVAGTMEASLVKM 121
Query: 179 GFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYH 238
GF + GY CP+ +R CELP +C+ CGL LVS+PHLARSYH
Sbjct: 122 GFSNNS------------------GYNCPQSGSRYCELPVECKQCGLTLVSAPHLARSYH 163
Query: 239 HL 240
L
Sbjct: 164 LL 165
>gi|301097433|ref|XP_002897811.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
gi|262106559|gb|EEY64611.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
Length = 164
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/174 (43%), Positives = 98/174 (56%), Gaps = 27/174 (15%)
Query: 154 KELIMEHAPPPPAIAEFA--IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKA 211
K L+ P P +A A A L++MGFPQR G++S+C+CH+ VGY CPRCK+
Sbjct: 2 KRLMTAFTFPSPTVATAASRFATLVEMGFPQRRS-GALSLCTCHQAFTT-VGYLCPRCKS 59
Query: 212 RVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDE--------------VTPLCLNDP 257
+ C+LPT C++C L LVSSPHLARSYHHLFP+A F + ++P
Sbjct: 60 KSCDLPTTCQVCNLPLVSSPHLARSYHHLFPVAKFAQHLLRSGATGEKGAKISP------ 113
Query: 258 RNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGCE 311
+ CFGC L G C C+ FC ECD Y+H+SLHNCPGC
Sbjct: 114 -ELVQKKCFGCLLLLGLDGEGAAY--ECATCQNVFCSECDTYVHDSLHNCPGCS 164
>gi|367054704|ref|XP_003657730.1| hypothetical protein THITE_2123698 [Thielavia terrestris NRRL 8126]
gi|347004996|gb|AEO71394.1| hypothetical protein THITE_2123698 [Thielavia terrestris NRRL 8126]
Length = 282
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 62/138 (44%), Positives = 90/138 (65%), Gaps = 3/138 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D P+R + FVRE+F+QNP+SQ+G++ ++DGVA ++D+GG+P HI+ L
Sbjct: 121 DLLPTRYLLTLNYAVDFVREYFEQNPISQMGIIAMRDGVAVRISDMGGNPAEHIEKLRTW 180
Query: 62 LGC---SGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
G+ SLQNAL++ +G L PS+G REVLI+Y AL + DPGDI +TI +
Sbjct: 181 AAQGEPQGNPSLQNALEMCRGALFHTPSHGTREVLIIYGALLSSDPGDIHDTIANLITDR 240
Query: 119 IRCSVIGLSAEMFICKHL 136
IR S++GLSA++ IC L
Sbjct: 241 IRVSIVGLSAQVAICAEL 258
>gi|123469529|ref|XP_001317976.1| transcription factor ssl1 family protein [Trichomonas vaginalis G3]
gi|121900723|gb|EAY05753.1| transcription factor ssl1 family protein [Trichomonas vaginalis G3]
Length = 394
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/336 (30%), Positives = 158/336 (47%), Gaps = 51/336 (15%)
Query: 3 FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MG 60
F +RM ++ Q F+ +FF QNPLSQ+ ++ + LT L + + H+K + +
Sbjct: 78 FNQNRMKLIQDQTIDFINDFFVQNPLSQLSILATYESTCRILTPLSCNVQDHVKKIKSLS 137
Query: 61 KLGCSGDSSLQNALDLVQGLL---------SQIPSYGHREVLILYSALSTCDPGDIMETI 111
G+ SL+++L + +L +QI + +EVLI+Y +L+TCD I +T+
Sbjct: 138 LEDHGGEPSLESSLSIATAILGNGEKNPGLAQIST--TKEVLIVYGSLTTCDNSPIYKTL 195
Query: 112 QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFA 171
++SKI+ S+IGL A++F+ + + +T G Y V + H ++ PP EF
Sbjct: 196 NLVRDSKIKVSIIGLGAKVFVLEKIAAETKGEYFVPVSIEHLHDIFHSFVIPP----EFM 251
Query: 172 IAN---LIKMGFPQRAGEGSISI-----------CSCHKEVKVGVGYTCPRCKARVCELP 217
+ I GF A S + E GY CPRC RV +P
Sbjct: 252 LNQRRYFIPFGFASLASPTVPSFDVFKLMSNPPSKTNKPETPQYGGYQCPRCGTRVFSIP 311
Query: 218 TDCRICGLQLVSSPHLARSYHHLFPIAPF-DEVTPLCLNDPRNRSRSTCFGCQQSLLSSG 276
C +C L+S +L R+ HL P+ F E +P CFGC
Sbjct: 312 CFCPVCRDFLISPAYLKRTVCHLTPVPLFVSEESP---------DTDHCFGC-------- 354
Query: 277 NKPGLYVAC--PKCKKHFCLECDIYIHESLHNCPGC 310
N P + C PKC K FC +CD +IHE L +CPGC
Sbjct: 355 NMPFEEIPCRCPKCNKCFCKKCDPFIHEVLQSCPGC 390
>gi|169596905|ref|XP_001791876.1| hypothetical protein SNOG_01225 [Phaeosphaeria nodorum SN15]
gi|160707395|gb|EAT90874.2| hypothetical protein SNOG_01225 [Phaeosphaeria nodorum SN15]
Length = 716
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 76/211 (36%), Positives = 105/211 (49%), Gaps = 46/211 (21%)
Query: 142 GSYSVALDESHFKELIMEHAPPPPAIAEFAI----------ANLIKMGFPQRAGEGSISI 191
G Y++A D+ KEL++ PP I + A A L+ MGFP R E ++
Sbjct: 206 GEYTIATDQEMLKELLIA-TTTPPVIRQTATTTDAPPPESAAALMMMGFPSRVVEDKPTM 264
Query: 192 CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTP 251
C+CH + +G GYTC RC A+VC LP C C L L+ S HLARSYHHLFP+ + V+
Sbjct: 265 CACHGNLTMG-GYTCSRCSAKVCSLPITCPSCQLTLLLSTHLARSYHHLFPLRNWATVS- 322
Query: 252 LCLNDPRNRSRSTCFGC-----------------QQSLLSSGNK-------------PGL 281
+ R + C C +Q+L ++G+K G
Sbjct: 323 --WSRAREKGSKECVSCLSSFSKPPASDTERTQTEQNLETNGHKRDADDSEEQKASESGR 380
Query: 282 YVACPKCKKHFCLECDIYIHESLHNCPGCES 312
Y C C+ HFC++CD++ H LHNCPGC S
Sbjct: 381 Y-ECRACESHFCIDCDMFAHMVLHNCPGCLS 410
Score = 40.8 bits (94), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKD 38
D RP+R + + +VREFF+QNP+SQ+ ++ + D
Sbjct: 119 DMRPNRYITMINYAQDYVREFFEQNPISQMSVLGMHD 155
>gi|76157401|gb|AAX28337.2| SJCHGC09266 protein [Schistosoma japonicum]
Length = 234
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 95/149 (63%), Gaps = 4/149 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D +P+R+ K FVRE+FDQNP+SQ+G++ D A LT+L G+P H+ AL +
Sbjct: 88 DLKPNRLICTIKAACTFVREYFDQNPISQLGIIVTSDRRAERLTELSGNPRPHLAALQSL 147
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
C+G+ SLQNAL L + L + H E+++L ++L+TCDPGDI +TI+ ++I
Sbjct: 148 YSRTCAGEPSLQNALLLAESRLKY--TIHHNEIVVLIASLTTCDPGDIHQTIKSLSSNRI 205
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVAL 148
CSVI L+ E+F+ + L Q T G++ V +
Sbjct: 206 HCSVISLAVEVFVYRALAQFTQGTFHVIM 234
>gi|402580964|gb|EJW74913.1| hypothetical protein WUBG_14179, partial [Wuchereria bancrofti]
Length = 144
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 65/138 (47%), Positives = 89/138 (64%), Gaps = 4/138 (2%)
Query: 115 KESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIAN 174
K IRCSVI LSAE+F+ K LC T G + V LD +HF+ ++ EHA PP + A ++
Sbjct: 3 KRYHIRCSVISLSAEIFVFKKLCSITSGRHDVVLDSTHFEVILSEHAIPP-VCSRNAESS 61
Query: 175 LIKMGFPQRAGEGSISICSCHK-EVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSP 231
+++MGFP S C CH+ EV+ G G+ CP+C AR C LP +CRIC L L+S+P
Sbjct: 62 VVRMGFPAHESIDCPSFCLCHQSEVRPPGGRGFFCPQCGARYCSLPVECRICKLTLISAP 121
Query: 232 HLARSYHHLFPIAPFDEV 249
LARS H+L P+ F+E+
Sbjct: 122 QLARSLHNLLPLPAFEEI 139
>gi|403267800|ref|XP_003925995.1| PREDICTED: general transcription factor IIH subunit 2 [Saimiri
boliviensis boliviensis]
Length = 233
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 61/134 (45%), Positives = 88/134 (65%), Gaps = 2/134 (1%)
Query: 30 QIGLVTVKDGVANCLTDLGGSPESHIKALMG--KLGCSGDSSLQNALDLVQGLLSQIPSY 87
+IG++ K A LT+L G+P HI +L + C G+ SL N+L + L +P +
Sbjct: 30 KIGIIVTKSKRAEKLTELSGNPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGH 89
Query: 88 GHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVA 147
REVLI++S+L+TCDP +I + I+ K +KIR SVIGLSAE+ +C L ++TGG+Y V
Sbjct: 90 TSREVLIIFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGTYHVI 149
Query: 148 LDESHFKELIMEHA 161
LDESH+KEL+ H
Sbjct: 150 LDESHYKELLTHHV 163
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 31/51 (60%), Gaps = 5/51 (9%)
Query: 260 RSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
R C+GCQ L +YV C C+ FC++CD+++H+SLH CPGC
Sbjct: 178 RMAGFCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 223
>gi|425774738|gb|EKV13039.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Penicillium
digitatum PHI26]
gi|425780731|gb|EKV18732.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Penicillium
digitatum Pd1]
Length = 296
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/138 (41%), Positives = 90/138 (65%), Gaps = 2/138 (1%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++D+ G+P H+ A+ +
Sbjct: 101 DLRPTRYLLALRYAQEFVREFFEQNPISQLGVLGLRDGLALRVSDMSGNPTGHLTAIQAL 160
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN L++ +G L PS+G REVL +Y +L + DPGDI +TI KI
Sbjct: 161 KMQDPKGLPSLQNGLEMARGALFHTPSHGTREVLFIYGSLLSSDPGDIHQTIDALISDKI 220
Query: 120 RCSVIGLSAEMFICKHLC 137
R ++GL+A++ IC+ L
Sbjct: 221 RVGIVGLAAQVAICRELS 238
>gi|56786210|gb|AAW29208.1| Ssl1 [Drosophila yakuba]
gi|56786212|gb|AAW29209.1| Ssl1 [Drosophila yakuba]
gi|56786214|gb|AAW29210.1| Ssl1 [Drosophila yakuba]
gi|56786216|gb|AAW29211.1| Ssl1 [Drosophila yakuba]
gi|56786226|gb|AAW29216.1| Ssl1 [Drosophila yakuba]
gi|56786228|gb|AAW29217.1| Ssl1 [Drosophila yakuba]
gi|56786230|gb|AAW29218.1| Ssl1 [Drosophila yakuba]
gi|56786232|gb|AAW29219.1| Ssl1 [Drosophila yakuba]
Length = 166
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 96/165 (58%), Gaps = 12/165 (7%)
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ + + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI + K+
Sbjct: 2 LANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDELKKEG 61
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
IRCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+M
Sbjct: 62 IRCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRM 120
Query: 179 GFPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKAR 212
GFP E +S+C CH E G+ CP+C ++
Sbjct: 121 GFPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSK 165
>gi|429962932|gb|ELA42476.1| hypothetical protein VICG_00575 [Vittaforma corneae ATCC 50505]
Length = 335
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 85/309 (27%), Positives = 150/309 (48%), Gaps = 34/309 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+ +++ + FV +F NPLS + +T + S E + K ++
Sbjct: 59 DYVPTVRNILSNTIPRFVSQFKISNPLSILSFLTCRSVFE------KYSREFNPKIMLNT 112
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
+G SG+ S N L ++ S ++EVL++ S++ T D G + K+ I+
Sbjct: 113 VG-SGNFSFLNCLKSAIEMIRT--STYNKEVLVITSSIGTKDTGAYEQVFSDIKKYNIKV 169
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
+VI + E+ + K + + + G + V LD HF+ L+ + P + A + L+K+GFP
Sbjct: 170 NVISICGEVTLFKKISELSNGVFKVPLDSFHFEILLHQFTEPLECLE--ATSCLVKLGFP 227
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q + C CH + + + Y CP CKA +C LP +C IC QLVS ++++SY+ ++
Sbjct: 228 QITSNSGL--CVCHLKFEHSL-YECPVCKAYICCLPCECPICSTQLVSPLNISKSYYFIY 284
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ PF +++ + R G L C +C+ H C +C+ ++H
Sbjct: 285 PLKPF-------ISNENGKCRKC--GRDAKFL-----------CEECQNHLCEKCNKFMH 324
Query: 302 ESLHNCPGC 310
E L C C
Sbjct: 325 EDLGFCIFC 333
>gi|56786218|gb|AAW29212.1| Ssl1 [Drosophila santomea]
gi|56786220|gb|AAW29213.1| Ssl1 [Drosophila santomea]
gi|56786222|gb|AAW29214.1| Ssl1 [Drosophila santomea]
gi|56786224|gb|AAW29215.1| Ssl1 [Drosophila santomea]
gi|56786234|gb|AAW29220.1| Ssl1 [Drosophila santomea]
gi|56786236|gb|AAW29221.1| Ssl1 [Drosophila santomea]
gi|56786238|gb|AAW29222.1| Ssl1 [Drosophila santomea]
gi|56786240|gb|AAW29223.1| Ssl1 [Drosophila santomea]
Length = 166
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 63/165 (38%), Positives = 95/165 (57%), Gaps = 12/165 (7%)
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
+ + + + SLQN LDL L +PS+ RE++I+ +L+TCDP DI TI K+
Sbjct: 2 LANVSLTSEPSLQNGLDLALKSLKVVPSHASREIVIIMGSLTTCDPVDINITIDDLKKEG 61
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
IRCSVI LSAE+ + ++L Q T G++ LD++HF++ +M PPPA A+ +LI+M
Sbjct: 62 IRCSVISLSAEIHVARYLTQQTMGTFGAVLDDAHFRDQLMSQVDPPPA-AKTQHNSLIRM 120
Query: 179 GFPQRAGE-----GSISICSCHKE------VKVGVGYTCPRCKAR 212
GFP E +S+C CH E G+ CP+C ++
Sbjct: 121 GFPHTKNEVEGKDAPLSMCMCHIENLEEPSELTTTGHHCPQCNSK 165
>gi|145532284|ref|XP_001451903.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124419569|emb|CAK84506.1| unnamed protein product [Paramecium tetraurelia]
Length = 373
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 149/328 (45%), Gaps = 43/328 (13%)
Query: 2 DFRPSRMAVVAKQV----------EAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS- 50
DF+PSR+AV + + ++F+ +F + NPLSQIG+ ++ + D S
Sbjct: 69 DFKPSRLAVALQFLPVKKHTIIHNQSFIEQFLEGNPLSQIGIAVAEEYKCKTILDFTSSC 128
Query: 51 --PESHIKAL--MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGD 106
+ ++ + + + G S + LQ AL + S + +L + ++ + D D
Sbjct: 129 VNIKQYLSTIHSINEAGFSMAACLQTALHV----FSSTQLHAQSSILFVTTSTYSDDKLD 184
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
+ E +KC+ + I+ ++I + + + Q T G Y ++E F + I + P +
Sbjct: 185 LNEWSEKCQSAAIQINMISFAGAIHQLIKITQATDGQYLCPINEFQFSQEIQKFISPQES 244
Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQ 226
I L+K+GFPQ+ ++C CH E+ Y CP C ++VC P C IC
Sbjct: 245 KNHKMITQLVKIGFPQKFIVSQPTLCQCHLEIIYNF-YKCPVCYSKVCSPPLLCPICSTW 303
Query: 227 LVSSPHLARS--YHHL--FPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
+V + RS Y+ L F I +ND + C GC + ++
Sbjct: 304 IVLPHQILRSDGYNTLGVFEI----------INDGLDH---ICQGCLEP------TSVIH 344
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGC 310
C +CK FCL+CD+ IH CPGC
Sbjct: 345 STCERCKNFFCLDCDVLIHSKFKVCPGC 372
>gi|440493575|gb|ELQ76030.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit SSL1 [Trachipleistophora
hominis]
Length = 334
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 88/313 (28%), Positives = 148/313 (47%), Gaps = 42/313 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF PS V ++ F+++F+D+NP+S + L+ ++ C + + S + G
Sbjct: 61 DFLPSFRYHVTTKLNHFIKKFYDENPISVLSLLIYRNN--KCESYIVLDRLSETEGFFGV 118
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G +G+ SL N+L++ L++ +E+L++ +L T D +E ++ K+ KI
Sbjct: 119 AG-TGEFSLINSLNVSLEFLNRDDYI--KEILVITGSLYTID----IEELRSFKDIKIH- 170
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
I AE++ K + + T G Y V ++ S L +E P +I + NL+K+GFP
Sbjct: 171 -FIAFRAEVYFFKDVAEKTHGKYYVPVEMSDI-SLFLESFCMPNSINSGSAHNLLKLGFP 228
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q G + C + K G Y CP C +VC LP C +C QLV+S LA+S ++ +
Sbjct: 229 Q-VLTGDLVCACCFRYSKKG--YECPVCHTKVCNLPIKCPVCKTQLVTSNILAQSLYYCY 285
Query: 242 PIAPFDEVT---PLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDI 298
P+ F++ +C D R+R C KC +C C
Sbjct: 286 PLEDFEKAGGACRVCGGDGRDR------------------------CKKCGSIYCERCSY 321
Query: 299 YIHESLHNCPGCE 311
+IH L+ C C+
Sbjct: 322 FIHNDLYFCIYCD 334
>gi|67528154|ref|XP_661887.1| hypothetical protein AN4283.2 [Aspergillus nidulans FGSC A4]
gi|40739631|gb|EAA58821.1| hypothetical protein AN4283.2 [Aspergillus nidulans FGSC A4]
Length = 277
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 53/130 (40%), Positives = 86/130 (66%), Gaps = 2/130 (1%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 106 DLRPTRYLLTLRYAQEFVREFFEQNPISQVGVIGLRDGLALRISDMSGNPTEHISAIQDL 165
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
G SLQN L++ +G L PS+G RE+ I++ +L + DPGDI +TI + KI
Sbjct: 166 RSQDPKGLPSLQNGLEMARGALFHTPSHGTREIFIVFGSLLSSDPGDIHQTITTLIDDKI 225
Query: 120 RCSVIGLSAE 129
R ++GL+A+
Sbjct: 226 RVGIVGLAAQ 235
>gi|323353892|gb|EGA85745.1| Ssl1p [Saccharomyces cerevisiae VL3]
Length = 273
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/127 (44%), Positives = 86/127 (67%), Gaps = 2/127 (1%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R A++ + FV EFFDQNP+SQ+G++ +++G+A ++ + G+P+ HI AL +
Sbjct: 140 DLRPNRHAMIIQYAIDFVHEFFDQNPISQMGIIIMRNGLAQLVSQVSGNPQDHIDALKSI 199
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
K G+ SLQNAL++ +GLL +P++ REVLI++ +LST DPGDI +TI KI
Sbjct: 200 RKQEPKGNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTTDPGDIHQTIDSLVSEKI 259
Query: 120 RCSVIGL 126
R G+
Sbjct: 260 RVKGFGV 266
>gi|429965596|gb|ELA47593.1| transcription factor ssl1, partial [Vavraia culicis 'floridensis']
Length = 381
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/310 (29%), Positives = 149/310 (48%), Gaps = 36/310 (11%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF PS V ++ F+++F+D+NP+S + L+ ++ C + S + G
Sbjct: 108 DFLPSFRYHVTTKLNHFIKKFYDENPISILSLLIYRNN--RCENYIVLDRMSETENFFGV 165
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G +G+ SL N+L++ LS+ +E+L++ +L T D E ++ ++ K+
Sbjct: 166 AG-TGEFSLMNSLNVSLDFLSRDDHI--KEILVVTGSLYTID----FEEMRAFRDIKVH- 217
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
I L AE++ K++ Q T G Y V ++ S L +E P +I + NL+K+GFP
Sbjct: 218 -FIALRAEVYFFKNIAQKTHGKYYVPVEVSDI-SLFLESFCMPNSINSSSALNLLKLGFP 275
Query: 182 QRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLF 241
Q A G + C + K G Y CP C +VC LP C IC QLV+S L++S ++ +
Sbjct: 276 Q-ALTGDLVCACCFRCSKKG--YECPVCHTKVCNLPIKCPICKTQLVTSNILSQSLYYCY 332
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ F++ +C C G G+ C KC +C C +IH
Sbjct: 333 PLEDFEKSNGVC---------RVCGG-------DGSD-----QCKKCMSVYCEGCCYFIH 371
Query: 302 ESLHNCPGCE 311
L+ C C+
Sbjct: 372 NDLYFCIYCD 381
>gi|383141961|gb|AFG52334.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141967|gb|AFG52337.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141975|gb|AFG52341.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141977|gb|AFG52342.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141987|gb|AFG52347.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
Length = 74
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 56/75 (74%), Positives = 59/75 (78%), Gaps = 3/75 (4%)
Query: 198 VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP 257
VKVG GYTCPRCKARVCELPT+C ICGL LVSSPHLARSYHHLFP+ PF+EV ND
Sbjct: 1 VKVGGGYTCPRCKARVCELPTECHICGLTLVSSPHLARSYHHLFPVTPFEEVLRTSSND- 59
Query: 258 RNRSRSTCFGCQQSL 272
R TCFGCQQ L
Sbjct: 60 --RLPRTCFGCQQFL 72
>gi|402469018|gb|EJW04086.1| hypothetical protein EDEG_01608 [Edhazardia aedis USNM 41457]
Length = 341
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 147/316 (46%), Gaps = 46/316 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF PS ++E FV +F + NP+S + + + + + +L +
Sbjct: 63 DFLPSFRISTKNKLEKFVEKFSEDNPISTLTVCVNSNKTSESMQNLNSPGD--------- 113
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G SL+ AL+ +L S +E+L++ +LS D + TI+ ++ I+
Sbjct: 114 ----GFFSLKKALETGYEVLKD--STNIKEMLLIIFSLSINDRNGLETTIENVVKANIKV 167
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
++I L AE+ I K +C TGG + V LD++H++ +M P ++ I+ LI++GFP
Sbjct: 168 NIINLCAELKILKTVCDRTGGKFCVPLDDNHYETSLMSFL-TPTSVPNSTIS-LIQIGFP 225
Query: 182 --QRAGEGSISICSCHKEVKV--GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSY 237
+ + + + C CH ++ Y+C +C A +C LP +C IC SS L ++
Sbjct: 226 KVEVSDKNTPFCCVCHLKIIRPPFETYSCVKCSAILCSLPVECPICDTLNASSIGLCKNI 285
Query: 238 HHLFPIAPFDEVTPLCL--NDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
HH + + F C+ ND GC Q C KC FC +
Sbjct: 286 HHFYVLEDFVPSDGFCVVCNDK---------GCMQ--------------CQKCLNVFCRD 322
Query: 296 CDIYIHESLHNCPGCE 311
C+ ++H++++ C CE
Sbjct: 323 CNSFLHDNINFCFYCE 338
>gi|1737212|gb|AAC52046.1| basic transcription factor 2 p44, partial [Homo sapiens]
Length = 143
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 57/108 (52%), Positives = 75/108 (69%), Gaps = 10/108 (9%)
Query: 203 GYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSR 262
GY CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLFP+ F E+ PL + N R
Sbjct: 36 GYFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEI-PL---EEYNGER 91
Query: 263 STCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
C+GCQ L +YV C C+ FC++CD+++H+SLH CPGC
Sbjct: 92 -FCYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHCCPGC 133
>gi|383141959|gb|AFG52333.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141963|gb|AFG52335.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141965|gb|AFG52336.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141969|gb|AFG52338.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141971|gb|AFG52339.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141973|gb|AFG52340.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141979|gb|AFG52343.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141981|gb|AFG52344.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141983|gb|AFG52345.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141985|gb|AFG52346.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141989|gb|AFG52348.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
gi|383141991|gb|AFG52349.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
Length = 74
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/75 (73%), Positives = 58/75 (77%), Gaps = 3/75 (4%)
Query: 198 VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP 257
VKVG GYTCPRCKAR CELPT+C ICGL LVSSPHLARSYHHLFP+ PF+EV ND
Sbjct: 1 VKVGGGYTCPRCKARACELPTECHICGLTLVSSPHLARSYHHLFPVTPFEEVLRTSSND- 59
Query: 258 RNRSRSTCFGCQQSL 272
R TCFGCQQ L
Sbjct: 60 --RLPRTCFGCQQFL 72
>gi|426384438|ref|XP_004058775.1| PREDICTED: general transcription factor IIH subunit 2-like [Gorilla
gorilla gorilla]
Length = 260
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 75/113 (66%), Gaps = 2/113 (1%)
Query: 50 SPESHIKALMG--KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDI 107
+P HI +L + C G+ SL N+L + L +P + REVLI++S+L+TCDP +I
Sbjct: 96 NPRKHITSLKKAVDMTCHGEPSLYNSLSMAMQTLKHMPGHTSREVLIIFSSLTTCDPSNI 155
Query: 108 METIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEH 160
+ I+ K +KIR SVIGLSAE+ +C L ++TGG Y V LDESH+KEL+ H
Sbjct: 156 YDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGIYHVILDESHYKELLTHH 208
>gi|361068769|gb|AEW08696.1| Pinus taeda anonymous locus CL1234Contig1_04 genomic sequence
Length = 74
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 54/75 (72%), Positives = 58/75 (77%), Gaps = 3/75 (4%)
Query: 198 VKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP 257
VKVG GYTCPRCKA VCELPT+C ICGL LVSSPHLARSYHHLFP+ PF++V ND
Sbjct: 1 VKVGGGYTCPRCKAHVCELPTECHICGLTLVSSPHLARSYHHLFPVTPFEKVLRTSSND- 59
Query: 258 RNRSRSTCFGCQQSL 272
R TCFGCQQ L
Sbjct: 60 --RLPRTCFGCQQFL 72
>gi|147803583|emb|CAN77666.1| hypothetical protein VITISV_035971 [Vitis vinifera]
Length = 328
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/68 (75%), Positives = 60/68 (88%)
Query: 44 LTDLGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCD 103
+ DLGGS +SH+KALMGKL CSGDSSLQNAL+LV+G L+ IPSYG+REVLILYSALS CD
Sbjct: 114 MQDLGGSIDSHVKALMGKLECSGDSSLQNALNLVRGYLNLIPSYGYREVLILYSALSACD 173
Query: 104 PGDIMETI 111
GDI+ T+
Sbjct: 174 TGDIIPTL 181
>gi|294942110|ref|XP_002783381.1| TFIIH basal transcription factor complex p44 subunit, putative
[Perkinsus marinus ATCC 50983]
gi|239895836|gb|EER15177.1| TFIIH basal transcription factor complex p44 subunit, putative
[Perkinsus marinus ATCC 50983]
Length = 188
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 103/195 (52%), Gaps = 20/195 (10%)
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFP 181
SVI LS E++ + + + TGG++SVA HFK L+ +H PP ++ + IKMGFP
Sbjct: 8 SVISLSPELYALRKVVEMTGGTFSVATSPIHFKRLMQKHLTPPNWVSS---PSYIKMGFP 64
Query: 182 -QRAGEGSISI-----CSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLAR 235
+RA +G+ + C CH ++ Y CP+C + VCE+P +C +C L LV L +
Sbjct: 65 VRRACDGNHTADPPIKCMCHNRLQKTFVYICPQCHSPVCEIPVNCPVCRLPLVDDDALKK 124
Query: 236 SYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLE 295
+ H++ + P + P ++ P++ TC C G + C +C C E
Sbjct: 125 HHRHIYSM-PTYTLLPT-VDYPKSY---TCQFCGTDFTEGGAR------CDQCLSDVCYE 173
Query: 296 CDIYIHESLHNCPGC 310
CD++ H L +CPGC
Sbjct: 174 CDMFAHNKLRHCPGC 188
>gi|194383680|dbj|BAG59198.1| unnamed protein product [Homo sapiens]
Length = 203
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/127 (40%), Positives = 79/127 (62%), Gaps = 4/127 (3%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKI 119
+ C G+ SL N+L + L +P + REVLI++S+L+TCDP +I + I+ + KI
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKHMPGHTSREVLIIFSSLTTCDPSNIYDLIK--VDQKI 192
Query: 120 RCSVIGL 126
+ VI +
Sbjct: 193 KTKVITV 199
>gi|387592763|gb|EIJ87787.1| hypothetical protein NEQG_01859 [Nematocida parisii ERTm3]
gi|387595388|gb|EIJ93012.1| hypothetical protein NEPG_01967 [Nematocida parisii ERTm1]
Length = 366
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 145/319 (45%), Gaps = 30/319 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF PSR V K V F REF + NPLS +GL V G++ ++ + E I+ +
Sbjct: 59 DFLPSRKYYVKKGVMKFFREFIESNPLSTVGLAVVCGGMSYLVSSI-LCEEEEIEMCFSQ 117
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTC--DPGDIMETIQKCKESKI 119
+G+ SL +A + + +EV+ + S + P + T+ I
Sbjct: 118 NEGAGNFSLGSAFETINEFFHGCSFL--KEVVFIISGFTFVGSSPFILKSTLIN---KGI 172
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
I ++ EM I + L +++GG + + L+ PPP A A +++K+G
Sbjct: 173 IMHTIHMAGEMEILRKLSEESGGVFGIVNCPEDLSTLLGLICIPPPHSAS-ARLSMLKIG 231
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP E +IC+CH ++K GY CP C +VC++P C +C L + HL ++ H
Sbjct: 232 FPSPIHEN--TICACHLDLK-EWGYECPFCTTKVCKVPGVCPVCENILSAPVHLLKALH- 287
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
+ + P+ + + TC GC + N L+ +CP CK CL+C +
Sbjct: 288 ------WSDSAPIYV----AAIKGTCRGCFE------NNIQLH-SCPGCKNSLCLDCTRF 330
Query: 300 IHESLHNCPGCESLRHSNP 318
I + L+ C C S P
Sbjct: 331 IRQELNFCIFCPGAVSSVP 349
>gi|378754973|gb|EHY65001.1| hypothetical protein NERG_02057 [Nematocida sp. 1 ERTm2]
Length = 357
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/321 (28%), Positives = 141/321 (43%), Gaps = 50/321 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DF PSR V K V F REF + NPLS +GL V G++ ++ + E I+ +
Sbjct: 59 DFLPSRKYYVKKGVIKFFREFTESNPLSTVGLAVVCGGMSYLVSSI-LCEEDEIEMCFAQ 117
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSAL------------STCDPGDIME 109
G SL +A + V+ Q S+ +EV+ + S S + G IM
Sbjct: 118 NEGQGGFSLGSAFETVKEFF-QGCSF-LKEVIFIVSGFTFVGRSPFILKNSLINKGIIMH 175
Query: 110 TIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAE 169
TI ++ EM I + + ++ GG + + L+ PPP A
Sbjct: 176 TIH-------------MAGEMEILRKISEECGGIFGIVNCPEDLSTLLGLICIPPPHSAS 222
Query: 170 FAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVS 229
A +++K+GFP E +IC+CH E+K GY CP C +VC++P C IC L +
Sbjct: 223 -ARLSMLKIGFPSSIQEN--TICACHLELK-EWGYECPFCTTKVCKVPGVCPICENILSA 278
Query: 230 SPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCK 289
HL ++ H + + P+ + + R C GC + + CP CK
Sbjct: 279 PVHLLKALH-------WSDSAPIYAPEGKGR----CRGCSEENIQLN-------GCPGCK 320
Query: 290 KHFCLECDIYIHESLHNCPGC 310
C++C +I + L+ C C
Sbjct: 321 SRLCMDCTGFIRQELNFCIFC 341
>gi|402592150|gb|EJW86079.1| hypothetical protein WUBG_03012, partial [Wuchereria bancrofti]
Length = 220
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 83/141 (58%), Gaps = 2/141 (1%)
Query: 3 FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MG 60
+PSR+AV K + F+ +F +QNP+SQ+G+V KD A L L G+ ++L +
Sbjct: 80 MQPSRLAVTIKALNQFLDKFSEQNPISQVGVVVCKDKRAERLIPLTGNVRLVKESLSTIT 139
Query: 61 KLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIR 120
+L C G+ SL N+L L P + REV+++ ++LSTCDP I T + K IR
Sbjct: 140 ELLCHGEFSLHNSLMATIRSLHSYPGHASREVILIVASLSTCDPSSIFGTFELLKRYHIR 199
Query: 121 CSVIGLSAEMFICKHLCQDTG 141
CSVI LSAE+F+ K LC T
Sbjct: 200 CSVISLSAEIFVFKKLCSITS 220
>gi|70917739|ref|XP_732957.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56504309|emb|CAH83594.1| hypothetical protein PC300587.00.0 [Plasmodium chabaudi chabaudi]
Length = 161
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 86/145 (59%), Gaps = 2/145 (1%)
Query: 4 RPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG--K 61
+P+R+ V+ + VE F++ FF +NP+ +G+V +K+ A + L + + + +L+ K
Sbjct: 1 KPNRITVIFECVEIFLKNFFFKNPVGHVGVVALKNSSAKLIQQLTSNIDDVLNSLVKEQK 60
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC 121
G G SLQ L++ LL +P YG +E+LI+Y ++ TCD +I++ + ++ +
Sbjct: 61 EGLQGSPSLQEGLEIAHNLLMDMPLYGTKEILIMYGSIRTCDKKNILKYLDLLIKNNMYV 120
Query: 122 SVIGLSAEMFICKHLCQDTGGSYSV 146
+ I ++ EM I KH+C+ T G Y +
Sbjct: 121 NCISIAPEMHILKHICEKTHGIYKI 145
>gi|85000913|ref|XP_955175.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65303321|emb|CAI75699.1| hypothetical protein, conserved [Theileria annulata]
Length = 319
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 113/220 (51%), Gaps = 33/220 (15%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
MDF+P R+ ++ F+ + P++Q+G++ +++ + N +T G +P+ ++ L
Sbjct: 108 MDFKPDRLYCAFNSLKEFMNVLYGSGPITQVGIIVMRNKICNVITQFGTNPDEQMELLSN 167
Query: 61 KL--GCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
L G G SSLQN L++ ++ ++P Y RE+LI++ + T DPG+I+ T+ K K++
Sbjct: 168 ILKDGPEGSSSLQNGLEMCLKIMCELPYYMTREILIIFGSNKTLDPGNILITLDKLKQNF 227
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKM 178
I + I LS E++I K G + V+ PP I+ A +LI
Sbjct: 228 ITVNCISLSPELYILK-----GCGIFMVS----------------PPDISR-AFHHLI-- 263
Query: 179 GFPQRAG---EGSISICSCHKEVKVGVGYTCPRCKARVCE 215
P ++ E +I+ C+ + + +GY C C+ CE
Sbjct: 264 --PPKSFHKIERNINCSGCN--LNIEIGYECQNCQGIFCE 299
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 17/88 (19%)
Query: 223 CGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLY 282
CG+ +VS P ++R++HHL P F ++ RN + C GC ++ +
Sbjct: 245 CGIFMVSPPDISRAFHHLIPPKSFHKIE-------RNIN---CSGCNLNI-------EIG 287
Query: 283 VACPKCKKHFCLECDIYIHESLHNCPGC 310
C C+ FC CD YIH+ LH CP C
Sbjct: 288 YECQNCQGIFCEYCDKYIHQDLHQCPIC 315
>gi|269859684|ref|XP_002649566.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220066929|gb|EED44398.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 346
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 148/311 (47%), Gaps = 28/311 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
++ P+ + + ++ F F NPLS+I T V N K L+
Sbjct: 60 EYLPTIRSFILPELSDFSDNFVKLNPLSKITFTTFNYKVLNTAFKFD-------KNLLSS 112
Query: 62 LGCSGDSSLQNALDLVQGLL-SQIPSYGHREVLILYSALSTCDPG-DIMETIQKCKESKI 119
G +G S + L + G+ ++I S +E L++ S++ + + + +++ + KI
Sbjct: 113 YG-TGHFSFTSCLSYINGITDTKITS---KECLLIISSVGVINEEINFDKVMKEIIQKKI 168
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ ++I + E+ + K +C TGG Y V ++ FK ++ + PP ++ +LI++G
Sbjct: 169 KINIISICGEVTLLKKVCISTGGKYVVPINSFQFKSILSDFLYPPESLD--LKCSLIEVG 226
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP+ + C CH + V + CPRCK VC +P+ C IC L+L+SS + +
Sbjct: 227 FPKNIYTQHL--CVCHLNL-CSVLFECPRCKGLVCTIPSTCPICKLELISSIDICDLICY 283
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
+ + PF ++ + + ++ C+GC++ + S C KC FC CD
Sbjct: 284 NYYLEPFIKIAT--SKELSSLEKNKCYGCEKLEVIS--------VCNKCLSPFCYNCDAK 333
Query: 300 IHESLHNCPGC 310
+H ++ CP C
Sbjct: 334 LHNVINFCPFC 344
>gi|407425981|gb|EKF39551.1| DNA repair and transcription factor protein [Trypanosoma cruzi
marinkellei]
Length = 356
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 138/346 (39%), Gaps = 50/346 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+R+ + Q+ FV + D NPL+ +G+VT++DGVA+ L + I+ L K
Sbjct: 19 DYLPTRLLALRPQLGRFVHAYLDANPLASLGVVTMRDGVAHRLNSCTTNASDIIQTLEVK 78
Query: 62 ---LGCSGDSSLQNALDLVQGLLSQI-----------PSYGHRE----------VLILYS 97
G SG SL+N L L + P G +E +L++ S
Sbjct: 79 YFLFGGSGAMSLENGLRFALSELVDMKRIAKRVRRTEPRAGGKEAREEPTARLRILLVSS 138
Query: 98 ALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELI 157
+++ DP D+ I+ + +IR V+ + + + TGG+ ++ H E++
Sbjct: 139 SVTIIDPHDVFGVIKMLAKLRIRVDVVSFCGAVHVFEEAAVSTGGTLYTPMNYDHLTEVL 198
Query: 158 MEHAPPPPA-IAEFAIANLIKMGFP-------QRAGEGSISICSCHKEVKVGVGYT-CPR 208
A P I +I++GFP GEG Y CP+
Sbjct: 199 GRLASPDKRHICPEERPTMIRIGFPVYIECEDSEGGEGG------------STSYVACPQ 246
Query: 209 CKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRS----T 264
C +P+ C +C L L S+P + ++ + P E +
Sbjct: 247 CALMQTSIPSTCPLCKLLLCSAPLIHNTFIANNELCPVSENVLDAFEGAEKPTEDYLNER 306
Query: 265 CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
C CQ ++ K L C C C+ C+ Y+ E + CP C
Sbjct: 307 CSLCQYGMMGEDGK-ALVWRCNSCHCLRCVSCEKYVRERIGLCPTC 351
>gi|426384446|ref|XP_004058778.1| PREDICTED: general transcription factor IIH subunit 2-like, partial
[Gorilla gorilla gorilla]
Length = 119
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/129 (48%), Positives = 82/129 (63%), Gaps = 15/129 (11%)
Query: 144 YSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG 203
Y V LDESH+KEL+ H PPPA + +LI+MG + +G+ + +G G
Sbjct: 2 YHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMG---KHLDGNTE-----PGLTLG-G 51
Query: 204 YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRS 263
Y CP+C+A+ CELP +C+ICGL LVS+PHLARSYHHLFP+ F E+ PL + N R
Sbjct: 52 YFCPQCRAKYCELPVECKICGLTLVSAPHLARSYHHLFPLDAFQEI-PL---EEYNGER- 106
Query: 264 TCFGCQQSL 272
C+GCQ L
Sbjct: 107 FCYGCQGEL 115
>gi|269860338|ref|XP_002649891.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220066731|gb|EED44204.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 346
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 148/311 (47%), Gaps = 28/311 (9%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
++ P+ + + ++ F F NPLS+I V N K L+
Sbjct: 60 EYLPTIRSFILPELSDFSDNFVKLNPLSKITFTIFNYKVLNTAFKFD-------KNLLSS 112
Query: 62 LGCSGDSSLQNALDLVQGLL-SQIPSYGHREVLILYSALSTCDPG-DIMETIQKCKESKI 119
G +G S + L + G+ ++I S +E L++ S++ + + + +++ + KI
Sbjct: 113 YG-TGHFSFTSCLSYINGITDTKITS---KECLLIISSVGVINKEINFDKVMKEIIQKKI 168
Query: 120 RCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMG 179
+ ++I + E+ + K +C TGG Y V ++ FK ++ + PP ++ +LI++G
Sbjct: 169 KINIISICGEVTLLKKVCISTGGKYVVPINSFQFKSILSDFLYPPESLD--LKCSLIEVG 226
Query: 180 FPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHH 239
FP+ + C+CH + V + CP+CK VC +P+ C IC L+L+SS + +
Sbjct: 227 FPKNIYTQHL--CACHLNL-CSVLFECPKCKGLVCTIPSACPICKLELISSIDICDLICY 283
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
+ + PF ++ + + ++ C+GC++ + S C KC FC CD
Sbjct: 284 NYHLEPFIKIAT--SKELSSLEKNKCYGCEKLEVIS--------VCNKCLSPFCYNCDAK 333
Query: 300 IHESLHNCPGC 310
+H ++ CP C
Sbjct: 334 LHNVINFCPFC 344
>gi|71664710|ref|XP_819333.1| DNA repair and transcription factor protein [Trypanosoma cruzi
strain CL Brener]
gi|70884629|gb|EAN97482.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 356
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 149/350 (42%), Gaps = 58/350 (16%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+R+ + Q+ FV + D NPL+ +G+VT++DGVA+ L + I+ L K
Sbjct: 19 DYLPTRLLALRPQLGRFVHAYLDANPLASLGVVTMRDGVAHRLNSCTTNASDIIQTLEVK 78
Query: 62 ---LGCSGDSSLQNALDLVQGLLSQI-----------PSYGHRE----------VLILYS 97
G SG SL+N L L + P G +E +L++ S
Sbjct: 79 YFLFGGSGAMSLENGLRFALSELVDMKRIAKRVRRTEPHAGGKEAREEPTARLRILLVSS 138
Query: 98 ALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELI 157
+++ DP D+ I+ + ++R VI + + TGG+ ++ H E++
Sbjct: 139 SVTVIDPHDVFGVIKMMAKLRVRVDVISFCGAVHVFGEAAVSTGGTLYTPMNYDHLTEVL 198
Query: 158 -MEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG-----YTCPRCKA 211
+ +P + + +I++GFP +C KE + G G CP+C
Sbjct: 199 GLLASPDKHHVCQEERPAMIRIGFP--------VYINC-KEPEGGEGGSTNYVACPQCAL 249
Query: 212 RVCELPTDCRICGLQLVSSPHLARSY---HHLFPIAP--------FDEVTPLCLNDPRNR 260
+P+ C +C L L S+P + ++ + L P++ D+ T LN+
Sbjct: 250 VQTSIPSTCPLCKLLLCSAPLIHNTFIAHNELCPVSENVVDAFEGTDKPTEDYLNE---- 305
Query: 261 SRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
C CQ ++ + L C C C+ C+ Y+ E + CP C
Sbjct: 306 ---RCSLCQYGMMGEDGR-ALVWRCNSCHCLRCVSCEKYVRERIGLCPTC 351
>gi|222871523|gb|EEF08654.1| predicted protein [Populus trichocarpa]
Length = 104
Score = 93.2 bits (230), Expect = 2e-16, Method: Composition-based stats.
Identities = 42/77 (54%), Positives = 59/77 (76%)
Query: 84 IPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
+P + REVLI++S+L+TCDP +I + I+ K +KIR SVIGLSAE+ +C L ++TGG+
Sbjct: 1 MPGHTSREVLIIFSSLTTCDPSNIYDLIKTLKAAKIRVSVIGLSAEVRVCTVLARETGGT 60
Query: 144 YSVALDESHFKELIMEH 160
Y V LDESH+KEL+ H
Sbjct: 61 YHVILDESHYKELLTHH 77
>gi|407843613|gb|EKG01508.1| DNA repair and transcription factor protein [Trypanosoma cruzi]
Length = 397
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 146/350 (41%), Gaps = 58/350 (16%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+R+ + Q+ FV + D NPL+ +G+VT++DGVA+ L + I+ L K
Sbjct: 60 DYLPTRLLALRPQLGRFVHAYLDANPLASLGVVTMRDGVAHRLNSCTTNASDIIQTLEVK 119
Query: 62 ---LGCSGDSSLQNALDLVQGLLSQI-----------PSYGHRE----------VLILYS 97
G SG SL+N L L + P G +E +L++ S
Sbjct: 120 YFLFGGSGAMSLENGLRFALSELVDMKRIAKRVRRTEPRAGGKEAREEPTARLRILLVSS 179
Query: 98 ALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELI 157
+++ DP D+ I+ + +R VI + + TGG+ ++ H E++
Sbjct: 180 SVTVIDPHDVFGVIKMMAKLHVRVDVISFCGAVHVFGEAAVSTGGTLYTPMNYDHLTEVL 239
Query: 158 MEHAPPPPA-IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG-----YTCPRCKA 211
A P + + +I++GFP C KE + G G CP+C
Sbjct: 240 GRLASPDKRHVCQEERPAMIRIGFP--------VYIDC-KEPEGGEGGSTNYVACPQCAL 290
Query: 212 RVCELPTDCRICGLQLVSSPHLARSY---HHLFPIAP--------FDEVTPLCLNDPRNR 260
+P+ C +C L L S+P + ++ + L P++ D+ T LN+
Sbjct: 291 VQTSIPSTCPLCKLLLCSAPLIHNTFIAHNELCPVSENVVDAFEGADKPTEDYLNE---- 346
Query: 261 SRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
C CQ ++ + L C C C+ C+ Y+ E + CP C
Sbjct: 347 ---RCSLCQYGMMGEDGR-ALVWRCNSCHCLRCVSCEKYVRERIGLCPTC 392
>gi|342182931|emb|CCC92411.1| putative DNA repair and transcription factor protein [Trypanosoma
congolense IL3000]
Length = 354
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/343 (23%), Positives = 135/343 (39%), Gaps = 45/343 (13%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+R+ + Q+ AFV F D NPL+ +G+V ++DGVA+ L + AL K
Sbjct: 19 DYLPTRLLALRPQLNAFVHAFLDDNPLASLGVVVMRDGVAHRLISCTANATDIAHALELK 78
Query: 62 ---LGCSGDSSLQNALDLVQGLLSQIPSYGHR-----------------------EVLIL 95
G SG SL+N L + L + R V+++
Sbjct: 79 YFLFGGSGAMSLENGLRMALSELVDLKRIAKRVRREAEMPGPNEEGLRPDHATQLRVILV 138
Query: 96 YSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKE 155
S+++ DP D+ + + ++R V+ + + TGG ++ H
Sbjct: 139 ASSVTLVDPHDVFRVQKVVAQLRVRVDVVSFCGAVHALQEAALTTGGQLYTPMNYEHLTG 198
Query: 156 LIMEHAPPPPAIAEFAIAN--LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARV 213
++ + A P +I++GFP+ E K CP+C
Sbjct: 199 ILQKLAAPERGSTHPLTERPAMIQIGFPRYV----------EAEGKEKRYMGCPQCGLIQ 248
Query: 214 CELPTDCRICGLQLVSSPHLARSY-HHLFPIAPFDEVTPLCLNDPRNRSRST-----CFG 267
+P+ C +C L L S+P + S+ PIAP +V P S
Sbjct: 249 TSVPSTCPLCKLLLCSAPLIHTSFITKNEPIAPSQKVESK-QKAPSGESDEAPSVHCSLC 307
Query: 268 CQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
C++ L G+ ++ C C + C C+ Y+ E + CP C
Sbjct: 308 CRRKPLGVGDGGMVFWRCSCCLRERCDVCEAYVKEGIGLCPTC 350
>gi|56753383|gb|AAW24895.1| SJCHGC07056 protein [Schistosoma japonicum]
Length = 150
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 72/134 (53%), Gaps = 28/134 (20%)
Query: 204 YTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDP----RN 259
Y CPRC A CELP +C +CGL LV++PHLAR+YHHLFP+ F+ V ++ +
Sbjct: 19 YACPRCHAAYCELPVECNVCGLTLVAAPHLARAYHHLFPLDLFEPVVDNNSSEKSLVNED 78
Query: 260 RSRST--------------------CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIY 299
+S+ T C GC +++ PG CPKC FC CD
Sbjct: 79 KSKETINSSSDDNNNNNNNNNQLRVCTGC--NVIIPSRIPGY--QCPKCTFIFCHSCDAI 134
Query: 300 IHESLHNCPGCESL 313
+H+S+H+CPGC +L
Sbjct: 135 LHDSIHSCPGCLTL 148
>gi|72393367|ref|XP_847484.1| DNA repair and transcription factor protein [Trypanosoma brucei
TREU927]
gi|62359582|gb|AAX80015.1| DNA repair and transcription factor protein, putative [Trypanosoma
brucei]
gi|70803514|gb|AAZ13418.1| DNA repair and transcription factor protein, putative [Trypanosoma
brucei brucei strain 927/4 GUTat10.1]
Length = 351
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/342 (24%), Positives = 138/342 (40%), Gaps = 44/342 (12%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+D+ P+R+ + Q+ FV + D NPL+ +G+V ++DGVA L + ++ L
Sbjct: 18 LDYLPTRLIALRPQLNKFVHTYLDANPLASLGVVVMRDGVAQRLIPCTTNATDIVQTLEL 77
Query: 61 K---LGCSGDSSLQNALDLVQGLLSQIPSYGHR---------------------EVLILY 96
K G SG SL+N L + L + R +++
Sbjct: 78 KYFRFGGSGAMSLENGLRMALSELVDLRRIAKRLRENNSGSTGSDVQEDVTSRLRIVLAS 137
Query: 97 SALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKEL 156
S+++ DP D+ + + ++R VI + + + TGGS ++ H E+
Sbjct: 138 SSVTIVDPLDVFAVQKIVAQLRVRVDVISFCGAVHVLQEAAVLTGGSLHTPMNYEHLTEI 197
Query: 157 IMEHAPPPPAIAEFAIAN--LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
+ A P A LI++GFP G+ E + TCP+C
Sbjct: 198 LRHLASPEGGSARHVDVQPALIQIGFPMYVGD----------EAEGKRYLTCPQCGLIQT 247
Query: 215 ELPTDCRICGLQLVSSPHLARSYHHLFPI-APFDEVTPLCLNDPRNRSRST-----CFGC 268
+P+ C +C L L S P + ++ + AP +V + + C C
Sbjct: 248 SIPSTCSLCKLLLFSVPLIHVTFITRNELCAPSQKVEHVTKKKTTAAAGGAADTARCSLC 307
Query: 269 QQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
Q+ L +G G Y C C C C+ YI E+L CP C
Sbjct: 308 QRKL--AGEDGGGYWCCGCCLCERCEACETYIRENLGLCPTC 347
>gi|340055688|emb|CCC50009.1| putative DNA repair and transcription factor protein, fragment
[Trypanosoma vivax Y486]
Length = 355
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 110/262 (41%), Gaps = 39/262 (14%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+R+ + Q+ AFV +FD+NPL+ +G+V ++DGVA+ + + ++AL K
Sbjct: 19 DYLPTRLLALRPQLHAFVHTYFDENPLAALGVVVMRDGVAHRICSCTTNATDILQALEVK 78
Query: 62 ---LGCSGDSSLQNALDLVQGLLSQIPSYGHR---------------------------E 91
G SG SL+N L L L ++ R
Sbjct: 79 YFLFGGSGAMSLENGLRLALSELVELKRIAKRLRQDGVDTGNGSNGSRGPTLEQPSARLR 138
Query: 92 VLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDES 151
++++ S+++ DP D+ + ++R VI S + + TGG L
Sbjct: 139 IVVISSSVTLVDPHDVFAVQRLVARLRVRVDVISFSGAVHALEEAAAVTGGMLHTPLGYD 198
Query: 152 HFKELIMEHAPPPPAIA--EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRC 209
H ++ A P A E + +I++GFP + + S EV+ CP+C
Sbjct: 199 HLTTILRRLAAPDKMAARREGEKSPMIRIGFP-------LHMESAEAEVERRRFIACPQC 251
Query: 210 KARVCELPTDCRICGLQLVSSP 231
P+ C +C L + S P
Sbjct: 252 GLVQTATPSTCHLCKLLICSVP 273
>gi|261330741|emb|CBH13726.1| DNA repair and transcription factor protein,putative [Trypanosoma
brucei gambiense DAL972]
Length = 351
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 106/257 (41%), Gaps = 36/257 (14%)
Query: 1 MDFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+D+ P+R+ + Q+ FV + D NPL+ +G+V ++DGVA L + ++ L
Sbjct: 18 LDYLPTRLIALRPQLNKFVHTYLDANPLASLGVVVMRDGVAQRLIPCTTNATDIVQTLEL 77
Query: 61 K---LGCSGDSSLQNALDLVQGLLSQIPSYGHR---------------------EVLILY 96
K G SG SL+N L + L + R +++
Sbjct: 78 KYFRFGGSGAMSLENGLRMALSELVDLRRIAKRLRENNSGSTGSDVQEDVTSRLRIVLAS 137
Query: 97 SALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKEL 156
S+++ DP D+ + + ++R VI + + + TGGS ++ H E+
Sbjct: 138 SSVTIVDPLDVFAVQKIVAQLRVRVDVISFCGAVHVLQEAAVLTGGSLHTPMNYEHLTEI 197
Query: 157 IMEHAPPPPAIAEFAIAN--LIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVC 214
+ A P A LI++GFP G+ E + TCP+C
Sbjct: 198 LRHLASPEGGSARHVGVQPALIQIGFPMYVGD----------EAEGKRYLTCPQCGLIQT 247
Query: 215 ELPTDCRICGLQLVSSP 231
+P+ C +C L L S P
Sbjct: 248 SIPSTCPLCKLLLFSVP 264
>gi|95007103|emb|CAJ20323.1| ssl1-like protein [Toxoplasma gondii RH]
Length = 647
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 127/339 (37%), Gaps = 56/339 (16%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
DFRP R+ V + E F+ F QNPL+Q+ V + A L G +
Sbjct: 291 DFRPDRLRCVCQLAEEFIGTFLLQNPLAQLAQVALIGPSAEPLAARGEAARDSAAVSARP 350
Query: 62 LGC---------------------------SGDSSLQNALDLVQGLLSQIPSYGHREVLI 94
G +G SL N L L + LL+ +P Y REVL+
Sbjct: 351 SGSASMQLFSSSASESIAALREKRKHPGVGTGVPSLNNGLHLAKDLLAGVPPYCTREVLV 410
Query: 95 LYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFK 154
L+ +L TCD G I ETI K+S I C+VI L+AE+ + K G S S
Sbjct: 411 LFGSLRTCDVGCIEETIAAVKKSNICCNVICLAAELHVLKFSESLPGNGRSTRCASSPRA 470
Query: 155 ELIMEHAPPPPAI---AEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKA 211
HA PA A+++ + F + G S + P +
Sbjct: 471 SARSSHATHSPASLVCQHAALSHPHGISFAEVDEHGGPLQLSPTPHFLL---LRLPPVRR 527
Query: 212 RVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQS 271
P + L+S+P LA + + P D ++ C C
Sbjct: 528 EALHHPQPLSM----LLSAPRLAGRH---LAVVPADA----------PEAKRLCACCTTQ 570
Query: 272 LLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
L G + CP C + FC +CDIY HE L C C
Sbjct: 571 LDRGGAQ------CPDCGEIFCHDCDIYSHEQLRQCAFC 603
>gi|115432926|ref|XP_001216600.1| hypothetical protein ATEG_07979 [Aspergillus terreus NIH2624]
gi|114189452|gb|EAU31152.1| hypothetical protein ATEG_07979 [Aspergillus terreus NIH2624]
Length = 202
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 59/92 (64%), Gaps = 2/92 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 108 DLRPTRYLLTLRYAQEFVREFFEQNPISQLGVLGLRDGLAVRISDMSGNPTEHISAIQAL 167
Query: 60 GKLGCSGDSSLQNALDLVQGLLSQIPSYGHRE 91
G SLQN L++ +G L + + HRE
Sbjct: 168 RDQDPKGLPSLQNGLEMARGALLYVLTLNHRE 199
>gi|147832400|emb|CAN64423.1| hypothetical protein VITISV_032275 [Vitis vinifera]
Length = 578
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/41 (78%), Positives = 38/41 (92%)
Query: 20 REFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG 60
+EFFDQ+P SQIGLVT+KDG+A CLTDLGGSP+SH+KALM
Sbjct: 199 KEFFDQDPFSQIGLVTIKDGLAQCLTDLGGSPDSHVKALMA 239
>gi|324562745|gb|ADY49854.1| General transcription factor IIH subunit 2, partial [Ascaris suum]
Length = 85
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 47/81 (58%), Gaps = 3/81 (3%)
Query: 178 MGFPQRAGEGSISICSCHKEVKV---GVGYTCPRCKARVCELPTDCRICGLQLVSSPHLA 234
MGFP + C CH+ G G+ CP+C AR C LP +CR+C L L+S+P LA
Sbjct: 1 MGFPSHQIIQKPAFCLCHQSENRPPGGRGFLCPQCGARYCSLPVECRVCKLMLISAPQLA 60
Query: 235 RSYHHLFPIAPFDEVTPLCLN 255
RS+HHL P+ F EV C+
Sbjct: 61 RSFHHLLPLPAFKEVIDYCVK 81
>gi|123446514|ref|XP_001312007.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121893838|gb|EAX99077.1| hypothetical protein TVAG_290240 [Trichomonas vaginalis G3]
Length = 166
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 77/137 (56%), Gaps = 13/137 (9%)
Query: 8 MAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCS 65
M ++ Q F+ +FF QNPLSQ+ ++ + LT L + + H+K + +
Sbjct: 1 MKLIQDQTIDFINDFFVQNPLSQLSILATYESTCRILTPLSCNVQDHVKKIKSLSLEDHG 60
Query: 66 GDSSLQNALDLVQGLL---------SQIPSYGHREVLILYSALSTCDPGDIMETIQKCKE 116
G+ SL+++L + +L +QI + +EVLI+Y +L+TCD I +T+ ++
Sbjct: 61 GEPSLESSLSIATAILGNGEKNPGLAQIST--TKEVLIVYGSLTTCDNSPIYKTLNLVRD 118
Query: 117 SKIRCSVIGLSAEMFIC 133
SKI+ S+IGL A++F+
Sbjct: 119 SKIKVSIIGLGAKVFVL 135
>gi|154318002|ref|XP_001558320.1| hypothetical protein BC1G_02984 [Botryotinia fuckeliana B05.10]
Length = 211
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 57/89 (64%), Gaps = 4/89 (4%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI---KAL 58
D RP+R + + FV E+F+QNP+SQ+G++ ++DG+A ++D+ G+P HI KAL
Sbjct: 122 DMRPTRYLLTIRYASEFVTEYFEQNPISQLGIIGMRDGIAVRISDMSGNPTEHIERLKAL 181
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSY 87
G G+ SLQNAL++ + L PS+
Sbjct: 182 RVDQG-QGNPSLQNALEMSRAALFHAPSH 209
>gi|26347121|dbj|BAC37209.1| unnamed protein product [Mus musculus]
Length = 201
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 49/82 (59%), Gaps = 2/82 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLL 81
+ C G+ SL N+L + L
Sbjct: 135 VDMTCHGEPSLYNSLSMAMQTL 156
>gi|30584663|gb|AAP36584.1| Homo sapiens general transcription factor IIH, polypeptide 2, 44kDa
[synthetic construct]
Length = 166
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 50/85 (58%), Gaps = 2/85 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQI 84
+ C G+ SL N+L + L +
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKLV 159
>gi|13529149|gb|AAH05345.1| GTF2H2 protein [Homo sapiens]
gi|30582385|gb|AAP35419.1| general transcription factor IIH, polypeptide 2, 44kDa [Homo
sapiens]
Length = 165
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 50/85 (58%), Gaps = 2/85 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMG- 60
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P HI +L
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKHITSLKKA 134
Query: 61 -KLGCSGDSSLQNALDLVQGLLSQI 84
+ C G+ SL N+L + L +
Sbjct: 135 VDMTCHGEPSLYNSLSIAMQTLKLV 159
>gi|238503359|ref|XP_002382913.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Aspergillus
flavus NRRL3357]
gi|220691723|gb|EED48071.1| RNA polymerase TFIIH complex subunit Ssl1, putative [Aspergillus
flavus NRRL3357]
Length = 87
Score = 70.5 bits (171), Expect = 1e-09, Method: Composition-based stats.
Identities = 32/82 (39%), Positives = 53/82 (64%), Gaps = 2/82 (2%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--M 59
D RP+R + + + VREFF+QNP+SQ+G++ ++DG+A ++DL G+P HI A+ +
Sbjct: 5 DLRPTRYLLTLRYAQELVREFFEQNPISQLGVLGLRDGLAIRISDLSGNPTEHISAIQTL 64
Query: 60 GKLGCSGDSSLQNALDLVQGLL 81
G SLQN +++ +G L
Sbjct: 65 RDQDPKGLPSLQNGIEMARGAL 86
>gi|241165648|ref|XP_002409694.1| Bcl-2-associated transcription factor (BTF), putative [Ixodes
scapularis]
gi|215494619|gb|EEC04260.1| Bcl-2-associated transcription factor (BTF), putative [Ixodes
scapularis]
Length = 123
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 56/83 (67%), Gaps = 4/83 (4%)
Query: 111 IQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEF 170
+QKC IRCSV+GL+AE+ +C L + TGG+Y+V +DE+HFKE++ +HA PPP
Sbjct: 1 MQKCS---IRCSVVGLAAEVRVCCALTKATGGTYNVIMDENHFKEILFQHAIPPPVTGN- 56
Query: 171 AIANLIKMGFPQRAGEGSISICS 193
+ ++LI+M A S++I S
Sbjct: 57 SESSLIRMVHVSPACPPSLTIMS 79
>gi|321452740|gb|EFX64060.1| hypothetical protein DAPPUDRAFT_267127 [Daphnia pulex]
Length = 135
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 59/100 (59%), Gaps = 11/100 (11%)
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETIQKCKESK 118
M C+G++SLQN+ + L +P++ RE+L L L I+ KE+
Sbjct: 39 MKDTSCNGETSLQNSFERAMCGLKNMPAHSSREMLTLLVIL-----------IKSFKENN 87
Query: 119 IRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIM 158
IR S+IGL+AE+ IC+ + + TGG+Y+V LD+ + KE ++
Sbjct: 88 IRVSIIGLAAEVRICQEIAKRTGGTYNVLLDDHYLKETLI 127
>gi|169806142|ref|XP_001827816.1| transcription-repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|161779264|gb|EDQ31287.1| transcription-repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 187
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 86/183 (46%), Gaps = 15/183 (8%)
Query: 128 AEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEG 187
E+ + K +C TGG Y V ++ FK ++ + ++ +LI++ FP+
Sbjct: 18 GEVTLLKKVCISTGGKYVVLINSFQFKSILSDFLYLLESLD--LKCSLIEVEFPKNIYTQ 75
Query: 188 SISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARSYHHLFPIAPFD 247
+ C+CH + V + CP+CK VC +P C IC L+L+SS + + + + F
Sbjct: 76 HL--CACHLNL-CSVLFECPKCKGFVCTIPFTCPICKLELISSTDIRDLICYNYYLESFV 132
Query: 248 EVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNC 307
++ + + ++ C+GC++ + S C KC FC CD +H ++ C
Sbjct: 133 KIVT--SKELSSLEKNKCYGCEKLEVIS--------VCNKCLSPFCYNCDAKLHNVINFC 182
Query: 308 PGC 310
C
Sbjct: 183 SFC 185
>gi|119498681|ref|XP_001266098.1| transcription factor and DNA repair complex, core TFIIH, putative
[Neosartorya fischeri NRRL 181]
gi|119414262|gb|EAW24201.1| transcription factor and DNA repair complex, core TFIIH, putative
[Neosartorya fischeri NRRL 181]
Length = 157
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 53/80 (66%), Gaps = 2/80 (2%)
Query: 4 RPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGK 61
RP+R + + + FVREFF+QNP+SQ+G++ ++DG+A ++D+ G+P HI A+ +
Sbjct: 77 RPTRYLLSLRYAQEFVREFFEQNPISQLGVLGLRDGLAIRVSDMSGNPTEHISAIQALRD 136
Query: 62 LGCSGDSSLQNALDLVQGLL 81
G SLQN L++ +G L
Sbjct: 137 HDPKGLPSLQNGLEMARGAL 156
>gi|431907809|gb|ELK11416.1| Baculoviral IAP repeat-containing protein 1 [Pteropus alecto]
Length = 1274
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/97 (41%), Positives = 56/97 (57%), Gaps = 12/97 (12%)
Query: 139 DTGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRA--------GEGSIS 190
+ G+Y V LDESH+KEL+ H PPPA + +LI+MGFPQ + S S
Sbjct: 77 ELSGTYHVILDESHYKELLTHHVSPPPASSSSE-CSLIRMGFPQHTIASLSDQDAKPSFS 135
Query: 191 ICSCHKEVKVGV---GYTCPRCKARVCELPTDCRICG 224
+ + G+ GY CP+C+A+ CELP +C+ICG
Sbjct: 136 MAHLDSNTEPGLTLGGYFCPQCRAKYCELPVECKICG 172
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 39/63 (61%), Gaps = 5/63 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS-----PESHIK 56
D +P+R+ K +E F+ E+FDQNP+SQIG++ K A LT+L G+ ESH K
Sbjct: 33 DLKPNRLTCTLKLLEYFIEEYFDQNPISQIGIIVTKSKRAEKLTELSGTYHVILDESHYK 92
Query: 57 ALM 59
L+
Sbjct: 93 ELL 95
>gi|253742551|gb|EES99376.1| TFIIH P44 [Giardia intestinalis ATCC 50581]
Length = 393
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/319 (21%), Positives = 121/319 (37%), Gaps = 32/319 (10%)
Query: 18 FVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLV 77
+V+ PL ++ +T+++ + + + + L SG +L LD
Sbjct: 78 YVQFILSHRPLIRVAFMTLQNSLCYLELRFSSNLSAFEEVLRDGFRPSGHIALATGLDQA 137
Query: 78 QGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHL 136
LL S R +L + + +T D DI ++K ++++ + L+ + H+
Sbjct: 138 YELLMNSKSAASSRRILYINFSSATIDSLDIFNVLEKLLSAQVQVDTVSLTNSFGVLDHI 197
Query: 137 CQDTGGSYSVALDESH------------FKELIMEHAPPPPAIAEFAIANLIKMGFPQRA 184
+ TGG + + E+ E + + I +GFP
Sbjct: 198 SRKTGGRHLIIDRETKGGLLNLLKRLELRSEDVRITQATEEGDSNVDEGKFIMVGFPVHI 257
Query: 185 GEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLVSSPHLARS---YHHLF 241
+ +I C CH G Y CP C A VC++ ++C CGLQLV P+L R +
Sbjct: 258 CDDAI--CFCHNRHGTGF-YECPFCYAVVCDI-SECVCCGLQLVMYPNLYRGALLHKRAL 313
Query: 242 PIAPFDE--VTPLCLNDPRN--------RSRSTCFGCQQSLLSSGNKPGLYVACPKCKKH 291
D V P N +++S C CQ + ++ C C
Sbjct: 314 SFTKSDSSLVQPEAATSSANQAEDSKHIKAKSVCSLCQMQYGTDTDRSQYL--CSFCSGP 371
Query: 292 FCLECDIYIHESLHNCPGC 310
FC C + E+ +CP C
Sbjct: 372 FCKSCSELLAEAKVSCPVC 390
>gi|736404|gb|AAA64502.1| BTF2p44 [Homo sapiens]
Length = 130
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 35/54 (64%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI 55
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+P I
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGNPRKXI 128
>gi|432104566|gb|ELK31178.1| Baculoviral IAP repeat-containing protein 1 [Myotis davidii]
Length = 1414
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 5/63 (7%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS-----PESHIK 56
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G+ ESH K
Sbjct: 94 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSGTYHVILDESHYK 153
Query: 57 ALM 59
L+
Sbjct: 154 ELL 156
>gi|159115826|ref|XP_001708135.1| TFIIH P44 [Giardia lamblia ATCC 50803]
gi|157436245|gb|EDO80461.1| TFIIH P44 [Giardia lamblia ATCC 50803]
Length = 406
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 98/247 (39%), Gaps = 36/247 (14%)
Query: 18 FVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLV 77
+V+ PL ++ +T+++ + + + + L SG +L LD
Sbjct: 78 YVQFILSHRPLIRVAFMTLQNSLCYLELRFSSNLSAFEEVLRDGFRPSGHIALATGLDQA 137
Query: 78 QGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHL 136
LL S R +L + + +T D DI ++K ++I+ ++ L+ + H+
Sbjct: 138 YELLMNNKSAASSRRILYVNFSNATLDSLDIFSVLEKLLAAQIQVDIVSLTNRFGVLDHI 197
Query: 137 CQDTGGSYSVA-----------------------LDESHFKELIMEH-----APPPPAIA 168
+ TGG + + L S +E EH A
Sbjct: 198 SRVTGGRHLIVDRDVKGGLLNLLKRLELRSEDTRLMHSKGRE---EHSLNLQASHNQTAT 254
Query: 169 EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRICGLQLV 228
+ I +GFP + +I C CH G Y CP C A VC++ ++C CGLQLV
Sbjct: 255 DTEGGKFIMVGFPVHICDDAI--CFCHNRHGTGF-YECPFCYAVVCDI-SECVCCGLQLV 310
Query: 229 SSPHLAR 235
P+L R
Sbjct: 311 MYPNLYR 317
>gi|308162603|gb|EFO64989.1| TFIIH P44 [Giardia lamblia P15]
Length = 416
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/254 (22%), Positives = 98/254 (38%), Gaps = 40/254 (15%)
Query: 18 FVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLV 77
+V+ PL ++ +T+++ + + + + L SG +L LD
Sbjct: 78 YVQFILSHRPLIRVAFMTLQNSLCYLELRFSSNLSAFEEVLRDGFRPSGHIALATGLDQA 137
Query: 78 QGLLSQIPSYGH-REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHL 136
LL S R +L + + +T D DI ++K ++I+ V+ L+ + H+
Sbjct: 138 YDLLMNSGSAASSRRILYINFSNATLDSLDIFSVLEKLFAAQIQVDVVSLTNSFGVLDHI 197
Query: 137 CQDTGGSY----------------------------------SVALDESHFKELIMEHAP 162
+ TGG + ++ L SH + +
Sbjct: 198 SRTTGGRHLIVDREIKGGLLNLLKRLELRSEDAKIMQSKGEGTLNLQMSHNQIITDAKCT 257
Query: 163 PPPAIA-EFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCR 221
+A I +GFP + +I C CH G Y CP C A VC++ ++C
Sbjct: 258 KDTVLASNTEEGKFIMVGFPVHICDDAI--CFCHNRHGTGF-YECPFCYAIVCDI-SECV 313
Query: 222 ICGLQLVSSPHLAR 235
CGLQLV P+L R
Sbjct: 314 CCGLQLVMYPNLYR 327
>gi|222838932|gb|EEE77283.1| predicted protein [Populus trichocarpa]
Length = 72
Score = 54.7 bits (130), Expect = 5e-05, Method: Composition-based stats.
Identities = 23/48 (47%), Positives = 32/48 (66%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG 49
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L G
Sbjct: 18 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSG 65
>gi|321454255|gb|EFX65433.1| hypothetical protein DAPPUDRAFT_264704 [Daphnia pulex]
Length = 72
Score = 53.9 bits (128), Expect = 1e-04, Method: Composition-based stats.
Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 16/85 (18%)
Query: 84 IPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
+P++ RE+L L L E+ IR S+IGL+AE+ IC+ + + TGG+
Sbjct: 1 MPAHSSREMLTLLVIL----------------ENNIRVSIIGLAAEVRICQEIAKRTGGT 44
Query: 144 YSVALDESHFKELIMEHAPPPPAIA 168
Y+V LD+ + KE ++ + P A+A
Sbjct: 45 YNVLLDDHYLKETLILNQVQPLAVA 69
>gi|149059194|gb|EDM10201.1| general transcription factor II H, polypeptide 2 (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 159
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS 50
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L +
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSDT 123
>gi|148668488|gb|EDL00807.1| general transcription factor II H, polypeptide 2, isoform CRA_b
[Mus musculus]
Length = 159
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGS 50
D +P+R+ K +E FV E+FDQNP+SQIG++ K A LT+L +
Sbjct: 75 DLKPNRLTCTLKLLEYFVEEYFDQNPISQIGIIVTKSKRAEKLTELSDT 123
>gi|294930956|ref|XP_002779725.1| btf, putative [Perkinsus marinus ATCC 50983]
gi|239889260|gb|EER11520.1| btf, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 53/92 (57%), Gaps = 4/92 (4%)
Query: 1 MDFRPSRMAVVAKQV-EAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALM 59
MD++ +R+ V + F+ ++FD NP+S + ++ ++D A+ +T + G P S ++ ++
Sbjct: 107 MDYKQNRLDFVVSHLCRNFIPQYFDLNPISNLSVLALRDQRAHFVTRMSGQPASQMERVL 166
Query: 60 --GKLGCSGDSSLQNALDLV-QGLLSQIPSYG 88
G SG +SL NAL+ V Q +P YG
Sbjct: 167 QFSTGGASGSASLVNALEAVGQAAKGSLPRYG 198
>gi|399949974|gb|AFP65630.1| hypothetical protein CMESO_485 [Chroomonas mesostigmatica CCMP1168]
Length = 334
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 75/175 (42%), Gaps = 17/175 (9%)
Query: 11 VAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--MGKLGCSGDS 68
+ K V + F NP Q+ ++ VK+ ++ L G+ + H + L + K G++
Sbjct: 28 IFKLVSKLIHYFLYSNPSYQLAIILVKNNSIEQISHLSGNKKHHNQGLYRIKKKQDLGEN 87
Query: 69 SLQNALDLVQGLLSQIPSYGHREVLIL----YSALSTCDPGDIMETIQKCKESKIRCSVI 124
SL+ A L + LLS +E++I Y + G ++KI+ SVI
Sbjct: 88 SLKTAFFLAKKLLSFNKKKSEQEIIIFMGGSYVSSVFHSFGSFFV------KNKIKFSVI 141
Query: 125 GLSAEMFICKHLCQDTGGSYSVALDESHFK-----ELIMEHAPPPPAIAEFAIAN 174
+ F+ + L + TGG Y V +S+F L M P FAI N
Sbjct: 142 MFYEKTFLFETLVKITGGFYIVLKKDSNFDLISCMRLNMFRKTQIPKKVYFAIGN 196
>gi|496307|gb|AAA35101.1| stem loop mutation suppressor, partial [Saccharomyces cerevisiae]
Length = 62
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 6/54 (11%)
Query: 265 CFGCQQSL-LSSGNKPGLYVA-----CPKCKKHFCLECDIYIHESLHNCPGCES 312
CF CQ + +K G + C CK+ FC++CD++IHE LHNCPGCES
Sbjct: 4 CFSCQSRFPILKNHKNGKLLTSSRYRCEDCKQEFCVDCDVFIHEILHNCPGCES 57
>gi|157870269|ref|XP_001683685.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126751|emb|CAJ05184.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 446
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/188 (20%), Positives = 78/188 (41%), Gaps = 30/188 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+ + + + FV + D PL+ +G+V ++DG+++ L + + L
Sbjct: 19 DYLPNYLLAMRPPLLRFVERYLDSTPLASLGVVVMRDGISHRLLPCTTNRNEIVDVLERD 78
Query: 62 L---GCSGDSSLQNALDLVQGLL-----------------SQIPSYGHR----------E 91
+ G SG++S++N L + L S + + R
Sbjct: 79 VFLHGGSGNTSMENGLRMAMSELVDMREVAALAAATAKKSSSVAARNPRAAWKGSATQLN 138
Query: 92 VLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDES 151
V++L ++++ DP D+ + IR SV+ L + + +TGG+ L+
Sbjct: 139 VIVLTASVTLIDPTDVFAVVNIMAVLSIRISVVSLVGAVHVFDVCTVETGGTLYCPLNYD 198
Query: 152 HFKELIME 159
H ++ E
Sbjct: 199 HLLHIMDE 206
>gi|312076198|ref|XP_003140754.1| hypothetical protein LOAG_05169 [Loa loa]
Length = 127
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 30/45 (66%)
Query: 5 PSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGG 49
PSR+AV K + F+ +F +QNP+SQ+G+V KD A CL L G
Sbjct: 83 PSRLAVTIKALNQFLDKFSEQNPISQVGIVVCKDKRAECLIPLTG 127
>gi|66361323|pdb|1Z60|A Chain A, Solution Structure Of The Carboxy-Terminal Domain Of Human
Tfiih P44 Subunit
Length = 59
Score = 45.8 bits (107), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 31/46 (67%), Gaps = 5/46 (10%)
Query: 265 CFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIHESLHNCPGC 310
C+GCQ L +YV C C+ FC++CD+++H+SLH+CPGC
Sbjct: 18 CYGCQGEL----KDQHVYV-CAVCQNVFCVDCDVFVHDSLHSCPGC 58
>gi|736405|gb|AAA64503.1| BTF2p44 [Homo sapiens]
Length = 69
Score = 44.7 bits (104), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 10/69 (14%)
Query: 242 PIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLLSSGNKPGLYVACPKCKKHFCLECDIYIH 301
P+ F E+ PL + N R C+GCQ L +YV C C+ FC++CD+++H
Sbjct: 1 PLDAFQEI-PL---EEYNGER-FCYGCQGELKDQH----VYV-CAXCQNVFCVDCDVFVH 50
Query: 302 ESLHNCPGC 310
+SLH CPGC
Sbjct: 51 DSLHCCPGC 59
>gi|392579844|gb|EIW72971.1| hypothetical protein TREMEDRAFT_70875 [Tremella mesenterica DSM
1558]
Length = 343
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/132 (24%), Positives = 57/132 (43%), Gaps = 8/132 (6%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKAL--- 58
D+ P+R A+ V D NP S +GL+T+ + L +P + I L
Sbjct: 20 DYPPTRFQAQAQAVSTVFTAKTDSNPESAVGLMTMAGKSPSLLV----TPTNDIGKLLSA 75
Query: 59 MGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIMETI-QKCKES 117
MGK GDS A+ + Q L + R+ +I++ DP + + + +K +++
Sbjct: 76 MGKASIGGDSDFSTAVQIAQLALKHRENKNQRQRVIVFVGSPVSDPQEALVKLGKKLRKN 135
Query: 118 KIRCSVIGLSAE 129
+ V+ E
Sbjct: 136 NVLVDVVTFGEE 147
>gi|156341095|ref|XP_001620650.1| hypothetical protein NEMVEDRAFT_v1g147480 [Nematostella
vectensis]
gi|156205835|gb|EDO28550.1| predicted protein [Nematostella vectensis]
Length = 93
Score = 43.1 bits (100), Expect = 0.17, Method: Composition-based stats.
Identities = 15/30 (50%), Positives = 24/30 (80%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQI 31
D +P+R++ AK +E F+ E+FDQNP+SQ+
Sbjct: 56 DLKPNRLSCSAKLLENFITEYFDQNPISQV 85
>gi|407926572|gb|EKG19539.1| Transcription factor Tfb4 [Macrophomina phaseolina MS6]
Length = 380
Score = 42.0 bits (97), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 54/132 (40%), Gaps = 12/132 (9%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVAL-DESHFKELIMEHAPPPP 165
+M I + +I ++ L+ + + C TGG Y L E + L+M P
Sbjct: 218 VMNAIFAAQRQRIPIDILKLAGDTVFLQQACDATGGIYLKPLAPEGLLQYLMMAFLPDET 277
Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD--CRIC 223
A + + ++ F RA C CH+ V V +GY C C + CE P D C C
Sbjct: 278 ARKHLVLPSAGEVDF--RAA------CFCHRRV-VDIGYVCSVCLSIFCEPPADQTCPTC 328
Query: 224 GLQLVSSPHLAR 235
L S + R
Sbjct: 329 TNPLSVSSSITR 340
>gi|426197233|gb|EKV47160.1| hypothetical protein AGABI2DRAFT_204058 [Agaricus bisporus var.
bisporus H97]
Length = 361
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 15/121 (12%)
Query: 107 IMETI---QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAP 162
IM ++ QK K + C V G + + TGGSY + ++ + LIM P
Sbjct: 162 IMNSVFSAQKLKVTIDVCQVFG--PDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLP 219
Query: 163 PPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
PPP A+ K+ F RA C CHK + V +G+ C C + C+ C
Sbjct: 220 PPPIRKVLAVPTQDKVDF--RAA------CFCHKNI-VDIGFVCSVCLSIFCQPVPVCST 270
Query: 223 C 223
C
Sbjct: 271 C 271
>gi|89096888|ref|ZP_01169779.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp.
NRRL B-14911]
gi|89088268|gb|EAR67378.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp.
NRRL B-14911]
Length = 459
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 7/123 (5%)
Query: 47 LGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGHREVL-ILYSALSTCDPG 105
LG ++ M SG + L A+D + LLS S ++ L I+ + TCD G
Sbjct: 224 LGAYEKTAFNKSMDSFEASGWTPLAGAIDKARELLSAYNSTDYKNTLYIVSDGVETCD-G 282
Query: 106 DIMETIQKCKESKI--RCSVIGLSAE---MFICKHLCQDTGGSYSVALDESHFKELIMEH 160
D +E Q+ + S I + ++IG + K + + GG+Y+ D+ ++ +++
Sbjct: 283 DPVEAAQQLQGSNIEAKVNIIGFDVDDEGQKQLKEVAEAGGGTYATVRDKDELEDQVLKK 342
Query: 161 APP 163
P
Sbjct: 343 WKP 345
>gi|409080333|gb|EKM80693.1| hypothetical protein AGABI1DRAFT_119282 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 361
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 15/121 (12%)
Query: 107 IMETI---QKCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAP 162
IM ++ QK K + C V G + + TGGSY + ++ + LIM P
Sbjct: 162 IMNSVFSAQKLKVTIDVCQVFG--PDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLP 219
Query: 163 PPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTDCRI 222
PPP A+ K+ F RA C CHK + V +G+ C C + C+ C
Sbjct: 220 PPPIRKVLAVPTQDKVDF--RAA------CFCHKNI-VDIGFVCSVCLSIFCQPVPVCST 270
Query: 223 C 223
C
Sbjct: 271 C 271
>gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM
11571]
gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM
11571]
Length = 316
Score = 41.6 bits (96), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 41/176 (23%), Positives = 75/176 (42%), Gaps = 28/176 (15%)
Query: 2 DFRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGK 61
D+ P+R+ E + D +P +G+V + G A+ + L +S I+ L
Sbjct: 104 DYSPNRLEATKSAAEELIN---DLDPKDYVGIVVFESG-ASTASYLSPDKDSVIENLENI 159
Query: 62 LGCSGDSSLQNALDLVQGLLSQIPSYGHREVLILYSALSTCDPGDIM--ETIQKCKESKI 119
+ G +++ + L L + IP+ ++V+IL S + G I E IQ K+S I
Sbjct: 160 MEKDGATAIGDGLSLGINMADSIPN--RKKVVILLSD-GVNNAGVISPDEAIQFAKDSDI 216
Query: 120 RCSVIGLSAEMFIC-------------------KHLCQDTGGSYSVALDESHFKEL 156
+ IG+ +E + K + +TGG Y ++D+ E+
Sbjct: 217 QVFTIGMGSEQPVVMGYDWFGNPQYAELDEATLKEIADETGGKYFKSVDDQTLNEI 272
>gi|449683803|ref|XP_002159513.2| PREDICTED: general transcription factor IIH subunit 3-like [Hydra
magnipapillata]
Length = 291
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 80/215 (37%), Gaps = 27/215 (12%)
Query: 37 KDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIPSYGH------- 89
KDG +++ + + K L + S++ N L+ G L++ Y H
Sbjct: 86 KDGKYELFSEMNETLQCEFKRLFAE----SVSNMTNRPSLLAGALTKALCYIHSHDRTAN 141
Query: 90 ------REVLILYSALSTCDPGDIMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGS 143
R ++I S+ S+ +M I + I L + TGG
Sbjct: 142 GRRTNARILIIKGSSDSSSQYMTVMNCIFAASKKNIVIDCCALQNNSGFMQQASDITGGV 201
Query: 144 YSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVG 203
Y D S E ++ P P + E K+ P + + C CHK++ V VG
Sbjct: 202 YFFIDDFSGMLEYLLWIFLPDPGLRE-------KLNLPTSSQIDYRAACFCHKQL-VDVG 253
Query: 204 YTCPRCKARVCELPTDCRICG--LQLVSSPHLARS 236
+ C C + C+ C C +L S P A+S
Sbjct: 254 FVCSVCLSIYCQFMPKCATCQTRFKLPSLPLNAKS 288
>gi|302496747|ref|XP_003010374.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
gi|291173917|gb|EFE29734.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
Length = 414
Score = 40.8 bits (94), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
IM +I C+ I + L+ + + C T G Y ++LD F + +M P
Sbjct: 251 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 308
Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
+ A NLI P R + C CH++V V VG+ C C + CE P +C C
Sbjct: 309 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VNVGFVCSICLSIFCEPPEGANCLTC 361
Query: 224 GLQL 227
G L
Sbjct: 362 GTHL 365
>gi|242764012|ref|XP_002340688.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
stipitatus ATCC 10500]
gi|218723884|gb|EED23301.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
stipitatus ATCC 10500]
Length = 380
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 96/247 (38%), Gaps = 32/247 (12%)
Query: 3 FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI-KALMGK 61
+RP R VV +QV VRE S + + +A LT SHI + M
Sbjct: 121 YRPFR--VVEEQVLENVRELLASTDASDVSATSTM--LAGALT----LALSHINRRTMTW 172
Query: 62 LGCSGDSSLQNALDLVQGLLSQIP-----SYGHREVLILYSALSTCDPGD----IMETIQ 112
G+SS+ A + S G + +++ S S+ D IM +I
Sbjct: 173 TETHGNSSVDTANAATGSSSAAAVSGGNVSLGLQSRILIVSVSSSTDSAHQYIPIMNSIF 232
Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH--FKELIMEHAPPPPAIAEF 170
C+ I V LS + + T G Y ++L E + L+M P +
Sbjct: 233 ACQRLHIPIDVCKLSGDAVFLQQASDATRGVY-MSLTEPRGLLQYLMMAFLPDQRSRKHL 291
Query: 171 AIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT--DCRICGLQLV 228
+ + + + F RA C CH+ V V +G+ C C + CE P DC CG L
Sbjct: 292 VLPSRVDVDF--RAA------CFCHRRV-VNIGFVCSICLSIFCEPPENGDCLTCGTHLE 342
Query: 229 SSPHLAR 235
+ AR
Sbjct: 343 IGDYGAR 349
>gi|327299656|ref|XP_003234521.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
118892]
gi|326463415|gb|EGD88868.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
118892]
Length = 413
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
IM +I C+ I + L+ + + C T G Y ++LD F + +M P
Sbjct: 250 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 307
Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
+ A NLI P R + C CH++V V VG+ C C + CE P +C C
Sbjct: 308 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTC 360
Query: 224 GLQL 227
G L
Sbjct: 361 GTHL 364
>gi|212529242|ref|XP_002144778.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
marneffei ATCC 18224]
gi|210074176|gb|EEA28263.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
marneffei ATCC 18224]
Length = 380
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 61/246 (24%), Positives = 96/246 (39%), Gaps = 30/246 (12%)
Query: 3 FRPSRMAVVAKQVEAFVREFFDQNPLSQIGLVTVKDGVANCLTDLGGSPESHI-KALMGK 61
+RP R+ V +QV VRE S + + +A LT SHI + M
Sbjct: 121 YRPFRL--VEEQVMENVRELLASTNASDVTATSTM--MAGALT----LALSHINRRTMTW 172
Query: 62 LGCSGDSSLQNALDLVQGLLSQIP-----SYGHREVLILYSALSTCDPGD----IMETIQ 112
G+S+++ A + S G + +++ S S+ D IM +I
Sbjct: 173 TETHGNSNIETAGASASSSATAAVSGGNLSLGLQSRILIVSVSSSTDSAHQYIPIMNSIF 232
Query: 113 KCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAPPPPAIAEFA 171
C+ I V LS + + T G Y S++ + L+M P +
Sbjct: 233 ACQRLHIPIDVCKLSGDAVFLQQASDATRGVYMSLSEPRGLLQYLMMAFLPDQRSRKHLI 292
Query: 172 IANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPT--DCRICGLQLVS 229
+ + + + F RA C CH+ V V +G+ C C + CE P DC CG L
Sbjct: 293 VPSRVDVDF--RAA------CFCHRRV-VNIGFVCSICLSIFCEPPENGDCLTCGTHLEI 343
Query: 230 SPHLAR 235
+ AR
Sbjct: 344 GDYSAR 349
>gi|326473589|gb|EGD97598.1| transcription factor TFIIH subunit Tfb4 [Trichophyton tonsurans CBS
112818]
Length = 415
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
IM +I C+ I + L+ + + C T G Y ++LD F + +M P
Sbjct: 252 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 309
Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
+ A NLI P R + C CH++V V VG+ C C + CE P +C C
Sbjct: 310 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTC 362
Query: 224 GLQL 227
G L
Sbjct: 363 GTHL 366
>gi|326480715|gb|EGE04725.1| TFIIH basal transcription factor complex p34 subunit [Trichophyton
equinum CBS 127.97]
Length = 415
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 12/124 (9%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH-FKELIMEHAPPPP 165
IM +I C+ I + L+ + + C T G Y ++LD F + +M P
Sbjct: 252 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 309
Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRIC 223
+ A NLI P R + C CH++V V VG+ C C + CE P +C C
Sbjct: 310 ---QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTC 362
Query: 224 GLQL 227
G L
Sbjct: 363 GTHL 366
>gi|315052780|ref|XP_003175764.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
gypseum CBS 118893]
gi|311341079|gb|EFR00282.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
gypseum CBS 118893]
Length = 413
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 54/125 (43%), Gaps = 14/125 (11%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALD--ESHFKELIMEHAPPP 164
IM +I C+ I + L+ + + C T G Y ++LD + L++ P
Sbjct: 250 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLPD- 307
Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
+ A NLI P R + C CH++V V VG+ C C + CE P +C
Sbjct: 308 ----QRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLT 359
Query: 223 CGLQL 227
CG L
Sbjct: 360 CGTHL 364
>gi|302656245|ref|XP_003019878.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
gi|291183651|gb|EFE39254.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
Length = 517
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 54/126 (42%), Gaps = 14/126 (11%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH--FKELIMEHAPPP 164
IM +I C+ I + L+ + + C T G Y ++LD + L++ P
Sbjct: 354 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIY-MSLDSPRGFLQYLMLAFLP-- 410
Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
+ A NLI P R + C CH++V V VG+ C C + CE P +C
Sbjct: 411 ---DQRARRNLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLT 463
Query: 223 CGLQLV 228
CG L
Sbjct: 464 CGTHLA 469
>gi|296815720|ref|XP_002848197.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
otae CBS 113480]
gi|238841222|gb|EEQ30884.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
otae CBS 113480]
Length = 413
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 49/123 (39%), Gaps = 10/123 (8%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
IM +I C+ I + L+ + + C T G Y F + +M P
Sbjct: 250 IMNSIFACQRLHIPIDICKLAGDAVFLQQACDATRGIYMSVDSPRGFLQYLMLAFLPDQR 309
Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRICG 224
+ NL+ P R + C CH++V V VG+ C C + CE P +C CG
Sbjct: 310 VRR----NLV---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGANCLTCG 361
Query: 225 LQL 227
L
Sbjct: 362 THL 364
>gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum So ce56]
gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum So ce56]
Length = 656
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/142 (20%), Positives = 57/142 (40%), Gaps = 7/142 (4%)
Query: 26 NPLSQIGLVTVKDGVANCLTDLGGSPESHIKALMGKLGCSGDSSLQNALDLVQGLLSQIP 85
P + L T V L G + I A + L G +++ + +DL L +
Sbjct: 332 KPGDTVALCTYAGSVREVLAPTGIESKGKILAALADLTAGGSTAMSSGIDLAYSLAERTL 391
Query: 86 SYGH-REVLILYSALSTCDP---GDIMETIQKCKESKIRCSVIGLSAEMF---ICKHLCQ 138
GH V++L + P +I++TI++ ++ I S +G + + + L
Sbjct: 392 VKGHVNRVIVLSDGDANVGPTSHDEILKTIKRARDKGITLSTVGFGQGNYKDLMMEQLAN 451
Query: 139 DTGGSYSVALDESHFKELIMEH 160
G+Y+ E+ + + E
Sbjct: 452 QGDGNYAYIDSEAQARRVFSEQ 473
>gi|301122235|ref|XP_002908844.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
gi|262099606|gb|EEY57658.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
Length = 293
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 8/87 (9%)
Query: 140 TGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVK 199
TGG Y D S + ++ P P++ + + P + ++C CH+EV
Sbjct: 197 TGGIYYKPNDHSGLLQYLISIYLPDPSMRKL-------LKLPSQDSVDFRAMCFCHREV- 248
Query: 200 VGVGYTCPRCKARVCELPTDCRICGLQ 226
+ Y CP C + CE C CG++
Sbjct: 249 ISTAYVCPVCLSLFCEFRPICSTCGIR 275
>gi|258567552|ref|XP_002584520.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237905966|gb|EEP80367.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 398
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 35/125 (28%), Positives = 53/125 (42%), Gaps = 14/125 (11%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDE--SHFKELIMEHAPPP 164
IM +I C+ I + LS + + C T G Y V +D + L++ P
Sbjct: 235 IMNSIFACQRLHIPIDICKLSGDAVFLQQACDATRGIY-VPVDHPLGFLQYLMVAFLPDQ 293
Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
+ + + + + F RA C CH++V V VG+ C C + CE P DC
Sbjct: 294 RSRSHLILPTRVDVDF--RAA------CFCHRKV-VDVGFVCSICLSIFCEPPEGADCLT 344
Query: 223 CGLQL 227
CG L
Sbjct: 345 CGTHL 349
>gi|348676264|gb|EGZ16082.1| hypothetical protein PHYSODRAFT_508027 [Phytophthora sojae]
Length = 298
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 38/87 (43%), Gaps = 8/87 (9%)
Query: 140 TGGSYSVALDESHFKELIMEHAPPPPAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVK 199
TGG Y D S + ++ P P++ + + P + ++C CH+EV
Sbjct: 202 TGGIYYKPNDHSGLLQYLISIYLPDPSMRKL-------LKLPSQDSVDFRAMCFCHREV- 253
Query: 200 VGVGYTCPRCKARVCELPTDCRICGLQ 226
+ Y CP C + CE C CG++
Sbjct: 254 ISTAYVCPVCLSLFCEFRPICSTCGIR 280
>gi|170596989|ref|XP_001902970.1| Zinc finger, C2H2 type family protein [Brugia malayi]
gi|158589013|gb|EDP28181.1| Zinc finger, C2H2 type family protein [Brugia malayi]
Length = 179
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 18/100 (18%)
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--------SSGNKPGLYVACPKCKKH 291
L PI EV L + R R + C C++ ++ +G KP +C CKK+
Sbjct: 52 LVPIEKLMEVVKLSKQERRTRKKQRCEICRKEVIYMKSHMTTHTGEKP---YSCSICKKN 108
Query: 292 FCLECDIYIH------ESLHNCPGC-ESLRHSNPIVANEG 324
F D+ H E LH+CP C +S H + ++++ G
Sbjct: 109 FTQFGDMKKHMMIHTGEKLHSCPICRKSFTHKHHLLSHMG 148
>gi|429729565|ref|ZP_19264224.1| von Willebrand factor type A domain protein [Corynebacterium durum
F0235]
gi|429149589|gb|EKX92567.1| von Willebrand factor type A domain protein [Corynebacterium durum
F0235]
Length = 747
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 64/148 (43%), Gaps = 20/148 (13%)
Query: 30 QIGLVTVKDGVAN-------------CLTDLGGSPESHIKALMGKLGCSGDSSLQNALDL 76
Q+ L+T G N L LGGS + I A + L SG + + AL
Sbjct: 58 QLALLTYGTGTGNSDEEKEAGCKDVQTLVPLGGSRD-EIAAQIDSLVASGYTPIGPALLA 116
Query: 77 VQGLL-SQIPSYGHREVLILYSALSTCDPGDIMETIQKC--KESKIRCSVIGLSAEMFI- 132
+ L ++ R ++++ + TC P ++ E + + S + V+GL+A+ +
Sbjct: 117 AEKTLDTKKTEKNKRHIVLVSDGIDTCAPPEMAEVARDIHKRNSNVTIDVVGLNADETVR 176
Query: 133 --CKHLCQDTGGSYSVALDESHFKELIM 158
K L GG+Y+ A DE+ L+
Sbjct: 177 EQLKELAAAGGGTYADATDEASVSTLVT 204
>gi|303314233|ref|XP_003067125.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240106793|gb|EER24980.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320037386|gb|EFW19323.1| transcription factor TFIIH subunit Tfb4 [Coccidioides posadasii
str. Silveira]
Length = 395
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 50/123 (40%), Gaps = 10/123 (8%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
IM +I C+ I + LS + + C T G Y F + +M P
Sbjct: 232 IMNSIFACQRLHIPIDICKLSGDAVFLQQACDATRGIYVPVDHPRGFLQYLMVAFLP--- 288
Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRICG 224
+ + +LI P R + C CH++V V VG+ C C + CE P DC CG
Sbjct: 289 -DQRSRRHLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGADCLTCG 343
Query: 225 LQL 227
L
Sbjct: 344 THL 346
>gi|119174356|ref|XP_001239539.1| hypothetical protein CIMG_09160 [Coccidioides immitis RS]
gi|392869735|gb|EAS28255.2| transcription factor tfb4 [Coccidioides immitis RS]
Length = 396
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 50/123 (40%), Gaps = 10/123 (8%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESHFKELIMEHAPPPPA 166
IM +I C+ I + LS + + C T G Y F + +M P
Sbjct: 233 IMNSIFACQRLHIPIDICKLSGDAVFLQQACDATRGIYVPVDHPRGFLQYLMVAFLP--- 289
Query: 167 IAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRICG 224
+ + +LI P R + C CH++V V VG+ C C + CE P DC CG
Sbjct: 290 -DQRSRRHLI---LPTRVDVDFRAACFCHRKV-VDVGFVCSICLSIFCEPPEGADCLTCG 344
Query: 225 LQL 227
L
Sbjct: 345 THL 347
>gi|429730769|ref|ZP_19265415.1| von Willebrand factor type A domain protein [Corynebacterium durum
F0235]
gi|429147207|gb|EKX90237.1| von Willebrand factor type A domain protein [Corynebacterium durum
F0235]
Length = 661
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 21/81 (25%), Positives = 44/81 (54%), Gaps = 5/81 (6%)
Query: 83 QIPSYGHREVLILYSALSTCDPGDIMETIQKCKESKIRC--SVIGLSAEMFICKHL---C 137
++PS G R+++++ + TC P + E ++ ++ + +VIGL+ + L
Sbjct: 148 ELPSQGARQIVLVSDGIDTCAPPPVCEVAKQIRQRGVDVVINVIGLNVDDQARSELQCVA 207
Query: 138 QDTGGSYSVALDESHFKELIM 158
++ GGSY+ A D + KE ++
Sbjct: 208 KEGGGSYADAKDAASLKEQLV 228
>gi|170585154|ref|XP_001897351.1| Zinc finger, C2H2 type family protein [Brugia malayi]
gi|158595226|gb|EDP33795.1| Zinc finger, C2H2 type family protein [Brugia malayi]
Length = 764
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 46/100 (46%), Gaps = 18/100 (18%)
Query: 240 LFPIAPFDEVTPLCLNDPRNRSRSTCFGCQQSLL--------SSGNKPGLYVACPKCKKH 291
L PI EV L + R R + C C++ ++ +G KP +C CKK+
Sbjct: 310 LVPIEKLMEVVKLSKQERRTRKKQRCEICRKEVIYMKSHMTTHTGEKP---YSCSICKKN 366
Query: 292 FCLECDIYIH------ESLHNCPGC-ESLRHSNPIVANEG 324
F D+ H E LH+CP C +S H + ++++ G
Sbjct: 367 FTQFGDMKKHMMIHTGEKLHSCPICRKSFTHKHHLLSHMG 406
>gi|255953875|ref|XP_002567690.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211589401|emb|CAP95542.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 374
Score = 37.7 bits (86), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 51/125 (40%), Gaps = 14/125 (11%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSYSVALDESH--FKELIMEHAPPP 164
+M +I C+ I V LS + + T G Y +AL E + L+M P
Sbjct: 222 VMNSIFACQRLSIPIDVCKLSGDAVFLQQASDATKGVY-MALAEPRGLLQYLMMAFLPDQ 280
Query: 165 PAIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELP--TDCRI 222
+ + + + F RA C CH+ V V VG+ C C + CE P DC
Sbjct: 281 RSRRHLVLPTRVDVDF--RAA------CFCHRRV-VDVGFVCSICLSIFCEPPPGGDCMT 331
Query: 223 CGLQL 227
CG L
Sbjct: 332 CGSHL 336
>gi|121713686|ref|XP_001274454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
clavatus NRRL 1]
gi|119402607|gb|EAW13028.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
clavatus NRRL 1]
Length = 386
Score = 37.4 bits (85), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 53/132 (40%), Gaps = 12/132 (9%)
Query: 107 IMETIQKCKESKIRCSVIGLSAEMFICKHLCQDTGGSY-SVALDESHFKELIMEHAPPPP 165
IM I C+ + V LS + + T G Y S+A + L+M P
Sbjct: 232 IMNAIFACQRLHVPIDVCKLSGDAVFLQQASDATKGVYMSLAEPRGLLQYLMMAFLPDQR 291
Query: 166 AIAEFAIANLIKMGFPQRAGEGSISICSCHKEVKVGVGYTCPRCKARVCELPTD--CRIC 223
+ + + + F RA C CH+ V V +G+ C C + CE P + C C
Sbjct: 292 SRRHLVLPTRVDVDF--RAA------CFCHRRV-VDIGFVCSICLSIFCEPPENGGCLTC 342
Query: 224 GLQLVSSPHLAR 235
G L + + AR
Sbjct: 343 GTHLETGDYGAR 354
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.138 0.436
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,177,347,693
Number of Sequences: 23463169
Number of extensions: 209273081
Number of successful extensions: 522464
Number of sequences better than 100.0: 540
Number of HSP's better than 100.0 without gapping: 420
Number of HSP's successfully gapped in prelim test: 120
Number of HSP's that attempted gapping in prelim test: 520237
Number of HSP's gapped (non-prelim): 899
length of query: 324
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 182
effective length of database: 9,027,425,369
effective search space: 1642991417158
effective search space used: 1642991417158
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 77 (34.3 bits)