BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022374
         (298 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225459534|ref|XP_002284465.1| PREDICTED: general transcription factor IIH subunit 3 [Vitis
           vinifera]
 gi|302141830|emb|CBI19033.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  466 bits (1200), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 236/298 (79%), Positives = 263/298 (88%), Gaps = 3/298 (1%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           MA  PSKLYSDDVSL+VVLLDTNPFFWS++SL FS+FL+HVLAFLN+IL +NQLNQVVVI
Sbjct: 1   MAPVPSKLYSDDVSLLVVLLDTNPFFWSTASLPFSKFLSHVLAFLNSILLINQLNQVVVI 60

Query: 61  ATGYNSCDYVYDSSST-GNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
           ATG NSC++++DSSS   N ++ NGRMP+LC+ LLQ LEEF+  DE+L K+     I  S
Sbjct: 61  ATGCNSCNFIFDSSSVPANPNLENGRMPALCSNLLQKLEEFVTGDEKLSKEVLAAGIGSS 120

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           LLSGSLSMALCYIQRVFR+G LHPQPRILCLQGSPDGPEQYVA+MNAIFSAQRSMVPIDS
Sbjct: 121 LLSGSLSMALCYIQRVFRTGPLHPQPRILCLQGSPDGPEQYVAVMNAIFSAQRSMVPIDS 180

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C +GAQ+SAFLQQASYITGGV+ KPQQLDGLFQYL T+F TDLHSR FLQLPKP GVDFR
Sbjct: 181 CVIGAQHSAFLQQASYITGGVYLKPQQLDGLFQYLSTVFATDLHSRRFLQLPKPAGVDFR 240

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRKTTD 297
           ASCFCHKNTIDMGYICSVCLSI+CKH KKCSTCGSVFGQA  QSD  SAT+RKRKT +
Sbjct: 241 ASCFCHKNTIDMGYICSVCLSIFCKHHKKCSTCGSVFGQA--QSDGNSATDRKRKTPE 296


>gi|449470273|ref|XP_004152842.1| PREDICTED: general transcription factor IIH subunit 3-like [Cucumis
           sativus]
          Length = 295

 Score =  462 bits (1189), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 229/297 (77%), Positives = 259/297 (87%), Gaps = 2/297 (0%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           MASAPSKLY+DDVSL+VVLLDTNPFFWS+S+L FS+FL+HVLAFLN+IL LNQLN+VVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSL 120
            TGY SC Y+Y+SSS  N  + +GRMP+LC  LL+NLEEF+  DEQ  K++P+G    SL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQ+VFRSG LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC
Sbjct: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180

Query: 181 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240
           Y+G+ NSAFLQQASYITGGV+ KPQQ+DGLFQYL T+FGTDLHSR FLQLPK VGVDFRA
Sbjct: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240

Query: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRKTTD 297
           SCFCHK TIDMGY+CSVCLSI+CKH KKCSTCGSVFG+   + D  S +  KRKT +
Sbjct: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVELD--SVSKLKRKTPE 295


>gi|18394597|ref|NP_564050.1| transcription initiation factor TFIIH subunit H3 [Arabidopsis
           thaliana]
 gi|21537277|gb|AAM61618.1| unknown [Arabidopsis thaliana]
 gi|92856638|gb|ABE77412.1| At1g18340 [Arabidopsis thaliana]
 gi|332191584|gb|AEE29705.1| transcription initiation factor TFIIH subunit H3 [Arabidopsis
           thaliana]
          Length = 301

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 227/299 (75%), Positives = 259/299 (86%), Gaps = 7/299 (2%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M +  SK YSDDVSL+V+LLDTNP FWS++S++FSQFL+HVLAFLNA+L LNQLNQVVVI
Sbjct: 1   MPAIASKQYSDDVSLLVLLLDTNPLFWSTTSITFSQFLSHVLAFLNAVLGLNQLNQVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGN----QSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQE-PEGR 115
           ATGY+SCDY+YDSS T N    +S G G MP++  +LL+ LEEF+ KDE+L K+E  E R
Sbjct: 61  ATGYSSCDYIYDSSLTSNHGNFESNGTG-MPAIFGSLLKKLEEFVTKDEELSKEEVSEDR 119

Query: 116 IACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
           I   LLSGSLSMALCYIQRVFRSG LHPQPRILCLQGSPDGPEQYVA+MN+IFSAQR MV
Sbjct: 120 IPSCLLSGSLSMALCYIQRVFRSGHLHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRLMV 179

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 235
           PIDSCY+G QNSAFLQQASYITGGVHH P+QLDGLFQYL TIF TDLHSR F+QLPKP+G
Sbjct: 180 PIDSCYIGVQNSAFLQQASYITGGVHHTPKQLDGLFQYLTTIFATDLHSRGFVQLPKPIG 239

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
           VDFRASCFCHK TIDMGYICSVCLSI+C+H KKCSTCGSVFGQ++   D  SA+++KRK
Sbjct: 240 VDFRASCFCHKKTIDMGYICSVCLSIFCEHHKKCSTCGSVFGQSKLD-DASSASDKKRK 297


>gi|297850232|ref|XP_002892997.1| hypothetical protein ARALYDRAFT_472054 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338839|gb|EFH69256.1| hypothetical protein ARALYDRAFT_472054 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 301

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 226/299 (75%), Positives = 259/299 (86%), Gaps = 7/299 (2%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M S  SK YSDDVSL+V+LLDTNP FWS++S++FSQFL+HVLAFLNA+L LNQLNQVVVI
Sbjct: 1   MPSVVSKQYSDDVSLLVLLLDTNPLFWSTTSITFSQFLSHVLAFLNAVLGLNQLNQVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGN----QSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQE-PEGR 115
           ATGY+SCDY+YDSS T N    +S G G MP+L  +LL+ LE+F+ KDE+L ++E  E R
Sbjct: 61  ATGYSSCDYIYDSSLTSNHGNLESNGTG-MPALFGSLLKKLEDFVTKDEELSREEVSEDR 119

Query: 116 IACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
           I   LLSGSLSMALCYIQRVFRSG LHPQPRILCLQGSPDGPEQYVA+MN+IFSAQR MV
Sbjct: 120 IPSCLLSGSLSMALCYIQRVFRSGHLHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRLMV 179

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 235
           PIDSCY+G QNSAFLQQASYITGGVHH P+QLDGLFQYL TIF TDLHSR+F+QLPKP+G
Sbjct: 180 PIDSCYIGVQNSAFLQQASYITGGVHHTPKQLDGLFQYLTTIFATDLHSRSFVQLPKPIG 239

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
           VDFRASCFCHK TIDMGYICSVCLSI+C+H KKCSTCGSVFGQ++   D  S +++KRK
Sbjct: 240 VDFRASCFCHKKTIDMGYICSVCLSIFCEHHKKCSTCGSVFGQSKLD-DASSVSDKKRK 297


>gi|224084866|ref|XP_002307429.1| predicted protein [Populus trichocarpa]
 gi|222856878|gb|EEE94425.1| predicted protein [Populus trichocarpa]
          Length = 289

 Score =  430 bits (1106), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 228/290 (78%), Positives = 254/290 (87%), Gaps = 4/290 (1%)

Query: 10  SDDVSLVVVLLDTNPFFWSS-SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           +DDVSLVVVLLDTNPFFW+  SSLSFSQFL+HVLAF+N+IL LNQLNQVVVIA+GYN+CD
Sbjct: 1   ADDVSLVVVLLDTNPFFWTPPSSLSFSQFLSHVLAFVNSILLLNQLNQVVVIASGYNTCD 60

Query: 69  YVYDSSSTGNQ-SVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSM 127
           Y+YDSSS  +Q    +GRMPSL + LLQ LEEFM KDE+LGK++ +  I  SLLSGSLSM
Sbjct: 61  YIYDSSSDASQLGSEDGRMPSLYSNLLQKLEEFMIKDEKLGKEQSQRAIKSSLLSGSLSM 120

Query: 128 ALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS 187
           ALCYIQRVFRSG LHPQPRILCLQGSPDGPEQYVA+MNAIFSAQRSMVPIDSCY+GA NS
Sbjct: 121 ALCYIQRVFRSGPLHPQPRILCLQGSPDGPEQYVAVMNAIFSAQRSMVPIDSCYVGAHNS 180

Query: 188 AFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKN 247
           AFLQQASYITGGV+ KPQ LDGLFQYL T+F TDLHSR+F+QLP+P GVDFRASCFCHK 
Sbjct: 181 AFLQQASYITGGVYVKPQHLDGLFQYLTTVFATDLHSRSFIQLPRPAGVDFRASCFCHKT 240

Query: 248 TIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRKTTD 297
           TIDMGYICSVCLSI+C H KKCSTCGSVFGQA  QSD  S ++ KRK  +
Sbjct: 241 TIDMGYICSVCLSIFCNHHKKCSTCGSVFGQA--QSDTSSTSDLKRKAPE 288


>gi|255584434|ref|XP_002532948.1| RNA polymerase II transcription factor B subunit, putative [Ricinus
           communis]
 gi|223527277|gb|EEF29432.1| RNA polymerase II transcription factor B subunit, putative [Ricinus
           communis]
          Length = 266

 Score =  416 bits (1068), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 205/257 (79%), Positives = 230/257 (89%), Gaps = 1/257 (0%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           MAS PSKLYSDDVSLV+VLLDTNPFFW++SSL FSQFL+HVLAFLN+IL LNQLNQVVVI
Sbjct: 1   MASVPSKLYSDDVSLVMVLLDTNPFFWTTSSLPFSQFLSHVLAFLNSILLLNQLNQVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK-QEPEGRIACS 119
           A GYNSCDY+YDSS   N S   GR+P+L + LLQ LE+FM +DE+LGK +E + +IA S
Sbjct: 61  ANGYNSCDYIYDSSLVMNHSSDEGRIPALYSDLLQKLEDFMIRDEKLGKGEESKRKIASS 120

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           LLSGSLSMALCYIQR+FRSG LHPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDS
Sbjct: 121 LLSGSLSMALCYIQRIFRSGPLHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDS 180

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           CY+G  NSAFLQQASYITGGV+ KPQ LDGLFQYL+T+F TDLHSR+FLQLP+P GVDFR
Sbjct: 181 CYVGGHNSAFLQQASYITGGVYVKPQHLDGLFQYLVTVFATDLHSRSFLQLPRPAGVDFR 240

Query: 240 ASCFCHKNTIDMGYICS 256
           ASCFCHK TIDMG+ICS
Sbjct: 241 ASCFCHKTTIDMGFICS 257


>gi|357461529|ref|XP_003601046.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355490094|gb|AES71297.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 340

 Score =  412 bits (1060), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 210/337 (62%), Positives = 243/337 (72%), Gaps = 47/337 (13%)

Query: 3   SAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIAT 62
           SAPSK Y+DDVSL+VV LDTNPFFWSS    FS+FL+ VLAFLN+IL L QLNQVVVIAT
Sbjct: 4   SAPSKNYADDVSLLVVTLDTNPFFWSSFPFHFSEFLSQVLAFLNSILLLGQLNQVVVIAT 63

Query: 63  GYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK-QEPEGRIACSLL 121
           G NSC YVYDS+S  N +  NG MP++ + LL NL+EF+ KD+QL       G I  SLL
Sbjct: 64  GCNSCSYVYDSASDKNHASTNGTMPAIYSNLLHNLDEFVAKDQQLTTPHHKPGTIPSSLL 123

Query: 122 SGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           SG+LSMALCYIQR FRSG +HPQPRILCLQGS DGPEQYVAIMNAIFSAQRS+VP+DSC+
Sbjct: 124 SGALSMALCYIQRAFRSGPMHPQPRILCLQGSTDGPEQYVAIMNAIFSAQRSVVPVDSCF 183

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           +G+ NSAFLQQASYITGG++++P Q++GLFQYL T+F TDLHSR FL+LPK +GVDFRAS
Sbjct: 184 IGSNNSAFLQQASYITGGIYYRPPQMEGLFQYLSTVFATDLHSREFLRLPKSLGVDFRAS 243

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTC----------------------------- 272
           CFCHK TIDMGY+CSVCLSI+C+H  KCSTC                             
Sbjct: 244 CFCHKQTIDMGYVCSVCLSIFCEHHDKCSTCSFTIYDASNECKCSLFADCSNQFNLAEIF 303

Query: 273 ---------------GSVFGQAQTQSDEPSATNRKRK 294
                           SVFG  QTQS+  S  NRKRK
Sbjct: 304 PLSAVFFNFFYSFIISSVFG--QTQSEAASEENRKRK 338


>gi|356509424|ref|XP_003523449.1| PREDICTED: general transcription factor IIH subunit 3-like [Glycine
           max]
          Length = 294

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 209/296 (70%), Positives = 239/296 (80%), Gaps = 5/296 (1%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M SAPSKLY+DDVSL+VV LDTNPFFWS+    F++FL+ VLAFLN IL L QLNQV+VI
Sbjct: 1   MPSAPSKLYADDVSLLVVTLDTNPFFWSTFPFPFAEFLSQVLAFLNTILLLGQLNQVIVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ--EPEGRIAC 118
           ATG NSC YVYDS+S  N     G MP+L + LL NL+EF+ +D QL        G +  
Sbjct: 61  ATGCNSCGYVYDSTSDKNHGSTTGTMPALYSNLLHNLDEFLARDRQLDADASHAPGIVPS 120

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           SLLSGSLSMALCYIQR FRSG + PQPRILCLQG+ DGPEQYVAIMNAIFSAQ S VPID
Sbjct: 121 SLLSGSLSMALCYIQRAFRSGPMPPQPRILCLQGAADGPEQYVAIMNAIFSAQHSTVPID 180

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 238
           SCY+G+ NSAFLQQASYITGG+++KP QLDGL+QYL T+F TDLHSR FL+LPK VGVDF
Sbjct: 181 SCYIGSNNSAFLQQASYITGGIYYKPPQLDGLYQYLSTVFATDLHSRAFLRLPKSVGVDF 240

Query: 239 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
           RASCFCHK TIDMGY+CSVCLSI+C+H  KCSTCGSVFGQAQ    + SA +R+RK
Sbjct: 241 RASCFCHKQTIDMGYVCSVCLSIFCEHHDKCSTCGSVFGQAQL---DASAADRQRK 293


>gi|363806998|ref|NP_001242062.1| uncharacterized protein LOC100776751 [Glycine max]
 gi|255647869|gb|ACU24393.1| unknown [Glycine max]
          Length = 294

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 205/296 (69%), Positives = 238/296 (80%), Gaps = 5/296 (1%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M SA SKLY+DDVSL+VV LDTNPFFWS+    F++FL+ VLAFLN+IL L QLNQV+VI
Sbjct: 1   MPSASSKLYADDVSLLVVTLDTNPFFWSTFPFPFAEFLSQVLAFLNSILLLGQLNQVIVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPE--GRIAC 118
           ATG NSC Y+YDS+S  N     G MP+L + LL NL+EF+ +D QL        G +  
Sbjct: 61  ATGCNSCGYIYDSTSDKNHGSTTGTMPALYSNLLHNLDEFLARDRQLDADAAHAPGTVPS 120

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           SLLSGSLSMALCYIQR FRSG + PQPRILCL+G  DGPEQYVAIMNAIFSAQ S VPID
Sbjct: 121 SLLSGSLSMALCYIQRAFRSGPMPPQPRILCLRGVADGPEQYVAIMNAIFSAQHSTVPID 180

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 238
           SCY+G+ NSAFLQQASYITGG+++KP QLDGL+QYL T+F TDL+SR FL+LPK VGVDF
Sbjct: 181 SCYIGSNNSAFLQQASYITGGIYYKPPQLDGLYQYLSTVFATDLYSRAFLRLPKSVGVDF 240

Query: 239 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
           RASCFCHK TIDMGY+CSVCLSI+C+H  KCSTCGSVFGQAQ    + SA +R+RK
Sbjct: 241 RASCFCHKQTIDMGYVCSVCLSIFCEHHDKCSTCGSVFGQAQL---DASAADRQRK 293


>gi|294464556|gb|ADE77788.1| unknown [Picea sitchensis]
          Length = 306

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 193/305 (63%), Positives = 232/305 (76%), Gaps = 18/305 (5%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSS-------SSLSFSQFLTHVLAFLNAILTLNQ 53
           MASA S    DDVSLVVVLLDTNPFFW+S       SSL+FSQFL HVLAF N++L L+Q
Sbjct: 6   MASAGSNPPQDDVSLVVVLLDTNPFFWASATSSGQASSLTFSQFLEHVLAFFNSLLLLHQ 65

Query: 54  LNQVVVIATGYNSCDYVYDSSSTGNQ------SVGNGRMPSLCATLLQNLEEFMNKDEQL 107
           +NQ+VVI TG NSC+++YDSS+T ++      +  +    +    ++Q LE+F+ +D+ L
Sbjct: 66  MNQLVVIGTGTNSCEFIYDSSTTNSEYKKSKDTKCSQEFLTPSTIIVQQLEDFVRRDQAL 125

Query: 108 GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAI 167
                  +   SLLSGSLSMALCYIQR+ R G  HPQPRILCLQGSPDGP+QYVAIMNAI
Sbjct: 126 NTNNNATKNIHSLLSGSLSMALCYIQRIIRGGPSHPQPRILCLQGSPDGPQQYVAIMNAI 185

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           FSAQRSMVP+DSC +G+Q+SAFLQQAS+ITGG++ KPQQL+GLFQ+L  +F TDLHSR F
Sbjct: 186 FSAQRSMVPVDSCIVGSQHSAFLQQASHITGGIYLKPQQLEGLFQFLAMVFATDLHSRRF 245

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPS 287
           LQLP+P GVDFRASCFCHK TIDMGY+CSVCLSI+C+H KKCSTC S F Q       PS
Sbjct: 246 LQLPRPTGVDFRASCFCHKKTIDMGYVCSVCLSIFCEHSKKCSTCESTFNQ-----KPPS 300

Query: 288 ATNRK 292
              RK
Sbjct: 301 GQKRK 305


>gi|357146071|ref|XP_003573866.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Brachypodium distachyon]
          Length = 290

 Score =  368 bits (944), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 187/283 (66%), Positives = 223/283 (78%), Gaps = 10/283 (3%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M SA SKLYSDDVSLVVV++DTNPFFW+ ++L F+ F  H++ F+N++L LN LNQVVVI
Sbjct: 1   MTSAHSKLYSDDVSLVVVIVDTNPFFWAGAALPFADFFGHLIHFVNSLLLLNHLNQVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQ---NLEEFMNKD--EQLGKQEPEGR 115
           A G +SC YV+DS   GN +      P + AT  +    +EEF+ +D  +  G     G 
Sbjct: 61  AAGVSSCAYVFDS---GNAAASGS--PDVAATFAKASRKVEEFVKQDARDTAGNGTVAGG 115

Query: 116 IACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
            A SL SG+LS+ALCYIQRVFRSG  HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMV
Sbjct: 116 GAASLFSGALSLALCYIQRVFRSGTRHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMV 175

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 235
           PIDSC +G Q+SAFLQQASYITGGV+ KPQ+L GLFQYL  +F +DLHSR FL+LPK +G
Sbjct: 176 PIDSCIVGTQDSAFLQQASYITGGVYLKPQELSGLFQYLAAVFASDLHSRAFLRLPKTLG 235

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQ 278
           VDFRASCFCHK TIDMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 236 VDFRASCFCHKKTIDMGYVCSVCLSIFCKYHKKCSTCGSEFSR 278


>gi|326513394|dbj|BAK06937.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 292

 Score =  342 bits (878), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 180/280 (64%), Positives = 221/280 (78%), Gaps = 4/280 (1%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M SA SKLYSDDVSLVVV++DTNPFFW+ ++L F+ FL+H++ F+N++L L+ LN VV+I
Sbjct: 1   MTSAHSKLYSDDVSLVVVVVDTNPFFWAGAALPFADFLSHLIHFVNSLLLLSNLNHVVII 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKD--EQLGKQEPEGRIAC 118
           A G +SC YV+DS + G  + G   +        + LEE + +D  E +G        A 
Sbjct: 61  AAGVSSCAYVFDSGNAG--ASGTADVAETLGKASRKLEELIKQDARETVGNGTVADGGAA 118

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           SL SG+LS+ALCYIQR+FRSG  HPQPRILC+QGSPDGPEQYVA+MN+IFSAQRSMVPID
Sbjct: 119 SLFSGALSLALCYIQRIFRSGSRHPQPRILCVQGSPDGPEQYVAVMNSIFSAQRSMVPID 178

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 238
           +C +G Q+SAFLQQASYITGGV+ KPQ+L GLFQYL  +F TDLHSR FL+LPK +GVDF
Sbjct: 179 TCIVGTQDSAFLQQASYITGGVYMKPQELSGLFQYLAAVFATDLHSRTFLRLPKTLGVDF 238

Query: 239 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQ 278
           RASCFCHK TIDMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 239 RASCFCHKKTIDMGYVCSVCLSIFCKYHKKCSTCGSEFSR 278


>gi|212722318|ref|NP_001131186.1| uncharacterized protein LOC100192494 [Zea mays]
 gi|194688300|gb|ACF78234.1| unknown [Zea mays]
 gi|194690816|gb|ACF79492.1| unknown [Zea mays]
 gi|224028537|gb|ACN33344.1| unknown [Zea mays]
 gi|413926714|gb|AFW66646.1| hypothetical protein ZEAMMB73_940692 [Zea mays]
          Length = 291

 Score =  336 bits (862), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 181/281 (64%), Positives = 220/281 (78%), Gaps = 4/281 (1%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M SA SKLYSDDVSLVVV++DTNPFFW++++L F+ F  H++ F+N++L LN LN+VVVI
Sbjct: 1   MTSAHSKLYSDDVSLVVVVVDTNPFFWAAAALPFADFFAHLIHFVNSLLLLNHLNRVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQL--GKQEPEGRIAC 118
           A G +SC Y+ DSS       G   +        +  EEF+ +D +   G        A 
Sbjct: 61  AAGVSSCAYILDSSDASPS--GGVAVSDTFDKASRKAEEFIAQDARATAGNGSVASANAA 118

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           SLLSG+LS++LCYIQR+FRSG  HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPID
Sbjct: 119 SLLSGALSISLCYIQRIFRSGSRHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPID 178

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 238
           SC +G Q+SAFLQQASYITGGV+ KPQ+L+GLFQYL  +F TDLHSR FL+LPK +GVDF
Sbjct: 179 SCIVGTQDSAFLQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRAFLRLPKTLGVDF 238

Query: 239 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQA 279
           RASCFCHK TIDMGY+CS+CLSI+CK+ KKCSTCGS F +A
Sbjct: 239 RASCFCHKKTIDMGYVCSICLSIFCKNQKKCSTCGSEFSRA 279


>gi|223946567|gb|ACN27367.1| unknown [Zea mays]
 gi|413926715|gb|AFW66647.1| hypothetical protein ZEAMMB73_940692 [Zea mays]
          Length = 294

 Score =  327 bits (837), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 180/284 (63%), Positives = 219/284 (77%), Gaps = 7/284 (2%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           M SA SKLYSDDVSLVVV++DTNPFFW++++L F+ F  H++ F+N++L LN LN+VVVI
Sbjct: 1   MTSAHSKLYSDDVSLVVVVVDTNPFFWAAAALPFADFFAHLIHFVNSLLLLNHLNRVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQL--GKQEPEGRIAC 118
           A G +SC Y+ DSS       G   +        +  EEF+ +D +   G        A 
Sbjct: 61  AAGVSSCAYILDSSDASPS--GGVAVSDTFDKASRKAEEFIAQDARATAGNGSVASANAA 118

Query: 119 SLLSGSLSMALCY---IQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
           SLLSG+LS++LC    IQR+FRSG  HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMV
Sbjct: 119 SLLSGALSISLCCYLDIQRIFRSGSRHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMV 178

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 235
           PIDSC +G Q+SAFLQQASYITGGV+ KPQ+L+GLFQYL  +F TDLHSR FL+LPK +G
Sbjct: 179 PIDSCIVGTQDSAFLQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRAFLRLPKTLG 238

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQA 279
           VDFRASCFCHK TIDMGY+CS+CLSI+CK+ KKCSTCGS F +A
Sbjct: 239 VDFRASCFCHKKTIDMGYVCSICLSIFCKNQKKCSTCGSEFSRA 282


>gi|115443937|ref|NP_001045748.1| Os02g0125800 [Oryza sativa Japonica Group]
 gi|41053068|dbj|BAD08012.1| putative TFIIH basal transcription factor complex p34 subunit
           [Oryza sativa Japonica Group]
 gi|113535279|dbj|BAF07662.1| Os02g0125800 [Oryza sativa Japonica Group]
 gi|215768495|dbj|BAH00724.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 289

 Score =  320 bits (819), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 177/279 (63%), Positives = 219/279 (78%), Gaps = 8/279 (2%)

Query: 4   APSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           A SKLYSDDVSL VV++DTNP FW++++L F+ F  H++ ++N++L LN LN VVVIA G
Sbjct: 2   ASSKLYSDDVSLAVVVVDTNPLFWATAALPFADFFAHLIHYVNSLLLLNHLNHVVVIAAG 61

Query: 64  YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIA----CS 119
            +SC Y++DSS        +  + +  A   + ++EF+ +D +       G +A     S
Sbjct: 62  LSSCTYIFDSSDASASGAAD--VAATFAKASRKMDEFIAEDAR--ATSSNGSVADGNAAS 117

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           LLSG+LS+ALCYIQRVFRSG  HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDS
Sbjct: 118 LLSGALSLALCYIQRVFRSGTRHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDS 177

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C +G Q+SAFLQQASYITGGV+ KPQ+L+GLFQYL  +F TDLHSR FL+LPK +GVDFR
Sbjct: 178 CIVGTQDSAFLQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRTFLRLPKTLGVDFR 237

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQ 278
           ASCFCHK TIDMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 238 ASCFCHKKTIDMGYVCSVCLSIFCKYHKKCSTCGSEFNR 276


>gi|168000809|ref|XP_001753108.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695807|gb|EDQ82149.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 257

 Score =  296 bits (758), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 144/258 (55%), Positives = 193/258 (74%), Gaps = 13/258 (5%)

Query: 44  FLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGN------QSVGNGRMP-SLCATLLQN 96
           F+N IL LN  N++V+IATG ++C ++YDS++T         S  N +   S    + ++
Sbjct: 2   FVNGILLLNHTNRIVIIATGSSTCAFLYDSTATDGINGSSVTSKANKKTQVSRTELITKS 61

Query: 97  LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG 156
           ++EF++K+E+  K+  +  I  S +SGSLSMALCYIQRV R  L HPQ RILCLQGSPD 
Sbjct: 62  MQEFVSKEEEASKESSQ--IQYSNISGSLSMALCYIQRVLRGPLPHPQARILCLQGSPDA 119

Query: 157 PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLT 216
           P QY+++MN+IFSAQR+ V ID+C +G+Q+SAFLQQA++ITGG++ KP Q DGLF+YL+ 
Sbjct: 120 PYQYISVMNSIFSAQRNGVVIDACVVGSQHSAFLQQATHITGGIYMKPPQPDGLFEYLMM 179

Query: 217 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +F TDL SR FLQLP+PVGVDFRASCFCHK TIDMG++CSVCLSI+CKH + CSTCG+ F
Sbjct: 180 VFTTDLFSRQFLQLPRPVGVDFRASCFCHKKTIDMGFVCSVCLSIFCKHHQTCSTCGATF 239

Query: 277 GQAQTQSDEPSATNRKRK 294
               ++S+  + T  KRK
Sbjct: 240 ----SRSNSTTVTGMKRK 253


>gi|449527735|ref|XP_004170865.1| PREDICTED: general transcription factor IIH subunit 3-like [Cucumis
           sativus]
          Length = 187

 Score =  264 bits (674), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 136/177 (76%), Positives = 155/177 (87%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           MASAPSKLY+DDVSL+VVLLDTNPFFWS+S+L FS+FL+HVLAFLN+IL LNQLN+VVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSL 120
            TGY SC Y+Y+SSS  N  + +GRMP+LC  LL+NLEEF+  DEQ  K++P+G    SL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 177
           LSGSLSMALCYIQ+VFRSG LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV I
Sbjct: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVCI 177


>gi|6714290|gb|AAF25986.1|AC013354_5 F15H18.15 [Arabidopsis thaliana]
          Length = 590

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 167/312 (53%), Positives = 207/312 (66%), Gaps = 53/312 (16%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHV--------LAFLNAILTLN 52
           M +  SK YSDDVSL+V+LLDTNP FWS++S++FSQFL+HV        LAFLNA+L LN
Sbjct: 1   MPAIASKQYSDDVSLLVLLLDTNPLFWSTTSITFSQFLSHVSVSNESIVLAFLNAVLGLN 60

Query: 53  QLNQVVVIATGYNSCDYVYDSSSTGN----QSVGNGRMPSLCATLLQNLEEFMNKDEQLG 108
           QLNQVVVIATGY+SCDY+YDSS T N    +S G G MP++  +LL+ LEEF+ KDE+L 
Sbjct: 61  QLNQVVVIATGYSSCDYIYDSSLTSNHGNFESNGTG-MPAIFGSLLKKLEEFVTKDEELS 119

Query: 109 KQE-PEGRI-ACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG---PEQYVAI 163
           K+E  E RI +C L  G++    C+    F +  +     + CL G         +YVA+
Sbjct: 120 KEEVSEDRIPSCLLWPGTV----CFFLHFFSTCKI-----LSCLSGLIKRFCHSFRYVAV 170

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL-LTIFGTDL 222
           MN+IFSAQR M    + Y  A NS                     GLF +   TIF TDL
Sbjct: 171 MNSIFSAQRLM---RTQYSMAANS---------------------GLFCFAQKTIFATDL 206

Query: 223 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQ 282
           HSR F+QLPKP+GVDFRASCFCHK TIDMGYICSVCLSI+C+H KKCSTCGSVFGQ++  
Sbjct: 207 HSRGFVQLPKPIGVDFRASCFCHKKTIDMGYICSVCLSIFCEHHKKCSTCGSVFGQSKLD 266

Query: 283 SDEPSATNRKRK 294
            D  SA+++KRK
Sbjct: 267 -DASSASDKKRK 277


>gi|218189967|gb|EEC72394.1| hypothetical protein OsI_05675 [Oryza sativa Indica Group]
 gi|222622090|gb|EEE56222.1| hypothetical protein OsJ_05207 [Oryza sativa Japonica Group]
          Length = 261

 Score =  250 bits (638), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 151/279 (54%), Positives = 191/279 (68%), Gaps = 36/279 (12%)

Query: 4   APSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           A SKLYSDDVSL VV++DTNP FW++++L F+ F  H++ ++N++L LN LN VVVIA G
Sbjct: 2   ASSKLYSDDVSLAVVVVDTNPLFWATAALPFADFFAHLIHYVNSLLLLNHLNHVVVIAAG 61

Query: 64  YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIA----CS 119
            +SC Y++DSS        +  + +  A   + ++EF+ +D +       G +A     S
Sbjct: 62  LSSCTYIFDSSDASASGAAD--VAATFAKASRKMDEFIAEDAR--ATSSNGSVADGNAAS 117

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           LLSG+LS+ALCYIQRVFRSG  HPQPR                            VPIDS
Sbjct: 118 LLSGALSLALCYIQRVFRSGTRHPQPR----------------------------VPIDS 149

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C +G Q+SAFLQQASYITGGV+ KPQ+L+GLFQYL  +F TDLHSR FL+LPK +GVDFR
Sbjct: 150 CIVGTQDSAFLQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRTFLRLPKTLGVDFR 209

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQ 278
           ASCFCHK TIDMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 210 ASCFCHKKTIDMGYVCSVCLSIFCKYHKKCSTCGSEFNR 248


>gi|384252816|gb|EIE26291.1| transcription factor Tfb4 [Coccomyxa subellipsoidea C-169]
          Length = 305

 Score =  247 bits (631), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 138/307 (44%), Positives = 191/307 (62%), Gaps = 27/307 (8%)

Query: 4   APSKLYSDDVSLVVVLLDTNPFFWSS----------SSLSFSQFLTHVLAFLNAILTLNQ 53
           APS    DD  L+VV+L+T+   WS+           +LS +  L  VL FLN  L LN+
Sbjct: 2   APSD-EDDDSCLLVVVLETHEDLWSALADESESPVIGNLSLTTLLQQVLVFLNTFLALNE 60

Query: 54  LNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQE-- 111
            N++VVIA   +  + +Y S S   +  G    P   A      EE + K   L +++  
Sbjct: 61  ANELVVIAACGDQSEVLYTSPSVHRERQG----PKHGAVTGNASEEVLKKLHALLQEQAA 116

Query: 112 -PEGRIA-CSLLSGSLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMN 165
            P G  A  ++L+G+LS ALC I +   +         QPR+LCL+GSPD  EQY+++MN
Sbjct: 117 LPVGDHARPAVLAGALSRALCLINKAQLASAGGSRTRKQPRLLCLKGSPDATEQYISVMN 176

Query: 166 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 225
           AIF+AQR  V ID+C LGA +SAFLQQA+++T G++ +P+  D L QYLLTIF  D  SR
Sbjct: 177 AIFAAQRCEVVIDACMLGASDSAFLQQAAHLTNGIYLRPKHKDALLQYLLTIFCGDAFSR 236

Query: 226 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDE 285
           +FLQLP+P+GVDFRA+CFCH+  ID+GY+CSVCLSI+C+ + +CSTCG+ F   +  +  
Sbjct: 237 SFLQLPRPIGVDFRAACFCHRRAIDLGYVCSVCLSIFCESITECSTCGTTF---KVVART 293

Query: 286 PSATNRK 292
           P AT RK
Sbjct: 294 P-ATKRK 299


>gi|330789994|ref|XP_003283083.1| hypothetical protein DICPUDRAFT_9126 [Dictyostelium purpureum]
 gi|325086950|gb|EGC40332.1| hypothetical protein DICPUDRAFT_9126 [Dictyostelium purpureum]
          Length = 282

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 119/286 (41%), Positives = 170/286 (59%), Gaps = 28/286 (9%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS-----------LSFSQFLTHVLAFLNAILTLNQLNQVVV 59
           DD SL+V++LD N + W +SS           + FS+FL HV+ F+NA L LNQ NQ+ +
Sbjct: 3   DDTSLLVIVLDCNIYSWGTSSPPLINTSKNNYIGFSKFLEHVMVFINAYLMLNQENQIAL 62

Query: 60  IATGYNSCDYVYDSS-----STGNQSVGNGRMPSLCATLLQNLEEF---MNKDEQLGKQE 111
           I++      +++  +         Q+  N +  +L + +L+ L++    +N D+Q     
Sbjct: 63  ISSKIGQSSFIFPETVEQLEQEETQTENNIQNITLQSQILKRLQKINIEVNNDQQ----- 117

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 171
               I  S  S +LSMALCYI R+ R    + +PRIL    SPD   QY+++MN IFSAQ
Sbjct: 118 ---DIINSSFSAALSMALCYINRI-RKESPNLKPRILVFNISPDVSSQYISVMNCIFSAQ 173

Query: 172 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 231
           +  +P+DSC L   +S FLQQAS++T G++ KP + + L QYLLT F  D  SR  L  P
Sbjct: 174 KQSIPVDSCILSQTDSTFLQQASHLTNGIYLKPHKQELLSQYLLTTFLLDTQSRKSLAYP 233

Query: 232 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 277
               VD+RASCFCHK  +D+G++CSVCLSI+C H   CSTCG+ F 
Sbjct: 234 NLKSVDYRASCFCHKKIVDIGFVCSVCLSIFCNHFSSCSTCGTKFS 279


>gi|145340718|ref|XP_001415467.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575690|gb|ABO93759.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 285

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 122/296 (41%), Positives = 165/296 (55%), Gaps = 36/296 (12%)

Query: 10  SDDVSLVVVLLDTNPFFWSSS---------SLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           +DD SL+VVL++TNP +W++S         +   S  L     FLN+   LNQ N+  VI
Sbjct: 9   ADDKSLLVVLVETNPRYWAASEGKGDGTAAANGLSSVLEATTVFLNSFFALNQQNRAAVI 68

Query: 61  ATGYNSCDYVYDSS---STGNQSV-GNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
           A   + C Y+Y S    + G+ S    G+  +  AT                   P  R 
Sbjct: 69  AVHDDGCHYLYTSPLELNAGDASASATGKKRTKTAT-------------------PSSRA 109

Query: 117 A-CSLLSGSLSMALCYIQR---VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           A  S  +G+LS+ALCY  R   +  +  L  +PRILCLQ S D P  Y+++MNAIFSAQR
Sbjct: 110 AMSSPFAGALSLALCYCNRAQTLETAAGLRVRPRILCLQASQDNPTDYISMMNAIFSAQR 169

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             +PID+  LG  +  FLQQA++IT G + KP    GL QYLL+    D+ SR+ L+LP 
Sbjct: 170 QSIPIDAFALGEHDLPFLQQAAHITRGAYVKPTHGAGLLQYLLSTAALDMRSRSHLKLPA 229

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSA 288
             GVDFRASCFCHK  + +G++CSVCLSI+C+    C TCG+ F      +  PSA
Sbjct: 230 ARGVDFRASCFCHKRPVSVGFVCSVCLSIFCERRSSCDTCGADFAADAQVTSVPSA 285


>gi|449524543|ref|XP_004169281.1| PREDICTED: general transcription factor IIH subunit 3-like, partial
           [Cucumis sativus]
          Length = 121

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 95/123 (77%), Positives = 106/123 (86%), Gaps = 2/123 (1%)

Query: 175 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
           VPIDSCY+G+ NSAFLQQASYITGGV+ KPQQ+DGLFQYL T+FGTDLHSR FLQLPK V
Sbjct: 1   VPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSV 60

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
           GVDFRASCFCHK TIDMGY+CSVCLSI+CKH KKCSTCGSVFG+   + D  S +  KRK
Sbjct: 61  GVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVELD--SVSKLKRK 118

Query: 295 TTD 297
           T +
Sbjct: 119 TPE 121


>gi|328874423|gb|EGG22788.1| general transcription factor IIH [Dictyostelium fasciculatum]
          Length = 304

 Score =  197 bits (500), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 110/283 (38%), Positives = 166/283 (58%), Gaps = 26/283 (9%)

Query: 11  DDVSLVVVLLDTNPFFWS---------------SSSLSFSQFLTHVLAFLNAILTLNQLN 55
           DD SL+V+++D N + W                S  ++ S F  H++ F+NA L LNQ N
Sbjct: 8   DDTSLLVIIVDLNAYSWGKQVELAEQNIDSTSGSRFITLSTFFEHLVIFINAYLLLNQEN 67

Query: 56  QVVVIATGYNSCDYVYDSS--STGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPE 113
           Q+ VI++      ++Y S+   T ++S+      SL      N+ + + ++  L  +E  
Sbjct: 68  QIAVISSNVGESSFIYPSNHVDTKDKSISFKISQSL-----NNISKHLIEEIHLTSEENS 122

Query: 114 GRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 173
              +    S ++S+ALCYI R+ +      + RIL    SPD   QY+ +MN IFSAQ+ 
Sbjct: 123 SSSS---FSAAMSLALCYINRIKKDNST-IRSRILVFNISPDVSTQYIPVMNCIFSAQKQ 178

Query: 174 MVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP 233
            +P+DSC L + +S FLQQAS++T G++ +P + D L QYLL+ F  D +SR FL +P  
Sbjct: 179 SIPVDSCILTSTDSTFLQQASHLTNGIYLRPHRQDHLGQYLLSSFLIDSYSRKFLNVPTL 238

Query: 234 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
             VD+RASCFCHK  +D+G++CSVCLSIYCK+   C+TCG+ F
Sbjct: 239 KTVDYRASCFCHKKIVDIGFVCSVCLSIYCKYSSSCTTCGTKF 281


>gi|348676264|gb|EGZ16082.1| hypothetical protein PHYSODRAFT_508027 [Phytophthora sojae]
          Length = 298

 Score =  194 bits (494), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 102/267 (38%), Positives = 154/267 (57%), Gaps = 17/267 (6%)

Query: 15  LVVVLLDTNPFFW------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           L+V+++DTNP  W      ++      Q ++  L F+N+ L L++ N++V+IA       
Sbjct: 21  LLVLVIDTNPVHWFAQGGSAADRAGLQQLISSTLVFVNSYLLLHRSNRIVIIAAHAGKSA 80

Query: 69  YVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMA 128
            +Y      + +    +   + A ++Q L++    D  L   +P  R A   ++ SLS A
Sbjct: 81  MLYPDPEQDDTTGSAEQAARVNAGVMQRLQQL--SDAPLDPAKP-NRTA---IAASLSRA 134

Query: 129 LCYIQRVFRSGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 186
           LC+I R        P  +PRIL +Q SPD  E Y+AIMN IFSAQ+  V +D+C L  ++
Sbjct: 135 LCFINRAINE---EPDLRPRILVVQKSPDVSEHYIAIMNGIFSAQKKSVAVDACILANEH 191

Query: 187 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 246
           S+F+QQA+Y+TGG+++KP    GL QYL++I+  D   R  L+LP    VDFRA CFCH+
Sbjct: 192 SSFMQQAAYLTGGIYYKPNDHSGLLQYLISIYLPDPSMRKLLKLPSQDSVDFRAMCFCHR 251

Query: 247 NTIDMGYICSVCLSIYCKHLKKCSTCG 273
             I   Y+C VCLS++C+    CSTCG
Sbjct: 252 EVISTAYVCPVCLSLFCEFRPICSTCG 278


>gi|301122235|ref|XP_002908844.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
 gi|262099606|gb|EEY57658.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
          Length = 293

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 99/267 (37%), Positives = 152/267 (56%), Gaps = 17/267 (6%)

Query: 15  LVVVLLDTNPFFW------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           L+V+++DTNP  W      ++      Q ++  L F+N+ L L++ N++V+IA       
Sbjct: 16  LLVLVIDTNPVHWFTNGGSAADRAGLQQLISSTLVFVNSYLLLHRSNRIVIIAAHAGKSA 75

Query: 69  YVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMA 128
            +Y      + +    +   + A +LQ +++    D  L    P      + ++ SLS +
Sbjct: 76  MLYPDPEQDDTTGSAEQAAKVNAGVLQRMQQL--SDAPLDPATPNQ----TAIAASLSRS 129

Query: 129 LCYIQRVFRSGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 186
           LC+I R        P  +PRIL +Q SPD  E Y+AIMN IFSAQ+  V +D+C L  ++
Sbjct: 130 LCFINRAINE---EPDLRPRILVIQKSPDVSEHYIAIMNGIFSAQKKSVAVDACILATEH 186

Query: 187 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 246
           S+F+QQA+Y+TGG+++KP    GL QYL++I+  D   R  L+LP    VDFRA CFCH+
Sbjct: 187 SSFMQQAAYLTGGIYYKPNDHSGLLQYLISIYLPDPSMRKLLKLPSQDSVDFRAMCFCHR 246

Query: 247 NTIDMGYICSVCLSIYCKHLKKCSTCG 273
             I   Y+C VCLS++C+    CSTCG
Sbjct: 247 EVISTAYVCPVCLSLFCEFRPICSTCG 273


>gi|290987006|ref|XP_002676214.1| predicted protein [Naegleria gruberi]
 gi|284089815|gb|EFC43470.1| predicted protein [Naegleria gruberi]
          Length = 346

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 116/322 (36%), Positives = 168/322 (52%), Gaps = 57/322 (17%)

Query: 10  SDDVSLVVVLLDTNPFFW--------------SSSSLSFSQFLTHVLAFLNAILTLNQLN 55
           +D++ L+V+LLD NP+ W              + S +SFSQF+  +L F+N  LT+ +  
Sbjct: 11  TDELELLVILLDINPYTWGKRKLDNEQQQSSSAKSLISFSQFINQLLVFINTFLTIQRNK 70

Query: 56  QVVVIATGYNSCDYV-----YDSSSTGNQSVGNGRMPSLCATLLQNL---------EEFM 101
           ++ VIA+  N+  +      + S   G  +    ++ S  +T + N+         E+F 
Sbjct: 71  KIAVIASSENNKSHFLFPNRFRSKKDGKHNHNQIQVKSTDSTSMLNMMGVKSGDDIEDFF 130

Query: 102 NKD----------------------EQLGKQEPEGRIACSL-----LSGSLSMALCYIQR 134
             D                      E     E +G    S      LSG+LSMALC+I R
Sbjct: 131 LNDIGFESVQNAVLSGLNSEEVTPSEDTEYMEDDGNHTTSTAQGFSLSGALSMALCFINR 190

Query: 135 VFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 192
           + +   L      RIL  Q SPD   QY+++MNAIFSA++  + +D+C L   +S FLQQ
Sbjct: 191 LEKEKPLGMSLNSRILTFQVSPDISSQYISVMNAIFSAEKMSIMLDACVLSNDDSTFLQQ 250

Query: 193 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 252
           A ++TGG + KPQ+ +GL QYLLTIF  +   R+++QLP    VDFRASCF  +  ID G
Sbjct: 251 ACFLTGGNYLKPQRQEGLIQYLLTIFMLEKSLRSYIQLPVQNTVDFRASCFETRKPIDDG 310

Query: 253 YICSVCLSIYCKHLKKCSTCGS 274
           Y+C VCLSI+  H   CSTC S
Sbjct: 311 YVCPVCLSIFSSHKPVCSTCAS 332


>gi|308798981|ref|XP_003074271.1| basal transcription factor complex subunit-related (ISS)
           [Ostreococcus tauri]
 gi|116000442|emb|CAL50122.1| basal transcription factor complex subunit-related (ISS)
           [Ostreococcus tauri]
          Length = 319

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 162/305 (53%), Gaps = 40/305 (13%)

Query: 10  SDDVSLVVVLLDTNPFFWSSS---------SLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           +DD SL++VL++TNP  W +S         S   S  L   + FLNA   LNQ N+ VV+
Sbjct: 10  TDDRSLLIVLVETNPRTWETSEGKGEGTPASRGLSSVLGATITFLNAFFALNQQNRAVVV 69

Query: 61  ATGYNSCDYVY------DSSSTGNQS--------VGNGRMP---SLCATLLQNLEEFMNK 103
           A   + C Y+Y      ++   G+++        VG G  P       T+L+ L E +N 
Sbjct: 70  AVHGDGCHYLYTSPLGVETDEDGDEAEDARWANKVG-GLDPLQSEAGPTILRRLGE-LNA 127

Query: 104 DE--------QLGKQEPEGRIACSLLSGSLSMALCYIQR---VFRSGLLHPQPRILCLQG 152
            E        +  K + +   A S  +G+LS+ALCY  R   +  +  L  +PRILCLQ 
Sbjct: 128 AETGSSATGAKRDKADSDDGPATSPFAGALSLALCYCNRAQALENAAGLRSKPRILCLQA 187

Query: 153 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 212
           S D P  Y+ +MNAIFSAQR  +P+D+  LG  +S F+QQA++IT G + KP   DGL Q
Sbjct: 188 SQDNPTDYIPMMNAIFSAQRQSIPVDAFALGEHDSPFMQQAAHITRGAYVKPTLGDGLLQ 247

Query: 213 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDM-GYICSVCLSIYCKHLKKCST 271
           YLL+    DL SR FL+LP+     FRAS            ++CSVCLS++C     C T
Sbjct: 248 YLLSTAVMDLRSRAFLKLPQAKASTFRASVLLPTTARQASAFVCSVCLSVFCDGRPACDT 307

Query: 272 CGSVF 276
           CG+ F
Sbjct: 308 CGTTF 312


>gi|115609530|ref|XP_791329.2| PREDICTED: general transcription factor IIH subunit 3-like
           [Strongylocentrotus purpuratus]
          Length = 313

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 104/287 (36%), Positives = 163/287 (56%), Gaps = 30/287 (10%)

Query: 14  SLVVVLLDTNPFFWSSSS--------LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           +LVV+++D NP +W   S        L+ SQ +  V+ F+N+ L +++ N+V VIA+  N
Sbjct: 11  NLVVIVIDVNPVWWGKQSHHKQQKDGLALSQCMDAVVVFINSHLMMDRRNKVAVIASHTN 70

Query: 66  SCDYVY-----------DSSSTGNQSVGNGR---MPSLCATLLQNLEEFMNKDEQLGKQE 111
              ++Y           D     ++   +G+     S+  T+++ L+  +++ +  G   
Sbjct: 71  ESRFLYPKKTSDAFDDGDDQREQDEMSSDGKYEVFASINDTVMEELKNLVDESKSWGLSH 130

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFS 169
            +     +LL+GSL+MALCYI RV +   +  +   RIL L+ + D   QY+  MN IF+
Sbjct: 131 TD-----TLLAGSLAMALCYIHRVXKECAVGEKIKSRILVLKAADDSASQYMNFMNVIFT 185

Query: 170 AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 229
           AQ+  +PID+C L  ++S+ LQQA  ITGG + K     GL QYLL ++      R  L 
Sbjct: 186 AQKQNIPIDACILD-KDSSLLQQACDITGGKYLKLLSTTGLLQYLLWVYLPSPSQRESLV 244

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           LP  + VD+RA+CFCH+  ID+GY+CSVCLSI+C +   CSTC + F
Sbjct: 245 LPPAIHVDYRAACFCHRILIDVGYVCSVCLSIFCTYSPICSTCHTAF 291


>gi|115655410|ref|XP_001194121.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Strongylocentrotus purpuratus]
          Length = 391

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 105/284 (36%), Positives = 157/284 (55%), Gaps = 26/284 (9%)

Query: 14  SLVVVLLDTNPFFWSSSS--------LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           +LVV+++D NP +W   S        L+ SQ +  V+ F+N+ L +++ N+V VIA+  N
Sbjct: 91  NLVVIVIDVNPVWWGKQSHHKQQKDGLALSQCMDAVVVFINSHLMMDRRNKVAVIASHTN 150

Query: 66  SCDYVY-----------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEG 114
              ++Y           D     ++   +G+     +     +EE  N    L  + P  
Sbjct: 151 ESRFLYPKKTSDAFDDGDDQREQDEMSSDGKYEVFASINDTVMEELKN----LVDENPTV 206

Query: 115 RIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQR 172
               +LL+GSL+MALCYI RV +   +  +   RIL L+ + D   QY+  MN IF+AQ+
Sbjct: 207 SHTDTLLAGSLAMALCYIHRVEKECAVGEKIKSRILVLKAADDSASQYMNFMNVIFTAQK 266

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             +PID+C L  ++S+ LQQA  ITGG + K     GL QYLL ++      R  L LP 
Sbjct: 267 QNIPIDACILD-KDSSLLQQACDITGGKYLKLLSTTGLLQYLLWVYLPSPSQRESLVLPP 325

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            + VD+RA+CFCH+  ID+GY+CSVCLSI+C +   CSTC + F
Sbjct: 326 AIHVDYRAACFCHRILIDVGYVCSVCLSIFCTYSPICSTCHTAF 369


>gi|417398730|gb|JAA46398.1| Putative rna polymer [Desmodus rotundus]
          Length = 308

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 106/286 (37%), Positives = 166/286 (58%), Gaps = 20/286 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKGSQFTLSKCMDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G+ P LC +        LL    E +  +  + + K + 
Sbjct: 65  QESRFLYPGRNGRMGDFFGDPGKTPELCPSGSKDGKYELLTAANEVIVDEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSA 170
           +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN IF+A
Sbjct: 125 KGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVVKAAEDSALQYMNFMNVIFAA 184

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R  L L
Sbjct: 185 QKQSILIDACVLDS-DSGLLQQACDITGGMYLKVPQVPSLLQYLLWVFLPDQDQRPQLTL 243

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|334327155|ref|XP_001375408.2| PREDICTED: general transcription factor IIH subunit 3-like, partial
           [Monodelphis domestica]
          Length = 308

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 106/289 (36%), Positives = 164/289 (56%), Gaps = 27/289 (9%)

Query: 11  DDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGYN 65
           D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+   
Sbjct: 5   DELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVMGNSHLFMNRSNKLAVIASHIQ 64

Query: 66  SCDYVYDSSST---------GNQSV-------GNGRMPSLCATLLQNLEEFMNKDEQLGK 109
              ++Y   +          GN +         +G+   L A      EE     + + K
Sbjct: 65  ESRFLYPGKNWKFGDIFGDPGNSATEYNPSGSKDGKYELLTAANEAITEEI---KDLMTK 121

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAI 167
            + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN I
Sbjct: 122 SDMEGQHTETLLAGSLAKALCYIHRMSKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVI 181

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           F+AQ+  + ID+C L A +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ 
Sbjct: 182 FAAQKQNIVIDACVLDA-DSGLLQQACDITGGIYLKVPQMPSLLQYLLWVFLPDQDQRSQ 240

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 LNLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|328852556|gb|EGG01701.1| hypothetical protein MELLADRAFT_110826 [Melampsora larici-populina
           98AG31]
          Length = 316

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 107/290 (36%), Positives = 162/290 (55%), Gaps = 30/290 (10%)

Query: 10  SDDVSLVVVLLDTNPFFWS------SSSLSF---SQFLTHVLAFLNAILTLNQLNQVVVI 60
           +D   L++++LD NP  WS      +SS  F   + FL  VL F N  L++   N + + 
Sbjct: 5   TDPADLLIIILDLNPSSWSNFKHQSTSSEEFIELTSFLETVLVFSNTHLSIRHENSLAIY 64

Query: 61  ATGYNSCDYVYDSSSTG--NQSVGNGRMPSLCAT--LLQN-----LEEFMN-KDEQLGKQ 110
           A+   + + +Y +   G  N SV   R  +   T  +L +     +++ M   D+ L   
Sbjct: 65  ASMIGTSELLYSTLENGVKNDSVSGHRDSNTYQTFRVLDDCVSDCVKKLMKVNDQDLKSG 124

Query: 111 EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ----PRILCLQGSPDGPEQYVAIMNA 166
           +P G +       +L+ ALC+I RV +     P+    PRIL +  SPD P QY+ +MN 
Sbjct: 125 QPTGTV------NALAKALCHINRVTKEET-RPKDSLKPRILIISISPDSPGQYIPMMNC 177

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IFSAQ+S +PID C +  +++ FLQQASY+T G++++ ++   L QYL  IF   +  R 
Sbjct: 178 IFSAQKSCIPIDVCKITGEDAVFLQQASYLTNGIYYRLEKPKALIQYLTMIFLPGVSIRK 237

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            L LP+   VD RA+CFCH+  ID+GY+CSVCLSI+C  +  CSTC + F
Sbjct: 238 SLNLPQQEEVDLRAACFCHRKIIDLGYVCSVCLSIFCTPIPVCSTCRTKF 287


>gi|395513842|ref|XP_003761131.1| PREDICTED: general transcription factor IIH subunit 3 [Sarcophilus
           harrisii]
          Length = 316

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 102/295 (34%), Positives = 165/295 (55%), Gaps = 33/295 (11%)

Query: 8   LYSDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIAT 62
           L  D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+
Sbjct: 9   LADDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVMGNSHLFMNRSNKLAVIAS 68

Query: 63  GYNSCDYVY-------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNK 103
                 ++Y                   D + +G++      + +    + + +++ M K
Sbjct: 69  HIQESRFLYPGKNWKFGDFFGDPGCSSADYNPSGSKDGKYELLTAANEVITEEIKDLMTK 128

Query: 104 DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYV 161
            +       EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+
Sbjct: 129 SDM------EGQRTETLLAGSLAKALCYIHRMGKEVKDNQEMKSRILVIKAAEDSALQYM 182

Query: 162 AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD 221
             MN IF+AQ+  + ID+C L   +S  LQQA  ITGG++ K  Q+  L QYLL +F  D
Sbjct: 183 NFMNVIFAAQKQNIVIDACVLDT-DSGLLQQACDITGGIYLKVPQMLSLLQYLLWVFLPD 241

Query: 222 LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
              R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 242 QDQRSQLNLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 296


>gi|118601766|ref|NP_001073057.1| general transcription factor IIH subunit 3 [Bos taurus]
 gi|426247186|ref|XP_004017367.1| PREDICTED: general transcription factor IIH subunit 3 [Ovis aries]
 gi|122131719|sp|Q05B56.1|TF2H3_BOVIN RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=General transcription factor IIH polypeptide 3
 gi|115545470|gb|AAI22793.1| General transcription factor IIH, polypeptide 3, 34kDa [Bos taurus]
 gi|296478493|tpg|DAA20608.1| TPA: general transcription factor IIH subunit 3 [Bos taurus]
          Length = 309

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 107/290 (36%), Positives = 164/290 (56%), Gaps = 27/290 (9%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVYDSSS---------TGNQSV-------GNGRMPSLCATLLQNLEEFMNKDEQLG 108
               ++Y   +          GN S         +G+   L A      EE     + + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPSSEFTPSGSKDGKYELLTAANEVIAEEI---KDLMT 121

Query: 109 KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNA 166
           K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN 
Sbjct: 122 KSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNV 181

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+
Sbjct: 182 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 240

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 QLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|66730451|ref|NP_001019407.1| general transcription factor IIH subunit 3 [Rattus norvegicus]
 gi|81888006|sp|Q561R7.1|TF2H3_RAT RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=General transcription factor IIH polypeptide 3
 gi|62531219|gb|AAH93380.1| General transcription factor IIH, polypeptide 3 [Rattus norvegicus]
 gi|149063248|gb|EDM13571.1| general transcription factor IIH, polypeptide 3 [Rattus norvegicus]
          Length = 309

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 101/293 (34%), Positives = 164/293 (55%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   NA L +N+ NQ+ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCMDAVMVLANAHLFMNRSNQLAVIASHI 64

Query: 65  NSCDYVY-------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDE 105
               ++Y                   D + +G++      + +    + + +++ M K +
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNALPDCNPSGSKDGKYELLTAANEVIAEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R  ++   + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRASKAVKDNQEMKSRILVIKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|440898315|gb|ELR49840.1| General transcription factor IIH subunit 3, partial [Bos grunniens
           mutus]
          Length = 305

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 107/290 (36%), Positives = 164/290 (56%), Gaps = 27/290 (9%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 1   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 60

Query: 65  NSCDYVYDSSS---------TGNQSV-------GNGRMPSLCATLLQNLEEFMNKDEQLG 108
               ++Y   +          GN S         +G+   L A      EE     + + 
Sbjct: 61  QESRFLYPGKNGRLGDFFGDPGNPSSEFTPSGSKDGKYELLTAANEVIAEEI---KDLMT 117

Query: 109 KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNA 166
           K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN 
Sbjct: 118 KSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNV 177

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+
Sbjct: 178 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 236

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 237 QLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 286


>gi|325190435|emb|CCA24938.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 288

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 99/270 (36%), Positives = 160/270 (59%), Gaps = 18/270 (6%)

Query: 14  SLVVVLLDTNPFFW----------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           + +V++LDTNP  W           S SL F + +  VL F+++ L LN+ N+++++A  
Sbjct: 7   AFLVLILDTNPSHWWRNVDGMPTDQSDSL-FQRTVKSVLIFVSSYLLLNRSNRLILLAAH 65

Query: 64  YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSG 123
             S   +Y  + + ++S    ++  +   +L+NL     +   +  +      + + +SG
Sbjct: 66  AGSSVTLYPDALSESESESGDQIGLIRNIVLKNLLALSEQTHDVTSK------STTSISG 119

Query: 124 SLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 183
           S+S  LC + RV +       PRI  +Q SPD PE Y++IMN+IFSAQ+  V ID+C L 
Sbjct: 120 SISRGLCCMNRVLKENA-QLCPRIFVIQRSPDVPEHYISIMNSIFSAQKKTVAIDACILS 178

Query: 184 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 243
            ++S F+QQA+Y+TGG+++KP+ +DGL QYL+T++  D + R  L+ P    VDFRA CF
Sbjct: 179 EEHSPFMQQATYLTGGIYYKPRVIDGLLQYLITLYLPDPNMRKMLRFPSQESVDFRAMCF 238

Query: 244 CHKNTIDMGYICSVCLSIYCKHLKKCSTCG 273
           CHK  +   ++C VCLS++C+    CSTCG
Sbjct: 239 CHKKAVSTAFVCPVCLSLFCQFQPICSTCG 268


>gi|348554273|ref|XP_003462950.1| PREDICTED: general transcription factor IIH subunit 3-like [Cavia
           porcellus]
          Length = 309

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 105/293 (35%), Positives = 164/293 (55%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  VL   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCMDAVLVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVYDSSS---------TGN-------QSVGNGRMPSLCA---TLLQNLEEFMNKDE 105
           +   ++Y   +          GN           +G+   L A    +++ +++ M K +
Sbjct: 65  HESRFLYPGKNGRLGDFFGDAGNPPPEYNPSGSKDGKYELLTAANEVIVEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ +    + +   RIL L+ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVLKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L   +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDT-DSGLLQQACDITGGLYLKVPQMTSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R  L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRAQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|194042856|ref|XP_001927574.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Sus scrofa]
          Length = 309

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 107/290 (36%), Positives = 165/290 (56%), Gaps = 27/290 (9%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY---------------DSSSTGNQS-VGNGRMPSLCATLLQNLEEFMNKDEQLG 108
               ++Y               + SS  N S   +G+   L A      EE     + + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPSSEFNPSGSKDGKYELLTAANDVIAEEI---KDLMT 121

Query: 109 KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNA 166
           K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN 
Sbjct: 122 KSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNV 181

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+
Sbjct: 182 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 240

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 QLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|149633387|ref|XP_001507590.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Ornithorhynchus anatinus]
          Length = 311

 Score =  177 bits (449), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 104/290 (35%), Positives = 166/290 (57%), Gaps = 23/290 (7%)

Query: 8   LYSDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIAT 62
           L  ++++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+
Sbjct: 5   LSDEELNLLVIVVDTNPIWWGKQALRESQFTLSKCIDAVMVMGNSHLLMNRTNKLAVIAS 64

Query: 63  GYNSCDYVY------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKD--EQLG 108
                 ++Y            D+ S+  +   +G        LL    E + ++  + + 
Sbjct: 65  HIQESRFLYPGKNWKLGEFSGDACSSPTEYNPSGSKDGK-YELLTTANEVITEEIKDLMT 123

Query: 109 KQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNA 166
           K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ S D   QY+  MN 
Sbjct: 124 KSDMKGQHTETLLAGSLAKALCYIHRMNKEVKDSQEMRSRILVIKASEDSALQYMNFMNV 183

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+
Sbjct: 184 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGIYLKVPQMPSLLQYLLWVFLPDQDQRS 242

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 243 QLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 292


>gi|358060458|dbj|GAA93863.1| hypothetical protein E5Q_00509 [Mixia osmundae IAM 14324]
          Length = 331

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 101/278 (36%), Positives = 153/278 (55%), Gaps = 17/278 (6%)

Query: 15  LVVVLLDTNPFFWSSSS--------LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
           L+ +++DTN   W  S+        LS S+ L  +L FLNA L L   N+++V   G   
Sbjct: 10  LLCIIIDTNVLAWHESAQSSESGDRLSLSEALETLLVFLNAHLALRDQNELLVYGAGPGH 69

Query: 67  CDYVYDS------SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ-EPEGRIACS 119
            + +Y S      SS  + S  N R  S      Q   +      Q+ ++ EP+     +
Sbjct: 70  SEMIYSSFDDQGESSADHASETNERYASTFPRFRQVDTQVSKAVGQMMRELEPDTDAPPA 129

Query: 120 LLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           ++S +L+  LC+I R+ R    H  +PR+L L  S D   QY+ +MN IFSAQ++ VPID
Sbjct: 130 IVS-ALARTLCHINRISREETKHTIKPRVLLLSVSHDSSSQYIPLMNCIFSAQKANVPID 188

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 238
            C +   ++ FLQQA+Y++ G+++K ++  GL QYL+  F   + SR  L LP    VD 
Sbjct: 189 VCKIYGDDAVFLQQAAYLSSGIYYKLEKRAGLLQYLMMTFLPGVTSRKLLNLPSQDAVDL 248

Query: 239 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           RA+CFCH+  +D+G++CSVCLSI+C     C TC + F
Sbjct: 249 RAACFCHQRIVDIGFVCSVCLSIFCTPRPSCLTCRTKF 286


>gi|395846791|ref|XP_003796077.1| PREDICTED: general transcription factor IIH subunit 3 [Otolemur
           garnettii]
          Length = 309

 Score =  177 bits (449), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 101/293 (34%), Positives = 166/293 (56%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVMGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY-------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDE 105
           +   ++Y                   + + TG++      + +    +++ +++ M K +
Sbjct: 65  HESRFLYPGKNGRLGDFFGDPGNPPPEFNPTGSKDGKYELLTAANEVIVEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|190358598|ref|NP_001121894.1| uncharacterized protein LOC100151595 [Danio rerio]
          Length = 296

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 103/277 (37%), Positives = 157/277 (56%), Gaps = 15/277 (5%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           DD++++V++LD NP +W   +      + S  L  ++   NA L +++ N++ VI + Y+
Sbjct: 5   DDINILVIVLDVNPVWWGQQAQREPKFTLSACLDSLMVLANAHLVMSRTNKLAVITSLYH 64

Query: 66  SCDYVYDSSS--TGNQSVGN--GRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLL 121
              ++Y S    +G +   N  G+   L  T     EE  N  E   + E  G    +LL
Sbjct: 65  ESHFLYPSKQWRSGEEISANPDGKYELLSVTNDLFAEEIRNLME---RAEVSGSQTETLL 121

Query: 122 SGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           + SL+ ALCYI R+ +        + RIL ++ + D   QY+  MN IF+AQ+  + ID+
Sbjct: 122 AVSLARALCYINRISKDVQAGQEVKSRILVIKAAEDSTSQYMNFMNVIFAAQKKNILIDA 181

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C L + +S  LQQA  ITGG++ +  Q   L QYLL +F  D   R+ L LP PV VD+R
Sbjct: 182 CVLDS-DSGLLQQACDITGGLYLRVPQKVALTQYLLWVFLPDAEQRSQLLLPPPVHVDYR 240

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           A+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 AACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 277


>gi|301754663|ref|XP_002913183.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Ailuropoda melanoleuca]
          Length = 309

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 104/293 (35%), Positives = 165/293 (56%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVYDSSS---------TGNQSV-------GNGRMPSLCAT---LLQNLEEFMNKDE 105
               ++Y   +          GN S         +G+     A    + + +++ M K E
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPSSEFSPSGSKDGKYELFTAANEIIAEEIKDLMTKSE 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+ +
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNL 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|114647703|ref|XP_001170519.1| PREDICTED: general transcription factor IIH subunit 3 isoform 3
           [Pan troglodytes]
 gi|397481834|ref|XP_003812142.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Pan paniscus]
 gi|410221584|gb|JAA08011.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
 gi|410252228|gb|JAA14081.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
 gi|410294034|gb|JAA25617.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
 gi|410334639|gb|JAA36266.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
          Length = 308

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 103/286 (36%), Positives = 165/286 (57%), Gaps = 20/286 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G +P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNLPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSA 170
           +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  MN IF+A
Sbjct: 125 KGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 184

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L L
Sbjct: 185 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLIL 243

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|291414274|ref|XP_002723381.1| PREDICTED: general transcription factor IIH, polypeptide 3, 34kDa
           [Oryctolagus cuniculus]
          Length = 309

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 105/293 (35%), Positives = 165/293 (56%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ V+A+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVVASHI 64

Query: 65  NSCDYVY---------------DSSSTGNQS-VGNGRMPSLCA---TLLQNLEEFMNKDE 105
               ++Y               + SS  N S   +G+   L A    +++ + + M K +
Sbjct: 65  QESRFLYPGKNGRVGDFFGDPGNPSSEFNPSGSKDGKYELLTAANEVIVEEIRDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L   +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDT-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCSFSPICTTCETAF 290


>gi|149720731|ref|XP_001498402.1| PREDICTED: general transcription factor IIH subunit 3-like [Equus
           caballus]
          Length = 309

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 104/293 (35%), Positives = 167/293 (56%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRCNKLAVIASHI 64

Query: 65  NSCDYVY---------------DSSSTGNQS-VGNGRMPSLCA---TLLQNLEEFMNKDE 105
               ++Y               ++ S  N S   +G+   L A    +++ +++ M K +
Sbjct: 65  QESRFLYPGKHGRLGDFFGDPGNAPSEFNPSGSKDGKYELLTAANEVIVEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDTALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|363739821|ref|XP_001235063.2| PREDICTED: general transcription factor IIH subunit 3 [Gallus
           gallus]
          Length = 307

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 101/291 (34%), Positives = 164/291 (56%), Gaps = 32/291 (10%)

Query: 11  DDVSLVVVLLDTNPFFWS-----SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           +++SL+V+++DTNP +W       +  + S+ +   +   N+ L +++ N++ +IA+   
Sbjct: 5   EELSLLVIVIDTNPIWWGKRAQGEAEFTLSKCMDAAMVLGNSHLFMSRTNRLALIASHTQ 64

Query: 66  SCDYVY--------DSSSTGNQSV-------GNGRMPSLCA---TLLQNLEEFMNKDEQL 107
              ++Y        D    G  SV        +G+   L A    + + +++ M K + +
Sbjct: 65  ESRFLYPGKRWASADLLGDGGNSVESNCSGSKDGKYELLTAINDAIAEEIKDLMTKTDMM 124

Query: 108 GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMN 165
           G+Q        +LL+GSL+ ALCYI ++ +    + +   RIL ++ + D   QY+  MN
Sbjct: 125 GQQTE------TLLAGSLAKALCYINKISKEVKANQEMKSRILVIKAAEDSALQYMNFMN 178

Query: 166 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 225
            IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K   L  L QYLL +F  D   R
Sbjct: 179 VIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVPHLPSLLQYLLWVFLPDQEQR 237

Query: 226 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           + L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 238 SQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICSTCETAF 288


>gi|31088894|ref|NP_852075.1| general transcription factor IIH subunit 3 [Mus musculus]
 gi|22654149|sp|Q8VD76.1|TF2H3_MOUSE RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=Basic transcription factor 2 34 kDa subunit;
           Short=BTF2 p34; AltName: Full=General transcription
           factor IIH polypeptide 3; AltName: Full=TFIIH basal
           transcription factor complex p34 subunit
 gi|17028402|gb|AAH17515.1| General transcription factor IIH, polypeptide 3 [Mus musculus]
 gi|74228938|dbj|BAE21939.1| unnamed protein product [Mus musculus]
 gi|148687628|gb|EDL19575.1| general transcription factor IIH, polypeptide 3 [Mus musculus]
          Length = 309

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 101/293 (34%), Positives = 163/293 (55%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ NQ+ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCMDAVMVLANSHLFMNRSNQLAVIASHI 64

Query: 65  NSCDYVY-------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDE 105
                +Y                   D + +G++      +      + + +++ M K +
Sbjct: 65  QESRLLYPGKNGGLGDFFGDPGNALPDCNPSGSKDGKYELLTVANEVIAEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI RV ++   + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRVNKAVKDNQEMKSRILVIKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|432873950|ref|XP_004072398.1| PREDICTED: general transcription factor IIH subunit 3-like [Oryzias
           latipes]
          Length = 300

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 106/282 (37%), Positives = 156/282 (55%), Gaps = 19/282 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+V+L+V+++D NP +W   +      + S+ L  V+   N+ + L + N++ VIA+  
Sbjct: 4   EDEVNLLVIVVDVNPIWWGQQAQREPEFTLSKCLDGVMVMGNSHMALTRTNKLAVIASHC 63

Query: 65  NSCDYVYDSSSTG--------NQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
               ++Y S S G          S  +G+   L        EE  N    L K E  G  
Sbjct: 64  QDSHFLYPSKSLGGGGGAAEDTSSSRDGKYELLAVANSAIAEEIRNV---LSKTEVRGNS 120

Query: 117 ACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 174
             +LL+GSL+ ALCYI RV +   +    + RIL ++ + D   QY+  MN IF+AQ+  
Sbjct: 121 TDTLLAGSLAKALCYIYRVSKELDVGQEIKSRILVIKAAEDCALQYMNFMNVIFAAQKQN 180

Query: 175 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
           + ID+C L + +S  LQQA  ITGG++ K  Q   L QYLL +F  D   R+ L LP P 
Sbjct: 181 ILIDACVLDS-DSGLLQQACDITGGLYLKIPQKLALTQYLLWVFLPDSEQRSQLVLPPPA 239

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 240 HVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 281


>gi|50540194|ref|NP_001002564.1| general transcription factor IIH, polypeptide 3 [Danio rerio]
 gi|49900709|gb|AAH76236.1| General transcription factor IIH, polypeptide 3 [Danio rerio]
          Length = 296

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 103/277 (37%), Positives = 156/277 (56%), Gaps = 15/277 (5%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           DD++++V++LD NP +W   +      + S  L  ++   NA L +++ N++ VI + Y+
Sbjct: 5   DDINILVIVLDVNPVWWGQQAQREPKFTLSACLDSLMVLANAHLVMSRTNKLAVITSLYH 64

Query: 66  SCDYVYDSSS--TGNQSVGN--GRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLL 121
              ++Y S    +G +   N  G+   L  T     EE  N  E   + E  G    +LL
Sbjct: 65  ESHFLYPSKQWRSGEEISANPDGKYELLSVTNDLFAEEIRNLME---RAEVSGSQTETLL 121

Query: 122 SGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           + SL+ ALCYI R+ +        + RIL ++ + D   QY+  MN IF+AQ+  + ID+
Sbjct: 122 AVSLARALCYINRISKDVQAGQEVKSRILVIKAAEDSTSQYMNFMNVIFAAQKKNILIDA 181

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C L + +S  LQQA  ITGG++ +  Q   L QYLL  F  D   R+ L LP PV VD+R
Sbjct: 182 CVLDS-DSGLLQQACDITGGLYLRVPQKVALTQYLLWAFLPDAEQRSQLLLPPPVHVDYR 240

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           A+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 AACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 277


>gi|1039318|emb|CAA82909.1| basic transcription factor 2, 35 kD subunit [Homo sapiens]
          Length = 303

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 103/286 (36%), Positives = 165/286 (57%), Gaps = 20/286 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G  P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSA 170
           +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN IF+A
Sbjct: 125 KGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 184

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L L
Sbjct: 185 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLIL 243

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|344297280|ref|XP_003420327.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Loxodonta africana]
          Length = 309

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 104/293 (35%), Positives = 166/293 (56%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY---------------DSSSTGNQS-VGNGRMPSLCA---TLLQNLEEFMNKDE 105
           +   ++Y               +  S  N S   +G+   L A    + + +++ M K +
Sbjct: 65  HESRFLYPGKNGRLGDFFGEASNPPSEFNPSGSKDGKYELLTAANEVIAEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHME------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|335775838|gb|AEH58705.1| general transcription factor IIH subunit-like protein [Equus
           caballus]
          Length = 302

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 104/288 (36%), Positives = 165/288 (57%), Gaps = 30/288 (10%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
            D+++L+V+++DTNP +W   +L  SQ    VL   N+ L +N+ N++ VIA+      +
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQDAVMVLG--NSHLFMNRCNKLAVIASHIQESRF 62

Query: 70  VY---------------DSSSTGNQS-VGNGRMPSLCA---TLLQNLEEFMNKDEQLGKQ 110
           +Y               ++ S  N S   +G+   L A    +++ +++ M K +  G+ 
Sbjct: 63  LYPGKHGRLGDFFGDPGNAPSEFNPSGSKDGKYELLTAANEVIVEEIKDLMTKSDIKGQH 122

Query: 111 EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIF 168
                   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN IF
Sbjct: 123 TE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDTALQYMNFMNVIF 176

Query: 169 SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFL 228
           +AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L
Sbjct: 177 AAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQL 235

Query: 229 QLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 236 ILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 283


>gi|19923732|ref|NP_001507.2| general transcription factor IIH subunit 3 isoform a [Homo sapiens]
 gi|50403772|sp|Q13889.2|TF2H3_HUMAN RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=Basic transcription factor 2 34 kDa subunit;
           Short=BTF2 p34; AltName: Full=General transcription
           factor IIH polypeptide 3; AltName: Full=TFIIH basal
           transcription factor complex p34 subunit
 gi|23955948|gb|AAN40702.1| general transcription factor IIH, polypeptide 3, 34kDa [Homo
           sapiens]
 gi|40850955|gb|AAH65250.1| General transcription factor IIH, polypeptide 3, 34kDa [Homo
           sapiens]
 gi|119618832|gb|EAW98426.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
           CRA_a [Homo sapiens]
 gi|189053764|dbj|BAG36016.1| unnamed protein product [Homo sapiens]
 gi|208966364|dbj|BAG73196.1| general transcription factor IIH, polypeptide 3, 34kDa [synthetic
           construct]
          Length = 308

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 103/286 (36%), Positives = 165/286 (57%), Gaps = 20/286 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G  P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSA 170
           +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN IF+A
Sbjct: 125 KGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 184

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L L
Sbjct: 185 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLIL 243

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|57105536|ref|XP_534644.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Canis lupus familiaris]
          Length = 308

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 106/289 (36%), Positives = 165/289 (57%), Gaps = 26/289 (8%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVYDSSS---------TGNQS------VGNGRMPSLCATLLQNLEEFMNKDEQLGK 109
               ++Y   +          GN S        +G+     A     +EE   KD  + K
Sbjct: 65  QESRFLYPGKNGKLGDFFGDPGNPSEFSPSGSKDGKYELFTAANEVIVEEI--KDLMI-K 121

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAI 167
            + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN I
Sbjct: 122 SDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVI 181

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           F+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ 
Sbjct: 182 FAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQ 240

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 LILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|167773979|gb|ABZ92424.1| general transcription factor IIH, polypeptide 3, 34kDa [synthetic
           construct]
          Length = 308

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 103/286 (36%), Positives = 165/286 (57%), Gaps = 20/286 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALQESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G  P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSA 170
           +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN IF+A
Sbjct: 125 KGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 184

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L L
Sbjct: 185 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLIL 243

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|355564808|gb|EHH21308.1| hypothetical protein EGK_04329, partial [Macaca mulatta]
 gi|355786640|gb|EHH66823.1| hypothetical protein EGM_03880, partial [Macaca fascicularis]
          Length = 308

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 103/286 (36%), Positives = 164/286 (57%), Gaps = 20/286 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G  P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSA 170
           +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  MN IF+A
Sbjct: 125 KGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 184

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L L
Sbjct: 185 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLIL 243

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|402888055|ref|XP_003907392.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Papio anubis]
 gi|426374588|ref|XP_004054152.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Gorilla gorilla gorilla]
 gi|380784389|gb|AFE64070.1| general transcription factor IIH subunit 3 [Macaca mulatta]
 gi|383417813|gb|AFH32120.1| general transcription factor IIH subunit 3 [Macaca mulatta]
 gi|384946662|gb|AFI36936.1| general transcription factor IIH subunit 3 [Macaca mulatta]
          Length = 308

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 103/286 (36%), Positives = 164/286 (57%), Gaps = 20/286 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G  P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSA 170
           +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  MN IF+A
Sbjct: 125 KGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 184

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L L
Sbjct: 185 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLIL 243

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|66819921|ref|XP_643618.1| general transcription factor IIH, polypeptide 3 [Dictyostelium
           discoideum AX4]
 gi|74861043|sp|Q86IB5.1|TF2H3_DICDI RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=TFIIH basal transcription factor complex subunit 3
 gi|60471553|gb|EAL69509.1| general transcription factor IIH, polypeptide 3 [Dictyostelium
           discoideum AX4]
          Length = 372

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 107/259 (41%), Positives = 150/259 (57%), Gaps = 16/259 (6%)

Query: 32  LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVG--------- 82
           + F++FL H + F+NA L LNQ NQ+ +I +      +V+  S+                
Sbjct: 90  IGFNKFLEHFMVFINAYLMLNQENQLAIICSKIGESSFVFPQSNIDQYQQEQQELEQRQL 149

Query: 83  --NGRM-PSLCATLL-QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS 138
             NG + P+   T+  Q L +    D ++   + +  I  S  S S+S+ALCYI R+ R 
Sbjct: 150 NENGELLPTPNKTIQGQILAKLQKLDLEIKHDQTD--ILSSSFSASMSIALCYINRIKRE 207

Query: 139 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 198
                +PRIL    SPD   QY+++MN IFS+Q+  +P+DSC L   +S FLQQAS++T 
Sbjct: 208 TPT-IKPRILVFNISPDVSSQYISVMNCIFSSQKQSIPVDSCILSQSDSTFLQQASHLTS 266

Query: 199 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 258
           G++ KPQ+ + L QYLLT F  D  SR  L  P    VD+RASCFCHK  +D+GY+CSVC
Sbjct: 267 GIYLKPQKQELLSQYLLTTFLLDTLSRKSLAYPTLKSVDYRASCFCHKRIVDIGYVCSVC 326

Query: 259 LSIYCKHLKKCSTCGSVFG 277
           LSI+C H   CSTCG+ F 
Sbjct: 327 LSIFCGHSSSCSTCGTKFS 345


>gi|34783458|gb|AAH39726.1| GTF2H3 protein, partial [Homo sapiens]
 gi|34783739|gb|AAH47868.2| GTF2H3 protein, partial [Homo sapiens]
          Length = 303

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 103/285 (36%), Positives = 165/285 (57%), Gaps = 20/285 (7%)

Query: 11  DDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGYN 65
           D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+   
Sbjct: 1   DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHIQ 60

Query: 66  SCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEPE 113
              ++Y   +   G+     G  P    +        LL +  E + ++  + + K + +
Sbjct: 61  ESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDIK 120

Query: 114 GRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQ 171
           G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN IF+AQ
Sbjct: 121 GQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAAQ 180

Query: 172 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 231
           +  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L LP
Sbjct: 181 KQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILP 239

Query: 232 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 240 PPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 284


>gi|449279312|gb|EMC86947.1| General transcription factor IIH subunit 3, partial [Columba livia]
          Length = 306

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 102/295 (34%), Positives = 163/295 (55%), Gaps = 38/295 (12%)

Query: 11  DDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGYN 65
           D++SL+VV++DTNP +W   +L  ++F     +  V+   N+ L +N+ N++ VIA+   
Sbjct: 2   DELSLLVVVIDTNPIWWGKKALGEAEFTLSKCIDAVMVLGNSHLLMNRNNKLAVIASHTQ 61

Query: 66  SCDYVY------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQL 107
              ++Y                  +S+ +G++      + ++   + + +++ M K E  
Sbjct: 62  ESRFLYPGKRWAVADLFGDGGSSMESNCSGSRDGKYELLTAINDAIAEEIKDLMTKTEMK 121

Query: 108 GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP------QPRILCLQGSPDGPEQYV 161
           G+Q        + L+GSL+ ALC I  +  S    P      + RIL ++ + D   QY+
Sbjct: 122 GQQTE------TQLAGSLAKALCCILLIVESS--KPLSNQEIKSRILVIKAAEDSALQYM 173

Query: 162 AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD 221
             MN IF+AQ+  V ID+C L + +S  LQQA  ITGG++ K   +  L QYLL +F  D
Sbjct: 174 NFMNVIFAAQKQSVLIDACVLES-DSGLLQQACDITGGIYLKVLHMPSLLQYLLWVFLPD 232

Query: 222 LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
              R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 233 QEQRSQLVLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICSTCETAF 287


>gi|332254335|ref|XP_003276282.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Nomascus leucogenys]
          Length = 308

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 100/292 (34%), Positives = 163/292 (55%), Gaps = 32/292 (10%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQ 106
               ++Y                  + + +G++      + S    +++ +++ M K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKGDI 124

Query: 107 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 164
            G+         +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 125 KGQHTE------TLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 165 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 224
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 225 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|354491444|ref|XP_003507865.1| PREDICTED: general transcription factor IIH subunit 3-like isoform
           2 [Cricetulus griseus]
 gi|344248744|gb|EGW04848.1| General transcription factor IIH subunit 3 [Cricetulus griseus]
          Length = 313

 Score =  174 bits (440), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 100/295 (33%), Positives = 166/295 (56%), Gaps = 33/295 (11%)

Query: 8   LYSDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIAT 62
           L  D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +++ NQ+ VIA+
Sbjct: 7   LAEDELNLLVIIVDTNPIWWGKQALKGSQFTLSKCMDAVMVLANSHLFMHRSNQLAVIAS 66

Query: 63  GYNSCDYVY-------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNK 103
                 ++Y                   D + +G++      + +    + + +++ M K
Sbjct: 67  HIQESRFLYPGKNGRLGDFFGDPGNPFPDCNPSGSKDGKYELLTAANDVIAEEIKDLMTK 126

Query: 104 DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYV 161
            +  G+         +LL+GSL+ ALCYI R+ ++   + +   RIL ++ + D   QY+
Sbjct: 127 SDIKGQHTE------TLLAGSLAKALCYIHRMNKAVKDNQEMKSRILVIKAAEDSALQYM 180

Query: 162 AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD 221
             MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D
Sbjct: 181 NFMNVIFAAQKQNILIDACILDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPD 239

Query: 222 LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
              R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 240 QDQRSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 294


>gi|354491442|ref|XP_003507864.1| PREDICTED: general transcription factor IIH subunit 3-like isoform
           1 [Cricetulus griseus]
          Length = 309

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 99/293 (33%), Positives = 165/293 (56%), Gaps = 33/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +++ NQ+ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKGSQFTLSKCMDAVMVLANSHLFMHRSNQLAVIASHI 64

Query: 65  NSCDYVY-------------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDE 105
               ++Y                   D + +G++      + +    + + +++ M K +
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPFPDCNPSGSKDGKYELLTAANDVIAEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ ++   + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRMNKAVKDNQEMKSRILVIKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACILDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 QRSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|348528468|ref|XP_003451739.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Oreochromis niloticus]
          Length = 300

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 102/282 (36%), Positives = 156/282 (55%), Gaps = 19/282 (6%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +      + S+ +  V+   NA + + + N++ VIA+  
Sbjct: 4   EDEINLLVIVVDVNPIWWGQQAQRDPQFTLSKCMDAVMVMGNAHMAMARTNKLAVIASHC 63

Query: 65  NSCDYVY--------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
               ++Y        D       S G+G+   L A      EE  N    + K E  G  
Sbjct: 64  QGSHFLYPNKSWSGGDGGGNDASSSGDGKYELLSAANNLIAEEIRNI---MSKIEVTGNS 120

Query: 117 ACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 174
             +LL+GSL+ ALCYI R+ +   +    + RIL ++ + D   QY+  MN IF+AQ+  
Sbjct: 121 TDTLLAGSLAKALCYIHRLTKELEVGQEIKSRILVVKAAEDCALQYMNFMNVIFAAQKQN 180

Query: 175 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
           + ID+C L + +S  LQQA  ITGG++ K  Q   L QYLL +F  D   R+ L LP P 
Sbjct: 181 ILIDACVLDS-DSGLLQQACDITGGLYLKIPQKVALAQYLLWVFLPDSEQRSQLVLPPPA 239

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 240 HVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 281


>gi|296213239|ref|XP_002753188.1| PREDICTED: general transcription factor IIH subunit 3 [Callithrix
           jacchus]
          Length = 309

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 104/287 (36%), Positives = 163/287 (56%), Gaps = 21/287 (7%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+ +L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDEWNLLVIVVDANPIWWGKQALKESQFTLSKCMDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCA---------TLLQNLEEFMNKD--EQLGKQE 111
               ++Y   +   G+     G  PS             LL +  E + ++  + + K +
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPSEFNPSGSKDGKYELLTSANEVIAEEIKDLMTKSD 124

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFS 169
            +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  MN IF+
Sbjct: 125 IKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFMNVIFA 184

Query: 170 AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 229
           AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L 
Sbjct: 185 AQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWMFLPDQDQRSQLI 243

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 244 LPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|355693898|gb|AER99489.1| proteinral transcription factor IIH, polypeptide 3, 34kDa [Mustela
           putorius furo]
          Length = 286

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/289 (35%), Positives = 162/289 (56%), Gaps = 33/289 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVYDSSS---------TGNQSV-------GNGRMPSLCA---TLLQNLEEFMNKDE 105
               ++Y   +          GN S         +G+     A    + + +++ M K +
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPSSEFSPSGSKDGKYELFTAANEVIAEEIKDLMTKSD 124

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAI 163
             G+         +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  
Sbjct: 125 IKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNF 178

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D  
Sbjct: 179 MNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQD 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 272
            R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC
Sbjct: 238 QRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTC 286


>gi|410922922|ref|XP_003974931.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Takifugu rubripes]
          Length = 298

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 103/281 (36%), Positives = 156/281 (55%), Gaps = 19/281 (6%)

Query: 11  DDVSLVVVLLDTNPFFWS-----SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           D++SL+V+++D NP +W       S  + S+ L  V+   N+ + + + N++ VIA+   
Sbjct: 5   DEISLLVIVVDVNPIWWGQQAQRESEFTLSKCLDSVMVLGNSHMAMARTNKLAVIASHCQ 64

Query: 66  SCDYVYDSS-----STGNQSV---GNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIA 117
              ++Y S+       G ++    G+G+   L        EE  N    + K E  G   
Sbjct: 65  DSHFLYPSNKLQGGENGEENTSCSGDGKYELLSIANNSIAEEMRNL---MLKTEVRGNSN 121

Query: 118 CSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
            +LL+GSL+ ALCYI RV +   +    + RIL ++ + D   QY+  MN IF+AQ+  +
Sbjct: 122 DTLLAGSLAKALCYINRVSKEMEVGQETKSRILVIKAAEDCALQYMNFMNVIFAAQKQNI 181

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 235
            ID+C L   +S  LQQA  ITGG++ K  Q   L QYLL +F  D   R+ L LP    
Sbjct: 182 LIDACVLD-MDSGLLQQACDITGGLYLKIPQKAALAQYLLWVFLPDTEQRSQLLLPPRAH 240

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 VDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 281


>gi|403163144|ref|XP_003323260.2| hypothetical protein PGTG_04797 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163932|gb|EFP78841.2| hypothetical protein PGTG_04797 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 322

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 100/293 (34%), Positives = 148/293 (50%), Gaps = 34/293 (11%)

Query: 10  SDDVSLVVVLLDTNPFFWS--------SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           +D   L+ V+LD NP  W+          + S    +   L F N+ L L   N + V  
Sbjct: 6   NDPADLLTVILDLNPLAWADLESGESQEETTSLVSVIESFLIFANSHLALRHENAIAVYG 65

Query: 62  TGYNSCDYVYDS------SSTGNQSVGNGRMPSL---------CATLLQNLEEFMNKDEQ 106
               S + +Y S      ++  +Q  G+    +           A+ ++ L         
Sbjct: 66  ASLGSSELLYSSLQAKPITTKSSQPSGSRDANTYQSFRILNDSIASGVKQLATLTQSSSM 125

Query: 107 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ---PRILCLQGSPDGPEQYVAI 163
            GKQ          +  +L+ ALC+I R+ R      +    RIL L  S D P QY+ +
Sbjct: 126 TGKQPG--------IVKALAKALCHINRLARENENKKESLKARILILSVSSDAPGQYIPM 177

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IFSAQ+S +PID C +  +N+ FLQQA+++T G++++ ++   + QYL  IF   L 
Sbjct: 178 MNCIFSAQKSTIPIDVCKISKENAVFLQQAAHLTEGIYYQIEKPKAILQYLTMIFLPGLA 237

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +R FL LPK   VDFRA+CFCH+  ID+GY+CSVCLSI+C     CSTC + F
Sbjct: 238 ARKFLNLPKHQEVDFRAACFCHREIIDVGYVCSVCLSIFCTPTPVCSTCRTKF 290


>gi|307105142|gb|EFN53393.1| hypothetical protein CHLNCDRAFT_13819, partial [Chlorella
           variabilis]
          Length = 135

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 101/134 (75%), Gaps = 1/134 (0%)

Query: 144 QP-RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH 202
           QP R+LCL  +PD P QY+++MNAIFSAQRS V ID+C LG ++S FLQQA+Y+TGGV+ 
Sbjct: 2   QPARVLCLTAAPDVPSQYISVMNAIFSAQRSGVLIDACQLGRRHSTFLQQAAYLTGGVYL 61

Query: 203 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 262
           KP +   L QYL ++F  D  +R FL++P    VDFRASCFCHK  ID+GY+CS CLSI+
Sbjct: 62  KPSKPVALVQYLNSVFAVDAATRQFLRMPGTAHVDFRASCFCHKRQIDLGYVCSACLSIF 121

Query: 263 CKHLKKCSTCGSVF 276
           C+ L  C+TCG+ F
Sbjct: 122 CEQLPACTTCGTEF 135


>gi|170093117|ref|XP_001877780.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164647639|gb|EDR11883.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 360

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 107/290 (36%), Positives = 160/290 (55%), Gaps = 31/290 (10%)

Query: 12  DVSLVVVLLDTNPFFWSSS-------SLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           D S + V+LD +P  W  S        LS + FL+HVLAFLNA L     N + V     
Sbjct: 11  DASHLSVVLDLSPTQWHLSGLLSNKYPLSLNSFLSHVLAFLNAHLASQHENTLAVFGAFP 70

Query: 65  NSCDYVYDSS--STGNQSV-GNGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
                +Y S+  +   QS+  N   P   + +T++  + + ++  +Q  ++ P     C+
Sbjct: 71  GKSLMLYSSTLHNPELQSIDSNSYAPFKLVDSTIINRISDELDAIDQSEEEAP-----CA 125

Query: 120 LLSGSLSMALCYIQRV-------------FRSGLLHPQPRILCLQGSPDGPEQYVAIMNA 166
           L+ G+L+ ALCY+ R+              R   + P PRIL L  SPD    Y+ IMN+
Sbjct: 126 LV-GALTKALCYVNRISLPPPSNSSTSPNTRDSTVLPDPRILILSVSPDLSTSYIPIMNS 184

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IFSAQ+  V ID+C +   ++ FLQQA+++TGG +   ++ D L QYL+  F +    R 
Sbjct: 185 IFSAQKLKVTIDACQVYGPDAVFLQQAAHLTGGSYLFLERRDALLQYLIMSFLSAPSIRQ 244

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            L +P    +DFRA+CFCHKN ID+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 245 VLAVPTQDRIDFRAACFCHKNIIDIGFVCSVCLSIFCQPVPVCSTCRTKF 294


>gi|345564048|gb|EGX47029.1| hypothetical protein AOL_s00097g75 [Arthrobotrys oligospora ATCC
           24927]
          Length = 374

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 113/322 (35%), Positives = 161/322 (50%), Gaps = 64/322 (19%)

Query: 7   KLYSDDVSLVVVLLDTNPFFWS----SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIAT 62
           K  +D  +L V++LDTNP  W+    SSSLSF   L+ +L F NA L  +Q N+  VIA 
Sbjct: 13  KAPADLATLQVIILDTNPLAWATFEKSSSLSFPAALSQLLIFANAHLAFSQSNRCAVIAN 72

Query: 63  GYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQ---------------- 106
             +   ++  +    +      R P       ++L+E  N++                  
Sbjct: 73  HIDGVHFLSPADPGTDH-----RDPGEADLTPEDLKEIANQNASKYRVFRLLEDEVRASY 127

Query: 107 ---LGKQEPEGRIAC---SLLSGSLSMALCYIQRVFRSGL-------------------- 140
              +    PE  +AC   S L+G+LS AL YI R+   G                     
Sbjct: 128 RRLIKNTTPEAVMACPNSSYLAGALSAALTYINRICFGGTGGRSSGVPGDEKSKVNSAGG 187

Query: 141 --------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 192
                   +H   +IL +  +PD   QY+ +MN+IF+AQR  VPID C L   ++ FLQQ
Sbjct: 188 KVDETGRTMHA--KILVVSVTPDPANQYIPVMNSIFAAQRLKVPIDVCKL-RDSTVFLQQ 244

Query: 193 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 252
           A+  TGGV+ +P+   GL QYL+  F  D  +R  L LP  V VDFRA+CFCHK  +D+G
Sbjct: 245 AADATGGVYMEPEHPQGLIQYLMMGFLPDHLARQSLILPTKVDVDFRAACFCHKKVLDIG 304

Query: 253 YICSVCLSIYCKHLKK--CSTC 272
           ++CSVCLSI+C+  +   CSTC
Sbjct: 305 FVCSVCLSIFCEPPQGAVCSTC 326


>gi|432095027|gb|ELK26416.1| General transcription factor IIH subunit 3, partial [Myotis
           davidii]
          Length = 277

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 103/288 (35%), Positives = 162/288 (56%), Gaps = 41/288 (14%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNRLAVIASHI 64

Query: 65  NSCDYVY---------------DSSSTGNQS-VGNGRMPSLCA---TLLQNLEEFMNKDE 105
               ++Y               + +S  N S   +G+   L A    +++ +++ M K +
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPASEYNPSGSKDGKYELLTAANEVIVEEIKDLMTKSD 124

Query: 106 QLGKQ-EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIM 164
             G+  EP       LL+GSL+ ALCY Q +        + RIL ++ + D   QY+  M
Sbjct: 125 IKGQHTEP-------LLAGSLAKALCYNQEM--------KSRILVIKAAEDSALQYMNFM 169

Query: 165 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 224
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 170 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 228

Query: 225 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 272
           R+ L LP P+ VD+RA+C CH+N I++GY+CSVCLSI+C     C+TC
Sbjct: 229 RSQLTLPPPIHVDYRAACLCHRNLIEIGYVCSVCLSIFCNFSPICTTC 276


>gi|47226419|emb|CAG08435.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 299

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 101/286 (35%), Positives = 158/286 (55%), Gaps = 26/286 (9%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
            D++SL+V+++D NP +W   +      + S+ L  ++   N+ + + + N++ VIA+  
Sbjct: 4   EDEISLLVIVVDVNPIWWGQQAQRQPEFTLSKCLDAMMVLGNSHMAMARTNKLAVIASHC 63

Query: 65  NSCDYVYDSSS----TGN-----QSVGNGRMPSLCAT---LLQNLEEFMNKDEQLGKQEP 112
               ++Y S+      GN      S G+G+   L      +++ +   M K E  G    
Sbjct: 64  QDSHFLYPSNKLQVGDGNGDENTSSSGDGKYELLSVANNLIVEEIRSVMLKTEVRGNSND 123

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSA 170
                 +LL+GSL+ ALCYI RV +   +    + RIL ++ + D   QY+  MN IF+A
Sbjct: 124 ------TLLAGSLAKALCYINRVSKELEVGQETKSRILVIKAAEDSALQYMNFMNVIFAA 177

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q   L QYLL +F  D   R+ L L
Sbjct: 178 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKIPQKLALAQYLLWVFLPDTEQRSQLLL 236

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P    VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 237 PPRAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 282


>gi|307214269|gb|EFN89365.1| General transcription factor IIH subunit 3 [Harpegnathos saltator]
          Length = 292

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 108/277 (38%), Positives = 153/277 (55%), Gaps = 23/277 (8%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL+V++LD NP        +   SQ L   + F NA L  +  N++ V+A   +S  +
Sbjct: 6   NTSLLVIVLDVNPVQRIVKQETKILSQCLDSAIVFANAHLMQSSNNELAVMACHGHSAKF 65

Query: 70  VYDSSSTGNQSVGNG---RMPSLCATLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSG 123
           +Y   +       +G   R   +  T+   L++ +N+   D  L  +        SL+SG
Sbjct: 66  LYPCENAAEIRQIDGQYERFTMVERTVRLQLQKVINELSTDTPLNTE--------SLISG 117

Query: 124 SLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           +LSMALCYI R+ R  +    LHP  RIL +  S D   QY+  MN  F+AQ+  V +D 
Sbjct: 118 ALSMALCYIARLERDKVAGQKLHP--RILVITASNDSATQYMNYMNIFFTAQKMNVILDV 175

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C L  Q    LQQ   ITGG + K  QL GL QYLL +F  D + R+ L LP PV VD+R
Sbjct: 176 CSLD-QELTLLQQGCDITGGNYLKVPQLAGLLQYLLWVFLPDPNVRSKLVLPPPVKVDYR 234

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           A+CFCH+  ID+GY+CS+CLS++CK    C+TC +VF
Sbjct: 235 AACFCHQELIDIGYVCSICLSVFCKFSPICTTCHTVF 271


>gi|442749163|gb|JAA66741.1| Putative rna polymer [Ixodes ricinus]
          Length = 293

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 105/276 (38%), Positives = 152/276 (55%), Gaps = 18/276 (6%)

Query: 11  DDVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           D+ SL+V ++DTNP      S     S+ L  +  F N+ L LN  N++ +IA+  +   
Sbjct: 5   DEGSLLVAIIDTNPCASLLESDQGIVSKLLDALTVFCNSHLMLNPCNKLAIIASHSHRST 64

Query: 69  YVY---DSSSTGNQSVGNGR---MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLS 122
           ++Y     SS+   SV +G+      +   +   ++E +  D+       E     SLL+
Sbjct: 65  FIYPKPQESSSDTYSV-DGQYELFTEVTGAIKDGVKELVLSDDS------ESAAGESLLT 117

Query: 123 GSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           G LS+ALCYI R+ +      +   RIL L  S +   QY+  MN  F+AQ+  V ID+C
Sbjct: 118 GGLSLALCYINRIEKESSSQNKIASRILVLSASGESASQYLNFMNVFFTAQKKNVIIDAC 177

Query: 181 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240
            L  ++S  LQQ   ITGG + K     GL QYLL +F  D  SRN++  P PV VD+RA
Sbjct: 178 VL-EKDSGLLQQGCDITGGKYMKVPNHAGLLQYLLWVFLPDKTSRNYMVFPPPVHVDYRA 236

Query: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 237 ACFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272


>gi|158296528|ref|XP_316918.4| AGAP008522-PA [Anopheles gambiae str. PEST]
 gi|157014753|gb|EAA12181.5| AGAP008522-PA [Anopheles gambiae str. PEST]
          Length = 288

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 106/275 (38%), Positives = 152/275 (55%), Gaps = 15/275 (5%)

Query: 10  SDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           + D SL+V++LDTNP          + +Q L  ++AF NA L     N++ V+A  +++ 
Sbjct: 4   NKDASLLVIVLDTNPSQRIIREKPHNLTQCLDSIVAFANAHLMQKAQNKLAVLACHHHAT 63

Query: 68  DYVYDSSSTG---NQSVGNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSG 123
            ++Y +        Q  G   + +L   T+ Q L   +N    L           SLL+G
Sbjct: 64  QFLYPTPGKPLDIRQVDGQYEVFTLVEKTIKQKLAHMINTAPPLTTPTE------SLLAG 117

Query: 124 SLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           S+SMALCYI R+ R+     +   R+L + GS +   QY+  MN  F+AQ+  V +D C 
Sbjct: 118 SMSMALCYIARINRNKAPGSKINSRVLVVTGSNECASQYMTYMNVFFTAQKQGVVVDVCA 177

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L    S  LQQ   ITGG + + +QL+G  QYLL +F  D   R  L LP PV VD+RA+
Sbjct: 178 LDKALS-LLQQGCDITGGQYLRLEQLEGFLQYLLWVFLPDPQMRCKLVLPPPVKVDYRAA 236

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCH+  ID+GY+CSVCLSI+CK    C+TC +VF
Sbjct: 237 CFCHRELIDIGYVCSVCLSIFCKFSPICTTCHTVF 271


>gi|157132766|ref|XP_001662634.1| transcription factor TFIIH-subunit, putative [Aedes aegypti]
 gi|108871079|gb|EAT35304.1| AAEL012523-PA [Aedes aegypti]
          Length = 288

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 106/275 (38%), Positives = 155/275 (56%), Gaps = 15/275 (5%)

Query: 10  SDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           + D SL+V+++DTNP       +    +Q L  ++AF NA L     N++ V+A  +++ 
Sbjct: 4   NKDASLLVIVMDTNPSQRIIRENPHHLTQCLDSIVAFANAHLMQKAQNKLAVLACHHHAT 63

Query: 68  DYVYDSSSTG---NQSVGNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSG 123
           +++Y +        Q  G     +L   T+ Q L + +++  +L           SLL+G
Sbjct: 64  EFLYPTPGKPLDIRQVDGQYEAFTLVEKTIKQKLAKVISEAPRLTVPTE------SLLAG 117

Query: 124 SLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           S++MALCYI RV R+    +    RIL + GS +   QY+  MN  F+AQ+  V +D C 
Sbjct: 118 SMAMALCYIARVHRNKPAGVKINSRILVVTGSNECASQYMTYMNVFFTAQKQNVSLDVCA 177

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L    S  LQQ   ITGG + K  QLDG  QYLL +F  +  +R  L LP PV VD+RA+
Sbjct: 178 LDKPLS-LLQQGCDITGGQYLKLPQLDGFLQYLLWVFLPEPLTRCKLVLPPPVKVDYRAA 236

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCH+  ID+GY+CSVCLSI+CK    C+TC +VF
Sbjct: 237 CFCHRELIDIGYVCSVCLSIFCKFSPICTTCHTVF 271


>gi|189189624|ref|XP_001931151.1| RNA polymerase II transcription factor B subunit 4 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187972757|gb|EDU40256.1| RNA polymerase II transcription factor B subunit 4 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 370

 Score =  167 bits (424), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 107/330 (32%), Positives = 166/330 (50%), Gaps = 60/330 (18%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  SS+LS S  L ++L F+NA L     N+V VIA+  +   ++Y
Sbjct: 20  SLLAIVLDTNPHAWAHLSSTLSLSAALANILVFINAHLASGNANEVAVIASHSHKTAFLY 79

Query: 72  DSSST-------GNQSVGNGRMPSLC-------------ATLLQNLEEFMNKDEQLGKQE 111
            + S+       G     N ++  +              + +L+N  + +N+ ++   + 
Sbjct: 80  PTPSSPQPQSRHGTNGATNDKVHDMAESANKYRPFAVVESAILRNFVKLLNETKESHLEA 139

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFRSGLLHP---------------------------Q 144
                  +L+ G+LS+AL +I +     +LH                             
Sbjct: 140 ----TPTTLIGGALSLALTHINK---QTILHAPTAASAESTALSALADSENAHVDRVPLT 192

Query: 145 PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 204
            RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  TGGV+ KP
Sbjct: 193 SRILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATGGVYMKP 251

Query: 205 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 264
            + +GL QYL+  F  D+ +R  L +P   GVDFRA+CFCH+  +D+G++CSVCLSI+C 
Sbjct: 252 DRPEGLLQYLMMAFLPDVTARTSLVVPSAGGVDFRAACFCHRKVVDIGFVCSVCLSIFCN 311

Query: 265 ---HLKKCSTCGSVFGQAQTQSDEPSATNR 291
                  C TCGS        ++ P+   R
Sbjct: 312 PDLSDNLCMTCGSYLSLRSAVTNPPALIPR 341


>gi|194853925|ref|XP_001968252.1| GG24611 [Drosophila erecta]
 gi|190660119|gb|EDV57311.1| GG24611 [Drosophila erecta]
          Length = 299

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 109/284 (38%), Positives = 155/284 (54%), Gaps = 21/284 (7%)

Query: 4   APSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           A SK     + L+V++LDTNP       +  + +Q L  V+AF NA L     N++ V++
Sbjct: 6   AASKEAESCIDLLVIVLDTNPSQHIVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVVS 65

Query: 62  TGYNSCDYVY---DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGR 115
             +++ +++Y          Q  G     SL    ++         +QLG      P   
Sbjct: 66  CSHHATNFLYPLPRRQVELRQVDGQYEAFSLVEKTVK---------QQLGSILMNAPRLS 116

Query: 116 IAC-SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
             C SLL+GS+SMALCYI R+ R+    +    RIL L GS +   QY+  MN  F+AQ+
Sbjct: 117 APCESLLAGSMSMALCYISRLQRNVAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQK 176

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F    H R+ L LP 
Sbjct: 177 LGITIDTCALN-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPHIRHKLVLPP 235

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P  VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 236 PPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|389741855|gb|EIM83043.1| Tfb4-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 400

 Score =  166 bits (421), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 102/296 (34%), Positives = 150/296 (50%), Gaps = 40/296 (13%)

Query: 11  DDVSLVVVLLDTNPFFW-------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           D  S + +++D +P  W       ++  L    FL+HVL F+N+ +     N + V    
Sbjct: 2   DKASHLSLIIDLSPSQWLLSADPSNAHPLPLHVFLSHVLTFINSHIASKHENSLAVFGAL 61

Query: 64  -------YNSCDYVYDSSSTGNQSVGNGRMP------SLCATLLQNLE--EFMNKDEQLG 108
                  Y+S D + D   T ++S  N   P      ++   + + LE  E  NK E+  
Sbjct: 62  PGKSVMLYSSSDSIPDQDQTADESSANSYRPFRLVDSAIVRRITEELELLEAQNKPEE-- 119

Query: 109 KQEPEGRIACSLLSGSLSMALCYIQRVF--------RSGLLHPQPRILCLQGSPDGPEQY 160
             EP        L G+L+ ALCYI R+              H  PRIL L  SPD    Y
Sbjct: 120 --EPVA------LVGALTKALCYINRITHVPTANAPEDPAAHVDPRILILSVSPDLSTAY 171

Query: 161 VAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGT 220
           + +MN+IFSAQ+  V ID C +   ++ FLQQ +++T G +   ++ D L QYL+  F  
Sbjct: 172 IPVMNSIFSAQKLKVTIDVCKVYGPDTVFLQQTAHLTAGSYIYLERRDALLQYLIMSFLP 231

Query: 221 DLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
               R  L +PK   +DFRA+CFCHKN +D+G++CSVCLSI+C  +  CSTC + F
Sbjct: 232 PPALRQILAVPKQDKIDFRAACFCHKNIVDIGFVCSVCLSIFCAPVPVCSTCRTKF 287


>gi|195575703|ref|XP_002077716.1| GD22925 [Drosophila simulans]
 gi|194189725|gb|EDX03301.1| GD22925 [Drosophila simulans]
          Length = 299

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 112/281 (39%), Positives = 157/281 (55%), Gaps = 19/281 (6%)

Query: 6   SKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           SK     + L+V++LDTNP   F   +  + +Q L  V+AF NA L     N++ V++  
Sbjct: 8   SKEAESCIDLLVIVLDTNPSQHFVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVVSCS 67

Query: 64  YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGRIAC-S 119
           +++ +++Y       + V   +M         NL E   K +QLG      P     C S
Sbjct: 68  HHATNFLY---PLPRRQVELRQMDGQYEAF--NLVEKTVK-QQLGSILMNAPRLSAPCES 121

Query: 120 LLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
           LL+GS+SMALCYI R+ R+  L P      RIL L GS +   QY+  MN  F+AQ+  +
Sbjct: 122 LLAGSMSMALCYISRLQRN--LAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLGI 179

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 235
            ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  
Sbjct: 180 TIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPPK 238

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 239 VDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|24580793|ref|NP_608574.2| Tfb4 [Drosophila melanogaster]
 gi|22945518|gb|AAF51411.2| Tfb4 [Drosophila melanogaster]
 gi|94400646|gb|ABF17925.1| FI01003p [Drosophila melanogaster]
          Length = 299

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 111/284 (39%), Positives = 159/284 (55%), Gaps = 21/284 (7%)

Query: 4   APSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           A SK     + L+V++LDTNP   F   +  + +Q L  V+AF NA L     N++ V++
Sbjct: 6   AASKEAESCIDLLVIVLDTNPSQHFVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVVS 65

Query: 62  TGYNSCDYVYDSSSTGNQ-SVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGRIA 117
             +++ +++Y       +    +G+  +       NL E   K +QLG      P     
Sbjct: 66  CSHHATNFLYPLPRRQVELRQVDGQYEAF------NLVEKTVK-QQLGSILMNAPRLSAP 118

Query: 118 C-SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           C SLL+GS+SMALCYI R+ R+  L P      RIL + GS +   QY+  MN  F+AQ+
Sbjct: 119 CESLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVVTGSNECASQYMTFMNVFFTAQK 176

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP 
Sbjct: 177 LGITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPP 235

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P  VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 236 PPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|451848115|gb|EMD61421.1| hypothetical protein COCSADRAFT_173760 [Cochliobolus sativus
           ND90Pr]
          Length = 377

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 110/337 (32%), Positives = 167/337 (49%), Gaps = 67/337 (19%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  SS+LS S  L ++L F+NA L     N+V VIA+  +   ++Y
Sbjct: 20  SLLAIVLDTNPHAWAHLSSTLSLSTALANILVFINAHLASGNTNEVAVIASHSHKTTFLY 79

Query: 72  DSS----STGNQSVGNGRM-----------------------PSLCATLLQNLEEFMNKD 104
            S     +     + NG +                         + +T+LQN  + +N+ 
Sbjct: 80  PSPVAPQAQSRNGIANGDVEMNGIGDDKGQHTAESANKYRPFAVVESTILQNFVKLLNET 139

Query: 105 EQLGKQEPEGRIACSLLSGSLSMALCYIQR---------------------------VFR 137
           ++   +        +L+ G+LS+AL +I +                           V R
Sbjct: 140 KESHLEA----TPTTLIGGALSLALTHINKQTILHAPTAASAESASLAALADSENTHVDR 195

Query: 138 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 197
             L     RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  T
Sbjct: 196 VSLTS---RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDAT 251

Query: 198 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 257
           GGV+ KP++ +GL QYL+  F  D+ +R  L +P   GVDFRA+CFCH+  +D+G++CSV
Sbjct: 252 GGVYMKPERPEGLLQYLMMAFLPDVTARASLVVPSAGGVDFRAACFCHRKVVDIGFVCSV 311

Query: 258 CLSIYCK---HLKKCSTCGSVFGQAQTQSDEPSATNR 291
           CLSI+C        C TCGS        ++ P+   R
Sbjct: 312 CLSIFCSPDLPDNLCLTCGSYLSLRSAMTNPPALIPR 348


>gi|350407575|ref|XP_003488130.1| PREDICTED: general transcription factor IIH subunit 3-like [Bombus
           impatiens]
          Length = 292

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 104/275 (37%), Positives = 151/275 (54%), Gaps = 19/275 (6%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL++V+LD NP        +   +Q L   + F NA L  +  NQ+ VIA   +   +
Sbjct: 6   ETSLLIVVLDVNPVQRIIKQETRILTQCLDSTVVFANAHLMQSSNNQLAVIACHSHGAKF 65

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSG 123
           +Y           +G+         T+ Q L++ +N+   D+ L  +        SL+SG
Sbjct: 66  LYPCEKPLEIRQIDGQYEKFTMIERTIRQQLQQVINEISMDKPLNGE--------SLISG 117

Query: 124 SLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           +L+MALCY+ R+ R  +   +   RIL +  S D   QY+  MN  F+AQ+  + +D C 
Sbjct: 118 ALTMALCYVARLEREKVASEKIYSRILVITASNDSATQYMNYMNIFFTAQKMGIILDVCS 177

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L  Q    LQQ   ITGG + K  QL+GL QYLL IF  D + R+ L LP PV VD+RA+
Sbjct: 178 LD-QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVLPPPVKVDYRAA 236

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCH+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 237 CFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|451999247|gb|EMD91710.1| hypothetical protein COCHEDRAFT_1136566 [Cochliobolus
           heterostrophus C5]
          Length = 377

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 113/337 (33%), Positives = 168/337 (49%), Gaps = 67/337 (19%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  SS+LS S  L ++L F+NA L     N+V VIA+  +   ++Y
Sbjct: 20  SLLAIVLDTNPHAWAHLSSTLSLSTALANILVFINAHLASGNTNEVAVIASHSHKTTFLY 79

Query: 72  DS---------SSTGNQSV----------------GNGRMP--SLCATLLQNLEEFMNKD 104
            S         + T N  V                 N   P   + +T+LQN  + +N+ 
Sbjct: 80  PSPVAPQTQARNGTANGDVEMNGTGGDKGQHTAESANKYRPFAVVESTILQNFVKLLNET 139

Query: 105 EQLGKQEPEGRIACSLLSGSLSMALCYIQR---------------------------VFR 137
           ++   +        +L+ G+LS+AL +I +                           V R
Sbjct: 140 KESHLEA----TPTTLIGGALSLALTHINKQTILHAPTAASAESASLAALADSENTHVDR 195

Query: 138 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 197
             L     RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  T
Sbjct: 196 VSLTS---RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDAT 251

Query: 198 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 257
           GGV+ KP++ +GL QYL+  F  D+ +R  L +P   GVDFRA+CFCH+  +D+G++CSV
Sbjct: 252 GGVYMKPERPEGLLQYLMMAFLPDVTARASLVVPSAGGVDFRAACFCHRKVVDIGFVCSV 311

Query: 258 CLSIYCK---HLKKCSTCGSVFGQAQTQSDEPSATNR 291
           CLSI+C        C TCGS        ++ P+   R
Sbjct: 312 CLSIFCSPDLPGNLCLTCGSYLSLRSAMTNPPALIPR 348


>gi|307175391|gb|EFN65408.1| General transcription factor IIH subunit 3 [Camponotus floridanus]
          Length = 292

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 103/272 (37%), Positives = 150/272 (55%), Gaps = 13/272 (4%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           D SL+V++LD NP        +   SQ L   + F NA L  +  N++ ++A   +S  +
Sbjct: 6   DTSLLVIVLDINPVQRIVKQETKILSQCLDSTVVFANAHLMQSSNNELAIMACHGHSAKF 65

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLS 126
           +Y   +       +G+         T+ Q L++ +N+         E     SL+SG+LS
Sbjct: 66  LYPCENMAEIRQIDGQYEKFTMVERTVRQQLQQVINEISMNTPLNTE-----SLISGALS 120

Query: 127 MALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
           MALCYI R+ R    + +   R+L +  S D   QY+  MN  F+AQR  V +D C L  
Sbjct: 121 MALCYIARLEREKFANQKLHSRMLVITASNDSAMQYMNYMNIFFTAQRMNVILDVCSLD- 179

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
           Q    LQQ   ITGG + +  QL GL QYLL +F  D   R+ L LP PV V++RA+CFC
Sbjct: 180 QELTLLQQGCEITGGNYLQVPQLAGLLQYLLWVFLPDPSVRSKLVLPPPVKVNYRAACFC 239

Query: 245 HKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           H++ ID+GY+CS+CLSI+CK+   C+TC +VF
Sbjct: 240 HQDLIDIGYVCSICLSIFCKYSPICTTCHTVF 271


>gi|170028289|ref|XP_001842028.1| TFIIH basal transcription factor complex p34 subunit [Culex
           quinquefasciatus]
 gi|167874183|gb|EDS37566.1| TFIIH basal transcription factor complex p34 subunit [Culex
           quinquefasciatus]
          Length = 292

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 108/279 (38%), Positives = 155/279 (55%), Gaps = 19/279 (6%)

Query: 10  SDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           ++D SL+V++LDTNP       +    +Q L  ++ F NA L     N++ V+A  +++ 
Sbjct: 4   NNDTSLLVIVLDTNPSQRIIRENPHHLTQCLDSIVGFANAHLMQKAHNKLAVLACHHHAT 63

Query: 68  DYVYDSSSTG---NQSVGNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSG 123
           +++Y +        Q  G     +L   T+ Q L   +N+  +L           SLL+G
Sbjct: 64  EFLYPNPGKPLDIRQVDGQYEQFTLVEKTIKQKLARMINEAPRLAAPTE------SLLAG 117

Query: 124 SLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           S++MALCYI R+ R+    +    RIL + GS +   QY+  MN  F+AQ+  V +D C 
Sbjct: 118 SMAMALCYIARINRNKPAGVKINSRILVVTGSNECASQYMTYMNVFFTAQKQNVTLDVCA 177

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD--LHS--RNFLQLPKPVGVD 237
           L    S  LQQ   ITGG + K  QLDG  QYLL +F  D  +H   R  L LP PV VD
Sbjct: 178 LDKPLS-LLQQGCDITGGQYLKLPQLDGFLQYLLWVFLPDPQMHPQMRCKLVLPPPVKVD 236

Query: 238 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +RA+CFCH+  ID+GY+CSVCLSI+CK    C+TC +VF
Sbjct: 237 YRAACFCHRELIDIGYVCSVCLSIFCKFSPICTTCHTVF 275


>gi|195350283|ref|XP_002041670.1| GM16625 [Drosophila sechellia]
 gi|194123443|gb|EDW45486.1| GM16625 [Drosophila sechellia]
          Length = 299

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 110/274 (40%), Positives = 155/274 (56%), Gaps = 19/274 (6%)

Query: 13  VSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYV 70
           + L+V++LDTNP   F   +  + +Q L  V+AF NA L     N++ V++  +++ +++
Sbjct: 15  IDLLVIVLDTNPSQHFVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVVSCSHHATNFL 74

Query: 71  YDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGRIAC-SLLSGSLS 126
           Y       + V   +M         NL E   K +QLG      P     C SLL+GS+S
Sbjct: 75  Y---PLPRRQVELRQMDGQYEAF--NLVEKTVK-QQLGSILMNAPRLSAPCESLLAGSMS 128

Query: 127 MALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 182
           MALCYI R+ R+  L P      RIL L GS +   QY+  MN  F+AQ+  + ID+C L
Sbjct: 129 MALCYISRLQRN--LAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLGITIDTCAL 186

Query: 183 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 242
             +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  VD+RASC
Sbjct: 187 D-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPPKVDYRASC 245

Query: 243 FCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           FCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 246 FCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|322795304|gb|EFZ18109.1| hypothetical protein SINV_08150 [Solenopsis invicta]
          Length = 322

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 108/284 (38%), Positives = 152/284 (53%), Gaps = 18/284 (6%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL+V++ D NP        +   SQ L   + F NA L  +  N++ ++A   +S  +
Sbjct: 4   ETSLLVIVFDVNPMQRIVKQETKILSQCLDSTIVFANAHLMQSSNNELAIMACHSHSARF 63

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLS 126
           +Y   +       +G+         T+ Q L++F+N+         E     SL+SG+LS
Sbjct: 64  LYPCENAVEIRQMDGQYEKFTMVERTVRQQLQQFINEIPMDAPLNTE-----SLISGALS 118

Query: 127 MALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 182
           MALCYI R+ R  +    LHP  RIL +  S D   QY+  MN  F+AQR  V +D C L
Sbjct: 119 MALCYIARLEREKVAGQKLHP--RILVITASNDSATQYMNYMNIFFTAQRMNVILDVCSL 176

Query: 183 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 242
             Q    LQQ   I+GG + K  QL GL QYLL +F  D   R  L LP PV VD+RA+C
Sbjct: 177 D-QELTLLQQGCDISGGNYLKVPQLAGLLQYLLWVFLPDPSVRLKLVLPPPVKVDYRAAC 235

Query: 243 FCHKNTIDMGYICSVCLSIYCKHLKKCSTCG-SVFGQAQTQSDE 285
           FCH+  ID+GY+CS+CLSI+CK    C+TC       A+T +D+
Sbjct: 236 FCHQELIDIGYVCSICLSIFCKFSPICTTCQYHCIQNARTYADK 279


>gi|330933521|ref|XP_003304199.1| hypothetical protein PTT_16692 [Pyrenophora teres f. teres 0-1]
 gi|311319344|gb|EFQ87711.1| hypothetical protein PTT_16692 [Pyrenophora teres f. teres 0-1]
          Length = 376

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 108/336 (32%), Positives = 165/336 (49%), Gaps = 66/336 (19%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  SS+LS S  L ++L F+NA L     N+V VIA+  +   ++Y
Sbjct: 20  SLLAIVLDTNPHAWAHLSSTLSLSAALANILVFINAHLASGNANEVAVIASHSHKTTFLY 79

Query: 72  DS-------------SSTGNQSVGNGRMPSLC-------------ATLLQNLEEFMNKDE 105
            +              STG     N +  ++              + +L+N  + +N+ +
Sbjct: 80  PTPLSPQPQSRHGTNDSTGTNGATNDKEHAMAESANKYRPFAVVESAILRNFVKLLNETK 139

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP---------------------- 143
           +   +        +L+ G+LS+AL +I +     +LH                       
Sbjct: 140 ESHLEA----TPTTLIGGALSLALTHINK---QTILHAPTAASAESTALSALADSENTHV 192

Query: 144 -----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 198
                  RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  TG
Sbjct: 193 DRVPLTSRILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATG 251

Query: 199 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 258
           GV+ KP + +GL QYL+  F  D  +R  L +P   GVDFRA+CFCH+  +D+G++CSVC
Sbjct: 252 GVYMKPDRPEGLLQYLMMAFLPDATARTSLVVPSAGGVDFRAACFCHRKVVDIGFVCSVC 311

Query: 259 LSIYCK---HLKKCSTCGSVFGQAQTQSDEPSATNR 291
           LSI+C        C TCGS        ++ P+   R
Sbjct: 312 LSIFCNPDLSDNLCMTCGSYLSLRSAVTNPPALIPR 347


>gi|340717520|ref|XP_003397229.1| PREDICTED: general transcription factor IIH subunit 3-like [Bombus
           terrestris]
          Length = 292

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 103/275 (37%), Positives = 151/275 (54%), Gaps = 19/275 (6%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL++++LD NP        +   +Q L   + F NA L  +  NQ+ VIA   +   +
Sbjct: 6   ETSLLIIVLDVNPVQRIIKQETRILTQCLDSTVVFANAHLMQSSNNQLAVIACHSHGAKF 65

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSG 123
           +Y           +G+         T+ Q L++ +N+   D+ L  +        SL+SG
Sbjct: 66  LYPCEKPLEIRQIDGQYEKFTMIERTIRQQLQQVINEISMDKPLNGE--------SLISG 117

Query: 124 SLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           +L+MALCY+ R+ R  +   +   RIL +  S D   QY+  MN  F+AQ+  + +D C 
Sbjct: 118 ALTMALCYVARLEREKVASEKIYSRILVITASNDSATQYMNYMNIFFTAQKMGIILDVCS 177

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L  Q    LQQ   ITGG + K  QL+GL QYLL IF  D + R+ L LP PV VD+RA+
Sbjct: 178 LD-QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVLPPPVKVDYRAA 236

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCH+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 237 CFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|213409415|ref|XP_002175478.1| RNA polymerase II transcription factor B subunit 4
           [Schizosaccharomyces japonicus yFS275]
 gi|212003525|gb|EEB09185.1| RNA polymerase II transcription factor B subunit 4
           [Schizosaccharomyces japonicus yFS275]
          Length = 295

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 98/271 (36%), Positives = 149/271 (54%), Gaps = 10/271 (3%)

Query: 11  DDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           D  SL+VV+LD NP  W S       S+ L  ++ F+NA L  +  N+V VIA+  +S +
Sbjct: 21  DTPSLLVVILDANPVSWYSLAQKAPISEILADIIVFINAHLAFHHENRVAVIASHTDSVE 80

Query: 69  YVYDSSSTGNQSVGNGRMPSLCATLLQNLEEF--MNKDEQLGKQEPEGRIAC-SLLSGSL 125
           ++Y ++    ++      P   A   +   E   M  +      E    I+  +++SG+L
Sbjct: 81  FLYPTTKDNRKTKQVD--PEKDANTYRKFREVDDMVLEGMARLVESTNSISTKTMMSGAL 138

Query: 126 SMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ 185
           S AL YI R+     L  + RIL    S D   QY++ MN IF AQ+  +PI+ C L ++
Sbjct: 139 SRALAYINRIQTESPL--RSRILIFSLSGDVALQYISTMNCIFCAQKKNIPINVCSL-SK 195

Query: 186 NSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCH 245
            + FL+QA   TGG++ K ++  GL Q+L+     D + R  + +P    VDFRA+CFCH
Sbjct: 196 ETLFLEQAVDATGGIYIKVEEPKGLLQHLMMSLFPDQNLRKHINIPNQANVDFRATCFCH 255

Query: 246 KNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           K  +D+GY+CSVCLSI+C    +C+TC S F
Sbjct: 256 KKILDIGYVCSVCLSIFCSPRDQCTTCHSTF 286


>gi|332019667|gb|EGI60141.1| General transcription factor IIH subunit 3 [Acromyrmex echinatior]
          Length = 292

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 106/277 (38%), Positives = 151/277 (54%), Gaps = 23/277 (8%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL+V++LD NP        +   SQ L   + F NA L  +  N++ +++   +   +
Sbjct: 6   ETSLLVIVLDVNPMQRIVKQETKILSQCLDSTIVFANAHLMQSSNNELAIMSCHGHGARF 65

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSG 123
           +Y   +       +G+         T+ Q L++ +N+   D  L  +        SL+SG
Sbjct: 66  LYPCENAIEIRQMDGQYEKFTMVERTVRQQLQQVINEISMDVPLNME--------SLISG 117

Query: 124 SLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           +LSMALCYI R+ R  +    LHP  R+L +  S D   QY+  MN  F+AQR  V +D 
Sbjct: 118 ALSMALCYIARLEREKVAGQKLHP--RMLVITASNDSATQYMNYMNIFFTAQRMNVILDV 175

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C L  Q    LQQ   ITGG + K  QL GL QYLL +F  D   R+ L LP PV VD+R
Sbjct: 176 CSLD-QELTLLQQGCDITGGNYLKVPQLAGLLQYLLWVFLPDPSVRSKLVLPPPVKVDYR 234

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           A+CFCH+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 235 AACFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|198475301|ref|XP_001357006.2| GA18615 [Drosophila pseudoobscura pseudoobscura]
 gi|198138760|gb|EAL34072.2| GA18615 [Drosophila pseudoobscura pseudoobscura]
          Length = 298

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 106/273 (38%), Positives = 152/273 (55%), Gaps = 17/273 (6%)

Query: 13  VSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYV 70
           + L+V++LDTNP       +  + +Q L  V+AF NA L     N++ VI+  +++ +++
Sbjct: 15  IDLLVIVLDTNPSQHIVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVISCAHHATNFL 74

Query: 71  YDSSSTG---NQSVGNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIAC-SLLSGSL 125
           Y          Q  G     SL   T+ Q L   +    +LG         C SLL+GS+
Sbjct: 75  YPQPRRQVELRQIDGQYEAFSLVEKTVKQQLGSILMNAPRLGS-------PCESLLAGSM 127

Query: 126 SMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 183
           +MALCYI R+ R  +  +    RIL L GS +   QY+  MN  F+AQ+  + ID+C L 
Sbjct: 128 AMALCYISRLQRNVAAGVKMHSRILVLTGSNECSSQYMTFMNVFFTAQKLGITIDTCALD 187

Query: 184 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 243
            +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  VD+RASCF
Sbjct: 188 -KTLSLLQQGCDITTGQFLKVTQLDGLLQYLLWVFLPAPQMRHKLVLPPPPKVDYRASCF 246

Query: 244 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 247 CHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|17945214|gb|AAL48665.1| RE13149p [Drosophila melanogaster]
          Length = 299

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 110/284 (38%), Positives = 159/284 (55%), Gaps = 21/284 (7%)

Query: 4   APSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           A SK     + L+V++LDTNP   F   +  + +Q L  V+AF NA L     N++ V++
Sbjct: 6   AASKEAESCIDLLVIVLDTNPSQHFVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVVS 65

Query: 62  TGYNSCDYVYDSSSTGNQ-SVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGRIA 117
             +++ +++Y       +    +G+  +       NL E   K +QLG      P     
Sbjct: 66  CSHHATNFLYPLPRRQVELRQVDGQYEAF------NLVEKTVK-QQLGSILMNAPRLSAP 118

Query: 118 C-SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           C SLL+GS+SMALCYI R+ R+  L P      RIL + GS +   QY+  +N  F+AQ+
Sbjct: 119 CESLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVVTGSNECASQYMTFLNVFFTAQK 176

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP 
Sbjct: 177 LGITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPP 235

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P  VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 236 PPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|390602568|gb|EIN11961.1| transcription factor Tfb4 [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 338

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/287 (36%), Positives = 153/287 (53%), Gaps = 28/287 (9%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS---------LSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           D  + + V++D +P  W  S+         LS   FL+ +LAFLN+ +     N + V+ 
Sbjct: 2   DRAAHLSVVVDLSPPQWLLSAQPGPSNPHPLSLRSFLSQLLAFLNSHIASKPENTLCVLG 61

Query: 62  TG-------YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEG 114
                    Y+S D+V  SS  G+    +  +P         ++   N+ + LG+ E + 
Sbjct: 62  AFPGKSGILYSSTDHVSTSSEKGDP---DAYLP-FKVVDAAIIDHITNELDVLGEWEAQE 117

Query: 115 RIACSLLSGSLSMALCYIQRVFR-SGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFS 169
             A   L G+L+ ALCYI R+   S    P    +PRIL L  SPD    Y+ IMN+IFS
Sbjct: 118 STA---LVGALTKALCYINRIMNVSADNEPRSPSEPRILILSVSPDQSTSYIPIMNSIFS 174

Query: 170 AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 229
           AQ+  V +D C L    S FLQQAS++TGG +   +  D L QYL+  F      R  L 
Sbjct: 175 AQKLKVTMDVCQLYGPESVFLQQASHLTGGSYIHLEHRDALLQYLIMSFLPPPSIRRTLS 234

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +P    +DFRA+CFCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 235 VPTQDKIDFRAACFCHKNIVDVGFVCSVCLSIFCQPVPVCSTCRTKF 281


>gi|194759192|ref|XP_001961833.1| GF14734 [Drosophila ananassae]
 gi|190615530|gb|EDV31054.1| GF14734 [Drosophila ananassae]
          Length = 297

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 112/286 (39%), Positives = 157/286 (54%), Gaps = 21/286 (7%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVV 59
           A   SK     + L+V++LDTNP       +  + +Q L  V+AF NA L     N++ V
Sbjct: 3   AETSSKEAESSIDLLVIVLDTNPSQHIVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAV 62

Query: 60  IATGYNSCDYVY---DSSSTGNQSVGNGRMPSLC-ATLLQNLEE-FMNKDEQLGKQEPEG 114
           ++  +++ +++Y          Q  G     SL   T+ Q L    MN     G  E   
Sbjct: 63  LSCSHHATNFLYPLPRRQVELRQVDGQYEAFSLVEKTVKQQLGSILMNAPRLNGPSE--- 119

Query: 115 RIACSLLSGSLSMALCYIQRVFR---SGL-LHPQPRILCLQGSPDGPEQYVAIMNAIFSA 170
               SLL+GS+SMALCYI R+ R   SG+ +H   RIL L GS +   QY+  MN  F+A
Sbjct: 120 ----SLLAGSMSMALCYISRLQRNVTSGVKMHS--RILVLTGSNECASQYMTFMNVFFTA 173

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L  +  + LQQ   IT G   K  QL+GL QYLL +F      R+ L L
Sbjct: 174 QKLGIVIDTCALD-KTLSLLQQGCDITSGQFLKVTQLEGLLQYLLWVFLPAPQMRHKLVL 232

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P P  VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 233 PPPPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 278


>gi|110757998|ref|XP_394705.3| PREDICTED: general transcription factor IIH subunit 3 [Apis
           mellifera]
          Length = 292

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 104/272 (38%), Positives = 148/272 (54%), Gaps = 13/272 (4%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL++++LD NP        +   +Q L   + F NA L  +  NQ+ VIA   +   +
Sbjct: 6   ETSLLIIILDVNPVQRIIKQETRILTQCLDSTIVFANAHLMQSSNNQLAVIACHSHGAKF 65

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLS 126
           +Y           +G+         T+ Q L++ +N  E    +   G    SL+SG+L+
Sbjct: 66  LYPCEDALEIRQIDGQYEKFTMIELTVRQQLQKVIN--EITSNKILNGE---SLISGALT 120

Query: 127 MALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
           MALCYI R+ R  +   +   RIL +  S D   QY+  MN  F+AQ+  V +D C L  
Sbjct: 121 MALCYIARLEREKIAGEKLYSRILVITASNDSATQYMNYMNIFFTAQKMGVILDVCSLD- 179

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
           Q    LQQ   ITGG + K  QL+GL QYLL IF  D + R+ L  P PV VD+RA+CFC
Sbjct: 180 QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVFPPPVKVDYRAACFC 239

Query: 245 HKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           H+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 240 HQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|380030634|ref|XP_003698948.1| PREDICTED: general transcription factor IIH subunit 3-like [Apis
           florea]
          Length = 292

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 104/272 (38%), Positives = 148/272 (54%), Gaps = 13/272 (4%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL++++LD NP        +   +Q L   + F NA L  +  NQ+ VIA   +   +
Sbjct: 6   ETSLLIIILDVNPVQRIIKQETRILTQCLDSTIVFANAHLMQSSNNQLAVIACHSHGAKF 65

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLS 126
           +Y           +G+         T+ Q L++ +N  E    +   G    SL+SG+L+
Sbjct: 66  LYPCEDALEIRQIDGQYEKFTMIELTVRQQLQKVIN--EITSNKLLSGE---SLISGALT 120

Query: 127 MALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
           MALCYI R+ R  +   +   RIL +  S D   QY+  MN  F+AQ+  V +D C L  
Sbjct: 121 MALCYIARLEREKIAGEKLYSRILVITASNDSATQYMNYMNIFFTAQKMGVILDVCSLD- 179

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
           Q    LQQ   ITGG + K  QL+GL QYLL IF  D + R+ L  P PV VD+RA+CFC
Sbjct: 180 QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVFPPPVKVDYRAACFC 239

Query: 245 HKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           H+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 240 HQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|195032196|ref|XP_001988455.1| GH10566 [Drosophila grimshawi]
 gi|193904455|gb|EDW03322.1| GH10566 [Drosophila grimshawi]
          Length = 300

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 109/285 (38%), Positives = 153/285 (53%), Gaps = 23/285 (8%)

Query: 4   APSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           A +K     + L+ ++LDTNP       +  + +Q L  V+AF NA L     N++ VI+
Sbjct: 7   AANKEAEASIDLLAIVLDTNPTQHIVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVIS 66

Query: 62  TGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEF----MNKDEQLGK---QEPEG 114
             +++ D++Y            GR   L   +    E F        +QLG      P  
Sbjct: 67  CSHHATDFLYPLP---------GRQVEL-RPIDGQYEAFSLVEKTVKQQLGSILMNAPRL 116

Query: 115 RIAC-SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 171
              C SLL+GS+SMALCYI R+ R  +  +    RIL L GS +   QY+  MN  F+AQ
Sbjct: 117 SSPCESLLAGSMSMALCYISRLQRNVAAGVKMHSRILVLTGSNECASQYMTYMNVFFTAQ 176

Query: 172 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 231
           +  + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP
Sbjct: 177 KLSIVIDACALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFMPSPQMRHKLVLP 235

Query: 232 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            P  VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 236 PPPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280


>gi|332375937|gb|AEE63109.1| unknown [Dendroctonus ponderosae]
          Length = 300

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 160/298 (53%), Gaps = 20/298 (6%)

Query: 5   PSKLYSDDVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIAT 62
           P    +D++SL+VV+LDTNP            +  +  V+AF N+ L     N++ V+A 
Sbjct: 12  PEITSADEISLLVVILDTNPTQKMLRDKQNQLTNIVDSVVAFSNSHLMQKAQNKLAVMAC 71

Query: 63  GYNSCDYVYDSSSTG-NQSVGNGRMPSLC---ATLLQNLEEFMNKDEQLGKQEPEGRIAC 118
              +  ++Y  +    +    +G+  +      T+ QNL E +  +      E       
Sbjct: 72  HSKTSQFIYPGAKKPLDVRQVDGQYEAFLLVEKTVKQNLAELLASESSTAVTE------- 124

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 176
           SLL+G+++MALCYI R+ R+     +   RIL + GS D   QY+  MN  F+AQ+  V 
Sbjct: 125 SLLAGAIAMALCYIARLQRTKPPGSKLNSRILVVSGSGDSASQYMNYMNVFFTAQKLNVV 184

Query: 177 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 236
           +D C L  Q+ + LQQ   ITGG++ K  Q+  L Q+LL +F  +   R  L LP PV V
Sbjct: 185 LDVCALD-QHLSLLQQCCDITGGMYLKVPQITALLQFLLWVFLPEPPIREKLVLPPPVKV 243

Query: 237 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
           D+RA+CFCH+  ID+G++CSVCLSI+CK    C+TC  VF      + +P    +KRK
Sbjct: 244 DYRAACFCHRELIDIGFVCSVCLSIFCKFSPICTTCHMVFKIPGPLAVKP----KKRK 297


>gi|449683803|ref|XP_002159513.2| PREDICTED: general transcription factor IIH subunit 3-like [Hydra
           magnipapillata]
          Length = 291

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 98/277 (35%), Positives = 148/277 (53%), Gaps = 21/277 (7%)

Query: 14  SLVVVLLDTNPFFWSS----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           +L+V++LD N  +W+S    ++  F + +  VL F N+ L LN  N++ ++        +
Sbjct: 8   NLLVIILDINAAWWASQFQNNNDVFQKCIDSVLVFCNSYLMLNHTNKLALVVCNSEQACF 67

Query: 70  VYDSSSTGNQSVG------NGR---MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSL 120
           +Y +SS     +       +G+      +  TL    +    +        P      SL
Sbjct: 68  LYPNSSKSTSGISIETSQKDGKYELFSEMNETLQCEFKRLFAESVSNMTNRP------SL 121

Query: 121 LSGSLSMALCYIQRVFRSGL-LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           L+G+L+ ALCYI    R+        RIL ++GS D   QY+ +MN IF+A +  + ID 
Sbjct: 122 LAGALTKALCYIHSHDRTANGRRTNARILIIKGSSDSSSQYMTVMNCIFAASKKNIVIDC 181

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C L   NS F+QQAS ITGGV+       G+ +YLL IF  D   R  L LP    +D+R
Sbjct: 182 CAL-QNNSGFMQQASDITGGVYFFIDDFSGMLEYLLWIFLPDPGLREKLNLPTSSQIDYR 240

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           A+CFCHK  +D+G++CSVCLSIYC+ + KC+TC + F
Sbjct: 241 AACFCHKQLVDVGFVCSVCLSIYCQFMPKCATCQTRF 277


>gi|303274264|ref|XP_003056454.1| RNA polymerase II transcription factor B subunit 4 [Micromonas
           pusilla CCMP1545]
 gi|226462538|gb|EEH59830.1| RNA polymerase II transcription factor B subunit 4 [Micromonas
           pusilla CCMP1545]
          Length = 296

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 104/277 (37%), Positives = 150/277 (54%), Gaps = 19/277 (6%)

Query: 8   LYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           L SD  S++V+L +TNP  WS          SF + L  +L ++   L+L++ N+VV+IA
Sbjct: 22  LPSDGSSVLVILFETNPLLWSGLGGRLDGKESFDETLRQLLVYMIGFLSLHEQNRVVIIA 81

Query: 62  TGYN--SCDYVYDSSSTGNQSVGN-GRMPSLCATLLQNLEEF---MNKDEQLGKQEPEGR 115
              N   C Y   + ++G   + N G    LCA ++ NL      M  D  L  +     
Sbjct: 82  LHSNVEHCMYAIPTLASGENDLTNTGSKSDLCAQVISNLATLAMRMRSDFDLSTKSNMP- 140

Query: 116 IACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
                L+  +SMALC + R  R    H  PRILC+ GS D   QY + MNAIFSA+ S V
Sbjct: 141 -----LTTVISMALCCVNRA-RVYKPHCSPRILCILGSQDSSGQYFSGMNAIFSAKNSGV 194

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 235
            ID+C LG  +S FLQQA+ ITGG +   QQ +    YLL+        R  LQ P+   
Sbjct: 195 RIDACLLGYTDSFFLQQAAQITGGSYCMLQQGEKALYYLLSDLPGSESIRAKLQSPRKFC 254

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 272
            +++A+CFCH+  + +G++CSVCL+I+C+  +KC  C
Sbjct: 255 SEYQATCFCHEVNLQVGFVCSVCLTIFCERCRKCLAC 291


>gi|396462438|ref|XP_003835830.1| similar to RNA polymerase II transcription factor B subunit 4
           [Leptosphaeria maculans JN3]
 gi|312212382|emb|CBX92465.1| similar to RNA polymerase II transcription factor B subunit 4
           [Leptosphaeria maculans JN3]
          Length = 375

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 111/336 (33%), Positives = 168/336 (50%), Gaps = 67/336 (19%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  SS+LS S  L ++L F+NA L     N+V VIA+  +   ++Y
Sbjct: 20  SLLAIVLDTNPHAWAHLSSTLSLSAALANILVFINAHLASGNANEVAVIASHSHKTTFLY 79

Query: 72  D--SSSTGNQSVGNGRM----------PSLCA-------------TLLQNLEEFMNK-DE 105
              SSS   +   NG +          P +                +L+N  + +++ +E
Sbjct: 80  PTPSSSPRPRKAANGDVEMNGAGGNSEPHMAENPNKYRPFAIVENAILRNFAKLLDETNE 139

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP---------------------- 143
           +     P      +L+ G+LS+AL +I +     +LH                       
Sbjct: 140 RHLAATP-----TTLIGGALSLALTHINK---QTMLHAPTAASAESASLAALADAENTHV 191

Query: 144 -----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 198
                  RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  TG
Sbjct: 192 DRIPLTSRILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATG 250

Query: 199 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 258
           GV+ KP++ +GL QYL+  F  D+ +R  L +P   GVDFRA+CFCH+  +D+G++CSVC
Sbjct: 251 GVYMKPERPEGLLQYLMMAFLPDVTARASLVVPNAGGVDFRAACFCHRRVVDIGFVCSVC 310

Query: 259 LSIYCK---HLKKCSTCGSVFGQAQTQSDEPSATNR 291
           LSI+C        C TCGS        ++ P+   R
Sbjct: 311 LSIFCSPDLPDNLCLTCGSYLSLRSAMTNTPALIPR 346


>gi|426197233|gb|EKV47160.1| hypothetical protein AGABI2DRAFT_204058 [Agaricus bisporus var.
           bisporus H97]
          Length = 361

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 99/274 (36%), Positives = 143/274 (52%), Gaps = 19/274 (6%)

Query: 18  VLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYD 72
           V+LD  P  W  S+     L+   FL+HVL FLNA +  N  N + V      +   +Y 
Sbjct: 6   VVLDLPPAQWHRSARHDPPLALPDFLSHVLVFLNAHIANNAENSLAVFGAFPGTSPLLYS 65

Query: 73  SSST--GNQSVGNGRMPSLCATLLQNLEEFMNKDEQLG-KQEPEGRIACSLLSGSLSMAL 129
           S+        + +   PS                + LG  + PE     S L G+++ AL
Sbjct: 66  SADPVPARLPIDSNSYPSFVHLDHAVARRVTAVLDSLGTDRAPEP----SALVGAITKAL 121

Query: 130 CYIQRVFRSG-------LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 182
           CY+ R+  S         + P PR+L L  SPD    Y+ IMN++FSAQ+  V ID C +
Sbjct: 122 CYVNRLAISPSSAAPAEHVLPDPRLLILSVSPDLATSYIPIMNSVFSAQKLKVTIDVCQV 181

Query: 183 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 242
              ++ FLQQA+++TGG +   ++ D L QYL+  F      R  L +P    VDFRA+C
Sbjct: 182 FGPDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLPPPPIRKVLAVPTQDKVDFRAAC 241

Query: 243 FCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           FCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 242 FCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 275


>gi|409080333|gb|EKM80693.1| hypothetical protein AGABI1DRAFT_119282 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 361

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 99/274 (36%), Positives = 143/274 (52%), Gaps = 19/274 (6%)

Query: 18  VLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYD 72
           V+LD  P  W  S+     L+   FL+HVL FLNA +  N  N + V      +   +Y 
Sbjct: 6   VVLDLPPAQWHRSARHDPPLALPDFLSHVLVFLNAHIANNAENSLAVFGAFPGTSPLLYS 65

Query: 73  SSST--GNQSVGNGRMPSLCATLLQNLEEFMNKDEQLG-KQEPEGRIACSLLSGSLSMAL 129
           S+        + +   PS                + LG  + PE     S L G+++ AL
Sbjct: 66  SADPVPARLPIDSNSYPSFVHLDHAVARRVTAVLDSLGTDRAPEP----SALVGAITKAL 121

Query: 130 CYIQRVFRSG-------LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 182
           CY+ R+  S         + P PR+L L  SPD    Y+ IMN++FSAQ+  V ID C +
Sbjct: 122 CYVNRLAISPSSAAPAEHVLPDPRLLILSVSPDLATSYIPIMNSVFSAQKLKVTIDVCQV 181

Query: 183 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 242
              ++ FLQQA+++TGG +   ++ D L QYL+  F      R  L +P    VDFRA+C
Sbjct: 182 FGPDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLPPPPIRKVLAVPTQDKVDFRAAC 241

Query: 243 FCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           FCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 242 FCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 275


>gi|383855904|ref|XP_003703450.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Megachile rotundata]
          Length = 292

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 105/275 (38%), Positives = 149/275 (54%), Gaps = 19/275 (6%)

Query: 12  DVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           + SL++++LD NP        +   +Q L   + F NA L  +  NQ+ +IA   +   +
Sbjct: 6   ETSLLIIVLDVNPVQRIVKQETRILTQCLDSTIVFANAHLMQSSNNQLAMIACHSHGAKF 65

Query: 70  VYDSSSTGNQSVGNGRMPSLCA---TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSG 123
           +Y           +G+         T+ Q L++ +N+   D+ L  +        SL+SG
Sbjct: 66  LYPCEKPVEIRQIDGQYEKFTMVERTVRQQLQQVINEISMDKPLNGE--------SLISG 117

Query: 124 SLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           +L+MALCYI R+ R  +   +   RIL +  S D   QY+  MN  F+AQ+  V +D C 
Sbjct: 118 ALTMALCYIARLEREKVAGQKLYSRILTITASNDSATQYMNYMNIFFTAQKMGVILDVCS 177

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L  Q    LQQ   ITGG + K  QL GL QYLL IF  D   R+ L LP PV VD+RA+
Sbjct: 178 LD-QELTLLQQGCDITGGNYLKVPQLSGLLQYLLWIFLPDPSIRSKLVLPPPVKVDYRAA 236

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCH+  ID+GY+CSVCLSI+CK    C+TC +VF
Sbjct: 237 CFCHQELIDIGYVCSVCLSIFCKFSPICTTCHTVF 271


>gi|430811432|emb|CCJ31073.1| unnamed protein product [Pneumocystis jirovecii]
 gi|430812043|emb|CCJ30521.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 284

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 103/272 (37%), Positives = 146/272 (53%), Gaps = 27/272 (9%)

Query: 10  SDDVS-LVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
            DDVS L+VV+LDTNPF W + S  + F   +   L FLNA L+ NQ N + V+A+  N 
Sbjct: 18  EDDVSNLLVVILDTNPFGWDALSEYIPFDVAIREFLVFLNAHLSSNQNNDLAVLASHTNF 77

Query: 67  CDYVYDS----SSTGNQSVGNGRMPS-------------LCATLLQNLEEFMNKDEQLGK 109
             Y+Y S    S    ++V   +                +   +  NL++ ++K  +  +
Sbjct: 78  VQYLYPSVQVSSKDNEKAVEENKEDKDFQKSNIYRPFYLMNKQIQDNLKKLIDKANEQDE 137

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQGSPDGPEQYVAIMNAIF 168
             P      +++ GSL MAL Y+ R  +S   +    RIL +  S D   QY+ IMN IF
Sbjct: 138 DTPS-----TMIGGSLCMALAYVNRAMQSASKNRLNARILIISVSSDITFQYIPIMNCIF 192

Query: 169 SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFL 228
            AQ+  +PID C +G  ++ FLQQAS  T G++    +   L QYLL IF  D   R  L
Sbjct: 193 GAQKKKIPIDVCKIGG-DTVFLQQASDATKGIYLHLDKPKSLLQYLLMIFLPDQKVRQHL 251

Query: 229 QLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 260
            LP  + VDFRA+CFCHK  +D+G++CSVCLS
Sbjct: 252 VLPFQLNVDFRAACFCHKKILDIGFVCSVCLS 283


>gi|195159598|ref|XP_002020665.1| GL15043 [Drosophila persimilis]
 gi|194117615|gb|EDW39658.1| GL15043 [Drosophila persimilis]
          Length = 298

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 107/282 (37%), Positives = 153/282 (54%), Gaps = 17/282 (6%)

Query: 4   APSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           A S      + L+V++LDTNP       +  + +Q L  V+AF NA L     N++ VI+
Sbjct: 6   AASTETESGIDLLVIVLDTNPSQHIVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVIS 65

Query: 62  TGYNSCDYVYDSSSTG---NQSVGNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIA 117
             +++ +++Y          Q  G     SL   T+ Q L   +    +LG         
Sbjct: 66  CAHHATNFLYPQPRRQVELRQIDGQYEAFSLVEKTVKQQLGSILMNAPRLGS-------P 118

Query: 118 C-SLLSGSLSMALCYIQRV--FRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 174
           C SLL+GS++MALCYI R+    +  +    RIL L GS +   QY+  MN  F+AQ+  
Sbjct: 119 CESLLAGSMAMALCYISRLHGMLAAGVKMHSRILVLTGSNECSSQYMTFMNVFFTAQKLG 178

Query: 175 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
           + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P 
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITTGQFLKVTQLDGLLQYLLWVFLPAPQMRHKLVLPPPP 237

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|198432365|ref|XP_002120803.1| PREDICTED: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Ciona intestinalis]
          Length = 309

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 100/297 (33%), Positives = 155/297 (52%), Gaps = 42/297 (14%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLS----FSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
            ++  L+VV+ D NP +W   SL      ++ L  +L F+N+ L L   N++ +IA+   
Sbjct: 4   EEERQLLVVIFDVNPVWWGIKSLQGQAQITKCLDCLLVFVNSYLMLRHDNKLAIIASHST 63

Query: 66  SCDYVY-----------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPE- 113
              ++Y             +S  +Q+ G+ R            E F   D+ +  +  E 
Sbjct: 64  KSVFLYPAQESSHSPDPSKASISDQAGGDCR-----------YEHFAKVDDSVTDKFKEL 112

Query: 114 GRIAC--------SLLSGSLSMALCYIQR--VFRSGLLHPQPRILCLQGSPDGPEQYVAI 163
            R  C        SL++GS+++ALCYI R  +   G      RIL ++ + D   QY++ 
Sbjct: 113 MRDGCEKNDIRRDSLIAGSMAIALCYIHRMQIESQGNTKLNARILVIKAADDAASQYMSF 172

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MN IF+AQ+  V +D+C LG Q+S  LQQA  +T G++ K    D   QYLLT F T+  
Sbjct: 173 MNVIFAAQKERVVVDACILG-QDSGLLQQACDMTSGMYLKVPHQDAFLQYLLTAFLTEAE 231

Query: 224 SRN----FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            R       Q  +P  VD+RA+CFCH++ +D+G++CSVCLSI+C+    CSTC + F
Sbjct: 232 LRKESLVLAQSSRPREVDYRAACFCHRSLVDVGFVCSVCLSIFCQFSPICSTCETAF 288


>gi|195470427|ref|XP_002087508.1| GE15755 [Drosophila yakuba]
 gi|194173609|gb|EDW87220.1| GE15755 [Drosophila yakuba]
          Length = 299

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/284 (37%), Positives = 154/284 (54%), Gaps = 21/284 (7%)

Query: 4   APSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           A SK     + L+V++LDTNP       +  + +Q L  V+AF NA L     N++ V++
Sbjct: 6   AASKEAESCIDLLVIVLDTNPSQHIVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVVS 65

Query: 62  TGYNSCDYVY---DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGR 115
             +++ +++Y          Q  G     SL    ++         +QLG      P   
Sbjct: 66  CSHHATNFLYPLPRRQVELRQVDGQYEAFSLVEKTVK---------QQLGSILMNAPRLS 116

Query: 116 IAC-SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
             C SLL+GS+SMALCYI R+ R+    +    RIL + GS +   QY+  MN  F+AQ+
Sbjct: 117 APCESLLAGSMSMALCYISRLQRNVAPGVKMHSRILVITGSNECASQYMTFMNVFFTAQK 176

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP 
Sbjct: 177 LGITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPP 235

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P  VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 236 PPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|395332939|gb|EJF65317.1| transcription factor Tfb4 [Dichomitus squalens LYAD-421 SS1]
          Length = 324

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 97/290 (33%), Positives = 148/290 (51%), Gaps = 30/290 (10%)

Query: 11  DDVSLVVVLLDTNPFFWS-------SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           D  S + V++D +P  W+          L    FL+ +LAFLNA +     N + V    
Sbjct: 2   DKPSHLSVIVDLSPTQWALCAQSTAPPGLHLQTFLSQLLAFLNAHIACKDENTLAVFGAF 61

Query: 64  YNSCDYVYDSS-----STGNQSVGNGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
                 +Y S+     +T   +  N   P  +L +T++ ++ E  N  +   ++EP G +
Sbjct: 62  PGQSVMLYSSTGEIMTATTEATDPNTYRPFKTLDSTVMSSIAERFNSYDVTEEEEPIGLV 121

Query: 117 ACSLLSGSLSMALCYIQRVF----------RSGLLHPQPRILCLQGSPDGPEQYVAIMNA 166
            C      L+ ALCYI R                +   PRIL L  SPD    Y+ IMN+
Sbjct: 122 GC------LTKALCYINRTISVVSASAQGTEGATISIDPRILVLSVSPDQSSTYIPIMNS 175

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IFSAQ+  V ID C +    + FLQQA+++TGG + + ++ D L QY++  F      R 
Sbjct: 176 IFSAQKLKVTIDVCKVFGAENVFLQQAAHLTGGAYVQIERTDALLQYMMMSFLPPPAIRQ 235

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            + +P    VDFRA+CFCHK  +++G++CSVCLSI+C+ +  CSTC + F
Sbjct: 236 LISVPTQDRVDFRAACFCHKRIVEIGFVCSVCLSIFCQPVPVCSTCRTKF 285


>gi|407926572|gb|EKG19539.1| Transcription factor Tfb4 [Macrophomina phaseolina MS6]
          Length = 380

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 156/317 (49%), Gaps = 66/317 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W+  + +L  S+ + ++L F+NA L +N  NQV V+A+  +   ++Y
Sbjct: 20  SLLTIIIDTNPHAWAFLADTLPLSKAIANLLVFINAHLAINSANQVAVLASHSHQATWLY 79

Query: 72  ----------------------------DSSSTGNQSVGNGRMP--SLCATLLQNLEEFM 101
                                       D S T +    N   P  ++   LL NL   M
Sbjct: 80  PTPHPPKAKRRNAPNGNKEENGDADGDVDMSDTPSMDDANKYRPFRNVETELLANLRVLM 139

Query: 102 NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV------------------------FR 137
            +  ++  +        +L++G+L+ AL YI +                          R
Sbjct: 140 ARTSEMDLKAS----TTTLVAGALTTALAYIAKANIKSAPQVGDTQMADLATAADPPAAR 195

Query: 138 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 197
           S L     RIL L  S D   QY+ +MNAIF+AQR  +PID   L A ++ FLQQA   T
Sbjct: 196 SSLTS---RILVLSVSGDLASQYIPVMNAIFAAQRQRIPIDILKL-AGDTVFLQQACDAT 251

Query: 198 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 257
           GG++ KP   +GL QYL+  F  D  +R  L LP    VDFRA+CFCH+  +D+GY+CSV
Sbjct: 252 GGIYLKPLAPEGLLQYLMMAFLPDETARKHLVLPSAGEVDFRAACFCHRRVVDIGYVCSV 311

Query: 258 CLSIYCKHL--KKCSTC 272
           CLSI+C+    + C TC
Sbjct: 312 CLSIFCEPPADQTCPTC 328


>gi|392570135|gb|EIW63308.1| transcription factor Tfb4 [Trametes versicolor FP-101664 SS1]
          Length = 329

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 151/290 (52%), Gaps = 30/290 (10%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS-------LSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           D  S + V+LD +P  W+ S+       +S   FL+ +LAFLNA +     N + V    
Sbjct: 2   DTPSHLSVILDLSPTQWALSAQSTNPPGISLETFLSQLLAFLNAHIACKDENTLAVFGAL 61

Query: 64  YNSCDYVYDSS-----STGNQSVGNGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
                 +Y S+     S    +  N   P  +L AT++++L E +       ++ P G +
Sbjct: 62  PGESVLLYSSTGEVTASDDGPTDSNTYRPFKTLDATVMKSLAERLESFGDAAEEAPIGIV 121

Query: 117 ACSLLSGSLSMALCYIQRVF-RSGLLHPQ---------PRILCLQGSPDGPEQYVAIMNA 166
            C      ++ ALCYI R+   +    P          PRIL L  SPD    Y+ IMN+
Sbjct: 122 GC------MTKALCYINRITSNTTAAGPNEDGPTASIDPRILVLSVSPDQSSTYIPIMNS 175

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IFSAQ+  V ID C +   ++ FLQQA+++TGG + +  + D L QY++  F      R 
Sbjct: 176 IFSAQKLKVTIDVCKIFGADNVFLQQAAHLTGGAYVQVDRTDALLQYMMMSFLPPPAIRQ 235

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            + +P    VDFRA+CFCHK  +++G++CSVCLSI+C+ +  CSTC + F
Sbjct: 236 LIAVPTQDRVDFRAACFCHKRIVEIGFVCSVCLSIFCQPVPVCSTCRTKF 285


>gi|393217355|gb|EJD02844.1| Tfb4-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 378

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 98/290 (33%), Positives = 150/290 (51%), Gaps = 30/290 (10%)

Query: 11  DDVSLVVVLLDTNPFFWSSSSL-------SFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           D  S + ++LD +P  W  SSL       SF  FL   LAFLN+ L L   N + V    
Sbjct: 2   DRPSHLAIILDLSPTQWHLSSLPKNAYPLSFKSFLAQTLAFLNSHLALKHENTLAVYGAF 61

Query: 64  YNSCDYVYDSSSTGNQSVG------NGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGR 115
                 +Y S     + V       N  +P   +   + + + E ++      ++ P   
Sbjct: 62  PGKSVLLYSSMDHKPEGVDDSIADPNTYLPFKVVDTAVTKRISEELDAMSDFDEEAP--- 118

Query: 116 IACSLLSGSLSMALCYIQRVFRS---------GLLHPQPRILCLQGSPDGPEQYVAIMNA 166
             C L+ G+L+ ALC+I R+             ++ P+PRI+ L  SPD    Y+ +MN+
Sbjct: 119 --CQLV-GALTKALCHINRLVNPTATSIDSDVPVVPPEPRIVILSVSPDLSTSYIPLMNS 175

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IFSAQ+  V ID C +   ++ FLQQA+++TGG +   ++ D L Q+LL  F      R 
Sbjct: 176 IFSAQKLKVAIDVCKIYGPDAVFLQQAAHLTGGSYIYLERRDALLQHLLMTFLPTPSIRQ 235

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            + +P    +DFRA+CFCHK  +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 236 LIAVPTQDKIDFRAACFCHKTIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 285


>gi|296421687|ref|XP_002840396.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636611|emb|CAZ84587.1| unnamed protein product [Tuber melanosporum]
          Length = 357

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 101/297 (34%), Positives = 159/297 (53%), Gaps = 44/297 (14%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+V++LDTNP+ WS  S++L   + L  +L FLN  +  +  N+V VIA+   S  ++Y
Sbjct: 19  SLLVLILDTNPYAWSTLSATLPLERALASILIFLNVHMAFSHANRVAVIASHTRSARFLY 78

Query: 72  DSSSTGN-------------QSVGNGRMPSLCATLLQ-NLEEFMNK---DEQLGKQEPEG 114
            ++++               +     R   L    +Q +  E M +    E +  +E   
Sbjct: 79  PTTTSTTTAAGGSGGSEDPFRDANKYRQFRLIEDQVQASFAELMRQTTASELVDAKE--- 135

Query: 115 RIACSLLSGSLSMALCYIQRVFR---SG------------LLHPQPRILCLQGSPDGPEQ 159
               ++++G+L +AL Y+ R+ R   SG            +     RIL +  S D  EQ
Sbjct: 136 ----TMMAGALGLALGYVNRLVRVDDSGGVVGGKDGAVGEVTSMNARILVVSVSGDLAEQ 191

Query: 160 YVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 219
           YV +MN+IF+AQR  +PID C +  + + FLQQAS  T G++ + +  + L QYL+  F 
Sbjct: 192 YVPVMNSIFAAQRKKIPIDICKVAGE-TVFLQQASDTTRGIYMQLEHPESLLQYLMMCFI 250

Query: 220 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGS 274
            D  +R  L  P  V VDFRA+CFCHKN +D+G++CS+CLSI+C   +   C+TC +
Sbjct: 251 PDTATRRHLVPPTQVNVDFRAACFCHKNVVDIGFVCSICLSIFCTPPEGAICTTCNT 307


>gi|195388531|ref|XP_002052933.1| GJ17829 [Drosophila virilis]
 gi|194149390|gb|EDW65088.1| GJ17829 [Drosophila virilis]
          Length = 300

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 106/284 (37%), Positives = 153/284 (53%), Gaps = 21/284 (7%)

Query: 4   APSKLYSDDVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           A +K     + L+ ++LDTNP       +  + +Q L  V+AF NA L     +++ VI+
Sbjct: 7   AANKEAEASIDLLAIVLDTNPSQQIVRQNPQNLTQILEAVIAFGNAHLMQKAQSKLAVIS 66

Query: 62  TGYNSCDYVY---DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGR 115
             +++ D++Y          Q  G     SL    ++         +QLG      P   
Sbjct: 67  CSHHATDFLYPLPGRQVELRQIDGQYEAFSLVEKTVK---------QQLGSILMNAPRLS 117

Query: 116 IAC-SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
            +C SLL+GS+SMALCYI R+ R  +  +    RIL L GS +   QY+  MN  F+AQ+
Sbjct: 118 ASCESLLAGSMSMALCYISRLQRNVAAGVKMHSRILVLTGSNECASQYMTYMNVFFTAQK 177

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP 
Sbjct: 178 LGIVIDTCALD-KTLSLLQQGCDITNGQFLKVTQLDGLLQYLLWVFLPSPQMRHKLVLPP 236

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
              VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 237 TPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280


>gi|19112324|ref|NP_595532.1| transcription factor TFIIH complex subunit Tfb4
           [Schizosaccharomyces pombe 972h-]
 gi|62901129|sp|O74366.1|TFB4_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|3417433|emb|CAA20320.1| transcription factor TFIIH complex subunit Tfb4
           [Schizosaccharomyces pombe]
          Length = 297

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/279 (34%), Positives = 149/279 (53%), Gaps = 24/279 (8%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           +D  SL+VV+LD NP  W S S  +  S+ L  +  FLNA L  +  N+V V+A+  +  
Sbjct: 20  NDTPSLLVVILDANPASWYSLSKKVPVSKVLADITVFLNAHLAFHHDNRVAVLASHSDKV 79

Query: 68  DYVYDSSSTGNQSVGN----------GRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIA 117
           +Y+Y S +   Q V             +   +   +L  ++  M+  +++ ++       
Sbjct: 80  EYLYPSIAP-EQKVAEVDPTKEANTYRKFREVDDLVLSGMKRLMSSTDKVSRK------- 131

Query: 118 CSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 177
            +++SG+LS AL YI +V     L  + RIL    + D   QY+  MN IF AQ+  +PI
Sbjct: 132 -TMISGALSRALAYINQVQNKNTL--RSRILIFSLTGDVALQYIPTMNCIFCAQKKNIPI 188

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 237
           + C +    + FL+QA+  TGG++ K     GL QYL+     D + R  L  P    VD
Sbjct: 189 NVCNIEG-GTLFLEQAADATGGIYLKVDNPKGLLQYLMMSLFPDQNLRKHLNTPNQANVD 247

Query: 238 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           FRA+CFCHK  +D+G++CSVCLSI+C+    CSTC + F
Sbjct: 248 FRATCFCHKKVLDIGFVCSVCLSIFCEPRVHCSTCHTKF 286


>gi|327276109|ref|XP_003222813.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
           subunit 3-like [Anolis carolinensis]
          Length = 360

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 98/288 (34%), Positives = 158/288 (54%), Gaps = 29/288 (10%)

Query: 11  DDVSLVVVLLDTNPFFWSS-----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           D+++L+V+++DTNP +W       S L+ S+ L   +   N+ L +N+ N++ VIA+   
Sbjct: 61  DELNLLVIVVDTNPIWWGKQALIGSKLTLSKCLDASMVLGNSHLFMNRNNKLAVIASHIQ 120

Query: 66  SCDYVY---------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ 110
              ++Y               D++  G++      + ++  T+ + +++ M K   +  Q
Sbjct: 121 ESRFLYPGKRWEIIDIFGEAVDANMYGSKDGKYELLTTVNETIAEEIQDLMTK-SGIKWQ 179

Query: 111 EPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIF 168
           + E     + L   L+ AL YI RV +  S     + RIL ++ + D   QY+  MN IF
Sbjct: 180 QTE-----TCLYXYLTKALWYIHRVSKTESASQEIKSRILVIKAAEDSALQYMNFMNVIF 234

Query: 169 SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFL 228
           +AQ+  + ID+C L + +S  LQQA  ITGG++ K      L QYLL +F  D   R+ L
Sbjct: 235 AAQKQNILIDACVLES-DSGLLQQACDITGGIYLKVPHKPSLLQYLLWVFLPDHDQRSQL 293

Query: 229 QLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            LP  V VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 294 ILPPRVHVDYRAACFCHRNLIEVGYVCSVCLSIFCNFSPICTTCETAF 341


>gi|393244851|gb|EJD52362.1| transcription factor Tfb4 [Auricularia delicata TFB-10046 SS5]
          Length = 338

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 99/289 (34%), Positives = 150/289 (51%), Gaps = 37/289 (12%)

Query: 14  SLVVVLLDTNPFFW------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           S + ++LD +P  W      +   L+   FLT  LAF+NA L     N +VV+       
Sbjct: 5   SHLALILDLSPTQWHLCGTAAEHPLALRDFLTQTLAFVNAHLAARDENSLVVLGALPGKS 64

Query: 68  DYVY----DSSSTGNQSVGNGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLL 121
             +Y    D S   N +  N   P  ++  T+++ + + ++    + ++EP        L
Sbjct: 65  VLLYASNEDFSGMANVADANSYQPFKAVDTTVMKRVIDELSSLGDITEEEPTS------L 118

Query: 122 SGSLSMALCYIQRVFRSGLLHPQP--------------RILCLQGSPDGPEQYVAIMNAI 167
            G+L+ ALC+I R+      HP P              RIL L  SPD    Y+ +MN+I
Sbjct: 119 VGALTKALCHINRI-----THPPPPANPSESPPTPPEARILILSVSPDLSTSYIPLMNSI 173

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           FSAQ+    ID C +   ++ FLQQA+++TGG +    + + L QYL+  F +    R+ 
Sbjct: 174 FSAQKLKAIIDVCKVFGPDAVFLQQAAHLTGGSYVYLLEREALLQYLIMSFLSPPMVRSI 233

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           L +P    +DFRA+CFCHKN ID+GY+CSVCLSI+C  +  CSTC + F
Sbjct: 234 LSVPTQEKIDFRAACFCHKNIIDVGYVCSVCLSIFCTPVAVCSTCRTKF 282


>gi|2257568|dbj|BAA21460.1| basic transcription factor 2.35KD subunit [Schizosaccharomyces
           pombe]
          Length = 298

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 97/279 (34%), Positives = 149/279 (53%), Gaps = 24/279 (8%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           +D  SL+VV+LD NP  W S S  +  S+ L  +  FLNA L  +  N+V V+A+  +  
Sbjct: 20  NDTPSLLVVILDANPASWYSLSKKVPVSKVLADITVFLNAHLAFHHDNRVAVLASHSDKV 79

Query: 68  DYVYDSSSTGNQSVGN----------GRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIA 117
           +Y+Y S +   Q V             +   +   +L  ++  M+  +++ ++       
Sbjct: 80  EYLYPSIAP-EQKVAEVDPTKEANTYRKFREVDDLVLSGMKRLMSSTDKVSRK------- 131

Query: 118 CSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 177
            +++SG+LS AL YI +V     L  + RIL    + D   QY+  MN IF AQ+  +PI
Sbjct: 132 -TMISGALSRALAYINQVQNKNTL--RSRILIFSLTGDVALQYIPTMNCIFCAQKKNIPI 188

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 237
           + C +    + FL+QA+  TGG++ K     GL QYL+     D + R  L  P    VD
Sbjct: 189 NVCNIEG-GTLFLEQAADATGGIYLKVDNPKGLLQYLMMSLFPDQNLRKHLNTPNQANVD 247

Query: 238 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           FRA+CFCHK  +D+G++CSVCLSI+C+    CSTC + F
Sbjct: 248 FRATCFCHKKVLDIGFVCSVCLSIFCEPRVHCSTCHTKF 286


>gi|427787947|gb|JAA59425.1| Putative transcription factor tfb4 [Rhipicephalus pulchellus]
          Length = 293

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/275 (37%), Positives = 157/275 (57%), Gaps = 16/275 (5%)

Query: 11  DDVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           D+ SL+V ++DTNP      +     S+FL     F N+ L LN  N++ VIA   +   
Sbjct: 5   DEGSLLVTIIDTNPCASLLQNEEGIVSRFLDAATVFCNSHLMLNPCNRLAVIACHSHKSS 64

Query: 69  YVYDSS--STGNQSVGNGR---MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSG 123
           +VY  +  ST +    +G+      + + + + ++E ++ D+       E +++ +LL+G
Sbjct: 65  FVYPKALPSTQDTVSVDGQYELFSEVTSAIKEGVKELVSSDKG------ESQVSETLLTG 118

Query: 124 SLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
            LS+ALCYI R+ +  +G      RIL L  S +   QY+  MN  F+AQ+  V ID+C 
Sbjct: 119 GLSLALCYINRIEKETAGQKKIASRILVLSASGESASQYLNFMNVFFTAQKKNVIIDACV 178

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L  ++S  LQQ   ITGG + K     GL +YLL +F  D  +R++L  P PV VD+RA+
Sbjct: 179 L-EKDSGLLQQGCDITGGKYMKVPNHTGLLEYLLWLFLPDKATRDYLVFPPPVHVDYRAA 237

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 238 CFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272


>gi|326929605|ref|XP_003210949.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Meleagris gallopavo]
          Length = 290

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 95/268 (35%), Positives = 148/268 (55%), Gaps = 27/268 (10%)

Query: 29  SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY--------DSSSTGNQS 80
           S   + S+ +   +   N+ L +++ N++ VIA+      ++Y        D    G  S
Sbjct: 11  SLDFTLSKCIDAAMVLGNSHLFMSRTNKLAVIASHTQESRFLYPGKRWASADLLGDGGNS 70

Query: 81  V-------GNGRMPSLCA---TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALC 130
           V        +G+   L A    + + +++ M K + +G+Q        +LL+GSL+ ALC
Sbjct: 71  VESNCSGSKDGKYELLTAINDAIAEEIKDLMTKTDMMGQQTE------TLLAGSLAKALC 124

Query: 131 YIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 188
           YI ++ +    + +   RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S 
Sbjct: 125 YINKMSKEVKANQEMKSRILVIKAAEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSG 183

Query: 189 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 248
            LQQA  ITGG++ K   L  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N 
Sbjct: 184 LLQQACDITGGIYLKVPHLPSLLQYLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNL 243

Query: 249 IDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           I++GY+CSVCLSI+C     CSTC + F
Sbjct: 244 IEIGYVCSVCLSIFCNFSPICSTCETAF 271


>gi|345497965|ref|XP_001603853.2| PREDICTED: general transcription factor IIH subunit 3-like [Nasonia
           vitripennis]
          Length = 295

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 105/276 (38%), Positives = 146/276 (52%), Gaps = 18/276 (6%)

Query: 12  DVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           D SL+V++LD NP       +  +Q L   + F NA L     N + ++A       ++Y
Sbjct: 6   DRSLLVIILDINPMQKLVKQV-LTQCLDSTIVFSNAHLMQTAGNSLAIMACHSQGAQFLY 64

Query: 72  DSSSTGNQSVGNG---RMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMA 128
                 +    +G   +   L  T+ Q + E + +     K   E     SL+SG+LSMA
Sbjct: 65  PQEKMPDVRQFDGQYEKFTQLERTVRQQIHEMVGEMSMDRKGNEE-----SLISGALSMA 119

Query: 129 LCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 186
           LCYI R+ R      H  PRIL +  S D   QY+  MN  F+AQ+  V +D C L  + 
Sbjct: 120 LCYIARMEREKYAGEHLHPRILVITASHDSATQYMNYMNIFFTAQKMGVMLDVCSLDHEL 179

Query: 187 SAFLQQASYITGGVHHK-PQQ-----LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240
           +  LQQ   +TGG + K PQQ     L GL QYLL +F  D   R+ L LP PV VD+RA
Sbjct: 180 T-LLQQGCDLTGGNYLKIPQQSGQPPLAGLLQYLLWVFLPDPAVRSKLVLPPPVKVDYRA 238

Query: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +CFCH+  +D G++CSVCLSI+CK    C+TC +VF
Sbjct: 239 ACFCHQELVDTGFVCSVCLSIFCKFSPICTTCHTVF 274


>gi|409045010|gb|EKM54491.1| hypothetical protein PHACADRAFT_258371 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 360

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 99/290 (34%), Positives = 154/290 (53%), Gaps = 30/290 (10%)

Query: 11  DDVSLVVVLLDTNPFFW-------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           D+ S + V+LD +P  W       +   ++    L+H+LAFLNA +     N + +    
Sbjct: 2   DNASHLGVVLDLSPKQWHLAAQPSNPHPITLVASLSHLLAFLNAHIACKDENSLALFGAL 61

Query: 64  YNSCDYVYDSSSTGNQ-----SVGNGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
                 +Y S+   N+     +  N   P   + +T+++++   +++ E LG+ E E  I
Sbjct: 62  PGKSVMLYSSTMPVNEEEIQPADANTYRPFKVMDSTIMKHI---VSEVEALGEPEVEEPI 118

Query: 117 ACSLLSGSLSMALCYIQRVF--------RSGLLH--PQPRILCLQGSPDGPEQYVAIMNA 166
                 G+L+ A+CYI RV         R+  L   P PRIL +  SPD    Y+ +MN 
Sbjct: 119 G---FVGALTKAMCYINRVTNGSTSSTSRNDDLATLPDPRILVISVSPDQSSSYIPVMNL 175

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IFSAQ+  V ID C +    S FLQQA+++TGG +    + D + QYL+  F      R 
Sbjct: 176 IFSAQKLKVTIDVCKIFGGESVFLQQAAHLTGGSYIDIDRPDAILQYLMMSFLPPPGLRQ 235

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            + +P    +DFRA+CFCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 236 LISVPTQDKIDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 285


>gi|195437728|ref|XP_002066792.1| GK24367 [Drosophila willistoni]
 gi|194162877|gb|EDW77778.1| GK24367 [Drosophila willistoni]
          Length = 303

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 105/279 (37%), Positives = 152/279 (54%), Gaps = 15/279 (5%)

Query: 6   SKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           +K     + L+V++LDTNP       +  + +Q L  ++AF NA L     N++ VI+  
Sbjct: 14  AKESESSIDLLVIVLDTNPSQRIVRQNPQNLTQILEAMIAFGNAHLMQKAQNKLAVISCS 73

Query: 64  YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK---QEPEGRIAC-S 119
           +++ +++Y     G Q     R       +   +E+ +   +QLG      P     C S
Sbjct: 74  HHASNFLYPLP--GRQV--EVRQIDGQYEVFSQVEKTIK--QQLGNILMNAPRLSAPCES 127

Query: 120 LLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 177
           LL+GS+  ALCYI R+ R+    +    R+L L GS +   QY+  MN  F+AQ+  + I
Sbjct: 128 LLAGSIGRALCYISRLQRNTPPGIKMHSRVLVLTGSNECSSQYMTFMNVFFTAQKLGIVI 187

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 237
           D+C L    S  LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  VD
Sbjct: 188 DTCSLENPVS-LLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPSPQMRHKLVLPPPPKVD 246

Query: 238 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +RASCFCH+  ID+GY+CSVCLS++CK+   C+TC +VF
Sbjct: 247 YRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTVF 285


>gi|357624804|gb|EHJ75440.1| putative transcription factor TFIIH-subunit [Danaus plexippus]
          Length = 291

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 98/272 (36%), Positives = 147/272 (54%), Gaps = 13/272 (4%)

Query: 12  DVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           D SL+V+++DTNP   + +      +  L  V+AF N+ L     NQ+ VI   ++  +Y
Sbjct: 9   DSSLLVIIVDTNPNQRYITEDPKVLTGCLDAVIAFANSHLMQKSRNQLAVIGCHFHKSEY 68

Query: 70  VYDSSSTG---NQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLS 126
           +Y S        Q  G   + +L    ++     M     +  Q  E R   SLL+G+++
Sbjct: 69  LYPSPGKPLDVRQIDGQYELFTLVEKTIK-----MRLVNLIKSQPQEERPGESLLAGAMA 123

Query: 127 MALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
           M LC+I R+ R     L    R+L + GS D   QY+  MN  F+AQ+  V +D C L  
Sbjct: 124 MGLCFIARMRREQPPGLRMVSRMLVVTGSSDTAAQYINYMNVFFTAQKQQVLLDVCSLD- 182

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
           ++ + LQQ   ITGG++ K   L+GL QYLL +F  +   R  L LP    VD+RA+CFC
Sbjct: 183 KHLSLLQQGCDITGGLYLKVPSLEGLLQYLLWVFLPEASERRELVLPGRGRVDYRAACFC 242

Query: 245 HKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           H+  + +GY+CSVCLS++CK    C+TC +VF
Sbjct: 243 HRELLTIGYVCSVCLSVFCKFSPICTTCHTVF 274


>gi|195995769|ref|XP_002107753.1| hypothetical protein TRIADDRAFT_51576 [Trichoplax adhaerens]
 gi|190588529|gb|EDV28551.1| hypothetical protein TRIADDRAFT_51576 [Trichoplax adhaerens]
          Length = 276

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 96/276 (34%), Positives = 153/276 (55%), Gaps = 32/276 (11%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           +L+++++DTNP  W   ++ + F++++ +++ ++N+ L ++  N + VIA+  N   Y+Y
Sbjct: 9   NLLILVIDTNPLAWLPFAAEMDFTRYIENIVVYINSHLMMDHRNSIAVIASHVNKSCYLY 68

Query: 72  -----------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSL 120
                      +S+ST N +  +GR   L           +N   +   +E    +A + 
Sbjct: 69  SPHMDNWIKNTNSNSTVNIA-KDGRHEEL---------SHINYIIETRVKELVQEMATAD 118

Query: 121 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           +   +       Q +   G    + RIL ++ +PD   QY+ IMN IF+AQ+  VPID+C
Sbjct: 119 IHNHVD------QDLRNKG--QAKSRILVIKCTPDTSFQYLPIMNCIFTAQKYNVPIDAC 170

Query: 181 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240
            +  ++S FLQQA  ITGG++ K Q   GL QYLLT F  +   R  L LP    VD+RA
Sbjct: 171 VI-EKDSGFLQQACDITGGIYLKLQSPSGLLQYLLTAFLPNQSMRETLILPPKAKVDYRA 229

Query: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +CFCH+  I++GYICSVCLSI CK +  C TC + F
Sbjct: 230 ACFCHRELIEIGYICSVCLSISCKFVPVCPTCQTHF 265


>gi|351698565|gb|EHB01484.1| General transcription factor IIH subunit 3, partial [Heterocephalus
           glaber]
          Length = 315

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/299 (33%), Positives = 153/299 (51%), Gaps = 39/299 (13%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLN---------------------AI 48
            D+++L+V+++DTNP +W   SL  SQ   H     +                     A+
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQSLKESQCWDHKCVLPHPAVIGGSHRFCCGHSPTRGGGAL 64

Query: 49  LTLNQLNQVVVIATGYNS--CDYVYDSSSTGNQ----SVGNGRMPSLCA---TLLQNLEE 99
             L  L Q   +  G N    D+  D  S           +G+   L A    + + +++
Sbjct: 65  FFLVHLQQCRFLYPGKNGRLGDFFGDPGSPSPDYNPSGSKDGKYELLTAANDVIAEEIKD 124

Query: 100 FMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGP 157
            M   +  G+         +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D  
Sbjct: 125 LMTTSDIKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSA 178

Query: 158 EQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
            QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +
Sbjct: 179 LQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMASLLQYLLWV 237

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           F  D   R  L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 FLPDQDQRAQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 296


>gi|449476600|ref|XP_002189370.2| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Taeniopygia guttata]
 gi|449476604|ref|XP_004176463.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Taeniopygia guttata]
          Length = 267

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/251 (35%), Positives = 143/251 (56%), Gaps = 27/251 (10%)

Query: 46  NAILTLNQLNQVVVIATGYNSCDYVY------------------DSSSTGNQSVGNGRMP 87
           N+ L +N+ N++ VIA+      ++Y                  +S+ +G++      + 
Sbjct: 5   NSHLLMNRTNKLAVIASHTQESRFLYPGKRWAAADPFGEGGPSMESNCSGSKDGKYELLT 64

Query: 88  SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--P 145
           ++   + + +++ M K +  G+Q        +LL+GSL+ ALCYI ++ +   ++ +   
Sbjct: 65  AINDAIAEEIKDLMTKTDMKGQQTE------TLLAGSLAKALCYINKMGKDLKVNQEIKS 118

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 119 RILVIKAAEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVP 177

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
            +  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 178 HMPSLLQYLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 237

Query: 266 LKKCSTCGSVF 276
              CSTC + F
Sbjct: 238 SPICSTCETAF 248


>gi|353240140|emb|CCA72023.1| related to TFIIH basal transcription factor complex p34 subunit
           [Piriformospora indica DSM 11827]
          Length = 355

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 95/284 (33%), Positives = 147/284 (51%), Gaps = 21/284 (7%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS--------LSFSQFLTHVLAFLNAILTLNQLNQVVVIAT 62
           ++ S + V+LD +P  W  SS        LS + FL  +  FLNA L     N + V   
Sbjct: 2   ENASHLSVILDLSPSQWYQSSQTSGDKHPLSLAHFLPQLFVFLNAHLASQHENTLTVFGA 61

Query: 63  GYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLS 122
                  +Y S+ T  Q+  +        T+   + + +N++ +   ++ E  IA   L 
Sbjct: 62  FPGKSSMLYSSNETYIQADADSNTFQPFKTMNSVIAQRLNEELESLPEDAEAPIA---LV 118

Query: 123 GSLSMALCYI-QRVFRSGLLH---------PQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           G+L+ +LC I +R   S  L+         P PR+L +  SPD    Y+ IMN+IF AQ+
Sbjct: 119 GALTKSLCTINRRSHPSDTLNATSPNPSKLPTPRLLVISVSPDLSASYIPIMNSIFCAQK 178

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             + ID C +   ++ FLQQA+++TGG +    + D + QYL+T F +    R  + +P 
Sbjct: 179 LKLSIDVCKIFGPDNVFLQQAAHLTGGSYISIDRRDSILQYLMTTFLSPPAVRRLMAVPT 238

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
              +D RA+CFCHK  +D+G+ICS+CLSI+CK    CSTC   F
Sbjct: 239 EDRIDLRAACFCHKQIVDIGFICSICLSIFCKPTPVCSTCRVKF 282


>gi|195118052|ref|XP_002003554.1| GI21952 [Drosophila mojavensis]
 gi|193914129|gb|EDW12996.1| GI21952 [Drosophila mojavensis]
          Length = 299

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 101/276 (36%), Positives = 148/276 (53%), Gaps = 23/276 (8%)

Query: 13  VSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYV 70
           + L+ ++LDTNP       +  + +Q L  V+AF NA L     N++ V++  + + +++
Sbjct: 16  IDLLAIVLDTNPSQLIVRQNPQNLTQILEAVIAFGNAHLMQKAQNKLAVLSCSHYTTEFL 75

Query: 71  YDSSSTGNQSVGNGRMPSLCATLLQNLEEF----MNKDEQLGK---QEPEGRIAC-SLLS 122
           +            GR   L   +    E F        +QLG      P     C SLL+
Sbjct: 76  FPLP---------GRQVEL-RQIDAQYEAFSLVEKTVKQQLGSILMNAPRLSAPCESLLA 125

Query: 123 GSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           GS+SMALCYI R+ R+    +    RIL + GS +   QY+  MN  F+AQ+  + +D+C
Sbjct: 126 GSMSMALCYISRLQRNVPAGVKMHSRILVITGSNECASQYMTFMNVFFTAQKLGIVVDTC 185

Query: 181 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240
            L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP    VD+RA
Sbjct: 186 ALD-KTLSLLQQGCDITNGQFLKVNQLDGLLQYLLWVFLPSPQMRHKLVLPPTPKVDYRA 244

Query: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           SCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 245 SCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280


>gi|452838611|gb|EME40551.1| hypothetical protein DOTSEDRAFT_136966 [Dothistroma septosporum
           NZE10]
          Length = 379

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/333 (33%), Positives = 162/333 (48%), Gaps = 77/333 (23%)

Query: 10  SDDVSLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           S+  SL+ ++LDTNP  W+    SL  S+ +  +L F+NA L ++  N+V VIA+     
Sbjct: 16  SEPPSLLTIILDTNPHAWALHEGSLPLSKAVASLLVFINAHLAIHYGNKVAVIASHAERA 75

Query: 68  DYVY--------DSSSTGNQSVGNG---------RMPS----------LCATLLQNLEEF 100
           +++Y         SS+T +  +GNG         R P           +   L  NL + 
Sbjct: 76  EFLYPAPATAKPHSSATDDVDMGNGDALVNGGPVRPPDDANKYRPFAHIEHALTHNLRDM 135

Query: 101 MNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQ--------------------------- 133
           M++        P      ++L+G+L+MAL YI                            
Sbjct: 136 MDRTSPAALSTPS-----TMLAGALTMALTYISKQSAALPTANSSAQFNYSDPSAVAGGS 190

Query: 134 -------RVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 186
                  R    GL   + RIL L  S D  +QY++IMNAIF+ QR  VPID   L A +
Sbjct: 191 DLTADGSRKKHHGL---RSRILILSVSGDLADQYISIMNAIFACQRMSVPIDILKL-AGD 246

Query: 187 SAFLQQASYITGGVH--HKPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCF 243
           + FLQQA+  TGGV+     +   G  QYL+  +  D  +R  L +P +  GVDFRA+CF
Sbjct: 247 TVFLQQAADATGGVYVALDEKTRAGFLQYLMFAYLPDQTARAHLIMPGEGEGVDFRAACF 306

Query: 244 CHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 274
           CH+  +D GY+CS+CLSI+C+  H   C TCGS
Sbjct: 307 CHRKVVDTGYVCSICLSIFCEPLHDGTCLTCGS 339


>gi|297693367|ref|XP_002823992.1| PREDICTED: general transcription factor IIH subunit 3 [Pongo
           abelii]
          Length = 318

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 94/286 (32%), Positives = 153/286 (53%), Gaps = 26/286 (9%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQ--------FLTH---VLAFLNAILTLNQLNQVV 58
            D+++L+V+++D NP +W   +L  SQ          TH     +    +   +  ++ +
Sbjct: 21  EDELNLLVIVVDANPIWWGKQALKESQECMCTPLQHFTHNNPPPSVFPPLFRKHWNHRFL 80

Query: 59  VIATGYNSCDYVYDSSS------TGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEP 112
                    D+  D  +      +G++      + S    +++ +++ M K +  G+   
Sbjct: 81  YPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDIKGQHTE 140

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSA 170
                 +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  MN IF+A
Sbjct: 141 ------TLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 194

Query: 171 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L L
Sbjct: 195 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLTL 253

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 254 PPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 299


>gi|392593946|gb|EIW83271.1| transcription factor Tfb4 [Coniophora puteana RWD-64-598 SS2]
          Length = 327

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 100/287 (34%), Positives = 146/287 (50%), Gaps = 27/287 (9%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS-------LSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           +  S + V++D +P  W  S+       LS   FL+ +LAFLN  +     N + V    
Sbjct: 2   EKASHLSVIVDLSPTQWHLSAQPSNPYPLSIQSFLSQLLAFLNTHVAAKHENTLAVYGAL 61

Query: 64  YNSCDYVYDSSSTGNQ----SVGNGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGRIA 117
                 +Y SS    +    S  N   P   + AT++  + +      +  ++ P     
Sbjct: 62  PGKSVLLYSSSDPAPENAPASDANAFTPFKLVDATVVDRIMQEFEAVSEAEEEAP----- 116

Query: 118 CSLLSGSLSMALCYIQRVFRSGLLH--------PQPRILCLQGSPDGPEQYVAIMNAIFS 169
           CSL+S  L+ ALCYI R+               P PRIL L  SPD    Y+ IMN IFS
Sbjct: 117 CSLVSA-LAKALCYINRITHPSTSAGPEDPNELPDPRILILSVSPDLSSSYIPIMNCIFS 175

Query: 170 AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 229
           AQ+  V ID C +    + FLQQA+++T G +   ++ D L QYL+  F      R  L 
Sbjct: 176 AQKLKVTIDVCKVYGPETVFLQQAAHLTEGSYIHLERRDALLQYLVMSFLAPPSVRGVLA 235

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +P+   +DFRA+CFCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 236 VPRQERIDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 282


>gi|169775241|ref|XP_001822088.1| transcription factor TFIIH complex subunit [Aspergillus oryzae
           RIB40]
 gi|83769951|dbj|BAE60086.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391872974|gb|EIT82049.1| RNA polymerase II transcription initiation protein [Aspergillus
           oryzae 3.042]
          Length = 376

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 104/322 (32%), Positives = 156/322 (48%), Gaps = 67/322 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           +L+ V+LDTNP  W+    SL  S+ + ++L FLNA L  N  N+V V+A+      ++Y
Sbjct: 20  ALLTVVLDTNPHAWALLEDSLPLSKAIANILVFLNAHLACNYANEVAVVASHSQKAAWLY 79

Query: 72  --------------------DSSSTGNQSVGNGRMPS------LCATLLQNLEEFMNKDE 105
                                SS+  N + G G++        +   +  NL E M    
Sbjct: 80  PHENPATKISHDADGDVSMNGSSTNNNTTEGPGQVNKYRPFRIVEEQVTHNLRELM---- 135

Query: 106 QLGKQEPEGRIACS-LLSGSLSMALCYIQR-------------------------VFRSG 139
                 P+ +   S +++G+L++AL +I R                            SG
Sbjct: 136 -ASTSGPDVKANNSTMMAGALTLALSHINRRSIAWAEAHGADTAAETSGGGPPSGGHASG 194

Query: 140 L-----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQAS 194
                 L  +  I+ + G+ D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS
Sbjct: 195 TDTTEGLQSRILIVSVSGASDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQAS 253

Query: 195 YITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYI 254
             T G++    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++
Sbjct: 254 DATKGIYMSLTEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFV 313

Query: 255 CSVCLSIYCKHLKK--CSTCGS 274
           CS+CLSI+C+ L    C TCG+
Sbjct: 314 CSICLSIFCEPLDNGTCLTCGT 335


>gi|346472167|gb|AEO35928.1| hypothetical protein [Amblyomma maculatum]
          Length = 293

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/275 (39%), Positives = 154/275 (56%), Gaps = 16/275 (5%)

Query: 11  DDVSLVVVLLDTNPF--FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           D+ SL+V ++DTNP      +     S+FL     F N+ L LN  N++ VIA   +   
Sbjct: 5   DEGSLLVTIIDTNPCASLLQNEEGIVSRFLDAATVFCNSHLMLNPCNKLAVIACHSHKSM 64

Query: 69  YVY---DSSSTGNQSV-GNGRMPSLCATLL-QNLEEFMNKDEQLGKQEPEGRIACSLLSG 123
           ++Y     S+  N SV G   + S     + + ++E +  DE       E  ++ +LL+G
Sbjct: 65  FLYPRAQPSAQENYSVDGQYELFSEVKNAVKEGVKELVLSDEN------ESLVSETLLTG 118

Query: 124 SLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
            LS+ALC+I RV +  +G      RIL L  S +   QY+  MN  F+AQ+  V ID+C 
Sbjct: 119 GLSLALCHINRVEKETAGQKKIASRILVLSASGESASQYLNFMNVFFTAQKKNVIIDACV 178

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L  ++S  LQQ   ITGG + K     GL QYLL +F  D  SR++L  P PV VD+RA+
Sbjct: 179 L-EKDSGLLQQGCDITGGKYMKVPNHAGLLQYLLWLFLPDKASRDYLVFPPPVHVDYRAA 237

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 238 CFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272


>gi|281343811|gb|EFB19395.1| hypothetical protein PANDA_000941 [Ailuropoda melanoleuca]
          Length = 278

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 93/266 (34%), Positives = 147/266 (55%), Gaps = 28/266 (10%)

Query: 32  LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSS---------TGNQSV- 81
            + S+ +  V+   N+ L +N+ N++ VIA+      ++Y   +          GN S  
Sbjct: 2   FTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPSSE 61

Query: 82  ------GNGRMPSLCAT---LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYI 132
                  +G+     A    + + +++ M K E  G+         +LL+GSL+ ALCYI
Sbjct: 62  FSPSGSKDGKYELFTAANEIIAEEIKDLMTKSEIKGQHTE------TLLAGSLAKALCYI 115

Query: 133 QRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFL 190
            R+ +    + +   RIL ++ + D   QY+ +MN IF+AQ+  + ID+C L + +S  L
Sbjct: 116 HRMNKEVKDNQEMKSRILVIKAAEDSALQYMNLMNVIFAAQKQNILIDACVLDS-DSGLL 174

Query: 191 QQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTID 250
           QQA  ITGG++ K  Q+  L QYLL +F  D   R+ L LP P+ VD+RA+CFCH+N I+
Sbjct: 175 QQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIE 234

Query: 251 MGYICSVCLSIYCKHLKKCSTCGSVF 276
           +GY+CSVCLSI+C     C+TC + F
Sbjct: 235 IGYVCSVCLSIFCNFSPICTTCETAF 260


>gi|238496129|ref|XP_002379300.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           flavus NRRL3357]
 gi|220694180|gb|EED50524.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           flavus NRRL3357]
          Length = 367

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/322 (32%), Positives = 156/322 (48%), Gaps = 67/322 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           +L+ V+LDTNP  W+    SL  S+ + ++L FLNA L  N  N+V V+A+      ++Y
Sbjct: 11  ALLTVVLDTNPHAWALLEDSLPLSKAIANILVFLNAHLACNYANEVAVVASHSQKAAWLY 70

Query: 72  --------------------DSSSTGNQSVGNGRMPS------LCATLLQNLEEFMNKDE 105
                                SS+  N + G G++        +   +  NL E M    
Sbjct: 71  PHENPATKISHDADGDVSMNGSSTNNNTTEGPGQVNKYRPFRIVEEQVTHNLRELM---- 126

Query: 106 QLGKQEPEGRIACS-LLSGSLSMALCYIQR-------------------------VFRSG 139
                 P+ +   S +++G+L++AL +I R                            SG
Sbjct: 127 -ASTSGPDVKANNSTMMAGALTLALSHINRRSIAWAEAHGADTAAETSGGGPPSGGHASG 185

Query: 140 L-----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQAS 194
                 L  +  I+ + G+ D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS
Sbjct: 186 TDTTEGLQSRILIVSVSGASDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQAS 244

Query: 195 YITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYI 254
             T G++    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++
Sbjct: 245 DATKGIYMSLTEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFV 304

Query: 255 CSVCLSIYCKHLKK--CSTCGS 274
           CS+CLSI+C+ L    C TCG+
Sbjct: 305 CSICLSIFCEPLDNGTCLTCGT 326


>gi|429851889|gb|ELA27048.1| transcription factor tfiih subunit [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 362

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 146/300 (48%), Gaps = 40/300 (13%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W++    L  S+ +  +L F+NA L  +  NQV +IA+  N   ++Y
Sbjct: 20  SLLSIVIDTNPRAWAALNDVLPLSKAIASILVFVNAHLAFSNANQVAIIASHSNRAVWLY 79

Query: 72  DSSSTGNQSVGNGR----------MPSLCATLLQNLEEFMNKDE-------QLGKQEPEG 114
            +      +  NG            P+   +      +F   ++       ++  Q  E 
Sbjct: 80  PTKPEPKAASTNGASEDVDMADAFAPTPNKSNANKYPQFAQIEKAVLGAIREVINQTTEA 139

Query: 115 RIACSL--LSGSLSMALCYIQ----------------RVFRSGLLHPQPRILCLQGSPDG 156
            ++C+   LSG+L++AL YI                 R   S L     RI  L  S   
Sbjct: 140 DLSCTTTQLSGALTLALSYINKASMALNETNKTPEATRATSSALTRLHARIFILSVSDSE 199

Query: 157 PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLT 216
           P QY+  MNA+F+A  S +PID+  L   ++ FLQQASYIT G   +     GL  YL+ 
Sbjct: 200 PVQYIPTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDGTFMQAASPRGLLSYLMF 258

Query: 217 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 274
            +  D  +R+ L  P    VDFRA+CFCH   +D G++CS+CLSI+C      +C TCG+
Sbjct: 259 AYSADAEARSSLIPPTHHTVDFRAACFCHGRVVDTGFVCSICLSIFCDVPENSECLTCGT 318


>gi|302759787|ref|XP_002963316.1| hypothetical protein SELMODRAFT_438480 [Selaginella moellendorffii]
 gi|300168584|gb|EFJ35187.1| hypothetical protein SELMODRAFT_438480 [Selaginella moellendorffii]
          Length = 916

 Score =  152 bits (385), Expect = 1e-34,   Method: Composition-based stats.
 Identities = 85/175 (48%), Positives = 115/175 (65%), Gaps = 11/175 (6%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           MA AP     DD SLVVVLLDT+  FW++   S S  L  VL+FLN++  +N +N +V+I
Sbjct: 706 MAGAPP---PDDSSLVVVLLDTSLHFWNAVVRS-SSGLQQVLSFLNSLALMNHMNHLVII 761

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSL 120
           A+G NSC+++YDS  +  ++  N +  S    ++  L +F+++D     Q P  +   SL
Sbjct: 762 ASGCNSCEFLYDSLDSDERAGANHQTAS--EKIVHKLNDFVSRD---ATQPP--KFQPSL 814

Query: 121 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
           LS SLSMALCYIQR  +  +   +PR+LCLQ SPD P QY+AIMNAIFSAQRSMV
Sbjct: 815 LSASLSMALCYIQRALKEKIGPSKPRMLCLQASPDAPHQYIAIMNAIFSAQRSMV 869


>gi|115387571|ref|XP_001211291.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195375|gb|EAU37075.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 380

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 153/323 (47%), Gaps = 66/323 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ V+LDTNP  W+    SL  S  + ++L FLNA L  N  N+V V+A+   +  ++Y
Sbjct: 20  SLLTVILDTNPHAWAVLEPSLPLSTAIANILVFLNAHLACNYANEVAVVASHSQTATWLY 79

Query: 72  DSSSTGNQSVG-----------NGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS- 119
              +  + ++            NG  P   A+ +     F   +EQ+ +   E   A S 
Sbjct: 80  PRETRHDTTIASGTDRDGDIAMNGSAP---ASQVNKYRPFRIVEEQVTQHLRELMEATSG 136

Query: 120 ---------LLSGSLSMALCYIQR------------------------------------ 134
                    +++G+L++AL +I R                                    
Sbjct: 137 ADVQATTSTMMAGALTLALSHINRRSLAWSEAHGADTAGDGAGASAGGGGGGGSGAGGAG 196

Query: 135 -VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQA 193
               +  L  +  I+ + GS D   QY+ +MN IF+ QR  +PID C L   ++ FLQQA
Sbjct: 197 RAGGAEGLQSRILIVSVSGSSDSAHQYIPMMNGIFACQRLHIPIDVCKLSG-DAVFLQQA 255

Query: 194 SYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGY 253
           S  T GV     +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G+
Sbjct: 256 SDATKGVFMSLAEPRGLLQYLMMAFLPDQRSRKHLVLPTRVDVDFRAACFCHRRVVDVGF 315

Query: 254 ICSVCLSIYCKHLKK--CSTCGS 274
           +CS+CLSI+C+  +   C TCG+
Sbjct: 316 VCSICLSIFCEPPENGDCLTCGT 338


>gi|332840758|ref|XP_003314057.1| PREDICTED: general transcription factor IIH subunit 3 [Pan
           troglodytes]
 gi|397481836|ref|XP_003812143.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Pan paniscus]
          Length = 267

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 91/245 (37%), Positives = 142/245 (57%), Gaps = 15/245 (6%)

Query: 46  NAILTLNQLNQVVVIATGYNSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQ 95
           N+ L +N+ N++ VIA+      ++Y   +   G+     G +P    +        LL 
Sbjct: 5   NSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNLPEFNPSGSKDGKYELLT 64

Query: 96  NLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQ 151
           +  E + ++  + + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++
Sbjct: 65  SANEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIK 124

Query: 152 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 211
            + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L 
Sbjct: 125 AAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLL 183

Query: 212 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 271
           QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+T
Sbjct: 184 QYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTT 243

Query: 272 CGSVF 276
           C + F
Sbjct: 244 CETAF 248


>gi|431912136|gb|ELK14274.1| General transcription factor IIH subunit 3 [Pteropus alecto]
          Length = 292

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 96/285 (33%), Positives = 150/285 (52%), Gaps = 40/285 (14%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++DTNP +W   +L  SQF     +  V+   N+ L ++  N++ VIA+  
Sbjct: 5   EDELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMDHSNKLAVIASHI 64

Query: 65  NSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGS 124
               ++Y           NGR+  L         EF     + GK E        LL+ +
Sbjct: 65  QESRFLYPGK--------NGRLGDLFGDPSNPPSEFNPSGSKDGKYE--------LLTAA 108

Query: 125 LSMALCYIQRVFRSG---------LLH--------PQPRILCLQGSPDGPEQYVAIMNAI 167
             + +  I+ +  +G         + H        P P  + ++ + D   QY+  MN I
Sbjct: 109 NEVIVEEIKDLMTNGPNPVADFVPVCHSQGSSFSSPHPGQV-IKAAEDSALQYMNFMNVI 167

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           F+AQ+  + ID+C LG+ +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ 
Sbjct: 168 FAAQKQNILIDACVLGS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQ 226

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 272
           L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC
Sbjct: 227 LILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTC 271


>gi|350630603|gb|EHA18975.1| hypothetical protein ASPNIDRAFT_42792 [Aspergillus niger ATCC 1015]
          Length = 380

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/320 (32%), Positives = 157/320 (49%), Gaps = 60/320 (18%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  + SL  S  + ++L F+NA L  N  N+V V+++  +   ++Y
Sbjct: 20  SLLTIILDTNPHAWALLNPSLPLSTAIANILVFINAHLACNYANEVAVVSSHTSQATWLY 79

Query: 72  -------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQ--------LGKQ 110
                        +  S G+ S+ + + P    T +     F   +EQ        + + 
Sbjct: 80  PVEKAATTTTAANNLDSDGDISMRSQQPPPSTHTNVNKYRPFRIVEEQVSTHLHNLMSQT 139

Query: 111 EPEG--RIACSLLSGSLSMALCYIQR------------VFRSGLLHP------------- 143
            PE       ++++G+L++AL +I R            +  +    P             
Sbjct: 140 TPETLQSTTSTMMAGALTLALSHINRRTLAWNEAHGGDIDTTNPNDPSSSSSSSHNTKGT 199

Query: 144 -----QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYI 196
                Q RIL L    S     QY+ IMNAIF+ QR  +PID C +   ++ FLQQAS  
Sbjct: 200 AESTLQSRILILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDA 258

Query: 197 TGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 256
           T GV+    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS
Sbjct: 259 TKGVYMALDEPKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCS 318

Query: 257 VCLSIYCKHLKK--CSTCGS 274
           +CLSI+C+ L    C TCGS
Sbjct: 319 ICLSIFCEPLGDGVCLTCGS 338


>gi|119618833|gb|EAW98427.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
           CRA_b [Homo sapiens]
          Length = 274

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 89/258 (34%), Positives = 145/258 (56%), Gaps = 27/258 (10%)

Query: 46  NAILTLNQLNQVVVIATGYNSCDYVY------------------DSSSTGNQSVGNGRMP 87
           N+ L +N+ N++ VIA+      ++Y                  + + +G++      + 
Sbjct: 5   NSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLT 64

Query: 88  SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--P 145
           S    +++ +++ M K +  G+         +LL+GSL+ ALCYI R+ +    + +   
Sbjct: 65  SANEVIVEEIKDLMTKSDIKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKS 118

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 119 RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 177

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
           Q+  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 178 QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 237

Query: 266 LKKCSTCGSVFGQAQTQS 283
              C+TC +   + +T+S
Sbjct: 238 SPICTTCDAESQEKETES 255


>gi|145254849|ref|XP_001398778.1| transcription factor TFIIH complex subunit [Aspergillus niger CBS
           513.88]
 gi|134084362|emb|CAK48702.1| unnamed protein product [Aspergillus niger]
          Length = 379

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/319 (32%), Positives = 156/319 (48%), Gaps = 59/319 (18%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  + SL  S  + ++L F+NA L  N  N+V V+++  +   ++Y
Sbjct: 20  SLLTIILDTNPHAWALLNPSLPLSTAIANILVFINAHLACNYANEVAVVSSHTSQATWLY 79

Query: 72  DSS------------STGNQSVGNGRMPSLCATLLQNLEEFMNKDEQ--------LGKQE 111
                          S G+ S+ + + P    T +     F   +EQ        + +  
Sbjct: 80  PVEKATTTTTANNLDSDGDISMRSQQPPPSTHTNINKYRPFRIVEEQVSTHLHNLMSQTT 139

Query: 112 PEG--RIACSLLSGSLSMALCYIQR------------VFRSGLLHP-------------- 143
           PE       ++++G+L++AL +I R            +  +    P              
Sbjct: 140 PETLQSTTSTMMAGALTLALSHINRRTLAWNEAHGGDIDTTNPNDPSSTSSSSHNTKGTA 199

Query: 144 ----QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 197
               Q RIL L    S     QY+ IMNAIF+ QR  +PID C +   ++ FLQQAS  T
Sbjct: 200 ESTLQSRILILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDAT 258

Query: 198 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 257
            GV+    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+
Sbjct: 259 KGVYMALDEPKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSI 318

Query: 258 CLSIYCKHLKK--CSTCGS 274
           CLSI+C+ L    C TCGS
Sbjct: 319 CLSIFCEPLGDGVCLTCGS 337


>gi|406863689|gb|EKD16736.1| hypothetical protein MBM_05205 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 357

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 157/303 (51%), Gaps = 48/303 (15%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           +L+VV++DTNP  W+  S +L  S+ + ++L F+NA L +N  NQV ++A+  +   ++Y
Sbjct: 20  ALLVVVIDTNPHAWALLSPTLPLSKAIANILVFINAHLAVNNANQVAIVASHCHRAAWLY 79

Query: 72  ---------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
                          D +++ N    N   P     L++++   +N   +L     E  +
Sbjct: 80  PHPASTSSSEDVEMSDDAASSNIDNPNKYRPF---ALVEDI--LLNSLRELIATTTESDV 134

Query: 117 ACSL---LSGSLSMALCYIQRVFR--SGL------------------LHPQPRILCLQGS 153
           A +    ++G+L++AL YI +  +  SG+                  L  Q RIL +  S
Sbjct: 135 ATTTSTQMAGALTLALSYIHKATQQFSGIDTESKPATAPSTENADTILGLQSRILVVSVS 194

Query: 154 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 213
            D   QY+ IMN  F+AQR  +PID   L A ++ FLQQAS  T GV+ + +   GL QY
Sbjct: 195 GDLAHQYIPIMNTTFAAQRLRIPIDILKL-AGDTVFLQQASDATKGVYMQLRNPQGLLQY 253

Query: 214 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--KHLKKCST 271
           L+  F  D  +R  L  P    VDFRA+CFCH+  +D+GY+CS+CLSI+C       C T
Sbjct: 254 LMMAFLPDQVARKHLVPPTQEVVDFRAACFCHRKVVDIGYVCSICLSIFCVPPEGAVCLT 313

Query: 272 CGS 274
           C +
Sbjct: 314 CSA 316


>gi|310794652|gb|EFQ30113.1| transcription factor tfb4 [Glomerella graminicola M1.001]
          Length = 365

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 100/307 (32%), Positives = 148/307 (48%), Gaps = 52/307 (16%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W++    L  S+ + ++L F+NA L  +  NQV + A+  N   ++Y
Sbjct: 20  SLLSIVIDTNPRAWAALNDVLPLSKAIANILVFVNAHLAFSNANQVAIFASHSNRAVWLY 79

Query: 72  DS------------------------SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQL 107
            S                        + T NQS  N + P       Q     M+   +L
Sbjct: 80  PSKPEDAGANGAPYAEDVTMTDADFFAPTPNQSSAN-KFPQFA----QIEAAVMSSMRKL 134

Query: 108 GKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLHPQPRILC 149
                E  +AC+   LSG+L++AL +I                 R   S L   + RI  
Sbjct: 135 VDATTEADLACTTTQLSGALTLALAHINKTSLSLNETNKAPDVARSTSSALSRLRARIFV 194

Query: 150 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 209
           L  S   P QY++ MNA+F+A  S +PID+  L   ++ FLQQASYIT G   +     G
Sbjct: 195 LSVSDSEPVQYISTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDGTFMQAASPRG 253

Query: 210 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLK 267
           L  YL+  +  D  +R+ L  P    VDFRA+CFCH   +D G++CS+CLSI+C      
Sbjct: 254 LLSYLMFAYSADAEARSSLIPPTHHTVDFRAACFCHGRVVDTGFVCSICLSIFCDVPDGS 313

Query: 268 KCSTCGS 274
           +C TCG+
Sbjct: 314 ECLTCGT 320


>gi|255953875|ref|XP_002567690.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211589401|emb|CAP95542.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 374

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 158/320 (49%), Gaps = 64/320 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W+    SL  S+ + ++L F+NA L  N  N+V V+A+      ++Y
Sbjct: 20  SLLTIVIDTNPHAWAVLEDSLPLSKAIANILVFINAHLACNYANEVAVVASHSQKAAWLY 79

Query: 72  DSSST---------GNQSVGNGRMPSLCAT------------LLQNLEEFMNKDEQLGKQ 110
            S S          G+ ++      +   T            + +NL+E M  D   G  
Sbjct: 80  PSHSEPQYRPADHDGDVTMNGTSDTTTPKTNKYRPFRIVEEQVTRNLKELM--DSTTG-D 136

Query: 111 EPEGRIACSLLSGSLSMALCYI-QRVFRSGLLHP-------------------------- 143
           +  G ++ ++L+G+L++AL +I +R       H                           
Sbjct: 137 DLRGNMS-TMLAGALTLALSHINRRTLAWAEEHGGANTEDAAADGNGGGTTSTNRYSASN 195

Query: 144 -----QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYI 196
                Q RIL +   GS D   QY+ +MN+IF+ QR  +PID C L   ++ FLQQAS  
Sbjct: 196 EDERLQSRILVVSVSGSTDAAHQYIPVMNSIFACQRLSIPIDVCKLSG-DAVFLQQASDA 254

Query: 197 TGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 256
           T GV+    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS
Sbjct: 255 TKGVYMALAEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCS 314

Query: 257 VCLSIYCKHLK--KCSTCGS 274
           +CLSI+C+      C TCGS
Sbjct: 315 ICLSIFCEPPPGGDCMTCGS 334


>gi|428673524|ref|NP_001258796.1| general transcription factor IIH subunit 3 isoform c [Homo sapiens]
 gi|194389784|dbj|BAG60408.1| unnamed protein product [Homo sapiens]
          Length = 267

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 91/245 (37%), Positives = 142/245 (57%), Gaps = 15/245 (6%)

Query: 46  NAILTLNQLNQVVVIATGYNSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQ 95
           N+ L +N+ N++ VIA+      ++Y   +   G+     G  P    +        LL 
Sbjct: 5   NSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLT 64

Query: 96  NLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQ 151
           +  E + ++  + + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++
Sbjct: 65  SANEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIK 124

Query: 152 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 211
            + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L 
Sbjct: 125 AAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLL 183

Query: 212 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 271
           QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+T
Sbjct: 184 QYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTT 243

Query: 272 CGSVF 276
           C + F
Sbjct: 244 CETAF 248


>gi|402888057|ref|XP_003907393.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Papio anubis]
 gi|426374590|ref|XP_004054153.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Gorilla gorilla gorilla]
          Length = 267

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 91/245 (37%), Positives = 141/245 (57%), Gaps = 15/245 (6%)

Query: 46  NAILTLNQLNQVVVIATGYNSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQ 95
           N+ L +N+ N++ VIA+      ++Y   +   G+     G  P    +        LL 
Sbjct: 5   NSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLT 64

Query: 96  NLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQ 151
           +  E + ++  + + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++
Sbjct: 65  SANEVIVEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIK 124

Query: 152 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 211
            + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L 
Sbjct: 125 AAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLL 183

Query: 212 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 271
           QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+T
Sbjct: 184 QYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTT 243

Query: 272 CGSVF 276
           C + F
Sbjct: 244 CETAF 248


>gi|70997489|ref|XP_753492.1| transcription factor TFIIH subunit Tfb4 [Aspergillus fumigatus
           Af293]
 gi|66851128|gb|EAL91454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           fumigatus Af293]
 gi|159126779|gb|EDP51895.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           fumigatus A1163]
          Length = 387

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 102/325 (31%), Positives = 153/325 (47%), Gaps = 65/325 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SL  S  + ++L F+NA L  N  N+V V+A+      ++Y
Sbjct: 20  SLLTIVLDTNPHAWALLEDSLPLSTAIANILVFVNAHLACNYANEVAVVASHSQKATWLY 79

Query: 72  ----------DSSSTGNQSVGNGRMPSLCATL--------LQNLEEFMNKDEQLGKQEPE 113
                      +S TG    G+  M    A          +     F   +EQ+ +   E
Sbjct: 80  PCETKEDNGKSTSKTGRDEDGDVAMNGSGAGAGAGFAAAQVNKYRPFRIVEEQVTRNLRE 139

Query: 114 ----------GRIACSLLSGSLSMALCYIQR--------------------------VFR 137
                          ++++G+L++AL +I R                            R
Sbjct: 140 LMDSTSGADVAATTSTMMAGALTLALSHINRRSIAWADAHGGTAAGPPGAVEAGSSGAGR 199

Query: 138 SGL------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQ 191
           +G       L  +  I+ + GS D   QY+ IMN IF+ QR  +PID C L   ++ FLQ
Sbjct: 200 AGADTGAEGLQSRILIISVSGSTDSAHQYIPIMNCIFACQRLHIPIDVCKLSG-DAVFLQ 258

Query: 192 QASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDM 251
           QAS  T GV+    +  GL QYL+  F  D  SR  L +P  V VDFRA+CFCH+  +D+
Sbjct: 259 QASDATKGVYMSLSEPRGLLQYLMMAFLPDQRSRRHLVIPTRVDVDFRAACFCHRRVVDI 318

Query: 252 GYICSVCLSIYCKHLKK--CSTCGS 274
           G++CS+CLSI+C+  +   C TCG+
Sbjct: 319 GFVCSICLSIFCEPPENGDCLTCGT 343


>gi|119479015|ref|XP_001259536.1| transcription factor TFIIH subunit Tfb4, putative [Neosartorya
           fischeri NRRL 181]
 gi|119407690|gb|EAW17639.1| transcription factor TFIIH subunit Tfb4, putative [Neosartorya
           fischeri NRRL 181]
          Length = 385

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 102/325 (31%), Positives = 153/325 (47%), Gaps = 65/325 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SL  S  + ++L F+NA L  N  N+V V+A+      ++Y
Sbjct: 20  SLLTIVLDTNPHAWALLKDSLPLSTAIANILVFVNAHLACNYANEVAVVASHSQKATWLY 79

Query: 72  ----------DSSSTGNQSVGNGRMPSLCATL--------LQNLEEFMNKDEQLGKQEPE 113
                      +S TG    G+  M    A          +     F   +EQ+ +   E
Sbjct: 80  PRETKEDNGKSNSKTGRDEDGDVAMNGSGAGAGAGSAAAQVNKYRPFRIVEEQVTRNLRE 139

Query: 114 ----------GRIACSLLSGSLSMALCYIQR--------------------------VFR 137
                          ++++G+L++AL +I R                            R
Sbjct: 140 LMDSTSGADVAATTSTMMAGALTLALSHINRRSIAWADAHGGTAAGPPGAVEAGSSGAGR 199

Query: 138 SGL------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQ 191
           +G       L  +  I+ + GS D   QY+ IMN IF+ QR  +PID C L   ++ FLQ
Sbjct: 200 AGTDTGAEGLQSRILIISVSGSTDSAHQYIPIMNCIFACQRLHIPIDVCKLSG-DAVFLQ 258

Query: 192 QASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDM 251
           QAS  T GV+    +  GL QYL+  F  D  SR  L +P  V VDFRA+CFCH+  +D+
Sbjct: 259 QASDATKGVYMSLSEPRGLLQYLMMAFLPDQRSRRHLVIPTRVDVDFRAACFCHRRVVDI 318

Query: 252 GYICSVCLSIYCKHLKK--CSTCGS 274
           G++CS+CLSI+C+  +   C TCG+
Sbjct: 319 GFVCSICLSIFCEPPENGDCLTCGT 343


>gi|187608008|ref|NP_001120625.1| general transcription factor IIH, polypeptide 3, 34kDa [Xenopus
           (Silurana) tropicalis]
 gi|171846721|gb|AAI61775.1| gtf2h3 protein [Xenopus (Silurana) tropicalis]
          Length = 200

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/183 (42%), Positives = 115/183 (62%), Gaps = 8/183 (4%)

Query: 96  NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGS 153
           ++++ M K  Q+  Q+ E     ++L+GSL+ ALCYI ++ +        + RIL ++ +
Sbjct: 5   DIKDLMTKTGQINGQQTE-----TVLAGSLAKALCYINKIAKDTKAGQEVKSRILVIKAA 59

Query: 154 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 213
            D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q + L QY
Sbjct: 60  EDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKLPQANSLLQY 118

Query: 214 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 273
           LL +F  D   R+ L L  PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC 
Sbjct: 119 LLWVFLPDPDQRSHLNLLPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCE 178

Query: 274 SVF 276
           + F
Sbjct: 179 TAF 181


>gi|441630270|ref|XP_004089523.1| PREDICTED: general transcription factor IIH subunit 3 isoform 3
           [Nomascus leucogenys]
          Length = 267

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 91/245 (37%), Positives = 141/245 (57%), Gaps = 15/245 (6%)

Query: 46  NAILTLNQLNQVVVIATGYNSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQ 95
           N+ L +N+ N++ VIA+      ++Y   +   G+     G  P    +        LL 
Sbjct: 5   NSHLFMNRSNKLAVIASHIQESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLT 64

Query: 96  NLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQ 151
           +  E + ++  + + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++
Sbjct: 65  SANEVIVEEIKDLMTKGDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIK 124

Query: 152 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 211
            + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L 
Sbjct: 125 AAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLL 183

Query: 212 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 271
           QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+T
Sbjct: 184 QYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTT 243

Query: 272 CGSVF 276
           C + F
Sbjct: 244 CETAF 248


>gi|342873217|gb|EGU75428.1| hypothetical protein FOXB_14053 [Fusarium oxysporum Fo5176]
          Length = 358

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 97/303 (32%), Positives = 153/303 (50%), Gaps = 49/303 (16%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ V+LDTNP  W+S  + L  S+ + ++L F+NA L  +  NQV +IA   +  +++Y
Sbjct: 20  SLLTVVLDTNPRAWASLNNVLPLSRAIANILVFVNAHLAFSNANQVALIAAHVDRAEWLY 79

Query: 72  -----------------DSSSTGNQSVGNGRMPSLC---ATLLQNLEEFMNKDEQLGKQE 111
                            D+S T  Q+  N + P      A +L  + + M++      +E
Sbjct: 80  PTPPKPSRDASGDVAMNDASQTQTQTSAN-KFPQFAQIEAAVLAAIRKLMDQ-----TKE 133

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFRS-------------GLLHPQP-----RILCLQGS 153
            +     + +SG+L++ALC+I +  ++             G  +  P     RIL +  S
Sbjct: 134 EDLSATTTQISGALTLALCHINKAAQALCAPTANLEDSHKGSSNTAPPTVRGRILVISVS 193

Query: 154 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 213
              P QY+  MNA+F+A  + V ID+  L  + + FLQQA Y TGG +       GL  Y
Sbjct: 194 DSEPSQYIPTMNAVFAAGHTQVAIDTLSLTGEPT-FLQQACYNTGGTYLAATHPQGLLNY 252

Query: 214 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK--KCST 271
           L+     D  +R  L  P    VDFRA+CFCH   +D G++CS+CLSI+C+  +  +C T
Sbjct: 253 LMFGLIADTEAREALIAPTHDTVDFRAACFCHGRVVDTGFVCSICLSIFCETPENSECFT 312

Query: 272 CGS 274
           CG+
Sbjct: 313 CGT 315


>gi|21410252|gb|AAH31030.1| Unknown (protein for IMAGE:4724313), partial [Homo sapiens]
          Length = 235

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 78/187 (41%), Positives = 117/187 (62%), Gaps = 9/187 (4%)

Query: 92  TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILC 149
            +++ +++ M K +  G+         +LL+GSL+ ALCYI R+ +    + +   RIL 
Sbjct: 37  VIVEEIKDLMTKSDIKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILV 90

Query: 150 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 209
           ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  
Sbjct: 91  IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 149

Query: 210 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 269
           L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C
Sbjct: 150 LLQYLLWVFLPDQDQRSRLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 209

Query: 270 STCGSVF 276
           +TC + F
Sbjct: 210 TTCETAF 216


>gi|384494657|gb|EIE85148.1| hypothetical protein RO3G_09858 [Rhizopus delemar RA 99-880]
          Length = 268

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 98/286 (34%), Positives = 144/286 (50%), Gaps = 59/286 (20%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
            DD +L+V+++DTNPF W+ S+     LS    L  +L ++NA L L   N + VIA+  
Sbjct: 8   EDDSNLLVIIIDTNPFAWNDSAKAELPLSLDDALNQILIYINAHLALKHNNNIAVIASHV 67

Query: 65  NSCDYVY---DSSSTGNQSVGNGRMPSL-------CATLLQNLEEFMNKDEQLGKQEPEG 114
               ++Y   D+      +  N +  ++          ++ NL+  ++  +     +  G
Sbjct: 68  GHSQFLYPLPDNQEPAEGANFNKKDANVYPYFQFVTDQVVSNLQRLLSNTDTSFLTKDMG 127

Query: 115 RIACSLLSGSLSMALCYIQRVFR---SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 171
               S ++G+LSMALCYI R+ +    G +  +PRIL L  SPD   QY+ +MN IFSAQ
Sbjct: 128 PTT-STITGALSMALCYINRITKLDDEGYI--KPRILVLSVSPDSAFQYIPLMNCIFSAQ 184

Query: 172 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 231
           ++      CY   Q+ AFL                              D +SRNFL LP
Sbjct: 185 KA------CY---QHMAFL-----------------------------PDRYSRNFLNLP 206

Query: 232 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 277
               VDFRA+CFCHK  +D+G++CSVCLSI+CK    CSTC + F 
Sbjct: 207 TQDQVDFRAACFCHKQIVDIGFVCSVCLSIFCKWSPVCSTCKTKFA 252


>gi|358389021|gb|EHK26614.1| hypothetical protein TRIVIDRAFT_69532 [Trichoderma virens Gv29-8]
          Length = 363

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 101/307 (32%), Positives = 153/307 (49%), Gaps = 45/307 (14%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W++  SSLS +Q ++++L F+NA L  +  NQV VIA   N   ++Y
Sbjct: 20  SLLSIVLDTNPRAWAALDSSLSLAQAISNILVFVNAHLAFSNTNQVAVIAAHVNRAVWLY 79

Query: 72  ------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQE-----------PEG 114
                  +++T   + G+ +M  + A    +     NK  Q  + E            E 
Sbjct: 80  PAPQNPSATTTTKDNSGDVQMQDVSAET-NSSSASANKYPQFAQIESSVFSSIQTLMAET 138

Query: 115 RI-----ACSLLSGSLSMALCYIQRVFRS--------------GLLHPQP---RILCLQG 152
            +       + LSG+L++ALC I +  +S                  P P   RI+ +  
Sbjct: 139 TVQDLDQVTTQLSGALTLALCRINKTAQSLSSSDTTISNAAPVNSTAPPPVKGRIVVISV 198

Query: 153 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 212
           S   P QY+  MNA+F+A  + V ID+  L A +S FLQQA + T G+  K     GL  
Sbjct: 199 SDSEPSQYIPTMNAVFAAAHNQVAIDTIAL-AGDSTFLQQACFNTNGIFLKASNPQGLLT 257

Query: 213 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK--KCS 270
           YL+     D  +R  L  P    VDFR +CFCH   +D G++CSVCLSI+C+  +  +C 
Sbjct: 258 YLMFGLIPDTEARGSLITPTHDTVDFRTACFCHGKVVDTGFVCSVCLSIFCEPPENAECL 317

Query: 271 TCGSVFG 277
           TCG++  
Sbjct: 318 TCGTILA 324


>gi|321470747|gb|EFX81722.1| hypothetical protein DAPPUDRAFT_224195 [Daphnia pulex]
          Length = 301

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 93/279 (33%), Positives = 143/279 (51%), Gaps = 19/279 (6%)

Query: 12  DVSLVVVLLDTNPF-------FWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           +  LV +++D  P           + S   ++    ++ F NA L     N++ VIA   
Sbjct: 9   ETPLVAIVIDATPAKKFASQAIIDNGSTMMNRIFDALIGFGNAHLMQGSHNKLAVIACNS 68

Query: 65  NSCDYVY-----DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
            S  ++Y     DS S  N+         + + +   L E + KD      + +   + S
Sbjct: 69  KSTKFLYPSENGDSKSYHNRPGQYDMFAKVDSDIRHGLSELILKDSI----DHDSTTSDS 124

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 177
           LL G+++ ALC+I ++ R   L  Q  PRIL + GS D   QY+  MN  F+AQ+  V I
Sbjct: 125 LLGGAMARALCHIHKIQRELSLSHQLKPRILVVSGSSDSALQYMTFMNVFFTAQKENVVI 184

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 237
           D C + + +S  LQQ   ITGG + +   + GL +YLL +F      R+ + LP P  VD
Sbjct: 185 DCCMMDS-DSGLLQQGCDITGGQYLRIPTVVGLLEYLLWVFLPGPSCRSKIVLPPPTKVD 243

Query: 238 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +RA+CFCH   +D+G++CSVCLSI+CK    C+TC + F
Sbjct: 244 YRAACFCHHKLVDIGWVCSVCLSIFCKFSIICTTCNTEF 282


>gi|320582712|gb|EFW96929.1| Subunit of TFIIH complex [Ogataea parapolymorpha DL-1]
          Length = 341

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 99/294 (33%), Positives = 156/294 (53%), Gaps = 22/294 (7%)

Query: 5   PSKLYSDDVSLVVVLLDTNPFFW----SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVV- 59
           P KL  +  SL+ V++D NP  W       S+S  Q ++ ++  LN+ L+LN  N V + 
Sbjct: 16  PHKLVDETPSLLAVVVDINPLEWRKLEKDRSVSLKQVVSSIIVMLNSHLSLNSSNHVALY 75

Query: 60  IATGYN---SCDYVYDSSSTGNQSVGNGRMPS----LCATLLQNLEEFMNK-----DEQL 107
           +A  Y+      Y    SST  + +    M      +  +++  +E  +N+      + +
Sbjct: 76  LANSYSHGAQLIYPLTESSTSTRFLHTPGMYRQFRFIDGSIVDKIEHLLNEQPDTLSQLI 135

Query: 108 GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQGSPDGPEQYVAIMNA 166
           G+++ +  I  +L +G++SMALC+I ++ +S  L+  + R+L +  S D    YV++MN 
Sbjct: 136 GREQGKQHIKGTL-AGAMSMALCHINKIQQSDELNALKSRLLVVSISDDSTLPYVSVMNT 194

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 226
           IF++Q+  + +D C LG  +S FLQQA+  T GV+    Q +GL QYL T    D   R 
Sbjct: 195 IFASQKMKISVDVCKLGP-SSTFLQQAADATNGVYIYITQPEGLIQYLTTALFIDPMLRP 253

Query: 227 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 278
            + LP    +DFRASCF     ID+GY+CSVCL I       +KC TC S F  
Sbjct: 254 IIVLPTDESIDFRASCFITNKVIDVGYVCSVCLCILSVIPEDEKCPTCHSKFDH 307


>gi|432090055|gb|ELK23655.1| General transcription factor IIH subunit 3 [Myotis davidii]
          Length = 247

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 78/188 (41%), Positives = 117/188 (62%), Gaps = 11/188 (5%)

Query: 92  TLLQNLEEFMNKDEQLGKQ-EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRIL 148
            +++ +++ M K +  G+  EP       LL+GSL+ ALCYI R+ +    + +   RIL
Sbjct: 49  VIVEEIKDLMTKSDIKGQHTEP-------LLAGSLAKALCYIHRMHKEVKDNQEMKSRIL 101

Query: 149 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 208
            ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+ 
Sbjct: 102 VIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMP 160

Query: 209 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 268
            L QYLL +F  D   R+ L LP P+ VD+RA+C CH+N I++GY+CSVCLSI+C     
Sbjct: 161 SLLQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACLCHRNLIEIGYVCSVCLSIFCNFSPI 220

Query: 269 CSTCGSVF 276
           C+TC + F
Sbjct: 221 CTTCETAF 228


>gi|242764012|ref|XP_002340688.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218723884|gb|EED23301.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 380

 Score =  147 bits (370), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 109/321 (33%), Positives = 156/321 (48%), Gaps = 61/321 (19%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LD NP  W+     L FS+ L ++L F+NA L  N  N+V V+A+      ++Y
Sbjct: 20  SLLAIVLDVNPHAWALLEDQLPFSKALANLLVFINAHLAFNYTNEVAVVASHSQRAAWLY 79

Query: 72  DS------SSTGNQSV-GNGRMPSLCA-------TLLQN--------LEE--FMNKDEQL 107
            +      +ST  Q   G+  M    A       T  QN        +EE    N  E L
Sbjct: 80  PTHSQQQKTSTNRQDADGDIEMNGQEAHNIHSSRTSTQNNMYRPFRVVEEQVLENVRELL 139

Query: 108 GKQEPEGRIACS-LLSGSLSMALCYIQRVFRSGL-------------------------- 140
              +     A S +L+G+L++AL +I R   +                            
Sbjct: 140 ASTDASDVSATSTMLAGALTLALSHINRRTMTWTETHGNSSVDTANAATGSSSAAAVSGG 199

Query: 141 ---LHPQPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASY 195
              L  Q RIL +    S D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS 
Sbjct: 200 NVSLGLQSRILIVSVSSSTDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASD 258

Query: 196 ITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYIC 255
            T GV+    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +++G++C
Sbjct: 259 ATRGVYMSLTEPRGLLQYLMMAFLPDQRSRKHLVLPSRVDVDFRAACFCHRRVVNIGFVC 318

Query: 256 SVCLSIYCKHLKK--CSTCGS 274
           S+CLSI+C+  +   C TCG+
Sbjct: 319 SICLSIFCEPPENGDCLTCGT 339


>gi|402224045|gb|EJU04108.1| transcription factor Tfb4 [Dacryopinax sp. DJM-731 SS1]
          Length = 305

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 96/275 (34%), Positives = 129/275 (46%), Gaps = 21/275 (7%)

Query: 18  VLLDTNPFFW-----SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYD 72
           ++LD  P  W     S+  L    FL   L F+NA L L   N V V          +Y 
Sbjct: 9   LILDLTPVPWDALSHSTPPLRLKDFLAQALVFINAHLALRNENSVSVFGALPGRSMILYP 68

Query: 73  SSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK--QEPEGRIACSLLSGSLSMALC 130
            + +                  Q  E FM   EQ  +  QE E  +  +L+ G+L+ +LC
Sbjct: 69  PAGSTQVEADENTF----HVFGQVDEGFMRSVEQEVEDLQEVEEDVPPALV-GALTKSLC 123

Query: 131 YIQRVFRSGLLHP---------QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           YI R+                   RIL    SPD    Y+  MN IFSAQ+  V ID C 
Sbjct: 124 YINRLSNPPPPSNPSEAPPPPPHSRILLFSVSPDASVAYIPFMNCIFSAQKLKVAIDVCK 183

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L      FL QA ++TGG      Q + L QYL   F ++  +R  L +P    VDFRA+
Sbjct: 184 LDESEVIFLHQACHLTGGAFVPITQREALLQYLSMCFLSETETRKTLAVPTLDKVDFRAA 243

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           CFCHK  +D+GY+CSVCLSI+C+ + +CSTC + F
Sbjct: 244 CFCHKEIVDIGYVCSVCLSIFCQPVLECSTCRTKF 278


>gi|154310839|ref|XP_001554750.1| hypothetical protein BC1G_06398 [Botryotinia fuckeliana B05.10]
          Length = 356

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/301 (33%), Positives = 153/301 (50%), Gaps = 46/301 (15%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           S +V++LDTNP  W   SS+L  S+ + ++L F+NA L +N  NQV +IA+  +   ++Y
Sbjct: 20  SCLVIILDTNPHAWEILSSTLPLSKAIANLLVFVNAHLAVNNSNQVAIIASHCHRAVWLY 79

Query: 72  DSSSTGNQSVGNG----RMPSLCA------TLLQNLEEFMNKDEQLGKQEPEGRIACSL- 120
            +  + ++    G      P   A       L++N    +N    L +   E  ++ +  
Sbjct: 80  PAPPSASEDTEMGGSTTHGPPDDANKYRQFALIEN--TLLNSVRSLIESTTEQEVSTTTT 137

Query: 121 --LSGSLSMALCYIQRVF-----------------------RSGLLHPQPRILCLQGSPD 155
             ++G+L++AL YI +                          +GL   Q RIL L  S D
Sbjct: 138 TQMAGALTLALSYINKQLTAYSDVSAADSKPTPSDITPSDSTTGL---QSRILVLSVSGD 194

Query: 156 GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 215
              QY+ IMN  F+AQR  +PID   L A ++ FLQQAS  T G++   +   GL QYL+
Sbjct: 195 LAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTVFLQQASDTTRGIYMHLKNPQGLLQYLM 253

Query: 216 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCG 273
             F  D  SR  L  P    VDFRA+CFCH+  +D+G++CS+CLSI+C   +   C TC 
Sbjct: 254 MAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKVVDVGFVCSICLSIFCSPPEGAICLTCS 313

Query: 274 S 274
           +
Sbjct: 314 T 314


>gi|347440963|emb|CCD33884.1| similar to RNA polymerase II transcription factor B subunit 4
           [Botryotinia fuckeliana]
          Length = 358

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/301 (33%), Positives = 153/301 (50%), Gaps = 46/301 (15%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           S +V++LDTNP  W   SS+L  S+ + ++L F+NA L +N  NQV +IA+  +   ++Y
Sbjct: 22  SCLVIILDTNPHAWEILSSTLPLSKAIANLLVFVNAHLAVNNSNQVAIIASHCHRAVWLY 81

Query: 72  DSSSTGNQSVGNG----RMPSLCA------TLLQNLEEFMNKDEQLGKQEPEGRIACSL- 120
            +  + ++    G      P   A       L++N    +N    L +   E  ++ +  
Sbjct: 82  PAPPSASEDTEMGGSTTHGPPDDANKYRQFALIEN--TLLNSVRSLIESTTEQEVSTTTT 139

Query: 121 --LSGSLSMALCYIQRVF-----------------------RSGLLHPQPRILCLQGSPD 155
             ++G+L++AL YI +                          +GL   Q RIL L  S D
Sbjct: 140 TQMAGALTLALSYINKQLTAYSDVSAADSKPTPSDITPSDSTTGL---QSRILVLSVSGD 196

Query: 156 GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 215
              QY+ IMN  F+AQR  +PID   L A ++ FLQQAS  T G++   +   GL QYL+
Sbjct: 197 LAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTVFLQQASDTTRGIYMHLKNPQGLLQYLM 255

Query: 216 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCG 273
             F  D  SR  L  P    VDFRA+CFCH+  +D+G++CS+CLSI+C   +   C TC 
Sbjct: 256 MAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKVVDVGFVCSICLSIFCSPPEGAICLTCS 315

Query: 274 S 274
           +
Sbjct: 316 T 316


>gi|449546921|gb|EMD37890.1| hypothetical protein CERSUDRAFT_136815 [Ceriporiopsis subvermispora
           B]
          Length = 331

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/289 (34%), Positives = 148/289 (51%), Gaps = 29/289 (10%)

Query: 11  DDVSLVVVLLDTNPFFW---SSSS----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           D  S + V++D +P  W   S SS    L    FL+ +LAF N+ +     N + V    
Sbjct: 2   DKASHLSVVIDLSPTQWHLCSQSSNPHPLPLQLFLSQLLAFFNSHIACKDENSLAVFGAF 61

Query: 64  YNSCDYVYDSSSTGNQ-----SVGNGRMP--SLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
                 +Y S++   +     +  N   P   L +T++ ++ + +   E L  + P G  
Sbjct: 62  PGKSILLYSSTTIAAEEHSVNADANTYRPFKILDSTIMDSIRDQLEALELLEAEAPIG-- 119

Query: 117 ACSLLSGSLSMALCYIQRVF-------RSG--LLHPQPRILCLQGSPDGPEQYVAIMNAI 167
               L G+++ +LCYI R+        R+G   +   PRIL    SPD    Y+ IMN+I
Sbjct: 120 ----LVGAITKSLCYINRLTIPSSSTTRAGEQAVPLDPRILIFSVSPDQSSSYIPIMNSI 175

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           FSAQ+  V ID C +   +S FLQQA+++TGG +    + D L QYL   F      R  
Sbjct: 176 FSAQKLKVTIDVCKIYGSDSVFLQQAAHLTGGSYLHLDRPDALLQYLTMSFLPPPGIRQL 235

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           L +P    +DFRA+CFCHK  +D+G++CSVCLSI+C  +  CSTC + F
Sbjct: 236 LSVPTQDKIDFRAACFCHKRIVDIGFVCSVCLSIFCNPVPVCSTCRTKF 284


>gi|340515579|gb|EGR45832.1| hypothetical protein TRIREDRAFT_67476 [Trichoderma reesei QM6a]
          Length = 362

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 155/315 (49%), Gaps = 62/315 (19%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W++  SSLS +Q ++++L F+NA L  +  NQV VIA   N   ++Y
Sbjct: 20  SLLSIVLDTNPRAWAALDSSLSLAQAISNILVFVNAHLAFSNANQVAVIAAHVNRAVWLY 79

Query: 72  D-----SSSTGNQSVGNGRM-----------PS---------LCATLLQNLEEFMNKD-- 104
                 SS+T   + G+ +M           PS         + +++  +++  M +   
Sbjct: 80  PALQHPSSTTVKDNSGDVQMHDASAETSLPPPSANKYPQFAQIESSVFSSIQTLMAETTV 139

Query: 105 ---EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--------------GLLHPQP-- 145
              +QL  Q          LSG+L++ALC I +  ++                  P P  
Sbjct: 140 QDLDQLTTQ----------LSGALTLALCRINKAAQALSSSDTTLSNAAPVNATAPPPVK 189

Query: 146 -RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 204
            RI+ +  S   P QY+  MNA+F+A  + V ID+  L A +S FLQQA + T G+  K 
Sbjct: 190 GRIVVVSVSDSDPSQYIPTMNAVFAAAHNQVAIDTISL-AGDSTFLQQACFNTNGIFLKA 248

Query: 205 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 264
               GL  YL+     D  +R  L  P    VDFR +CFCH   +D GY+CSVCLSI+C 
Sbjct: 249 ANPRGLLTYLMFGLIPDTEARASLITPTHDTVDFRTACFCHGKVVDTGYVCSVCLSIFCT 308

Query: 265 HLK--KCSTCGSVFG 277
                +C TCG+V  
Sbjct: 309 PPDNAECLTCGTVLA 323


>gi|302899112|ref|XP_003047982.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256728914|gb|EEU42269.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 355

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 93/301 (30%), Positives = 150/301 (49%), Gaps = 37/301 (12%)

Query: 9   YSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
           + D  SL+ ++LDTNP  W++  ++L  S+ + ++L F+NA L  +  NQV +IA   + 
Sbjct: 15  HEDTPSLLTIVLDTNPRAWAAIGNALPLSRAIANILVFVNAHLAFSNANQVALIAAHVDR 74

Query: 67  CDYVYDSSSTGNQSV-GNGRMPSLCATLLQNLEEF----------MNKDEQLGKQEPEGR 115
             ++Y +     +   G+  M     +   +  +F          ++   +L  Q  E  
Sbjct: 75  AQWLYPTPPKPTRDASGDVSMKDAAPSQTSSANKFPQFAQIEAAVLDSVRKLMDQTTEAD 134

Query: 116 IACSL--LSGSLSMALCYIQRVFRSGLLHP------------------QPRILCLQGSPD 155
           +A +   LSG+L++ALC+I +  ++ L  P                  + RIL +  S  
Sbjct: 135 LATTTTQLSGALTLALCHINKAAQA-LCSPTANLEDNHKATTTAPPTVRGRILVISVSDS 193

Query: 156 GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 215
            P QY+  MNA+F+A  + V ID+  L   +  FLQQA + TGG +       GL  YL+
Sbjct: 194 EPSQYIPTMNAVFAAAHAQVAIDTLSLTG-DPTFLQQACFNTGGTYLAATHPQGLLTYLM 252

Query: 216 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCG 273
                D  +R  L  P    VDFRA+CFCH   +D G++CS+CLSI+C+     +C TCG
Sbjct: 253 FGLIADTEAREALVAPTHDTVDFRAACFCHGRVVDTGFVCSICLSIFCELPENSECLTCG 312

Query: 274 S 274
           +
Sbjct: 313 T 313


>gi|156034244|ref|XP_001585541.1| hypothetical protein SS1G_13425 [Sclerotinia sclerotiorum 1980]
 gi|154698828|gb|EDN98566.1| hypothetical protein SS1G_13425 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 356

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 154/303 (50%), Gaps = 50/303 (16%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           S +V++LDTNP  W+  SS+L  S+ + ++L F+NA L +N  NQV +IA+  +   ++Y
Sbjct: 20  SCLVIILDTNPHAWALLSSTLPLSKAVANLLVFINAHLAVNNANQVAIIASHCHRAVWLY 79

Query: 72  DSSSTGNQSVGNG--------------RMPSLCA-TLLQNLEEFMNKDEQLGKQEPEGRI 116
            +  + ++ +  G              R  +L   TLL +L   ++       ++     
Sbjct: 80  PAPPSASEDIEMGGSTTHGPPNDANKYRQFALIENTLLNSLRNLISST----TEQEVSTT 135

Query: 117 ACSLLSGSLSMALCYIQRVFRS-----------------------GLLHPQPRILCLQGS 153
             + ++G+L++AL YI +   +                       GL   Q RIL L  S
Sbjct: 136 TTTQMAGALTLALSYINKQSLAYNDVSASDSKPTTSDVNTSDSSIGL---QSRILVLSVS 192

Query: 154 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 213
            D   QY+ IMN  F+AQR  +PID   L A ++ FLQQAS  T G++   +   GL QY
Sbjct: 193 GDLAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTVFLQQASDTTRGIYMHLKNPQGLLQY 251

Query: 214 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CST 271
           L+  F  D  SR  L  P    VDFRA+CFCH+  +D+G++CS+CLSI+C   +   C T
Sbjct: 252 LMIAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKVVDVGFVCSICLSIFCSPPEGAICLT 311

Query: 272 CGS 274
           C +
Sbjct: 312 CST 314


>gi|428174859|gb|EKX43752.1| hypothetical protein GUITHDRAFT_87651 [Guillardia theta CCMP2712]
          Length = 343

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 101/316 (31%), Positives = 159/316 (50%), Gaps = 36/316 (11%)

Query: 11  DDV-SLVVVLLDTNPFFWSSSSL----SFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           DD+ S ++V+LD N   W+ S+     +  + L  +L FL A   L+  N V +IA    
Sbjct: 22  DDLPSSLIVILDLNQASWTESTAKGGPTVIEALESLLTFLRAFFMLHGGNTVTIIAAHPY 81

Query: 66  SCDYVYDS------SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQE-PEGRIAC 118
             +++Y S       + G+Q  G   M  +   L + +       +    QE  +  + C
Sbjct: 82  GSEFLYQSPALQRSKAHGHQQSGALEMDEIDHRLHEKIVSLSKSLKSKSSQENSDSSMKC 141

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           SLLS + ++A C + R+    LL   PR+L L  S + P QY+ IMN IFS Q++ V ID
Sbjct: 142 SLLSSAFALAACTVNRISSHKLLRMLPRVLVLTASTELPSQYIPIMNCIFSMQKANVLID 201

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQ------QLDGLFQYLLTIFGTDLHSR------- 225
           +  +   + +FLQQA+ IT GV+ +P+          L  +++++F  D H+R       
Sbjct: 202 ALVVSDGDCSFLQQAADITNGVYLRPELSKVGSSPGALLSWMISVFLGDKHTRFLNADSQ 261

Query: 226 ---NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK-HLKKCSTCGSVFGQAQT 281
                L+LP+   VD+R+SCF     +D+G++CSVCLSI+ K     C TC S     + 
Sbjct: 262 SRSTLLRLPQAQDVDYRSSCFKTGQLVDVGFVCSVCLSIFAKCDFFICPTCDS-----KI 316

Query: 282 QSDEP--SATNRKRKT 295
             D P  +A N+KRK 
Sbjct: 317 ALDRPAGAAVNKKRKV 332


>gi|391332492|ref|XP_003740668.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Metaseiulus occidentalis]
          Length = 297

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 93/269 (34%), Positives = 141/269 (52%), Gaps = 6/269 (2%)

Query: 10  SDDVS-LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
            D+VS L+ V+LDT+         +  Q L  V+ F NA L  +  N + V+A   ++  
Sbjct: 10  EDEVSELLAVVLDTSFNSALIKEKNVGQLLDAVVIFSNAHLMNSSKNTLAVVACHPHTAK 69

Query: 69  YVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS-LLSGSLSM 127
            +Y        S+ N R P     +L +  + +    +      +G  +   LL+G +++
Sbjct: 70  MIY---PVKKDSIQNIRPPDAQMEILAHATQVIRDGIKEVVTSWQGVYSIEPLLTGGMAL 126

Query: 128 ALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS 187
           ALCYI +   + +     RIL L  S +   QY+  MN  F+AQR  + +DSC +   + 
Sbjct: 127 ALCYIAKRMANTVGKLNGRILVLSSSGESAHQYLNFMNVFFAAQRKAIVVDSCVI-QNDC 185

Query: 188 AFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKN 247
             LQQ + ITGG + K   + GL +YLL +F      R  L LPKP  VD+R +CFCH+N
Sbjct: 186 GLLQQGADITGGNYLKAPSIAGLLEYLLWVFLPPKSLRAKLVLPKPDVVDYRTACFCHRN 245

Query: 248 TIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            I++GY+CSVCLSI+C+    CSTC + F
Sbjct: 246 LIEVGYVCSVCLSIFCQFAPICSTCQTHF 274


>gi|346324613|gb|EGX94210.1| transcription factor TFIIH subunit Tfb4 [Cordyceps militaris CM01]
          Length = 366

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 103/311 (33%), Positives = 151/311 (48%), Gaps = 56/311 (18%)

Query: 14  SLVVVLLDTNPFFWS-----SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           SL+ V++DTNP  W+     S  LS +  + ++L F+NA L  +  NQV VIA   N   
Sbjct: 20  SLLSVIIDTNPRAWAVLDDGSLPLSLTAAVANLLLFINAHLAFSGANQVAVIAAHVNRAV 79

Query: 69  YVYDSSSTG----NQSVGNG------------------RMPSLC---ATLLQNLEEFMNK 103
           ++Y SS +     + +  +G                  + P      AT+L +L++ M +
Sbjct: 80  WLYPSSHSPKPIQDNTSADGDVAMADAPAAATTPSSANKYPQFAQIEATVLGSLKKLMAE 139

Query: 104 DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR---------------SGLLHPQP--- 145
                  E +     + LSG+L++ALC+  +  R               SG   P P   
Sbjct: 140 -----TTERDLDSTTTQLSGALTLALCHTNKASRALGASDTAIHEAAQTSGTTPPSPMKG 194

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL L  S     QY+  MNA+F+A    V ID+  L A ++ FLQQA + TGGV  +  
Sbjct: 195 RILVLSVSDSESSQYIPTMNAVFAAAHGRVAIDALAL-AGDATFLQQACFNTGGVFLRAA 253

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK- 264
              GL  YL+     D  +R  L  P    VDFRA+CFCH N +D G++CS+CLSI+C+ 
Sbjct: 254 NPQGLLAYLMFGLVADTEAREALVAPTHDTVDFRAACFCHGNVVDTGFVCSICLSIFCEL 313

Query: 265 -HLKKCSTCGS 274
               +C TCG+
Sbjct: 314 PDNAECFTCGT 324


>gi|121713686|ref|XP_001274454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           clavatus NRRL 1]
 gi|119402607|gb|EAW13028.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           clavatus NRRL 1]
          Length = 386

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 101/326 (30%), Positives = 157/326 (48%), Gaps = 66/326 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SL  S  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 20  SLLTIVLDTNPHAWALLEPSLPLSTAIANILVFINAHLACNYANEVAVVASHTHKATWLY 79

Query: 72  ----------------DSSSTGNQSVGNGRMPSLCATLLQN-------LEEFMNKDEQ-- 106
                            + + G+ ++ +    S  A    N       +EE ++++ +  
Sbjct: 80  PCETQHPRTTTASAPASADNDGDVAMDDSSGSSGAADAHVNKYRPFRIVEEQVSRNLRRL 139

Query: 107 LGKQEPEGRIA--CSLLSGSLSMALCYIQR------------------------------ 134
           LG        A   ++++G+L++AL +I R                              
Sbjct: 140 LGATTATDVQASTTTMMAGALTLALSHINRRSLAWAEAHGGAAATGSAGAAEGAPGGGGR 199

Query: 135 ----VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFL 190
                  +  L  +  I+ +  S D   QY+ IMNAIF+ QR  VPID C L   ++ FL
Sbjct: 200 ADGEGGGAEGLQSRILIISVSSSTDSAHQYIPIMNAIFACQRLHVPIDVCKLSG-DAVFL 258

Query: 191 QQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTID 250
           QQAS  T GV+    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D
Sbjct: 259 QQASDATKGVYMSLAEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVD 318

Query: 251 MGYICSVCLSIYCKHLKK--CSTCGS 274
           +G++CS+CLSI+C+  +   C TCG+
Sbjct: 319 IGFVCSICLSIFCEPPENGGCLTCGT 344


>gi|452820306|gb|EME27350.1| transcription initiation factor TFIIH subunit H3 isoform 2
           [Galdieria sulphuraria]
 gi|452820307|gb|EME27351.1| transcription initiation factor TFIIH subunit H3 isoform 1
           [Galdieria sulphuraria]
          Length = 328

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 100/299 (33%), Positives = 149/299 (49%), Gaps = 37/299 (12%)

Query: 10  SDDVSLVVVLLDTNP--------FFWSSSSLS--------FSQFLTHVLAFLNAILTLNQ 53
           SD  +L+ VLLD +P        F  S S+ S           FL  ++ FL+  L  N 
Sbjct: 33  SDKSNLLCVLLDLSPNLFDRLDKFVPSQSAKSVDLIEADALETFLLQLITFLHVFLLSNF 92

Query: 54  LNQVVVIATGYNSCDYVYDSSSTGNQSVGNG------------RMPSLCATLLQNLEEFM 101
            N++V+I     +   VY+       S  +             R+  L +  L+N+  F+
Sbjct: 93  ANRLVIIVYNEYNIHVVYEDKGLNTLSDPSNSASFLLTEERQERLTRLDSEFLKNVASFI 152

Query: 102 NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYV 161
            +       E  G    +LLS SLS ALCY+ R   S     + RILCL  S D P+QY+
Sbjct: 153 -ESYPYNSCEENG----TLLSASLSFALCYLHRRILSE--RYEARILCLHASSDVPKQYI 205

Query: 162 AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG--LFQYLLTIFG 219
            +MN+IF AQR  + ID C L  + S+ LQQA+++TGG++ KP+Q +   L  YL +++ 
Sbjct: 206 GVMNSIFCAQRWSILIDVCNLHEEESSLLQQAAHLTGGIYFKPKQSEAIPLSNYLCSLYF 265

Query: 220 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQ 278
               +R +L++P    VD  A CF     I MGYICSVCL+ + +    C+ CG+ F +
Sbjct: 266 PSRDARKYLKIPVLNSVDCSALCFESGEKIHMGYICSVCLATFGRKRLSCNICGAEFAE 324


>gi|412993660|emb|CCO14171.1| predicted protein [Bathycoccus prasinos]
          Length = 370

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 69/167 (41%), Positives = 105/167 (62%), Gaps = 11/167 (6%)

Query: 121 LSGSLSMALCYIQRVFRSGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           ++G++S+AL Y  R+ R   L    + R+L +Q S D  EQYV +MNA+F+AQ S + ID
Sbjct: 203 IAGAISLALTYANRIERDPSLRGRVKSRVLVVQASEDDAEQYVPMMNAMFAAQSSNILID 262

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQQ---------LDGLFQYLLTIFGTDLHSRNFLQ 229
              L  + S+FLQQA++++GG++ KP++          + L +  L  F  D ++R FL+
Sbjct: 263 CLCLNEKESSFLQQAAFVSGGIYAKPEKSKKGGGEETREDLLERALCSFLPDRYTRRFLR 322

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            P+   +DFR+SCFCH N I  G++CSVCLS++CK    C TCG+ F
Sbjct: 323 PPRAGSIDFRSSCFCHGNRISKGFVCSVCLSVFCKKEGACQTCGAAF 369


>gi|400593294|gb|EJP61268.1| transcription factor tfb4 [Beauveria bassiana ARSEF 2860]
          Length = 373

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 107/318 (33%), Positives = 151/318 (47%), Gaps = 63/318 (19%)

Query: 14  SLVVVLLDTNPFFWS-----SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           SL+ V++DTNP  W+        LS    + ++L F+NA L  +  NQV VIA   N   
Sbjct: 20  SLLSVIIDTNPRAWAVLDDGDLPLSLPTAVANLLLFINAHLAFSGANQVAVIAAHVNRAV 79

Query: 69  YVYD------------SSSTGNQSVGN----------GRMPS----------LCATLLQN 96
           ++Y             +S   N S G+          G  PS          + AT+L +
Sbjct: 80  WLYPSSQQQQQNLLSNTSPRPNPSDGDVDMADAPSATGTTPSSANKYPPFAHIEATVLAS 139

Query: 97  LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR---------------SGLL 141
           L++ M +       E +     + LSG+L++ALC+  +  R               SG  
Sbjct: 140 LKQLMAE-----TTERDLDSTTTQLSGALTLALCHNNKAGRALSASDTALLDAAQASGAT 194

Query: 142 HPQP---RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 198
            P P   RIL L  S     QY+ IMNA+F+A  S V ID+  L A ++ FLQQA + TG
Sbjct: 195 APPPMKGRILVLSVSDSESAQYIPIMNAVFAAAHSHVAIDTLAL-AGDATFLQQACFGTG 253

Query: 199 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 258
           GV  +     GL  YL+     D  +R  L  P    VDFRA+CFCH N +D G++CS+C
Sbjct: 254 GVFLRAANPQGLLAYLMFGMIADSEAREALVAPAHDTVDFRAACFCHGNVVDTGFVCSIC 313

Query: 259 LSIYCK--HLKKCSTCGS 274
           LSI+C      +C TCG+
Sbjct: 314 LSIFCDLPENAECLTCGT 331


>gi|295670289|ref|XP_002795692.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284777|gb|EEH40343.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 422

 Score =  143 bits (360), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 163/353 (46%), Gaps = 88/353 (24%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ V+LDTNP+ W+   S+L  S  + ++L F+NA L  N  N+V V+A+  N   ++Y
Sbjct: 33  SLLTVILDTNPYAWALLESTLPLSAAIANLLVFINAHLACNYANKVAVVASHCNHAAWLY 92

Query: 72  DSSSTGNQS-------------VGNGRMPSL---------CATLLQNLEEFMNKDEQLGK 109
            + ST + S               NG+ PS            + L     F+  +EQL +
Sbjct: 93  PTPSTPSISRSKAIQLDADGDITMNGQ-PSTEPERENQQQSQSPLNKYRPFLLVEEQLTR 151

Query: 110 Q-------EPEGRIACS---LLSGSLSMALCYIQR-----VFRSGLLHPQPR-------- 146
                    P   +  +   +L+G+L++AL +I R         G  +P P         
Sbjct: 152 NLHCILSATPPSAVTSTTSTMLAGALTLALSHINRESIAYAEAHGAANPDPTSSTTGPAT 211

Query: 147 -----------------------------------ILCLQGSPDGPEQYVAIMNAIFSAQ 171
                                              I+ L  +    + Y+ IMN+IF+ Q
Sbjct: 212 STASGLPPPPNSTTTDPASAVANHLNPTALQSRILIISLSNTTHSAQHYIPIMNSIFACQ 271

Query: 172 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ-QLDGLFQYLLTIFGTDLHSRNFLQL 230
           R  +PID   L A ++AFLQQAS  TGG++        G  QYL+  F  D  +R  L L
Sbjct: 272 RLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINTHPAGFLQYLMLGFLPDQRARAHLIL 330

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK---KCSTCGSVFGQAQ 280
           P  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L    +C TCG+  G A+
Sbjct: 331 PSRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPLGGEGECLTCGTRLGMAE 383


>gi|336371329|gb|EGN99668.1| hypothetical protein SERLA73DRAFT_53296 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 297

 Score =  143 bits (360), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 97/291 (33%), Positives = 141/291 (48%), Gaps = 39/291 (13%)

Query: 18  VLLDTNPFFWSSSS-------LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYV 70
           V+LD +P  W  S+       LS   FL+ +LAFLNA +     N + V          +
Sbjct: 9   VILDLSPTQWHLSAQSSNQYPLSIHSFLSQLLAFLNAHIASKHENTLAVFGAFPGKSVML 68

Query: 71  YDSSSTGNQSVGNGRMPSLCATLLQN---LEEFMNKDEQLGKQEPEGRIACSLLSGSLSM 127
           Y S+      V      S  +  + +   +   MN+ + LG  + E   A   L G+L+ 
Sbjct: 69  YSSTDPVADDVPPADANSYSSFKVVDFTIVRSIMNEFDALGNLDEEPPCA---LVGALTK 125

Query: 128 ALC--------YIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           ALC        +         + P PRIL L  SPD    Y+ IMN+IFSAQ+  V ID 
Sbjct: 126 ALCCKSISAHFHSPNAVEDPAVLPDPRILILSVSPDLSTSYIPIMNSIFSAQKLKVTIDV 185

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFR 239
           C +   ++ FLQQA+++TGG +   ++ D L QYL   F +    R  L +PK   +DFR
Sbjct: 186 CKVYGPDAVFLQQAAHLTGGSYIFLERRDALLQYLTMSFLSPPALRQVLAVPKQDKIDFR 245

Query: 240 ASCFCHKNTIDMGYICSVCLS------------------IYCKHLKKCSTC 272
           A+CFCHKN +D+G++CSVCLS                  ++C+ +  CSTC
Sbjct: 246 AACFCHKNIVDVGFVCSVCLSSNHILFYVVKLWFTYMCTVFCQPVPVCSTC 296


>gi|225717796|gb|ACO14744.1| General transcription factor IIH subunit 3 [Caligus clemensi]
          Length = 302

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/298 (35%), Positives = 152/298 (51%), Gaps = 24/298 (8%)

Query: 12  DVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           +  L+++++D NP            +  L  +L F NA L L+  N +  I +  +   +
Sbjct: 14  EAQLLILVMDMNPNQRLLLEEPTRLTSVLDSLLCFSNAHLLLHPSNALAAIGSLSSGSYF 73

Query: 70  VY------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSG 123
           +Y      D+S    Q  G   + +L  T ++N      K  QL + E  G    S LSG
Sbjct: 74  LYPSSCIDDTSPELRQLDGQYELFTLVETSVRN------KFSQLLEAETSGS-EDSPLSG 126

Query: 124 SLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 183
           SL+MAL YI R  R      + RIL L  S D   QY+  MN  F+AQ+  V +D+C L 
Sbjct: 127 SLAMALAYINRK-RKESPELRARILILTASGDTASQYMNYMNVFFTAQKLNVLLDACMLQ 185

Query: 184 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 243
           + +S  LQQ + ITGG++     L  L Q+LL IF      R  L LP  V VDFRA+CF
Sbjct: 186 S-DSPLLQQGADITGGLYFNVPDLAALLQFLLWIFLPSAEIRPQLGLPTGVKVDFRAACF 244

Query: 244 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF-------GQAQTQSDEPSATNRKRK 294
           CH+  + +G++CSVCLSI+CK    C+TC +VF       G+ +    + S++  KRK
Sbjct: 245 CHRQLVGVGFVCSVCLSIFCKFSPICTTCQTVFRVPAPLLGKKKKSIAQGSSSLLKRK 302


>gi|326437202|gb|EGD82772.1| hypothetical protein PTSG_03422 [Salpingoeca sp. ATCC 50818]
          Length = 311

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 109/338 (32%), Positives = 155/338 (45%), Gaps = 92/338 (27%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL V+++D +P  W   S   S  + +  +  FLNA L+++  N + VI        ++Y
Sbjct: 3   SLFVLIVDLHPARWKQLSEDASLDEAMNQICVFLNAQLSVHVSNTIAVIGAAPKKSMFLY 62

Query: 72  DSSSTGNQSVGNGRMPS----------LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLL 121
            S+          +MPS          + AT+L N  +  N  EQ G++ P      S L
Sbjct: 63  PSAV---------KMPSPDDRQELFSEMNATILANATKLFN--EQSGERHP------SDL 105

Query: 122 SGSLSMALCYIQRVFRSGLLHPQ------------------------------------- 144
           +G+LS ALC+I R  R   L PQ                                     
Sbjct: 106 AGALSKALCFINRTVRGTSLQPQHTTASSSVTSRFIAGDKATPASSSSSSSSSAPKAAGR 165

Query: 145 ----------PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQAS 194
                     PRI  +  SPD   +Y A MN IF+AQ++ VPID C +G + +A LQQA+
Sbjct: 166 DRVSSGARVTPRIGVVSVSPDSSVEYFAAMNCIFAAQKNDVPIDVCAMG-EEAAILQQAA 224

Query: 195 YITGG----VHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTID 250
           ++T G    VHH  Q L  L    +T   +D H    L  P    ++F ASCFCH  T+D
Sbjct: 225 HLTNGTYVKVHHPSQLLQSLMMGFVT--SSDAH---VLVRPVQRRINFDASCFCHGRTVD 279

Query: 251 MGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSA 288
           +GY+C+VCLS+YC+ L  C  CG+ F      SD+P A
Sbjct: 280 IGYVCTVCLSVYCERLASCRICGAEF------SDKPGA 311


>gi|212529242|ref|XP_002144778.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210074176|gb|EEA28263.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 380

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 100/321 (31%), Positives = 149/321 (46%), Gaps = 61/321 (19%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LD NP  W+     L FS+ L ++L F+NA L  N  N+V VIA+      ++Y
Sbjct: 20  SLLAIVLDINPHAWALLEDQLPFSKALANILVFINAHLACNYTNEVAVIASHSQRAAWLY 79

Query: 72  ----------------DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPE-- 113
                           D     N    +    S  +T       F   +EQ+ +   E  
Sbjct: 80  PAHNQQQKPSTNRLDADGDIDMNGEEAHNSRASRSSTQSNMYRPFRLVEEQVMENVRELL 139

Query: 114 -------GRIACSLLSGSLSMALCYIQRVFRSGL-------------------------- 140
                       ++++G+L++AL +I R   +                            
Sbjct: 140 ASTNASDVTATSTMMAGALTLALSHINRRTMTWTETHGNSNIETAGASASSSATAAVSGG 199

Query: 141 ---LHPQPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASY 195
              L  Q RIL +    S D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS 
Sbjct: 200 NLSLGLQSRILIVSVSSSTDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASD 258

Query: 196 ITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYIC 255
            T GV+    +  GL QYL+  F  D  SR  L +P  V VDFRA+CFCH+  +++G++C
Sbjct: 259 ATRGVYMSLSEPRGLLQYLMMAFLPDQRSRKHLIVPSRVDVDFRAACFCHRRVVNIGFVC 318

Query: 256 SVCLSIYCKHLKK--CSTCGS 274
           S+CLSI+C+  +   C TCG+
Sbjct: 319 SICLSIFCEPPENGDCLTCGT 339


>gi|224156833|ref|XP_002337765.1| predicted protein [Populus trichocarpa]
 gi|222869675|gb|EEF06806.1| predicted protein [Populus trichocarpa]
          Length = 77

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 65/77 (84%), Positives = 71/77 (92%)

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MNAIFSAQRSMVPIDSCY+GA NSAFLQQASYITGGV+ KPQ LDGLFQYL T+F TDLH
Sbjct: 1   MNAIFSAQRSMVPIDSCYVGAHNSAFLQQASYITGGVYVKPQHLDGLFQYLTTVFATDLH 60

Query: 224 SRNFLQLPKPVGVDFRA 240
           SR+F+QLP+P GVDFRA
Sbjct: 61  SRSFIQLPRPAGVDFRA 77


>gi|259489666|tpe|CBF90125.1| TPA: transcription factor TFIIH complex subunit (Eurofung)
           [Aspergillus nidulans FGSC A4]
          Length = 392

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 100/334 (29%), Positives = 154/334 (46%), Gaps = 74/334 (22%)

Query: 14  SLVVVLLDTNPFFWS------SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           SL+ V++DTNP  W+       + L FS  L ++L FLNA L  N  N+V V+A+  +  
Sbjct: 20  SLLTVIIDTNPHAWALLEQNEKNQLPFSTALANILVFLNAHLACNYANEVAVVASHTHKA 79

Query: 68  DYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEP----EGRI------- 116
            ++Y S ++   S  +    ++      +      K  Q+ K  P    E ++       
Sbjct: 80  AWLYPSPNSPTTSADSDGDVTMSNNNHMSTGGGSGKPSQVNKYRPFRIVEEQVTSNLRHL 139

Query: 117 ------------ACSLLSGSLSMALCYIQR----------------VFRSGL-------- 140
                       A ++++G+L++AL +I R                  RSG         
Sbjct: 140 VDSTNRDDLDSGASTMMAGALTLALSHINRRTIAWAEAHGGGDASSFSRSGDPDTSTTSM 199

Query: 141 ------------------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 182
                             L  +  I+ +  +     QY+ IMN IF+ QR  +PID C L
Sbjct: 200 IARRSTTTTATNDGAAEGLQSRILIISVSSATGSAHQYIPIMNGIFACQRLHIPIDVCKL 259

Query: 183 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 242
              ++ FLQQAS  T G++    +  GL QYL+  F  D  SR  L LP  V VDFRA+C
Sbjct: 260 SG-DAVFLQQASDATKGIYMSLSEPRGLLQYLMMAFLPDQRSRKHLILPTRVDVDFRAAC 318

Query: 243 FCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGS 274
           FCH+  +D+G++CS+CLSI+C+  +   C TCG+
Sbjct: 319 FCHRRVVDIGFVCSICLSIFCEPPENGDCLTCGT 352


>gi|440633398|gb|ELR03317.1| hypothetical protein GMDG_06064 [Geomyces destructans 20631-21]
          Length = 363

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 150/308 (48%), Gaps = 52/308 (16%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    +L  S+ + +++ F+NA L  N  NQV V+A+  +   ++Y
Sbjct: 20  SLLTIVLDTNPHAWALLHDTLPLSKAVANLMVFINAHLAFNNANQVAVVASHSHRAAWLY 79

Query: 72  ----DSSSTGNQSV---GNGRMPS-------------LCATLLQNLEEFMNKDEQLGKQE 111
               ++   G  +V   G  RM               +   LL +L E ++        E
Sbjct: 80  PRPPNARGAGEDTVMTDGYDRMTKAPNDANKYRPFSLIEHDLLTSLRELIDST----TTE 135

Query: 112 PEGRIACSLLSGSLSMALCYIQR---------VFRSGLLHPQP--------------RIL 148
                  + ++G+L++AL Y  +         V  SG  HP                RIL
Sbjct: 136 DVSSTTTTQMAGALTLALSYSNKATVTYNESGVSSSGKSHPDGGENPTDDVPRGLLCRIL 195

Query: 149 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 208
            +  S D   QY+ IMN  F+AQR  +PID   L   ++ FLQQAS  T G + + +   
Sbjct: 196 VISVSGDLAHQYIPIMNTTFAAQRLRIPIDILKLSG-DTVFLQQASDATNGTYMQLRNPQ 254

Query: 209 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 268
           GL Q L+  F  D  +R  L  P    VDFRA+CFCH+N +D+G++CS+CLSI+C   + 
Sbjct: 255 GLLQSLMLGFLPDQSARTHLISPTQEVVDFRAACFCHRNVVDVGFVCSICLSIFCSPPEG 314

Query: 269 --CSTCGS 274
             C TCG+
Sbjct: 315 AICLTCGT 322


>gi|452978955|gb|EME78718.1| hypothetical protein MYCFIDRAFT_124295, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 339

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 100/334 (29%), Positives = 155/334 (46%), Gaps = 80/334 (23%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           +L+ V+LDTNP  W+    SLS SQ + ++L F+NA + +N  N+V V+A+     +++Y
Sbjct: 4   ALLTVILDTNPHAWALLEHSLSLSQLVVNLLVFVNAHIAINPANRVAVVASHSECAEWLY 63

Query: 72  DSSSTGNQSVGNGR-------------------------------MPSLCATLLQNLEEF 100
            + +       NG                                   +   +  NL   
Sbjct: 64  PTPARAQSHAANGNGNGTHVEMADGDPAPRGGPVQAPDDANKYRPFAHIEHAITTNLRRL 123

Query: 101 MNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR---VFRSG------------------ 139
           M+       Q        ++++G+L+ AL YI +      SG                  
Sbjct: 124 MSSTSAEALQS----TPATMMAGALTRALAYISKQSQTLPSGAASSQQFNYSDPNSVAGG 179

Query: 140 ------------LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS 187
                       +L    RIL L  S D  +QY+ IMN+IF+ QR  +PID   L A ++
Sbjct: 180 NEASSGTSQGNNMLGLTSRILILSVSGDLADQYIPIMNSIFACQRLSIPIDILKL-AGDT 238

Query: 188 AFLQQASYITGGVH----HKPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASC 242
            FLQQA+  TGG++    H+ +   G  QYL+  +  D  +R  L +P +  GVDFRA+C
Sbjct: 239 VFLQQAADATGGIYMALDHRTRA--GFLQYLMFAYLPDEAARKHLIVPGEGEGVDFRAAC 296

Query: 243 FCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 274
           FCH+  +D+G++CS+CLSI+C+  H   C TCGS
Sbjct: 297 FCHRRVVDIGFVCSICLSIFCEPLHDSTCLTCGS 330


>gi|254567625|ref|XP_002490923.1| Subunit of TFIIH complex, involved in transcription initiation,
           similar to 34 kDa subunit of human T [Komagataella
           pastoris GS115]
 gi|238030720|emb|CAY68643.1| Subunit of TFIIH complex, involved in transcription initiation,
           similar to 34 kDa subunit of human T [Komagataella
           pastoris GS115]
          Length = 328

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 103/289 (35%), Positives = 149/289 (51%), Gaps = 33/289 (11%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVI-ATGYNS---- 66
           SL+ V++DTNP  W+     +   +    VL  LNA L+LN  N+V V+ ++  NS    
Sbjct: 23  SLLAVVIDTNPLEWTKLKGVICLKELCKSVLVLLNAHLSLNSGNRVAVLTSSSLNSGPKF 82

Query: 67  -----CDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLG--KQEPEGRIACS 119
                 D  Y+   +   S  + +   +   ++Q L+  ++ +E L    QE +G IA  
Sbjct: 83  LYPDPNDKTYEKRESLLSSDIHRQFKFVDQKIIQELQILLD-NEPLNPENQELKGSIA-- 139

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQ-------PRILCLQGSPDGPEQYVAIMNAIFSAQR 172
              G++SMAL YI R+     + P+        ++L +  S D  +QYV  MN IFSAQ+
Sbjct: 140 ---GAMSMALSYINRLTN---IEPESTGNTLRAKMLVISISDDSTQQYVPFMNCIFSAQK 193

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             V ID C +G  +S+FLQQAS +T GV+   + + GL QYL T    D   R  + LP 
Sbjct: 194 MKVSIDVCKMGP-DSSFLQQASDVTNGVYMLIKNVHGLIQYLTTALFIDPSLRPIMVLPT 252

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQA 279
              +DFRASCF     ID+GY+CSVCL I+    +   C TC S F  A
Sbjct: 253 NSDLDFRASCFVTNKVIDIGYVCSVCLCIFSIIPRNNVCPTCQSKFDDA 301


>gi|449295631|gb|EMC91652.1| hypothetical protein BAUCODRAFT_300256 [Baudoinia compniacensis
           UAMH 10762]
          Length = 386

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 103/329 (31%), Positives = 153/329 (46%), Gaps = 73/329 (22%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W+    +L  ++ +  +L F+NA L +N  NQV VIA+     +++Y
Sbjct: 20  SLLTIIIDTNPHAWALLDDTLPLTKAVASLLVFINAHLAINYTNQVAVIASHSERAEWLY 79

Query: 72  DS-----SSTGNQSVGNGR-----------------------MPSLCATLLQNLEEFMNK 103
            +     +   N ++ NGR                          L  TL+ NL + MN 
Sbjct: 80  PTDVHPDNVQSNGALANGRDYEDLMDRGGPVQPPDDANKYRPFAQLEHTLMYNLRKMMNT 139

Query: 104 DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS------------------------- 138
                  +       ++L+G+L+ AL YI +   S                         
Sbjct: 140 T----TTDAINSSPATMLAGALTRALAYISKQTASLPTATSSTQFNYSDPSAMAGGNELT 195

Query: 139 -------GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQ 191
                    L    RIL +  S D  ++Y+ IMN IF+ QR  VPID   L A +  FLQ
Sbjct: 196 TDPTKSANALGLSSRILIVSVSGDLADRYIEIMNCIFACQRMSVPIDVLKL-AGDPVFLQ 254

Query: 192 QASYITGGVH---HKPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKN 247
           QA+  TGG++          GL QYL+  +  D  +RN L  P +  GVDFRA+CFCHK 
Sbjct: 255 QAADTTGGIYMSLSTSTARAGLLQYLMFAYLMDQTARNHLIAPGEGEGVDFRAACFCHKR 314

Query: 248 TIDMGYICSVCLSIYCKHLKK--CSTCGS 274
            +D+G++CS+CLSI+C+ L    C  CGS
Sbjct: 315 VVDIGFVCSICLSIFCEPLPDGTCLLCGS 343


>gi|358366716|dbj|GAA83336.1| transcription factor TFIIH complex subunit [Aspergillus kawachii
           IFO 4308]
          Length = 386

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 153/326 (46%), Gaps = 66/326 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+  S +L  S  + ++L F+NA L  N  N+V V+++  +   ++Y
Sbjct: 20  SLLTIILDTNPHAWALLSPTLPLSTAIANILVFINAHLASNYANEVAVVSSHTSQATWLY 79

Query: 72  ----DSSSTGNQSVGNGRMPSLCATL------------LQNLEEFMNKDEQLG------- 108
                 +S  N    +G +P                  +     F   +EQ+        
Sbjct: 80  PVEKSPNSNSNNPDPDGDIPMPTQQPQPPSNTTTTTTNINKYRPFRIVEEQVSSHLHNLM 139

Query: 109 -KQEPEG--RIACSLLSGSLSMALCYIQR------------VFRSGLLHP---------- 143
            +  P        ++++G+L++AL +I R            +  S    P          
Sbjct: 140 TQTTPSTLQSTTSTMMAGALTLALSHINRRTLAWNEAHGGDIDTSNPNDPSSTTSTSTSS 199

Query: 144 -----------QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFL 190
                      Q RIL L    S     QY+ IMNAIF+ QR  +PID C +   ++ FL
Sbjct: 200 SNTKGTAESTLQSRILILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFL 258

Query: 191 QQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTID 250
           QQAS  T GV+    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D
Sbjct: 259 QQASDATKGVYMALDEPKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVD 318

Query: 251 MGYICSVCLSIYCKHLKK--CSTCGS 274
           +G++CS+CLSI+C+ L    C TCG+
Sbjct: 319 VGFVCSICLSIFCEPLGDGVCLTCGT 344


>gi|425772016|gb|EKV10443.1| Transcription factor TFIIH subunit Tfb4, putative [Penicillium
           digitatum Pd1]
 gi|425777277|gb|EKV15458.1| Transcription factor TFIIH subunit Tfb4, putative [Penicillium
           digitatum PHI26]
          Length = 387

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 104/333 (31%), Positives = 153/333 (45%), Gaps = 77/333 (23%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA---------- 61
           SL+ +++DTNP  W+    SL  S+ + ++L F+NA L  N  N+V V+A          
Sbjct: 20  SLLTIVIDTNPHAWAILEHSLPLSKAIANILVFINAHLACNYANEVAVVASHCQKAAWLY 79

Query: 62  ---------TGYNSCDYVYDSSSTGNQSVGNGRMP--SLCATLLQNLEEFMNKDEQLGKQ 110
                    T  +  D   + +S       N   P   +   + +NL+E M  D   G  
Sbjct: 80  PSHNAPRNRTADHDSDVAMNGASDTQPPETNKYRPFRIVEEQVTRNLKELM--DSTTG-D 136

Query: 111 EPEGRIACSLLSGSLSMALCYI-QRVFRSGLLHP-------------------------- 143
           +  G ++ ++L+G+L++AL +  +R       H                           
Sbjct: 137 DLRGNMS-TMLAGALTLALSHTNRRALAWAEEHGGTNNEDAAADGNGNGNGNGNGNGHGG 195

Query: 144 ------------------QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 183
                             Q RIL +   GS D   QY+ IMN+IF+ QR  +PID C L 
Sbjct: 196 GTSASVNRYSASNEDERLQSRILVISVSGSTDAAHQYIPIMNSIFACQRLNIPIDVCKLS 255

Query: 184 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 243
             ++ FLQQAS  T GV+    +  GL QYL+  F  D  SR  L LP  V VDFRA+CF
Sbjct: 256 G-DAVFLQQASDATKGVYMALAEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACF 314

Query: 244 CHKNTIDMGYICSVCLSIYCKHL--KKCSTCGS 274
           CH+  +D+G++CS+CLSI+C       C TCG+
Sbjct: 315 CHRRVVDVGFVCSICLSIFCGPPPGSDCMTCGT 347


>gi|346972888|gb|EGY16340.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
           dahliae VdLs.17]
          Length = 398

 Score =  140 bits (353), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 157/324 (48%), Gaps = 62/324 (19%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           +  SL  V+LD NP  WS+ S  L  S+ L++VL +LNA L  +  NQV VIA+  N   
Sbjct: 17  ETASLSTVILDVNPRAWSALSDVLPISKALSNVLIYLNANLAFSNTNQVAVIASHTNRAV 76

Query: 69  YVYDS--------SSTGNQSVGNG-----RMPSLCA-TLLQNLEEFMNKD-EQLGKQEPE 113
           ++Y +        S+ G+ ++ +      ++PS    T    +E+ + +  ++L  +  E
Sbjct: 77  WLYPTKPEPSNVPSTDGDVAMSDAPPKKSKVPSANKFTQFAQVEDAVLRSLKKLLDETTE 136

Query: 114 GRIACSL--LSGSLSMALCYIQRV-------FRSGLLHP--------QPRILCLQGSPDG 156
             + C+   LSG+L++AL +I +V         S    P           IL +  S   
Sbjct: 137 ADLDCTTTQLSGALTLALAHINKVNLSYTASLSSDTAKPMTTAAPGLSSHILVVSVSDSD 196

Query: 157 PEQYVAIMNAIFSAQRSMVPIDSCYLG---------------------AQNSAFLQQASY 195
           P QY++ MNA+F+A  + +PID   L                      A +S+FLQQAS+
Sbjct: 197 PSQYISTMNAVFAAAHASIPIDVLSLSPATAKQTTATQSTDPNDPHKPASSSSFLQQASH 256

Query: 196 ITGGVHHKPQQLDGLFQYLLTIFG---TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 252
           IT G   +P    GL  YL   FG    D  +R  L  P    VDFRA+CFCH   ID G
Sbjct: 257 ITNGTFLEPPSPLGLLAYL--TFGLAVADTRARAALVPPTHHTVDFRAACFCHGRVIDTG 314

Query: 253 YICSVCLSIYCK--HLKKCSTCGS 274
           ++CSVCLSI+C       C TCG+
Sbjct: 315 FVCSVCLSIFCSVPEGADCLTCGT 338


>gi|322709828|gb|EFZ01403.1| transcription factor TFIIH subunit Tfb4, putative [Metarhizium
           anisopliae ARSEF 23]
          Length = 358

 Score =  140 bits (353), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 95/302 (31%), Positives = 150/302 (49%), Gaps = 47/302 (15%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W++  + LS S+ + ++L F+NA L  + +NQV ++A   N   ++Y
Sbjct: 20  SLLSIVIDTNPRAWAALDNVLSISKAIANILVFVNAHLAFSNVNQVAIVAAHVNRAVWLY 79

Query: 72  ------------------DSSSTGNQSVGNG--RMPSLCATLLQNLEEFMNKDEQLGKQE 111
                             D SS   ++  N   +   +   +L ++++ M++       E
Sbjct: 80  PTPPQTSPKDTSDDVHMNDVSSESPRNSANKYPQFAQIETAVLGSMQKLMSE-----TTE 134

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFR------SGLLH-PQP----------RILCLQGSP 154
            +     + LSG+L++ALC+I +  +      S L    QP          RIL +  S 
Sbjct: 135 QDLEATTTQLSGALTLALCHINKASQALSGASSSLAEVAQPSANAAPPMKGRILVVSVSD 194

Query: 155 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 214
             P QY+  MNA+F+A  + V ID+  L   N  FLQQA + T G+        GL  YL
Sbjct: 195 SEPTQYIPTMNAVFAAAHAQVAIDTLSLSG-NPTFLQQACFNTNGIFLAAANPKGLLTYL 253

Query: 215 LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTC 272
           +     D  +R  L  P    VDFRA+CFCH   +D G+ICS+CLSI+C+     +C TC
Sbjct: 254 MFGLIADTEARQSLITPTHDTVDFRAACFCHGKVVDRGFICSICLSIFCELPENAECLTC 313

Query: 273 GS 274
           G+
Sbjct: 314 GT 315


>gi|226294041|gb|EEH49461.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides brasiliensis Pb18]
          Length = 408

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 106/353 (30%), Positives = 161/353 (45%), Gaps = 88/353 (24%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ V+LDTNP+ W+   S+L  S  + ++L F+NA L  N  N+V V+A+  N   ++Y
Sbjct: 19  SLLTVILDTNPYAWALLESTLPLSAAIANLLVFINAHLACNYANKVAVVASHCNHAAWLY 78

Query: 72  DSSSTGNQS-------------VGNGRMPSL---------CATLLQNLEEFMNKDEQLGK 109
            + ST + S               NG+ PS            +       F+  +EQL +
Sbjct: 79  PTPSTPSISQSKAIQLDADGDITMNGQ-PSTEPEPENQQQSQSPFNKYRPFLLVEEQLTR 137

Query: 110 Q-------EPEGRIACS---LLSGSLSMALCYIQR-----VFRSGLLHPQPR-------- 146
                    P   +  +   +L+G+L++AL +I R         G  +P P         
Sbjct: 138 NLHCLLSATPPSAVTSTTSTMLAGALTLALSHINRESIAYAEAHGAANPDPTSSTTGPAT 197

Query: 147 -----------------------------------ILCLQGSPDGPEQYVAIMNAIFSAQ 171
                                              I+ L  +    + Y+ IMN+IF+ Q
Sbjct: 198 STASGLPPPPNSTTADPTSAAANPLNPTALQSRILIISLSNNTHSAQHYIPIMNSIFACQ 257

Query: 172 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ-QLDGLFQYLLTIFGTDLHSRNFLQL 230
           R  +PID   L A ++AFLQQAS  TGG++        G  QYL+  F  D  +R  L L
Sbjct: 258 RLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINAHPAGFLQYLMLGFLPDQRARAHLIL 316

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL---KKCSTCGSVFGQAQ 280
           P  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L    +C TCG+  G  +
Sbjct: 317 PSRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPLGGEGECLTCGTRLGMVE 369


>gi|302413233|ref|XP_003004449.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
           albo-atrum VaMs.102]
 gi|261357025|gb|EEY19453.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
           albo-atrum VaMs.102]
          Length = 384

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 157/324 (48%), Gaps = 62/324 (19%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           +  SL  V+LD NP  WS+ S  L  S+ L++VL +LNA L  +  NQV VIA+  N   
Sbjct: 17  ETASLCTVILDVNPRAWSALSDVLPISKALSNVLIYLNANLAFSNTNQVAVIASHTNRAV 76

Query: 69  YVYDS--------SSTGNQSVGNG-----RMPSLCA-TLLQNLEEFMNKD-EQLGKQEPE 113
           ++Y +        S+ G+ ++ +      ++PS    T    +E+ + +  ++L  +  E
Sbjct: 77  WLYPTKSEPSNTPSTDGDVAMSDAPPKKAKVPSANKFTQFAQVEDAVLRSLKKLLDETTE 136

Query: 114 GRIACSL--LSGSLSMALCYIQRV-------FRSGLLHP--------QPRILCLQGSPDG 156
             + C+   LSG+L++AL +I +V         S    P           IL +  S   
Sbjct: 137 ADLDCTTTQLSGALTLALAHINKVNLSYTASLSSDTAKPMTTAAPGLSSHILVVSVSDSD 196

Query: 157 PEQYVAIMNAIFSAQRSMVPIDSCYLG---------------------AQNSAFLQQASY 195
           P QY++ MNA+F+A  + +PID   L                      A +S+FLQQAS+
Sbjct: 197 PSQYISTMNAVFAAAHASIPIDVLSLSPATAKQTAAAQTTDPNDPHKSASSSSFLQQASH 256

Query: 196 ITGGVHHKPQQLDGLFQYLLTIFG---TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 252
           IT G   +P    GL  YL   FG    D  +R  L  P    VDFRA+CFCH   ID G
Sbjct: 257 ITNGTFLEPPSPLGLLAYL--TFGLAVADTRARAALVPPTHHTVDFRAACFCHGRVIDTG 314

Query: 253 YICSVCLSIYCK--HLKKCSTCGS 274
           ++CSVCLSI+C       C TCG+
Sbjct: 315 FVCSVCLSIFCSVPEGADCLTCGT 338


>gi|410976500|ref|XP_003994658.1| PREDICTED: general transcription factor IIH subunit 3 [Felis catus]
          Length = 407

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/336 (30%), Positives = 160/336 (47%), Gaps = 74/336 (22%)

Query: 8   LYSDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIAT 62
           L  D+++L+V+++DTNP +W    L  SQF     +  V+   N+ L +N+ N++ VIA+
Sbjct: 60  LVEDELNLLVIIVDTNPIWWGKQGLKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIAS 119

Query: 63  GYNSCDYVYDSSS---------TGNQSV-------GNGRMPSLCA---TLLQNLEEFMNK 103
                 ++Y   +          GN S         +G+     A    + + +++ M K
Sbjct: 120 HIQESRFLYPGKNGRLGDFFGDPGNPSSEFSPSGSKDGKYELFTAANEVIAEEIKDLMTK 179

Query: 104 DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-----------------------GL 140
            +  G+         +LL+GSL+ ALCYI R+ +                         +
Sbjct: 180 SDIKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILFQAPNAPMSMPNI 233

Query: 141 LHPQPRILC-----------LQGSPDGPEQYVAIMNAIFSA------QRSM---VPIDSC 180
           LH      C           L+G+ D    Y     ++  A       RS+   + ID+C
Sbjct: 234 LHVGAAACCSERPWALGTGRLEGAGDSTAFYRLCDRSLAKAWVLQLQMRSIYLNILIDAC 293

Query: 181 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240
            L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L LP P+ VD+RA
Sbjct: 294 VLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPPPIHVDYRA 352

Query: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 353 ACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 388


>gi|67515713|ref|XP_657742.1| hypothetical protein AN0138.2 [Aspergillus nidulans FGSC A4]
 gi|40746160|gb|EAA65316.1| hypothetical protein AN0138.2 [Aspergillus nidulans FGSC A4]
          Length = 384

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 98/330 (29%), Positives = 151/330 (45%), Gaps = 74/330 (22%)

Query: 18  VLLDTNPFFWS------SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           V++DTNP  W+       + L FS  L ++L FLNA L  N  N+V V+A+  +   ++Y
Sbjct: 16  VIIDTNPHAWALLEQNEKNQLPFSTALANILVFLNAHLACNYANEVAVVASHTHKAAWLY 75

Query: 72  DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEP----EGRI----------- 116
            S ++   S  +    ++      +      K  Q+ K  P    E ++           
Sbjct: 76  PSPNSPTTSADSDGDVTMSNNNHMSTGGGSGKPSQVNKYRPFRIVEEQVTSNLRHLVDST 135

Query: 117 --------ACSLLSGSLSMALCYIQR----------------VFRSGL------------ 140
                   A ++++G+L++AL +I R                  RSG             
Sbjct: 136 NRDDLDSGASTMMAGALTLALSHINRRTIAWAEAHGGGDASSFSRSGDPDTSTTSMIARR 195

Query: 141 --------------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 186
                         L  +  I+ +  +     QY+ IMN IF+ QR  +PID C L   +
Sbjct: 196 STTTTATNDGAAEGLQSRILIISVSSATGSAHQYIPIMNGIFACQRLHIPIDVCKLSG-D 254

Query: 187 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 246
           + FLQQAS  T G++    +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+
Sbjct: 255 AVFLQQASDATKGIYMSLSEPRGLLQYLMMAFLPDQRSRKHLILPTRVDVDFRAACFCHR 314

Query: 247 NTIDMGYICSVCLSIYCKHLKK--CSTCGS 274
             +D+G++CS+CLSI+C+  +   C TCG+
Sbjct: 315 RVVDIGFVCSICLSIFCEPPENGDCLTCGT 344


>gi|46138695|ref|XP_391038.1| hypothetical protein FG10862.1 [Gibberella zeae PH-1]
 gi|408390605|gb|EKJ69997.1| hypothetical protein FPSE_09842 [Fusarium pseudograminearum CS3096]
          Length = 355

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 150/298 (50%), Gaps = 35/298 (11%)

Query: 11  DDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           D  SL+ ++LDTNP  W+S  S L  S+ + ++L F+NA L  +  N+V +IA   +  +
Sbjct: 17  DTPSLLSIVLDTNPRAWASLNSLLPLSRAIANILVFVNAHLAFSNANRVALIAAHVDRAE 76

Query: 69  YVYDSSSTGNQSV-GNGRM--PSLCATLLQNLEEFMNKD-------EQLGKQEPEGRIAC 118
           ++Y +    ++   G+  M   S   T      EF   +        +L  Q  E  ++ 
Sbjct: 77  WLYPTPPKPSRDASGDVEMNDASQTQTSANKFPEFAQIETAVLSGIRKLMDQTKEADLSA 136

Query: 119 SL--LSGSLSMALCYIQRVFRS-------------GLLHPQP-----RILCLQGSPDGPE 158
           +   +SG+L++ALC+I +  ++             G     P     RIL +  S   P 
Sbjct: 137 TTTQISGALTLALCHINKAAQALCSPTANLEDSHKGSSSTSPPTVRGRILVISVSDSEPS 196

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 218
           QY+  MNA+F+A  + V ID+  L  + + FL+QA Y TGG +       GL  YL+   
Sbjct: 197 QYIPTMNAVFAAAHTQVAIDTLSLTGEPT-FLEQACYNTGGTYLAATHPQGLLNYLMFGL 255

Query: 219 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK--KCSTCGS 274
            +D  +R  L       VDFRA+CFCH   +D G++CS+CLSI+C+  +  +C TCG+
Sbjct: 256 MSDTEAREALINSVHDTVDFRAACFCHGRVVDTGFVCSICLSIFCETPENSECFTCGT 313


>gi|258567552|ref|XP_002584520.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237905966|gb|EEP80367.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 398

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/329 (29%), Positives = 152/329 (46%), Gaps = 69/329 (20%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A+      ++Y
Sbjct: 20  SLLTIVLDTNPHAWALLEETLPLSTAVANLLVFINAHLACNYANKVAVVASHSQEARWLY 79

Query: 72  DSSSTGNQSVGNGRMPSLCATLLQN----------LEEFMNKDEQLGKQEPE-------- 113
            +S+T   +  NG   +   T + +             F   +EQL +   +        
Sbjct: 80  PASTTSPSNSQNGSTDTDGDTNMSSSNPPPTQSNKYRPFRIVEEQLTRNLKDLLSTTSPA 139

Query: 114 --GRIACSLLSGSLSMALCYIQR-----------------------VFRSGLLHPQP--- 145
                  ++++G+L++AL +I R                          +  L P P   
Sbjct: 140 SLSSTTSTMMAGALTLALSHINRETIVYAETHGASSAKLDADPSNPAPTASALPPPPGST 199

Query: 146 ----------------RILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS 187
                           RIL +    +     QY+ IMN+IF+ QR  +PID C L   ++
Sbjct: 200 FDPSSNTNRNLSGLQSRILIISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-DA 258

Query: 188 AFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKN 247
            FLQQA   T G++       G  QYL+  F  D  SR+ L LP  V VDFRA+CFCH+ 
Sbjct: 259 VFLQQACDATRGIYVPVDHPLGFLQYLMVAFLPDQRSRSHLILPTRVDVDFRAACFCHRK 318

Query: 248 TIDMGYICSVCLSIYCKHLK--KCSTCGS 274
            +D+G++CS+CLSI+C+  +   C TCG+
Sbjct: 319 VVDVGFVCSICLSIFCEPPEGADCLTCGT 347


>gi|443727474|gb|ELU14215.1| hypothetical protein CAPTEDRAFT_169977 [Capitella teleta]
          Length = 299

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 100/285 (35%), Positives = 158/285 (55%), Gaps = 30/285 (10%)

Query: 14  SLVVVLLDTNPFFWSSSSLS------FSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           S +++++DT+P +W    L        S+ L  ++ F +A L L+  NQ+VV+A   N  
Sbjct: 8   SCLILVIDTSPVWWGKKLLKQEYGRILSESLDALMVFCSAHLMLHHQNQLVVLAAHTNCS 67

Query: 68  DYVY------DSSSTGN-----QSVGNGRMPSLCATLLQN-LEEFMNKDEQLGKQEPEGR 115
             ++      D++  G+     QS G   + +   +++++ L + +  D   G  E    
Sbjct: 68  QVIFPSPNAHDAAMNGDAAFRGQSDGKYEVFAEVQSIIKDQLTDLILHD---GGGEVHSD 124

Query: 116 IACSLLSGSLSMALCYIQRVFRSGLLHP---QPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
                L+G+L+ ALCY+ ++ + GL      + R+L +QGS D P QY+ +MN IF+AQ+
Sbjct: 125 CL---LAGALARALCYVNKINK-GLPDGDAMRSRVLVVQGSEDNPLQYMDLMNIIFTAQK 180

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH-SRNFLQLP 231
             V ID+C +  ++S  LQQ   ITGG + K  Q   L QYLL +F  +   +R  L LP
Sbjct: 181 QNVIIDACIMD-EDSPILQQVCDITGGFYLKTPQPSALLQYLLWVFLLEPGGAREKLNLP 239

Query: 232 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
             V VD+RA+CFCH+N I+ GY+CSVCLS++C     CSTC + F
Sbjct: 240 AKVHVDYRAACFCHRNLINTGYVCSVCLSVFCSFSPICSTCQTTF 284


>gi|322698540|gb|EFY90309.1| transcription factor TFIIH subunit Tfb4, putative [Metarhizium
           acridum CQMa 102]
          Length = 358

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 96/302 (31%), Positives = 148/302 (49%), Gaps = 47/302 (15%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W+     L  S+ + ++L F+NA L  + +NQV V+A   N   ++Y
Sbjct: 20  SLLSIVIDTNPRAWAVLDDVLPISKAIANILVFVNAHLAFSNVNQVAVVAAHVNRAVWLY 79

Query: 72  ---------DSS-----------STGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQE 111
                    D+S           S  N +    +   +   +L ++++ M++       E
Sbjct: 80  PTPPQTSPKDTSGDVHMIDVPGDSPRNSANKYPQFAQIETAVLASMQKLMSE-----TTE 134

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFR-----SGLLH--PQP----------RILCLQGSP 154
            +     + LSG+L++ALC+I +  +     SG L    QP          RIL +  S 
Sbjct: 135 QDLEATTTQLSGALTLALCHINKASQALSGASGSLAEVAQPSANAALPMKGRILVVSVSD 194

Query: 155 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 214
             P QY+ IMNA+F+A  + V ID+  L   N  FLQQA + T G         GL  YL
Sbjct: 195 SEPSQYIPIMNAVFAAAHAQVAIDTLCLSG-NPTFLQQACFNTNGTFLAAANPKGLLAYL 253

Query: 215 LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTC 272
           +     D  +R  L  P    VDFRA+CFCH   +D G++CS+CLSI+C+     +C TC
Sbjct: 254 MFGLIADTEARQSLITPTHDTVDFRAACFCHGKVVDRGFVCSICLSIFCELPENAECLTC 313

Query: 273 GS 274
           G+
Sbjct: 314 GT 315


>gi|358396042|gb|EHK45429.1| hypothetical protein TRIATDRAFT_221909 [Trichoderma atroviride IMI
           206040]
          Length = 367

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 100/310 (32%), Positives = 152/310 (49%), Gaps = 47/310 (15%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W++  S LS SQ ++++L F+NA L  +  NQV VIA   N   ++Y
Sbjct: 20  SLLSIVIDTNPRAWAALESRLSLSQAISNILVFVNAHLAFSNTNQVAVIAAHVNRAVWLY 79

Query: 72  DSSSTGNQSV--------GNGRMPSLCA-TLLQNLEEFMNKDEQLGKQE----------- 111
            +++   Q          G+ +M  + A T   +     NK  Q  + E           
Sbjct: 80  PAAAAAAQKPLAAARDHSGDVQMQDVSAETNNSSSSPSANKYPQFAQIESSVFSSIQSLM 139

Query: 112 PEGRI-----ACSLLSGSLSMALCYIQRVFRS--------------GLLHPQP---RILC 149
            E  +       + LSG+L++ALC I +  ++                  P P   RI+ 
Sbjct: 140 AETTVQDLDQVTTQLSGALTLALCRINKASQALSASDTTLSNAAPVNSAAPPPVKSRIVV 199

Query: 150 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 209
           +  S   P QY+  MNA+F+A  + V ID+  L A +S FLQQA + T G+  K     G
Sbjct: 200 ISVSDSEPSQYIPTMNAVFAAAHNQVAIDTIAL-AGDSTFLQQACFNTNGIFLKASNPQG 258

Query: 210 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK-- 267
           L  YL+     D  +R  +  P    VDFR +CFCH   +D G++CSVCLSI+C+  +  
Sbjct: 259 LLTYLMFGLIPDTEARESIITPAHDTVDFRTACFCHGRVVDTGFVCSVCLSIFCEPPENA 318

Query: 268 KCSTCGSVFG 277
           +C TCG+V  
Sbjct: 319 ECLTCGTVLA 328


>gi|344300009|gb|EGW30349.1| hypothetical protein SPAPADRAFT_63205 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 367

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 104/354 (29%), Positives = 160/354 (45%), Gaps = 65/354 (18%)

Query: 3   SAPSKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           ++ S+  SD+ SL+ ++LD +P  W      ++  +   ++L FLNA L LN  NQV  I
Sbjct: 13  TSSSEAISDEPSLLTIILDISPLGWYKLRDLMTLEEMTKNLLVFLNAHLALNNSNQVAFI 72

Query: 61  ATGYNSCDYVY-------------DSSSTGNQSVGNG---RMPSLCATLLQNLEEFMNKD 104
           A+      ++Y             D +++    V  G   +   +   +L+ L EF+N  
Sbjct: 73  ASSPKGSRFLYPNPAKSYQEVKVSDGATSARDFVNKGMYRQFRIVDKAVLEELNEFLN-- 130

Query: 105 EQLGKQEPEGRIA--CSLLSGSLSMALCYIQRVF-------------------------- 136
            ++ K + +  +A   S LSG+LSMAL Y  R+                           
Sbjct: 131 -EINKDQDDTMLADPASKLSGALSMALTYTNRMLTLDQSITTTTASAIASTTSSSASAAS 189

Query: 137 -----------RSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-A 184
                       S     + RIL +  + D   +Y+ IMN IF+AQ+  + ID   LG  
Sbjct: 190 STATGAGGASSGSSSTSMKSRILVVSANDDNDVKYIPIMNTIFAAQKMKLSIDIIKLGDE 249

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
           +NS++LQQ S  T G++       G+ Q L T +  +   R F+ LP    V++RASCF 
Sbjct: 250 ENSSYLQQPSDATNGIYLHLTDPRGIIQVLSTAYFIEPSLRPFIILPTNSNVNYRASCFI 309

Query: 245 HKNTIDMGYICSVCLSIY--CKHLKKCSTCGSVFGQAQTQ--SDEPSATNRKRK 294
               +D+GY+CSVCL I       KKC TC S F +      + EP    +KRK
Sbjct: 310 TGKAVDVGYVCSVCLCIMSIIPDSKKCPTCNSQFDERIIHQLTKEPEVVKKKRK 363


>gi|398392135|ref|XP_003849527.1| hypothetical protein MYCGRDRAFT_75897 [Zymoseptoria tritici IPO323]
 gi|339469404|gb|EGP84503.1| hypothetical protein MYCGRDRAFT_75897 [Zymoseptoria tritici IPO323]
          Length = 386

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 102/331 (30%), Positives = 164/331 (49%), Gaps = 65/331 (19%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SLS S+ + ++L F+NA + +N  N+V V+A+     +++Y
Sbjct: 20  SLLTIILDTNPHAWALLEDSLSLSKVVVNLLVFVNAHIAINHANRVAVLASHSERAEWLY 79

Query: 72  DSSSTGNQSVGNG---RMPSLCATL------------------LQNLEEFMNKDEQLGKQ 110
            ++ + +    NG    M    AT+                   Q      N   +L ++
Sbjct: 80  PTAPSSHSHRTNGDDVEMADSTATMNGGANHPPPDDPNKYRPFAQIEHALANNLRKLIER 139

Query: 111 EPEGRIACS---LLSGSLSMALCYIQRVFRS----------------------------- 138
                ++ +   +++G+L+MAL YI +   +                             
Sbjct: 140 TSTSSLSATPATMMAGALTMALSYISKQSAALPTTTSSAQFNYSDPNAGPAGGNDLAADN 199

Query: 139 ----GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQAS 194
                LL    RIL L  S D  +QY+ IMN+IF+ QR  +PID   L A ++ FLQQA+
Sbjct: 200 KGPNNLLGLTSRILILSVSGDLADQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAA 258

Query: 195 YITGGVHH--KPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDM 251
             TGG++     Q   G  QYL+  +  D  +RN L +P +  GVDFRA+CFCH+  +D+
Sbjct: 259 DATGGIYMALDEQSRAGFLQYLMFAYLPDQTARNHLIMPGESEGVDFRAACFCHRRVVDI 318

Query: 252 GYICSVCLSIYCKHLK--KCSTCGSVFGQAQ 280
           G++CS+CLSI+C+ ++  +C TCGS    A 
Sbjct: 319 GFVCSICLSIFCEPMQDGQCLTCGSHLSMAN 349


>gi|255545470|ref|XP_002513795.1| hypothetical protein RCOM_1031970 [Ricinus communis]
 gi|223546881|gb|EEF48378.1| hypothetical protein RCOM_1031970 [Ricinus communis]
          Length = 159

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 83/201 (41%), Positives = 106/201 (52%), Gaps = 51/201 (25%)

Query: 41  VLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEF 100
           VLAFLN+IL  NQL+QV+VI +GYNSCDYVYDSS   N+S  +G++P+L + LLQ LE+F
Sbjct: 5   VLAFLNSILLFNQLSQVLVIGSGYNSCDYVYDSSLIANRSSEDGKIPTLYSELLQKLEDF 64

Query: 101 MNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQY 160
           + +DE+LGK E         L  +L +       ++   LL P                 
Sbjct: 65  IIRDEKLGKGE-------EFLKETLPLHCSQDPCLWHFALLIP----------------- 100

Query: 161 VAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGT 220
               + +       VPIDSCY+G  NSAFLQQ                       T+  T
Sbjct: 101 ----DTMVRKNEGKVPIDSCYVGGHNSAFLQQ-----------------------TVLAT 133

Query: 221 DLHSRNFLQLPKPVGVDFRAS 241
           DLHSRN LQLP+P  VDFRAS
Sbjct: 134 DLHSRNCLQLPRPADVDFRAS 154


>gi|226468218|emb|CAX69786.1| putative general transcription factor IIH [Schistosoma japonicum]
          Length = 317

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 96/314 (30%), Positives = 152/314 (48%), Gaps = 50/314 (15%)

Query: 8   LYSDDVSLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           + S+  SL++++LD  P  W   +S       +   L F N+ L L+  N+V VI    +
Sbjct: 1   MSSEVRSLLLLILDMTPASWGFCTSDFGLPNCIEAALGFANSHLMLSSFNEVAVIGVTPS 60

Query: 66  SCDYVYDS-SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGS 124
              ++Y S S T   +  +G+  +L  + + N    ++ D          +I   +L+G+
Sbjct: 61  QIKFIYPSHSETLVGASNDGQNDAL--SCMNNTVRQLSLDLVTSCSSTSTQI---VLAGA 115

Query: 125 LSMALCYIQR--------------------VFRSGLLHPQPRILCLQGSPDGPEQYVAIM 164
           +   LCY  R                     F  G      RIL ++ S D   QY+++M
Sbjct: 116 IIKGLCYYLRRCRELQPSTRHLDDPSETTMEFSEGNDKISARILIIKASDDSTSQYLSLM 175

Query: 165 NAIFSAQRSMVPIDSCYL----------------GAQNSAFLQQASYITGGVHHKPQQLD 208
           N++F+AQ+  VPID+C L                   +S+ LQQA+ +TGG++ +  ++ 
Sbjct: 176 NSVFTAQKLHVPIDTCVLPLPRQLDNLNGRRSISSPGHSSLLQQAADLTGGIYLQIPRVS 235

Query: 209 GLFQYLLTIFGTDLHSRNFLQLPKP------VGVDFRASCFCHKNTIDMGYICSVCLSIY 262
           GL QYLL++F      R  L LP         GVDFRA+CFCHK  ID+GY+CS+CLS++
Sbjct: 236 GLLQYLLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACFCHKRLIDIGYVCSICLSVF 295

Query: 263 CKHLKKCSTCGSVF 276
           C+    C TC + F
Sbjct: 296 CEFNPICPTCNTPF 309


>gi|303314233|ref|XP_003067125.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|240106793|gb|EER24980.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|320037386|gb|EFW19323.1| transcription factor TFIIH subunit Tfb4 [Coccidioides posadasii
           str. Silveira]
          Length = 395

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 99/335 (29%), Positives = 150/335 (44%), Gaps = 81/335 (24%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A+      ++Y
Sbjct: 17  SLLTIVLDTNPHAWALLEDTLPLSTAVANLLVFVNAHLACNYANKVAVVASHSQEARWLY 76

Query: 72  D-----------------------SSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQ-- 106
                                   SS+TG     N   P       + +EE + ++ +  
Sbjct: 77  PTPTTASSNTNQDEPADTDGDVAMSSTTGESPASNKYRP------FRIVEEQLTRNLKAL 130

Query: 107 LGKQEPEG--RIACSLLSGSLSMALCYIQRV------------FRSGLLHPQPRILCL-- 150
           L    P        ++++G+L++AL +I R               S   +P P    L  
Sbjct: 131 LATTSPASVSSATSTMMAGALTLALSHINRETIAYAETHGTSRLDSDPSNPTPTATGLPP 190

Query: 151 -------------QGSPDG----------------PEQYVAIMNAIFSAQRSMVPIDSCY 181
                        + SP G                  QY+ IMN+IF+ QR  +PID C 
Sbjct: 191 PPGSHPDPTSQSSRSSPTGLQSRILIISVSSATGSAHQYIPIMNSIFACQRLHIPIDICK 250

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L   ++ FLQQA   T G++       G  QYL+  F  D  SR  L LP  V VDFRA+
Sbjct: 251 LSG-DAVFLQQACDATRGIYVPVDHPRGFLQYLMVAFLPDQRSRRHLILPTRVDVDFRAA 309

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLK--KCSTCGS 274
           CFCH+  +D+G++CS+CLSI+C+  +   C TCG+
Sbjct: 310 CFCHRKVVDVGFVCSICLSIFCEPPEGADCLTCGT 344


>gi|226467818|emb|CAX69785.1| putative general transcription factor IIH [Schistosoma japonicum]
          Length = 317

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 152/314 (48%), Gaps = 50/314 (15%)

Query: 8   LYSDDVSLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           + S+  SL++++LD  P  W   +S       +   L F N+ L L+  N+V VI    +
Sbjct: 1   MSSEVRSLLLLILDMTPASWGFCTSDFGLPNCIEAALGFANSHLMLSSFNEVAVIGVTPS 60

Query: 66  SCDYVYDS-SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGS 124
              ++Y S S T   +  +G+  +L  + + N    ++ D          +I   +L+G+
Sbjct: 61  QIKFIYPSHSETLVGASNDGQNDAL--SCMNNTVRQLSLDLVTSCSSTSTQI---VLAGA 115

Query: 125 LSMALCYIQR--------------------VFRSGLLHPQPRILCLQGSPDGPEQYVAIM 164
           +   LCY  R                     F  G      RIL ++ S D   QY+++M
Sbjct: 116 IIKGLCYYLRRCRELQPSTRHLDDPSETTMEFSEGNDKISARILIIKASDDSTSQYLSLM 175

Query: 165 NAIFSAQRSMVPIDSCYL----------------GAQNSAFLQQASYITGGVHHKPQQLD 208
           N++F+AQ+  VPID+C L                   +S+ LQQA+ +TGG++ +  ++ 
Sbjct: 176 NSVFTAQKLHVPIDTCVLPLPRQLDNLNGRRSISSPGHSSLLQQAADLTGGIYLQIPRVS 235

Query: 209 GLFQYLLTIFGTDLHSRNFLQLPKP------VGVDFRASCFCHKNTIDMGYICSVCLSIY 262
           GL QYLL++F      R  L LP         GVDFRA+CFCH+  ID+GY+CS+CLS++
Sbjct: 236 GLLQYLLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACFCHRRLIDIGYVCSICLSVF 295

Query: 263 CKHLKKCSTCGSVF 276
           C+    C TC + F
Sbjct: 296 CEFNPICPTCNTPF 309


>gi|225712364|gb|ACO12028.1| General transcription factor IIH subunit 3 [Lepeophtheirus
           salmonis]
 gi|290562788|gb|ADD38789.1| General transcription factor IIH subunit 3 [Lepeophtheirus
           salmonis]
          Length = 305

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/299 (33%), Positives = 152/299 (50%), Gaps = 29/299 (9%)

Query: 15  LVVVLLDTNP----FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYV 70
           L+++++D NP    F    ++L+    L ++L F NA L L+  N +  + +  +   ++
Sbjct: 17  LLIIVMDMNPNQKLFLHDPTALT--SVLDNLLCFANAHLMLHPSNALATLGSMASGSYFL 74

Query: 71  Y---DSSSTGN---QSVGNGRMPSLCATLLQN-LEEFMNKDEQLGKQEPEGRIACSLLSG 123
           Y   D  +  N   Q  G   + +L  T +++   E +  +  +  Q        S LSG
Sbjct: 75  YPPPDDPAADNEIRQLDGQYELFTLVETTVRSKFIELLQSEAGISSQTD------SPLSG 128

Query: 124 SLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 183
           SL+MAL YI R  R   +    RIL +  S D   QY+  MN  F+AQ+  V +D+C L 
Sbjct: 129 SLAMALSYINRR-RKENMDLSARILVITASGDTASQYMNYMNVFFTAQKLDVLLDTCMLQ 187

Query: 184 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 243
             +S  LQQ + ITGG++        L Q+LL IF      R  L LP    VDFRA+CF
Sbjct: 188 I-DSPLLQQGADITGGMYFNVPDNAALLQFLLWIFLPSAEMRPQLGLPSANKVDFRAACF 246

Query: 244 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF--------GQAQTQSDEPSATNRKRK 294
           CH+  +D+G++CSVCLSI+CK    C+TC ++F        G+ +      S++  KRK
Sbjct: 247 CHRQLVDIGFVCSVCLSIFCKFSPICTTCQTIFRVPPPLMGGKKKKSHAHGSSSVLKRK 305


>gi|320167472|gb|EFW44371.1| general transcription factor IIH [Capsaspora owczarzaki ATCC 30864]
          Length = 381

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMN 165
           + P      S L  SLS+ALC+I +  R+  L P    + RIL +  + D P  YV +MN
Sbjct: 201 KPPMSSAMTSQLVSSLSVALCHINKTRRA--LPPGVVCRSRILVVHAAEDVPAHYVQLMN 258

Query: 166 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 225
            IF+AQ+  V ID+  L   +S FLQQA  +T G + K Q  D L Q L+     D  +R
Sbjct: 259 TIFAAQKLGVIIDALILTPDDSGFLQQAVDLTKGAYIKLQSNDDLVQTLMMFGLADRAAR 318

Query: 226 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
             L LP+P  VD+R++CFCH+  +D+G++CSVCLSI+C     C TCG+ F
Sbjct: 319 ELLILPQPTVVDYRSACFCHRRIVDVGFVCSVCLSIFCDQRGLCPTCGTRF 369


>gi|240274744|gb|EER38259.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
           capsulatus H143]
          Length = 412

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 108/346 (31%), Positives = 159/346 (45%), Gaps = 86/346 (24%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SL  S  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 20  SLLTIILDTNPHAWARLEPSLPLSAAIANLLVFINAHLACNYANKVAVVASHCHHAAWLY 79

Query: 72  DSSSTGNQSVGNGRMPSLCATLLQNL----------------EEFMNK-------DEQLG 108
            + ++   S      PS  A    ++                EE +NK       +EQL 
Sbjct: 80  PTPASPKSSDSQTDAPSKDAPPDGDINMNGQQSSGQHQQAEQEEPLNKYRPFRLVEEQLT 139

Query: 109 KQ-------EPEGRIACS---LLSGSLSMALCYIQR-----------------------V 135
           +             IA +   L++G+L++AL +I R                        
Sbjct: 140 RNLHTLLSTTTAADIASTTSTLIAGALTLALSHINRETIAYAETHGTARPDGSTSGPANS 199

Query: 136 FRSGLLHP------------------QPRILC--LQGSPDGPEQYVAIMNAIFSAQRSMV 175
             SGL  P                  Q RIL   L  +      Y+ IMN+IF+ QR  +
Sbjct: 200 TASGLPPPPGTTASATTSHLPNPSALQSRILIISLSSTTHSAHHYIPIMNSIFACQRLHI 259

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVH-HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
           PID   L A ++AFLQQAS  T G++        G  QYL+  F  D  +R+ L LP  V
Sbjct: 260 PIDILKL-AGDAAFLQQASDATRGIYIPATSHPAGFLQYLMLGFLPDQRARSHLVLPSRV 318

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHL------KKCSTCGS 274
            VDFRA+CFCH+  +D+G++CS+CLSI+C+ L       +C TCGS
Sbjct: 319 DVDFRAACFCHRRVVDVGFVCSICLSIFCEPLAGVDGVAECLTCGS 364


>gi|154286010|ref|XP_001543800.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407441|gb|EDN02982.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 413

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 157/347 (45%), Gaps = 87/347 (25%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SL  S  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 20  SLLTIILDTNPHAWARLEPSLPLSAAIANLLVFINAHLACNYANKVAVVASHCHHAAWLY 79

Query: 72  DSSSTGNQSVGNGRMPSLCAT----LLQNLEEFMNKDEQLGKQEPEGR------------ 115
            + ++   S      PS  A     +  N ++   + +Q  ++EP  +            
Sbjct: 80  PTPASPKSSDSQTDAPSKDAPPDGDISMNDQQSAGQHQQAEQEEPLNKYRPFRLVEEQLT 139

Query: 116 -----------------IACSLLSGSLSMALCYIQR------------------------ 134
                               +L++G+L++AL +I R                        
Sbjct: 140 RNLHTLLSTTTAADIASTTSTLIAGALTLALSHINRETIAYAETHGTARPDGSSTSGPAN 199

Query: 135 VFRSGLLHP------------------QPRILC--LQGSPDGPEQYVAIMNAIFSAQRSM 174
              SGL  P                  Q RIL   L  +      Y+ IMN+IF+ QR  
Sbjct: 200 STASGLPPPPGTTASATTSHLPNPSALQSRILIISLSSTTHSAHHYIPIMNSIFACQRLH 259

Query: 175 VPIDSCYLGAQNSAFLQQASYITGGVH-HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP 233
           +PID   L A ++AFLQQAS  T G++        G  QYL+  F  D  +R+ L LP  
Sbjct: 260 IPIDILKL-AGDAAFLQQASDATRGIYIPATSHPAGFLQYLMLGFLPDQRARSHLVLPSR 318

Query: 234 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL------KKCSTCGS 274
           V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L       +C TCGS
Sbjct: 319 VDVDFRAACFCHRRVVDVGFVCSICLSIFCEPLGGVDGVAECLTCGS 365


>gi|325094096|gb|EGC47406.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
           capsulatus H88]
          Length = 412

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 108/346 (31%), Positives = 159/346 (45%), Gaps = 86/346 (24%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SL  S  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 20  SLLTIILDTNPHAWARLEPSLPLSAAIANLLVFINAHLACNYANKVAVVASHCHHAAWLY 79

Query: 72  DSSSTGNQSVGNGRMPSLCATLLQNL----------------EEFMNK-------DEQLG 108
            + ++   S      PS  A    ++                EE +NK       +EQL 
Sbjct: 80  PTPASPKSSDSQTDAPSKDAPPDGDINMNGQQSSGQHQQAEQEEPLNKYRPFRLVEEQLT 139

Query: 109 KQ-------EPEGRIACS---LLSGSLSMALCYIQR-----------------------V 135
           +             IA +   L++G+L++AL +I R                        
Sbjct: 140 RNLHTLLSTTTAADIASTTSTLIAGALTLALSHINRETIAYAETHGTARPDGSTSGPANS 199

Query: 136 FRSGLLHP------------------QPRILC--LQGSPDGPEQYVAIMNAIFSAQRSMV 175
             SGL  P                  Q RIL   L  +      Y+ IMN+IF+ QR  +
Sbjct: 200 TASGLPPPPGTTASATTSHLPNPSALQSRILIISLSSTTHSAHHYIPIMNSIFACQRLHI 259

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVH-HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
           PID   L A ++AFLQQAS  T G++        G  QYL+  F  D  +R+ L LP  V
Sbjct: 260 PIDILKL-AGDAAFLQQASDATRGIYIPATPHPAGFLQYLMLGFLPDQRARSHLVLPSRV 318

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHL------KKCSTCGS 274
            VDFRA+CFCH+  +D+G++CS+CLSI+C+ L       +C TCGS
Sbjct: 319 DVDFRAACFCHRRVVDVGFVCSICLSIFCEPLAGVDGVAECLTCGS 364


>gi|255070041|ref|XP_002507102.1| RNA polymerase II transcription factor B subunit 4 [Micromonas sp.
           RCC299]
 gi|226522377|gb|ACO68360.1| RNA polymerase II transcription factor B subunit 4 [Micromonas sp.
           RCC299]
          Length = 291

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 90/254 (35%), Positives = 130/254 (51%), Gaps = 6/254 (2%)

Query: 28  SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMP 87
           S   L     L  ++ F++  L LN  N +VV+A     C Y+Y+S  +  +     + P
Sbjct: 35  SDDGLGIMHTLRDLVLFISCYLALNLQNDLVVLALHNGDCHYLYESPKSKLRRGVRVQPP 94

Query: 88  --SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 145
              +C  ++  +    +   Q    +     + S L+  LSMALC+IQR+  +G      
Sbjct: 95  VSDVCKEIITRIFRISSSPLQSPSGDASSSGSESPLAAGLSMALCHIQRL-GTGSFRGLR 153

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RI CL  SP    QY+ +MN IF+AQ++ V IDS  L   +S  L+QA+ +T G+H K Q
Sbjct: 154 RIFCLLRSPVSQRQYIPMMNVIFAAQQAFVTIDSYSLCDMHSDILEQAANMTNGLHRKLQ 213

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
           +   L Q+LLT        R+ L   K  GVDF+  CFCHK T+  G++CS CLSI C  
Sbjct: 214 KDGELGQHLLTFSSWSQTCRHSLLPGKQCGVDFKTFCFCHKKTLQRGFVCSACLSISCVA 273

Query: 266 LK---KCSTCGSVF 276
           LK    C TC + F
Sbjct: 274 LKTGGDCMTCAANF 287


>gi|225558318|gb|EEH06602.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
           capsulatus G186AR]
          Length = 412

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 108/346 (31%), Positives = 159/346 (45%), Gaps = 86/346 (24%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    SL  S  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 20  SLLTIILDTNPHAWARLEPSLPLSAAIANLLVFINAHLACNYANKVAVVASHCHHAAWLY 79

Query: 72  DSSSTGNQSVGNGRMPSLCATLLQNL----------------EEFMNK-------DEQLG 108
            + ++   S      PS  A    ++                EE +NK       +EQL 
Sbjct: 80  PTPASPKSSDSQTDPPSKDAPPDGDINMNGQQSAGQYQQAEQEESLNKYRPFRLVEEQLT 139

Query: 109 KQ-------EPEGRIACS---LLSGSLSMALCYIQR-----------------------V 135
           +             IA +   L++G+L++AL +I R                        
Sbjct: 140 RNLHTLLSTTTAADIASTTSTLIAGALTLALSHINRETIAYAETHGTARPDGSTSGPANS 199

Query: 136 FRSGLLHP------------------QPRILC--LQGSPDGPEQYVAIMNAIFSAQRSMV 175
             SGL  P                  Q RIL   L  +      Y+ IMN+IF+ QR  +
Sbjct: 200 TASGLPPPPGTTASATTSHLPNPSALQSRILIISLSSTTHSAHHYIPIMNSIFACQRLHI 259

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVH-HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
           PID   L A ++AFLQQAS  T G++        G  QYL+  F  D  +R+ L LP  V
Sbjct: 260 PIDILKL-AGDAAFLQQASDATRGIYIPATSHPAGFLQYLMLGFLPDQRARSHLVLPSRV 318

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHL------KKCSTCGS 274
            VDFRA+CFCH+  +D+G++CS+CLSI+C+ L       +C TCGS
Sbjct: 319 DVDFRAACFCHRRVVDVGFVCSICLSIFCEPLAGVDGVAECLTCGS 364


>gi|119174356|ref|XP_001239539.1| hypothetical protein CIMG_09160 [Coccidioides immitis RS]
 gi|392869735|gb|EAS28255.2| transcription factor tfb4 [Coccidioides immitis RS]
          Length = 396

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 99/330 (30%), Positives = 151/330 (45%), Gaps = 70/330 (21%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A+      ++Y
Sbjct: 17  SLLTIVLDTNPHAWALLEDTLPLSTAVANLLVFVNAHLACNYANKVAVVASHSQEARWLY 76

Query: 72  DSSSTG----NQ-------------SVGNGRMPSLCA-TLLQNLEEFMNKDEQ--LGKQE 111
            + +T     NQ             S   G  P+       + +EE + ++ +  L    
Sbjct: 77  PTPTTAPSNTNQDEPADTDGDVAMSSTTGGESPASNKYRPFRIVEEQLTRNLKALLATTS 136

Query: 112 PEG--RIACSLLSGSLSMALCYIQRV------------FRSGLLHPQPRILCL------- 150
           P        ++++G+L++AL +I R               S   +P P    L       
Sbjct: 137 PASVSSATSTMMAGALTLALSHINRETIAYAETHGTSRLDSDPSNPTPTATGLPPPPGSH 196

Query: 151 --------QGSPDG----------------PEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 186
                     SP G                  QY+ IMN+IF+ QR  +PID C L   +
Sbjct: 197 PDPTSQSSHSSPTGLQSRILIISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-D 255

Query: 187 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 246
           + FLQQA   T G++       G  QYL+  F  D  SR  L LP  V VDFRA+CFCH+
Sbjct: 256 AVFLQQACDATRGIYVPVDHPRGFLQYLMVAFLPDQRSRRHLILPTRVDVDFRAACFCHR 315

Query: 247 NTIDMGYICSVCLSIYCKHLK--KCSTCGS 274
             +D+G++CS+CLSI+C+  +   C TCG+
Sbjct: 316 KVVDVGFVCSICLSIFCEPPEGADCLTCGT 345


>gi|388582433|gb|EIM22738.1| transcription factor Tfb4 [Wallemia sebi CBS 633.66]
          Length = 272

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 75/183 (40%), Positives = 106/183 (57%), Gaps = 18/183 (9%)

Query: 117 ACSLLSGSLSMALCYIQRVF----RSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           A S L+ +L+ +LC++ R      +S       RIL +  S D P +YV++MN+IF+AQ+
Sbjct: 85  ASSTLASTLAQSLCHVNRKIVQESKSTF-----RILVVAASDDVPPEYVSLMNSIFAAQK 139

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
           S + ID   +   N+ FLQQA+++TGG   +    + L QYL+  F      R  L  P+
Sbjct: 140 SKIVIDVVQIYNCNTIFLQQAAHLTGGNFIQATDPESLVQYLIMAFLPSQPLRARLVQPR 199

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF---------GQAQTQS 283
              VDFRA+CFCHK  +D+ YICSVCLSI+C+  ++CSTC SVF         G  QT S
Sbjct: 200 SDKVDFRAACFCHKRIVDIAYICSVCLSIFCQPPQRCSTCNSVFPAKTLQKFTGSLQTHS 259

Query: 284 DEP 286
             P
Sbjct: 260 HAP 262


>gi|119618834|gb|EAW98428.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
           CRA_c [Homo sapiens]
          Length = 169

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 65/138 (47%), Positives = 92/138 (66%), Gaps = 1/138 (0%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 14  RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
           Q+  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 73  QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132

Query: 266 LKKCSTCGSVFGQAQTQS 283
              C+TC +   + +T+S
Sbjct: 133 SPICTTCDAESQEKETES 150


>gi|428673526|ref|NP_001258797.1| general transcription factor IIH subunit 3 isoform d [Homo sapiens]
          Length = 162

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/131 (48%), Positives = 88/131 (67%), Gaps = 1/131 (0%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 14  RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
           Q+  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 73  QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132

Query: 266 LKKCSTCGSVF 276
              C+TC + F
Sbjct: 133 SPICTTCETAF 143


>gi|393217350|gb|EJD02839.1| Tfb4-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 432

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/317 (30%), Positives = 146/317 (46%), Gaps = 61/317 (19%)

Query: 11  DDVSLVVVLLDTNPFFWSSSSL-------SFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           D+ S + ++LD  P  W  SSL       SF  FL   LAFLN  LTL   N +VV    
Sbjct: 29  DNPSHLAIILDLFPTQWHLSSLPKNAYPHSFKSFLAQTLAFLNFHLTLKHENALVVYGAF 88

Query: 64  YNSCDYVYDSSSTGNQ------SVGNGRMP------SLCATLLQNLEEFMNKDEQLGKQE 111
                 +Y S+    +      +V N  +P      ++   + + L+   + DE+  +Q 
Sbjct: 89  PGKSVLLYSSTDHKAEGADDSIAVPNTYLPFKVVDTAVTKRISEELDAMSDFDEEALRQ- 147

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFRSG-------------------------------L 140
                    L G+L+ ALC     F                                  +
Sbjct: 148 ---------LVGALTKALCRTGLSFLKPCSNLDLLDERAVIDINWLVNPTATSIDSDVPV 198

Query: 141 LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR-SMVPIDSCYLGAQNSAFLQQASYITGG 199
           + P+PRI+ L  SPD    Y+ +MN+IFSAQ    V ID C + A ++ FLQQA+++TGG
Sbjct: 199 VSPEPRIVILSVSPDLSTSYIPLMNSIFSAQNLCKVAIDVCKIYAPDAVFLQQAAHLTGG 258

Query: 200 VHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCL 259
            +   ++ D L Q+LL  F      R  + +P    +DFR +CF HK  +D+G++CSV L
Sbjct: 259 SYIYLERRDALLQHLLMTFLPTPSIRQPIAVPTQDKIDFRVACFFHKTIVDIGFVCSVRL 318

Query: 260 SIYCKHLKKCSTCGSVF 276
           SI C+ +  CSTC + F
Sbjct: 319 SILCQLVPVCSTCRTKF 335


>gi|441630273|ref|XP_004089524.1| PREDICTED: general transcription factor IIH subunit 3 isoform 4
           [Nomascus leucogenys]
          Length = 162

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 64/131 (48%), Positives = 88/131 (67%), Gaps = 1/131 (0%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 14  RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
           Q+  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 73  QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132

Query: 266 LKKCSTCGSVF 276
              C+TC + F
Sbjct: 133 SPICTTCETAF 143


>gi|389644740|ref|XP_003720002.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae 70-15]
 gi|351639771|gb|EHA47635.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae 70-15]
 gi|440470636|gb|ELQ39698.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae Y34]
 gi|440480828|gb|ELQ61470.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae P131]
          Length = 380

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 99/329 (30%), Positives = 155/329 (47%), Gaps = 71/329 (21%)

Query: 11  DDVSLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           D  SL+VV++DTNP  W+     L  S+ + ++L F+NA L L+  + V VIA   +   
Sbjct: 17  DTPSLLVVVIDTNPRAWALLRDVLPISKAIANILVFVNAHLALSSSHSVAVIAAHTHRAV 76

Query: 69  YVY------------DSSSTGNQSVGNGRMPS------------LCATLLQNLEEFMNKD 104
           ++Y            D   T   +  + + P+            + +TLL +L + +   
Sbjct: 77  WLYPSPPKPPVRDANDVEMTDADAGKDKKQPATSSANKLPQFAHIESTLLSSLHDLI--- 133

Query: 105 EQLGKQEPEGRIACSLLSGSLSMALCYIQR----VFRSGL--LHPQP------------- 145
            Q   +        +L+SG LS+AL +I +       +G+     +P             
Sbjct: 134 -QSTTKAELASTTTTLISGGLSLALAHINKTKELAMATGIDATKAEPAAPGVSASTAGAG 192

Query: 146 ---------------RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAF 189
                          RIL +  S    +QY+  MNA+F+A  + VP+D   L GA  + F
Sbjct: 193 VNGGGGSAARTVLHSRILTVSVSDSSADQYIPTMNAVFAASSAGVPLDVLALRGA--APF 250

Query: 190 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTD--LHSRNFLQLPKPVGVDFRASCFCHKN 247
           LQQ ++ITGG +    +  G+  YL+T F +   + S+  L  P    VDFRA+CFCH+ 
Sbjct: 251 LQQGAFITGGTYIAATEPRGILAYLMTGFASTSGVGSQGLLLSPGSESVDFRAACFCHRR 310

Query: 248 TIDMGYICSVCLSIYCK--HLKKCSTCGS 274
            +D GY+CSVCLSI+C+     +C TCGS
Sbjct: 311 AVDTGYVCSVCLSIFCEVPSGAECLTCGS 339


>gi|189241296|ref|XP_975136.2| PREDICTED: similar to transcription factor TFIIH-subunit, putative
           [Tribolium castaneum]
 gi|270013170|gb|EFA09618.1| hypothetical protein TcasGA2_TC011739 [Tribolium castaneum]
          Length = 232

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 123/210 (58%), Gaps = 24/210 (11%)

Query: 92  TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP------ 145
           T+ QNL + +  ++     E       SLL+G+++MALCYI R+ R+     +P      
Sbjct: 37  TIKQNLAKLLATEKSTLVTE-------SLLAGAIAMALCYIARIQRT-----KPPGCVLN 84

Query: 146 -RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 204
            RIL + GS D   QY+  MN  F+AQ+  + ID C L  Q+ + LQQ   ITGG++ K 
Sbjct: 85  SRILVVTGSGDSASQYMNYMNVFFTAQKQGIVIDVCALD-QHLSLLQQGCDITGGLYLKV 143

Query: 205 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 264
            QL GL QYLL +F  +   R  L LP PV VD+RA+CFCH+  ID+GY+CSVCLSI+CK
Sbjct: 144 PQLQGLLQYLLWVFLPEPPIRQKLVLPPPVKVDYRAACFCHRELIDIGYVCSVCLSIFCK 203

Query: 265 HLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
               C+TC +VF      + +P    +KRK
Sbjct: 204 FSPICTTCHTVFKIPGPLAVKP----KKRK 229


>gi|146416891|ref|XP_001484415.1| hypothetical protein PGUG_03796 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 378

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 98/336 (29%), Positives = 155/336 (46%), Gaps = 48/336 (14%)

Query: 7   KLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           + ++DD SL+ V+LD NP  W +  +  +  +    ++ F+NA L+LN  N+V  +    
Sbjct: 38  ETFTDDPSLLTVILDLNPLGWYNIRNRTTVKEVTKSLIVFMNAHLSLNNSNRVAFLTASP 97

Query: 65  NSCDYVY-----DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
           +   ++Y     D   T N  V  G            L E  N+ + + + E       S
Sbjct: 98  SGARFLYPNTMSDQKDTANTLVNRGMYRQFRIVDETVLLELNNEFQAVAQSELTDY--KS 155

Query: 120 LLSGSLSMALCYIQRVFR---------------------------------SGLLHPQPR 146
            +SG+LS+AL Y  R+                                   S L+  + R
Sbjct: 156 TVSGALSLALTYTHRMLTLDESISTTTASAISTSTNVASNSSAAGAGSTGASNLISMKSR 215

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 206
           IL +    +   +Y+ IMN+IF+AQ+  V ID   LG+++ ++LQQA+  T GV+   Q 
Sbjct: 216 ILVVSPDDNDDIRYIPIMNSIFAAQKMKVSIDVVKLGSKDVSYLQQAADATNGVYLHVQN 275

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY--CK 264
            +GL Q L T +  +   R  + LP    V++RASCF    +ID+G++CSVCL I     
Sbjct: 276 PEGLIQVLSTAYFIEPSLRPLVVLPTNSNVNYRASCFVTGKSIDLGFVCSVCLCIMSIIP 335

Query: 265 HLKKCSTCGSVFGQ---AQTQSDEPSATNRKRKTTD 297
            L KC TC S F     +Q +   P+   RK++  D
Sbjct: 336 DLGKCPTCQSEFDPNILSQLRR-SPAVLPRKKRKVD 370


>gi|367012173|ref|XP_003680587.1| hypothetical protein TDEL_0C04870 [Torulaspora delbrueckii]
 gi|359748246|emb|CCE91376.1| hypothetical protein TDEL_0C04870 [Torulaspora delbrueckii]
          Length = 325

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 97/303 (32%), Positives = 140/303 (46%), Gaps = 27/303 (8%)

Query: 3   SAPSKLYSDDV-SLVVVLLDTNPFFWS------SSSLSFSQFLTHVLAFLNAILTLNQLN 55
           + P   Y+++  SL+ +++DT P  W+      S + +       ++ FLNA L  N  N
Sbjct: 12  TKPKSQYAEETPSLLTLVIDTAPKLWAELDDEKSQNANIISVFESIIVFLNAHLAFNSSN 71

Query: 56  QVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQN--LEEFMNKDEQLGK---- 109
           QV VIA       Y+Y  S+ GN    +        +++ N    +F N DE L +    
Sbjct: 72  QVAVIAAHSQGIKYLYPKSNVGNNKATSSSTSGKDLSIINNDMYRQFRNVDETLVEELYK 131

Query: 110 --QEPEGRI----ACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILCLQ----GSPDGPE 158
             QE + +I      S L+G +S  L YI R+ R    +  + R+L +     G  D   
Sbjct: 132 LFQEEKNQIDKVTQKSTLAGGISAGLTYINRISRELATIALKSRLLVITCGSGGGRDEIF 191

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
           QY+ IMN IFSA +   PID   +G  + S FLQQ +  T GV+   +   GL QYL T 
Sbjct: 192 QYIPIMNCIFSATKLKCPIDVVKIGGHRESTFLQQTTDATNGVYLHLESTQGLIQYLSTA 251

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--KHLKKCSTCGSV 275
              D   R  +  P    VDFR SC+     + +GYICSVCL +        KC  C S 
Sbjct: 252 MFIDPSLRPIIVKPNQGSVDFRTSCYLTGKVVAIGYICSVCLCVLSILPPGNKCPACDSE 311

Query: 276 FGQ 278
           F +
Sbjct: 312 FDE 314


>gi|448098480|ref|XP_004198937.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
 gi|359380359|emb|CCE82600.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
          Length = 376

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 159/353 (45%), Gaps = 65/353 (18%)

Query: 5   PSKLYSDDVSLVVVLLDTNPFFW--SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIAT 62
           P++   DD SL+ V+LD  P  W      ++  +    +L FLNA L+LN  NQV  I  
Sbjct: 14  PTQPTVDDPSLLTVILDLTPHGWFKIKHHITVQEVTKALLVFLNAHLSLNNSNQVAFIVC 73

Query: 63  GYNSCDYVY-----------DSSSTGNQSVGNGRMPSLCATLLQN---------LEEFMN 102
             +   ++Y           D+ S+  ++      P+L  T +           LEE   
Sbjct: 74  SPDGAKFLYPNPGKNFDGIQDNGSSKGETGNKSSNPNLVNTEMYRQFRIVDEAVLEELNE 133

Query: 103 KDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--------------------SGLLH 142
             + L K     +   S LSG+LS+AL Y  R+                      SG  +
Sbjct: 134 VLKSLSKSNSVNKTR-STLSGALSLALTYTNRMLNLDQSISTTTASAFNATTKSTSGTSN 192

Query: 143 PQ-----------------PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ 185
                               RIL +  + D   +Y+ IMN++F+AQ+  +PID   LG+ 
Sbjct: 193 VTSSGSSASAVSTNYTSMLSRILVISANDDDDVKYIPIMNSMFAAQKMRLPIDVAKLGSI 252

Query: 186 NSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCH 245
           +S++LQQAS  T G++ + ++ +GL Q L T F  +   R  + LP    V +RASCF  
Sbjct: 253 SSSYLQQASDATNGIYLQVEKPEGLIQTLCTAFFIESSIRPLIILPSNSNVSYRASCFIT 312

Query: 246 KNTIDMGYICSVCLSIY--CKHLKKCSTCGSVFGQA---QTQSDEPSATNRKR 293
             ++D+GY+CSVCL I        KC TC S F ++   Q Q+    +T R++
Sbjct: 313 GKSVDLGYVCSVCLCIMSIIPTDAKCPTCNSHFDKSILNQLQNSVVLSTKRRK 365


>gi|254580699|ref|XP_002496335.1| ZYRO0C16016p [Zygosaccharomyces rouxii]
 gi|238939226|emb|CAR27402.1| ZYRO0C16016p [Zygosaccharomyces rouxii]
          Length = 342

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 96/297 (32%), Positives = 138/297 (46%), Gaps = 26/297 (8%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQ------FLTHVLAFLNAILTLNQLNQVVV 59
           S L  +  SL+ V++DT P  W+      +Q       L  +L FLNA L  N  NQV V
Sbjct: 37  SHLAEETPSLLTVIVDTAPRLWAELDNERNQTGKIIDMLKSLLVFLNAHLAFNSSNQVAV 96

Query: 60  IATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQL----------GK 109
           IA       Y+Y  SS+   +   G+  S+    +    +F N DE L           K
Sbjct: 97  IAAHSQGIKYLYPKSSSNQSNSLKGKDLSIINNDM--YRQFRNVDETLVEELYGLFQQEK 154

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILCLQ----GSPDGPEQYVAIM 164
           ++ +     S L+G++S  L Y+ RV +    +  + R+L +     G  D   QY+ IM
Sbjct: 155 EQADEVTQKSTLAGAMSAGLTYVNRVSKELETISFKSRLLVVTCGSGGGRDEIFQYIPIM 214

Query: 165 NAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           N IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +GL QYL T    D  
Sbjct: 215 NCIFSATKLKCPIDVVKIGGSKESTFLQQTTDATQGVYLHAESTEGLIQYLATAMFIDPS 274

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL--KKCSTCGSVFGQ 278
            R  +  P    VDFR SC+     + +GYICSVCL +        +C  C S F +
Sbjct: 275 LRQIIVKPNQGSVDFRTSCYLTGKVVAIGYICSVCLCVLSVSPPGNRCPACDSEFDE 331


>gi|190347434|gb|EDK39698.2| hypothetical protein PGUG_03796 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 378

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/336 (28%), Positives = 154/336 (45%), Gaps = 48/336 (14%)

Query: 7   KLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           + ++DD SL+ V+LD NP  W +  +  +  +    ++ F+NA L+LN  N+V  +    
Sbjct: 38  ETFTDDPSLLTVILDLNPLGWYNIRNRTTVKEVTKSLIVFMNAHLSLNNSNRVAFLTASP 97

Query: 65  NSCDYVY-----DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
           +   ++Y     D   T N  V  G            L E  N+ + + + E       S
Sbjct: 98  SGARFLYPNTMSDQKDTANTLVNRGMYRQFRIVDETVLSELNNEFQAVAQSELTDY--KS 155

Query: 120 LLSGSLSMALCYIQRVFR---------------------------------SGLLHPQPR 146
            +SG+LS+AL Y  R+                                   S L+  + R
Sbjct: 156 TVSGALSLALTYTHRMLTLDESISTTTASAISTSTNVASNSSAAGAGSTGASNLISMKSR 215

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 206
           IL +    +   +Y+ IMN+IF+AQ+  V ID   LG+++ ++LQQA+  T GV+   Q 
Sbjct: 216 ILVVSPDDNDDIRYIPIMNSIFAAQKMKVSIDVVKLGSKDVSYLQQAADATNGVYLHVQN 275

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY--CK 264
            +GL Q L T +  +   R  + LP    V++RASCF    +ID+G++CSVCL I     
Sbjct: 276 PEGLIQVLSTAYFIEPSLRPLVVLPTNSNVNYRASCFVTGKSIDLGFVCSVCLCIMSIIP 335

Query: 265 HLKKCSTCGSVFGQ---AQTQSDEPSATNRKRKTTD 297
              KC TC S F     +Q +   P+   RK++  D
Sbjct: 336 DSGKCPTCQSEFDPNILSQLRR-SPAVLPRKKRKVD 370


>gi|50286117|ref|XP_445487.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62901307|sp|Q6FWA7.1|TFB4_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|49524792|emb|CAG58398.1| unnamed protein product [Candida glabrata]
          Length = 335

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 100/295 (33%), Positives = 141/295 (47%), Gaps = 28/295 (9%)

Query: 10  SDDV-SLVVVLLDTNPFFW------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIAT 62
           ++D+ SL+ V+LD +P  W      S    S +  L  ++ FLN+ L  N  NQV VIA 
Sbjct: 19  TEDIPSLLTVVLDISPRLWAEFDHRSGEKQSVTTVLKSLIVFLNSHLAFNSANQVAVIAA 78

Query: 63  GYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGK------QEPEGR 115
                 Y+Y  SS T  Q+ GN +  S+ ++ +     F N DE L +      Q  E  
Sbjct: 79  FSQGIQYLYPRSSDTSEQNAGNSKDLSIISSHM--YRRFRNVDETLIEEFYKLYQREESL 136

Query: 116 I----ACSLLSGSLSMALCYIQRV---FRSGLLHPQPRILCLQGSPDGPE--QYVAIMNA 166
           I      S LSG+++ AL Y  R+   F S  L  +  ++    S +  E  QY+ IMN 
Sbjct: 137 IDKPVQKSTLSGAMAAALTYTNRLTKEFESISLRSRLLVITCGSSREKDEIFQYIPIMNC 196

Query: 167 IFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 225
           IFSA +   PID   +G  + S FLQQ +  T GV+   +  +G+ QYL T    D   R
Sbjct: 197 IFSATKLKCPIDVIKIGGNKQSTFLQQTTDATNGVYIHLESTNGIIQYLSTAMSIDPSLR 256

Query: 226 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 278
             +  P    VDFR SC+     + +GYICSVCL +        KC  C S F +
Sbjct: 257 QIIVRPTQGSVDFRTSCYLTGKVVAIGYICSVCLCVLSIIPPGNKCPACDSQFDE 311


>gi|448102380|ref|XP_004199788.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
 gi|359381210|emb|CCE81669.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
          Length = 376

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 159/358 (44%), Gaps = 65/358 (18%)

Query: 5   PSKLYSDDVSLVVVLLDTNPFFW--SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIAT 62
           P++   DD SL+ V+LD  P  W      ++  +    +L FLNA L+LN  NQV  I  
Sbjct: 14  PAQPTVDDPSLLTVILDLTPHGWFKIKDHITVQEVTKALLVFLNAHLSLNNTNQVAFIVC 73

Query: 63  GYNSCDYVYDSSST---GNQSVGNGR--------MPSLCATLLQN---------LEEFMN 102
             +   ++Y +      G Q  GN +         P+L  T +           LEE   
Sbjct: 74  SPDGAKFLYPNPGKNFDGIQDNGNSKGEAGTKTPSPNLVNTEMYRQFRIVDEAVLEELNE 133

Query: 103 KDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--------------------SGLLH 142
             + L       +   S LSG+LS+AL Y  R+                      SG  +
Sbjct: 134 VIKSLSNSNSVNKTR-STLSGALSLALTYTNRMLNLDQSISTTTASAFNATTKSTSGTTN 192

Query: 143 PQ-----------------PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ 185
                               RIL +  + D   +Y+ IMN++F+AQ+  +PID   LG+ 
Sbjct: 193 ATGSGSSASAVSTNYTSMLSRILVISANDDDDVKYIPIMNSMFAAQKMRLPIDVAKLGSV 252

Query: 186 NSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCH 245
           +S++LQQAS  T G++ + +  +GL Q L T F  +   R  + LP    V +RASCF  
Sbjct: 253 SSSYLQQASDATNGIYLQVENPEGLIQTLCTAFFIESSIRPLIILPSNSNVSYRASCFIT 312

Query: 246 KNTIDMGYICSVCLSIY--CKHLKKCSTCGSVFGQA---QTQSDEPSATNRKRKTTDA 298
             ++D+GY+CSVCL I        KC TC S F ++   Q Q+    +T R++  T+ 
Sbjct: 313 GKSVDLGYVCSVCLCIMSIIPTDAKCPTCNSHFDKSILNQLQNSVVLSTKRRKIDTNG 370


>gi|405968700|gb|EKC33746.1| Poly [ADP-ribose] polymerase 4 [Crassostrea gigas]
          Length = 3371

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 91/283 (32%), Positives = 141/283 (49%), Gaps = 55/283 (19%)

Query: 14   SLVVVLLDTNPFFW---------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
            S ++++LDTNP +W         +   ++ ++ L  V+ F ++ L +N  N++ +IA   
Sbjct: 2844 SQLIIILDTNPVWWGHKPSSHTKAQQKMTLTECLNSVMVFAHSHLMMNHSNKLAIIAAHS 2903

Query: 65   NSCDYVYDS--------SSTGNQSVGNGRMPSLCATLLQNLEEFMNK--DEQL-GKQEPE 113
            +   ++Y          S     + GN     L   + + ++E +    D  + G+  P+
Sbjct: 2904 DQSVFLYPKNDPTVHVISEADEVTEGNDGKYELFNQVDKQIKEGIKNLIDNCMNGQLYPD 2963

Query: 114  GRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 173
                 SL++G+L+MALC  QR       H +  IL                     + R 
Sbjct: 2964 -----SLVAGALAMALC--QRS------HEEAGIL---------------------SSRI 2989

Query: 174  MVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP 233
            +V ID+C L  + S  LQQA  ITGG++ K  Q+ GL QYL+ +F  D   R  L LP  
Sbjct: 2990 LVVIDACVLDNE-SGLLQQACDITGGIYLKIPQMKGLLQYLMWVFLPDPLERPKLTLPPK 3048

Query: 234  VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            V VD+RA+CFCH+N ID+GY+CSVCLSI+C     C TC + F
Sbjct: 3049 VQVDYRAACFCHRNLIDIGYVCSVCLSIFCAFSPICGTCQTTF 3091


>gi|281211236|gb|EFA85402.1| general transcription factor IIH [Polysphondylium pallidum PN500]
          Length = 276

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 80/241 (33%), Positives = 128/241 (53%), Gaps = 37/241 (15%)

Query: 11  DDVSLVVVLLDTNPFFWSSSS---------------LSFSQFLTHVLAFLNAILTLNQLN 55
           DD SL+V++ D N + W   +               ++   F+ ++L F+NA L LNQ N
Sbjct: 12  DDNSLLVIVADFNIYSWGQRAQTVQSEGRDGRRFPLITLPTFIENLLVFINAYLMLNQEN 71

Query: 56  QVVVIATGYNSCDYVY----DSSSTGNQSVGN------GRMPSLCATLLQNLEEFMNKDE 105
           Q+ +I++      +VY      + T ++ +         R+ ++C   L+NL++      
Sbjct: 72  QIALISSVIGESYFVYPPQPQQALTDHRCISEQIQSSLERINTIC---LENLDK------ 122

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMN 165
               +E + +   S  S ++S+ALCYI R+ +      + RIL    SPD   QY+ +MN
Sbjct: 123 --RSEEQQQQDCTSSFSAAMSLALCYINRI-KKEFPSLRARILVFNLSPDVSTQYIPVMN 179

Query: 166 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 225
            IFSAQ+  +P+DSC L   +S FLQQAS++T G++ KPQ+ + L QYLL+ F  D  SR
Sbjct: 180 CIFSAQKQSIPVDSCVLSTTDSTFLQQASHLTNGIYLKPQRQENLSQYLLSTFLVDSFSR 239

Query: 226 N 226
           N
Sbjct: 240 N 240


>gi|448533948|ref|XP_003870733.1| TFIIH complex subunit [Candida orthopsilosis Co 90-125]
 gi|380355088|emb|CCG24605.1| TFIIH complex subunit [Candida orthopsilosis]
          Length = 408

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 98/342 (28%), Positives = 158/342 (46%), Gaps = 56/342 (16%)

Query: 10  SDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           +D+ SL+ ++LD +P  W S   S S       +L FLN  L++N  NQV  I +   + 
Sbjct: 64  NDEPSLLTIILDISPRGWYSIKDSTSLQDVAKSLLVFLNGHLSINNSNQVSFIVSSPQTS 123

Query: 68  DYVY-----DSSSTGNQSVGNG---RMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
            ++Y       +ST +  +  G   +  ++   +L+ L  F+ +  ++       R   S
Sbjct: 124 KFLYPHPSNQKTSTTDSKISPGMYRQFKNVNEVVLEELNRFIKETSEISVVSTARR--NS 181

Query: 120 LLSGSLSMALCYIQR------------------------------------VFRSGLLHP 143
            ++G++SMAL Y  R                                    +  S     
Sbjct: 182 SITGAISMALTYTYRMSILDQSIQTTTASAINTSSINAKQSDASSSSTTAGIVPSTSTSI 241

Query: 144 QPRILCLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 201
           + RIL +  + DG +   Y+ +MN IF+AQ+  V ID   LG +NS++LQQAS  T GV+
Sbjct: 242 KSRILIVSAN-DGDDSINYIPLMNCIFTAQKLKVSIDVAKLGFKNSSYLQQASDATNGVY 300

Query: 202 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 261
              +   G+ Q L T F  + + R ++ LP    V+++ASCF  K  +D+G++CSVCL I
Sbjct: 301 LHIEDPQGIIQVLTTAFFIEPNLRPYVILPTNTQVNYKASCFITKRPVDVGFVCSVCLCI 360

Query: 262 YCK--HLKKCSTCGSVFGQA---QTQSDEPSATNRKRKTTDA 298
             K    +KC  C S F +    +   D    T +KRK  D+
Sbjct: 361 MSKIPEDEKCPACSSEFDKRSIYELTKDPQVVTRKKRKLDDS 402


>gi|402084346|gb|EJT79364.1| RNA polymerase II transcription factor B subunit 4 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 382

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 155/325 (47%), Gaps = 58/325 (17%)

Query: 9   YSDDV-SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           + D++ SL+ V++DTNP  W+     L  S+ + ++L ++NA L L+ LNQV +IA   +
Sbjct: 17  HVDEIPSLLAVVVDTNPSAWALLRDVLPISKAIANILIYVNAHLALSNLNQVAIIAAHAH 76

Query: 66  SCDYVYDSSSTG---NQSVGNGRMPSLCAT---------LLQNLEEFMNKD-------EQ 106
              ++Y S         + G+  M    AT           + L +F + +       + 
Sbjct: 77  RARWLYPSPPNPRPRKDAAGDVEMADAAATSQNSRTRAAAAKKLPQFAHIESAVLDSLQA 136

Query: 107 LGKQEPEGRIA---CSLLSGSLSMALCYIQRVFRSGL---LHPQPRILCLQGSPDGPE-- 158
           L ++     +A    +L+SG+L++AL +I +V  + L      +       GS   P   
Sbjct: 137 LIRETSPAEVASTTTTLVSGALTLALAHINKVREAALGAGFAERSAAAASPGSTAIPAGA 196

Query: 159 ------------------------QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQAS 194
                                   QY+  MNA+F+A  + VP+D   L   N++FLQQ S
Sbjct: 197 GNVGAAAAVRARILVVSVSDSSASQYIPTMNAVFAASHASVPVDVVALRG-NASFLQQGS 255

Query: 195 YITGGVHHKPQQLDGLFQYLLTIFGTDLHS-RNFLQLPKPVGVDFRASCFCHKNTIDMGY 253
           YITGG     ++  GL  YL+T F     +  + L  P    VDFRA+CFCH+  +D G+
Sbjct: 256 YITGGNFIHAKEPRGLLTYLMTGFPVGGGAVSDMLIGPGTESVDFRAACFCHRKALDTGF 315

Query: 254 ICSVCLSIYCKHL--KKCSTCGSVF 276
           +CSVCLSI+C+     +C TCGS  
Sbjct: 316 VCSVCLSIFCEAPPENECLTCGSAL 340


>gi|255715715|ref|XP_002554139.1| KLTH0E15180p [Lachancea thermotolerans]
 gi|238935521|emb|CAR23702.1| KLTH0E15180p [Lachancea thermotolerans CBS 6340]
          Length = 337

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 98/311 (31%), Positives = 139/311 (44%), Gaps = 31/311 (9%)

Query: 14  SLVVVLLDTNPFFW------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           SL+ V++DT P  W      S    +    L  ++ FLN+ LT N  NQV VIA   +  
Sbjct: 24  SLLTVVVDTTPKAWAELDRESEHEGNLISALQSLIVFLNSHLTFNSANQVAVIAAHSSGI 83

Query: 68  DYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQL-----------GKQEPEGRI 116
            Y+Y  +ST +Q     R   L         +F N DE +           GK+E     
Sbjct: 84  KYLYPVTST-SQDETAARSADLSIINSDMYRQFRNVDETVLEELYKLLQEEGKREQPSGF 142

Query: 117 ACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILCL----QGSPDGPEQYVAIMNAIFSAQ 171
             S LSG++S    YI R+ +       + R++ +     G  D   QY+ IMN IFSA 
Sbjct: 143 QKSTLSGAMSAGFTYINRIIKEQSSASLKARLMVVTCGSSGGKDEVFQYIPIMNCIFSAT 202

Query: 172 RSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 230
           +   P+D   +G +Q S FLQQA+  T GV+      +GL QYL T    D   R  +  
Sbjct: 203 KLKCPVDVVKIGGSQESTFLQQATDATNGVYLHVPSTEGLIQYLTTAMFIDPSLRPIIVK 262

Query: 231 PKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQAQTQSDE--- 285
           P    VDFR SC+     + +G++CSVCL +        +C  C S F +  T   +   
Sbjct: 263 PNQGSVDFRTSCYLTGKVVAVGFVCSVCLCVLSVIPPGNRCPACDSEFDEHVTTKLKKKP 322

Query: 286 --PSATNRKRK 294
             P    +KRK
Sbjct: 323 VVPGMIQKKRK 333


>gi|302830812|ref|XP_002946972.1| hypothetical protein VOLCADRAFT_79384 [Volvox carteri f.
           nagariensis]
 gi|300268016|gb|EFJ52198.1| hypothetical protein VOLCADRAFT_79384 [Volvox carteri f.
           nagariensis]
          Length = 141

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 61/118 (51%), Positives = 84/118 (71%)

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 218
           QY+ +MNAIFSAQR+ V +D+  L  ++S+FLQQA+++TGG++ KP     L   LL  F
Sbjct: 2   QYIPVMNAIFSAQRAEVLLDAVVLATEDSSFLQQAAHLTGGLYFKPAGAGALLGLLLNYF 61

Query: 219 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
             D  +R  L + + +GVDFRASCFCHK  I+ GY+CSVCLSI+C+  + CSTCG+ F
Sbjct: 62  VCDTSTRKQLDVAQELGVDFRASCFCHKYVIETGYVCSVCLSIFCQPSRACSTCGTAF 119


>gi|365982181|ref|XP_003667924.1| hypothetical protein NDAI_0A05260 [Naumovozyma dairenensis CBS 421]
 gi|343766690|emb|CCD22681.1| hypothetical protein NDAI_0A05260 [Naumovozyma dairenensis CBS 421]
          Length = 339

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 95/298 (31%), Positives = 135/298 (45%), Gaps = 25/298 (8%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLNQVVV 59
           S L  +  SL+ V++D  P  W+        + S    L  ++ FLNA L  N  NQV V
Sbjct: 16  SSLSEETPSLLTVIIDITPKLWTELDQETEENGSLISVLKSLIVFLNAHLAFNSSNQVAV 75

Query: 60  IATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ------EPE 113
           IA+      Y+Y  ++ G+    + +   L          F N DE L ++      +  
Sbjct: 76  IASHSQGIKYLYPQNTKGSDEESSSKTKDLSIINKDMYRRFRNVDETLVEELYTLFEKER 135

Query: 114 GRI----ACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILCLQ-----GSPDGPEQYVAI 163
            +I      S LSG++S  L YI R+ R    +  + R++ L       S D   QY+ I
Sbjct: 136 DQIDKFKQKSTLSGAMSAGLTYINRLSRELEAISLKSRLMVLTCGGKGESKDEIFQYIPI 195

Query: 164 MNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 222
           MN IFSA +   PID   +G ++ S FLQQ +  T GV+   +   GL QYL T    D 
Sbjct: 196 MNCIFSANKIKCPIDVVKIGGSERSTFLQQTTDATNGVYLHVESSKGLIQYLSTAMFIDP 255

Query: 223 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 278
             R  +  P    VDFR SC+     + +GYICSVCL +        KC  C S F +
Sbjct: 256 SLRQIIVKPNHGSVDFRTSCYLTGKVVAIGYICSVCLCVLSIIPPGNKCPACDSEFDE 313


>gi|428673522|ref|NP_001258795.1| general transcription factor IIH subunit 3 isoform b [Homo sapiens]
          Length = 265

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 137/284 (48%), Gaps = 59/284 (20%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G  P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           +G+   +LL+GSL+ ALCYI R                             MN      +
Sbjct: 125 KGQHTETLLAGSLAKALCYIHR-----------------------------MNKEVKDNQ 155

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
            M            S  L  A  ITGG++ K  Q+  L QYLL +F  D   R+ L LP 
Sbjct: 156 EM-----------KSRIL--ACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPP 202

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 203 PVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 246


>gi|441630267|ref|XP_004089522.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Nomascus leucogenys]
          Length = 265

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 137/284 (48%), Gaps = 59/284 (20%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVY--DSSSTGNQSVGNGRMPSLCAT--------LLQNLEEFMNKD--EQLGKQEP 112
               ++Y   +   G+     G  P    +        LL +  E + ++  + + K + 
Sbjct: 65  QESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKGDI 124

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           +G+   +LL+GSL+ ALCYI R                             MN      +
Sbjct: 125 KGQHTETLLAGSLAKALCYIHR-----------------------------MNKEIKDNQ 155

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
            M            S  L  A  ITGG++ K  Q+  L QYLL +F  D   R+ L LP 
Sbjct: 156 EM-----------KSRIL--ACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPP 202

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 203 PVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 246


>gi|171689210|ref|XP_001909545.1| hypothetical protein [Podospora anserina S mat+]
 gi|170944567|emb|CAP70678.1| unnamed protein product [Podospora anserina S mat+]
          Length = 396

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 96/341 (28%), Positives = 151/341 (44%), Gaps = 74/341 (21%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           SKL  D  SL  +++DTNP  W++    L  S+ + ++  F+NA L L+  NQ+ ++A  
Sbjct: 16  SKL-EDVPSLTTIIIDTNPRAWAALGDVLPLSKAIANIQIFINAHLALSNTNQIAILAAH 74

Query: 64  YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEF-----------MNKDEQLGKQ-E 111
            N   ++Y +           R      T L    +F            +  E +G    
Sbjct: 75  TNRAVWLYPTPPKPPSEDVEMRDAGKTDTFLNTANKFPQYAQIEHALVTSLRELIGSTIP 134

Query: 112 PEGRIACSLLSGSLSMALCYIQRV-----------------------FRSGLLHPQPRIL 148
           P+     + +SG+L++AL ++ +                          +GL+    RIL
Sbjct: 135 PDLNETTTQMSGALTLALAHMNKTALAYSASQALSNSTAGTTAPGTTASTGLVGFHGRIL 194

Query: 149 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 208
            +  S     QY+  MNA+F+A  S + ID+  L   ++ FL+QAS+IT G   +     
Sbjct: 195 VISVSDSAASQYIPTMNAVFAASMSRIAIDTLALRG-SATFLEQASFITQGTFIQAADPQ 253

Query: 209 GLFQYLLTIFGTDLHS-----------------------RNFLQLPKPVG---------- 235
           G+ QYL+  FG    S                       R   +L KPVG          
Sbjct: 254 GILQYLMFGFGVGSASSGLSAAQNDGSGPLMGKSKTGKQREGDELRKPVGECLFTPAADS 313

Query: 236 VDFRASCFCHKNTIDMGYICSVCLSIYCKHL--KKCSTCGS 274
           VDFRA+CFCH+N +D G++CS+CLSI+C+     +C TCG+
Sbjct: 314 VDFRAACFCHRNVVDTGFVCSICLSIFCEPPPGDECLTCGN 354


>gi|444315902|ref|XP_004178608.1| hypothetical protein TBLA_0B02470 [Tetrapisispora blattae CBS 6284]
 gi|387511648|emb|CCH59089.1| hypothetical protein TBLA_0B02470 [Tetrapisispora blattae CBS 6284]
          Length = 340

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 100/311 (32%), Positives = 139/311 (44%), Gaps = 40/311 (12%)

Query: 3   SAPSKLYSDDVSLVVVLLDTNPFFWS-----------SSSLSFSQFLTHVLAFLNAILTL 51
           +A  ++  +  SL+ V++DT P  W+           +      + +  ++ FLN+ L  
Sbjct: 12  TARPQVVEETPSLLTVVIDTAPKLWTQFDKEDPDDNTNKEPQLLRVIRAIVVFLNSHLAF 71

Query: 52  NQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ- 110
           N  NQV VIA       Y+Y      N S       +L         +F N DE L  + 
Sbjct: 72  NTNNQVAVIAAHSQGIKYLYPKEDNNNVSESTKEKSNLSIINKDMYRQFRNVDESLVDEL 131

Query: 111 ----EPEGRIA-----CSLLSGSLSMALCYIQRV---------FRSGLLHPQPRILCLQG 152
               + E +        S +SG++S AL Y+ R+         F+S LL     I C  G
Sbjct: 132 FSLFQEESKFIDKKSQKSTISGAMSCALTYVNRITKQFNDASIFKSRLL----VITCGSG 187

Query: 153 -SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHHKPQQLDGL 210
            S D   QY+ IMN IFSA +   PID   +G  + S FLQQ +  T GV+   +   GL
Sbjct: 188 RSKDEVFQYIPIMNCIFSATKMKCPIDVVKIGGHEKSTFLQQTTDATNGVYLYVENTKGL 247

Query: 211 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC---LSIYCKHLK 267
            QYL T    D   R+ +  P    VDFR SCF     + +GYICSVC   LSI  K+  
Sbjct: 248 IQYLSTAMFIDSSLRSIIVKPNHGVVDFRTSCFLTGRVVAVGYICSVCLCVLSIVPKN-N 306

Query: 268 KCSTCGSVFGQ 278
           KC  C S F +
Sbjct: 307 KCPACDSKFDE 317


>gi|68489766|ref|XP_711270.1| hypothetical protein CaO19.11671 [Candida albicans SC5314]
 gi|68489813|ref|XP_711246.1| hypothetical protein CaO19.4194 [Candida albicans SC5314]
 gi|46432534|gb|EAK92011.1| hypothetical protein CaO19.4194 [Candida albicans SC5314]
 gi|46432559|gb|EAK92035.1| hypothetical protein CaO19.11671 [Candida albicans SC5314]
          Length = 363

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 95/337 (28%), Positives = 155/337 (45%), Gaps = 54/337 (16%)

Query: 10  SDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           +DD SL+ ++LD +   W +    +S       +L FLN  L+LN  NQV  + +     
Sbjct: 20  NDDPSLLTIILDLSMKGWYNIKEMISIQDITKSLLVFLNGHLSLNNSNQVAFLVSSTMGS 79

Query: 68  DYVY-DSSSTGNQSVG--NGRMPSLCAT-------LLQNLEEFMNKDEQLGKQEPEGRIA 117
            ++Y D +  GN +V   +   P +  +       +LQ L E++ +  +   +  +    
Sbjct: 80  KFLYPDLTMVGNPNVSEHSEHFPDMYRSFKMVDQSVLQQLNEYIEEIIKFETRNEKKGFN 139

Query: 118 CSLLSGSLSMALCYIQRVF------------------------------------RSGLL 141
           C  L+G++SMAL Y  R+                                      S   
Sbjct: 140 C--LTGAISMALTYTNRMLTLDQSITTTTAAAMTASTLESTSNNNNTSGTSGSSSTSSST 197

Query: 142 HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 201
             + R+L +  + D   +Y+ +MN IF+AQ+  V ID   LG  NS++LQQAS  T GV+
Sbjct: 198 SMKSRVLIVSANDDDDLRYIPLMNCIFAAQKMKVSIDVAKLGHNNSSYLQQASDATKGVY 257

Query: 202 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 261
              +   G+ Q L T F  + + R ++ LP    V++RASCF    ++D+GY+CSVCL I
Sbjct: 258 LHIEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFITGKSVDLGYVCSVCLCI 317

Query: 262 YCK--HLKKCSTCGSVFGQA--QTQSDEPSATNRKRK 294
             +     KC  C S F +   Q  + EP   ++K++
Sbjct: 318 MSQLPQSGKCPACDSEFDKKIIQDLNREPQVISKKKR 354


>gi|241956274|ref|XP_002420857.1| RNA polymerase II transcription factor B subunit 4, putative; TFIIH
           subunit TFB4, putative [Candida dubliniensis CD36]
 gi|223644200|emb|CAX41010.1| RNA polymerase II transcription factor B subunit 4, putative
           [Candida dubliniensis CD36]
          Length = 363

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 95/339 (28%), Positives = 156/339 (46%), Gaps = 54/339 (15%)

Query: 11  DDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           DD SL+ ++LD +   W +    +S       +L FLN  L+LN  NQV  + +      
Sbjct: 21  DDPSLLTIILDLSMKGWYNIKEMISIQDITKSLLVFLNGHLSLNNSNQVAFLVSSSMGSK 80

Query: 69  YVY-DSSSTGNQSVG--NGRMPSLCAT-------LLQNLEEFMNKDEQLGKQEPEGRIAC 118
           ++Y D ++ GN +V   +   P +  +       +L+ L E++ +  +   +  +    C
Sbjct: 81  FLYPDLTTIGNPNVNEHSEHFPGMYRSFKMADQSVLRQLNEYIKEIVKFETRNEKRGFNC 140

Query: 119 SLLSGSLSMALCYIQRVF------------------------------------RSGLLH 142
             L+G++SMAL Y  R+                                      S    
Sbjct: 141 --LTGAISMALTYTNRMLTLDQSITTTTAAAMTTSTLESTSNNNSTSGTGGSSGTSSSTS 198

Query: 143 PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH 202
            + RIL +  + D   +Y+ +MN IF+AQ+  V ID   LG  NS++LQQAS  T GV+ 
Sbjct: 199 MKSRILIVSANDDDDLRYIPLMNCIFAAQKMKVSIDVAKLGHNNSSYLQQASDATKGVYL 258

Query: 203 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 262
             +   G+ Q L T F  + + R ++ LP    V++RASCF    ++D+GY+CSVCL I 
Sbjct: 259 HIEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFITGKSVDLGYVCSVCLCIM 318

Query: 263 CK--HLKKCSTCGSVFGQA--QTQSDEPSATNRKRKTTD 297
            +     KC  C S F +   Q  + EP   ++K++  +
Sbjct: 319 SQLPQSGKCPACDSEFDKKIIQELNREPQVISKKKRKLE 357


>gi|302680643|ref|XP_003030003.1| hypothetical protein SCHCODRAFT_30664 [Schizophyllum commune H4-8]
 gi|300103694|gb|EFI95100.1| hypothetical protein SCHCODRAFT_30664 [Schizophyllum commune H4-8]
          Length = 257

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 87/264 (32%), Positives = 128/264 (48%), Gaps = 34/264 (12%)

Query: 18  VLLDTNPFFW---SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS-CDYVYDS 73
           V+LD N   W   S+  LS +  L  +L FL+A +     N + V   G     +  +DS
Sbjct: 3   VILDLNAHQWTQLSTEELSLANALPQLLVFLHAHIAAAAENTLAVYTAGSEGPTELAFDS 62

Query: 74  SST---------GNQSVGNGRMP-SLCATLLQNLEEFMNKDEQLGK-----QEPEGRIAC 118
            S          GN+     R+  SL   +   ++   + DE + +      +P      
Sbjct: 63  LSNDVELPGALDGNKYPPFRRLDVSLTDAIQAAMDRLADADEDMKRTVGTSSDPHA---- 118

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           +  + +L+ +LCYI R     L  P  RIL L  SPD  + YV  MNAIFSAQ+  VPID
Sbjct: 119 TSFTSALTRSLCYINR-----LQSPSSRILLLSVSPDRAQDYVPFMNAIFSAQKLKVPID 173

Query: 179 SCYLGAQNSAFLQQASYITGGVHHK------PQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
           +  L A +S FLQQA+Y+TGG + +        +  GL QYL+  F      R  + +P 
Sbjct: 174 TLQLAAHDSVFLQQATYLTGGAYVRLGDESGEGKRGGLLQYLMMCFLPPPALRQVMAVPT 233

Query: 233 PVGVDFRASCFCHKNTIDMGYICS 256
              V+ RA+CFCHK   ++G++CS
Sbjct: 234 MDQVNLRAACFCHKRMTEIGFVCS 257


>gi|294658697|ref|XP_461035.2| DEHA2F15510p [Debaryomyces hansenii CBS767]
 gi|218511849|sp|Q6BL86.2|TFB4_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|202953319|emb|CAG89405.2| DEHA2F15510p [Debaryomyces hansenii CBS767]
          Length = 387

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 154/353 (43%), Gaps = 78/353 (22%)

Query: 15  LVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY- 71
           L+ V+LD  P  W    + ++  +    +L FLNA L+LN  NQV  IA+      ++Y 
Sbjct: 24  LLTVVLDVTPQSWYKIRNQITIQEVAKSLLVFLNAHLSLNNSNQVAFIASTPQGSKFLYP 83

Query: 72  ---------DSSSTGNQS-----------VGNG---RMPSLCATLLQNLEE-FMNKDEQL 107
                     S   G  S           VG+G   +   +   +L+ L E F +  + +
Sbjct: 84  NPEKNYDEVSSKKNGEGSNLNKADSTSSLVGDGMYRQFRIVDEAVLEKLNEIFADISQNV 143

Query: 108 GKQEPEGRIACSLLSGSLSMALCYIQRVFR------------------------------ 137
            K       + S LSG+LS+AL Y  R+                                
Sbjct: 144 DKSR-----SNSTLSGALSLALTYTNRMLNLDSSISTTTASAINTTTNANSNKTSSSGTT 198

Query: 138 -----------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 186
                      + L   + RIL +  + D   +Y+ IMN  F+AQ+  VPID   LG ++
Sbjct: 199 SNSMSTGGTNTTSLTSMRSRILIVSSNDDNDIKYIPIMNTTFAAQKMKVPIDVAKLGERD 258

Query: 187 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 246
           S++LQQAS  T GV+      +GL Q L T F  +   R+ + LP    V++RASCF   
Sbjct: 259 SSYLQQASDATNGVYLHIANPEGLIQTLSTAFFIEPSIRSLIILPTNSNVNYRASCFITG 318

Query: 247 NTIDMGYICSVCLSIYCKHLK--KCSTCGSVFGQ---AQTQSDEPSATNRKRK 294
            ++++G++CSVCL I     K  KC TCGS F +   AQ +        +KRK
Sbjct: 319 KSVNLGFVCSVCLCIMSIIPKEGKCPTCGSKFDEKILAQLRRGVVIPVKKKRK 371


>gi|71004806|ref|XP_757069.1| hypothetical protein UM00922.1 [Ustilago maydis 521]
 gi|46096873|gb|EAK82106.1| hypothetical protein UM00922.1 [Ustilago maydis 521]
          Length = 422

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 152/372 (40%), Gaps = 101/372 (27%)

Query: 15  LVVVLLDTNPFFWS--------------SSSLSFSQFLTHVLAFLNAILTLNQLNQVVV- 59
            +V++LD NP  W+              SS ++    L  ++ FLNA + +   N + + 
Sbjct: 36  FLVLVLDINPLAWTRRSESLDKGKYKQVSSDVALEDALAAIMIFLNAHMAMQHENGLAIY 95

Query: 60  IATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQN-----LEEFMNKDEQL------- 107
            A G  +   +Y ++S   ++ G G + S  +T  Q       + F   D+ L       
Sbjct: 96  AAAGTGTAQLLYSTASFSAKATGTGVVNSAASTSAQKPDANTYQHFKLVDDHLEQGIRAT 155

Query: 108 -------GKQEPEG--------RIACSLLSGS--------LSMALCYIQRV--------- 135
                   K+  E          +  + LS S        LS ALC++ R+         
Sbjct: 156 CKSMFDRAKKAQEAVHASADSEAVKHATLSRSINVGIVSALSQALCHLNRLGLSDATDAA 215

Query: 136 -----------FRSGLLHP----------------QPRILCLQGSPDGPEQYVAIMNAIF 168
                       R+G  +                 + RIL L  + D   QY+ +MN IF
Sbjct: 216 NTGNGIAATAQTRAGGANTAQGAGGSVGAGSIGSFKSRILILSVTQDASTQYIPMMNCIF 275

Query: 169 SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK---------------PQQLDGLFQY 213
           +AQ+  + ID C L  +++ FLQQASY+T G + +                     L Q 
Sbjct: 276 AAQKKGITIDVCKLFGKDTVFLQQASYLTSGTYFRLADDEDQSKSDTASATDMRSSLVQT 335

Query: 214 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 273
           LLT +      R  + LP    +DFRA+CFCH+  +D+GYICSVCLS++C     C TC 
Sbjct: 336 LLTTYLPSRSMRGVMNLPSLEKIDFRAACFCHRKIVDIGYICSVCLSLFCSPKPSCLTCR 395

Query: 274 SVFGQAQTQSDE 285
           S F +   Q  E
Sbjct: 396 SKFPKETLQRYE 407


>gi|365762534|gb|EHN04068.1| Tfb4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 338

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 141/303 (46%), Gaps = 26/303 (8%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLN 55
           A +  ++  +  SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N
Sbjct: 12  ARSRKQVTEESPSLLTVIIEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSAN 71

Query: 56  QVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNL-EEFMNKDE-------- 105
           +V VIA       Y+Y +S+S    S    +  S    +  ++   F N DE        
Sbjct: 72  KVAVIAAYSQGIKYLYPESTSALKASESENKTRSDLKIINSDMYRRFRNVDETLVEEIYK 131

Query: 106 --QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPE 158
             +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L      S D   
Sbjct: 132 LFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIF 191

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
           QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +GL QYL T 
Sbjct: 192 QYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTEGLIQYLATA 251

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSV 275
              D   R  +  P    VDFR SC+     + +G+ICSVCL +        KC  C S 
Sbjct: 252 MFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPGNKCPACDSQ 311

Query: 276 FGQ 278
           F +
Sbjct: 312 FDE 314


>gi|151942841|gb|EDN61187.1| transcription initiation factor TFIIH subunit [Saccharomyces
           cerevisiae YJM789]
 gi|259150209|emb|CAY87012.1| Tfb4p [Saccharomyces cerevisiae EC1118]
 gi|349581866|dbj|GAA27023.1| K7_Tfb4p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 338

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 141/303 (46%), Gaps = 26/303 (8%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLN 55
           A +  ++  +  SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N
Sbjct: 12  ARSRKQVTEESPSLLTVIIEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSAN 71

Query: 56  QVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNL-EEFMNKDE-------- 105
           +V VIA       Y+Y +S+S    S    +  S    +  ++   F N DE        
Sbjct: 72  KVAVIAAYSQGIKYLYPESTSALKASESENKTRSDLKIINSDMYRRFRNVDETLVEEIYK 131

Query: 106 --QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPE 158
             +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L      S D   
Sbjct: 132 LFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIF 191

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
           QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +GL QYL T 
Sbjct: 192 QYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTEGLIQYLATA 251

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSV 275
              D   R  +  P    VDFR SC+     + +G+ICSVCL +        KC  C S 
Sbjct: 252 MFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPGNKCPACDSQ 311

Query: 276 FGQ 278
           F +
Sbjct: 312 FDE 314


>gi|6325313|ref|NP_015381.1| Tfb4p [Saccharomyces cerevisiae S288c]
 gi|62901135|sp|Q12004.1|TFB4_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=General transcription and DNA repair
           factor IIH subunit TFB4; Short=TFIIH subunit TFB4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|805037|emb|CAA89174.1| unknown [Saccharomyces cerevisiae]
 gi|1314125|emb|CAA95001.1| unknown [Saccharomyces cerevisiae]
 gi|190407994|gb|EDV11259.1| RNA polymerase II transcription factor B subunit 4 [Saccharomyces
           cerevisiae RM11-1a]
 gi|256268932|gb|EEU04279.1| Tfb4p [Saccharomyces cerevisiae JAY291]
 gi|285815586|tpg|DAA11478.1| TPA: Tfb4p [Saccharomyces cerevisiae S288c]
 gi|323350201|gb|EGA84348.1| Tfb4p [Saccharomyces cerevisiae VL3]
 gi|392296067|gb|EIW07170.1| Tfb4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 338

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 141/303 (46%), Gaps = 26/303 (8%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLN 55
           A +  ++  +  SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N
Sbjct: 12  ARSRKQVTEESPSLLTVIIEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSAN 71

Query: 56  QVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNL-EEFMNKDE-------- 105
           +V VIA       Y+Y +S+S    S    +  S    +  ++   F N DE        
Sbjct: 72  KVAVIAAYSQGIKYLYPESTSALKASESENKTRSDLKIINSDMYRRFRNVDETLVEEIYK 131

Query: 106 --QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPE 158
             +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L      S D   
Sbjct: 132 LFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIF 191

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
           QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +GL QYL T 
Sbjct: 192 QYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTEGLIQYLATA 251

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSV 275
              D   R  +  P    VDFR SC+     + +G+ICSVCL +        KC  C S 
Sbjct: 252 MFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAVGFICSVCLCVLSIIPPGNKCPACDSQ 311

Query: 276 FGQ 278
           F +
Sbjct: 312 FDE 314


>gi|323331310|gb|EGA72728.1| Tfb4p [Saccharomyces cerevisiae AWRI796]
          Length = 409

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 97/319 (30%), Positives = 145/319 (45%), Gaps = 33/319 (10%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLN 55
           A +  ++  +  SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N
Sbjct: 83  ARSRKQVTEESPSLLTVIIEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSAN 142

Query: 56  QVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNL-EEFMNKDE-------- 105
           +V VIA       Y+Y +S+S    S    +  S    +  ++   F N DE        
Sbjct: 143 KVAVIAAYSQGIKYLYPESTSALKASESENKTRSDLKIINSDMYRRFRNVDETLVEEIYK 202

Query: 106 --QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPE 158
             +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L      S D   
Sbjct: 203 LFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIF 262

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
           QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +GL QYL T 
Sbjct: 263 QYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTEGLIQYLATA 322

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSV 275
              D   R  +  P    VDFR SC+     + +G+ICSVCL +        KC  C S 
Sbjct: 323 MFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAVGFICSVCLCVLSIIPPGNKCPACDSQ 382

Query: 276 FGQAQTQSDEPSATNRKRK 294
           F       DE      KRK
Sbjct: 383 F-------DEHVIAKLKRK 394


>gi|323335143|gb|EGA76433.1| Tfb4p [Saccharomyces cerevisiae Vin13]
          Length = 338

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 92/303 (30%), Positives = 140/303 (46%), Gaps = 26/303 (8%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLN 55
           A +  ++  +  SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N
Sbjct: 12  ARSRKQVTEESPSLLTVIIEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSAN 71

Query: 56  QVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNL-EEFMNKDE-------- 105
           +V VIA       Y+Y +S+S    S    +  S    +  ++   F N DE        
Sbjct: 72  KVAVIAAYSQGIKYLYPESTSALKASESENKTRSDLKIINSDMYRRFRNVDETLVEEIYK 131

Query: 106 --QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPE 158
             +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L      S D   
Sbjct: 132 LFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIF 191

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
           QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +GL QYL T 
Sbjct: 192 QYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTEGLIQYLATA 251

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSV 275
              D   R  +  P    VDFR SC+     +  G+ICSVCL +        KC  C S 
Sbjct: 252 MFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAXGFICSVCLCVLSIIPPGNKCPACDSQ 311

Query: 276 FGQ 278
           F +
Sbjct: 312 FDE 314


>gi|45187550|ref|NP_983773.1| ADL323Cp [Ashbya gossypii ATCC 10895]
 gi|62901344|sp|Q75B93.1|TFB4_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|44982288|gb|AAS51597.1| ADL323Cp [Ashbya gossypii ATCC 10895]
 gi|374106985|gb|AEY95893.1| FADL323Cp [Ashbya gossypii FDAG1]
          Length = 341

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 131/299 (43%), Gaps = 27/299 (9%)

Query: 7   KLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           +L  +  SL+ +++DTNP  W+             Q L   + FLNA L+ N  NQV VI
Sbjct: 17  QLVEETPSLLTLVIDTNPKLWAEFDREVGKKGQLMQVLKSTIVFLNAHLSFNNSNQVSVI 76

Query: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDE-----------QLGK 109
           A       Y+Y  +     S    +              F N DE           Q  K
Sbjct: 77  AACSRGIKYLYPQADDKEGSTKKKKSEDRSIINRNMYRGFRNVDEAVVEELYRVFQQESK 136

Query: 110 QEPEG--RIACSLLSGSLSMALCYIQRV-FRSGLLHPQPRILCL----QGSPDGPEQYVA 162
           Q  +G  +   S LSG++S  L YI R+   +  +  + R+L +      S D   QY+ 
Sbjct: 137 QLEDGVPQPFRSTLSGAMSAGLTYINRITHETEGVSLKSRLLVITCGSSASKDEVFQYIP 196

Query: 163 IMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD 221
           IMN IFSA +   PID   +G  + S FLQQA+  T G +      DGL QYL T    D
Sbjct: 197 IMNCIFSATKMKCPIDVVKVGGVKESTFLQQATDATNGNYLHVANTDGLIQYLSTAMFID 256

Query: 222 LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 278
              R ++  P    VDFR SC+     + +G++CSVCL +        KC  C S F +
Sbjct: 257 PSLRQWVVKPNQSSVDFRTSCYLTGKVVAIGFVCSVCLCVLSIIPPGNKCPACDSEFDE 315


>gi|366988553|ref|XP_003674043.1| hypothetical protein NCAS_0A11040 [Naumovozyma castellii CBS 4309]
 gi|342299906|emb|CCC67662.1| hypothetical protein NCAS_0A11040 [Naumovozyma castellii CBS 4309]
          Length = 338

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 94/298 (31%), Positives = 133/298 (44%), Gaps = 26/298 (8%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQ------FLTHVLAFLNAILTLNQLNQVVV 59
           S +  +  SL+ V++D  P  W+      ++       L  +L FLNA L  N  NQV V
Sbjct: 16  SSITEETPSLLTVIVDITPKLWAELDEEINEKGNLINVLKSLLVFLNAHLAFNSSNQVAV 75

Query: 60  IATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ-----EPEG 114
           IA       Y++  +S   + V +G+   L          F N DE L  +     E E 
Sbjct: 76  IAAHSQGIKYLFPKNSISVKGVNSGK-KDLSIINKDMYRRFRNVDETLVDELYALFEEEL 134

Query: 115 R-----IACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILCLQ-----GSPDGPEQYVAI 163
           +        S LSG++S  L YI RV +    +  + R++ L       S D   QY+ I
Sbjct: 135 KQVNTVTQKSTLSGAMSAGLTYINRVTKELEAISLKSRLMVLTCGGKGESKDEIFQYIPI 194

Query: 164 MNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 222
           MN IFSA +   PID   +G ++ S FLQQ +  T G +   +   GL QYL T    D 
Sbjct: 195 MNCIFSATKIKCPIDVVKIGGSKESTFLQQTTDATNGAYLHVESTKGLIQYLATAMFIDP 254

Query: 223 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 278
             R  +  P    VDFR SC+     + +G+ICSVCL +        KC  C S F +
Sbjct: 255 SLRQIIVKPNHGSVDFRTSCYLTGKVVAVGFICSVCLCVLSIIPPGNKCPACDSEFDE 312


>gi|323346291|gb|EGA80581.1| Tfb4p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 313

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 92/301 (30%), Positives = 139/301 (46%), Gaps = 26/301 (8%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLN 55
           A +  ++  +  SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N
Sbjct: 12  ARSRKQVTEESPSLLTVIIEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSAN 71

Query: 56  QVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNL-EEFMNKDE-------- 105
           +V VIA       Y+Y +S+S    S    +  S    +  ++   F N DE        
Sbjct: 72  KVAVIAAYSQGIKYLYPESTSALKASESENKTRSDLKIINSDMYRRFRNVDETLVEEIYK 131

Query: 106 --QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPE 158
             +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L      S D   
Sbjct: 132 LFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIF 191

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTI 217
           QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +GL QYL T 
Sbjct: 192 QYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTEGLIQYLATA 251

Query: 218 FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSV 275
              D   R  +  P    VDFR SC+     +  G+ICSVCL +        KC  C S 
Sbjct: 252 MFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAXGFICSVCLCVLSIIPPGNKCPACDSQ 311

Query: 276 F 276
           F
Sbjct: 312 F 312


>gi|401838123|gb|EJT41898.1| TFB4-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 338

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 88/291 (30%), Positives = 133/291 (45%), Gaps = 26/291 (8%)

Query: 14  SLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N+V VIA      
Sbjct: 24  SLLTVIVEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSANKVAVIAAHSQGI 83

Query: 68  DYVYDSSSTGNQSVGNGRMPSLCATLLQN--LEEFMNKDE----------QLGKQEPEGR 115
            Y+Y  S+T  ++  + +       ++ +     F N DE          ++ K + E  
Sbjct: 84  KYLYPESTTTTEASKSEKKSRSDLKIINSDMYRRFRNVDEALVEEIYKLFEVEKNQIEQN 143

Query: 116 IACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPEQYVAIMNAIFSA 170
              S L+G++S  L Y+ R+ +  +    + R+L L      S D   QY+ IMN IFSA
Sbjct: 144 SQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIFQYIPIMNCIFSA 203

Query: 171 QRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 229
            +   PID   +G ++ S FLQQ +  T GV+       GL QYL T    D   R  + 
Sbjct: 204 TKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVDSTQGLIQYLATAMFIDPSLRPIIV 263

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 278
            P    VDFR SC+     + +G+ICSVCL +        KC  C S F +
Sbjct: 264 KPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPGNKCPACDSEFDE 314


>gi|341038752|gb|EGS23744.1| putative transcription factor [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 429

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 157/368 (42%), Gaps = 109/368 (29%)

Query: 10  SDDV-SLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
           +DD+ SL  +++DTNP  W++ +  L  S+ + ++L F+NA L  +  NQV +IA+  N 
Sbjct: 18  TDDIPSLRTIIIDTNPRAWAALADVLPLSKAIANILIFVNAHLAFSNSNQVAIIASHTNR 77

Query: 67  CDYVY---------DSSS---TGNQSVGNGRMPSLC---ATLLQNLEEFMNKDEQLGKQE 111
             ++Y          SSS      +S   G+ P       +LL ++   M+         
Sbjct: 78  AVWLYPQPPEPLPSGSSSHDAAARKSATIGKYPQFAQIEKSLLSSIRALMDD-----TTP 132

Query: 112 PEGRIACSLLSGSLSMALCYIQRVFRS--------------------------------- 138
            +     + +SG+L++AL +I +   S                                 
Sbjct: 133 SDLDTTTTQISGALTLALAHINKTALSLTASNTAAAAVATGHSLTAGSAASVAAKAASTS 192

Query: 139 ---GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASY 195
              GL     RIL +  S     QY+  MNA+F+A  + + ID+  L   ++ FL+QAS+
Sbjct: 193 TSAGLAGLHARILIISVSDSSAAQYIPTMNAVFAAAHARIAIDTLALRG-SATFLEQASF 251

Query: 196 ITGGVHHKPQQLDGLFQYLLTIFGTD-----------LHSRNFLQLPKPVG--------- 235
           IT G   +  +  GL QYL+  FG+               ++ +  PKP G         
Sbjct: 252 ITRGTFIRAAEPRGLLQYLMFGFGSGSAPSNPAAAAAAAEKSTIGGPKPSGAAAAQPAAG 311

Query: 236 ---------------------------VDFRASCFCHKNTIDMGYICSVCLSIYCKHLK- 267
                                      VDFRA+CFCH+N +D G++CSVCLSI+C+    
Sbjct: 312 TTKPGAAGKLPPGAAVTDLLMTPSAEAVDFRAACFCHRNVVDTGFVCSVCLSIFCEVPPG 371

Query: 268 -KCSTCGS 274
            +C TCG+
Sbjct: 372 GECMTCGT 379


>gi|365757943|gb|EHM99813.1| Tfb4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 338

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 88/291 (30%), Positives = 133/291 (45%), Gaps = 26/291 (8%)

Query: 14  SLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N+V VIA      
Sbjct: 24  SLLTVIVEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSANKVAVIAAHSQGI 83

Query: 68  DYVYDSSSTGNQSVGNGRMPSLCATLLQN--LEEFMNKDE----------QLGKQEPEGR 115
            Y+Y  S+T  ++  + +       ++ +     F N DE          ++ K + E  
Sbjct: 84  KYLYPESTTTTEASKSEKKSRSDLKIINSDMYRRFRNVDEALVEEIYKLFEVEKNQIEQN 143

Query: 116 IACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPEQYVAIMNAIFSA 170
              S L+G++S  L Y+ R+ +  +    + R+L L      S D   QY+ IMN IFSA
Sbjct: 144 SQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIFQYIPIMNCIFSA 203

Query: 171 QRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 229
            +   PID   +G ++ S FLQQ +  T GV+       GL QYL T    D   R  + 
Sbjct: 204 TKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVDSTQGLIQYLATAMFIDPSLRPIIV 263

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 278
            P    VDFR SC+     + +G+ICSVCL +        KC  C S F +
Sbjct: 264 KPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPGNKCPACDSEFDE 314


>gi|328352540|emb|CCA38939.1| RNA polymerase II transcription factor B subunit 4 [Komagataella
           pastoris CBS 7435]
          Length = 307

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 90/260 (34%), Positives = 133/260 (51%), Gaps = 31/260 (11%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVI-ATGYNS---- 66
           SL+ V++DTNP  W+     +   +    VL  LNA L+LN  N+V V+ ++  NS    
Sbjct: 23  SLLAVVIDTNPLEWTKLKGVICLKELCKSVLVLLNAHLSLNSGNRVAVLTSSSLNSGPKF 82

Query: 67  -----CDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLG--KQEPEGRIACS 119
                 D  Y+   +   S  + +   +   ++Q L+  ++ +E L    QE +G IA  
Sbjct: 83  LYPDPNDKTYEKRESLLSSDIHRQFKFVDQKIIQELQILLD-NEPLNPENQELKGSIA-- 139

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQ-------PRILCLQGSPDGPEQYVAIMNAIFSAQR 172
              G++SMAL YI R+     + P+        ++L +  S D  +QYV  MN IFSAQ+
Sbjct: 140 ---GAMSMALSYINRLTN---IEPESTGNTLRAKMLVISISDDSTQQYVPFMNCIFSAQK 193

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             V ID C +G  +S+FLQQAS +T GV+   + + GL QYL T    D   R  + LP 
Sbjct: 194 MKVSIDVCKMGP-DSSFLQQASDVTNGVYMLIKNVHGLIQYLTTALFIDPSLRPIMVLPT 252

Query: 233 PVGVDFRASCFCHKNTIDMG 252
              +DFRASCF     ID+G
Sbjct: 253 NSDLDFRASCFVTNKVIDIG 272


>gi|315052780|ref|XP_003175764.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           gypseum CBS 118893]
 gi|311341079|gb|EFR00282.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           gypseum CBS 118893]
          Length = 413

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 86/139 (61%), Gaps = 3/139 (2%)

Query: 138 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 197
           +G L  +  I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T
Sbjct: 225 AGALQSRILIISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDAT 283

Query: 198 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 257
            G++       G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+
Sbjct: 284 RGIYMSLDSPRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSI 343

Query: 258 CLSIYCKHLK--KCSTCGS 274
           CLSI+C+  +   C TCG+
Sbjct: 344 CLSIFCEPPEGANCLTCGT 362



 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 38/63 (60%), Gaps = 2/63 (3%)

Query: 11 DDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
          +D SL+ V+LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A+  +   
Sbjct: 9  NDPSLLTVILDTNPHAWAQLQETLPLSSAVANLLVFINAHLACNYANKVAVVASHSHQAV 68

Query: 69 YVY 71
          ++Y
Sbjct: 69 WLY 71


>gi|403216997|emb|CCK71492.1| hypothetical protein KNAG_0H00770 [Kazachstania naganishii CBS
           8797]
          Length = 342

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 96/302 (31%), Positives = 138/302 (45%), Gaps = 46/302 (15%)

Query: 14  SLVVVLLDTNPFFW------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           SL+ V++D  P  W      +  + +  + L  ++ FLNA L  N  NQV +IA+     
Sbjct: 24  SLLTVVVDLTPKLWAEFDEETKEASNIIKMLRALIVFLNAHLAFNTSNQVAIIASHSQGI 83

Query: 68  DYVYDSSS----TGNQSVGN-----------GRMPSLCATLLQNLEEFMNKDEQLGKQEP 112
            Y+Y  SS    T  ++VG             R  ++  TL++ L       +++G  E 
Sbjct: 84  KYLYPRSSNRSETEAETVGKKDQFIINPGMYSRFRNVDETLVEELYTLFK--QEMG--EV 139

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHP------QPRILCLQ-GSPDGPEQ------ 159
           E     S L G++  AL Y  R     LLH       + RI+ +  G+ +G  Q      
Sbjct: 140 EKATQKSTLPGAMLAALAYTNR-----LLHDYETKSLKSRIMVMTCGNKNGTSQKEEIFQ 194

Query: 160 YVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 218
           Y+ IMN IFSA +   PID   +G A+ S FLQQ +  T G++      DGL QYL T  
Sbjct: 195 YIPIMNCIFSAAKLKCPIDVVKIGGARKSTFLQQTTDATNGIYLHVDSTDGLVQYLSTAM 254

Query: 219 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVF 276
             D   R+ +  P    VDFR SCF     + +GYICSVCL +        +C  C S F
Sbjct: 255 FIDPSLRSIIVKPNQGSVDFRTSCFLTGKVVAVGYICSVCLCVLSVLPPGNRCPACESEF 314

Query: 277 GQ 278
            +
Sbjct: 315 DE 316


>gi|366998281|ref|XP_003683877.1| hypothetical protein TPHA_0A03670 [Tetrapisispora phaffii CBS 4417]
 gi|357522172|emb|CCE61443.1| hypothetical protein TPHA_0A03670 [Tetrapisispora phaffii CBS 4417]
          Length = 344

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 94/324 (29%), Positives = 151/324 (46%), Gaps = 47/324 (14%)

Query: 8   LYSDDV-SLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
           L SD+  SL+ V++DT+P  W+          S  + L  +L FLNA L  N  N+V VI
Sbjct: 17  LNSDETPSLLTVIIDTSPRLWAELDGDMKEDGSIIKVLRSMLVFLNAHLATNNANKVAVI 76

Query: 61  ATGYNSCDYVYD--------------SSSTGNQSVGNGRM----PSLCATLLQNLEEFMN 102
           A       Y+Y               S+S  + ++ N  M     ++  +L++ + +   
Sbjct: 77  AAHSQGIKYLYPINTSHANKNKTNEISTSKKDLAIINPNMYRQFRNVDESLVEEIYKIFQ 136

Query: 103 KDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCL--------QGS 153
           K++    ++P+ +   S L+G++S  L YI R+ ++   +  + R++ +           
Sbjct: 137 KEKAEYLEKPKQK---STLAGAMSAGLTYINRIVKNEENYTLKSRLVVITCGGNAYTDNK 193

Query: 154 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQ 212
            +   QY+ IMN IFSA +   PID   +G +  S FLQQA+  T G++   +   GL Q
Sbjct: 194 VEEVIQYIPIMNCIFSATKMKCPIDVVKIGGSSESTFLQQATDATSGIYLYVKDYHGLIQ 253

Query: 213 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--KHLKKCS 270
           YLLT    +   R+ +  P    +DFR SC+     + +G+ICS+CL +        KC 
Sbjct: 254 YLLTAMFIEPSLRSTIVKPNSKSIDFRTSCYITGKVVAVGFICSICLCVMSILPPNNKCP 313

Query: 271 TCGSVFGQAQTQSDEPSATNRKRK 294
            C SVF       DE      KRK
Sbjct: 314 ACDSVF-------DEKVIVKLKRK 330


>gi|150865476|ref|XP_001384710.2| hypothetical protein PICST_83789 [Scheffersomyces stipitis CBS
           6054]
 gi|149386732|gb|ABN66681.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 376

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 158/357 (44%), Gaps = 74/357 (20%)

Query: 10  SDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           +DD SL+ V+LD +P  W       S  +    +L F+NA L+LN  NQV  IA+     
Sbjct: 19  NDDPSLLTVILDVSPAGWYRIRDQTSIDELAKSLLVFMNAHLSLNNSNQVAFIASTPQKS 78

Query: 68  DYV-------YDSSSTGNQSVG-----NGRMPSLCATLLQNLEEFMNKDEQL-------- 107
            ++       YD   T + S G     + + P+L +  +    +F   DE +        
Sbjct: 79  KFLFPNPEIDYDEIRTSSSSSGSASNQHQKTPTLVSKDM--YRQFRVVDEAVLEELNVVF 136

Query: 108 -----GKQEPEGRIACSLLSGSLSMALCYIQRVFR------------------------S 138
                G Q+       S LSG+LSMAL Y  R+                          S
Sbjct: 137 DEIANGIQDINNN---STLSGALSMALTYTNRMLTLDQSISTTTASAINSTTSMGAGSGS 193

Query: 139 G--------------LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
           G              +   + RIL +  + +   +Y+ +MN+IF+AQ+    ID   LG 
Sbjct: 194 GNTATNSSTSNPSNSITSMKSRILIVTANDEDDVKYIPVMNSIFAAQKMRTSIDIAKLGF 253

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
           ++S++LQQA+  T G++       G+ Q L + F  +   R F+ LP    V++RASCF 
Sbjct: 254 EDSSYLQQAADATNGIYFHVHDPRGIVQTLTSAFFIEPSIRPFIILPTNSNVNYRASCFV 313

Query: 245 HKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFGQAQTQ--SDEPSATNRKRKTTD 297
              ++D+G++CSVCL I  K     KC  C SVF +      S  PS  ++K++  D
Sbjct: 314 TGKSVDIGFVCSVCLCIMSKIPPSGKCPACESVFDEKIIAQLSKGPSVLSKKKRKID 370


>gi|367035398|ref|XP_003666981.1| hypothetical protein MYCTH_2312237 [Myceliophthora thermophila ATCC
           42464]
 gi|347014254|gb|AEO61736.1| hypothetical protein MYCTH_2312237 [Myceliophthora thermophila ATCC
           42464]
          Length = 433

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 155/374 (41%), Gaps = 115/374 (30%)

Query: 10  SDDV-SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
           +DD+ SL  +++DTNP  W++    L  S+ + ++L F+N  L  +  NQV +IA   N 
Sbjct: 18  TDDIPSLCTIIIDTNPRAWAALNEVLPVSKAIANILVFVNTHLAFSNSNQVALIAAHSNR 77

Query: 67  CDYVY---------------DSSSTGNQSVGNGRMPSLC---ATLLQNLEEFMNK--DEQ 106
             ++Y               D+SS  +++    + P       +LL +L   ++   D  
Sbjct: 78  AVWLYPTPPKQAAPRDVDMQDASSNASKASSANKYPQFAQIEKSLLTSLRALIDDTTDAD 137

Query: 107 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS---------------------------- 138
           L    P+       +SG+L++AL +I +   S                            
Sbjct: 138 LSTTTPQ-------ISGALTLALAHINKTALSFSASATAAAAHAAATTGTAVTAGATVGP 190

Query: 139 -----------GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS 187
                      GL     RIL +  S   P QY+  MNA+F+A  + V ID+  L   ++
Sbjct: 191 APVASTSTSSGGLAGLHARILIISVSDSAPAQYIPTMNAVFAAAHARVAIDTLALRG-SA 249

Query: 188 AFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD-------------------------- 221
            FL+QAS+IT G   +  +  GL QYL+  FG+                           
Sbjct: 250 TFLEQASFITRGTFVRAAEPRGLLQYLMFGFGSGSAPSHPSGGADPGKGPAAPKTTAASV 309

Query: 222 -----------------LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 264
                                + L  P    VDFRA+CFCH+N ID GY+CSVCLSI+C+
Sbjct: 310 PGGGPGGRRKTGRLGLGASVADLLLTPSADAVDFRAACFCHRNVIDTGYVCSVCLSIFCE 369

Query: 265 --HLKKCSTCGSVF 276
                +C TCG+V 
Sbjct: 370 VPPGGECLTCGTVL 383


>gi|195111388|ref|XP_002000261.1| GI10128 [Drosophila mojavensis]
 gi|193916855|gb|EDW15722.1| GI10128 [Drosophila mojavensis]
          Length = 334

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 140/301 (46%), Gaps = 12/301 (3%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           S L   ++ ++VV  D N  +       F++ +  +  F NA L  +  N +V+ +    
Sbjct: 21  SGLPEKNMLIIVVDYDANQSYVKRDLELFTKVVCSLTMFGNAYLLQSSENDLVIWSCSSY 80

Query: 66  SCDYVYDSSSTGNQSVGNGRMPSLCAT---LLQNLEEFMNKDEQLGK---QEPEGRIACS 119
           + + +Y           + ++  L          L   +++D  + K   +E E +   +
Sbjct: 81  AVNVIYPEKLIDPTKDSDSQLEELAVVESLTRMRLFNLISQDITVMKRQAKEEEEQTVTA 140

Query: 120 LLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAI-MNAIFSAQRSMVP 176
           LL G+++M L Y+ R  R  + + H + RIL + GS + P     + MNA   A R  V 
Sbjct: 141 LLPGTVAMVLSYLSRCRREVAKVAHIRGRILIVSGSKEPPIPLATMQMNAFQVAARMGVV 200

Query: 177 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 236
           ID C L  ++S  L+ A+ ITGG +      D L   LL +F    H R  L  P     
Sbjct: 201 IDVCALELESSYMLRHAADITGGFYFSTSNFDTLSGVLLGLFLASPHVRQHLNYPVQPQA 260

Query: 237 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG---QAQTQSDEPSATNRKR 293
           D RA CFCH   ++MG++CS CL+++CK+   C  C ++F    + Q   D  +A  R+R
Sbjct: 261 DLRAMCFCHSKLVEMGFVCSSCLTVFCKYTPICGKCETIFKPPQKLQMYIDNRAAQRRQR 320

Query: 294 K 294
           +
Sbjct: 321 E 321


>gi|50547937|ref|XP_501438.1| YALI0C04367p [Yarrowia lipolytica]
 gi|62901295|sp|Q6CD24.1|TFB4_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|49647305|emb|CAG81737.1| YALI0C04367p [Yarrowia lipolytica CLIB122]
          Length = 340

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 141/300 (47%), Gaps = 33/300 (11%)

Query: 11  DDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCD 68
           D  SL+ +++D +   W    S +S S+ +  +L F+NA L L+  N V VI    +   
Sbjct: 21  DTPSLLSIIIDAHVPSWEEIKSQISISEAVASILVFINAHLALHNSNSVNVIGYNASGAR 80

Query: 69  YVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQ------------------ 110
            +Y   S G +S  +        ++    +    KD  + +Q                  
Sbjct: 81  ILYPPKS-GVESTRSKEREERSESVSDGEQAPSKKDHSMYRQFKTVDEVVQTELWNMLNH 139

Query: 111 ---EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQ-GSPDGPEQYVAIMNA 166
                E +   S +SG+LS+AL +I +         + RIL L  G  +   QY+  MN 
Sbjct: 140 TNYVEEEKQHNSAISGALSLALGFINKHVFVDESRMRARILLLTVGHKNETIQYIPTMNC 199

Query: 167 IFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD-----GLFQYLLTIFGT 220
           IF+AQ+  +P+D C LG   +  FLQQA   T G++    + +     GL QYLL+ F +
Sbjct: 200 IFAAQKLKIPVDVCKLGPGSDQVFLQQACDSTHGIYMDISEKNSKTPKGLVQYLLSGFIS 259

Query: 221 DLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFGQ 278
           D   R  + LP    VDFRA+CF  K  +D+GY+CSVCL I  +    ++C TC + + +
Sbjct: 260 DPSLRPHIVLPTQSNVDFRAACFLTKQVVDIGYVCSVCLCIMSQIPSNRRCPTCDTTYSE 319


>gi|255727412|ref|XP_002548632.1| hypothetical protein CTRG_02929 [Candida tropicalis MYA-3404]
 gi|240134556|gb|EER34111.1| hypothetical protein CTRG_02929 [Candida tropicalis MYA-3404]
          Length = 369

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 96/340 (28%), Positives = 156/340 (45%), Gaps = 52/340 (15%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           +DD SL+ ++LD +   W +    +S       +L FLN  L+LN  NQV  I +     
Sbjct: 20  NDDPSLLTIILDLSLKGWFNIKDLISIQDITKSLLVFLNGHLSLNNSNQVAFIVSSSKGS 79

Query: 68  DYVY-DSSSTGNQSVGN--GRMPSLCA-------TLLQNLEEFMNK---DEQLGKQEPEG 114
            ++Y D ++ GN +  +  G+ P +         ++L  L +F+ +     +  ++    
Sbjct: 80  KFLYPDLTTNGNNTTEDFGGQFPGMYRQFKMVDQSVLHQLNKFIEEVMATPEEEEENGRR 139

Query: 115 RIACSLLSGSLSMALCYIQRVF---------------------------------RSGLL 141
           R   + L+G+LSMAL Y  R+                                       
Sbjct: 140 RRTTNTLTGALSMALTYTNRMLTLDQTITTTTASAMTTSTLESTSNNNTSSSGSASVSTT 199

Query: 142 HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 201
             + R+L +  + D   +Y+ +MN IF+AQ+  V ID   LG +NS++LQQAS  T GV+
Sbjct: 200 GIKSRVLIVSANDDDDIKYIPLMNCIFAAQKMKVSIDVAKLGHKNSSYLQQASDATRGVY 259

Query: 202 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 261
              +   G+ Q L T F  + + R ++ LP    V++RASCF    ++D+GY+CSVCL I
Sbjct: 260 LHIEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFLTGKSVDLGYVCSVCLCI 319

Query: 262 YCK--HLKKCSTCGSVFGQAQTQS--DEPSATNRKRKTTD 297
             K     KC  C S F +        EP   ++K++  D
Sbjct: 320 MSKLPDNGKCPACDSAFDKKIINELIREPQVISKKKRKLD 359


>gi|363751126|ref|XP_003645780.1| hypothetical protein Ecym_3480 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889414|gb|AET38963.1| Hypothetical protein Ecym_3480 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 341

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 147/320 (45%), Gaps = 43/320 (13%)

Query: 7   KLYSDDVSLVVVLLDTNPFFWSSSSLSFSQ------FLTHVLAFLNAILTLNQLNQVVVI 60
           +L  D  SL+ +++DTNP  W+       Q       L  ++ FLNA L+ N  NQV VI
Sbjct: 17  QLVEDTPSLLTLVIDTNPKLWADLDKEVGQEGQLLQALKSIIIFLNAHLSFNNSNQVSVI 76

Query: 61  ATGYNSCDYVY-------DSSSTG----NQSVGNGRM----PSLCATLLQNLEEFMNK-- 103
           A       Y+Y       D S       ++S+ N  M     ++  T+++ L +   +  
Sbjct: 77  AAYSRGIKYLYPPADELEDMSEKKTLHEDKSIINRDMYRGFRNVDETVVEELYKLFIQEI 136

Query: 104 -DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGL-LHPQPRILCLQ-GSPDGPE-- 158
            D++  K +P      S LSG++S  L YI R+ +       + R+L +  GS  G +  
Sbjct: 137 NDQENSKIQP----IKSTLSGAMSAGLTYINRIVKENESTSLKSRLLVITCGSNAGKDEI 192

Query: 159 -QYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHHKPQQLDGLFQYLLT 216
            QY+ IMN IFSA +    ID   +G  + S FLQQA+  T G +   +  +GL QYL T
Sbjct: 193 FQYIPIMNCIFSATKMRCSIDVVKIGGTEESTFLQQATDATNGNYLHVENTEGLIQYLST 252

Query: 217 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGS 274
               D   R ++  P    VDFR SC+     + +G++CSVCL +        KC  C +
Sbjct: 253 AMFIDPSLRQWIVKPNQSAVDFRTSCYLTGTVVAIGFVCSVCLCVLSIIPPENKCPACDA 312

Query: 275 VFGQAQTQSDEPSATNRKRK 294
            F       DE      KRK
Sbjct: 313 EF-------DEKVVAKLKRK 325


>gi|326480715|gb|EGE04725.1| TFIIH basal transcription factor complex p34 subunit [Trichophyton
           equinum CBS 127.97]
          Length = 415

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 206
           I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 236 IISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 294

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 266
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 295 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 354

Query: 267 K--KCSTCGS 274
           +   C TCG+
Sbjct: 355 EGANCLTCGT 364



 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 41/70 (58%), Gaps = 2/70 (2%)

Query: 4  APSKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
          A S   ++D SL+ V+LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A
Sbjct: 2  ADSGAAANDPSLLTVILDTNPHAWAQLQETLPLSSAVANLLVFINAHLACNYANKVAVVA 61

Query: 62 TGYNSCDYVY 71
          +  +   ++Y
Sbjct: 62 SHNHQAVWLY 71


>gi|326473589|gb|EGD97598.1| transcription factor TFIIH subunit Tfb4 [Trichophyton tonsurans CBS
           112818]
          Length = 415

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 206
           I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 236 IISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 294

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 266
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 295 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 354

Query: 267 K--KCSTCGS 274
           +   C TCG+
Sbjct: 355 EGANCLTCGT 364



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 42/70 (60%), Gaps = 2/70 (2%)

Query: 4  APSKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
          A S   ++D SL+ V+LDTNP  W+    +L  S  +T++L F+NA L  N  N+V V+A
Sbjct: 2  ADSGAAANDPSLLTVILDTNPHAWAQLQETLPLSSAVTNLLVFINAHLACNYANKVAVVA 61

Query: 62 TGYNSCDYVY 71
          +  +   ++Y
Sbjct: 62 SHNHQAVWLY 71


>gi|388851454|emb|CCF54856.1| related to TFIIH basal transcription factor complex p34 subunit
           [Ustilago hordei]
          Length = 423

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 60/149 (40%), Positives = 85/149 (57%), Gaps = 16/149 (10%)

Query: 144 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 203
           + RIL L  +PD   QY+ +MN IF+AQ+  + ID C L   ++ FLQQASY+T G + +
Sbjct: 250 KSRILILSVTPDASTQYIPMMNCIFAAQKKGITIDVCKLFGSDTVFLQQASYLTSGTYFR 309

Query: 204 ------PQQLDG----------LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKN 247
                   +++G          L Q LLT +      R  + LP    +DFRA+CFCH+ 
Sbjct: 310 LSDDSDALKVNGKEGITDMRSSLVQTLLTTYLPSRSMRGVMNLPTLEEIDFRAACFCHRK 369

Query: 248 TIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            +D+GYICSVCLS++C+    C TC S F
Sbjct: 370 IVDIGYICSVCLSLFCQPRPFCLTCRSKF 398


>gi|296815720|ref|XP_002848197.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           otae CBS 113480]
 gi|238841222|gb|EEQ30884.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           otae CBS 113480]
          Length = 413

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 84/138 (60%), Gaps = 3/138 (2%)

Query: 139 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 198
           G L  +  I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T 
Sbjct: 226 GSLQSRILIVSVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATR 284

Query: 199 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 258
           G++       G  QYL+  F  D   R  L LP  V VDFRA+CFCH+  +D+G++CS+C
Sbjct: 285 GIYMSVDSPRGFLQYLMLAFLPDQRVRRNLVLPTRVDVDFRAACFCHRKVVDVGFVCSIC 344

Query: 259 LSIYCKHLK--KCSTCGS 274
           LSI+C+  +   C TCG+
Sbjct: 345 LSIFCEPPEGANCLTCGT 362



 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 2/70 (2%)

Query: 4  APSKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
          A S   S D SL+ V+LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A
Sbjct: 2  AESTSASTDPSLLTVILDTNPHAWAQLQETLPLSSAVANLLVFINAHLACNYSNKVAVVA 61

Query: 62 TGYNSCDYVY 71
          +  +   ++Y
Sbjct: 62 SHSHQAVWLY 71


>gi|385301449|gb|EIF45638.1| subunit of TFIIH complex [Dekkera bruxellensis AWRI1499]
          Length = 219

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/185 (42%), Positives = 99/185 (53%), Gaps = 15/185 (8%)

Query: 94  LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRILCLQ 151
           LQNL+  + K E   K           LSG+LS AL YI R F++   H   + RI+C+ 
Sbjct: 3   LQNLKSLVGKGETHIK---------GTLSGALSQALSYINR-FQNSDEHSGMKARIVCIS 52

Query: 152 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 211
            S D    YV+IMN+IF+AQ+  V +D C LG  NS FLQQAS  T GV+       GL 
Sbjct: 53  VSGDNTLPYVSIMNSIFAAQKQKVSVDVCKLGP-NSTFLQQASDSTNGVYIYIXHPLGLI 111

Query: 212 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKC 269
           QYL T    D   R  + LP    +DFRASCF     +D+GY+CSVCL I       + C
Sbjct: 112 QYLSTALFIDPMLRPMVVLPTNTNIDFRASCFITNKVVDIGYVCSVCLCILSVIPEDEIC 171

Query: 270 STCGS 274
            TC S
Sbjct: 172 PTCHS 176


>gi|320594208|gb|EFX06611.1| transcription factor tfiih subunit [Grosmannia clavigera kw1407]
          Length = 429

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 146/350 (41%), Gaps = 106/350 (30%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  W++  + L  S+ + ++L F+N+ L L+  N+V V+A+  N   ++Y
Sbjct: 20  SLLTIVLDTNPRAWAALAAVLPLSKAVANILVFVNSHLALSSTNRVAVVASHCNRAVWLY 79

Query: 72  DSSS-----------------TGNQSVGNGRMPS--------------------LCATLL 94
            + +                   +    +   PS                    +  TLL
Sbjct: 80  PTPAGSSLLPKKPPSDGLDVDMADAPASHPHRPSHPDPADQSGRSANKFPQFREIETTLL 139

Query: 95  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF------------------ 136
           Q++ + ++        E       +LLSG+L++AL +I +                    
Sbjct: 140 QSVHDLVSSTAAADLDE-----TTTLLSGALTLALAHINKTALTIPAAAAAANDGGSGGK 194

Query: 137 -------------------------------RSGLLHP------QPRILCLQGSPDGPEQ 159
                                          RSG   P        RIL +  S   P Q
Sbjct: 195 RSRPRPGGVLAGSATSGNAKKTGTSAGLADSRSGKTDPVDDVGLHARILVISVSDSSPAQ 254

Query: 160 YVAIMNAIFSAQRSMVPIDSCYL-GAQNSAFLQQASYITGG--VHHKPQQLDGLFQYLLT 216
           Y+A MNA+F+A  + V ID   L GA  + FLQQA+YIT G  V     Q  GL  YL+ 
Sbjct: 255 YIATMNAVFAAAHAGVAIDVLALRGA--ATFLQQAAYITRGTFVRVGADQKHGLLAYLML 312

Query: 217 IF--GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 264
            F   +       L  P    VDFRA+CFCH+  +D G++CSVCLSI+C+
Sbjct: 313 AFNAASSHDVAQHLVAPSADAVDFRAACFCHRRVVDTGFVCSVCLSIFCE 362


>gi|302656245|ref|XP_003019878.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
 gi|291183651|gb|EFE39254.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
          Length = 517

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 206
           I+ +  + +  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 338 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 396

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 266
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 397 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 456

Query: 267 K--KCSTCGS 274
           +   C TCG+
Sbjct: 457 EGANCLTCGT 466



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 40/68 (58%), Gaps = 2/68 (2%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           + L S D SL+ V+LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A+ 
Sbjct: 107 TDLISPDPSLLTVILDTNPHAWAQLQETLPLSSAVANLLVFINAHLACNYANKVAVVASH 166

Query: 64  YNSCDYVY 71
            +   ++Y
Sbjct: 167 SHQAVWLY 174


>gi|327299656|ref|XP_003234521.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
           118892]
 gi|326463415|gb|EGD88868.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
           118892]
          Length = 413

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 206
           I+ +  + +  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 234 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 292

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 266
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 293 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 352

Query: 267 K--KCSTCGS 274
           +   C TCG+
Sbjct: 353 EGANCLTCGT 362



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 41/70 (58%), Gaps = 2/70 (2%)

Query: 4  APSKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
          A S   ++D SL+ V+LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A
Sbjct: 2  ADSGAAANDPSLLTVILDTNPHAWAQLQETLPLSSAVANLLVFINAHLACNYANKVAVVA 61

Query: 62 TGYNSCDYVY 71
          +  +   ++Y
Sbjct: 62 SHSHQAVWLY 71


>gi|444724893|gb|ELW65479.1| Tectonic-2 [Tupaia chinensis]
          Length = 805

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 55/113 (48%), Positives = 78/113 (69%), Gaps = 1/113 (0%)

Query: 150 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 209
           ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  
Sbjct: 10  IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 68

Query: 210 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 262
           L QYLL +F  D   R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+
Sbjct: 69  LLQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIF 121


>gi|354543147|emb|CCE39865.1| hypothetical protein CPAR2_602840 [Candida parapsilosis]
          Length = 381

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 157/355 (44%), Gaps = 74/355 (20%)

Query: 10  SDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
           +D+ SL+ ++LD +P  W +  +S+S       +L FLN  L++N  NQV  IA+  ++ 
Sbjct: 19  NDEPSLLTIILDISPKGWYNIRNSISLQDVAKSLLVFLNGHLSINNSNQVAFIASSPHTS 78

Query: 68  DYVYDSSSTGNQSVGNGR----------MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIA 117
            ++Y   +  N   G  +            ++   +LQ L++F+   ++  +Q       
Sbjct: 79  KFLYPHPTNANADSGITKSKISPGMYRQFKNVDEVVLQELDQFV---QETSQQLHNATAR 135

Query: 118 CSLLSGSLSMALCYIQRVFRSGLLHPQ--------------------------------- 144
            S ++G++SMAL Y    +R  +L                                    
Sbjct: 136 HSSITGAISMALTY---TYRMSILDQSIQTTTAAAINTSTMSSAKQSEVAAASSSTTAST 192

Query: 145 ---------------PRILCLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGAQNS 187
                           RIL +  + DG +   Y+ +MN IF+AQ+  V ID   LG +NS
Sbjct: 193 TSATAAVSSSSTSLKSRILIVSAN-DGDDSINYIPLMNCIFTAQKLKVSIDVAKLGFKNS 251

Query: 188 AFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKN 247
           ++LQQAS  T GV+   +   G+ Q L T F  +   R ++ LP    V+++ASCF  K 
Sbjct: 252 SYLQQASDATNGVYLHIEDPRGITQVLATAFFIEPSLRPYIILPTNTQVNYKASCFITKK 311

Query: 248 TIDMGYICSVCLSIYCK--HLKKCSTCGSVFGQAQTQ--SDEPS-ATNRKRKTTD 297
            +D+G++CSVCL I        KC  C S F +      +  P   T +KRK  D
Sbjct: 312 PVDVGFVCSVCLCIMSSIPDDGKCPACSSDFDKRSINELTKIPQVVTKKKRKLED 366


>gi|410082137|ref|XP_003958647.1| hypothetical protein KAFR_0H01020 [Kazachstania africana CBS 2517]
 gi|372465236|emb|CCF59512.1| hypothetical protein KAFR_0H01020 [Kazachstania africana CBS 2517]
          Length = 328

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 138/318 (43%), Gaps = 39/318 (12%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLNQVVV 59
           S +  +  SL+ V++D  P  W+       +  S    L  +L FLNA L  N  NQV V
Sbjct: 16  STVLEETPSLLAVIIDIAPKLWAEFDEEKDTEGSIINVLQALLVFLNAHLAFNTSNQVAV 75

Query: 60  IATGYNSCDYVYDSSSTGNQSVGNG-------------RMPSLCATLLQNLEEFMNKDEQ 106
           IA       Y+Y  ++     V                R  ++   L++ L     K+EQ
Sbjct: 76  IAAHSQGIKYLYPENNINKNDVLKSDKDLSIINKGMYRRFRNVDEVLVEELYSLF-KEEQ 134

Query: 107 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRIL---CLQGSPDGPE--QY 160
               E +     S L G++S  L Y+ R+ +    L  + R++   C  G  +  E  QY
Sbjct: 135 ---NELDKITQKSTLPGAISAGLTYVNRISKEMETLSLKSRLMVLTCGSGMANKEEIFQY 191

Query: 161 VAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 219
           + IMN IFSA +   P+D   +G ++ S FLQQ +  T GV+   +   GL QYL T   
Sbjct: 192 IPIMNCIFSATKLKCPVDVVKIGGSKESTFLQQTTDATHGVYLHVETTKGLIQYLTTAMF 251

Query: 220 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFG 277
            D   R  +  P    VDFR SC+     + +G+ICSVCL +        KC  C S F 
Sbjct: 252 IDPSLRQIIVKPNQGSVDFRTSCYLTGKVVAVGFICSVCLCVLSIIPPGNKCPACDSEF- 310

Query: 278 QAQTQSDEPSATNRKRKT 295
                 DE      K+KT
Sbjct: 311 ------DERVINKLKKKT 322


>gi|443895518|dbj|GAC72864.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB4 [Pseudozyma antarctica
           T-34]
          Length = 425

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 59/148 (39%), Positives = 83/148 (56%), Gaps = 15/148 (10%)

Query: 144 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-- 201
           + RIL L  + D   QY+ +MN IF+AQ+  + ID C L   ++ FLQQASY+T G +  
Sbjct: 253 KSRILILSVTQDASTQYIPMMNCIFAAQKKGITIDVCKLLGNDTVFLQQASYLTSGTYFR 312

Query: 202 ----------HKPQQLD---GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 248
                      +   +D    L Q LLT +      R+ + LP    +DFRA+CFCH+  
Sbjct: 313 LSDEQAGPSGQEASAIDLRSALVQTLLTTYLPSRSMRSSMNLPTLEEIDFRAACFCHRKI 372

Query: 249 IDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +D+GYICSVCLS++C+    C TC S F
Sbjct: 373 VDVGYICSVCLSLFCQPRPFCLTCRSKF 400


>gi|302496747|ref|XP_003010374.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
 gi|291173917|gb|EFE29734.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
          Length = 414

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 55/130 (42%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 206
           I+ +  + +  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 235 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 293

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 266
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +++G++CS+CLSI+C+  
Sbjct: 294 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVNVGFVCSICLSIFCEPP 353

Query: 267 K--KCSTCGS 274
           +   C TCG+
Sbjct: 354 EGANCLTCGT 363



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 41/70 (58%), Gaps = 2/70 (2%)

Query: 4  APSKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
          A S   ++D SL+ V+LDTNP  W+    +L  S  + ++L F+NA L  N  N+V V+A
Sbjct: 2  ADSGAAANDPSLLTVILDTNPHAWAQLQETLPLSSAVANLLVFINAHLACNYANKVAVVA 61

Query: 62 TGYNSCDYVY 71
          +  +   ++Y
Sbjct: 62 SHSHQAVWLY 71


>gi|50303925|ref|XP_451910.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62901298|sp|Q6CVX9.1|TFB4_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|49641042|emb|CAH02303.1| KLLA0B08569p [Kluyveromyces lactis]
          Length = 337

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 97/299 (32%), Positives = 136/299 (45%), Gaps = 42/299 (14%)

Query: 11  DDVSLVVVLLDTNPFFW--------------SSSSLSFSQFLTHVLAFLNAILTLNQLNQ 56
           D  SL+ V++DT+   W              SS      + L  ++ FLNA L  N  NQ
Sbjct: 21  DTPSLLTVVVDTSIHSWVQLTKQQSGSGSEGSSGEKQLIEALKSIVVFLNAHLAFNSGNQ 80

Query: 57  VVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDE-------QLGK 109
           V +IA       Y+Y S+ +          PS+          F N DE       +L K
Sbjct: 81  VCLIAAHSEGMKYLYPSADSK---------PSMSMVSSDMYRGFRNVDEIVVEQWYRLFK 131

Query: 110 QEPEGRIAC----SLLSGSLSMALCYIQRVFRSGL-LHPQPRILCLQ-GSPDGPE---QY 160
           +E EG+ +     S LSG++S AL Y+ R+ +       + R+L +  G+  G +   QY
Sbjct: 132 EELEGQESKVSMKSSLSGAMSSALTYVNRILKENENTSLRSRLLVITCGTSQGKDEIFQY 191

Query: 161 VAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 219
           + IMN IFSA +    ID   +G    S FLQQA+  T GV+   +   GL QYL T   
Sbjct: 192 IPIMNCIFSATKMKCSIDVVKIGGGIESTFLQQATDATSGVYLHVENTRGLIQYLSTAMF 251

Query: 220 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVF 276
            D   RN +  P    VDFR SCF     + +G++CSVCL +       +KC  C S F
Sbjct: 252 IDPSLRNVIIKPNQGSVDFRTSCFLTGKVVAVGFVCSVCLCVLSVIPPGQKCPACDSPF 310


>gi|343426778|emb|CBQ70306.1| related to TFIIH basal transcription factor complex p34 subunit
           [Sporisorium reilianum SRZ2]
          Length = 422

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 59/148 (39%), Positives = 83/148 (56%), Gaps = 15/148 (10%)

Query: 144 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 203
           + RIL L  + D   QY+ +MN IF+AQ+  + ID C L   ++ FLQQASY+T G + +
Sbjct: 251 KSRILILSVTQDASTQYIPMMNCIFAAQKRGITIDVCKLFGSDTVFLQQASYLTSGTYFR 310

Query: 204 ----PQQLDG-----------LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 248
                +Q +G           L Q LLT +      R  + LP    +DFR +CFCH+  
Sbjct: 311 LADDHEQSNGDKAGAIDMRSSLVQTLLTTYLPSRSMRGVMNLPTLEEIDFRTACFCHRRI 370

Query: 249 IDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +D+GYICSVCLS++C+    C TC S F
Sbjct: 371 VDVGYICSVCLSLFCQPRPFCLTCRSKF 398


>gi|239611920|gb|EEQ88907.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           ER-3]
          Length = 425

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 205
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 245 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 303

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 304 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 363

Query: 266 L------KKCSTCGSVFGQAQ 280
           L       +C TCGS  G  +
Sbjct: 364 LGGVDGVAECLTCGSQLGVGE 384



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 37/60 (61%), Gaps = 2/60 (3%)

Query: 14 SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
          SL+ V+LDTNP  W+    SL FS  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 21 SLLTVILDTNPHAWARLEPSLPFSAAIANLLVFINAHLACNYANKVAVVASHCHHATWLY 80


>gi|327352824|gb|EGE81681.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 426

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 205
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 246 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 304

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 305 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 364

Query: 266 L------KKCSTCGSVFGQAQ 280
           L       +C TCGS  G  +
Sbjct: 365 LGGVDGVAECLTCGSQLGVGE 385



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 37/60 (61%), Gaps = 2/60 (3%)

Query: 14 SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
          SL+ V+LDTNP  W+    SL FS  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 21 SLLTVILDTNPHAWARLEPSLPFSAAIANLLVFINAHLACNYANKVAVVASHCHHATWLY 80


>gi|261201796|ref|XP_002628112.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           SLH14081]
 gi|239590209|gb|EEQ72790.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           SLH14081]
          Length = 425

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 205
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 245 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 303

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 265
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 304 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 363

Query: 266 L------KKCSTCGSVFGQAQ 280
           L       +C TCGS  G  +
Sbjct: 364 LGGVDGVAECLTCGSQLGVGE 384



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 37/60 (61%), Gaps = 2/60 (3%)

Query: 14 SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
          SL+ V+LDTNP  W+    SL FS  + ++L F+NA L  N  N+V V+A+  +   ++Y
Sbjct: 21 SLLTVILDTNPHAWARLEPSLPFSAAIANLLVFINAHLACNYANKVAVVASHCHHATWLY 80


>gi|260939906|ref|XP_002614253.1| hypothetical protein CLUG_05739 [Clavispora lusitaniae ATCC 42720]
 gi|238852147|gb|EEQ41611.1| hypothetical protein CLUG_05739 [Clavispora lusitaniae ATCC 42720]
          Length = 332

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 145/312 (46%), Gaps = 27/312 (8%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           +D+ SL+ V+L+  P   ++   S    +  ++ FLNA L+LN  NQV  + + +    +
Sbjct: 18  ADNPSLLTVILELTPKSIAALDASLQDIVKALVVFLNAHLSLNNSNQVAFLVSSHVGARF 77

Query: 70  VYDSSS---TGNQSVGNGRMPSLCATLLQNLEEFMNKD-EQLGKQEPEGRIACSLLSGSL 125
           ++   S   T  + V  G M      + + +   +NK+ E+L         A S L+G+L
Sbjct: 78  LHPGVSDVETETRFVNPG-MYRQFRLVDEAVFGALNKELERLASAAKND--ARSTLAGAL 134

Query: 126 SMALCYIQRVF---RSG---------------LLHPQPRILCLQGSPDGPEQYVAIMNAI 167
           SMA+ Y  R+    +SG                     R+L +  +      Y+ IMNAI
Sbjct: 135 SMAMTYTNRMLHVDQSGSAVQVAAAGTAGAQTTAAMGARVLVVSANESDDNNYMGIMNAI 194

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           F+AQ+  V ID   LG +++ +L+QA+  T GV+       G+ Q L T F  +   R+ 
Sbjct: 195 FAAQKMKVAIDVAKLGRRSAPYLEQAADATQGVYLHVADPRGMVQTLATAFFVEPSLRSV 254

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--KHLKKCSTCGSVFGQAQTQSDE 285
           + LP    VD++ASCF     +D+GY+CSVCL I         C  C S F +       
Sbjct: 255 VILPTHGNVDYKASCFLTGRAVDVGYVCSVCLCIMSMLPDSGACPMCHSQFDEHHIARLR 314

Query: 286 PSATNRKRKTTD 297
              + RKR+  +
Sbjct: 315 RGPSVRKRRKME 326


>gi|225684398|gb|EEH22682.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides brasiliensis Pb03]
          Length = 382

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 97/322 (30%), Positives = 142/322 (44%), Gaps = 85/322 (26%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ V+LDTNP+ W+   S+L  S  + ++L F+NA L  N  N+V V+A+  N   ++Y
Sbjct: 19  SLLTVILDTNPYAWALLESTLPLSAAIANLLVFINAHLACNYANKVAVVASHCNHAAWLY 78

Query: 72  DSSSTGNQS-------------VGNGRMPSL---------CATLLQNLEEFMNKDEQLGK 109
            + ST + S               NG+ PS            +       F+  +EQL +
Sbjct: 79  PTPSTPSISQSKAIQLDADGDITMNGQ-PSTEPEPENQQQSQSPFNKYRPFLLVEEQLTR 137

Query: 110 Q-------EPEGRIACS---LLSGSLSMALCYIQR------------------------- 134
                    P   +  +   +L+G+L++AL +I R                         
Sbjct: 138 NLHCLLSATPPSAVTSTTSTMLAGALTLALSHINRESIAYAEAHGAANPDPTSSTTGPAT 197

Query: 135 VFRSGLLHP---------------------QPRILC--LQGSPDGPEQYVAIMNAIFSAQ 171
              SGL  P                     Q RIL   L  +    + Y+ IMN+IF+ Q
Sbjct: 198 STASGLPPPPNSTTADPTSAAANPLNPTALQSRILIISLSNNTHSAQHYIPIMNSIFACQ 257

Query: 172 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ-QLDGLFQYLLTIFGTDLHSRNFLQL 230
           R  +PID   L A ++AFLQQAS  TGG++        G  QYL+  F  D  +R  L L
Sbjct: 258 RLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINAHPAGFLQYLMLGFLPDQRARAHLIL 316

Query: 231 PKPVGVDFRASCFCHKNTIDMG 252
           P  V VDFRA+CFCH+  +D+G
Sbjct: 317 PSRVDVDFRAACFCHRKVVDVG 338


>gi|453081361|gb|EMF09410.1| transcription factor Tfb4 [Mycosphaerella populorum SO2202]
          Length = 287

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 83/134 (61%), Gaps = 6/134 (4%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH--K 203
           R+L L  S D   QY+ IMN+IF+ QR  +PID   L A ++ FLQQA+  TGG++    
Sbjct: 114 RLLILSVSGDLANQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAADATGGIYMALD 172

Query: 204 PQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 262
                G  QYL+  +  D  +R  L  P +  GVDFRA+CFCH+  +D+G++CS+CLSI+
Sbjct: 173 SNTRGGFLQYLMFAYLPDETARKHLITPGEGEGVDFRAACFCHRRVVDIGFVCSICLSIF 232

Query: 263 CK--HLKKCSTCGS 274
           C+  H   C TCGS
Sbjct: 233 CEPLHDSLCLTCGS 246


>gi|443923251|gb|ELU42522.1| Tfb4 domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 872

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 128/297 (43%), Gaps = 57/297 (19%)

Query: 16  VVVLLDTNPFFWSSSS---------LSFSQFLTHVLAFLNAILTLNQLNQVVVIAT--GY 64
           ++++LD +P  W  S+         +  S FL+ VL F+N  L   Q N   ++    G 
Sbjct: 573 LILILDISPVQWQLSASSTANGGQGMDLSTFLSQVLVFVNCHLGCRQENSAAILGAFPGR 632

Query: 65  NSCDYVYDSSSTGNQSVG----------------------NGRMPSLCATLLQNL-EEFM 101
           +   Y   +S+  N  +G                         + S+  +  QNL + + 
Sbjct: 633 SLTLYPAPASTQSNTGLGPESESNTYHGFRSVDDVLMERIQSELASMSESDTQNLIQAYR 692

Query: 102 NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQ 159
           N+       +   R   S++S    +               P P I+C+    D   P  
Sbjct: 693 NQQNIRAPTKTRKRKGGSIVSLHYRL---------------PHPDIVCVARCFDTIYPIH 737

Query: 160 YVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 219
            + ++ A       +VPID C +   ++ FLQQASY+T G +        L Q L   F 
Sbjct: 738 ELHLLGA------KIVPIDVCRVFGPDTIFLQQASYLTRGCYLNLDSGSSLLQSLTMCFL 791

Query: 220 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
                RN ++LP    VDFRASCFCHK  +DMGY+CSVCLSI+CK +  C+TC   F
Sbjct: 792 PSASLRNTIELPSQGKVDFRASCFCHKEVVDMGYVCSVCLSIFCKPVVVCATCKIKF 848


>gi|344229675|gb|EGV61560.1| transcription factor Tfb4 [Candida tenuis ATCC 10573]
          Length = 381

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 87/336 (25%), Positives = 148/336 (44%), Gaps = 63/336 (18%)

Query: 6   SKLYSDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           S + +DD SL+ ++L+ +P  W +     S    +  +L FLN  L+LN  NQV  + + 
Sbjct: 15  SDVSNDDPSLLTIVLEVSPEGWFNIKDQTSIMDVVKSLLVFLNGHLSLNNSNQVAFVLSS 74

Query: 64  YNSCDYVYDSSS----------TGNQSVGNGRMPSLCATLLQNL-EEFMNKDE------- 105
                ++Y  +S          T  ++V N       + + + +  +F   DE       
Sbjct: 75  PAGSKFLYPDTSKTYEEIPLNVTSEETVENSNKSEELSFVTKGMYRQFRVVDETVLSGLS 134

Query: 106 ----QLGKQEPEGRIACSLLSGSLSMALCYIQRVFR------------------------ 137
                +   + +     S LSG+LS+AL Y  R+ +                        
Sbjct: 135 QTMSDITHAQDDKVTGSSRLSGALSLALSYTNRMMKLDQSISTTTASAISSAANISNKES 194

Query: 138 -------------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
                         G    + RIL +  + +   +Y+++M AIF AQ+  V ID   LG 
Sbjct: 195 STAGASSGTNANSYGTSRMKSRILVVTPNDNEDIKYISLMKAIFGAQKMKVAIDIAKLGR 254

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
           ++S++LQQA+  T GV+   ++  GL Q L T +  +   R  + LP    V+++ASCF 
Sbjct: 255 KDSSYLQQAADATNGVYLHIEKPLGLIQVLCTAYFIEPSIRPLMILPTNSNVNYKASCFI 314

Query: 245 HKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQ 278
              ++++GY+CSVCL I  +   +  C TC S F +
Sbjct: 315 TGKSVEIGYVCSVCLCIMSEIPDRMSCPTCNSHFDE 350


>gi|313239039|emb|CBY14022.1| unnamed protein product [Oikopleura dioica]
          Length = 291

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 140/300 (46%), Gaps = 37/300 (12%)

Query: 14  SLVVVLLDTNPFFW----------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
           SL+ V+LD NP  W          +S S SF + L   L F+++ L +   N+VVV+   
Sbjct: 9   SLIAVVLDLNPLAWGFRAAQDAQRTSKSESFVKALDATLTFISSYLLMTHGNKVVVVGAH 68

Query: 64  -------YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRI 116
                  + S  Y  +++S  N+ +   R+  LC  +L  L+E +       +  P  R 
Sbjct: 69  PLGAEILHPSDTYSGNTASMVNRQI-EARVNELCRGVL--LDETIAD----SRTSPASR- 120

Query: 117 ACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG-PEQYVAIMNAIFSAQRSMV 175
                + +LS ALC I R  RS    P  R++ ++   D    QY+ ++N  ++A+   V
Sbjct: 121 ----FNNALSKALCCINR-MRSEKDLPA-RLMVMKCGGDFLSHQYLPLINCAYAAEHLKV 174

Query: 176 PIDSCYLGAQNSA-FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 234
            ID+  L A+     LQQ S +TGGV+ +  +   L Q LLT    D  +R  L      
Sbjct: 175 SIDTVVLEAETDVPVLQQMSDVTGGVYSRISEPAILLQTLLTYHLPDDDARKMLSKVPAT 234

Query: 235 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 294
            VD RA CF  +   D+GY+CSVCLS+Y      C TC SV          P +  RKR+
Sbjct: 235 PVDSRAVCFKTRAVTDIGYVCSVCLSVYKMLTPICVTCKSVLKLPGM----PPSRKRKRE 290


>gi|406602443|emb|CCH45984.1| RNA polymerase II transcription factor B subunit 4 [Wickerhamomyces
           ciferrii]
          Length = 361

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 94/294 (31%), Positives = 131/294 (44%), Gaps = 53/294 (18%)

Query: 27  WSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY------------- 71
           W S    LS    L  ++  +NA L LN  NQ+ +IA+  +   ++Y             
Sbjct: 21  WQSIEDKLSIKSMLESLVVAMNAHLALNNSNQIAIIASHTDGAKFLYPYNNQKESTSRPK 80

Query: 72  ---------------------DSSSTGNQSVG--NGRM----PSLCATLLQNLEEFMNKD 104
                                ++S T N +    N  M      +   LL+NL    N  
Sbjct: 81  TRSDELQAEDQQGIESRSISPEASQTDNNTKHYINSSMYRQFKVVDEKLLENLYNLYN-- 138

Query: 105 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIM 164
           E +    P+       LS SLS+ L YI ++  +  L  + +IL +  S D   +Y+ IM
Sbjct: 139 EPIPTTPPKNH-----LSSSLSLGLTYINKIQSNDSLM-KAKILIVNISQDEHLKYIPIM 192

Query: 165 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 224
           N IF+AQ+  V ID C LG  N+ FLQQAS  T GV+   + +DGL QY  T    D   
Sbjct: 193 NCIFAAQKMKVSIDVCQLGL-NATFLQQASDATNGVYLHIENMDGLIQYFTTALFIDPSI 251

Query: 225 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVF 276
           +N L  P    +DFRASCF     +D+G++CSVCL I        KC  C S F
Sbjct: 252 KNILTKPNKGDIDFRASCFLTGKIVDIGFVCSVCLCILSLIPEDNKCPACDSEF 305


>gi|159490694|ref|XP_001703308.1| TFIIH 34kDa subunit [Chlamydomonas reinhardtii]
 gi|158280232|gb|EDP05990.1| TFIIH 34kDa subunit [Chlamydomonas reinhardtii]
          Length = 169

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 71/102 (69%)

Query: 170 AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 229
           AQR+ V +D+  +  Q+S+F+QQA+++TGG++ +P +  GL   LL  F  D  +R  L 
Sbjct: 39  AQRAQVVLDAVVVAPQDSSFMQQAAHLTGGLYFRPHRGRGLLGVLLNHFVCDTATRKQLD 98

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 271
             + +GVDFRASCFCHK  I+ GY+CSVCLSI+C+  + C+T
Sbjct: 99  EAQALGVDFRASCFCHKYVIETGYVCSVCLSIFCQPSRSCTT 140


>gi|149236047|ref|XP_001523901.1| hypothetical protein LELG_04714 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146452277|gb|EDK46533.1| hypothetical protein LELG_04714 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 444

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 61/158 (38%), Positives = 85/158 (53%), Gaps = 6/158 (3%)

Query: 146 RILCLQGSP-DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 204
           RIL +  S  D    Y+ +MN IF+AQ+  V ID   LG  +S++LQQAS  T GV+ K 
Sbjct: 259 RILIVTASDNDNHLNYIPLMNCIFTAQKLKVSIDVAKLGTTDSSYLQQASDATHGVYLKI 318

Query: 205 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--Y 262
           +   G+ Q L T F  + + R  + +P    VD++ASCF  K  +D+GY+CSVCL I  +
Sbjct: 319 EDPRGMIQVLSTAFFIEPNLRPHVIMPTDYQVDYKASCFLTKKPVDLGYVCSVCLCIMSF 378

Query: 263 CKHLKKCSTCGSVFGQAQTQS---DEPSATNRKRKTTD 297
                KC  C S F Q   Q    D   A  +K++  D
Sbjct: 379 VPDDLKCPACSSEFDQGIVQELTRDPQVAVVKKKRKLD 416



 Score = 40.4 bits (93), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 33/64 (51%), Gaps = 2/64 (3%)

Query: 10 SDDVSLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSC 67
          +DD +L+ ++LD +P  W +   + S       +L FLN  L LN  NQV  IA+     
Sbjct: 20 NDDPTLLTIVLDVSPLGWYNLRHTTSLENITKSLLVFLNGHLALNNSNQVAFIASSSTGS 79

Query: 68 DYVY 71
           ++Y
Sbjct: 80 KFLY 83


>gi|380484278|emb|CCF40099.1| RNA polymerase II transcription factor B subunit 4 [Colletotrichum
           higginsianum]
          Length = 157

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/113 (43%), Positives = 69/113 (61%), Gaps = 3/113 (2%)

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MNA+F+A  S +PID+  L + ++ FLQQASYIT G   +     GL  YL+  +  D  
Sbjct: 1   MNAVFAAAHSQIPIDTLAL-SGDATFLQQASYITDGTFMQAASPRGLLSYLMFAYAADAE 59

Query: 224 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 274
           +R+ L  P    V FRA+CFCH   +D G++CS+CLSI+C      +C TCG+
Sbjct: 60  ARSSLIPPTHHTVXFRAACFCHGRVVDTGFVCSICLSIFCDVPEGSECLTCGT 112


>gi|357629722|gb|EHJ78327.1| putative transcription factor TFIIH-subunit [Danaus plexippus]
          Length = 282

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 74/224 (33%), Positives = 112/224 (50%), Gaps = 18/224 (8%)

Query: 3   SAPSKLYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60
            APS     D SL+V+++DTNP   + +      +  L  V+AF N+ L     NQ+ VI
Sbjct: 5   EAPS-----DSSLLVIIVDTNPNQRYITEDPKVLTGCLDAVIAFANSHLMQKSRNQLAVI 59

Query: 61  ATGYNSCDYVYDSSSTG---NQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIA 117
              ++  +Y+Y S        Q  G   + +L    ++     M     +  Q  E R  
Sbjct: 60  GCHFHKSEYLYPSPGKPLDVRQIDGQYELFTLVEKTIK-----MRLVNLIKSQPQEERPG 114

Query: 118 CSLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
            SLL+G+++M LC+I R+ R     L    R+L + GS D   QY+  MN  F+AQ+  V
Sbjct: 115 ESLLAGAMAMGLCFIARMRREQPPGLRMVSRMLVVTGSSDTAAQYINYMNVFFTAQKQQV 174

Query: 176 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 219
            +D C L  ++ + LQQ   ITGG++ K   L+GL QYLL I+ 
Sbjct: 175 LLDVCSLD-KHLSLLQQGCDITGGLYLKVPSLEGLLQYLLVIYA 217


>gi|219113725|ref|XP_002186446.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583296|gb|ACI65916.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 358

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 57/137 (41%), Positives = 77/137 (56%), Gaps = 9/137 (6%)

Query: 145 PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ----NSAFLQQASYITGGV 200
           PRI  +Q S D    Y A MN  F+A +  + +D C+L A+    +SAFL+QA  +TGGV
Sbjct: 206 PRIFLIQASDDRSRDYNAFMNCAFAAVKQSIVVDGCFLSAKGNHSSSAFLEQACDLTGGV 265

Query: 201 HHKPQ---QLDG-LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 256
              P    Q+ G L + L ++F  D+ SR  L LP    VDFRA CF     +D+ Y+C+
Sbjct: 266 FLSPSGTAQVGGALTEVLHSVFLADVPSRCLLNLPAINKVDFRARCFETAELVDLAYVCN 325

Query: 257 VCLSIYCKHLKK-CSTC 272
            CLSI+ K  K  C TC
Sbjct: 326 QCLSIFRKQPKNYCPTC 342


>gi|378726397|gb|EHY52856.1| transcription initiation factor TFIIH subunit H3 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 455

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 104/392 (26%), Positives = 153/392 (39%), Gaps = 131/392 (33%)

Query: 14  SLVVVLLDTNPFFWSSSSLSFSQF--LTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ ++LDTNP  WS  S + S    + ++L F+NA L  N  N+V VIA+  +   ++Y
Sbjct: 20  SLLAIILDTNPAAWSLLSSTLSFSSAVANLLVFINAHLAANYTNKVAVIASHCDKATWLY 79

Query: 72  ------------------------DSSSTGNQSVGNG-RMPSLCATLLQNLEEFMNK--- 103
                                   DS+     +V NG +      +   + EE  NK   
Sbjct: 80  PTPTEQEPPPPLSSSSHTNNNDLTDSTKRFKLNVNNGTKNNDSIPSSSSSAEETGNKYRP 139

Query: 104 ----DEQL--------GKQEPEG--RIACSLLSGSLSMALCYIQR---------VFRSGL 140
               +E+L            P+   R   ++++G+L++AL YI R         +  SG 
Sbjct: 140 FRLVEEELIHNLTTLLASTSPDAVSRSPTTMIAGALTLALSYINRESIAYAESVIGSSGA 199

Query: 141 LHP--------------------------QPRILCLQGSP--DGPEQYVAIMNAIFSAQR 172
                                        Q RIL +  SP  D   QY+ IMNAIF+ QR
Sbjct: 200 ADTTAAATTNTTTTTGDPNSSSSNNNTSLQSRILLVSVSPSTDLAHQYIPIMNAIFACQR 259

Query: 173 SMVPIDSCYL----------------------------------GAQNSAFLQQASYITG 198
             +PID   L                                   + ++ FLQQA+  T 
Sbjct: 260 LAIPIDILQLPLPLQTNNTTTQAQQTSTNNQTPAGDNNTTTAGNTSNSTVFLQQAADATH 319

Query: 199 GVHHKPQ-------------QLDGLFQYLLTIFGTDLHSRN-FLQLPKPVGVDFRASCFC 244
           G+    Q                    YLLT       +R   L LP  + VDFRA+CFC
Sbjct: 320 GIFIAAQLPKSQSQSQASAASSQAFLTYLLTSLLPSPSTRAAHLILPTRIDVDFRAACFC 379

Query: 245 HKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 274
           H+N + +G++CS+CLSI+C       C TCG+
Sbjct: 380 HRNVVSVGFVCSICLSIFCSVPENADCLTCGT 411


>gi|37958350|gb|AAP41125.1| ZK1128.4 [Pristionchus pacificus]
          Length = 278

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 73/260 (28%), Positives = 125/260 (48%), Gaps = 31/260 (11%)

Query: 41  VLAFLNAILTLNQLNQVVVIATGYN-SCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEE 99
           ++ F N  L+L Q NQ+++ A G   S   +YDS  + N        P   + +++ +  
Sbjct: 40  IVTFSNVHLSLAQSNQLLLYAYGNGMSKKLLYDSVRSDN--------PDSSSCIIKGIRG 91

Query: 100 FMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP-------RILCLQG 152
            + ++   G         C +L+ +L+ A+C I++    G    +        RI+ +  
Sbjct: 92  ALRENSTSGDARK-----CGVLAATLATAICQIRKAATLGTNKSEETISGQKGRIVIISL 146

Query: 153 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 212
            PD   Q+  +MN  FSA +  + ID   LG  +SA LQQA+ IT GV     +++ L +
Sbjct: 147 LPDFGSQHSLLMNLFFSAHKHDICIDVVSLGG-SSALLQQAADITAGVFQVIHKIEDLPR 205

Query: 213 YLLT--IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCS 270
            L+T  +  +   S +F        VD+RA+C CH+  +  G+ CSVCL++ C  +  C 
Sbjct: 206 ILMTRCLPVSIRDSTSF------TTVDYRAACVCHEKLVSNGWTCSVCLAVLCSFMAICP 259

Query: 271 TCGSVFGQAQTQSDEPSATN 290
           +CG+   Q  + S E   T+
Sbjct: 260 SCGACV-QDHSSSKEEKETD 278


>gi|312385025|gb|EFR29618.1| hypothetical protein AND_01266 [Anopheles darlingi]
          Length = 516

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 73/213 (34%), Positives = 110/213 (51%), Gaps = 17/213 (7%)

Query: 12  DVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDY 69
           D SL+V++LDTNP          + +Q L  ++AF NA L     N++ V+A  +++ ++
Sbjct: 284 DSSLLVIVLDTNPSQRIIREKPHNLTQCLDSIVAFGNAHLMQKAQNKLAVLACHHHATEF 343

Query: 70  VYDSSSTG---NQSVGNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIAC-SLLSGS 124
           +Y +        Q  G   + +L   T+ Q L   +       K  P   +   SLL+GS
Sbjct: 344 LYPTPGKPLDIRQVDGQYEVFTLVEKTIKQKLANLI-------KTAPHLTLPTESLLAGS 396

Query: 125 LSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 182
           +SMALCYI R+ R+     +   R+L + GS +   QY+  MN  F+AQ+  V +D C L
Sbjct: 397 MSMALCYIARINRNKPPGSKINSRVLVVTGSNECASQYMTYMNVFFTAQKQGVVVDVCAL 456

Query: 183 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 215
               S  LQQ   ITGG + + +QLDG  QYLL
Sbjct: 457 DKALS-LLQQGCDITGGQYLRLEQLDGFLQYLL 488


>gi|367054924|ref|XP_003657840.1| hypothetical protein THITE_2123949 [Thielavia terrestris NRRL 8126]
 gi|347005106|gb|AEO71504.1| hypothetical protein THITE_2123949 [Thielavia terrestris NRRL 8126]
          Length = 459

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 85/196 (43%), Gaps = 61/196 (31%)

Query: 139 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 198
           GL     RIL L  S   P QY+  MNA+F+A  + + ID+  L   ++ FL+QAS+IT 
Sbjct: 207 GLAGMHARILILSVSDSSPAQYIPTMNAVFAAAHARIAIDTLSLRG-SATFLEQASFITR 265

Query: 199 GVHHKPQQLDGLFQYLLTIFGT----------------------------------DLHS 224
           G   +  +  GL QYL+  FG+                                  +   
Sbjct: 266 GTFIRAAEPRGLLQYLMFGFGSGSAPSHPAAGGGGAGGTDAGKGPASKPKTSASTANGGG 325

Query: 225 RNFLQLPKPVG------------------------VDFRASCFCHKNTIDMGYICSVCLS 260
                 PKP G                        VDFRA+CFCH+N +D G++CS+CLS
Sbjct: 326 AAGRGRPKPGGASSSSRLGLGASVAELLVTPSADAVDFRAACFCHRNVVDTGFVCSICLS 385

Query: 261 IYCK--HLKKCSTCGS 274
           I+C+     +C TCG+
Sbjct: 386 IFCEVPDGGECLTCGT 401



 Score = 45.4 bits (106), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 41/65 (63%), Gaps = 3/65 (4%)

Query: 10 SDDV-SLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
          +DD+ SL  +++DTNP  W++ +  L  S+ + ++L F+NA L  +  NQV +IA+  N 
Sbjct: 18 TDDIPSLCTIIIDTNPRAWAALADVLPISKAIANILIFVNAHLAFSNSNQVALIASHTNR 77

Query: 67 CDYVY 71
            ++Y
Sbjct: 78 AVWLY 82


>gi|299751088|ref|XP_001830046.2| basic transcription factor 2.35KD subunit [Coprinopsis cinerea
           okayama7#130]
 gi|298409214|gb|EAU91711.2| basic transcription factor 2.35KD subunit [Coprinopsis cinerea
           okayama7#130]
          Length = 280

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 109/218 (50%), Gaps = 16/218 (7%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           +D  +L+ V+LD +P  W  ++     ++ + FL  +LAFLNA +     N + V     
Sbjct: 2   NDHGTLLAVVLDLSPAQWHRAANDPQPITLNAFLAQLLAFLNAHIACMHENTLAVFGAFP 61

Query: 65  NSCDYVY---DSSSTGNQSVGNGRMPSLCATLLQN--LEEFMNKDEQLGKQEPEGRIACS 119
                +Y   D   T + SV     P     LL N  +     + + L     E  + C+
Sbjct: 62  GKSVMLYASTDQHQTASISVDANSYPPF--KLLDNTVVTSIAAELDALDDSHDEEEVPCA 119

Query: 120 LLSGSLSMALCYIQRVFR-SGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 176
           L+ G+L+ ALC    +   S  + PQ  PRIL L  SPD    Y+ IMNAIFSAQ+  V 
Sbjct: 120 LV-GALTKALCSTTPLPNGSQGVPPQLDPRILILSVSPDLSTSYIPIMNAIFSAQKLKVT 178

Query: 177 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 214
           ID+C +    + FLQQA+++TGG +   ++ D L QYL
Sbjct: 179 IDACQIYGGETVFLQQAAHLTGGSYLLLERRDALLQYL 216


>gi|195447228|ref|XP_002071121.1| GK25630 [Drosophila willistoni]
 gi|194167206|gb|EDW82107.1| GK25630 [Drosophila willistoni]
          Length = 310

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/174 (33%), Positives = 85/174 (48%), Gaps = 10/174 (5%)

Query: 97  LEEFMNKDEQ-LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS---GL-----LHPQPRI 147
           + E M  D Q +         + SLL+G +  ALCYIQR  R    GL     L  + R+
Sbjct: 134 IAELMRSDAQKVNYNSNAQNTSTSLLAGGIGKALCYIQRRRREITRGLAKGVQLRIEARV 193

Query: 148 LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQ 206
           L +  +  G  Q +  M+    A+ S V +D C L  A+  +  +QA+ +TGG ++    
Sbjct: 194 LIVTATDPGSNQVINYMHMFNGAKSSRVALDVCILDRAETLSIWRQATALTGGFYYSTND 253

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 260
              L   LL IF      R     P+   VD+R +C CHKN +D+G++CS CLS
Sbjct: 254 FTDLLTRLLGIFLMSPVERFQFNYPEQPQVDYRPNCVCHKNVLDIGFVCSACLS 307


>gi|298711517|emb|CBJ26605.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 223

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 106/218 (48%), Gaps = 33/218 (15%)

Query: 11  DDVSLVVVLLDTNPFFWSSSSL-----------------SFSQFLTHVLAFLNAILTLNQ 53
           ++ SL++++LD +  FW+                     SF   +   L F+ + L +++
Sbjct: 8   EEPSLLLIILDASSAFWTKREAVRRRQKERPESGEVKLSSFQDTVEATLMFVESYLMMHR 67

Query: 54  LNQVVVIATGYNSCDYVYDSSSTGNQ--SVGNGRMPS------LCATLLQNLEEFMNKDE 105
            N+V  +A        ++ + +   +  ++G G + +      L   L + +E     DE
Sbjct: 68  RNEVCFVACTARESTIIFPTPAMEREQRTLGRGSVSTKEFQDCLWDGLGRVVERGHAADE 127

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMN 165
             G  E      C+L +GSLS  LCYI R  R      Q R L L GSPD PE Y ++MN
Sbjct: 128 AGGGDE-----GCAL-AGSLSKGLCYINRKMRESS-SIQARALVLSGSPDVPETYNSVMN 180

Query: 166 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 203
           AIFSAQ++ V +D   LG ++S FLQQA+Y+TG    K
Sbjct: 181 AIFSAQKAGVLVDCAVLG-ESSTFLQQAAYLTGKGARK 217


>gi|169607795|ref|XP_001797317.1| hypothetical protein SNOG_06960 [Phaeosphaeria nodorum SN15]
 gi|160701493|gb|EAT85611.2| hypothetical protein SNOG_06960 [Phaeosphaeria nodorum SN15]
          Length = 312

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 79/253 (31%), Positives = 123/253 (48%), Gaps = 45/253 (17%)

Query: 14  SLVVVLLDTNPFFWS--SSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ V+LDTNP  W+  SSS+S S  L ++L F+NA L     N+      G    +   
Sbjct: 20  SLLAVVLDTNPHAWAHLSSSISLSAALANILVFINAHLASGNANESRNGTNGDVEMNGAE 79

Query: 72  DSSSTGNQSVGNGRMPSLC-ATLLQNLEEFMNKDEQLGKQEPEGRIACS---LLSGSLSM 127
           D+     ++    R  +L  + +L+N  + +       ++  E  +A +   L+ G++SM
Sbjct: 80  DNKPYIAENPNKYRAFALVESAILKNFAKLL-------EETNESHLAATPTTLIGGAISM 132

Query: 128 ALCYIQRVFRSGLLHP----------------------------QPRILCLQGSPDGPEQ 159
           AL YI +   S +LH                               RIL +  S D   Q
Sbjct: 133 ALSYINK---STILHAPTGASAEITSVAAMADTDNSTHLDRIALTSRILIVSVSGDLANQ 189

Query: 160 YVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 219
           Y+ +MN+IF+AQR  +PID   L A ++  LQQAS  TGGV+ KP++ +GL QYL+  + 
Sbjct: 190 YIPVMNSIFAAQRKKIPIDILKL-AGDTVLLQQASDATGGVYMKPERPEGLLQYLMMAYL 248

Query: 220 TDLHSRNFLQLPK 232
            D  +R  L +P+
Sbjct: 249 PDATARKSLIIPR 261


>gi|336468231|gb|EGO56394.1| hypothetical protein NEUTE1DRAFT_147077 [Neurospora tetrasperma
           FGSC 2508]
 gi|350289522|gb|EGZ70747.1| transcription factor Tfb4 [Neurospora tetrasperma FGSC 2509]
          Length = 439

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 84/183 (45%), Gaps = 55/183 (30%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL +  S   P QY+  MNA+F+A  + + ID+  L   ++ FLQQASYIT G   + Q
Sbjct: 223 RILVVSVSDSSPSQYIPTMNAVFAAAHARIAIDTLALRG-SATFLQQASYITRGTFIRAQ 281

Query: 206 QLDGLFQYLLTIFGTDL---------HSR------------------------------- 225
           +  GL QYL+  FG+           H R                               
Sbjct: 282 EPRGLLQYLMFGFGSGSAPQGLAAAGHQRSQQQHGGNKGKEDKASEKAVGAAAKGGAKHG 341

Query: 226 ------------NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 271
                       + L  P    VDFRA+CFCH+N ID G++CS+CLSI+C+     +C T
Sbjct: 342 GQKFLGSNASVADLLVTPSADSVDFRAACFCHRNVIDTGFVCSICLSIFCEVPEGGECLT 401

Query: 272 CGS 274
           CG+
Sbjct: 402 CGT 404



 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 46/81 (56%), Gaps = 7/81 (8%)

Query: 10 SDDV-SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
          +DD+ SL  +++DTNP  W++    L  ++ L ++L F+N+ L  N  NQV +IA+  N 
Sbjct: 18 TDDIPSLYTIIIDTNPRAWAALNDVLPLTKALANILIFVNSHLAFNNSNQVALIASHTNR 77

Query: 67 CDYVYDSSSTGNQSVGNGRMP 87
            ++Y ++   N     GR P
Sbjct: 78 AVWLYPTAPDQNP----GRQP 94


>gi|116204609|ref|XP_001228115.1| hypothetical protein CHGG_10188 [Chaetomium globosum CBS 148.51]
 gi|88176316|gb|EAQ83784.1| hypothetical protein CHGG_10188 [Chaetomium globosum CBS 148.51]
          Length = 408

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 137/353 (38%), Gaps = 93/353 (26%)

Query: 10  SDDV-SLVVVLLDTNPFFWSSSS--LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
           +DD+ SL  +++DTNP  W++ +  L  S+ + ++L F+NA L  +  NQV +IA   N 
Sbjct: 18  TDDIPSLCTIVIDTNPRAWAALTDVLPISKAIANILVFVNAHLAFSNSNQVSIIAAHTNR 77

Query: 67  CDYVYDSSSTGNQS-----------------VGNGRMPSLCATLLQNLEEFMNK--DEQL 107
             ++Y +     +S                     +   +  +LL +L   +N   D  L
Sbjct: 78  AVWLYPAPPRQGRSQDVDMQNAASDSKSSSANKYPQFAQIENSLLTSLRALINDTTDADL 137

Query: 108 GKQEPEGRIACSLLSGSLSMALCYIQRVFRS----------------------------- 138
                +       +SG+L++AL +I +   S                             
Sbjct: 138 ASTTTQ-------ISGALTLALAHINKTALSFTASSTAAHAATNTGTAMTAGTAVGPNPV 190

Query: 139 --------GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFL 190
                   GL     RIL L  S   P QY+  MNA+F+A R+ + ID+  L A     L
Sbjct: 191 SSTSMNGGGLAGLHARILILSVSDSAPAQYIPTMNAVFAAARARIAIDTLSLRAAEPRGL 250

Query: 191 QQASYITGGVHHKPQQLDG--------------LFQYLLTIFGTDLHSR----------- 225
            Q      G    P    G                    T      H R           
Sbjct: 251 LQYLMFGFGSGTAPSNPSGGADSGKGPASSKTATAGGQGTTAAGGRHPRTAKLGLGASVA 310

Query: 226 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVF 276
             L  P    VDFRA+CFCH+N ID G++CSVCLSI+C+     +C TCG+  
Sbjct: 311 ELLVTPSADAVDFRAACFCHRNVIDTGFVCSVCLSIFCEVPDGGECLTCGTAL 363


>gi|380477141|emb|CCF44315.1| RNA polymerase II transcription factor B subunit 4 [Colletotrichum
           higginsianum]
          Length = 255

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 72/230 (31%), Positives = 106/230 (46%), Gaps = 50/230 (21%)

Query: 14  SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY 71
           SL+ +++DTNP  W++    L  S+ + +VL F+NA L  +  NQV + A+  N   ++Y
Sbjct: 20  SLLSIVIDTNPRAWAALNDVLPLSKAIANVLVFVNAHLAFSNANQVAIFASHSNRAVWLY 79

Query: 72  ----------------DSSSTG--------NQSVGNGRMPSLCATLLQNLEEFMNKDEQL 107
                           D + TG        NQS  N + P       Q     M+   +L
Sbjct: 80  PAKPEPATANGAAASEDVAMTGVDAFAPTPNQSSAN-KFPQFA----QIESAVMSSMRKL 134

Query: 108 GKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLHPQPRILC 149
                E  +AC+   LSG+L++AL +I                 R   S L   + RI  
Sbjct: 135 VDATTEAXLACTTTQLSGALTLALAHINKTSLSLNETNKAPDVPRSTSSALSRLRARIFI 194

Query: 150 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG 199
           L  S   P QY++ MNA+F+A  S +PID+  L   ++ FLQQASYIT G
Sbjct: 195 LSVSDSEPVQYISTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDG 243


>gi|85078711|ref|XP_956217.1| hypothetical protein NCU01580 [Neurospora crassa OR74A]
 gi|16416034|emb|CAB91384.2| related to basic transcription factor 2 (35 kd SU) [Neurospora
           crassa]
 gi|28917270|gb|EAA26981.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 439

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 83/183 (45%), Gaps = 55/183 (30%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RIL +  S   P QY+  MNA+F+A  + + ID+  L   ++ FLQQASYIT G   + Q
Sbjct: 223 RILVVSVSDSSPSQYIPTMNAVFAAAHARIAIDTLALRG-SATFLQQASYITRGTFIRAQ 281

Query: 206 QLDGLFQYLLTIF----------------------------------------------- 218
           +  GL QYL+  F                                               
Sbjct: 282 EPRGLLQYLMFGFGSGSAPQGLAAAGNQRSQQQHGGHKGKEDKASEKALGAAAKGGAKHG 341

Query: 219 -----GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 271
                G++    + L  P    VDFRA+CFCH+N ID G++CS+CLSI+C+     +C T
Sbjct: 342 GQKFLGSNASVADLLVTPSADSVDFRAACFCHRNVIDTGFVCSICLSIFCEVPEGGECLT 401

Query: 272 CGS 274
           CG+
Sbjct: 402 CGT 404



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 46/81 (56%), Gaps = 7/81 (8%)

Query: 10 SDDV-SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
          +DD+ SL  +++DTNP  W++    L  ++ L ++L F+N+ L  N  NQV +IA+  N 
Sbjct: 18 TDDIPSLYTIIIDTNPRAWAALNDVLPLTKALANILIFVNSHLAFNNSNQVALIASHTNR 77

Query: 67 CDYVYDSSSTGNQSVGNGRMP 87
            ++Y ++   N     GR P
Sbjct: 78 AVWLYPTAPDQNP----GRHP 94


>gi|308501315|ref|XP_003112842.1| hypothetical protein CRE_25211 [Caenorhabditis remanei]
 gi|308265143|gb|EFP09096.1| hypothetical protein CRE_25211 [Caenorhabditis remanei]
          Length = 317

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 117/274 (42%), Gaps = 38/274 (13%)

Query: 42  LAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFM 101
           ++F NA L  +  NQ++V A G N    +  SS+  +    +  + +    LL N  +  
Sbjct: 56  VSFCNAHLGQSASNQLLVFAYGRNESKNMIYSSARSDDRNASFIVVNRLKDLLTN--DSG 113

Query: 102 NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP------------------ 143
           N D  +G            L  +L+ A C++++  R     P                  
Sbjct: 114 NSDATIGAP----------LGPALAHAFCHMKKDSRITTADPCDDSLGPAATSEQSESST 163

Query: 144 ---QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGG 199
                R + +  SP    ++ ++MN  FSA +  + +D   +G       LQQA+ ITGG
Sbjct: 164 EKVTNRAVVISMSPIMGSEHGSLMNLFFSAAKQSICVDVVSMGDDFTGGILQQAADITGG 223

Query: 200 VHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCL 259
                Q+   L + L+T   TD   R          VD+RASC CH   +  G++CSVCL
Sbjct: 224 SFLHAQKPPALLKILMTNMLTDPSYRTAFSKLSHSSVDYRASCACHHKLVSSGWVCSVCL 283

Query: 260 SIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKR 293
           S+ C++   C  C + F    T ++ P   NRKR
Sbjct: 284 SVLCQYTPICKVCRAAF----TITNIPKKPNRKR 313


>gi|341877621|gb|EGT33556.1| hypothetical protein CAEBREN_10115 [Caenorhabditis brenneri]
          Length = 311

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 73/278 (26%), Positives = 124/278 (44%), Gaps = 41/278 (14%)

Query: 42  LAFLNAILTLNQLNQVVVIATGYN-SCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEF 100
           ++F NA L  +  NQ++V A G N     +Y S+ + +++     +  L   L+   ++ 
Sbjct: 51  VSFCNAHLGQSSSNQLLVFAYGRNVEKKMIYSSARSEDRNASRIVVDRLRQLLI---DDS 107

Query: 101 MNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR----------------------S 138
            N D  +G          + L  +L+ A C++++  R                      +
Sbjct: 108 NNSDAIIG----------APLGPALAHAFCHMKKDSRVTTADPCDDSLGPASSNSEQSDA 157

Query: 139 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYIT 197
               P  R + +  +     ++ ++MN  FSA + ++ +D   LG       LQQA+ IT
Sbjct: 158 SAEKPTCRAVVISITQIMGSEHGSLMNLFFSAAKQLICVDVVSLGDDFTGGILQQAADIT 217

Query: 198 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 257
           GG     Q+   L + L+T   TD   R+         VD+RASC CH+  +  G++CSV
Sbjct: 218 GGSFMHAQKPQALLKILMTNMLTDPTYRSAFSKLSHNSVDYRASCVCHQTLVSSGWVCSV 277

Query: 258 CLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRKT 295
           CLS+ C++   C  C + F    T  + P   NRKRK+
Sbjct: 278 CLSVLCQYTPICKVCKAAF----TIRNIPKKPNRKRKS 311


>gi|402595070|gb|EJW88996.1| hypothetical protein WUBG_00095 [Wuchereria bancrofti]
          Length = 274

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 117/242 (48%), Gaps = 25/242 (10%)

Query: 41  VLAFLNAILTLNQLNQVVVIAT-GYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEE 99
           + ++  A ++L+ +N +VVI   G  S   +Y +S++ +  + +    S+   L ++   
Sbjct: 35  ITSYTTAHMSLSAMNGIVVIGVDGTLSEPTIYTTSASSDIDMSSAIKTSIQNALKKSASS 94

Query: 100 FMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQ 159
               D              +L + +++ A+C+I R F++ +     RIL +    D   +
Sbjct: 95  TNTTDS-------------ALFAPAIATAICHIFR-FKNEVDKGDGRILVINIGSDLLGE 140

Query: 160 YVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TI 217
           +  +MN  F+A +  + ID   +G + S  LQQAS ITGG +   ++   L +Y +  T+
Sbjct: 141 HNILMNIFFAAHKHSILIDVANIG-ETSPILQQASDITGGTYFNVKKPKQLLKYTMCFTL 199

Query: 218 FGTDLHSRNFLQLPKP---VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 274
               L S      P P     +D+RASC CH   + +G++CSVCLS+ C     C  C +
Sbjct: 200 GRASLRS----AFPSPSSSTSIDYRASCHCHGAPVSVGWVCSVCLSVQCHFSPICPACNT 255

Query: 275 VF 276
           VF
Sbjct: 256 VF 257


>gi|407044375|gb|EKE42557.1| hypothetical protein ENU1_017690 [Entamoeba nuttalli P19]
          Length = 263

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 72/266 (27%), Positives = 126/266 (47%), Gaps = 29/266 (10%)

Query: 18  VLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTG 77
           +LLD   F  ++  L+  +   +V    + I  L +++    I T       VY   S  
Sbjct: 19  ILLDATFFVHNNRKLNVEEIREYVNIINSFIRVLKEIHDNSSIFT------IVYAGKSVF 72

Query: 78  NQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR 137
            +    G    L  ++ + ++E +N  ++    EP        LS ++S+AL  I +   
Sbjct: 73  VKESDRGE---LTTSIAEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK--- 117

Query: 138 SGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFLQQA 193
            G L P  Q +I+ +    +   +++ +MN +F+AQR    ID+  L    +   F QQA
Sbjct: 118 DGKLQPDVQKKIIVVSTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSVIETRCTFCQQA 177

Query: 194 SYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDM 251
           SY+T G+++    L  LF  LL+   +D  +   +++   +   +D++  CF  K  +D 
Sbjct: 178 SYLTNGIYNTAT-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQPAIDWKLWCFHCKKPLDH 236

Query: 252 GYICSVCLSIYCKHLK-KCSTCGSVF 276
           GY+CS C  IYC+  K +C TCG+ F
Sbjct: 237 GYVCSSCFGIYCEKQKGRCPTCGAYF 262


>gi|167391336|ref|XP_001739730.1| RNA polymerase II transcription factor B subunit [Entamoeba dispar
           SAW760]
 gi|165896489|gb|EDR23894.1| RNA polymerase II transcription factor B subunit, putative
           [Entamoeba dispar SAW760]
          Length = 249

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 70/266 (26%), Positives = 126/266 (47%), Gaps = 29/266 (10%)

Query: 18  VLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTG 77
           +LLD   F  ++  L+  +   +V    + I  L +++    + T       VY   S  
Sbjct: 5   ILLDATFFVHNNRKLNVEEIREYVNIINSFIRVLKEIHDNSSVFT------IVYAGKSVF 58

Query: 78  NQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR 137
            +    G    L   +++ ++E +N  ++    EP        LS ++S+AL  I +   
Sbjct: 59  VKESDRGE---LTTPIVEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK--- 103

Query: 138 SGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA--FLQQA 193
            G L P  Q +++ +    +   +++ +MN +F+AQR    ID+  L    +   F QQA
Sbjct: 104 DGKLQPDIQKKVIVISTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSIIETGCTFCQQA 163

Query: 194 SYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDM 251
           SY+T G+++    L  LF  LL+   +D  +   +++   +   +D++  CF  K  +D 
Sbjct: 164 SYLTNGIYNTAP-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQSAIDWKLWCFHCKKPLDH 222

Query: 252 GYICSVCLSIYC-KHLKKCSTCGSVF 276
           GY+CS C  IYC K  ++C TCG+ F
Sbjct: 223 GYVCSSCFGIYCEKQKERCPTCGAYF 248


>gi|124513782|ref|XP_001350247.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
 gi|23615664|emb|CAD52656.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
          Length = 326

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 74/306 (24%), Positives = 136/306 (44%), Gaps = 48/306 (15%)

Query: 16  VVVLLDTNPFFWSS-----------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           +++++D N   W              +L   +FL  V  F+     +    ++ +I+T  
Sbjct: 28  LILIIDVNLLIWCEGVQIKFENNVIRTLKLHEFLKSVFQFIRFYCIMCNSERICIISTCS 87

Query: 65  NSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGS 124
           N+C  +Y++  +   +  N      C+     L +F+N++++  K E       S LS +
Sbjct: 88  NNCKIIYENYISF--AKNNLTEKDFCSDTYDKLVDFINENKKEKKSE-------SALSSA 138

Query: 125 LSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPIDSCYL 182
           L++ALCY  R+ R+   +   R+  L  S       QY  +MN  ++A+R+ + ID   L
Sbjct: 139 LTIALCYNNRI-RNLYENINTRLFLLDISKSHYYTNQYTQLMNIAYNAKRNNIIIDVFSL 197

Query: 183 GAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRNFLQ--- 229
             + +  L+Q   IT G++                D L Q ++  F   +++R +     
Sbjct: 198 NYK-TQILEQICNITNGLYLDNTIFQSINPYGNIQDILTQTIIFWFLPSVNTRKYFSNTY 256

Query: 230 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--KHLK------KCSTCGSVFGQAQT 281
           L +   +   A C CH   ID+ YICS CL+IYC  K+L+       C+ C + F ++  
Sbjct: 257 LNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKNLQTNKDRLSCAICKTRFTKSLL 313

Query: 282 QSDEPS 287
           ++   S
Sbjct: 314 RNKHVS 319


>gi|67468778|ref|XP_650390.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56467011|gb|EAL45004.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 263

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 129/269 (47%), Gaps = 35/269 (13%)

Query: 18  VLLDTNPFFWSSSSLS---FSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSS 74
           +LLD   F  ++  L+     +++  + +F+  +  ++  + + +I         VY   
Sbjct: 19  ILLDATFFVHNNRKLNVEEIREYVNIINSFIRVLKEIHDNSSIFII---------VYAGK 69

Query: 75  STGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR 134
           S   +    G    L  ++ + ++E +N  ++    EP        LS ++S+AL  I +
Sbjct: 70  SVFVKESDRGE---LTISIAEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK 117

Query: 135 VFRSGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFL 190
               G L P  Q +I+ +    +   +++ +MN +F+AQR    ID+  L    +   F 
Sbjct: 118 ---DGKLQPDIQKKIIIVSTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSVIETRCTFC 174

Query: 191 QQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNT 248
           QQASY+T G+++    L  LF  LL+   +D  +   +++   +   +D++  CF  K  
Sbjct: 175 QQASYLTNGIYNTAT-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQPAIDWKLWCFHCKKP 233

Query: 249 IDMGYICSVCLSIYCKHLK-KCSTCGSVF 276
           +D GY+CS C  IYC+  K +C TCG+ F
Sbjct: 234 LDHGYVCSSCFGIYCEKQKGRCPTCGAYF 262


>gi|123976711|ref|XP_001314576.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121897129|gb|EAY02259.1| hypothetical protein TVAG_030750 [Trichomonas vaginalis G3]
          Length = 284

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 73/288 (25%), Positives = 124/288 (43%), Gaps = 31/288 (10%)

Query: 14  SLVVVLLDTNPFFWSS-SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYD 72
             + +L+D NP  W     + F  F+ H+  +L  ++  + +    V+A   +  ++++ 
Sbjct: 7   EFLAILIDVNPLEWKEMKDIKFDGFMNHLFLYLKQMILSDCILPPCVLAYNQSHAEFIFP 66

Query: 73  SSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPE--GRIAC------SLLSGS 124
                   +     P+       N E+  N  E +  Q  E  GR A         L  +
Sbjct: 67  EPGRTKDVINQKFNPT-------NQEQIKNYFENIIIQIAEFNGRSASIPPTPHVRLDVA 119

Query: 125 LSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
           LS +LC +  +    +   + RIL    SP+   QY    N +F+A+R  V ID  +L +
Sbjct: 120 LSKSLCLMNNLPNKNI---KKRILAFSVSPETGSQYD---NLLFAAKRLDVVIDVLFLNS 173

Query: 185 QNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFC 244
               FL QA+ +T G     +    L QYL ++  T +  R+ + LP P  +++ A    
Sbjct: 174 NRLCFLSQAADLTHGFSKTIKMGAALAQYLFSLPPTTV--RDLMALPAPEPLEYSAPSAD 231

Query: 245 HKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG-------QAQTQSDE 285
               I  GYIC VCLSI  +  + C  C + +G       Q + +S+E
Sbjct: 232 THKMISRGYICPVCLSILEEACRTCPICNTRYGSVNFYINQTEFKSEE 279


>gi|241605964|ref|XP_002405599.1| RNA polymerase II transcription initiation factor, putative [Ixodes
           scapularis]
 gi|215500649|gb|EEC10143.1| RNA polymerase II transcription initiation factor, putative [Ixodes
           scapularis]
          Length = 291

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 103/206 (50%), Gaps = 18/206 (8%)

Query: 8   LYSDDVSLVVVLLDTNP--FFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           L  D+ SL+V ++DTNP      S     S+ L  +  F N+ L LN  N++ +IA+  +
Sbjct: 1   LAEDEGSLLVAIIDTNPCASLLESDQGIVSKLLDALSVFCNSHLMLNPCNKLAIIASHSH 60

Query: 66  SCDYVY---DSSSTGNQSVGNGR---MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
              ++Y     SS+   SV +G+      +   +   ++E +  D+       E     S
Sbjct: 61  KSTFIYPKPQESSSDTYSV-DGQYELFTEVTGAIKDGVKELVLSDDS------ESAAGES 113

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 177
           LL+G LS+ALCYI R+ +      +   RIL L  S +   QY+  MN  F+AQ+  V I
Sbjct: 114 LLTGGLSLALCYINRIEKESSSQNKIASRILVLSASGESASQYLNFMNVFFTAQKKNVII 173

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHK 203
           D+C L  ++S  LQQ   ITGG + K
Sbjct: 174 DACVL-EKDSGLLQQGCDITGGKYMK 198


>gi|393909857|gb|EFO22116.2| basic transcription factor 2 [Loa loa]
          Length = 274

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 88/163 (53%), Gaps = 11/163 (6%)

Query: 119 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 178
           ++ + ++++A+C+I R +++ + +   RIL +    D   ++  +MN  F+A +  + ID
Sbjct: 101 AVFAPAVAIAICHIFR-YKNEVDNGDGRILIINIGSDLRGEHSILMNIFFAAHKHNILID 159

Query: 179 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQLPKP--- 233
              +G + S  LQQAS ITGG +   ++   L +Y +  T+    L S      P P   
Sbjct: 160 VANIG-EASTILQQASDITGGTYFSVKKPKQLLKYTMCFTLGKASLRS----AFPSPSSS 214

Query: 234 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
             VD+RASC CH   + +G++CSVCLS+ C     C  C +VF
Sbjct: 215 TSVDYRASCHCHGAPVSIGWVCSVCLSVQCHFNPICPACNTVF 257


>gi|392896378|ref|NP_499249.2| Protein ZK1128.4 [Caenorhabditis elegans]
 gi|224490453|emb|CAA87423.2| Protein ZK1128.4 [Caenorhabditis elegans]
          Length = 296

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 78/309 (25%), Positives = 135/309 (43%), Gaps = 45/309 (14%)

Query: 13  VSLVVVLLDTNPFFW-----SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN-S 66
           +S + VL++T+   W     +    +    L  +++F NA L  +  NQ++V A G N  
Sbjct: 1   MSTLSVLIETSSCSWGMLASAHGDRTIGIILRAIVSFCNAHLGQSANNQLLVFAYGRNVD 60

Query: 67  CDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLS 126
              +Y S+   +++            +++ L E ++ D  L      G    + L  +L+
Sbjct: 61  NKMIYSSTRCEDRNAS--------FLVVKRLRELLSSDA-LTNDATIG----APLGPALA 107

Query: 127 MALCYIQRVFR-------SGLLHPQP--------------RILCLQGSPDGPEQYVAIMN 165
            A C++++  R          L PQ               R + +  +P    ++ ++MN
Sbjct: 108 HAFCHMKKDSRVVTADPCDDSLGPQTTSEQSETASEKATNRAVVISITPIMGSEHGSLMN 167

Query: 166 AIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 224
             FSA +  + +D   +G       LQQA+ ITGG     ++   L + L+T   TD   
Sbjct: 168 LFFSAAKQSICVDVVSMGDDFTGGILQQAADITGGSFLHAKKPQTLLKILMTNMLTDPTH 227

Query: 225 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSD 284
           R          VD+RASC CH   +  G++CS+CLS+ C++   C  C + F    T ++
Sbjct: 228 RAVFSKLSHNSVDYRASCACHHQLVSSGWVCSICLSVLCQYTPICKVCKAAF----TIAN 283

Query: 285 EPSATNRKR 293
            P   NRKR
Sbjct: 284 LPIKPNRKR 292


>gi|170591206|ref|XP_001900361.1| basic transcription factor 2, 35 kD subunit [Brugia malayi]
 gi|158591973|gb|EDP30575.1| basic transcription factor 2, 35 kD subunit, putative [Brugia
           malayi]
          Length = 274

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 116/242 (47%), Gaps = 25/242 (10%)

Query: 41  VLAFLNAILTLNQLNQVVVIAT-GYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEE 99
           + ++  A ++L+ +N +VVI   G  S   +Y +S++ +       M S   T +QN  +
Sbjct: 35  ITSYTTAHMSLSAMNGIVVIGVDGTLSEPTIYTTSASSDID-----MSSAVKTSIQNALK 89

Query: 100 FMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQ 159
                              +L + +++ A+C+I R F++ +     RIL +    D   +
Sbjct: 90  KSASSTSTTDS--------ALFAPAIATAICHIFR-FKNEVDKGDGRILIINIGSDLFGE 140

Query: 160 YVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TI 217
           +  +MN  F+A +  + ID   +G + S  LQQAS ITGG +   ++   L +Y +  T+
Sbjct: 141 HNILMNIFFAAHKHNILIDVANIG-ETSPILQQASDITGGTYFNVKKPKQLLKYTMCFTL 199

Query: 218 FGTDLHSRNFLQLPKP---VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 274
               L S      P P     +D+RASC CH   + +G++CSVCLS+ C     C  C +
Sbjct: 200 GRASLRS----AFPSPSSSTSIDYRASCHCHGAPVSVGWVCSVCLSVQCHFSPICPACNT 255

Query: 275 VF 276
           VF
Sbjct: 256 VF 257


>gi|351713028|gb|EHB15947.1| General transcription factor IIH subunit 3 [Heterocephalus glaber]
          Length = 89

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 36/70 (51%), Positives = 49/70 (70%)

Query: 207 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 266
           +  L QYLL +F +D   R  L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C   
Sbjct: 1   MASLLQYLLWVFLSDQDQRAQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFS 60

Query: 267 KKCSTCGSVF 276
             C+TC + F
Sbjct: 61  PICTTCETAF 70


>gi|268573188|ref|XP_002641571.1| Hypothetical protein CBG09872 [Caenorhabditis briggsae]
          Length = 299

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 78/284 (27%), Positives = 128/284 (45%), Gaps = 47/284 (16%)

Query: 37  FLTHVLAFLNAILTLNQLNQVVVIATGYN-SCDYVYDSSSTGNQSVGNGRMPSLCATLLQ 95
            L  +++F NA L  +  NQ++V A G N     +Y S+ + +++V +  +  L   LLQ
Sbjct: 32  ILRAIVSFCNAHLGQSASNQLLVFAFGRNVEKKLIYSSARSEDRNVSSVVVDHL-RDLLQ 90

Query: 96  NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR------------------ 137
           +  +  N D  +G   P G         +L+ A C++++  +                  
Sbjct: 91  S--DSGNSDATMG--APIG--------PALAHAFCHMKKDSQLATADICDDSLGPAASNT 138

Query: 138 ----SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQ 192
               S       R + +  +P    ++ ++MN  FSA +  + +D   +        LQQ
Sbjct: 139 EKSDSSTEKVSTRAVVISMTPIIGSEHGSLMNLFFSAAKQSICVDVVSMSDDLTGGILQQ 198

Query: 193 ASYITGG--VH-HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTI 249
           A+ ITGG  VH  KPQ    L + L+T   TD   RN         VD+RASC CH   +
Sbjct: 199 AADITGGSFVHAEKPQ---ALLKILMTNMLTDASYRNTFSKLSYCSVDYRASCACHHTLV 255

Query: 250 DMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKR 293
             G++CS+CLS+ C++   C  C + F    T ++ P   +RKR
Sbjct: 256 SSGWVCSICLSVLCQYTPICKVCKAAF----TITNLPKKPSRKR 295


>gi|224006638|ref|XP_002292279.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971921|gb|EED90254.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 409

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/189 (33%), Positives = 94/189 (49%), Gaps = 16/189 (8%)

Query: 100 FMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQ 159
             N    L +++ EG IA  L+ GS S      +      LL   PR L +Q S D    
Sbjct: 208 LANSSSMLQRKDDEGVIA--LMGGSGSSNFKQQELQHSKSLL--SPRALIIQSSSDRTSD 263

Query: 160 YVAIMNAIFSAQRSMVPIDSCYL------GAQNSAFLQQASYITGGVH----HKPQQLDG 209
           Y A+MN  F+A +  + ID C++       A++S +L+Q    T GV+        Q+ G
Sbjct: 264 YNALMNCAFAANKQNIVIDGCFIPSGMKDDAKSSPYLEQIVDQTRGVYLSVPSGAAQVGG 323

Query: 210 -LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 268
            L + L+++F      R  + LPK   VDFRA CF    +ID+  IC+ CLSI+ +  K+
Sbjct: 324 ALSEVLISVFLPPPLLRKEMNLPKLTKVDFRARCFETGESIDVAQICNQCLSIFKERPKE 383

Query: 269 -CSTCGSVF 276
            C TCG+V 
Sbjct: 384 FCLTCGAVV 392


>gi|156373212|ref|XP_001629427.1| predicted protein [Nematostella vectensis]
 gi|156216427|gb|EDO37364.1| predicted protein [Nematostella vectensis]
          Length = 190

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 101/210 (48%), Gaps = 35/210 (16%)

Query: 14  SLVVVLLDTNPFFW-------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
           SL+VV+ D NP  W       + + +S ++ L  ++   N+ L L   N +  +A   N 
Sbjct: 8   SLLVVIFDANPVAWGGIAVEGNETQISLTRCLESLMVLCNSYLLLKHNNLLAFVAASTNG 67

Query: 67  CDYVYDSSS-TGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSL 125
             ++Y  +S  G QS     +PS  +   +   EF   D  L + +     A S +   L
Sbjct: 68  SKFLYPKASCEGIQS-----LPSQDSKY-EKFSEF--NDTVLREIKKLVGAAQSTVGAKL 119

Query: 126 SMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ 185
                             QPRIL ++ SPD    Y+ +MN+IF+AQ+S   ID+C L  +
Sbjct: 120 ------------------QPRILIIKASPDDASHYMPVMNSIFAAQKSNTCIDACVL-YE 160

Query: 186 NSAFLQQASYITGGVHHKPQQLDGLFQYLL 215
           +S +LQQA+ ITGGV+ K  +   L QYL+
Sbjct: 161 HSGYLQQATDITGGVYLKIPETQALLQYLM 190


>gi|324508596|gb|ADY43627.1| General transcription factor IIH subunit 3 [Ascaris suum]
          Length = 272

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 109/247 (44%), Gaps = 22/247 (8%)

Query: 34  FSQFLTHVLAFLNAILTLNQLNQVVVI---ATGYNSCDYVYDSSSTGNQSVGNGRMPSLC 90
           F   L+ + +F +A L+L+  N V ++   AT  N   Y +D       ++     P++ 
Sbjct: 28  FYSLLSAIASFASAHLSLSANNSVSILGVDATLNNPLLYAFD------LTIQIDMTPTIV 81

Query: 91  ATL-LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC 149
             L +  L+   N D +   Q           + + + A C+I R F+        RIL 
Sbjct: 82  ERLRMALLKSAGNSDAKCTSQ----------FAPAFATAFCHINR-FKKENDGADGRILI 130

Query: 150 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 209
           +    D   +  A+MN  FSA +  + +D   +G + S  LQQA  IT G +   ++   
Sbjct: 131 INIGSDLAREQNALMNVFFSAHKQGIVVDVANIG-EPSPVLQQACDITMGAYINVEKPKR 189

Query: 210 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 269
           L QYL+        SR+         VD+RASC CH     +  +CSVCLS+ CK    C
Sbjct: 190 LLQYLMFFALGGPDSRSKFTSSVASSVDYRASCHCHGTPASIALVCSVCLSVQCKFNPIC 249

Query: 270 STCGSVF 276
             C +VF
Sbjct: 250 PICNTVF 256


>gi|397617770|gb|EJK64598.1| hypothetical protein THAOC_14658 [Thalassiosira oceanica]
          Length = 263

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 77/155 (49%), Gaps = 12/155 (7%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL------GAQNSAFLQQASYITGG 199
           R L +Q SPD    Y A+MN  F++ +S V +D C++       ++ S  L+Q   +TGG
Sbjct: 107 RALVVQASPDRTSDYNALMNCAFASSKSSVVVDGCFIPSGLRDDSRTSPVLRQVVDLTGG 166

Query: 200 VH----HKPQQLDGLFQYLLTI-FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYI 254
           V+        Q+ G    +LT  F      R  + LP+   VDFRA CF     +D+ ++
Sbjct: 167 VYLAVPSGAAQVGGALGEVLTAAFLPPPRLRGLMNLPESDDVDFRARCFATGRAVDVAWV 226

Query: 255 CSVCLSIYCKHLK-KCSTCGSVFGQAQTQSDEPSA 288
           C+ CLSI+ +  +  C TCG+     + + D   A
Sbjct: 227 CNQCLSIFAERPEGACPTCGAKVLVGEGRGDSKRA 261


>gi|406695514|gb|EKC98819.1| hypothetical protein A1Q2_06922 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 297

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 31/157 (19%)

Query: 146 RILCLQGSPDGPEQYVA--------------------IMNAIFSAQRSMVPIDSCYLGAQ 185
           RIL L  +P G E   A                    +MN +F+ Q++ V ID   L  +
Sbjct: 87  RILILNATPGGAEDAEATDSKTAKASGGGGGQRGYVGLMNCVFAGQKAKVAIDVLTLPPE 146

Query: 186 NSA-----FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF-LQLPKPVGVDFR 239
           ++      FLQQA+Y+T G++ +     GL QYL +++      RN    +P    VDFR
Sbjct: 147 STMTAPPIFLQQAAYLTEGIYWRWNGRGGLLQYLHSMYLPPPSLRNKPFTVPPQDAVDFR 206

Query: 240 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           A CFCH + +D+G++C     I+C+    CS C + F
Sbjct: 207 AVCFCHHDVVDVGFVC-----IFCEVKPICSMCKTKF 238


>gi|82541502|ref|XP_724989.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23479826|gb|EAA16554.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 327

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 83/314 (26%), Positives = 131/314 (41%), Gaps = 48/314 (15%)

Query: 13  VSLVVVLLDTNPFFWSS-----------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           +S +++++D N   WS             ++  SQFL  V  F      ++   ++ +IA
Sbjct: 22  MSHLILVIDVNFLIWSQGLKIKFDKNNIKTVKLSQFLKSVFQFTRFYCFMSSSERICIIA 81

Query: 62  TGYNSCDYVYDS--SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
           T   +   +Y++  S T N    N      C      L  F+ KD     +  E  +  S
Sbjct: 82  TCSGNSKIIYENYISYTKNNLTEND----YCTNAYDKLINFI-KDNNNNTKTYE--MVES 134

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPI 177
            LS +L++ALCY  R+  +   +   RI  L  S       QY  +MN  ++A+R+ + I
Sbjct: 135 TLSSALAIALCYNNRIC-NLYENINSRIFLLDISKSHFYTNQYTQLMNIAYNAKRNKIII 193

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRNF 227
           D   L  + +  L+Q   IT G++                D L Q ++  F    HSR +
Sbjct: 194 DVFSLNHK-TQILEQICNITNGLYIDNSIFLSINCGDNVEDILTQTIMFWFLPSTHSRKY 252

Query: 228 LQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSVF 276
                L +   +   A C CH   ID+ YICS CL+IYC        K    CS C + F
Sbjct: 253 FSNTYLNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKDSQTNKERISCSVCKTRF 309

Query: 277 GQAQTQSDEPSATN 290
            +A  ++   S  N
Sbjct: 310 TKALLRNKTVSDLN 323


>gi|68063783|ref|XP_673887.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56492063|emb|CAI02316.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 327

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 132/314 (42%), Gaps = 48/314 (15%)

Query: 13  VSLVVVLLDTNPFFWSS-----------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           +S +++++D N   WS             ++  SQFL  V  F      ++   ++ +IA
Sbjct: 22  MSHLILVIDVNFLIWSQGLKIKFDKNNIKTVKLSQFLKSVFQFARFYCFMSSSERICIIA 81

Query: 62  TGYNSCDYVYDS--SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
           T   +   +Y++  S T N    N      C    + L  F+ KD     +  E  +  S
Sbjct: 82  TCSGNSKIIYENYISYTKNNLTEND----YCTNAYEKLINFI-KDNNNNSKTYE--MVES 134

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPI 177
            LS +L++ALCY  R+  +   +   RI  L  S       QY  +MN  ++A+R+ + I
Sbjct: 135 TLSSALAIALCYNNRIC-NLYENINSRIFLLDISKSHFYTNQYTQLMNIAYNAKRNNIII 193

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRNF 227
           D   L  + +  L+Q   IT G++                D L Q ++  F    HSR +
Sbjct: 194 DVFSLNHK-TQILEQICNITNGLYIDNSIFLSINCADNVEDILTQTIMFWFLPSTHSRKY 252

Query: 228 LQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSVF 276
                L +   +   A C CH   ID+ YICS CL+IYC        K    CS C + F
Sbjct: 253 FSNTYLNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKDAQTNKERISCSVCKTRF 309

Query: 277 GQAQTQSDEPSATN 290
            ++  ++   S  N
Sbjct: 310 TKSLLRNKTVSDLN 323


>gi|392576127|gb|EIW69258.1| hypothetical protein TREMEDRAFT_62112 [Tremella mesenterica DSM
           1558]
          Length = 297

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 69/248 (27%), Positives = 109/248 (43%), Gaps = 50/248 (20%)

Query: 14  SLVVVLLDTNPFFWS-----------------------SSSLSFSQFLTHVLAFLNAILT 50
           SL+++++D +P  WS                        SSL+  +F+T +L FLNA L 
Sbjct: 3   SLLILIVDLHPLSWSLLSIPPSPQPSDPTLSPIIAKSLPSSLTLEEFITQLLVFLNAHLA 62

Query: 51  LNQLNQVVVIATGYNSCDYVYDSSST----GNQSVGNGRMP--SLCATLLQNLEEFMNKD 104
               N+VVV          ++ + ST     N   GN   P   L + L Q L   M+++
Sbjct: 63  SQWGNRVVVYGVSAGKSRLLFPAPSTTRDDTNTPSGNVYEPFRLLDSGLEQGLRGMMSEE 122

Query: 105 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQ--- 159
              G +E +G      L  +++ ALCY+ R+  S   L     R+L +  +P        
Sbjct: 123 ---GAREGKGLNEPPALVAAMTKALCYLNRLSPSTATLQTTDTRMLIINATPGSSFSESD 179

Query: 160 --------YVAIMNAIFSAQRSMVPID-----SCYLGAQNSAFLQQASYITGGVHHKPQQ 206
                   YV +MN +F+AQ+  +PID       Y       FLQQA+++TGG++ +   
Sbjct: 180 SSGGLRGGYVGLMNCVFAAQKGKIPIDVLTLPPSYTQTSPPIFLQQAAHLTGGMYWRWNG 239

Query: 207 LDGLFQYL 214
             G+ QYL
Sbjct: 240 RGGVLQYL 247


>gi|70953884|ref|XP_746016.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56526511|emb|CAH87927.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 327

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 132/314 (42%), Gaps = 48/314 (15%)

Query: 13  VSLVVVLLDTNPFFWSS-----------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIA 61
           +S +++++D N   WS             ++  SQFL  V  F      ++   ++ +IA
Sbjct: 22  MSHLILVIDVNFLIWSQGLKIKFDKNNIKTVKLSQFLKSVFQFTRFYCFMSSSERICIIA 81

Query: 62  TGYNSCDYVYDS--SSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACS 119
           T   +   +Y++  S T N    N      C      L +F+ KD     +  E  +  S
Sbjct: 82  TCSGNSKIIYENYISYTKNNLTEND----YCTNAYDKLIKFI-KDNNNNSKTYE--MVES 134

Query: 120 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPI 177
            LS +L++ALCY  R+  +   +   RI  L  S       QY  +MN  ++A+R+ + I
Sbjct: 135 TLSSALAIALCYNNRIC-NLYENINSRIFLLDISKSHFYTNQYTQLMNIAYNAKRNNIII 193

Query: 178 DSCYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRNF 227
           D   L  + +  L+Q   IT G++                D L Q ++  F    +SR +
Sbjct: 194 DVFSLNHK-TQILEQICNITNGLYIDNSIFLSITCGDNVEDILTQTIMFWFLPSTNSRKY 252

Query: 228 LQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSVF 276
                L +   +   A C CH   ID+ YICS CL+IYC        K    CS C + F
Sbjct: 253 FSNTYLNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKDAQTNKERISCSICKTRF 309

Query: 277 GQAQTQSDEPSATN 290
            +A  ++   S  N
Sbjct: 310 TKALLRNKTVSDLN 323


>gi|401400847|ref|XP_003880871.1| hypothetical protein NCLIV_039130 [Neospora caninum Liverpool]
 gi|325115283|emb|CBZ50838.1| hypothetical protein NCLIV_039130 [Neospora caninum Liverpool]
          Length = 580

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 85/193 (44%), Gaps = 28/193 (14%)

Query: 114 GRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIF 168
           GR  C   S+L+G+LS+ALCY+ +V +     P+ R+L L GS D     QY+ +MN  F
Sbjct: 372 GRSLCGEDSMLAGALSLALCYLNKVAKRSARPPERRVLLLDGSLDRSYSSQYMPLMNLAF 431

Query: 169 SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-----PQQLDG----------LFQY 213
           +A +  + ID C L    S   +Q   I+ GVH K     P    G          L   
Sbjct: 432 AAAKGNIVIDCCALCPNPSTISEQLCDISRGVHLKFSQAAPASTSGNSHLDGGLALLQLL 491

Query: 214 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KH 265
           L  IF +         L    G    A CFCH   +++  ICS CL+IYC        K 
Sbjct: 492 LFWIFPSVSLRPAIAALSVHRGRSNAAVCFCHHRPVEVCCICSCCLAIYCSEKDAQTGKE 551

Query: 266 LKKCSTCGSVFGQ 278
              C  C S F +
Sbjct: 552 RISCDVCKSRFSR 564


>gi|225707288|gb|ACO09490.1| TFIIH basal transcription factor complex p34 subunit [Osmerus
           mordax]
          Length = 208

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 91/194 (46%), Gaps = 25/194 (12%)

Query: 23  NPFFWSSSS-----LSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVY------ 71
           NP +W   +      + S+ L  V+   N+ L + + N++ +IA+      ++Y      
Sbjct: 17  NPLWWGQQAQKDPEFTLSKCLDAVMVMGNSHLVMTRTNKLAIIASHCQESHFLYPSKHWK 76

Query: 72  --DSSSTGNQSVGNGRMPSLCAT---LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLS 126
             + S     + G+G+   L      +   ++  M++ E  G Q        +LL+GSL+
Sbjct: 77  IGEGSGDVQSASGDGKYELLSVANDLIADEIKNLMSRTEVKGHQTD------TLLAGSLA 130

Query: 127 MALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA 184
            ALCYI RV +        + RIL  + + D   QY+  MN IF+AQR  + ID+C L +
Sbjct: 131 KALCYIHRVTKELEAGQELKSRILVFKAAEDCARQYMNFMNVIFAAQRQNILIDACVLDS 190

Query: 185 QNSAFLQQASYITG 198
           + S  LQQ   I G
Sbjct: 191 E-SGLLQQLISICG 203


>gi|207340364|gb|EDZ68737.1| YPR056Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 230

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 101/217 (46%), Gaps = 24/217 (11%)

Query: 2   ASAPSKLYSDDVSLVVVLLDTNPFFWSS------SSLSFSQFLTHVLAFLNAILTLNQLN 55
           A +  ++  +  SL+ V+++  P  W++         S  + L  ++ FLNA L  N  N
Sbjct: 12  ARSRKQVTEESPSLLTVIIEIAPKLWTTFDEEGNEKGSIIKVLEALIVFLNAHLAFNSAN 71

Query: 56  QVVVIATGYNSCDYVY-DSSSTGNQSVGNGRMPSLCATLLQNL-EEFMNKDE-------- 105
           +V VIA       Y+Y +S+S    S    +  S    +  ++   F N DE        
Sbjct: 72  KVAVIAAYSQGIKYLYPESTSALKASESENKTRSDLKIINSDMYRRFRNVDETLVEEIYK 131

Query: 106 --QLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ----GSPDGPE 158
             +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L      S D   
Sbjct: 132 LFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCGSGSSKDEIF 191

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAFLQQAS 194
           QY+ IMN IFSA +   PID   + G++ S FLQQ +
Sbjct: 192 QYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTT 228


>gi|399218636|emb|CCF75523.1| unnamed protein product [Babesia microti strain RI]
          Length = 278

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 70/284 (24%), Positives = 122/284 (42%), Gaps = 49/284 (17%)

Query: 16  VVVLLDTNPFFWSS-----SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYV 70
           +V+++D N   WS+     + L+F QF++ +  F+ A   ++    + +IAT    C  +
Sbjct: 8   LVLVIDANKRSWSNDGVSDNKLTFIQFVSSLKRFIRAYHCMSSGRTLYIIATNTIYCKII 67

Query: 71  YDSSSTGN-----QSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSL 125
           +   +         +     M S   ++ ++L +F+++D  L    P+  ++ +L     
Sbjct: 68  FHGLTDSTLIPTTTNSNTTSMDSTLDSMEESLLKFISED--LPDANPQPTLSSNL----- 120

Query: 126 SMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ 185
                   R+     L  +  +L    +    +Q++ ++N  FSA               
Sbjct: 121 -------NRIGHKENLSREILLLDYSTNDMYLKQHIPLLNIGFSAL-------------- 159

Query: 186 NSAFLQQASYITGGVHHKPQQLDG---------LFQYLLTIFGTDLHSRNFLQLPKPVGV 236
           N   L+Q S IT G   +   + G         LFQ LL  F     S+N L  P    +
Sbjct: 160 NMKLLEQLSDITKGTFLRLMDMYGNDSINIEYALFQTLLFWFLASNDSKNILSPPSLTQI 219

Query: 237 DFRASCFCHKNTIDMGYICSVCLSIYC--KHLKKCSTCGSVFGQ 278
              A C+CH NTI++GY+CS CL++YC  K L  C  C S F +
Sbjct: 220 SNSAVCYCHNNTIEIGYLCSSCLTVYCTDKSLIMCKICSSRFAK 263


>gi|156101255|ref|XP_001616321.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805195|gb|EDL46594.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 323

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 73/301 (24%), Positives = 125/301 (41%), Gaps = 53/301 (17%)

Query: 16  VVVLLDTNPFFW-----------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           +++++D N   W           +  +L   +FL     F+     ++   ++ +IAT  
Sbjct: 24  LILVVDVNFMIWCEGLKIKFDKNNVKTLRLPEFLKSTFHFVRFYCLMSNAERICIIATSL 83

Query: 65  NSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFM--NKDEQLGKQEPEGRIACSLLS 122
            SC  +Y++  +   +  N      C      L EF+  NK E++ +         S LS
Sbjct: 84  RSCKILYENYVS--YAKNNLTERDYCEEAYNKLIEFLSENKTEKMME---------SSLS 132

Query: 123 GSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG---PEQYVAIMNAIFSAQRSMVPIDS 179
            +L++ALCY  R+  +   + + RI  L  S +      QY  +MN  ++A+R+ + ID 
Sbjct: 133 SALAVALCYNHRIV-NMYENMKGRIFLLDISNETYVYTNQYTQLMNIAYNAKRNNIIIDV 191

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRNFLQ 229
             L  +    L+Q   IT G++                D L Q ++  F    ++R +  
Sbjct: 192 FSLNCK-IQLLEQICNITNGLYIDSSIFFNVNCTDNIEDVLTQTMIFWFLPSNNTRKYFS 250

Query: 230 ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSVFGQ 278
              L     +   A C CH   +D+ YICS CL+IYC        K    C  C + F +
Sbjct: 251 NTYLNDDTNI---AVCSCHNKQVDIAYICSCCLAIYCSEKDEKTNKERTSCFVCKTRFTK 307

Query: 279 A 279
           A
Sbjct: 308 A 308


>gi|149579196|ref|XP_001517604.1| PREDICTED: general transcription factor IIH subunit 3-like, partial
           [Ornithorhynchus anatinus]
          Length = 80

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 31/60 (51%), Positives = 43/60 (71%)

Query: 217 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +F  D   R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 2   VFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 61


>gi|298711518|emb|CBJ26606.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 117

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 51/79 (64%), Gaps = 1/79 (1%)

Query: 196 ITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYIC 255
           + GG++ + Q + GL QY+   F      R +L+L     VDFRA+CFCHK  +D+ Y+C
Sbjct: 15  LPGGLYLEHQGV-GLAQYMFATFLPSQSCRKYLKLRTQQSVDFRAACFCHKRVVDIAYVC 73

Query: 256 SVCLSIYCKHLKKCSTCGS 274
           SVCLS++C+    C+TCG+
Sbjct: 74  SVCLSVFCEFSPVCTTCGT 92


>gi|221502054|gb|EEE27800.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 404

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 84/196 (42%), Gaps = 29/196 (14%)

Query: 112 PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 166
           P    AC   S+L+G+LS+ALC + +V +     P+ R+L L GS D     QY+ +MN 
Sbjct: 193 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 252

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ----------------LDGL 210
            F+A +  + ID C L    S   +Q   I+ GVH K  Q                L  L
Sbjct: 253 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 312

Query: 211 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 263
              L  IF +         L    G    A CFCH   +++  ICS CL+IYC       
Sbjct: 313 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 372

Query: 264 -KHLKKCSTCGSVFGQ 278
            K    C  C S F +
Sbjct: 373 GKERISCDVCKSRFSR 388


>gi|221057700|ref|XP_002261358.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|194247363|emb|CAQ40763.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 323

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 72/299 (24%), Positives = 123/299 (41%), Gaps = 49/299 (16%)

Query: 16  VVVLLDTNPFFW-----------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           +++++D N   W           +  +L   +FL     F+     ++   ++ +IAT  
Sbjct: 24  LILVVDVNFMIWCEGLKIKFDKNNVKTLGLPEFLKSTFHFVRFYCLMSSAERICIIATSL 83

Query: 65  NSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGS 124
            SC  +Y++  +   +  N      C      L EF+N+++     E       S LS +
Sbjct: 84  RSCRILYENYVS--YAKNNLTERDYCEEAYNKLIEFINENKTEKMME-------SNLSSA 134

Query: 125 LSMALCYIQRVFRSGLLHPQPRILCLQGSPDG---PEQYVAIMNAIFSAQRSMVPIDSCY 181
           L++ALCY  R+  +   + + RI  L  S        QY  +MN  ++A+R+ + ID   
Sbjct: 135 LAVALCYNHRIV-NMYENMKGRIFLLDISNQSYVYTNQYTQLMNIAYNAKRNNIIIDVFS 193

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRNFLQ-- 229
           L  +    L+Q   IT G++                D L Q ++  F    ++R +    
Sbjct: 194 LNYK-IQLLEQICNITNGLYIDTTIFFNVNCTDNIEDVLTQTMIFWFLPSNNTRKYFSNT 252

Query: 230 -LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSVFGQA 279
            L     +   A C CH   +D+ YICS CL+IYC        K    C  C + F +A
Sbjct: 253 YLNDDTNI---AVCSCHNKQVDIAYICSCCLAIYCSEKDDKTNKDRISCFVCKTRFTKA 308


>gi|389584514|dbj|GAB67246.1| hypothetical protein PCYB_112670, partial [Plasmodium cynomolgi
           strain B]
          Length = 323

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 72/299 (24%), Positives = 122/299 (40%), Gaps = 49/299 (16%)

Query: 16  VVVLLDTNPFFW-----------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGY 64
           +++++D N   W           +  +L   +FL     F+     ++   ++ +IAT  
Sbjct: 24  LILVVDVNFMIWYEGVKIKFDKNNVKTLRLPEFLKSTFHFVRFYCLMSNSERICIIATSL 83

Query: 65  NSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGS 124
            SC  +Y++  +   +  N      C      L EF+N+++     E       S LS +
Sbjct: 84  RSCKILYENYVS--YAKNNLTERDYCEEAYNKLIEFINENKTEKMME-------SSLSSA 134

Query: 125 LSMALCYIQRVFRSGLLHPQPRILCLQGSPD---GPEQYVAIMNAIFSAQRSMVPIDSCY 181
           L++ALCY  R+  +   + + RI  L  S        QY  +MN  ++A+R+ + ID   
Sbjct: 135 LAVALCYNHRIV-NMYENMKGRIFLLDISNQIYVYTNQYTQLMNIAYNAKRNDIIIDVFS 193

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRNFLQ-- 229
           L  +    L+Q   IT G++                D L Q ++  F     +R +    
Sbjct: 194 LNYK-IKLLEQICNITNGLYIDSSIFFNVNCTDNIEDVLTQTMIFWFLPSNSTRKYFSNT 252

Query: 230 -LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSVFGQA 279
            L     +   A C CH   +D+ YICS CL+IYC        K    C  C + F +A
Sbjct: 253 YLNDDTNI---AVCSCHNKQVDIAYICSCCLAIYCSEKDDKTNKDRTSCFVCKTRFTKA 308


>gi|237838949|ref|XP_002368772.1| hypothetical protein TGME49_066930 [Toxoplasma gondii ME49]
 gi|211966436|gb|EEB01632.1| hypothetical protein TGME49_066930 [Toxoplasma gondii ME49]
          Length = 547

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 83/196 (42%), Gaps = 29/196 (14%)

Query: 112 PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 166
           P    AC   S+L+G+LS+ALC + +V +     P+ R+L L GS D     QY+ +MN 
Sbjct: 336 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 395

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------------DGL 210
            F+A +  + ID C L    S   +Q   I+ GVH K  Q                   L
Sbjct: 396 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 455

Query: 211 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 263
              L  IF +         L    G    A CFCH   +++  ICS CL+IYC       
Sbjct: 456 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 515

Query: 264 -KHLKKCSTCGSVFGQ 278
            K    C  C S F +
Sbjct: 516 GKERISCDVCKSRFSR 531


>gi|403292309|ref|XP_003937192.1| PREDICTED: tectonic-2 [Saimiri boliviensis boliviensis]
          Length = 940

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 96/188 (51%), Gaps = 26/188 (13%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCMDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVYDSSSTGNQSVGN-----GRMPSLCA---------TLLQNLEEFMNKD--EQLG 108
               ++Y      N  +G+     G  PS             LL +  E + ++  + + 
Sbjct: 65  QESRFLYPGK---NGRLGDFFGDPGNPPSEFNPSGSKDGKYELLTSANEVIAEEIKDLMT 121

Query: 109 KQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNA 166
           K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  MN 
Sbjct: 122 KSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFMNV 181

Query: 167 IFSAQRSM 174
           IF+AQ+ +
Sbjct: 182 IFAAQKQL 189


>gi|221481332|gb|EEE19726.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 549

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 83/196 (42%), Gaps = 29/196 (14%)

Query: 112 PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 166
           P    AC   S+L+G+LS+ALC + +V +     P+ R+L L GS D     QY+ +MN 
Sbjct: 338 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 397

Query: 167 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------------DGL 210
            F+A +  + ID C L    S   +Q   I+ GVH K  Q                   L
Sbjct: 398 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 457

Query: 211 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 263
              L  IF +         L    G    A CFCH   +++  ICS CL+IYC       
Sbjct: 458 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 517

Query: 264 -KHLKKCSTCGSVFGQ 278
            K    C  C S F +
Sbjct: 518 GKERISCDVCKSRFSR 533


>gi|224146905|ref|XP_002336364.1| predicted protein [Populus trichocarpa]
 gi|222834814|gb|EEE73263.1| predicted protein [Populus trichocarpa]
          Length = 80

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 35/66 (53%), Positives = 42/66 (63%)

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MNA FSA RS+VPIDSC +GA N AF QQASYITG V+ KPQQ+  +       F  ++ 
Sbjct: 1   MNATFSAPRSLVPIDSCRVGAHNPAFSQQASYITGSVYVKPQQVSLVNHRPFIFFAPEVA 60

Query: 224 SRNFLQ 229
             N L 
Sbjct: 61  VENLLH 66


>gi|224116574|ref|XP_002331930.1| predicted protein [Populus trichocarpa]
 gi|222874602|gb|EEF11733.1| predicted protein [Populus trichocarpa]
          Length = 79

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 35/66 (53%), Positives = 42/66 (63%)

Query: 164 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 223
           MNA FSA RS+VPIDSC +GA N AF QQASYITG V+ KPQQ+  +       F  ++ 
Sbjct: 1   MNATFSAPRSLVPIDSCRVGAHNPAFSQQASYITGSVYVKPQQVSLVNHRPFIFFAPEVA 60

Query: 224 SRNFLQ 229
             N L 
Sbjct: 61  VENLLH 66


>gi|294889292|ref|XP_002772747.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239877280|gb|EER04563.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 383

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/319 (25%), Positives = 124/319 (38%), Gaps = 58/319 (18%)

Query: 15  LVVVLLDTNPFFWSSSSLS----------------FSQFLTHVLAFLNAILTLNQLNQVV 58
           LV ++LD NP+ WSS SL+                 +  +T    F   +LT+  ++   
Sbjct: 28  LVAIVLDLNPWVWSSYSLASFKVEAPPSEKAMQEKAAGAVTLEALFSALLLTVGAISAAS 87

Query: 59  VIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCAT-LLQNLEEFMNKDEQLGKQEPEGRIA 117
                   C  +   S T  + +  G  P +  T L + +  F+    +   +      +
Sbjct: 88  TSGDSLGVC--IVGVSPTKAKILLKGSWPEMDRTALWEAMVGFIKASVKERSRPSTTTTS 145

Query: 118 CSLLSGSLSMALCYIQRVFRSGLLHPQPRILCL------QGSPDGPEQYVAIMNAIFSAQ 171
            SL++  LS+ALCY+ +     L  P     C+       G  +   + V + N  ++AQ
Sbjct: 146 SSLIAAGLSLALCYLNK----HLTIPGRARCCIVEAVSVYGEANYASEAVPLANCGWAAQ 201

Query: 172 RSMVPIDSCYLGAQNS-----------------------AFLQQASYITGGVHHKPQQ-- 206
              VPID C +    +                         L Q    T GVH   ++  
Sbjct: 202 DLRVPIDLCTVCPGRTPQSSSPSNDRSSSSSSNASPTSATLLVQLCEATRGVHIPSKRCS 261

Query: 207 -LDGLFQYLLTIFGT-DLHSRNFLQL-PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 263
            + GLFQ L+  F    L  R  L+  P    +D  + C CH   +D GY+CSVCL IYC
Sbjct: 262 TVGGLFQSLMFHFSVPSLGEREVLKTRPSARLLDMGSLCACHGLPVDRGYVCSVCLCIYC 321

Query: 264 KHLKK-CSTCGSVFGQAQT 281
                 C  CG+ F +  T
Sbjct: 322 NDSSGVCRKCGARFKRTAT 340


>gi|401888190|gb|EJT52153.1| hypothetical protein A1Q1_06259 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 323

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 111/300 (37%), Gaps = 103/300 (34%)

Query: 30  SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSL 89
           S ++   F T ++ FLNA L     N+VVV           Y +       +   R P +
Sbjct: 15  SPITLEDFATVLMVFLNAHLASRWGNEVVV-----------YGALPGSAIDINRRRPPEV 63

Query: 90  CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC 149
            AT L       + ++      PE                              + RIL 
Sbjct: 64  AATSL-----VAHDNDPSKNSNPEAE----------------------------ESRILI 90

Query: 150 LQGSPDGPEQYVA--------------------IMNAIFSAQRSM-------------VP 176
           L  +P G E   A                    +MN +F+AQ+++             V 
Sbjct: 91  LNATPGGAEDAEATDSKTAKASGGGGGQRGYVGLMNCVFAAQKAVSLTAVEDELTCQKVA 150

Query: 177 IDSCYLGAQNSA-----FLQQASYITGGVHHKPQQLDGLFQYL---------------LT 216
           ID   L  +++      FLQQA+Y+T G++ +     GL QYL               + 
Sbjct: 151 IDVLTLPPESTMTAPPIFLQQAAYLTEGIYWRWNGRGGLLQYLHVSLLVPSVSADPQSMY 210

Query: 217 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           +    L ++ F  +P    VDFRA CFCH + +D+G++C     I+C+    CS C + F
Sbjct: 211 LPPPSLRNKPF-TVPPQDAVDFRAVCFCHHDVVDVGFVC-----IFCEVKPICSMCKTKF 264


>gi|294947738|ref|XP_002785469.1| hypothetical protein Pmar_PMAR005787 [Perkinsus marinus ATCC 50983]
 gi|239899380|gb|EER17265.1| hypothetical protein Pmar_PMAR005787 [Perkinsus marinus ATCC 50983]
          Length = 385

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 85/327 (25%), Positives = 128/327 (39%), Gaps = 72/327 (22%)

Query: 15  LVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAI-------LTLNQLNQVVVIATGYNSC 67
           LV ++LD NP+ WSS SL+   F         A+       +TL  L   +++  G  S 
Sbjct: 28  LVAIVLDLNPWVWSSYSLA--SFKVEAPPSEKAMQEKAAGAVTLEALFSALLLTVGAIS- 84

Query: 68  DYVYDSSSTGNQSVGN---GRMPSLCATLLQNLEEFMNK---------------DEQLGK 109
                ++ST   S+G    G  P+    LL+     M++                E+   
Sbjct: 85  -----AASTSGDSLGVCIVGVSPTKAKILLKGSWPEMDRTALWEAMVGFIKASVKERSRP 139

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCL------QGSPDGPEQYVAI 163
                  + SL++  LS+ALCY+ +     L  P     C+       G  +   + V +
Sbjct: 140 SSTMTTTSSSLIAAGLSLALCYLNK----HLTIPGRARCCIVEAVSVYGEANYASEAVPL 195

Query: 164 MNAIFSAQRSMVPIDSCYL-----------------------GAQNSAFLQQASYITGGV 200
            N  ++AQ   VPID C +                          ++  L Q    T GV
Sbjct: 196 ANCGWAAQDLRVPIDLCTVCPGRTPQSSSPSNDRSSSSSSNASPTSATLLVQLCEATRGV 255

Query: 201 HHKPQQ---LDGLFQYLLTIFGT-DLHSRNFLQL-PKPVGVDFRASCFCHKNTIDMGYIC 255
           H   ++   + GLFQ L+  F    L  R  L+  P    +D  + C CH   +D GY+C
Sbjct: 256 HIPSKRCSTVGGLFQSLMFHFSVPSLGEREVLKTRPSARLLDMGSLCACHGLPVDRGYVC 315

Query: 256 SVCLSIYCKHLKK-CSTCGSVFGQAQT 281
           SVCL IYC      C  CG+ F +  T
Sbjct: 316 SVCLCIYCNDSSGVCRKCGARFKRTAT 342


>gi|413935398|gb|AFW69949.1| hypothetical protein ZEAMMB73_462205 [Zea mays]
          Length = 73

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/41 (70%), Positives = 36/41 (87%)

Query: 1  MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHV 41
          MASA SKLYSDD+SLVVV+LDTNPFFW++++L F+ F  HV
Sbjct: 1  MASAHSKLYSDDISLVVVVLDTNPFFWAAATLPFADFFAHV 41


>gi|19074510|ref|NP_586016.1| hypothetical protein ECU07_0880 [Encephalitozoon cuniculi GB-M1]
 gi|19069152|emb|CAD25620.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329589|gb|AGE95860.1| hypothetical protein ECU07_0880 [Encephalitozoon cuniculi]
          Length = 213

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 80/163 (49%), Gaps = 25/163 (15%)

Query: 123 GSLSMALCYIQRVFRSGLLH--PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           G L  ALC          LH   + +I+      +  ++Y+  +  +F+AQR  + + + 
Sbjct: 74  GDLGYALC----------LHRDEESQIVIFTLGRERKDEYLRYLKCMFAAQRFGIRVSAF 123

Query: 181 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240
            L  +N   LQ  +  TGG +   +  DG  ++LL++ GT          PKP+G  F A
Sbjct: 124 SL-FENKTILQCCAS-TGGDYSTSE--DGCLRFLLSLLGTRGE-------PKPLG--FPA 170

Query: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 283
           +C+CH   + +G +C +CLS++C+ +  C  C S F   + +S
Sbjct: 171 TCYCHNRQVLLGLVCPICLSVFCRFVPVCKKCKSKFSFTKYES 213


>gi|303389949|ref|XP_003073206.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon
           intestinalis ATCC 50506]
 gi|303302351|gb|ADM11846.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon
           intestinalis ATCC 50506]
          Length = 213

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 81/161 (50%), Gaps = 21/161 (13%)

Query: 123 GSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 182
           G L  ALC    + R G    + +I+      +  ++Y+  +  +F+AQR  V I +  L
Sbjct: 74  GDLGYALC----LHRLG----ESQIVIFTLEKERNDEYLRYLKCMFAAQRFGVRISAFSL 125

Query: 183 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 242
             +N   LQ  +  TGG +   +  DG  ++LL++ GT   S       KP+G  F A+C
Sbjct: 126 -FENKTILQCCA-ATGGGYSTSE--DGCLRFLLSLLGTKGDS-------KPLG--FPATC 172

Query: 243 FCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 283
           +CH   + +G +C +CLS++C+ +  C  C S F   + +S
Sbjct: 173 YCHNKQVLLGLVCPICLSVFCRFVPVCKKCKSKFSFVKHES 213


>gi|76163131|gb|AAX30917.2| SJCHGC08726 protein [Schistosoma japonicum]
          Length = 77

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 6/76 (7%)

Query: 214 LLTIFGTDLHSRNFLQLPKP------VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK 267
           LL++F      R  L LP         GVDFRA+CFCHK  ID+GY+CS+CLS++C+   
Sbjct: 1   LLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACFCHKRLIDIGYVCSICLSVFCEFNP 60

Query: 268 KCSTCGSVFGQAQTQS 283
            C TC + F     Q+
Sbjct: 61  ICPTCNTPFVLPTVQN 76


>gi|32399081|emb|CAD98321.1| transcription factor, possible [Cryptosporidium parvum]
          Length = 311

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/308 (25%), Positives = 126/308 (40%), Gaps = 46/308 (14%)

Query: 18  VLLDTNPFFW-----SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYD 72
           ++LD+   F       S S + S  L      LN+I    Q  ++  ++TG      +Y 
Sbjct: 7   IILDSRSLFRYEDTGESQSNAQSNLLVSGECVLNSIF---QFTRLYNLSTGQKKKLNIYL 63

Query: 73  SSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYI 132
             S G   + +G +  L   L    E      E++ ++   G +    +S ++S  LC I
Sbjct: 64  CDSKGENLLFDGFISGLTKEL--QFEIMPKMLEEISERISLGEMK---MSPTISKCLCKI 118

Query: 133 QRVFRSGLLH--PQPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 188
               +    +   + RI+ L  S       QYV+++N  F++Q+  V ID   +    S 
Sbjct: 119 NSTRKKFGYNRSEKDRIILLDASNKDEYISQYVSLLNCGFASQKLDVVIDVVSVTRNPSP 178

Query: 189 FLQQASYITGGVHHKPQQL-DGLFQYLLTIFGTDLHSRN--------FLQLP-------- 231
            L     I  G++ K  Q+ D L +  + I   D H  +        F  LP        
Sbjct: 179 LLNNLVDICNGLNLKYSQIVDSLAESNIEINKEDHHWLDQGLTPFLIFHLLPSIQAREEI 238

Query: 232 -------KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQAQTQ 282
                  K  G+   A CFCH   I++GY+CS CLSI+C   K   CSTCG+   +   Q
Sbjct: 239 FISINKTKQTGL---AVCFCHHQKIEIGYVCSSCLSIFCSRFKAPICSTCGARLKRIPIQ 295

Query: 283 SDEPSATN 290
               S+ N
Sbjct: 296 QKTLSSLN 303


>gi|396081719|gb|AFN83334.1| RNA polymerase II transcription [Encephalitozoon romaleae SJ-2008]
          Length = 213

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 78/162 (48%), Gaps = 23/162 (14%)

Query: 123 GSLSMALCYIQRVFRSGLLHPQP-RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY 181
           G L  ALC         L   +P +I+      +  ++Y+  +  +F+AQR  V I +  
Sbjct: 74  GDLGYALC---------LHKDEPTQIVIFTLGRERKDEYLRYLKCMFAAQRLGVRISAFS 124

Query: 182 LGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 241
           L  +N   LQ     TGG +   +  D   ++LL++ GT           KP+G  F A+
Sbjct: 125 L-FENKTILQ-CCVSTGGDYSTSE--DRCLRFLLSLLGTGGSQ-------KPLG--FPAT 171

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 283
           C+CH   I +G +C +CLS++CK +  C  C S F  A+ +S
Sbjct: 172 CYCHNRQILLGLVCPICLSVFCKFIPVCKKCKSKFSFAKYES 213


>gi|209882598|ref|XP_002142735.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209558341|gb|EEA08386.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 310

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 23/193 (11%)

Query: 121 LSGSLSMALCYIQ--RVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVP 176
           L+  +S+ LC I   R   S     + RIL    S +     QYV ++N+ ++A +  V 
Sbjct: 110 LASGISICLCKINSDRKLHSKNRTNKDRILLFDASNEKSYINQYVLLINSSYAALKLDVI 169

Query: 177 IDSCYLGAQNSAFLQQASYITGGVH---------------HKPQQLDGLFQYLL--TIFG 219
           ID C + +  S  L     I+ G++                K    DGL  +++   +  
Sbjct: 170 IDVCSISSNPSRLLYNVVDISKGIYINYSSIIKNVSSKIDDKSHIQDGLLPFIIFHLLPS 229

Query: 220 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFG 277
            +L + + + L         + CFCH   +++G+ICS CL+I+C   +   C+ CG+ F 
Sbjct: 230 AELRANSLINLSTKSRHSGISVCFCHYKKVEVGFICSSCLAIFCSLFRAPICAACGARFK 289

Query: 278 QAQTQSDEPSATN 290
           +   Q    S  N
Sbjct: 290 RVPIQQKPLSYLN 302


>gi|126650916|ref|XP_001388356.1| transcription factor [Cryptosporidium parvum Iowa II]
 gi|126117450|gb|EAZ51550.1| transcription factor, putative [Cryptosporidium parvum Iowa II]
          Length = 286

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 120/291 (41%), Gaps = 46/291 (15%)

Query: 18  VLLDTNPFFW-----SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNSCDYVYD 72
           ++LD+   F       S S + S  L      LN+I    Q  ++  ++TG      +Y 
Sbjct: 7   IILDSRSLFRYEDTGESQSNAQSNLLVSGECVLNSIF---QFTRLYNLSTGQKKKLNIYL 63

Query: 73  SSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYI 132
             S G   + +G +  L   L    E      E++ ++   G +    +S ++S  LC I
Sbjct: 64  CDSKGENLLFDGFISGLTKEL--QFEIMPKMLEEISERISLGEMK---MSPTISKCLCKI 118

Query: 133 QRVFRSGLLH--PQPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 188
               +    +   + RI+ L  S       QYV+++N  F++Q+  V ID   +    S 
Sbjct: 119 NSTRKKFGYNRSEKDRIILLDASNKDEYISQYVSLLNCGFASQKLDVVIDVVSVTRNPSP 178

Query: 189 FLQQASYITGGVHHKPQQL-DGLFQYLLTIFGTDLHSRN--------FLQLP-------- 231
            L     I  G++ K  Q+ D L +  + I   D H  +        F  LP        
Sbjct: 179 LLNNLVDICNGLNLKYSQIVDSLAESNIEINKEDHHWLDQGLTPFLIFHLLPSIQAREEI 238

Query: 232 -------KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCG 273
                  K  G+   A CFCH   I++GY+CS CLSI+C   K   CSTCG
Sbjct: 239 FISINKTKQTGL---AVCFCHHQKIEIGYVCSSCLSIFCSRFKAPICSTCG 286


>gi|401827001|ref|XP_003887593.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon hellem
           ATCC 50504]
 gi|392998599|gb|AFM98612.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon hellem
           ATCC 50504]
          Length = 213

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 68/129 (52%), Gaps = 13/129 (10%)

Query: 155 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 214
           +  ++Y+  +  +F+AQR  V I +  L  +N   LQ  +  TGG +   +  D   ++L
Sbjct: 98  EKKDEYLRYLKCMFAAQRLGVRISAFSL-FENKTILQCCAS-TGGDYSTSE--DKCLRFL 153

Query: 215 LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 274
           L++ GT  +        KP+G  F A+C+CH   + +G +C +CLS++CK +  C  C S
Sbjct: 154 LSLLGTGGNK-------KPLG--FPAACYCHNRQVILGLVCPICLSVFCKFIPVCRKCKS 204

Query: 275 VFGQAQTQS 283
            F   + ++
Sbjct: 205 KFSFVKYEN 213


>gi|71029634|ref|XP_764460.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351414|gb|EAN32177.1| hypothetical protein, conserved [Theileria parva]
          Length = 348

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 72/313 (23%), Positives = 125/313 (39%), Gaps = 67/313 (21%)

Query: 16  VVVLLDTNPFFWSS----------SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYN 65
           +V++LD +P  WSS            +    F T +  FL     ++  N+ ++I    +
Sbjct: 29  LVLILDLSPGRWSSGVPLRPEEPEKRIHLHNFFTVLANFLKLFGYMSVRNKTLIITANNH 88

Query: 66  SCDYVY---------DSSSTGNQSVGNG-----------RMPSLCATLLQNLEEFMNKDE 105
           S   +Y         DS    N SV              ++      +   + +F+++  
Sbjct: 89  STRILYEGFVYNDWTDSCFGVNSSVYTNSQHCDTDRTGEKVDKTLGVMWNKMIDFISE-- 146

Query: 106 QLGKQEPEGRIACSLLSGSLSMALCYIQRV-FRSGLLHPQPRILCLQGSPDGPEQYVAIM 164
             G   P+   A + L+ +++    Y+ R+  R+     +  I  +  S +   QY+ +M
Sbjct: 147 --GLTYPD---ADTQLTSAIATGFLYLNRIRMRNEGYGRKIIIFDVSTSENYKSQYIGLM 201

Query: 165 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH--------------HKPQQLDGL 210
           N  +SA    + I++  LG Q S  L+Q S IT G +              H  Q L  L
Sbjct: 202 NIAYSALAQNITINTFALG-QPSRILEQLSSITRGKYLLVSKVLNFDLNFSHINQYLSQL 260

Query: 211 --FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 268
             F YL +   ++L S N       VG+     C+CH  ++D+ Y+C  C ++YC  +  
Sbjct: 261 ITFWYLPSEGMSELLSTNLSFDFGNVGI-----CYCHYKSVDVSYLCPCCFAVYCSEIDD 315

Query: 269 -------CSTCGS 274
                  C  CGS
Sbjct: 316 KGKYRIICMVCGS 328


>gi|119618835|gb|EAW98429.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
           CRA_d [Homo sapiens]
          Length = 200

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 76/149 (51%), Gaps = 23/149 (15%)

Query: 10  SDDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIATGY 64
            D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+  
Sbjct: 5   EDELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHI 64

Query: 65  NSCDYVYDSSSTGNQSVGN-----GRMPSLCAT--------LLQNLEEFMNKD--EQLGK 109
               ++Y      N  +G+     G  P    +        LL +  E + ++  + + K
Sbjct: 65  QESRFLYPGK---NGRLGDFFGDPGNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTK 121

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVFRS 138
            + +G+   +LL+GSL+ ALCYI R+ + 
Sbjct: 122 SDIKGQHTETLLAGSLAKALCYIHRMNKE 150


>gi|198471480|ref|XP_002133743.1| GA22626 [Drosophila pseudoobscura pseudoobscura]
 gi|198145935|gb|EDY72370.1| GA22626 [Drosophila pseudoobscura pseudoobscura]
          Length = 390

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 70/159 (44%), Gaps = 9/159 (5%)

Query: 119 SLLSGSLSMALCYIQRVFR-----SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 173
           SLL+ SL +ALCYIQ+  R     S     Q RIL + GS    + Y    N I  A  +
Sbjct: 157 SLLAESLGLALCYIQKRRRQIPNTSHGRSVQGRILIVTGSVLHEDVYKNYDNVIQEALET 216

Query: 174 MVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP 233
            V I+ C +       LQ+A+  T G++        L   L+  F   L+    ++ P+ 
Sbjct: 217 NVTINVCAVKVPKQLPLQRATDTTDGLYFCTNDTMSLSGDLVDHF---LYPTYLIKSPRS 273

Query: 234 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 272
             +   A C CH     +G++C  C  + CKH   C  C
Sbjct: 274 -HIAVPARCACHGFGNKIGFVCGKCKLVLCKHSPYCDDC 311


>gi|449019603|dbj|BAM83005.1| similar to TFIIH subunit TFB4 [Cyanidioschyzon merolae strain 10D]
          Length = 399

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/167 (28%), Positives = 71/167 (42%), Gaps = 29/167 (17%)

Query: 141 LHPQPRILCLQGSPDGP---EQYVAIMNAIFSAQRSMVPIDSCYLGA--------QNSAF 189
           L P+ R++ +  S  GP    + V + NA F AQ+  + +D   + A        Q  A 
Sbjct: 231 LSPRARLVVI--SAPGPFDVAESVPLWNAAFCAQQHQLLVDVVRIEATSEPGTKPQARAA 288

Query: 190 LQQASYITGGVHHKPQ-QLDGLFQYLLTIFGTDLHSRNFLQLPKPV-------------- 234
           LQQ +Y T G++ + +     LF +L T    +   R +L    P               
Sbjct: 289 LQQLAYATQGIYLECRCACAKLFTHLATYIVPNRLERRYLTRRIPELGSYASGTERCGVV 348

Query: 235 -GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQ 280
             VD RA+CF     + MGY+CS CLS+Y      C  C +    A+
Sbjct: 349 ESVDLRATCFQSGKLVHMGYVCSRCLSVYSTPQAACIVCDARLDAAE 395


>gi|403223596|dbj|BAM41726.1| uncharacterized protein TOT_040000918 [Theileria orientalis strain
           Shintoku]
          Length = 332

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/168 (27%), Positives = 78/168 (46%), Gaps = 33/168 (19%)

Query: 121 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG-PEQYVAIMNAIFSAQRSMVPIDS 179
           L+ ++SM   Y+ R+ +S     +  I+    S +    QY+ +MN  F+A +  + I++
Sbjct: 141 LTSAISMGCLYLNRITKSKAGFGRKIIIFDVSSRENYKSQYIGLMNIAFTALKQNITINT 200

Query: 180 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN----------FLQ 229
             LG Q S  L+Q S IT       + LD     L  IF +D+ + N          F  
Sbjct: 201 LALG-QPSRILEQLSTITNA-----KYLD-----LSKIFQSDVETSNIEQSLSQLISFWL 249

Query: 230 LP-KPVG------VDFR----ASCFCHKNTIDMGYICSVCLSIYCKHL 266
           LP + VG      + F     A C+CH  T+++ Y+C  C ++YC  +
Sbjct: 250 LPSEEVGEILSTKLSFEFGNTAICYCHYKTVEVSYLCPCCFAVYCSEV 297


>gi|328769012|gb|EGF79057.1| hypothetical protein BATDEDRAFT_90182 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 153

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 67/155 (43%), Gaps = 29/155 (18%)

Query: 41  VLAFLNAILTLNQLNQVVVIATGYNSCDYVYD----SSSTGNQSVGNGRMPSLCATLLQN 96
           ++  +NA L ++  N + VI T   S   +Y     +S +   SVG   +P +  TL   
Sbjct: 1   MMVLVNAHLAMHHHNSLAVIGTTSTSSRLLYPPPSLASVSDRDSVGAASLPGMTGTLSTA 60

Query: 97  LE-------------------EFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR 137
            E                   +F + D  + +Q         L+   +S A  +I ++  
Sbjct: 61  AESALPVGSTTLSKKPANVYKQFFDVDTHIVEQ------LRRLVLADVSAAPKHINKMQS 114

Query: 138 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           S     Q RIL L  SPDG  QY+ IMNAIF+AQ+
Sbjct: 115 STTPDRQSRILVLSVSPDGSAQYIPIMNAIFAAQK 149


>gi|312078926|ref|XP_003141952.1| basic transcription factor 2 [Loa loa]
          Length = 253

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 23/145 (15%)

Query: 119 SLLSGSLSMALCYIQR-----------VFR--SGLLHPQPRILCLQGSPDGPEQYVAIMN 165
           ++ + ++++A+C+I+            +FR  + + +   RIL +    D   ++  +MN
Sbjct: 101 AVFAPAVAIAICHIKDDRAILNNISSDIFRYKNEVDNGDGRILIINIGSDLRGEHSILMN 160

Query: 166 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLH 223
             F+A +  + ID   +G + S  LQQAS ITGG +   ++   L +Y +  T+    L 
Sbjct: 161 IFFAAHKHNILIDVANIG-EASTILQQASDITGGTYFSVKKPKQLLKYTMCFTLGKASLR 219

Query: 224 SRNFLQLPKP---VGVDFRASCFCH 245
           S      P P     VD+RASC CH
Sbjct: 220 S----AFPSPSSSTSVDYRASCHCH 240


>gi|321265019|ref|XP_003197226.1| hypothetical protein CGB_M0140C [Cryptococcus gattii WM276]
 gi|317463705|gb|ADV25439.1| Hypothetical protein CGB_M0140C [Cryptococcus gattii WM276]
          Length = 323

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/224 (25%), Positives = 86/224 (38%), Gaps = 60/224 (26%)

Query: 1   MASAPSKLYSDDVSLVVVLLDTNPFFW---------------------SSSSLSFSQFLT 39
           M S PS L        +V+LDT+P  W                      SS  S  QF+T
Sbjct: 1   MPSPPSTL--------IVILDTHPLSWHLLSHLPPAPPIPDNKVLDTAKSSPTSLHQFIT 52

Query: 40  HVLAFLNAILTLNQLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPS---------LC 90
            ++ FLNA L     N+VVV          +Y  S+   +  G G  P+         L 
Sbjct: 53  ILIVFLNAHLASKWGNEVVVYTASAGKATLIYPPSNDKLRQRGEGGKPNANVYRPFQVLD 112

Query: 91  ATLLQNLEEFMNKDEQLGKQEPEGRI-ACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC 149
             + + L+E + +++Q    E  G I     +  +L+ ALC+I R   S   H  P  L 
Sbjct: 113 ERIEEGLKEVVREEQQKLDTEGPGFINEPPAMVSALTKALCFINRRILSS-AHNDPTALP 171

Query: 150 LQGSPDG----------PEQ----------YVAIMNAIFSAQRS 173
               P+           P +          YV +MN +F+AQ++
Sbjct: 172 PSSDPNNNNGDTSGGLLPSKERRQQRMRGGYVGLMNCVFAAQKA 215


>gi|164662823|ref|XP_001732533.1| hypothetical protein MGL_0308 [Malassezia globosa CBS 7966]
 gi|159106436|gb|EDP45319.1| hypothetical protein MGL_0308 [Malassezia globosa CBS 7966]
          Length = 276

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 84/215 (39%), Gaps = 48/215 (22%)

Query: 15  LVVVLLDTNPFFWSSSS-----------------LSFSQF---LTHVLAFLNAILTLNQL 54
            +VV+LD N + W   S                 ++FS     +  VL FLNA   +   
Sbjct: 23  FLVVILDLNAYAWQHVSETAKESEPSSTPSQQADVAFSTLKDTILSVLVFLNAYTAMQHG 82

Query: 55  NQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMP-----SLCATLLQNLEEFMNKDEQLGK 109
           N +VV      +   +Y SS   +    + R P     S C    Q  +   +    L  
Sbjct: 83  NGLVVYGAATGTARLLYSSSPHVHPDTHSRRTPQDTHVSACLPFKQMDDAVFHGMRALFD 142

Query: 110 QEPEGRIACSLLSGSLSMALCYIQRVF-----------RSGLLHPQPR----------IL 148
              +G      +  +L++ALC++ R+             SG L  + R          IL
Sbjct: 143 DARDGARGPVGMVRALALALCHMHRLSTVLSESTTTDPHSGALGARTRRGRNTSFLQRIL 202

Query: 149 CLQGSPDGPEQYVAIMNAIFSAQRSMVPID--SCY 181
            L  +PD   QYV +MN IFSAQ+  + +D  SC+
Sbjct: 203 VLSATPDVSAQYVPMMNCIFSAQKQGIQVDARSCF 237


>gi|58261942|ref|XP_568381.1| hypothetical protein CNM00130 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|57230554|gb|AAW46864.1| hypothetical protein CNM00130 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 340

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 52/211 (24%)

Query: 14  SLVVVLLDTNPFFW---------------------SSSSLSFSQFLTHVLAFLNAILTLN 52
           S ++++LDT+P  W                     +SS  S  QF+T +  FLNA L   
Sbjct: 6   STLILVLDTHPLSWHLLAHLPPAPPLPDNKILDNATSSPTSLDQFITILTVFLNAHLASK 65

Query: 53  QLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLCA-----TLLQNLEEFMNK--DE 105
             N+VVV        + +Y  S+   +  G G  PS         L Q +EE + +   E
Sbjct: 66  WGNEVVVYTASAGKAELIYPPSNEKIRQRGEGAKPSANMYRPFQILDQGIEEGLKEVVRE 125

Query: 106 QLGKQEPEGR---IACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG------ 156
           + GK   EG         +  +L+ ALC+I R   S  +   P  L L   P+       
Sbjct: 126 EEGKLNTEGAGFINQPPAMVSALTKALCFINRRI-SPSVPADPTALPLSSDPNSGTSDTS 184

Query: 157 ----PEQ----------YVAIMNAIFSAQRS 173
               P +          YV +MN +F+AQ++
Sbjct: 185 GGLLPSKERRQQRMRGGYVGLMNCVFAAQKA 215


>gi|402471701|gb|EJW05288.1| hypothetical protein EDEG_00646 [Edhazardia aedis USNM 41457]
          Length = 222

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 70/167 (41%), Gaps = 25/167 (14%)

Query: 113 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 172
           E ++  S L+  ++  LC I +   S     +  +L L    +   Q + +  A   A+ 
Sbjct: 68  EYKVCSSNLAQDIAYTLCLINKNRDSS--EARIFVLNLDYQDNNTIQLLKVSTA---AKN 122

Query: 173 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 232
             + IDS Y   Q S  L Q   ITGG++     +       + +FG       F +L  
Sbjct: 123 MKIRIDS-YSSVQ-SPVLSQVCNITGGIYRTSLDI------FVNLFG-------FYELGD 167

Query: 233 PVGVDF-----RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 274
             G D         C CH N I +G +C +CL+IYC+ +  C  C +
Sbjct: 168 ESGTDMSGCFGNVKCICHGNIITLGLVCPICLAIYCRTVPICRVCKT 214


>gi|224106784|ref|XP_002333630.1| predicted protein [Populus trichocarpa]
 gi|222837853|gb|EEE76218.1| predicted protein [Populus trichocarpa]
          Length = 125

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 39/74 (52%), Gaps = 11/74 (14%)

Query: 155 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 214
           DGPEQ+VA+MNA F    S+VPIDSC           QAS +TG V+ K QQ+  +    
Sbjct: 33  DGPEQHVAVMNATFCTPCSLVPIDSC-----------QASCVTGCVYVKSQQVSLVNHRP 81

Query: 215 LTIFGTDLHSRNFL 228
              F  ++   N L
Sbjct: 82  FISFAPEVAVENLL 95


>gi|291239973|ref|XP_002739904.1| PREDICTED: general transcription factor IIH, polypeptide 3,
           34kDa-like [Saccoglossus kowalevskii]
          Length = 143

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 69/136 (50%), Gaps = 20/136 (14%)

Query: 10  SDDVSLVVVLLDTNPFFW------SSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATG 63
            ++ SL+VV++D NP +W        + L F+Q +  +L ++N+ L +++ N+V +IA+ 
Sbjct: 4   EEESSLLVVIVDVNPIWWGKMRIQDENDLKFTQCMESILVYVNSHLMMHRSNKVAIIASH 63

Query: 64  YNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIAC----- 118
            NS  ++Y +++   +   +              E F   D+ + K+  +   A      
Sbjct: 64  TNSSCFLYPTATPTTEEEEDFLRDG-------KYEHFAKIDDTIFKEVKKLMTAHTAELH 116

Query: 119 --SLLSGSLSMALCYI 132
             +LL+GSL+MALC I
Sbjct: 117 IDTLLTGSLAMALCCI 132


>gi|336271807|ref|XP_003350661.1| hypothetical protein SMAC_02333 [Sordaria macrospora k-hell]
 gi|380094822|emb|CCC07324.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 409

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 43/72 (59%), Gaps = 3/72 (4%)

Query: 10 SDDV-SLVVVLLDTNPFFWSS--SSLSFSQFLTHVLAFLNAILTLNQLNQVVVIATGYNS 66
          +DD+ SL  +++DTNP  W++    L  ++ L ++L F+N+ L  N  NQV +IA+  N 
Sbjct: 18 TDDIPSLYTIIIDTNPRAWAALNDVLPLTKALANILIFVNSHLAFNNSNQVALIASHTNR 77

Query: 67 CDYVYDSSSTGN 78
            ++Y ++   N
Sbjct: 78 AVWLYPTAPDQN 89


>gi|134118157|ref|XP_772246.1| hypothetical protein CNBM0140 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50254857|gb|EAL17599.1| hypothetical protein CNBM0140 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 385

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 70/173 (40%), Gaps = 32/173 (18%)

Query: 14  SLVVVLLDTNPFFW---------------------SSSSLSFSQFLTHVLAFLNAILTLN 52
           S ++++LDT+P  W                     +SS  S  QF+T +  FLNA L   
Sbjct: 6   STLILVLDTHPLSWHLLAHLPPAPPLPDNKILDNATSSPTSLDQFITILTVFLNAHLASK 65

Query: 53  QLNQVVVIATGYNSCDYVYDSSSTGNQSVGNGRMPSLC-----ATLLQNLEEFMNK--DE 105
             N+VVV        + +Y  S+   +  G G  PS         L Q +EE + +   E
Sbjct: 66  WGNEVVVYTASAGKAELIYPPSNEKIRQRGEGAKPSANMYRPFQILDQGIEEGLKEVVRE 125

Query: 106 QLGKQEPEGR---IACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD 155
           + GK   EG         +  +L+ ALC+I R   S  +   P  L L   P+
Sbjct: 126 EEGKLNTEGAGFINQPPAMVSALTKALCFINRRI-SPSVPTDPTALPLSSDPN 177


>gi|160331405|ref|XP_001712410.1| hypothetical protein HAN_2g261 [Hemiselmis andersenii]
 gi|159765858|gb|ABW98085.1| hypothetical protein HAN_2g261 [Hemiselmis andersenii]
          Length = 293

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 22/139 (15%)

Query: 153 SPDGPEQYVAIMNAIFS-AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 211
           S    ++++A  N +F  AQ++    D+   G ++  F    S  T G++ +P  L  LF
Sbjct: 162 SSKTRKEHIAFSNQLFFLAQKNKFSFDTLLFGRKDCFFFHFLSEKTKGIYCRP--LKNLF 219

Query: 212 QY---------LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDM----GYICSVC 258
           +          LL++F T   SR F  LP      F       KNT ++     + C +C
Sbjct: 220 ELAFTEEFITILLSLFLTSPFSREFYILP------FSTKILNKKNTTNLIAKKKFSCPIC 273

Query: 259 LSIYCKHLKKCSTCGSVFG 277
            SI+      C  CG VF 
Sbjct: 274 FSIFNFLFTNCFVCGFVFS 292


>gi|387592672|gb|EIJ87696.1| hypothetical protein NEQG_02243 [Nematocida parisii ERTm3]
 gi|387595301|gb|EIJ92926.1| hypothetical protein NEPG_02325 [Nematocida parisii ERTm1]
          Length = 210

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 11/109 (10%)

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           F+A++  + +D C +   N  F+ Q S +           + LF+YL+ +   D     F
Sbjct: 111 FTARKLNIHVD-CII--PNDGFVSQVSDVLKRPAFCISSGESLFEYLMGMLSID-----F 162

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            ++ +    +    C CH   I  GY+C +CL +YCK +  C  C + F
Sbjct: 163 GRISRK---NVHRFCICHNKEIQTGYLCPICLGLYCKFVPLCKHCKTRF 208


>gi|378754778|gb|EHY64807.1| hypothetical protein NERG_02210 [Nematocida sp. 1 ERTm2]
          Length = 210

 Score = 45.4 bits (106), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 49/109 (44%), Gaps = 11/109 (10%)

Query: 168 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 227
           F+A++  + +D C +   +  F  Q S +           + LF+YL+ +   D     F
Sbjct: 111 FTAKKQNIHVD-CVI--PHDGFASQISDVLNRPAFSISSNESLFEYLMGMLSMD-----F 162

Query: 228 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
            ++ K    +    C CH   I  GY+C +CL +YCK +  C  C + F
Sbjct: 163 GRISKK---NLSRFCICHHKEIQTGYLCPICLGLYCKFVPLCKHCKTRF 208


>gi|440302599|gb|ELP94906.1| RNA polymerase II transcription factor B subunit, partial
           [Entamoeba invadens IP1]
          Length = 188

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 6/98 (6%)

Query: 121 LSGSLSMALCYIQRVFRSG-LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 179
           LS ++S A+C I    +S   LH   +I+ +    +    ++ +MN IF+AQ+  V ID+
Sbjct: 89  LSAAMSSAMCRINSYSKSNPTLHK--KIVAISTPLNAAPTFITVMNNIFAAQKIGVSIDT 146

Query: 180 CYLGAQNS--AFLQQASYITGGVHHKPQQLDGLFQYLL 215
             +          QQASY+T G+++    LD L   LL
Sbjct: 147 IVISDNEEPCTICQQASYLTNGIYNM-SSLDTLLPRLL 183


>gi|269860080|ref|XP_002649763.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB4 [Enterocytozoon
           bieneusi H348]
 gi|220066822|gb|EED44293.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB4 [Enterocytozoon
           bieneusi H348]
          Length = 202

 Score = 45.1 bits (105), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 27/45 (60%)

Query: 233 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 277
           P+  +F  +C+C   TI +G+ CS+CL+IYCK +  C  C   F 
Sbjct: 155 PILNNFGVNCYCCNKTIYLGFCCSICLTIYCKFIPICKKCKIKFN 199


>gi|429963037|gb|ELA42581.1| hypothetical protein VICG_00333 [Vittaforma corneae ATCC 50505]
          Length = 215

 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 28/118 (23%), Positives = 48/118 (40%), Gaps = 16/118 (13%)

Query: 159 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 218
           QY+  +  +F AQ   +PI         +  ++     +GG+  +      +FQ L    
Sbjct: 110 QYLEYLKCMFVAQHRKIPIHG--FSLHRNILVRMCCEGSGGIFLESCSFSDMFQLL---- 163

Query: 219 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 276
           G     ++  Q+           C C  N + +G +C VCL +YCK +  C  C + F
Sbjct: 164 GNRTKKKDAYQI----------KCACCNNFVTLGLVCPVCLLVYCKFMPVCKKCKTKF 211


>gi|68065658|ref|XP_674813.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56493631|emb|CAH95047.1| hypothetical protein PB001049.00.0 [Plasmodium berghei]
          Length = 89

 Score = 42.0 bits (97), Expect = 0.32,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 29/59 (49%), Gaps = 8/59 (13%)

Query: 240 ASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSVFGQAQTQSDEPSATN 290
           A C CH   ID+ YICS CL+IYC        K    CS C + F ++  ++   S  N
Sbjct: 27  AVCTCHNKQIDIAYICSCCLAIYCSEKDAQTNKERISCSVCKTRFTKSLLRNKTVSDLN 85


>gi|403359548|gb|EJY79438.1| putative: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Oxytricha trifallax]
 gi|403360227|gb|EJY79783.1| putative: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Oxytricha trifallax]
          Length = 255

 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 59/129 (45%), Gaps = 28/129 (21%)

Query: 118 CSLLSGSLSMALCYIQRVFRSGLLHPQP----RILCLQGSPDGPEQYVAIMNAIFSAQRS 173
           CSL S ++  A+C+ ++      +        RI+ ++   + P QY +++NAIF+AQ+ 
Sbjct: 114 CSL-STAMMKAVCFNKKWIEDVAVKRSESFSARIMIVKFQSEDPSQYSSLVNAIFAAQKL 172

Query: 174 MVPIDSCYL--------------------GAQNSAFLQQASYITGGVHHK--PQQLDG-L 210
            + +DS  +                      + S+ LQQA+ +T G  +K     LD  L
Sbjct: 173 KISVDSLIIQYAPPQAIQEPQDGKQQPASQEEESSLLQQAALLTNGTSYKLFSNNLDSEL 232

Query: 211 FQYLLTIFG 219
           F  LL +F 
Sbjct: 233 FPTLLELFS 241


>gi|403376319|gb|EJY88136.1| putative: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Oxytricha trifallax]
          Length = 230

 Score = 41.2 bits (95), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 23/97 (23%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL--------------------GAQ 185
           RI+ ++   + P QY +++NAIF+AQ+  + +DS  +                      +
Sbjct: 120 RIMIVKFQSEDPSQYSSLVNAIFAAQKLKISVDSLIIQYAPPQAIQEPQDGKQQPASQEE 179

Query: 186 NSAFLQQASYITGGVHHK--PQQLDG-LFQYLLTIFG 219
            S+ LQQA+ +T G  +K     LD  LF  LL +F 
Sbjct: 180 ESSLLQQAALLTNGTSYKLFSNNLDSELFPTLLELFS 216


>gi|429965399|gb|ELA47396.1| hypothetical protein VCUG_01165 [Vavraia culicis 'floridensis']
          Length = 215

 Score = 40.8 bits (94), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 52/136 (38%), Gaps = 26/136 (19%)

Query: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 205
           RI+ +  S +  + Y+  + + F A+R        Y    N + L          HHK  
Sbjct: 99  RIIIISLSRENNKDYLKYLKSAFVAKR--------YSDRYNISVLS---------HHKNP 141

Query: 206 QLDGLFQYLLTIFGTDLHSRNFLQL---PKPVGVDF-RASCFCHKNTIDMGYICSVCLSI 261
            L  +       F  +     FLQ+    KP  + F    C CH   I  G +C VCLSI
Sbjct: 142 ALSEI-----GCFYNNFALSTFLQILSGKKPQKIFFCSTKCSCHDREILYGLVCPVCLSI 196

Query: 262 YCKHLKKCSTCGSVFG 277
           YC  +  C  C   F 
Sbjct: 197 YCSLIPICKRCRIRFN 212


>gi|395755950|ref|XP_003780047.1| PREDICTED: general transcription factor IIH subunit 3-like,
          partial [Pongo abelii]
          Length = 61

 Score = 40.8 bits (94), Expect = 0.76,   Method: Composition-based stats.
 Identities = 19/57 (33%), Positives = 36/57 (63%), Gaps = 5/57 (8%)

Query: 11 DDVSLVVVLLDTNPFFWSSSSLSFSQF-----LTHVLAFLNAILTLNQLNQVVVIAT 62
          D+++L+V+++D NP +W   +L  SQF     +  V+   N+ L +N+ N++ VIA+
Sbjct: 1  DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIAS 57


>gi|195389498|ref|XP_002053413.1| GJ23349 [Drosophila virilis]
 gi|194151499|gb|EDW66933.1| GJ23349 [Drosophila virilis]
          Length = 490

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 28/62 (45%)

Query: 199 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 258
           G +      +GL   +L  F         L  PK   +D RA C+CH   I++G  CS C
Sbjct: 9   GFYFDTSAFEGLCMNVLCQFLASTQRVYHLNYPKQSPMDLRALCYCHNKRIEIGRACSNC 68

Query: 259 LS 260
           LS
Sbjct: 69  LS 70


>gi|84997465|ref|XP_953454.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65304450|emb|CAI76829.1| hypothetical protein, conserved [Theileria annulata]
          Length = 391

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 43/164 (26%), Positives = 64/164 (39%), Gaps = 37/164 (22%)

Query: 147 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG------- 199
           I  +  S +   QY+ +MN  +SA    + I++  LG Q S  L+Q S IT         
Sbjct: 209 IFDISNSENYKSQYIGLMNIAYSALSQNITINTFALG-QPSRILEQLSAITNAKYLLVSK 267

Query: 200 -------VHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQ-LPKPVGVDF--RASCFCHKN 247
                   +H  Q L  L   L+   I    L S    + L   +  DF     C+CH  
Sbjct: 268 LLNFDLNFNHINQYLTQLITVLILFIILFWYLPSEGMSEMLSTNLSFDFGNVGICYCHYK 327

Query: 248 TIDMGYICSVCLS----------IYC-------KHLKKCSTCGS 274
           ++D+ Y+C  C +          +YC       K+   C  CGS
Sbjct: 328 SVDVTYLCPCCFAGNFYSLIRFIVYCSEIDDKGKYRIICMVCGS 371


>gi|291000862|ref|XP_002682998.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
           gruberi]
 gi|284096626|gb|EFC50254.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
           gruberi]
          Length = 349

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 242 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 273
           C CHK     GYIC  C S YC    +CSTCG
Sbjct: 240 CLCHKKITPGGYICPRCKSKYCDLPVECSTCG 271


>gi|413935397|gb|AFW69948.1| hypothetical protein ZEAMMB73_462205, partial [Zea mays]
          Length = 22

 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 16/22 (72%), Positives = 20/22 (90%)

Query: 251 MGYICSVCLSIYCKHLKKCSTC 272
           MGY+CSVCLSI+C++ K CSTC
Sbjct: 1   MGYVCSVCLSIFCRNQKTCSTC 22


>gi|118401911|ref|XP_001033275.1| hypothetical protein TTHERM_00420200 [Tetrahymena thermophila]
 gi|89287623|gb|EAR85612.1| hypothetical protein TTHERM_00420200 [Tetrahymena thermophila
           SB210]
          Length = 351

 Score = 38.1 bits (87), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 12/69 (17%)

Query: 209 GLFQYL--LTIFGTDLHSRNFLQLPK-PV-------GVDFRASCFCHKNTID--MGYICS 256
            LF Y+  L +   D+ ++   QL K PV        V FR +C CH   +D  + Y+CS
Sbjct: 270 DLFNYISILMLTSGDMLAKPEQQLTKNPVFRPAINEKVQFRVNCSCHNTPMDSDIAYVCS 329

Query: 257 VCLSIYCKH 265
            CL +YCK 
Sbjct: 330 DCLFLYCKQ 338


>gi|440492213|gb|ELQ74803.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB4 [Trachipleistophora
           hominis]
          Length = 247

 Score = 37.4 bits (85), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 26/55 (47%), Gaps = 4/55 (7%)

Query: 227 FLQL---PKPVGVDF-RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 277
           FLQ+    KP  + F    C CH   I  G +C VCLSIYC  +  C  C   F 
Sbjct: 190 FLQILCGRKPQKIFFCSTKCNCHGREILYGLVCPVCLSIYCSLIPICKRCRIRFN 244


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.134    0.404 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,564,603,000
Number of Sequences: 23463169
Number of extensions: 185319872
Number of successful extensions: 373592
Number of sequences better than 100.0: 398
Number of HSP's better than 100.0 without gapping: 370
Number of HSP's successfully gapped in prelim test: 28
Number of HSP's that attempted gapping in prelim test: 372179
Number of HSP's gapped (non-prelim): 535
length of query: 298
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 157
effective length of database: 9,050,888,538
effective search space: 1420989500466
effective search space used: 1420989500466
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)