BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 028156
         (213 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|18394597|ref|NP_564050.1| transcription initiation factor TFIIH subunit H3 [Arabidopsis
           thaliana]
 gi|21537277|gb|AAM61618.1| unknown [Arabidopsis thaliana]
 gi|92856638|gb|ABE77412.1| At1g18340 [Arabidopsis thaliana]
 gi|332191584|gb|AEE29705.1| transcription initiation factor TFIIH subunit H3 [Arabidopsis
           thaliana]
          Length = 301

 Score =  347 bits (891), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 165/210 (78%), Positives = 185/210 (88%), Gaps = 2/210 (0%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQE-PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
           MP++  +LL+ LEEF+ KDE+L K+E  E RI   LLSGSLSMALCYIQRVFRSG LHPQ
Sbjct: 89  MPAIFGSLLKKLEEFVTKDEELSKEEVSEDRIPSCLLSGSLSMALCYIQRVFRSGHLHPQ 148

Query: 60  PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
           PRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCY+G QNSAFLQQASYITGGVHH P
Sbjct: 149 PRILCLQGSPDGPEQYVAVMNSIFSAQRLMVPIDSCYIGVQNSAFLQQASYITGGVHHTP 208

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
           +QLDGLFQYL TIF TDLHSR F+QLPKP+GVDFRASCFCHK TIDMGYICSVCLSI+C+
Sbjct: 209 KQLDGLFQYLTTIFATDLHSRGFVQLPKPIGVDFRASCFCHKKTIDMGYICSVCLSIFCE 268

Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
           H KKCSTCGSVFGQ++   D  SA+++KRK
Sbjct: 269 HHKKCSTCGSVFGQSKLD-DASSASDKKRK 297


>gi|297850232|ref|XP_002892997.1| hypothetical protein ARALYDRAFT_472054 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338839|gb|EFH69256.1| hypothetical protein ARALYDRAFT_472054 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 301

 Score =  345 bits (884), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 163/210 (77%), Positives = 185/210 (88%), Gaps = 2/210 (0%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQE-PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
           MP+L  +LL+ LE+F+ KDE+L ++E  E RI   LLSGSLSMALCYIQRVFRSG LHPQ
Sbjct: 89  MPALFGSLLKKLEDFVTKDEELSREEVSEDRIPSCLLSGSLSMALCYIQRVFRSGHLHPQ 148

Query: 60  PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
           PRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCY+G QNSAFLQQASYITGGVHH P
Sbjct: 149 PRILCLQGSPDGPEQYVAVMNSIFSAQRLMVPIDSCYIGVQNSAFLQQASYITGGVHHTP 208

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
           +QLDGLFQYL TIF TDLHSR+F+QLPKP+GVDFRASCFCHK TIDMGYICSVCLSI+C+
Sbjct: 209 KQLDGLFQYLTTIFATDLHSRSFVQLPKPIGVDFRASCFCHKKTIDMGYICSVCLSIFCE 268

Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
           H KKCSTCGSVFGQ++   D  S +++KRK
Sbjct: 269 HHKKCSTCGSVFGQSKLD-DASSVSDKKRK 297


>gi|225459534|ref|XP_002284465.1| PREDICTED: general transcription factor IIH subunit 3 [Vitis
           vinifera]
 gi|302141830|emb|CBI19033.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  337 bits (865), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 173/212 (81%), Positives = 187/212 (88%), Gaps = 2/212 (0%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
           MP+LC+ LLQ LEEF+  DE+L K+     I  SLLSGSLSMALCYIQRVFR+G LHPQP
Sbjct: 87  MPALCSNLLQKLEEFVTGDEKLSKEVLAAGIGSSLLSGSLSMALCYIQRVFRTGPLHPQP 146

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RILCLQGSPDGPEQYVA+MNAIFSAQRSMVPIDSC +GAQ+SAFLQQASYITGGV+ KPQ
Sbjct: 147 RILCLQGSPDGPEQYVAVMNAIFSAQRSMVPIDSCVIGAQHSAFLQQASYITGGVYLKPQ 206

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
           QLDGLFQYL T+F TDLHSR FLQLPKP GVDFRASCFCHKNTIDMGYICSVCLSI+CKH
Sbjct: 207 QLDGLFQYLSTVFATDLHSRRFLQLPKPAGVDFRASCFCHKNTIDMGYICSVCLSIFCKH 266

Query: 181 LKKCSTCGSVFGQAQTQSDEPSATNRKRKTTD 212
            KKCSTCGSVFGQA  QSD  SAT+RKRKT +
Sbjct: 267 HKKCSTCGSVFGQA--QSDGNSATDRKRKTPE 296


>gi|224084866|ref|XP_002307429.1| predicted protein [Populus trichocarpa]
 gi|222856878|gb|EEE94425.1| predicted protein [Populus trichocarpa]
          Length = 289

 Score =  335 bits (858), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 169/213 (79%), Positives = 185/213 (86%), Gaps = 2/213 (0%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
           MPSL + LLQ LEEFM KDE+LGK++ +  I  SLLSGSLSMALCYIQRVFRSG LHPQP
Sbjct: 79  MPSLYSNLLQKLEEFMIKDEKLGKEQSQRAIKSSLLSGSLSMALCYIQRVFRSGPLHPQP 138

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RILCLQGSPDGPEQYVA+MNAIFSAQRSMVPIDSCY+GA NSAFLQQASYITGGV+ KPQ
Sbjct: 139 RILCLQGSPDGPEQYVAVMNAIFSAQRSMVPIDSCYVGAHNSAFLQQASYITGGVYVKPQ 198

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
            LDGLFQYL T+F TDLHSR+F+QLP+P GVDFRASCFCHK TIDMGYICSVCLSI+C H
Sbjct: 199 HLDGLFQYLTTVFATDLHSRSFIQLPRPAGVDFRASCFCHKTTIDMGYICSVCLSIFCNH 258

Query: 181 LKKCSTCGSVFGQAQTQSDEPSATNRKRKTTDA 213
            KKCSTCGSVFGQA  QSD  S ++ KRK  + 
Sbjct: 259 HKKCSTCGSVFGQA--QSDTSSTSDLKRKAPET 289


>gi|449470273|ref|XP_004152842.1| PREDICTED: general transcription factor IIH subunit 3-like [Cucumis
           sativus]
          Length = 295

 Score =  330 bits (846), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 166/212 (78%), Positives = 184/212 (86%), Gaps = 2/212 (0%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
           MP+LC  LL+NLEEF+  DEQ  K++P+G    SLLSGSLSMALCYIQ+VFRSG LHPQP
Sbjct: 86  MPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSLLSGSLSMALCYIQKVFRSGSLHPQP 145

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY+G+ NSAFLQQASYITGGV+ KPQ
Sbjct: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQ 205

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
           Q+DGLFQYL T+FGTDLHSR FLQLPK VGVDFRASCFCHK TIDMGY+CSVCLSI+CKH
Sbjct: 206 QMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKH 265

Query: 181 LKKCSTCGSVFGQAQTQSDEPSATNRKRKTTD 212
            KKCSTCGSVFG+   + D  S +  KRKT +
Sbjct: 266 HKKCSTCGSVFGETPVELD--SVSKLKRKTPE 295


>gi|357461529|ref|XP_003601046.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355490094|gb|AES71297.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 340

 Score =  298 bits (762), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 151/254 (59%), Positives = 177/254 (69%), Gaps = 47/254 (18%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGK-QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
           MP++ + LL NL+EF+ KD+QL       G I  SLLSG+LSMALCYIQR FRSG +HPQ
Sbjct: 87  MPAIYSNLLHNLDEFVAKDQQLTTPHHKPGTIPSSLLSGALSMALCYIQRAFRSGPMHPQ 146

Query: 60  PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
           PRILCLQGS DGPEQYVAIMNAIFSAQRS+VP+DSC++G+ NSAFLQQASYITGG++++P
Sbjct: 147 PRILCLQGSTDGPEQYVAIMNAIFSAQRSVVPVDSCFIGSNNSAFLQQASYITGGIYYRP 206

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
            Q++GLFQYL T+F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CSVCLSI+C+
Sbjct: 207 PQMEGLFQYLSTVFATDLHSREFLRLPKSLGVDFRASCFCHKQTIDMGYVCSVCLSIFCE 266

Query: 180 HLKKCSTC--------------------------------------------GSVFGQAQ 195
           H  KCSTC                                             SVFG  Q
Sbjct: 267 HHDKCSTCSFTIYDASNECKCSLFADCSNQFNLAEIFPLSAVFFNFFYSFIISSVFG--Q 324

Query: 196 TQSDEPSATNRKRK 209
           TQS+  S  NRKRK
Sbjct: 325 TQSEAASEENRKRK 338


>gi|356509424|ref|XP_003523449.1| PREDICTED: general transcription factor IIH subunit 3-like [Glycine
           max]
          Length = 294

 Score =  297 bits (761), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 152/211 (72%), Positives = 174/211 (82%), Gaps = 5/211 (2%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPE--GRIACSLLSGSLSMALCYIQRVFRSGLLHP 58
           MP+L + LL NL+EF+ +D QL        G +  SLLSGSLSMALCYIQR FRSG + P
Sbjct: 86  MPALYSNLLHNLDEFLARDRQLDADASHAPGIVPSSLLSGSLSMALCYIQRAFRSGPMPP 145

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           QPRILCLQG+ DGPEQYVAIMNAIFSAQ S VPIDSCY+G+ NSAFLQQASYITGG+++K
Sbjct: 146 QPRILCLQGAADGPEQYVAIMNAIFSAQHSTVPIDSCYIGSNNSAFLQQASYITGGIYYK 205

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
           P QLDGL+QYL T+F TDLHSR FL+LPK VGVDFRASCFCHK TIDMGY+CSVCLSI+C
Sbjct: 206 PPQLDGLYQYLSTVFATDLHSRAFLRLPKSVGVDFRASCFCHKQTIDMGYVCSVCLSIFC 265

Query: 179 KHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
           +H  KCSTCGSVFGQAQ    + SA +R+RK
Sbjct: 266 EHHDKCSTCGSVFGQAQL---DASAADRQRK 293


>gi|363806998|ref|NP_001242062.1| uncharacterized protein LOC100776751 [Glycine max]
 gi|255647869|gb|ACU24393.1| unknown [Glycine max]
          Length = 294

 Score =  294 bits (752), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 150/211 (71%), Positives = 173/211 (81%), Gaps = 5/211 (2%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPE--GRIACSLLSGSLSMALCYIQRVFRSGLLHP 58
           MP+L + LL NL+EF+ +D QL        G +  SLLSGSLSMALCYIQR FRSG + P
Sbjct: 86  MPALYSNLLHNLDEFLARDRQLDADAAHAPGTVPSSLLSGSLSMALCYIQRAFRSGPMPP 145

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           QPRILCL+G  DGPEQYVAIMNAIFSAQ S VPIDSCY+G+ NSAFLQQASYITGG+++K
Sbjct: 146 QPRILCLRGVADGPEQYVAIMNAIFSAQHSTVPIDSCYIGSNNSAFLQQASYITGGIYYK 205

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
           P QLDGL+QYL T+F TDL+SR FL+LPK VGVDFRASCFCHK TIDMGY+CSVCLSI+C
Sbjct: 206 PPQLDGLYQYLSTVFATDLYSRAFLRLPKSVGVDFRASCFCHKQTIDMGYVCSVCLSIFC 265

Query: 179 KHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
           +H  KCSTCGSVFGQAQ    + SA +R+RK
Sbjct: 266 EHHDKCSTCGSVFGQAQL---DASAADRQRK 293


>gi|294464556|gb|ADE77788.1| unknown [Picea sitchensis]
          Length = 306

 Score =  293 bits (749), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 138/200 (69%), Positives = 161/200 (80%), Gaps = 5/200 (2%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
           ++Q LE+F+ +D+ L       +   SLLSGSLSMALCYIQR+ R G  HPQPRILCLQG
Sbjct: 111 IVQQLEDFVRRDQALNTNNNATKNIHSLLSGSLSMALCYIQRIIRGGPSHPQPRILCLQG 170

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           SPDGP+QYVAIMNAIFSAQRSMVP+DSC +G+Q+SAFLQQAS+ITGG++ KPQQL+GLFQ
Sbjct: 171 SPDGPQQYVAIMNAIFSAQRSMVPVDSCIVGSQHSAFLQQASHITGGIYLKPQQLEGLFQ 230

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           +L  +F TDLHSR FLQLP+P GVDFRASCFCHK TIDMGY+CSVCLSI+C+H KKCSTC
Sbjct: 231 FLAMVFATDLHSRRFLQLPRPTGVDFRASCFCHKKTIDMGYVCSVCLSIFCEHSKKCSTC 290

Query: 188 GSVFGQAQTQSDEPSATNRK 207
            S F Q       PS   RK
Sbjct: 291 ESTFNQ-----KPPSGQKRK 305


>gi|357146071|ref|XP_003573866.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Brachypodium distachyon]
          Length = 290

 Score =  288 bits (738), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 138/197 (70%), Positives = 160/197 (81%), Gaps = 5/197 (2%)

Query: 2   PSLCATLLQ---NLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL 56
           P + AT  +    +EEF+ +D  +  G     G  A SL SG+LS+ALCYIQRVFRSG  
Sbjct: 82  PDVAATFAKASRKVEEFVKQDARDTAGNGTVAGGGAASLFSGALSLALCYIQRVFRSGTR 141

Query: 57  HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
           HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAFLQQASYITGGV+
Sbjct: 142 HPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAFLQQASYITGGVY 201

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
            KPQ+L GLFQYL  +F +DLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CSVCLSI
Sbjct: 202 LKPQELSGLFQYLAAVFASDLHSRAFLRLPKTLGVDFRASCFCHKKTIDMGYVCSVCLSI 261

Query: 177 YCKHLKKCSTCGSVFGQ 193
           +CK+ KKCSTCGS F +
Sbjct: 262 FCKYHKKCSTCGSEFSR 278


>gi|212722318|ref|NP_001131186.1| uncharacterized protein LOC100192494 [Zea mays]
 gi|194688300|gb|ACF78234.1| unknown [Zea mays]
 gi|194690816|gb|ACF79492.1| unknown [Zea mays]
 gi|224028537|gb|ACN33344.1| unknown [Zea mays]
 gi|413926714|gb|AFW66646.1| hypothetical protein ZEAMMB73_940692 [Zea mays]
          Length = 291

 Score =  286 bits (733), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 134/187 (71%), Positives = 157/187 (83%), Gaps = 2/187 (1%)

Query: 10  QNLEEFMNKDEQL--GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
           +  EEF+ +D +   G        A SLLSG+LS++LCYIQR+FRSG  HPQPRILCLQG
Sbjct: 93  RKAEEFIAQDARATAGNGSVASANAASLLSGALSISLCYIQRIFRSGSRHPQPRILCLQG 152

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           SPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAFLQQASYITGGV+ KPQ+L+GLFQ
Sbjct: 153 SPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAFLQQASYITGGVYLKPQELNGLFQ 212

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           YL  +F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CS+CLSI+CK+ KKCSTC
Sbjct: 213 YLAAVFATDLHSRAFLRLPKTLGVDFRASCFCHKKTIDMGYVCSICLSIFCKNQKKCSTC 272

Query: 188 GSVFGQA 194
           GS F +A
Sbjct: 273 GSEFSRA 279


>gi|326513394|dbj|BAK06937.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 292

 Score =  285 bits (728), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 133/186 (71%), Positives = 155/186 (83%), Gaps = 2/186 (1%)

Query: 10  QNLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
           + LEE + +D  E +G        A SL SG+LS+ALCYIQR+FRSG  HPQPRILC+QG
Sbjct: 93  RKLEELIKQDARETVGNGTVADGGAASLFSGALSLALCYIQRIFRSGSRHPQPRILCVQG 152

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           SPDGPEQYVA+MN+IFSAQRSMVPID+C +G Q+SAFLQQASYITGGV+ KPQ+L GLFQ
Sbjct: 153 SPDGPEQYVAVMNSIFSAQRSMVPIDTCIVGTQDSAFLQQASYITGGVYMKPQELSGLFQ 212

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           YL  +F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CSVCLSI+CK+ KKCSTC
Sbjct: 213 YLAAVFATDLHSRTFLRLPKTLGVDFRASCFCHKKTIDMGYVCSVCLSIFCKYHKKCSTC 272

Query: 188 GSVFGQ 193
           GS F +
Sbjct: 273 GSEFSR 278


>gi|255584434|ref|XP_002532948.1| RNA polymerase II transcription factor B subunit, putative [Ricinus
           communis]
 gi|223527277|gb|EEF29432.1| RNA polymerase II transcription factor B subunit, putative [Ricinus
           communis]
          Length = 266

 Score =  277 bits (708), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 137/172 (79%), Positives = 156/172 (90%), Gaps = 1/172 (0%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGK-QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
           +P+L + LLQ LE+FM +DE+LGK +E + +IA SLLSGSLSMALCYIQR+FRSG LHPQ
Sbjct: 86  IPALYSDLLQKLEDFMIRDEKLGKGEESKRKIASSLLSGSLSMALCYIQRIFRSGPLHPQ 145

Query: 60  PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
           PRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDSCY+G  NSAFLQQASYITGGV+ KP
Sbjct: 146 PRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCYVGGHNSAFLQQASYITGGVYVKP 205

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 171
           Q LDGLFQYL+T+F TDLHSR+FLQLP+P GVDFRASCFCHK TIDMG+ICS
Sbjct: 206 QHLDGLFQYLVTVFATDLHSRSFLQLPRPAGVDFRASCFCHKTTIDMGFICS 257


>gi|223946567|gb|ACN27367.1| unknown [Zea mays]
 gi|413926715|gb|AFW66647.1| hypothetical protein ZEAMMB73_940692 [Zea mays]
          Length = 294

 Score =  277 bits (708), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 133/190 (70%), Positives = 156/190 (82%), Gaps = 5/190 (2%)

Query: 10  QNLEEFMNKDEQL--GKQEPEGRIACSLLSGSLSMALC---YIQRVFRSGLLHPQPRILC 64
           +  EEF+ +D +   G        A SLLSG+LS++LC    IQR+FRSG  HPQPRILC
Sbjct: 93  RKAEEFIAQDARATAGNGSVASANAASLLSGALSISLCCYLDIQRIFRSGSRHPQPRILC 152

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           LQGSPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAFLQQASYITGGV+ KPQ+L+G
Sbjct: 153 LQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAFLQQASYITGGVYLKPQELNG 212

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
           LFQYL  +F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CS+CLSI+CK+ KKC
Sbjct: 213 LFQYLAAVFATDLHSRAFLRLPKTLGVDFRASCFCHKKTIDMGYVCSICLSIFCKNQKKC 272

Query: 185 STCGSVFGQA 194
           STCGS F +A
Sbjct: 273 STCGSEFSRA 282


>gi|115443937|ref|NP_001045748.1| Os02g0125800 [Oryza sativa Japonica Group]
 gi|41053068|dbj|BAD08012.1| putative TFIIH basal transcription factor complex p34 subunit
           [Oryza sativa Japonica Group]
 gi|113535279|dbj|BAF07662.1| Os02g0125800 [Oryza sativa Japonica Group]
 gi|215768495|dbj|BAH00724.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 289

 Score =  267 bits (682), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 121/149 (81%), Positives = 135/149 (90%)

Query: 45  CYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAF 104
           CYIQRVFRSG  HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAF
Sbjct: 128 CYIQRVFRSGTRHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAF 187

Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTI 164
           LQQASYITGGV+ KPQ+L+GLFQYL  +F TDLHSR FL+LPK +GVDFRASCFCHK TI
Sbjct: 188 LQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRTFLRLPKTLGVDFRASCFCHKKTI 247

Query: 165 DMGYICSVCLSIYCKHLKKCSTCGSVFGQ 193
           DMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 248 DMGYVCSVCLSIFCKYHKKCSTCGSEFNR 276


>gi|168000809|ref|XP_001753108.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695807|gb|EDQ82149.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 257

 Score =  265 bits (678), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 124/202 (61%), Positives = 162/202 (80%), Gaps = 6/202 (2%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
           + ++++EF++K+E+  K+  +  I  S +SGSLSMALCYIQRV R  L HPQ RILCLQG
Sbjct: 58  ITKSMQEFVSKEEEASKESSQ--IQYSNISGSLSMALCYIQRVLRGPLPHPQARILCLQG 115

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           SPD P QY+++MN+IFSAQR+ V ID+C +G+Q+SAFLQQA++ITGG++ KP Q DGLF+
Sbjct: 116 SPDAPYQYISVMNSIFSAQRNGVVIDACVVGSQHSAFLQQATHITGGIYMKPPQPDGLFE 175

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           YL+ +F TDL SR FLQLP+PVGVDFRASCFCHK TIDMG++CSVCLSI+CKH + CSTC
Sbjct: 176 YLMMVFTTDLFSRQFLQLPRPVGVDFRASCFCHKKTIDMGFVCSVCLSIFCKHHQTCSTC 235

Query: 188 GSVFGQAQTQSDEPSATNRKRK 209
           G+ F    ++S+  + T  KRK
Sbjct: 236 GATF----SRSNSTTVTGMKRK 253


>gi|384252816|gb|EIE26291.1| transcription factor Tfb4 [Coccomyxa subellipsoidea C-169]
          Length = 305

 Score =  207 bits (526), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 92/162 (56%), Positives = 125/162 (77%), Gaps = 4/162 (2%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           ++L+G+LS ALC I +   +         QPR+LCL+GSPD  EQY+++MNAIF+AQR  
Sbjct: 126 AVLAGALSRALCLINKAQLASAGGSRTRKQPRLLCLKGSPDATEQYISVMNAIFAAQRCE 185

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           V ID+C LGA +SAFLQQA+++T G++ +P+  D L QYLLTIF  D  SR+FLQLP+P+
Sbjct: 186 VVIDACMLGASDSAFLQQAAHLTNGIYLRPKHKDALLQYLLTIFCGDAFSRSFLQLPRPI 245

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           GVDFRA+CFCH+  ID+GY+CSVCLSI+C+ + +CSTCG+ F
Sbjct: 246 GVDFRAACFCHRRAIDLGYVCSVCLSIFCESITECSTCGTTF 287


>gi|449524543|ref|XP_004169281.1| PREDICTED: general transcription factor IIH subunit 3-like, partial
           [Cucumis sativus]
          Length = 121

 Score =  200 bits (509), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 93/118 (78%), Positives = 102/118 (86%)

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           VPIDSCY+G+ NSAFLQQASYITGGV+ KPQQ+DGLFQYL T+FGTDLHSR FLQLPK V
Sbjct: 1   VPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSV 60

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRK 207
           GVDFRASCFCHK TIDMGY+CSVCLSI+CKH KKCSTCGSVFG+   + D  S   RK
Sbjct: 61  GVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVELDSVSKLKRK 118


>gi|218189967|gb|EEC72394.1| hypothetical protein OsI_05675 [Oryza sativa Indica Group]
 gi|222622090|gb|EEE56222.1| hypothetical protein OsJ_05207 [Oryza sativa Japonica Group]
          Length = 261

 Score =  197 bits (500), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 95/149 (63%), Positives = 107/149 (71%), Gaps = 28/149 (18%)

Query: 45  CYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAF 104
           CYIQRVFRSG  HPQPR                            VPIDSC +G Q+SAF
Sbjct: 128 CYIQRVFRSGTRHPQPR----------------------------VPIDSCIVGTQDSAF 159

Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTI 164
           LQQASYITGGV+ KPQ+L+GLFQYL  +F TDLHSR FL+LPK +GVDFRASCFCHK TI
Sbjct: 160 LQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRTFLRLPKTLGVDFRASCFCHKKTI 219

Query: 165 DMGYICSVCLSIYCKHLKKCSTCGSVFGQ 193
           DMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 220 DMGYVCSVCLSIFCKYHKKCSTCGSEFNR 248


>gi|330789994|ref|XP_003283083.1| hypothetical protein DICPUDRAFT_9126 [Dictyostelium purpureum]
 gi|325086950|gb|EGC40332.1| hypothetical protein DICPUDRAFT_9126 [Dictyostelium purpureum]
          Length = 282

 Score =  178 bits (452), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 89/184 (48%), Positives = 116/184 (63%), Gaps = 9/184 (4%)

Query: 9   LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS 68
           LQ +   +N D+Q         I  S  S +LSMALCYI R+ R    + +PRIL    S
Sbjct: 105 LQKINIEVNNDQQ--------DIINSSFSAALSMALCYINRI-RKESPNLKPRILVFNIS 155

Query: 69  PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 128
           PD   QY+++MN IFSAQ+  +P+DSC L   +S FLQQAS++T G++ KP + + L QY
Sbjct: 156 PDVSSQYISVMNCIFSAQKQSIPVDSCILSQTDSTFLQQASHLTNGIYLKPHKQELLSQY 215

Query: 129 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 188
           LLT F  D  SR  L  P    VD+RASCFCHK  +D+G++CSVCLSI+C H   CSTCG
Sbjct: 216 LLTTFLLDTQSRKSLAYPNLKSVDYRASCFCHKKIVDIGFVCSVCLSIFCNHFSSCSTCG 275

Query: 189 SVFG 192
           + F 
Sbjct: 276 TKFS 279


>gi|145340718|ref|XP_001415467.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575690|gb|ABO93759.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 285

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 89/181 (49%), Positives = 116/181 (64%), Gaps = 4/181 (2%)

Query: 27  PEGRIACSL-LSGSLSMALCYIQR---VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAI 82
           P  R A S   +G+LS+ALCY  R   +  +  L  +PRILCLQ S D P  Y+++MNAI
Sbjct: 105 PSSRAAMSSPFAGALSLALCYCNRAQTLETAAGLRVRPRILCLQASQDNPTDYISMMNAI 164

Query: 83  FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 142
           FSAQR  +PID+  LG  +  FLQQA++IT G + KP    GL QYLL+    D+ SR+ 
Sbjct: 165 FSAQRQSIPIDAFALGEHDLPFLQQAAHITRGAYVKPTHGAGLLQYLLSTAALDMRSRSH 224

Query: 143 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPS 202
           L+LP   GVDFRASCFCHK  + +G++CSVCLSI+C+    C TCG+ F      +  PS
Sbjct: 225 LKLPAARGVDFRASCFCHKRPVSVGFVCSVCLSIFCERRSSCDTCGADFAADAQVTSVPS 284

Query: 203 A 203
           A
Sbjct: 285 A 285


>gi|6714290|gb|AAF25986.1|AC013354_5 F15H18.15 [Arabidopsis thaliana]
          Length = 590

 Score =  172 bits (436), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 105/215 (48%), Positives = 133/215 (61%), Gaps = 40/215 (18%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQE-PEGRI-ACSLLSGSLSMALCYIQRVFRSGLLHP 58
           MP++  +LL+ LEEF+ KDE+L K+E  E RI +C L  G++    C+    F +  +  
Sbjct: 97  MPAIFGSLLKKLEEFVTKDEELSKEEVSEDRIPSCLLWPGTV----CFFLHFFSTCKI-- 150

Query: 59  QPRILCLQGSPD---GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
              + CL G         +YVA+MN+IFSAQR M    + Y  A NS             
Sbjct: 151 ---LSCLSGLIKRFCHSFRYVAVMNSIFSAQRLM---RTQYSMAANS------------- 191

Query: 116 HHKPQQLDGLFQYL-LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCL 174
                   GLF +   TIF TDLHSR F+QLPKP+GVDFRASCFCHK TIDMGYICSVCL
Sbjct: 192 --------GLFCFAQKTIFATDLHSRGFVQLPKPIGVDFRASCFCHKKTIDMGYICSVCL 243

Query: 175 SIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
           SI+C+H KKCSTCGSVFGQ++   D  SA+++KRK
Sbjct: 244 SIFCEHHKKCSTCGSVFGQSKLD-DASSASDKKRK 277


>gi|307105142|gb|EFN53393.1| hypothetical protein CHLNCDRAFT_13819, partial [Chlorella
           variabilis]
          Length = 135

 Score =  170 bits (431), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 78/134 (58%), Positives = 101/134 (75%), Gaps = 1/134 (0%)

Query: 59  QP-RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH 117
           QP R+LCL  +PD P QY+++MNAIFSAQRS V ID+C LG ++S FLQQA+Y+TGGV+ 
Sbjct: 2   QPARVLCLTAAPDVPSQYISVMNAIFSAQRSGVLIDACQLGRRHSTFLQQAAYLTGGVYL 61

Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
           KP +   L QYL ++F  D  +R FL++P    VDFRASCFCHK  ID+GY+CS CLSI+
Sbjct: 62  KPSKPVALVQYLNSVFAVDAATRQFLRMPGTAHVDFRASCFCHKRQIDLGYVCSACLSIF 121

Query: 178 CKHLKKCSTCGSVF 191
           C+ L  C+TCG+ F
Sbjct: 122 CEQLPACTTCGTEF 135


>gi|328874423|gb|EGG22788.1| general transcription factor IIH [Dictyostelium fasciculatum]
          Length = 304

 Score =  167 bits (422), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 76/150 (50%), Positives = 104/150 (69%), Gaps = 1/150 (0%)

Query: 42  MALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 101
           +ALCYI R+ +      + RIL    SPD   QY+ +MN IFSAQ+  +P+DSC L + +
Sbjct: 133 LALCYINRIKKDNST-IRSRILVFNISPDVSTQYIPVMNCIFSAQKQSIPVDSCILTSTD 191

Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 161
           S FLQQAS++T G++ +P + D L QYLL+ F  D +SR FL +P    VD+RASCFCHK
Sbjct: 192 STFLQQASHLTNGIYLRPHRQDHLGQYLLSSFLIDSYSRKFLNVPTLKTVDYRASCFCHK 251

Query: 162 NTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
             +D+G++CSVCLSIYCK+   C+TCG+ F
Sbjct: 252 KIVDIGFVCSVCLSIYCKYSSSCTTCGTKF 281


>gi|348676264|gb|EGZ16082.1| hypothetical protein PHYSODRAFT_508027 [Phytophthora sojae]
          Length = 298

 Score =  164 bits (415), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 84/185 (45%), Positives = 116/185 (62%), Gaps = 11/185 (5%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRIL 63
           A ++Q L++    D  L   +P  R A   ++ SLS ALC+I R        P  +PRIL
Sbjct: 103 AGVMQRLQQL--SDAPLDPAKP-NRTA---IAASLSRALCFINRAINE---EPDLRPRIL 153

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
            +Q SPD  E Y+AIMN IFSAQ+  V +D+C L  ++S+F+QQA+Y+TGG+++KP    
Sbjct: 154 VVQKSPDVSEHYIAIMNGIFSAQKKSVAVDACILANEHSSFMQQAAYLTGGIYYKPNDHS 213

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
           GL QYL++I+  D   R  L+LP    VDFRA CFCH+  I   Y+C VCLS++C+    
Sbjct: 214 GLLQYLISIYLPDPSMRKLLKLPSQDSVDFRAMCFCHREVISTAYVCPVCLSLFCEFRPI 273

Query: 184 CSTCG 188
           CSTCG
Sbjct: 274 CSTCG 278


>gi|301122235|ref|XP_002908844.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
 gi|262099606|gb|EEY57658.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
          Length = 293

 Score =  162 bits (411), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 81/185 (43%), Positives = 114/185 (61%), Gaps = 11/185 (5%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRIL 63
           A +LQ +++    D  L    P      + ++ SLS +LC+I R        P  +PRIL
Sbjct: 98  AGVLQRMQQL--SDAPLDPATPNQ----TAIAASLSRSLCFINRAINE---EPDLRPRIL 148

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
            +Q SPD  E Y+AIMN IFSAQ+  V +D+C L  ++S+F+QQA+Y+TGG+++KP    
Sbjct: 149 VIQKSPDVSEHYIAIMNGIFSAQKKSVAVDACILATEHSSFMQQAAYLTGGIYYKPNDHS 208

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
           GL QYL++I+  D   R  L+LP    VDFRA CFCH+  I   Y+C VCLS++C+    
Sbjct: 209 GLLQYLISIYLPDPSMRKLLKLPSQDSVDFRAMCFCHREVISTAYVCPVCLSLFCEFRPI 268

Query: 184 CSTCG 188
           CSTCG
Sbjct: 269 CSTCG 273


>gi|290987006|ref|XP_002676214.1| predicted protein [Naegleria gruberi]
 gi|284089815|gb|EFC43470.1| predicted protein [Naegleria gruberi]
          Length = 346

 Score =  159 bits (401), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 80/156 (51%), Positives = 105/156 (67%), Gaps = 2/156 (1%)

Query: 36  LSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           LSG+LSMALC+I R+ +   L      RIL  Q SPD   QY+++MNAIFSA++  + +D
Sbjct: 177 LSGALSMALCFINRLEKEKPLGMSLNSRILTFQVSPDISSQYISVMNAIFSAEKMSIMLD 236

Query: 94  SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
           +C L   +S FLQQA ++TGG + KPQ+ +GL QYLLTIF  +   R+++QLP    VDF
Sbjct: 237 ACVLSNDDSTFLQQACFLTGGNYLKPQRQEGLIQYLLTIFMLEKSLRSYIQLPVQNTVDF 296

Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 189
           RASCF  +  ID GY+C VCLSI+  H   CSTC S
Sbjct: 297 RASCFETRKPIDDGYVCPVCLSIFSSHKPVCSTCAS 332


>gi|328852556|gb|EGG01701.1| hypothetical protein MELLADRAFT_110826 [Melampsora larici-populina
           98AG31]
          Length = 316

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 78/177 (44%), Positives = 111/177 (62%), Gaps = 11/177 (6%)

Query: 19  DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ----PRILCLQGSPDGPEQ 74
           D+ L   +P G +       +L+ ALC+I RV +     P+    PRIL +  SPD P Q
Sbjct: 118 DQDLKSGQPTGTV------NALAKALCHINRVTKEET-RPKDSLKPRILIISISPDSPGQ 170

Query: 75  YVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 134
           Y+ +MN IFSAQ+S +PID C +  +++ FLQQASY+T G++++ ++   L QYL  IF 
Sbjct: 171 YIPMMNCIFSAQKSCIPIDVCKITGEDAVFLQQASYLTNGIYYRLEKPKALIQYLTMIFL 230

Query: 135 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
             +  R  L LP+   VD RA+CFCH+  ID+GY+CSVCLSI+C  +  CSTC + F
Sbjct: 231 PGVSIRKSLNLPQQEEVDLRAACFCHRKIIDLGYVCSVCLSIFCTPIPVCSTCRTKF 287


>gi|325190435|emb|CCA24938.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 288

 Score =  157 bits (398), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 70/145 (48%), Positives = 99/145 (68%), Gaps = 1/145 (0%)

Query: 44  LCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
           LC + RV +       PRI  +Q SPD PE Y++IMN+IFSAQ+  V ID+C L  ++S 
Sbjct: 125 LCCMNRVLKENA-QLCPRIFVIQRSPDVPEHYISIMNSIFSAQKKTVAIDACILSEEHSP 183

Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
           F+QQA+Y+TGG+++KP+ +DGL QYL+T++  D + R  L+ P    VDFRA CFCHK  
Sbjct: 184 FMQQATYLTGGIYYKPRVIDGLLQYLITLYLPDPNMRKMLRFPSQESVDFRAMCFCHKKA 243

Query: 164 IDMGYICSVCLSIYCKHLKKCSTCG 188
           +   ++C VCLS++C+    CSTCG
Sbjct: 244 VSTAFVCPVCLSLFCQFQPICSTCG 268


>gi|403163144|ref|XP_003323260.2| hypothetical protein PGTG_04797 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163932|gb|EFP78841.2| hypothetical protein PGTG_04797 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 322

 Score =  156 bits (395), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 74/156 (47%), Positives = 103/156 (66%), Gaps = 3/156 (1%)

Query: 39  SLSMALCYIQRVFRSGLLHPQ---PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
           +L+ ALC+I R+ R      +    RIL L  S D P QY+ +MN IFSAQ+S +PID C
Sbjct: 135 ALAKALCHINRLARENENKKESLKARILILSVSSDAPGQYIPMMNCIFSAQKSTIPIDVC 194

Query: 96  YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
            +  +N+ FLQQA+++T G++++ ++   + QYL  IF   L +R FL LPK   VDFRA
Sbjct: 195 KISKENAVFLQQAAHLTEGIYYQIEKPKAILQYLTMIFLPGLAARKFLNLPKHQEVDFRA 254

Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +CFCH+  ID+GY+CSVCLSI+C     CSTC + F
Sbjct: 255 ACFCHREIIDVGYVCSVCLSIFCTPTPVCSTCRTKF 290


>gi|66819921|ref|XP_643618.1| general transcription factor IIH, polypeptide 3 [Dictyostelium
           discoideum AX4]
 gi|74861043|sp|Q86IB5.1|TF2H3_DICDI RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=TFIIH basal transcription factor complex subunit 3
 gi|60471553|gb|EAL69509.1| general transcription factor IIH, polypeptide 3 [Dictyostelium
           discoideum AX4]
          Length = 372

 Score =  155 bits (393), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 79/151 (52%), Positives = 102/151 (67%), Gaps = 1/151 (0%)

Query: 42  MALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 101
           +ALCYI R+ R      +PRIL    SPD   QY+++MN IFS+Q+  +P+DSC L   +
Sbjct: 196 IALCYINRIKRETPT-IKPRILVFNISPDVSSQYISVMNCIFSSQKQSIPVDSCILSQSD 254

Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 161
           S FLQQAS++T G++ KPQ+ + L QYLLT F  D  SR  L  P    VD+RASCFCHK
Sbjct: 255 STFLQQASHLTSGIYLKPQKQELLSQYLLTTFLLDTLSRKSLAYPTLKSVDYRASCFCHK 314

Query: 162 NTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 192
             +D+GY+CSVCLSI+C H   CSTCG+ F 
Sbjct: 315 RIVDIGYVCSVCLSIFCGHSSSCSTCGTKFS 345


>gi|363739821|ref|XP_001235063.2| PREDICTED: general transcription factor IIH subunit 3 [Gallus
           gallus]
          Length = 307

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 79/184 (42%), Positives = 116/184 (63%), Gaps = 9/184 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQG 67
           + +++ M K + +G+Q        +LL+GSL+ ALCYI ++ +    + +   RIL ++ 
Sbjct: 112 EEIKDLMTKTDMMGQQTE------TLLAGSLAKALCYINKISKEVKANQEMKSRILVIKA 165

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K   L  L Q
Sbjct: 166 AEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVPHLPSLLQ 224

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           YLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     CSTC
Sbjct: 225 YLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICSTC 284

Query: 188 GSVF 191
            + F
Sbjct: 285 ETAF 288


>gi|334327155|ref|XP_001375408.2| PREDICTED: general transcription factor IIH subunit 3-like, partial
           [Monodelphis domestica]
          Length = 308

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 78/172 (45%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 119 MTKSDMEGQHTETLLAGSLAKALCYIHRMSKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L A +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNIVIDACVLDA-DSGLLQQACDITGGIYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLNLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|326929605|ref|XP_003210949.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Meleagris gallopavo]
          Length = 290

 Score =  153 bits (387), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 79/184 (42%), Positives = 116/184 (63%), Gaps = 9/184 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQG 67
           + +++ M K + +G+Q        +LL+GSL+ ALCYI ++ +    + +   RIL ++ 
Sbjct: 95  EEIKDLMTKTDMMGQQTE------TLLAGSLAKALCYINKMSKEVKANQEMKSRILVIKA 148

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K   L  L Q
Sbjct: 149 AEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVPHLPSLLQ 207

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           YLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     CSTC
Sbjct: 208 YLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICSTC 267

Query: 188 GSVF 191
            + F
Sbjct: 268 ETAF 271


>gi|307214269|gb|EFN89365.1| General transcription factor IIH subunit 3 [Harpegnathos saltator]
          Length = 292

 Score =  153 bits (387), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 82/162 (50%), Positives = 105/162 (64%), Gaps = 7/162 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           SL+SG+LSMALCYI R+ R  +    LHP  RIL +  S D   QY+  MN  F+AQ+  
Sbjct: 113 SLISGALSMALCYIARLERDKVAGQKLHP--RILVITASNDSATQYMNYMNIFFTAQKMN 170

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           V +D C L  Q    LQQ   ITGG + K  QL GL QYLL +F  D + R+ L LP PV
Sbjct: 171 VILDVCSLD-QELTLLQQGCDITGGNYLKVPQLAGLLQYLLWVFLPDPNVRSKLVLPPPV 229

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            VD+RA+CFCH+  ID+GY+CS+CLS++CK    C+TC +VF
Sbjct: 230 KVDYRAACFCHQELIDIGYVCSICLSVFCKFSPICTTCHTVF 271


>gi|115609530|ref|XP_791329.2| PREDICTED: general transcription factor IIH subunit 3-like
           [Strongylocentrotus purpuratus]
          Length = 313

 Score =  153 bits (386), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 80/191 (41%), Positives = 118/191 (61%), Gaps = 8/191 (4%)

Query: 3   SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQP 60
           S+  T+++ L+  +++ +  G    +     +LL+GSL+MALCYI RV +   +    + 
Sbjct: 107 SINDTVMEELKNLVDESKSWGLSHTD-----TLLAGSLAMALCYIHRVXKECAVGEKIKS 161

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL L+ + D   QY+  MN IF+AQ+  +PID+C L  ++S+ LQQA  ITGG + K  
Sbjct: 162 RILVLKAADDSASQYMNFMNVIFTAQKQNIPIDACILD-KDSSLLQQACDITGGKYLKLL 220

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              GL QYLL ++      R  L LP  + VD+RA+CFCH+  ID+GY+CSVCLSI+C +
Sbjct: 221 STTGLLQYLLWVYLPSPSQRESLVLPPAIHVDYRAACFCHRILIDVGYVCSVCLSIFCTY 280

Query: 181 LKKCSTCGSVF 191
              CSTC + F
Sbjct: 281 SPICSTCHTAF 291


>gi|332019667|gb|EGI60141.1| General transcription factor IIH subunit 3 [Acromyrmex echinatior]
          Length = 292

 Score =  153 bits (386), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 83/162 (51%), Positives = 104/162 (64%), Gaps = 7/162 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           SL+SG+LSMALCYI R+ R  +    LHP  R+L +  S D   QY+  MN  F+AQR  
Sbjct: 113 SLISGALSMALCYIARLEREKVAGQKLHP--RMLVITASNDSATQYMNYMNIFFTAQRMN 170

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           V +D C L  Q    LQQ   ITGG + K  QL GL QYLL +F  D   R+ L LP PV
Sbjct: 171 VILDVCSLD-QELTLLQQGCDITGGNYLKVPQLAGLLQYLLWVFLPDPSVRSKLVLPPPV 229

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            VD+RA+CFCH+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 230 KVDYRAACFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|395513842|ref|XP_003761131.1| PREDICTED: general transcription factor IIH subunit 3 [Sarcophilus
           harrisii]
          Length = 316

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 126 MTKSDMEGQRTETLLAGSLAKALCYIHRMGKEVKDNQEMKSRILVIKAAEDSALQYMNFM 185

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L   +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 186 NVIFAAQKQNIVIDACVLDT-DSGLLQQACDITGGIYLKVPQMLSLLQYLLWVFLPDQDQ 244

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 245 RSQLNLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 296


>gi|149633387|ref|XP_001507590.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Ornithorhynchus anatinus]
          Length = 311

 Score =  150 bits (380), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 80/188 (42%), Positives = 118/188 (62%), Gaps = 5/188 (2%)

Query: 8   LLQNLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRIL 63
           LL    E + ++  + + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL
Sbjct: 106 LLTTANEVITEEIKDLMTKSDMKGQHTETLLAGSLAKALCYIHRMNKEVKDSQEMRSRIL 165

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
            ++ S D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+ 
Sbjct: 166 VIKASEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGIYLKVPQMP 224

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
            L QYLL +F  D   R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     
Sbjct: 225 SLLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI 284

Query: 184 CSTCGSVF 191
           C+TC + F
Sbjct: 285 CTTCETAF 292


>gi|118601766|ref|NP_001073057.1| general transcription factor IIH subunit 3 [Bos taurus]
 gi|426247186|ref|XP_004017367.1| PREDICTED: general transcription factor IIH subunit 3 [Ovis aries]
 gi|122131719|sp|Q05B56.1|TF2H3_BOVIN RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=General transcription factor IIH polypeptide 3
 gi|115545470|gb|AAI22793.1| General transcription factor IIH, polypeptide 3, 34kDa [Bos taurus]
 gi|296478493|tpg|DAA20608.1| TPA: general transcription factor IIH subunit 3 [Bos taurus]
          Length = 309

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 78/172 (45%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|440898315|gb|ELR49840.1| General transcription factor IIH subunit 3, partial [Bos grunniens
           mutus]
          Length = 305

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 78/172 (45%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 116 MTKSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 175

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 176 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 234

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 235 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 286


>gi|115655410|ref|XP_001194121.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Strongylocentrotus purpuratus]
          Length = 391

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 79/181 (43%), Positives = 111/181 (61%), Gaps = 3/181 (1%)

Query: 13  EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPD 70
           +  M + + L  + P      +LL+GSL+MALCYI RV +   +    + RIL L+ + D
Sbjct: 190 DTVMEELKNLVDENPTVSHTDTLLAGSLAMALCYIHRVEKECAVGEKIKSRILVLKAADD 249

Query: 71  GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
              QY+  MN IF+AQ+  +PID+C L  ++S+ LQQA  ITGG + K     GL QYLL
Sbjct: 250 SASQYMNFMNVIFTAQKQNIPIDACILD-KDSSLLQQACDITGGKYLKLLSTTGLLQYLL 308

Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
            ++      R  L LP  + VD+RA+CFCH+  ID+GY+CSVCLSI+C +   CSTC + 
Sbjct: 309 WVYLPSPSQRESLVLPPAIHVDYRAACFCHRILIDVGYVCSVCLSIFCTYSPICSTCHTA 368

Query: 191 F 191
           F
Sbjct: 369 F 369


>gi|449476600|ref|XP_002189370.2| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Taeniopygia guttata]
 gi|449476604|ref|XP_004176463.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Taeniopygia guttata]
          Length = 267

 Score =  150 bits (379), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 78/187 (41%), Positives = 117/187 (62%), Gaps = 9/187 (4%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILC 64
            + + +++ M K +  G+Q        +LL+GSL+ ALCYI ++ +   ++ +   RIL 
Sbjct: 69  AIAEEIKDLMTKTDMKGQQTE------TLLAGSLAKALCYINKMGKDLKVNQEIKSRILV 122

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K   +  
Sbjct: 123 IKAAEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVPHMPS 181

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
           L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C
Sbjct: 182 LLQYLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 241

Query: 185 STCGSVF 191
           STC + F
Sbjct: 242 STCETAF 248


>gi|301754663|ref|XP_002913183.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Ailuropoda melanoleuca]
          Length = 309

 Score =  150 bits (379), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 113/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K E +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+ +M
Sbjct: 120 MTKSEIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNLM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|390602568|gb|EIN11961.1| transcription factor Tfb4 [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 338

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 81/185 (43%), Positives = 111/185 (60%), Gaps = 8/185 (4%)

Query: 12  LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-SGLLHP----QPRILCLQ 66
           ++   N+ + LG+ E +   A   L G+L+ ALCYI R+   S    P    +PRIL L 
Sbjct: 100 IDHITNELDVLGEWEAQESTA---LVGALTKALCYINRIMNVSADNEPRSPSEPRILILS 156

Query: 67  GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
            SPD    Y+ IMN+IFSAQ+  V +D C L    S FLQQAS++TGG +   +  D L 
Sbjct: 157 VSPDQSTSYIPIMNSIFSAQKLKVTMDVCQLYGPESVFLQQASHLTGGSYIHLEHRDALL 216

Query: 127 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 186
           QYL+  F      R  L +P    +DFRA+CFCHKN +D+G++CSVCLSI+C+ +  CST
Sbjct: 217 QYLIMSFLPPPSIRRTLSVPTQDKIDFRAACFCHKNIVDVGFVCSVCLSIFCQPVPVCST 276

Query: 187 CGSVF 191
           C + F
Sbjct: 277 CRTKF 281


>gi|322795304|gb|EFZ18109.1| hypothetical protein SINV_08150 [Solenopsis invicta]
          Length = 322

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 87/185 (47%), Positives = 111/185 (60%), Gaps = 12/185 (6%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGL----LHPQPRI 62
           T+ Q L++F+N+         E     SL+SG+LSMALCYI R+ R  +    LHP  RI
Sbjct: 89  TVRQQLQQFINEIPMDAPLNTE-----SLISGALSMALCYIARLEREKVAGQKLHP--RI 141

Query: 63  LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
           L +  S D   QY+  MN  F+AQR  V +D C L  Q    LQQ   I+GG + K  QL
Sbjct: 142 LVITASNDSATQYMNYMNIFFTAQRMNVILDVCSLD-QELTLLQQGCDISGGNYLKVPQL 200

Query: 123 DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK 182
            GL QYLL +F  D   R  L LP PV VD+RA+CFCH+  ID+GY+CS+CLSI+CK   
Sbjct: 201 AGLLQYLLWVFLPDPSVRLKLVLPPPVKVDYRAACFCHQELIDIGYVCSICLSIFCKFSP 260

Query: 183 KCSTC 187
            C+TC
Sbjct: 261 ICTTC 265


>gi|350407575|ref|XP_003488130.1| PREDICTED: general transcription factor IIH subunit 3-like [Bombus
           impatiens]
          Length = 292

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 84/190 (44%), Positives = 117/190 (61%), Gaps = 14/190 (7%)

Query: 7   TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PR 61
           T+ Q L++ +N+   D+ L  +        SL+SG+L+MALCY+ R+ R  +   +   R
Sbjct: 91  TIRQQLQQVINEISMDKPLNGE--------SLISGALTMALCYVARLEREKVASEKIYSR 142

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL +  S D   QY+  MN  F+AQ+  + +D C L  Q    LQQ   ITGG + K  Q
Sbjct: 143 ILVITASNDSATQYMNYMNIFFTAQKMGIILDVCSLD-QELTLLQQGCDITGGNYLKVPQ 201

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
           L+GL QYLL IF  D + R+ L LP PV VD+RA+CFCH+  ID+GY+CS+CLSI+CK  
Sbjct: 202 LNGLLQYLLWIFLPDPNVRSKLVLPPPVKVDYRAACFCHQELIDIGYVCSICLSIFCKFS 261

Query: 182 KKCSTCGSVF 191
             C+TC +VF
Sbjct: 262 PICTTCHTVF 271


>gi|194042856|ref|XP_001927574.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Sus scrofa]
          Length = 309

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + EG+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|194853925|ref|XP_001968252.1| GG24611 [Drosophila erecta]
 gi|190660119|gb|EDV57311.1| GG24611 [Drosophila erecta]
          Length = 299

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 79/160 (49%), Positives = 104/160 (65%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS+SMALCYI R+ R+    +    RIL L GS +   QY+  MN  F+AQ+  + 
Sbjct: 121 SLLAGSMSMALCYISRLQRNVAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLGIT 180

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F    H R+ L LP P  V
Sbjct: 181 IDTCALN-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPHIRHKLVLPPPPKV 239

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|340717520|ref|XP_003397229.1| PREDICTED: general transcription factor IIH subunit 3-like [Bombus
           terrestris]
          Length = 292

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 84/190 (44%), Positives = 117/190 (61%), Gaps = 14/190 (7%)

Query: 7   TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PR 61
           T+ Q L++ +N+   D+ L  +        SL+SG+L+MALCY+ R+ R  +   +   R
Sbjct: 91  TIRQQLQQVINEISMDKPLNGE--------SLISGALTMALCYVARLEREKVASEKIYSR 142

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL +  S D   QY+  MN  F+AQ+  + +D C L  Q    LQQ   ITGG + K  Q
Sbjct: 143 ILVITASNDSATQYMNYMNIFFTAQKMGIILDVCSLD-QELTLLQQGCDITGGNYLKVPQ 201

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
           L+GL QYLL IF  D + R+ L LP PV VD+RA+CFCH+  ID+GY+CS+CLSI+CK  
Sbjct: 202 LNGLLQYLLWIFLPDPNVRSKLVLPPPVKVDYRAACFCHQELIDIGYVCSICLSIFCKFS 261

Query: 182 KKCSTCGSVF 191
             C+TC +VF
Sbjct: 262 PICTTCHTVF 271


>gi|170093117|ref|XP_001877780.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164647639|gb|EDR11883.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 360

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 120/199 (60%), Gaps = 19/199 (9%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV-------------FR 52
           +T++  + + ++  +Q  ++ P     C+L+ G+L+ ALCY+ R+              R
Sbjct: 102 STIINRISDELDAIDQSEEEAP-----CALV-GALTKALCYVNRISLPPPSNSSTSPNTR 155

Query: 53  SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
              + P PRIL L  SPD    Y+ IMN+IFSAQ+  V ID+C +   ++ FLQQA+++T
Sbjct: 156 DSTVLPDPRILILSVSPDLSTSYIPIMNSIFSAQKLKVTIDACQVYGPDAVFLQQAAHLT 215

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
           GG +   ++ D L QYL+  F +    R  L +P    +DFRA+CFCHKN ID+G++CSV
Sbjct: 216 GGSYLFLERRDALLQYLIMSFLSAPSIRQVLAVPTQDRIDFRAACFCHKNIIDIGFVCSV 275

Query: 173 CLSIYCKHLKKCSTCGSVF 191
           CLSI+C+ +  CSTC + F
Sbjct: 276 CLSIFCQPVPVCSTCRTKF 294


>gi|187608008|ref|NP_001120625.1| general transcription factor IIH, polypeptide 3, 34kDa [Xenopus
           (Silurana) tropicalis]
 gi|171846721|gb|AAI61775.1| gtf2h3 protein [Xenopus (Silurana) tropicalis]
          Length = 200

 Score =  149 bits (377), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 77/183 (42%), Positives = 115/183 (62%), Gaps = 8/183 (4%)

Query: 11  NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGS 68
           ++++ M K  Q+  Q+ E     ++L+GSL+ ALCYI ++ +        + RIL ++ +
Sbjct: 5   DIKDLMTKTGQINGQQTE-----TVLAGSLAKALCYINKIAKDTKAGQEVKSRILVIKAA 59

Query: 69  PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 128
            D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q + L QY
Sbjct: 60  EDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKLPQANSLLQY 118

Query: 129 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 188
           LL +F  D   R+ L L  PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC 
Sbjct: 119 LLWVFLPDPDQRSHLNLLPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCE 178

Query: 189 SVF 191
           + F
Sbjct: 179 TAF 181


>gi|442749163|gb|JAA66741.1| Putative rna polymer [Ixodes ricinus]
          Length = 293

 Score =  149 bits (377), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 79/174 (45%), Positives = 105/174 (60%), Gaps = 3/174 (1%)

Query: 20  EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVA 77
           E +   + E     SLL+G LS+ALCYI R+ +      +   RIL L  S +   QY+ 
Sbjct: 100 ELVLSDDSESAAGESLLTGGLSLALCYINRIEKESSSQNKIASRILVLSASGESASQYLN 159

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
            MN  F+AQ+  V ID+C L  ++S  LQQ   ITGG + K     GL QYLL +F  D 
Sbjct: 160 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMKVPNHAGLLQYLLWVFLPDK 218

Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            SRN++  P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 219 TSRNYMVFPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272


>gi|110757998|ref|XP_394705.3| PREDICTED: general transcription factor IIH subunit 3 [Apis
           mellifera]
          Length = 292

 Score =  149 bits (376), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 79/160 (49%), Positives = 103/160 (64%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SL+SG+L+MALCYI R+ R  +   +   RIL +  S D   QY+  MN  F+AQ+  V 
Sbjct: 113 SLISGALTMALCYIARLEREKIAGEKLYSRILVITASNDSATQYMNYMNIFFTAQKMGVI 172

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           +D C L  Q    LQQ   ITGG + K  QL+GL QYLL IF  D + R+ L  P PV V
Sbjct: 173 LDVCSLD-QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVFPPPVKV 231

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RA+CFCH+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 232 DYRAACFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|281343811|gb|EFB19395.1| hypothetical protein PANDA_000941 [Ailuropoda melanoleuca]
          Length = 278

 Score =  149 bits (376), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 113/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K E +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+ +M
Sbjct: 90  MTKSEIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNLM 149

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 150 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 208

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 209 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 260


>gi|380030634|ref|XP_003698948.1| PREDICTED: general transcription factor IIH subunit 3-like [Apis
           florea]
          Length = 292

 Score =  149 bits (376), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 79/160 (49%), Positives = 103/160 (64%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SL+SG+L+MALCYI R+ R  +   +   RIL +  S D   QY+  MN  F+AQ+  V 
Sbjct: 113 SLISGALTMALCYIARLEREKIAGEKLYSRILVITASNDSATQYMNYMNIFFTAQKMGVI 172

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           +D C L  Q    LQQ   ITGG + K  QL+GL QYLL IF  D + R+ L  P PV V
Sbjct: 173 LDVCSLD-QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVFPPPVKV 231

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RA+CFCH+  ID+GY+CS+CLSI+CK    C+TC +VF
Sbjct: 232 DYRAACFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271


>gi|119618833|gb|EAW98427.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
           CRA_b [Homo sapiens]
          Length = 274

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 78/179 (43%), Positives = 116/179 (64%), Gaps = 3/179 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 78  MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 137

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC +   + +T+S
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCDAESQEKETES 255


>gi|308798981|ref|XP_003074271.1| basal transcription factor complex subunit-related (ISS)
           [Ostreococcus tauri]
 gi|116000442|emb|CAL50122.1| basal transcription factor complex subunit-related (ISS)
           [Ostreococcus tauri]
          Length = 319

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 103/172 (59%), Gaps = 4/172 (2%)

Query: 24  KQEPEGRIACSLLSGSLSMALCYIQR---VFRSGLLHPQPRILCLQGSPDGPEQYVAIMN 80
           K + +   A S  +G+LS+ALCY  R   +  +  L  +PRILCLQ S D P  Y+ +MN
Sbjct: 141 KADSDDGPATSPFAGALSLALCYCNRAQALENAAGLRSKPRILCLQASQDNPTDYIPMMN 200

Query: 81  AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 140
           AIFSAQR  +P+D+  LG  +S F+QQA++IT G + KP   DGL QYLL+    DL SR
Sbjct: 201 AIFSAQRQSIPVDAFALGEHDSPFMQQAAHITRGAYVKPTLGDGLLQYLLSTAVMDLRSR 260

Query: 141 NFLQLPKPVGVDFRASCFCHKNTIDM-GYICSVCLSIYCKHLKKCSTCGSVF 191
            FL+LP+     FRAS            ++CSVCLS++C     C TCG+ F
Sbjct: 261 AFLKLPQAKASTFRASVLLPTTARQASAFVCSVCLSVFCDGRPACDTCGTTF 312


>gi|1039318|emb|CAA82909.1| basic transcription factor 2, 35 kD subunit [Homo sapiens]
          Length = 303

 Score =  149 bits (375), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|307175391|gb|EFN65408.1| General transcription factor IIH subunit 3 [Camponotus floridanus]
          Length = 292

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 78/160 (48%), Positives = 104/160 (65%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SL+SG+LSMALCYI R+ R    + +   R+L +  S D   QY+  MN  F+AQR  V 
Sbjct: 113 SLISGALSMALCYIARLEREKFANQKLHSRMLVITASNDSAMQYMNYMNIFFTAQRMNVI 172

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           +D C L  Q    LQQ   ITGG + +  QL GL QYLL +F  D   R+ L LP PV V
Sbjct: 173 LDVCSLD-QELTLLQQGCEITGGNYLQVPQLAGLLQYLLWVFLPDPSVRSKLVLPPPVKV 231

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           ++RA+CFCH++ ID+GY+CS+CLSI+CK+   C+TC +VF
Sbjct: 232 NYRAACFCHQDLIDIGYVCSICLSIFCKYSPICTTCHTVF 271


>gi|114647703|ref|XP_001170519.1| PREDICTED: general transcription factor IIH subunit 3 isoform 3
           [Pan troglodytes]
 gi|397481834|ref|XP_003812142.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Pan paniscus]
 gi|410221584|gb|JAA08011.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
 gi|410252228|gb|JAA14081.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
 gi|410294034|gb|JAA25617.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
 gi|410334639|gb|JAA36266.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
           troglodytes]
          Length = 308

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|19923732|ref|NP_001507.2| general transcription factor IIH subunit 3 isoform a [Homo sapiens]
 gi|50403772|sp|Q13889.2|TF2H3_HUMAN RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=Basic transcription factor 2 34 kDa subunit;
           Short=BTF2 p34; AltName: Full=General transcription
           factor IIH polypeptide 3; AltName: Full=TFIIH basal
           transcription factor complex p34 subunit
 gi|23955948|gb|AAN40702.1| general transcription factor IIH, polypeptide 3, 34kDa [Homo
           sapiens]
 gi|40850955|gb|AAH65250.1| General transcription factor IIH, polypeptide 3, 34kDa [Homo
           sapiens]
 gi|119618832|gb|EAW98426.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
           CRA_a [Homo sapiens]
 gi|189053764|dbj|BAG36016.1| unnamed protein product [Homo sapiens]
 gi|208966364|dbj|BAG73196.1| general transcription factor IIH, polypeptide 3, 34kDa [synthetic
           construct]
          Length = 308

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|167773979|gb|ABZ92424.1| general transcription factor IIH, polypeptide 3, 34kDa [synthetic
           construct]
          Length = 308

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|34783458|gb|AAH39726.1| GTF2H3 protein, partial [Homo sapiens]
 gi|34783739|gb|AAH47868.2| GTF2H3 protein, partial [Homo sapiens]
          Length = 303

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 114 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 173

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 174 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 232

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 233 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 284


>gi|383855904|ref|XP_003703450.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Megachile rotundata]
          Length = 292

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 87/190 (45%), Positives = 115/190 (60%), Gaps = 14/190 (7%)

Query: 7   TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PR 61
           T+ Q L++ +N+   D+ L  +        SL+SG+L+MALCYI R+ R  +   +   R
Sbjct: 91  TVRQQLQQVINEISMDKPLNGE--------SLISGALTMALCYIARLEREKVAGQKLYSR 142

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL +  S D   QY+  MN  F+AQ+  V +D C L  Q    LQQ   ITGG + K  Q
Sbjct: 143 ILTITASNDSATQYMNYMNIFFTAQKMGVILDVCSLD-QELTLLQQGCDITGGNYLKVPQ 201

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
           L GL QYLL IF  D   R+ L LP PV VD+RA+CFCH+  ID+GY+CSVCLSI+CK  
Sbjct: 202 LSGLLQYLLWIFLPDPSIRSKLVLPPPVKVDYRAACFCHQELIDIGYVCSVCLSIFCKFS 261

Query: 182 KKCSTCGSVF 191
             C+TC +VF
Sbjct: 262 PICTTCHTVF 271


>gi|402888055|ref|XP_003907392.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Papio anubis]
 gi|426374588|ref|XP_004054152.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Gorilla gorilla gorilla]
 gi|380784389|gb|AFE64070.1| general transcription factor IIH subunit 3 [Macaca mulatta]
 gi|383417813|gb|AFH32120.1| general transcription factor IIH subunit 3 [Macaca mulatta]
 gi|384946662|gb|AFI36936.1| general transcription factor IIH subunit 3 [Macaca mulatta]
          Length = 308

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|195032196|ref|XP_001988455.1| GH10566 [Drosophila grimshawi]
 gi|193904455|gb|EDW03322.1| GH10566 [Drosophila grimshawi]
          Length = 300

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 78/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS+SMALCYI R+ R  +  +    RIL L GS +   QY+  MN  F+AQ+  + 
Sbjct: 122 SLLAGSMSMALCYISRLQRNVAAGVKMHSRILVLTGSNECASQYMTYMNVFFTAQKLSIV 181

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  V
Sbjct: 182 IDACALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFMPSPQMRHKLVLPPPPKV 240

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 241 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280


>gi|395846791|ref|XP_003796077.1| PREDICTED: general transcription factor IIH subunit 3 [Otolemur
           garnettii]
          Length = 309

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|348554273|ref|XP_003462950.1| PREDICTED: general transcription factor IIH subunit 3-like [Cavia
           porcellus]
          Length = 309

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 110/172 (63%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL L+ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVLKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L   +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDT-DSGLLQQACDITGGLYLKVPQMTSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R  L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RAQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|291414274|ref|XP_002723381.1| PREDICTED: general transcription factor IIH, polypeptide 3, 34kDa
           [Oryctolagus cuniculus]
          Length = 309

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L   +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDT-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCSFSPICTTCETAF 290


>gi|157132766|ref|XP_001662634.1| transcription factor TFIIH-subunit, putative [Aedes aegypti]
 gi|108871079|gb|EAT35304.1| AAEL012523-PA [Aedes aegypti]
          Length = 288

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 80/160 (50%), Positives = 103/160 (64%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS++MALCYI RV R+    +    RIL + GS +   QY+  MN  F+AQ+  V 
Sbjct: 113 SLLAGSMAMALCYIARVHRNKPAGVKINSRILVVTGSNECASQYMTYMNVFFTAQKQNVS 172

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           +D C L    S  LQQ   ITGG + K  QLDG  QYLL +F  +  +R  L LP PV V
Sbjct: 173 LDVCALDKPLS-LLQQGCDITGGQYLKLPQLDGFLQYLLWVFLPEPLTRCKLVLPPPVKV 231

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RA+CFCH+  ID+GY+CSVCLSI+CK    C+TC +VF
Sbjct: 232 DYRAACFCHRELIDIGYVCSVCLSIFCKFSPICTTCHTVF 271


>gi|21410252|gb|AAH31030.1| Unknown (protein for IMAGE:4724313), partial [Homo sapiens]
          Length = 235

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/187 (41%), Positives = 117/187 (62%), Gaps = 9/187 (4%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILC 64
            +++ +++ M K +  G+         +LL+GSL+ ALCYI R+ +    + +   RIL 
Sbjct: 37  VIVEEIKDLMTKSDIKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILV 90

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  
Sbjct: 91  IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 149

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
           L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C
Sbjct: 150 LLQYLLWVFLPDQDQRSRLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 209

Query: 185 STCGSVF 191
           +TC + F
Sbjct: 210 TTCETAF 216


>gi|358060458|dbj|GAA93863.1| hypothetical protein E5Q_00509 [Mixia osmundae IAM 14324]
          Length = 331

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/187 (40%), Positives = 111/187 (59%), Gaps = 10/187 (5%)

Query: 14  EFMNKDEQLGKQ--------EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILC 64
            F   D Q+ K         EP+     +++S +L+  LC+I R+ R    H  +PR+L 
Sbjct: 101 RFRQVDTQVSKAVGQMMRELEPDTDAPPAIVS-ALARTLCHINRISREETKHTIKPRVLL 159

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           L  S D   QY+ +MN IFSAQ++ VPID C +   ++ FLQQA+Y++ G+++K ++  G
Sbjct: 160 LSVSHDSSSQYIPLMNCIFSAQKANVPIDVCKIYGDDAVFLQQAAYLSSGIYYKLEKRAG 219

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
           L QYL+  F   + SR  L LP    VD RA+CFCH+  +D+G++CSVCLSI+C     C
Sbjct: 220 LLQYLMMTFLPGVTSRKLLNLPSQDAVDLRAACFCHQRIVDIGFVCSVCLSIFCTPRPSC 279

Query: 185 STCGSVF 191
            TC + F
Sbjct: 280 LTCRTKF 286


>gi|355564808|gb|EHH21308.1| hypothetical protein EGK_04329, partial [Macaca mulatta]
 gi|355786640|gb|EHH66823.1| hypothetical protein EGM_03880, partial [Macaca fascicularis]
          Length = 308

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|344297280|ref|XP_003420327.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Loxodonta africana]
          Length = 309

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHMETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|449683803|ref|XP_002159513.2| PREDICTED: general transcription factor IIH subunit 3-like [Hydra
           magnipapillata]
          Length = 291

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/159 (47%), Positives = 101/159 (63%), Gaps = 2/159 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL-LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 92
           SLL+G+L+ ALCYI    R+        RIL ++GS D   QY+ +MN IF+A +  + I
Sbjct: 120 SLLAGALTKALCYIHSHDRTANGRRTNARILIIKGSSDSSSQYMTVMNCIFAASKKNIVI 179

Query: 93  DSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 152
           D C L   NS F+QQAS ITGGV+       G+ +YLL IF  D   R  L LP    +D
Sbjct: 180 DCCAL-QNNSGFMQQASDITGGVYFFIDDFSGMLEYLLWIFLPDPGLREKLNLPTSSQID 238

Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +RA+CFCHK  +D+G++CSVCLSIYC+ + KC+TC + F
Sbjct: 239 YRAACFCHKQLVDVGFVCSVCLSIYCQFMPKCATCQTRF 277


>gi|31088894|ref|NP_852075.1| general transcription factor IIH subunit 3 [Mus musculus]
 gi|22654149|sp|Q8VD76.1|TF2H3_MOUSE RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=Basic transcription factor 2 34 kDa subunit;
           Short=BTF2 p34; AltName: Full=General transcription
           factor IIH polypeptide 3; AltName: Full=TFIIH basal
           transcription factor complex p34 subunit
 gi|17028402|gb|AAH17515.1| General transcription factor IIH, polypeptide 3 [Mus musculus]
 gi|74228938|dbj|BAE21939.1| unnamed protein product [Mus musculus]
 gi|148687628|gb|EDL19575.1| general transcription factor IIH, polypeptide 3 [Mus musculus]
          Length = 309

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/172 (44%), Positives = 113/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI RV ++   + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRVNKAVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|332254335|ref|XP_003276282.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Nomascus leucogenys]
          Length = 308

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/187 (41%), Positives = 116/187 (62%), Gaps = 9/187 (4%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILC 64
            +++ +++ M K +  G+         +LL+GSL+ ALCYI R+ +        + RIL 
Sbjct: 110 VIVEEIKDLMTKGDIKGQHTE------TLLAGSLAKALCYIHRMNKEIKDNQEMKSRILV 163

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  
Sbjct: 164 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 222

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
           L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C
Sbjct: 223 LLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 282

Query: 185 STCGSVF 191
           +TC + F
Sbjct: 283 TTCETAF 289


>gi|335775838|gb|AEH58705.1| general transcription factor IIH subunit-like protein [Equus
           caballus]
          Length = 302

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 113 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDTALQYMNFM 172

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 173 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 231

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 232 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 283


>gi|57105536|ref|XP_534644.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
           [Canis lupus familiaris]
          Length = 308

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/170 (44%), Positives = 111/170 (65%), Gaps = 3/170 (1%)

Query: 24  KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNA 81
           K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN 
Sbjct: 121 KSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNV 180

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
           IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+
Sbjct: 181 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 239

Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 240 QLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|149720731|ref|XP_001498402.1| PREDICTED: general transcription factor IIH subunit 3-like [Equus
           caballus]
          Length = 309

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDTALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|296213239|ref|XP_002753188.1| PREDICTED: general transcription factor IIH subunit 3 [Callithrix
           jacchus]
          Length = 309

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWMFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|428673524|ref|NP_001258796.1| general transcription factor IIH subunit 3 isoform c [Homo sapiens]
 gi|194389784|dbj|BAG60408.1| unnamed protein product [Homo sapiens]
          Length = 267

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 78  MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 137

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 248


>gi|417398730|gb|JAA46398.1| Putative rna polymer [Desmodus rotundus]
          Length = 308

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVVKAAEDSALQYMNFM 178

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 179 NVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGMYLKVPQVPSLLQYLLWVFLPDQDQ 237

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R  L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 238 RPQLTLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289


>gi|332840758|ref|XP_003314057.1| PREDICTED: general transcription factor IIH subunit 3 [Pan
           troglodytes]
 gi|397481836|ref|XP_003812143.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Pan paniscus]
          Length = 267

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 78  MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 137

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 248


>gi|402888057|ref|XP_003907393.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Papio anubis]
 gi|426374590|ref|XP_004054153.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Gorilla gorilla gorilla]
          Length = 267

 Score =  147 bits (371), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 78  MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 137

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 248


>gi|297693367|ref|XP_002823992.1| PREDICTED: general transcription factor IIH subunit 3 [Pongo
           abelii]
          Length = 318

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL ++ + D   QY+  M
Sbjct: 129 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 188

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 189 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 247

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 248 RSQLTLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 299


>gi|432873950|ref|XP_004072398.1| PREDICTED: general transcription factor IIH subunit 3-like [Oryzias
           latipes]
          Length = 300

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 82/181 (45%), Positives = 111/181 (61%), Gaps = 6/181 (3%)

Query: 13  EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPD 70
           EE  N    L K E  G    +LL+GSL+ ALCYI RV +   +    + RIL ++ + D
Sbjct: 105 EEIRNV---LSKTEVRGNSTDTLLAGSLAKALCYIYRVSKELDVGQEIKSRILVIKAAED 161

Query: 71  GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
              QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q   L QYLL
Sbjct: 162 CALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKIPQKLALTQYLL 220

Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
            +F  D   R+ L LP P  VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + 
Sbjct: 221 WVFLPDSEQRSQLVLPPPAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETA 280

Query: 191 F 191
           F
Sbjct: 281 F 281


>gi|158296528|ref|XP_316918.4| AGAP008522-PA [Anopheles gambiae str. PEST]
 gi|157014753|gb|EAA12181.5| AGAP008522-PA [Anopheles gambiae str. PEST]
          Length = 288

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 78/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS+SMALCYI R+ R+     +   R+L + GS +   QY+  MN  F+AQ+  V 
Sbjct: 113 SLLAGSMSMALCYIARINRNKAPGSKINSRVLVVTGSNECASQYMTYMNVFFTAQKQGVV 172

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           +D C L    S  LQQ   ITGG + + +QL+G  QYLL +F  D   R  L LP PV V
Sbjct: 173 VDVCALDKALS-LLQQGCDITGGQYLRLEQLEGFLQYLLWVFLPDPQMRCKLVLPPPVKV 231

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RA+CFCH+  ID+GY+CSVCLSI+CK    C+TC +VF
Sbjct: 232 DYRAACFCHRELIDIGYVCSVCLSIFCKFSPICTTCHTVF 271


>gi|195575703|ref|XP_002077716.1| GD22925 [Drosophila simulans]
 gi|194189725|gb|EDX03301.1| GD22925 [Drosophila simulans]
          Length = 299

 Score =  147 bits (370), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 80/162 (49%), Positives = 104/162 (64%), Gaps = 7/162 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           SLL+GS+SMALCYI R+ R+  L P      RIL L GS +   QY+  MN  F+AQ+  
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLG 178

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P 
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|441630270|ref|XP_004089523.1| PREDICTED: general transcription factor IIH subunit 3 isoform 3
           [Nomascus leucogenys]
          Length = 267

 Score =  147 bits (370), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 78/187 (41%), Positives = 116/187 (62%), Gaps = 9/187 (4%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILC 64
            +++ +++ M K +  G+         +LL+GSL+ ALCYI R+ +        + RIL 
Sbjct: 69  VIVEEIKDLMTKGDIKGQHTE------TLLAGSLAKALCYIHRMNKEIKDNQEMKSRILV 122

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  
Sbjct: 123 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 181

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
           L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C
Sbjct: 182 LLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 241

Query: 185 STCGSVF 191
           +TC + F
Sbjct: 242 TTCETAF 248


>gi|395332939|gb|EJF65317.1| transcription factor Tfb4 [Dichomitus squalens LYAD-421 SS1]
          Length = 324

 Score =  147 bits (370), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 113/199 (56%), Gaps = 16/199 (8%)

Query: 3   SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF----------R 52
           +L +T++ ++ E  N  +   ++EP G + C      L+ ALCYI R             
Sbjct: 93  TLDSTVMSSIAERFNSYDVTEEEEPIGLVGC------LTKALCYINRTISVVSASAQGTE 146

Query: 53  SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
              +   PRIL L  SPD    Y+ IMN+IFSAQ+  V ID C +    + FLQQA+++T
Sbjct: 147 GATISIDPRILVLSVSPDQSSTYIPIMNSIFSAQKLKVTIDVCKVFGAENVFLQQAAHLT 206

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
           GG + + ++ D L QY++  F      R  + +P    VDFRA+CFCHK  +++G++CSV
Sbjct: 207 GGAYVQIERTDALLQYMMMSFLPPPAIRQLISVPTQDRVDFRAACFCHKRIVEIGFVCSV 266

Query: 173 CLSIYCKHLKKCSTCGSVF 191
           CLSI+C+ +  CSTC + F
Sbjct: 267 CLSIFCQPVPVCSTCRTKF 285


>gi|195350283|ref|XP_002041670.1| GM16625 [Drosophila sechellia]
 gi|194123443|gb|EDW45486.1| GM16625 [Drosophila sechellia]
          Length = 299

 Score =  147 bits (370), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 80/162 (49%), Positives = 104/162 (64%), Gaps = 7/162 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           SLL+GS+SMALCYI R+ R+  L P      RIL L GS +   QY+  MN  F+AQ+  
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLG 178

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P 
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|355693898|gb|AER99489.1| proteinral transcription factor IIH, polypeptide 3, 34kDa [Mustela
           putorius furo]
          Length = 286

 Score =  146 bits (369), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 76/168 (45%), Positives = 110/168 (65%), Gaps = 3/168 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTC 286


>gi|66730451|ref|NP_001019407.1| general transcription factor IIH subunit 3 [Rattus norvegicus]
 gi|81888006|sp|Q561R7.1|TF2H3_RAT RecName: Full=General transcription factor IIH subunit 3; AltName:
           Full=General transcription factor IIH polypeptide 3
 gi|62531219|gb|AAH93380.1| General transcription factor IIH, polypeptide 3 [Rattus norvegicus]
 gi|149063248|gb|EDM13571.1| general transcription factor IIH, polypeptide 3 [Rattus norvegicus]
          Length = 309

 Score =  146 bits (369), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 75/172 (43%), Positives = 112/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R  ++   + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRASKAVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|432090055|gb|ELK23655.1| General transcription factor IIH subunit 3 [Myotis davidii]
          Length = 247

 Score =  146 bits (368), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 78/188 (41%), Positives = 117/188 (62%), Gaps = 11/188 (5%)

Query: 7   TLLQNLEEFMNKDEQLGKQ-EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRIL 63
            +++ +++ M K +  G+  EP       LL+GSL+ ALCYI R+ +    + +   RIL
Sbjct: 49  VIVEEIKDLMTKSDIKGQHTEP-------LLAGSLAKALCYIHRMHKEVKDNQEMKSRIL 101

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
            ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+ 
Sbjct: 102 VIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMP 160

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
            L QYLL +F  D   R+ L LP P+ VD+RA+C CH+N I++GY+CSVCLSI+C     
Sbjct: 161 SLLQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACLCHRNLIEIGYVCSVCLSIFCNFSPI 220

Query: 184 CSTCGSVF 191
           C+TC + F
Sbjct: 221 CTTCETAF 228


>gi|24580793|ref|NP_608574.2| Tfb4 [Drosophila melanogaster]
 gi|22945518|gb|AAF51411.2| Tfb4 [Drosophila melanogaster]
 gi|94400646|gb|ABF17925.1| FI01003p [Drosophila melanogaster]
          Length = 299

 Score =  145 bits (367), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 79/162 (48%), Positives = 104/162 (64%), Gaps = 7/162 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           SLL+GS+SMALCYI R+ R+  L P      RIL + GS +   QY+  MN  F+AQ+  
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVVTGSNECASQYMTFMNVFFTAQKLG 178

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P 
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|198475301|ref|XP_001357006.2| GA18615 [Drosophila pseudoobscura pseudoobscura]
 gi|198138760|gb|EAL34072.2| GA18615 [Drosophila pseudoobscura pseudoobscura]
          Length = 298

 Score =  145 bits (367), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 77/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS++MALCYI R+ R  +  +    RIL L GS +   QY+  MN  F+AQ+  + 
Sbjct: 121 SLLAGSMAMALCYISRLQRNVAAGVKMHSRILVLTGSNECSSQYMTFMNVFFTAQKLGIT 180

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  V
Sbjct: 181 IDTCALD-KTLSLLQQGCDITTGQFLKVTQLDGLLQYLLWVFLPAPQMRHKLVLPPPPKV 239

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|195470427|ref|XP_002087508.1| GE15755 [Drosophila yakuba]
 gi|194173609|gb|EDW87220.1| GE15755 [Drosophila yakuba]
          Length = 299

 Score =  145 bits (367), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 77/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS+SMALCYI R+ R+    +    RIL + GS +   QY+  MN  F+AQ+  + 
Sbjct: 121 SLLAGSMSMALCYISRLQRNVAPGVKMHSRILVITGSNECASQYMTFMNVFFTAQKLGIT 180

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  V
Sbjct: 181 IDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPPKV 239

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|354491442|ref|XP_003507864.1| PREDICTED: general transcription factor IIH subunit 3-like isoform
           1 [Cricetulus griseus]
          Length = 309

 Score =  145 bits (367), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 75/172 (43%), Positives = 113/172 (65%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
           + K + +G+   +LL+GSL+ ALCYI R+ ++   + +   RIL ++ + D   QY+  M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKAVKDNQEMKSRILVIKAAEDSALQYMNFM 179

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   
Sbjct: 180 NVIFAAQKQNILIDACILDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290


>gi|354491444|ref|XP_003507865.1| PREDICTED: general transcription factor IIH subunit 3-like isoform
           2 [Cricetulus griseus]
 gi|344248744|gb|EGW04848.1| General transcription factor IIH subunit 3 [Cricetulus griseus]
          Length = 313

 Score =  145 bits (367), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 75/174 (43%), Positives = 114/174 (65%), Gaps = 3/174 (1%)

Query: 20  EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVA 77
           + + K + +G+   +LL+GSL+ ALCYI R+ ++   + +   RIL ++ + D   QY+ 
Sbjct: 122 DLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKAVKDNQEMKSRILVIKAAEDSALQYMN 181

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
            MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D 
Sbjct: 182 FMNVIFAAQKQNILIDACILDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQ 240

Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
             R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C     C+TC + F
Sbjct: 241 DQRSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 294


>gi|190358598|ref|NP_001121894.1| uncharacterized protein LOC100151595 [Danio rerio]
          Length = 296

 Score =  145 bits (367), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 79/181 (43%), Positives = 110/181 (60%), Gaps = 6/181 (3%)

Query: 13  EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPD 70
           EE  N  E   + E  G    +LL+ SL+ ALCYI R+ +        + RIL ++ + D
Sbjct: 101 EEIRNLME---RAEVSGSQTETLLAVSLARALCYINRISKDVQAGQEVKSRILVIKAAED 157

Query: 71  GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
              QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ +  Q   L QYLL
Sbjct: 158 STSQYMNFMNVIFAAQKKNILIDACVLDS-DSGLLQQACDITGGLYLRVPQKVALTQYLL 216

Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
            +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + 
Sbjct: 217 WVFLPDAEQRSQLLLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETA 276

Query: 191 F 191
           F
Sbjct: 277 F 277


>gi|409045010|gb|EKM54491.1| hypothetical protein PHACADRAFT_258371 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 360

 Score =  145 bits (367), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 75/190 (39%), Positives = 108/190 (56%), Gaps = 13/190 (6%)

Query: 12  LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLH----------PQPR 61
           ++  +++ E LG+ E E  I      G+L+ A+CYI RV                 P PR
Sbjct: 99  MKHIVSEVEALGEPEVEEPIG---FVGALTKAMCYINRVTNGSTSSTSRNDDLATLPDPR 155

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL +  SPD    Y+ +MN IFSAQ+  V ID C +    S FLQQA+++TGG +    +
Sbjct: 156 ILVISVSPDQSSSYIPVMNLIFSAQKLKVTIDVCKIFGGESVFLQQAAHLTGGSYIDIDR 215

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
            D + QYL+  F      R  + +P    +DFRA+CFCHKN +D+G++CSVCLSI+C+ +
Sbjct: 216 PDAILQYLMMSFLPPPGLRQLISVPTQDKIDFRAACFCHKNIVDIGFVCSVCLSIFCQPV 275

Query: 182 KKCSTCGSVF 191
             CSTC + F
Sbjct: 276 PVCSTCRTKF 285


>gi|194759192|ref|XP_001961833.1| GF14734 [Drosophila ananassae]
 gi|190615530|gb|EDV31054.1| GF14734 [Drosophila ananassae]
          Length = 297

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 80/162 (49%), Positives = 106/162 (65%), Gaps = 7/162 (4%)

Query: 34  SLLSGSLSMALCYIQRVFR---SGL-LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           SLL+GS+SMALCYI R+ R   SG+ +H   RIL L GS +   QY+  MN  F+AQ+  
Sbjct: 120 SLLAGSMSMALCYISRLQRNVTSGVKMHS--RILVLTGSNECASQYMTFMNVFFTAQKLG 177

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           + ID+C L  +  + LQQ   IT G   K  QL+GL QYLL +F      R+ L LP P 
Sbjct: 178 IVIDTCALD-KTLSLLQQGCDITSGQFLKVTQLEGLLQYLLWVFLPAPQMRHKLVLPPPP 236

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 237 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 278


>gi|332375937|gb|AEE63109.1| unknown [Dendroctonus ponderosae]
          Length = 300

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 119/208 (57%), Gaps = 20/208 (9%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-----QPR 61
           T+ QNL E +  +      E       SLL+G+++MALCYI R+ R+    P       R
Sbjct: 105 TVKQNLAELLASESSTAVTE-------SLLAGAIAMALCYIARLQRT---KPPGSKLNSR 154

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL + GS D   QY+  MN  F+AQ+  V +D C L  Q+ + LQQ   ITGG++ K  Q
Sbjct: 155 ILVVSGSGDSASQYMNYMNVFFTAQKLNVVLDVCALD-QHLSLLQQCCDITGGMYLKVPQ 213

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
           +  L Q+LL +F  +   R  L LP PV VD+RA+CFCH+  ID+G++CSVCLSI+CK  
Sbjct: 214 ITALLQFLLWVFLPEPPIREKLVLPPPVKVDYRAACFCHRELIDIGFVCSVCLSIFCKFS 273

Query: 182 KKCSTCGSVFGQAQTQSDEPSATNRKRK 209
             C+TC  VF      + +P    +KRK
Sbjct: 274 PICTTCHMVFKIPGPLAVKP----KKRK 297


>gi|47226419|emb|CAG08435.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 299

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/170 (45%), Positives = 106/170 (62%), Gaps = 3/170 (1%)

Query: 24  KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNA 81
           K E  G    +LL+GSL+ ALCYI RV +   +    + RIL ++ + D   QY+  MN 
Sbjct: 114 KTEVRGNSNDTLLAGSLAKALCYINRVSKELEVGQETKSRILVIKAAEDSALQYMNFMNV 173

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
           IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q   L QYLL +F  D   R+
Sbjct: 174 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKIPQKLALAQYLLWVFLPDTEQRS 232

Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            L LP    VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 233 QLLLPPRAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 282


>gi|195388531|ref|XP_002052933.1| GJ17829 [Drosophila virilis]
 gi|194149390|gb|EDW65088.1| GJ17829 [Drosophila virilis]
          Length = 300

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/178 (46%), Positives = 109/178 (61%), Gaps = 7/178 (3%)

Query: 20  EQLGK---QEPEGRIAC-SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPE 73
           +QLG      P    +C SLL+GS+SMALCYI R+ R  +  +    RIL L GS +   
Sbjct: 104 QQLGSILMNAPRLSASCESLLAGSMSMALCYISRLQRNVAAGVKMHSRILVLTGSNECAS 163

Query: 74  QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
           QY+  MN  F+AQ+  + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F
Sbjct: 164 QYMTYMNVFFTAQKLGIVIDTCALD-KTLSLLQQGCDITNGQFLKVTQLDGLLQYLLWVF 222

Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
                 R+ L LP    VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 223 LPSPQMRHKLVLPPTPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280


>gi|17945214|gb|AAL48665.1| RE13149p [Drosophila melanogaster]
          Length = 299

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/162 (48%), Positives = 104/162 (64%), Gaps = 7/162 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
           SLL+GS+SMALCYI R+ R+  L P      RIL + GS +   QY+  +N  F+AQ+  
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVVTGSNECASQYMTFLNVFFTAQKLG 178

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           + ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P 
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            VD+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|410922922|ref|XP_003974931.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Takifugu rubripes]
          Length = 298

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 77/170 (45%), Positives = 105/170 (61%), Gaps = 3/170 (1%)

Query: 24  KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNA 81
           K E  G    +LL+GSL+ ALCYI RV +   +    + RIL ++ + D   QY+  MN 
Sbjct: 113 KTEVRGNSNDTLLAGSLAKALCYINRVSKEMEVGQETKSRILVIKAAEDCALQYMNFMNV 172

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
           IF+AQ+  + ID+C L   +S  LQQA  ITGG++ K  Q   L QYLL +F  D   R+
Sbjct: 173 IFAAQKQNILIDACVLD-MDSGLLQQACDITGGLYLKIPQKAALAQYLLWVFLPDTEQRS 231

Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            L LP    VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 232 QLLLPPRAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 281


>gi|348528468|ref|XP_003451739.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Oreochromis niloticus]
          Length = 300

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 108/172 (62%), Gaps = 3/172 (1%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIM 79
           + K E  G    +LL+GSL+ ALCYI R+ +   +    + RIL ++ + D   QY+  M
Sbjct: 111 MSKIEVTGNSTDTLLAGSLAKALCYIHRLTKELEVGQEIKSRILVVKAAEDCALQYMNFM 170

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
           N IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q   L QYLL +F  D   
Sbjct: 171 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKIPQKVALAQYLLWVFLPDSEQ 229

Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R+ L LP P  VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 230 RSQLVLPPPAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 281


>gi|170028289|ref|XP_001842028.1| TFIIH basal transcription factor complex p34 subunit [Culex
           quinquefasciatus]
 gi|167874183|gb|EDS37566.1| TFIIH basal transcription factor complex p34 subunit [Culex
           quinquefasciatus]
          Length = 292

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/191 (45%), Positives = 113/191 (59%), Gaps = 13/191 (6%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILC 64
           T+ Q L   +N+  +L           SLL+GS++MALCYI R+ R+    +    RIL 
Sbjct: 92  TIKQKLARMINEAPRLAAPTE------SLLAGSMAMALCYIARINRNKPAGVKINSRILV 145

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           + GS +   QY+  MN  F+AQ+  V +D C L    S  LQQ   ITGG + K  QLDG
Sbjct: 146 VTGSNECASQYMTYMNVFFTAQKQNVTLDVCALDKPLS-LLQQGCDITGGQYLKLPQLDG 204

Query: 125 LFQYLLTIFGTD--LHS--RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
             QYLL +F  D  +H   R  L LP PV VD+RA+CFCH+  ID+GY+CSVCLSI+CK 
Sbjct: 205 FLQYLLWVFLPDPQMHPQMRCKLVLPPPVKVDYRAACFCHRELIDIGYVCSVCLSIFCKF 264

Query: 181 LKKCSTCGSVF 191
              C+TC +VF
Sbjct: 265 SPICTTCHTVF 275


>gi|195159598|ref|XP_002020665.1| GL15043 [Drosophila persimilis]
 gi|194117615|gb|EDW39658.1| GL15043 [Drosophila persimilis]
          Length = 298

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/160 (47%), Positives = 102/160 (63%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRV--FRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS++MALCYI R+    +  +    RIL L GS +   QY+  MN  F+AQ+  + 
Sbjct: 121 SLLAGSMAMALCYISRLHGMLAAGVKMHSRILVLTGSNECSSQYMTFMNVFFTAQKLGIT 180

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           ID+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  V
Sbjct: 181 IDTCALD-KTLSLLQQGCDITTGQFLKVTQLDGLLQYLLWVFLPAPQMRHKLVLPPPPKV 239

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279


>gi|50540194|ref|NP_001002564.1| general transcription factor IIH, polypeptide 3 [Danio rerio]
 gi|49900709|gb|AAH76236.1| General transcription factor IIH, polypeptide 3 [Danio rerio]
          Length = 296

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 79/181 (43%), Positives = 109/181 (60%), Gaps = 6/181 (3%)

Query: 13  EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPD 70
           EE  N  E   + E  G    +LL+ SL+ ALCYI R+ +        + RIL ++ + D
Sbjct: 101 EEIRNLME---RAEVSGSQTETLLAVSLARALCYINRISKDVQAGQEVKSRILVIKAAED 157

Query: 71  GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
              QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ +  Q   L QYLL
Sbjct: 158 STSQYMNFMNVIFAAQKKNILIDACVLDS-DSGLLQQACDITGGLYLRVPQKVALTQYLL 216

Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
             F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + 
Sbjct: 217 WAFLPDAEQRSQLLLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETA 276

Query: 191 F 191
           F
Sbjct: 277 F 277


>gi|449279312|gb|EMC86947.1| General transcription factor IIH subunit 3, partial [Columba livia]
          Length = 306

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 79/188 (42%), Positives = 112/188 (59%), Gaps = 15/188 (7%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP------QPRIL 63
           + +++ M K E  G+Q        + L+GSL+ ALC I  +  S    P      + RIL
Sbjct: 109 EEIKDLMTKTEMKGQQTE------TQLAGSLAKALCCILLIVESS--KPLSNQEIKSRIL 160

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
            ++ + D   QY+  MN IF+AQ+  V ID+C L + +S  LQQA  ITGG++ K   + 
Sbjct: 161 VIKAAEDSALQYMNFMNVIFAAQKQSVLIDACVLES-DSGLLQQACDITGGIYLKVLHMP 219

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
            L QYLL +F  D   R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     
Sbjct: 220 SLLQYLLWVFLPDQEQRSQLVLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI 279

Query: 184 CSTCGSVF 191
           CSTC + F
Sbjct: 280 CSTCETAF 287


>gi|426197233|gb|EKV47160.1| hypothetical protein AGABI2DRAFT_204058 [Agaricus bisporus var.
           bisporus H97]
          Length = 361

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 72/165 (43%), Positives = 103/165 (62%), Gaps = 7/165 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL-------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
           S L G+++ ALCY+ R+  S         + P PR+L L  SPD    Y+ IMN++FSAQ
Sbjct: 111 SALVGAITKALCYVNRLAISPSSAAPAEHVLPDPRLLILSVSPDLATSYIPIMNSVFSAQ 170

Query: 87  RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
           +  V ID C +   ++ FLQQA+++TGG +   ++ D L QYL+  F      R  L +P
Sbjct: 171 KLKVTIDVCQVFGPDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLPPPPIRKVLAVP 230

Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
               VDFRA+CFCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 231 TQDKVDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 275


>gi|195118052|ref|XP_002003554.1| GI21952 [Drosophila mojavensis]
 gi|193914129|gb|EDW12996.1| GI21952 [Drosophila mojavensis]
          Length = 299

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 75/160 (46%), Positives = 102/160 (63%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS+SMALCYI R+ R+    +    RIL + GS +   QY+  MN  F+AQ+  + 
Sbjct: 122 SLLAGSMSMALCYISRLQRNVPAGVKMHSRILVITGSNECASQYMTFMNVFFTAQKLGIV 181

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           +D+C L  +  + LQQ   IT G   K  QLDGL QYLL +F      R+ L LP    V
Sbjct: 182 VDTCALD-KTLSLLQQGCDITNGQFLKVNQLDGLLQYLLWVFLPSPQMRHKLVLPPTPKV 240

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC ++F
Sbjct: 241 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280


>gi|409080333|gb|EKM80693.1| hypothetical protein AGABI1DRAFT_119282 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 361

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 72/165 (43%), Positives = 103/165 (62%), Gaps = 7/165 (4%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL-------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
           S L G+++ ALCY+ R+  S         + P PR+L L  SPD    Y+ IMN++FSAQ
Sbjct: 111 SALVGAITKALCYVNRLAISPSSAAPAEHVLPDPRLLILSVSPDLATSYIPIMNSVFSAQ 170

Query: 87  RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
           +  V ID C +   ++ FLQQA+++TGG +   ++ D L QYL+  F      R  L +P
Sbjct: 171 KLKVTIDVCQVFGPDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLPPPPIRKVLAVP 230

Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
               VDFRA+CFCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 231 TQDKVDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 275


>gi|389741855|gb|EIM83043.1| Tfb4-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 400

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 78/188 (41%), Positives = 106/188 (56%), Gaps = 18/188 (9%)

Query: 12  LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF--------RSGLLHPQPRIL 63
           L E  NK E+    EP        L G+L+ ALCYI R+              H  PRIL
Sbjct: 110 LLEAQNKPEE----EPVA------LVGALTKALCYINRITHVPTANAPEDPAAHVDPRIL 159

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
            L  SPD    Y+ +MN+IFSAQ+  V ID C +   ++ FLQQ +++T G +   ++ D
Sbjct: 160 ILSVSPDLSTAYIPVMNSIFSAQKLKVTIDVCKVYGPDTVFLQQTAHLTAGSYIYLERRD 219

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
            L QYL+  F      R  L +PK   +DFRA+CFCHKN +D+G++CSVCLSI+C  +  
Sbjct: 220 ALLQYLIMSFLPPPALRQILAVPKQDKIDFRAACFCHKNIVDIGFVCSVCLSIFCAPVPV 279

Query: 184 CSTCGSVF 191
           CSTC + F
Sbjct: 280 CSTCRTKF 287


>gi|351698565|gb|EHB01484.1| General transcription factor IIH subunit 3, partial [Heterocephalus
           glaber]
          Length = 315

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 75/166 (45%), Positives = 108/166 (65%), Gaps = 3/166 (1%)

Query: 28  EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSA 85
           +G+   +LL+GSL+ ALCYI R+ +    + +   RIL ++ + D   QY+  MN IF+A
Sbjct: 132 KGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 191

Query: 86  QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 145
           Q+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R  L L
Sbjct: 192 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMASLLQYLLWVFLPDQDQRAQLIL 250

Query: 146 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           P P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 251 PPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 296


>gi|345497965|ref|XP_001603853.2| PREDICTED: general transcription factor IIH subunit 3-like [Nasonia
           vitripennis]
          Length = 295

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 81/166 (48%), Positives = 104/166 (62%), Gaps = 9/166 (5%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SL+SG+LSMALCYI R+ R      H  PRIL +  S D   QY+  MN  F+AQ+  V 
Sbjct: 110 SLISGALSMALCYIARMEREKYAGEHLHPRILVITASHDSATQYMNYMNIFFTAQKMGVM 169

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHK-PQQ-----LDGLFQYLLTIFGTDLHSRNFLQL 145
           +D C L  + +  LQQ   +TGG + K PQQ     L GL QYLL +F  D   R+ L L
Sbjct: 170 LDVCSLDHELT-LLQQGCDLTGGNYLKIPQQSGQPPLAGLLQYLLWVFLPDPAVRSKLVL 228

Query: 146 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           P PV VD+RA+CFCH+  +D G++CSVCLSI+CK    C+TC +VF
Sbjct: 229 PPPVKVDYRAACFCHQELVDTGFVCSVCLSIFCKFSPICTTCHTVF 274


>gi|392570135|gb|EIW63308.1| transcription factor Tfb4 [Trametes versicolor FP-101664 SS1]
          Length = 329

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 114/199 (57%), Gaps = 16/199 (8%)

Query: 3   SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF-RSGLLHPQ-- 59
           +L AT++++L E +       ++ P G + C      ++ ALCYI R+   +    P   
Sbjct: 93  TLDATVMKSLAERLESFGDAAEEAPIGIVGC------MTKALCYINRITSNTTAAGPNED 146

Query: 60  -------PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
                  PRIL L  SPD    Y+ IMN+IFSAQ+  V ID C +   ++ FLQQA+++T
Sbjct: 147 GPTASIDPRILVLSVSPDQSSTYIPIMNSIFSAQKLKVTIDVCKIFGADNVFLQQAAHLT 206

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
           GG + +  + D L QY++  F      R  + +P    VDFRA+CFCHK  +++G++CSV
Sbjct: 207 GGAYVQVDRTDALLQYMMMSFLPPPAIRQLIAVPTQDRVDFRAACFCHKRIVEIGFVCSV 266

Query: 173 CLSIYCKHLKKCSTCGSVF 191
           CLSI+C+ +  CSTC + F
Sbjct: 267 CLSIFCQPVPVCSTCRTKF 285


>gi|412993660|emb|CCO14171.1| predicted protein [Bathycoccus prasinos]
          Length = 370

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 69/167 (41%), Positives = 105/167 (62%), Gaps = 11/167 (6%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           ++G++S+AL Y  R+ R   L    + R+L +Q S D  EQYV +MNA+F+AQ S + ID
Sbjct: 203 IAGAISLALTYANRIERDPSLRGRVKSRVLVVQASEDDAEQYVPMMNAMFAAQSSNILID 262

Query: 94  SCYLGAQNSAFLQQASYITGGVHHKPQQ---------LDGLFQYLLTIFGTDLHSRNFLQ 144
              L  + S+FLQQA++++GG++ KP++          + L +  L  F  D ++R FL+
Sbjct: 263 CLCLNEKESSFLQQAAFVSGGIYAKPEKSKKGGGEETREDLLERALCSFLPDRYTRRFLR 322

Query: 145 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            P+   +DFR+SCFCH N I  G++CSVCLS++CK    C TCG+ F
Sbjct: 323 PPRAGSIDFRSSCFCHGNRISKGFVCSVCLSVFCKKEGACQTCGAAF 369


>gi|224156833|ref|XP_002337765.1| predicted protein [Populus trichocarpa]
 gi|222869675|gb|EEF06806.1| predicted protein [Populus trichocarpa]
          Length = 77

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 65/77 (84%), Positives = 71/77 (92%)

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNAIFSAQRSMVPIDSCY+GA NSAFLQQASYITGGV+ KPQ LDGLFQYL T+F TDLH
Sbjct: 1   MNAIFSAQRSMVPIDSCYVGAHNSAFLQQASYITGGVYVKPQHLDGLFQYLTTVFATDLH 60

Query: 139 SRNFLQLPKPVGVDFRA 155
           SR+F+QLP+P GVDFRA
Sbjct: 61  SRSFIQLPRPAGVDFRA 77


>gi|392593946|gb|EIW83271.1| transcription factor Tfb4 [Coniophora puteana RWD-64-598 SS2]
          Length = 327

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 74/167 (44%), Positives = 100/167 (59%), Gaps = 9/167 (5%)

Query: 33  CSLLSGSLSMALCYIQRVFRSGLLH--------PQPRILCLQGSPDGPEQYVAIMNAIFS 84
           CSL+S  L+ ALCYI R+               P PRIL L  SPD    Y+ IMN IFS
Sbjct: 117 CSLVSA-LAKALCYINRITHPSTSAGPEDPNELPDPRILILSVSPDLSSSYIPIMNCIFS 175

Query: 85  AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 144
           AQ+  V ID C +    + FLQQA+++T G +   ++ D L QYL+  F      R  L 
Sbjct: 176 AQKLKVTIDVCKVYGPETVFLQQAAHLTEGSYIHLERRDALLQYLVMSFLAPPSVRGVLA 235

Query: 145 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +P+   +DFRA+CFCHKN +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 236 VPRQERIDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 282


>gi|195437728|ref|XP_002066792.1| GK24367 [Drosophila willistoni]
 gi|194162877|gb|EDW77778.1| GK24367 [Drosophila willistoni]
          Length = 303

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 77/160 (48%), Positives = 100/160 (62%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL+GS+  ALCYI R+ R+    +    R+L L GS +   QY+  MN  F+AQ+  + 
Sbjct: 127 SLLAGSIGRALCYISRLQRNTPPGIKMHSRVLVLTGSNECSSQYMTFMNVFFTAQKLGIV 186

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           ID+C L    S  LQQ   IT G   K  QLDGL QYLL +F      R+ L LP P  V
Sbjct: 187 IDTCSLENPVS-LLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPSPQMRHKLVLPPPPKV 245

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RASCFCH+  ID+GY+CSVCLS++CK+   C+TC +VF
Sbjct: 246 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTVF 285


>gi|432095027|gb|ELK26416.1| General transcription factor IIH subunit 3, partial [Myotis
           davidii]
          Length = 277

 Score =  141 bits (356), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 112/182 (61%), Gaps = 17/182 (9%)

Query: 7   TLLQNLEEFMNKDEQLGKQ-EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCL 65
            +++ +++ M K +  G+  EP       LL+GSL+ ALCY Q +        + RIL +
Sbjct: 111 VIVEEIKDLMTKSDIKGQHTEP-------LLAGSLAKALCYNQEM--------KSRILVI 155

Query: 66  QGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGL 125
           + + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  L
Sbjct: 156 KAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSL 214

Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCS 185
            QYLL +F  D   R+ L LP P+ VD+RA+C CH+N I++GY+CSVCLSI+C     C+
Sbjct: 215 LQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACLCHRNLIEIGYVCSVCLSIFCNFSPICT 274

Query: 186 TC 187
           TC
Sbjct: 275 TC 276


>gi|198432365|ref|XP_002120803.1| PREDICTED: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Ciona intestinalis]
          Length = 309

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 71/164 (43%), Positives = 103/164 (62%), Gaps = 7/164 (4%)

Query: 34  SLLSGSLSMALCYIQR--VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SL++GS+++ALCYI R  +   G      RIL ++ + D   QY++ MN IF+AQ+  V 
Sbjct: 126 SLIAGSMAIALCYIHRMQIESQGNTKLNARILVIKAADDAASQYMSFMNVIFAAQKERVV 185

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN----FLQLPK 147
           +D+C LG Q+S  LQQA  +T G++ K    D   QYLLT F T+   R       Q  +
Sbjct: 186 VDACILG-QDSGLLQQACDMTSGMYLKVPHQDAFLQYLLTAFLTEAELRKESLVLAQSSR 244

Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           P  VD+RA+CFCH++ +D+G++CSVCLSI+C+    CSTC + F
Sbjct: 245 PREVDYRAACFCHRSLVDVGFVCSVCLSIFCQFSPICSTCETAF 288


>gi|393217355|gb|EJD02844.1| Tfb4-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 378

 Score =  139 bits (351), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 68/165 (41%), Positives = 102/165 (61%), Gaps = 9/165 (5%)

Query: 36  LSGSLSMALCYIQRVFRSG---------LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
           L G+L+ ALC+I R+             ++ P+PRI+ L  SPD    Y+ +MN+IFSAQ
Sbjct: 121 LVGALTKALCHINRLVNPTATSIDSDVPVVPPEPRIVILSVSPDLSTSYIPLMNSIFSAQ 180

Query: 87  RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
           +  V ID C +   ++ FLQQA+++TGG +   ++ D L Q+LL  F      R  + +P
Sbjct: 181 KLKVAIDVCKIYGPDAVFLQQAAHLTGGSYIYLERRDALLQHLLMTFLPTPSIRQLIAVP 240

Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
               +DFRA+CFCHK  +D+G++CSVCLSI+C+ +  CSTC + F
Sbjct: 241 TQDKIDFRAACFCHKTIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 285


>gi|393244851|gb|EJD52362.1| transcription factor Tfb4 [Auricularia delicata TFB-10046 SS5]
          Length = 338

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 113/199 (56%), Gaps = 25/199 (12%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP------ 60
           T+++ + + ++    + ++EP        L G+L+ ALC+I R+      HP P      
Sbjct: 95  TVMKRVIDELSSLGDITEEEPTS------LVGALTKALCHINRI-----THPPPPANPSE 143

Query: 61  --------RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
                   RIL L  SPD    Y+ +MN+IFSAQ+    ID C +   ++ FLQQA+++T
Sbjct: 144 SPPTPPEARILILSVSPDLSTSYIPLMNSIFSAQKLKAIIDVCKVFGPDAVFLQQAAHLT 203

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
           GG +    + + L QYL+  F +    R+ L +P    +DFRA+CFCHKN ID+GY+CSV
Sbjct: 204 GGSYVYLLEREALLQYLIMSFLSPPMVRSILSVPTQEKIDFRAACFCHKNIIDVGYVCSV 263

Query: 173 CLSIYCKHLKKCSTCGSVF 191
           CLSI+C  +  CSTC + F
Sbjct: 264 CLSIFCTPVAVCSTCRTKF 282


>gi|345564048|gb|EGX47029.1| hypothetical protein AOL_s00097g75 [Arthrobotrys oligospora ATCC
           24927]
          Length = 374

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/194 (41%), Positives = 109/194 (56%), Gaps = 36/194 (18%)

Query: 27  PEGRIAC---SLLSGSLSMALCYIQRVFRSGL---------------------------- 55
           PE  +AC   S L+G+LS AL YI R+   G                             
Sbjct: 136 PEAVMACPNSSYLAGALSAALTYINRICFGGTGGRSSGVPGDEKSKVNSAGGKVDETGRT 195

Query: 56  LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
           +H   +IL +  +PD   QY+ +MN+IF+AQR  VPID C L   ++ FLQQA+  TGGV
Sbjct: 196 MHA--KILVVSVTPDPANQYIPVMNSIFAAQRLKVPIDVCKL-RDSTVFLQQAADATGGV 252

Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
           + +P+   GL QYL+  F  D  +R  L LP  V VDFRA+CFCHK  +D+G++CSVCLS
Sbjct: 253 YMEPEHPQGLIQYLMMGFLPDHLARQSLILPTKVDVDFRAACFCHKKVLDIGFVCSVCLS 312

Query: 176 IYCKHLKK--CSTC 187
           I+C+  +   CSTC
Sbjct: 313 IFCEPPQGAVCSTC 326


>gi|451848115|gb|EMD61421.1| hypothetical protein COCSADRAFT_173760 [Cochliobolus sativus
           ND90Pr]
          Length = 377

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 83/231 (35%), Positives = 121/231 (52%), Gaps = 38/231 (16%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR---------------- 49
           +T+LQN  + +N+     K+        +L+ G+LS+AL +I +                
Sbjct: 126 STILQNFVKLLNET----KESHLEATPTTLIGGALSLALTHINKQTILHAPTAASAESAS 181

Query: 50  -----------VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 98
                      V R  L     RIL +  S D   QY+ +MN+IF+AQR  +PID   L 
Sbjct: 182 LAALADSENTHVDRVSL---TSRILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL- 237

Query: 99  AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
           A ++  LQQAS  TGGV+ KP++ +GL QYL+  F  D+ +R  L +P   GVDFRA+CF
Sbjct: 238 AGDTVLLQQASDATGGVYMKPERPEGLLQYLMMAFLPDVTARASLVVPSAGGVDFRAACF 297

Query: 159 CHKNTIDMGYICSVCLSIYCK---HLKKCSTCGSVFGQAQTQSDEPSATNR 206
           CH+  +D+G++CSVCLSI+C        C TCGS        ++ P+   R
Sbjct: 298 CHRKVVDIGFVCSVCLSIFCSPDLPDNLCLTCGSYLSLRSAMTNPPALIPR 348


>gi|451999247|gb|EMD91710.1| hypothetical protein COCHEDRAFT_1136566 [Cochliobolus
           heterostrophus C5]
          Length = 377

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 83/231 (35%), Positives = 121/231 (52%), Gaps = 38/231 (16%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR---------------- 49
           +T+LQN  + +N+     K+        +L+ G+LS+AL +I +                
Sbjct: 126 STILQNFVKLLNET----KESHLEATPTTLIGGALSLALTHINKQTILHAPTAASAESAS 181

Query: 50  -----------VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 98
                      V R  L     RIL +  S D   QY+ +MN+IF+AQR  +PID   L 
Sbjct: 182 LAALADSENTHVDRVSL---TSRILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL- 237

Query: 99  AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
           A ++  LQQAS  TGGV+ KP++ +GL QYL+  F  D+ +R  L +P   GVDFRA+CF
Sbjct: 238 AGDTVLLQQASDATGGVYMKPERPEGLLQYLMMAFLPDVTARASLVVPSAGGVDFRAACF 297

Query: 159 CHKNTIDMGYICSVCLSIYCKHL---KKCSTCGSVFGQAQTQSDEPSATNR 206
           CH+  +D+G++CSVCLSI+C        C TCGS        ++ P+   R
Sbjct: 298 CHRKVVDIGFVCSVCLSIFCSPDLPGNLCLTCGSYLSLRSAMTNPPALIPR 348


>gi|427787947|gb|JAA59425.1| Putative transcription factor tfb4 [Rhipicephalus pulchellus]
          Length = 293

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 77/174 (44%), Positives = 109/174 (62%), Gaps = 3/174 (1%)

Query: 20  EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVA 77
           E +   + E +++ +LL+G LS+ALCYI R+ +  +G      RIL L  S +   QY+ 
Sbjct: 100 ELVSSDKGESQVSETLLTGGLSLALCYINRIEKETAGQKKIASRILVLSASGESASQYLN 159

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
            MN  F+AQ+  V ID+C L  ++S  LQQ   ITGG + K     GL +YLL +F  D 
Sbjct: 160 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMKVPNHTGLLEYLLWLFLPDK 218

Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            +R++L  P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 219 ATRDYLVFPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272


>gi|449546921|gb|EMD37890.1| hypothetical protein CERSUDRAFT_136815 [Ceriporiopsis subvermispora
           B]
          Length = 331

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 71/165 (43%), Positives = 98/165 (59%), Gaps = 9/165 (5%)

Query: 36  LSGSLSMALCYIQRVF-------RSG--LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
           L G+++ +LCYI R+        R+G   +   PRIL    SPD    Y+ IMN+IFSAQ
Sbjct: 120 LVGAITKSLCYINRLTIPSSSTTRAGEQAVPLDPRILIFSVSPDQSSSYIPIMNSIFSAQ 179

Query: 87  RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
           +  V ID C +   +S FLQQA+++TGG +    + D L QYL   F      R  L +P
Sbjct: 180 KLKVTIDVCKIYGSDSVFLQQAAHLTGGSYLHLDRPDALLQYLTMSFLPPPGIRQLLSVP 239

Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
               +DFRA+CFCHK  +D+G++CSVCLSI+C  +  CSTC + F
Sbjct: 240 TQDKIDFRAACFCHKRIVDIGFVCSVCLSIFCNPVPVCSTCRTKF 284


>gi|213409415|ref|XP_002175478.1| RNA polymerase II transcription factor B subunit 4
           [Schizosaccharomyces japonicus yFS275]
 gi|212003525|gb|EEB09185.1| RNA polymerase II transcription factor B subunit 4
           [Schizosaccharomyces japonicus yFS275]
          Length = 295

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 68/158 (43%), Positives = 99/158 (62%), Gaps = 3/158 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           +++SG+LS AL YI R+     L  + RIL    S D   QY++ MN IF AQ+  +PI+
Sbjct: 132 TMMSGALSRALAYINRIQTESPL--RSRILIFSLSGDVALQYISTMNCIFCAQKKNIPIN 189

Query: 94  SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
            C L ++ + FL+QA   TGG++ K ++  GL Q+L+     D + R  + +P    VDF
Sbjct: 190 VCSL-SKETLFLEQAVDATGGIYIKVEEPKGLLQHLMMSLFPDQNLRKHINIPNQANVDF 248

Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           RA+CFCHK  +D+GY+CSVCLSI+C    +C+TC S F
Sbjct: 249 RATCFCHKKILDIGYVCSVCLSIFCSPRDQCTTCHSTF 286


>gi|320167472|gb|EFW44371.1| general transcription factor IIH [Capsaspora owczarzaki ATCC 30864]
          Length = 381

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 25  QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMN 80
           + P      S L  SLS+ALC+I +  R+  L P    + RIL +  + D P  YV +MN
Sbjct: 201 KPPMSSAMTSQLVSSLSVALCHINKTRRA--LPPGVVCRSRILVVHAAEDVPAHYVQLMN 258

Query: 81  AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 140
            IF+AQ+  V ID+  L   +S FLQQA  +T G + K Q  D L Q L+     D  +R
Sbjct: 259 TIFAAQKLGVIIDALILTPDDSGFLQQAVDLTKGAYIKLQSNDDLVQTLMMFGLADRAAR 318

Query: 141 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
             L LP+P  VD+R++CFCH+  +D+G++CSVCLSI+C     C TCG+ F
Sbjct: 319 ELLILPQPTVVDYRSACFCHRRIVDVGFVCSVCLSIFCDQRGLCPTCGTRF 369


>gi|357624804|gb|EHJ75440.1| putative transcription factor TFIIH-subunit [Danaus plexippus]
          Length = 291

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/173 (43%), Positives = 106/173 (61%), Gaps = 4/173 (2%)

Query: 22  LGKQEP-EGRIACSLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAI 78
           L K +P E R   SLL+G+++M LC+I R+ R     L    R+L + GS D   QY+  
Sbjct: 103 LIKSQPQEERPGESLLAGAMAMGLCFIARMRREQPPGLRMVSRMLVVTGSSDTAAQYINY 162

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MN  F+AQ+  V +D C L  ++ + LQQ   ITGG++ K   L+GL QYLL +F  +  
Sbjct: 163 MNVFFTAQKQQVLLDVCSLD-KHLSLLQQGCDITGGLYLKVPSLEGLLQYLLWVFLPEAS 221

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            R  L LP    VD+RA+CFCH+  + +GY+CSVCLS++CK    C+TC +VF
Sbjct: 222 ERRELVLPGRGRVDYRAACFCHRELLTIGYVCSVCLSVFCKFSPICTTCHTVF 274


>gi|391332492|ref|XP_003740668.1| PREDICTED: general transcription factor IIH subunit 3-like
           [Metaseiulus occidentalis]
          Length = 297

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 70/158 (44%), Positives = 97/158 (61%), Gaps = 3/158 (1%)

Query: 35  LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
           LL+G +++ALCYI +   + +     RIL L  S +   QY+  MN  F+AQR  + +DS
Sbjct: 119 LLTGGMALALCYIAKRMANTVGKLNGRILVLSSSGESAHQYLNFMNVFFAAQRKAIVVDS 178

Query: 95  CYLGAQNS-AFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
           C +  QN    LQQ + ITGG + K   + GL +YLL +F      R  L LPKP  VD+
Sbjct: 179 CVI--QNDCGLLQQGADITGGNYLKAPSIAGLLEYLLWVFLPPKSLRAKLVLPKPDVVDY 236

Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           R +CFCH+N I++GY+CSVCLSI+C+    CSTC + F
Sbjct: 237 RTACFCHRNLIEVGYVCSVCLSIFCQFAPICSTCQTHF 274


>gi|321470747|gb|EFX81722.1| hypothetical protein DAPPUDRAFT_224195 [Daphnia pulex]
          Length = 301

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 99/160 (61%), Gaps = 3/160 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           SLL G+++ ALC+I ++ R   L  Q  PRIL + GS D   QY+  MN  F+AQ+  V 
Sbjct: 124 SLLGGAMARALCHIHKIQRELSLSHQLKPRILVVSGSSDSALQYMTFMNVFFTAQKENVV 183

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
           ID C + + +S  LQQ   ITGG + +   + GL +YLL +F      R+ + LP P  V
Sbjct: 184 IDCCMMDS-DSGLLQQGCDITGGQYLRIPTVVGLLEYLLWVFLPGPSCRSKIVLPPPTKV 242

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           D+RA+CFCH   +D+G++CSVCLSI+CK    C+TC + F
Sbjct: 243 DYRAACFCHHKLVDIGWVCSVCLSIFCKFSIICTTCNTEF 282


>gi|225717796|gb|ACO14744.1| General transcription factor IIH subunit 3 [Caligus clemensi]
          Length = 302

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 113/200 (56%), Gaps = 10/200 (5%)

Query: 17  NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYV 76
           NK  QL + E  G    S LSGSL+MAL YI R  R      + RIL L  S D   QY+
Sbjct: 106 NKFSQLLEAETSGS-EDSPLSGSLAMALAYINRK-RKESPELRARILILTASGDTASQYM 163

Query: 77  AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD 136
             MN  F+AQ+  V +D+C L + +S  LQQ + ITGG++     L  L Q+LL IF   
Sbjct: 164 NYMNVFFTAQKLNVLLDACMLQS-DSPLLQQGADITGGLYFNVPDLAALLQFLLWIFLPS 222

Query: 137 LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF----- 191
              R  L LP  V VDFRA+CFCH+  + +G++CSVCLSI+CK    C+TC +VF     
Sbjct: 223 AEIRPQLGLPTGVKVDFRAACFCHRQLVGVGFVCSVCLSIFCKFSPICTTCQTVFRVPAP 282

Query: 192 --GQAQTQSDEPSATNRKRK 209
             G+ +    + S++  KRK
Sbjct: 283 LLGKKKKSIAQGSSSLLKRK 302


>gi|346472167|gb|AEO35928.1| hypothetical protein [Amblyomma maculatum]
          Length = 293

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 80/174 (45%), Positives = 108/174 (62%), Gaps = 3/174 (1%)

Query: 20  EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVA 77
           E +   E E  ++ +LL+G LS+ALC+I RV +  +G      RIL L  S +   QY+ 
Sbjct: 100 ELVLSDENESLVSETLLTGGLSLALCHINRVEKETAGQKKIASRILVLSASGESASQYLN 159

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
            MN  F+AQ+  V ID+C L  ++S  LQQ   ITGG + K     GL QYLL +F  D 
Sbjct: 160 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMKVPNHAGLLQYLLWLFLPDK 218

Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            SR++L  P PV VD+RA+CFCH+N I++GY+CSVCLSI+C     CSTC + F
Sbjct: 219 ASRDYLVFPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272


>gi|119618834|gb|EAW98428.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
           CRA_c [Homo sapiens]
          Length = 169

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 65/138 (47%), Positives = 92/138 (66%), Gaps = 1/138 (0%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 14  RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
           Q+  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 73  QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132

Query: 181 LKKCSTCGSVFGQAQTQS 198
              C+TC +   + +T+S
Sbjct: 133 SPICTTCDAESQEKETES 150


>gi|195995769|ref|XP_002107753.1| hypothetical protein TRIADDRAFT_51576 [Trichoplax adhaerens]
 gi|190588529|gb|EDV28551.1| hypothetical protein TRIADDRAFT_51576 [Trichoplax adhaerens]
          Length = 276

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 68/133 (51%), Positives = 89/133 (66%), Gaps = 1/133 (0%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + RIL ++ +PD   QY+ IMN IF+AQ+  VPID+C +  ++S FLQQA  ITGG++ K
Sbjct: 134 KSRILVIKCTPDTSFQYLPIMNCIFTAQKYNVPIDACVI-EKDSGFLQQACDITGGIYLK 192

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
            Q   GL QYLLT F  +   R  L LP    VD+RA+CFCH+  I++GYICSVCLSI C
Sbjct: 193 LQSPSGLLQYLLTAFLPNQSMRETLILPPKAKVDYRAACFCHRELIEIGYICSVCLSISC 252

Query: 179 KHLKKCSTCGSVF 191
           K +  C TC + F
Sbjct: 253 KFVPVCPTCQTHF 265


>gi|189189624|ref|XP_001931151.1| RNA polymerase II transcription factor B subunit 4 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187972757|gb|EDU40256.1| RNA polymerase II transcription factor B subunit 4 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 370

 Score =  133 bits (335), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 67/149 (44%), Positives = 92/149 (61%), Gaps = 4/149 (2%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  TGGV+ KP 
Sbjct: 194 RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATGGVYMKPD 252

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK- 179
           + +GL QYL+  F  D+ +R  L +P   GVDFRA+CFCH+  +D+G++CSVCLSI+C  
Sbjct: 253 RPEGLLQYLMMAFLPDVTARTSLVVPSAGGVDFRAACFCHRKVVDIGFVCSVCLSIFCNP 312

Query: 180 --HLKKCSTCGSVFGQAQTQSDEPSATNR 206
                 C TCGS        ++ P+   R
Sbjct: 313 DLSDNLCMTCGSYLSLRSAVTNPPALIPR 341


>gi|449527735|ref|XP_004170865.1| PREDICTED: general transcription factor IIH subunit 3-like [Cucumis
           sativus]
          Length = 187

 Score =  133 bits (334), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 72/90 (80%), Positives = 79/90 (87%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
           MP+LC  LL+NLEEF+  DEQ  K++P+G    SLLSGSLSMALCYIQ+VFRSG LHPQP
Sbjct: 86  MPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSLLSGSLSMALCYIQKVFRSGSLHPQP 145

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMV 90
           RILCLQGSPDGPEQYVAIMNAIFSAQRSMV
Sbjct: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175


>gi|396462438|ref|XP_003835830.1| similar to RNA polymerase II transcription factor B subunit 4
           [Leptosphaeria maculans JN3]
 gi|312212382|emb|CBX92465.1| similar to RNA polymerase II transcription factor B subunit 4
           [Leptosphaeria maculans JN3]
          Length = 375

 Score =  133 bits (334), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 67/149 (44%), Positives = 93/149 (62%), Gaps = 4/149 (2%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  TGGV+ KP+
Sbjct: 199 RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATGGVYMKPE 257

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK- 179
           + +GL QYL+  F  D+ +R  L +P   GVDFRA+CFCH+  +D+G++CSVCLSI+C  
Sbjct: 258 RPEGLLQYLMMAFLPDVTARASLVVPNAGGVDFRAACFCHRRVVDIGFVCSVCLSIFCSP 317

Query: 180 --HLKKCSTCGSVFGQAQTQSDEPSATNR 206
                 C TCGS        ++ P+   R
Sbjct: 318 DLPDNLCLTCGSYLSLRSAMTNTPALIPR 346


>gi|296421687|ref|XP_002840396.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636611|emb|CAZ84587.1| unnamed protein product [Tuber melanosporum]
          Length = 357

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 70/173 (40%), Positives = 104/173 (60%), Gaps = 18/173 (10%)

Query: 34  SLLSGSLSMALCYIQRVFR---SG------------LLHPQPRILCLQGSPDGPEQYVAI 78
           ++++G+L +AL Y+ R+ R   SG            +     RIL +  S D  EQYV +
Sbjct: 136 TMMAGALGLALGYVNRLVRVDDSGGVVGGKDGAVGEVTSMNARILVVSVSGDLAEQYVPV 195

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MN+IF+AQR  +PID C +  + + FLQQAS  T G++ + +  + L QYL+  F  D  
Sbjct: 196 MNSIFAAQRKKIPIDICKVAGE-TVFLQQASDTTRGIYMQLEHPESLLQYLMMCFIPDTA 254

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGS 189
           +R  L  P  V VDFRA+CFCHKN +D+G++CS+CLSI+C   +   C+TC +
Sbjct: 255 TRRHLVPPTQVNVDFRAACFCHKNVVDIGFVCSICLSIFCTPPEGAICTTCNT 307


>gi|402224045|gb|EJU04108.1| transcription factor Tfb4 [Dacryopinax sp. DJM-731 SS1]
          Length = 305

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 77/191 (40%), Positives = 103/191 (53%), Gaps = 12/191 (6%)

Query: 13  EEFMNKDEQLGK--QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP---------QPR 61
           E FM   EQ  +  QE E  +  +L+ G+L+ +LCYI R+                   R
Sbjct: 90  EGFMRSVEQEVEDLQEVEEDVPPALV-GALTKSLCYINRLSNPPPPSNPSEAPPPPPHSR 148

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL    SPD    Y+  MN IFSAQ+  V ID C L      FL QA ++TGG      Q
Sbjct: 149 ILLFSVSPDASVAYIPFMNCIFSAQKLKVAIDVCKLDESEVIFLHQACHLTGGAFVPITQ 208

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
            + L QYL   F ++  +R  L +P    VDFRA+CFCHK  +D+GY+CSVCLSI+C+ +
Sbjct: 209 REALLQYLSMCFLSETETRKTLAVPTLDKVDFRAACFCHKEIVDIGYVCSVCLSIFCQPV 268

Query: 182 KKCSTCGSVFG 192
            +CSTC + F 
Sbjct: 269 LECSTCRTKFS 279


>gi|388582433|gb|EIM22738.1| transcription factor Tfb4 [Wallemia sebi CBS 633.66]
          Length = 272

 Score =  132 bits (333), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 75/183 (40%), Positives = 106/183 (57%), Gaps = 18/183 (9%)

Query: 32  ACSLLSGSLSMALCYIQRVF----RSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 87
           A S L+ +L+ +LC++ R      +S       RIL +  S D P +YV++MN+IF+AQ+
Sbjct: 85  ASSTLASTLAQSLCHVNRKIVQESKSTF-----RILVVAASDDVPPEYVSLMNSIFAAQK 139

Query: 88  SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
           S + ID   +   N+ FLQQA+++TGG   +    + L QYL+  F      R  L  P+
Sbjct: 140 SKIVIDVVQIYNCNTIFLQQAAHLTGGNFIQATDPESLVQYLIMAFLPSQPLRARLVQPR 199

Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF---------GQAQTQS 198
              VDFRA+CFCHK  +D+ YICSVCLSI+C+  ++CSTC SVF         G  QT S
Sbjct: 200 SDKVDFRAACFCHKRIVDIAYICSVCLSIFCQPPQRCSTCNSVFPAKTLQKFTGSLQTHS 259

Query: 199 DEP 201
             P
Sbjct: 260 HAP 262


>gi|452820306|gb|EME27350.1| transcription initiation factor TFIIH subunit H3 isoform 2
           [Galdieria sulphuraria]
 gi|452820307|gb|EME27351.1| transcription initiation factor TFIIH subunit H3 isoform 1
           [Galdieria sulphuraria]
          Length = 328

 Score =  132 bits (333), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 69/162 (42%), Positives = 99/162 (61%), Gaps = 4/162 (2%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           +LLS SLS ALCY+ R   S     + RILCL  S D P+QY+ +MN+IF AQR  + ID
Sbjct: 165 TLLSASLSFALCYLHRRILSE--RYEARILCLHASSDVPKQYIGVMNSIFCAQRWSILID 222

Query: 94  SCYLGAQNSAFLQQASYITGGVHHKPQQLDG--LFQYLLTIFGTDLHSRNFLQLPKPVGV 151
            C L  + S+ LQQA+++TGG++ KP+Q +   L  YL +++     +R +L++P    V
Sbjct: 223 VCNLHEEESSLLQQAAHLTGGIYFKPKQSEAIPLSNYLCSLYFPSRDARKYLKIPVLNSV 282

Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQ 193
           D  A CF     I MGYICSVCL+ + +    C+ CG+ F +
Sbjct: 283 DCSALCFESGEKIHMGYICSVCLATFGRKRLSCNICGAEFAE 324


>gi|428673526|ref|NP_001258797.1| general transcription factor IIH subunit 3 isoform d [Homo sapiens]
          Length = 162

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 64/131 (48%), Positives = 88/131 (67%), Gaps = 1/131 (0%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 14  RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
           Q+  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 73  QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132

Query: 181 LKKCSTCGSVF 191
              C+TC + F
Sbjct: 133 SPICTTCETAF 143


>gi|441630273|ref|XP_004089524.1| PREDICTED: general transcription factor IIH subunit 3 isoform 4
           [Nomascus leucogenys]
          Length = 162

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 64/131 (48%), Positives = 88/131 (67%), Gaps = 1/131 (0%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  
Sbjct: 14  RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
           Q+  L QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C  
Sbjct: 73  QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132

Query: 181 LKKCSTCGSVF 191
              C+TC + F
Sbjct: 133 SPICTTCETAF 143


>gi|353240140|emb|CCA72023.1| related to TFIIH basal transcription factor complex p34 subunit
           [Piriformospora indica DSM 11827]
          Length = 355

 Score =  132 bits (332), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 112/195 (57%), Gaps = 13/195 (6%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYI-QRVFRSGLLH-------- 57
           T+   + + +N++ +   ++ E  IA   L G+L+ +LC I +R   S  L+        
Sbjct: 91  TMNSVIAQRLNEELESLPEDAEAPIA---LVGALTKSLCTINRRSHPSDTLNATSPNPSK 147

Query: 58  -PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
            P PR+L +  SPD    Y+ IMN+IF AQ+  + ID C +   ++ FLQQA+++TGG +
Sbjct: 148 LPTPRLLVISVSPDLSASYIPIMNSIFCAQKLKLSIDVCKIFGPDNVFLQQAAHLTGGSY 207

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
               + D + QYL+T F +    R  + +P    +D RA+CFCHK  +D+G+ICS+CLSI
Sbjct: 208 ISIDRRDSILQYLMTTFLSPPAVRRLMAVPTEDRIDLRAACFCHKQIVDIGFICSICLSI 267

Query: 177 YCKHLKKCSTCGSVF 191
           +CK    CSTC   F
Sbjct: 268 FCKPTPVCSTCRVKF 282


>gi|330933521|ref|XP_003304199.1| hypothetical protein PTT_16692 [Pyrenophora teres f. teres 0-1]
 gi|311319344|gb|EFQ87711.1| hypothetical protein PTT_16692 [Pyrenophora teres f. teres 0-1]
          Length = 376

 Score =  132 bits (332), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 67/149 (44%), Positives = 91/149 (61%), Gaps = 4/149 (2%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  TGGV+ KP 
Sbjct: 200 RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATGGVYMKPD 258

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK- 179
           + +GL QYL+  F  D  +R  L +P   GVDFRA+CFCH+  +D+G++CSVCLSI+C  
Sbjct: 259 RPEGLLQYLMMAFLPDATARTSLVVPSAGGVDFRAACFCHRKVVDIGFVCSVCLSIFCNP 318

Query: 180 --HLKKCSTCGSVFGQAQTQSDEPSATNR 206
                 C TCGS        ++ P+   R
Sbjct: 319 DLSDNLCMTCGSYLSLRSAVTNPPALIPR 347


>gi|327276109|ref|XP_003222813.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
           subunit 3-like [Anolis carolinensis]
          Length = 360

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 76/193 (39%), Positives = 113/193 (58%), Gaps = 9/193 (4%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHP 58
           + ++  T+ + +++ M K   +  Q+ E     + L   L+ AL YI RV +  S     
Sbjct: 156 LTTVNETIAEEIQDLMTK-SGIKWQQTE-----TCLYXYLTKALWYIHRVSKTESASQEI 209

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + RIL ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K
Sbjct: 210 KSRILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLES-DSGLLQQACDITGGIYLK 268

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
                 L QYLL +F  D   R+ L LP  V VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 269 VPHKPSLLQYLLWVFLPDHDQRSQLILPPRVHVDYRAACFCHRNLIEVGYVCSVCLSIFC 328

Query: 179 KHLKKCSTCGSVF 191
                C+TC + F
Sbjct: 329 NFSPICTTCETAF 341


>gi|407926572|gb|EKG19539.1| Transcription factor Tfb4 [Macrophomina phaseolina MS6]
          Length = 380

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/206 (38%), Positives = 109/206 (52%), Gaps = 34/206 (16%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV----------------- 50
           LL NL   M +  ++  +        +L++G+L+ AL YI +                  
Sbjct: 131 LLANLRVLMARTSEMDLKAS----TTTLVAGALTTALAYIAKANIKSAPQVGDTQMADLA 186

Query: 51  -------FRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
                   RS L     RIL L  S D   QY+ +MNAIF+AQR  +PID   L A ++ 
Sbjct: 187 TAADPPAARSSLTS---RILVLSVSGDLASQYIPVMNAIFAAQRQRIPIDILKL-AGDTV 242

Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
           FLQQA   TGG++ KP   +GL QYL+  F  D  +R  L LP    VDFRA+CFCH+  
Sbjct: 243 FLQQACDATGGIYLKPLAPEGLLQYLMMAFLPDETARKHLVLPSAGEVDFRAACFCHRRV 302

Query: 164 IDMGYICSVCLSIYCKHL--KKCSTC 187
           +D+GY+CSVCLSI+C+    + C TC
Sbjct: 303 VDIGYVCSVCLSIFCEPPADQTCPTC 328


>gi|189241296|ref|XP_975136.2| PREDICTED: similar to transcription factor TFIIH-subunit, putative
           [Tribolium castaneum]
 gi|270013170|gb|EFA09618.1| hypothetical protein TcasGA2_TC011739 [Tribolium castaneum]
          Length = 232

 Score =  131 bits (330), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/207 (42%), Positives = 124/207 (59%), Gaps = 18/207 (8%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG----LLHPQPRI 62
           T+ QNL + +  ++     E       SLL+G+++MALCYI R+ R+     +L+   RI
Sbjct: 37  TIKQNLAKLLATEKSTLVTE-------SLLAGAIAMALCYIARIQRTKPPGCVLNS--RI 87

Query: 63  LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
           L + GS D   QY+  MN  F+AQ+  + ID C L  Q+ + LQQ   ITGG++ K  QL
Sbjct: 88  LVVTGSGDSASQYMNYMNVFFTAQKQGIVIDVCALD-QHLSLLQQGCDITGGLYLKVPQL 146

Query: 123 DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK 182
            GL QYLL +F  +   R  L LP PV VD+RA+CFCH+  ID+GY+CSVCLSI+CK   
Sbjct: 147 QGLLQYLLWVFLPEPPIRQKLVLPPPVKVDYRAACFCHRELIDIGYVCSVCLSIFCKFSP 206

Query: 183 KCSTCGSVFGQAQTQSDEPSATNRKRK 209
            C+TC +VF      + +P    +KRK
Sbjct: 207 ICTTCHTVFKIPGPLAVKP----KKRK 229


>gi|19112324|ref|NP_595532.1| transcription factor TFIIH complex subunit Tfb4
           [Schizosaccharomyces pombe 972h-]
 gi|62901129|sp|O74366.1|TFB4_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|3417433|emb|CAA20320.1| transcription factor TFIIH complex subunit Tfb4
           [Schizosaccharomyces pombe]
          Length = 297

 Score =  129 bits (324), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 69/184 (37%), Positives = 105/184 (57%), Gaps = 11/184 (5%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
           +L  ++  M+  +++ ++        +++SG+LS AL YI +V     L  + RIL    
Sbjct: 114 VLSGMKRLMSSTDKVSRK--------TMISGALSRALAYINQVQNKNTL--RSRILIFSL 163

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           + D   QY+  MN IF AQ+  +PI+ C +    + FL+QA+  TGG++ K     GL Q
Sbjct: 164 TGDVALQYIPTMNCIFCAQKKNIPINVCNIEG-GTLFLEQAADATGGIYLKVDNPKGLLQ 222

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           YL+     D + R  L  P    VDFRA+CFCHK  +D+G++CSVCLSI+C+    CSTC
Sbjct: 223 YLMMSLFPDQNLRKHLNTPNQANVDFRATCFCHKKVLDIGFVCSVCLSIFCEPRVHCSTC 282

Query: 188 GSVF 191
            + F
Sbjct: 283 HTKF 286


>gi|2257568|dbj|BAA21460.1| basic transcription factor 2.35KD subunit [Schizosaccharomyces
           pombe]
          Length = 298

 Score =  129 bits (323), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 69/184 (37%), Positives = 105/184 (57%), Gaps = 11/184 (5%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
           +L  ++  M+  +++ ++        +++SG+LS AL YI +V     L  + RIL    
Sbjct: 114 VLSGMKRLMSSTDKVSRK--------TMISGALSRALAYINQVQNKNTL--RSRILIFSL 163

Query: 68  SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
           + D   QY+  MN IF AQ+  +PI+ C +    + FL+QA+  TGG++ K     GL Q
Sbjct: 164 TGDVALQYIPTMNCIFCAQKKNIPINVCNIEG-GTLFLEQAADATGGIYLKVDNPKGLLQ 222

Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
           YL+     D + R  L  P    VDFRA+CFCHK  +D+G++CSVCLSI+C+    CSTC
Sbjct: 223 YLMMSLFPDQNLRKHLNTPNQANVDFRATCFCHKKVLDIGFVCSVCLSIFCEPRVHCSTC 282

Query: 188 GSVF 191
            + F
Sbjct: 283 HTKF 286


>gi|336371329|gb|EGN99668.1| hypothetical protein SERLA73DRAFT_53296 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 297

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 78/202 (38%), Positives = 109/202 (53%), Gaps = 29/202 (14%)

Query: 12  LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALC--YIQRVFRSG------LLHPQPRIL 63
           +   MN+ + LG  + E   A   L G+L+ ALC   I   F S        + P PRIL
Sbjct: 98  VRSIMNEFDALGNLDEEPPCA---LVGALTKALCCKSISAHFHSPNAVEDPAVLPDPRIL 154

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
            L  SPD    Y+ IMN+IFSAQ+  V ID C +   ++ FLQQA+++TGG +   ++ D
Sbjct: 155 ILSVSPDLSTSYIPIMNSIFSAQKLKVTIDVCKVYGPDAVFLQQAAHLTGGSYIFLERRD 214

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS-------- 175
            L QYL   F +    R  L +PK   +DFRA+CFCHKN +D+G++CSVCLS        
Sbjct: 215 ALLQYLTMSFLSPPALRQVLAVPKQDKIDFRAACFCHKNIVDVGFVCSVCLSSNHILFYV 274

Query: 176 ----------IYCKHLKKCSTC 187
                     ++C+ +  CSTC
Sbjct: 275 VKLWFTYMCTVFCQPVPVCSTC 296


>gi|430811432|emb|CCJ31073.1| unnamed protein product [Pneumocystis jirovecii]
 gi|430812043|emb|CCJ30521.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 284

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/166 (42%), Positives = 98/166 (59%), Gaps = 7/166 (4%)

Query: 11  NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQGSP 69
           NL++ ++K  +  +  P      +++ GSL MAL Y+ R  +S   +    RIL +  S 
Sbjct: 124 NLKKLIDKANEQDEDTPS-----TMIGGSLCMALAYVNRAMQSASKNRLNARILIISVSS 178

Query: 70  DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
           D   QY+ IMN IF AQ+  +PID C +G  ++ FLQQAS  T G++    +   L QYL
Sbjct: 179 DITFQYIPIMNCIFGAQKKKIPIDVCKIGG-DTVFLQQASDATKGIYLHLDKPKSLLQYL 237

Query: 130 LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
           L IF  D   R  L LP  + VDFRA+CFCHK  +D+G++CSVCLS
Sbjct: 238 LMIFLPDQKVRQHLVLPFQLNVDFRAACFCHKKILDIGFVCSVCLS 283


>gi|225712364|gb|ACO12028.1| General transcription factor IIH subunit 3 [Lepeophtheirus
           salmonis]
 gi|290562788|gb|ADD38789.1| General transcription factor IIH subunit 3 [Lepeophtheirus
           salmonis]
          Length = 305

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/158 (44%), Positives = 94/158 (59%), Gaps = 2/158 (1%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           S LSGSL+MAL YI R  R   +    RIL +  S D   QY+  MN  F+AQ+  V +D
Sbjct: 124 SPLSGSLAMALSYINRR-RKENMDLSARILVITASGDTASQYMNYMNVFFTAQKLDVLLD 182

Query: 94  SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
           +C L   +S  LQQ + ITGG++        L Q+LL IF      R  L LP    VDF
Sbjct: 183 TCMLQI-DSPLLQQGADITGGMYFNVPDNAALLQFLLWIFLPSAEMRPQLGLPSANKVDF 241

Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           RA+CFCH+  +D+G++CSVCLSI+CK    C+TC ++F
Sbjct: 242 RAACFCHRQLVDIGFVCSVCLSIFCKFSPICTTCQTIF 279


>gi|428174859|gb|EKX43752.1| hypothetical protein GUITHDRAFT_87651 [Guillardia theta CCMP2712]
          Length = 343

 Score =  127 bits (318), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 113/206 (54%), Gaps = 24/206 (11%)

Query: 24  KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIF 83
           ++  +  + CSLLS + ++A C + R+    LL   PR+L L  S + P QY+ IMN IF
Sbjct: 132 QENSDSSMKCSLLSSAFALAACTVNRISSHKLLRMLPRVLVLTASTELPSQYIPIMNCIF 191

Query: 84  SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ------QLDGLFQYLLTIFGTDL 137
           S Q++ V ID+  +   + +FLQQA+ IT GV+ +P+          L  +++++F  D 
Sbjct: 192 SMQKANVLIDALVVSDGDCSFLQQAADITNGVYLRPELSKVGSSPGALLSWMISVFLGDK 251

Query: 138 HSR----------NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK-HLKKCST 186
           H+R            L+LP+   VD+R+SCF     +D+G++CSVCLSI+ K     C T
Sbjct: 252 HTRFLNADSQSRSTLLRLPQAQDVDYRSSCFKTGQLVDVGFVCSVCLSIFAKCDFFICPT 311

Query: 187 CGSVFGQAQTQSDEP--SATNRKRKT 210
           C S     +   D P  +A N+KRK 
Sbjct: 312 CDS-----KIALDRPAGAAVNKKRKV 332


>gi|429851889|gb|ELA27048.1| transcription factor tfiih subunit [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 362

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 71/189 (37%), Positives = 99/189 (52%), Gaps = 21/189 (11%)

Query: 21  QLGKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLHPQPRI 62
           ++  Q  E  ++C+   LSG+L++AL YI                 R   S L     RI
Sbjct: 131 EVINQTTEADLSCTTTQLSGALTLALSYINKASMALNETNKTPEATRATSSALTRLHARI 190

Query: 63  LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
             L  S   P QY+  MNA+F+A  S +PID+  L   ++ FLQQASYIT G   +    
Sbjct: 191 FILSVSDSEPVQYIPTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDGTFMQAASP 249

Query: 123 DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--H 180
            GL  YL+  +  D  +R+ L  P    VDFRA+CFCH   +D G++CS+CLSI+C    
Sbjct: 250 RGLLSYLMFAYSADAEARSSLIPPTHHTVDFRAACFCHGRVVDTGFVCSICLSIFCDVPE 309

Query: 181 LKKCSTCGS 189
             +C TCG+
Sbjct: 310 NSECLTCGT 318


>gi|302830812|ref|XP_002946972.1| hypothetical protein VOLCADRAFT_79384 [Volvox carteri f.
           nagariensis]
 gi|300268016|gb|EFJ52198.1| hypothetical protein VOLCADRAFT_79384 [Volvox carteri f.
           nagariensis]
          Length = 141

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 61/118 (51%), Positives = 84/118 (71%)

Query: 74  QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
           QY+ +MNAIFSAQR+ V +D+  L  ++S+FLQQA+++TGG++ KP     L   LL  F
Sbjct: 2   QYIPVMNAIFSAQRAEVLLDAVVLATEDSSFLQQAAHLTGGLYFKPAGAGALLGLLLNYF 61

Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
             D  +R  L + + +GVDFRASCFCHK  I+ GY+CSVCLSI+C+  + CSTCG+ F
Sbjct: 62  VCDTSTRKQLDVAQELGVDFRASCFCHKYVIETGYVCSVCLSIFCQPSRACSTCGTAF 119


>gi|169775241|ref|XP_001822088.1| transcription factor TFIIH complex subunit [Aspergillus oryzae
           RIB40]
 gi|83769951|dbj|BAE60086.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391872974|gb|EIT82049.1| RNA polymerase II transcription initiation protein [Aspergillus
           oryzae 3.042]
          Length = 376

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 84/130 (64%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ + G+ D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS  T G++    +
Sbjct: 207 IVSVSGASDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLTE 265

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L
Sbjct: 266 PRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPL 325

Query: 182 KK--CSTCGS 189
               C TCG+
Sbjct: 326 DNGTCLTCGT 335


>gi|303274264|ref|XP_003056454.1| RNA polymerase II transcription factor B subunit 4 [Micromonas
           pusilla CCMP1545]
 gi|226462538|gb|EEH59830.1| RNA polymerase II transcription factor B subunit 4 [Micromonas
           pusilla CCMP1545]
          Length = 296

 Score =  126 bits (316), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 76/187 (40%), Positives = 105/187 (56%), Gaps = 10/187 (5%)

Query: 4   LCATLLQNLEEF---MNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
           LCA ++ NL      M  D  L  +          L+  +SMALC + R  R    H  P
Sbjct: 112 LCAQVISNLATLAMRMRSDFDLSTKSNMP------LTTVISMALCCVNRA-RVYKPHCSP 164

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RILC+ GS D   QY + MNAIFSA+ S V ID+C LG  +S FLQQA+ ITGG +   Q
Sbjct: 165 RILCILGSQDSSGQYFSGMNAIFSAKNSGVRIDACLLGYTDSFFLQQAAQITGGSYCMLQ 224

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
           Q +    YLL+        R  LQ P+    +++A+CFCH+  + +G++CSVCL+I+C+ 
Sbjct: 225 QGEKALYYLLSDLPGSESIRAKLQSPRKFCSEYQATCFCHEVNLQVGFVCSVCLTIFCER 284

Query: 181 LKKCSTC 187
            +KC  C
Sbjct: 285 CRKCLAC 291


>gi|238496129|ref|XP_002379300.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           flavus NRRL3357]
 gi|220694180|gb|EED50524.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           flavus NRRL3357]
          Length = 367

 Score =  126 bits (316), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 84/130 (64%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ + G+ D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS  T G++    +
Sbjct: 198 IVSVSGASDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLTE 256

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L
Sbjct: 257 PRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPL 316

Query: 182 KK--CSTCGS 189
               C TCG+
Sbjct: 317 DNGTCLTCGT 326


>gi|121713686|ref|XP_001274454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           clavatus NRRL 1]
 gi|119402607|gb|EAW13028.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           clavatus NRRL 1]
          Length = 386

 Score =  125 bits (315), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 63/130 (48%), Positives = 83/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  S D   QY+ IMNAIF+ QR  VPID C L   ++ FLQQAS  T GV+    +
Sbjct: 216 IISVSSSTDSAHQYIPIMNAIFACQRLHVPIDVCKLSG-DAVFLQQASDATKGVYMSLAE 274

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 275 PRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 334

Query: 182 KK--CSTCGS 189
           +   C TCG+
Sbjct: 335 ENGGCLTCGT 344


>gi|443727474|gb|ELU14215.1| hypothetical protein CAPTEDRAFT_169977 [Capitella teleta]
          Length = 299

 Score =  125 bits (315), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 68/151 (45%), Positives = 94/151 (62%), Gaps = 6/151 (3%)

Query: 45  CYIQRVFRSGLLHP---QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 101
           CY+ ++   GL      + R+L +QGS D P QY+ +MN IF+AQ+  V ID+C +  ++
Sbjct: 136 CYVNKI-NKGLPDGDAMRSRVLVVQGSEDNPLQYMDLMNIIFTAQKQNVIIDACIMD-ED 193

Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH-SRNFLQLPKPVGVDFRASCFCH 160
           S  LQQ   ITGG + K  Q   L QYLL +F  +   +R  L LP  V VD+RA+CFCH
Sbjct: 194 SPILQQVCDITGGFYLKTPQPSALLQYLLWVFLLEPGGAREKLNLPAKVHVDYRAACFCH 253

Query: 161 KNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +N I+ GY+CSVCLS++C     CSTC + F
Sbjct: 254 RNLINTGYVCSVCLSVFCSFSPICSTCQTTF 284


>gi|255953875|ref|XP_002567690.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211589401|emb|CAP95542.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 374

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/135 (48%), Positives = 85/135 (62%), Gaps = 5/135 (3%)

Query: 59  QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
           Q RIL +   GS D   QY+ +MN+IF+ QR  +PID C L   ++ FLQQAS  T GV+
Sbjct: 201 QSRILVVSVSGSTDAAHQYIPVMNSIFACQRLSIPIDVCKLSG-DAVFLQQASDATKGVY 259

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
               +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI
Sbjct: 260 MALAEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSI 319

Query: 177 YCKHLK--KCSTCGS 189
           +C+      C TCGS
Sbjct: 320 FCEPPPGGDCMTCGS 334


>gi|431912136|gb|ELK14274.1| General transcription factor IIH subunit 3 [Pteropus alecto]
          Length = 292

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 60/123 (48%), Positives = 84/123 (68%), Gaps = 1/123 (0%)

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           ++ + D   QY+  MN IF+AQ+  + ID+C LG+ +S  LQQA  ITGG++ K  Q+  
Sbjct: 150 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLGS-DSGLLQQACDITGGLYLKVPQMPS 208

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
           L QYLL +F  D   R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C
Sbjct: 209 LLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 268

Query: 185 STC 187
           +TC
Sbjct: 269 TTC 271


>gi|310794652|gb|EFQ30113.1| transcription factor tfb4 [Glomerella graminicola M1.001]
          Length = 365

 Score =  125 bits (313), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 102/194 (52%), Gaps = 21/194 (10%)

Query: 16  MNKDEQLGKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLH 57
           M+   +L     E  +AC+   LSG+L++AL +I                 R   S L  
Sbjct: 128 MSSMRKLVDATTEADLACTTTQLSGALTLALAHINKTSLSLNETNKAPDVARSTSSALSR 187

Query: 58  PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH 117
            + RI  L  S   P QY++ MNA+F+A  S +PID+  L   ++ FLQQASYIT G   
Sbjct: 188 LRARIFVLSVSDSEPVQYISTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDGTFM 246

Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
           +     GL  YL+  +  D  +R+ L  P    VDFRA+CFCH   +D G++CS+CLSI+
Sbjct: 247 QAASPRGLLSYLMFAYSADAEARSSLIPPTHHTVDFRAACFCHGRVVDTGFVCSICLSIF 306

Query: 178 CK--HLKKCSTCGS 189
           C      +C TCG+
Sbjct: 307 CDVPDGSECLTCGT 320


>gi|425772016|gb|EKV10443.1| Transcription factor TFIIH subunit Tfb4, putative [Penicillium
           digitatum Pd1]
 gi|425777277|gb|EKV15458.1| Transcription factor TFIIH subunit Tfb4, putative [Penicillium
           digitatum PHI26]
          Length = 387

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/135 (48%), Positives = 84/135 (62%), Gaps = 5/135 (3%)

Query: 59  QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
           Q RIL +   GS D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS  T GV+
Sbjct: 214 QSRILVISVSGSTDAAHQYIPIMNSIFACQRLNIPIDVCKLSG-DAVFLQQASDATKGVY 272

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
               +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI
Sbjct: 273 MALAEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSI 332

Query: 177 YCKHL--KKCSTCGS 189
           +C       C TCG+
Sbjct: 333 FCGPPPGSDCMTCGT 347


>gi|145254849|ref|XP_001398778.1| transcription factor TFIIH complex subunit [Aspergillus niger CBS
           513.88]
 gi|134084362|emb|CAK48702.1| unnamed protein product [Aspergillus niger]
          Length = 379

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/130 (48%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL +  S     QY+ IMNAIF+ QR  +PID C +   ++ FLQQAS  T GV+    +
Sbjct: 209 ILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDATKGVYMALDE 267

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L
Sbjct: 268 PKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPL 327

Query: 182 KK--CSTCGS 189
               C TCGS
Sbjct: 328 GDGVCLTCGS 337


>gi|350630603|gb|EHA18975.1| hypothetical protein ASPNIDRAFT_42792 [Aspergillus niger ATCC 1015]
          Length = 380

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/130 (48%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL +  S     QY+ IMNAIF+ QR  +PID C +   ++ FLQQAS  T GV+    +
Sbjct: 210 ILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDATKGVYMALDE 268

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L
Sbjct: 269 PKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPL 328

Query: 182 KK--CSTCGS 189
               C TCGS
Sbjct: 329 GDGVCLTCGS 338


>gi|70997489|ref|XP_753492.1| transcription factor TFIIH subunit Tfb4 [Aspergillus fumigatus
           Af293]
 gi|66851128|gb|EAL91454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           fumigatus Af293]
 gi|159126779|gb|EDP51895.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
           fumigatus A1163]
          Length = 387

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 83/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ + GS D   QY+ IMN IF+ QR  +PID C L   ++ FLQQAS  T GV+    +
Sbjct: 215 IISVSGSTDSAHQYIPIMNCIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGVYMSLSE 273

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L +P  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 274 PRGLLQYLMMAFLPDQRSRRHLVIPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 333

Query: 182 KK--CSTCGS 189
           +   C TCG+
Sbjct: 334 ENGDCLTCGT 343


>gi|119479015|ref|XP_001259536.1| transcription factor TFIIH subunit Tfb4, putative [Neosartorya
           fischeri NRRL 181]
 gi|119407690|gb|EAW17639.1| transcription factor TFIIH subunit Tfb4, putative [Neosartorya
           fischeri NRRL 181]
          Length = 385

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 83/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ + GS D   QY+ IMN IF+ QR  +PID C L   ++ FLQQAS  T GV+    +
Sbjct: 215 IISVSGSTDSAHQYIPIMNCIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGVYMSLSE 273

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L +P  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 274 PRGLLQYLMMAFLPDQRSRRHLVIPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 333

Query: 182 KK--CSTCGS 189
           +   C TCG+
Sbjct: 334 ENGDCLTCGT 343


>gi|358366716|dbj|GAA83336.1| transcription factor TFIIH complex subunit [Aspergillus kawachii
           IFO 4308]
          Length = 386

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 62/130 (47%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           IL +  S     QY+ IMNAIF+ QR  +PID C +   ++ FLQQAS  T GV+    +
Sbjct: 216 ILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDATKGVYMALDE 274

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ L
Sbjct: 275 PKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPL 334

Query: 182 KK--CSTCGS 189
               C TCG+
Sbjct: 335 GDGVCLTCGT 344


>gi|115387571|ref|XP_001211291.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195375|gb|EAU37075.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 380

 Score =  123 bits (308), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 61/130 (46%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ + GS D   QY+ +MN IF+ QR  +PID C L   ++ FLQQAS  T GV     +
Sbjct: 210 IVSVSGSSDSAHQYIPMMNGIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGVFMSLAE 268

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 269 PRGLLQYLMMAFLPDQRSRKHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPP 328

Query: 182 KK--CSTCGS 189
           +   C TCG+
Sbjct: 329 ENGDCLTCGT 338


>gi|320582712|gb|EFW96929.1| Subunit of TFIIH complex [Ogataea parapolymorpha DL-1]
          Length = 341

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 72/174 (41%), Positives = 100/174 (57%), Gaps = 9/174 (5%)

Query: 23  GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQGSPDGPEQYVAIMNA 81
           GKQ  +G +A     G++SMALC+I ++ +S  L+  + R+L +  S D    YV++MN 
Sbjct: 140 GKQHIKGTLA-----GAMSMALCHINKIQQSDELNALKSRLLVVSISDDSTLPYVSVMNT 194

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
           IF++Q+  + +D C LG  +S FLQQA+  T GV+    Q +GL QYL T    D   R 
Sbjct: 195 IFASQKMKISVDVCKLGP-SSTFLQQAADATNGVYIYITQPEGLIQYLTTALFIDPMLRP 253

Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 193
            + LP    +DFRASCF     ID+GY+CSVCL I       +KC TC S F  
Sbjct: 254 IIVLPTDESIDFRASCFITNKVIDVGYVCSVCLCILSVIPEDEKCPTCHSKFDH 307


>gi|400593294|gb|EJP61268.1| transcription factor tfb4 [Beauveria bassiana ARSEF 2860]
          Length = 373

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/204 (37%), Positives = 107/204 (52%), Gaps = 26/204 (12%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR------------- 52
           AT+L +L++ M +       E +     + LSG+L++ALC+  +  R             
Sbjct: 134 ATVLASLKQLMAE-----TTERDLDSTTTQLSGALTLALCHNNKAGRALSASDTALLDAA 188

Query: 53  --SGLLHPQP---RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 107
             SG   P P   RIL L  S     QY+ IMNA+F+A  S V ID+  L A ++ FLQQ
Sbjct: 189 QASGATAPPPMKGRILVLSVSDSESAQYIPIMNAVFAAAHSHVAIDTLAL-AGDATFLQQ 247

Query: 108 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
           A + TGGV  +     GL  YL+     D  +R  L  P    VDFRA+CFCH N +D G
Sbjct: 248 ACFGTGGVFLRAANPQGLLAYLMFGMIADSEAREALVAPAHDTVDFRAACFCHGNVVDTG 307

Query: 168 YICSVCLSIYCK--HLKKCSTCGS 189
           ++CS+CLSI+C      +C TCG+
Sbjct: 308 FVCSICLSIFCDLPENAECLTCGT 331


>gi|242764012|ref|XP_002340688.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218723884|gb|EED23301.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 380

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/135 (46%), Positives = 85/135 (62%), Gaps = 5/135 (3%)

Query: 59  QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
           Q RIL +    S D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS  T GV+
Sbjct: 206 QSRILIVSVSSSTDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATRGVY 264

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
               +  GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +++G++CS+CLSI
Sbjct: 265 MSLTEPRGLLQYLMMAFLPDQRSRKHLVLPSRVDVDFRAACFCHRRVVNIGFVCSICLSI 324

Query: 177 YCKHLKK--CSTCGS 189
           +C+  +   C TCG+
Sbjct: 325 FCEPPENGDCLTCGT 339


>gi|226468218|emb|CAX69786.1| putative general transcription factor IIH [Schistosoma japonicum]
          Length = 317

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 62/153 (40%), Positives = 91/153 (59%), Gaps = 22/153 (14%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL----------------GAQNSAF 104
           RIL ++ S D   QY+++MN++F+AQ+  VPID+C L                   +S+ 
Sbjct: 157 RILIIKASDDSTSQYLSLMNSVFTAQKLHVPIDTCVLPLPRQLDNLNGRRSISSPGHSSL 216

Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP------VGVDFRASCF 158
           LQQA+ +TGG++ +  ++ GL QYLL++F      R  L LP         GVDFRA+CF
Sbjct: 217 LQQAADLTGGIYLQIPRVSGLLQYLLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACF 276

Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           CHK  ID+GY+CS+CLS++C+    C TC + F
Sbjct: 277 CHKRLIDIGYVCSICLSVFCEFNPICPTCNTPF 309


>gi|254567625|ref|XP_002490923.1| Subunit of TFIIH complex, involved in transcription initiation,
           similar to 34 kDa subunit of human T [Komagataella
           pastoris GS115]
 gi|238030720|emb|CAY68643.1| Subunit of TFIIH complex, involved in transcription initiation,
           similar to 34 kDa subunit of human T [Komagataella
           pastoris GS115]
          Length = 328

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/179 (41%), Positives = 99/179 (55%), Gaps = 18/179 (10%)

Query: 25  QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ-------PRILCLQGSPDGPEQYVA 77
           QE +G IA     G++SMAL YI R+     + P+        ++L +  S D  +QYV 
Sbjct: 132 QELKGSIA-----GAMSMALSYINRLTN---IEPESTGNTLRAKMLVISISDDSTQQYVP 183

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
            MN IFSAQ+  V ID C +G  +S+FLQQAS +T GV+   + + GL QYL T    D 
Sbjct: 184 FMNCIFSAQKMKVSIDVCKMGP-DSSFLQQASDVTNGVYMLIKNVHGLIQYLTTALFIDP 242

Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQA 194
             R  + LP    +DFRASCF     ID+GY+CSVCL I+    +   C TC S F  A
Sbjct: 243 SLRPIMVLPTNSDLDFRASCFVTNKVIDIGYVCSVCLCIFSIIPRNNVCPTCQSKFDDA 301


>gi|326437202|gb|EGD82772.1| hypothetical protein PTSG_03422 [Salpingoeca sp. ATCC 50818]
          Length = 311

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/249 (35%), Positives = 118/249 (47%), Gaps = 71/249 (28%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ------ 59
           AT+L N  +  N  EQ G++ P      S L+G+LS ALC+I R  R   L PQ      
Sbjct: 83  ATILANATKLFN--EQSGERHP------SDLAGALSKALCFINRTVRGTSLQPQHTTASS 134

Query: 60  -----------------------------------------PRILCLQGSPDGPEQYVAI 78
                                                    PRI  +  SPD   +Y A 
Sbjct: 135 SVTSRFIAGDKATPASSSSSSSSSAPKAAGRDRVSSGARVTPRIGVVSVSPDSSVEYFAA 194

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG----VHHKPQQLDGLFQYLLTIFG 134
           MN IF+AQ++ VPID C +G + +A LQQA+++T G    VHH  Q L  L    +T   
Sbjct: 195 MNCIFAAQKNDVPIDVCAMG-EEAAILQQAAHLTNGTYVKVHHPSQLLQSLMMGFVT--S 251

Query: 135 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQA 194
           +D H    L  P    ++F ASCFCH  T+D+GY+C+VCLS+YC+ L  C  CG+ F   
Sbjct: 252 SDAH---VLVRPVQRRINFDASCFCHGRTVDIGYVCTVCLSVYCERLASCRICGAEF--- 305

Query: 195 QTQSDEPSA 203
              SD+P A
Sbjct: 306 ---SDKPGA 311


>gi|346324613|gb|EGX94210.1| transcription factor TFIIH subunit Tfb4 [Cordyceps militaris CM01]
          Length = 366

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/204 (36%), Positives = 106/204 (51%), Gaps = 26/204 (12%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR------------- 52
           AT+L +L++ M +       E +     + LSG+L++ALC+  +  R             
Sbjct: 127 ATVLGSLKKLMAE-----TTERDLDSTTTQLSGALTLALCHTNKASRALGASDTAIHEAA 181

Query: 53  --SGLLHPQP---RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 107
             SG   P P   RIL L  S     QY+  MNA+F+A    V ID+  L A ++ FLQQ
Sbjct: 182 QTSGTTPPSPMKGRILVLSVSDSESSQYIPTMNAVFAAAHGRVAIDALAL-AGDATFLQQ 240

Query: 108 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
           A + TGGV  +     GL  YL+     D  +R  L  P    VDFRA+CFCH N +D G
Sbjct: 241 ACFNTGGVFLRAANPQGLLAYLMFGLVADTEAREALVAPTHDTVDFRAACFCHGNVVDTG 300

Query: 168 YICSVCLSIYCK--HLKKCSTCGS 189
           ++CS+CLSI+C+     +C TCG+
Sbjct: 301 FVCSICLSIFCELPDNAECFTCGT 324


>gi|226467818|emb|CAX69785.1| putative general transcription factor IIH [Schistosoma japonicum]
          Length = 317

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 61/153 (39%), Positives = 91/153 (59%), Gaps = 22/153 (14%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL----------------GAQNSAF 104
           RIL ++ S D   QY+++MN++F+AQ+  VPID+C L                   +S+ 
Sbjct: 157 RILIIKASDDSTSQYLSLMNSVFTAQKLHVPIDTCVLPLPRQLDNLNGRRSISSPGHSSL 216

Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP------VGVDFRASCF 158
           LQQA+ +TGG++ +  ++ GL QYLL++F      R  L LP         GVDFRA+CF
Sbjct: 217 LQQAADLTGGIYLQIPRVSGLLQYLLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACF 276

Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           CH+  ID+GY+CS+CLS++C+    C TC + F
Sbjct: 277 CHRRLIDIGYVCSICLSVFCEFNPICPTCNTPF 309


>gi|212529242|ref|XP_002144778.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210074176|gb|EEA28263.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 380

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 59/130 (45%), Positives = 83/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  S D   QY+ IMN+IF+ QR  +PID C L   ++ FLQQAS  T GV+    +
Sbjct: 211 IVSVSSSTDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATRGVYMSLSE 269

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L +P  V VDFRA+CFCH+  +++G++CS+CLSI+C+  
Sbjct: 270 PRGLLQYLMMAFLPDQRSRKHLIVPSRVDVDFRAACFCHRRVVNIGFVCSICLSIFCEPP 329

Query: 182 KK--CSTCGS 189
           +   C TCG+
Sbjct: 330 ENGDCLTCGT 339


>gi|255070041|ref|XP_002507102.1| RNA polymerase II transcription factor B subunit 4 [Micromonas sp.
           RCC299]
 gi|226522377|gb|ACO68360.1| RNA polymerase II transcription factor B subunit 4 [Micromonas sp.
           RCC299]
          Length = 291

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/159 (44%), Positives = 93/159 (58%), Gaps = 4/159 (2%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
           L+  LSMALC+IQR+  +G      RI CL  SP    QY+ +MN IF+AQ++ V IDS 
Sbjct: 130 LAAGLSMALCHIQRL-GTGSFRGLRRIFCLLRSPVSQRQYIPMMNVIFAAQQAFVTIDSY 188

Query: 96  YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
            L   +S  L+QA+ +T G+H K Q+   L Q+LLT        R+ L   K  GVDF+ 
Sbjct: 189 SLCDMHSDILEQAANMTNGLHRKLQKDGELGQHLLTFSSWSQTCRHSLLPGKQCGVDFKT 248

Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLK---KCSTCGSVF 191
            CFCHK T+  G++CS CLSI C  LK    C TC + F
Sbjct: 249 FCFCHKKTLQRGFVCSACLSISCVALKTGGDCMTCAANF 287


>gi|452838611|gb|EME40551.1| hypothetical protein DOTSEDRAFT_136966 [Dothistroma septosporum
           NZE10]
          Length = 379

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 81/221 (36%), Positives = 111/221 (50%), Gaps = 48/221 (21%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQ------------------- 48
           L  NL + M++        P      ++L+G+L+MAL YI                    
Sbjct: 128 LTHNLRDMMDRTSPAALSTPS-----TMLAGALTMALTYISKQSAALPTANSSAQFNYSD 182

Query: 49  ---------------RVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
                          R    GL   + RIL L  S D  +QY++IMNAIF+ QR  VPID
Sbjct: 183 PSAVAGGSDLTADGSRKKHHGL---RSRILILSVSGDLADQYISIMNAIFACQRMSVPID 239

Query: 94  SCYLGAQNSAFLQQASYITGGVHH--KPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVG 150
              L A ++ FLQQA+  TGGV+     +   G  QYL+  +  D  +R  L +P +  G
Sbjct: 240 ILKL-AGDTVFLQQAADATGGVYVALDEKTRAGFLQYLMFAYLPDQTARAHLIMPGEGEG 298

Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
           VDFRA+CFCH+  +D GY+CS+CLSI+C+  H   C TCGS
Sbjct: 299 VDFRAACFCHRKVVDTGYVCSICLSIFCEPLHDGTCLTCGS 339


>gi|398392135|ref|XP_003849527.1| hypothetical protein MYCGRDRAFT_75897 [Zymoseptoria tritici IPO323]
 gi|339469404|gb|EGP84503.1| hypothetical protein MYCGRDRAFT_75897 [Zymoseptoria tritici IPO323]
          Length = 386

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 63/140 (45%), Positives = 90/140 (64%), Gaps = 6/140 (4%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH--K 118
           RIL L  S D  +QY+ IMN+IF+ QR  +PID   L A ++ FLQQA+  TGG++    
Sbjct: 211 RILILSVSGDLADQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAADATGGIYMALD 269

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
            Q   G  QYL+  +  D  +RN L +P +  GVDFRA+CFCH+  +D+G++CS+CLSI+
Sbjct: 270 EQSRAGFLQYLMFAYLPDQTARNHLIMPGESEGVDFRAACFCHRRVVDIGFVCSICLSIF 329

Query: 178 CKHLK--KCSTCGSVFGQAQ 195
           C+ ++  +C TCGS    A 
Sbjct: 330 CEPMQDGQCLTCGSHLSMAN 349


>gi|315052780|ref|XP_003175764.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           gypseum CBS 118893]
 gi|311341079|gb|EFR00282.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           gypseum CBS 118893]
          Length = 413

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 86/139 (61%), Gaps = 3/139 (2%)

Query: 53  SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
           +G L  +  I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T
Sbjct: 225 AGALQSRILIISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDAT 283

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
            G++       G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+
Sbjct: 284 RGIYMSLDSPRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSI 343

Query: 173 CLSIYCKHLK--KCSTCGS 189
           CLSI+C+  +   C TCG+
Sbjct: 344 CLSIFCEPPEGANCLTCGT 362


>gi|67515713|ref|XP_657742.1| hypothetical protein AN0138.2 [Aspergillus nidulans FGSC A4]
 gi|40746160|gb|EAA65316.1| hypothetical protein AN0138.2 [Aspergillus nidulans FGSC A4]
          Length = 384

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 58/130 (44%), Positives = 81/130 (62%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  +     QY+ IMN IF+ QR  +PID C L   ++ FLQQAS  T G++    +
Sbjct: 216 IISVSSATGSAHQYIPIMNGIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLSE 274

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 275 PRGLLQYLMMAFLPDQRSRKHLILPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 334

Query: 182 KK--CSTCGS 189
           +   C TCG+
Sbjct: 335 ENGDCLTCGT 344


>gi|259489666|tpe|CBF90125.1| TPA: transcription factor TFIIH complex subunit (Eurofung)
           [Aspergillus nidulans FGSC A4]
          Length = 392

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 58/130 (44%), Positives = 81/130 (62%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  +     QY+ IMN IF+ QR  +PID C L   ++ FLQQAS  T G++    +
Sbjct: 224 IISVSSATGSAHQYIPIMNGIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLSE 282

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             GL QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 283 PRGLLQYLMMAFLPDQRSRKHLILPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 342

Query: 182 KK--CSTCGS 189
           +   C TCG+
Sbjct: 343 ENGDCLTCGT 352


>gi|406863689|gb|EKD16736.1| hypothetical protein MBM_05205 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 357

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 71/187 (37%), Positives = 101/187 (54%), Gaps = 24/187 (12%)

Query: 15  FMNKDEQLGKQEPEGRIACSL---LSGSLSMALCYIQRVFR--SGL-------------- 55
            +N   +L     E  +A +    ++G+L++AL YI +  +  SG+              
Sbjct: 118 LLNSLRELIATTTESDVATTTSTQMAGALTLALSYIHKATQQFSGIDTESKPATAPSTEN 177

Query: 56  ----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYI 111
               L  Q RIL +  S D   QY+ IMN  F+AQR  +PID   L A ++ FLQQAS  
Sbjct: 178 ADTILGLQSRILVVSVSGDLAHQYIPIMNTTFAAQRLRIPIDILKL-AGDTVFLQQASDA 236

Query: 112 TGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 171
           T GV+ + +   GL QYL+  F  D  +R  L  P    VDFRA+CFCH+  +D+GY+CS
Sbjct: 237 TKGVYMQLRNPQGLLQYLMMAFLPDQVARKHLVPPTQEVVDFRAACFCHRKVVDIGYVCS 296

Query: 172 VCLSIYC 178
           +CLSI+C
Sbjct: 297 ICLSIFC 303


>gi|385301449|gb|EIF45638.1| subunit of TFIIH complex [Dekkera bruxellensis AWRI1499]
          Length = 219

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 79/185 (42%), Positives = 99/185 (53%), Gaps = 15/185 (8%)

Query: 9   LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRILCLQ 66
           LQNL+  + K E   K           LSG+LS AL YI R F++   H   + RI+C+ 
Sbjct: 3   LQNLKSLVGKGETHIK---------GTLSGALSQALSYINR-FQNSDEHSGMKARIVCIS 52

Query: 67  GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
            S D    YV+IMN+IF+AQ+  V +D C LG  NS FLQQAS  T GV+       GL 
Sbjct: 53  VSGDNTLPYVSIMNSIFAAQKQKVSVDVCKLGP-NSTFLQQASDSTNGVYIYIXHPLGLI 111

Query: 127 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKC 184
           QYL T    D   R  + LP    +DFRASCF     +D+GY+CSVCL I       + C
Sbjct: 112 QYLSTALFIDPMLRPMVVLPTNTNIDFRASCFITNKVVDIGYVCSVCLCILSVIPEDEIC 171

Query: 185 STCGS 189
            TC S
Sbjct: 172 PTCHS 176


>gi|384494657|gb|EIE85148.1| hypothetical protein RO3G_09858 [Rhizopus delemar RA 99-880]
          Length = 268

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/162 (42%), Positives = 89/162 (54%), Gaps = 43/162 (26%)

Query: 34  SLLSGSLSMALCYIQRVFR---SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 90
           S ++G+LSMALCYI R+ +    G +  +PRIL L  SPD   QY+ +MN IFSAQ++  
Sbjct: 131 STITGALSMALCYINRITKLDDEGYI--KPRILVLSVSPDSAFQYIPLMNCIFSAQKA-- 186

Query: 91  PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 150
               CY   Q+ AFL                              D +SRNFL LP    
Sbjct: 187 ----CY---QHMAFL-----------------------------PDRYSRNFLNLPTQDQ 210

Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 192
           VDFRA+CFCHK  +D+G++CSVCLSI+CK    CSTC + F 
Sbjct: 211 VDFRAACFCHKQIVDIGFVCSVCLSIFCKWSPVCSTCKTKFA 252


>gi|405968700|gb|EKC33746.1| Poly [ADP-ribose] polymerase 4 [Crassostrea gigas]
          Length = 3371

 Score =  117 bits (292), Expect = 4e-24,   Method: Composition-based stats.
 Identities = 67/158 (42%), Positives = 88/158 (55%), Gaps = 30/158 (18%)

Query: 34   SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
            SL++G+L+MALC  QR       H +  IL                     + R +V ID
Sbjct: 2964 SLVAGALAMALC--QRS------HEEAGIL---------------------SSRILVVID 2994

Query: 94   SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
            +C L  + S  LQQA  ITGG++ K  Q+ GL QYL+ +F  D   R  L LP  V VD+
Sbjct: 2995 ACVLDNE-SGLLQQACDITGGIYLKIPQMKGLLQYLMWVFLPDPLERPKLTLPPKVQVDY 3053

Query: 154  RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            RA+CFCH+N ID+GY+CSVCLSI+C     C TC + F
Sbjct: 3054 RAACFCHRNLIDIGYVCSVCLSIFCAFSPICGTCQTTF 3091


>gi|388851454|emb|CCF54856.1| related to TFIIH basal transcription factor complex p34 subunit
           [Ustilago hordei]
          Length = 423

 Score =  116 bits (291), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 60/149 (40%), Positives = 85/149 (57%), Gaps = 16/149 (10%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + RIL L  +PD   QY+ +MN IF+AQ+  + ID C L   ++ FLQQASY+T G + +
Sbjct: 250 KSRILILSVTPDASTQYIPMMNCIFAAQKKGITIDVCKLFGSDTVFLQQASYLTSGTYFR 309

Query: 119 ------PQQLDG----------LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKN 162
                   +++G          L Q LLT +      R  + LP    +DFRA+CFCH+ 
Sbjct: 310 LSDDSDALKVNGKEGITDMRSSLVQTLLTTYLPSRSMRGVMNLPTLEEIDFRAACFCHRK 369

Query: 163 TIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            +D+GYICSVCLS++C+    C TC S F
Sbjct: 370 IVDIGYICSVCLSLFCQPRPFCLTCRSKF 398


>gi|295670289|ref|XP_002795692.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226284777|gb|EEH40343.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 422

 Score =  116 bits (291), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 61/138 (44%), Positives = 86/138 (62%), Gaps = 5/138 (3%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ- 120
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  TGG++     
Sbjct: 247 IISLSNTTHSAQHYIPIMNSIFACQRLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINT 305

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 306 HPAGFLQYLMLGFLPDQRARAHLILPSRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEP 365

Query: 181 LK---KCSTCGSVFGQAQ 195
           L    +C TCG+  G A+
Sbjct: 366 LGGEGECLTCGTRLGMAE 383


>gi|326480715|gb|EGE04725.1| TFIIH basal transcription factor complex p34 subunit [Trichophyton
           equinum CBS 127.97]
          Length = 415

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 236 IISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 294

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 295 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 354

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 355 EGANCLTCGT 364


>gi|326473589|gb|EGD97598.1| transcription factor TFIIH subunit Tfb4 [Trichophyton tonsurans CBS
           112818]
          Length = 415

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 236 IISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 294

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 295 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 354

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 355 EGANCLTCGT 364


>gi|296815720|ref|XP_002848197.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           otae CBS 113480]
 gi|238841222|gb|EEQ30884.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
           otae CBS 113480]
          Length = 413

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 84/138 (60%), Gaps = 3/138 (2%)

Query: 54  GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
           G L  +  I+ +  + D  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T 
Sbjct: 226 GSLQSRILIVSVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATR 284

Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 173
           G++       G  QYL+  F  D   R  L LP  V VDFRA+CFCH+  +D+G++CS+C
Sbjct: 285 GIYMSVDSPRGFLQYLMLAFLPDQRVRRNLVLPTRVDVDFRAACFCHRKVVDVGFVCSIC 344

Query: 174 LSIYCKHLK--KCSTCGS 189
           LSI+C+  +   C TCG+
Sbjct: 345 LSIFCEPPEGANCLTCGT 362


>gi|393217350|gb|EJD02839.1| Tfb4-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 432

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 58/138 (42%), Positives = 86/138 (62%), Gaps = 1/138 (0%)

Query: 55  LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR-SMVPIDSCYLGAQNSAFLQQASYITG 113
           ++ P+PRI+ L  SPD    Y+ +MN+IFSAQ    V ID C + A ++ FLQQA+++TG
Sbjct: 198 VVSPEPRIVILSVSPDLSTSYIPLMNSIFSAQNLCKVAIDVCKIYAPDAVFLQQAAHLTG 257

Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 173
           G +   ++ D L Q+LL  F      R  + +P    +DFR +CF HK  +D+G++CSV 
Sbjct: 258 GSYIYLERRDALLQHLLMTFLPTPSIRQPIAVPTQDKIDFRVACFFHKTIVDIGFVCSVR 317

Query: 174 LSIYCKHLKKCSTCGSVF 191
           LSI C+ +  CSTC + F
Sbjct: 318 LSILCQLVPVCSTCRTKF 335


>gi|444724893|gb|ELW65479.1| Tectonic-2 [Tupaia chinensis]
          Length = 805

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 55/113 (48%), Positives = 78/113 (69%), Gaps = 1/113 (0%)

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           ++ + D   QY+  MN IF+AQ+  + ID+C L + +S  LQQA  ITGG++ K  Q+  
Sbjct: 10  IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 68

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
           L QYLL +F  D   R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+
Sbjct: 69  LLQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIF 121


>gi|302656245|ref|XP_003019878.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
 gi|291183651|gb|EFE39254.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
          Length = 517

 Score =  115 bits (287), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  + +  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 338 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 396

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 397 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 456

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 457 EGANCLTCGT 466


>gi|358389021|gb|EHK26614.1| hypothetical protein TRIVIDRAFT_69532 [Trichoderma virens Gv29-8]
          Length = 363

 Score =  115 bits (287), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 92/176 (52%), Gaps = 20/176 (11%)

Query: 36  LSGSLSMALCYIQRVFRS--------------GLLHPQP---RILCLQGSPDGPEQYVAI 78
           LSG+L++ALC I +  +S                  P P   RI+ +  S   P QY+  
Sbjct: 150 LSGALTLALCRINKTAQSLSSSDTTISNAAPVNSTAPPPVKGRIVVISVSDSEPSQYIPT 209

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA+F+A  + V ID+  L A +S FLQQA + T G+  K     GL  YL+     D  
Sbjct: 210 MNAVFAAAHNQVAIDTIAL-AGDSTFLQQACFNTNGIFLKASNPQGLLTYLMFGLIPDTE 268

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFG 192
           +R  L  P    VDFR +CFCH   +D G++CSVCLSI+C+     +C TCG++  
Sbjct: 269 ARGSLITPTHDTVDFRTACFCHGKVVDTGFVCSVCLSIFCEPPENAECLTCGTILA 324


>gi|71004806|ref|XP_757069.1| hypothetical protein UM00922.1 [Ustilago maydis 521]
 gi|46096873|gb|EAK82106.1| hypothetical protein UM00922.1 [Ustilago maydis 521]
          Length = 422

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 58/148 (39%), Positives = 80/148 (54%), Gaps = 15/148 (10%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + RIL L  + D   QY+ +MN IF+AQ+  + ID C L  +++ FLQQASY+T G + +
Sbjct: 251 KSRILILSVTQDASTQYIPMMNCIFAAQKKGITIDVCKLFGKDTVFLQQASYLTSGTYFR 310

Query: 119 ---------------PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
                                L Q LLT +      R  + LP    +DFRA+CFCH+  
Sbjct: 311 LADDEDQSKSDTASATDMRSSLVQTLLTTYLPSRSMRGVMNLPSLEKIDFRAACFCHRKI 370

Query: 164 IDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +D+GYICSVCLS++C     C TC S F
Sbjct: 371 VDIGYICSVCLSLFCSPKPSCLTCRSKF 398


>gi|327299656|ref|XP_003234521.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
           118892]
 gi|326463415|gb|EGD88868.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
           118892]
          Length = 413

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  + +  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 234 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 292

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 293 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 352

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 353 EGANCLTCGT 362


>gi|452978955|gb|EME78718.1| hypothetical protein MYCFIDRAFT_124295, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 339

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/136 (45%), Positives = 88/136 (64%), Gaps = 10/136 (7%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH---- 116
           RIL L  S D  +QY+ IMN+IF+ QR  +PID   L A ++ FLQQA+  TGG++    
Sbjct: 198 RILILSVSGDLADQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAADATGGIYMALD 256

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDMGYICSVCLS 175
           H+ +   G  QYL+  +  D  +R  L +P +  GVDFRA+CFCH+  +D+G++CS+CLS
Sbjct: 257 HRTRA--GFLQYLMFAYLPDEAARKHLIVPGEGEGVDFRAACFCHRRVVDIGFVCSICLS 314

Query: 176 IYCK--HLKKCSTCGS 189
           I+C+  H   C TCGS
Sbjct: 315 IFCEPLHDSTCLTCGS 330


>gi|226294041|gb|EEH49461.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides brasiliensis Pb18]
          Length = 408

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 85/138 (61%), Gaps = 5/138 (3%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ- 120
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  TGG++     
Sbjct: 233 IISLSNNTHSAQHYIPIMNSIFACQRLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINA 291

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 292 HPAGFLQYLMLGFLPDQRARAHLILPSRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEP 351

Query: 181 L---KKCSTCGSVFGQAQ 195
           L    +C TCG+  G  +
Sbjct: 352 LGGEGECLTCGTRLGMVE 369


>gi|340515579|gb|EGR45832.1| hypothetical protein TRIREDRAFT_67476 [Trichoderma reesei QM6a]
          Length = 362

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 67/176 (38%), Positives = 91/176 (51%), Gaps = 20/176 (11%)

Query: 36  LSGSLSMALCYIQRVFRS--------------GLLHPQP---RILCLQGSPDGPEQYVAI 78
           LSG+L++ALC I +  ++                  P P   RI+ +  S   P QY+  
Sbjct: 149 LSGALTLALCRINKAAQALSSSDTTLSNAAPVNATAPPPVKGRIVVVSVSDSDPSQYIPT 208

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA+F+A  + V ID+  L A +S FLQQA + T G+  K     GL  YL+     D  
Sbjct: 209 MNAVFAAAHNQVAIDTISL-AGDSTFLQQACFNTNGIFLKAANPRGLLTYLMFGLIPDTE 267

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFG 192
           +R  L  P    VDFR +CFCH   +D GY+CSVCLSI+C      +C TCG+V  
Sbjct: 268 ARASLITPTHDTVDFRTACFCHGKVVDTGYVCSVCLSIFCTPPDNAECLTCGTVLA 323


>gi|342873217|gb|EGU75428.1| hypothetical protein FOXB_14053 [Fusarium oxysporum Fo5176]
          Length = 358

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 105/204 (51%), Gaps = 26/204 (12%)

Query: 6   ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS------------ 53
           A +L  + + M++      +E +     + +SG+L++ALC+I +  ++            
Sbjct: 118 AAVLAAIRKLMDQ-----TKEEDLSATTTQISGALTLALCHINKAAQALCAPTANLEDSH 172

Query: 54  -GLLHPQP-----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 107
            G  +  P     RIL +  S   P QY+  MNA+F+A  + V ID+  L  + + FLQQ
Sbjct: 173 KGSSNTAPPTVRGRILVISVSDSEPSQYIPTMNAVFAAGHTQVAIDTLSLTGEPT-FLQQ 231

Query: 108 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
           A Y TGG +       GL  YL+     D  +R  L  P    VDFRA+CFCH   +D G
Sbjct: 232 ACYNTGGTYLAATHPQGLLNYLMFGLIADTEAREALIAPTHDTVDFRAACFCHGRVVDTG 291

Query: 168 YICSVCLSIYCK--HLKKCSTCGS 189
           ++CS+CLSI+C+     +C TCG+
Sbjct: 292 FVCSICLSIFCETPENSECFTCGT 315


>gi|347440963|emb|CCD33884.1| similar to RNA polymerase II transcription factor B subunit 4
           [Botryotinia fuckeliana]
          Length = 358

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 106/208 (50%), Gaps = 33/208 (15%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-------------- 52
           TLL ++   +   E   +QE       + ++G+L++AL YI +                 
Sbjct: 117 TLLNSVRSLI---ESTTEQE-VSTTTTTQMAGALTLALSYINKQLTAYSDVSAADSKPTP 172

Query: 53  ---------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
                    +GL   Q RIL L  S D   QY+ IMN  F+AQR  +PID   L A ++ 
Sbjct: 173 SDITPSDSTTGL---QSRILVLSVSGDLAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTV 228

Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
           FLQQAS  T G++   +   GL QYL+  F  D  SR  L  P    VDFRA+CFCH+  
Sbjct: 229 FLQQASDTTRGIYMHLKNPQGLLQYLMMAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKV 288

Query: 164 IDMGYICSVCLSIYCKHLKK--CSTCGS 189
           +D+G++CS+CLSI+C   +   C TC +
Sbjct: 289 VDVGFVCSICLSIFCSPPEGAICLTCST 316


>gi|154310839|ref|XP_001554750.1| hypothetical protein BC1G_06398 [Botryotinia fuckeliana B05.10]
          Length = 356

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 106/208 (50%), Gaps = 33/208 (15%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-------------- 52
           TLL ++   +   E   +QE       + ++G+L++AL YI +                 
Sbjct: 115 TLLNSVRSLI---ESTTEQE-VSTTTTTQMAGALTLALSYINKQLTAYSDVSAADSKPTP 170

Query: 53  ---------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
                    +GL   Q RIL L  S D   QY+ IMN  F+AQR  +PID   L A ++ 
Sbjct: 171 SDITPSDSTTGL---QSRILVLSVSGDLAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTV 226

Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
           FLQQAS  T G++   +   GL QYL+  F  D  SR  L  P    VDFRA+CFCH+  
Sbjct: 227 FLQQASDTTRGIYMHLKNPQGLLQYLMMAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKV 286

Query: 164 IDMGYICSVCLSIYCKHLKK--CSTCGS 189
           +D+G++CS+CLSI+C   +   C TC +
Sbjct: 287 VDVGFVCSICLSIFCSPPEGAICLTCST 314


>gi|303314233|ref|XP_003067125.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|240106793|gb|EER24980.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|320037386|gb|EFW19323.1| transcription factor TFIIH subunit Tfb4 [Coccidioides posadasii
           str. Silveira]
          Length = 395

 Score =  113 bits (283), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  +     QY+ IMN+IF+ QR  +PID C L   ++ FLQQA   T G++     
Sbjct: 216 IISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-DAVFLQQACDATRGIYVPVDH 274

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 275 PRGFLQYLMVAFLPDQRSRRHLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 334

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 335 EGADCLTCGT 344


>gi|119174356|ref|XP_001239539.1| hypothetical protein CIMG_09160 [Coccidioides immitis RS]
 gi|392869735|gb|EAS28255.2| transcription factor tfb4 [Coccidioides immitis RS]
          Length = 396

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  +     QY+ IMN+IF+ QR  +PID C L   ++ FLQQA   T G++     
Sbjct: 217 IISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-DAVFLQQACDATRGIYVPVDH 275

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  SR  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 276 PRGFLQYLMVAFLPDQRSRRHLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 335

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 336 EGADCLTCGT 345


>gi|443895518|dbj|GAC72864.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB4 [Pseudozyma antarctica
           T-34]
          Length = 425

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 59/148 (39%), Positives = 83/148 (56%), Gaps = 15/148 (10%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-- 116
           + RIL L  + D   QY+ +MN IF+AQ+  + ID C L   ++ FLQQASY+T G +  
Sbjct: 253 KSRILILSVTQDASTQYIPMMNCIFAAQKKGITIDVCKLLGNDTVFLQQASYLTSGTYFR 312

Query: 117 ----------HKPQQLD---GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
                      +   +D    L Q LLT +      R+ + LP    +DFRA+CFCH+  
Sbjct: 313 LSDEQAGPSGQEASAIDLRSALVQTLLTTYLPSRSMRSSMNLPTLEEIDFRAACFCHRKI 372

Query: 164 IDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +D+GYICSVCLS++C+    C TC S F
Sbjct: 373 VDVGYICSVCLSLFCQPRPFCLTCRSKF 400


>gi|156034244|ref|XP_001585541.1| hypothetical protein SS1G_13425 [Sclerotinia sclerotiorum 1980]
 gi|154698828|gb|EDN98566.1| hypothetical protein SS1G_13425 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 356

 Score =  113 bits (282), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/133 (45%), Positives = 80/133 (60%), Gaps = 3/133 (2%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           Q RIL L  S D   QY+ IMN  F+AQR  +PID   L A ++ FLQQAS  T G++  
Sbjct: 183 QSRILVLSVSGDLAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTVFLQQASDTTRGIYMH 241

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
            +   GL QYL+  F  D  SR  L  P    VDFRA+CFCH+  +D+G++CS+CLSI+C
Sbjct: 242 LKNPQGLLQYLMIAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKVVDVGFVCSICLSIFC 301

Query: 179 KHLKK--CSTCGS 189
              +   C TC +
Sbjct: 302 SPPEGAICLTCST 314


>gi|195111388|ref|XP_002000261.1| GI10128 [Drosophila mojavensis]
 gi|193916855|gb|EDW15722.1| GI10128 [Drosophila mojavensis]
          Length = 334

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 66/191 (34%), Positives = 99/191 (51%), Gaps = 6/191 (3%)

Query: 25  QEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAI-MNA 81
           +E E +   +LL G+++M L Y+ R  R  + + H + RIL + GS + P     + MNA
Sbjct: 131 KEEEEQTVTALLPGTVAMVLSYLSRCRREVAKVAHIRGRILIVSGSKEPPIPLATMQMNA 190

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
              A R  V ID C L  ++S  L+ A+ ITGG +      D L   LL +F    H R 
Sbjct: 191 FQVAARMGVVIDVCALELESSYMLRHAADITGGFYFSTSNFDTLSGVLLGLFLASPHVRQ 250

Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG---QAQTQS 198
            L  P     D RA CFCH   ++MG++CS CL+++CK+   C  C ++F    + Q   
Sbjct: 251 HLNYPVQPQADLRAMCFCHSKLVEMGFVCSSCLTVFCKYTPICGKCETIFKPPQKLQMYI 310

Query: 199 DEPSATNRKRK 209
           D  +A  R+R+
Sbjct: 311 DNRAAQRRQRE 321


>gi|302899112|ref|XP_003047982.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256728914|gb|EEU42269.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 355

 Score =  112 bits (281), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 68/196 (34%), Positives = 100/196 (51%), Gaps = 24/196 (12%)

Query: 16  MNKDEQLGKQEPEGRIACSL--LSGSLSMALCYIQRVFRSGLLHPQP------------- 60
           ++   +L  Q  E  +A +   LSG+L++ALC+I +  ++ L  P               
Sbjct: 120 LDSVRKLMDQTTEADLATTTTQLSGALTLALCHINKAAQA-LCSPTANLEDNHKATTTAP 178

Query: 61  -----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
                RIL +  S   P QY+  MNA+F+A  + V ID+  L   +  FLQQA + TGG 
Sbjct: 179 PTVRGRILVISVSDSEPSQYIPTMNAVFAAAHAQVAIDTLSLTG-DPTFLQQACFNTGGT 237

Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
           +       GL  YL+     D  +R  L  P    VDFRA+CFCH   +D G++CS+CLS
Sbjct: 238 YLAATHPQGLLTYLMFGLIADTEAREALVAPTHDTVDFRAACFCHGRVVDTGFVCSICLS 297

Query: 176 IYCK--HLKKCSTCGS 189
           I+C+     +C TCG+
Sbjct: 298 IFCELPENSECLTCGT 313


>gi|302496747|ref|XP_003010374.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
 gi|291173917|gb|EFE29734.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
          Length = 414

 Score =  112 bits (281), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 55/130 (42%), Positives = 82/130 (63%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  + +  +QY+ IMN+IF+ QR  +PID C L A ++ FLQQA   T G++     
Sbjct: 235 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 293

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +++G++CS+CLSI+C+  
Sbjct: 294 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVNVGFVCSICLSIFCEPP 353

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 354 EGANCLTCGT 363


>gi|440633398|gb|ELR03317.1| hypothetical protein GMDG_06064 [Geomyces destructans 20631-21]
          Length = 363

 Score =  112 bits (281), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 73/207 (35%), Positives = 104/207 (50%), Gaps = 30/207 (14%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR---------VFRSGLLHP 58
           LL +L E ++        E       + ++G+L++AL Y  +         V  SG  HP
Sbjct: 121 LLTSLRELIDST----TTEDVSSTTTTQMAGALTLALSYSNKATVTYNESGVSSSGKSHP 176

Query: 59  QP--------------RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAF 104
                           RIL +  S D   QY+ IMN  F+AQR  +PID   L   ++ F
Sbjct: 177 DGGENPTDDVPRGLLCRILVISVSGDLAHQYIPIMNTTFAAQRLRIPIDILKLSG-DTVF 235

Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTI 164
           LQQAS  T G + + +   GL Q L+  F  D  +R  L  P    VDFRA+CFCH+N +
Sbjct: 236 LQQASDATNGTYMQLRNPQGLLQSLMLGFLPDQSARTHLISPTQEVVDFRAACFCHRNVV 295

Query: 165 DMGYICSVCLSIYCKHLKK--CSTCGS 189
           D+G++CS+CLSI+C   +   C TCG+
Sbjct: 296 DVGFVCSICLSIFCSPPEGAICLTCGT 322


>gi|358396042|gb|EHK45429.1| hypothetical protein TRIATDRAFT_221909 [Trichoderma atroviride IMI
           206040]
          Length = 367

 Score =  112 bits (281), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 92/176 (52%), Gaps = 20/176 (11%)

Query: 36  LSGSLSMALCYIQRVFRS--------------GLLHPQP---RILCLQGSPDGPEQYVAI 78
           LSG+L++ALC I +  ++                  P P   RI+ +  S   P QY+  
Sbjct: 154 LSGALTLALCRINKASQALSASDTTLSNAAPVNSAAPPPVKSRIVVISVSDSEPSQYIPT 213

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA+F+A  + V ID+  L A +S FLQQA + T G+  K     GL  YL+     D  
Sbjct: 214 MNAVFAAAHNQVAIDTIAL-AGDSTFLQQACFNTNGIFLKASNPQGLLTYLMFGLIPDTE 272

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFG 192
           +R  +  P    VDFR +CFCH   +D G++CSVCLSI+C+     +C TCG+V  
Sbjct: 273 ARESIITPAHDTVDFRTACFCHGRVVDTGFVCSVCLSIFCEPPENAECLTCGTVLA 328


>gi|258567552|ref|XP_002584520.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237905966|gb|EEP80367.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 398

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 80/130 (61%), Gaps = 3/130 (2%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
           I+ +  +     QY+ IMN+IF+ QR  +PID C L   ++ FLQQA   T G++     
Sbjct: 219 IISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-DAVFLQQACDATRGIYVPVDH 277

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
             G  QYL+  F  D  SR+ L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+  
Sbjct: 278 PLGFLQYLMVAFLPDQRSRSHLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 337

Query: 182 K--KCSTCGS 189
           +   C TCG+
Sbjct: 338 EGADCLTCGT 347


>gi|343426778|emb|CBQ70306.1| related to TFIIH basal transcription factor complex p34 subunit
           [Sporisorium reilianum SRZ2]
          Length = 422

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 59/148 (39%), Positives = 83/148 (56%), Gaps = 15/148 (10%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + RIL L  + D   QY+ +MN IF+AQ+  + ID C L   ++ FLQQASY+T G + +
Sbjct: 251 KSRILILSVTQDASTQYIPMMNCIFAAQKRGITIDVCKLFGSDTVFLQQASYLTSGTYFR 310

Query: 119 ----PQQLDG-----------LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
                +Q +G           L Q LLT +      R  + LP    +DFR +CFCH+  
Sbjct: 311 LADDHEQSNGDKAGAIDMRSSLVQTLLTTYLPSRSMRGVMNLPTLEEIDFRTACFCHRRI 370

Query: 164 IDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +D+GYICSVCLS++C+    C TC S F
Sbjct: 371 VDVGYICSVCLSLFCQPRPFCLTCRSKF 398


>gi|322698540|gb|EFY90309.1| transcription factor TFIIH subunit Tfb4, putative [Metarhizium
           acridum CQMa 102]
          Length = 358

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 68/173 (39%), Positives = 92/173 (53%), Gaps = 20/173 (11%)

Query: 36  LSGSLSMALCYIQRVFR-----SGLLH--PQP----------RILCLQGSPDGPEQYVAI 78
           LSG+L++ALC+I +  +     SG L    QP          RIL +  S   P QY+ I
Sbjct: 144 LSGALTLALCHINKASQALSGASGSLAEVAQPSANAALPMKGRILVVSVSDSEPSQYIPI 203

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA+F+A  + V ID+  L   N  FLQQA + T G         GL  YL+     D  
Sbjct: 204 MNAVFAAAHAQVAIDTLCLSG-NPTFLQQACFNTNGTFLAAANPKGLLAYLMFGLIADTE 262

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
           +R  L  P    VDFRA+CFCH   +D G++CS+CLSI+C+     +C TCG+
Sbjct: 263 ARQSLITPTHDTVDFRAACFCHGKVVDRGFVCSICLSIFCELPENAECLTCGT 315


>gi|322709828|gb|EFZ01403.1| transcription factor TFIIH subunit Tfb4, putative [Metarhizium
           anisopliae ARSEF 23]
          Length = 358

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/173 (38%), Positives = 91/173 (52%), Gaps = 20/173 (11%)

Query: 36  LSGSLSMALCYIQRVFR------SGLLH-PQP----------RILCLQGSPDGPEQYVAI 78
           LSG+L++ALC+I +  +      S L    QP          RIL +  S   P QY+  
Sbjct: 144 LSGALTLALCHINKASQALSGASSSLAEVAQPSANAAPPMKGRILVVSVSDSEPTQYIPT 203

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA+F+A  + V ID+  L   N  FLQQA + T G+        GL  YL+     D  
Sbjct: 204 MNAVFAAAHAQVAIDTLSLSG-NPTFLQQACFNTNGIFLAAANPKGLLTYLMFGLIADTE 262

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
           +R  L  P    VDFRA+CFCH   +D G+ICS+CLSI+C+     +C TCG+
Sbjct: 263 ARQSLITPTHDTVDFRAACFCHGKVVDRGFICSICLSIFCELPENAECLTCGT 315


>gi|327352824|gb|EGE81681.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           ATCC 18188]
          Length = 426

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 120
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 246 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 304

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 305 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 364

Query: 181 L------KKCSTCGSVFGQAQ 195
           L       +C TCGS  G  +
Sbjct: 365 LGGVDGVAECLTCGSQLGVGE 385


>gi|261201796|ref|XP_002628112.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           SLH14081]
 gi|239590209|gb|EEQ72790.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           SLH14081]
          Length = 425

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 120
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 245 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 303

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 304 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 363

Query: 181 L------KKCSTCGSVFGQAQ 195
           L       +C TCGS  G  +
Sbjct: 364 LGGVDGVAECLTCGSQLGVGE 384


>gi|449295631|gb|EMC91652.1| hypothetical protein BAUCODRAFT_300256 [Baudoinia compniacensis
           UAMH 10762]
          Length = 386

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 107/221 (48%), Gaps = 43/221 (19%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS------------- 53
           TL+ NL + MN        +       ++L+G+L+ AL YI +   S             
Sbjct: 128 TLMYNLRKMMNTT----TTDAINSSPATMLAGALTRALAYISKQTASLPTATSSTQFNYS 183

Query: 54  -------------------GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
                                L    RIL +  S D  ++Y+ IMN IF+ QR  VPID 
Sbjct: 184 DPSAMAGGNELTTDPTKSANALGLSSRILIVSVSGDLADRYIEIMNCIFACQRMSVPIDV 243

Query: 95  CYLGAQNSAFLQQASYITGGVH---HKPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVG 150
             L A +  FLQQA+  TGG++          GL QYL+  +  D  +RN L  P +  G
Sbjct: 244 LKL-AGDPVFLQQAADTTGGIYMSLSTSTARAGLLQYLMFAYLMDQTARNHLIAPGEGEG 302

Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGS 189
           VDFRA+CFCHK  +D+G++CS+CLSI+C+ L    C  CGS
Sbjct: 303 VDFRAACFCHKRVVDIGFVCSICLSIFCEPLPDGTCLLCGS 343


>gi|239611920|gb|EEQ88907.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
           ER-3]
          Length = 425

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 120
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 245 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 303

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 304 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 363

Query: 181 L------KKCSTCGSVFGQAQ 195
           L       +C TCGS  G  +
Sbjct: 364 LGGVDGVAECLTCGSQLGVGE 384


>gi|302680643|ref|XP_003030003.1| hypothetical protein SCHCODRAFT_30664 [Schizophyllum commune H4-8]
 gi|300103694|gb|EFI95100.1| hypothetical protein SCHCODRAFT_30664 [Schizophyllum commune H4-8]
          Length = 257

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/142 (42%), Positives = 82/142 (57%), Gaps = 11/142 (7%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
            + +L+ +LCYI R     L  P  RIL L  SPD  + YV  MNAIFSAQ+  VPID+ 
Sbjct: 121 FTSALTRSLCYINR-----LQSPSSRILLLSVSPDRAQDYVPFMNAIFSAQKLKVPIDTL 175

Query: 96  YLGAQNSAFLQQASYITGGVHHK------PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
            L A +S FLQQA+Y+TGG + +        +  GL QYL+  F      R  + +P   
Sbjct: 176 QLAAHDSVFLQQATYLTGGAYVRLGDESGEGKRGGLLQYLMMCFLPPPALRQVMAVPTMD 235

Query: 150 GVDFRASCFCHKNTIDMGYICS 171
            V+ RA+CFCHK   ++G++CS
Sbjct: 236 QVNLRAACFCHKRMTEIGFVCS 257


>gi|240274744|gb|EER38259.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
           capsulatus H143]
          Length = 412

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
           I+ L  +      Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 231 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATS 289

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R+ L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 290 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 349

Query: 181 L------KKCSTCGS 189
           L       +C TCGS
Sbjct: 350 LAGVDGVAECLTCGS 364


>gi|410976500|ref|XP_003994658.1| PREDICTED: general transcription factor IIH subunit 3 [Felis catus]
          Length = 407

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 74/213 (34%), Positives = 108/213 (50%), Gaps = 44/213 (20%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-----------------------GLLHP 58
           + K + +G+   +LL+GSL+ ALCYI R+ +                         +LH 
Sbjct: 177 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILFQAPNAPMSMPNILHV 236

Query: 59  QPRILC-----------LQGSPDGPEQYVAIMNAIFSAQ------RSM---VPIDSCYLG 98
                C           L+G+ D    Y     ++  A       RS+   + ID+C L 
Sbjct: 237 GAAACCSERPWALGTGRLEGAGDSTAFYRLCDRSLAKAWVLQLQMRSIYLNILIDACVLD 296

Query: 99  AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
           + +S  LQQA  ITGG++ K  Q+  L QYLL +F  D   R+ L LP P+ VD+RA+CF
Sbjct: 297 S-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACF 355

Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           CH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 356 CHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 388


>gi|225558318|gb|EEH06602.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
           capsulatus G186AR]
          Length = 412

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
           I+ L  +      Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 231 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATS 289

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R+ L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 290 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 349

Query: 181 L------KKCSTCGS 189
           L       +C TCGS
Sbjct: 350 LAGVDGVAECLTCGS 364


>gi|325094096|gb|EGC47406.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
           capsulatus H88]
          Length = 412

 Score =  110 bits (274), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
           I+ L  +      Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 231 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATP 289

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R+ L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 290 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 349

Query: 181 L------KKCSTCGS 189
           L       +C TCGS
Sbjct: 350 LAGVDGVAECLTCGS 364


>gi|154286010|ref|XP_001543800.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407441|gb|EDN02982.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 413

 Score =  110 bits (274), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
           I+ L  +      Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  T G++     
Sbjct: 232 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATS 290

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
              G  QYL+  F  D  +R+ L LP  V VDFRA+CFCH+  +D+G++CS+CLSI+C+ 
Sbjct: 291 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 350

Query: 181 L------KKCSTCGS 189
           L       +C TCGS
Sbjct: 351 LGGVDGVAECLTCGS 365


>gi|389644740|ref|XP_003720002.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae 70-15]
 gi|351639771|gb|EHA47635.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae 70-15]
 gi|440470636|gb|ELQ39698.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae Y34]
 gi|440480828|gb|ELQ61470.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
           oryzae P131]
          Length = 380

 Score =  109 bits (273), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 82/134 (61%), Gaps = 7/134 (5%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAFLQQASYITGGVHHKP 119
           RIL +  S    +QY+  MNA+F+A  + VP+D   L GA  + FLQQ ++ITGG +   
Sbjct: 208 RILTVSVSDSSADQYIPTMNAVFAASSAGVPLDVLALRGA--APFLQQGAFITGGTYIAA 265

Query: 120 QQLDGLFQYLLTIFGTD--LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
            +  G+  YL+T F +   + S+  L  P    VDFRA+CFCH+  +D GY+CSVCLSI+
Sbjct: 266 TEPRGILAYLMTGFASTSGVGSQGLLLSPGSESVDFRAACFCHRRAVDTGYVCSVCLSIF 325

Query: 178 CK--HLKKCSTCGS 189
           C+     +C TCGS
Sbjct: 326 CEVPSGAECLTCGS 339


>gi|453081361|gb|EMF09410.1| transcription factor Tfb4 [Mycosphaerella populorum SO2202]
          Length = 287

 Score =  109 bits (272), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 83/134 (61%), Gaps = 6/134 (4%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH--K 118
           R+L L  S D   QY+ IMN+IF+ QR  +PID   L A ++ FLQQA+  TGG++    
Sbjct: 114 RLLILSVSGDLANQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAADATGGIYMALD 172

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
                G  QYL+  +  D  +R  L  P +  GVDFRA+CFCH+  +D+G++CS+CLSI+
Sbjct: 173 SNTRGGFLQYLMFAYLPDETARKHLITPGEGEGVDFRAACFCHRRVVDIGFVCSICLSIF 232

Query: 178 CK--HLKKCSTCGS 189
           C+  H   C TCGS
Sbjct: 233 CEPLHDSLCLTCGS 246


>gi|344300009|gb|EGW30349.1| hypothetical protein SPAPADRAFT_63205 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 367

 Score =  109 bits (272), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 113/246 (45%), Gaps = 47/246 (19%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIA--CSLLSGSLSMALCYIQRVF-------------- 51
           +L+ L EF+N   ++ K + +  +A   S LSG+LSMAL Y  R+               
Sbjct: 121 VLEELNEFLN---EINKDQDDTMLADPASKLSGALSMALTYTNRMLTLDQSITTTTASAI 177

Query: 52  -----------------------RSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 88
                                   S     + RIL +  + D   +Y+ IMN IF+AQ+ 
Sbjct: 178 ASTTSSSASAASSTATGAGGASSGSSSTSMKSRILVVSANDDNDVKYIPIMNTIFAAQKM 237

Query: 89  MVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
            + ID   LG  +NS++LQQ S  T G++       G+ Q L T +  +   R F+ LP 
Sbjct: 238 KLSIDIIKLGDEENSSYLQQPSDATNGIYLHLTDPRGIIQVLSTAYFIEPSLRPFIILPT 297

Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQAQTQ--SDEPSA 203
              V++RASCF     +D+GY+CSVCL I       KKC TC S F +      + EP  
Sbjct: 298 NSNVNYRASCFITGKAVDVGYVCSVCLCIMSIIPDSKKCPTCNSQFDERIIHQLTKEPEV 357

Query: 204 TNRKRK 209
             +KRK
Sbjct: 358 VKKKRK 363


>gi|346972888|gb|EGY16340.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
           dahliae VdLs.17]
          Length = 398

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/213 (34%), Positives = 101/213 (47%), Gaps = 45/213 (21%)

Query: 20  EQLGKQEPEGRIACSL--LSGSLSMALCYIQRV-------FRSGLLHP--------QPRI 62
           ++L  +  E  + C+   LSG+L++AL +I +V         S    P           I
Sbjct: 128 KKLLDETTEADLDCTTTQLSGALTLALAHINKVNLSYTASLSSDTAKPMTTAAPGLSSHI 187

Query: 63  LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG---------------------AQN 101
           L +  S   P QY++ MNA+F+A  + +PID   L                      A +
Sbjct: 188 LVVSVSDSDPSQYISTMNAVFAAAHASIPIDVLSLSPATAKQTTATQSTDPNDPHKPASS 247

Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG---TDLHSRNFLQLPKPVGVDFRASCF 158
           S+FLQQAS+IT G   +P    GL  YL   FG    D  +R  L  P    VDFRA+CF
Sbjct: 248 SSFLQQASHITNGTFLEPPSPLGLLAYL--TFGLAVADTRARAALVPPTHHTVDFRAACF 305

Query: 159 CHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
           CH   ID G++CSVCLSI+C       C TCG+
Sbjct: 306 CHGRVIDTGFVCSVCLSIFCSVPEGADCLTCGT 338


>gi|46138695|ref|XP_391038.1| hypothetical protein FG10862.1 [Gibberella zeae PH-1]
 gi|408390605|gb|EKJ69997.1| hypothetical protein FPSE_09842 [Fusarium pseudograminearum CS3096]
          Length = 355

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/174 (35%), Positives = 92/174 (52%), Gaps = 21/174 (12%)

Query: 36  LSGSLSMALCYIQRVFRS-------------GLLHPQP-----RILCLQGSPDGPEQYVA 77
           +SG+L++ALC+I +  ++             G     P     RIL +  S   P QY+ 
Sbjct: 141 ISGALTLALCHINKAAQALCSPTANLEDSHKGSSSTSPPTVRGRILVISVSDSEPSQYIP 200

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
            MNA+F+A  + V ID+  L  + + FL+QA Y TGG +       GL  YL+    +D 
Sbjct: 201 TMNAVFAAAHTQVAIDTLSLTGEPT-FLEQACYNTGGTYLAATHPQGLLNYLMFGLMSDT 259

Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
            +R  L       VDFRA+CFCH   +D G++CS+CLSI+C+     +C TCG+
Sbjct: 260 EAREALINSVHDTVDFRAACFCHGRVVDTGFVCSICLSIFCETPENSECFTCGT 313


>gi|302413233|ref|XP_003004449.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
           albo-atrum VaMs.102]
 gi|261357025|gb|EEY19453.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
           albo-atrum VaMs.102]
          Length = 384

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/213 (34%), Positives = 101/213 (47%), Gaps = 45/213 (21%)

Query: 20  EQLGKQEPEGRIACSL--LSGSLSMALCYIQRV-------FRSGLLHP--------QPRI 62
           ++L  +  E  + C+   LSG+L++AL +I +V         S    P           I
Sbjct: 128 KKLLDETTEADLDCTTTQLSGALTLALAHINKVNLSYTASLSSDTAKPMTTAAPGLSSHI 187

Query: 63  LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG---------------------AQN 101
           L +  S   P QY++ MNA+F+A  + +PID   L                      A +
Sbjct: 188 LVVSVSDSDPSQYISTMNAVFAAAHASIPIDVLSLSPATAKQTAAAQTTDPNDPHKSASS 247

Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG---TDLHSRNFLQLPKPVGVDFRASCF 158
           S+FLQQAS+IT G   +P    GL  YL   FG    D  +R  L  P    VDFRA+CF
Sbjct: 248 SSFLQQASHITNGTFLEPPSPLGLLAYL--TFGLAVADTRARAALVPPTHHTVDFRAACF 305

Query: 159 CHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
           CH   ID G++CSVCLSI+C       C TCG+
Sbjct: 306 CHGRVIDTGFVCSVCLSIFCSVPEGADCLTCGT 338


>gi|294658697|ref|XP_461035.2| DEHA2F15510p [Debaryomyces hansenii CBS767]
 gi|218511849|sp|Q6BL86.2|TFB4_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|202953319|emb|CAG89405.2| DEHA2F15510p [Debaryomyces hansenii CBS767]
          Length = 387

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 57/143 (39%), Positives = 83/143 (58%), Gaps = 2/143 (1%)

Query: 53  SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
           + L   + RIL +  + D   +Y+ IMN  F+AQ+  VPID   LG ++S++LQQAS  T
Sbjct: 210 TSLTSMRSRILIVSSNDDNDIKYIPIMNTTFAAQKMKVPIDVAKLGERDSSYLQQASDAT 269

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
            GV+      +GL Q L T F  +   R+ + LP    V++RASCF    ++++G++CSV
Sbjct: 270 NGVYLHIANPEGLIQTLSTAFFIEPSIRSLIILPTNSNVNYRASCFITGKSVNLGFVCSV 329

Query: 173 CLSIYCKHLK--KCSTCGSVFGQ 193
           CL I     K  KC TCGS F +
Sbjct: 330 CLCIMSIIPKEGKCPTCGSKFDE 352


>gi|448102380|ref|XP_004199788.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
 gi|359381210|emb|CCE81669.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
          Length = 376

 Score =  106 bits (264), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 59/157 (37%), Positives = 91/157 (57%), Gaps = 5/157 (3%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +  + D   +Y+ IMN++F+AQ+  +PID   LG+ +S++LQQAS  T G++ + +
Sbjct: 213 RILVISANDDDDVKYIPIMNSMFAAQKMRLPIDVAKLGSVSSSYLQQASDATNGIYLQVE 272

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
             +GL Q L T F  +   R  + LP    V +RASCF    ++D+GY+CSVCL I    
Sbjct: 273 NPEGLIQTLCTAFFIESSIRPLIILPSNSNVSYRASCFITGKSVDLGYVCSVCLCIMSII 332

Query: 179 KHLKKCSTCGSVFGQA---QTQSDEPSATNRKRKTTD 212
               KC TC S F ++   Q Q+    +T R++  T+
Sbjct: 333 PTDAKCPTCNSHFDKSILNQLQNSVVLSTKRRKIDTN 369


>gi|448098480|ref|XP_004198937.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
 gi|359380359|emb|CCE82600.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
          Length = 376

 Score =  105 bits (263), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 58/153 (37%), Positives = 90/153 (58%), Gaps = 5/153 (3%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +  + D   +Y+ IMN++F+AQ+  +PID   LG+ +S++LQQAS  T G++ + +
Sbjct: 213 RILVISANDDDDVKYIPIMNSMFAAQKMRLPIDVAKLGSISSSYLQQASDATNGIYLQVE 272

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
           + +GL Q L T F  +   R  + LP    V +RASCF    ++D+GY+CSVCL I    
Sbjct: 273 KPEGLIQTLCTAFFIESSIRPLIILPSNSNVSYRASCFITGKSVDLGYVCSVCLCIMSII 332

Query: 179 KHLKKCSTCGSVFGQA---QTQSDEPSATNRKR 208
               KC TC S F ++   Q Q+    +T R++
Sbjct: 333 PTDAKCPTCNSHFDKSILNQLQNSVVLSTKRRK 365


>gi|255727412|ref|XP_002548632.1| hypothetical protein CTRG_02929 [Candida tropicalis MYA-3404]
 gi|240134556|gb|EER34111.1| hypothetical protein CTRG_02929 [Candida tropicalis MYA-3404]
          Length = 369

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 88/158 (55%), Gaps = 4/158 (2%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + R+L +  + D   +Y+ +MN IF+AQ+  V ID   LG +NS++LQQAS  T GV+  
Sbjct: 202 KSRVLIVSANDDDDIKYIPLMNCIFAAQKMKVSIDVAKLGHKNSSYLQQASDATRGVYLH 261

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
            +   G+ Q L T F  + + R ++ LP    V++RASCF    ++D+GY+CSVCL I  
Sbjct: 262 IEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFLTGKSVDLGYVCSVCLCIMS 321

Query: 179 K--HLKKCSTCGSVFGQAQTQS--DEPSATNRKRKTTD 212
           K     KC  C S F +        EP   ++K++  D
Sbjct: 322 KLPDNGKCPACDSAFDKKIINELIREPQVISKKKRKLD 359


>gi|406602443|emb|CCH45984.1| RNA polymerase II transcription factor B subunit 4 [Wickerhamomyces
           ciferrii]
          Length = 361

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/148 (41%), Positives = 81/148 (54%), Gaps = 4/148 (2%)

Query: 46  YIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFL 105
           YI ++  +  L  + +IL +  S D   +Y+ IMN IF+AQ+  V ID C LG  N+ FL
Sbjct: 160 YINKIQSNDSLM-KAKILIVNISQDEHLKYIPIMNCIFAAQKMKVSIDVCQLGL-NATFL 217

Query: 106 QQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTID 165
           QQAS  T GV+   + +DGL QY  T    D   +N L  P    +DFRASCF     +D
Sbjct: 218 QQASDATNGVYLHIENMDGLIQYFTTALFIDPSIKNILTKPNKGDIDFRASCFLTGKIVD 277

Query: 166 MGYICSVCLSI--YCKHLKKCSTCGSVF 191
           +G++CSVCL I        KC  C S F
Sbjct: 278 IGFVCSVCLCILSLIPEDNKCPACDSEF 305


>gi|241956274|ref|XP_002420857.1| RNA polymerase II transcription factor B subunit 4, putative; TFIIH
           subunit TFB4, putative [Candida dubliniensis CD36]
 gi|223644200|emb|CAX41010.1| RNA polymerase II transcription factor B subunit 4, putative
           [Candida dubliniensis CD36]
          Length = 363

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 88/155 (56%), Gaps = 4/155 (2%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + RIL +  + D   +Y+ +MN IF+AQ+  V ID   LG  NS++LQQAS  T GV+  
Sbjct: 200 KSRILIVSANDDDDLRYIPLMNCIFAAQKMKVSIDVAKLGHNNSSYLQQASDATKGVYLH 259

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
            +   G+ Q L T F  + + R ++ LP    V++RASCF    ++D+GY+CSVCL I  
Sbjct: 260 IEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFITGKSVDLGYVCSVCLCIMS 319

Query: 179 K--HLKKCSTCGSVFGQA--QTQSDEPSATNRKRK 209
           +     KC  C S F +   Q  + EP   ++K++
Sbjct: 320 QLPQSGKCPACDSEFDKKIIQELNREPQVISKKKR 354


>gi|50547937|ref|XP_501438.1| YALI0C04367p [Yarrowia lipolytica]
 gi|62901295|sp|Q6CD24.1|TFB4_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|49647305|emb|CAG81737.1| YALI0C04367p [Yarrowia lipolytica CLIB122]
          Length = 340

 Score =  105 bits (261), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/169 (37%), Positives = 94/169 (55%), Gaps = 9/169 (5%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQ-GSPDGPEQYVAIMNAIFSAQRSMVPI 92
           S +SG+LS+AL +I +         + RIL L  G  +   QY+  MN IF+AQ+  +P+
Sbjct: 151 SAISGALSLALGFINKHVFVDESRMRARILLLTVGHKNETIQYIPTMNCIFAAQKLKIPV 210

Query: 93  DSCYLG-AQNSAFLQQASYITGGVHHKPQQLD-----GLFQYLLTIFGTDLHSRNFLQLP 146
           D C LG   +  FLQQA   T G++    + +     GL QYLL+ F +D   R  + LP
Sbjct: 211 DVCKLGPGSDQVFLQQACDSTHGIYMDISEKNSKTPKGLVQYLLSGFISDPSLRPHIVLP 270

Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFGQ 193
               VDFRA+CF  K  +D+GY+CSVCL I  +    ++C TC + + +
Sbjct: 271 TQSNVDFRAACFLTKQVVDIGYVCSVCLCIMSQIPSNRRCPTCDTTYSE 319


>gi|402084346|gb|EJT79364.1| RNA polymerase II transcription factor B subunit 4 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 382

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/122 (44%), Positives = 74/122 (60%), Gaps = 4/122 (3%)

Query: 74  QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
           QY+  MNA+F+A  + VP+D   L   N++FLQQ SYITGG     ++  GL  YL+T F
Sbjct: 221 QYIPTMNAVFAASHASVPVDVVALRG-NASFLQQGSYITGGNFIHAKEPRGLLTYLMTGF 279

Query: 134 GTDLHS-RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL--KKCSTCGSV 190
                +  + L  P    VDFRA+CFCH+  +D G++CSVCLSI+C+     +C TCGS 
Sbjct: 280 PVGGGAVSDMLIGPGTESVDFRAACFCHRKALDTGFVCSVCLSIFCEAPPENECLTCGSA 339

Query: 191 FG 192
             
Sbjct: 340 LA 341


>gi|159490694|ref|XP_001703308.1| TFIIH 34kDa subunit [Chlamydomonas reinhardtii]
 gi|158280232|gb|EDP05990.1| TFIIH 34kDa subunit [Chlamydomonas reinhardtii]
          Length = 169

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 71/102 (69%)

Query: 85  AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 144
           AQR+ V +D+  +  Q+S+F+QQA+++TGG++ +P +  GL   LL  F  D  +R  L 
Sbjct: 39  AQRAQVVLDAVVVAPQDSSFMQQAAHLTGGLYFRPHRGRGLLGVLLNHFVCDTATRKQLD 98

Query: 145 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 186
             + +GVDFRASCFCHK  I+ GY+CSVCLSI+C+  + C+T
Sbjct: 99  EAQALGVDFRASCFCHKYVIETGYVCSVCLSIFCQPSRSCTT 140


>gi|281211236|gb|EFA85402.1| general transcription factor IIH [Polysphondylium pallidum PN500]
          Length = 276

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 3/124 (2%)

Query: 20  EQLGK--QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVA 77
           E L K  +E + +   S  S ++S+ALCYI R+ +      + RIL    SPD   QY+ 
Sbjct: 118 ENLDKRSEEQQQQDCTSSFSAAMSLALCYINRI-KKEFPSLRARILVFNLSPDVSTQYIP 176

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
           +MN IFSAQ+  +P+DSC L   +S FLQQAS++T G++ KPQ+ + L QYLL+ F  D 
Sbjct: 177 VMNCIFSAQKQSIPVDSCVLSTTDSTFLQQASHLTNGIYLKPQRQENLSQYLLSTFLVDS 236

Query: 138 HSRN 141
            SRN
Sbjct: 237 FSRN 240


>gi|68489766|ref|XP_711270.1| hypothetical protein CaO19.11671 [Candida albicans SC5314]
 gi|68489813|ref|XP_711246.1| hypothetical protein CaO19.4194 [Candida albicans SC5314]
 gi|46432534|gb|EAK92011.1| hypothetical protein CaO19.4194 [Candida albicans SC5314]
 gi|46432559|gb|EAK92035.1| hypothetical protein CaO19.11671 [Candida albicans SC5314]
          Length = 363

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/155 (36%), Positives = 88/155 (56%), Gaps = 4/155 (2%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           + R+L +  + D   +Y+ +MN IF+AQ+  V ID   LG  NS++LQQAS  T GV+  
Sbjct: 200 KSRVLIVSANDDDDLRYIPLMNCIFAAQKMKVSIDVAKLGHNNSSYLQQASDATKGVYLH 259

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
            +   G+ Q L T F  + + R ++ LP    V++RASCF    ++D+GY+CSVCL I  
Sbjct: 260 IEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFITGKSVDLGYVCSVCLCIMS 319

Query: 179 K--HLKKCSTCGSVFGQA--QTQSDEPSATNRKRK 209
           +     KC  C S F +   Q  + EP   ++K++
Sbjct: 320 QLPQSGKCPACDSEFDKKIIQDLNREPQVISKKKR 354


>gi|146416891|ref|XP_001484415.1| hypothetical protein PGUG_03796 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 378

 Score =  103 bits (257), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 70/217 (32%), Positives = 105/217 (48%), Gaps = 39/217 (17%)

Query: 34  SLLSGSLSMALCYIQRVFR---------------------------------SGLLHPQP 60
           S +SG+LS+AL Y  R+                                   S L+  + 
Sbjct: 155 STVSGALSLALTYTHRMLTLDESISTTTASAISTSTNVASNSSAAGAGSTGASNLISMKS 214

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +    +   +Y+ IMN+IF+AQ+  V ID   LG+++ ++LQQA+  T GV+   Q
Sbjct: 215 RILVVSPDDNDDIRYIPIMNSIFAAQKMKVSIDVVKLGSKDVSYLQQAADATNGVYLHVQ 274

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY--C 178
             +GL Q L T +  +   R  + LP    V++RASCF    +ID+G++CSVCL I    
Sbjct: 275 NPEGLIQVLSTAYFIEPSLRPLVVLPTNSNVNYRASCFVTGKSIDLGFVCSVCLCIMSII 334

Query: 179 KHLKKCSTCGSVFGQ---AQTQSDEPSATNRKRKTTD 212
             L KC TC S F     +Q +   P+   RK++  D
Sbjct: 335 PDLGKCPTCQSEFDPNILSQLRR-SPAVLPRKKRKVD 370


>gi|448533948|ref|XP_003870733.1| TFIIH complex subunit [Candida orthopsilosis Co 90-125]
 gi|380355088|emb|CCG24605.1| TFIIH complex subunit [Candida orthopsilosis]
          Length = 408

 Score =  102 bits (255), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/162 (37%), Positives = 91/162 (56%), Gaps = 8/162 (4%)

Query: 59  QPRILCLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
           + RIL +  + DG +   Y+ +MN IF+AQ+  V ID   LG +NS++LQQAS  T GV+
Sbjct: 242 KSRILIVSAN-DGDDSINYIPLMNCIFTAQKLKVSIDVAKLGFKNSSYLQQASDATNGVY 300

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
              +   G+ Q L T F  + + R ++ LP    V+++ASCF  K  +D+G++CSVCL I
Sbjct: 301 LHIEDPQGIIQVLTTAFFIEPNLRPYVILPTNTQVNYKASCFITKRPVDVGFVCSVCLCI 360

Query: 177 YCK--HLKKCSTCGSVFGQA---QTQSDEPSATNRKRKTTDA 213
             K    +KC  C S F +    +   D    T +KRK  D+
Sbjct: 361 MSKIPEDEKCPACSSEFDKRSIYELTKDPQVVTRKKRKLDDS 402


>gi|443923251|gb|ELU42522.1| Tfb4 domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 872

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 57/136 (41%), Positives = 75/136 (55%), Gaps = 8/136 (5%)

Query: 58  PQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
           P P I+C+    D   P   + ++ A       +VPID C +   ++ FLQQASY+T G 
Sbjct: 719 PHPDIVCVARCFDTIYPIHELHLLGA------KIVPIDVCRVFGPDTIFLQQASYLTRGC 772

Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
           +        L Q L   F      RN ++LP    VDFRASCFCHK  +DMGY+CSVCLS
Sbjct: 773 YLNLDSGSSLLQSLTMCFLPSASLRNTIELPSQGKVDFRASCFCHKEVVDMGYVCSVCLS 832

Query: 176 IYCKHLKKCSTCGSVF 191
           I+CK +  C+TC   F
Sbjct: 833 IFCKPVVVCATCKIKF 848


>gi|328352540|emb|CCA38939.1| RNA polymerase II transcription factor B subunit 4 [Komagataella
           pastoris CBS 7435]
          Length = 307

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/150 (41%), Positives = 83/150 (55%), Gaps = 16/150 (10%)

Query: 25  QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ-------PRILCLQGSPDGPEQYVA 77
           QE +G IA     G++SMAL YI R+     + P+        ++L +  S D  +QYV 
Sbjct: 132 QELKGSIA-----GAMSMALSYINRLTN---IEPESTGNTLRAKMLVISISDDSTQQYVP 183

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
            MN IFSAQ+  V ID C +G  +S+FLQQAS +T GV+   + + GL QYL T    D 
Sbjct: 184 FMNCIFSAQKMKVSIDVCKMGP-DSSFLQQASDVTNGVYMLIKNVHGLIQYLTTALFIDP 242

Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
             R  + LP    +DFRASCF     ID+G
Sbjct: 243 SLRPIMVLPTNSDLDFRASCFVTNKVIDIG 272


>gi|150865476|ref|XP_001384710.2| hypothetical protein PICST_83789 [Scheffersomyces stipitis CBS
           6054]
 gi|149386732|gb|ABN66681.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 376

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 68/221 (30%), Positives = 104/221 (47%), Gaps = 42/221 (19%)

Query: 34  SLLSGSLSMALCYIQRVFR------------------------SG--------------L 55
           S LSG+LSMAL Y  R+                          SG              +
Sbjct: 150 STLSGALSMALTYTNRMLTLDQSISTTTASAINSTTSMGAGSGSGNTATNSSTSNPSNSI 209

Query: 56  LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
              + RIL +  + +   +Y+ +MN+IF+AQ+    ID   LG ++S++LQQA+  T G+
Sbjct: 210 TSMKSRILIVTANDEDDVKYIPVMNSIFAAQKMRTSIDIAKLGFEDSSYLQQAADATNGI 269

Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
           +       G+ Q L + F  +   R F+ LP    V++RASCF    ++D+G++CSVCL 
Sbjct: 270 YFHVHDPRGIVQTLTSAFFIEPSIRPFIILPTNSNVNYRASCFVTGKSVDIGFVCSVCLC 329

Query: 176 IYCK--HLKKCSTCGSVFGQAQTQ--SDEPSATNRKRKTTD 212
           I  K     KC  C SVF +      S  PS  ++K++  D
Sbjct: 330 IMSKIPPSGKCPACESVFDEKIIAQLSKGPSVLSKKKRKID 370


>gi|190347434|gb|EDK39698.2| hypothetical protein PGUG_03796 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 378

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 69/217 (31%), Positives = 104/217 (47%), Gaps = 39/217 (17%)

Query: 34  SLLSGSLSMALCYIQRVFR---------------------------------SGLLHPQP 60
           S +SG+LS+AL Y  R+                                   S L+  + 
Sbjct: 155 STVSGALSLALTYTHRMLTLDESISTTTASAISTSTNVASNSSAAGAGSTGASNLISMKS 214

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +    +   +Y+ IMN+IF+AQ+  V ID   LG+++ ++LQQA+  T GV+   Q
Sbjct: 215 RILVVSPDDNDDIRYIPIMNSIFAAQKMKVSIDVVKLGSKDVSYLQQAADATNGVYLHVQ 274

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
             +GL Q L T +  +   R  + LP    V++RASCF    +ID+G++CSVCL I    
Sbjct: 275 NPEGLIQVLSTAYFIEPSLRPLVVLPTNSNVNYRASCFVTGKSIDLGFVCSVCLCIMSII 334

Query: 179 KHLKKCSTCGSVFGQ---AQTQSDEPSATNRKRKTTD 212
               KC TC S F     +Q +   P+   RK++  D
Sbjct: 335 PDSGKCPTCQSEFDPNILSQLRR-SPAVLPRKKRKVD 370


>gi|171689210|ref|XP_001909545.1| hypothetical protein [Podospora anserina S mat+]
 gi|170944567|emb|CAP70678.1| unnamed protein product [Podospora anserina S mat+]
          Length = 396

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/172 (35%), Positives = 87/172 (50%), Gaps = 36/172 (20%)

Query: 53  SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
           +GL+    RIL +  S     QY+  MNA+F+A  S + ID+  L   ++ FL+QAS+IT
Sbjct: 184 TGLVGFHGRILVISVSDSAASQYIPTMNAVFAASMSRIAIDTLALRG-SATFLEQASFIT 242

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHS-----------------------RNFLQLPKPV 149
            G   +     G+ QYL+  FG    S                       R   +L KPV
Sbjct: 243 QGTFIQAADPQGILQYLMFGFGVGSASSGLSAAQNDGSGPLMGKSKTGKQREGDELRKPV 302

Query: 150 G----------VDFRASCFCHKNTIDMGYICSVCLSIYCKHL--KKCSTCGS 189
           G          VDFRA+CFCH+N +D G++CS+CLSI+C+     +C TCG+
Sbjct: 303 GECLFTPAADSVDFRAACFCHRNVVDTGFVCSICLSIFCEPPPGDECLTCGN 354


>gi|320594208|gb|EFX06611.1| transcription factor tfiih subunit [Grosmannia clavigera kw1407]
          Length = 429

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 76/139 (54%), Gaps = 13/139 (9%)

Query: 52  RSGLLHP------QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAF 104
           RSG   P        RIL +  S   P QY+A MNA+F+A  + V ID   L GA  + F
Sbjct: 226 RSGKTDPVDDVGLHARILVISVSDSSPAQYIATMNAVFAAAHAGVAIDVLALRGA--ATF 283

Query: 105 LQQASYITGG--VHHKPQQLDGLFQYLLTIF--GTDLHSRNFLQLPKPVGVDFRASCFCH 160
           LQQA+YIT G  V     Q  GL  YL+  F   +       L  P    VDFRA+CFCH
Sbjct: 284 LQQAAYITRGTFVRVGADQKHGLLAYLMLAFNAASSHDVAQHLVAPSADAVDFRAACFCH 343

Query: 161 KNTIDMGYICSVCLSIYCK 179
           +  +D G++CSVCLSI+C+
Sbjct: 344 RRVVDTGFVCSVCLSIFCE 362


>gi|444315902|ref|XP_004178608.1| hypothetical protein TBLA_0B02470 [Tetrapisispora blattae CBS 6284]
 gi|387511648|emb|CCH59089.1| hypothetical protein TBLA_0B02470 [Tetrapisispora blattae CBS 6284]
          Length = 340

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 75/201 (37%), Positives = 98/201 (48%), Gaps = 30/201 (14%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV---------FRSGLLH 57
           +L Q   +F++K  Q            S +SG++S AL Y+ R+         F+S LL 
Sbjct: 133 SLFQEESKFIDKKSQK-----------STISGAMSCALTYVNRITKQFNDASIFKSRLL- 180

Query: 58  PQPRILCLQG-SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGV 115
               I C  G S D   QY+ IMN IFSA +   PID   +G  + S FLQQ +  T GV
Sbjct: 181 ---VITCGSGRSKDEVFQYIPIMNCIFSATKMKCPIDVVKIGGHEKSTFLQQTTDATNGV 237

Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC-- 173
           +   +   GL QYL T    D   R+ +  P    VDFR SCF     + +GYICSVC  
Sbjct: 238 YLYVENTKGLIQYLSTAMFIDSSLRSIIVKPNHGVVDFRTSCFLTGRVVAVGYICSVCLC 297

Query: 174 -LSIYCKHLKKCSTCGSVFGQ 193
            LSI  K+  KC  C S F +
Sbjct: 298 VLSIVPKN-NKCPACDSKFDE 317


>gi|367035398|ref|XP_003666981.1| hypothetical protein MYCTH_2312237 [Myceliophthora thermophila ATCC
           42464]
 gi|347014254|gb|AEO61736.1| hypothetical protein MYCTH_2312237 [Myceliophthora thermophila ATCC
           42464]
          Length = 433

 Score = 99.8 bits (247), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 62/183 (33%), Positives = 84/183 (45%), Gaps = 46/183 (25%)

Query: 54  GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
           GL     RIL +  S   P QY+  MNA+F+A  + V ID+  L   ++ FL+QAS+IT 
Sbjct: 202 GLAGLHARILIISVSDSAPAQYIPTMNAVFAAAHARVAIDTLALRG-SATFLEQASFITR 260

Query: 114 GVHHKPQQLDGLFQYLLTIFGTD------------------------------------- 136
           G   +  +  GL QYL+  FG+                                      
Sbjct: 261 GTFVRAAEPRGLLQYLMFGFGSGSAPSHPSGGADPGKGPAAPKTTAASVPGGGPGGRRKT 320

Query: 137 ------LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCG 188
                     + L  P    VDFRA+CFCH+N ID GY+CSVCLSI+C+     +C TCG
Sbjct: 321 GRLGLGASVADLLLTPSADAVDFRAACFCHRNVIDTGYVCSVCLSIFCEVPPGGECLTCG 380

Query: 189 SVF 191
           +V 
Sbjct: 381 TVL 383


>gi|255715715|ref|XP_002554139.1| KLTH0E15180p [Lachancea thermotolerans]
 gi|238935521|emb|CAR23702.1| KLTH0E15180p [Lachancea thermotolerans CBS 6340]
          Length = 337

 Score = 99.8 bits (247), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 69/206 (33%), Positives = 97/206 (47%), Gaps = 15/206 (7%)

Query: 12  LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILCL----Q 66
           LEE     ++ GK+E       S LSG++S    YI R+ +       + R++ +     
Sbjct: 123 LEELYKLLQEEGKREQPSGFQKSTLSGAMSAGFTYINRIIKEQSSASLKARLMVVTCGSS 182

Query: 67  GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGL 125
           G  D   QY+ IMN IFSA +   P+D   +G +Q S FLQQA+  T GV+      +GL
Sbjct: 183 GGKDEVFQYIPIMNCIFSATKLKCPVDVVKIGGSQESTFLQQATDATNGVYLHVPSTEGL 242

Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKK 183
            QYL T    D   R  +  P    VDFR SC+     + +G++CSVCL +        +
Sbjct: 243 IQYLTTAMFIDPSLRPIIVKPNQGSVDFRTSCYLTGKVVAVGFVCSVCLCVLSVIPPGNR 302

Query: 184 CSTCGSVFGQAQTQSDEPSATNRKRK 209
           C  C S F       DE   T  K+K
Sbjct: 303 CPACDSEF-------DEHVTTKLKKK 321


>gi|149236047|ref|XP_001523901.1| hypothetical protein LELG_04714 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146452277|gb|EDK46533.1| hypothetical protein LELG_04714 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 444

 Score = 99.4 bits (246), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 61/158 (38%), Positives = 85/158 (53%), Gaps = 6/158 (3%)

Query: 61  RILCLQGSP-DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
           RIL +  S  D    Y+ +MN IF+AQ+  V ID   LG  +S++LQQAS  T GV+ K 
Sbjct: 259 RILIVTASDNDNHLNYIPLMNCIFTAQKLKVSIDVAKLGTTDSSYLQQASDATHGVYLKI 318

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--Y 177
           +   G+ Q L T F  + + R  + +P    VD++ASCF  K  +D+GY+CSVCL I  +
Sbjct: 319 EDPRGMIQVLSTAFFIEPNLRPHVIMPTDYQVDYKASCFLTKKPVDLGYVCSVCLCIMSF 378

Query: 178 CKHLKKCSTCGSVFGQAQTQS---DEPSATNRKRKTTD 212
                KC  C S F Q   Q    D   A  +K++  D
Sbjct: 379 VPDDLKCPACSSEFDQGIVQELTRDPQVAVVKKKRKLD 416


>gi|380484278|emb|CCF40099.1| RNA polymerase II transcription factor B subunit 4 [Colletotrichum
           higginsianum]
          Length = 157

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 49/113 (43%), Positives = 69/113 (61%), Gaps = 3/113 (2%)

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA+F+A  S +PID+  L + ++ FLQQASYIT G   +     GL  YL+  +  D  
Sbjct: 1   MNAVFAAAHSQIPIDTLAL-SGDATFLQQASYITDGTFMQAASPRGLLSYLMFAYAADAE 59

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
           +R+ L  P    V FRA+CFCH   +D G++CS+CLSI+C      +C TCG+
Sbjct: 60  ARSSLIPPTHHTVXFRAACFCHGRVVDTGFVCSICLSIFCDVPEGSECLTCGT 112


>gi|365762534|gb|EHN04068.1| Tfb4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 338

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     + +G+ICSVCL +       
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 303 NKCPACDSQFDE 314


>gi|151942841|gb|EDN61187.1| transcription initiation factor TFIIH subunit [Saccharomyces
           cerevisiae YJM789]
 gi|259150209|emb|CAY87012.1| Tfb4p [Saccharomyces cerevisiae EC1118]
 gi|349581866|dbj|GAA27023.1| K7_Tfb4p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 338

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     + +G+ICSVCL +       
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 303 NKCPACDSQFDE 314


>gi|428673522|ref|NP_001258795.1| general transcription factor IIH subunit 3 isoform b [Homo sapiens]
          Length = 265

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/170 (35%), Positives = 84/170 (49%), Gaps = 42/170 (24%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNA 81
           + K + +G+   +LL+GSL+ ALCYI R+ +                 D  E    I+  
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKE--------------VKDNQEMKSRIL-- 162

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
                                     A  ITGG++ K  Q+  L QYLL +F  D   R+
Sbjct: 163 --------------------------ACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 196

Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 197 QLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 246


>gi|6325313|ref|NP_015381.1| Tfb4p [Saccharomyces cerevisiae S288c]
 gi|62901135|sp|Q12004.1|TFB4_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=General transcription and DNA repair
           factor IIH subunit TFB4; Short=TFIIH subunit TFB4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|805037|emb|CAA89174.1| unknown [Saccharomyces cerevisiae]
 gi|1314125|emb|CAA95001.1| unknown [Saccharomyces cerevisiae]
 gi|190407994|gb|EDV11259.1| RNA polymerase II transcription factor B subunit 4 [Saccharomyces
           cerevisiae RM11-1a]
 gi|256268932|gb|EEU04279.1| Tfb4p [Saccharomyces cerevisiae JAY291]
 gi|285815586|tpg|DAA11478.1| TPA: Tfb4p [Saccharomyces cerevisiae S288c]
 gi|323350201|gb|EGA84348.1| Tfb4p [Saccharomyces cerevisiae VL3]
 gi|392296067|gb|EIW07170.1| Tfb4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 338

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     + +G+ICSVCL +       
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAVGFICSVCLCVLSIIPPG 302

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 303 NKCPACDSQFDE 314


>gi|441630267|ref|XP_004089522.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
           [Nomascus leucogenys]
          Length = 265

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 89/185 (48%), Gaps = 48/185 (25%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQ 66
            +++ +++ M K +  G+         +LL+GSL+ ALCYI R                 
Sbjct: 110 VIVEEIKDLMTKGDIKGQHTE------TLLAGSLAKALCYIHR----------------- 146

Query: 67  GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
                       MN      + M            S  L  A  ITGG++ K  Q+  L 
Sbjct: 147 ------------MNKEIKDNQEM-----------KSRIL--ACDITGGLYLKVPQMPSLL 181

Query: 127 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 186
           QYLL +F  D   R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C     C+T
Sbjct: 182 QYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTT 241

Query: 187 CGSVF 191
           C + F
Sbjct: 242 CETAF 246


>gi|323331310|gb|EGA72728.1| Tfb4p [Saccharomyces cerevisiae AWRI796]
          Length = 409

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 194 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 253

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +
Sbjct: 254 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 313

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     + +G+ICSVCL +       
Sbjct: 314 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAVGFICSVCLCVLSIIPPG 373

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 374 NKCPACDSQFDE 385


>gi|323335143|gb|EGA76433.1| Tfb4p [Saccharomyces cerevisiae Vin13]
          Length = 338

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 95/192 (49%), Gaps = 8/192 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     +  G+ICSVCL +       
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAXGFICSVCLCVLSIIPPG 302

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 303 NKCPACDSQFDE 314


>gi|219113725|ref|XP_002186446.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583296|gb|ACI65916.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 358

 Score = 96.7 bits (239), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 57/137 (41%), Positives = 77/137 (56%), Gaps = 9/137 (6%)

Query: 60  PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ----NSAFLQQASYITGGV 115
           PRI  +Q S D    Y A MN  F+A +  + +D C+L A+    +SAFL+QA  +TGGV
Sbjct: 206 PRIFLIQASDDRSRDYNAFMNCAFAAVKQSIVVDGCFLSAKGNHSSSAFLEQACDLTGGV 265

Query: 116 HHKPQ---QLDG-LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 171
              P    Q+ G L + L ++F  D+ SR  L LP    VDFRA CF     +D+ Y+C+
Sbjct: 266 FLSPSGTAQVGGALTEVLHSVFLADVPSRCLLNLPAINKVDFRARCFETAELVDLAYVCN 325

Query: 172 VCLSIYCKHLKK-CSTC 187
            CLSI+ K  K  C TC
Sbjct: 326 QCLSIFRKQPKNYCPTC 342


>gi|323346291|gb|EGA80581.1| Tfb4p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 313

 Score = 96.7 bits (239), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 66/190 (34%), Positives = 94/190 (49%), Gaps = 8/190 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+   +  +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     +  G+ICSVCL +       
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAXGFICSVCLCVLSIIPPG 302

Query: 182 KKCSTCGSVF 191
            KC  C S F
Sbjct: 303 NKCPACDSQF 312


>gi|254580699|ref|XP_002496335.1| ZYRO0C16016p [Zygosaccharomyces rouxii]
 gi|238939226|emb|CAR27402.1| ZYRO0C16016p [Zygosaccharomyces rouxii]
          Length = 342

 Score = 96.3 bits (238), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 66/198 (33%), Positives = 95/198 (47%), Gaps = 18/198 (9%)

Query: 14  EFMNKDEQL----------GKQEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRI 62
           +F N DE L           K++ +     S L+G++S  L Y+ RV +    +  + R+
Sbjct: 134 QFRNVDETLVEELYGLFQQEKEQADEVTQKSTLAGAMSAGLTYVNRVSKELETISFKSRL 193

Query: 63  LCLQ----GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHH 117
           L +     G  D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+ 
Sbjct: 194 LVVTCGSGGGRDEIFQYIPIMNCIFSATKLKCPIDVVKIGGSKESTFLQQTTDATQGVYL 253

Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
             +  +GL QYL T    D   R  +  P    VDFR SC+     + +GYICSVCL + 
Sbjct: 254 HAESTEGLIQYLATAMFIDPSLRQIIVKPNQGSVDFRTSCYLTGKVVAIGYICSVCLCVL 313

Query: 178 CKHL--KKCSTCGSVFGQ 193
                  +C  C S F +
Sbjct: 314 SVSPPGNRCPACDSEFDE 331


>gi|50303925|ref|XP_451910.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62901298|sp|Q6CVX9.1|TFB4_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|49641042|emb|CAH02303.1| KLLA0B08569p [Kluyveromyces lactis]
          Length = 337

 Score = 96.3 bits (238), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 71/196 (36%), Positives = 98/196 (50%), Gaps = 19/196 (9%)

Query: 15  FMNKDE-------QLGKQEPEGRIAC----SLLSGSLSMALCYIQRVFRSGL-LHPQPRI 62
           F N DE       +L K+E EG+ +     S LSG++S AL Y+ R+ +       + R+
Sbjct: 115 FRNVDEIVVEQWYRLFKEELEGQESKVSMKSSLSGAMSSALTYVNRILKENENTSLRSRL 174

Query: 63  LCLQ-GSPDGPE---QYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHH 117
           L +  G+  G +   QY+ IMN IFSA +    ID   +G    S FLQQA+  T GV+ 
Sbjct: 175 LVITCGTSQGKDEIFQYIPIMNCIFSATKMKCSIDVVKIGGGIESTFLQQATDATSGVYL 234

Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI- 176
             +   GL QYL T    D   RN +  P    VDFR SCF     + +G++CSVCL + 
Sbjct: 235 HVENTRGLIQYLSTAMFIDPSLRNVIIKPNQGSVDFRTSCFLTGKVVAVGFVCSVCLCVL 294

Query: 177 -YCKHLKKCSTCGSVF 191
                 +KC  C S F
Sbjct: 295 SVIPPGQKCPACDSPF 310


>gi|367012173|ref|XP_003680587.1| hypothetical protein TDEL_0C04870 [Torulaspora delbrueckii]
 gi|359748246|emb|CCE91376.1| hypothetical protein TDEL_0C04870 [Torulaspora delbrueckii]
          Length = 325

 Score = 95.9 bits (237), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 94/198 (47%), Gaps = 18/198 (9%)

Query: 14  EFMNKDEQLGK------QEPEGRI----ACSLLSGSLSMALCYIQRVFRS-GLLHPQPRI 62
           +F N DE L +      QE + +I      S L+G +S  L YI R+ R    +  + R+
Sbjct: 117 QFRNVDETLVEELYKLFQEEKNQIDKVTQKSTLAGGISAGLTYINRISRELATIALKSRL 176

Query: 63  LCLQ----GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHH 117
           L +     G  D   QY+ IMN IFSA +   PID   +G  + S FLQQ +  T GV+ 
Sbjct: 177 LVITCGSGGGRDEIFQYIPIMNCIFSATKLKCPIDVVKIGGHRESTFLQQTTDATNGVYL 236

Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
             +   GL QYL T    D   R  +  P    VDFR SC+     + +GYICSVCL + 
Sbjct: 237 HLESTQGLIQYLSTAMFIDPSLRPIIVKPNQGSVDFRTSCYLTGKVVAIGYICSVCLCVL 296

Query: 178 C--KHLKKCSTCGSVFGQ 193
                  KC  C S F +
Sbjct: 297 SILPPGNKCPACDSEFDE 314


>gi|365982181|ref|XP_003667924.1| hypothetical protein NDAI_0A05260 [Naumovozyma dairenensis CBS 421]
 gi|343766690|emb|CCD22681.1| hypothetical protein NDAI_0A05260 [Naumovozyma dairenensis CBS 421]
          Length = 339

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 97/197 (49%), Gaps = 15/197 (7%)

Query: 7   TLLQNLEEFMNKD-EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILC 64
           TL++ L     K+ +Q+ K + +     S LSG++S  L YI R+ R    +  + R++ 
Sbjct: 122 TLVEELYTLFEKERDQIDKFKQK-----STLSGAMSAGLTYINRLSRELEAISLKSRLMV 176

Query: 65  LQ-----GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHK 118
           L       S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+  
Sbjct: 177 LTCGGKGESKDEIFQYIPIMNCIFSANKIKCPIDVVKIGGSERSTFLQQTTDATNGVYLH 236

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI-- 176
            +   GL QYL T    D   R  +  P    VDFR SC+     + +GYICSVCL +  
Sbjct: 237 VESSKGLIQYLSTAMFIDPSLRQIIVKPNHGSVDFRTSCYLTGKVVAIGYICSVCLCVLS 296

Query: 177 YCKHLKKCSTCGSVFGQ 193
                 KC  C S F +
Sbjct: 297 IIPPGNKCPACDSEFDE 313


>gi|45187550|ref|NP_983773.1| ADL323Cp [Ashbya gossypii ATCC 10895]
 gi|62901344|sp|Q75B93.1|TFB4_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|44982288|gb|AAS51597.1| ADL323Cp [Ashbya gossypii ATCC 10895]
 gi|374106985|gb|AEY95893.1| FADL323Cp [Ashbya gossypii FDAG1]
          Length = 341

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 67/192 (34%), Positives = 94/192 (48%), Gaps = 10/192 (5%)

Query: 12  LEEFMNKDEQLGKQEPEG--RIACSLLSGSLSMALCYIQRV-FRSGLLHPQPRILCL--- 65
           +EE     +Q  KQ  +G  +   S LSG++S  L YI R+   +  +  + R+L +   
Sbjct: 124 VEELYRVFQQESKQLEDGVPQPFRSTLSGAMSAGLTYINRITHETEGVSLKSRLLVITCG 183

Query: 66  -QGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G  + S FLQQA+  T G +      D
Sbjct: 184 SSASKDEVFQYIPIMNCIFSATKMKCPIDVVKVGGVKESTFLQQATDATNGNYLHVANTD 243

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R ++  P    VDFR SC+     + +G++CSVCL +       
Sbjct: 244 GLIQYLSTAMFIDPSLRQWVVKPNQSSVDFRTSCYLTGKVVAIGFVCSVCLCVLSIIPPG 303

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 304 NKCPACDSEFDE 315


>gi|260939906|ref|XP_002614253.1| hypothetical protein CLUG_05739 [Clavispora lusitaniae ATCC 42720]
 gi|238852147|gb|EEQ41611.1| hypothetical protein CLUG_05739 [Clavispora lusitaniae ATCC 42720]
          Length = 332

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 94/198 (47%), Gaps = 20/198 (10%)

Query: 32  ACSLLSGSLSMALCYIQRVF---RSG---------------LLHPQPRILCLQGSPDGPE 73
           A S L+G+LSMA+ Y  R+    +SG                     R+L +  +     
Sbjct: 126 ARSTLAGALSMAMTYTNRMLHVDQSGSAVQVAAAGTAGAQTTAAMGARVLVVSANESDDN 185

Query: 74  QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
            Y+ IMNAIF+AQ+  V ID   LG +++ +L+QA+  T GV+       G+ Q L T F
Sbjct: 186 NYMGIMNAIFAAQKMKVAIDVAKLGRRSAPYLEQAADATQGVYLHVADPRGMVQTLATAF 245

Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVF 191
             +   R+ + LP    VD++ASCF     +D+GY+CSVCL I         C  C S F
Sbjct: 246 FVEPSLRSVVILPTHGNVDYKASCFLTGRAVDVGYVCSVCLCIMSMLPDSGACPMCHSQF 305

Query: 192 GQAQTQSDEPSATNRKRK 209
            +          + RKR+
Sbjct: 306 DEHHIARLRRGPSVRKRR 323


>gi|401838123|gb|EJT41898.1| TFB4-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 338

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 93/192 (48%), Gaps = 8/192 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  ++ K + E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 EALVEEIYKLFEVEKNQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+       
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVDSTQ 242

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     + +G+ICSVCL +       
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 303 NKCPACDSEFDE 314


>gi|365757943|gb|EHM99813.1| Tfb4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 338

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 65/192 (33%), Positives = 93/192 (48%), Gaps = 8/192 (4%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  ++ K + E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 EALVEEIYKLFEVEKNQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
              S D   QY+ IMN IFSA +   PID   +G ++ S FLQQ +  T GV+       
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVDSTQ 242

Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
           GL QYL T    D   R  +  P    VDFR SC+     + +G+ICSVCL +       
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302

Query: 182 KKCSTCGSVFGQ 193
            KC  C S F +
Sbjct: 303 NKCPACDSEFDE 314


>gi|50286117|ref|XP_445487.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62901307|sp|Q6FWA7.1|TFB4_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 4;
           AltName: Full=RNA polymerase II transcription factor B
           34 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p34 subunit
 gi|49524792|emb|CAG58398.1| unnamed protein product [Candida glabrata]
          Length = 335

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 65/195 (33%), Positives = 95/195 (48%), Gaps = 12/195 (6%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV---FRSGLLHPQPRIL 63
           TL++   +   ++E L  +  +     S LSG+++ AL Y  R+   F S  L  +  ++
Sbjct: 121 TLIEEFYKLYQREESLIDKPVQK----STLSGAMAAALTYTNRLTKEFESISLRSRLLVI 176

Query: 64  CLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHHKPQ 120
               S +  E  QY+ IMN IFSA +   PID   +G  + S FLQQ +  T GV+   +
Sbjct: 177 TCGSSREKDEIFQYIPIMNCIFSATKLKCPIDVIKIGGNKQSTFLQQTTDATNGVYIHLE 236

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
             +G+ QYL T    D   R  +  P    VDFR SC+     + +GYICSVCL +    
Sbjct: 237 STNGIIQYLSTAMSIDPSLRQIIVRPTQGSVDFRTSCYLTGKVVAIGYICSVCLCVLSII 296

Query: 179 KHLKKCSTCGSVFGQ 193
               KC  C S F +
Sbjct: 297 PPGNKCPACDSQFDE 311


>gi|341038752|gb|EGS23744.1| putative transcription factor [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 429

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/186 (32%), Positives = 86/186 (46%), Gaps = 50/186 (26%)

Query: 53  SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
           +GL     RIL +  S     QY+  MNA+F+A  + + ID+  L   ++ FL+QAS+IT
Sbjct: 195 AGLAGLHARILIISVSDSSAAQYIPTMNAVFAAAHARIAIDTLALRG-SATFLEQASFIT 253

Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRN-----------FLQLPKPVG----------- 150
            G   +  +  GL QYL+  FG+     N            +  PKP G           
Sbjct: 254 RGTFIRAAEPRGLLQYLMFGFGSGSAPSNPAAAAAAAEKSTIGGPKPSGAAAAQPAAGTT 313

Query: 151 -------------------------VDFRASCFCHKNTIDMGYICSVCLSIYCKHLK--K 183
                                    VDFRA+CFCH+N +D G++CSVCLSI+C+     +
Sbjct: 314 KPGAAGKLPPGAAVTDLLMTPSAEAVDFRAACFCHRNVVDTGFVCSVCLSIFCEVPPGGE 373

Query: 184 CSTCGS 189
           C TCG+
Sbjct: 374 CMTCGT 379


>gi|363751126|ref|XP_003645780.1| hypothetical protein Ecym_3480 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889414|gb|AET38963.1| Hypothetical protein Ecym_3480 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 341

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 89/184 (48%), Gaps = 15/184 (8%)

Query: 34  SLLSGSLSMALCYIQRVFRSGL-LHPQPRILCLQ-GSPDGPE---QYVAIMNAIFSAQRS 88
           S LSG++S  L YI R+ +       + R+L +  GS  G +   QY+ IMN IFSA + 
Sbjct: 149 STLSGAMSAGLTYINRIVKENESTSLKSRLLVITCGSNAGKDEIFQYIPIMNCIFSATKM 208

Query: 89  MVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
              ID   +G  + S FLQQA+  T G +   +  +GL QYL T    D   R ++  P 
Sbjct: 209 RCSIDVVKIGGTEESTFLQQATDATNGNYLHVENTEGLIQYLSTAMFIDPSLRQWIVKPN 268

Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQAQTQSDEPSATN 205
              VDFR SC+     + +G++CSVCL +        KC  C + F       DE     
Sbjct: 269 QSAVDFRTSCYLTGTVVAIGFVCSVCLCVLSIIPPENKCPACDAEF-------DEKVVAK 321

Query: 206 RKRK 209
            KRK
Sbjct: 322 LKRK 325


>gi|403216997|emb|CCK71492.1| hypothetical protein KNAG_0H00770 [Kazachstania naganishii CBS
           8797]
          Length = 342

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 71/205 (34%), Positives = 94/205 (45%), Gaps = 31/205 (15%)

Query: 15  FMNKDEQLGKQ----------EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP------ 58
           F N DE L ++          E E     S L G++  AL Y  R     LLH       
Sbjct: 117 FRNVDETLVEELYTLFKQEMGEVEKATQKSTLPGAMLAALAYTNR-----LLHDYETKSL 171

Query: 59  QPRILCLQ-GSPDGPEQ------YVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASY 110
           + RI+ +  G+ +G  Q      Y+ IMN IFSA +   PID   +G A+ S FLQQ + 
Sbjct: 172 KSRIMVMTCGNKNGTSQKEEIFQYIPIMNCIFSAAKLKCPIDVVKIGGARKSTFLQQTTD 231

Query: 111 ITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYIC 170
            T G++      DGL QYL T    D   R+ +  P    VDFR SCF     + +GYIC
Sbjct: 232 ATNGIYLHVDSTDGLVQYLSTAMFIDPSLRSIIVKPNQGSVDFRTSCFLTGKVVAVGYIC 291

Query: 171 SVCLSI--YCKHLKKCSTCGSVFGQ 193
           SVCL +        +C  C S F +
Sbjct: 292 SVCLCVLSVLPPGNRCPACESEFDE 316


>gi|366998281|ref|XP_003683877.1| hypothetical protein TPHA_0A03670 [Tetrapisispora phaffii CBS 4417]
 gi|357522172|emb|CCE61443.1| hypothetical protein TPHA_0A03670 [Tetrapisispora phaffii CBS 4417]
          Length = 344

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 66/216 (30%), Positives = 107/216 (49%), Gaps = 24/216 (11%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCL 65
           +L++ + +   K++    ++P+ +   S L+G++S  L YI R+ ++   +  + R++ +
Sbjct: 126 SLVEEIYKIFQKEKAEYLEKPKQK---STLAGAMSAGLTYINRIVKNEENYTLKSRLVVI 182

Query: 66  --------QGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVH 116
                       +   QY+ IMN IFSA +   PID   +G +  S FLQQA+  T G++
Sbjct: 183 TCGGNAYTDNKVEEVIQYIPIMNCIFSATKMKCPIDVVKIGGSSESTFLQQATDATSGIY 242

Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCL-- 174
              +   GL QYLLT    +   R+ +  P    +DFR SC+     + +G+ICS+CL  
Sbjct: 243 LYVKDYHGLIQYLLTAMFIEPSLRSTIVKPNSKSIDFRTSCYITGKVVAVGFICSICLCV 302

Query: 175 -SIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
            SI   +  KC  C SVF       DE      KRK
Sbjct: 303 MSILPPN-NKCPACDSVF-------DEKVIVKLKRK 330


>gi|195447228|ref|XP_002071121.1| GK25630 [Drosophila willistoni]
 gi|194167206|gb|EDW82107.1| GK25630 [Drosophila willistoni]
          Length = 310

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 58/174 (33%), Positives = 85/174 (48%), Gaps = 10/174 (5%)

Query: 12  LEEFMNKDEQ-LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS---GL-----LHPQPRI 62
           + E M  D Q +         + SLL+G +  ALCYIQR  R    GL     L  + R+
Sbjct: 134 IAELMRSDAQKVNYNSNAQNTSTSLLAGGIGKALCYIQRRRREITRGLAKGVQLRIEARV 193

Query: 63  LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQ 121
           L +  +  G  Q +  M+    A+ S V +D C L  A+  +  +QA+ +TGG ++    
Sbjct: 194 LIVTATDPGSNQVINYMHMFNGAKSSRVALDVCILDRAETLSIWRQATALTGGFYYSTND 253

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
              L   LL IF      R     P+   VD+R +C CHKN +D+G++CS CLS
Sbjct: 254 FTDLLTRLLGIFLMSPVERFQFNYPEQPQVDYRPNCVCHKNVLDIGFVCSACLS 307


>gi|354543147|emb|CCE39865.1| hypothetical protein CPAR2_602840 [Candida parapsilosis]
          Length = 381

 Score = 92.8 bits (229), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 60/159 (37%), Positives = 86/159 (54%), Gaps = 8/159 (5%)

Query: 61  RILCLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           RIL +  + DG +   Y+ +MN IF+AQ+  V ID   LG +NS++LQQAS  T GV+  
Sbjct: 209 RILIVSAN-DGDDSINYIPLMNCIFTAQKLKVSIDVAKLGFKNSSYLQQASDATNGVYLH 267

Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
            +   G+ Q L T F  +   R ++ LP    V+++ASCF  K  +D+G++CSVCL I  
Sbjct: 268 IEDPRGITQVLATAFFIEPSLRPYIILPTNTQVNYKASCFITKKPVDVGFVCSVCLCIMS 327

Query: 179 K--HLKKCSTCGSVFGQAQTQ--SDEPS-ATNRKRKTTD 212
                 KC  C S F +      +  P   T +KRK  D
Sbjct: 328 SIPDDGKCPACSSDFDKRSINELTKIPQVVTKKKRKLED 366


>gi|366988553|ref|XP_003674043.1| hypothetical protein NCAS_0A11040 [Naumovozyma castellii CBS 4309]
 gi|342299906|emb|CCC67662.1| hypothetical protein NCAS_0A11040 [Naumovozyma castellii CBS 4309]
          Length = 338

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 9/169 (5%)

Query: 34  SLLSGSLSMALCYIQRVFRS-GLLHPQPRILCLQ-----GSPDGPEQYVAIMNAIFSAQR 87
           S LSG++S  L YI RV +    +  + R++ L       S D   QY+ IMN IFSA +
Sbjct: 144 STLSGAMSAGLTYINRVTKELEAISLKSRLMVLTCGGKGESKDEIFQYIPIMNCIFSATK 203

Query: 88  SMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
              PID   +G ++ S FLQQ +  T G +   +   GL QYL T    D   R  +  P
Sbjct: 204 IKCPIDVVKIGGSKESTFLQQTTDATNGAYLHVESTKGLIQYLATAMFIDPSLRQIIVKP 263

Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 193
               VDFR SC+     + +G+ICSVCL +        KC  C S F +
Sbjct: 264 NHGSVDFRTSCYLTGKVVAVGFICSVCLCVLSIIPPGNKCPACDSEFDE 312


>gi|302759787|ref|XP_002963316.1| hypothetical protein SELMODRAFT_438480 [Selaginella moellendorffii]
 gi|300168584|gb|EFJ35187.1| hypothetical protein SELMODRAFT_438480 [Selaginella moellendorffii]
          Length = 916

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 46/83 (55%), Positives = 58/83 (69%), Gaps = 5/83 (6%)

Query: 8   LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
           ++  L +F+++D     Q P  +   SLLS SLSMALCYIQR  +  +   +PR+LCLQ 
Sbjct: 792 IVHKLNDFVSRD---ATQPP--KFQPSLLSASLSMALCYIQRALKEKIGPSKPRMLCLQA 846

Query: 68  SPDGPEQYVAIMNAIFSAQRSMV 90
           SPD P QY+AIMNAIFSAQRSMV
Sbjct: 847 SPDAPHQYIAIMNAIFSAQRSMV 869


>gi|344229675|gb|EGV61560.1| transcription factor Tfb4 [Candida tenuis ATCC 10573]
          Length = 381

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 64/226 (28%), Positives = 105/226 (46%), Gaps = 42/226 (18%)

Query: 7   TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-------------- 52
           T+L  L + M+    +   + +     S LSG+LS+AL Y  R+ +              
Sbjct: 128 TVLSGLSQTMS---DITHAQDDKVTGSSRLSGALSLALSYTNRMMKLDQSISTTTASAIS 184

Query: 53  -----------------------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
                                   G    + RIL +  + +   +Y+++M AIF AQ+  
Sbjct: 185 SAANISNKESSTAGASSGTNANSYGTSRMKSRILVVTPNDNEDIKYISLMKAIFGAQKMK 244

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
           V ID   LG ++S++LQQA+  T GV+   ++  GL Q L T +  +   R  + LP   
Sbjct: 245 VAIDIAKLGRKDSSYLQQAADATNGVYLHIEKPLGLIQVLCTAYFIEPSIRPLMILPTNS 304

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQ 193
            V+++ASCF    ++++GY+CSVCL I  +   +  C TC S F +
Sbjct: 305 NVNYKASCFITGKSVEIGYVCSVCLCIMSEIPDRMSCPTCNSHFDE 350


>gi|410082137|ref|XP_003958647.1| hypothetical protein KAFR_0H01020 [Kazachstania africana CBS 2517]
 gi|372465236|emb|CCF59512.1| hypothetical protein KAFR_0H01020 [Kazachstania africana CBS 2517]
          Length = 328

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 64/186 (34%), Positives = 89/186 (47%), Gaps = 16/186 (8%)

Query: 34  SLLSGSLSMALCYIQRVFRS-GLLHPQPRILCL---QGSPDGPE--QYVAIMNAIFSAQR 87
           S L G++S  L Y+ R+ +    L  + R++ L    G  +  E  QY+ IMN IFSA +
Sbjct: 144 STLPGAISAGLTYVNRISKEMETLSLKSRLMVLTCGSGMANKEEIFQYIPIMNCIFSATK 203

Query: 88  SMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
              P+D   +G ++ S FLQQ +  T GV+   +   GL QYL T    D   R  +  P
Sbjct: 204 LKCPVDVVKIGGSKESTFLQQTTDATHGVYLHVETTKGLIQYLTTAMFIDPSLRQIIVKP 263

Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQAQTQSDEPSAT 204
               VDFR SC+     + +G+ICSVCL +        KC  C S F       DE    
Sbjct: 264 NQGSVDFRTSCYLTGKVVAVGFICSVCLCVLSIIPPGNKCPACDSEF-------DERVIN 316

Query: 205 NRKRKT 210
             K+KT
Sbjct: 317 KLKKKT 322


>gi|336468231|gb|EGO56394.1| hypothetical protein NEUTE1DRAFT_147077 [Neurospora tetrasperma
           FGSC 2508]
 gi|350289522|gb|EGZ70747.1| transcription factor Tfb4 [Neurospora tetrasperma FGSC 2509]
          Length = 439

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 84/183 (45%), Gaps = 55/183 (30%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +  S   P QY+  MNA+F+A  + + ID+  L   ++ FLQQASYIT G   + Q
Sbjct: 223 RILVVSVSDSSPSQYIPTMNAVFAAAHARIAIDTLALRG-SATFLQQASYITRGTFIRAQ 281

Query: 121 QLDGLFQYLLTIFGTDL---------HSR------------------------------- 140
           +  GL QYL+  FG+           H R                               
Sbjct: 282 EPRGLLQYLMFGFGSGSAPQGLAAAGHQRSQQQHGGNKGKEDKASEKAVGAAAKGGAKHG 341

Query: 141 ------------NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
                       + L  P    VDFRA+CFCH+N ID G++CS+CLSI+C+     +C T
Sbjct: 342 GQKFLGSNASVADLLVTPSADSVDFRAACFCHRNVIDTGFVCSICLSIFCEVPEGGECLT 401

Query: 187 CGS 189
           CG+
Sbjct: 402 CGT 404


>gi|85078711|ref|XP_956217.1| hypothetical protein NCU01580 [Neurospora crassa OR74A]
 gi|16416034|emb|CAB91384.2| related to basic transcription factor 2 (35 kd SU) [Neurospora
           crassa]
 gi|28917270|gb|EAA26981.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 439

 Score = 89.4 bits (220), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 83/183 (45%), Gaps = 55/183 (30%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RIL +  S   P QY+  MNA+F+A  + + ID+  L   ++ FLQQASYIT G   + Q
Sbjct: 223 RILVVSVSDSSPSQYIPTMNAVFAAAHARIAIDTLALRG-SATFLQQASYITRGTFIRAQ 281

Query: 121 QLDGLFQYLLTIF----------------------------------------------- 133
           +  GL QYL+  F                                               
Sbjct: 282 EPRGLLQYLMFGFGSGSAPQGLAAAGNQRSQQQHGGHKGKEDKASEKALGAAAKGGAKHG 341

Query: 134 -----GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
                G++    + L  P    VDFRA+CFCH+N ID G++CS+CLSI+C+     +C T
Sbjct: 342 GQKFLGSNASVADLLVTPSADSVDFRAACFCHRNVIDTGFVCSICLSIFCEVPEGGECLT 401

Query: 187 CGS 189
           CG+
Sbjct: 402 CGT 404


>gi|341877621|gb|EGT33556.1| hypothetical protein CAEBREN_10115 [Caenorhabditis brenneri]
          Length = 311

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/139 (35%), Positives = 73/139 (52%), Gaps = 5/139 (3%)

Query: 73  EQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKPQQLDGLFQYLLT 131
            ++ ++MN  FSA + ++ +D   LG       LQQA+ ITGG     Q+   L + L+T
Sbjct: 177 SEHGSLMNLFFSAAKQLICVDVVSLGDDFTGGILQQAADITGGSFMHAQKPQALLKILMT 236

Query: 132 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
              TD   R+         VD+RASC CH+  +  G++CSVCLS+ C++   C  C + F
Sbjct: 237 NMLTDPTYRSAFSKLSHNSVDYRASCVCHQTLVSSGWVCSVCLSVLCQYTPICKVCKAAF 296

Query: 192 GQAQTQSDEPSATNRKRKT 210
               T  + P   NRKRK+
Sbjct: 297 ----TIRNIPKKPNRKRKS 311


>gi|407044375|gb|EKE42557.1| hypothetical protein ENU1_017690 [Entamoeba nuttalli P19]
          Length = 263

 Score = 85.9 bits (211), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 101/195 (51%), Gaps = 20/195 (10%)

Query: 4   LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPR 61
           L  ++ + ++E +N  ++    EP        LS ++S+AL  I +    G L P  Q +
Sbjct: 81  LTTSIAEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK---DGKLQPDVQKK 128

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFLQQASYITGGVHHKP 119
           I+ +    +   +++ +MN +F+AQR    ID+  L    +   F QQASY+T G+++  
Sbjct: 129 IIVVSTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSVIETRCTFCQQASYLTNGIYNTA 188

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
             L  LF  LL+   +D  +   +++   +   +D++  CF  K  +D GY+CS C  IY
Sbjct: 189 T-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQPAIDWKLWCFHCKKPLDHGYVCSSCFGIY 247

Query: 178 CKHLK-KCSTCGSVF 191
           C+  K +C TCG+ F
Sbjct: 248 CEKQKGRCPTCGAYF 262


>gi|225684398|gb|EEH22682.1| RNA polymerase II transcription factor B subunit 4
           [Paracoccidioides brasiliensis Pb03]
          Length = 382

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/107 (43%), Positives = 64/107 (59%), Gaps = 2/107 (1%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ- 120
           I+ L  +    + Y+ IMN+IF+ QR  +PID   L A ++AFLQQAS  TGG++     
Sbjct: 233 IISLSNNTHSAQHYIPIMNSIFACQRLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINA 291

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
              G  QYL+  F  D  +R  L LP  V VDFRA+CFCH+  +D+G
Sbjct: 292 HPAGFLQYLMLGFLPDQRARAHLILPSRVDVDFRAACFCHRKVVDVG 338


>gi|308501315|ref|XP_003112842.1| hypothetical protein CRE_25211 [Caenorhabditis remanei]
 gi|308265143|gb|EFP09096.1| hypothetical protein CRE_25211 [Caenorhabditis remanei]
          Length = 317

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 74/149 (49%), Gaps = 5/149 (3%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKP 119
           R + +  SP    ++ ++MN  FSA +  + +D   +G       LQQA+ ITGG     
Sbjct: 169 RAVVISMSPIMGSEHGSLMNLFFSAAKQSICVDVVSMGDDFTGGILQQAADITGGSFLHA 228

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
           Q+   L + L+T   TD   R          VD+RASC CH   +  G++CSVCLS+ C+
Sbjct: 229 QKPPALLKILMTNMLTDPSYRTAFSKLSHSSVDYRASCACHHKLVSSGWVCSVCLSVLCQ 288

Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKR 208
           +   C  C + F    T ++ P   NRKR
Sbjct: 289 YTPICKVCRAAF----TITNIPKKPNRKR 313


>gi|37958350|gb|AAP41125.1| ZK1128.4 [Pristionchus pacificus]
          Length = 278

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/182 (31%), Positives = 93/182 (51%), Gaps = 17/182 (9%)

Query: 33  CSLLSGSLSMALCYIQRVFRSGLLHPQP-------RILCLQGSPDGPEQYVAIMNAIFSA 85
           C +L+ +L+ A+C I++    G    +        RI+ +   PD   Q+  +MN  FSA
Sbjct: 105 CGVLAATLATAICQIRKAATLGTNKSEETISGQKGRIVIISLLPDFGSQHSLLMNLFFSA 164

Query: 86  QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLT--IFGTDLHSRNFL 143
            +  + ID   LG  +SA LQQA+ IT GV     +++ L + L+T  +  +   S +F 
Sbjct: 165 HKHDICIDVVSLGG-SSALLQQAADITAGVFQVIHKIEDLPRILMTRCLPVSIRDSTSF- 222

Query: 144 QLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSA 203
                  VD+RA+C CH+  +  G+ CSVCL++ C  +  C +CG+   Q  + S E   
Sbjct: 223 -----TTVDYRAACVCHEKLVSNGWTCSVCLAVLCSFMAICPSCGACV-QDHSSSKEEKE 276

Query: 204 TN 205
           T+
Sbjct: 277 TD 278


>gi|67468778|ref|XP_650390.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56467011|gb|EAL45004.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 263

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 101/195 (51%), Gaps = 20/195 (10%)

Query: 4   LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPR 61
           L  ++ + ++E +N  ++    EP        LS ++S+AL  I +    G L P  Q +
Sbjct: 81  LTISIAEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK---DGKLQPDIQKK 128

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFLQQASYITGGVHHKP 119
           I+ +    +   +++ +MN +F+AQR    ID+  L    +   F QQASY+T G+++  
Sbjct: 129 IIIVSTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSVIETRCTFCQQASYLTNGIYNTA 188

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
             L  LF  LL+   +D  +   +++   +   +D++  CF  K  +D GY+CS C  IY
Sbjct: 189 T-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQPAIDWKLWCFHCKKPLDHGYVCSSCFGIY 247

Query: 178 CKHLK-KCSTCGSVF 191
           C+  K +C TCG+ F
Sbjct: 248 CEKQKGRCPTCGAYF 262


>gi|393909857|gb|EFO22116.2| basic transcription factor 2 [Loa loa]
          Length = 274

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 88/163 (53%), Gaps = 11/163 (6%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           ++ + ++++A+C+I R +++ + +   RIL +    D   ++  +MN  F+A +  + ID
Sbjct: 101 AVFAPAVAIAICHIFR-YKNEVDNGDGRILIINIGSDLRGEHSILMNIFFAAHKHNILID 159

Query: 94  SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQLPKP--- 148
              +G + S  LQQAS ITGG +   ++   L +Y +  T+    L S      P P   
Sbjct: 160 VANIG-EASTILQQASDITGGTYFSVKKPKQLLKYTMCFTLGKASLRS----AFPSPSSS 214

Query: 149 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
             VD+RASC CH   + +G++CSVCLS+ C     C  C +VF
Sbjct: 215 TSVDYRASCHCHGAPVSIGWVCSVCLSVQCHFNPICPACNTVF 257


>gi|167391336|ref|XP_001739730.1| RNA polymerase II transcription factor B subunit [Entamoeba dispar
           SAW760]
 gi|165896489|gb|EDR23894.1| RNA polymerase II transcription factor B subunit, putative
           [Entamoeba dispar SAW760]
          Length = 249

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/195 (29%), Positives = 101/195 (51%), Gaps = 20/195 (10%)

Query: 4   LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPR 61
           L   +++ ++E +N  ++    EP        LS ++S+AL  I +    G L P  Q +
Sbjct: 67  LTTPIVEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK---DGKLQPDIQKK 114

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFLQQASYITGGVHHKP 119
           ++ +    +   +++ +MN +F+AQR    ID+  L    +   F QQASY+T G+++  
Sbjct: 115 VIVISTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSIIETGCTFCQQASYLTNGIYNTA 174

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
             L  LF  LL+   +D  +   +++   +   +D++  CF  K  +D GY+CS C  IY
Sbjct: 175 P-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQSAIDWKLWCFHCKKPLDHGYVCSSCFGIY 233

Query: 178 C-KHLKKCSTCGSVF 191
           C K  ++C TCG+ F
Sbjct: 234 CEKQKERCPTCGAYF 248


>gi|402595070|gb|EJW88996.1| hypothetical protein WUBG_00095 [Wuchereria bancrofti]
          Length = 274

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/163 (32%), Positives = 86/163 (52%), Gaps = 11/163 (6%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           +L + +++ A+C+I R F++ +     RIL +    D   ++  +MN  F+A +  + ID
Sbjct: 101 ALFAPAIATAICHIFR-FKNEVDKGDGRILVINIGSDLLGEHNILMNIFFAAHKHSILID 159

Query: 94  SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQLPKP--- 148
              +G + S  LQQAS ITGG +   ++   L +Y +  T+    L S      P P   
Sbjct: 160 VANIG-ETSPILQQASDITGGTYFNVKKPKQLLKYTMCFTLGRASLRS----AFPSPSSS 214

Query: 149 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
             +D+RASC CH   + +G++CSVCLS+ C     C  C +VF
Sbjct: 215 TSIDYRASCHCHGAPVSVGWVCSVCLSVQCHFSPICPACNTVF 257


>gi|313239039|emb|CBY14022.1| unnamed protein product [Oikopleura dioica]
          Length = 291

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/177 (35%), Positives = 85/177 (48%), Gaps = 6/177 (3%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
           S  + +LS ALC I R  RS    P   ++   G      QY+ ++N  ++A+   V ID
Sbjct: 119 SRFNNALSKALCCINR-MRSEKDLPARLMVMKCGGDFLSHQYLPLINCAYAAEHLKVSID 177

Query: 94  SCYLGAQNSA-FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 152
           +  L A+     LQQ S +TGGV+ +  +   L Q LLT    D  +R  L       VD
Sbjct: 178 TVVLEAETDVPVLQQMSDVTGGVYSRISEPAILLQTLLTYHLPDDDARKMLSKVPATPVD 237

Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
            RA CF  +   D+GY+CSVCLS+Y      C TC SV          P +  RKR+
Sbjct: 238 SRAVCFKTRAVTDIGYVCSVCLSVYKMLTPICVTCKSVLKLPGM----PPSRKRKRE 290


>gi|351713028|gb|EHB15947.1| General transcription factor IIH subunit 3 [Heterocephalus glaber]
          Length = 89

 Score = 83.6 bits (205), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 36/70 (51%), Positives = 49/70 (70%)

Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
           +  L QYLL +F +D   R  L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C   
Sbjct: 1   MASLLQYLLWVFLSDQDQRAQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFS 60

Query: 182 KKCSTCGSVF 191
             C+TC + F
Sbjct: 61  PICTTCETAF 70


>gi|170591206|ref|XP_001900361.1| basic transcription factor 2, 35 kD subunit [Brugia malayi]
 gi|158591973|gb|EDP30575.1| basic transcription factor 2, 35 kD subunit, putative [Brugia
           malayi]
          Length = 274

 Score = 83.6 bits (205), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/162 (32%), Positives = 85/162 (52%), Gaps = 11/162 (6%)

Query: 35  LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
           L + +++ A+C+I R F++ +     RIL +    D   ++  +MN  F+A +  + ID 
Sbjct: 102 LFAPAIATAICHIFR-FKNEVDKGDGRILIINIGSDLFGEHNILMNIFFAAHKHNILIDV 160

Query: 95  CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQLPKP---V 149
             +G + S  LQQAS ITGG +   ++   L +Y +  T+    L S      P P    
Sbjct: 161 ANIG-ETSPILQQASDITGGTYFNVKKPKQLLKYTMCFTLGRASLRS----AFPSPSSST 215

Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            +D+RASC CH   + +G++CSVCLS+ C     C  C +VF
Sbjct: 216 SIDYRASCHCHGAPVSVGWVCSVCLSVQCHFSPICPACNTVF 257


>gi|224006638|ref|XP_002292279.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971921|gb|EED90254.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 409

 Score = 83.2 bits (204), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 64/189 (33%), Positives = 94/189 (49%), Gaps = 16/189 (8%)

Query: 15  FMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQ 74
             N    L +++ EG IA  L+ GS S      +      LL   PR L +Q S D    
Sbjct: 208 LANSSSMLQRKDDEGVIA--LMGGSGSSNFKQQELQHSKSLL--SPRALIIQSSSDRTSD 263

Query: 75  YVAIMNAIFSAQRSMVPIDSCYL------GAQNSAFLQQASYITGGVH----HKPQQLDG 124
           Y A+MN  F+A +  + ID C++       A++S +L+Q    T GV+        Q+ G
Sbjct: 264 YNALMNCAFAANKQNIVIDGCFIPSGMKDDAKSSPYLEQIVDQTRGVYLSVPSGAAQVGG 323

Query: 125 -LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
            L + L+++F      R  + LPK   VDFRA CF    +ID+  IC+ CLSI+ +  K+
Sbjct: 324 ALSEVLISVFLPPPLLRKEMNLPKLTKVDFRARCFETGESIDVAQICNQCLSIFKERPKE 383

Query: 184 -CSTCGSVF 191
            C TCG+V 
Sbjct: 384 FCLTCGAVV 392


>gi|378726397|gb|EHY52856.1| transcription initiation factor TFIIH subunit H3 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 455

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 79/183 (43%), Gaps = 52/183 (28%)

Query: 59  QPRILCLQGSP--DGPEQYVAIMNAIFSAQRSMVPIDSCYL------------------- 97
           Q RIL +  SP  D   QY+ IMNAIF+ QR  +PID   L                   
Sbjct: 229 QSRILLVSVSPSTDLAHQYIPIMNAIFACQRLAIPIDILQLPLPLQTNNTTTQAQQTSTN 288

Query: 98  ---------------GAQNSAFLQQASYITGGVHHKPQ-------------QLDGLFQYL 129
                           + ++ FLQQA+  T G+    Q                    YL
Sbjct: 289 NQTPAGDNNTTTAGNTSNSTVFLQQAADATHGIFIAAQLPKSQSQSQASAASSQAFLTYL 348

Query: 130 LTIFGTDLHSRN-FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
           LT       +R   L LP  + VDFRA+CFCH+N + +G++CS+CLSI+C       C T
Sbjct: 349 LTSLLPSPSTRAAHLILPTRIDVDFRAACFCHRNVVSVGFVCSICLSIFCSVPENADCLT 408

Query: 187 CGS 189
           CG+
Sbjct: 409 CGT 411


>gi|268573188|ref|XP_002641571.1| Hypothetical protein CBG09872 [Caenorhabditis briggsae]
          Length = 299

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 74/149 (49%), Gaps = 5/149 (3%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKP 119
           R + +  +P    ++ ++MN  FSA +  + +D   +        LQQA+ ITGG     
Sbjct: 151 RAVVISMTPIIGSEHGSLMNLFFSAAKQSICVDVVSMSDDLTGGILQQAADITGGSFVHA 210

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
           ++   L + L+T   TD   RN         VD+RASC CH   +  G++CS+CLS+ C+
Sbjct: 211 EKPQALLKILMTNMLTDASYRNTFSKLSYCSVDYRASCACHHTLVSSGWVCSICLSVLCQ 270

Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKR 208
           +   C  C + F    T ++ P   +RKR
Sbjct: 271 YTPICKVCKAAF----TITNLPKKPSRKR 295


>gi|397617770|gb|EJK64598.1| hypothetical protein THAOC_14658 [Thalassiosira oceanica]
          Length = 263

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 77/155 (49%), Gaps = 12/155 (7%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL------GAQNSAFLQQASYITGG 114
           R L +Q SPD    Y A+MN  F++ +S V +D C++       ++ S  L+Q   +TGG
Sbjct: 107 RALVVQASPDRTSDYNALMNCAFASSKSSVVVDGCFIPSGLRDDSRTSPVLRQVVDLTGG 166

Query: 115 VH----HKPQQLDGLFQYLLTI-FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYI 169
           V+        Q+ G    +LT  F      R  + LP+   VDFRA CF     +D+ ++
Sbjct: 167 VYLAVPSGAAQVGGALGEVLTAAFLPPPRLRGLMNLPESDDVDFRARCFATGRAVDVAWV 226

Query: 170 CSVCLSIYCKHLK-KCSTCGSVFGQAQTQSDEPSA 203
           C+ CLSI+ +  +  C TCG+     + + D   A
Sbjct: 227 CNQCLSIFAERPEGACPTCGAKVLVGEGRGDSKRA 261


>gi|392896378|ref|NP_499249.2| Protein ZK1128.4 [Caenorhabditis elegans]
 gi|224490453|emb|CAA87423.2| Protein ZK1128.4 [Caenorhabditis elegans]
          Length = 296

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 5/149 (3%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKP 119
           R + +  +P    ++ ++MN  FSA +  + +D   +G       LQQA+ ITGG     
Sbjct: 148 RAVVISITPIMGSEHGSLMNLFFSAAKQSICVDVVSMGDDFTGGILQQAADITGGSFLHA 207

Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
           ++   L + L+T   TD   R          VD+RASC CH   +  G++CS+CLS+ C+
Sbjct: 208 KKPQTLLKILMTNMLTDPTHRAVFSKLSHNSVDYRASCACHHQLVSSGWVCSICLSVLCQ 267

Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKR 208
           +   C  C + F    T ++ P   NRKR
Sbjct: 268 YTPICKVCKAAF----TIANLPIKPNRKR 292


>gi|169607795|ref|XP_001797317.1| hypothetical protein SNOG_06960 [Phaeosphaeria nodorum SN15]
 gi|160701493|gb|EAT85611.2| hypothetical protein SNOG_06960 [Phaeosphaeria nodorum SN15]
          Length = 312

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 73/143 (51%), Gaps = 32/143 (22%)

Query: 33  CSLLSGSLSMALCYIQRVFRSGLLHP----------------------------QPRILC 64
            +L+ G++SMAL YI +   S +LH                               RIL 
Sbjct: 123 TTLIGGAISMALSYINK---STILHAPTGASAEITSVAAMADTDNSTHLDRIALTSRILI 179

Query: 65  LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
           +  S D   QY+ +MN+IF+AQR  +PID   L A ++  LQQAS  TGGV+ KP++ +G
Sbjct: 180 VSVSGDLANQYIPVMNSIFAAQRKKIPIDILKL-AGDTVLLQQASDATGGVYMKPERPEG 238

Query: 125 LFQYLLTIFGTDLHSRNFLQLPK 147
           L QYL+  +  D  +R  L +P+
Sbjct: 239 LLQYLMMAYLPDATARKSLIIPR 261


>gi|406695514|gb|EKC98819.1| hypothetical protein A1Q2_06922 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 297

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 31/157 (19%)

Query: 61  RILCLQGSPDGPEQYVA--------------------IMNAIFSAQRSMVPIDSCYLGAQ 100
           RIL L  +P G E   A                    +MN +F+ Q++ V ID   L  +
Sbjct: 87  RILILNATPGGAEDAEATDSKTAKASGGGGGQRGYVGLMNCVFAGQKAKVAIDVLTLPPE 146

Query: 101 NSA-----FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF-LQLPKPVGVDFR 154
           ++      FLQQA+Y+T G++ +     GL QYL +++      RN    +P    VDFR
Sbjct: 147 STMTAPPIFLQQAAYLTEGIYWRWNGRGGLLQYLHSMYLPPPSLRNKPFTVPPQDAVDFR 206

Query: 155 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           A CFCH + +D+G++C     I+C+    CS C + F
Sbjct: 207 AVCFCHHDVVDVGFVC-----IFCEVKPICSMCKTKF 238


>gi|298711517|emb|CBJ26605.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 223

 Score = 80.1 bits (196), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 43/78 (55%), Positives = 53/78 (67%), Gaps = 2/78 (2%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
           L+GSLS  LCYI R  R      Q R L L GSPD PE Y ++MNAIFSAQ++ V +D  
Sbjct: 137 LAGSLSKGLCYINRKMRESS-SIQARALVLSGSPDVPETYNSVMNAIFSAQKAGVLVDCA 195

Query: 96  YLGAQNSAFLQQASYITG 113
            LG ++S FLQQA+Y+TG
Sbjct: 196 VLG-ESSTFLQQAAYLTG 212


>gi|324508596|gb|ADY43627.1| General transcription factor IIH subunit 3 [Ascaris suum]
          Length = 272

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/153 (33%), Positives = 73/153 (47%), Gaps = 2/153 (1%)

Query: 39  SLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 98
           + + A C+I R F+        RIL +    D   +  A+MN  FSA +  + +D   +G
Sbjct: 106 AFATAFCHINR-FKKENDGADGRILIINIGSDLAREQNALMNVFFSAHKQGIVVDVANIG 164

Query: 99  AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
            + S  LQQA  IT G +   ++   L QYL+        SR+         VD+RASC 
Sbjct: 165 -EPSPVLQQACDITMGAYINVEKPKRLLQYLMFFALGGPDSRSKFTSSVASSVDYRASCH 223

Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           CH     +  +CSVCLS+ CK    C  C +VF
Sbjct: 224 CHGTPASIALVCSVCLSVQCKFNPICPICNTVF 256


>gi|357629722|gb|EHJ78327.1| putative transcription factor TFIIH-subunit [Danaus plexippus]
          Length = 282

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 4/116 (3%)

Query: 22  LGKQEP-EGRIACSLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAI 78
           L K +P E R   SLL+G+++M LC+I R+ R     L    R+L + GS D   QY+  
Sbjct: 103 LIKSQPQEERPGESLLAGAMAMGLCFIARMRREQPPGLRMVSRMLVVTGSSDTAAQYINY 162

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 134
           MN  F+AQ+  V +D C L  ++ + LQQ   ITGG++ K   L+GL QYLL I+ 
Sbjct: 163 MNVFFTAQKQQVLLDVCSLD-KHLSLLQQGCDITGGLYLKVPSLEGLLQYLLVIYA 217


>gi|312385025|gb|EFR29618.1| hypothetical protein AND_01266 [Anopheles darlingi]
          Length = 516

 Score = 77.0 bits (188), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 60/102 (58%), Gaps = 9/102 (8%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHP-----QPRILCLQGSPDGPEQYVAIMNAIFSAQRS 88
           SLL+GS+SMALCYI R+ R+    P       R+L + GS +   QY+  MN  F+AQ+ 
Sbjct: 391 SLLAGSMSMALCYIARINRN---KPPGSKINSRVLVVTGSNECASQYMTYMNVFFTAQKQ 447

Query: 89  MVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
            V +D C L    S  LQQ   ITGG + + +QLDG  QYLL
Sbjct: 448 GVVVDVCALDKALS-LLQQGCDITGGQYLRLEQLDGFLQYLL 488


>gi|401400847|ref|XP_003880871.1| hypothetical protein NCLIV_039130 [Neospora caninum Liverpool]
 gi|325115283|emb|CBZ50838.1| hypothetical protein NCLIV_039130 [Neospora caninum Liverpool]
          Length = 580

 Score = 76.3 bits (186), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 85/193 (44%), Gaps = 28/193 (14%)

Query: 29  GRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIF 83
           GR  C   S+L+G+LS+ALCY+ +V +     P+ R+L L GS D     QY+ +MN  F
Sbjct: 372 GRSLCGEDSMLAGALSLALCYLNKVAKRSARPPERRVLLLDGSLDRSYSSQYMPLMNLAF 431

Query: 84  SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-----PQQLDG----------LFQY 128
           +A +  + ID C L    S   +Q   I+ GVH K     P    G          L   
Sbjct: 432 AAAKGNIVIDCCALCPNPSTISEQLCDISRGVHLKFSQAAPASTSGNSHLDGGLALLQLL 491

Query: 129 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KH 180
           L  IF +         L    G    A CFCH   +++  ICS CL+IYC        K 
Sbjct: 492 LFWIFPSVSLRPAIAALSVHRGRSNAAVCFCHHRPVEVCCICSCCLAIYCSEKDAQTGKE 551

Query: 181 LKKCSTCGSVFGQ 193
              C  C S F +
Sbjct: 552 RISCDVCKSRFSR 564


>gi|255545470|ref|XP_002513795.1| hypothetical protein RCOM_1031970 [Ricinus communis]
 gi|223546881|gb|EEF48378.1| hypothetical protein RCOM_1031970 [Ricinus communis]
          Length = 159

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 70/156 (44%), Gaps = 51/156 (32%)

Query: 1   MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
           +P+L + LLQ LE+F+ +DE+LGK E         L  +L +       ++   LL P  
Sbjct: 50  IPTLYSELLQKLEDFIIRDEKLGKGE-------EFLKETLPLHCSQDPCLWHFALLIPDT 102

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
            +   +G                      VPIDSCY+G  NSAFLQQ             
Sbjct: 103 MVRKNEGK---------------------VPIDSCYVGGHNSAFLQQ------------- 128

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 156
                     T+  TDLHSRN LQLP+P  VDFRAS
Sbjct: 129 ----------TVLATDLHSRNCLQLPRPADVDFRAS 154


>gi|299751088|ref|XP_001830046.2| basic transcription factor 2.35KD subunit [Coprinopsis cinerea
           okayama7#130]
 gi|298409214|gb|EAU91711.2| basic transcription factor 2.35KD subunit [Coprinopsis cinerea
           okayama7#130]
          Length = 280

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 66/113 (58%), Gaps = 4/113 (3%)

Query: 20  EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-SGLLHPQ--PRILCLQGSPDGPEQYV 76
           + L     E  + C+L+ G+L+ ALC    +   S  + PQ  PRIL L  SPD    Y+
Sbjct: 105 DALDDSHDEEEVPCALV-GALTKALCSTTPLPNGSQGVPPQLDPRILILSVSPDLSTSYI 163

Query: 77  AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
            IMNAIFSAQ+  V ID+C +    + FLQQA+++TGG +   ++ D L QYL
Sbjct: 164 PIMNAIFSAQKLKVTIDACQIYGGETVFLQQAAHLTGGSYLLLERRDALLQYL 216


>gi|298711518|emb|CBJ26606.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 117

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/79 (43%), Positives = 51/79 (64%), Gaps = 1/79 (1%)

Query: 111 ITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYIC 170
           + GG++ + Q + GL QY+   F      R +L+L     VDFRA+CFCHK  +D+ Y+C
Sbjct: 15  LPGGLYLEHQGV-GLAQYMFATFLPSQSCRKYLKLRTQQSVDFRAACFCHKRVVDIAYVC 73

Query: 171 SVCLSIYCKHLKKCSTCGS 189
           SVCLS++C+    C+TCG+
Sbjct: 74  SVCLSVFCEFSPVCTTCGT 92


>gi|149579196|ref|XP_001517604.1| PREDICTED: general transcription factor IIH subunit 3-like, partial
           [Ornithorhynchus anatinus]
          Length = 80

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 31/60 (51%), Positives = 43/60 (71%)

Query: 132 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           +F  D   R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C     C+TC + F
Sbjct: 2   VFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 61


>gi|221502054|gb|EEE27800.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 404

 Score = 72.8 bits (177), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 84/196 (42%), Gaps = 29/196 (14%)

Query: 27  PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 81
           P    AC   S+L+G+LS+ALC + +V +     P+ R+L L GS D     QY+ +MN 
Sbjct: 193 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 252

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ----------------LDGL 125
            F+A +  + ID C L    S   +Q   I+ GVH K  Q                L  L
Sbjct: 253 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 312

Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 178
              L  IF +         L    G    A CFCH   +++  ICS CL+IYC       
Sbjct: 313 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 372

Query: 179 -KHLKKCSTCGSVFGQ 193
            K    C  C S F +
Sbjct: 373 GKERISCDVCKSRFSR 388


>gi|124513782|ref|XP_001350247.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
 gi|23615664|emb|CAD52656.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
          Length = 326

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/224 (26%), Positives = 103/224 (45%), Gaps = 35/224 (15%)

Query: 2   PSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPR 61
              C+     L +F+N++++  K E       S LS +L++ALCY  R+ R+   +   R
Sbjct: 108 KDFCSDTYDKLVDFINENKKEKKSE-------SALSSALTIALCYNNRI-RNLYENINTR 159

Query: 62  ILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
           +  L  S       QY  +MN  ++A+R+ + ID   L  + +  L+Q   IT G++   
Sbjct: 160 LFLLDISKSHYYTNQYTQLMNIAYNAKRNNIIIDVFSLNYK-TQILEQICNITNGLYLDN 218

Query: 120 QQL----------DGLFQYLLTIFGTDLHSRNFLQ---LPKPVGVDFRASCFCHKNTIDM 166
                        D L Q ++  F   +++R +     L +   +   A C CH   ID+
Sbjct: 219 TIFQSINPYGNIQDILTQTIIFWFLPSVNTRKYFSNTYLNEDTNI---AVCTCHNKQIDI 275

Query: 167 GYICSVCLSIYC--KHLK------KCSTCGSVFGQAQTQSDEPS 202
            YICS CL+IYC  K+L+       C+ C + F ++  ++   S
Sbjct: 276 AYICSCCLAIYCSEKNLQTNKDRLSCAICKTRFTKSLLRNKHVS 319


>gi|237838949|ref|XP_002368772.1| hypothetical protein TGME49_066930 [Toxoplasma gondii ME49]
 gi|211966436|gb|EEB01632.1| hypothetical protein TGME49_066930 [Toxoplasma gondii ME49]
          Length = 547

 Score = 72.0 bits (175), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 83/196 (42%), Gaps = 29/196 (14%)

Query: 27  PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 81
           P    AC   S+L+G+LS+ALC + +V +     P+ R+L L GS D     QY+ +MN 
Sbjct: 336 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 395

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------------DGL 125
            F+A +  + ID C L    S   +Q   I+ GVH K  Q                   L
Sbjct: 396 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 455

Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 178
              L  IF +         L    G    A CFCH   +++  ICS CL+IYC       
Sbjct: 456 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 515

Query: 179 -KHLKKCSTCGSVFGQ 193
            K    C  C S F +
Sbjct: 516 GKERISCDVCKSRFSR 531


>gi|156373212|ref|XP_001629427.1| predicted protein [Nematostella vectensis]
 gi|156216427|gb|EDO37364.1| predicted protein [Nematostella vectensis]
          Length = 190

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/72 (48%), Positives = 50/72 (69%), Gaps = 1/72 (1%)

Query: 59  QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
           QPRIL ++ SPD    Y+ +MN+IF+AQ+S   ID+C L  ++S +LQQA+ ITGGV+ K
Sbjct: 120 QPRILIIKASPDDASHYMPVMNSIFAAQKSNTCIDACVL-YEHSGYLQQATDITGGVYLK 178

Query: 119 PQQLDGLFQYLL 130
             +   L QYL+
Sbjct: 179 IPETQALLQYLM 190


>gi|221481332|gb|EEE19726.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 549

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 83/196 (42%), Gaps = 29/196 (14%)

Query: 27  PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 81
           P    AC   S+L+G+LS+ALC + +V +     P+ R+L L GS D     QY+ +MN 
Sbjct: 338 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 397

Query: 82  IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------------DGL 125
            F+A +  + ID C L    S   +Q   I+ GVH K  Q                   L
Sbjct: 398 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 457

Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 178
              L  IF +         L    G    A CFCH   +++  ICS CL+IYC       
Sbjct: 458 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 517

Query: 179 -KHLKKCSTCGSVFGQ 193
            K    C  C S F +
Sbjct: 518 GKERISCDVCKSRFSR 533


>gi|116204609|ref|XP_001228115.1| hypothetical protein CHGG_10188 [Chaetomium globosum CBS 148.51]
 gi|88176316|gb|EAQ83784.1| hypothetical protein CHGG_10188 [Chaetomium globosum CBS 148.51]
          Length = 408

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 74/185 (40%), Gaps = 67/185 (36%)

Query: 54  GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
           GL     RIL L  S   P QY+  MNA+F+A R+ + ID+  L A              
Sbjct: 199 GLAGLHARILILSVSDSAPAQYIPTMNAVFAAARARIAIDTLSLRAA------------- 245

Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRN-------------------------------- 141
               +P+   GL QYL+  FG+     N                                
Sbjct: 246 ----EPR---GLLQYLMFGFGSGTAPSNPSGGADSGKGPASSKTATAGGQGTTAAGGRHP 298

Query: 142 -------------FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
                         L  P    VDFRA+CFCH+N ID G++CSVCLSI+C+     +C T
Sbjct: 299 RTAKLGLGASVAELLVTPSADAVDFRAACFCHRNVIDTGFVCSVCLSIFCEVPDGGECLT 358

Query: 187 CGSVF 191
           CG+  
Sbjct: 359 CGTAL 363


>gi|123976711|ref|XP_001314576.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121897129|gb|EAY02259.1| hypothetical protein TVAG_030750 [Trichomonas vaginalis G3]
          Length = 284

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 76/160 (47%), Gaps = 8/160 (5%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
           L  +LS +LC +  +    +   + RIL    SP+   QY    N +F+A+R  V ID  
Sbjct: 116 LDVALSKSLCLMNNLPNKNI---KKRILAFSVSPETGSQYD---NLLFAAKRLDVVIDVL 169

Query: 96  YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
           +L +    FL QA+ +T G     +    L QYL ++  T +  R+ + LP P  +++ A
Sbjct: 170 FLNSNRLCFLSQAADLTHGFSKTIKMGAALAQYLFSLPPTTV--RDLMALPAPEPLEYSA 227

Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQ 195
                   I  GYIC VCLSI  +  + C  C + +G   
Sbjct: 228 PSADTHKMISRGYICPVCLSILEEACRTCPICNTRYGSVN 267


>gi|224146905|ref|XP_002336364.1| predicted protein [Populus trichocarpa]
 gi|222834814|gb|EEE73263.1| predicted protein [Populus trichocarpa]
          Length = 80

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 35/66 (53%), Positives = 42/66 (63%)

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA FSA RS+VPIDSC +GA N AF QQASYITG V+ KPQQ+  +       F  ++ 
Sbjct: 1   MNATFSAPRSLVPIDSCRVGAHNPAFSQQASYITGSVYVKPQQVSLVNHRPFIFFAPEVA 60

Query: 139 SRNFLQ 144
             N L 
Sbjct: 61  VENLLH 66


>gi|224116574|ref|XP_002331930.1| predicted protein [Populus trichocarpa]
 gi|222874602|gb|EEF11733.1| predicted protein [Populus trichocarpa]
          Length = 79

 Score = 70.5 bits (171), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 35/66 (53%), Positives = 42/66 (63%)

Query: 79  MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
           MNA FSA RS+VPIDSC +GA N AF QQASYITG V+ KPQQ+  +       F  ++ 
Sbjct: 1   MNATFSAPRSLVPIDSCRVGAHNPAFSQQASYITGSVYVKPQQVSLVNHRPFIFFAPEVA 60

Query: 139 SRNFLQ 144
             N L 
Sbjct: 61  VENLLH 66


>gi|399218636|emb|CCF75523.1| unnamed protein product [Babesia microti strain RI]
          Length = 278

 Score = 69.7 bits (169), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 41/115 (35%), Positives = 56/115 (48%), Gaps = 11/115 (9%)

Query: 90  VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG---------LFQYLLTIFGTDLHSR 140
           +P+ +    A N   L+Q S IT G   +   + G         LFQ LL  F     S+
Sbjct: 149 IPLLNIGFSALNMKLLEQLSDITKGTFLRLMDMYGNDSINIEYALFQTLLFWFLASNDSK 208

Query: 141 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--KHLKKCSTCGSVFGQ 193
           N L  P    +   A C+CH NTI++GY+CS CL++YC  K L  C  C S F +
Sbjct: 209 NILSPPSLTQISNSAVCYCHNNTIEIGYLCSSCLTVYCTDKSLIMCKICSSRFAK 263


>gi|82541502|ref|XP_724989.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23479826|gb|EAA16554.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 327

 Score = 69.7 bits (169), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 86/195 (44%), Gaps = 28/195 (14%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVP 91
           S LS +L++ALCY  R+  +   +   RI  L  S       QY  +MN  ++A+R+ + 
Sbjct: 134 STLSSALAIALCYNNRIC-NLYENINSRIFLLDISKSHFYTNQYTQLMNIAYNAKRNKII 192

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRN 141
           ID   L  + +  L+Q   IT G++                D L Q ++  F    HSR 
Sbjct: 193 IDVFSLNHK-TQILEQICNITNGLYIDNSIFLSINCGDNVEDILTQTIMFWFLPSTHSRK 251

Query: 142 FLQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSV 190
           +     L +   +   A C CH   ID+ YICS CL+IYC        K    CS C + 
Sbjct: 252 YFSNTYLNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKDSQTNKERISCSVCKTR 308

Query: 191 FGQAQTQSDEPSATN 205
           F +A  ++   S  N
Sbjct: 309 FTKALLRNKTVSDLN 323


>gi|68063783|ref|XP_673887.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56492063|emb|CAI02316.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 327

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 96/224 (42%), Gaps = 31/224 (13%)

Query: 5   CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC 64
           C    + L  F+ KD     +  E  +  S LS +L++ALCY  R+  +   +   RI  
Sbjct: 108 CTNAYEKLINFI-KDNNNNSKTYE--MVESTLSSALAIALCYNNRIC-NLYENINSRIFL 163

Query: 65  LQGSPDG--PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
           L  S       QY  +MN  ++A+R+ + ID   L  + +  L+Q   IT G++      
Sbjct: 164 LDISKSHFYTNQYTQLMNIAYNAKRNNIIIDVFSLNHK-TQILEQICNITNGLYIDNSIF 222

Query: 123 ----------DGLFQYLLTIFGTDLHSRNFLQ---LPKPVGVDFRASCFCHKNTIDMGYI 169
                     D L Q ++  F    HSR +     L +   +   A C CH   ID+ YI
Sbjct: 223 LSINCADNVEDILTQTIMFWFLPSTHSRKYFSNTYLNEDTNI---AVCTCHNKQIDIAYI 279

Query: 170 CSVCLSIYC--------KHLKKCSTCGSVFGQAQTQSDEPSATN 205
           CS CL+IYC        K    CS C + F ++  ++   S  N
Sbjct: 280 CSCCLAIYCSEKDAQTNKERISCSVCKTRFTKSLLRNKTVSDLN 323


>gi|401888190|gb|EJT52153.1| hypothetical protein A1Q1_06259 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 323

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 79/184 (42%), Gaps = 59/184 (32%)

Query: 61  RILCLQGSPDGPEQYVA--------------------IMNAIFSAQRSM----------- 89
           RIL L  +P G E   A                    +MN +F+AQ+++           
Sbjct: 87  RILILNATPGGAEDAEATDSKTAKASGGGGGQRGYVGLMNCVFAAQKAVSLTAVEDELTC 146

Query: 90  --VPIDSCYLGAQNSA-----FLQQASYITGGVHHKPQQLDGLFQYL------------- 129
             V ID   L  +++      FLQQA+Y+T G++ +     GL QYL             
Sbjct: 147 QKVAIDVLTLPPESTMTAPPIFLQQAAYLTEGIYWRWNGRGGLLQYLHVSLLVPSVSADP 206

Query: 130 --LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
             + +    L ++ F  +P    VDFRA CFCH + +D+G++C     I+C+    CS C
Sbjct: 207 QSMYLPPPSLRNKPF-TVPPQDAVDFRAVCFCHHDVVDVGFVC-----IFCEVKPICSMC 260

Query: 188 GSVF 191
            + F
Sbjct: 261 KTKF 264


>gi|76163131|gb|AAX30917.2| SJCHGC08726 protein [Schistosoma japonicum]
          Length = 77

 Score = 66.2 bits (160), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 6/76 (7%)

Query: 129 LLTIFGTDLHSRNFLQLPKP------VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK 182
           LL++F      R  L LP         GVDFRA+CFCHK  ID+GY+CS+CLS++C+   
Sbjct: 1   LLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACFCHKRLIDIGYVCSICLSVFCEFNP 60

Query: 183 KCSTCGSVFGQAQTQS 198
            C TC + F     Q+
Sbjct: 61  ICPTCNTPFVLPTVQN 76


>gi|70953884|ref|XP_746016.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56526511|emb|CAH87927.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 327

 Score = 66.2 bits (160), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 57/195 (29%), Positives = 86/195 (44%), Gaps = 28/195 (14%)

Query: 34  SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVP 91
           S LS +L++ALCY  R+  +   +   RI  L  S       QY  +MN  ++A+R+ + 
Sbjct: 134 STLSSALAIALCYNNRIC-NLYENINSRIFLLDISKSHFYTNQYTQLMNIAYNAKRNNII 192

Query: 92  IDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRN 141
           ID   L  + +  L+Q   IT G++                D L Q ++  F    +SR 
Sbjct: 193 IDVFSLNHK-TQILEQICNITNGLYIDNSIFLSITCGDNVEDILTQTIMFWFLPSTNSRK 251

Query: 142 FLQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSV 190
           +     L +   +   A C CH   ID+ YICS CL+IYC        K    CS C + 
Sbjct: 252 YFSNTYLNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKDAQTNKERISCSICKTR 308

Query: 191 FGQAQTQSDEPSATN 205
           F +A  ++   S  N
Sbjct: 309 FTKALLRNKTVSDLN 323


>gi|241605964|ref|XP_002405599.1| RNA polymerase II transcription initiation factor, putative [Ixodes
           scapularis]
 gi|215500649|gb|EEC10143.1| RNA polymerase II transcription initiation factor, putative [Ixodes
           scapularis]
          Length = 291

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 55/101 (54%), Gaps = 3/101 (2%)

Query: 20  EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVA 77
           E +   + E     SLL+G LS+ALCYI R+ +      +   RIL L  S +   QY+ 
Sbjct: 99  ELVLSDDSESAAGESLLTGGLSLALCYINRIEKESSSQNKIASRILVLSASGESASQYLN 158

Query: 78  IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
            MN  F+AQ+  V ID+C L  ++S  LQQ   ITGG + K
Sbjct: 159 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMK 198


>gi|380477141|emb|CCF44315.1| RNA polymerase II transcription factor B subunit 4 [Colletotrichum
           higginsianum]
          Length = 255

 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 59/117 (50%), Gaps = 19/117 (16%)

Query: 16  MNKDEQLGKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLH 57
           M+   +L     E  +AC+   LSG+L++AL +I                 R   S L  
Sbjct: 128 MSSMRKLVDATTEAXLACTTTQLSGALTLALAHINKTSLSLNETNKAPDVPRSTSSALSR 187

Query: 58  PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG 114
            + RI  L  S   P QY++ MNA+F+A  S +PID+  L   ++ FLQQASYIT G
Sbjct: 188 LRARIFILSVSDSEPVQYISTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDG 243


>gi|19074510|ref|NP_586016.1| hypothetical protein ECU07_0880 [Encephalitozoon cuniculi GB-M1]
 gi|19069152|emb|CAD25620.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329589|gb|AGE95860.1| hypothetical protein ECU07_0880 [Encephalitozoon cuniculi]
          Length = 213

 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 80/163 (49%), Gaps = 25/163 (15%)

Query: 38  GSLSMALCYIQRVFRSGLLH--PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
           G L  ALC          LH   + +I+      +  ++Y+  +  +F+AQR  + + S 
Sbjct: 74  GDLGYALC----------LHRDEESQIVIFTLGRERKDEYLRYLKCMFAAQRFGIRV-SA 122

Query: 96  YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
           +   +N   LQ  +  TGG +   +  DG  ++LL++ GT          PKP+G  F A
Sbjct: 123 FSLFENKTILQCCAS-TGGDYSTSE--DGCLRFLLSLLGTRGE-------PKPLG--FPA 170

Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
           +C+CH   + +G +C +CLS++C+ +  C  C S F   + +S
Sbjct: 171 TCYCHNRQVLLGLVCPICLSVFCRFVPVCKKCKSKFSFTKYES 213


>gi|303389949|ref|XP_003073206.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon
           intestinalis ATCC 50506]
 gi|303302351|gb|ADM11846.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon
           intestinalis ATCC 50506]
          Length = 213

 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 81/161 (50%), Gaps = 21/161 (13%)

Query: 38  GSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 97
           G L  ALC    + R G    + +I+      +  ++Y+  +  +F+AQR  V I +  L
Sbjct: 74  GDLGYALC----LHRLG----ESQIVIFTLEKERNDEYLRYLKCMFAAQRFGVRISAFSL 125

Query: 98  GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 157
             +N   LQ  +  TGG +   +  DG  ++LL++ GT   S       KP+G  F A+C
Sbjct: 126 -FENKTILQCCA-ATGGGYSTSE--DGCLRFLLSLLGTKGDS-------KPLG--FPATC 172

Query: 158 FCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
           +CH   + +G +C +CLS++C+ +  C  C S F   + +S
Sbjct: 173 YCHNKQVLLGLVCPICLSVFCRFVPVCKKCKSKFSFVKHES 213


>gi|156101255|ref|XP_001616321.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805195|gb|EDL46594.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 323

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 87/203 (42%), Gaps = 23/203 (11%)

Query: 13  EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG- 71
           EE  NK  +   +    ++  S LS +L++ALCY  R+  +   + + RI  L  S +  
Sbjct: 108 EEAYNKLIEFLSENKTEKMMESSLSSALAVALCYNHRIV-NMYENMKGRIFLLDISNETY 166

Query: 72  --PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL------- 122
               QY  +MN  ++A+R+ + ID   L  +    L+Q   IT G++             
Sbjct: 167 VYTNQYTQLMNIAYNAKRNNIIIDVFSLNCK-IQLLEQICNITNGLYIDSSIFFNVNCTD 225

Query: 123 ---DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC- 178
              D L Q ++  F    ++R +            A C CH   +D+ YICS CL+IYC 
Sbjct: 226 NIEDVLTQTMIFWFLPSNNTRKYFSNTYLNDDTNIAVCSCHNKQVDIAYICSCCLAIYCS 285

Query: 179 -------KHLKKCSTCGSVFGQA 194
                  K    C  C + F +A
Sbjct: 286 EKDEKTNKERTSCFVCKTRFTKA 308


>gi|225707288|gb|ACO09490.1| TFIIH basal transcription factor complex p34 subunit [Osmerus
           mordax]
          Length = 208

 Score = 63.9 bits (154), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 37/94 (39%), Positives = 53/94 (56%), Gaps = 3/94 (3%)

Query: 22  LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
           + + E +G    +LL+GSL+ ALCYI RV +        + RIL  + + D   QY+  M
Sbjct: 111 MSRTEVKGHQTDTLLAGSLAKALCYIHRVTKELEAGQELKSRILVFKAAEDCARQYMNFM 170

Query: 80  NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
           N IF+AQR  + ID+C L ++ S  LQQ   I G
Sbjct: 171 NVIFAAQRQNILIDACVLDSE-SGLLQQLISICG 203


>gi|367054924|ref|XP_003657840.1| hypothetical protein THITE_2123949 [Thielavia terrestris NRRL 8126]
 gi|347005106|gb|AEO71504.1| hypothetical protein THITE_2123949 [Thielavia terrestris NRRL 8126]
          Length = 459

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 25/50 (50%), Positives = 35/50 (70%), Gaps = 2/50 (4%)

Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
            L  P    VDFRA+CFCH+N +D G++CS+CLSI+C+     +C TCG+
Sbjct: 352 LLVTPSADAVDFRAACFCHRNVVDTGFVCSICLSIFCEVPDGGECLTCGT 401



 Score = 60.5 bits (145), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 47/82 (57%), Gaps = 1/82 (1%)

Query: 54  GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
           GL     RIL L  S   P QY+  MNA+F+A  + + ID+  L   ++ FL+QAS+IT 
Sbjct: 207 GLAGMHARILILSVSDSSPAQYIPTMNAVFAAAHARIAIDTLSLRG-SATFLEQASFITR 265

Query: 114 GVHHKPQQLDGLFQYLLTIFGT 135
           G   +  +  GL QYL+  FG+
Sbjct: 266 GTFIRAAEPRGLLQYLMFGFGS 287


>gi|221057700|ref|XP_002261358.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|194247363|emb|CAQ40763.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 323

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 23/203 (11%)

Query: 13  EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG- 71
           EE  NK  +   +    ++  S LS +L++ALCY  R+  +   + + RI  L  S    
Sbjct: 108 EEAYNKLIEFINENKTEKMMESNLSSALAVALCYNHRIV-NMYENMKGRIFLLDISNQSY 166

Query: 72  --PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL------- 122
               QY  +MN  ++A+R+ + ID   L  +    L+Q   IT G++             
Sbjct: 167 VYTNQYTQLMNIAYNAKRNNIIIDVFSLNYK-IQLLEQICNITNGLYIDTTIFFNVNCTD 225

Query: 123 ---DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC- 178
              D L Q ++  F    ++R +            A C CH   +D+ YICS CL+IYC 
Sbjct: 226 NIEDVLTQTMIFWFLPSNNTRKYFSNTYLNDDTNIAVCSCHNKQVDIAYICSCCLAIYCS 285

Query: 179 -------KHLKKCSTCGSVFGQA 194
                  K    C  C + F +A
Sbjct: 286 EKDDKTNKDRISCFVCKTRFTKA 308


>gi|32399081|emb|CAD98321.1| transcription factor, possible [Cryptosporidium parvum]
          Length = 311

 Score = 62.0 bits (149), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/175 (30%), Positives = 76/175 (43%), Gaps = 31/175 (17%)

Query: 59  QPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
           + RI+ L  S       QYV+++N  F++Q+  V ID   +    S  L     I  G++
Sbjct: 132 KDRIILLDASNKDEYISQYVSLLNCGFASQKLDVVIDVVSVTRNPSPLLNNLVDICNGLN 191

Query: 117 HKPQQL-DGLFQYLLTIFGTDLHSRN--------FLQLP---------------KPVGVD 152
            K  Q+ D L +  + I   D H  +        F  LP               K  G+ 
Sbjct: 192 LKYSQIVDSLAESNIEINKEDHHWLDQGLTPFLIFHLLPSIQAREEIFISINKTKQTGL- 250

Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQAQTQSDEPSATN 205
             A CFCH   I++GY+CS CLSI+C   K   CSTCG+   +   Q    S+ N
Sbjct: 251 --AVCFCHHQKIEIGYVCSSCLSIFCSRFKAPICSTCGARLKRIPIQQKTLSSLN 303


>gi|389584514|dbj|GAB67246.1| hypothetical protein PCYB_112670, partial [Plasmodium cynomolgi
           strain B]
          Length = 323

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 87/206 (42%), Gaps = 29/206 (14%)

Query: 13  EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD-- 70
           EE  NK  +   +    ++  S LS +L++ALCY  R+  +   + + RI  L  S    
Sbjct: 108 EEAYNKLIEFINENKTEKMMESSLSSALAVALCYNHRIV-NMYENMKGRIFLLDISNQIY 166

Query: 71  -GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL------- 122
               QY  +MN  ++A+R+ + ID   L  +    L+Q   IT G++             
Sbjct: 167 VYTNQYTQLMNIAYNAKRNDIIIDVFSLNYK-IKLLEQICNITNGLYIDSSIFFNVNCTD 225

Query: 123 ---DGLFQYLLTIFGTDLHSRNFLQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
              D L Q ++  F     +R +     L     +   A C CH   +D+ YICS CL+I
Sbjct: 226 NIEDVLTQTMIFWFLPSNSTRKYFSNTYLNDDTNI---AVCSCHNKQVDIAYICSCCLAI 282

Query: 177 YC--------KHLKKCSTCGSVFGQA 194
           YC        K    C  C + F +A
Sbjct: 283 YCSEKDDKTNKDRTSCFVCKTRFTKA 308


>gi|209882598|ref|XP_002142735.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209558341|gb|EEA08386.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 310

 Score = 61.2 bits (147), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 23/193 (11%)

Query: 36  LSGSLSMALCYIQ--RVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVP 91
           L+  +S+ LC I   R   S     + RIL    S +     QYV ++N+ ++A +  V 
Sbjct: 110 LASGISICLCKINSDRKLHSKNRTNKDRILLFDASNEKSYINQYVLLINSSYAALKLDVI 169

Query: 92  IDSCYLGAQNSAFLQQASYITGGVH---------------HKPQQLDGLFQYLL--TIFG 134
           ID C + +  S  L     I+ G++                K    DGL  +++   +  
Sbjct: 170 IDVCSISSNPSRLLYNVVDISKGIYINYSSIIKNVSSKIDDKSHIQDGLLPFIIFHLLPS 229

Query: 135 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFG 192
            +L + + + L         + CFCH   +++G+ICS CL+I+C   +   C+ CG+ F 
Sbjct: 230 AELRANSLINLSTKSRHSGISVCFCHYKKVEVGFICSSCLAIFCSLFRAPICAACGARFK 289

Query: 193 QAQTQSDEPSATN 205
           +   Q    S  N
Sbjct: 290 RVPIQQKPLSYLN 302


>gi|396081719|gb|AFN83334.1| RNA polymerase II transcription [Encephalitozoon romaleae SJ-2008]
          Length = 213

 Score = 61.2 bits (147), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/163 (30%), Positives = 77/163 (47%), Gaps = 25/163 (15%)

Query: 38  GSLSMALCYIQRVFRSGLLHPQP--RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
           G L  ALC          LH     +I+      +  ++Y+  +  +F+AQR  V I S 
Sbjct: 74  GDLGYALC----------LHKDEPTQIVIFTLGRERKDEYLRYLKCMFAAQRLGVRI-SA 122

Query: 96  YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
           +   +N   LQ     TGG +   +  D   ++LL++ GT           KP+G  F A
Sbjct: 123 FSLFENKTILQ-CCVSTGGDYSTSE--DRCLRFLLSLLGTG-------GSQKPLG--FPA 170

Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
           +C+CH   I +G +C +CLS++CK +  C  C S F  A+ +S
Sbjct: 171 TCYCHNRQILLGLVCPICLSVFCKFIPVCKKCKSKFSFAKYES 213


>gi|126650916|ref|XP_001388356.1| transcription factor [Cryptosporidium parvum Iowa II]
 gi|126117450|gb|EAZ51550.1| transcription factor, putative [Cryptosporidium parvum Iowa II]
          Length = 286

 Score = 59.7 bits (143), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 70/158 (44%), Gaps = 31/158 (19%)

Query: 59  QPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
           + RI+ L  S       QYV+++N  F++Q+  V ID   +    S  L     I  G++
Sbjct: 132 KDRIILLDASNKDEYISQYVSLLNCGFASQKLDVVIDVVSVTRNPSPLLNNLVDICNGLN 191

Query: 117 HKPQQL-DGLFQYLLTIFGTDLHSRN--------FLQLP---------------KPVGVD 152
            K  Q+ D L +  + I   D H  +        F  LP               K  G+ 
Sbjct: 192 LKYSQIVDSLAESNIEINKEDHHWLDQGLTPFLIFHLLPSIQAREEIFISINKTKQTGL- 250

Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCG 188
             A CFCH   I++GY+CS CLSI+C   K   CSTCG
Sbjct: 251 --AVCFCHHQKIEIGYVCSSCLSIFCSRFKAPICSTCG 286


>gi|401827001|ref|XP_003887593.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon hellem
           ATCC 50504]
 gi|392998599|gb|AFM98612.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB4 [Encephalitozoon hellem
           ATCC 50504]
          Length = 213

 Score = 59.3 bits (142), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 68/129 (52%), Gaps = 13/129 (10%)

Query: 70  DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
           +  ++Y+  +  +F+AQR  V I +  L  +N   LQ  +  TGG +   +  D   ++L
Sbjct: 98  EKKDEYLRYLKCMFAAQRLGVRISAFSL-FENKTILQCCAS-TGGDYSTSE--DKCLRFL 153

Query: 130 LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 189
           L++ GT  +        KP+G  F A+C+CH   + +G +C +CLS++CK +  C  C S
Sbjct: 154 LSLLGTGGN-------KKPLG--FPAACYCHNRQVILGLVCPICLSVFCKFIPVCRKCKS 204

Query: 190 VFGQAQTQS 198
            F   + ++
Sbjct: 205 KFSFVKYEN 213


>gi|294947738|ref|XP_002785469.1| hypothetical protein Pmar_PMAR005787 [Perkinsus marinus ATCC 50983]
 gi|239899380|gb|EER17265.1| hypothetical protein Pmar_PMAR005787 [Perkinsus marinus ATCC 50983]
          Length = 385

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 81/197 (41%), Gaps = 39/197 (19%)

Query: 35  LLSGSLSMALCYIQRVFRSGLLHPQPRILCL------QGSPDGPEQYVAIMNAIFSAQRS 88
           L++  LS+ALCY+ +     L  P     C+       G  +   + V + N  ++AQ  
Sbjct: 150 LIAAGLSLALCYLNK----HLTIPGRARCCIVEAVSVYGEANYASEAVPLANCGWAAQDL 205

Query: 89  MVPIDSCYL-----------------------GAQNSAFLQQASYITGGVHHKPQQ---L 122
            VPID C +                          ++  L Q    T GVH   ++   +
Sbjct: 206 RVPIDLCTVCPGRTPQSSSPSNDRSSSSSSNASPTSATLLVQLCEATRGVHIPSKRCSTV 265

Query: 123 DGLFQYLLTIFGT-DLHSRNFLQL-PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
            GLFQ L+  F    L  R  L+  P    +D  + C CH   +D GY+CSVCL IYC  
Sbjct: 266 GGLFQSLMFHFSVPSLGEREVLKTRPSARLLDMGSLCACHGLPVDRGYVCSVCLCIYCND 325

Query: 181 LKK-CSTCGSVFGQAQT 196
               C  CG+ F +  T
Sbjct: 326 SSGVCRKCGARFKRTAT 342


>gi|294889292|ref|XP_002772747.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239877280|gb|EER04563.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 383

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 81/197 (41%), Gaps = 39/197 (19%)

Query: 35  LLSGSLSMALCYIQRVFRSGLLHPQPRILCL------QGSPDGPEQYVAIMNAIFSAQRS 88
           L++  LS+ALCY+ +     L  P     C+       G  +   + V + N  ++AQ  
Sbjct: 148 LIAAGLSLALCYLNK----HLTIPGRARCCIVEAVSVYGEANYASEAVPLANCGWAAQDL 203

Query: 89  MVPIDSCYL-----------------------GAQNSAFLQQASYITGGVHHKPQQ---L 122
            VPID C +                          ++  L Q    T GVH   ++   +
Sbjct: 204 RVPIDLCTVCPGRTPQSSSPSNDRSSSSSSNASPTSATLLVQLCEATRGVHIPSKRCSTV 263

Query: 123 DGLFQYLLTIFGT-DLHSRNFLQL-PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
            GLFQ L+  F    L  R  L+  P    +D  + C CH   +D GY+CSVCL IYC  
Sbjct: 264 GGLFQSLMFHFSVPSLGEREVLKTRPSARLLDMGSLCACHGLPVDRGYVCSVCLCIYCND 323

Query: 181 LKK-CSTCGSVFGQAQT 196
               C  CG+ F +  T
Sbjct: 324 SSGVCRKCGARFKRTAT 340


>gi|403223596|dbj|BAM41726.1| uncharacterized protein TOT_040000918 [Theileria orientalis strain
           Shintoku]
          Length = 332

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/168 (27%), Positives = 78/168 (46%), Gaps = 33/168 (19%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG-PEQYVAIMNAIFSAQRSMVPIDS 94
           L+ ++SM   Y+ R+ +S     +  I+    S +    QY+ +MN  F+A +  + I++
Sbjct: 141 LTSAISMGCLYLNRITKSKAGFGRKIIIFDVSSRENYKSQYIGLMNIAFTALKQNITINT 200

Query: 95  CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN----------FLQ 144
             LG Q S  L+Q S IT       + LD     L  IF +D+ + N          F  
Sbjct: 201 LALG-QPSRILEQLSTITNA-----KYLD-----LSKIFQSDVETSNIEQSLSQLISFWL 249

Query: 145 LP-KPVG------VDFR----ASCFCHKNTIDMGYICSVCLSIYCKHL 181
           LP + VG      + F     A C+CH  T+++ Y+C  C ++YC  +
Sbjct: 250 LPSEEVGEILSTKLSFEFGNTAICYCHYKTVEVSYLCPCCFAVYCSEV 297


>gi|198471480|ref|XP_002133743.1| GA22626 [Drosophila pseudoobscura pseudoobscura]
 gi|198145935|gb|EDY72370.1| GA22626 [Drosophila pseudoobscura pseudoobscura]
          Length = 390

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 70/159 (44%), Gaps = 9/159 (5%)

Query: 34  SLLSGSLSMALCYIQRVFR-----SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 88
           SLL+ SL +ALCYIQ+  R     S     Q RIL + GS    + Y    N I  A  +
Sbjct: 157 SLLAESLGLALCYIQKRRRQIPNTSHGRSVQGRILIVTGSVLHEDVYKNYDNVIQEALET 216

Query: 89  MVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP 148
            V I+ C +       LQ+A+  T G++        L   L+  F   L+    ++ P+ 
Sbjct: 217 NVTINVCAVKVPKQLPLQRATDTTDGLYFCTNDTMSLSGDLVDHF---LYPTYLIKSPRS 273

Query: 149 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
             +   A C CH     +G++C  C  + CKH   C  C
Sbjct: 274 -HIAVPARCACHGFGNKIGFVCGKCKLVLCKHSPYCDDC 311


>gi|449019603|dbj|BAM83005.1| similar to TFIIH subunit TFB4 [Cyanidioschyzon merolae strain 10D]
          Length = 399

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 25/165 (15%)

Query: 56  LHPQPRILCLQG-SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA--------QNSAFLQ 106
           L P+ R++ +    P    + V + NA F AQ+  + +D   + A        Q  A LQ
Sbjct: 231 LSPRARLVVISAPGPFDVAESVPLWNAAFCAQQHQLLVDVVRIEATSEPGTKPQARAALQ 290

Query: 107 QASYITGGVHHKPQ-QLDGLFQYLLTIFGTDLHSRNFLQLPKPV---------------G 150
           Q +Y T G++ + +     LF +L T    +   R +L    P                 
Sbjct: 291 QLAYATQGIYLECRCACAKLFTHLATYIVPNRLERRYLTRRIPELGSYASGTERCGVVES 350

Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQ 195
           VD RA+CF     + MGY+CS CLS+Y      C  C +    A+
Sbjct: 351 VDLRATCFQSGKLVHMGYVCSRCLSVYSTPQAACIVCDARLDAAE 395


>gi|392576127|gb|EIW69258.1| hypothetical protein TREMEDRAFT_62112 [Tremella mesenterica DSM
           1558]
          Length = 297

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 21/144 (14%)

Query: 4   LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG--LLHPQPR 61
           L + L Q L   M+++   G +E +G      L  +++ ALCY+ R+  S   L     R
Sbjct: 107 LDSGLEQGLRGMMSEE---GAREGKGLNEPPALVAAMTKALCYLNRLSPSTATLQTTDTR 163

Query: 62  ILCLQGSPDGPEQ-----------YVAIMNAIFSAQRSMVPID-----SCYLGAQNSAFL 105
           +L +  +P                YV +MN +F+AQ+  +PID       Y       FL
Sbjct: 164 MLIINATPGSSFSESDSSGGLRGGYVGLMNCVFAAQKGKIPIDVLTLPPSYTQTSPPIFL 223

Query: 106 QQASYITGGVHHKPQQLDGLFQYL 129
           QQA+++TGG++ +     G+ QYL
Sbjct: 224 QQAAHLTGGMYWRWNGRGGVLQYL 247


>gi|71029634|ref|XP_764460.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68351414|gb|EAN32177.1| hypothetical protein, conserved [Theileria parva]
          Length = 348

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/178 (26%), Positives = 78/178 (43%), Gaps = 30/178 (16%)

Query: 36  LSGSLSMALCYIQRV-FRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
           L+ +++    Y+ R+  R+     +  I  +  S +   QY+ +MN  +SA    + I++
Sbjct: 157 LTSAIATGFLYLNRIRMRNEGYGRKIIIFDVSTSENYKSQYIGLMNIAYSALAQNITINT 216

Query: 95  CYLGAQNSAFLQQASYITGGVH--------------HKPQQLDGL--FQYLLTIFGTDLH 138
             LG Q S  L+Q S IT G +              H  Q L  L  F YL +   ++L 
Sbjct: 217 FALG-QPSRILEQLSSITRGKYLLVSKVLNFDLNFSHINQYLSQLITFWYLPSEGMSELL 275

Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK-------CSTCGS 189
           S N       VG+     C+CH  ++D+ Y+C  C ++YC  +         C  CGS
Sbjct: 276 STNLSFDFGNVGI-----CYCHYKSVDVSYLCPCCFAVYCSEIDDKGKYRIICMVCGS 328


>gi|312078926|ref|XP_003141952.1| basic transcription factor 2 [Loa loa]
          Length = 253

 Score = 53.5 bits (127), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 23/145 (15%)

Query: 34  SLLSGSLSMALCYIQR-----------VFR--SGLLHPQPRILCLQGSPDGPEQYVAIMN 80
           ++ + ++++A+C+I+            +FR  + + +   RIL +    D   ++  +MN
Sbjct: 101 AVFAPAVAIAICHIKDDRAILNNISSDIFRYKNEVDNGDGRILIINIGSDLRGEHSILMN 160

Query: 81  AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLH 138
             F+A +  + ID   +G + S  LQQAS ITGG +   ++   L +Y +  T+    L 
Sbjct: 161 IFFAAHKHNILIDVANIG-EASTILQQASDITGGTYFSVKKPKQLLKYTMCFTLGKASLR 219

Query: 139 SRNFLQLPKP---VGVDFRASCFCH 160
           S      P P     VD+RASC CH
Sbjct: 220 S----AFPSPSSSTSVDYRASCHCH 240


>gi|402471701|gb|EJW05288.1| hypothetical protein EDEG_00646 [Edhazardia aedis USNM 41457]
          Length = 222

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/167 (26%), Positives = 70/167 (41%), Gaps = 25/167 (14%)

Query: 28  EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 87
           E ++  S L+  ++  LC I +   S     +  +L L    +   Q + +  A   A+ 
Sbjct: 68  EYKVCSSNLAQDIAYTLCLINKNRDSS--EARIFVLNLDYQDNNTIQLLKVSTA---AKN 122

Query: 88  SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
             + IDS Y   Q S  L Q   ITGG++     +       + +FG       F +L  
Sbjct: 123 MKIRIDS-YSSVQ-SPVLSQVCNITGGIYRTSLDI------FVNLFG-------FYELGD 167

Query: 148 PVGVDF-----RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 189
             G D         C CH N I +G +C +CL+IYC+ +  C  C +
Sbjct: 168 ESGTDMSGCFGNVKCICHGNIITLGLVCPICLAIYCRTVPICRVCKT 214


>gi|207340364|gb|EDZ68737.1| YPR056Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 230

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 6/106 (5%)

Query: 10  QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
           + L E + K  +L K++ E     S L+G++S  L Y+ R+ +  +    + R+L L   
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182

Query: 67  --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAFLQQAS 109
              S D   QY+ IMN IFSA +   PID   + G++ S FLQQ +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTT 228


>gi|224106784|ref|XP_002333630.1| predicted protein [Populus trichocarpa]
 gi|222837853|gb|EEE76218.1| predicted protein [Populus trichocarpa]
          Length = 125

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 39/74 (52%), Gaps = 11/74 (14%)

Query: 70  DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
           DGPEQ+VA+MNA F    S+VPIDSC           QAS +TG V+ K QQ+  +    
Sbjct: 33  DGPEQHVAVMNATFCTPCSLVPIDSC-----------QASCVTGCVYVKSQQVSLVNHRP 81

Query: 130 LTIFGTDLHSRNFL 143
              F  ++   N L
Sbjct: 82  FISFAPEVAVENLL 95


>gi|403292309|ref|XP_003937192.1| PREDICTED: tectonic-2 [Saimiri boliviensis boliviensis]
          Length = 940

 Score = 46.6 bits (109), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 50/86 (58%), Gaps = 4/86 (4%)

Query: 8   LLQNLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRIL 63
           LL +  E + ++  + + K + +G+   +LL+GSL+ ALCYI R+ +        + RIL
Sbjct: 104 LLTSANEVIAEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRIL 163

Query: 64  CLQGSPDGPEQYVAIMNAIFSAQRSM 89
            ++ + D   QY+  MN IF+AQ+ +
Sbjct: 164 VIKAAEDSALQYMNFMNVIFAAQKQL 189


>gi|440302599|gb|ELP94906.1| RNA polymerase II transcription factor B subunit, partial
           [Entamoeba invadens IP1]
          Length = 188

 Score = 45.8 bits (107), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 6/98 (6%)

Query: 36  LSGSLSMALCYIQRVFRSG-LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
           LS ++S A+C I    +S   LH   +I+ +    +    ++ +MN IF+AQ+  V ID+
Sbjct: 89  LSAAMSSAMCRINSYSKSNPTLHK--KIVAISTPLNAAPTFITVMNNIFAAQKIGVSIDT 146

Query: 95  CYLGAQNS--AFLQQASYITGGVHHKPQQLDGLFQYLL 130
             +          QQASY+T G+++    LD L   LL
Sbjct: 147 IVISDNEEPCTICQQASYLTNGIYNM-SSLDTLLPRLL 183


>gi|160331405|ref|XP_001712410.1| hypothetical protein HAN_2g261 [Hemiselmis andersenii]
 gi|159765858|gb|ABW98085.1| hypothetical protein HAN_2g261 [Hemiselmis andersenii]
          Length = 293

 Score = 45.8 bits (107), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 22/139 (15%)

Query: 68  SPDGPEQYVAIMNAIFS-AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
           S    ++++A  N +F  AQ++    D+   G ++  F    S  T G++ +P  L  LF
Sbjct: 162 SSKTRKEHIAFSNQLFFLAQKNKFSFDTLLFGRKDCFFFHFLSEKTKGIYCRP--LKNLF 219

Query: 127 QY---------LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDM----GYICSVC 173
           +          LL++F T   SR F  LP      F       KNT ++     + C +C
Sbjct: 220 ELAFTEEFITILLSLFLTSPFSREFYILP------FSTKILNKKNTTNLIAKKKFSCPIC 273

Query: 174 LSIYCKHLKKCSTCGSVFG 192
            SI+      C  CG VF 
Sbjct: 274 FSIFNFLFTNCFVCGFVFS 292


>gi|328769012|gb|EGF79057.1| hypothetical protein BATDEDRAFT_90182 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 153

 Score = 45.4 bits (106), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 32/53 (60%)

Query: 35  LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 87
           L+   +S A  +I ++  S     Q RIL L  SPDG  QY+ IMNAIF+AQ+
Sbjct: 97  LVLADVSAAPKHINKMQSSTTPDRQSRILVLSVSPDGSAQYIPIMNAIFAAQK 149


>gi|387592672|gb|EIJ87696.1| hypothetical protein NEQG_02243 [Nematocida parisii ERTm3]
 gi|387595301|gb|EIJ92926.1| hypothetical protein NEPG_02325 [Nematocida parisii ERTm1]
          Length = 210

 Score = 44.3 bits (103), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 11/109 (10%)

Query: 83  FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 142
           F+A++  + +D C +   N  F+ Q S +           + LF+YL+ +   D     F
Sbjct: 111 FTARKLNIHVD-CII--PNDGFVSQVSDVLKRPAFCISSGESLFEYLMGMLSID-----F 162

Query: 143 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            ++ +    +    C CH   I  GY+C +CL +YCK +  C  C + F
Sbjct: 163 GRISRK---NVHRFCICHNKEIQTGYLCPICLGLYCKFVPLCKHCKTRF 208


>gi|378754778|gb|EHY64807.1| hypothetical protein NERG_02210 [Nematocida sp. 1 ERTm2]
          Length = 210

 Score = 44.3 bits (103), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 49/109 (44%), Gaps = 11/109 (10%)

Query: 83  FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 142
           F+A++  + +D C +   +  F  Q S +           + LF+YL+ +   D     F
Sbjct: 111 FTAKKQNIHVD-CVI--PHDGFASQISDVLNRPAFSISSNESLFEYLMGMLSMD-----F 162

Query: 143 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
            ++ K    +    C CH   I  GY+C +CL +YCK +  C  C + F
Sbjct: 163 GRISKK---NLSRFCICHHKEIQTGYLCPICLGLYCKFVPLCKHCKTRF 208


>gi|269860080|ref|XP_002649763.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB4 [Enterocytozoon
           bieneusi H348]
 gi|220066822|gb|EED44293.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB4 [Enterocytozoon
           bieneusi H348]
          Length = 202

 Score = 43.9 bits (102), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 27/45 (60%)

Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 192
           P+  +F  +C+C   TI +G+ CS+CL+IYCK +  C  C   F 
Sbjct: 155 PILNNFGVNCYCCNKTIYLGFCCSICLTIYCKFIPICKKCKIKFN 199


>gi|403359548|gb|EJY79438.1| putative: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Oxytricha trifallax]
 gi|403360227|gb|EJY79783.1| putative: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Oxytricha trifallax]
          Length = 255

 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 27/126 (21%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHPQP----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           LS ++  A+C+ ++      +        RI+ ++   + P QY +++NAIF+AQ+  + 
Sbjct: 116 LSTAMMKAVCFNKKWIEDVAVKRSESFSARIMIVKFQSEDPSQYSSLVNAIFAAQKLKIS 175

Query: 92  IDSCYL--------------------GAQNSAFLQQASYITGGVHHK--PQQLDG-LFQY 128
           +DS  +                      + S+ LQQA+ +T G  +K     LD  LF  
Sbjct: 176 VDSLIIQYAPPQAIQEPQDGKQQPASQEEESSLLQQAALLTNGTSYKLFSNNLDSELFPT 235

Query: 129 LLTIFG 134
           LL +F 
Sbjct: 236 LLELFS 241


>gi|429963037|gb|ELA42581.1| hypothetical protein VICG_00333 [Vittaforma corneae ATCC 50505]
          Length = 215

 Score = 42.4 bits (98), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/118 (23%), Positives = 48/118 (40%), Gaps = 16/118 (13%)

Query: 74  QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
           QY+  +  +F AQ   +PI         +  ++     +GG+  +      +FQ L    
Sbjct: 110 QYLEYLKCMFVAQHRKIPIHG--FSLHRNILVRMCCEGSGGIFLESCSFSDMFQLL---- 163

Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
           G     ++  Q+           C C  N + +G +C VCL +YCK +  C  C + F
Sbjct: 164 GNRTKKKDAYQI----------KCACCNNFVTLGLVCPVCLLVYCKFMPVCKKCKTKF 211


>gi|403376319|gb|EJY88136.1| putative: similar to general transcription factor IIH, polypeptide
           3, 34kDa [Oxytricha trifallax]
          Length = 230

 Score = 42.4 bits (98), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 27/126 (21%)

Query: 36  LSGSLSMALCYIQRVFRSGLLHPQP----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
           LS ++  A+C+ ++      +        RI+ ++   + P QY +++NAIF+AQ+  + 
Sbjct: 91  LSTAMMKAVCFNKKWIEDVAVKRSESFSARIMIVKFQSEDPSQYSSLVNAIFAAQKLKIS 150

Query: 92  IDSCYL--------------------GAQNSAFLQQASYITGGVHHK--PQQLDG-LFQY 128
           +DS  +                      + S+ LQQA+ +T G  +K     LD  LF  
Sbjct: 151 VDSLIIQYAPPQAIQEPQDGKQQPASQEEESSLLQQAALLTNGTSYKLFSNNLDSELFPT 210

Query: 129 LLTIFG 134
           LL +F 
Sbjct: 211 LLELFS 216


>gi|68065658|ref|XP_674813.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56493631|emb|CAH95047.1| hypothetical protein PB001049.00.0 [Plasmodium berghei]
          Length = 89

 Score = 41.2 bits (95), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 13/89 (14%)

Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------ 178
           +F +L +       S  +L     + V     C CH   ID+ YICS CL+IYC      
Sbjct: 2   MFWFLPSTHSRKYFSNTYLNEDTNIAV-----CTCHNKQIDIAYICSCCLAIYCSEKDAQ 56

Query: 179 --KHLKKCSTCGSVFGQAQTQSDEPSATN 205
             K    CS C + F ++  ++   S  N
Sbjct: 57  TNKERISCSVCKTRFTKSLLRNKTVSDLN 85


>gi|429965399|gb|ELA47396.1| hypothetical protein VCUG_01165 [Vavraia culicis 'floridensis']
          Length = 215

 Score = 40.0 bits (92), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 52/136 (38%), Gaps = 26/136 (19%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
           RI+ +  S +  + Y+  + + F A+R        Y    N + L          HHK  
Sbjct: 99  RIIIISLSRENNKDYLKYLKSAFVAKR--------YSDRYNISVLS---------HHKNP 141

Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQL---PKPVGVDF-RASCFCHKNTIDMGYICSVCLSI 176
            L  +       F  +     FLQ+    KP  + F    C CH   I  G +C VCLSI
Sbjct: 142 ALSEI-----GCFYNNFALSTFLQILSGKKPQKIFFCSTKCSCHDREILYGLVCPVCLSI 196

Query: 177 YCKHLKKCSTCGSVFG 192
           YC  +  C  C   F 
Sbjct: 197 YCSLIPICKRCRIRFN 212


>gi|164662823|ref|XP_001732533.1| hypothetical protein MGL_0308 [Malassezia globosa CBS 7966]
 gi|159106436|gb|EDP45319.1| hypothetical protein MGL_0308 [Malassezia globosa CBS 7966]
          Length = 276

 Score = 39.3 bits (90), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 29/101 (28%)

Query: 19  DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF-----------RSGLLHPQPR------ 61
           D + G + P G +       +L++ALC++ R+             SG L  + R      
Sbjct: 143 DARDGARGPVGMVR------ALALALCHMHRLSTVLSESTTTDPHSGALGARTRRGRNTS 196

Query: 62  ----ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID--SCY 96
               IL L  +PD   QYV +MN IFSAQ+  + +D  SC+
Sbjct: 197 FLQRILVLSATPDVSAQYVPMMNCIFSAQKQGIQVDARSCF 237


>gi|84997465|ref|XP_953454.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65304450|emb|CAI76829.1| hypothetical protein, conserved [Theileria annulata]
          Length = 391

 Score = 39.3 bits (90), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 63/164 (38%), Gaps = 37/164 (22%)

Query: 62  ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG------- 114
           I  +  S +   QY+ +MN  +SA    + I++  LG Q S  L+Q S IT         
Sbjct: 209 IFDISNSENYKSQYIGLMNIAYSALSQNITINTFALG-QPSRILEQLSAITNAKYLLVSK 267

Query: 115 -------VHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQ-LPKPVGVDF--RASCFCHKN 162
                   +H  Q L  L   L+   I    L S    + L   +  DF     C+CH  
Sbjct: 268 LLNFDLNFNHINQYLTQLITVLILFIILFWYLPSEGMSEMLSTNLSFDFGNVGICYCHYK 327

Query: 163 TIDMGYICSVCLS----------IYCKHLKK-------CSTCGS 189
           ++D+ Y+C  C +          +YC  +         C  CGS
Sbjct: 328 SVDVTYLCPCCFAGNFYSLIRFIVYCSEIDDKGKYRIICMVCGS 371


>gi|195389498|ref|XP_002053413.1| GJ23349 [Drosophila virilis]
 gi|194151499|gb|EDW66933.1| GJ23349 [Drosophila virilis]
          Length = 490

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 28/62 (45%)

Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 173
           G +      +GL   +L  F         L  PK   +D RA C+CH   I++G  CS C
Sbjct: 9   GFYFDTSAFEGLCMNVLCQFLASTQRVYHLNYPKQSPMDLRALCYCHNKRIEIGRACSNC 68

Query: 174 LS 175
           LS
Sbjct: 69  LS 70


>gi|413935397|gb|AFW69948.1| hypothetical protein ZEAMMB73_462205, partial [Zea mays]
          Length = 22

 Score = 38.9 bits (89), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 16/22 (72%), Positives = 20/22 (90%)

Query: 166 MGYICSVCLSIYCKHLKKCSTC 187
           MGY+CSVCLSI+C++ K CSTC
Sbjct: 1   MGYVCSVCLSIFCRNQKTCSTC 22


>gi|403347082|gb|EJY72956.1| RNA polymerase II transcription factor B subunit 4 [Oxytricha
           trifallax]
          Length = 149

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 39/78 (50%), Gaps = 20/78 (25%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-------------GAQ------- 100
           RI+ ++   + P QY +++NAIF+AQ+  + +DS  +             G Q       
Sbjct: 32  RIMIVKFQSEDPTQYSSLVNAIFAAQKLKISVDSLIIQYAPPQAIQEPQDGKQQPASQDE 91

Query: 101 NSAFLQQASYITGGVHHK 118
            S+ LQQA+ +T G  +K
Sbjct: 92  ESSLLQQAALLTNGTSYK 109


>gi|336271807|ref|XP_003350661.1| hypothetical protein SMAC_02333 [Sordaria macrospora k-hell]
 gi|380094822|emb|CCC07324.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 409

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 1/48 (2%)

Query: 88  SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGT 135
           + + ID+  L   ++ FLQQASYIT G   + Q+  GL QYL+  FG+
Sbjct: 234 ARIAIDTLALRG-SATFLQQASYITRGTFIRAQEPRGLLQYLMFGFGS 280


>gi|291000862|ref|XP_002682998.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
           gruberi]
 gi|284096626|gb|EFC50254.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
           gruberi]
          Length = 349

 Score = 37.7 bits (86), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 18/32 (56%)

Query: 157 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 188
           C CHK     GYIC  C S YC    +CSTCG
Sbjct: 240 CLCHKKITPGGYICPRCKSKYCDLPVECSTCG 271


>gi|308159753|gb|EFO62274.1| TFIIH P34 [Giardia lamblia P15]
          Length = 279

 Score = 37.7 bits (86), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 62/148 (41%), Gaps = 22/148 (14%)

Query: 61  RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID--SCYLGAQNSAFLQQASYITGGVHH- 117
           RIL +  S     + V   +A+FSA+R  V ID  S       + FL+Q +Y+T      
Sbjct: 133 RILIITASSVQDSEVVN--SALFSAKRFNVKIDIISLTTAPDVNIFLKQVAYLTASKFSN 190

Query: 118 --KPQQLDGLFQYLLT-IF-GTDLHSRNFLQLPK--PVGV-------DFRASC-FCHKNT 163
             KP+       Y  T IF  +       LQL     VGV       DF   C  CH N 
Sbjct: 191 VPKPKTSASSVSYFTTHIFKASPAAEWRSLQLNDGASVGVYHREKRLDFMGECSVCHTN- 249

Query: 164 IDMGYICSVCLSIYCKHLKK--CSTCGS 189
           I  G  C+VC ++ C    K  C TC +
Sbjct: 250 ISHGITCAVCFTLQCDACFKGSCPTCST 277


>gi|118401911|ref|XP_001033275.1| hypothetical protein TTHERM_00420200 [Tetrahymena thermophila]
 gi|89287623|gb|EAR85612.1| hypothetical protein TTHERM_00420200 [Tetrahymena thermophila
           SB210]
          Length = 351

 Score = 37.4 bits (85), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 12/69 (17%)

Query: 124 GLFQYL--LTIFGTDLHSRNFLQLPK-PV-------GVDFRASCFCHKNTID--MGYICS 171
            LF Y+  L +   D+ ++   QL K PV        V FR +C CH   +D  + Y+CS
Sbjct: 270 DLFNYISILMLTSGDMLAKPEQQLTKNPVFRPAINEKVQFRVNCSCHNTPMDSDIAYVCS 329

Query: 172 VCLSIYCKH 180
            CL +YCK 
Sbjct: 330 DCLFLYCKQ 338


>gi|443898660|dbj|GAC75994.1| vacuolar sorting protein VPS36 [Pseudozyma antarctica T-34]
          Length = 804

 Score = 36.2 bits (82), Expect = 9.1,   Method: Composition-based stats.
 Identities = 16/47 (34%), Positives = 24/47 (51%)

Query: 167 GYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRKTTDA 213
           G  C+VC  +    + KC  CGS  G +  +    S+ +  RK+TDA
Sbjct: 334 GLACNVCTFLNHPSMDKCEMCGSSLGNSWPERPSTSSLSTSRKSTDA 380


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.136    0.419 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,302,014,962
Number of Sequences: 23463169
Number of extensions: 133501140
Number of successful extensions: 243393
Number of sequences better than 100.0: 393
Number of HSP's better than 100.0 without gapping: 376
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 242480
Number of HSP's gapped (non-prelim): 430
length of query: 213
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 77
effective length of database: 9,168,204,383
effective search space: 705951737491
effective search space used: 705951737491
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 73 (32.7 bits)