BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 028156
(213 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|18394597|ref|NP_564050.1| transcription initiation factor TFIIH subunit H3 [Arabidopsis
thaliana]
gi|21537277|gb|AAM61618.1| unknown [Arabidopsis thaliana]
gi|92856638|gb|ABE77412.1| At1g18340 [Arabidopsis thaliana]
gi|332191584|gb|AEE29705.1| transcription initiation factor TFIIH subunit H3 [Arabidopsis
thaliana]
Length = 301
Score = 347 bits (891), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 165/210 (78%), Positives = 185/210 (88%), Gaps = 2/210 (0%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQE-PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
MP++ +LL+ LEEF+ KDE+L K+E E RI LLSGSLSMALCYIQRVFRSG LHPQ
Sbjct: 89 MPAIFGSLLKKLEEFVTKDEELSKEEVSEDRIPSCLLSGSLSMALCYIQRVFRSGHLHPQ 148
Query: 60 PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
PRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCY+G QNSAFLQQASYITGGVHH P
Sbjct: 149 PRILCLQGSPDGPEQYVAVMNSIFSAQRLMVPIDSCYIGVQNSAFLQQASYITGGVHHTP 208
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
+QLDGLFQYL TIF TDLHSR F+QLPKP+GVDFRASCFCHK TIDMGYICSVCLSI+C+
Sbjct: 209 KQLDGLFQYLTTIFATDLHSRGFVQLPKPIGVDFRASCFCHKKTIDMGYICSVCLSIFCE 268
Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
H KKCSTCGSVFGQ++ D SA+++KRK
Sbjct: 269 HHKKCSTCGSVFGQSKLD-DASSASDKKRK 297
>gi|297850232|ref|XP_002892997.1| hypothetical protein ARALYDRAFT_472054 [Arabidopsis lyrata subsp.
lyrata]
gi|297338839|gb|EFH69256.1| hypothetical protein ARALYDRAFT_472054 [Arabidopsis lyrata subsp.
lyrata]
Length = 301
Score = 345 bits (884), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 163/210 (77%), Positives = 185/210 (88%), Gaps = 2/210 (0%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQE-PEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
MP+L +LL+ LE+F+ KDE+L ++E E RI LLSGSLSMALCYIQRVFRSG LHPQ
Sbjct: 89 MPALFGSLLKKLEDFVTKDEELSREEVSEDRIPSCLLSGSLSMALCYIQRVFRSGHLHPQ 148
Query: 60 PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
PRILCLQGSPDGPEQYVA+MN+IFSAQR MVPIDSCY+G QNSAFLQQASYITGGVHH P
Sbjct: 149 PRILCLQGSPDGPEQYVAVMNSIFSAQRLMVPIDSCYIGVQNSAFLQQASYITGGVHHTP 208
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
+QLDGLFQYL TIF TDLHSR+F+QLPKP+GVDFRASCFCHK TIDMGYICSVCLSI+C+
Sbjct: 209 KQLDGLFQYLTTIFATDLHSRSFVQLPKPIGVDFRASCFCHKKTIDMGYICSVCLSIFCE 268
Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
H KKCSTCGSVFGQ++ D S +++KRK
Sbjct: 269 HHKKCSTCGSVFGQSKLD-DASSVSDKKRK 297
>gi|225459534|ref|XP_002284465.1| PREDICTED: general transcription factor IIH subunit 3 [Vitis
vinifera]
gi|302141830|emb|CBI19033.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 337 bits (865), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 173/212 (81%), Positives = 187/212 (88%), Gaps = 2/212 (0%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
MP+LC+ LLQ LEEF+ DE+L K+ I SLLSGSLSMALCYIQRVFR+G LHPQP
Sbjct: 87 MPALCSNLLQKLEEFVTGDEKLSKEVLAAGIGSSLLSGSLSMALCYIQRVFRTGPLHPQP 146
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RILCLQGSPDGPEQYVA+MNAIFSAQRSMVPIDSC +GAQ+SAFLQQASYITGGV+ KPQ
Sbjct: 147 RILCLQGSPDGPEQYVAVMNAIFSAQRSMVPIDSCVIGAQHSAFLQQASYITGGVYLKPQ 206
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
QLDGLFQYL T+F TDLHSR FLQLPKP GVDFRASCFCHKNTIDMGYICSVCLSI+CKH
Sbjct: 207 QLDGLFQYLSTVFATDLHSRRFLQLPKPAGVDFRASCFCHKNTIDMGYICSVCLSIFCKH 266
Query: 181 LKKCSTCGSVFGQAQTQSDEPSATNRKRKTTD 212
KKCSTCGSVFGQA QSD SAT+RKRKT +
Sbjct: 267 HKKCSTCGSVFGQA--QSDGNSATDRKRKTPE 296
>gi|224084866|ref|XP_002307429.1| predicted protein [Populus trichocarpa]
gi|222856878|gb|EEE94425.1| predicted protein [Populus trichocarpa]
Length = 289
Score = 335 bits (858), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 169/213 (79%), Positives = 185/213 (86%), Gaps = 2/213 (0%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
MPSL + LLQ LEEFM KDE+LGK++ + I SLLSGSLSMALCYIQRVFRSG LHPQP
Sbjct: 79 MPSLYSNLLQKLEEFMIKDEKLGKEQSQRAIKSSLLSGSLSMALCYIQRVFRSGPLHPQP 138
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RILCLQGSPDGPEQYVA+MNAIFSAQRSMVPIDSCY+GA NSAFLQQASYITGGV+ KPQ
Sbjct: 139 RILCLQGSPDGPEQYVAVMNAIFSAQRSMVPIDSCYVGAHNSAFLQQASYITGGVYVKPQ 198
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
LDGLFQYL T+F TDLHSR+F+QLP+P GVDFRASCFCHK TIDMGYICSVCLSI+C H
Sbjct: 199 HLDGLFQYLTTVFATDLHSRSFIQLPRPAGVDFRASCFCHKTTIDMGYICSVCLSIFCNH 258
Query: 181 LKKCSTCGSVFGQAQTQSDEPSATNRKRKTTDA 213
KKCSTCGSVFGQA QSD S ++ KRK +
Sbjct: 259 HKKCSTCGSVFGQA--QSDTSSTSDLKRKAPET 289
>gi|449470273|ref|XP_004152842.1| PREDICTED: general transcription factor IIH subunit 3-like [Cucumis
sativus]
Length = 295
Score = 330 bits (846), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 166/212 (78%), Positives = 184/212 (86%), Gaps = 2/212 (0%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
MP+LC LL+NLEEF+ DEQ K++P+G SLLSGSLSMALCYIQ+VFRSG LHPQP
Sbjct: 86 MPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSLLSGSLSMALCYIQKVFRSGSLHPQP 145
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCY+G+ NSAFLQQASYITGGV+ KPQ
Sbjct: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQ 205
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
Q+DGLFQYL T+FGTDLHSR FLQLPK VGVDFRASCFCHK TIDMGY+CSVCLSI+CKH
Sbjct: 206 QMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKH 265
Query: 181 LKKCSTCGSVFGQAQTQSDEPSATNRKRKTTD 212
KKCSTCGSVFG+ + D S + KRKT +
Sbjct: 266 HKKCSTCGSVFGETPVELD--SVSKLKRKTPE 295
>gi|357461529|ref|XP_003601046.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355490094|gb|AES71297.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 340
Score = 298 bits (762), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 151/254 (59%), Positives = 177/254 (69%), Gaps = 47/254 (18%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGK-QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
MP++ + LL NL+EF+ KD+QL G I SLLSG+LSMALCYIQR FRSG +HPQ
Sbjct: 87 MPAIYSNLLHNLDEFVAKDQQLTTPHHKPGTIPSSLLSGALSMALCYIQRAFRSGPMHPQ 146
Query: 60 PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
PRILCLQGS DGPEQYVAIMNAIFSAQRS+VP+DSC++G+ NSAFLQQASYITGG++++P
Sbjct: 147 PRILCLQGSTDGPEQYVAIMNAIFSAQRSVVPVDSCFIGSNNSAFLQQASYITGGIYYRP 206
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
Q++GLFQYL T+F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CSVCLSI+C+
Sbjct: 207 PQMEGLFQYLSTVFATDLHSREFLRLPKSLGVDFRASCFCHKQTIDMGYVCSVCLSIFCE 266
Query: 180 HLKKCSTC--------------------------------------------GSVFGQAQ 195
H KCSTC SVFG Q
Sbjct: 267 HHDKCSTCSFTIYDASNECKCSLFADCSNQFNLAEIFPLSAVFFNFFYSFIISSVFG--Q 324
Query: 196 TQSDEPSATNRKRK 209
TQS+ S NRKRK
Sbjct: 325 TQSEAASEENRKRK 338
>gi|356509424|ref|XP_003523449.1| PREDICTED: general transcription factor IIH subunit 3-like [Glycine
max]
Length = 294
Score = 297 bits (761), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/211 (72%), Positives = 174/211 (82%), Gaps = 5/211 (2%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPE--GRIACSLLSGSLSMALCYIQRVFRSGLLHP 58
MP+L + LL NL+EF+ +D QL G + SLLSGSLSMALCYIQR FRSG + P
Sbjct: 86 MPALYSNLLHNLDEFLARDRQLDADASHAPGIVPSSLLSGSLSMALCYIQRAFRSGPMPP 145
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
QPRILCLQG+ DGPEQYVAIMNAIFSAQ S VPIDSCY+G+ NSAFLQQASYITGG+++K
Sbjct: 146 QPRILCLQGAADGPEQYVAIMNAIFSAQHSTVPIDSCYIGSNNSAFLQQASYITGGIYYK 205
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
P QLDGL+QYL T+F TDLHSR FL+LPK VGVDFRASCFCHK TIDMGY+CSVCLSI+C
Sbjct: 206 PPQLDGLYQYLSTVFATDLHSRAFLRLPKSVGVDFRASCFCHKQTIDMGYVCSVCLSIFC 265
Query: 179 KHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
+H KCSTCGSVFGQAQ + SA +R+RK
Sbjct: 266 EHHDKCSTCGSVFGQAQL---DASAADRQRK 293
>gi|363806998|ref|NP_001242062.1| uncharacterized protein LOC100776751 [Glycine max]
gi|255647869|gb|ACU24393.1| unknown [Glycine max]
Length = 294
Score = 294 bits (752), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 150/211 (71%), Positives = 173/211 (81%), Gaps = 5/211 (2%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPE--GRIACSLLSGSLSMALCYIQRVFRSGLLHP 58
MP+L + LL NL+EF+ +D QL G + SLLSGSLSMALCYIQR FRSG + P
Sbjct: 86 MPALYSNLLHNLDEFLARDRQLDADAAHAPGTVPSSLLSGSLSMALCYIQRAFRSGPMPP 145
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
QPRILCL+G DGPEQYVAIMNAIFSAQ S VPIDSCY+G+ NSAFLQQASYITGG+++K
Sbjct: 146 QPRILCLRGVADGPEQYVAIMNAIFSAQHSTVPIDSCYIGSNNSAFLQQASYITGGIYYK 205
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
P QLDGL+QYL T+F TDL+SR FL+LPK VGVDFRASCFCHK TIDMGY+CSVCLSI+C
Sbjct: 206 PPQLDGLYQYLSTVFATDLYSRAFLRLPKSVGVDFRASCFCHKQTIDMGYVCSVCLSIFC 265
Query: 179 KHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
+H KCSTCGSVFGQAQ + SA +R+RK
Sbjct: 266 EHHDKCSTCGSVFGQAQL---DASAADRQRK 293
>gi|294464556|gb|ADE77788.1| unknown [Picea sitchensis]
Length = 306
Score = 293 bits (749), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 138/200 (69%), Positives = 161/200 (80%), Gaps = 5/200 (2%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
++Q LE+F+ +D+ L + SLLSGSLSMALCYIQR+ R G HPQPRILCLQG
Sbjct: 111 IVQQLEDFVRRDQALNTNNNATKNIHSLLSGSLSMALCYIQRIIRGGPSHPQPRILCLQG 170
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
SPDGP+QYVAIMNAIFSAQRSMVP+DSC +G+Q+SAFLQQAS+ITGG++ KPQQL+GLFQ
Sbjct: 171 SPDGPQQYVAIMNAIFSAQRSMVPVDSCIVGSQHSAFLQQASHITGGIYLKPQQLEGLFQ 230
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
+L +F TDLHSR FLQLP+P GVDFRASCFCHK TIDMGY+CSVCLSI+C+H KKCSTC
Sbjct: 231 FLAMVFATDLHSRRFLQLPRPTGVDFRASCFCHKKTIDMGYVCSVCLSIFCEHSKKCSTC 290
Query: 188 GSVFGQAQTQSDEPSATNRK 207
S F Q PS RK
Sbjct: 291 ESTFNQ-----KPPSGQKRK 305
>gi|357146071|ref|XP_003573866.1| PREDICTED: general transcription factor IIH subunit 3-like
[Brachypodium distachyon]
Length = 290
Score = 288 bits (738), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 138/197 (70%), Positives = 160/197 (81%), Gaps = 5/197 (2%)
Query: 2 PSLCATLLQ---NLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL 56
P + AT + +EEF+ +D + G G A SL SG+LS+ALCYIQRVFRSG
Sbjct: 82 PDVAATFAKASRKVEEFVKQDARDTAGNGTVAGGGAASLFSGALSLALCYIQRVFRSGTR 141
Query: 57 HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAFLQQASYITGGV+
Sbjct: 142 HPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAFLQQASYITGGVY 201
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
KPQ+L GLFQYL +F +DLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CSVCLSI
Sbjct: 202 LKPQELSGLFQYLAAVFASDLHSRAFLRLPKTLGVDFRASCFCHKKTIDMGYVCSVCLSI 261
Query: 177 YCKHLKKCSTCGSVFGQ 193
+CK+ KKCSTCGS F +
Sbjct: 262 FCKYHKKCSTCGSEFSR 278
>gi|212722318|ref|NP_001131186.1| uncharacterized protein LOC100192494 [Zea mays]
gi|194688300|gb|ACF78234.1| unknown [Zea mays]
gi|194690816|gb|ACF79492.1| unknown [Zea mays]
gi|224028537|gb|ACN33344.1| unknown [Zea mays]
gi|413926714|gb|AFW66646.1| hypothetical protein ZEAMMB73_940692 [Zea mays]
Length = 291
Score = 286 bits (733), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 134/187 (71%), Positives = 157/187 (83%), Gaps = 2/187 (1%)
Query: 10 QNLEEFMNKDEQL--GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
+ EEF+ +D + G A SLLSG+LS++LCYIQR+FRSG HPQPRILCLQG
Sbjct: 93 RKAEEFIAQDARATAGNGSVASANAASLLSGALSISLCYIQRIFRSGSRHPQPRILCLQG 152
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
SPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAFLQQASYITGGV+ KPQ+L+GLFQ
Sbjct: 153 SPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAFLQQASYITGGVYLKPQELNGLFQ 212
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
YL +F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CS+CLSI+CK+ KKCSTC
Sbjct: 213 YLAAVFATDLHSRAFLRLPKTLGVDFRASCFCHKKTIDMGYVCSICLSIFCKNQKKCSTC 272
Query: 188 GSVFGQA 194
GS F +A
Sbjct: 273 GSEFSRA 279
>gi|326513394|dbj|BAK06937.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 292
Score = 285 bits (728), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 133/186 (71%), Positives = 155/186 (83%), Gaps = 2/186 (1%)
Query: 10 QNLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
+ LEE + +D E +G A SL SG+LS+ALCYIQR+FRSG HPQPRILC+QG
Sbjct: 93 RKLEELIKQDARETVGNGTVADGGAASLFSGALSLALCYIQRIFRSGSRHPQPRILCVQG 152
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
SPDGPEQYVA+MN+IFSAQRSMVPID+C +G Q+SAFLQQASYITGGV+ KPQ+L GLFQ
Sbjct: 153 SPDGPEQYVAVMNSIFSAQRSMVPIDTCIVGTQDSAFLQQASYITGGVYMKPQELSGLFQ 212
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
YL +F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CSVCLSI+CK+ KKCSTC
Sbjct: 213 YLAAVFATDLHSRTFLRLPKTLGVDFRASCFCHKKTIDMGYVCSVCLSIFCKYHKKCSTC 272
Query: 188 GSVFGQ 193
GS F +
Sbjct: 273 GSEFSR 278
>gi|255584434|ref|XP_002532948.1| RNA polymerase II transcription factor B subunit, putative [Ricinus
communis]
gi|223527277|gb|EEF29432.1| RNA polymerase II transcription factor B subunit, putative [Ricinus
communis]
Length = 266
Score = 277 bits (708), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 137/172 (79%), Positives = 156/172 (90%), Gaps = 1/172 (0%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGK-QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ 59
+P+L + LLQ LE+FM +DE+LGK +E + +IA SLLSGSLSMALCYIQR+FRSG LHPQ
Sbjct: 86 IPALYSDLLQKLEDFMIRDEKLGKGEESKRKIASSLLSGSLSMALCYIQRIFRSGPLHPQ 145
Query: 60 PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
PRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDSCY+G NSAFLQQASYITGGV+ KP
Sbjct: 146 PRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCYVGGHNSAFLQQASYITGGVYVKP 205
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 171
Q LDGLFQYL+T+F TDLHSR+FLQLP+P GVDFRASCFCHK TIDMG+ICS
Sbjct: 206 QHLDGLFQYLVTVFATDLHSRSFLQLPRPAGVDFRASCFCHKTTIDMGFICS 257
>gi|223946567|gb|ACN27367.1| unknown [Zea mays]
gi|413926715|gb|AFW66647.1| hypothetical protein ZEAMMB73_940692 [Zea mays]
Length = 294
Score = 277 bits (708), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 133/190 (70%), Positives = 156/190 (82%), Gaps = 5/190 (2%)
Query: 10 QNLEEFMNKDEQL--GKQEPEGRIACSLLSGSLSMALC---YIQRVFRSGLLHPQPRILC 64
+ EEF+ +D + G A SLLSG+LS++LC IQR+FRSG HPQPRILC
Sbjct: 93 RKAEEFIAQDARATAGNGSVASANAASLLSGALSISLCCYLDIQRIFRSGSRHPQPRILC 152
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
LQGSPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAFLQQASYITGGV+ KPQ+L+G
Sbjct: 153 LQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAFLQQASYITGGVYLKPQELNG 212
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
LFQYL +F TDLHSR FL+LPK +GVDFRASCFCHK TIDMGY+CS+CLSI+CK+ KKC
Sbjct: 213 LFQYLAAVFATDLHSRAFLRLPKTLGVDFRASCFCHKKTIDMGYVCSICLSIFCKNQKKC 272
Query: 185 STCGSVFGQA 194
STCGS F +A
Sbjct: 273 STCGSEFSRA 282
>gi|115443937|ref|NP_001045748.1| Os02g0125800 [Oryza sativa Japonica Group]
gi|41053068|dbj|BAD08012.1| putative TFIIH basal transcription factor complex p34 subunit
[Oryza sativa Japonica Group]
gi|113535279|dbj|BAF07662.1| Os02g0125800 [Oryza sativa Japonica Group]
gi|215768495|dbj|BAH00724.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 289
Score = 267 bits (682), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 121/149 (81%), Positives = 135/149 (90%)
Query: 45 CYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAF 104
CYIQRVFRSG HPQPRILCLQGSPDGPEQYVA+MN+IFSAQRSMVPIDSC +G Q+SAF
Sbjct: 128 CYIQRVFRSGTRHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRSMVPIDSCIVGTQDSAF 187
Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTI 164
LQQASYITGGV+ KPQ+L+GLFQYL +F TDLHSR FL+LPK +GVDFRASCFCHK TI
Sbjct: 188 LQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRTFLRLPKTLGVDFRASCFCHKKTI 247
Query: 165 DMGYICSVCLSIYCKHLKKCSTCGSVFGQ 193
DMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 248 DMGYVCSVCLSIFCKYHKKCSTCGSEFNR 276
>gi|168000809|ref|XP_001753108.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695807|gb|EDQ82149.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 257
Score = 265 bits (678), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 124/202 (61%), Positives = 162/202 (80%), Gaps = 6/202 (2%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
+ ++++EF++K+E+ K+ + I S +SGSLSMALCYIQRV R L HPQ RILCLQG
Sbjct: 58 ITKSMQEFVSKEEEASKESSQ--IQYSNISGSLSMALCYIQRVLRGPLPHPQARILCLQG 115
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
SPD P QY+++MN+IFSAQR+ V ID+C +G+Q+SAFLQQA++ITGG++ KP Q DGLF+
Sbjct: 116 SPDAPYQYISVMNSIFSAQRNGVVIDACVVGSQHSAFLQQATHITGGIYMKPPQPDGLFE 175
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
YL+ +F TDL SR FLQLP+PVGVDFRASCFCHK TIDMG++CSVCLSI+CKH + CSTC
Sbjct: 176 YLMMVFTTDLFSRQFLQLPRPVGVDFRASCFCHKKTIDMGFVCSVCLSIFCKHHQTCSTC 235
Query: 188 GSVFGQAQTQSDEPSATNRKRK 209
G+ F ++S+ + T KRK
Sbjct: 236 GATF----SRSNSTTVTGMKRK 253
>gi|384252816|gb|EIE26291.1| transcription factor Tfb4 [Coccomyxa subellipsoidea C-169]
Length = 305
Score = 207 bits (526), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 92/162 (56%), Positives = 125/162 (77%), Gaps = 4/162 (2%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
++L+G+LS ALC I + + QPR+LCL+GSPD EQY+++MNAIF+AQR
Sbjct: 126 AVLAGALSRALCLINKAQLASAGGSRTRKQPRLLCLKGSPDATEQYISVMNAIFAAQRCE 185
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
V ID+C LGA +SAFLQQA+++T G++ +P+ D L QYLLTIF D SR+FLQLP+P+
Sbjct: 186 VVIDACMLGASDSAFLQQAAHLTNGIYLRPKHKDALLQYLLTIFCGDAFSRSFLQLPRPI 245
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
GVDFRA+CFCH+ ID+GY+CSVCLSI+C+ + +CSTCG+ F
Sbjct: 246 GVDFRAACFCHRRAIDLGYVCSVCLSIFCESITECSTCGTTF 287
>gi|449524543|ref|XP_004169281.1| PREDICTED: general transcription factor IIH subunit 3-like, partial
[Cucumis sativus]
Length = 121
Score = 200 bits (509), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 93/118 (78%), Positives = 102/118 (86%)
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
VPIDSCY+G+ NSAFLQQASYITGGV+ KPQQ+DGLFQYL T+FGTDLHSR FLQLPK V
Sbjct: 1 VPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSV 60
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRK 207
GVDFRASCFCHK TIDMGY+CSVCLSI+CKH KKCSTCGSVFG+ + D S RK
Sbjct: 61 GVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCGSVFGETPVELDSVSKLKRK 118
>gi|218189967|gb|EEC72394.1| hypothetical protein OsI_05675 [Oryza sativa Indica Group]
gi|222622090|gb|EEE56222.1| hypothetical protein OsJ_05207 [Oryza sativa Japonica Group]
Length = 261
Score = 197 bits (500), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 95/149 (63%), Positives = 107/149 (71%), Gaps = 28/149 (18%)
Query: 45 CYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAF 104
CYIQRVFRSG HPQPR VPIDSC +G Q+SAF
Sbjct: 128 CYIQRVFRSGTRHPQPR----------------------------VPIDSCIVGTQDSAF 159
Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTI 164
LQQASYITGGV+ KPQ+L+GLFQYL +F TDLHSR FL+LPK +GVDFRASCFCHK TI
Sbjct: 160 LQQASYITGGVYLKPQELNGLFQYLAAVFATDLHSRTFLRLPKTLGVDFRASCFCHKKTI 219
Query: 165 DMGYICSVCLSIYCKHLKKCSTCGSVFGQ 193
DMGY+CSVCLSI+CK+ KKCSTCGS F +
Sbjct: 220 DMGYVCSVCLSIFCKYHKKCSTCGSEFNR 248
>gi|330789994|ref|XP_003283083.1| hypothetical protein DICPUDRAFT_9126 [Dictyostelium purpureum]
gi|325086950|gb|EGC40332.1| hypothetical protein DICPUDRAFT_9126 [Dictyostelium purpureum]
Length = 282
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/184 (48%), Positives = 116/184 (63%), Gaps = 9/184 (4%)
Query: 9 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGS 68
LQ + +N D+Q I S S +LSMALCYI R+ R + +PRIL S
Sbjct: 105 LQKINIEVNNDQQ--------DIINSSFSAALSMALCYINRI-RKESPNLKPRILVFNIS 155
Query: 69 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 128
PD QY+++MN IFSAQ+ +P+DSC L +S FLQQAS++T G++ KP + + L QY
Sbjct: 156 PDVSSQYISVMNCIFSAQKQSIPVDSCILSQTDSTFLQQASHLTNGIYLKPHKQELLSQY 215
Query: 129 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 188
LLT F D SR L P VD+RASCFCHK +D+G++CSVCLSI+C H CSTCG
Sbjct: 216 LLTTFLLDTQSRKSLAYPNLKSVDYRASCFCHKKIVDIGFVCSVCLSIFCNHFSSCSTCG 275
Query: 189 SVFG 192
+ F
Sbjct: 276 TKFS 279
>gi|145340718|ref|XP_001415467.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575690|gb|ABO93759.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 285
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 89/181 (49%), Positives = 116/181 (64%), Gaps = 4/181 (2%)
Query: 27 PEGRIACSL-LSGSLSMALCYIQR---VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAI 82
P R A S +G+LS+ALCY R + + L +PRILCLQ S D P Y+++MNAI
Sbjct: 105 PSSRAAMSSPFAGALSLALCYCNRAQTLETAAGLRVRPRILCLQASQDNPTDYISMMNAI 164
Query: 83 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 142
FSAQR +PID+ LG + FLQQA++IT G + KP GL QYLL+ D+ SR+
Sbjct: 165 FSAQRQSIPIDAFALGEHDLPFLQQAAHITRGAYVKPTHGAGLLQYLLSTAALDMRSRSH 224
Query: 143 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPS 202
L+LP GVDFRASCFCHK + +G++CSVCLSI+C+ C TCG+ F + PS
Sbjct: 225 LKLPAARGVDFRASCFCHKRPVSVGFVCSVCLSIFCERRSSCDTCGADFAADAQVTSVPS 284
Query: 203 A 203
A
Sbjct: 285 A 285
>gi|6714290|gb|AAF25986.1|AC013354_5 F15H18.15 [Arabidopsis thaliana]
Length = 590
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 105/215 (48%), Positives = 133/215 (61%), Gaps = 40/215 (18%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQE-PEGRI-ACSLLSGSLSMALCYIQRVFRSGLLHP 58
MP++ +LL+ LEEF+ KDE+L K+E E RI +C L G++ C+ F + +
Sbjct: 97 MPAIFGSLLKKLEEFVTKDEELSKEEVSEDRIPSCLLWPGTV----CFFLHFFSTCKI-- 150
Query: 59 QPRILCLQGSPD---GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
+ CL G +YVA+MN+IFSAQR M + Y A NS
Sbjct: 151 ---LSCLSGLIKRFCHSFRYVAVMNSIFSAQRLM---RTQYSMAANS------------- 191
Query: 116 HHKPQQLDGLFQYL-LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCL 174
GLF + TIF TDLHSR F+QLPKP+GVDFRASCFCHK TIDMGYICSVCL
Sbjct: 192 --------GLFCFAQKTIFATDLHSRGFVQLPKPIGVDFRASCFCHKKTIDMGYICSVCL 243
Query: 175 SIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
SI+C+H KKCSTCGSVFGQ++ D SA+++KRK
Sbjct: 244 SIFCEHHKKCSTCGSVFGQSKLD-DASSASDKKRK 277
>gi|307105142|gb|EFN53393.1| hypothetical protein CHLNCDRAFT_13819, partial [Chlorella
variabilis]
Length = 135
Score = 170 bits (431), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 101/134 (75%), Gaps = 1/134 (0%)
Query: 59 QP-RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH 117
QP R+LCL +PD P QY+++MNAIFSAQRS V ID+C LG ++S FLQQA+Y+TGGV+
Sbjct: 2 QPARVLCLTAAPDVPSQYISVMNAIFSAQRSGVLIDACQLGRRHSTFLQQAAYLTGGVYL 61
Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
KP + L QYL ++F D +R FL++P VDFRASCFCHK ID+GY+CS CLSI+
Sbjct: 62 KPSKPVALVQYLNSVFAVDAATRQFLRMPGTAHVDFRASCFCHKRQIDLGYVCSACLSIF 121
Query: 178 CKHLKKCSTCGSVF 191
C+ L C+TCG+ F
Sbjct: 122 CEQLPACTTCGTEF 135
>gi|328874423|gb|EGG22788.1| general transcription factor IIH [Dictyostelium fasciculatum]
Length = 304
Score = 167 bits (422), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 76/150 (50%), Positives = 104/150 (69%), Gaps = 1/150 (0%)
Query: 42 MALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 101
+ALCYI R+ + + RIL SPD QY+ +MN IFSAQ+ +P+DSC L + +
Sbjct: 133 LALCYINRIKKDNST-IRSRILVFNISPDVSTQYIPVMNCIFSAQKQSIPVDSCILTSTD 191
Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 161
S FLQQAS++T G++ +P + D L QYLL+ F D +SR FL +P VD+RASCFCHK
Sbjct: 192 STFLQQASHLTNGIYLRPHRQDHLGQYLLSSFLIDSYSRKFLNVPTLKTVDYRASCFCHK 251
Query: 162 NTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+D+G++CSVCLSIYCK+ C+TCG+ F
Sbjct: 252 KIVDIGFVCSVCLSIYCKYSSSCTTCGTKF 281
>gi|348676264|gb|EGZ16082.1| hypothetical protein PHYSODRAFT_508027 [Phytophthora sojae]
Length = 298
Score = 164 bits (415), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/185 (45%), Positives = 116/185 (62%), Gaps = 11/185 (5%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRIL 63
A ++Q L++ D L +P R A ++ SLS ALC+I R P +PRIL
Sbjct: 103 AGVMQRLQQL--SDAPLDPAKP-NRTA---IAASLSRALCFINRAINE---EPDLRPRIL 153
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
+Q SPD E Y+AIMN IFSAQ+ V +D+C L ++S+F+QQA+Y+TGG+++KP
Sbjct: 154 VVQKSPDVSEHYIAIMNGIFSAQKKSVAVDACILANEHSSFMQQAAYLTGGIYYKPNDHS 213
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
GL QYL++I+ D R L+LP VDFRA CFCH+ I Y+C VCLS++C+
Sbjct: 214 GLLQYLISIYLPDPSMRKLLKLPSQDSVDFRAMCFCHREVISTAYVCPVCLSLFCEFRPI 273
Query: 184 CSTCG 188
CSTCG
Sbjct: 274 CSTCG 278
>gi|301122235|ref|XP_002908844.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
gi|262099606|gb|EEY57658.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
Length = 293
Score = 162 bits (411), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 81/185 (43%), Positives = 114/185 (61%), Gaps = 11/185 (5%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRIL 63
A +LQ +++ D L P + ++ SLS +LC+I R P +PRIL
Sbjct: 98 AGVLQRMQQL--SDAPLDPATPNQ----TAIAASLSRSLCFINRAINE---EPDLRPRIL 148
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
+Q SPD E Y+AIMN IFSAQ+ V +D+C L ++S+F+QQA+Y+TGG+++KP
Sbjct: 149 VIQKSPDVSEHYIAIMNGIFSAQKKSVAVDACILATEHSSFMQQAAYLTGGIYYKPNDHS 208
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
GL QYL++I+ D R L+LP VDFRA CFCH+ I Y+C VCLS++C+
Sbjct: 209 GLLQYLISIYLPDPSMRKLLKLPSQDSVDFRAMCFCHREVISTAYVCPVCLSLFCEFRPI 268
Query: 184 CSTCG 188
CSTCG
Sbjct: 269 CSTCG 273
>gi|290987006|ref|XP_002676214.1| predicted protein [Naegleria gruberi]
gi|284089815|gb|EFC43470.1| predicted protein [Naegleria gruberi]
Length = 346
Score = 159 bits (401), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 80/156 (51%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 36 LSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
LSG+LSMALC+I R+ + L RIL Q SPD QY+++MNAIFSA++ + +D
Sbjct: 177 LSGALSMALCFINRLEKEKPLGMSLNSRILTFQVSPDISSQYISVMNAIFSAEKMSIMLD 236
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
+C L +S FLQQA ++TGG + KPQ+ +GL QYLLTIF + R+++QLP VDF
Sbjct: 237 ACVLSNDDSTFLQQACFLTGGNYLKPQRQEGLIQYLLTIFMLEKSLRSYIQLPVQNTVDF 296
Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 189
RASCF + ID GY+C VCLSI+ H CSTC S
Sbjct: 297 RASCFETRKPIDDGYVCPVCLSIFSSHKPVCSTCAS 332
>gi|328852556|gb|EGG01701.1| hypothetical protein MELLADRAFT_110826 [Melampsora larici-populina
98AG31]
Length = 316
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 78/177 (44%), Positives = 111/177 (62%), Gaps = 11/177 (6%)
Query: 19 DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ----PRILCLQGSPDGPEQ 74
D+ L +P G + +L+ ALC+I RV + P+ PRIL + SPD P Q
Sbjct: 118 DQDLKSGQPTGTV------NALAKALCHINRVTKEET-RPKDSLKPRILIISISPDSPGQ 170
Query: 75 YVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 134
Y+ +MN IFSAQ+S +PID C + +++ FLQQASY+T G++++ ++ L QYL IF
Sbjct: 171 YIPMMNCIFSAQKSCIPIDVCKITGEDAVFLQQASYLTNGIYYRLEKPKALIQYLTMIFL 230
Query: 135 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+ R L LP+ VD RA+CFCH+ ID+GY+CSVCLSI+C + CSTC + F
Sbjct: 231 PGVSIRKSLNLPQQEEVDLRAACFCHRKIIDLGYVCSVCLSIFCTPIPVCSTCRTKF 287
>gi|325190435|emb|CCA24938.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 288
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 99/145 (68%), Gaps = 1/145 (0%)
Query: 44 LCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
LC + RV + PRI +Q SPD PE Y++IMN+IFSAQ+ V ID+C L ++S
Sbjct: 125 LCCMNRVLKENA-QLCPRIFVIQRSPDVPEHYISIMNSIFSAQKKTVAIDACILSEEHSP 183
Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
F+QQA+Y+TGG+++KP+ +DGL QYL+T++ D + R L+ P VDFRA CFCHK
Sbjct: 184 FMQQATYLTGGIYYKPRVIDGLLQYLITLYLPDPNMRKMLRFPSQESVDFRAMCFCHKKA 243
Query: 164 IDMGYICSVCLSIYCKHLKKCSTCG 188
+ ++C VCLS++C+ CSTCG
Sbjct: 244 VSTAFVCPVCLSLFCQFQPICSTCG 268
>gi|403163144|ref|XP_003323260.2| hypothetical protein PGTG_04797 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163932|gb|EFP78841.2| hypothetical protein PGTG_04797 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 322
Score = 156 bits (395), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 103/156 (66%), Gaps = 3/156 (1%)
Query: 39 SLSMALCYIQRVFRSGLLHPQ---PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
+L+ ALC+I R+ R + RIL L S D P QY+ +MN IFSAQ+S +PID C
Sbjct: 135 ALAKALCHINRLARENENKKESLKARILILSVSSDAPGQYIPMMNCIFSAQKSTIPIDVC 194
Query: 96 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
+ +N+ FLQQA+++T G++++ ++ + QYL IF L +R FL LPK VDFRA
Sbjct: 195 KISKENAVFLQQAAHLTEGIYYQIEKPKAILQYLTMIFLPGLAARKFLNLPKHQEVDFRA 254
Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+CFCH+ ID+GY+CSVCLSI+C CSTC + F
Sbjct: 255 ACFCHREIIDVGYVCSVCLSIFCTPTPVCSTCRTKF 290
>gi|66819921|ref|XP_643618.1| general transcription factor IIH, polypeptide 3 [Dictyostelium
discoideum AX4]
gi|74861043|sp|Q86IB5.1|TF2H3_DICDI RecName: Full=General transcription factor IIH subunit 3; AltName:
Full=TFIIH basal transcription factor complex subunit 3
gi|60471553|gb|EAL69509.1| general transcription factor IIH, polypeptide 3 [Dictyostelium
discoideum AX4]
Length = 372
Score = 155 bits (393), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/151 (52%), Positives = 102/151 (67%), Gaps = 1/151 (0%)
Query: 42 MALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 101
+ALCYI R+ R +PRIL SPD QY+++MN IFS+Q+ +P+DSC L +
Sbjct: 196 IALCYINRIKRETPT-IKPRILVFNISPDVSSQYISVMNCIFSSQKQSIPVDSCILSQSD 254
Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHK 161
S FLQQAS++T G++ KPQ+ + L QYLLT F D SR L P VD+RASCFCHK
Sbjct: 255 STFLQQASHLTSGIYLKPQKQELLSQYLLTTFLLDTLSRKSLAYPTLKSVDYRASCFCHK 314
Query: 162 NTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 192
+D+GY+CSVCLSI+C H CSTCG+ F
Sbjct: 315 RIVDIGYVCSVCLSIFCGHSSSCSTCGTKFS 345
>gi|363739821|ref|XP_001235063.2| PREDICTED: general transcription factor IIH subunit 3 [Gallus
gallus]
Length = 307
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 79/184 (42%), Positives = 116/184 (63%), Gaps = 9/184 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQG 67
+ +++ M K + +G+Q +LL+GSL+ ALCYI ++ + + + RIL ++
Sbjct: 112 EEIKDLMTKTDMMGQQTE------TLLAGSLAKALCYINKISKEVKANQEMKSRILVIKA 165
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
+ D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K L L Q
Sbjct: 166 AEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVPHLPSLLQ 224
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
YLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C CSTC
Sbjct: 225 YLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICSTC 284
Query: 188 GSVF 191
+ F
Sbjct: 285 ETAF 288
>gi|334327155|ref|XP_001375408.2| PREDICTED: general transcription factor IIH subunit 3-like, partial
[Monodelphis domestica]
Length = 308
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + EG+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDMEGQHTETLLAGSLAKALCYIHRMSKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L A +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQNIVIDACVLDA-DSGLLQQACDITGGIYLKVPQMPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RSQLNLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|326929605|ref|XP_003210949.1| PREDICTED: general transcription factor IIH subunit 3-like
[Meleagris gallopavo]
Length = 290
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 79/184 (42%), Positives = 116/184 (63%), Gaps = 9/184 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQG 67
+ +++ M K + +G+Q +LL+GSL+ ALCYI ++ + + + RIL ++
Sbjct: 95 EEIKDLMTKTDMMGQQTE------TLLAGSLAKALCYINKMSKEVKANQEMKSRILVIKA 148
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
+ D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K L L Q
Sbjct: 149 AEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVPHLPSLLQ 207
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
YLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C CSTC
Sbjct: 208 YLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICSTC 267
Query: 188 GSVF 191
+ F
Sbjct: 268 ETAF 271
>gi|307214269|gb|EFN89365.1| General transcription factor IIH subunit 3 [Harpegnathos saltator]
Length = 292
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 82/162 (50%), Positives = 105/162 (64%), Gaps = 7/162 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
SL+SG+LSMALCYI R+ R + LHP RIL + S D QY+ MN F+AQ+
Sbjct: 113 SLISGALSMALCYIARLERDKVAGQKLHP--RILVITASNDSATQYMNYMNIFFTAQKMN 170
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
V +D C L Q LQQ ITGG + K QL GL QYLL +F D + R+ L LP PV
Sbjct: 171 VILDVCSLD-QELTLLQQGCDITGGNYLKVPQLAGLLQYLLWVFLPDPNVRSKLVLPPPV 229
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RA+CFCH+ ID+GY+CS+CLS++CK C+TC +VF
Sbjct: 230 KVDYRAACFCHQELIDIGYVCSICLSVFCKFSPICTTCHTVF 271
>gi|115609530|ref|XP_791329.2| PREDICTED: general transcription factor IIH subunit 3-like
[Strongylocentrotus purpuratus]
Length = 313
Score = 153 bits (386), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 118/191 (61%), Gaps = 8/191 (4%)
Query: 3 SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQP 60
S+ T+++ L+ +++ + G + +LL+GSL+MALCYI RV + + +
Sbjct: 107 SINDTVMEELKNLVDESKSWGLSHTD-----TLLAGSLAMALCYIHRVXKECAVGEKIKS 161
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL L+ + D QY+ MN IF+AQ+ +PID+C L ++S+ LQQA ITGG + K
Sbjct: 162 RILVLKAADDSASQYMNFMNVIFTAQKQNIPIDACILD-KDSSLLQQACDITGGKYLKLL 220
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
GL QYLL ++ R L LP + VD+RA+CFCH+ ID+GY+CSVCLSI+C +
Sbjct: 221 STTGLLQYLLWVYLPSPSQRESLVLPPAIHVDYRAACFCHRILIDVGYVCSVCLSIFCTY 280
Query: 181 LKKCSTCGSVF 191
CSTC + F
Sbjct: 281 SPICSTCHTAF 291
>gi|332019667|gb|EGI60141.1| General transcription factor IIH subunit 3 [Acromyrmex echinatior]
Length = 292
Score = 153 bits (386), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 83/162 (51%), Positives = 104/162 (64%), Gaps = 7/162 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
SL+SG+LSMALCYI R+ R + LHP R+L + S D QY+ MN F+AQR
Sbjct: 113 SLISGALSMALCYIARLEREKVAGQKLHP--RMLVITASNDSATQYMNYMNIFFTAQRMN 170
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
V +D C L Q LQQ ITGG + K QL GL QYLL +F D R+ L LP PV
Sbjct: 171 VILDVCSLD-QELTLLQQGCDITGGNYLKVPQLAGLLQYLLWVFLPDPSVRSKLVLPPPV 229
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RA+CFCH+ ID+GY+CS+CLSI+CK C+TC +VF
Sbjct: 230 KVDYRAACFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271
>gi|395513842|ref|XP_003761131.1| PREDICTED: general transcription factor IIH subunit 3 [Sarcophilus
harrisii]
Length = 316
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + EG+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 126 MTKSDMEGQRTETLLAGSLAKALCYIHRMGKEVKDNQEMKSRILVIKAAEDSALQYMNFM 185
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 186 NVIFAAQKQNIVIDACVLDT-DSGLLQQACDITGGIYLKVPQMLSLLQYLLWVFLPDQDQ 244
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 245 RSQLNLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 296
>gi|149633387|ref|XP_001507590.1| PREDICTED: general transcription factor IIH subunit 3-like
[Ornithorhynchus anatinus]
Length = 311
Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 118/188 (62%), Gaps = 5/188 (2%)
Query: 8 LLQNLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRIL 63
LL E + ++ + + K + +G+ +LL+GSL+ ALCYI R+ + + RIL
Sbjct: 106 LLTTANEVITEEIKDLMTKSDMKGQHTETLLAGSLAKALCYIHRMNKEVKDSQEMRSRIL 165
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
++ S D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+
Sbjct: 166 VIKASEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGIYLKVPQMP 224
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
L QYLL +F D R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 225 SLLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI 284
Query: 184 CSTCGSVF 191
C+TC + F
Sbjct: 285 CTTCETAF 292
>gi|118601766|ref|NP_001073057.1| general transcription factor IIH subunit 3 [Bos taurus]
gi|426247186|ref|XP_004017367.1| PREDICTED: general transcription factor IIH subunit 3 [Ovis aries]
gi|122131719|sp|Q05B56.1|TF2H3_BOVIN RecName: Full=General transcription factor IIH subunit 3; AltName:
Full=General transcription factor IIH polypeptide 3
gi|115545470|gb|AAI22793.1| General transcription factor IIH, polypeptide 3, 34kDa [Bos taurus]
gi|296478493|tpg|DAA20608.1| TPA: general transcription factor IIH subunit 3 [Bos taurus]
Length = 309
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + EG+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|440898315|gb|ELR49840.1| General transcription factor IIH subunit 3, partial [Bos grunniens
mutus]
Length = 305
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + EG+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 116 MTKSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 175
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 176 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 234
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 235 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 286
>gi|115655410|ref|XP_001194121.1| PREDICTED: general transcription factor IIH subunit 3-like
[Strongylocentrotus purpuratus]
Length = 391
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 111/181 (61%), Gaps = 3/181 (1%)
Query: 13 EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPD 70
+ M + + L + P +LL+GSL+MALCYI RV + + + RIL L+ + D
Sbjct: 190 DTVMEELKNLVDENPTVSHTDTLLAGSLAMALCYIHRVEKECAVGEKIKSRILVLKAADD 249
Query: 71 GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
QY+ MN IF+AQ+ +PID+C L ++S+ LQQA ITGG + K GL QYLL
Sbjct: 250 SASQYMNFMNVIFTAQKQNIPIDACILD-KDSSLLQQACDITGGKYLKLLSTTGLLQYLL 308
Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
++ R L LP + VD+RA+CFCH+ ID+GY+CSVCLSI+C + CSTC +
Sbjct: 309 WVYLPSPSQRESLVLPPAIHVDYRAACFCHRILIDVGYVCSVCLSIFCTYSPICSTCHTA 368
Query: 191 F 191
F
Sbjct: 369 F 369
>gi|449476600|ref|XP_002189370.2| PREDICTED: general transcription factor IIH subunit 3 isoform 1
[Taeniopygia guttata]
gi|449476604|ref|XP_004176463.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
[Taeniopygia guttata]
Length = 267
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 117/187 (62%), Gaps = 9/187 (4%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILC 64
+ + +++ M K + G+Q +LL+GSL+ ALCYI ++ + ++ + RIL
Sbjct: 69 AIAEEIKDLMTKTDMKGQQTE------TLLAGSLAKALCYINKMGKDLKVNQEIKSRILV 122
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K +
Sbjct: 123 IKAAEDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKVPHMPS 181
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
L QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C
Sbjct: 182 LLQYLLWVFLPDQEQRSQLVLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 241
Query: 185 STCGSVF 191
STC + F
Sbjct: 242 STCETAF 248
>gi|301754663|ref|XP_002913183.1| PREDICTED: general transcription factor IIH subunit 3-like
[Ailuropoda melanoleuca]
Length = 309
Score = 150 bits (379), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 113/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K E +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ +M
Sbjct: 120 MTKSEIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNLM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|390602568|gb|EIN11961.1| transcription factor Tfb4 [Punctularia strigosozonata HHB-11173
SS5]
Length = 338
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 81/185 (43%), Positives = 111/185 (60%), Gaps = 8/185 (4%)
Query: 12 LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-SGLLHP----QPRILCLQ 66
++ N+ + LG+ E + A L G+L+ ALCYI R+ S P +PRIL L
Sbjct: 100 IDHITNELDVLGEWEAQESTA---LVGALTKALCYINRIMNVSADNEPRSPSEPRILILS 156
Query: 67 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
SPD Y+ IMN+IFSAQ+ V +D C L S FLQQAS++TGG + + D L
Sbjct: 157 VSPDQSTSYIPIMNSIFSAQKLKVTMDVCQLYGPESVFLQQASHLTGGSYIHLEHRDALL 216
Query: 127 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 186
QYL+ F R L +P +DFRA+CFCHKN +D+G++CSVCLSI+C+ + CST
Sbjct: 217 QYLIMSFLPPPSIRRTLSVPTQDKIDFRAACFCHKNIVDVGFVCSVCLSIFCQPVPVCST 276
Query: 187 CGSVF 191
C + F
Sbjct: 277 CRTKF 281
>gi|322795304|gb|EFZ18109.1| hypothetical protein SINV_08150 [Solenopsis invicta]
Length = 322
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 87/185 (47%), Positives = 111/185 (60%), Gaps = 12/185 (6%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGL----LHPQPRI 62
T+ Q L++F+N+ E SL+SG+LSMALCYI R+ R + LHP RI
Sbjct: 89 TVRQQLQQFINEIPMDAPLNTE-----SLISGALSMALCYIARLEREKVAGQKLHP--RI 141
Query: 63 LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
L + S D QY+ MN F+AQR V +D C L Q LQQ I+GG + K QL
Sbjct: 142 LVITASNDSATQYMNYMNIFFTAQRMNVILDVCSLD-QELTLLQQGCDISGGNYLKVPQL 200
Query: 123 DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK 182
GL QYLL +F D R L LP PV VD+RA+CFCH+ ID+GY+CS+CLSI+CK
Sbjct: 201 AGLLQYLLWVFLPDPSVRLKLVLPPPVKVDYRAACFCHQELIDIGYVCSICLSIFCKFSP 260
Query: 183 KCSTC 187
C+TC
Sbjct: 261 ICTTC 265
>gi|350407575|ref|XP_003488130.1| PREDICTED: general transcription factor IIH subunit 3-like [Bombus
impatiens]
Length = 292
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/190 (44%), Positives = 117/190 (61%), Gaps = 14/190 (7%)
Query: 7 TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PR 61
T+ Q L++ +N+ D+ L + SL+SG+L+MALCY+ R+ R + + R
Sbjct: 91 TIRQQLQQVINEISMDKPLNGE--------SLISGALTMALCYVARLEREKVASEKIYSR 142
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + S D QY+ MN F+AQ+ + +D C L Q LQQ ITGG + K Q
Sbjct: 143 ILVITASNDSATQYMNYMNIFFTAQKMGIILDVCSLD-QELTLLQQGCDITGGNYLKVPQ 201
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
L+GL QYLL IF D + R+ L LP PV VD+RA+CFCH+ ID+GY+CS+CLSI+CK
Sbjct: 202 LNGLLQYLLWIFLPDPNVRSKLVLPPPVKVDYRAACFCHQELIDIGYVCSICLSIFCKFS 261
Query: 182 KKCSTCGSVF 191
C+TC +VF
Sbjct: 262 PICTTCHTVF 271
>gi|194042856|ref|XP_001927574.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
[Sus scrofa]
Length = 309
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + EG+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|194853925|ref|XP_001968252.1| GG24611 [Drosophila erecta]
gi|190660119|gb|EDV57311.1| GG24611 [Drosophila erecta]
Length = 299
Score = 150 bits (378), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 79/160 (49%), Positives = 104/160 (65%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS+SMALCYI R+ R+ + RIL L GS + QY+ MN F+AQ+ +
Sbjct: 121 SLLAGSMSMALCYISRLQRNVAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLGIT 180
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
ID+C L + + LQQ IT G K QLDGL QYLL +F H R+ L LP P V
Sbjct: 181 IDTCALN-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPHIRHKLVLPPPPKV 239
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|340717520|ref|XP_003397229.1| PREDICTED: general transcription factor IIH subunit 3-like [Bombus
terrestris]
Length = 292
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 84/190 (44%), Positives = 117/190 (61%), Gaps = 14/190 (7%)
Query: 7 TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PR 61
T+ Q L++ +N+ D+ L + SL+SG+L+MALCY+ R+ R + + R
Sbjct: 91 TIRQQLQQVINEISMDKPLNGE--------SLISGALTMALCYVARLEREKVASEKIYSR 142
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + S D QY+ MN F+AQ+ + +D C L Q LQQ ITGG + K Q
Sbjct: 143 ILVITASNDSATQYMNYMNIFFTAQKMGIILDVCSLD-QELTLLQQGCDITGGNYLKVPQ 201
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
L+GL QYLL IF D + R+ L LP PV VD+RA+CFCH+ ID+GY+CS+CLSI+CK
Sbjct: 202 LNGLLQYLLWIFLPDPNVRSKLVLPPPVKVDYRAACFCHQELIDIGYVCSICLSIFCKFS 261
Query: 182 KKCSTCGSVF 191
C+TC +VF
Sbjct: 262 PICTTCHTVF 271
>gi|170093117|ref|XP_001877780.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164647639|gb|EDR11883.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 360
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 78/199 (39%), Positives = 120/199 (60%), Gaps = 19/199 (9%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV-------------FR 52
+T++ + + ++ +Q ++ P C+L+ G+L+ ALCY+ R+ R
Sbjct: 102 STIINRISDELDAIDQSEEEAP-----CALV-GALTKALCYVNRISLPPPSNSSTSPNTR 155
Query: 53 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
+ P PRIL L SPD Y+ IMN+IFSAQ+ V ID+C + ++ FLQQA+++T
Sbjct: 156 DSTVLPDPRILILSVSPDLSTSYIPIMNSIFSAQKLKVTIDACQVYGPDAVFLQQAAHLT 215
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
GG + ++ D L QYL+ F + R L +P +DFRA+CFCHKN ID+G++CSV
Sbjct: 216 GGSYLFLERRDALLQYLIMSFLSAPSIRQVLAVPTQDRIDFRAACFCHKNIIDIGFVCSV 275
Query: 173 CLSIYCKHLKKCSTCGSVF 191
CLSI+C+ + CSTC + F
Sbjct: 276 CLSIFCQPVPVCSTCRTKF 294
>gi|187608008|ref|NP_001120625.1| general transcription factor IIH, polypeptide 3, 34kDa [Xenopus
(Silurana) tropicalis]
gi|171846721|gb|AAI61775.1| gtf2h3 protein [Xenopus (Silurana) tropicalis]
Length = 200
Score = 149 bits (377), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 115/183 (62%), Gaps = 8/183 (4%)
Query: 11 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGS 68
++++ M K Q+ Q+ E ++L+GSL+ ALCYI ++ + + RIL ++ +
Sbjct: 5 DIKDLMTKTGQINGQQTE-----TVLAGSLAKALCYINKIAKDTKAGQEVKSRILVIKAA 59
Query: 69 PDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQY 128
D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q + L QY
Sbjct: 60 EDSALQYMNFMNVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGIYLKLPQANSLLQY 118
Query: 129 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 188
LL +F D R+ L L PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC
Sbjct: 119 LLWVFLPDPDQRSHLNLLPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCE 178
Query: 189 SVF 191
+ F
Sbjct: 179 TAF 181
>gi|442749163|gb|JAA66741.1| Putative rna polymer [Ixodes ricinus]
Length = 293
Score = 149 bits (377), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 79/174 (45%), Positives = 105/174 (60%), Gaps = 3/174 (1%)
Query: 20 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVA 77
E + + E SLL+G LS+ALCYI R+ + + RIL L S + QY+
Sbjct: 100 ELVLSDDSESAAGESLLTGGLSLALCYINRIEKESSSQNKIASRILVLSASGESASQYLN 159
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
MN F+AQ+ V ID+C L ++S LQQ ITGG + K GL QYLL +F D
Sbjct: 160 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMKVPNHAGLLQYLLWVFLPDK 218
Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
SRN++ P PV VD+RA+CFCH+N I++GY+CSVCLSI+C CSTC + F
Sbjct: 219 TSRNYMVFPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272
>gi|110757998|ref|XP_394705.3| PREDICTED: general transcription factor IIH subunit 3 [Apis
mellifera]
Length = 292
Score = 149 bits (376), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 79/160 (49%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SL+SG+L+MALCYI R+ R + + RIL + S D QY+ MN F+AQ+ V
Sbjct: 113 SLISGALTMALCYIARLEREKIAGEKLYSRILVITASNDSATQYMNYMNIFFTAQKMGVI 172
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
+D C L Q LQQ ITGG + K QL+GL QYLL IF D + R+ L P PV V
Sbjct: 173 LDVCSLD-QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVFPPPVKV 231
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RA+CFCH+ ID+GY+CS+CLSI+CK C+TC +VF
Sbjct: 232 DYRAACFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271
>gi|281343811|gb|EFB19395.1| hypothetical protein PANDA_000941 [Ailuropoda melanoleuca]
Length = 278
Score = 149 bits (376), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 113/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K E +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ +M
Sbjct: 90 MTKSEIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNLM 149
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 150 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 208
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 209 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 260
>gi|380030634|ref|XP_003698948.1| PREDICTED: general transcription factor IIH subunit 3-like [Apis
florea]
Length = 292
Score = 149 bits (376), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 79/160 (49%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SL+SG+L+MALCYI R+ R + + RIL + S D QY+ MN F+AQ+ V
Sbjct: 113 SLISGALTMALCYIARLEREKIAGEKLYSRILVITASNDSATQYMNYMNIFFTAQKMGVI 172
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
+D C L Q LQQ ITGG + K QL+GL QYLL IF D + R+ L P PV V
Sbjct: 173 LDVCSLD-QELTLLQQGCDITGGNYLKVPQLNGLLQYLLWIFLPDPNVRSKLVFPPPVKV 231
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RA+CFCH+ ID+GY+CS+CLSI+CK C+TC +VF
Sbjct: 232 DYRAACFCHQELIDIGYVCSICLSIFCKFSPICTTCHTVF 271
>gi|119618833|gb|EAW98427.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
CRA_b [Homo sapiens]
Length = 274
Score = 149 bits (375), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 78/179 (43%), Positives = 116/179 (64%), Gaps = 3/179 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 78 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 137
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + + +T+S
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCDAESQEKETES 255
>gi|308798981|ref|XP_003074271.1| basal transcription factor complex subunit-related (ISS)
[Ostreococcus tauri]
gi|116000442|emb|CAL50122.1| basal transcription factor complex subunit-related (ISS)
[Ostreococcus tauri]
Length = 319
Score = 149 bits (375), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 103/172 (59%), Gaps = 4/172 (2%)
Query: 24 KQEPEGRIACSLLSGSLSMALCYIQR---VFRSGLLHPQPRILCLQGSPDGPEQYVAIMN 80
K + + A S +G+LS+ALCY R + + L +PRILCLQ S D P Y+ +MN
Sbjct: 141 KADSDDGPATSPFAGALSLALCYCNRAQALENAAGLRSKPRILCLQASQDNPTDYIPMMN 200
Query: 81 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 140
AIFSAQR +P+D+ LG +S F+QQA++IT G + KP DGL QYLL+ DL SR
Sbjct: 201 AIFSAQRQSIPVDAFALGEHDSPFMQQAAHITRGAYVKPTLGDGLLQYLLSTAVMDLRSR 260
Query: 141 NFLQLPKPVGVDFRASCFCHKNTIDM-GYICSVCLSIYCKHLKKCSTCGSVF 191
FL+LP+ FRAS ++CSVCLS++C C TCG+ F
Sbjct: 261 AFLKLPQAKASTFRASVLLPTTARQASAFVCSVCLSVFCDGRPACDTCGTTF 312
>gi|1039318|emb|CAA82909.1| basic transcription factor 2, 35 kD subunit [Homo sapiens]
Length = 303
Score = 149 bits (375), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|307175391|gb|EFN65408.1| General transcription factor IIH subunit 3 [Camponotus floridanus]
Length = 292
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/160 (48%), Positives = 104/160 (65%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SL+SG+LSMALCYI R+ R + + R+L + S D QY+ MN F+AQR V
Sbjct: 113 SLISGALSMALCYIARLEREKFANQKLHSRMLVITASNDSAMQYMNYMNIFFTAQRMNVI 172
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
+D C L Q LQQ ITGG + + QL GL QYLL +F D R+ L LP PV V
Sbjct: 173 LDVCSLD-QELTLLQQGCEITGGNYLQVPQLAGLLQYLLWVFLPDPSVRSKLVLPPPVKV 231
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
++RA+CFCH++ ID+GY+CS+CLSI+CK+ C+TC +VF
Sbjct: 232 NYRAACFCHQDLIDIGYVCSICLSIFCKYSPICTTCHTVF 271
>gi|114647703|ref|XP_001170519.1| PREDICTED: general transcription factor IIH subunit 3 isoform 3
[Pan troglodytes]
gi|397481834|ref|XP_003812142.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
[Pan paniscus]
gi|410221584|gb|JAA08011.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
troglodytes]
gi|410252228|gb|JAA14081.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
troglodytes]
gi|410294034|gb|JAA25617.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
troglodytes]
gi|410334639|gb|JAA36266.1| general transcription factor IIH, polypeptide 3, 34kDa [Pan
troglodytes]
Length = 308
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|19923732|ref|NP_001507.2| general transcription factor IIH subunit 3 isoform a [Homo sapiens]
gi|50403772|sp|Q13889.2|TF2H3_HUMAN RecName: Full=General transcription factor IIH subunit 3; AltName:
Full=Basic transcription factor 2 34 kDa subunit;
Short=BTF2 p34; AltName: Full=General transcription
factor IIH polypeptide 3; AltName: Full=TFIIH basal
transcription factor complex p34 subunit
gi|23955948|gb|AAN40702.1| general transcription factor IIH, polypeptide 3, 34kDa [Homo
sapiens]
gi|40850955|gb|AAH65250.1| General transcription factor IIH, polypeptide 3, 34kDa [Homo
sapiens]
gi|119618832|gb|EAW98426.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
CRA_a [Homo sapiens]
gi|189053764|dbj|BAG36016.1| unnamed protein product [Homo sapiens]
gi|208966364|dbj|BAG73196.1| general transcription factor IIH, polypeptide 3, 34kDa [synthetic
construct]
Length = 308
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|167773979|gb|ABZ92424.1| general transcription factor IIH, polypeptide 3, 34kDa [synthetic
construct]
Length = 308
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|34783458|gb|AAH39726.1| GTF2H3 protein, partial [Homo sapiens]
gi|34783739|gb|AAH47868.2| GTF2H3 protein, partial [Homo sapiens]
Length = 303
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 114 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 173
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 174 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 232
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 233 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 284
>gi|383855904|ref|XP_003703450.1| PREDICTED: general transcription factor IIH subunit 3-like
[Megachile rotundata]
Length = 292
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/190 (45%), Positives = 115/190 (60%), Gaps = 14/190 (7%)
Query: 7 TLLQNLEEFMNK---DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PR 61
T+ Q L++ +N+ D+ L + SL+SG+L+MALCYI R+ R + + R
Sbjct: 91 TVRQQLQQVINEISMDKPLNGE--------SLISGALTMALCYIARLEREKVAGQKLYSR 142
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + S D QY+ MN F+AQ+ V +D C L Q LQQ ITGG + K Q
Sbjct: 143 ILTITASNDSATQYMNYMNIFFTAQKMGVILDVCSLD-QELTLLQQGCDITGGNYLKVPQ 201
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
L GL QYLL IF D R+ L LP PV VD+RA+CFCH+ ID+GY+CSVCLSI+CK
Sbjct: 202 LSGLLQYLLWIFLPDPSIRSKLVLPPPVKVDYRAACFCHQELIDIGYVCSVCLSIFCKFS 261
Query: 182 KKCSTCGSVF 191
C+TC +VF
Sbjct: 262 PICTTCHTVF 271
>gi|402888055|ref|XP_003907392.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
[Papio anubis]
gi|426374588|ref|XP_004054152.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
[Gorilla gorilla gorilla]
gi|380784389|gb|AFE64070.1| general transcription factor IIH subunit 3 [Macaca mulatta]
gi|383417813|gb|AFH32120.1| general transcription factor IIH subunit 3 [Macaca mulatta]
gi|384946662|gb|AFI36936.1| general transcription factor IIH subunit 3 [Macaca mulatta]
Length = 308
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|195032196|ref|XP_001988455.1| GH10566 [Drosophila grimshawi]
gi|193904455|gb|EDW03322.1| GH10566 [Drosophila grimshawi]
Length = 300
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 78/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS+SMALCYI R+ R + + RIL L GS + QY+ MN F+AQ+ +
Sbjct: 122 SLLAGSMSMALCYISRLQRNVAAGVKMHSRILVLTGSNECASQYMTYMNVFFTAQKLSIV 181
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P V
Sbjct: 182 IDACALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFMPSPQMRHKLVLPPPPKV 240
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 241 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280
>gi|395846791|ref|XP_003796077.1| PREDICTED: general transcription factor IIH subunit 3 [Otolemur
garnettii]
Length = 309
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|348554273|ref|XP_003462950.1| PREDICTED: general transcription factor IIH subunit 3-like [Cavia
porcellus]
Length = 309
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 110/172 (63%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL L+ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVLKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDT-DSGLLQQACDITGGLYLKVPQMTSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RAQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|291414274|ref|XP_002723381.1| PREDICTED: general transcription factor IIH, polypeptide 3, 34kDa
[Oryctolagus cuniculus]
Length = 309
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDT-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCSFSPICTTCETAF 290
>gi|157132766|ref|XP_001662634.1| transcription factor TFIIH-subunit, putative [Aedes aegypti]
gi|108871079|gb|EAT35304.1| AAEL012523-PA [Aedes aegypti]
Length = 288
Score = 148 bits (374), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/160 (50%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS++MALCYI RV R+ + RIL + GS + QY+ MN F+AQ+ V
Sbjct: 113 SLLAGSMAMALCYIARVHRNKPAGVKINSRILVVTGSNECASQYMTYMNVFFTAQKQNVS 172
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
+D C L S LQQ ITGG + K QLDG QYLL +F + +R L LP PV V
Sbjct: 173 LDVCALDKPLS-LLQQGCDITGGQYLKLPQLDGFLQYLLWVFLPEPLTRCKLVLPPPVKV 231
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RA+CFCH+ ID+GY+CSVCLSI+CK C+TC +VF
Sbjct: 232 DYRAACFCHRELIDIGYVCSVCLSIFCKFSPICTTCHTVF 271
>gi|21410252|gb|AAH31030.1| Unknown (protein for IMAGE:4724313), partial [Homo sapiens]
Length = 235
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 117/187 (62%), Gaps = 9/187 (4%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILC 64
+++ +++ M K + G+ +LL+GSL+ ALCYI R+ + + + RIL
Sbjct: 37 VIVEEIKDLMTKSDIKGQHTE------TLLAGSLAKALCYIHRMNKEVKDNQEMKSRILV 90
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+
Sbjct: 91 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 149
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
L QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C
Sbjct: 150 LLQYLLWVFLPDQDQRSRLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 209
Query: 185 STCGSVF 191
+TC + F
Sbjct: 210 TTCETAF 216
>gi|358060458|dbj|GAA93863.1| hypothetical protein E5Q_00509 [Mixia osmundae IAM 14324]
Length = 331
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/187 (40%), Positives = 111/187 (59%), Gaps = 10/187 (5%)
Query: 14 EFMNKDEQLGKQ--------EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILC 64
F D Q+ K EP+ +++S +L+ LC+I R+ R H +PR+L
Sbjct: 101 RFRQVDTQVSKAVGQMMRELEPDTDAPPAIVS-ALARTLCHINRISREETKHTIKPRVLL 159
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
L S D QY+ +MN IFSAQ++ VPID C + ++ FLQQA+Y++ G+++K ++ G
Sbjct: 160 LSVSHDSSSQYIPLMNCIFSAQKANVPIDVCKIYGDDAVFLQQAAYLSSGIYYKLEKRAG 219
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
L QYL+ F + SR L LP VD RA+CFCH+ +D+G++CSVCLSI+C C
Sbjct: 220 LLQYLMMTFLPGVTSRKLLNLPSQDAVDLRAACFCHQRIVDIGFVCSVCLSIFCTPRPSC 279
Query: 185 STCGSVF 191
TC + F
Sbjct: 280 LTCRTKF 286
>gi|355564808|gb|EHH21308.1| hypothetical protein EGK_04329, partial [Macaca mulatta]
gi|355786640|gb|EHH66823.1| hypothetical protein EGM_03880, partial [Macaca fascicularis]
Length = 308
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|344297280|ref|XP_003420327.1| PREDICTED: general transcription factor IIH subunit 3-like
[Loxodonta africana]
Length = 309
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHMETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|449683803|ref|XP_002159513.2| PREDICTED: general transcription factor IIH subunit 3-like [Hydra
magnipapillata]
Length = 291
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/159 (47%), Positives = 101/159 (63%), Gaps = 2/159 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL-LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPI 92
SLL+G+L+ ALCYI R+ RIL ++GS D QY+ +MN IF+A + + I
Sbjct: 120 SLLAGALTKALCYIHSHDRTANGRRTNARILIIKGSSDSSSQYMTVMNCIFAASKKNIVI 179
Query: 93 DSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 152
D C L NS F+QQAS ITGGV+ G+ +YLL IF D R L LP +D
Sbjct: 180 DCCAL-QNNSGFMQQASDITGGVYFFIDDFSGMLEYLLWIFLPDPGLREKLNLPTSSQID 238
Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+RA+CFCHK +D+G++CSVCLSIYC+ + KC+TC + F
Sbjct: 239 YRAACFCHKQLVDVGFVCSVCLSIYCQFMPKCATCQTRF 277
>gi|31088894|ref|NP_852075.1| general transcription factor IIH subunit 3 [Mus musculus]
gi|22654149|sp|Q8VD76.1|TF2H3_MOUSE RecName: Full=General transcription factor IIH subunit 3; AltName:
Full=Basic transcription factor 2 34 kDa subunit;
Short=BTF2 p34; AltName: Full=General transcription
factor IIH polypeptide 3; AltName: Full=TFIIH basal
transcription factor complex p34 subunit
gi|17028402|gb|AAH17515.1| General transcription factor IIH, polypeptide 3 [Mus musculus]
gi|74228938|dbj|BAE21939.1| unnamed protein product [Mus musculus]
gi|148687628|gb|EDL19575.1| general transcription factor IIH, polypeptide 3 [Mus musculus]
Length = 309
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 113/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI RV ++ + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRVNKAVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|332254335|ref|XP_003276282.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
[Nomascus leucogenys]
Length = 308
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 116/187 (62%), Gaps = 9/187 (4%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILC 64
+++ +++ M K + G+ +LL+GSL+ ALCYI R+ + + RIL
Sbjct: 110 VIVEEIKDLMTKGDIKGQHTE------TLLAGSLAKALCYIHRMNKEIKDNQEMKSRILV 163
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+
Sbjct: 164 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 222
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
L QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C
Sbjct: 223 LLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 282
Query: 185 STCGSVF 191
+TC + F
Sbjct: 283 TTCETAF 289
>gi|335775838|gb|AEH58705.1| general transcription factor IIH subunit-like protein [Equus
caballus]
Length = 302
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 113 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDTALQYMNFM 172
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 173 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 231
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 232 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 283
>gi|57105536|ref|XP_534644.1| PREDICTED: general transcription factor IIH subunit 3 isoform 1
[Canis lupus familiaris]
Length = 308
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/170 (44%), Positives = 111/170 (65%), Gaps = 3/170 (1%)
Query: 24 KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNA 81
K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ MN
Sbjct: 121 KSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNV 180
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D R+
Sbjct: 181 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 239
Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 240 QLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|149720731|ref|XP_001498402.1| PREDICTED: general transcription factor IIH subunit 3-like [Equus
caballus]
Length = 309
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDTALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|296213239|ref|XP_002753188.1| PREDICTED: general transcription factor IIH subunit 3 [Callithrix
jacchus]
Length = 309
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWMFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|428673524|ref|NP_001258796.1| general transcription factor IIH subunit 3 isoform c [Homo sapiens]
gi|194389784|dbj|BAG60408.1| unnamed protein product [Homo sapiens]
Length = 267
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 78 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 137
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 248
>gi|417398730|gb|JAA46398.1| Putative rna polymer [Desmodus rotundus]
Length = 308
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVVKAAEDSALQYMNFM 178
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 179 NVIFAAQKQSILIDACVLDS-DSGLLQQACDITGGMYLKVPQVPSLLQYLLWVFLPDQDQ 237
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 238 RPQLTLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 289
>gi|332840758|ref|XP_003314057.1| PREDICTED: general transcription factor IIH subunit 3 [Pan
troglodytes]
gi|397481836|ref|XP_003812143.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
[Pan paniscus]
Length = 267
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + RIL ++ + D QY+ M
Sbjct: 78 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 137
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 248
>gi|402888057|ref|XP_003907393.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
[Papio anubis]
gi|426374590|ref|XP_004054153.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
[Gorilla gorilla gorilla]
Length = 267
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + RIL ++ + D QY+ M
Sbjct: 78 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 137
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 138 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 196
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 197 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 248
>gi|297693367|ref|XP_002823992.1| PREDICTED: general transcription factor IIH subunit 3 [Pongo
abelii]
Length = 318
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 111/172 (64%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + RIL ++ + D QY+ M
Sbjct: 129 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRILVIKAAEDSALQYMNFM 188
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 189 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 247
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 248 RSQLTLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 299
>gi|432873950|ref|XP_004072398.1| PREDICTED: general transcription factor IIH subunit 3-like [Oryzias
latipes]
Length = 300
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 82/181 (45%), Positives = 111/181 (61%), Gaps = 6/181 (3%)
Query: 13 EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPD 70
EE N L K E G +LL+GSL+ ALCYI RV + + + RIL ++ + D
Sbjct: 105 EEIRNV---LSKTEVRGNSTDTLLAGSLAKALCYIYRVSKELDVGQEIKSRILVIKAAED 161
Query: 71 GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q L QYLL
Sbjct: 162 CALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKIPQKLALTQYLL 220
Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
+F D R+ L LP P VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC +
Sbjct: 221 WVFLPDSEQRSQLVLPPPAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETA 280
Query: 191 F 191
F
Sbjct: 281 F 281
>gi|158296528|ref|XP_316918.4| AGAP008522-PA [Anopheles gambiae str. PEST]
gi|157014753|gb|EAA12181.5| AGAP008522-PA [Anopheles gambiae str. PEST]
Length = 288
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS+SMALCYI R+ R+ + R+L + GS + QY+ MN F+AQ+ V
Sbjct: 113 SLLAGSMSMALCYIARINRNKAPGSKINSRVLVVTGSNECASQYMTYMNVFFTAQKQGVV 172
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
+D C L S LQQ ITGG + + +QL+G QYLL +F D R L LP PV V
Sbjct: 173 VDVCALDKALS-LLQQGCDITGGQYLRLEQLEGFLQYLLWVFLPDPQMRCKLVLPPPVKV 231
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RA+CFCH+ ID+GY+CSVCLSI+CK C+TC +VF
Sbjct: 232 DYRAACFCHRELIDIGYVCSVCLSIFCKFSPICTTCHTVF 271
>gi|195575703|ref|XP_002077716.1| GD22925 [Drosophila simulans]
gi|194189725|gb|EDX03301.1| GD22925 [Drosophila simulans]
Length = 299
Score = 147 bits (370), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 80/162 (49%), Positives = 104/162 (64%), Gaps = 7/162 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
SLL+GS+SMALCYI R+ R+ L P RIL L GS + QY+ MN F+AQ+
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLG 178
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
+ ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|441630270|ref|XP_004089523.1| PREDICTED: general transcription factor IIH subunit 3 isoform 3
[Nomascus leucogenys]
Length = 267
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 116/187 (62%), Gaps = 9/187 (4%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILC 64
+++ +++ M K + G+ +LL+GSL+ ALCYI R+ + + RIL
Sbjct: 69 VIVEEIKDLMTKGDIKGQHTE------TLLAGSLAKALCYIHRMNKEIKDNQEMKSRILV 122
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+
Sbjct: 123 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 181
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
L QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C
Sbjct: 182 LLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 241
Query: 185 STCGSVF 191
+TC + F
Sbjct: 242 TTCETAF 248
>gi|395332939|gb|EJF65317.1| transcription factor Tfb4 [Dichomitus squalens LYAD-421 SS1]
Length = 324
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 113/199 (56%), Gaps = 16/199 (8%)
Query: 3 SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF----------R 52
+L +T++ ++ E N + ++EP G + C L+ ALCYI R
Sbjct: 93 TLDSTVMSSIAERFNSYDVTEEEEPIGLVGC------LTKALCYINRTISVVSASAQGTE 146
Query: 53 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
+ PRIL L SPD Y+ IMN+IFSAQ+ V ID C + + FLQQA+++T
Sbjct: 147 GATISIDPRILVLSVSPDQSSTYIPIMNSIFSAQKLKVTIDVCKVFGAENVFLQQAAHLT 206
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
GG + + ++ D L QY++ F R + +P VDFRA+CFCHK +++G++CSV
Sbjct: 207 GGAYVQIERTDALLQYMMMSFLPPPAIRQLISVPTQDRVDFRAACFCHKRIVEIGFVCSV 266
Query: 173 CLSIYCKHLKKCSTCGSVF 191
CLSI+C+ + CSTC + F
Sbjct: 267 CLSIFCQPVPVCSTCRTKF 285
>gi|195350283|ref|XP_002041670.1| GM16625 [Drosophila sechellia]
gi|194123443|gb|EDW45486.1| GM16625 [Drosophila sechellia]
Length = 299
Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 80/162 (49%), Positives = 104/162 (64%), Gaps = 7/162 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
SLL+GS+SMALCYI R+ R+ L P RIL L GS + QY+ MN F+AQ+
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVLTGSNECASQYMTFMNVFFTAQKLG 178
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
+ ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|355693898|gb|AER99489.1| proteinral transcription factor IIH, polypeptide 3, 34kDa [Mustela
putorius furo]
Length = 286
Score = 146 bits (369), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/168 (45%), Positives = 110/168 (65%), Gaps = 3/168 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC
Sbjct: 239 RSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTC 286
>gi|66730451|ref|NP_001019407.1| general transcription factor IIH subunit 3 [Rattus norvegicus]
gi|81888006|sp|Q561R7.1|TF2H3_RAT RecName: Full=General transcription factor IIH subunit 3; AltName:
Full=General transcription factor IIH polypeptide 3
gi|62531219|gb|AAH93380.1| General transcription factor IIH, polypeptide 3 [Rattus norvegicus]
gi|149063248|gb|EDM13571.1| general transcription factor IIH, polypeptide 3 [Rattus norvegicus]
Length = 309
Score = 146 bits (369), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 75/172 (43%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R ++ + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRASKAVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|432090055|gb|ELK23655.1| General transcription factor IIH subunit 3 [Myotis davidii]
Length = 247
Score = 146 bits (368), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 117/188 (62%), Gaps = 11/188 (5%)
Query: 7 TLLQNLEEFMNKDEQLGKQ-EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRIL 63
+++ +++ M K + G+ EP LL+GSL+ ALCYI R+ + + + RIL
Sbjct: 49 VIVEEIKDLMTKSDIKGQHTEP-------LLAGSLAKALCYIHRMHKEVKDNQEMKSRIL 101
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+
Sbjct: 102 VIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMP 160
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
L QYLL +F D R+ L LP P+ VD+RA+C CH+N I++GY+CSVCLSI+C
Sbjct: 161 SLLQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACLCHRNLIEIGYVCSVCLSIFCNFSPI 220
Query: 184 CSTCGSVF 191
C+TC + F
Sbjct: 221 CTTCETAF 228
>gi|24580793|ref|NP_608574.2| Tfb4 [Drosophila melanogaster]
gi|22945518|gb|AAF51411.2| Tfb4 [Drosophila melanogaster]
gi|94400646|gb|ABF17925.1| FI01003p [Drosophila melanogaster]
Length = 299
Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 79/162 (48%), Positives = 104/162 (64%), Gaps = 7/162 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
SLL+GS+SMALCYI R+ R+ L P RIL + GS + QY+ MN F+AQ+
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVVTGSNECASQYMTFMNVFFTAQKLG 178
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
+ ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|198475301|ref|XP_001357006.2| GA18615 [Drosophila pseudoobscura pseudoobscura]
gi|198138760|gb|EAL34072.2| GA18615 [Drosophila pseudoobscura pseudoobscura]
Length = 298
Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS++MALCYI R+ R + + RIL L GS + QY+ MN F+AQ+ +
Sbjct: 121 SLLAGSMAMALCYISRLQRNVAAGVKMHSRILVLTGSNECSSQYMTFMNVFFTAQKLGIT 180
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P V
Sbjct: 181 IDTCALD-KTLSLLQQGCDITTGQFLKVTQLDGLLQYLLWVFLPAPQMRHKLVLPPPPKV 239
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|195470427|ref|XP_002087508.1| GE15755 [Drosophila yakuba]
gi|194173609|gb|EDW87220.1| GE15755 [Drosophila yakuba]
Length = 299
Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 103/160 (64%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS+SMALCYI R+ R+ + RIL + GS + QY+ MN F+AQ+ +
Sbjct: 121 SLLAGSMSMALCYISRLQRNVAPGVKMHSRILVITGSNECASQYMTFMNVFFTAQKLGIT 180
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P V
Sbjct: 181 IDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPPKV 239
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|354491442|ref|XP_003507864.1| PREDICTED: general transcription factor IIH subunit 3-like isoform
1 [Cricetulus griseus]
Length = 309
Score = 145 bits (367), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 75/172 (43%), Positives = 113/172 (65%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIM 79
+ K + +G+ +LL+GSL+ ALCYI R+ ++ + + RIL ++ + D QY+ M
Sbjct: 120 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKAVKDNQEMKSRILVIKAAEDSALQYMNFM 179
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 180 NVIFAAQKQNILIDACILDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQ 238
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C C+TC + F
Sbjct: 239 RSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 290
>gi|354491444|ref|XP_003507865.1| PREDICTED: general transcription factor IIH subunit 3-like isoform
2 [Cricetulus griseus]
gi|344248744|gb|EGW04848.1| General transcription factor IIH subunit 3 [Cricetulus griseus]
Length = 313
Score = 145 bits (367), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 114/174 (65%), Gaps = 3/174 (1%)
Query: 20 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVA 77
+ + K + +G+ +LL+GSL+ ALCYI R+ ++ + + RIL ++ + D QY+
Sbjct: 122 DLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKAVKDNQEMKSRILVIKAAEDSALQYMN 181
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D
Sbjct: 182 FMNVIFAAQKQNILIDACILDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQ 240
Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P+ VD+RA+CFCH++ I++GY+CSVCLSI+C C+TC + F
Sbjct: 241 DQRSQLILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTCETAF 294
>gi|190358598|ref|NP_001121894.1| uncharacterized protein LOC100151595 [Danio rerio]
Length = 296
Score = 145 bits (367), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 110/181 (60%), Gaps = 6/181 (3%)
Query: 13 EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPD 70
EE N E + E G +LL+ SL+ ALCYI R+ + + RIL ++ + D
Sbjct: 101 EEIRNLME---RAEVSGSQTETLLAVSLARALCYINRISKDVQAGQEVKSRILVIKAAED 157
Query: 71 GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ + Q L QYLL
Sbjct: 158 STSQYMNFMNVIFAAQKKNILIDACVLDS-DSGLLQQACDITGGLYLRVPQKVALTQYLL 216
Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
+F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC +
Sbjct: 217 WVFLPDAEQRSQLLLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETA 276
Query: 191 F 191
F
Sbjct: 277 F 277
>gi|409045010|gb|EKM54491.1| hypothetical protein PHACADRAFT_258371 [Phanerochaete carnosa
HHB-10118-sp]
Length = 360
Score = 145 bits (367), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 75/190 (39%), Positives = 108/190 (56%), Gaps = 13/190 (6%)
Query: 12 LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLH----------PQPR 61
++ +++ E LG+ E E I G+L+ A+CYI RV P PR
Sbjct: 99 MKHIVSEVEALGEPEVEEPIG---FVGALTKAMCYINRVTNGSTSSTSRNDDLATLPDPR 155
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + SPD Y+ +MN IFSAQ+ V ID C + S FLQQA+++TGG + +
Sbjct: 156 ILVISVSPDQSSSYIPVMNLIFSAQKLKVTIDVCKIFGGESVFLQQAAHLTGGSYIDIDR 215
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
D + QYL+ F R + +P +DFRA+CFCHKN +D+G++CSVCLSI+C+ +
Sbjct: 216 PDAILQYLMMSFLPPPGLRQLISVPTQDKIDFRAACFCHKNIVDIGFVCSVCLSIFCQPV 275
Query: 182 KKCSTCGSVF 191
CSTC + F
Sbjct: 276 PVCSTCRTKF 285
>gi|194759192|ref|XP_001961833.1| GF14734 [Drosophila ananassae]
gi|190615530|gb|EDV31054.1| GF14734 [Drosophila ananassae]
Length = 297
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 80/162 (49%), Positives = 106/162 (65%), Gaps = 7/162 (4%)
Query: 34 SLLSGSLSMALCYIQRVFR---SGL-LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
SLL+GS+SMALCYI R+ R SG+ +H RIL L GS + QY+ MN F+AQ+
Sbjct: 120 SLLAGSMSMALCYISRLQRNVTSGVKMHS--RILVLTGSNECASQYMTFMNVFFTAQKLG 177
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
+ ID+C L + + LQQ IT G K QL+GL QYLL +F R+ L LP P
Sbjct: 178 IVIDTCALD-KTLSLLQQGCDITSGQFLKVTQLEGLLQYLLWVFLPAPQMRHKLVLPPPP 236
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 237 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 278
>gi|332375937|gb|AEE63109.1| unknown [Dendroctonus ponderosae]
Length = 300
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 119/208 (57%), Gaps = 20/208 (9%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-----QPR 61
T+ QNL E + + E SLL+G+++MALCYI R+ R+ P R
Sbjct: 105 TVKQNLAELLASESSTAVTE-------SLLAGAIAMALCYIARLQRT---KPPGSKLNSR 154
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + GS D QY+ MN F+AQ+ V +D C L Q+ + LQQ ITGG++ K Q
Sbjct: 155 ILVVSGSGDSASQYMNYMNVFFTAQKLNVVLDVCALD-QHLSLLQQCCDITGGMYLKVPQ 213
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
+ L Q+LL +F + R L LP PV VD+RA+CFCH+ ID+G++CSVCLSI+CK
Sbjct: 214 ITALLQFLLWVFLPEPPIREKLVLPPPVKVDYRAACFCHRELIDIGFVCSVCLSIFCKFS 273
Query: 182 KKCSTCGSVFGQAQTQSDEPSATNRKRK 209
C+TC VF + +P +KRK
Sbjct: 274 PICTTCHMVFKIPGPLAVKP----KKRK 297
>gi|47226419|emb|CAG08435.1| unnamed protein product [Tetraodon nigroviridis]
Length = 299
Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/170 (45%), Positives = 106/170 (62%), Gaps = 3/170 (1%)
Query: 24 KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNA 81
K E G +LL+GSL+ ALCYI RV + + + RIL ++ + D QY+ MN
Sbjct: 114 KTEVRGNSNDTLLAGSLAKALCYINRVSKELEVGQETKSRILVIKAAEDSALQYMNFMNV 173
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q L QYLL +F D R+
Sbjct: 174 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKIPQKLALAQYLLWVFLPDTEQRS 232
Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
L LP VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 233 QLLLPPRAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 282
>gi|195388531|ref|XP_002052933.1| GJ17829 [Drosophila virilis]
gi|194149390|gb|EDW65088.1| GJ17829 [Drosophila virilis]
Length = 300
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/178 (46%), Positives = 109/178 (61%), Gaps = 7/178 (3%)
Query: 20 EQLGK---QEPEGRIAC-SLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPE 73
+QLG P +C SLL+GS+SMALCYI R+ R + + RIL L GS +
Sbjct: 104 QQLGSILMNAPRLSASCESLLAGSMSMALCYISRLQRNVAAGVKMHSRILVLTGSNECAS 163
Query: 74 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
QY+ MN F+AQ+ + ID+C L + + LQQ IT G K QLDGL QYLL +F
Sbjct: 164 QYMTYMNVFFTAQKLGIVIDTCALD-KTLSLLQQGCDITNGQFLKVTQLDGLLQYLLWVF 222
Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP VD+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 223 LPSPQMRHKLVLPPTPKVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280
>gi|17945214|gb|AAL48665.1| RE13149p [Drosophila melanogaster]
Length = 299
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/162 (48%), Positives = 104/162 (64%), Gaps = 7/162 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
SLL+GS+SMALCYI R+ R+ L P RIL + GS + QY+ +N F+AQ+
Sbjct: 121 SLLAGSMSMALCYISRLQRN--LAPGVKMHSRILVVTGSNECASQYMTFLNVFFTAQKLG 178
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
+ ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P
Sbjct: 179 ITIDTCALD-KTLSLLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPAPQIRHKLVLPPPP 237
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 238 KVDYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|410922922|ref|XP_003974931.1| PREDICTED: general transcription factor IIH subunit 3-like
[Takifugu rubripes]
Length = 298
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/170 (45%), Positives = 105/170 (61%), Gaps = 3/170 (1%)
Query: 24 KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNA 81
K E G +LL+GSL+ ALCYI RV + + + RIL ++ + D QY+ MN
Sbjct: 113 KTEVRGNSNDTLLAGSLAKALCYINRVSKEMEVGQETKSRILVIKAAEDCALQYMNFMNV 172
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
IF+AQ+ + ID+C L +S LQQA ITGG++ K Q L QYLL +F D R+
Sbjct: 173 IFAAQKQNILIDACVLD-MDSGLLQQACDITGGLYLKIPQKAALAQYLLWVFLPDTEQRS 231
Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
L LP VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 232 QLLLPPRAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 281
>gi|348528468|ref|XP_003451739.1| PREDICTED: general transcription factor IIH subunit 3-like
[Oreochromis niloticus]
Length = 300
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 108/172 (62%), Gaps = 3/172 (1%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIM 79
+ K E G +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ M
Sbjct: 111 MSKIEVTGNSTDTLLAGSLAKALCYIHRLTKELEVGQEIKSRILVVKAAEDCALQYMNFM 170
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHS 139
N IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q L QYLL +F D
Sbjct: 171 NVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKIPQKVALAQYLLWVFLPDSEQ 229
Query: 140 RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R+ L LP P VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 230 RSQLVLPPPAHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 281
>gi|170028289|ref|XP_001842028.1| TFIIH basal transcription factor complex p34 subunit [Culex
quinquefasciatus]
gi|167874183|gb|EDS37566.1| TFIIH basal transcription factor complex p34 subunit [Culex
quinquefasciatus]
Length = 292
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/191 (45%), Positives = 113/191 (59%), Gaps = 13/191 (6%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILC 64
T+ Q L +N+ +L SLL+GS++MALCYI R+ R+ + RIL
Sbjct: 92 TIKQKLARMINEAPRLAAPTE------SLLAGSMAMALCYIARINRNKPAGVKINSRILV 145
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
+ GS + QY+ MN F+AQ+ V +D C L S LQQ ITGG + K QLDG
Sbjct: 146 VTGSNECASQYMTYMNVFFTAQKQNVTLDVCALDKPLS-LLQQGCDITGGQYLKLPQLDG 204
Query: 125 LFQYLLTIFGTD--LHS--RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
QYLL +F D +H R L LP PV VD+RA+CFCH+ ID+GY+CSVCLSI+CK
Sbjct: 205 FLQYLLWVFLPDPQMHPQMRCKLVLPPPVKVDYRAACFCHRELIDIGYVCSVCLSIFCKF 264
Query: 181 LKKCSTCGSVF 191
C+TC +VF
Sbjct: 265 SPICTTCHTVF 275
>gi|195159598|ref|XP_002020665.1| GL15043 [Drosophila persimilis]
gi|194117615|gb|EDW39658.1| GL15043 [Drosophila persimilis]
Length = 298
Score = 144 bits (363), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/160 (47%), Positives = 102/160 (63%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRV--FRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS++MALCYI R+ + + RIL L GS + QY+ MN F+AQ+ +
Sbjct: 121 SLLAGSMAMALCYISRLHGMLAAGVKMHSRILVLTGSNECSSQYMTFMNVFFTAQKLGIT 180
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
ID+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP P V
Sbjct: 181 IDTCALD-KTLSLLQQGCDITTGQFLKVTQLDGLLQYLLWVFLPAPQMRHKLVLPPPPKV 239
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 240 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 279
>gi|50540194|ref|NP_001002564.1| general transcription factor IIH, polypeptide 3 [Danio rerio]
gi|49900709|gb|AAH76236.1| General transcription factor IIH, polypeptide 3 [Danio rerio]
Length = 296
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 109/181 (60%), Gaps = 6/181 (3%)
Query: 13 EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPD 70
EE N E + E G +LL+ SL+ ALCYI R+ + + RIL ++ + D
Sbjct: 101 EEIRNLME---RAEVSGSQTETLLAVSLARALCYINRISKDVQAGQEVKSRILVIKAAED 157
Query: 71 GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ + Q L QYLL
Sbjct: 158 STSQYMNFMNVIFAAQKKNILIDACVLDS-DSGLLQQACDITGGLYLRVPQKVALTQYLL 216
Query: 131 TIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSV 190
F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC +
Sbjct: 217 WAFLPDAEQRSQLLLPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETA 276
Query: 191 F 191
F
Sbjct: 277 F 277
>gi|449279312|gb|EMC86947.1| General transcription factor IIH subunit 3, partial [Columba livia]
Length = 306
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 112/188 (59%), Gaps = 15/188 (7%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP------QPRIL 63
+ +++ M K E G+Q + L+GSL+ ALC I + S P + RIL
Sbjct: 109 EEIKDLMTKTEMKGQQTE------TQLAGSLAKALCCILLIVESS--KPLSNQEIKSRIL 160
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
++ + D QY+ MN IF+AQ+ V ID+C L + +S LQQA ITGG++ K +
Sbjct: 161 VIKAAEDSALQYMNFMNVIFAAQKQSVLIDACVLES-DSGLLQQACDITGGIYLKVLHMP 219
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
L QYLL +F D R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 220 SLLQYLLWVFLPDQEQRSQLVLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPI 279
Query: 184 CSTCGSVF 191
CSTC + F
Sbjct: 280 CSTCETAF 287
>gi|426197233|gb|EKV47160.1| hypothetical protein AGABI2DRAFT_204058 [Agaricus bisporus var.
bisporus H97]
Length = 361
Score = 144 bits (362), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 72/165 (43%), Positives = 103/165 (62%), Gaps = 7/165 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL-------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
S L G+++ ALCY+ R+ S + P PR+L L SPD Y+ IMN++FSAQ
Sbjct: 111 SALVGAITKALCYVNRLAISPSSAAPAEHVLPDPRLLILSVSPDLATSYIPIMNSVFSAQ 170
Query: 87 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
+ V ID C + ++ FLQQA+++TGG + ++ D L QYL+ F R L +P
Sbjct: 171 KLKVTIDVCQVFGPDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLPPPPIRKVLAVP 230
Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VDFRA+CFCHKN +D+G++CSVCLSI+C+ + CSTC + F
Sbjct: 231 TQDKVDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 275
>gi|195118052|ref|XP_002003554.1| GI21952 [Drosophila mojavensis]
gi|193914129|gb|EDW12996.1| GI21952 [Drosophila mojavensis]
Length = 299
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/160 (46%), Positives = 102/160 (63%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS+SMALCYI R+ R+ + RIL + GS + QY+ MN F+AQ+ +
Sbjct: 122 SLLAGSMSMALCYISRLQRNVPAGVKMHSRILVITGSNECASQYMTFMNVFFTAQKLGIV 181
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
+D+C L + + LQQ IT G K QLDGL QYLL +F R+ L LP V
Sbjct: 182 VDTCALD-KTLSLLQQGCDITNGQFLKVNQLDGLLQYLLWVFLPSPQMRHKLVLPPTPKV 240
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC ++F
Sbjct: 241 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTIF 280
>gi|409080333|gb|EKM80693.1| hypothetical protein AGABI1DRAFT_119282 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 361
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 72/165 (43%), Positives = 103/165 (62%), Gaps = 7/165 (4%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL-------LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
S L G+++ ALCY+ R+ S + P PR+L L SPD Y+ IMN++FSAQ
Sbjct: 111 SALVGAITKALCYVNRLAISPSSAAPAEHVLPDPRLLILSVSPDLATSYIPIMNSVFSAQ 170
Query: 87 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
+ V ID C + ++ FLQQA+++TGG + ++ D L QYL+ F R L +P
Sbjct: 171 KLKVTIDVCQVFGPDTVFLQQAAHLTGGSYIHLERRDALLQYLIMAFLPPPPIRKVLAVP 230
Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VDFRA+CFCHKN +D+G++CSVCLSI+C+ + CSTC + F
Sbjct: 231 TQDKVDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 275
>gi|389741855|gb|EIM83043.1| Tfb4-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 400
Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 106/188 (56%), Gaps = 18/188 (9%)
Query: 12 LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF--------RSGLLHPQPRIL 63
L E NK E+ EP L G+L+ ALCYI R+ H PRIL
Sbjct: 110 LLEAQNKPEE----EPVA------LVGALTKALCYINRITHVPTANAPEDPAAHVDPRIL 159
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
L SPD Y+ +MN+IFSAQ+ V ID C + ++ FLQQ +++T G + ++ D
Sbjct: 160 ILSVSPDLSTAYIPVMNSIFSAQKLKVTIDVCKVYGPDTVFLQQTAHLTAGSYIYLERRD 219
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
L QYL+ F R L +PK +DFRA+CFCHKN +D+G++CSVCLSI+C +
Sbjct: 220 ALLQYLIMSFLPPPALRQILAVPKQDKIDFRAACFCHKNIVDIGFVCSVCLSIFCAPVPV 279
Query: 184 CSTCGSVF 191
CSTC + F
Sbjct: 280 CSTCRTKF 287
>gi|351698565|gb|EHB01484.1| General transcription factor IIH subunit 3, partial [Heterocephalus
glaber]
Length = 315
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 75/166 (45%), Positives = 108/166 (65%), Gaps = 3/166 (1%)
Query: 28 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSA 85
+G+ +LL+GSL+ ALCYI R+ + + + RIL ++ + D QY+ MN IF+A
Sbjct: 132 KGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVIFAA 191
Query: 86 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQL 145
Q+ + ID+C L + +S LQQA ITGG++ K Q+ L QYLL +F D R L L
Sbjct: 192 QKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMASLLQYLLWVFLPDQDQRAQLIL 250
Query: 146 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
P P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 251 PPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 296
>gi|345497965|ref|XP_001603853.2| PREDICTED: general transcription factor IIH subunit 3-like [Nasonia
vitripennis]
Length = 295
Score = 143 bits (360), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 81/166 (48%), Positives = 104/166 (62%), Gaps = 9/166 (5%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLL--HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SL+SG+LSMALCYI R+ R H PRIL + S D QY+ MN F+AQ+ V
Sbjct: 110 SLISGALSMALCYIARMEREKYAGEHLHPRILVITASHDSATQYMNYMNIFFTAQKMGVM 169
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHK-PQQ-----LDGLFQYLLTIFGTDLHSRNFLQL 145
+D C L + + LQQ +TGG + K PQQ L GL QYLL +F D R+ L L
Sbjct: 170 LDVCSLDHELT-LLQQGCDLTGGNYLKIPQQSGQPPLAGLLQYLLWVFLPDPAVRSKLVL 228
Query: 146 PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
P PV VD+RA+CFCH+ +D G++CSVCLSI+CK C+TC +VF
Sbjct: 229 PPPVKVDYRAACFCHQELVDTGFVCSVCLSIFCKFSPICTTCHTVF 274
>gi|392570135|gb|EIW63308.1| transcription factor Tfb4 [Trametes versicolor FP-101664 SS1]
Length = 329
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 114/199 (57%), Gaps = 16/199 (8%)
Query: 3 SLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF-RSGLLHPQ-- 59
+L AT++++L E + ++ P G + C ++ ALCYI R+ + P
Sbjct: 93 TLDATVMKSLAERLESFGDAAEEAPIGIVGC------MTKALCYINRITSNTTAAGPNED 146
Query: 60 -------PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
PRIL L SPD Y+ IMN+IFSAQ+ V ID C + ++ FLQQA+++T
Sbjct: 147 GPTASIDPRILVLSVSPDQSSTYIPIMNSIFSAQKLKVTIDVCKIFGADNVFLQQAAHLT 206
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
GG + + + D L QY++ F R + +P VDFRA+CFCHK +++G++CSV
Sbjct: 207 GGAYVQVDRTDALLQYMMMSFLPPPAIRQLIAVPTQDRVDFRAACFCHKRIVEIGFVCSV 266
Query: 173 CLSIYCKHLKKCSTCGSVF 191
CLSI+C+ + CSTC + F
Sbjct: 267 CLSIFCQPVPVCSTCRTKF 285
>gi|412993660|emb|CCO14171.1| predicted protein [Bathycoccus prasinos]
Length = 370
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 69/167 (41%), Positives = 105/167 (62%), Gaps = 11/167 (6%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHP--QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
++G++S+AL Y R+ R L + R+L +Q S D EQYV +MNA+F+AQ S + ID
Sbjct: 203 IAGAISLALTYANRIERDPSLRGRVKSRVLVVQASEDDAEQYVPMMNAMFAAQSSNILID 262
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQ---------LDGLFQYLLTIFGTDLHSRNFLQ 144
L + S+FLQQA++++GG++ KP++ + L + L F D ++R FL+
Sbjct: 263 CLCLNEKESSFLQQAAFVSGGIYAKPEKSKKGGGEETREDLLERALCSFLPDRYTRRFLR 322
Query: 145 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
P+ +DFR+SCFCH N I G++CSVCLS++CK C TCG+ F
Sbjct: 323 PPRAGSIDFRSSCFCHGNRISKGFVCSVCLSVFCKKEGACQTCGAAF 369
>gi|224156833|ref|XP_002337765.1| predicted protein [Populus trichocarpa]
gi|222869675|gb|EEF06806.1| predicted protein [Populus trichocarpa]
Length = 77
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/77 (84%), Positives = 71/77 (92%)
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNAIFSAQRSMVPIDSCY+GA NSAFLQQASYITGGV+ KPQ LDGLFQYL T+F TDLH
Sbjct: 1 MNAIFSAQRSMVPIDSCYVGAHNSAFLQQASYITGGVYVKPQHLDGLFQYLTTVFATDLH 60
Query: 139 SRNFLQLPKPVGVDFRA 155
SR+F+QLP+P GVDFRA
Sbjct: 61 SRSFIQLPRPAGVDFRA 77
>gi|392593946|gb|EIW83271.1| transcription factor Tfb4 [Coniophora puteana RWD-64-598 SS2]
Length = 327
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 100/167 (59%), Gaps = 9/167 (5%)
Query: 33 CSLLSGSLSMALCYIQRVFRSGLLH--------PQPRILCLQGSPDGPEQYVAIMNAIFS 84
CSL+S L+ ALCYI R+ P PRIL L SPD Y+ IMN IFS
Sbjct: 117 CSLVSA-LAKALCYINRITHPSTSAGPEDPNELPDPRILILSVSPDLSSSYIPIMNCIFS 175
Query: 85 AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 144
AQ+ V ID C + + FLQQA+++T G + ++ D L QYL+ F R L
Sbjct: 176 AQKLKVTIDVCKVYGPETVFLQQAAHLTEGSYIHLERRDALLQYLVMSFLAPPSVRGVLA 235
Query: 145 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+P+ +DFRA+CFCHKN +D+G++CSVCLSI+C+ + CSTC + F
Sbjct: 236 VPRQERIDFRAACFCHKNIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 282
>gi|195437728|ref|XP_002066792.1| GK24367 [Drosophila willistoni]
gi|194162877|gb|EDW77778.1| GK24367 [Drosophila willistoni]
Length = 303
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 77/160 (48%), Positives = 100/160 (62%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL+GS+ ALCYI R+ R+ + R+L L GS + QY+ MN F+AQ+ +
Sbjct: 127 SLLAGSIGRALCYISRLQRNTPPGIKMHSRVLVLTGSNECSSQYMTFMNVFFTAQKLGIV 186
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
ID+C L S LQQ IT G K QLDGL QYLL +F R+ L LP P V
Sbjct: 187 IDTCSLENPVS-LLQQGCDITSGQFLKVTQLDGLLQYLLWVFLPSPQMRHKLVLPPPPKV 245
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RASCFCH+ ID+GY+CSVCLS++CK+ C+TC +VF
Sbjct: 246 DYRASCFCHRELIDIGYVCSVCLSVFCKYSPICTTCHTVF 285
>gi|432095027|gb|ELK26416.1| General transcription factor IIH subunit 3, partial [Myotis
davidii]
Length = 277
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 112/182 (61%), Gaps = 17/182 (9%)
Query: 7 TLLQNLEEFMNKDEQLGKQ-EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCL 65
+++ +++ M K + G+ EP LL+GSL+ ALCY Q + + RIL +
Sbjct: 111 VIVEEIKDLMTKSDIKGQHTEP-------LLAGSLAKALCYNQEM--------KSRILVI 155
Query: 66 QGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGL 125
+ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+ L
Sbjct: 156 KAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSL 214
Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCS 185
QYLL +F D R+ L LP P+ VD+RA+C CH+N I++GY+CSVCLSI+C C+
Sbjct: 215 LQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACLCHRNLIEIGYVCSVCLSIFCNFSPICT 274
Query: 186 TC 187
TC
Sbjct: 275 TC 276
>gi|198432365|ref|XP_002120803.1| PREDICTED: similar to general transcription factor IIH, polypeptide
3, 34kDa [Ciona intestinalis]
Length = 309
Score = 141 bits (355), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/164 (43%), Positives = 103/164 (62%), Gaps = 7/164 (4%)
Query: 34 SLLSGSLSMALCYIQR--VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SL++GS+++ALCYI R + G RIL ++ + D QY++ MN IF+AQ+ V
Sbjct: 126 SLIAGSMAIALCYIHRMQIESQGNTKLNARILVIKAADDAASQYMSFMNVIFAAQKERVV 185
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN----FLQLPK 147
+D+C LG Q+S LQQA +T G++ K D QYLLT F T+ R Q +
Sbjct: 186 VDACILG-QDSGLLQQACDMTSGMYLKVPHQDAFLQYLLTAFLTEAELRKESLVLAQSSR 244
Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
P VD+RA+CFCH++ +D+G++CSVCLSI+C+ CSTC + F
Sbjct: 245 PREVDYRAACFCHRSLVDVGFVCSVCLSIFCQFSPICSTCETAF 288
>gi|393217355|gb|EJD02844.1| Tfb4-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 378
Score = 139 bits (351), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 68/165 (41%), Positives = 102/165 (61%), Gaps = 9/165 (5%)
Query: 36 LSGSLSMALCYIQRVFRSG---------LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
L G+L+ ALC+I R+ ++ P+PRI+ L SPD Y+ +MN+IFSAQ
Sbjct: 121 LVGALTKALCHINRLVNPTATSIDSDVPVVPPEPRIVILSVSPDLSTSYIPLMNSIFSAQ 180
Query: 87 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
+ V ID C + ++ FLQQA+++TGG + ++ D L Q+LL F R + +P
Sbjct: 181 KLKVAIDVCKIYGPDAVFLQQAAHLTGGSYIYLERRDALLQHLLMTFLPTPSIRQLIAVP 240
Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+DFRA+CFCHK +D+G++CSVCLSI+C+ + CSTC + F
Sbjct: 241 TQDKIDFRAACFCHKTIVDIGFVCSVCLSIFCQPVPVCSTCRTKF 285
>gi|393244851|gb|EJD52362.1| transcription factor Tfb4 [Auricularia delicata TFB-10046 SS5]
Length = 338
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 113/199 (56%), Gaps = 25/199 (12%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP------ 60
T+++ + + ++ + ++EP L G+L+ ALC+I R+ HP P
Sbjct: 95 TVMKRVIDELSSLGDITEEEPTS------LVGALTKALCHINRI-----THPPPPANPSE 143
Query: 61 --------RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
RIL L SPD Y+ +MN+IFSAQ+ ID C + ++ FLQQA+++T
Sbjct: 144 SPPTPPEARILILSVSPDLSTSYIPLMNSIFSAQKLKAIIDVCKVFGPDAVFLQQAAHLT 203
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
GG + + + L QYL+ F + R+ L +P +DFRA+CFCHKN ID+GY+CSV
Sbjct: 204 GGSYVYLLEREALLQYLIMSFLSPPMVRSILSVPTQEKIDFRAACFCHKNIIDVGYVCSV 263
Query: 173 CLSIYCKHLKKCSTCGSVF 191
CLSI+C + CSTC + F
Sbjct: 264 CLSIFCTPVAVCSTCRTKF 282
>gi|345564048|gb|EGX47029.1| hypothetical protein AOL_s00097g75 [Arthrobotrys oligospora ATCC
24927]
Length = 374
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/194 (41%), Positives = 109/194 (56%), Gaps = 36/194 (18%)
Query: 27 PEGRIAC---SLLSGSLSMALCYIQRVFRSGL---------------------------- 55
PE +AC S L+G+LS AL YI R+ G
Sbjct: 136 PEAVMACPNSSYLAGALSAALTYINRICFGGTGGRSSGVPGDEKSKVNSAGGKVDETGRT 195
Query: 56 LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
+H +IL + +PD QY+ +MN+IF+AQR VPID C L ++ FLQQA+ TGGV
Sbjct: 196 MHA--KILVVSVTPDPANQYIPVMNSIFAAQRLKVPIDVCKL-RDSTVFLQQAADATGGV 252
Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
+ +P+ GL QYL+ F D +R L LP V VDFRA+CFCHK +D+G++CSVCLS
Sbjct: 253 YMEPEHPQGLIQYLMMGFLPDHLARQSLILPTKVDVDFRAACFCHKKVLDIGFVCSVCLS 312
Query: 176 IYCKHLKK--CSTC 187
I+C+ + CSTC
Sbjct: 313 IFCEPPQGAVCSTC 326
>gi|451848115|gb|EMD61421.1| hypothetical protein COCSADRAFT_173760 [Cochliobolus sativus
ND90Pr]
Length = 377
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 121/231 (52%), Gaps = 38/231 (16%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR---------------- 49
+T+LQN + +N+ K+ +L+ G+LS+AL +I +
Sbjct: 126 STILQNFVKLLNET----KESHLEATPTTLIGGALSLALTHINKQTILHAPTAASAESAS 181
Query: 50 -----------VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 98
V R L RIL + S D QY+ +MN+IF+AQR +PID L
Sbjct: 182 LAALADSENTHVDRVSL---TSRILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL- 237
Query: 99 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
A ++ LQQAS TGGV+ KP++ +GL QYL+ F D+ +R L +P GVDFRA+CF
Sbjct: 238 AGDTVLLQQASDATGGVYMKPERPEGLLQYLMMAFLPDVTARASLVVPSAGGVDFRAACF 297
Query: 159 CHKNTIDMGYICSVCLSIYCK---HLKKCSTCGSVFGQAQTQSDEPSATNR 206
CH+ +D+G++CSVCLSI+C C TCGS ++ P+ R
Sbjct: 298 CHRKVVDIGFVCSVCLSIFCSPDLPDNLCLTCGSYLSLRSAMTNPPALIPR 348
>gi|451999247|gb|EMD91710.1| hypothetical protein COCHEDRAFT_1136566 [Cochliobolus
heterostrophus C5]
Length = 377
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 121/231 (52%), Gaps = 38/231 (16%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR---------------- 49
+T+LQN + +N+ K+ +L+ G+LS+AL +I +
Sbjct: 126 STILQNFVKLLNET----KESHLEATPTTLIGGALSLALTHINKQTILHAPTAASAESAS 181
Query: 50 -----------VFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 98
V R L RIL + S D QY+ +MN+IF+AQR +PID L
Sbjct: 182 LAALADSENTHVDRVSL---TSRILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL- 237
Query: 99 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
A ++ LQQAS TGGV+ KP++ +GL QYL+ F D+ +R L +P GVDFRA+CF
Sbjct: 238 AGDTVLLQQASDATGGVYMKPERPEGLLQYLMMAFLPDVTARASLVVPSAGGVDFRAACF 297
Query: 159 CHKNTIDMGYICSVCLSIYCKHL---KKCSTCGSVFGQAQTQSDEPSATNR 206
CH+ +D+G++CSVCLSI+C C TCGS ++ P+ R
Sbjct: 298 CHRKVVDIGFVCSVCLSIFCSPDLPGNLCLTCGSYLSLRSAMTNPPALIPR 348
>gi|427787947|gb|JAA59425.1| Putative transcription factor tfb4 [Rhipicephalus pulchellus]
Length = 293
Score = 136 bits (342), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 77/174 (44%), Positives = 109/174 (62%), Gaps = 3/174 (1%)
Query: 20 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVA 77
E + + E +++ +LL+G LS+ALCYI R+ + +G RIL L S + QY+
Sbjct: 100 ELVSSDKGESQVSETLLTGGLSLALCYINRIEKETAGQKKIASRILVLSASGESASQYLN 159
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
MN F+AQ+ V ID+C L ++S LQQ ITGG + K GL +YLL +F D
Sbjct: 160 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMKVPNHTGLLEYLLWLFLPDK 218
Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+R++L P PV VD+RA+CFCH+N I++GY+CSVCLSI+C CSTC + F
Sbjct: 219 ATRDYLVFPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272
>gi|449546921|gb|EMD37890.1| hypothetical protein CERSUDRAFT_136815 [Ceriporiopsis subvermispora
B]
Length = 331
Score = 135 bits (341), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 98/165 (59%), Gaps = 9/165 (5%)
Query: 36 LSGSLSMALCYIQRVF-------RSG--LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQ 86
L G+++ +LCYI R+ R+G + PRIL SPD Y+ IMN+IFSAQ
Sbjct: 120 LVGAITKSLCYINRLTIPSSSTTRAGEQAVPLDPRILIFSVSPDQSSSYIPIMNSIFSAQ 179
Query: 87 RSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
+ V ID C + +S FLQQA+++TGG + + D L QYL F R L +P
Sbjct: 180 KLKVTIDVCKIYGSDSVFLQQAAHLTGGSYLHLDRPDALLQYLTMSFLPPPGIRQLLSVP 239
Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+DFRA+CFCHK +D+G++CSVCLSI+C + CSTC + F
Sbjct: 240 TQDKIDFRAACFCHKRIVDIGFVCSVCLSIFCNPVPVCSTCRTKF 284
>gi|213409415|ref|XP_002175478.1| RNA polymerase II transcription factor B subunit 4
[Schizosaccharomyces japonicus yFS275]
gi|212003525|gb|EEB09185.1| RNA polymerase II transcription factor B subunit 4
[Schizosaccharomyces japonicus yFS275]
Length = 295
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 99/158 (62%), Gaps = 3/158 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
+++SG+LS AL YI R+ L + RIL S D QY++ MN IF AQ+ +PI+
Sbjct: 132 TMMSGALSRALAYINRIQTESPL--RSRILIFSLSGDVALQYISTMNCIFCAQKKNIPIN 189
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
C L ++ + FL+QA TGG++ K ++ GL Q+L+ D + R + +P VDF
Sbjct: 190 VCSL-SKETLFLEQAVDATGGIYIKVEEPKGLLQHLMMSLFPDQNLRKHINIPNQANVDF 248
Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
RA+CFCHK +D+GY+CSVCLSI+C +C+TC S F
Sbjct: 249 RATCFCHKKILDIGYVCSVCLSIFCSPRDQCTTCHSTF 286
>gi|320167472|gb|EFW44371.1| general transcription factor IIH [Capsaspora owczarzaki ATCC 30864]
Length = 381
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 25 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP----QPRILCLQGSPDGPEQYVAIMN 80
+ P S L SLS+ALC+I + R+ L P + RIL + + D P YV +MN
Sbjct: 201 KPPMSSAMTSQLVSSLSVALCHINKTRRA--LPPGVVCRSRILVVHAAEDVPAHYVQLMN 258
Query: 81 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSR 140
IF+AQ+ V ID+ L +S FLQQA +T G + K Q D L Q L+ D +R
Sbjct: 259 TIFAAQKLGVIIDALILTPDDSGFLQQAVDLTKGAYIKLQSNDDLVQTLMMFGLADRAAR 318
Query: 141 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
L LP+P VD+R++CFCH+ +D+G++CSVCLSI+C C TCG+ F
Sbjct: 319 ELLILPQPTVVDYRSACFCHRRIVDVGFVCSVCLSIFCDQRGLCPTCGTRF 369
>gi|357624804|gb|EHJ75440.1| putative transcription factor TFIIH-subunit [Danaus plexippus]
Length = 291
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 106/173 (61%), Gaps = 4/173 (2%)
Query: 22 LGKQEP-EGRIACSLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAI 78
L K +P E R SLL+G+++M LC+I R+ R L R+L + GS D QY+
Sbjct: 103 LIKSQPQEERPGESLLAGAMAMGLCFIARMRREQPPGLRMVSRMLVVTGSSDTAAQYINY 162
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MN F+AQ+ V +D C L ++ + LQQ ITGG++ K L+GL QYLL +F +
Sbjct: 163 MNVFFTAQKQQVLLDVCSLD-KHLSLLQQGCDITGGLYLKVPSLEGLLQYLLWVFLPEAS 221
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R L LP VD+RA+CFCH+ + +GY+CSVCLS++CK C+TC +VF
Sbjct: 222 ERRELVLPGRGRVDYRAACFCHRELLTIGYVCSVCLSVFCKFSPICTTCHTVF 274
>gi|391332492|ref|XP_003740668.1| PREDICTED: general transcription factor IIH subunit 3-like
[Metaseiulus occidentalis]
Length = 297
Score = 135 bits (339), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/158 (44%), Positives = 97/158 (61%), Gaps = 3/158 (1%)
Query: 35 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
LL+G +++ALCYI + + + RIL L S + QY+ MN F+AQR + +DS
Sbjct: 119 LLTGGMALALCYIAKRMANTVGKLNGRILVLSSSGESAHQYLNFMNVFFAAQRKAIVVDS 178
Query: 95 CYLGAQNS-AFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
C + QN LQQ + ITGG + K + GL +YLL +F R L LPKP VD+
Sbjct: 179 CVI--QNDCGLLQQGADITGGNYLKAPSIAGLLEYLLWVFLPPKSLRAKLVLPKPDVVDY 236
Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
R +CFCH+N I++GY+CSVCLSI+C+ CSTC + F
Sbjct: 237 RTACFCHRNLIEVGYVCSVCLSIFCQFAPICSTCQTHF 274
>gi|321470747|gb|EFX81722.1| hypothetical protein DAPPUDRAFT_224195 [Daphnia pulex]
Length = 301
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 99/160 (61%), Gaps = 3/160 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
SLL G+++ ALC+I ++ R L Q PRIL + GS D QY+ MN F+AQ+ V
Sbjct: 124 SLLGGAMARALCHIHKIQRELSLSHQLKPRILVVSGSSDSALQYMTFMNVFFTAQKENVV 183
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGV 151
ID C + + +S LQQ ITGG + + + GL +YLL +F R+ + LP P V
Sbjct: 184 IDCCMMDS-DSGLLQQGCDITGGQYLRIPTVVGLLEYLLWVFLPGPSCRSKIVLPPPTKV 242
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D+RA+CFCH +D+G++CSVCLSI+CK C+TC + F
Sbjct: 243 DYRAACFCHHKLVDIGWVCSVCLSIFCKFSIICTTCNTEF 282
>gi|225717796|gb|ACO14744.1| General transcription factor IIH subunit 3 [Caligus clemensi]
Length = 302
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 85/200 (42%), Positives = 113/200 (56%), Gaps = 10/200 (5%)
Query: 17 NKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYV 76
NK QL + E G S LSGSL+MAL YI R R + RIL L S D QY+
Sbjct: 106 NKFSQLLEAETSGS-EDSPLSGSLAMALAYINRK-RKESPELRARILILTASGDTASQYM 163
Query: 77 AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTD 136
MN F+AQ+ V +D+C L + +S LQQ + ITGG++ L L Q+LL IF
Sbjct: 164 NYMNVFFTAQKLNVLLDACMLQS-DSPLLQQGADITGGLYFNVPDLAALLQFLLWIFLPS 222
Query: 137 LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF----- 191
R L LP V VDFRA+CFCH+ + +G++CSVCLSI+CK C+TC +VF
Sbjct: 223 AEIRPQLGLPTGVKVDFRAACFCHRQLVGVGFVCSVCLSIFCKFSPICTTCQTVFRVPAP 282
Query: 192 --GQAQTQSDEPSATNRKRK 209
G+ + + S++ KRK
Sbjct: 283 LLGKKKKSIAQGSSSLLKRK 302
>gi|346472167|gb|AEO35928.1| hypothetical protein [Amblyomma maculatum]
Length = 293
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 108/174 (62%), Gaps = 3/174 (1%)
Query: 20 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVA 77
E + E E ++ +LL+G LS+ALC+I RV + +G RIL L S + QY+
Sbjct: 100 ELVLSDENESLVSETLLTGGLSLALCHINRVEKETAGQKKIASRILVLSASGESASQYLN 159
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
MN F+AQ+ V ID+C L ++S LQQ ITGG + K GL QYLL +F D
Sbjct: 160 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMKVPNHAGLLQYLLWLFLPDK 218
Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
SR++L P PV VD+RA+CFCH+N I++GY+CSVCLSI+C CSTC + F
Sbjct: 219 ASRDYLVFPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCAFSPICSTCQTAF 272
>gi|119618834|gb|EAW98428.1| general transcription factor IIH, polypeptide 3, 34kDa, isoform
CRA_c [Homo sapiens]
Length = 169
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 65/138 (47%), Positives = 92/138 (66%), Gaps = 1/138 (0%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL ++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K
Sbjct: 14 RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
Q+ L QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 73 QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132
Query: 181 LKKCSTCGSVFGQAQTQS 198
C+TC + + +T+S
Sbjct: 133 SPICTTCDAESQEKETES 150
>gi|195995769|ref|XP_002107753.1| hypothetical protein TRIADDRAFT_51576 [Trichoplax adhaerens]
gi|190588529|gb|EDV28551.1| hypothetical protein TRIADDRAFT_51576 [Trichoplax adhaerens]
Length = 276
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 68/133 (51%), Positives = 89/133 (66%), Gaps = 1/133 (0%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ RIL ++ +PD QY+ IMN IF+AQ+ VPID+C + ++S FLQQA ITGG++ K
Sbjct: 134 KSRILVIKCTPDTSFQYLPIMNCIFTAQKYNVPIDACVI-EKDSGFLQQACDITGGIYLK 192
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
Q GL QYLLT F + R L LP VD+RA+CFCH+ I++GYICSVCLSI C
Sbjct: 193 LQSPSGLLQYLLTAFLPNQSMRETLILPPKAKVDYRAACFCHRELIEIGYICSVCLSISC 252
Query: 179 KHLKKCSTCGSVF 191
K + C TC + F
Sbjct: 253 KFVPVCPTCQTHF 265
>gi|189189624|ref|XP_001931151.1| RNA polymerase II transcription factor B subunit 4 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187972757|gb|EDU40256.1| RNA polymerase II transcription factor B subunit 4 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 370
Score = 133 bits (335), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 92/149 (61%), Gaps = 4/149 (2%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + S D QY+ +MN+IF+AQR +PID L A ++ LQQAS TGGV+ KP
Sbjct: 194 RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATGGVYMKPD 252
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK- 179
+ +GL QYL+ F D+ +R L +P GVDFRA+CFCH+ +D+G++CSVCLSI+C
Sbjct: 253 RPEGLLQYLMMAFLPDVTARTSLVVPSAGGVDFRAACFCHRKVVDIGFVCSVCLSIFCNP 312
Query: 180 --HLKKCSTCGSVFGQAQTQSDEPSATNR 206
C TCGS ++ P+ R
Sbjct: 313 DLSDNLCMTCGSYLSLRSAVTNPPALIPR 341
>gi|449527735|ref|XP_004170865.1| PREDICTED: general transcription factor IIH subunit 3-like [Cucumis
sativus]
Length = 187
Score = 133 bits (334), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 72/90 (80%), Positives = 79/90 (87%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
MP+LC LL+NLEEF+ DEQ K++P+G SLLSGSLSMALCYIQ+VFRSG LHPQP
Sbjct: 86 MPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSLLSGSLSMALCYIQKVFRSGSLHPQP 145
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMV 90
RILCLQGSPDGPEQYVAIMNAIFSAQRSMV
Sbjct: 146 RILCLQGSPDGPEQYVAIMNAIFSAQRSMV 175
>gi|396462438|ref|XP_003835830.1| similar to RNA polymerase II transcription factor B subunit 4
[Leptosphaeria maculans JN3]
gi|312212382|emb|CBX92465.1| similar to RNA polymerase II transcription factor B subunit 4
[Leptosphaeria maculans JN3]
Length = 375
Score = 133 bits (334), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 93/149 (62%), Gaps = 4/149 (2%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + S D QY+ +MN+IF+AQR +PID L A ++ LQQAS TGGV+ KP+
Sbjct: 199 RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATGGVYMKPE 257
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK- 179
+ +GL QYL+ F D+ +R L +P GVDFRA+CFCH+ +D+G++CSVCLSI+C
Sbjct: 258 RPEGLLQYLMMAFLPDVTARASLVVPNAGGVDFRAACFCHRRVVDIGFVCSVCLSIFCSP 317
Query: 180 --HLKKCSTCGSVFGQAQTQSDEPSATNR 206
C TCGS ++ P+ R
Sbjct: 318 DLPDNLCLTCGSYLSLRSAMTNTPALIPR 346
>gi|296421687|ref|XP_002840396.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636611|emb|CAZ84587.1| unnamed protein product [Tuber melanosporum]
Length = 357
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 70/173 (40%), Positives = 104/173 (60%), Gaps = 18/173 (10%)
Query: 34 SLLSGSLSMALCYIQRVFR---SG------------LLHPQPRILCLQGSPDGPEQYVAI 78
++++G+L +AL Y+ R+ R SG + RIL + S D EQYV +
Sbjct: 136 TMMAGALGLALGYVNRLVRVDDSGGVVGGKDGAVGEVTSMNARILVVSVSGDLAEQYVPV 195
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MN+IF+AQR +PID C + + + FLQQAS T G++ + + + L QYL+ F D
Sbjct: 196 MNSIFAAQRKKIPIDICKVAGE-TVFLQQASDTTRGIYMQLEHPESLLQYLMMCFIPDTA 254
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGS 189
+R L P V VDFRA+CFCHKN +D+G++CS+CLSI+C + C+TC +
Sbjct: 255 TRRHLVPPTQVNVDFRAACFCHKNVVDIGFVCSICLSIFCTPPEGAICTTCNT 307
>gi|402224045|gb|EJU04108.1| transcription factor Tfb4 [Dacryopinax sp. DJM-731 SS1]
Length = 305
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 77/191 (40%), Positives = 103/191 (53%), Gaps = 12/191 (6%)
Query: 13 EEFMNKDEQLGK--QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP---------QPR 61
E FM EQ + QE E + +L+ G+L+ +LCYI R+ R
Sbjct: 90 EGFMRSVEQEVEDLQEVEEDVPPALV-GALTKSLCYINRLSNPPPPSNPSEAPPPPPHSR 148
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL SPD Y+ MN IFSAQ+ V ID C L FL QA ++TGG Q
Sbjct: 149 ILLFSVSPDASVAYIPFMNCIFSAQKLKVAIDVCKLDESEVIFLHQACHLTGGAFVPITQ 208
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
+ L QYL F ++ +R L +P VDFRA+CFCHK +D+GY+CSVCLSI+C+ +
Sbjct: 209 REALLQYLSMCFLSETETRKTLAVPTLDKVDFRAACFCHKEIVDIGYVCSVCLSIFCQPV 268
Query: 182 KKCSTCGSVFG 192
+CSTC + F
Sbjct: 269 LECSTCRTKFS 279
>gi|388582433|gb|EIM22738.1| transcription factor Tfb4 [Wallemia sebi CBS 633.66]
Length = 272
Score = 132 bits (333), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 106/183 (57%), Gaps = 18/183 (9%)
Query: 32 ACSLLSGSLSMALCYIQRVF----RSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 87
A S L+ +L+ +LC++ R +S RIL + S D P +YV++MN+IF+AQ+
Sbjct: 85 ASSTLASTLAQSLCHVNRKIVQESKSTF-----RILVVAASDDVPPEYVSLMNSIFAAQK 139
Query: 88 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
S + ID + N+ FLQQA+++TGG + + L QYL+ F R L P+
Sbjct: 140 SKIVIDVVQIYNCNTIFLQQAAHLTGGNFIQATDPESLVQYLIMAFLPSQPLRARLVQPR 199
Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF---------GQAQTQS 198
VDFRA+CFCHK +D+ YICSVCLSI+C+ ++CSTC SVF G QT S
Sbjct: 200 SDKVDFRAACFCHKRIVDIAYICSVCLSIFCQPPQRCSTCNSVFPAKTLQKFTGSLQTHS 259
Query: 199 DEP 201
P
Sbjct: 260 HAP 262
>gi|452820306|gb|EME27350.1| transcription initiation factor TFIIH subunit H3 isoform 2
[Galdieria sulphuraria]
gi|452820307|gb|EME27351.1| transcription initiation factor TFIIH subunit H3 isoform 1
[Galdieria sulphuraria]
Length = 328
Score = 132 bits (333), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 69/162 (42%), Positives = 99/162 (61%), Gaps = 4/162 (2%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
+LLS SLS ALCY+ R S + RILCL S D P+QY+ +MN+IF AQR + ID
Sbjct: 165 TLLSASLSFALCYLHRRILSE--RYEARILCLHASSDVPKQYIGVMNSIFCAQRWSILID 222
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQLDG--LFQYLLTIFGTDLHSRNFLQLPKPVGV 151
C L + S+ LQQA+++TGG++ KP+Q + L YL +++ +R +L++P V
Sbjct: 223 VCNLHEEESSLLQQAAHLTGGIYFKPKQSEAIPLSNYLCSLYFPSRDARKYLKIPVLNSV 282
Query: 152 DFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQ 193
D A CF I MGYICSVCL+ + + C+ CG+ F +
Sbjct: 283 DCSALCFESGEKIHMGYICSVCLATFGRKRLSCNICGAEFAE 324
>gi|428673526|ref|NP_001258797.1| general transcription factor IIH subunit 3 isoform d [Homo sapiens]
Length = 162
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 64/131 (48%), Positives = 88/131 (67%), Gaps = 1/131 (0%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL ++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K
Sbjct: 14 RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
Q+ L QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 73 QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132
Query: 181 LKKCSTCGSVF 191
C+TC + F
Sbjct: 133 SPICTTCETAF 143
>gi|441630273|ref|XP_004089524.1| PREDICTED: general transcription factor IIH subunit 3 isoform 4
[Nomascus leucogenys]
Length = 162
Score = 132 bits (333), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 64/131 (48%), Positives = 88/131 (67%), Gaps = 1/131 (0%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL ++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K
Sbjct: 14 RILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVP 72
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
Q+ L QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 73 QMPSLLQYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNF 132
Query: 181 LKKCSTCGSVF 191
C+TC + F
Sbjct: 133 SPICTTCETAF 143
>gi|353240140|emb|CCA72023.1| related to TFIIH basal transcription factor complex p34 subunit
[Piriformospora indica DSM 11827]
Length = 355
Score = 132 bits (332), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 112/195 (57%), Gaps = 13/195 (6%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYI-QRVFRSGLLH-------- 57
T+ + + +N++ + ++ E IA L G+L+ +LC I +R S L+
Sbjct: 91 TMNSVIAQRLNEELESLPEDAEAPIA---LVGALTKSLCTINRRSHPSDTLNATSPNPSK 147
Query: 58 -PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
P PR+L + SPD Y+ IMN+IF AQ+ + ID C + ++ FLQQA+++TGG +
Sbjct: 148 LPTPRLLVISVSPDLSASYIPIMNSIFCAQKLKLSIDVCKIFGPDNVFLQQAAHLTGGSY 207
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
+ D + QYL+T F + R + +P +D RA+CFCHK +D+G+ICS+CLSI
Sbjct: 208 ISIDRRDSILQYLMTTFLSPPAVRRLMAVPTEDRIDLRAACFCHKQIVDIGFICSICLSI 267
Query: 177 YCKHLKKCSTCGSVF 191
+CK CSTC F
Sbjct: 268 FCKPTPVCSTCRVKF 282
>gi|330933521|ref|XP_003304199.1| hypothetical protein PTT_16692 [Pyrenophora teres f. teres 0-1]
gi|311319344|gb|EFQ87711.1| hypothetical protein PTT_16692 [Pyrenophora teres f. teres 0-1]
Length = 376
Score = 132 bits (332), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 91/149 (61%), Gaps = 4/149 (2%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + S D QY+ +MN+IF+AQR +PID L A ++ LQQAS TGGV+ KP
Sbjct: 200 RILIVSVSGDLANQYIPVMNSIFAAQRKRIPIDILKL-AGDTVLLQQASDATGGVYMKPD 258
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK- 179
+ +GL QYL+ F D +R L +P GVDFRA+CFCH+ +D+G++CSVCLSI+C
Sbjct: 259 RPEGLLQYLMMAFLPDATARTSLVVPSAGGVDFRAACFCHRKVVDIGFVCSVCLSIFCNP 318
Query: 180 --HLKKCSTCGSVFGQAQTQSDEPSATNR 206
C TCGS ++ P+ R
Sbjct: 319 DLSDNLCMTCGSYLSLRSAVTNPPALIPR 347
>gi|327276109|ref|XP_003222813.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
subunit 3-like [Anolis carolinensis]
Length = 360
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 76/193 (39%), Positives = 113/193 (58%), Gaps = 9/193 (4%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHP 58
+ ++ T+ + +++ M K + Q+ E + L L+ AL YI RV + S
Sbjct: 156 LTTVNETIAEEIQDLMTK-SGIKWQQTE-----TCLYXYLTKALWYIHRVSKTESASQEI 209
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ RIL ++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K
Sbjct: 210 KSRILVIKAAEDSALQYMNFMNVIFAAQKQNILIDACVLES-DSGLLQQACDITGGIYLK 268
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
L QYLL +F D R+ L LP V VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 269 VPHKPSLLQYLLWVFLPDHDQRSQLILPPRVHVDYRAACFCHRNLIEVGYVCSVCLSIFC 328
Query: 179 KHLKKCSTCGSVF 191
C+TC + F
Sbjct: 329 NFSPICTTCETAF 341
>gi|407926572|gb|EKG19539.1| Transcription factor Tfb4 [Macrophomina phaseolina MS6]
Length = 380
Score = 132 bits (331), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/206 (38%), Positives = 109/206 (52%), Gaps = 34/206 (16%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV----------------- 50
LL NL M + ++ + +L++G+L+ AL YI +
Sbjct: 131 LLANLRVLMARTSEMDLKAS----TTTLVAGALTTALAYIAKANIKSAPQVGDTQMADLA 186
Query: 51 -------FRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
RS L RIL L S D QY+ +MNAIF+AQR +PID L A ++
Sbjct: 187 TAADPPAARSSLTS---RILVLSVSGDLASQYIPVMNAIFAAQRQRIPIDILKL-AGDTV 242
Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
FLQQA TGG++ KP +GL QYL+ F D +R L LP VDFRA+CFCH+
Sbjct: 243 FLQQACDATGGIYLKPLAPEGLLQYLMMAFLPDETARKHLVLPSAGEVDFRAACFCHRRV 302
Query: 164 IDMGYICSVCLSIYCKHL--KKCSTC 187
+D+GY+CSVCLSI+C+ + C TC
Sbjct: 303 VDIGYVCSVCLSIFCEPPADQTCPTC 328
>gi|189241296|ref|XP_975136.2| PREDICTED: similar to transcription factor TFIIH-subunit, putative
[Tribolium castaneum]
gi|270013170|gb|EFA09618.1| hypothetical protein TcasGA2_TC011739 [Tribolium castaneum]
Length = 232
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/207 (42%), Positives = 124/207 (59%), Gaps = 18/207 (8%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG----LLHPQPRI 62
T+ QNL + + ++ E SLL+G+++MALCYI R+ R+ +L+ RI
Sbjct: 37 TIKQNLAKLLATEKSTLVTE-------SLLAGAIAMALCYIARIQRTKPPGCVLNS--RI 87
Query: 63 LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
L + GS D QY+ MN F+AQ+ + ID C L Q+ + LQQ ITGG++ K QL
Sbjct: 88 LVVTGSGDSASQYMNYMNVFFTAQKQGIVIDVCALD-QHLSLLQQGCDITGGLYLKVPQL 146
Query: 123 DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK 182
GL QYLL +F + R L LP PV VD+RA+CFCH+ ID+GY+CSVCLSI+CK
Sbjct: 147 QGLLQYLLWVFLPEPPIRQKLVLPPPVKVDYRAACFCHRELIDIGYVCSVCLSIFCKFSP 206
Query: 183 KCSTCGSVFGQAQTQSDEPSATNRKRK 209
C+TC +VF + +P +KRK
Sbjct: 207 ICTTCHTVFKIPGPLAVKP----KKRK 229
>gi|19112324|ref|NP_595532.1| transcription factor TFIIH complex subunit Tfb4
[Schizosaccharomyces pombe 972h-]
gi|62901129|sp|O74366.1|TFB4_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 4;
AltName: Full=RNA polymerase II transcription factor B
34 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p34 subunit
gi|3417433|emb|CAA20320.1| transcription factor TFIIH complex subunit Tfb4
[Schizosaccharomyces pombe]
Length = 297
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 105/184 (57%), Gaps = 11/184 (5%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
+L ++ M+ +++ ++ +++SG+LS AL YI +V L + RIL
Sbjct: 114 VLSGMKRLMSSTDKVSRK--------TMISGALSRALAYINQVQNKNTL--RSRILIFSL 163
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
+ D QY+ MN IF AQ+ +PI+ C + + FL+QA+ TGG++ K GL Q
Sbjct: 164 TGDVALQYIPTMNCIFCAQKKNIPINVCNIEG-GTLFLEQAADATGGIYLKVDNPKGLLQ 222
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
YL+ D + R L P VDFRA+CFCHK +D+G++CSVCLSI+C+ CSTC
Sbjct: 223 YLMMSLFPDQNLRKHLNTPNQANVDFRATCFCHKKVLDIGFVCSVCLSIFCEPRVHCSTC 282
Query: 188 GSVF 191
+ F
Sbjct: 283 HTKF 286
>gi|2257568|dbj|BAA21460.1| basic transcription factor 2.35KD subunit [Schizosaccharomyces
pombe]
Length = 298
Score = 129 bits (323), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 105/184 (57%), Gaps = 11/184 (5%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
+L ++ M+ +++ ++ +++SG+LS AL YI +V L + RIL
Sbjct: 114 VLSGMKRLMSSTDKVSRK--------TMISGALSRALAYINQVQNKNTL--RSRILIFSL 163
Query: 68 SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQ 127
+ D QY+ MN IF AQ+ +PI+ C + + FL+QA+ TGG++ K GL Q
Sbjct: 164 TGDVALQYIPTMNCIFCAQKKNIPINVCNIEG-GTLFLEQAADATGGIYLKVDNPKGLLQ 222
Query: 128 YLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
YL+ D + R L P VDFRA+CFCHK +D+G++CSVCLSI+C+ CSTC
Sbjct: 223 YLMMSLFPDQNLRKHLNTPNQANVDFRATCFCHKKVLDIGFVCSVCLSIFCEPRVHCSTC 282
Query: 188 GSVF 191
+ F
Sbjct: 283 HTKF 286
>gi|336371329|gb|EGN99668.1| hypothetical protein SERLA73DRAFT_53296 [Serpula lacrymans var.
lacrymans S7.3]
Length = 297
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 109/202 (53%), Gaps = 29/202 (14%)
Query: 12 LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALC--YIQRVFRSG------LLHPQPRIL 63
+ MN+ + LG + E A L G+L+ ALC I F S + P PRIL
Sbjct: 98 VRSIMNEFDALGNLDEEPPCA---LVGALTKALCCKSISAHFHSPNAVEDPAVLPDPRIL 154
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLD 123
L SPD Y+ IMN+IFSAQ+ V ID C + ++ FLQQA+++TGG + ++ D
Sbjct: 155 ILSVSPDLSTSYIPIMNSIFSAQKLKVTIDVCKVYGPDAVFLQQAAHLTGGSYIFLERRD 214
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS-------- 175
L QYL F + R L +PK +DFRA+CFCHKN +D+G++CSVCLS
Sbjct: 215 ALLQYLTMSFLSPPALRQVLAVPKQDKIDFRAACFCHKNIVDVGFVCSVCLSSNHILFYV 274
Query: 176 ----------IYCKHLKKCSTC 187
++C+ + CSTC
Sbjct: 275 VKLWFTYMCTVFCQPVPVCSTC 296
>gi|430811432|emb|CCJ31073.1| unnamed protein product [Pneumocystis jirovecii]
gi|430812043|emb|CCJ30521.1| unnamed protein product [Pneumocystis jirovecii]
Length = 284
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/166 (42%), Positives = 98/166 (59%), Gaps = 7/166 (4%)
Query: 11 NLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQGSP 69
NL++ ++K + + P +++ GSL MAL Y+ R +S + RIL + S
Sbjct: 124 NLKKLIDKANEQDEDTPS-----TMIGGSLCMALAYVNRAMQSASKNRLNARILIISVSS 178
Query: 70 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
D QY+ IMN IF AQ+ +PID C +G ++ FLQQAS T G++ + L QYL
Sbjct: 179 DITFQYIPIMNCIFGAQKKKIPIDVCKIGG-DTVFLQQASDATKGIYLHLDKPKSLLQYL 237
Query: 130 LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
L IF D R L LP + VDFRA+CFCHK +D+G++CSVCLS
Sbjct: 238 LMIFLPDQKVRQHLVLPFQLNVDFRAACFCHKKILDIGFVCSVCLS 283
>gi|225712364|gb|ACO12028.1| General transcription factor IIH subunit 3 [Lepeophtheirus
salmonis]
gi|290562788|gb|ADD38789.1| General transcription factor IIH subunit 3 [Lepeophtheirus
salmonis]
Length = 305
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/158 (44%), Positives = 94/158 (59%), Gaps = 2/158 (1%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
S LSGSL+MAL YI R R + RIL + S D QY+ MN F+AQ+ V +D
Sbjct: 124 SPLSGSLAMALSYINRR-RKENMDLSARILVITASGDTASQYMNYMNVFFTAQKLDVLLD 182
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
+C L +S LQQ + ITGG++ L Q+LL IF R L LP VDF
Sbjct: 183 TCMLQI-DSPLLQQGADITGGMYFNVPDNAALLQFLLWIFLPSAEMRPQLGLPSANKVDF 241
Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
RA+CFCH+ +D+G++CSVCLSI+CK C+TC ++F
Sbjct: 242 RAACFCHRQLVDIGFVCSVCLSIFCKFSPICTTCQTIF 279
>gi|428174859|gb|EKX43752.1| hypothetical protein GUITHDRAFT_87651 [Guillardia theta CCMP2712]
Length = 343
Score = 127 bits (318), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 113/206 (54%), Gaps = 24/206 (11%)
Query: 24 KQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIF 83
++ + + CSLLS + ++A C + R+ LL PR+L L S + P QY+ IMN IF
Sbjct: 132 QENSDSSMKCSLLSSAFALAACTVNRISSHKLLRMLPRVLVLTASTELPSQYIPIMNCIF 191
Query: 84 SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ------QLDGLFQYLLTIFGTDL 137
S Q++ V ID+ + + +FLQQA+ IT GV+ +P+ L +++++F D
Sbjct: 192 SMQKANVLIDALVVSDGDCSFLQQAADITNGVYLRPELSKVGSSPGALLSWMISVFLGDK 251
Query: 138 HSR----------NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK-HLKKCST 186
H+R L+LP+ VD+R+SCF +D+G++CSVCLSI+ K C T
Sbjct: 252 HTRFLNADSQSRSTLLRLPQAQDVDYRSSCFKTGQLVDVGFVCSVCLSIFAKCDFFICPT 311
Query: 187 CGSVFGQAQTQSDEP--SATNRKRKT 210
C S + D P +A N+KRK
Sbjct: 312 CDS-----KIALDRPAGAAVNKKRKV 332
>gi|429851889|gb|ELA27048.1| transcription factor tfiih subunit [Colletotrichum gloeosporioides
Nara gc5]
Length = 362
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 71/189 (37%), Positives = 99/189 (52%), Gaps = 21/189 (11%)
Query: 21 QLGKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLHPQPRI 62
++ Q E ++C+ LSG+L++AL YI R S L RI
Sbjct: 131 EVINQTTEADLSCTTTQLSGALTLALSYINKASMALNETNKTPEATRATSSALTRLHARI 190
Query: 63 LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
L S P QY+ MNA+F+A S +PID+ L ++ FLQQASYIT G +
Sbjct: 191 FILSVSDSEPVQYIPTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDGTFMQAASP 249
Query: 123 DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--H 180
GL YL+ + D +R+ L P VDFRA+CFCH +D G++CS+CLSI+C
Sbjct: 250 RGLLSYLMFAYSADAEARSSLIPPTHHTVDFRAACFCHGRVVDTGFVCSICLSIFCDVPE 309
Query: 181 LKKCSTCGS 189
+C TCG+
Sbjct: 310 NSECLTCGT 318
>gi|302830812|ref|XP_002946972.1| hypothetical protein VOLCADRAFT_79384 [Volvox carteri f.
nagariensis]
gi|300268016|gb|EFJ52198.1| hypothetical protein VOLCADRAFT_79384 [Volvox carteri f.
nagariensis]
Length = 141
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 61/118 (51%), Positives = 84/118 (71%)
Query: 74 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
QY+ +MNAIFSAQR+ V +D+ L ++S+FLQQA+++TGG++ KP L LL F
Sbjct: 2 QYIPVMNAIFSAQRAEVLLDAVVLATEDSSFLQQAAHLTGGLYFKPAGAGALLGLLLNYF 61
Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
D +R L + + +GVDFRASCFCHK I+ GY+CSVCLSI+C+ + CSTCG+ F
Sbjct: 62 VCDTSTRKQLDVAQELGVDFRASCFCHKYVIETGYVCSVCLSIFCQPSRACSTCGTAF 119
>gi|169775241|ref|XP_001822088.1| transcription factor TFIIH complex subunit [Aspergillus oryzae
RIB40]
gi|83769951|dbj|BAE60086.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391872974|gb|EIT82049.1| RNA polymerase II transcription initiation protein [Aspergillus
oryzae 3.042]
Length = 376
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 84/130 (64%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + G+ D QY+ IMN+IF+ QR +PID C L ++ FLQQAS T G++ +
Sbjct: 207 IVSVSGASDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLTE 265
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+ L
Sbjct: 266 PRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPL 325
Query: 182 KK--CSTCGS 189
C TCG+
Sbjct: 326 DNGTCLTCGT 335
>gi|303274264|ref|XP_003056454.1| RNA polymerase II transcription factor B subunit 4 [Micromonas
pusilla CCMP1545]
gi|226462538|gb|EEH59830.1| RNA polymerase II transcription factor B subunit 4 [Micromonas
pusilla CCMP1545]
Length = 296
Score = 126 bits (316), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/187 (40%), Positives = 105/187 (56%), Gaps = 10/187 (5%)
Query: 4 LCATLLQNLEEF---MNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
LCA ++ NL M D L + L+ +SMALC + R R H P
Sbjct: 112 LCAQVISNLATLAMRMRSDFDLSTKSNMP------LTTVISMALCCVNRA-RVYKPHCSP 164
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RILC+ GS D QY + MNAIFSA+ S V ID+C LG +S FLQQA+ ITGG + Q
Sbjct: 165 RILCILGSQDSSGQYFSGMNAIFSAKNSGVRIDACLLGYTDSFFLQQAAQITGGSYCMLQ 224
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
Q + YLL+ R LQ P+ +++A+CFCH+ + +G++CSVCL+I+C+
Sbjct: 225 QGEKALYYLLSDLPGSESIRAKLQSPRKFCSEYQATCFCHEVNLQVGFVCSVCLTIFCER 284
Query: 181 LKKCSTC 187
+KC C
Sbjct: 285 CRKCLAC 291
>gi|238496129|ref|XP_002379300.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
flavus NRRL3357]
gi|220694180|gb|EED50524.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
flavus NRRL3357]
Length = 367
Score = 126 bits (316), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 84/130 (64%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + G+ D QY+ IMN+IF+ QR +PID C L ++ FLQQAS T G++ +
Sbjct: 198 IVSVSGASDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLTE 256
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+ L
Sbjct: 257 PRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPL 316
Query: 182 KK--CSTCGS 189
C TCG+
Sbjct: 317 DNGTCLTCGT 326
>gi|121713686|ref|XP_001274454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
clavatus NRRL 1]
gi|119402607|gb|EAW13028.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
clavatus NRRL 1]
Length = 386
Score = 125 bits (315), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 63/130 (48%), Positives = 83/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + S D QY+ IMNAIF+ QR VPID C L ++ FLQQAS T GV+ +
Sbjct: 216 IISVSSSTDSAHQYIPIMNAIFACQRLHVPIDVCKLSG-DAVFLQQASDATKGVYMSLAE 274
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 275 PRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 334
Query: 182 KK--CSTCGS 189
+ C TCG+
Sbjct: 335 ENGGCLTCGT 344
>gi|443727474|gb|ELU14215.1| hypothetical protein CAPTEDRAFT_169977 [Capitella teleta]
Length = 299
Score = 125 bits (315), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 94/151 (62%), Gaps = 6/151 (3%)
Query: 45 CYIQRVFRSGLLHP---QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQN 101
CY+ ++ GL + R+L +QGS D P QY+ +MN IF+AQ+ V ID+C + ++
Sbjct: 136 CYVNKI-NKGLPDGDAMRSRVLVVQGSEDNPLQYMDLMNIIFTAQKQNVIIDACIMD-ED 193
Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH-SRNFLQLPKPVGVDFRASCFCH 160
S LQQ ITGG + K Q L QYLL +F + +R L LP V VD+RA+CFCH
Sbjct: 194 SPILQQVCDITGGFYLKTPQPSALLQYLLWVFLLEPGGAREKLNLPAKVHVDYRAACFCH 253
Query: 161 KNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+N I+ GY+CSVCLS++C CSTC + F
Sbjct: 254 RNLINTGYVCSVCLSVFCSFSPICSTCQTTF 284
>gi|255953875|ref|XP_002567690.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211589401|emb|CAP95542.1| Pc21g06450 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 374
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/135 (48%), Positives = 85/135 (62%), Gaps = 5/135 (3%)
Query: 59 QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
Q RIL + GS D QY+ +MN+IF+ QR +PID C L ++ FLQQAS T GV+
Sbjct: 201 QSRILVVSVSGSTDAAHQYIPVMNSIFACQRLSIPIDVCKLSG-DAVFLQQASDATKGVY 259
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
+ GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI
Sbjct: 260 MALAEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSI 319
Query: 177 YCKHLK--KCSTCGS 189
+C+ C TCGS
Sbjct: 320 FCEPPPGGDCMTCGS 334
>gi|431912136|gb|ELK14274.1| General transcription factor IIH subunit 3 [Pteropus alecto]
Length = 292
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 60/123 (48%), Positives = 84/123 (68%), Gaps = 1/123 (0%)
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
++ + D QY+ MN IF+AQ+ + ID+C LG+ +S LQQA ITGG++ K Q+
Sbjct: 150 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLGS-DSGLLQQACDITGGLYLKVPQMPS 208
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKC 184
L QYLL +F D R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C
Sbjct: 209 LLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPIC 268
Query: 185 STC 187
+TC
Sbjct: 269 TTC 271
>gi|310794652|gb|EFQ30113.1| transcription factor tfb4 [Glomerella graminicola M1.001]
Length = 365
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 102/194 (52%), Gaps = 21/194 (10%)
Query: 16 MNKDEQLGKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLH 57
M+ +L E +AC+ LSG+L++AL +I R S L
Sbjct: 128 MSSMRKLVDATTEADLACTTTQLSGALTLALAHINKTSLSLNETNKAPDVARSTSSALSR 187
Query: 58 PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH 117
+ RI L S P QY++ MNA+F+A S +PID+ L ++ FLQQASYIT G
Sbjct: 188 LRARIFVLSVSDSEPVQYISTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDGTFM 246
Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
+ GL YL+ + D +R+ L P VDFRA+CFCH +D G++CS+CLSI+
Sbjct: 247 QAASPRGLLSYLMFAYSADAEARSSLIPPTHHTVDFRAACFCHGRVVDTGFVCSICLSIF 306
Query: 178 CK--HLKKCSTCGS 189
C +C TCG+
Sbjct: 307 CDVPDGSECLTCGT 320
>gi|425772016|gb|EKV10443.1| Transcription factor TFIIH subunit Tfb4, putative [Penicillium
digitatum Pd1]
gi|425777277|gb|EKV15458.1| Transcription factor TFIIH subunit Tfb4, putative [Penicillium
digitatum PHI26]
Length = 387
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/135 (48%), Positives = 84/135 (62%), Gaps = 5/135 (3%)
Query: 59 QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
Q RIL + GS D QY+ IMN+IF+ QR +PID C L ++ FLQQAS T GV+
Sbjct: 214 QSRILVISVSGSTDAAHQYIPIMNSIFACQRLNIPIDVCKLSG-DAVFLQQASDATKGVY 272
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
+ GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI
Sbjct: 273 MALAEPRGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSI 332
Query: 177 YCKHL--KKCSTCGS 189
+C C TCG+
Sbjct: 333 FCGPPPGSDCMTCGT 347
>gi|145254849|ref|XP_001398778.1| transcription factor TFIIH complex subunit [Aspergillus niger CBS
513.88]
gi|134084362|emb|CAK48702.1| unnamed protein product [Aspergillus niger]
Length = 379
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/130 (48%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + S QY+ IMNAIF+ QR +PID C + ++ FLQQAS T GV+ +
Sbjct: 209 ILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDATKGVYMALDE 267
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+ L
Sbjct: 268 PKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPL 327
Query: 182 KK--CSTCGS 189
C TCGS
Sbjct: 328 GDGVCLTCGS 337
>gi|350630603|gb|EHA18975.1| hypothetical protein ASPNIDRAFT_42792 [Aspergillus niger ATCC 1015]
Length = 380
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/130 (48%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + S QY+ IMNAIF+ QR +PID C + ++ FLQQAS T GV+ +
Sbjct: 210 ILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDATKGVYMALDE 268
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+ L
Sbjct: 269 PKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPL 328
Query: 182 KK--CSTCGS 189
C TCGS
Sbjct: 329 GDGVCLTCGS 338
>gi|70997489|ref|XP_753492.1| transcription factor TFIIH subunit Tfb4 [Aspergillus fumigatus
Af293]
gi|66851128|gb|EAL91454.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
fumigatus Af293]
gi|159126779|gb|EDP51895.1| transcription factor TFIIH subunit Tfb4, putative [Aspergillus
fumigatus A1163]
Length = 387
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 83/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + GS D QY+ IMN IF+ QR +PID C L ++ FLQQAS T GV+ +
Sbjct: 215 IISVSGSTDSAHQYIPIMNCIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGVYMSLSE 273
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L +P V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 274 PRGLLQYLMMAFLPDQRSRRHLVIPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 333
Query: 182 KK--CSTCGS 189
+ C TCG+
Sbjct: 334 ENGDCLTCGT 343
>gi|119479015|ref|XP_001259536.1| transcription factor TFIIH subunit Tfb4, putative [Neosartorya
fischeri NRRL 181]
gi|119407690|gb|EAW17639.1| transcription factor TFIIH subunit Tfb4, putative [Neosartorya
fischeri NRRL 181]
Length = 385
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 83/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + GS D QY+ IMN IF+ QR +PID C L ++ FLQQAS T GV+ +
Sbjct: 215 IISVSGSTDSAHQYIPIMNCIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGVYMSLSE 273
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L +P V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 274 PRGLLQYLMMAFLPDQRSRRHLVIPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 333
Query: 182 KK--CSTCGS 189
+ C TCG+
Sbjct: 334 ENGDCLTCGT 343
>gi|358366716|dbj|GAA83336.1| transcription factor TFIIH complex subunit [Aspergillus kawachii
IFO 4308]
Length = 386
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 62/130 (47%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
IL + S QY+ IMNAIF+ QR +PID C + ++ FLQQAS T GV+ +
Sbjct: 216 ILSVSSSTGSAHQYIPIMNAIFACQRLHIPIDVCKVSG-DAVFLQQASDATKGVYMALDE 274
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+ L
Sbjct: 275 PKGLLQYLMMAFLPDQRSRRHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPL 334
Query: 182 KK--CSTCGS 189
C TCG+
Sbjct: 335 GDGVCLTCGT 344
>gi|115387571|ref|XP_001211291.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195375|gb|EAU37075.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 380
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + GS D QY+ +MN IF+ QR +PID C L ++ FLQQAS T GV +
Sbjct: 210 IVSVSGSSDSAHQYIPMMNGIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGVFMSLAE 268
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 269 PRGLLQYLMMAFLPDQRSRKHLVLPTRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEPP 328
Query: 182 KK--CSTCGS 189
+ C TCG+
Sbjct: 329 ENGDCLTCGT 338
>gi|320582712|gb|EFW96929.1| Subunit of TFIIH complex [Ogataea parapolymorpha DL-1]
Length = 341
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 72/174 (41%), Positives = 100/174 (57%), Gaps = 9/174 (5%)
Query: 23 GKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQGSPDGPEQYVAIMNA 81
GKQ +G +A G++SMALC+I ++ +S L+ + R+L + S D YV++MN
Sbjct: 140 GKQHIKGTLA-----GAMSMALCHINKIQQSDELNALKSRLLVVSISDDSTLPYVSVMNT 194
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
IF++Q+ + +D C LG +S FLQQA+ T GV+ Q +GL QYL T D R
Sbjct: 195 IFASQKMKISVDVCKLGP-SSTFLQQAADATNGVYIYITQPEGLIQYLTTALFIDPMLRP 253
Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 193
+ LP +DFRASCF ID+GY+CSVCL I +KC TC S F
Sbjct: 254 IIVLPTDESIDFRASCFITNKVIDVGYVCSVCLCILSVIPEDEKCPTCHSKFDH 307
>gi|400593294|gb|EJP61268.1| transcription factor tfb4 [Beauveria bassiana ARSEF 2860]
Length = 373
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 107/204 (52%), Gaps = 26/204 (12%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR------------- 52
AT+L +L++ M + E + + LSG+L++ALC+ + R
Sbjct: 134 ATVLASLKQLMAE-----TTERDLDSTTTQLSGALTLALCHNNKAGRALSASDTALLDAA 188
Query: 53 --SGLLHPQP---RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 107
SG P P RIL L S QY+ IMNA+F+A S V ID+ L A ++ FLQQ
Sbjct: 189 QASGATAPPPMKGRILVLSVSDSESAQYIPIMNAVFAAAHSHVAIDTLAL-AGDATFLQQ 247
Query: 108 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
A + TGGV + GL YL+ D +R L P VDFRA+CFCH N +D G
Sbjct: 248 ACFGTGGVFLRAANPQGLLAYLMFGMIADSEAREALVAPAHDTVDFRAACFCHGNVVDTG 307
Query: 168 YICSVCLSIYCK--HLKKCSTCGS 189
++CS+CLSI+C +C TCG+
Sbjct: 308 FVCSICLSIFCDLPENAECLTCGT 331
>gi|242764012|ref|XP_002340688.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
stipitatus ATCC 10500]
gi|218723884|gb|EED23301.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
stipitatus ATCC 10500]
Length = 380
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/135 (46%), Positives = 85/135 (62%), Gaps = 5/135 (3%)
Query: 59 QPRILCLQ--GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
Q RIL + S D QY+ IMN+IF+ QR +PID C L ++ FLQQAS T GV+
Sbjct: 206 QSRILIVSVSSSTDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATRGVY 264
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
+ GL QYL+ F D SR L LP V VDFRA+CFCH+ +++G++CS+CLSI
Sbjct: 265 MSLTEPRGLLQYLMMAFLPDQRSRKHLVLPSRVDVDFRAACFCHRRVVNIGFVCSICLSI 324
Query: 177 YCKHLKK--CSTCGS 189
+C+ + C TCG+
Sbjct: 325 FCEPPENGDCLTCGT 339
>gi|226468218|emb|CAX69786.1| putative general transcription factor IIH [Schistosoma japonicum]
Length = 317
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 91/153 (59%), Gaps = 22/153 (14%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL----------------GAQNSAF 104
RIL ++ S D QY+++MN++F+AQ+ VPID+C L +S+
Sbjct: 157 RILIIKASDDSTSQYLSLMNSVFTAQKLHVPIDTCVLPLPRQLDNLNGRRSISSPGHSSL 216
Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP------VGVDFRASCF 158
LQQA+ +TGG++ + ++ GL QYLL++F R L LP GVDFRA+CF
Sbjct: 217 LQQAADLTGGIYLQIPRVSGLLQYLLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACF 276
Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
CHK ID+GY+CS+CLS++C+ C TC + F
Sbjct: 277 CHKRLIDIGYVCSICLSVFCEFNPICPTCNTPF 309
>gi|254567625|ref|XP_002490923.1| Subunit of TFIIH complex, involved in transcription initiation,
similar to 34 kDa subunit of human T [Komagataella
pastoris GS115]
gi|238030720|emb|CAY68643.1| Subunit of TFIIH complex, involved in transcription initiation,
similar to 34 kDa subunit of human T [Komagataella
pastoris GS115]
Length = 328
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/179 (41%), Positives = 99/179 (55%), Gaps = 18/179 (10%)
Query: 25 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ-------PRILCLQGSPDGPEQYVA 77
QE +G IA G++SMAL YI R+ + P+ ++L + S D +QYV
Sbjct: 132 QELKGSIA-----GAMSMALSYINRLTN---IEPESTGNTLRAKMLVISISDDSTQQYVP 183
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
MN IFSAQ+ V ID C +G +S+FLQQAS +T GV+ + + GL QYL T D
Sbjct: 184 FMNCIFSAQKMKVSIDVCKMGP-DSSFLQQASDVTNGVYMLIKNVHGLIQYLTTALFIDP 242
Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQA 194
R + LP +DFRASCF ID+GY+CSVCL I+ + C TC S F A
Sbjct: 243 SLRPIMVLPTNSDLDFRASCFVTNKVIDIGYVCSVCLCIFSIIPRNNVCPTCQSKFDDA 301
>gi|326437202|gb|EGD82772.1| hypothetical protein PTSG_03422 [Salpingoeca sp. ATCC 50818]
Length = 311
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 118/249 (47%), Gaps = 71/249 (28%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ------ 59
AT+L N + N EQ G++ P S L+G+LS ALC+I R R L PQ
Sbjct: 83 ATILANATKLFN--EQSGERHP------SDLAGALSKALCFINRTVRGTSLQPQHTTASS 134
Query: 60 -----------------------------------------PRILCLQGSPDGPEQYVAI 78
PRI + SPD +Y A
Sbjct: 135 SVTSRFIAGDKATPASSSSSSSSSAPKAAGRDRVSSGARVTPRIGVVSVSPDSSVEYFAA 194
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG----VHHKPQQLDGLFQYLLTIFG 134
MN IF+AQ++ VPID C +G + +A LQQA+++T G VHH Q L L +T
Sbjct: 195 MNCIFAAQKNDVPIDVCAMG-EEAAILQQAAHLTNGTYVKVHHPSQLLQSLMMGFVT--S 251
Query: 135 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQA 194
+D H L P ++F ASCFCH T+D+GY+C+VCLS+YC+ L C CG+ F
Sbjct: 252 SDAH---VLVRPVQRRINFDASCFCHGRTVDIGYVCTVCLSVYCERLASCRICGAEF--- 305
Query: 195 QTQSDEPSA 203
SD+P A
Sbjct: 306 ---SDKPGA 311
>gi|346324613|gb|EGX94210.1| transcription factor TFIIH subunit Tfb4 [Cordyceps militaris CM01]
Length = 366
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 106/204 (51%), Gaps = 26/204 (12%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR------------- 52
AT+L +L++ M + E + + LSG+L++ALC+ + R
Sbjct: 127 ATVLGSLKKLMAE-----TTERDLDSTTTQLSGALTLALCHTNKASRALGASDTAIHEAA 181
Query: 53 --SGLLHPQP---RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 107
SG P P RIL L S QY+ MNA+F+A V ID+ L A ++ FLQQ
Sbjct: 182 QTSGTTPPSPMKGRILVLSVSDSESSQYIPTMNAVFAAAHGRVAIDALAL-AGDATFLQQ 240
Query: 108 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
A + TGGV + GL YL+ D +R L P VDFRA+CFCH N +D G
Sbjct: 241 ACFNTGGVFLRAANPQGLLAYLMFGLVADTEAREALVAPTHDTVDFRAACFCHGNVVDTG 300
Query: 168 YICSVCLSIYCK--HLKKCSTCGS 189
++CS+CLSI+C+ +C TCG+
Sbjct: 301 FVCSICLSIFCELPDNAECFTCGT 324
>gi|226467818|emb|CAX69785.1| putative general transcription factor IIH [Schistosoma japonicum]
Length = 317
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 91/153 (59%), Gaps = 22/153 (14%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL----------------GAQNSAF 104
RIL ++ S D QY+++MN++F+AQ+ VPID+C L +S+
Sbjct: 157 RILIIKASDDSTSQYLSLMNSVFTAQKLHVPIDTCVLPLPRQLDNLNGRRSISSPGHSSL 216
Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP------VGVDFRASCF 158
LQQA+ +TGG++ + ++ GL QYLL++F R L LP GVDFRA+CF
Sbjct: 217 LQQAADLTGGIYLQIPRVSGLLQYLLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACF 276
Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
CH+ ID+GY+CS+CLS++C+ C TC + F
Sbjct: 277 CHRRLIDIGYVCSICLSVFCEFNPICPTCNTPF 309
>gi|212529242|ref|XP_002144778.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
marneffei ATCC 18224]
gi|210074176|gb|EEA28263.1| transcription factor TFIIH subunit Tfb4, putative [Talaromyces
marneffei ATCC 18224]
Length = 380
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/130 (45%), Positives = 83/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + S D QY+ IMN+IF+ QR +PID C L ++ FLQQAS T GV+ +
Sbjct: 211 IVSVSSSTDSAHQYIPIMNSIFACQRLHIPIDVCKLSG-DAVFLQQASDATRGVYMSLSE 269
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L +P V VDFRA+CFCH+ +++G++CS+CLSI+C+
Sbjct: 270 PRGLLQYLMMAFLPDQRSRKHLIVPSRVDVDFRAACFCHRRVVNIGFVCSICLSIFCEPP 329
Query: 182 KK--CSTCGS 189
+ C TCG+
Sbjct: 330 ENGDCLTCGT 339
>gi|255070041|ref|XP_002507102.1| RNA polymerase II transcription factor B subunit 4 [Micromonas sp.
RCC299]
gi|226522377|gb|ACO68360.1| RNA polymerase II transcription factor B subunit 4 [Micromonas sp.
RCC299]
Length = 291
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 93/159 (58%), Gaps = 4/159 (2%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
L+ LSMALC+IQR+ +G RI CL SP QY+ +MN IF+AQ++ V IDS
Sbjct: 130 LAAGLSMALCHIQRL-GTGSFRGLRRIFCLLRSPVSQRQYIPMMNVIFAAQQAFVTIDSY 188
Query: 96 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
L +S L+QA+ +T G+H K Q+ L Q+LLT R+ L K GVDF+
Sbjct: 189 SLCDMHSDILEQAANMTNGLHRKLQKDGELGQHLLTFSSWSQTCRHSLLPGKQCGVDFKT 248
Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLK---KCSTCGSVF 191
CFCHK T+ G++CS CLSI C LK C TC + F
Sbjct: 249 FCFCHKKTLQRGFVCSACLSISCVALKTGGDCMTCAANF 287
>gi|452838611|gb|EME40551.1| hypothetical protein DOTSEDRAFT_136966 [Dothistroma septosporum
NZE10]
Length = 379
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 81/221 (36%), Positives = 111/221 (50%), Gaps = 48/221 (21%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQ------------------- 48
L NL + M++ P ++L+G+L+MAL YI
Sbjct: 128 LTHNLRDMMDRTSPAALSTPS-----TMLAGALTMALTYISKQSAALPTANSSAQFNYSD 182
Query: 49 ---------------RVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
R GL + RIL L S D +QY++IMNAIF+ QR VPID
Sbjct: 183 PSAVAGGSDLTADGSRKKHHGL---RSRILILSVSGDLADQYISIMNAIFACQRMSVPID 239
Query: 94 SCYLGAQNSAFLQQASYITGGVHH--KPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVG 150
L A ++ FLQQA+ TGGV+ + G QYL+ + D +R L +P + G
Sbjct: 240 ILKL-AGDTVFLQQAADATGGVYVALDEKTRAGFLQYLMFAYLPDQTARAHLIMPGEGEG 298
Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
VDFRA+CFCH+ +D GY+CS+CLSI+C+ H C TCGS
Sbjct: 299 VDFRAACFCHRKVVDTGYVCSICLSIFCEPLHDGTCLTCGS 339
>gi|398392135|ref|XP_003849527.1| hypothetical protein MYCGRDRAFT_75897 [Zymoseptoria tritici IPO323]
gi|339469404|gb|EGP84503.1| hypothetical protein MYCGRDRAFT_75897 [Zymoseptoria tritici IPO323]
Length = 386
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/140 (45%), Positives = 90/140 (64%), Gaps = 6/140 (4%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH--K 118
RIL L S D +QY+ IMN+IF+ QR +PID L A ++ FLQQA+ TGG++
Sbjct: 211 RILILSVSGDLADQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAADATGGIYMALD 269
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
Q G QYL+ + D +RN L +P + GVDFRA+CFCH+ +D+G++CS+CLSI+
Sbjct: 270 EQSRAGFLQYLMFAYLPDQTARNHLIMPGESEGVDFRAACFCHRRVVDIGFVCSICLSIF 329
Query: 178 CKHLK--KCSTCGSVFGQAQ 195
C+ ++ +C TCGS A
Sbjct: 330 CEPMQDGQCLTCGSHLSMAN 349
>gi|315052780|ref|XP_003175764.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
gypseum CBS 118893]
gi|311341079|gb|EFR00282.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
gypseum CBS 118893]
Length = 413
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/139 (42%), Positives = 86/139 (61%), Gaps = 3/139 (2%)
Query: 53 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
+G L + I+ + + D +QY+ IMN+IF+ QR +PID C L A ++ FLQQA T
Sbjct: 225 AGALQSRILIISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDAT 283
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
G++ G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+
Sbjct: 284 RGIYMSLDSPRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSI 343
Query: 173 CLSIYCKHLK--KCSTCGS 189
CLSI+C+ + C TCG+
Sbjct: 344 CLSIFCEPPEGANCLTCGT 362
>gi|67515713|ref|XP_657742.1| hypothetical protein AN0138.2 [Aspergillus nidulans FGSC A4]
gi|40746160|gb|EAA65316.1| hypothetical protein AN0138.2 [Aspergillus nidulans FGSC A4]
Length = 384
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 81/130 (62%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + QY+ IMN IF+ QR +PID C L ++ FLQQAS T G++ +
Sbjct: 216 IISVSSATGSAHQYIPIMNGIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLSE 274
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 275 PRGLLQYLMMAFLPDQRSRKHLILPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 334
Query: 182 KK--CSTCGS 189
+ C TCG+
Sbjct: 335 ENGDCLTCGT 344
>gi|259489666|tpe|CBF90125.1| TPA: transcription factor TFIIH complex subunit (Eurofung)
[Aspergillus nidulans FGSC A4]
Length = 392
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 81/130 (62%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + QY+ IMN IF+ QR +PID C L ++ FLQQAS T G++ +
Sbjct: 224 IISVSSATGSAHQYIPIMNGIFACQRLHIPIDVCKLSG-DAVFLQQASDATKGIYMSLSE 282
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
GL QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 283 PRGLLQYLMMAFLPDQRSRKHLILPTRVDVDFRAACFCHRRVVDIGFVCSICLSIFCEPP 342
Query: 182 KK--CSTCGS 189
+ C TCG+
Sbjct: 343 ENGDCLTCGT 352
>gi|406863689|gb|EKD16736.1| hypothetical protein MBM_05205 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 357
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 101/187 (54%), Gaps = 24/187 (12%)
Query: 15 FMNKDEQLGKQEPEGRIACSL---LSGSLSMALCYIQRVFR--SGL-------------- 55
+N +L E +A + ++G+L++AL YI + + SG+
Sbjct: 118 LLNSLRELIATTTESDVATTTSTQMAGALTLALSYIHKATQQFSGIDTESKPATAPSTEN 177
Query: 56 ----LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYI 111
L Q RIL + S D QY+ IMN F+AQR +PID L A ++ FLQQAS
Sbjct: 178 ADTILGLQSRILVVSVSGDLAHQYIPIMNTTFAAQRLRIPIDILKL-AGDTVFLQQASDA 236
Query: 112 TGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 171
T GV+ + + GL QYL+ F D +R L P VDFRA+CFCH+ +D+GY+CS
Sbjct: 237 TKGVYMQLRNPQGLLQYLMMAFLPDQVARKHLVPPTQEVVDFRAACFCHRKVVDIGYVCS 296
Query: 172 VCLSIYC 178
+CLSI+C
Sbjct: 297 ICLSIFC 303
>gi|385301449|gb|EIF45638.1| subunit of TFIIH complex [Dekkera bruxellensis AWRI1499]
Length = 219
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 79/185 (42%), Positives = 99/185 (53%), Gaps = 15/185 (8%)
Query: 9 LQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPRILCLQ 66
LQNL+ + K E K LSG+LS AL YI R F++ H + RI+C+
Sbjct: 3 LQNLKSLVGKGETHIK---------GTLSGALSQALSYINR-FQNSDEHSGMKARIVCIS 52
Query: 67 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
S D YV+IMN+IF+AQ+ V +D C LG NS FLQQAS T GV+ GL
Sbjct: 53 VSGDNTLPYVSIMNSIFAAQKQKVSVDVCKLGP-NSTFLQQASDSTNGVYIYIXHPLGLI 111
Query: 127 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKC 184
QYL T D R + LP +DFRASCF +D+GY+CSVCL I + C
Sbjct: 112 QYLSTALFIDPMLRPMVVLPTNTNIDFRASCFITNKVVDIGYVCSVCLCILSVIPEDEIC 171
Query: 185 STCGS 189
TC S
Sbjct: 172 PTCHS 176
>gi|384494657|gb|EIE85148.1| hypothetical protein RO3G_09858 [Rhizopus delemar RA 99-880]
Length = 268
Score = 117 bits (293), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/162 (42%), Positives = 89/162 (54%), Gaps = 43/162 (26%)
Query: 34 SLLSGSLSMALCYIQRVFR---SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV 90
S ++G+LSMALCYI R+ + G + +PRIL L SPD QY+ +MN IFSAQ++
Sbjct: 131 STITGALSMALCYINRITKLDDEGYI--KPRILVLSVSPDSAFQYIPLMNCIFSAQKA-- 186
Query: 91 PIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVG 150
CY Q+ AFL D +SRNFL LP
Sbjct: 187 ----CY---QHMAFL-----------------------------PDRYSRNFLNLPTQDQ 210
Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 192
VDFRA+CFCHK +D+G++CSVCLSI+CK CSTC + F
Sbjct: 211 VDFRAACFCHKQIVDIGFVCSVCLSIFCKWSPVCSTCKTKFA 252
>gi|405968700|gb|EKC33746.1| Poly [ADP-ribose] polymerase 4 [Crassostrea gigas]
Length = 3371
Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats.
Identities = 67/158 (42%), Positives = 88/158 (55%), Gaps = 30/158 (18%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
SL++G+L+MALC QR H + IL + R +V ID
Sbjct: 2964 SLVAGALAMALC--QRS------HEEAGIL---------------------SSRILVVID 2994
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDF 153
+C L + S LQQA ITGG++ K Q+ GL QYL+ +F D R L LP V VD+
Sbjct: 2995 ACVLDNE-SGLLQQACDITGGIYLKIPQMKGLLQYLMWVFLPDPLERPKLTLPPKVQVDY 3053
Query: 154 RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
RA+CFCH+N ID+GY+CSVCLSI+C C TC + F
Sbjct: 3054 RAACFCHRNLIDIGYVCSVCLSIFCAFSPICGTCQTTF 3091
>gi|388851454|emb|CCF54856.1| related to TFIIH basal transcription factor complex p34 subunit
[Ustilago hordei]
Length = 423
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 85/149 (57%), Gaps = 16/149 (10%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ RIL L +PD QY+ +MN IF+AQ+ + ID C L ++ FLQQASY+T G + +
Sbjct: 250 KSRILILSVTPDASTQYIPMMNCIFAAQKKGITIDVCKLFGSDTVFLQQASYLTSGTYFR 309
Query: 119 ------PQQLDG----------LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKN 162
+++G L Q LLT + R + LP +DFRA+CFCH+
Sbjct: 310 LSDDSDALKVNGKEGITDMRSSLVQTLLTTYLPSRSMRGVMNLPTLEEIDFRAACFCHRK 369
Query: 163 TIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+D+GYICSVCLS++C+ C TC S F
Sbjct: 370 IVDIGYICSVCLSLFCQPRPFCLTCRSKF 398
>gi|295670289|ref|XP_002795692.1| RNA polymerase II transcription factor B subunit 4
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226284777|gb|EEH40343.1| RNA polymerase II transcription factor B subunit 4
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 422
Score = 116 bits (291), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/138 (44%), Positives = 86/138 (62%), Gaps = 5/138 (3%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ- 120
I+ L + + Y+ IMN+IF+ QR +PID L A ++AFLQQAS TGG++
Sbjct: 247 IISLSNTTHSAQHYIPIMNSIFACQRLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINT 305
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 306 HPAGFLQYLMLGFLPDQRARAHLILPSRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEP 365
Query: 181 LK---KCSTCGSVFGQAQ 195
L +C TCG+ G A+
Sbjct: 366 LGGEGECLTCGTRLGMAE 383
>gi|326480715|gb|EGE04725.1| TFIIH basal transcription factor complex p34 subunit [Trichophyton
equinum CBS 127.97]
Length = 415
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + D +QY+ IMN+IF+ QR +PID C L A ++ FLQQA T G++
Sbjct: 236 IISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 294
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 295 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 354
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 355 EGANCLTCGT 364
>gi|326473589|gb|EGD97598.1| transcription factor TFIIH subunit Tfb4 [Trichophyton tonsurans CBS
112818]
Length = 415
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + D +QY+ IMN+IF+ QR +PID C L A ++ FLQQA T G++
Sbjct: 236 IISVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 294
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 295 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 354
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 355 EGANCLTCGT 364
>gi|296815720|ref|XP_002848197.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
otae CBS 113480]
gi|238841222|gb|EEQ30884.1| TFIIH basal transcription factor complex p34 subunit [Arthroderma
otae CBS 113480]
Length = 413
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 84/138 (60%), Gaps = 3/138 (2%)
Query: 54 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
G L + I+ + + D +QY+ IMN+IF+ QR +PID C L A ++ FLQQA T
Sbjct: 226 GSLQSRILIVSVSSAADSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATR 284
Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 173
G++ G QYL+ F D R L LP V VDFRA+CFCH+ +D+G++CS+C
Sbjct: 285 GIYMSVDSPRGFLQYLMLAFLPDQRVRRNLVLPTRVDVDFRAACFCHRKVVDVGFVCSIC 344
Query: 174 LSIYCKHLK--KCSTCGS 189
LSI+C+ + C TCG+
Sbjct: 345 LSIFCEPPEGANCLTCGT 362
>gi|393217350|gb|EJD02839.1| Tfb4-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 432
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/138 (42%), Positives = 86/138 (62%), Gaps = 1/138 (0%)
Query: 55 LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR-SMVPIDSCYLGAQNSAFLQQASYITG 113
++ P+PRI+ L SPD Y+ +MN+IFSAQ V ID C + A ++ FLQQA+++TG
Sbjct: 198 VVSPEPRIVILSVSPDLSTSYIPLMNSIFSAQNLCKVAIDVCKIYAPDAVFLQQAAHLTG 257
Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 173
G + ++ D L Q+LL F R + +P +DFR +CF HK +D+G++CSV
Sbjct: 258 GSYIYLERRDALLQHLLMTFLPTPSIRQPIAVPTQDKIDFRVACFFHKTIVDIGFVCSVR 317
Query: 174 LSIYCKHLKKCSTCGSVF 191
LSI C+ + CSTC + F
Sbjct: 318 LSILCQLVPVCSTCRTKF 335
>gi|444724893|gb|ELW65479.1| Tectonic-2 [Tupaia chinensis]
Length = 805
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 55/113 (48%), Positives = 78/113 (69%), Gaps = 1/113 (0%)
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
++ + D QY+ MN IF+AQ+ + ID+C L + +S LQQA ITGG++ K Q+
Sbjct: 10 IKAAEDSALQYMNFMNVIFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPS 68
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
L QYLL +F D R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+
Sbjct: 69 LLQYLLWVFLPDQDQRSQLTLPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIF 121
>gi|302656245|ref|XP_003019878.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
gi|291183651|gb|EFE39254.1| hypothetical protein TRV_06076 [Trichophyton verrucosum HKI 0517]
Length = 517
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + + +QY+ IMN+IF+ QR +PID C L A ++ FLQQA T G++
Sbjct: 338 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 396
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 397 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 456
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 457 EGANCLTCGT 466
>gi|358389021|gb|EHK26614.1| hypothetical protein TRIVIDRAFT_69532 [Trichoderma virens Gv29-8]
Length = 363
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 92/176 (52%), Gaps = 20/176 (11%)
Query: 36 LSGSLSMALCYIQRVFRS--------------GLLHPQP---RILCLQGSPDGPEQYVAI 78
LSG+L++ALC I + +S P P RI+ + S P QY+
Sbjct: 150 LSGALTLALCRINKTAQSLSSSDTTISNAAPVNSTAPPPVKGRIVVISVSDSEPSQYIPT 209
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA+F+A + V ID+ L A +S FLQQA + T G+ K GL YL+ D
Sbjct: 210 MNAVFAAAHNQVAIDTIAL-AGDSTFLQQACFNTNGIFLKASNPQGLLTYLMFGLIPDTE 268
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFG 192
+R L P VDFR +CFCH +D G++CSVCLSI+C+ +C TCG++
Sbjct: 269 ARGSLITPTHDTVDFRTACFCHGKVVDTGFVCSVCLSIFCEPPENAECLTCGTILA 324
>gi|71004806|ref|XP_757069.1| hypothetical protein UM00922.1 [Ustilago maydis 521]
gi|46096873|gb|EAK82106.1| hypothetical protein UM00922.1 [Ustilago maydis 521]
Length = 422
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 80/148 (54%), Gaps = 15/148 (10%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ RIL L + D QY+ +MN IF+AQ+ + ID C L +++ FLQQASY+T G + +
Sbjct: 251 KSRILILSVTQDASTQYIPMMNCIFAAQKKGITIDVCKLFGKDTVFLQQASYLTSGTYFR 310
Query: 119 ---------------PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
L Q LLT + R + LP +DFRA+CFCH+
Sbjct: 311 LADDEDQSKSDTASATDMRSSLVQTLLTTYLPSRSMRGVMNLPSLEKIDFRAACFCHRKI 370
Query: 164 IDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+D+GYICSVCLS++C C TC S F
Sbjct: 371 VDIGYICSVCLSLFCSPKPSCLTCRSKF 398
>gi|327299656|ref|XP_003234521.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
118892]
gi|326463415|gb|EGD88868.1| transcription factor TFIIH subunit Tfb4 [Trichophyton rubrum CBS
118892]
Length = 413
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + + +QY+ IMN+IF+ QR +PID C L A ++ FLQQA T G++
Sbjct: 234 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 292
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 293 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 352
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 353 EGANCLTCGT 362
>gi|452978955|gb|EME78718.1| hypothetical protein MYCFIDRAFT_124295, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 339
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/136 (45%), Positives = 88/136 (64%), Gaps = 10/136 (7%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH---- 116
RIL L S D +QY+ IMN+IF+ QR +PID L A ++ FLQQA+ TGG++
Sbjct: 198 RILILSVSGDLADQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAADATGGIYMALD 256
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDMGYICSVCLS 175
H+ + G QYL+ + D +R L +P + GVDFRA+CFCH+ +D+G++CS+CLS
Sbjct: 257 HRTRA--GFLQYLMFAYLPDEAARKHLIVPGEGEGVDFRAACFCHRRVVDIGFVCSICLS 314
Query: 176 IYCK--HLKKCSTCGS 189
I+C+ H C TCGS
Sbjct: 315 IFCEPLHDSTCLTCGS 330
>gi|226294041|gb|EEH49461.1| RNA polymerase II transcription factor B subunit 4
[Paracoccidioides brasiliensis Pb18]
Length = 408
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 85/138 (61%), Gaps = 5/138 (3%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ- 120
I+ L + + Y+ IMN+IF+ QR +PID L A ++AFLQQAS TGG++
Sbjct: 233 IISLSNNTHSAQHYIPIMNSIFACQRLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINA 291
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 292 HPAGFLQYLMLGFLPDQRARAHLILPSRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEP 351
Query: 181 L---KKCSTCGSVFGQAQ 195
L +C TCG+ G +
Sbjct: 352 LGGEGECLTCGTRLGMVE 369
>gi|340515579|gb|EGR45832.1| hypothetical protein TRIREDRAFT_67476 [Trichoderma reesei QM6a]
Length = 362
Score = 114 bits (284), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 91/176 (51%), Gaps = 20/176 (11%)
Query: 36 LSGSLSMALCYIQRVFRS--------------GLLHPQP---RILCLQGSPDGPEQYVAI 78
LSG+L++ALC I + ++ P P RI+ + S P QY+
Sbjct: 149 LSGALTLALCRINKAAQALSSSDTTLSNAAPVNATAPPPVKGRIVVVSVSDSDPSQYIPT 208
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA+F+A + V ID+ L A +S FLQQA + T G+ K GL YL+ D
Sbjct: 209 MNAVFAAAHNQVAIDTISL-AGDSTFLQQACFNTNGIFLKAANPRGLLTYLMFGLIPDTE 267
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFG 192
+R L P VDFR +CFCH +D GY+CSVCLSI+C +C TCG+V
Sbjct: 268 ARASLITPTHDTVDFRTACFCHGKVVDTGYVCSVCLSIFCTPPDNAECLTCGTVLA 323
>gi|342873217|gb|EGU75428.1| hypothetical protein FOXB_14053 [Fusarium oxysporum Fo5176]
Length = 358
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 105/204 (51%), Gaps = 26/204 (12%)
Query: 6 ATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS------------ 53
A +L + + M++ +E + + +SG+L++ALC+I + ++
Sbjct: 118 AAVLAAIRKLMDQ-----TKEEDLSATTTQISGALTLALCHINKAAQALCAPTANLEDSH 172
Query: 54 -GLLHPQP-----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQ 107
G + P RIL + S P QY+ MNA+F+A + V ID+ L + + FLQQ
Sbjct: 173 KGSSNTAPPTVRGRILVISVSDSEPSQYIPTMNAVFAAGHTQVAIDTLSLTGEPT-FLQQ 231
Query: 108 ASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
A Y TGG + GL YL+ D +R L P VDFRA+CFCH +D G
Sbjct: 232 ACYNTGGTYLAATHPQGLLNYLMFGLIADTEAREALIAPTHDTVDFRAACFCHGRVVDTG 291
Query: 168 YICSVCLSIYCK--HLKKCSTCGS 189
++CS+CLSI+C+ +C TCG+
Sbjct: 292 FVCSICLSIFCETPENSECFTCGT 315
>gi|347440963|emb|CCD33884.1| similar to RNA polymerase II transcription factor B subunit 4
[Botryotinia fuckeliana]
Length = 358
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 106/208 (50%), Gaps = 33/208 (15%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-------------- 52
TLL ++ + E +QE + ++G+L++AL YI +
Sbjct: 117 TLLNSVRSLI---ESTTEQE-VSTTTTTQMAGALTLALSYINKQLTAYSDVSAADSKPTP 172
Query: 53 ---------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
+GL Q RIL L S D QY+ IMN F+AQR +PID L A ++
Sbjct: 173 SDITPSDSTTGL---QSRILVLSVSGDLAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTV 228
Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
FLQQAS T G++ + GL QYL+ F D SR L P VDFRA+CFCH+
Sbjct: 229 FLQQASDTTRGIYMHLKNPQGLLQYLMMAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKV 288
Query: 164 IDMGYICSVCLSIYCKHLKK--CSTCGS 189
+D+G++CS+CLSI+C + C TC +
Sbjct: 289 VDVGFVCSICLSIFCSPPEGAICLTCST 316
>gi|154310839|ref|XP_001554750.1| hypothetical protein BC1G_06398 [Botryotinia fuckeliana B05.10]
Length = 356
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 106/208 (50%), Gaps = 33/208 (15%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-------------- 52
TLL ++ + E +QE + ++G+L++AL YI +
Sbjct: 115 TLLNSVRSLI---ESTTEQE-VSTTTTTQMAGALTLALSYINKQLTAYSDVSAADSKPTP 170
Query: 53 ---------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSA 103
+GL Q RIL L S D QY+ IMN F+AQR +PID L A ++
Sbjct: 171 SDITPSDSTTGL---QSRILVLSVSGDLAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTV 226
Query: 104 FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
FLQQAS T G++ + GL QYL+ F D SR L P VDFRA+CFCH+
Sbjct: 227 FLQQASDTTRGIYMHLKNPQGLLQYLMMAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKV 286
Query: 164 IDMGYICSVCLSIYCKHLKK--CSTCGS 189
+D+G++CS+CLSI+C + C TC +
Sbjct: 287 VDVGFVCSICLSIFCSPPEGAICLTCST 314
>gi|303314233|ref|XP_003067125.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240106793|gb|EER24980.1| Transcription factor Tfb4 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320037386|gb|EFW19323.1| transcription factor TFIIH subunit Tfb4 [Coccidioides posadasii
str. Silveira]
Length = 395
Score = 113 bits (283), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + QY+ IMN+IF+ QR +PID C L ++ FLQQA T G++
Sbjct: 216 IISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-DAVFLQQACDATRGIYVPVDH 274
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 275 PRGFLQYLMVAFLPDQRSRRHLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 334
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 335 EGADCLTCGT 344
>gi|119174356|ref|XP_001239539.1| hypothetical protein CIMG_09160 [Coccidioides immitis RS]
gi|392869735|gb|EAS28255.2| transcription factor tfb4 [Coccidioides immitis RS]
Length = 396
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 79/130 (60%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + QY+ IMN+IF+ QR +PID C L ++ FLQQA T G++
Sbjct: 217 IISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-DAVFLQQACDATRGIYVPVDH 275
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D SR L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 276 PRGFLQYLMVAFLPDQRSRRHLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 335
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 336 EGADCLTCGT 345
>gi|443895518|dbj|GAC72864.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB4 [Pseudozyma antarctica
T-34]
Length = 425
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 59/148 (39%), Positives = 83/148 (56%), Gaps = 15/148 (10%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-- 116
+ RIL L + D QY+ +MN IF+AQ+ + ID C L ++ FLQQASY+T G +
Sbjct: 253 KSRILILSVTQDASTQYIPMMNCIFAAQKKGITIDVCKLLGNDTVFLQQASYLTSGTYFR 312
Query: 117 ----------HKPQQLD---GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
+ +D L Q LLT + R+ + LP +DFRA+CFCH+
Sbjct: 313 LSDEQAGPSGQEASAIDLRSALVQTLLTTYLPSRSMRSSMNLPTLEEIDFRAACFCHRKI 372
Query: 164 IDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+D+GYICSVCLS++C+ C TC S F
Sbjct: 373 VDVGYICSVCLSLFCQPRPFCLTCRSKF 400
>gi|156034244|ref|XP_001585541.1| hypothetical protein SS1G_13425 [Sclerotinia sclerotiorum 1980]
gi|154698828|gb|EDN98566.1| hypothetical protein SS1G_13425 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 356
Score = 113 bits (282), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 61/133 (45%), Positives = 80/133 (60%), Gaps = 3/133 (2%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
Q RIL L S D QY+ IMN F+AQR +PID L A ++ FLQQAS T G++
Sbjct: 183 QSRILVLSVSGDLAHQYIPIMNTTFAAQRMRIPIDILKL-AGDTVFLQQASDTTRGIYMH 241
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
+ GL QYL+ F D SR L P VDFRA+CFCH+ +D+G++CS+CLSI+C
Sbjct: 242 LKNPQGLLQYLMIAFLPDQTSRKDLIAPTQEVVDFRAACFCHRKVVDVGFVCSICLSIFC 301
Query: 179 KHLKK--CSTCGS 189
+ C TC +
Sbjct: 302 SPPEGAICLTCST 314
>gi|195111388|ref|XP_002000261.1| GI10128 [Drosophila mojavensis]
gi|193916855|gb|EDW15722.1| GI10128 [Drosophila mojavensis]
Length = 334
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 99/191 (51%), Gaps = 6/191 (3%)
Query: 25 QEPEGRIACSLLSGSLSMALCYIQRVFR--SGLLHPQPRILCLQGSPDGPEQYVAI-MNA 81
+E E + +LL G+++M L Y+ R R + + H + RIL + GS + P + MNA
Sbjct: 131 KEEEEQTVTALLPGTVAMVLSYLSRCRREVAKVAHIRGRILIVSGSKEPPIPLATMQMNA 190
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
A R V ID C L ++S L+ A+ ITGG + D L LL +F H R
Sbjct: 191 FQVAARMGVVIDVCALELESSYMLRHAADITGGFYFSTSNFDTLSGVLLGLFLASPHVRQ 250
Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG---QAQTQS 198
L P D RA CFCH ++MG++CS CL+++CK+ C C ++F + Q
Sbjct: 251 HLNYPVQPQADLRAMCFCHSKLVEMGFVCSSCLTVFCKYTPICGKCETIFKPPQKLQMYI 310
Query: 199 DEPSATNRKRK 209
D +A R+R+
Sbjct: 311 DNRAAQRRQRE 321
>gi|302899112|ref|XP_003047982.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256728914|gb|EEU42269.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 355
Score = 112 bits (281), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 100/196 (51%), Gaps = 24/196 (12%)
Query: 16 MNKDEQLGKQEPEGRIACSL--LSGSLSMALCYIQRVFRSGLLHPQP------------- 60
++ +L Q E +A + LSG+L++ALC+I + ++ L P
Sbjct: 120 LDSVRKLMDQTTEADLATTTTQLSGALTLALCHINKAAQA-LCSPTANLEDNHKATTTAP 178
Query: 61 -----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
RIL + S P QY+ MNA+F+A + V ID+ L + FLQQA + TGG
Sbjct: 179 PTVRGRILVISVSDSEPSQYIPTMNAVFAAAHAQVAIDTLSLTG-DPTFLQQACFNTGGT 237
Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
+ GL YL+ D +R L P VDFRA+CFCH +D G++CS+CLS
Sbjct: 238 YLAATHPQGLLTYLMFGLIADTEAREALVAPTHDTVDFRAACFCHGRVVDTGFVCSICLS 297
Query: 176 IYCK--HLKKCSTCGS 189
I+C+ +C TCG+
Sbjct: 298 IFCELPENSECLTCGT 313
>gi|302496747|ref|XP_003010374.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
gi|291173917|gb|EFE29734.1| hypothetical protein ARB_03075 [Arthroderma benhamiae CBS 112371]
Length = 414
Score = 112 bits (281), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 82/130 (63%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + + +QY+ IMN+IF+ QR +PID C L A ++ FLQQA T G++
Sbjct: 235 IISVSSAANSAQQYIPIMNSIFACQRLHIPIDICKL-AGDAVFLQQACDATRGIYMSLDS 293
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D +R L LP V VDFRA+CFCH+ +++G++CS+CLSI+C+
Sbjct: 294 PRGFLQYLMLAFLPDQRARRNLILPTRVDVDFRAACFCHRKVVNVGFVCSICLSIFCEPP 353
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 354 EGANCLTCGT 363
>gi|440633398|gb|ELR03317.1| hypothetical protein GMDG_06064 [Geomyces destructans 20631-21]
Length = 363
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 104/207 (50%), Gaps = 30/207 (14%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQR---------VFRSGLLHP 58
LL +L E ++ E + ++G+L++AL Y + V SG HP
Sbjct: 121 LLTSLRELIDST----TTEDVSSTTTTQMAGALTLALSYSNKATVTYNESGVSSSGKSHP 176
Query: 59 QP--------------RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAF 104
RIL + S D QY+ IMN F+AQR +PID L ++ F
Sbjct: 177 DGGENPTDDVPRGLLCRILVISVSGDLAHQYIPIMNTTFAAQRLRIPIDILKLSG-DTVF 235
Query: 105 LQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTI 164
LQQAS T G + + + GL Q L+ F D +R L P VDFRA+CFCH+N +
Sbjct: 236 LQQASDATNGTYMQLRNPQGLLQSLMLGFLPDQSARTHLISPTQEVVDFRAACFCHRNVV 295
Query: 165 DMGYICSVCLSIYCKHLKK--CSTCGS 189
D+G++CS+CLSI+C + C TCG+
Sbjct: 296 DVGFVCSICLSIFCSPPEGAICLTCGT 322
>gi|358396042|gb|EHK45429.1| hypothetical protein TRIATDRAFT_221909 [Trichoderma atroviride IMI
206040]
Length = 367
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 92/176 (52%), Gaps = 20/176 (11%)
Query: 36 LSGSLSMALCYIQRVFRS--------------GLLHPQP---RILCLQGSPDGPEQYVAI 78
LSG+L++ALC I + ++ P P RI+ + S P QY+
Sbjct: 154 LSGALTLALCRINKASQALSASDTTLSNAAPVNSAAPPPVKSRIVVISVSDSEPSQYIPT 213
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA+F+A + V ID+ L A +S FLQQA + T G+ K GL YL+ D
Sbjct: 214 MNAVFAAAHNQVAIDTIAL-AGDSTFLQQACFNTNGIFLKASNPQGLLTYLMFGLIPDTE 272
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFG 192
+R + P VDFR +CFCH +D G++CSVCLSI+C+ +C TCG+V
Sbjct: 273 ARESIITPAHDTVDFRTACFCHGRVVDTGFVCSVCLSIFCEPPENAECLTCGTVLA 328
>gi|258567552|ref|XP_002584520.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237905966|gb|EEP80367.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 398
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 56/130 (43%), Positives = 80/130 (61%), Gaps = 3/130 (2%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ 121
I+ + + QY+ IMN+IF+ QR +PID C L ++ FLQQA T G++
Sbjct: 219 IISVSSATGSAHQYIPIMNSIFACQRLHIPIDICKLSG-DAVFLQQACDATRGIYVPVDH 277
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
G QYL+ F D SR+ L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 278 PLGFLQYLMVAFLPDQRSRSHLILPTRVDVDFRAACFCHRKVVDVGFVCSICLSIFCEPP 337
Query: 182 K--KCSTCGS 189
+ C TCG+
Sbjct: 338 EGADCLTCGT 347
>gi|343426778|emb|CBQ70306.1| related to TFIIH basal transcription factor complex p34 subunit
[Sporisorium reilianum SRZ2]
Length = 422
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/148 (39%), Positives = 83/148 (56%), Gaps = 15/148 (10%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ RIL L + D QY+ +MN IF+AQ+ + ID C L ++ FLQQASY+T G + +
Sbjct: 251 KSRILILSVTQDASTQYIPMMNCIFAAQKRGITIDVCKLFGSDTVFLQQASYLTSGTYFR 310
Query: 119 ----PQQLDG-----------LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNT 163
+Q +G L Q LLT + R + LP +DFR +CFCH+
Sbjct: 311 LADDHEQSNGDKAGAIDMRSSLVQTLLTTYLPSRSMRGVMNLPTLEEIDFRTACFCHRRI 370
Query: 164 IDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+D+GYICSVCLS++C+ C TC S F
Sbjct: 371 VDVGYICSVCLSLFCQPRPFCLTCRSKF 398
>gi|322698540|gb|EFY90309.1| transcription factor TFIIH subunit Tfb4, putative [Metarhizium
acridum CQMa 102]
Length = 358
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 92/173 (53%), Gaps = 20/173 (11%)
Query: 36 LSGSLSMALCYIQRVFR-----SGLLH--PQP----------RILCLQGSPDGPEQYVAI 78
LSG+L++ALC+I + + SG L QP RIL + S P QY+ I
Sbjct: 144 LSGALTLALCHINKASQALSGASGSLAEVAQPSANAALPMKGRILVVSVSDSEPSQYIPI 203
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA+F+A + V ID+ L N FLQQA + T G GL YL+ D
Sbjct: 204 MNAVFAAAHAQVAIDTLCLSG-NPTFLQQACFNTNGTFLAAANPKGLLAYLMFGLIADTE 262
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
+R L P VDFRA+CFCH +D G++CS+CLSI+C+ +C TCG+
Sbjct: 263 ARQSLITPTHDTVDFRAACFCHGKVVDRGFVCSICLSIFCELPENAECLTCGT 315
>gi|322709828|gb|EFZ01403.1| transcription factor TFIIH subunit Tfb4, putative [Metarhizium
anisopliae ARSEF 23]
Length = 358
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 91/173 (52%), Gaps = 20/173 (11%)
Query: 36 LSGSLSMALCYIQRVFR------SGLLH-PQP----------RILCLQGSPDGPEQYVAI 78
LSG+L++ALC+I + + S L QP RIL + S P QY+
Sbjct: 144 LSGALTLALCHINKASQALSGASSSLAEVAQPSANAAPPMKGRILVVSVSDSEPTQYIPT 203
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA+F+A + V ID+ L N FLQQA + T G+ GL YL+ D
Sbjct: 204 MNAVFAAAHAQVAIDTLSLSG-NPTFLQQACFNTNGIFLAAANPKGLLTYLMFGLIADTE 262
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
+R L P VDFRA+CFCH +D G+ICS+CLSI+C+ +C TCG+
Sbjct: 263 ARQSLITPTHDTVDFRAACFCHGKVVDRGFICSICLSIFCELPENAECLTCGT 315
>gi|327352824|gb|EGE81681.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
ATCC 18188]
Length = 426
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 120
I+ L + + Y+ IMN+IF+ QR +PID L A ++AFLQQAS T G++
Sbjct: 246 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 304
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 305 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 364
Query: 181 L------KKCSTCGSVFGQAQ 195
L +C TCGS G +
Sbjct: 365 LGGVDGVAECLTCGSQLGVGE 385
>gi|261201796|ref|XP_002628112.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
SLH14081]
gi|239590209|gb|EEQ72790.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
SLH14081]
Length = 425
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 120
I+ L + + Y+ IMN+IF+ QR +PID L A ++AFLQQAS T G++
Sbjct: 245 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 303
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 304 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 363
Query: 181 L------KKCSTCGSVFGQAQ 195
L +C TCGS G +
Sbjct: 364 LGGVDGVAECLTCGSQLGVGE 384
>gi|449295631|gb|EMC91652.1| hypothetical protein BAUCODRAFT_300256 [Baudoinia compniacensis
UAMH 10762]
Length = 386
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 107/221 (48%), Gaps = 43/221 (19%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS------------- 53
TL+ NL + MN + ++L+G+L+ AL YI + S
Sbjct: 128 TLMYNLRKMMNTT----TTDAINSSPATMLAGALTRALAYISKQTASLPTATSSTQFNYS 183
Query: 54 -------------------GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
L RIL + S D ++Y+ IMN IF+ QR VPID
Sbjct: 184 DPSAMAGGNELTTDPTKSANALGLSSRILIVSVSGDLADRYIEIMNCIFACQRMSVPIDV 243
Query: 95 CYLGAQNSAFLQQASYITGGVH---HKPQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVG 150
L A + FLQQA+ TGG++ GL QYL+ + D +RN L P + G
Sbjct: 244 LKL-AGDPVFLQQAADTTGGIYMSLSTSTARAGLLQYLMFAYLMDQTARNHLIAPGEGEG 302
Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGS 189
VDFRA+CFCHK +D+G++CS+CLSI+C+ L C CGS
Sbjct: 303 VDFRAACFCHKRVVDIGFVCSICLSIFCEPLPDGTCLLCGS 343
>gi|239611920|gb|EEQ88907.1| transcription factor TFIIH subunit Tfb4 [Ajellomyces dermatitidis
ER-3]
Length = 425
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 84/141 (59%), Gaps = 8/141 (5%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-PQ 120
I+ L + + Y+ IMN+IF+ QR +PID L A ++AFLQQAS T G++
Sbjct: 245 IISLSSTTHAAQHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPVTS 303
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 304 HPAGFLQYLMLGFLPDQRARAHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 363
Query: 181 L------KKCSTCGSVFGQAQ 195
L +C TCGS G +
Sbjct: 364 LGGVDGVAECLTCGSQLGVGE 384
>gi|302680643|ref|XP_003030003.1| hypothetical protein SCHCODRAFT_30664 [Schizophyllum commune H4-8]
gi|300103694|gb|EFI95100.1| hypothetical protein SCHCODRAFT_30664 [Schizophyllum commune H4-8]
Length = 257
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 82/142 (57%), Gaps = 11/142 (7%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
+ +L+ +LCYI R L P RIL L SPD + YV MNAIFSAQ+ VPID+
Sbjct: 121 FTSALTRSLCYINR-----LQSPSSRILLLSVSPDRAQDYVPFMNAIFSAQKLKVPIDTL 175
Query: 96 YLGAQNSAFLQQASYITGGVHHK------PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
L A +S FLQQA+Y+TGG + + + GL QYL+ F R + +P
Sbjct: 176 QLAAHDSVFLQQATYLTGGAYVRLGDESGEGKRGGLLQYLMMCFLPPPALRQVMAVPTMD 235
Query: 150 GVDFRASCFCHKNTIDMGYICS 171
V+ RA+CFCHK ++G++CS
Sbjct: 236 QVNLRAACFCHKRMTEIGFVCS 257
>gi|240274744|gb|EER38259.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
capsulatus H143]
Length = 412
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
I+ L + Y+ IMN+IF+ QR +PID L A ++AFLQQAS T G++
Sbjct: 231 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATS 289
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R+ L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 290 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 349
Query: 181 L------KKCSTCGS 189
L +C TCGS
Sbjct: 350 LAGVDGVAECLTCGS 364
>gi|410976500|ref|XP_003994658.1| PREDICTED: general transcription factor IIH subunit 3 [Felis catus]
Length = 407
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 108/213 (50%), Gaps = 44/213 (20%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-----------------------GLLHP 58
+ K + +G+ +LL+GSL+ ALCYI R+ + +LH
Sbjct: 177 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILFQAPNAPMSMPNILHV 236
Query: 59 QPRILC-----------LQGSPDGPEQYVAIMNAIFSAQ------RSM---VPIDSCYLG 98
C L+G+ D Y ++ A RS+ + ID+C L
Sbjct: 237 GAAACCSERPWALGTGRLEGAGDSTAFYRLCDRSLAKAWVLQLQMRSIYLNILIDACVLD 296
Query: 99 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
+ +S LQQA ITGG++ K Q+ L QYLL +F D R+ L LP P+ VD+RA+CF
Sbjct: 297 S-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQLILPPPIHVDYRAACF 355
Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
CH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 356 CHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 388
>gi|225558318|gb|EEH06602.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
capsulatus G186AR]
Length = 412
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
I+ L + Y+ IMN+IF+ QR +PID L A ++AFLQQAS T G++
Sbjct: 231 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATS 289
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R+ L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 290 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 349
Query: 181 L------KKCSTCGS 189
L +C TCGS
Sbjct: 350 LAGVDGVAECLTCGS 364
>gi|325094096|gb|EGC47406.1| RNA polymerase II transcription factor B subunit 4 [Ajellomyces
capsulatus H88]
Length = 412
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
I+ L + Y+ IMN+IF+ QR +PID L A ++AFLQQAS T G++
Sbjct: 231 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATP 289
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R+ L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 290 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 349
Query: 181 L------KKCSTCGS 189
L +C TCGS
Sbjct: 350 LAGVDGVAECLTCGS 364
>gi|154286010|ref|XP_001543800.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407441|gb|EDN02982.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 413
Score = 110 bits (274), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 59/135 (43%), Positives = 82/135 (60%), Gaps = 8/135 (5%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH-HKPQ 120
I+ L + Y+ IMN+IF+ QR +PID L A ++AFLQQAS T G++
Sbjct: 232 IISLSSTTHSAHHYIPIMNSIFACQRLHIPIDILKL-AGDAAFLQQASDATRGIYIPATS 290
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
G QYL+ F D +R+ L LP V VDFRA+CFCH+ +D+G++CS+CLSI+C+
Sbjct: 291 HPAGFLQYLMLGFLPDQRARSHLVLPSRVDVDFRAACFCHRRVVDVGFVCSICLSIFCEP 350
Query: 181 L------KKCSTCGS 189
L +C TCGS
Sbjct: 351 LGGVDGVAECLTCGS 365
>gi|389644740|ref|XP_003720002.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
oryzae 70-15]
gi|351639771|gb|EHA47635.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
oryzae 70-15]
gi|440470636|gb|ELQ39698.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
oryzae Y34]
gi|440480828|gb|ELQ61470.1| RNA polymerase II transcription factor B subunit 4 [Magnaporthe
oryzae P131]
Length = 380
Score = 109 bits (273), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 82/134 (61%), Gaps = 7/134 (5%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAFLQQASYITGGVHHKP 119
RIL + S +QY+ MNA+F+A + VP+D L GA + FLQQ ++ITGG +
Sbjct: 208 RILTVSVSDSSADQYIPTMNAVFAASSAGVPLDVLALRGA--APFLQQGAFITGGTYIAA 265
Query: 120 QQLDGLFQYLLTIFGTD--LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
+ G+ YL+T F + + S+ L P VDFRA+CFCH+ +D GY+CSVCLSI+
Sbjct: 266 TEPRGILAYLMTGFASTSGVGSQGLLLSPGSESVDFRAACFCHRRAVDTGYVCSVCLSIF 325
Query: 178 CK--HLKKCSTCGS 189
C+ +C TCGS
Sbjct: 326 CEVPSGAECLTCGS 339
>gi|453081361|gb|EMF09410.1| transcription factor Tfb4 [Mycosphaerella populorum SO2202]
Length = 287
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 60/134 (44%), Positives = 83/134 (61%), Gaps = 6/134 (4%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHH--K 118
R+L L S D QY+ IMN+IF+ QR +PID L A ++ FLQQA+ TGG++
Sbjct: 114 RLLILSVSGDLANQYIPIMNSIFACQRLSIPIDILKL-AGDTVFLQQAADATGGIYMALD 172
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLP-KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
G QYL+ + D +R L P + GVDFRA+CFCH+ +D+G++CS+CLSI+
Sbjct: 173 SNTRGGFLQYLMFAYLPDETARKHLITPGEGEGVDFRAACFCHRRVVDIGFVCSICLSIF 232
Query: 178 CK--HLKKCSTCGS 189
C+ H C TCGS
Sbjct: 233 CEPLHDSLCLTCGS 246
>gi|344300009|gb|EGW30349.1| hypothetical protein SPAPADRAFT_63205 [Spathaspora passalidarum
NRRL Y-27907]
Length = 367
Score = 109 bits (272), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 78/246 (31%), Positives = 113/246 (45%), Gaps = 47/246 (19%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIA--CSLLSGSLSMALCYIQRVF-------------- 51
+L+ L EF+N ++ K + + +A S LSG+LSMAL Y R+
Sbjct: 121 VLEELNEFLN---EINKDQDDTMLADPASKLSGALSMALTYTNRMLTLDQSITTTTASAI 177
Query: 52 -----------------------RSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 88
S + RIL + + D +Y+ IMN IF+AQ+
Sbjct: 178 ASTTSSSASAASSTATGAGGASSGSSSTSMKSRILVVSANDDNDVKYIPIMNTIFAAQKM 237
Query: 89 MVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
+ ID LG +NS++LQQ S T G++ G+ Q L T + + R F+ LP
Sbjct: 238 KLSIDIIKLGDEENSSYLQQPSDATNGIYLHLTDPRGIIQVLSTAYFIEPSLRPFIILPT 297
Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQAQTQ--SDEPSA 203
V++RASCF +D+GY+CSVCL I KKC TC S F + + EP
Sbjct: 298 NSNVNYRASCFITGKAVDVGYVCSVCLCIMSIIPDSKKCPTCNSQFDERIIHQLTKEPEV 357
Query: 204 TNRKRK 209
+KRK
Sbjct: 358 VKKKRK 363
>gi|346972888|gb|EGY16340.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
dahliae VdLs.17]
Length = 398
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 101/213 (47%), Gaps = 45/213 (21%)
Query: 20 EQLGKQEPEGRIACSL--LSGSLSMALCYIQRV-------FRSGLLHP--------QPRI 62
++L + E + C+ LSG+L++AL +I +V S P I
Sbjct: 128 KKLLDETTEADLDCTTTQLSGALTLALAHINKVNLSYTASLSSDTAKPMTTAAPGLSSHI 187
Query: 63 LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG---------------------AQN 101
L + S P QY++ MNA+F+A + +PID L A +
Sbjct: 188 LVVSVSDSDPSQYISTMNAVFAAAHASIPIDVLSLSPATAKQTTATQSTDPNDPHKPASS 247
Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG---TDLHSRNFLQLPKPVGVDFRASCF 158
S+FLQQAS+IT G +P GL YL FG D +R L P VDFRA+CF
Sbjct: 248 SSFLQQASHITNGTFLEPPSPLGLLAYL--TFGLAVADTRARAALVPPTHHTVDFRAACF 305
Query: 159 CHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
CH ID G++CSVCLSI+C C TCG+
Sbjct: 306 CHGRVIDTGFVCSVCLSIFCSVPEGADCLTCGT 338
>gi|46138695|ref|XP_391038.1| hypothetical protein FG10862.1 [Gibberella zeae PH-1]
gi|408390605|gb|EKJ69997.1| hypothetical protein FPSE_09842 [Fusarium pseudograminearum CS3096]
Length = 355
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 92/174 (52%), Gaps = 21/174 (12%)
Query: 36 LSGSLSMALCYIQRVFRS-------------GLLHPQP-----RILCLQGSPDGPEQYVA 77
+SG+L++ALC+I + ++ G P RIL + S P QY+
Sbjct: 141 ISGALTLALCHINKAAQALCSPTANLEDSHKGSSSTSPPTVRGRILVISVSDSEPSQYIP 200
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
MNA+F+A + V ID+ L + + FL+QA Y TGG + GL YL+ +D
Sbjct: 201 TMNAVFAAAHTQVAIDTLSLTGEPT-FLEQACYNTGGTYLAATHPQGLLNYLMFGLMSDT 259
Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
+R L VDFRA+CFCH +D G++CS+CLSI+C+ +C TCG+
Sbjct: 260 EAREALINSVHDTVDFRAACFCHGRVVDTGFVCSICLSIFCETPENSECFTCGT 313
>gi|302413233|ref|XP_003004449.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
albo-atrum VaMs.102]
gi|261357025|gb|EEY19453.1| RNA polymerase II transcription factor B subunit 4 [Verticillium
albo-atrum VaMs.102]
Length = 384
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 101/213 (47%), Gaps = 45/213 (21%)
Query: 20 EQLGKQEPEGRIACSL--LSGSLSMALCYIQRV-------FRSGLLHP--------QPRI 62
++L + E + C+ LSG+L++AL +I +V S P I
Sbjct: 128 KKLLDETTEADLDCTTTQLSGALTLALAHINKVNLSYTASLSSDTAKPMTTAAPGLSSHI 187
Query: 63 LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG---------------------AQN 101
L + S P QY++ MNA+F+A + +PID L A +
Sbjct: 188 LVVSVSDSDPSQYISTMNAVFAAAHASIPIDVLSLSPATAKQTAAAQTTDPNDPHKSASS 247
Query: 102 SAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG---TDLHSRNFLQLPKPVGVDFRASCF 158
S+FLQQAS+IT G +P GL YL FG D +R L P VDFRA+CF
Sbjct: 248 SSFLQQASHITNGTFLEPPSPLGLLAYL--TFGLAVADTRARAALVPPTHHTVDFRAACF 305
Query: 159 CHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
CH ID G++CSVCLSI+C C TCG+
Sbjct: 306 CHGRVIDTGFVCSVCLSIFCSVPEGADCLTCGT 338
>gi|294658697|ref|XP_461035.2| DEHA2F15510p [Debaryomyces hansenii CBS767]
gi|218511849|sp|Q6BL86.2|TFB4_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 4;
AltName: Full=RNA polymerase II transcription factor B
34 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p34 subunit
gi|202953319|emb|CAG89405.2| DEHA2F15510p [Debaryomyces hansenii CBS767]
Length = 387
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 57/143 (39%), Positives = 83/143 (58%), Gaps = 2/143 (1%)
Query: 53 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
+ L + RIL + + D +Y+ IMN F+AQ+ VPID LG ++S++LQQAS T
Sbjct: 210 TSLTSMRSRILIVSSNDDNDIKYIPIMNTTFAAQKMKVPIDVAKLGERDSSYLQQASDAT 269
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSV 172
GV+ +GL Q L T F + R+ + LP V++RASCF ++++G++CSV
Sbjct: 270 NGVYLHIANPEGLIQTLSTAFFIEPSIRSLIILPTNSNVNYRASCFITGKSVNLGFVCSV 329
Query: 173 CLSIYCKHLK--KCSTCGSVFGQ 193
CL I K KC TCGS F +
Sbjct: 330 CLCIMSIIPKEGKCPTCGSKFDE 352
>gi|448102380|ref|XP_004199788.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
gi|359381210|emb|CCE81669.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
Length = 376
Score = 106 bits (264), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 59/157 (37%), Positives = 91/157 (57%), Gaps = 5/157 (3%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + + D +Y+ IMN++F+AQ+ +PID LG+ +S++LQQAS T G++ + +
Sbjct: 213 RILVISANDDDDVKYIPIMNSMFAAQKMRLPIDVAKLGSVSSSYLQQASDATNGIYLQVE 272
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
+GL Q L T F + R + LP V +RASCF ++D+GY+CSVCL I
Sbjct: 273 NPEGLIQTLCTAFFIESSIRPLIILPSNSNVSYRASCFITGKSVDLGYVCSVCLCIMSII 332
Query: 179 KHLKKCSTCGSVFGQA---QTQSDEPSATNRKRKTTD 212
KC TC S F ++ Q Q+ +T R++ T+
Sbjct: 333 PTDAKCPTCNSHFDKSILNQLQNSVVLSTKRRKIDTN 369
>gi|448098480|ref|XP_004198937.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
gi|359380359|emb|CCE82600.1| Piso0_002332 [Millerozyma farinosa CBS 7064]
Length = 376
Score = 105 bits (263), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 58/153 (37%), Positives = 90/153 (58%), Gaps = 5/153 (3%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + + D +Y+ IMN++F+AQ+ +PID LG+ +S++LQQAS T G++ + +
Sbjct: 213 RILVISANDDDDVKYIPIMNSMFAAQKMRLPIDVAKLGSISSSYLQQASDATNGIYLQVE 272
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
+ +GL Q L T F + R + LP V +RASCF ++D+GY+CSVCL I
Sbjct: 273 KPEGLIQTLCTAFFIESSIRPLIILPSNSNVSYRASCFITGKSVDLGYVCSVCLCIMSII 332
Query: 179 KHLKKCSTCGSVFGQA---QTQSDEPSATNRKR 208
KC TC S F ++ Q Q+ +T R++
Sbjct: 333 PTDAKCPTCNSHFDKSILNQLQNSVVLSTKRRK 365
>gi|255727412|ref|XP_002548632.1| hypothetical protein CTRG_02929 [Candida tropicalis MYA-3404]
gi|240134556|gb|EER34111.1| hypothetical protein CTRG_02929 [Candida tropicalis MYA-3404]
Length = 369
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 88/158 (55%), Gaps = 4/158 (2%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ R+L + + D +Y+ +MN IF+AQ+ V ID LG +NS++LQQAS T GV+
Sbjct: 202 KSRVLIVSANDDDDIKYIPLMNCIFAAQKMKVSIDVAKLGHKNSSYLQQASDATRGVYLH 261
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
+ G+ Q L T F + + R ++ LP V++RASCF ++D+GY+CSVCL I
Sbjct: 262 IEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFLTGKSVDLGYVCSVCLCIMS 321
Query: 179 K--HLKKCSTCGSVFGQAQTQS--DEPSATNRKRKTTD 212
K KC C S F + EP ++K++ D
Sbjct: 322 KLPDNGKCPACDSAFDKKIINELIREPQVISKKKRKLD 359
>gi|406602443|emb|CCH45984.1| RNA polymerase II transcription factor B subunit 4 [Wickerhamomyces
ciferrii]
Length = 361
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/148 (41%), Positives = 81/148 (54%), Gaps = 4/148 (2%)
Query: 46 YIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFL 105
YI ++ + L + +IL + S D +Y+ IMN IF+AQ+ V ID C LG N+ FL
Sbjct: 160 YINKIQSNDSLM-KAKILIVNISQDEHLKYIPIMNCIFAAQKMKVSIDVCQLGL-NATFL 217
Query: 106 QQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTID 165
QQAS T GV+ + +DGL QY T D +N L P +DFRASCF +D
Sbjct: 218 QQASDATNGVYLHIENMDGLIQYFTTALFIDPSIKNILTKPNKGDIDFRASCFLTGKIVD 277
Query: 166 MGYICSVCLSI--YCKHLKKCSTCGSVF 191
+G++CSVCL I KC C S F
Sbjct: 278 IGFVCSVCLCILSLIPEDNKCPACDSEF 305
>gi|241956274|ref|XP_002420857.1| RNA polymerase II transcription factor B subunit 4, putative; TFIIH
subunit TFB4, putative [Candida dubliniensis CD36]
gi|223644200|emb|CAX41010.1| RNA polymerase II transcription factor B subunit 4, putative
[Candida dubliniensis CD36]
Length = 363
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 88/155 (56%), Gaps = 4/155 (2%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ RIL + + D +Y+ +MN IF+AQ+ V ID LG NS++LQQAS T GV+
Sbjct: 200 KSRILIVSANDDDDLRYIPLMNCIFAAQKMKVSIDVAKLGHNNSSYLQQASDATKGVYLH 259
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
+ G+ Q L T F + + R ++ LP V++RASCF ++D+GY+CSVCL I
Sbjct: 260 IEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFITGKSVDLGYVCSVCLCIMS 319
Query: 179 K--HLKKCSTCGSVFGQA--QTQSDEPSATNRKRK 209
+ KC C S F + Q + EP ++K++
Sbjct: 320 QLPQSGKCPACDSEFDKKIIQELNREPQVISKKKR 354
>gi|50547937|ref|XP_501438.1| YALI0C04367p [Yarrowia lipolytica]
gi|62901295|sp|Q6CD24.1|TFB4_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 4;
AltName: Full=RNA polymerase II transcription factor B
34 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p34 subunit
gi|49647305|emb|CAG81737.1| YALI0C04367p [Yarrowia lipolytica CLIB122]
Length = 340
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 94/169 (55%), Gaps = 9/169 (5%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQ-GSPDGPEQYVAIMNAIFSAQRSMVPI 92
S +SG+LS+AL +I + + RIL L G + QY+ MN IF+AQ+ +P+
Sbjct: 151 SAISGALSLALGFINKHVFVDESRMRARILLLTVGHKNETIQYIPTMNCIFAAQKLKIPV 210
Query: 93 DSCYLG-AQNSAFLQQASYITGGVHHKPQQLD-----GLFQYLLTIFGTDLHSRNFLQLP 146
D C LG + FLQQA T G++ + + GL QYLL+ F +D R + LP
Sbjct: 211 DVCKLGPGSDQVFLQQACDSTHGIYMDISEKNSKTPKGLVQYLLSGFISDPSLRPHIVLP 270
Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVFGQ 193
VDFRA+CF K +D+GY+CSVCL I + ++C TC + + +
Sbjct: 271 TQSNVDFRAACFLTKQVVDIGYVCSVCLCIMSQIPSNRRCPTCDTTYSE 319
>gi|402084346|gb|EJT79364.1| RNA polymerase II transcription factor B subunit 4 [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 382
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/122 (44%), Positives = 74/122 (60%), Gaps = 4/122 (3%)
Query: 74 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
QY+ MNA+F+A + VP+D L N++FLQQ SYITGG ++ GL YL+T F
Sbjct: 221 QYIPTMNAVFAASHASVPVDVVALRG-NASFLQQGSYITGGNFIHAKEPRGLLTYLMTGF 279
Query: 134 GTDLHS-RNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL--KKCSTCGSV 190
+ + L P VDFRA+CFCH+ +D G++CSVCLSI+C+ +C TCGS
Sbjct: 280 PVGGGAVSDMLIGPGTESVDFRAACFCHRKALDTGFVCSVCLSIFCEAPPENECLTCGSA 339
Query: 191 FG 192
Sbjct: 340 LA 341
>gi|159490694|ref|XP_001703308.1| TFIIH 34kDa subunit [Chlamydomonas reinhardtii]
gi|158280232|gb|EDP05990.1| TFIIH 34kDa subunit [Chlamydomonas reinhardtii]
Length = 169
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/102 (47%), Positives = 71/102 (69%)
Query: 85 AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQ 144
AQR+ V +D+ + Q+S+F+QQA+++TGG++ +P + GL LL F D +R L
Sbjct: 39 AQRAQVVLDAVVVAPQDSSFMQQAAHLTGGLYFRPHRGRGLLGVLLNHFVCDTATRKQLD 98
Query: 145 LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 186
+ +GVDFRASCFCHK I+ GY+CSVCLSI+C+ + C+T
Sbjct: 99 EAQALGVDFRASCFCHKYVIETGYVCSVCLSIFCQPSRSCTT 140
>gi|281211236|gb|EFA85402.1| general transcription factor IIH [Polysphondylium pallidum PN500]
Length = 276
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 77/124 (62%), Gaps = 3/124 (2%)
Query: 20 EQLGK--QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVA 77
E L K +E + + S S ++S+ALCYI R+ + + RIL SPD QY+
Sbjct: 118 ENLDKRSEEQQQQDCTSSFSAAMSLALCYINRI-KKEFPSLRARILVFNLSPDVSTQYIP 176
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
+MN IFSAQ+ +P+DSC L +S FLQQAS++T G++ KPQ+ + L QYLL+ F D
Sbjct: 177 VMNCIFSAQKQSIPVDSCVLSTTDSTFLQQASHLTNGIYLKPQRQENLSQYLLSTFLVDS 236
Query: 138 HSRN 141
SRN
Sbjct: 237 FSRN 240
>gi|68489766|ref|XP_711270.1| hypothetical protein CaO19.11671 [Candida albicans SC5314]
gi|68489813|ref|XP_711246.1| hypothetical protein CaO19.4194 [Candida albicans SC5314]
gi|46432534|gb|EAK92011.1| hypothetical protein CaO19.4194 [Candida albicans SC5314]
gi|46432559|gb|EAK92035.1| hypothetical protein CaO19.11671 [Candida albicans SC5314]
Length = 363
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 88/155 (56%), Gaps = 4/155 (2%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
+ R+L + + D +Y+ +MN IF+AQ+ V ID LG NS++LQQAS T GV+
Sbjct: 200 KSRVLIVSANDDDDLRYIPLMNCIFAAQKMKVSIDVAKLGHNNSSYLQQASDATKGVYLH 259
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
+ G+ Q L T F + + R ++ LP V++RASCF ++D+GY+CSVCL I
Sbjct: 260 IEDPKGIIQVLSTAFFIEPNLRPYIILPTNSNVNYRASCFITGKSVDLGYVCSVCLCIMS 319
Query: 179 K--HLKKCSTCGSVFGQA--QTQSDEPSATNRKRK 209
+ KC C S F + Q + EP ++K++
Sbjct: 320 QLPQSGKCPACDSEFDKKIIQDLNREPQVISKKKR 354
>gi|146416891|ref|XP_001484415.1| hypothetical protein PGUG_03796 [Meyerozyma guilliermondii ATCC
6260]
Length = 378
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/217 (32%), Positives = 105/217 (48%), Gaps = 39/217 (17%)
Query: 34 SLLSGSLSMALCYIQRVFR---------------------------------SGLLHPQP 60
S +SG+LS+AL Y R+ S L+ +
Sbjct: 155 STVSGALSLALTYTHRMLTLDESISTTTASAISTSTNVASNSSAAGAGSTGASNLISMKS 214
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + + +Y+ IMN+IF+AQ+ V ID LG+++ ++LQQA+ T GV+ Q
Sbjct: 215 RILVVSPDDNDDIRYIPIMNSIFAAQKMKVSIDVVKLGSKDVSYLQQAADATNGVYLHVQ 274
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY--C 178
+GL Q L T + + R + LP V++RASCF +ID+G++CSVCL I
Sbjct: 275 NPEGLIQVLSTAYFIEPSLRPLVVLPTNSNVNYRASCFVTGKSIDLGFVCSVCLCIMSII 334
Query: 179 KHLKKCSTCGSVFGQ---AQTQSDEPSATNRKRKTTD 212
L KC TC S F +Q + P+ RK++ D
Sbjct: 335 PDLGKCPTCQSEFDPNILSQLRR-SPAVLPRKKRKVD 370
>gi|448533948|ref|XP_003870733.1| TFIIH complex subunit [Candida orthopsilosis Co 90-125]
gi|380355088|emb|CCG24605.1| TFIIH complex subunit [Candida orthopsilosis]
Length = 408
Score = 102 bits (255), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/162 (37%), Positives = 91/162 (56%), Gaps = 8/162 (4%)
Query: 59 QPRILCLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
+ RIL + + DG + Y+ +MN IF+AQ+ V ID LG +NS++LQQAS T GV+
Sbjct: 242 KSRILIVSAN-DGDDSINYIPLMNCIFTAQKLKVSIDVAKLGFKNSSYLQQASDATNGVY 300
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
+ G+ Q L T F + + R ++ LP V+++ASCF K +D+G++CSVCL I
Sbjct: 301 LHIEDPQGIIQVLTTAFFIEPNLRPYVILPTNTQVNYKASCFITKRPVDVGFVCSVCLCI 360
Query: 177 YCK--HLKKCSTCGSVFGQA---QTQSDEPSATNRKRKTTDA 213
K +KC C S F + + D T +KRK D+
Sbjct: 361 MSKIPEDEKCPACSSEFDKRSIYELTKDPQVVTRKKRKLDDS 402
>gi|443923251|gb|ELU42522.1| Tfb4 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 872
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/136 (41%), Positives = 75/136 (55%), Gaps = 8/136 (5%)
Query: 58 PQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
P P I+C+ D P + ++ A +VPID C + ++ FLQQASY+T G
Sbjct: 719 PHPDIVCVARCFDTIYPIHELHLLGA------KIVPIDVCRVFGPDTIFLQQASYLTRGC 772
Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
+ L Q L F RN ++LP VDFRASCFCHK +DMGY+CSVCLS
Sbjct: 773 YLNLDSGSSLLQSLTMCFLPSASLRNTIELPSQGKVDFRASCFCHKEVVDMGYVCSVCLS 832
Query: 176 IYCKHLKKCSTCGSVF 191
I+CK + C+TC F
Sbjct: 833 IFCKPVVVCATCKIKF 848
>gi|328352540|emb|CCA38939.1| RNA polymerase II transcription factor B subunit 4 [Komagataella
pastoris CBS 7435]
Length = 307
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 83/150 (55%), Gaps = 16/150 (10%)
Query: 25 QEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ-------PRILCLQGSPDGPEQYVA 77
QE +G IA G++SMAL YI R+ + P+ ++L + S D +QYV
Sbjct: 132 QELKGSIA-----GAMSMALSYINRLTN---IEPESTGNTLRAKMLVISISDDSTQQYVP 183
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDL 137
MN IFSAQ+ V ID C +G +S+FLQQAS +T GV+ + + GL QYL T D
Sbjct: 184 FMNCIFSAQKMKVSIDVCKMGP-DSSFLQQASDVTNGVYMLIKNVHGLIQYLTTALFIDP 242
Query: 138 HSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
R + LP +DFRASCF ID+G
Sbjct: 243 SLRPIMVLPTNSDLDFRASCFVTNKVIDIG 272
>gi|150865476|ref|XP_001384710.2| hypothetical protein PICST_83789 [Scheffersomyces stipitis CBS
6054]
gi|149386732|gb|ABN66681.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 376
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/221 (30%), Positives = 104/221 (47%), Gaps = 42/221 (19%)
Query: 34 SLLSGSLSMALCYIQRVFR------------------------SG--------------L 55
S LSG+LSMAL Y R+ SG +
Sbjct: 150 STLSGALSMALTYTNRMLTLDQSISTTTASAINSTTSMGAGSGSGNTATNSSTSNPSNSI 209
Query: 56 LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGV 115
+ RIL + + + +Y+ +MN+IF+AQ+ ID LG ++S++LQQA+ T G+
Sbjct: 210 TSMKSRILIVTANDEDDVKYIPVMNSIFAAQKMRTSIDIAKLGFEDSSYLQQAADATNGI 269
Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
+ G+ Q L + F + R F+ LP V++RASCF ++D+G++CSVCL
Sbjct: 270 YFHVHDPRGIVQTLTSAFFIEPSIRPFIILPTNSNVNYRASCFVTGKSVDIGFVCSVCLC 329
Query: 176 IYCK--HLKKCSTCGSVFGQAQTQ--SDEPSATNRKRKTTD 212
I K KC C SVF + S PS ++K++ D
Sbjct: 330 IMSKIPPSGKCPACESVFDEKIIAQLSKGPSVLSKKKRKID 370
>gi|190347434|gb|EDK39698.2| hypothetical protein PGUG_03796 [Meyerozyma guilliermondii ATCC
6260]
Length = 378
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 104/217 (47%), Gaps = 39/217 (17%)
Query: 34 SLLSGSLSMALCYIQRVFR---------------------------------SGLLHPQP 60
S +SG+LS+AL Y R+ S L+ +
Sbjct: 155 STVSGALSLALTYTHRMLTLDESISTTTASAISTSTNVASNSSAAGAGSTGASNLISMKS 214
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + + +Y+ IMN+IF+AQ+ V ID LG+++ ++LQQA+ T GV+ Q
Sbjct: 215 RILVVSPDDNDDIRYIPIMNSIFAAQKMKVSIDVVKLGSKDVSYLQQAADATNGVYLHVQ 274
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
+GL Q L T + + R + LP V++RASCF +ID+G++CSVCL I
Sbjct: 275 NPEGLIQVLSTAYFIEPSLRPLVVLPTNSNVNYRASCFVTGKSIDLGFVCSVCLCIMSII 334
Query: 179 KHLKKCSTCGSVFGQ---AQTQSDEPSATNRKRKTTD 212
KC TC S F +Q + P+ RK++ D
Sbjct: 335 PDSGKCPTCQSEFDPNILSQLRR-SPAVLPRKKRKVD 370
>gi|171689210|ref|XP_001909545.1| hypothetical protein [Podospora anserina S mat+]
gi|170944567|emb|CAP70678.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 87/172 (50%), Gaps = 36/172 (20%)
Query: 53 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
+GL+ RIL + S QY+ MNA+F+A S + ID+ L ++ FL+QAS+IT
Sbjct: 184 TGLVGFHGRILVISVSDSAASQYIPTMNAVFAASMSRIAIDTLALRG-SATFLEQASFIT 242
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHS-----------------------RNFLQLPKPV 149
G + G+ QYL+ FG S R +L KPV
Sbjct: 243 QGTFIQAADPQGILQYLMFGFGVGSASSGLSAAQNDGSGPLMGKSKTGKQREGDELRKPV 302
Query: 150 G----------VDFRASCFCHKNTIDMGYICSVCLSIYCKHL--KKCSTCGS 189
G VDFRA+CFCH+N +D G++CS+CLSI+C+ +C TCG+
Sbjct: 303 GECLFTPAADSVDFRAACFCHRNVVDTGFVCSICLSIFCEPPPGDECLTCGN 354
>gi|320594208|gb|EFX06611.1| transcription factor tfiih subunit [Grosmannia clavigera kw1407]
Length = 429
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/139 (43%), Positives = 76/139 (54%), Gaps = 13/139 (9%)
Query: 52 RSGLLHP------QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAF 104
RSG P RIL + S P QY+A MNA+F+A + V ID L GA + F
Sbjct: 226 RSGKTDPVDDVGLHARILVISVSDSSPAQYIATMNAVFAAAHAGVAIDVLALRGA--ATF 283
Query: 105 LQQASYITGG--VHHKPQQLDGLFQYLLTIF--GTDLHSRNFLQLPKPVGVDFRASCFCH 160
LQQA+YIT G V Q GL YL+ F + L P VDFRA+CFCH
Sbjct: 284 LQQAAYITRGTFVRVGADQKHGLLAYLMLAFNAASSHDVAQHLVAPSADAVDFRAACFCH 343
Query: 161 KNTIDMGYICSVCLSIYCK 179
+ +D G++CSVCLSI+C+
Sbjct: 344 RRVVDTGFVCSVCLSIFCE 362
>gi|444315902|ref|XP_004178608.1| hypothetical protein TBLA_0B02470 [Tetrapisispora blattae CBS 6284]
gi|387511648|emb|CCH59089.1| hypothetical protein TBLA_0B02470 [Tetrapisispora blattae CBS 6284]
Length = 340
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 75/201 (37%), Positives = 98/201 (48%), Gaps = 30/201 (14%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV---------FRSGLLH 57
+L Q +F++K Q S +SG++S AL Y+ R+ F+S LL
Sbjct: 133 SLFQEESKFIDKKSQK-----------STISGAMSCALTYVNRITKQFNDASIFKSRLL- 180
Query: 58 PQPRILCLQG-SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGV 115
I C G S D QY+ IMN IFSA + PID +G + S FLQQ + T GV
Sbjct: 181 ---VITCGSGRSKDEVFQYIPIMNCIFSATKMKCPIDVVKIGGHEKSTFLQQTTDATNGV 237
Query: 116 HHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC-- 173
+ + GL QYL T D R+ + P VDFR SCF + +GYICSVC
Sbjct: 238 YLYVENTKGLIQYLSTAMFIDSSLRSIIVKPNHGVVDFRTSCFLTGRVVAVGYICSVCLC 297
Query: 174 -LSIYCKHLKKCSTCGSVFGQ 193
LSI K+ KC C S F +
Sbjct: 298 VLSIVPKN-NKCPACDSKFDE 317
>gi|367035398|ref|XP_003666981.1| hypothetical protein MYCTH_2312237 [Myceliophthora thermophila ATCC
42464]
gi|347014254|gb|AEO61736.1| hypothetical protein MYCTH_2312237 [Myceliophthora thermophila ATCC
42464]
Length = 433
Score = 99.8 bits (247), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 84/183 (45%), Gaps = 46/183 (25%)
Query: 54 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
GL RIL + S P QY+ MNA+F+A + V ID+ L ++ FL+QAS+IT
Sbjct: 202 GLAGLHARILIISVSDSAPAQYIPTMNAVFAAAHARVAIDTLALRG-SATFLEQASFITR 260
Query: 114 GVHHKPQQLDGLFQYLLTIFGTD------------------------------------- 136
G + + GL QYL+ FG+
Sbjct: 261 GTFVRAAEPRGLLQYLMFGFGSGSAPSHPSGGADPGKGPAAPKTTAASVPGGGPGGRRKT 320
Query: 137 ------LHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCG 188
+ L P VDFRA+CFCH+N ID GY+CSVCLSI+C+ +C TCG
Sbjct: 321 GRLGLGASVADLLLTPSADAVDFRAACFCHRNVIDTGYVCSVCLSIFCEVPPGGECLTCG 380
Query: 189 SVF 191
+V
Sbjct: 381 TVL 383
>gi|255715715|ref|XP_002554139.1| KLTH0E15180p [Lachancea thermotolerans]
gi|238935521|emb|CAR23702.1| KLTH0E15180p [Lachancea thermotolerans CBS 6340]
Length = 337
Score = 99.8 bits (247), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 97/206 (47%), Gaps = 15/206 (7%)
Query: 12 LEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILCL----Q 66
LEE ++ GK+E S LSG++S YI R+ + + R++ +
Sbjct: 123 LEELYKLLQEEGKREQPSGFQKSTLSGAMSAGFTYINRIIKEQSSASLKARLMVVTCGSS 182
Query: 67 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGL 125
G D QY+ IMN IFSA + P+D +G +Q S FLQQA+ T GV+ +GL
Sbjct: 183 GGKDEVFQYIPIMNCIFSATKLKCPVDVVKIGGSQESTFLQQATDATNGVYLHVPSTEGL 242
Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKK 183
QYL T D R + P VDFR SC+ + +G++CSVCL + +
Sbjct: 243 IQYLTTAMFIDPSLRPIIVKPNQGSVDFRTSCYLTGKVVAVGFVCSVCLCVLSVIPPGNR 302
Query: 184 CSTCGSVFGQAQTQSDEPSATNRKRK 209
C C S F DE T K+K
Sbjct: 303 CPACDSEF-------DEHVTTKLKKK 321
>gi|149236047|ref|XP_001523901.1| hypothetical protein LELG_04714 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452277|gb|EDK46533.1| hypothetical protein LELG_04714 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 444
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 85/158 (53%), Gaps = 6/158 (3%)
Query: 61 RILCLQGSP-DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
RIL + S D Y+ +MN IF+AQ+ V ID LG +S++LQQAS T GV+ K
Sbjct: 259 RILIVTASDNDNHLNYIPLMNCIFTAQKLKVSIDVAKLGTTDSSYLQQASDATHGVYLKI 318
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--Y 177
+ G+ Q L T F + + R + +P VD++ASCF K +D+GY+CSVCL I +
Sbjct: 319 EDPRGMIQVLSTAFFIEPNLRPHVIMPTDYQVDYKASCFLTKKPVDLGYVCSVCLCIMSF 378
Query: 178 CKHLKKCSTCGSVFGQAQTQS---DEPSATNRKRKTTD 212
KC C S F Q Q D A +K++ D
Sbjct: 379 VPDDLKCPACSSEFDQGIVQELTRDPQVAVVKKKRKLD 416
>gi|380484278|emb|CCF40099.1| RNA polymerase II transcription factor B subunit 4 [Colletotrichum
higginsianum]
Length = 157
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 69/113 (61%), Gaps = 3/113 (2%)
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA+F+A S +PID+ L + ++ FLQQASYIT G + GL YL+ + D
Sbjct: 1 MNAVFAAAHSQIPIDTLAL-SGDATFLQQASYITDGTFMQAASPRGLLSYLMFAYAADAE 59
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
+R+ L P V FRA+CFCH +D G++CS+CLSI+C +C TCG+
Sbjct: 60 ARSSLIPPTHHTVXFRAACFCHGRVVDTGFVCSICLSIFCDVPEGSECLTCGT 112
>gi|365762534|gb|EHN04068.1| Tfb4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 338
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K +L K++ E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+ + +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + +G+ICSVCL +
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 303 NKCPACDSQFDE 314
>gi|151942841|gb|EDN61187.1| transcription initiation factor TFIIH subunit [Saccharomyces
cerevisiae YJM789]
gi|259150209|emb|CAY87012.1| Tfb4p [Saccharomyces cerevisiae EC1118]
gi|349581866|dbj|GAA27023.1| K7_Tfb4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 338
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K +L K++ E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+ + +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + +G+ICSVCL +
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 303 NKCPACDSQFDE 314
>gi|428673522|ref|NP_001258795.1| general transcription factor IIH subunit 3 isoform b [Homo sapiens]
Length = 265
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 84/170 (49%), Gaps = 42/170 (24%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNA 81
+ K + +G+ +LL+GSL+ ALCYI R+ + D E I+
Sbjct: 119 MTKSDIKGQHTETLLAGSLAKALCYIHRMNKE--------------VKDNQEMKSRIL-- 162
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN 141
A ITGG++ K Q+ L QYLL +F D R+
Sbjct: 163 --------------------------ACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 196
Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 197 QLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 246
>gi|6325313|ref|NP_015381.1| Tfb4p [Saccharomyces cerevisiae S288c]
gi|62901135|sp|Q12004.1|TFB4_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 4;
AltName: Full=General transcription and DNA repair
factor IIH subunit TFB4; Short=TFIIH subunit TFB4;
AltName: Full=RNA polymerase II transcription factor B
34 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p34 subunit
gi|805037|emb|CAA89174.1| unknown [Saccharomyces cerevisiae]
gi|1314125|emb|CAA95001.1| unknown [Saccharomyces cerevisiae]
gi|190407994|gb|EDV11259.1| RNA polymerase II transcription factor B subunit 4 [Saccharomyces
cerevisiae RM11-1a]
gi|256268932|gb|EEU04279.1| Tfb4p [Saccharomyces cerevisiae JAY291]
gi|285815586|tpg|DAA11478.1| TPA: Tfb4p [Saccharomyces cerevisiae S288c]
gi|323350201|gb|EGA84348.1| Tfb4p [Saccharomyces cerevisiae VL3]
gi|392296067|gb|EIW07170.1| Tfb4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 338
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K +L K++ E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+ + +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + +G+ICSVCL +
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAVGFICSVCLCVLSIIPPG 302
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 303 NKCPACDSQFDE 314
>gi|441630267|ref|XP_004089522.1| PREDICTED: general transcription factor IIH subunit 3 isoform 2
[Nomascus leucogenys]
Length = 265
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 89/185 (48%), Gaps = 48/185 (25%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQ 66
+++ +++ M K + G+ +LL+GSL+ ALCYI R
Sbjct: 110 VIVEEIKDLMTKGDIKGQHTE------TLLAGSLAKALCYIHR----------------- 146
Query: 67 GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
MN + M S L A ITGG++ K Q+ L
Sbjct: 147 ------------MNKEIKDNQEM-----------KSRIL--ACDITGGLYLKVPQMPSLL 181
Query: 127 QYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCST 186
QYLL +F D R+ L LP PV VD+RA+CFCH+N I++GY+CSVCLSI+C C+T
Sbjct: 182 QYLLWVFLPDQDQRSQLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTT 241
Query: 187 CGSVF 191
C + F
Sbjct: 242 CETAF 246
>gi|323331310|gb|EGA72728.1| Tfb4p [Saccharomyces cerevisiae AWRI796]
Length = 409
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 96/192 (50%), Gaps = 8/192 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K +L K++ E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 194 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 253
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+ + +
Sbjct: 254 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 313
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + +G+ICSVCL +
Sbjct: 314 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAVGFICSVCLCVLSIIPPG 373
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 374 NKCPACDSQFDE 385
>gi|323335143|gb|EGA76433.1| Tfb4p [Saccharomyces cerevisiae Vin13]
Length = 338
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 95/192 (49%), Gaps = 8/192 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K +L K++ E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+ + +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + G+ICSVCL +
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAXGFICSVCLCVLSIIPPG 302
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 303 NKCPACDSQFDE 314
>gi|219113725|ref|XP_002186446.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583296|gb|ACI65916.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 358
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 77/137 (56%), Gaps = 9/137 (6%)
Query: 60 PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ----NSAFLQQASYITGGV 115
PRI +Q S D Y A MN F+A + + +D C+L A+ +SAFL+QA +TGGV
Sbjct: 206 PRIFLIQASDDRSRDYNAFMNCAFAAVKQSIVVDGCFLSAKGNHSSSAFLEQACDLTGGV 265
Query: 116 HHKPQ---QLDG-LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICS 171
P Q+ G L + L ++F D+ SR L LP VDFRA CF +D+ Y+C+
Sbjct: 266 FLSPSGTAQVGGALTEVLHSVFLADVPSRCLLNLPAINKVDFRARCFETAELVDLAYVCN 325
Query: 172 VCLSIYCKHLKK-CSTC 187
CLSI+ K K C TC
Sbjct: 326 QCLSIFRKQPKNYCPTC 342
>gi|323346291|gb|EGA80581.1| Tfb4p [Saccharomyces cerevisiae Lalvin QA23]
Length = 313
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 94/190 (49%), Gaps = 8/190 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K +L K++ E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+ + +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVESTE 242
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + G+ICSVCL +
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAXGFICSVCLCVLSIIPPG 302
Query: 182 KKCSTCGSVF 191
KC C S F
Sbjct: 303 NKCPACDSQF 312
>gi|254580699|ref|XP_002496335.1| ZYRO0C16016p [Zygosaccharomyces rouxii]
gi|238939226|emb|CAR27402.1| ZYRO0C16016p [Zygosaccharomyces rouxii]
Length = 342
Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 95/198 (47%), Gaps = 18/198 (9%)
Query: 14 EFMNKDEQL----------GKQEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRI 62
+F N DE L K++ + S L+G++S L Y+ RV + + + R+
Sbjct: 134 QFRNVDETLVEELYGLFQQEKEQADEVTQKSTLAGAMSAGLTYVNRVSKELETISFKSRL 193
Query: 63 LCLQ----GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHH 117
L + G D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+
Sbjct: 194 LVVTCGSGGGRDEIFQYIPIMNCIFSATKLKCPIDVVKIGGSKESTFLQQTTDATQGVYL 253
Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
+ +GL QYL T D R + P VDFR SC+ + +GYICSVCL +
Sbjct: 254 HAESTEGLIQYLATAMFIDPSLRQIIVKPNQGSVDFRTSCYLTGKVVAIGYICSVCLCVL 313
Query: 178 CKHL--KKCSTCGSVFGQ 193
+C C S F +
Sbjct: 314 SVSPPGNRCPACDSEFDE 331
>gi|50303925|ref|XP_451910.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62901298|sp|Q6CVX9.1|TFB4_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 4;
AltName: Full=RNA polymerase II transcription factor B
34 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p34 subunit
gi|49641042|emb|CAH02303.1| KLLA0B08569p [Kluyveromyces lactis]
Length = 337
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 71/196 (36%), Positives = 98/196 (50%), Gaps = 19/196 (9%)
Query: 15 FMNKDE-------QLGKQEPEGRIAC----SLLSGSLSMALCYIQRVFRSGL-LHPQPRI 62
F N DE +L K+E EG+ + S LSG++S AL Y+ R+ + + R+
Sbjct: 115 FRNVDEIVVEQWYRLFKEELEGQESKVSMKSSLSGAMSSALTYVNRILKENENTSLRSRL 174
Query: 63 LCLQ-GSPDGPE---QYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHH 117
L + G+ G + QY+ IMN IFSA + ID +G S FLQQA+ T GV+
Sbjct: 175 LVITCGTSQGKDEIFQYIPIMNCIFSATKMKCSIDVVKIGGGIESTFLQQATDATSGVYL 234
Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI- 176
+ GL QYL T D RN + P VDFR SCF + +G++CSVCL +
Sbjct: 235 HVENTRGLIQYLSTAMFIDPSLRNVIIKPNQGSVDFRTSCFLTGKVVAVGFVCSVCLCVL 294
Query: 177 -YCKHLKKCSTCGSVF 191
+KC C S F
Sbjct: 295 SVIPPGQKCPACDSPF 310
>gi|367012173|ref|XP_003680587.1| hypothetical protein TDEL_0C04870 [Torulaspora delbrueckii]
gi|359748246|emb|CCE91376.1| hypothetical protein TDEL_0C04870 [Torulaspora delbrueckii]
Length = 325
Score = 95.9 bits (237), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 94/198 (47%), Gaps = 18/198 (9%)
Query: 14 EFMNKDEQLGK------QEPEGRI----ACSLLSGSLSMALCYIQRVFRS-GLLHPQPRI 62
+F N DE L + QE + +I S L+G +S L YI R+ R + + R+
Sbjct: 117 QFRNVDETLVEELYKLFQEEKNQIDKVTQKSTLAGGISAGLTYINRISRELATIALKSRL 176
Query: 63 LCLQ----GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHH 117
L + G D QY+ IMN IFSA + PID +G + S FLQQ + T GV+
Sbjct: 177 LVITCGSGGGRDEIFQYIPIMNCIFSATKLKCPIDVVKIGGHRESTFLQQTTDATNGVYL 236
Query: 118 KPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
+ GL QYL T D R + P VDFR SC+ + +GYICSVCL +
Sbjct: 237 HLESTQGLIQYLSTAMFIDPSLRPIIVKPNQGSVDFRTSCYLTGKVVAIGYICSVCLCVL 296
Query: 178 C--KHLKKCSTCGSVFGQ 193
KC C S F +
Sbjct: 297 SILPPGNKCPACDSEFDE 314
>gi|365982181|ref|XP_003667924.1| hypothetical protein NDAI_0A05260 [Naumovozyma dairenensis CBS 421]
gi|343766690|emb|CCD22681.1| hypothetical protein NDAI_0A05260 [Naumovozyma dairenensis CBS 421]
Length = 339
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/197 (35%), Positives = 97/197 (49%), Gaps = 15/197 (7%)
Query: 7 TLLQNLEEFMNKD-EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS-GLLHPQPRILC 64
TL++ L K+ +Q+ K + + S LSG++S L YI R+ R + + R++
Sbjct: 122 TLVEELYTLFEKERDQIDKFKQK-----STLSGAMSAGLTYINRLSRELEAISLKSRLMV 176
Query: 65 LQ-----GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHK 118
L S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+
Sbjct: 177 LTCGGKGESKDEIFQYIPIMNCIFSANKIKCPIDVVKIGGSERSTFLQQTTDATNGVYLH 236
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI-- 176
+ GL QYL T D R + P VDFR SC+ + +GYICSVCL +
Sbjct: 237 VESSKGLIQYLSTAMFIDPSLRQIIVKPNHGSVDFRTSCYLTGKVVAIGYICSVCLCVLS 296
Query: 177 YCKHLKKCSTCGSVFGQ 193
KC C S F +
Sbjct: 297 IIPPGNKCPACDSEFDE 313
>gi|45187550|ref|NP_983773.1| ADL323Cp [Ashbya gossypii ATCC 10895]
gi|62901344|sp|Q75B93.1|TFB4_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 4;
AltName: Full=RNA polymerase II transcription factor B
34 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p34 subunit
gi|44982288|gb|AAS51597.1| ADL323Cp [Ashbya gossypii ATCC 10895]
gi|374106985|gb|AEY95893.1| FADL323Cp [Ashbya gossypii FDAG1]
Length = 341
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/192 (34%), Positives = 94/192 (48%), Gaps = 10/192 (5%)
Query: 12 LEEFMNKDEQLGKQEPEG--RIACSLLSGSLSMALCYIQRV-FRSGLLHPQPRILCL--- 65
+EE +Q KQ +G + S LSG++S L YI R+ + + + R+L +
Sbjct: 124 VEELYRVFQQESKQLEDGVPQPFRSTLSGAMSAGLTYINRITHETEGVSLKSRLLVITCG 183
Query: 66 -QGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G + S FLQQA+ T G + D
Sbjct: 184 SSASKDEVFQYIPIMNCIFSATKMKCPIDVVKVGGVKESTFLQQATDATNGNYLHVANTD 243
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R ++ P VDFR SC+ + +G++CSVCL +
Sbjct: 244 GLIQYLSTAMFIDPSLRQWVVKPNQSSVDFRTSCYLTGKVVAIGFVCSVCLCVLSIIPPG 303
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 304 NKCPACDSEFDE 315
>gi|260939906|ref|XP_002614253.1| hypothetical protein CLUG_05739 [Clavispora lusitaniae ATCC 42720]
gi|238852147|gb|EEQ41611.1| hypothetical protein CLUG_05739 [Clavispora lusitaniae ATCC 42720]
Length = 332
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 94/198 (47%), Gaps = 20/198 (10%)
Query: 32 ACSLLSGSLSMALCYIQRVF---RSG---------------LLHPQPRILCLQGSPDGPE 73
A S L+G+LSMA+ Y R+ +SG R+L + +
Sbjct: 126 ARSTLAGALSMAMTYTNRMLHVDQSGSAVQVAAAGTAGAQTTAAMGARVLVVSANESDDN 185
Query: 74 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
Y+ IMNAIF+AQ+ V ID LG +++ +L+QA+ T GV+ G+ Q L T F
Sbjct: 186 NYMGIMNAIFAAQKMKVAIDVAKLGRRSAPYLEQAADATQGVYLHVADPRGMVQTLATAF 245
Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGSVF 191
+ R+ + LP VD++ASCF +D+GY+CSVCL I C C S F
Sbjct: 246 FVEPSLRSVVILPTHGNVDYKASCFLTGRAVDVGYVCSVCLCIMSMLPDSGACPMCHSQF 305
Query: 192 GQAQTQSDEPSATNRKRK 209
+ + RKR+
Sbjct: 306 DEHHIARLRRGPSVRKRR 323
>gi|401838123|gb|EJT41898.1| TFB4-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 338
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 93/192 (48%), Gaps = 8/192 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K ++ K + E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 EALVEEIYKLFEVEKNQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVDSTQ 242
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + +G+ICSVCL +
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 303 NKCPACDSEFDE 314
>gi|365757943|gb|EHM99813.1| Tfb4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 338
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 93/192 (48%), Gaps = 8/192 (4%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K ++ K + E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 EALVEEIYKLFEVEKNQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLD 123
S D QY+ IMN IFSA + PID +G ++ S FLQQ + T GV+
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTTDATNGVYLHVDSTQ 242
Query: 124 GLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHL 181
GL QYL T D R + P VDFR SC+ + +G+ICSVCL +
Sbjct: 243 GLIQYLATAMFIDPSLRPIIVKPNHGSVDFRTSCYLTGRVVAIGFICSVCLCVLSIIPPG 302
Query: 182 KKCSTCGSVFGQ 193
KC C S F +
Sbjct: 303 NKCPACDSEFDE 314
>gi|50286117|ref|XP_445487.1| hypothetical protein [Candida glabrata CBS 138]
gi|62901307|sp|Q6FWA7.1|TFB4_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 4;
AltName: Full=RNA polymerase II transcription factor B
34 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p34 subunit
gi|49524792|emb|CAG58398.1| unnamed protein product [Candida glabrata]
Length = 335
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 95/195 (48%), Gaps = 12/195 (6%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRV---FRSGLLHPQPRIL 63
TL++ + ++E L + + S LSG+++ AL Y R+ F S L + ++
Sbjct: 121 TLIEEFYKLYQREESLIDKPVQK----STLSGAMAAALTYTNRLTKEFESISLRSRLLVI 176
Query: 64 CLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGA-QNSAFLQQASYITGGVHHKPQ 120
S + E QY+ IMN IFSA + PID +G + S FLQQ + T GV+ +
Sbjct: 177 TCGSSREKDEIFQYIPIMNCIFSATKLKCPIDVIKIGGNKQSTFLQQTTDATNGVYIHLE 236
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSI--YC 178
+G+ QYL T D R + P VDFR SC+ + +GYICSVCL +
Sbjct: 237 STNGIIQYLSTAMSIDPSLRQIIVRPTQGSVDFRTSCYLTGKVVAIGYICSVCLCVLSII 296
Query: 179 KHLKKCSTCGSVFGQ 193
KC C S F +
Sbjct: 297 PPGNKCPACDSQFDE 311
>gi|341038752|gb|EGS23744.1| putative transcription factor [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 429
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 86/186 (46%), Gaps = 50/186 (26%)
Query: 53 SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYIT 112
+GL RIL + S QY+ MNA+F+A + + ID+ L ++ FL+QAS+IT
Sbjct: 195 AGLAGLHARILIISVSDSSAAQYIPTMNAVFAAAHARIAIDTLALRG-SATFLEQASFIT 253
Query: 113 GGVHHKPQQLDGLFQYLLTIFGTDLHSRN-----------FLQLPKPVG----------- 150
G + + GL QYL+ FG+ N + PKP G
Sbjct: 254 RGTFIRAAEPRGLLQYLMFGFGSGSAPSNPAAAAAAAEKSTIGGPKPSGAAAAQPAAGTT 313
Query: 151 -------------------------VDFRASCFCHKNTIDMGYICSVCLSIYCKHLK--K 183
VDFRA+CFCH+N +D G++CSVCLSI+C+ +
Sbjct: 314 KPGAAGKLPPGAAVTDLLMTPSAEAVDFRAACFCHRNVVDTGFVCSVCLSIFCEVPPGGE 373
Query: 184 CSTCGS 189
C TCG+
Sbjct: 374 CMTCGT 379
>gi|363751126|ref|XP_003645780.1| hypothetical protein Ecym_3480 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889414|gb|AET38963.1| Hypothetical protein Ecym_3480 [Eremothecium cymbalariae
DBVPG#7215]
Length = 341
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 89/184 (48%), Gaps = 15/184 (8%)
Query: 34 SLLSGSLSMALCYIQRVFRSGL-LHPQPRILCLQ-GSPDGPE---QYVAIMNAIFSAQRS 88
S LSG++S L YI R+ + + R+L + GS G + QY+ IMN IFSA +
Sbjct: 149 STLSGAMSAGLTYINRIVKENESTSLKSRLLVITCGSNAGKDEIFQYIPIMNCIFSATKM 208
Query: 89 MVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
ID +G + S FLQQA+ T G + + +GL QYL T D R ++ P
Sbjct: 209 RCSIDVVKIGGTEESTFLQQATDATNGNYLHVENTEGLIQYLSTAMFIDPSLRQWIVKPN 268
Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQAQTQSDEPSATN 205
VDFR SC+ + +G++CSVCL + KC C + F DE
Sbjct: 269 QSAVDFRTSCYLTGTVVAIGFVCSVCLCVLSIIPPENKCPACDAEF-------DEKVVAK 321
Query: 206 RKRK 209
KRK
Sbjct: 322 LKRK 325
>gi|403216997|emb|CCK71492.1| hypothetical protein KNAG_0H00770 [Kazachstania naganishii CBS
8797]
Length = 342
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 71/205 (34%), Positives = 94/205 (45%), Gaps = 31/205 (15%)
Query: 15 FMNKDEQLGKQ----------EPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP------ 58
F N DE L ++ E E S L G++ AL Y R LLH
Sbjct: 117 FRNVDETLVEELYTLFKQEMGEVEKATQKSTLPGAMLAALAYTNR-----LLHDYETKSL 171
Query: 59 QPRILCLQ-GSPDGPEQ------YVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASY 110
+ RI+ + G+ +G Q Y+ IMN IFSA + PID +G A+ S FLQQ +
Sbjct: 172 KSRIMVMTCGNKNGTSQKEEIFQYIPIMNCIFSAAKLKCPIDVVKIGGARKSTFLQQTTD 231
Query: 111 ITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYIC 170
T G++ DGL QYL T D R+ + P VDFR SCF + +GYIC
Sbjct: 232 ATNGIYLHVDSTDGLVQYLSTAMFIDPSLRSIIVKPNQGSVDFRTSCFLTGKVVAVGYIC 291
Query: 171 SVCLSI--YCKHLKKCSTCGSVFGQ 193
SVCL + +C C S F +
Sbjct: 292 SVCLCVLSVLPPGNRCPACESEFDE 316
>gi|366998281|ref|XP_003683877.1| hypothetical protein TPHA_0A03670 [Tetrapisispora phaffii CBS 4417]
gi|357522172|emb|CCE61443.1| hypothetical protein TPHA_0A03670 [Tetrapisispora phaffii CBS 4417]
Length = 344
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 107/216 (49%), Gaps = 24/216 (11%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCL 65
+L++ + + K++ ++P+ + S L+G++S L YI R+ ++ + + R++ +
Sbjct: 126 SLVEEIYKIFQKEKAEYLEKPKQK---STLAGAMSAGLTYINRIVKNEENYTLKSRLVVI 182
Query: 66 --------QGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVH 116
+ QY+ IMN IFSA + PID +G + S FLQQA+ T G++
Sbjct: 183 TCGGNAYTDNKVEEVIQYIPIMNCIFSATKMKCPIDVVKIGGSSESTFLQQATDATSGIY 242
Query: 117 HKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCL-- 174
+ GL QYLLT + R+ + P +DFR SC+ + +G+ICS+CL
Sbjct: 243 LYVKDYHGLIQYLLTAMFIEPSLRSTIVKPNSKSIDFRTSCYITGKVVAVGFICSICLCV 302
Query: 175 -SIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
SI + KC C SVF DE KRK
Sbjct: 303 MSILPPN-NKCPACDSVF-------DEKVIVKLKRK 330
>gi|195447228|ref|XP_002071121.1| GK25630 [Drosophila willistoni]
gi|194167206|gb|EDW82107.1| GK25630 [Drosophila willistoni]
Length = 310
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 85/174 (48%), Gaps = 10/174 (5%)
Query: 12 LEEFMNKDEQ-LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS---GL-----LHPQPRI 62
+ E M D Q + + SLL+G + ALCYIQR R GL L + R+
Sbjct: 134 IAELMRSDAQKVNYNSNAQNTSTSLLAGGIGKALCYIQRRRREITRGLAKGVQLRIEARV 193
Query: 63 LCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQ 121
L + + G Q + M+ A+ S V +D C L A+ + +QA+ +TGG ++
Sbjct: 194 LIVTATDPGSNQVINYMHMFNGAKSSRVALDVCILDRAETLSIWRQATALTGGFYYSTND 253
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLS 175
L LL IF R P+ VD+R +C CHKN +D+G++CS CLS
Sbjct: 254 FTDLLTRLLGIFLMSPVERFQFNYPEQPQVDYRPNCVCHKNVLDIGFVCSACLS 307
>gi|354543147|emb|CCE39865.1| hypothetical protein CPAR2_602840 [Candida parapsilosis]
Length = 381
Score = 92.8 bits (229), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 86/159 (54%), Gaps = 8/159 (5%)
Query: 61 RILCLQGSPDGPE--QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
RIL + + DG + Y+ +MN IF+AQ+ V ID LG +NS++LQQAS T GV+
Sbjct: 209 RILIVSAN-DGDDSINYIPLMNCIFTAQKLKVSIDVAKLGFKNSSYLQQASDATNGVYLH 267
Query: 119 PQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC 178
+ G+ Q L T F + R ++ LP V+++ASCF K +D+G++CSVCL I
Sbjct: 268 IEDPRGITQVLATAFFIEPSLRPYIILPTNTQVNYKASCFITKKPVDVGFVCSVCLCIMS 327
Query: 179 K--HLKKCSTCGSVFGQAQTQ--SDEPS-ATNRKRKTTD 212
KC C S F + + P T +KRK D
Sbjct: 328 SIPDDGKCPACSSDFDKRSINELTKIPQVVTKKKRKLED 366
>gi|366988553|ref|XP_003674043.1| hypothetical protein NCAS_0A11040 [Naumovozyma castellii CBS 4309]
gi|342299906|emb|CCC67662.1| hypothetical protein NCAS_0A11040 [Naumovozyma castellii CBS 4309]
Length = 338
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 9/169 (5%)
Query: 34 SLLSGSLSMALCYIQRVFRS-GLLHPQPRILCLQ-----GSPDGPEQYVAIMNAIFSAQR 87
S LSG++S L YI RV + + + R++ L S D QY+ IMN IFSA +
Sbjct: 144 STLSGAMSAGLTYINRVTKELEAISLKSRLMVLTCGGKGESKDEIFQYIPIMNCIFSATK 203
Query: 88 SMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
PID +G ++ S FLQQ + T G + + GL QYL T D R + P
Sbjct: 204 IKCPIDVVKIGGSKESTFLQQTTDATNGAYLHVESTKGLIQYLATAMFIDPSLRQIIVKP 263
Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQ 193
VDFR SC+ + +G+ICSVCL + KC C S F +
Sbjct: 264 NHGSVDFRTSCYLTGKVVAVGFICSVCLCVLSIIPPGNKCPACDSEFDE 312
>gi|302759787|ref|XP_002963316.1| hypothetical protein SELMODRAFT_438480 [Selaginella moellendorffii]
gi|300168584|gb|EFJ35187.1| hypothetical protein SELMODRAFT_438480 [Selaginella moellendorffii]
Length = 916
Score = 91.3 bits (225), Expect = 2e-16, Method: Composition-based stats.
Identities = 46/83 (55%), Positives = 58/83 (69%), Gaps = 5/83 (6%)
Query: 8 LLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQG 67
++ L +F+++D Q P + SLLS SLSMALCYIQR + + +PR+LCLQ
Sbjct: 792 IVHKLNDFVSRD---ATQPP--KFQPSLLSASLSMALCYIQRALKEKIGPSKPRMLCLQA 846
Query: 68 SPDGPEQYVAIMNAIFSAQRSMV 90
SPD P QY+AIMNAIFSAQRSMV
Sbjct: 847 SPDAPHQYIAIMNAIFSAQRSMV 869
>gi|344229675|gb|EGV61560.1| transcription factor Tfb4 [Candida tenuis ATCC 10573]
Length = 381
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 105/226 (46%), Gaps = 42/226 (18%)
Query: 7 TLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-------------- 52
T+L L + M+ + + + S LSG+LS+AL Y R+ +
Sbjct: 128 TVLSGLSQTMS---DITHAQDDKVTGSSRLSGALSLALSYTNRMMKLDQSISTTTASAIS 184
Query: 53 -----------------------SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSM 89
G + RIL + + + +Y+++M AIF AQ+
Sbjct: 185 SAANISNKESSTAGASSGTNANSYGTSRMKSRILVVTPNDNEDIKYISLMKAIFGAQKMK 244
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPV 149
V ID LG ++S++LQQA+ T GV+ ++ GL Q L T + + R + LP
Sbjct: 245 VAIDIAKLGRKDSSYLQQAADATNGVYLHIEKPLGLIQVLCTAYFIEPSIRPLMILPTNS 304
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQ 193
V+++ASCF ++++GY+CSVCL I + + C TC S F +
Sbjct: 305 NVNYKASCFITGKSVEIGYVCSVCLCIMSEIPDRMSCPTCNSHFDE 350
>gi|410082137|ref|XP_003958647.1| hypothetical protein KAFR_0H01020 [Kazachstania africana CBS 2517]
gi|372465236|emb|CCF59512.1| hypothetical protein KAFR_0H01020 [Kazachstania africana CBS 2517]
Length = 328
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 89/186 (47%), Gaps = 16/186 (8%)
Query: 34 SLLSGSLSMALCYIQRVFRS-GLLHPQPRILCL---QGSPDGPE--QYVAIMNAIFSAQR 87
S L G++S L Y+ R+ + L + R++ L G + E QY+ IMN IFSA +
Sbjct: 144 STLPGAISAGLTYVNRISKEMETLSLKSRLMVLTCGSGMANKEEIFQYIPIMNCIFSATK 203
Query: 88 SMVPIDSCYLG-AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLP 146
P+D +G ++ S FLQQ + T GV+ + GL QYL T D R + P
Sbjct: 204 LKCPVDVVKIGGSKESTFLQQTTDATHGVYLHVETTKGLIQYLTTAMFIDPSLRQIIVKP 263
Query: 147 KPVGVDFRASCFCHKNTIDMGYICSVCLSI--YCKHLKKCSTCGSVFGQAQTQSDEPSAT 204
VDFR SC+ + +G+ICSVCL + KC C S F DE
Sbjct: 264 NQGSVDFRTSCYLTGKVVAVGFICSVCLCVLSIIPPGNKCPACDSEF-------DERVIN 316
Query: 205 NRKRKT 210
K+KT
Sbjct: 317 KLKKKT 322
>gi|336468231|gb|EGO56394.1| hypothetical protein NEUTE1DRAFT_147077 [Neurospora tetrasperma
FGSC 2508]
gi|350289522|gb|EGZ70747.1| transcription factor Tfb4 [Neurospora tetrasperma FGSC 2509]
Length = 439
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 84/183 (45%), Gaps = 55/183 (30%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + S P QY+ MNA+F+A + + ID+ L ++ FLQQASYIT G + Q
Sbjct: 223 RILVVSVSDSSPSQYIPTMNAVFAAAHARIAIDTLALRG-SATFLQQASYITRGTFIRAQ 281
Query: 121 QLDGLFQYLLTIFGTDL---------HSR------------------------------- 140
+ GL QYL+ FG+ H R
Sbjct: 282 EPRGLLQYLMFGFGSGSAPQGLAAAGHQRSQQQHGGNKGKEDKASEKAVGAAAKGGAKHG 341
Query: 141 ------------NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
+ L P VDFRA+CFCH+N ID G++CS+CLSI+C+ +C T
Sbjct: 342 GQKFLGSNASVADLLVTPSADSVDFRAACFCHRNVIDTGFVCSICLSIFCEVPEGGECLT 401
Query: 187 CGS 189
CG+
Sbjct: 402 CGT 404
>gi|85078711|ref|XP_956217.1| hypothetical protein NCU01580 [Neurospora crassa OR74A]
gi|16416034|emb|CAB91384.2| related to basic transcription factor 2 (35 kd SU) [Neurospora
crassa]
gi|28917270|gb|EAA26981.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 439
Score = 89.4 bits (220), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 83/183 (45%), Gaps = 55/183 (30%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RIL + S P QY+ MNA+F+A + + ID+ L ++ FLQQASYIT G + Q
Sbjct: 223 RILVVSVSDSSPSQYIPTMNAVFAAAHARIAIDTLALRG-SATFLQQASYITRGTFIRAQ 281
Query: 121 QLDGLFQYLLTIF----------------------------------------------- 133
+ GL QYL+ F
Sbjct: 282 EPRGLLQYLMFGFGSGSAPQGLAAAGNQRSQQQHGGHKGKEDKASEKALGAAAKGGAKHG 341
Query: 134 -----GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
G++ + L P VDFRA+CFCH+N ID G++CS+CLSI+C+ +C T
Sbjct: 342 GQKFLGSNASVADLLVTPSADSVDFRAACFCHRNVIDTGFVCSICLSIFCEVPEGGECLT 401
Query: 187 CGS 189
CG+
Sbjct: 402 CGT 404
>gi|341877621|gb|EGT33556.1| hypothetical protein CAEBREN_10115 [Caenorhabditis brenneri]
Length = 311
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/139 (35%), Positives = 73/139 (52%), Gaps = 5/139 (3%)
Query: 73 EQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKPQQLDGLFQYLLT 131
++ ++MN FSA + ++ +D LG LQQA+ ITGG Q+ L + L+T
Sbjct: 177 SEHGSLMNLFFSAAKQLICVDVVSLGDDFTGGILQQAADITGGSFMHAQKPQALLKILMT 236
Query: 132 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
TD R+ VD+RASC CH+ + G++CSVCLS+ C++ C C + F
Sbjct: 237 NMLTDPTYRSAFSKLSHNSVDYRASCVCHQTLVSSGWVCSVCLSVLCQYTPICKVCKAAF 296
Query: 192 GQAQTQSDEPSATNRKRKT 210
T + P NRKRK+
Sbjct: 297 ----TIRNIPKKPNRKRKS 311
>gi|407044375|gb|EKE42557.1| hypothetical protein ENU1_017690 [Entamoeba nuttalli P19]
Length = 263
Score = 85.9 bits (211), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 101/195 (51%), Gaps = 20/195 (10%)
Query: 4 LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPR 61
L ++ + ++E +N ++ EP LS ++S+AL I + G L P Q +
Sbjct: 81 LTTSIAEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK---DGKLQPDVQKK 128
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFLQQASYITGGVHHKP 119
I+ + + +++ +MN +F+AQR ID+ L + F QQASY+T G+++
Sbjct: 129 IIVVSTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSVIETRCTFCQQASYLTNGIYNTA 188
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
L LF LL+ +D + +++ + +D++ CF K +D GY+CS C IY
Sbjct: 189 T-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQPAIDWKLWCFHCKKPLDHGYVCSSCFGIY 247
Query: 178 CKHLK-KCSTCGSVF 191
C+ K +C TCG+ F
Sbjct: 248 CEKQKGRCPTCGAYF 262
>gi|225684398|gb|EEH22682.1| RNA polymerase II transcription factor B subunit 4
[Paracoccidioides brasiliensis Pb03]
Length = 382
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/107 (43%), Positives = 64/107 (59%), Gaps = 2/107 (1%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ- 120
I+ L + + Y+ IMN+IF+ QR +PID L A ++AFLQQAS TGG++
Sbjct: 233 IISLSNNTHSAQHYIPIMNSIFACQRLHIPIDIIKL-AGDAAFLQQASDATGGIYIPINA 291
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMG 167
G QYL+ F D +R L LP V VDFRA+CFCH+ +D+G
Sbjct: 292 HPAGFLQYLMLGFLPDQRARAHLILPSRVDVDFRAACFCHRKVVDVG 338
>gi|308501315|ref|XP_003112842.1| hypothetical protein CRE_25211 [Caenorhabditis remanei]
gi|308265143|gb|EFP09096.1| hypothetical protein CRE_25211 [Caenorhabditis remanei]
Length = 317
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 74/149 (49%), Gaps = 5/149 (3%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKP 119
R + + SP ++ ++MN FSA + + +D +G LQQA+ ITGG
Sbjct: 169 RAVVISMSPIMGSEHGSLMNLFFSAAKQSICVDVVSMGDDFTGGILQQAADITGGSFLHA 228
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
Q+ L + L+T TD R VD+RASC CH + G++CSVCLS+ C+
Sbjct: 229 QKPPALLKILMTNMLTDPSYRTAFSKLSHSSVDYRASCACHHKLVSSGWVCSVCLSVLCQ 288
Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKR 208
+ C C + F T ++ P NRKR
Sbjct: 289 YTPICKVCRAAF----TITNIPKKPNRKR 313
>gi|37958350|gb|AAP41125.1| ZK1128.4 [Pristionchus pacificus]
Length = 278
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 93/182 (51%), Gaps = 17/182 (9%)
Query: 33 CSLLSGSLSMALCYIQRVFRSGLLHPQP-------RILCLQGSPDGPEQYVAIMNAIFSA 85
C +L+ +L+ A+C I++ G + RI+ + PD Q+ +MN FSA
Sbjct: 105 CGVLAATLATAICQIRKAATLGTNKSEETISGQKGRIVIISLLPDFGSQHSLLMNLFFSA 164
Query: 86 QRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLT--IFGTDLHSRNFL 143
+ + ID LG +SA LQQA+ IT GV +++ L + L+T + + S +F
Sbjct: 165 HKHDICIDVVSLGG-SSALLQQAADITAGVFQVIHKIEDLPRILMTRCLPVSIRDSTSF- 222
Query: 144 QLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSA 203
VD+RA+C CH+ + G+ CSVCL++ C + C +CG+ Q + S E
Sbjct: 223 -----TTVDYRAACVCHEKLVSNGWTCSVCLAVLCSFMAICPSCGACV-QDHSSSKEEKE 276
Query: 204 TN 205
T+
Sbjct: 277 TD 278
>gi|67468778|ref|XP_650390.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56467011|gb|EAL45004.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 263
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 101/195 (51%), Gaps = 20/195 (10%)
Query: 4 LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPR 61
L ++ + ++E +N ++ EP LS ++S+AL I + G L P Q +
Sbjct: 81 LTISIAEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK---DGKLQPDIQKK 128
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFLQQASYITGGVHHKP 119
I+ + + +++ +MN +F+AQR ID+ L + F QQASY+T G+++
Sbjct: 129 IIIVSTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSVIETRCTFCQQASYLTNGIYNTA 188
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
L LF LL+ +D + +++ + +D++ CF K +D GY+CS C IY
Sbjct: 189 T-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQPAIDWKLWCFHCKKPLDHGYVCSSCFGIY 247
Query: 178 CKHLK-KCSTCGSVF 191
C+ K +C TCG+ F
Sbjct: 248 CEKQKGRCPTCGAYF 262
>gi|393909857|gb|EFO22116.2| basic transcription factor 2 [Loa loa]
Length = 274
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 88/163 (53%), Gaps = 11/163 (6%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
++ + ++++A+C+I R +++ + + RIL + D ++ +MN F+A + + ID
Sbjct: 101 AVFAPAVAIAICHIFR-YKNEVDNGDGRILIINIGSDLRGEHSILMNIFFAAHKHNILID 159
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQLPKP--- 148
+G + S LQQAS ITGG + ++ L +Y + T+ L S P P
Sbjct: 160 VANIG-EASTILQQASDITGGTYFSVKKPKQLLKYTMCFTLGKASLRS----AFPSPSSS 214
Query: 149 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
VD+RASC CH + +G++CSVCLS+ C C C +VF
Sbjct: 215 TSVDYRASCHCHGAPVSIGWVCSVCLSVQCHFNPICPACNTVF 257
>gi|167391336|ref|XP_001739730.1| RNA polymerase II transcription factor B subunit [Entamoeba dispar
SAW760]
gi|165896489|gb|EDR23894.1| RNA polymerase II transcription factor B subunit, putative
[Entamoeba dispar SAW760]
Length = 249
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 101/195 (51%), Gaps = 20/195 (10%)
Query: 4 LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP--QPR 61
L +++ ++E +N ++ EP LS ++S+AL I + G L P Q +
Sbjct: 67 LTTPIVEAIKEVLN--DKFLVNEPN-------LSSAMSLALTTINK---DGKLQPDIQKK 114
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNS--AFLQQASYITGGVHHKP 119
++ + + +++ +MN +F+AQR ID+ L + F QQASY+T G+++
Sbjct: 115 VIVISTPFESSPKFIPLMNCVFAAQRIKTTIDTIVLSIIETGCTFCQQASYLTNGIYNTA 174
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLP--KPVGVDFRASCFCHKNTIDMGYICSVCLSIY 177
L LF LL+ +D + +++ + +D++ CF K +D GY+CS C IY
Sbjct: 175 P-LQSLFPRLLSNNTSDCDTEKIIRVTGNQQSAIDWKLWCFHCKKPLDHGYVCSSCFGIY 233
Query: 178 C-KHLKKCSTCGSVF 191
C K ++C TCG+ F
Sbjct: 234 CEKQKERCPTCGAYF 248
>gi|402595070|gb|EJW88996.1| hypothetical protein WUBG_00095 [Wuchereria bancrofti]
Length = 274
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 86/163 (52%), Gaps = 11/163 (6%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
+L + +++ A+C+I R F++ + RIL + D ++ +MN F+A + + ID
Sbjct: 101 ALFAPAIATAICHIFR-FKNEVDKGDGRILVINIGSDLLGEHNILMNIFFAAHKHSILID 159
Query: 94 SCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQLPKP--- 148
+G + S LQQAS ITGG + ++ L +Y + T+ L S P P
Sbjct: 160 VANIG-ETSPILQQASDITGGTYFNVKKPKQLLKYTMCFTLGRASLRS----AFPSPSSS 214
Query: 149 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+D+RASC CH + +G++CSVCLS+ C C C +VF
Sbjct: 215 TSIDYRASCHCHGAPVSVGWVCSVCLSVQCHFSPICPACNTVF 257
>gi|313239039|emb|CBY14022.1| unnamed protein product [Oikopleura dioica]
Length = 291
Score = 84.7 bits (208), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 85/177 (48%), Gaps = 6/177 (3%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID 93
S + +LS ALC I R RS P ++ G QY+ ++N ++A+ V ID
Sbjct: 119 SRFNNALSKALCCINR-MRSEKDLPARLMVMKCGGDFLSHQYLPLINCAYAAEHLKVSID 177
Query: 94 SCYLGAQNSA-FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVD 152
+ L A+ LQQ S +TGGV+ + + L Q LLT D +R L VD
Sbjct: 178 TVVLEAETDVPVLQQMSDVTGGVYSRISEPAILLQTLLTYHLPDDDARKMLSKVPATPVD 237
Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRK 209
RA CF + D+GY+CSVCLS+Y C TC SV P + RKR+
Sbjct: 238 SRAVCFKTRAVTDIGYVCSVCLSVYKMLTPICVTCKSVLKLPGM----PPSRKRKRE 290
>gi|351713028|gb|EHB15947.1| General transcription factor IIH subunit 3 [Heterocephalus glaber]
Length = 89
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 36/70 (51%), Positives = 49/70 (70%)
Query: 122 LDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHL 181
+ L QYLL +F +D R L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C
Sbjct: 1 MASLLQYLLWVFLSDQDQRAQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFS 60
Query: 182 KKCSTCGSVF 191
C+TC + F
Sbjct: 61 PICTTCETAF 70
>gi|170591206|ref|XP_001900361.1| basic transcription factor 2, 35 kD subunit [Brugia malayi]
gi|158591973|gb|EDP30575.1| basic transcription factor 2, 35 kD subunit, putative [Brugia
malayi]
Length = 274
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 85/162 (52%), Gaps = 11/162 (6%)
Query: 35 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
L + +++ A+C+I R F++ + RIL + D ++ +MN F+A + + ID
Sbjct: 102 LFAPAIATAICHIFR-FKNEVDKGDGRILIINIGSDLFGEHNILMNIFFAAHKHNILIDV 160
Query: 95 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQLPKP---V 149
+G + S LQQAS ITGG + ++ L +Y + T+ L S P P
Sbjct: 161 ANIG-ETSPILQQASDITGGTYFNVKKPKQLLKYTMCFTLGRASLRS----AFPSPSSST 215
Query: 150 GVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+D+RASC CH + +G++CSVCLS+ C C C +VF
Sbjct: 216 SIDYRASCHCHGAPVSVGWVCSVCLSVQCHFSPICPACNTVF 257
>gi|224006638|ref|XP_002292279.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971921|gb|EED90254.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 409
Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 94/189 (49%), Gaps = 16/189 (8%)
Query: 15 FMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQ 74
N L +++ EG IA L+ GS S + LL PR L +Q S D
Sbjct: 208 LANSSSMLQRKDDEGVIA--LMGGSGSSNFKQQELQHSKSLL--SPRALIIQSSSDRTSD 263
Query: 75 YVAIMNAIFSAQRSMVPIDSCYL------GAQNSAFLQQASYITGGVH----HKPQQLDG 124
Y A+MN F+A + + ID C++ A++S +L+Q T GV+ Q+ G
Sbjct: 264 YNALMNCAFAANKQNIVIDGCFIPSGMKDDAKSSPYLEQIVDQTRGVYLSVPSGAAQVGG 323
Query: 125 -LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK 183
L + L+++F R + LPK VDFRA CF +ID+ IC+ CLSI+ + K+
Sbjct: 324 ALSEVLISVFLPPPLLRKEMNLPKLTKVDFRARCFETGESIDVAQICNQCLSIFKERPKE 383
Query: 184 -CSTCGSVF 191
C TCG+V
Sbjct: 384 FCLTCGAVV 392
>gi|378726397|gb|EHY52856.1| transcription initiation factor TFIIH subunit H3 [Exophiala
dermatitidis NIH/UT8656]
Length = 455
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 79/183 (43%), Gaps = 52/183 (28%)
Query: 59 QPRILCLQGSP--DGPEQYVAIMNAIFSAQRSMVPIDSCYL------------------- 97
Q RIL + SP D QY+ IMNAIF+ QR +PID L
Sbjct: 229 QSRILLVSVSPSTDLAHQYIPIMNAIFACQRLAIPIDILQLPLPLQTNNTTTQAQQTSTN 288
Query: 98 ---------------GAQNSAFLQQASYITGGVHHKPQ-------------QLDGLFQYL 129
+ ++ FLQQA+ T G+ Q YL
Sbjct: 289 NQTPAGDNNTTTAGNTSNSTVFLQQAADATHGIFIAAQLPKSQSQSQASAASSQAFLTYL 348
Query: 130 LTIFGTDLHSRN-FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
LT +R L LP + VDFRA+CFCH+N + +G++CS+CLSI+C C T
Sbjct: 349 LTSLLPSPSTRAAHLILPTRIDVDFRAACFCHRNVVSVGFVCSICLSIFCSVPENADCLT 408
Query: 187 CGS 189
CG+
Sbjct: 409 CGT 411
>gi|268573188|ref|XP_002641571.1| Hypothetical protein CBG09872 [Caenorhabditis briggsae]
Length = 299
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 74/149 (49%), Gaps = 5/149 (3%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKP 119
R + + +P ++ ++MN FSA + + +D + LQQA+ ITGG
Sbjct: 151 RAVVISMTPIIGSEHGSLMNLFFSAAKQSICVDVVSMSDDLTGGILQQAADITGGSFVHA 210
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
++ L + L+T TD RN VD+RASC CH + G++CS+CLS+ C+
Sbjct: 211 EKPQALLKILMTNMLTDASYRNTFSKLSYCSVDYRASCACHHTLVSSGWVCSICLSVLCQ 270
Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKR 208
+ C C + F T ++ P +RKR
Sbjct: 271 YTPICKVCKAAF----TITNLPKKPSRKR 295
>gi|397617770|gb|EJK64598.1| hypothetical protein THAOC_14658 [Thalassiosira oceanica]
Length = 263
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 77/155 (49%), Gaps = 12/155 (7%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL------GAQNSAFLQQASYITGG 114
R L +Q SPD Y A+MN F++ +S V +D C++ ++ S L+Q +TGG
Sbjct: 107 RALVVQASPDRTSDYNALMNCAFASSKSSVVVDGCFIPSGLRDDSRTSPVLRQVVDLTGG 166
Query: 115 VH----HKPQQLDGLFQYLLTI-FGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYI 169
V+ Q+ G +LT F R + LP+ VDFRA CF +D+ ++
Sbjct: 167 VYLAVPSGAAQVGGALGEVLTAAFLPPPRLRGLMNLPESDDVDFRARCFATGRAVDVAWV 226
Query: 170 CSVCLSIYCKHLK-KCSTCGSVFGQAQTQSDEPSA 203
C+ CLSI+ + + C TCG+ + + D A
Sbjct: 227 CNQCLSIFAERPEGACPTCGAKVLVGEGRGDSKRA 261
>gi|392896378|ref|NP_499249.2| Protein ZK1128.4 [Caenorhabditis elegans]
gi|224490453|emb|CAA87423.2| Protein ZK1128.4 [Caenorhabditis elegans]
Length = 296
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 5/149 (3%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQ-NSAFLQQASYITGGVHHKP 119
R + + +P ++ ++MN FSA + + +D +G LQQA+ ITGG
Sbjct: 148 RAVVISITPIMGSEHGSLMNLFFSAAKQSICVDVVSMGDDFTGGILQQAADITGGSFLHA 207
Query: 120 QQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK 179
++ L + L+T TD R VD+RASC CH + G++CS+CLS+ C+
Sbjct: 208 KKPQTLLKILMTNMLTDPTHRAVFSKLSHNSVDYRASCACHHQLVSSGWVCSICLSVLCQ 267
Query: 180 HLKKCSTCGSVFGQAQTQSDEPSATNRKR 208
+ C C + F T ++ P NRKR
Sbjct: 268 YTPICKVCKAAF----TIANLPIKPNRKR 292
>gi|169607795|ref|XP_001797317.1| hypothetical protein SNOG_06960 [Phaeosphaeria nodorum SN15]
gi|160701493|gb|EAT85611.2| hypothetical protein SNOG_06960 [Phaeosphaeria nodorum SN15]
Length = 312
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 73/143 (51%), Gaps = 32/143 (22%)
Query: 33 CSLLSGSLSMALCYIQRVFRSGLLHP----------------------------QPRILC 64
+L+ G++SMAL YI + S +LH RIL
Sbjct: 123 TTLIGGAISMALSYINK---STILHAPTGASAEITSVAAMADTDNSTHLDRIALTSRILI 179
Query: 65 LQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG 124
+ S D QY+ +MN+IF+AQR +PID L A ++ LQQAS TGGV+ KP++ +G
Sbjct: 180 VSVSGDLANQYIPVMNSIFAAQRKKIPIDILKL-AGDTVLLQQASDATGGVYMKPERPEG 238
Query: 125 LFQYLLTIFGTDLHSRNFLQLPK 147
L QYL+ + D +R L +P+
Sbjct: 239 LLQYLMMAYLPDATARKSLIIPR 261
>gi|406695514|gb|EKC98819.1| hypothetical protein A1Q2_06922 [Trichosporon asahii var. asahii
CBS 8904]
Length = 297
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 76/157 (48%), Gaps = 31/157 (19%)
Query: 61 RILCLQGSPDGPEQYVA--------------------IMNAIFSAQRSMVPIDSCYLGAQ 100
RIL L +P G E A +MN +F+ Q++ V ID L +
Sbjct: 87 RILILNATPGGAEDAEATDSKTAKASGGGGGQRGYVGLMNCVFAGQKAKVAIDVLTLPPE 146
Query: 101 NSA-----FLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF-LQLPKPVGVDFR 154
++ FLQQA+Y+T G++ + GL QYL +++ RN +P VDFR
Sbjct: 147 STMTAPPIFLQQAAYLTEGIYWRWNGRGGLLQYLHSMYLPPPSLRNKPFTVPPQDAVDFR 206
Query: 155 ASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
A CFCH + +D+G++C I+C+ CS C + F
Sbjct: 207 AVCFCHHDVVDVGFVC-----IFCEVKPICSMCKTKF 238
>gi|298711517|emb|CBJ26605.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 223
Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/78 (55%), Positives = 53/78 (67%), Gaps = 2/78 (2%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
L+GSLS LCYI R R Q R L L GSPD PE Y ++MNAIFSAQ++ V +D
Sbjct: 137 LAGSLSKGLCYINRKMRESS-SIQARALVLSGSPDVPETYNSVMNAIFSAQKAGVLVDCA 195
Query: 96 YLGAQNSAFLQQASYITG 113
LG ++S FLQQA+Y+TG
Sbjct: 196 VLG-ESSTFLQQAAYLTG 212
>gi|324508596|gb|ADY43627.1| General transcription factor IIH subunit 3 [Ascaris suum]
Length = 272
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 73/153 (47%), Gaps = 2/153 (1%)
Query: 39 SLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLG 98
+ + A C+I R F+ RIL + D + A+MN FSA + + +D +G
Sbjct: 106 AFATAFCHINR-FKKENDGADGRILIINIGSDLAREQNALMNVFFSAHKQGIVVDVANIG 164
Query: 99 AQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCF 158
+ S LQQA IT G + ++ L QYL+ SR+ VD+RASC
Sbjct: 165 -EPSPVLQQACDITMGAYINVEKPKRLLQYLMFFALGGPDSRSKFTSSVASSVDYRASCH 223
Query: 159 CHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
CH + +CSVCLS+ CK C C +VF
Sbjct: 224 CHGTPASIALVCSVCLSVQCKFNPICPICNTVF 256
>gi|357629722|gb|EHJ78327.1| putative transcription factor TFIIH-subunit [Danaus plexippus]
Length = 282
Score = 78.2 bits (191), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 4/116 (3%)
Query: 22 LGKQEP-EGRIACSLLSGSLSMALCYIQRVFRSGL--LHPQPRILCLQGSPDGPEQYVAI 78
L K +P E R SLL+G+++M LC+I R+ R L R+L + GS D QY+
Sbjct: 103 LIKSQPQEERPGESLLAGAMAMGLCFIARMRREQPPGLRMVSRMLVVTGSSDTAAQYINY 162
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFG 134
MN F+AQ+ V +D C L ++ + LQQ ITGG++ K L+GL QYLL I+
Sbjct: 163 MNVFFTAQKQQVLLDVCSLD-KHLSLLQQGCDITGGLYLKVPSLEGLLQYLLVIYA 217
>gi|312385025|gb|EFR29618.1| hypothetical protein AND_01266 [Anopheles darlingi]
Length = 516
Score = 77.0 bits (188), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/102 (45%), Positives = 60/102 (58%), Gaps = 9/102 (8%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHP-----QPRILCLQGSPDGPEQYVAIMNAIFSAQRS 88
SLL+GS+SMALCYI R+ R+ P R+L + GS + QY+ MN F+AQ+
Sbjct: 391 SLLAGSMSMALCYIARINRN---KPPGSKINSRVLVVTGSNECASQYMTYMNVFFTAQKQ 447
Query: 89 MVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL 130
V +D C L S LQQ ITGG + + +QLDG QYLL
Sbjct: 448 GVVVDVCALDKALS-LLQQGCDITGGQYLRLEQLDGFLQYLL 488
>gi|401400847|ref|XP_003880871.1| hypothetical protein NCLIV_039130 [Neospora caninum Liverpool]
gi|325115283|emb|CBZ50838.1| hypothetical protein NCLIV_039130 [Neospora caninum Liverpool]
Length = 580
Score = 76.3 bits (186), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 85/193 (44%), Gaps = 28/193 (14%)
Query: 29 GRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIF 83
GR C S+L+G+LS+ALCY+ +V + P+ R+L L GS D QY+ +MN F
Sbjct: 372 GRSLCGEDSMLAGALSLALCYLNKVAKRSARPPERRVLLLDGSLDRSYSSQYMPLMNLAF 431
Query: 84 SAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK-----PQQLDG----------LFQY 128
+A + + ID C L S +Q I+ GVH K P G L
Sbjct: 432 AAAKGNIVIDCCALCPNPSTISEQLCDISRGVHLKFSQAAPASTSGNSHLDGGLALLQLL 491
Query: 129 LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KH 180
L IF + L G A CFCH +++ ICS CL+IYC K
Sbjct: 492 LFWIFPSVSLRPAIAALSVHRGRSNAAVCFCHHRPVEVCCICSCCLAIYCSEKDAQTGKE 551
Query: 181 LKKCSTCGSVFGQ 193
C C S F +
Sbjct: 552 RISCDVCKSRFSR 564
>gi|255545470|ref|XP_002513795.1| hypothetical protein RCOM_1031970 [Ricinus communis]
gi|223546881|gb|EEF48378.1| hypothetical protein RCOM_1031970 [Ricinus communis]
Length = 159
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 70/156 (44%), Gaps = 51/156 (32%)
Query: 1 MPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQP 60
+P+L + LLQ LE+F+ +DE+LGK E L +L + ++ LL P
Sbjct: 50 IPTLYSELLQKLEDFIIRDEKLGKGE-------EFLKETLPLHCSQDPCLWHFALLIPDT 102
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
+ +G VPIDSCY+G NSAFLQQ
Sbjct: 103 MVRKNEGK---------------------VPIDSCYVGGHNSAFLQQ------------- 128
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRAS 156
T+ TDLHSRN LQLP+P VDFRAS
Sbjct: 129 ----------TVLATDLHSRNCLQLPRPADVDFRAS 154
>gi|299751088|ref|XP_001830046.2| basic transcription factor 2.35KD subunit [Coprinopsis cinerea
okayama7#130]
gi|298409214|gb|EAU91711.2| basic transcription factor 2.35KD subunit [Coprinopsis cinerea
okayama7#130]
Length = 280
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 66/113 (58%), Gaps = 4/113 (3%)
Query: 20 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFR-SGLLHPQ--PRILCLQGSPDGPEQYV 76
+ L E + C+L+ G+L+ ALC + S + PQ PRIL L SPD Y+
Sbjct: 105 DALDDSHDEEEVPCALV-GALTKALCSTTPLPNGSQGVPPQLDPRILILSVSPDLSTSYI 163
Query: 77 AIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
IMNAIFSAQ+ V ID+C + + FLQQA+++TGG + ++ D L QYL
Sbjct: 164 PIMNAIFSAQKLKVTIDACQIYGGETVFLQQAAHLTGGSYLLLERRDALLQYL 216
>gi|298711518|emb|CBJ26606.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 117
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 51/79 (64%), Gaps = 1/79 (1%)
Query: 111 ITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYIC 170
+ GG++ + Q + GL QY+ F R +L+L VDFRA+CFCHK +D+ Y+C
Sbjct: 15 LPGGLYLEHQGV-GLAQYMFATFLPSQSCRKYLKLRTQQSVDFRAACFCHKRVVDIAYVC 73
Query: 171 SVCLSIYCKHLKKCSTCGS 189
SVCLS++C+ C+TCG+
Sbjct: 74 SVCLSVFCEFSPVCTTCGT 92
>gi|149579196|ref|XP_001517604.1| PREDICTED: general transcription factor IIH subunit 3-like, partial
[Ornithorhynchus anatinus]
Length = 80
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 43/60 (71%)
Query: 132 IFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
+F D R+ L LP P+ VD+RA+CFCH+N I++GY+CSVCLSI+C C+TC + F
Sbjct: 2 VFLPDQDQRSQLILPPPIHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTCETAF 61
>gi|221502054|gb|EEE27800.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 404
Score = 72.8 bits (177), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 84/196 (42%), Gaps = 29/196 (14%)
Query: 27 PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 81
P AC S+L+G+LS+ALC + +V + P+ R+L L GS D QY+ +MN
Sbjct: 193 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 252
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQ----------------LDGL 125
F+A + + ID C L S +Q I+ GVH K Q L L
Sbjct: 253 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 312
Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 178
L IF + L G A CFCH +++ ICS CL+IYC
Sbjct: 313 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 372
Query: 179 -KHLKKCSTCGSVFGQ 193
K C C S F +
Sbjct: 373 GKERISCDVCKSRFSR 388
>gi|124513782|ref|XP_001350247.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|23615664|emb|CAD52656.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 326
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 103/224 (45%), Gaps = 35/224 (15%)
Query: 2 PSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPR 61
C+ L +F+N++++ K E S LS +L++ALCY R+ R+ + R
Sbjct: 108 KDFCSDTYDKLVDFINENKKEKKSE-------SALSSALTIALCYNNRI-RNLYENINTR 159
Query: 62 ILCLQGSPDG--PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKP 119
+ L S QY +MN ++A+R+ + ID L + + L+Q IT G++
Sbjct: 160 LFLLDISKSHYYTNQYTQLMNIAYNAKRNNIIIDVFSLNYK-TQILEQICNITNGLYLDN 218
Query: 120 QQL----------DGLFQYLLTIFGTDLHSRNFLQ---LPKPVGVDFRASCFCHKNTIDM 166
D L Q ++ F +++R + L + + A C CH ID+
Sbjct: 219 TIFQSINPYGNIQDILTQTIIFWFLPSVNTRKYFSNTYLNEDTNI---AVCTCHNKQIDI 275
Query: 167 GYICSVCLSIYC--KHLK------KCSTCGSVFGQAQTQSDEPS 202
YICS CL+IYC K+L+ C+ C + F ++ ++ S
Sbjct: 276 AYICSCCLAIYCSEKNLQTNKDRLSCAICKTRFTKSLLRNKHVS 319
>gi|237838949|ref|XP_002368772.1| hypothetical protein TGME49_066930 [Toxoplasma gondii ME49]
gi|211966436|gb|EEB01632.1| hypothetical protein TGME49_066930 [Toxoplasma gondii ME49]
Length = 547
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 83/196 (42%), Gaps = 29/196 (14%)
Query: 27 PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 81
P AC S+L+G+LS+ALC + +V + P+ R+L L GS D QY+ +MN
Sbjct: 336 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 395
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------------DGL 125
F+A + + ID C L S +Q I+ GVH K Q L
Sbjct: 396 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 455
Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 178
L IF + L G A CFCH +++ ICS CL+IYC
Sbjct: 456 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 515
Query: 179 -KHLKKCSTCGSVFGQ 193
K C C S F +
Sbjct: 516 GKERISCDVCKSRFSR 531
>gi|156373212|ref|XP_001629427.1| predicted protein [Nematostella vectensis]
gi|156216427|gb|EDO37364.1| predicted protein [Nematostella vectensis]
Length = 190
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/72 (48%), Positives = 50/72 (69%), Gaps = 1/72 (1%)
Query: 59 QPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
QPRIL ++ SPD Y+ +MN+IF+AQ+S ID+C L ++S +LQQA+ ITGGV+ K
Sbjct: 120 QPRILIIKASPDDASHYMPVMNSIFAAQKSNTCIDACVL-YEHSGYLQQATDITGGVYLK 178
Query: 119 PQQLDGLFQYLL 130
+ L QYL+
Sbjct: 179 IPETQALLQYLM 190
>gi|221481332|gb|EEE19726.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 549
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 83/196 (42%), Gaps = 29/196 (14%)
Query: 27 PEGRIAC---SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNA 81
P AC S+L+G+LS+ALC + +V + P+ R+L L GS D QY+ +MN
Sbjct: 338 PRPTTACGEDSMLAGALSLALCCLNKVSKRSARTPERRVLILDGSLDRSYSSQYMPLMNL 397
Query: 82 IFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------------DGL 125
F+A + + ID C L S +Q I+ GVH K Q L
Sbjct: 398 AFAAAKGNIVIDCCALSTNPSTIPEQLCDISRGVHLKFAQAAPSASTSGSASLDGGLALL 457
Query: 126 FQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------- 178
L IF + L G A CFCH +++ ICS CL+IYC
Sbjct: 458 QLLLFWIFPSMSLRPAIAALSVHRGRSNTAVCFCHHKPVEVCCICSCCLAIYCSEKDAQT 517
Query: 179 -KHLKKCSTCGSVFGQ 193
K C C S F +
Sbjct: 518 GKERISCDVCKSRFSR 533
>gi|116204609|ref|XP_001228115.1| hypothetical protein CHGG_10188 [Chaetomium globosum CBS 148.51]
gi|88176316|gb|EAQ83784.1| hypothetical protein CHGG_10188 [Chaetomium globosum CBS 148.51]
Length = 408
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 74/185 (40%), Gaps = 67/185 (36%)
Query: 54 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
GL RIL L S P QY+ MNA+F+A R+ + ID+ L A
Sbjct: 199 GLAGLHARILILSVSDSAPAQYIPTMNAVFAAARARIAIDTLSLRAA------------- 245
Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRN-------------------------------- 141
+P+ GL QYL+ FG+ N
Sbjct: 246 ----EPR---GLLQYLMFGFGSGTAPSNPSGGADSGKGPASSKTATAGGQGTTAAGGRHP 298
Query: 142 -------------FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCST 186
L P VDFRA+CFCH+N ID G++CSVCLSI+C+ +C T
Sbjct: 299 RTAKLGLGASVAELLVTPSADAVDFRAACFCHRNVIDTGFVCSVCLSIFCEVPDGGECLT 358
Query: 187 CGSVF 191
CG+
Sbjct: 359 CGTAL 363
>gi|123976711|ref|XP_001314576.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121897129|gb|EAY02259.1| hypothetical protein TVAG_030750 [Trichomonas vaginalis G3]
Length = 284
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 76/160 (47%), Gaps = 8/160 (5%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
L +LS +LC + + + + RIL SP+ QY N +F+A+R V ID
Sbjct: 116 LDVALSKSLCLMNNLPNKNI---KKRILAFSVSPETGSQYD---NLLFAAKRLDVVIDVL 169
Query: 96 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
+L + FL QA+ +T G + L QYL ++ T + R+ + LP P +++ A
Sbjct: 170 FLNSNRLCFLSQAADLTHGFSKTIKMGAALAQYLFSLPPTTV--RDLMALPAPEPLEYSA 227
Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQ 195
I GYIC VCLSI + + C C + +G
Sbjct: 228 PSADTHKMISRGYICPVCLSILEEACRTCPICNTRYGSVN 267
>gi|224146905|ref|XP_002336364.1| predicted protein [Populus trichocarpa]
gi|222834814|gb|EEE73263.1| predicted protein [Populus trichocarpa]
Length = 80
Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/66 (53%), Positives = 42/66 (63%)
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA FSA RS+VPIDSC +GA N AF QQASYITG V+ KPQQ+ + F ++
Sbjct: 1 MNATFSAPRSLVPIDSCRVGAHNPAFSQQASYITGSVYVKPQQVSLVNHRPFIFFAPEVA 60
Query: 139 SRNFLQ 144
N L
Sbjct: 61 VENLLH 66
>gi|224116574|ref|XP_002331930.1| predicted protein [Populus trichocarpa]
gi|222874602|gb|EEF11733.1| predicted protein [Populus trichocarpa]
Length = 79
Score = 70.5 bits (171), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/66 (53%), Positives = 42/66 (63%)
Query: 79 MNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLH 138
MNA FSA RS+VPIDSC +GA N AF QQASYITG V+ KPQQ+ + F ++
Sbjct: 1 MNATFSAPRSLVPIDSCRVGAHNPAFSQQASYITGSVYVKPQQVSLVNHRPFIFFAPEVA 60
Query: 139 SRNFLQ 144
N L
Sbjct: 61 VENLLH 66
>gi|399218636|emb|CCF75523.1| unnamed protein product [Babesia microti strain RI]
Length = 278
Score = 69.7 bits (169), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 56/115 (48%), Gaps = 11/115 (9%)
Query: 90 VPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDG---------LFQYLLTIFGTDLHSR 140
+P+ + A N L+Q S IT G + + G LFQ LL F S+
Sbjct: 149 IPLLNIGFSALNMKLLEQLSDITKGTFLRLMDMYGNDSINIEYALFQTLLFWFLASNDSK 208
Query: 141 NFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--KHLKKCSTCGSVFGQ 193
N L P + A C+CH NTI++GY+CS CL++YC K L C C S F +
Sbjct: 209 NILSPPSLTQISNSAVCYCHNNTIEIGYLCSSCLTVYCTDKSLIMCKICSSRFAK 263
>gi|82541502|ref|XP_724989.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479826|gb|EAA16554.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 327
Score = 69.7 bits (169), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 86/195 (44%), Gaps = 28/195 (14%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVP 91
S LS +L++ALCY R+ + + RI L S QY +MN ++A+R+ +
Sbjct: 134 STLSSALAIALCYNNRIC-NLYENINSRIFLLDISKSHFYTNQYTQLMNIAYNAKRNKII 192
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRN 141
ID L + + L+Q IT G++ D L Q ++ F HSR
Sbjct: 193 IDVFSLNHK-TQILEQICNITNGLYIDNSIFLSINCGDNVEDILTQTIMFWFLPSTHSRK 251
Query: 142 FLQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSV 190
+ L + + A C CH ID+ YICS CL+IYC K CS C +
Sbjct: 252 YFSNTYLNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKDSQTNKERISCSVCKTR 308
Query: 191 FGQAQTQSDEPSATN 205
F +A ++ S N
Sbjct: 309 FTKALLRNKTVSDLN 323
>gi|68063783|ref|XP_673887.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56492063|emb|CAI02316.1| conserved hypothetical protein [Plasmodium berghei]
Length = 327
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 96/224 (42%), Gaps = 31/224 (13%)
Query: 5 CATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILC 64
C + L F+ KD + E + S LS +L++ALCY R+ + + RI
Sbjct: 108 CTNAYEKLINFI-KDNNNNSKTYE--MVESTLSSALAIALCYNNRIC-NLYENINSRIFL 163
Query: 65 LQGSPDG--PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL 122
L S QY +MN ++A+R+ + ID L + + L+Q IT G++
Sbjct: 164 LDISKSHFYTNQYTQLMNIAYNAKRNNIIIDVFSLNHK-TQILEQICNITNGLYIDNSIF 222
Query: 123 ----------DGLFQYLLTIFGTDLHSRNFLQ---LPKPVGVDFRASCFCHKNTIDMGYI 169
D L Q ++ F HSR + L + + A C CH ID+ YI
Sbjct: 223 LSINCADNVEDILTQTIMFWFLPSTHSRKYFSNTYLNEDTNI---AVCTCHNKQIDIAYI 279
Query: 170 CSVCLSIYC--------KHLKKCSTCGSVFGQAQTQSDEPSATN 205
CS CL+IYC K CS C + F ++ ++ S N
Sbjct: 280 CSCCLAIYCSEKDAQTNKERISCSVCKTRFTKSLLRNKTVSDLN 323
>gi|401888190|gb|EJT52153.1| hypothetical protein A1Q1_06259 [Trichosporon asahii var. asahii
CBS 2479]
Length = 323
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 79/184 (42%), Gaps = 59/184 (32%)
Query: 61 RILCLQGSPDGPEQYVA--------------------IMNAIFSAQRSM----------- 89
RIL L +P G E A +MN +F+AQ+++
Sbjct: 87 RILILNATPGGAEDAEATDSKTAKASGGGGGQRGYVGLMNCVFAAQKAVSLTAVEDELTC 146
Query: 90 --VPIDSCYLGAQNSA-----FLQQASYITGGVHHKPQQLDGLFQYL------------- 129
V ID L +++ FLQQA+Y+T G++ + GL QYL
Sbjct: 147 QKVAIDVLTLPPESTMTAPPIFLQQAAYLTEGIYWRWNGRGGLLQYLHVSLLVPSVSADP 206
Query: 130 --LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
+ + L ++ F +P VDFRA CFCH + +D+G++C I+C+ CS C
Sbjct: 207 QSMYLPPPSLRNKPF-TVPPQDAVDFRAVCFCHHDVVDVGFVC-----IFCEVKPICSMC 260
Query: 188 GSVF 191
+ F
Sbjct: 261 KTKF 264
>gi|76163131|gb|AAX30917.2| SJCHGC08726 protein [Schistosoma japonicum]
Length = 77
Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 6/76 (7%)
Query: 129 LLTIFGTDLHSRNFLQLPKP------VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLK 182
LL++F R L LP GVDFRA+CFCHK ID+GY+CS+CLS++C+
Sbjct: 1 LLSVFLPSSKMRTSLILPDSRSSGLSFGVDFRAACFCHKRLIDIGYVCSICLSVFCEFNP 60
Query: 183 KCSTCGSVFGQAQTQS 198
C TC + F Q+
Sbjct: 61 ICPTCNTPFVLPTVQN 76
>gi|70953884|ref|XP_746016.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526511|emb|CAH87927.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 327
Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 86/195 (44%), Gaps = 28/195 (14%)
Query: 34 SLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG--PEQYVAIMNAIFSAQRSMVP 91
S LS +L++ALCY R+ + + RI L S QY +MN ++A+R+ +
Sbjct: 134 STLSSALAIALCYNNRIC-NLYENINSRIFLLDISKSHFYTNQYTQLMNIAYNAKRNNII 192
Query: 92 IDSCYLGAQNSAFLQQASYITGGVHHKPQQL----------DGLFQYLLTIFGTDLHSRN 141
ID L + + L+Q IT G++ D L Q ++ F +SR
Sbjct: 193 IDVFSLNHK-TQILEQICNITNGLYIDNSIFLSITCGDNVEDILTQTIMFWFLPSTNSRK 251
Query: 142 FLQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC--------KHLKKCSTCGSV 190
+ L + + A C CH ID+ YICS CL+IYC K CS C +
Sbjct: 252 YFSNTYLNEDTNI---AVCTCHNKQIDIAYICSCCLAIYCSEKDAQTNKERISCSICKTR 308
Query: 191 FGQAQTQSDEPSATN 205
F +A ++ S N
Sbjct: 309 FTKALLRNKTVSDLN 323
>gi|241605964|ref|XP_002405599.1| RNA polymerase II transcription initiation factor, putative [Ixodes
scapularis]
gi|215500649|gb|EEC10143.1| RNA polymerase II transcription initiation factor, putative [Ixodes
scapularis]
Length = 291
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 55/101 (54%), Gaps = 3/101 (2%)
Query: 20 EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQ--PRILCLQGSPDGPEQYVA 77
E + + E SLL+G LS+ALCYI R+ + + RIL L S + QY+
Sbjct: 99 ELVLSDDSESAAGESLLTGGLSLALCYINRIEKESSSQNKIASRILVLSASGESASQYLN 158
Query: 78 IMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHK 118
MN F+AQ+ V ID+C L ++S LQQ ITGG + K
Sbjct: 159 FMNVFFTAQKKNVIIDACVL-EKDSGLLQQGCDITGGKYMK 198
>gi|380477141|emb|CCF44315.1| RNA polymerase II transcription factor B subunit 4 [Colletotrichum
higginsianum]
Length = 255
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 59/117 (50%), Gaps = 19/117 (16%)
Query: 16 MNKDEQLGKQEPEGRIACSL--LSGSLSMALCYIQ----------------RVFRSGLLH 57
M+ +L E +AC+ LSG+L++AL +I R S L
Sbjct: 128 MSSMRKLVDATTEAXLACTTTQLSGALTLALAHINKTSLSLNETNKAPDVPRSTSSALSR 187
Query: 58 PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG 114
+ RI L S P QY++ MNA+F+A S +PID+ L ++ FLQQASYIT G
Sbjct: 188 LRARIFILSVSDSEPVQYISTMNAVFAAAHSQIPIDTLALSG-DATFLQQASYITDG 243
>gi|19074510|ref|NP_586016.1| hypothetical protein ECU07_0880 [Encephalitozoon cuniculi GB-M1]
gi|19069152|emb|CAD25620.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
gi|449329589|gb|AGE95860.1| hypothetical protein ECU07_0880 [Encephalitozoon cuniculi]
Length = 213
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 80/163 (49%), Gaps = 25/163 (15%)
Query: 38 GSLSMALCYIQRVFRSGLLH--PQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
G L ALC LH + +I+ + ++Y+ + +F+AQR + + S
Sbjct: 74 GDLGYALC----------LHRDEESQIVIFTLGRERKDEYLRYLKCMFAAQRFGIRV-SA 122
Query: 96 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
+ +N LQ + TGG + + DG ++LL++ GT PKP+G F A
Sbjct: 123 FSLFENKTILQCCAS-TGGDYSTSE--DGCLRFLLSLLGTRGE-------PKPLG--FPA 170
Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
+C+CH + +G +C +CLS++C+ + C C S F + +S
Sbjct: 171 TCYCHNRQVLLGLVCPICLSVFCRFVPVCKKCKSKFSFTKYES 213
>gi|303389949|ref|XP_003073206.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB4 [Encephalitozoon
intestinalis ATCC 50506]
gi|303302351|gb|ADM11846.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB4 [Encephalitozoon
intestinalis ATCC 50506]
Length = 213
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 81/161 (50%), Gaps = 21/161 (13%)
Query: 38 GSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL 97
G L ALC + R G + +I+ + ++Y+ + +F+AQR V I + L
Sbjct: 74 GDLGYALC----LHRLG----ESQIVIFTLEKERNDEYLRYLKCMFAAQRFGVRISAFSL 125
Query: 98 GAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASC 157
+N LQ + TGG + + DG ++LL++ GT S KP+G F A+C
Sbjct: 126 -FENKTILQCCA-ATGGGYSTSE--DGCLRFLLSLLGTKGDS-------KPLG--FPATC 172
Query: 158 FCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
+CH + +G +C +CLS++C+ + C C S F + +S
Sbjct: 173 YCHNKQVLLGLVCPICLSVFCRFVPVCKKCKSKFSFVKHES 213
>gi|156101255|ref|XP_001616321.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805195|gb|EDL46594.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 323
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 87/203 (42%), Gaps = 23/203 (11%)
Query: 13 EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG- 71
EE NK + + ++ S LS +L++ALCY R+ + + + RI L S +
Sbjct: 108 EEAYNKLIEFLSENKTEKMMESSLSSALAVALCYNHRIV-NMYENMKGRIFLLDISNETY 166
Query: 72 --PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL------- 122
QY +MN ++A+R+ + ID L + L+Q IT G++
Sbjct: 167 VYTNQYTQLMNIAYNAKRNNIIIDVFSLNCK-IQLLEQICNITNGLYIDSSIFFNVNCTD 225
Query: 123 ---DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC- 178
D L Q ++ F ++R + A C CH +D+ YICS CL+IYC
Sbjct: 226 NIEDVLTQTMIFWFLPSNNTRKYFSNTYLNDDTNIAVCSCHNKQVDIAYICSCCLAIYCS 285
Query: 179 -------KHLKKCSTCGSVFGQA 194
K C C + F +A
Sbjct: 286 EKDEKTNKERTSCFVCKTRFTKA 308
>gi|225707288|gb|ACO09490.1| TFIIH basal transcription factor complex p34 subunit [Osmerus
mordax]
Length = 208
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 37/94 (39%), Positives = 53/94 (56%), Gaps = 3/94 (3%)
Query: 22 LGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRILCLQGSPDGPEQYVAIM 79
+ + E +G +LL+GSL+ ALCYI RV + + RIL + + D QY+ M
Sbjct: 111 MSRTEVKGHQTDTLLAGSLAKALCYIHRVTKELEAGQELKSRILVFKAAEDCARQYMNFM 170
Query: 80 NAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
N IF+AQR + ID+C L ++ S LQQ I G
Sbjct: 171 NVIFAAQRQNILIDACVLDSE-SGLLQQLISICG 203
>gi|367054924|ref|XP_003657840.1| hypothetical protein THITE_2123949 [Thielavia terrestris NRRL 8126]
gi|347005106|gb|AEO71504.1| hypothetical protein THITE_2123949 [Thielavia terrestris NRRL 8126]
Length = 459
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 35/50 (70%), Gaps = 2/50 (4%)
Query: 142 FLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCK--HLKKCSTCGS 189
L P VDFRA+CFCH+N +D G++CS+CLSI+C+ +C TCG+
Sbjct: 352 LLVTPSADAVDFRAACFCHRNVVDTGFVCSICLSIFCEVPDGGECLTCGT 401
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 47/82 (57%), Gaps = 1/82 (1%)
Query: 54 GLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITG 113
GL RIL L S P QY+ MNA+F+A + + ID+ L ++ FL+QAS+IT
Sbjct: 207 GLAGMHARILILSVSDSSPAQYIPTMNAVFAAAHARIAIDTLSLRG-SATFLEQASFITR 265
Query: 114 GVHHKPQQLDGLFQYLLTIFGT 135
G + + GL QYL+ FG+
Sbjct: 266 GTFIRAAEPRGLLQYLMFGFGS 287
>gi|221057700|ref|XP_002261358.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|194247363|emb|CAQ40763.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 323
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 86/203 (42%), Gaps = 23/203 (11%)
Query: 13 EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG- 71
EE NK + + ++ S LS +L++ALCY R+ + + + RI L S
Sbjct: 108 EEAYNKLIEFINENKTEKMMESNLSSALAVALCYNHRIV-NMYENMKGRIFLLDISNQSY 166
Query: 72 --PEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL------- 122
QY +MN ++A+R+ + ID L + L+Q IT G++
Sbjct: 167 VYTNQYTQLMNIAYNAKRNNIIIDVFSLNYK-IQLLEQICNITNGLYIDTTIFFNVNCTD 225
Query: 123 ---DGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC- 178
D L Q ++ F ++R + A C CH +D+ YICS CL+IYC
Sbjct: 226 NIEDVLTQTMIFWFLPSNNTRKYFSNTYLNDDTNIAVCSCHNKQVDIAYICSCCLAIYCS 285
Query: 179 -------KHLKKCSTCGSVFGQA 194
K C C + F +A
Sbjct: 286 EKDDKTNKDRISCFVCKTRFTKA 308
>gi|32399081|emb|CAD98321.1| transcription factor, possible [Cryptosporidium parvum]
Length = 311
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 76/175 (43%), Gaps = 31/175 (17%)
Query: 59 QPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
+ RI+ L S QYV+++N F++Q+ V ID + S L I G++
Sbjct: 132 KDRIILLDASNKDEYISQYVSLLNCGFASQKLDVVIDVVSVTRNPSPLLNNLVDICNGLN 191
Query: 117 HKPQQL-DGLFQYLLTIFGTDLHSRN--------FLQLP---------------KPVGVD 152
K Q+ D L + + I D H + F LP K G+
Sbjct: 192 LKYSQIVDSLAESNIEINKEDHHWLDQGLTPFLIFHLLPSIQAREEIFISINKTKQTGL- 250
Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFGQAQTQSDEPSATN 205
A CFCH I++GY+CS CLSI+C K CSTCG+ + Q S+ N
Sbjct: 251 --AVCFCHHQKIEIGYVCSSCLSIFCSRFKAPICSTCGARLKRIPIQQKTLSSLN 303
>gi|389584514|dbj|GAB67246.1| hypothetical protein PCYB_112670, partial [Plasmodium cynomolgi
strain B]
Length = 323
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 87/206 (42%), Gaps = 29/206 (14%)
Query: 13 EEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPD-- 70
EE NK + + ++ S LS +L++ALCY R+ + + + RI L S
Sbjct: 108 EEAYNKLIEFINENKTEKMMESSLSSALAVALCYNHRIV-NMYENMKGRIFLLDISNQIY 166
Query: 71 -GPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQL------- 122
QY +MN ++A+R+ + ID L + L+Q IT G++
Sbjct: 167 VYTNQYTQLMNIAYNAKRNDIIIDVFSLNYK-IKLLEQICNITNGLYIDSSIFFNVNCTD 225
Query: 123 ---DGLFQYLLTIFGTDLHSRNFLQ---LPKPVGVDFRASCFCHKNTIDMGYICSVCLSI 176
D L Q ++ F +R + L + A C CH +D+ YICS CL+I
Sbjct: 226 NIEDVLTQTMIFWFLPSNSTRKYFSNTYLNDDTNI---AVCSCHNKQVDIAYICSCCLAI 282
Query: 177 YC--------KHLKKCSTCGSVFGQA 194
YC K C C + F +A
Sbjct: 283 YCSEKDDKTNKDRTSCFVCKTRFTKA 308
>gi|209882598|ref|XP_002142735.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209558341|gb|EEA08386.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 310
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 84/193 (43%), Gaps = 23/193 (11%)
Query: 36 LSGSLSMALCYIQ--RVFRSGLLHPQPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVP 91
L+ +S+ LC I R S + RIL S + QYV ++N+ ++A + V
Sbjct: 110 LASGISICLCKINSDRKLHSKNRTNKDRILLFDASNEKSYINQYVLLINSSYAALKLDVI 169
Query: 92 IDSCYLGAQNSAFLQQASYITGGVH---------------HKPQQLDGLFQYLL--TIFG 134
ID C + + S L I+ G++ K DGL +++ +
Sbjct: 170 IDVCSISSNPSRLLYNVVDISKGIYINYSSIIKNVSSKIDDKSHIQDGLLPFIIFHLLPS 229
Query: 135 TDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCGSVFG 192
+L + + + L + CFCH +++G+ICS CL+I+C + C+ CG+ F
Sbjct: 230 AELRANSLINLSTKSRHSGISVCFCHYKKVEVGFICSSCLAIFCSLFRAPICAACGARFK 289
Query: 193 QAQTQSDEPSATN 205
+ Q S N
Sbjct: 290 RVPIQQKPLSYLN 302
>gi|396081719|gb|AFN83334.1| RNA polymerase II transcription [Encephalitozoon romaleae SJ-2008]
Length = 213
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 77/163 (47%), Gaps = 25/163 (15%)
Query: 38 GSLSMALCYIQRVFRSGLLHPQP--RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 95
G L ALC LH +I+ + ++Y+ + +F+AQR V I S
Sbjct: 74 GDLGYALC----------LHKDEPTQIVIFTLGRERKDEYLRYLKCMFAAQRLGVRI-SA 122
Query: 96 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 155
+ +N LQ TGG + + D ++LL++ GT KP+G F A
Sbjct: 123 FSLFENKTILQ-CCVSTGGDYSTSE--DRCLRFLLSLLGTG-------GSQKPLG--FPA 170
Query: 156 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQTQS 198
+C+CH I +G +C +CLS++CK + C C S F A+ +S
Sbjct: 171 TCYCHNRQILLGLVCPICLSVFCKFIPVCKKCKSKFSFAKYES 213
>gi|126650916|ref|XP_001388356.1| transcription factor [Cryptosporidium parvum Iowa II]
gi|126117450|gb|EAZ51550.1| transcription factor, putative [Cryptosporidium parvum Iowa II]
Length = 286
Score = 59.7 bits (143), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 70/158 (44%), Gaps = 31/158 (19%)
Query: 59 QPRILCLQGSPDGP--EQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVH 116
+ RI+ L S QYV+++N F++Q+ V ID + S L I G++
Sbjct: 132 KDRIILLDASNKDEYISQYVSLLNCGFASQKLDVVIDVVSVTRNPSPLLNNLVDICNGLN 191
Query: 117 HKPQQL-DGLFQYLLTIFGTDLHSRN--------FLQLP---------------KPVGVD 152
K Q+ D L + + I D H + F LP K G+
Sbjct: 192 LKYSQIVDSLAESNIEINKEDHHWLDQGLTPFLIFHLLPSIQAREEIFISINKTKQTGL- 250
Query: 153 FRASCFCHKNTIDMGYICSVCLSIYCKHLKK--CSTCG 188
A CFCH I++GY+CS CLSI+C K CSTCG
Sbjct: 251 --AVCFCHHQKIEIGYVCSSCLSIFCSRFKAPICSTCG 286
>gi|401827001|ref|XP_003887593.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB4 [Encephalitozoon hellem
ATCC 50504]
gi|392998599|gb|AFM98612.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB4 [Encephalitozoon hellem
ATCC 50504]
Length = 213
Score = 59.3 bits (142), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 68/129 (52%), Gaps = 13/129 (10%)
Query: 70 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
+ ++Y+ + +F+AQR V I + L +N LQ + TGG + + D ++L
Sbjct: 98 EKKDEYLRYLKCMFAAQRLGVRISAFSL-FENKTILQCCAS-TGGDYSTSE--DKCLRFL 153
Query: 130 LTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 189
L++ GT + KP+G F A+C+CH + +G +C +CLS++CK + C C S
Sbjct: 154 LSLLGTGGN-------KKPLG--FPAACYCHNRQVILGLVCPICLSVFCKFIPVCRKCKS 204
Query: 190 VFGQAQTQS 198
F + ++
Sbjct: 205 KFSFVKYEN 213
>gi|294947738|ref|XP_002785469.1| hypothetical protein Pmar_PMAR005787 [Perkinsus marinus ATCC 50983]
gi|239899380|gb|EER17265.1| hypothetical protein Pmar_PMAR005787 [Perkinsus marinus ATCC 50983]
Length = 385
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 81/197 (41%), Gaps = 39/197 (19%)
Query: 35 LLSGSLSMALCYIQRVFRSGLLHPQPRILCL------QGSPDGPEQYVAIMNAIFSAQRS 88
L++ LS+ALCY+ + L P C+ G + + V + N ++AQ
Sbjct: 150 LIAAGLSLALCYLNK----HLTIPGRARCCIVEAVSVYGEANYASEAVPLANCGWAAQDL 205
Query: 89 MVPIDSCYL-----------------------GAQNSAFLQQASYITGGVHHKPQQ---L 122
VPID C + ++ L Q T GVH ++ +
Sbjct: 206 RVPIDLCTVCPGRTPQSSSPSNDRSSSSSSNASPTSATLLVQLCEATRGVHIPSKRCSTV 265
Query: 123 DGLFQYLLTIFGT-DLHSRNFLQL-PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
GLFQ L+ F L R L+ P +D + C CH +D GY+CSVCL IYC
Sbjct: 266 GGLFQSLMFHFSVPSLGEREVLKTRPSARLLDMGSLCACHGLPVDRGYVCSVCLCIYCND 325
Query: 181 LKK-CSTCGSVFGQAQT 196
C CG+ F + T
Sbjct: 326 SSGVCRKCGARFKRTAT 342
>gi|294889292|ref|XP_002772747.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239877280|gb|EER04563.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 383
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 81/197 (41%), Gaps = 39/197 (19%)
Query: 35 LLSGSLSMALCYIQRVFRSGLLHPQPRILCL------QGSPDGPEQYVAIMNAIFSAQRS 88
L++ LS+ALCY+ + L P C+ G + + V + N ++AQ
Sbjct: 148 LIAAGLSLALCYLNK----HLTIPGRARCCIVEAVSVYGEANYASEAVPLANCGWAAQDL 203
Query: 89 MVPIDSCYL-----------------------GAQNSAFLQQASYITGGVHHKPQQ---L 122
VPID C + ++ L Q T GVH ++ +
Sbjct: 204 RVPIDLCTVCPGRTPQSSSPSNDRSSSSSSNASPTSATLLVQLCEATRGVHIPSKRCSTV 263
Query: 123 DGLFQYLLTIFGT-DLHSRNFLQL-PKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKH 180
GLFQ L+ F L R L+ P +D + C CH +D GY+CSVCL IYC
Sbjct: 264 GGLFQSLMFHFSVPSLGEREVLKTRPSARLLDMGSLCACHGLPVDRGYVCSVCLCIYCND 323
Query: 181 LKK-CSTCGSVFGQAQT 196
C CG+ F + T
Sbjct: 324 SSGVCRKCGARFKRTAT 340
>gi|403223596|dbj|BAM41726.1| uncharacterized protein TOT_040000918 [Theileria orientalis strain
Shintoku]
Length = 332
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 78/168 (46%), Gaps = 33/168 (19%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDG-PEQYVAIMNAIFSAQRSMVPIDS 94
L+ ++SM Y+ R+ +S + I+ S + QY+ +MN F+A + + I++
Sbjct: 141 LTSAISMGCLYLNRITKSKAGFGRKIIIFDVSSRENYKSQYIGLMNIAFTALKQNITINT 200
Query: 95 CYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRN----------FLQ 144
LG Q S L+Q S IT + LD L IF +D+ + N F
Sbjct: 201 LALG-QPSRILEQLSTITNA-----KYLD-----LSKIFQSDVETSNIEQSLSQLISFWL 249
Query: 145 LP-KPVG------VDFR----ASCFCHKNTIDMGYICSVCLSIYCKHL 181
LP + VG + F A C+CH T+++ Y+C C ++YC +
Sbjct: 250 LPSEEVGEILSTKLSFEFGNTAICYCHYKTVEVSYLCPCCFAVYCSEV 297
>gi|198471480|ref|XP_002133743.1| GA22626 [Drosophila pseudoobscura pseudoobscura]
gi|198145935|gb|EDY72370.1| GA22626 [Drosophila pseudoobscura pseudoobscura]
Length = 390
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 70/159 (44%), Gaps = 9/159 (5%)
Query: 34 SLLSGSLSMALCYIQRVFR-----SGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRS 88
SLL+ SL +ALCYIQ+ R S Q RIL + GS + Y N I A +
Sbjct: 157 SLLAESLGLALCYIQKRRRQIPNTSHGRSVQGRILIVTGSVLHEDVYKNYDNVIQEALET 216
Query: 89 MVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKP 148
V I+ C + LQ+A+ T G++ L L+ F L+ ++ P+
Sbjct: 217 NVTINVCAVKVPKQLPLQRATDTTDGLYFCTNDTMSLSGDLVDHF---LYPTYLIKSPRS 273
Query: 149 VGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTC 187
+ A C CH +G++C C + CKH C C
Sbjct: 274 -HIAVPARCACHGFGNKIGFVCGKCKLVLCKHSPYCDDC 311
>gi|449019603|dbj|BAM83005.1| similar to TFIIH subunit TFB4 [Cyanidioschyzon merolae strain 10D]
Length = 399
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 25/165 (15%)
Query: 56 LHPQPRILCLQG-SPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGA--------QNSAFLQ 106
L P+ R++ + P + V + NA F AQ+ + +D + A Q A LQ
Sbjct: 231 LSPRARLVVISAPGPFDVAESVPLWNAAFCAQQHQLLVDVVRIEATSEPGTKPQARAALQ 290
Query: 107 QASYITGGVHHKPQ-QLDGLFQYLLTIFGTDLHSRNFLQLPKPV---------------G 150
Q +Y T G++ + + LF +L T + R +L P
Sbjct: 291 QLAYATQGIYLECRCACAKLFTHLATYIVPNRLERRYLTRRIPELGSYASGTERCGVVES 350
Query: 151 VDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFGQAQ 195
VD RA+CF + MGY+CS CLS+Y C C + A+
Sbjct: 351 VDLRATCFQSGKLVHMGYVCSRCLSVYSTPQAACIVCDARLDAAE 395
>gi|392576127|gb|EIW69258.1| hypothetical protein TREMEDRAFT_62112 [Tremella mesenterica DSM
1558]
Length = 297
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 21/144 (14%)
Query: 4 LCATLLQNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSG--LLHPQPR 61
L + L Q L M+++ G +E +G L +++ ALCY+ R+ S L R
Sbjct: 107 LDSGLEQGLRGMMSEE---GAREGKGLNEPPALVAAMTKALCYLNRLSPSTATLQTTDTR 163
Query: 62 ILCLQGSPDGPEQ-----------YVAIMNAIFSAQRSMVPID-----SCYLGAQNSAFL 105
+L + +P YV +MN +F+AQ+ +PID Y FL
Sbjct: 164 MLIINATPGSSFSESDSSGGLRGGYVGLMNCVFAAQKGKIPIDVLTLPPSYTQTSPPIFL 223
Query: 106 QQASYITGGVHHKPQQLDGLFQYL 129
QQA+++TGG++ + G+ QYL
Sbjct: 224 QQAAHLTGGMYWRWNGRGGVLQYL 247
>gi|71029634|ref|XP_764460.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351414|gb|EAN32177.1| hypothetical protein, conserved [Theileria parva]
Length = 348
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 78/178 (43%), Gaps = 30/178 (16%)
Query: 36 LSGSLSMALCYIQRV-FRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
L+ +++ Y+ R+ R+ + I + S + QY+ +MN +SA + I++
Sbjct: 157 LTSAIATGFLYLNRIRMRNEGYGRKIIIFDVSTSENYKSQYIGLMNIAYSALAQNITINT 216
Query: 95 CYLGAQNSAFLQQASYITGGVH--------------HKPQQLDGL--FQYLLTIFGTDLH 138
LG Q S L+Q S IT G + H Q L L F YL + ++L
Sbjct: 217 FALG-QPSRILEQLSSITRGKYLLVSKVLNFDLNFSHINQYLSQLITFWYLPSEGMSELL 275
Query: 139 SRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKK-------CSTCGS 189
S N VG+ C+CH ++D+ Y+C C ++YC + C CGS
Sbjct: 276 STNLSFDFGNVGI-----CYCHYKSVDVSYLCPCCFAVYCSEIDDKGKYRIICMVCGS 328
>gi|312078926|ref|XP_003141952.1| basic transcription factor 2 [Loa loa]
Length = 253
Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 23/145 (15%)
Query: 34 SLLSGSLSMALCYIQR-----------VFR--SGLLHPQPRILCLQGSPDGPEQYVAIMN 80
++ + ++++A+C+I+ +FR + + + RIL + D ++ +MN
Sbjct: 101 AVFAPAVAIAICHIKDDRAILNNISSDIFRYKNEVDNGDGRILIINIGSDLRGEHSILMN 160
Query: 81 AIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLL--TIFGTDLH 138
F+A + + ID +G + S LQQAS ITGG + ++ L +Y + T+ L
Sbjct: 161 IFFAAHKHNILIDVANIG-EASTILQQASDITGGTYFSVKKPKQLLKYTMCFTLGKASLR 219
Query: 139 SRNFLQLPKP---VGVDFRASCFCH 160
S P P VD+RASC CH
Sbjct: 220 S----AFPSPSSSTSVDYRASCHCH 240
>gi|402471701|gb|EJW05288.1| hypothetical protein EDEG_00646 [Edhazardia aedis USNM 41457]
Length = 222
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/167 (26%), Positives = 70/167 (41%), Gaps = 25/167 (14%)
Query: 28 EGRIACSLLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 87
E ++ S L+ ++ LC I + S + +L L + Q + + A A+
Sbjct: 68 EYKVCSSNLAQDIAYTLCLINKNRDSS--EARIFVLNLDYQDNNTIQLLKVSTA---AKN 122
Query: 88 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPK 147
+ IDS Y Q S L Q ITGG++ + + +FG F +L
Sbjct: 123 MKIRIDS-YSSVQ-SPVLSQVCNITGGIYRTSLDI------FVNLFG-------FYELGD 167
Query: 148 PVGVDF-----RASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGS 189
G D C CH N I +G +C +CL+IYC+ + C C +
Sbjct: 168 ESGTDMSGCFGNVKCICHGNIITLGLVCPICLAIYCRTVPICRVCKT 214
>gi|207340364|gb|EDZ68737.1| YPR056Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 230
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 56/106 (52%), Gaps = 6/106 (5%)
Query: 10 QNLEEFMNKDEQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRSGLLHP-QPRILCLQ-- 66
+ L E + K +L K++ E S L+G++S L Y+ R+ + + + R+L L
Sbjct: 123 ETLVEEIYKLFELEKKQIEQNSQRSTLAGAMSAGLTYVNRISKESVTTSLKSRLLVLTCG 182
Query: 67 --GSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-GAQNSAFLQQAS 109
S D QY+ IMN IFSA + PID + G++ S FLQQ +
Sbjct: 183 SGSSKDEIFQYIPIMNCIFSATKMKCPIDVVKIGGSKESTFLQQTT 228
>gi|224106784|ref|XP_002333630.1| predicted protein [Populus trichocarpa]
gi|222837853|gb|EEE76218.1| predicted protein [Populus trichocarpa]
Length = 125
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 39/74 (52%), Gaps = 11/74 (14%)
Query: 70 DGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYL 129
DGPEQ+VA+MNA F S+VPIDSC QAS +TG V+ K QQ+ +
Sbjct: 33 DGPEQHVAVMNATFCTPCSLVPIDSC-----------QASCVTGCVYVKSQQVSLVNHRP 81
Query: 130 LTIFGTDLHSRNFL 143
F ++ N L
Sbjct: 82 FISFAPEVAVENLL 95
>gi|403292309|ref|XP_003937192.1| PREDICTED: tectonic-2 [Saimiri boliviensis boliviensis]
Length = 940
Score = 46.6 bits (109), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 50/86 (58%), Gaps = 4/86 (4%)
Query: 8 LLQNLEEFMNKD--EQLGKQEPEGRIACSLLSGSLSMALCYIQRVFRS--GLLHPQPRIL 63
LL + E + ++ + + K + +G+ +LL+GSL+ ALCYI R+ + + RIL
Sbjct: 104 LLTSANEVIAEEIKDLMTKSDIKGQHTETLLAGSLAKALCYIHRMNKEIKDNQEMKSRIL 163
Query: 64 CLQGSPDGPEQYVAIMNAIFSAQRSM 89
++ + D QY+ MN IF+AQ+ +
Sbjct: 164 VIKAAEDSALQYMNFMNVIFAAQKQL 189
>gi|440302599|gb|ELP94906.1| RNA polymerase II transcription factor B subunit, partial
[Entamoeba invadens IP1]
Length = 188
Score = 45.8 bits (107), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 6/98 (6%)
Query: 36 LSGSLSMALCYIQRVFRSG-LLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 94
LS ++S A+C I +S LH +I+ + + ++ +MN IF+AQ+ V ID+
Sbjct: 89 LSAAMSSAMCRINSYSKSNPTLHK--KIVAISTPLNAAPTFITVMNNIFAAQKIGVSIDT 146
Query: 95 CYLGAQNS--AFLQQASYITGGVHHKPQQLDGLFQYLL 130
+ QQASY+T G+++ LD L LL
Sbjct: 147 IVISDNEEPCTICQQASYLTNGIYNM-SSLDTLLPRLL 183
>gi|160331405|ref|XP_001712410.1| hypothetical protein HAN_2g261 [Hemiselmis andersenii]
gi|159765858|gb|ABW98085.1| hypothetical protein HAN_2g261 [Hemiselmis andersenii]
Length = 293
Score = 45.8 bits (107), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 22/139 (15%)
Query: 68 SPDGPEQYVAIMNAIFS-AQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLF 126
S ++++A N +F AQ++ D+ G ++ F S T G++ +P L LF
Sbjct: 162 SSKTRKEHIAFSNQLFFLAQKNKFSFDTLLFGRKDCFFFHFLSEKTKGIYCRP--LKNLF 219
Query: 127 QY---------LLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDM----GYICSVC 173
+ LL++F T SR F LP F KNT ++ + C +C
Sbjct: 220 ELAFTEEFITILLSLFLTSPFSREFYILP------FSTKILNKKNTTNLIAKKKFSCPIC 273
Query: 174 LSIYCKHLKKCSTCGSVFG 192
SI+ C CG VF
Sbjct: 274 FSIFNFLFTNCFVCGFVFS 292
>gi|328769012|gb|EGF79057.1| hypothetical protein BATDEDRAFT_90182 [Batrachochytrium
dendrobatidis JAM81]
Length = 153
Score = 45.4 bits (106), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 32/53 (60%)
Query: 35 LLSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR 87
L+ +S A +I ++ S Q RIL L SPDG QY+ IMNAIF+AQ+
Sbjct: 97 LVLADVSAAPKHINKMQSSTTPDRQSRILVLSVSPDGSAQYIPIMNAIFAAQK 149
>gi|387592672|gb|EIJ87696.1| hypothetical protein NEQG_02243 [Nematocida parisii ERTm3]
gi|387595301|gb|EIJ92926.1| hypothetical protein NEPG_02325 [Nematocida parisii ERTm1]
Length = 210
Score = 44.3 bits (103), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 11/109 (10%)
Query: 83 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 142
F+A++ + +D C + N F+ Q S + + LF+YL+ + D F
Sbjct: 111 FTARKLNIHVD-CII--PNDGFVSQVSDVLKRPAFCISSGESLFEYLMGMLSID-----F 162
Query: 143 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
++ + + C CH I GY+C +CL +YCK + C C + F
Sbjct: 163 GRISRK---NVHRFCICHNKEIQTGYLCPICLGLYCKFVPLCKHCKTRF 208
>gi|378754778|gb|EHY64807.1| hypothetical protein NERG_02210 [Nematocida sp. 1 ERTm2]
Length = 210
Score = 44.3 bits (103), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 49/109 (44%), Gaps = 11/109 (10%)
Query: 83 FSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNF 142
F+A++ + +D C + + F Q S + + LF+YL+ + D F
Sbjct: 111 FTAKKQNIHVD-CVI--PHDGFASQISDVLNRPAFSISSNESLFEYLMGMLSMD-----F 162
Query: 143 LQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
++ K + C CH I GY+C +CL +YCK + C C + F
Sbjct: 163 GRISKK---NLSRFCICHHKEIQTGYLCPICLGLYCKFVPLCKHCKTRF 208
>gi|269860080|ref|XP_002649763.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB4 [Enterocytozoon
bieneusi H348]
gi|220066822|gb|EED44293.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB4 [Enterocytozoon
bieneusi H348]
Length = 202
Score = 43.9 bits (102), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 27/45 (60%)
Query: 148 PVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVFG 192
P+ +F +C+C TI +G+ CS+CL+IYCK + C C F
Sbjct: 155 PILNNFGVNCYCCNKTIYLGFCCSICLTIYCKFIPICKKCKIKFN 199
>gi|403359548|gb|EJY79438.1| putative: similar to general transcription factor IIH, polypeptide
3, 34kDa [Oxytricha trifallax]
gi|403360227|gb|EJY79783.1| putative: similar to general transcription factor IIH, polypeptide
3, 34kDa [Oxytricha trifallax]
Length = 255
Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 27/126 (21%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHPQP----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
LS ++ A+C+ ++ + RI+ ++ + P QY +++NAIF+AQ+ +
Sbjct: 116 LSTAMMKAVCFNKKWIEDVAVKRSESFSARIMIVKFQSEDPSQYSSLVNAIFAAQKLKIS 175
Query: 92 IDSCYL--------------------GAQNSAFLQQASYITGGVHHK--PQQLDG-LFQY 128
+DS + + S+ LQQA+ +T G +K LD LF
Sbjct: 176 VDSLIIQYAPPQAIQEPQDGKQQPASQEEESSLLQQAALLTNGTSYKLFSNNLDSELFPT 235
Query: 129 LLTIFG 134
LL +F
Sbjct: 236 LLELFS 241
>gi|429963037|gb|ELA42581.1| hypothetical protein VICG_00333 [Vittaforma corneae ATCC 50505]
Length = 215
Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/118 (23%), Positives = 48/118 (40%), Gaps = 16/118 (13%)
Query: 74 QYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIF 133
QY+ + +F AQ +PI + ++ +GG+ + +FQ L
Sbjct: 110 QYLEYLKCMFVAQHRKIPIHG--FSLHRNILVRMCCEGSGGIFLESCSFSDMFQLL---- 163
Query: 134 GTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCGSVF 191
G ++ Q+ C C N + +G +C VCL +YCK + C C + F
Sbjct: 164 GNRTKKKDAYQI----------KCACCNNFVTLGLVCPVCLLVYCKFMPVCKKCKTKF 211
>gi|403376319|gb|EJY88136.1| putative: similar to general transcription factor IIH, polypeptide
3, 34kDa [Oxytricha trifallax]
Length = 230
Score = 42.4 bits (98), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 27/126 (21%)
Query: 36 LSGSLSMALCYIQRVFRSGLLHPQP----RILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 91
LS ++ A+C+ ++ + RI+ ++ + P QY +++NAIF+AQ+ +
Sbjct: 91 LSTAMMKAVCFNKKWIEDVAVKRSESFSARIMIVKFQSEDPSQYSSLVNAIFAAQKLKIS 150
Query: 92 IDSCYL--------------------GAQNSAFLQQASYITGGVHHK--PQQLDG-LFQY 128
+DS + + S+ LQQA+ +T G +K LD LF
Sbjct: 151 VDSLIIQYAPPQAIQEPQDGKQQPASQEEESSLLQQAALLTNGTSYKLFSNNLDSELFPT 210
Query: 129 LLTIFG 134
LL +F
Sbjct: 211 LLELFS 216
>gi|68065658|ref|XP_674813.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56493631|emb|CAH95047.1| hypothetical protein PB001049.00.0 [Plasmodium berghei]
Length = 89
Score = 41.2 bits (95), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 38/89 (42%), Gaps = 13/89 (14%)
Query: 125 LFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVCLSIYC------ 178
+F +L + S +L + V C CH ID+ YICS CL+IYC
Sbjct: 2 MFWFLPSTHSRKYFSNTYLNEDTNIAV-----CTCHNKQIDIAYICSCCLAIYCSEKDAQ 56
Query: 179 --KHLKKCSTCGSVFGQAQTQSDEPSATN 205
K CS C + F ++ ++ S N
Sbjct: 57 TNKERISCSVCKTRFTKSLLRNKTVSDLN 85
>gi|429965399|gb|ELA47396.1| hypothetical protein VCUG_01165 [Vavraia culicis 'floridensis']
Length = 215
Score = 40.0 bits (92), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 52/136 (38%), Gaps = 26/136 (19%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQ 120
RI+ + S + + Y+ + + F A+R Y N + L HHK
Sbjct: 99 RIIIISLSRENNKDYLKYLKSAFVAKR--------YSDRYNISVLS---------HHKNP 141
Query: 121 QLDGLFQYLLTIFGTDLHSRNFLQL---PKPVGVDF-RASCFCHKNTIDMGYICSVCLSI 176
L + F + FLQ+ KP + F C CH I G +C VCLSI
Sbjct: 142 ALSEI-----GCFYNNFALSTFLQILSGKKPQKIFFCSTKCSCHDREILYGLVCPVCLSI 196
Query: 177 YCKHLKKCSTCGSVFG 192
YC + C C F
Sbjct: 197 YCSLIPICKRCRIRFN 212
>gi|164662823|ref|XP_001732533.1| hypothetical protein MGL_0308 [Malassezia globosa CBS 7966]
gi|159106436|gb|EDP45319.1| hypothetical protein MGL_0308 [Malassezia globosa CBS 7966]
Length = 276
Score = 39.3 bits (90), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 29/101 (28%)
Query: 19 DEQLGKQEPEGRIACSLLSGSLSMALCYIQRVF-----------RSGLLHPQPR------ 61
D + G + P G + +L++ALC++ R+ SG L + R
Sbjct: 143 DARDGARGPVGMVR------ALALALCHMHRLSTVLSESTTTDPHSGALGARTRRGRNTS 196
Query: 62 ----ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID--SCY 96
IL L +PD QYV +MN IFSAQ+ + +D SC+
Sbjct: 197 FLQRILVLSATPDVSAQYVPMMNCIFSAQKQGIQVDARSCF 237
>gi|84997465|ref|XP_953454.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65304450|emb|CAI76829.1| hypothetical protein, conserved [Theileria annulata]
Length = 391
Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 63/164 (38%), Gaps = 37/164 (22%)
Query: 62 ILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYLGAQNSAFLQQASYITGG------- 114
I + S + QY+ +MN +SA + I++ LG Q S L+Q S IT
Sbjct: 209 IFDISNSENYKSQYIGLMNIAYSALSQNITINTFALG-QPSRILEQLSAITNAKYLLVSK 267
Query: 115 -------VHHKPQQLDGLFQYLL--TIFGTDLHSRNFLQ-LPKPVGVDF--RASCFCHKN 162
+H Q L L L+ I L S + L + DF C+CH
Sbjct: 268 LLNFDLNFNHINQYLTQLITVLILFIILFWYLPSEGMSEMLSTNLSFDFGNVGICYCHYK 327
Query: 163 TIDMGYICSVCLS----------IYCKHLKK-------CSTCGS 189
++D+ Y+C C + +YC + C CGS
Sbjct: 328 SVDVTYLCPCCFAGNFYSLIRFIVYCSEIDDKGKYRIICMVCGS 371
>gi|195389498|ref|XP_002053413.1| GJ23349 [Drosophila virilis]
gi|194151499|gb|EDW66933.1| GJ23349 [Drosophila virilis]
Length = 490
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 28/62 (45%)
Query: 114 GVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRASCFCHKNTIDMGYICSVC 173
G + +GL +L F L PK +D RA C+CH I++G CS C
Sbjct: 9 GFYFDTSAFEGLCMNVLCQFLASTQRVYHLNYPKQSPMDLRALCYCHNKRIEIGRACSNC 68
Query: 174 LS 175
LS
Sbjct: 69 LS 70
>gi|413935397|gb|AFW69948.1| hypothetical protein ZEAMMB73_462205, partial [Zea mays]
Length = 22
Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/22 (72%), Positives = 20/22 (90%)
Query: 166 MGYICSVCLSIYCKHLKKCSTC 187
MGY+CSVCLSI+C++ K CSTC
Sbjct: 1 MGYVCSVCLSIFCRNQKTCSTC 22
>gi|403347082|gb|EJY72956.1| RNA polymerase II transcription factor B subunit 4 [Oxytricha
trifallax]
Length = 149
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 39/78 (50%), Gaps = 20/78 (25%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYL-------------GAQ------- 100
RI+ ++ + P QY +++NAIF+AQ+ + +DS + G Q
Sbjct: 32 RIMIVKFQSEDPTQYSSLVNAIFAAQKLKISVDSLIIQYAPPQAIQEPQDGKQQPASQDE 91
Query: 101 NSAFLQQASYITGGVHHK 118
S+ LQQA+ +T G +K
Sbjct: 92 ESSLLQQAALLTNGTSYK 109
>gi|336271807|ref|XP_003350661.1| hypothetical protein SMAC_02333 [Sordaria macrospora k-hell]
gi|380094822|emb|CCC07324.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 409
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 88 SMVPIDSCYLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGT 135
+ + ID+ L ++ FLQQASYIT G + Q+ GL QYL+ FG+
Sbjct: 234 ARIAIDTLALRG-SATFLQQASYITRGTFIRAQEPRGLLQYLMFGFGS 280
>gi|291000862|ref|XP_002682998.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
gruberi]
gi|284096626|gb|EFC50254.1| RNA polymerase II transcription factor TFIIH subunit [Naegleria
gruberi]
Length = 349
Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 157 CFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 188
C CHK GYIC C S YC +CSTCG
Sbjct: 240 CLCHKKITPGGYICPRCKSKYCDLPVECSTCG 271
>gi|308159753|gb|EFO62274.1| TFIIH P34 [Giardia lamblia P15]
Length = 279
Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 62/148 (41%), Gaps = 22/148 (14%)
Query: 61 RILCLQGSPDGPEQYVAIMNAIFSAQRSMVPID--SCYLGAQNSAFLQQASYITGGVHH- 117
RIL + S + V +A+FSA+R V ID S + FL+Q +Y+T
Sbjct: 133 RILIITASSVQDSEVVN--SALFSAKRFNVKIDIISLTTAPDVNIFLKQVAYLTASKFSN 190
Query: 118 --KPQQLDGLFQYLLT-IF-GTDLHSRNFLQLPK--PVGV-------DFRASC-FCHKNT 163
KP+ Y T IF + LQL VGV DF C CH N
Sbjct: 191 VPKPKTSASSVSYFTTHIFKASPAAEWRSLQLNDGASVGVYHREKRLDFMGECSVCHTN- 249
Query: 164 IDMGYICSVCLSIYCKHLKK--CSTCGS 189
I G C+VC ++ C K C TC +
Sbjct: 250 ISHGITCAVCFTLQCDACFKGSCPTCST 277
>gi|118401911|ref|XP_001033275.1| hypothetical protein TTHERM_00420200 [Tetrahymena thermophila]
gi|89287623|gb|EAR85612.1| hypothetical protein TTHERM_00420200 [Tetrahymena thermophila
SB210]
Length = 351
Score = 37.4 bits (85), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 35/69 (50%), Gaps = 12/69 (17%)
Query: 124 GLFQYL--LTIFGTDLHSRNFLQLPK-PV-------GVDFRASCFCHKNTID--MGYICS 171
LF Y+ L + D+ ++ QL K PV V FR +C CH +D + Y+CS
Sbjct: 270 DLFNYISILMLTSGDMLAKPEQQLTKNPVFRPAINEKVQFRVNCSCHNTPMDSDIAYVCS 329
Query: 172 VCLSIYCKH 180
CL +YCK
Sbjct: 330 DCLFLYCKQ 338
>gi|443898660|dbj|GAC75994.1| vacuolar sorting protein VPS36 [Pseudozyma antarctica T-34]
Length = 804
Score = 36.2 bits (82), Expect = 9.1, Method: Composition-based stats.
Identities = 16/47 (34%), Positives = 24/47 (51%)
Query: 167 GYICSVCLSIYCKHLKKCSTCGSVFGQAQTQSDEPSATNRKRKTTDA 213
G C+VC + + KC CGS G + + S+ + RK+TDA
Sbjct: 334 GLACNVCTFLNHPSMDKCEMCGSSLGNSWPERPSTSSLSTSRKSTDA 380
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.136 0.419
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,302,014,962
Number of Sequences: 23463169
Number of extensions: 133501140
Number of successful extensions: 243393
Number of sequences better than 100.0: 393
Number of HSP's better than 100.0 without gapping: 376
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 242480
Number of HSP's gapped (non-prelim): 430
length of query: 213
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 77
effective length of database: 9,168,204,383
effective search space: 705951737491
effective search space used: 705951737491
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 73 (32.7 bits)