BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy13810
(386 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|350413829|ref|XP_003490125.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
impatiens]
Length = 467
Score = 540 bits (1392), Expect = e-151, Method: Compositional matrix adjust.
Identities = 268/418 (64%), Positives = 317/418 (75%), Gaps = 35/418 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++ L EL
Sbjct: 49 ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLILNELNVWKEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L +E +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLSTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT R+P G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
LTRDSVR AL+SGITAAQI+G+L+ HA + P LPP + DQI+LWE ER R S
Sbjct: 349 LTRDSVRQALKSGITAAQIVGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFS 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EGVLYSQF+SQ DFE LRD+A GVL WQNE+KRT+VVTK GH D+KKFW + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQNERKRTIVVTKAGHDDVKKFWKRYSKGS 466
>gi|242021693|ref|XP_002431278.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
gi|212516535|gb|EEB18540.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
Length = 467
Score = 540 bits (1390), Expect = e-151, Method: Compositional matrix adjust.
Identities = 262/417 (62%), Positives = 326/417 (78%), Gaps = 34/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +IA+HYV RLLFVEQPVPQAVIASW+S+TH+KEH AT LT+LR
Sbjct: 50 ELPQIARHYVSRLLFVEQPVPQAVIASWISQTHSKEHLTATKVLTDLRVWQEAAIPGGLP 109
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+N K++LLGGG+PW+MSA+LE D+K RD+ +L+ YA+ERWEC+L +MVGSQ
Sbjct: 110 GWILNSVFKKNMKISLLGGGRPWTMSAQLEPDTKPRDIPYLDNYAMERWECVLHYMVGSQ 169
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
EGISADAVR L +E + VIT++GFQFLLL+T +QVWYF+L+YL+TV ++
Sbjct: 170 AQEGISADAVRILLHANLMKRDEEDGSCVITREGFQFLLLDTASQVWYFMLQYLDTVSAR 229
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
LDLVECLTFLFQL FST G DYST GMS+ L VFLQHLREFGL+YQRKR+ GRFYPT+L
Sbjct: 230 NLDLVECLTFLFQLSFSTLGKDYSTIGMSEGLLVFLQHLREFGLIYQRKRRGGRFYPTRL 289
Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
ALN+A +L+Q+ + G++I+ETN+RVYAYTDSNL+VAL+GLFCE++YRFPNL+VG++
Sbjct: 290 ALNIACGENKSLQQMNKEGYIIIETNYRVYAYTDSNLQVALLGLFCEMLYRFPNLSVGLI 349
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALP---SVSCP-LPPVVSDQIRLWEGERERLTTSE 326
TRDSVR A +SGITA QI+GFLRLHA P +V P LPP V DQI+LWE E RL S+
Sbjct: 350 TRDSVRQAFKSGITAEQIVGFLRLHAHPRMIAVGPPTLPPTVVDQIKLWENELNRLVYSD 409
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLYSQF+SQADFE LRD A +LGVL W+N+KKRT+VVTK GH D+KKFW ++ K +
Sbjct: 410 GVLYSQFLSQADFEALRDRANELGVLVWENDKKRTMVVTKSGHDDVKKFWKSYSKAN 466
>gi|340710070|ref|XP_003393621.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
terrestris]
Length = 467
Score = 539 bits (1389), Expect = e-151, Method: Compositional matrix adjust.
Identities = 267/418 (63%), Positives = 317/418 (75%), Gaps = 35/418 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++ L EL
Sbjct: 49 ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLILNELNVWKEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L +E +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLSTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT R+P G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
LTRDSVR AL+SGITAAQI+G+L+ HA + P LPP + DQI+LWE ER R +
Sbjct: 349 LTRDSVRQALKSGITAAQIVGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFN 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EGVLYSQF+SQ DFE LRD+A GVL WQNE+KRT+VVTK GH D+KKFW + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQNERKRTIVVTKAGHDDVKKFWKRYSKGS 466
>gi|345492819|ref|XP_001602702.2| PREDICTED: general transcription factor IIH subunit 4-like [Nasonia
vitripennis]
Length = 428
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 265/419 (63%), Positives = 315/419 (75%), Gaps = 35/419 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+HYV+RLLFVEQPVPQAVIASW SK H + H + EL
Sbjct: 10 ELPEIARHYVMRLLFVEQPVPQAVIASWCSKLHVENHLNVVQVMNELNIWKEAAIPGGLP 69
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
R+N K+ LLGGG PW+MS +LE+DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 70 GWILNATFRKNLKIVLLGGGAPWTMSKQLEIDSKPRDIAFLDSYALERWECVLHYMVGSQ 129
Query: 100 QTEGISADAVRTLFNEGAE-------NPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L + G +PVIT+ GFQFLLL+T AQVWYFIL+YL+TVE++
Sbjct: 130 QQEGISADAVRILLHAGLMKRDEEDGSPVITQAGFQFLLLDTSAQVWYFILQYLDTVEAR 189
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYST+GMSD L +FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 190 GLDLVECLTFLFQLNFSTLGKDYSTQGMSDGLLMFLQHLREFGLVYQRKRKAGRFYPTRL 249
Query: 213 ALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT + + + + F+IVETN+RVYAYT+SNL+VALIGLFCEL+YRFPNL V +
Sbjct: 250 ALNIATGQDKPISRDLEKERFVIVETNYRVYAYTNSNLQVALIGLFCELLYRFPNLVVAI 309
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP----LPPVVSDQIRLWEGERERLTTS 325
LTRDSVRAAL+SGITA QI+G+L HA + P LPP + DQI+LWE ER R S
Sbjct: 310 LTRDSVRAALKSGITAVQIVGYLNQHAHNKMIDPGPPTLPPTIVDQIKLWENERNRFIFS 369
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
EGVLYSQF SQ DFE LRD+A LGV+ WQ+++KRT+VVTK GH D+KKFW + KG+
Sbjct: 370 EGVLYSQFHSQIDFEVLRDHAVSLGVMIWQSDRKRTMVVTKTGHDDVKKFWKRYSKGAN 428
>gi|383852611|ref|XP_003701820.1| PREDICTED: general transcription factor IIH subunit 4-like
[Megachile rotundata]
Length = 467
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 266/418 (63%), Positives = 317/418 (75%), Gaps = 35/418 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++ L EL
Sbjct: 49 ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVSILNELNVWKEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L +E +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT R+P G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
LTRDSVR AL+SGITA+QI+G+L+ HA + P LPP + DQI+LWE ER R S
Sbjct: 349 LTRDSVRQALKSGITASQIVGYLQQHAHSKMIEAGPPVLPPTIVDQIKLWENERNRFIFS 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EGVLYSQF+SQ DFE LRD+A GVL WQ+E+KRT+VVTK GH D+KKFW + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSKGS 466
>gi|48102983|ref|XP_395476.1| PREDICTED: general transcription factor IIH subunit 4 [Apis
mellifera]
Length = 467
Score = 536 bits (1380), Expect = e-150, Method: Compositional matrix adjust.
Identities = 266/418 (63%), Positives = 316/418 (75%), Gaps = 35/418 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++ L EL
Sbjct: 49 ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLVLNELNVWNEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L +E +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT R+P G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
LTRDSVR AL+SGITA QI+G+L+ HA + P LPP + DQI+LWE ER R S
Sbjct: 349 LTRDSVRQALKSGITATQIVGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFS 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EGVLYSQF+SQ DFE LRD+A GVL WQ+E+KRT+VVTK GH D+KKFW + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSKGS 466
>gi|380014187|ref|XP_003691121.1| PREDICTED: general transcription factor IIH subunit 4-like [Apis
florea]
Length = 467
Score = 536 bits (1380), Expect = e-150, Method: Compositional matrix adjust.
Identities = 266/418 (63%), Positives = 316/418 (75%), Gaps = 35/418 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++ L EL
Sbjct: 49 ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLILNELNVWNEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L +E +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT R+P G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
LTRDSVR AL+SGITA QI+G+L+ HA + P LPP + DQI+LWE ER R S
Sbjct: 349 LTRDSVRQALKSGITATQIVGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFS 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EGVLYSQF+SQ DFE LRD+A GVL WQ+E+KRT+VVTK GH D+KKFW + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSKGS 466
>gi|307212060|gb|EFN87943.1| General transcription factor IIH subunit 4 [Harpegnathos saltator]
Length = 467
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 262/418 (62%), Positives = 315/418 (75%), Gaps = 35/418 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK + +EH++ L EL
Sbjct: 49 ELPIIAKHYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQILNELYVWKEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGG+PW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNNTFKKNLKIVLLGGGRPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLFNEGAE-------NPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L + G +P+IT+ GFQFLLL+T +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDETDGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDL+ECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLIECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT T R+ G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGETKPLTRDTDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMIYRFPNLVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA----LPSVSCPLPPVVSDQIRLWEGERERLTTS 325
LTRDSVR AL+SGITA QI+G+L+ HA + + LPP + DQI LWE ER R S
Sbjct: 349 LTRDSVRQALKSGITATQIVGYLQQHAHSKTIEAGPPILPPTIVDQIMLWENERNRFLFS 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EGVLYSQF+SQ DFE LRD+A GVL WQNE+KRT+VVTK GH DIKKFW + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALTTGVLIWQNERKRTIVVTKAGHDDIKKFWKRYSKGS 466
>gi|307177222|gb|EFN66420.1| General transcription factor IIH subunit 4 [Camponotus floridanus]
Length = 467
Score = 526 bits (1354), Expect = e-147, Method: Compositional matrix adjust.
Identities = 258/418 (61%), Positives = 316/418 (75%), Gaps = 35/418 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK + +EH++ L +L
Sbjct: 49 ELPIIAKHYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQVLNDLYVWKEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNNIFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L +E +P+IT+ GFQFLLL+T +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT + K + G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPN+ V +
Sbjct: 289 ALNIATGENKPLAKDTDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCP-LPPVVSDQIRLWEGERERLTTS 325
LTRDS+R AL+SGITA+QI+G+L+ H + P LPP + DQI+LWE ER R S
Sbjct: 349 LTRDSIRQALKSGITASQIVGYLQQHVHSKMIERGPPILPPTIVDQIKLWENERNRFLFS 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EGVLYSQF+SQ DFE LRD+A GVL WQ+E+KRT+VVTK GH D+KKFW + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKTGHDDVKKFWKRYSKGS 466
>gi|91094191|ref|XP_971121.1| PREDICTED: similar to TFIIH basal transcription factor complex p52
subunit [Tribolium castaneum]
gi|270010854|gb|EFA07302.1| hypothetical protein TcasGA2_TC015892 [Tribolium castaneum]
Length = 472
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 261/421 (61%), Positives = 316/421 (75%), Gaps = 38/421 (9%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL E+A+ YVIR+LFVEQPVPQAV+ASW ++ +KEH + L+EL
Sbjct: 52 ELPELARQYVIRILFVEQPVPQAVVASWGAQAFSKEHVYVSKVLSELSVWQEAAIPGGLL 111
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ALLGGGKPWSMS+ LEVDSKARD+ FL+ Y+LERWEC+L +MVGSQ
Sbjct: 112 GWILSVTFKKNLKIALLGGGKPWSMSSALEVDSKARDVSFLDAYSLERWECVLHYMVGSQ 171
Query: 100 QTEGISADAVRTLFNEGA-------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L + G +PVIT+ GFQFLLL+ AQVW+F+L+YL+TVE +
Sbjct: 172 QQEGISADAVRILLHAGLMKRDEEDGSPVITRQGFQFLLLDRQAQVWHFLLQYLDTVEQR 231
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GL LVECLTFLFQL FST G DYSTEGMS L +FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 232 GLSLVECLTFLFQLSFSTLGKDYSTEGMSPGLLIFLQHLREFGLVYQRKRKAGRFYPTRL 291
Query: 213 ALNM-ATRGTLKQIRE----PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
ALN+ ++G +I E G++IVETN+RVYAYTDSNL+VALIGLF ELMYRFPNL V
Sbjct: 292 ALNITCSQGPETRILEEDTPKGYIIVETNYRVYAYTDSNLQVALIGLFTELMYRFPNLVV 351
Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-----SCPLPPVVSDQIRLWEGERERL 322
GV+TRDS+R AL+ GITA QIIG+L+ HA P + PLPP V DQI+LWE ER RL
Sbjct: 352 GVITRDSIRQALKGGITADQIIGYLKQHAHPQMLEGEAKHPLPPTVVDQIKLWEIERNRL 411
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
T SEGVLYSQF+SQADF L++YAQ G L W N++KRT+++ K H D+KKFW + KG
Sbjct: 412 TYSEGVLYSQFLSQADFNILKEYAQSNGHLIWCNKEKRTLIINKSAHDDVKKFWKRYSKG 471
Query: 383 S 383
+
Sbjct: 472 N 472
>gi|332030486|gb|EGI70174.1| General transcription factor IIH subunit 4 [Acromyrmex echinatior]
Length = 550
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 251/416 (60%), Positives = 310/416 (74%), Gaps = 36/416 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAKHYV+RLLFVEQPVPQAVIASW SK + +EH++ L EL
Sbjct: 49 ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQVLNELYVWKEASIPGGLP 108
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWTLNSTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168
Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q EGISADAVR L +E +P+IT+ GFQFLLL+T +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEAR 228
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GLDLVECLTFLFQL FST G DYSTEGMS+ L FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288
Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
ALN+AT T R+ G++IVETN+RVYAYT+SNL+VAL+GLFCE++YRFPN+ V +
Sbjct: 289 ALNIATGETKPLTRDTDKEGYIIVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSI 348
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
LTRDSVR AL+SGITA+QI+G+LR HA + P LPP + DQI+LWE ER R S
Sbjct: 349 LTRDSVRQALKSGITASQIVGYLRQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFLFS 408
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVV-TKQGHSDIKKFWVNHQ 380
EGVLYSQF+SQ DFE LRD+A GVL WQ+E++ T++ T+ S+ ++ + +Q
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHAVSTGVLIWQSERRSTLLADTRNIVSEYRRLFGKYQ 464
>gi|442757633|gb|JAA70975.1| Putative rna polymer [Ixodes ricinus]
Length = 459
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 236/415 (56%), Positives = 305/415 (73%), Gaps = 34/415 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL I++HY++RL+FV+QPVPQAV++SW + + KEH E+ + LT L
Sbjct: 43 ELPIISRHYIMRLMFVDQPVPQAVVSSWNEQKYVKEHLESLEALTALHIWADSSLPGGLP 102
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
R+N ++ALLGGG+PW++ + LE D RD FL++YA+ERWEC+L FMVG
Sbjct: 103 GWSLSVVFRKNIQIALLGGGQPWAVYSTLEKDKHGRDAQFLDRYAMERWECVLHFMVGCH 162
Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
EGISADAVR L + G P+IT +GFQFLL++T +QVW+F+L+YL+T+ES
Sbjct: 163 TKEGISADAVRILLHAGLMKSEEEEGSAPLITMEGFQFLLMDTASQVWHFVLQYLDTLES 222
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+GL+LVECLTFLFQL F T G DYSTEGMS+ L VFLQHLREFGLVYQRKR++GRFYPT+
Sbjct: 223 RGLNLVECLTFLFQLSFLTLGKDYSTEGMSESLLVFLQHLREFGLVYQRKRRSGRFYPTR 282
Query: 212 LALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+A+ +R E G+++VETN+RVYAYTDS L+VAL+ LFCELMYR PNL VGV
Sbjct: 283 LAINLASGLKETNLRSYESGYIVVETNYRVYAYTDSQLQVALLALFCELMYRLPNLVVGV 342
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
LTR+SVR ALRSGIT+ QII FL++HA P S +PP + DQ+RLWE ER+R E
Sbjct: 343 LTRESVRQALRSGITSNQIIKFLQMHAHPEAQKQSPVIPPTIMDQLRLWELERDRFNFRE 402
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
GVLYSQF+SQ+DF+ LR+YA DLGVL W N KR +VV + GH ++K+FW H++
Sbjct: 403 GVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRVMVVNRNGHDEVKRFWKRHRQ 457
>gi|157113167|ref|XP_001651923.1| TFIIH basal transcription factor complex p52 subunit [Aedes
aegypti]
gi|108877858|gb|EAT42083.1| AAEL006356-PA [Aedes aegypti]
Length = 487
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 243/431 (56%), Positives = 308/431 (71%), Gaps = 49/431 (11%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ +VIR+LFVEQ +PQAV++SW S+ +AKE+ + LTEL
Sbjct: 55 ELPEIARQFVIRILFVEQAIPQAVVSSWGSQVYAKENVLVSRVLTELGVWRTAGLTGGLE 114
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
++N K+ALLGGG+PWSMS L+ D K+RD++FL+ YA+ RW C+L +MVG+
Sbjct: 115 AWELCPTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIEFLDAYAMSRWRCVLHYMVGAG 174
Query: 99 ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
+ EGIS DAVR L +E +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 175 SSKGMEGEGISPDAVRILLHANLMKRDETDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 234
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
T E++GLDL ECL+ LFQL FST G DYS+EG+S L FLQHLREFGLVYQRKRK GRF
Sbjct: 235 TCEARGLDLAECLSMLFQLSFSTLGRDYSSEGLSQGLLTFLQHLREFGLVYQRKRKEGRF 294
Query: 208 YPTKLALNMATRGTLKQIRE-------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
YPT+LA N+ ++ ++ I+E G++IVETN+RVYAYTDSNL+VAL+GLF EL+Y
Sbjct: 295 YPTRLAHNITSKNAVQTIQEDGSSVQDKGYIIVETNYRVYAYTDSNLQVALLGLFTELLY 354
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------SCPLPPVVSDQ 311
RFPNL VGVLTRDSVR A R GITA QII +L HA P++ PLPP V DQ
Sbjct: 355 RFPNLVVGVLTRDSVRQAFRGGITAEQIISYLEQHAHPTMLNMEQAINSKSPLPPTVVDQ 414
Query: 312 IRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSD 371
I+LWE ER R T +EGV+Y+QF+SQ DF LRDYAQ +GV+TWQNE+ RT+VVTK GH D
Sbjct: 415 IKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSIGVMTWQNERTRTMVVTKNGHDD 474
Query: 372 IKKFWVNHQKG 382
+K+FW + KG
Sbjct: 475 VKRFWKRYSKG 485
>gi|170039509|ref|XP_001847575.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
gi|167863052|gb|EDS26435.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
Length = 492
Score = 493 bits (1268), Expect = e-137, Method: Compositional matrix adjust.
Identities = 242/432 (56%), Positives = 305/432 (70%), Gaps = 49/432 (11%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ +VIR+LFVEQP+P AV++SW S+ +AKE+ + LTEL
Sbjct: 60 ELPEIARQFVIRILFVEQPIPHAVVSSWGSQIYAKENTAVSKVLTELGVWRSTGLTGGLE 119
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
++N K+ALLGGG+PWSMS L+ D K+RD++FL+ YA+ RW C+L +MVG+
Sbjct: 120 GWELCSTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIEFLDGYAMSRWRCVLHYMVGAG 179
Query: 99 ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
+ EGIS DAVR L +E +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 180 SSKGMEGEGISPDAVRILLHANLMKRDETDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 239
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
T E++GLDL ECL+ LFQL FST G DYS+EG+S L FLQHLREFGLVYQRKRK GRF
Sbjct: 240 TCEARGLDLAECLSMLFQLSFSTLGRDYSSEGLSQGLLTFLQHLREFGLVYQRKRKEGRF 299
Query: 208 YPTKLALNMATRGTLKQIRE-------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
YPT+LA N+ +R + I+E G+++VETN+RVYAYTDSNL+VAL+GLF EL+Y
Sbjct: 300 YPTRLAHNITSRNAVPTIQEDGSAAQDKGYIVVETNYRVYAYTDSNLQVALLGLFTELLY 359
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------SCPLPPVVSDQ 311
RFPNL VGVLTRDSVR A R GITA QII +L HA P++ LPP V DQ
Sbjct: 360 RFPNLVVGVLTRDSVRQAFRGGITADQIISYLEQHAHPTMHNVEQTINTKSALPPTVVDQ 419
Query: 312 IRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSD 371
I+LWE ER R T +EGV+Y+QF+SQ DF LRDYAQ +GV+ WQNE+ RT+VVTK GH D
Sbjct: 420 IKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSIGVMIWQNERTRTMVVTKNGHDD 479
Query: 372 IKKFWVNHQKGS 383
+KKFW + KG
Sbjct: 480 VKKFWKRYSKGG 491
>gi|31201525|ref|XP_309710.1| AGAP010984-PA [Anopheles gambiae str. PEST]
gi|21293295|gb|EAA05440.1| AGAP010984-PA [Anopheles gambiae str. PEST]
Length = 501
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 244/435 (56%), Positives = 308/435 (70%), Gaps = 53/435 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ +VIR+LFVEQP+PQAV++SW ++ +AKE+ + LTEL
Sbjct: 65 ELPEIARQFVIRILFVEQPIPQAVVSSWATQVYAKENTSVSQVLTELGVWRSAAYPGGLA 124
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
++N K+ALLGGG+PWSMS L+ D K+RD+DFL+ YA+ RW C+L +MVG+
Sbjct: 125 AWELCPTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIDFLDTYAMSRWRCVLHYMVGAG 184
Query: 99 ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
+ EGIS DAVR L +E +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 185 SSKGMEGEGISPDAVRILLHANLMKRDESDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 244
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
T E++GL+L ECL+ LFQL FST G DYS+EG+S L FLQHLREFGLVYQRKRK GRF
Sbjct: 245 TCEARGLNLPECLSMLFQLSFSTLGRDYSSEGLSPGLLTFLQHLREFGLVYQRKRKEGRF 304
Query: 208 YPTKLALNMATRG-----TLKQIREP------GFLIVETNFRVYAYTDSNLKVALIGLFC 256
YPT+LA N+ ++ TL Q +E G++IVETN+RVYAYTDSNL+VAL+GLF
Sbjct: 305 YPTRLAHNITSKNATHATTLAQDQESNATKDKGYIIVETNYRVYAYTDSNLQVALLGLFT 364
Query: 257 ELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------SCPLPPV 307
EL+YRFPNL VGVL+RDSVR A R GITA QII +L HA P++ LPP
Sbjct: 365 ELLYRFPNLVVGVLSRDSVRQAFRGGITAEQIISYLEQHAHPTMLTVEQAINSKSSLPPT 424
Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
V DQI+LWE ER R T +EGV+Y+QF+SQADF LRDYAQ +GV+ WQNE+ RT+VVTK
Sbjct: 425 VVDQIKLWENERNRFTYTEGVVYNQFLSQADFITLRDYAQSIGVMIWQNERIRTMVVTKN 484
Query: 368 GHSDIKKFWVNHQKG 382
GH D+KKFW + KG
Sbjct: 485 GHDDVKKFWKRYSKG 499
>gi|427785675|gb|JAA58289.1| Putative rna polymer [Rhipicephalus pulchellus]
Length = 457
Score = 490 bits (1262), Expect = e-136, Method: Compositional matrix adjust.
Identities = 237/416 (56%), Positives = 306/416 (73%), Gaps = 36/416 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL +I++HY++RLLFVEQPVPQAV++SW + + K+H EA + LT L
Sbjct: 41 ELPKISRHYIMRLLFVEQPVPQAVVSSWNEQKYVKDHLEALEALTALHIWTDASLPGGLP 100
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
R+N ++ALLGGGKPW++ + LE D RD FL+QYA ERWEC+L FMVG
Sbjct: 101 GWSLSAVFRKNIQIALLGGGKPWAVYSALEKDKHGRDAAFLDQYAAERWECVLHFMVGCH 160
Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
TEGISADAVR L + G +P+IT +GFQFLL++T +QVW+F+L+YL+T+ES
Sbjct: 161 TTEGISADAVRILLHAGLMKSDEGEGSSPLITMEGFQFLLMDTASQVWHFVLQYLDTLES 220
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+GL+LVECLTFLFQL F T G DYSTEGMS+ L VFLQHLREFGLVYQRKR++GRFYPT+
Sbjct: 221 RGLNLVECLTFLFQLSFLTLGKDYSTEGMSESLLVFLQHLREFGLVYQRKRRSGRFYPTR 280
Query: 212 LALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
LA+N+A+ TL+ E G+++VETN+RVYAYT+S L+VAL+ LFCEL+YRFPNL V
Sbjct: 281 LAINLASGLKETTLRSF-EAGYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVA 339
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTS 325
LTR+SVR ALRSGIT+ QII FLR++A P + +P + DQ+RLWE ER+R
Sbjct: 340 RLTRESVRQALRSGITSNQIIKFLRMYAHPEALKQTPVIPATIMDQLRLWEMERDRFVFR 399
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
EGVLYSQF+SQ+DF+ LR+YA +LGVL W N KR +VV + GH ++KKFW H++
Sbjct: 400 EGVLYSQFISQSDFQLLRNYASELGVLIWDNPSKRVMVVNRNGHDEVKKFWKRHRQ 455
>gi|357618296|gb|EHJ71332.1| putative TFIIH basal transcription factor complex p52 subunit
[Danaus plexippus]
Length = 603
Score = 486 bits (1251), Expect = e-135, Method: Compositional matrix adjust.
Identities = 241/425 (56%), Positives = 295/425 (69%), Gaps = 42/425 (9%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL E+A+H+VIRLLFVEQPVPQAV+ASW+++THAKE +A + L+EL
Sbjct: 179 ELPELARHFVIRLLFVEQPVPQAVVASWVTQTHAKEQHKACEALSELSVWQEAPIPGGLP 238
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N KVALLGGG+PWSMS+ LE D KARD+ FL+ YALERWEC+L +MVGS
Sbjct: 239 GWMLSQSFKKNLKVALLGGGRPWSMSSSLEPDGKARDVSFLDAYALERWECVLHYMVGST 298
Query: 100 QTEGISADAVRTLFNEGAEN-------PVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
QTEGISADAVR L G N VIT+ GFQFLLL T QVW F+ YL T E +
Sbjct: 299 QTEGISADAVRILLQAGLMNRDAEDGTAVITRAGFQFLLLSTAKQVWLFLQHYLHTAEKR 358
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
L ECL FL+QL FST G DYSTEGMS+ + VFLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 359 SLSAAECLAFLYQLSFSTLGKDYSTEGMSNNMLVFLQHLREFGLVYQRKRKAGRFYPTRL 418
Query: 213 ALNMAT----RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
ALN+ L+ G++I ETN+RVYAYT S L+VAL+GLF EL+YRFPN+ VG
Sbjct: 419 ALNITCVKDGVAPLQTAASSGYIIAETNYRVYAYTTSALQVALLGLFTELVYRFPNVVVG 478
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSDQIRLWEGE 318
VLTR+SVRAALR GI+A QII +L H+ P + S LPP V DQIRLWE E
Sbjct: 479 VLTRESVRAALRGGISAQQIITYLEQHSHPQMLKSDQGGIRSSSSLPPTVLDQIRLWESE 538
Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVN 378
R R T +EGV+Y+QF+SQA+F LRDY + G L W ++ RT+VV + H D+K++W
Sbjct: 539 RNRFTYTEGVVYNQFLSQAEFNVLRDYGRSSGALVWAADRTRTMVVARAAHDDVKRYWKR 598
Query: 379 HQKGS 383
+ K +
Sbjct: 599 YSKAT 603
>gi|346464903|gb|AEO32296.1| hypothetical protein [Amblyomma maculatum]
Length = 436
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 230/395 (58%), Positives = 290/395 (73%), Gaps = 34/395 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL I++ Y++RLLFVEQPVPQAVI+SW + + K+H EA + LT L
Sbjct: 42 ELPIISRQYIMRLLFVEQPVPQAVISSWNEQKYVKDHLEALEALTALHIWTDASLPGGLP 101
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
R+N ++ALLGGGKPW++ LE D RD FL+QYA+ERWEC+L FMVG
Sbjct: 102 GWSLSAVFRKNIQIALLGGGKPWAVYNPLEKDKHGRDATFLDQYAVERWECVLHFMVGCH 161
Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
TEGISADAVR L + G +P+IT +GFQFLL++TP+QVW+F+L+YL+T+ES
Sbjct: 162 TTEGISADAVRILLHAGLMKSEEGEGSSPLITMEGFQFLLMDTPSQVWHFVLQYLDTIES 221
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+GL+LVECLTFLFQL F T G DYSTEGMS+ L VFLQHLREFGLVYQRKR++GRFYPT+
Sbjct: 222 RGLNLVECLTFLFQLSFLTLGKDYSTEGMSESLLVFLQHLREFGLVYQRKRRSGRFYPTR 281
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+A+ R T + E G+++VETN+RVYAYT+S L+VAL+ LFCEL+YRFPNL V
Sbjct: 282 LAINLASGLRETNLRSYEAGYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVAR 341
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
LTR+SVR ALRSGIT+ QII FLR++A P + +P + DQ+RLWE ER+R E
Sbjct: 342 LTRESVRQALRSGITSNQIIKFLRMYAHPEALKQTPVIPATIMDQLRLWELERDRFVYRE 401
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
GVLYSQF+SQ+DF+ LR+YA DLGVL W N KR
Sbjct: 402 GVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRV 436
>gi|195019521|ref|XP_001984999.1| GH16811 [Drosophila grimshawi]
gi|193898481|gb|EDV97347.1| GH16811 [Drosophila grimshawi]
Length = 493
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 229/432 (53%), Positives = 298/432 (68%), Gaps = 52/432 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ +VIR+LFV+QPVPQAV++SW ++ AKE EAT LT L
Sbjct: 60 ELPEIARQFVIRILFVDQPVPQAVVSSWGAQRIAKEQMEATSCLTALNVWRVTAIPGGLA 119
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW+M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 120 AWELSPTFKKSVRQVLLGGGKPWAMTNTLDKDSKPRDIAFLDTYAMARWRCVLHYMVGTG 179
Query: 99 -----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YLE
Sbjct: 180 SRNGGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLE 239
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
T E +G L ECL+ LFQL FST G DYS+EGMS ++ +FLQHLREFGLV+QRKRK GRF
Sbjct: 240 TCEERGFSLPECLSMLFQLSFSTLGRDYSSEGMSHQMLMFLQHLREFGLVFQRKRKEGRF 299
Query: 208 YPTKLALNM--------ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
YPT+LALN+ +T ++++E G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 300 YPTRLALNVTNKDAAQASTSADDERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELL 359
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
YRFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V
Sbjct: 360 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSC-LPPTVV 418
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQI+LWE ER R T +EGV+Y+QF+SQ DF LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 419 DQIKLWELERNRFTYTEGVVYNQFLSQTDFVTLRDYAQSINMLVWQNERTRTMVVQKNGH 478
Query: 370 SDIKKFWVNHQK 381
D+K++W + K
Sbjct: 479 DDVKRYWKKYSK 490
>gi|194749905|ref|XP_001957376.1| GF10392 [Drosophila ananassae]
gi|190624658|gb|EDV40182.1| GF10392 [Drosophila ananassae]
Length = 497
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 230/432 (53%), Positives = 300/432 (69%), Gaps = 52/432 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ ++IR+LFV+QPVPQAV++SW ++ AKE +AT LT L
Sbjct: 64 ELPEIARQFIIRILFVDQPVPQAVVSSWGAQRCAKEQTDATSCLTALNVWRVTAIPGGLS 123
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ LE DSK RD++FL+ YA+ RW C+L +MVG+
Sbjct: 124 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDINFLDTYAMSRWRCVLHYMVGTG 183
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
+TE IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L YL+T
Sbjct: 184 SRNGAETEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLHYLDT 243
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGM+ ++ FLQHLREFGLVYQRKRK GRFY
Sbjct: 244 CEERGMPLPECLSMLFQLSFSTLGRDYSSEGMNKQMLDFLQHLREFGLVYQRKRKEGRFY 303
Query: 209 PTKLALNMATR-----GTL----KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
PT+LALN+ ++ TL + ++E G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 304 PTRLALNVTSKEAAVAATLTTDEEGVQESGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 363
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
YRFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V
Sbjct: 364 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSC-LPPTVV 422
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQI+LWE ER R T +EGV+Y+QF+SQ DF LRDYAQ + VL WQNE+ RT+VV K GH
Sbjct: 423 DQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSIHVLVWQNERTRTMVVQKNGH 482
Query: 370 SDIKKFWVNHQK 381
D+K++W + K
Sbjct: 483 DDVKRYWKKYSK 494
>gi|195378984|ref|XP_002048261.1| GJ11446 [Drosophila virilis]
gi|194155419|gb|EDW70603.1| GJ11446 [Drosophila virilis]
Length = 492
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 226/431 (52%), Positives = 298/431 (69%), Gaps = 51/431 (11%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ +VIR+LF++QPVPQAV+ SW ++ AKE EAT LT L
Sbjct: 60 ELPEIARQFVIRILFIDQPVPQAVVTSWGAQRVAKEQMEATSCLTALCVWRVTAIPGGLA 119
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW+M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 120 AWELSPTFKKSVRQVLLGGGKPWAMTNTLDRDSKPRDIAFLDSYAMSRWRCVLHYMVGTG 179
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YLET
Sbjct: 180 SRGGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLET 239
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGM++++ FLQHLREFGLV+QRKRK GRFY
Sbjct: 240 CEERGICLPECLSMLFQLSFSTLGRDYSSEGMNNQMLTFLQHLREFGLVFQRKRKEGRFY 299
Query: 209 PTKLALNM--------ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
PT+LALN+ +T ++++E G+++VETN+RVYAYTDS L+VA++GLF EL+Y
Sbjct: 300 PTRLALNVTNKDAAQASTSADEERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELLY 359
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSD 310
RFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP + D
Sbjct: 360 RFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSC-LPPTIVD 418
Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
QI+LWE ER R T +EGV+Y+QF+SQ DF LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 419 QIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSINMLVWQNERTRTMVVQKNGHD 478
Query: 371 DIKKFWVNHQK 381
D+K++W + K
Sbjct: 479 DVKRYWKKYSK 489
>gi|195162678|ref|XP_002022181.1| GL25529 [Drosophila persimilis]
gi|198464237|ref|XP_001353141.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
gi|194104142|gb|EDW26185.1| GL25529 [Drosophila persimilis]
gi|198149631|gb|EAL30642.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
Length = 494
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 228/432 (52%), Positives = 300/432 (69%), Gaps = 51/432 (11%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL E+A+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT L++L
Sbjct: 62 ELPELARQFIIRILFVDQPVPQAVVTSWGAQRFAKEQTEATSCLSDLNVWRVTAIPGGLS 121
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ LE DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 122 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDIAFLDSYAMSRWRCVLHYMVGTG 181
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 182 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 241
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFY
Sbjct: 242 CEERGVSLPECLSMLFQLSFSTLGRDYSSEGMNSQMLSFLQHLREFGLVFQRKRKEGRFY 301
Query: 209 PTKLALNM-----ATRGTL---KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
PT+LALN+ AT T+ + +++ G+++VETN+RVYAYTDS L+VA++GLF EL+Y
Sbjct: 302 PTRLALNVTSKEAATTMTVSDEEAMQDSGYIVVETNYRVYAYTDSPLQVAVLGLFTELLY 361
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSD 310
RFPNL VGVLTRDSVR ALR GITA QI+ +L+ +A P++ SC LPP V D
Sbjct: 362 RFPNLVVGVLTRDSVRQALRGGITAEQIVSYLQQYAHPNMKLVESAIQSKSC-LPPTVVD 420
Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
QI+LWE ER R T +EGV+Y+QF+SQ DF LRDYAQ + VL WQNE+ RT+VV K GH
Sbjct: 421 QIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSIHVLVWQNERTRTMVVQKHGHD 480
Query: 371 DIKKFWVNHQKG 382
D+K++W + KG
Sbjct: 481 DVKRYWKKYSKG 492
>gi|195126587|ref|XP_002007752.1| GI12214 [Drosophila mojavensis]
gi|193919361|gb|EDW18228.1| GI12214 [Drosophila mojavensis]
Length = 492
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 227/433 (52%), Positives = 300/433 (69%), Gaps = 51/433 (11%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ +VIR+LFV+QPVPQAV++SW ++ AKE EAT+ L+ L
Sbjct: 60 ELPEIARQFVIRILFVDQPVPQAVVSSWGAQRVAKEQLEATNCLSALSVWRVTAIPGGLA 119
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW+++ LE DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 120 AWELSPTFKKSVRQVLLGGGKPWAITNSLERDSKPRDVAFLDAYAMSRWRCVLHYMVGTG 179
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
+TE IS DAVR L + + E IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 180 NRSNTETEAISPDAVRILLHANLMKRDEREGITITRQGFQFLLLDTRAQVWHFMLQYLDT 239
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
+ +GL L ECL+ LFQL FST G DYS+EGM+ ++ +FLQHLREFGLV+QRKRK RFY
Sbjct: 240 CQERGLVLAECLSMLFQLSFSTLGRDYSSEGMNKQMLMFLQHLREFGLVFQRKRKEHRFY 299
Query: 209 PTKLALNMATRGTL--------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
PT+LALN+ + ++++E G+++VETN+RVYAYTDS L+VA++GLF EL+Y
Sbjct: 300 PTRLALNVTNKEAAEASISVDEERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELLY 359
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSD 310
RFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V D
Sbjct: 360 RFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSC-LPPTVVD 418
Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
QI+LWE ER R T +EGV+Y+QF+SQ DF LRDYAQ VL WQNE+ RT+VV K GH
Sbjct: 419 QIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSQNVLVWQNERTRTMVVQKNGHD 478
Query: 371 DIKKFWVNHQKGS 383
D+K++W + K +
Sbjct: 479 DVKRYWKKYSKSA 491
>gi|195441209|ref|XP_002068409.1| GK20438 [Drosophila willistoni]
gi|194164494|gb|EDW79395.1| GK20438 [Drosophila willistoni]
Length = 512
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 226/440 (51%), Positives = 299/440 (67%), Gaps = 58/440 (13%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
+L EIA+ ++IR+LFV+QPVPQAV++SW ++ AKE +EAT LT L
Sbjct: 73 DLPEIARQFIIRILFVDQPVPQAVVSSWGAQRCAKEQQEATSCLTSLNVWRVTAIPGGLA 132
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ LE DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 133 AWELSPTFKKSVRQILLGGGKPWPMTNTLEKDSKPRDIAFLDSYAMSRWRCVLHYMVGTG 192
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+++YL+T
Sbjct: 193 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMMQYLDT 252
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGMS ++ FLQHLREFGLV+QRKRK GRFY
Sbjct: 253 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMSHQMLAFLQHLREFGLVFQRKRKEGRFY 312
Query: 209 PTKLALNMATRGTL---------------KQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
PT+LALN+ ++ + +++++ G+++VETN+RVYAYTDS L+VA++G
Sbjct: 313 PTRLALNVTSKESADATTAIVTMSASEEEERMQDRGYIVVETNYRVYAYTDSPLQVAVLG 372
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCP 303
LF EL+YRFPNL VGVLTRDSVR ALR GITA QII +L +A P++ SC
Sbjct: 373 LFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIISYLEQYAHPNMKLVESAINSKSC- 431
Query: 304 LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
LPP V DQI+LWE ER R T +EGV+Y+QF+S DF LRDYAQ + VL WQNEK RT+V
Sbjct: 432 LPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSTTDFVTLRDYAQSIQVLVWQNEKTRTMV 491
Query: 364 VTKQGHSDIKKFWVNHQKGS 383
V K GH D+K++W + K +
Sbjct: 492 VQKNGHDDVKRYWKKYSKSA 511
>gi|194872905|ref|XP_001973103.1| GG13537 [Drosophila erecta]
gi|190654886|gb|EDV52129.1| GG13537 [Drosophila erecta]
Length = 499
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT LT L
Sbjct: 66 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ LE DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305
Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
PT+LALN+ ++ + ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
YRFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIQSKSC-LPPTVV 424
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQI+LWE ER R T +EGV+Y+QF+S DF LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484
Query: 370 SDIKKFWVNHQK 381
D+K++W + K
Sbjct: 485 DDVKRYWKKYSK 496
>gi|260810774|ref|XP_002600123.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
gi|229285409|gb|EEN56135.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
Length = 457
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 223/416 (53%), Positives = 295/416 (70%), Gaps = 34/416 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL+++AKHYV+RLLF+EQ VPQAV+ SW++ + EH A L+ELR
Sbjct: 42 ELTDLAKHYVMRLLFLEQAVPQAVVGSWVTSGNHSEHLRAVSQLSELRVWHEHQISGGLA 101
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
N + ALLGGGKPW +L D A+D+ L+ Y+LERWE +L F+VGS
Sbjct: 102 GWILSPTFRANLRTALLGGGKPWFSVVQLPPDKHAKDVAALDTYSLERWEVLLNFIVGSG 161
Query: 100 QTEGISADAVRTLFN-------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+ + +S D + L EG+ +P IT GFQFLL++TP+QVWY IL+YL+T++S+
Sbjct: 162 EAQ-VSKDIMEILIKSGLMKSEEGSLHPTITPAGFQFLLMDTPSQVWYIILQYLDTMQSR 220
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
GL+LVE L FLFQ+ FST G DY TEGMSD +Q FLQHLRE GLV QRKRK+GRFYPT+L
Sbjct: 221 GLNLVEALQFLFQISFSTLGKDYPTEGMSDSMQQFLQHLRELGLVNQRKRKSGRFYPTRL 280
Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
A+++A+ K + G+L+VE+N+R+YAYTDS L+VALIGLF E++YRFPN+ V L
Sbjct: 281 AIHLASGISDVEKDFHKEGYLVVESNYRIYAYTDSELQVALIGLFSEILYRFPNMVVANL 340
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWEGERERLTTSEG 327
TRDSV+ A+ GITA QI+ FLR++A P P+ PP +SDQIRLWE ER+RLT +EG
Sbjct: 341 TRDSVQEAVVRGITADQILHFLRVNAHPKALHRVPIVPPTISDQIRLWEMERDRLTFTEG 400
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
VLY+QF+SQ DFE LR+YA+DLGVL W+N KR +VV+K GH D+K++W +KG+
Sbjct: 401 VLYNQFLSQPDFEMLRNYAKDLGVLLWENNPKRLMVVSKAGHDDVKRYWKTIRKGN 456
>gi|443688956|gb|ELT91478.1| hypothetical protein CAPTEDRAFT_18660 [Capitella teleta]
Length = 463
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 228/415 (54%), Positives = 293/415 (70%), Gaps = 35/415 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL ++KHYV+R+LFVEQPV +V++SW++ ++ EH A +L++L
Sbjct: 47 ELPSLSKHYVLRILFVEQPVSHSVVSSWVNSSNQTEHTAAVKSLSDLCVWQDHCLPGGLP 106
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
R N KVALLGGG PW+ S L D ++D+ FL QY++ERWEC+L +M GS+
Sbjct: 107 GYLLSDVFRTNLKVALLGGGSPWAGSGALGDDKHSKDVAFLEQYSMERWECVLHYMAGSK 166
Query: 100 Q-TEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ G+S D V L + G + +P IT GFQFLLL+T QVW+F+++YL TVES
Sbjct: 167 EGVSGVSRDVVHILLHSGLMKTEQSSPDPCITPAGFQFLLLDTSTQVWFFMIEYLNTVES 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+ +DLVECL+FLFQL FST G DYSTE M+ Q FLQHLREFGLVYQRKR + RFYPT+
Sbjct: 227 RKMDLVECLSFLFQLSFSTLGKDYSTESMTSNQQRFLQHLREFGLVYQRKRSSQRFYPTR 286
Query: 212 LALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
LA+N+AT G + E GF++VETN+RVYAYT+S+LKVALI LFC +MYRFPN+ VG
Sbjct: 287 LAVNLATGSKGGDSETASEDGFIVVETNYRVYAYTNSSLKVALISLFCSMMYRFPNMAVG 346
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTS 325
V++RDSVR AL GITA QII FLR HA P LPP VSDQIRLWE ER+R +
Sbjct: 347 VVSRDSVREALSRGITAEQIINFLRNHAHPHTQKRKPILPPTVSDQIRLWELERDRFHFN 406
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
EGVLY+QF+SQ DFE LRDYA+DLGVL+++N +R +VVT+ GH D+K+FW H+
Sbjct: 407 EGVLYNQFLSQTDFEVLRDYAKDLGVLSYENIPRRLMVVTRAGHDDVKRFWKRHK 461
>gi|195477975|ref|XP_002086439.1| GE22863 [Drosophila yakuba]
gi|194186229|gb|EDW99840.1| GE22863 [Drosophila yakuba]
Length = 499
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT LT L
Sbjct: 66 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ LE DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305
Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
PT+LALN+ ++ + ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
YRFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSC-LPPTVV 424
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQI+LWE ER R T +EGV+Y+QF+S DF LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484
Query: 370 SDIKKFWVNHQK 381
D+K++W + K
Sbjct: 485 DDVKRYWKKYSK 496
>gi|195327811|ref|XP_002030611.1| GM24480 [Drosophila sechellia]
gi|194119554|gb|EDW41597.1| GM24480 [Drosophila sechellia]
Length = 499
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT LT L
Sbjct: 66 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305
Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
PT+LALN+ ++ + ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
YRFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTVV 424
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQI+LWE ER R T +EGVLY+QF+S DF LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484
Query: 370 SDIKKFWVNHQK 381
D+K++W + K
Sbjct: 485 DDVKRYWKKYSK 496
>gi|21357821|ref|NP_648780.1| marionette [Drosophila melanogaster]
gi|7294263|gb|AAF49614.1| marionette [Drosophila melanogaster]
gi|220942562|gb|ACL83824.1| Tfb2-PA [synthetic construct]
Length = 499
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT LT L
Sbjct: 66 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E +G+ L ECL+ LFQL FST G DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305
Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
PT+LALN+ ++ + ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
YRFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTVV 424
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQI+LWE ER R T +EGVLY+QF+S DF LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484
Query: 370 SDIKKFWVNHQK 381
D+K++W + K
Sbjct: 485 DDVKRYWKKYSK 496
>gi|321470926|gb|EFX81900.1| hypothetical protein DAPPUDRAFT_210868 [Daphnia pulex]
Length = 463
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 224/423 (52%), Positives = 293/423 (69%), Gaps = 44/423 (10%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL E+++ Y++R+LFV+Q VP+A++ SW+S A+E ++ LT+LR
Sbjct: 41 ELPELSRIYIMRILFVDQAVPKAIMGSWVSPNSARELEDIVKLLTDLRVWQEVEMQGGLK 100
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
RN K ALLGGG WSM + D KAR + L++YA+ RWEC+L FMVGS
Sbjct: 101 GWLLNPTFRRNLKGALLGGGNEWSMKPNTDADPKARGIAVLDEYAMGRWECVLHFMVGSH 160
Query: 100 QTEGISADAVRTLFNEG------AENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESK 152
Q E IS+DA++ L + G EN + ITKDGFQFLL++T AQVWYF+L+YL+T S+
Sbjct: 161 QHEAISSDALQILQHAGLMKKEPGENQLSITKDGFQFLLMDTSAQVWYFLLQYLDTANSR 220
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
LDL++CL FLFQL FST G DYST+ M+D LQ FLQHLREFGLVYQRKRK GRFYPT+L
Sbjct: 221 NLDLIDCLGFLFQLSFSTLGQDYSTDSMNDGLQKFLQHLREFGLVYQRKRKDGRFYPTRL 280
Query: 213 ALNMAT---RGTLKQI----REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
AL++A + L + + G+++VETN+RVYAYTDS+L+VALI LFCEL+YRFPNL
Sbjct: 281 ALDIAAGPKKSMLNSLNASSQTTGYIVVETNYRVYAYTDSSLQVALIALFCELIYRFPNL 340
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHAL------PSVSCPLPPVVSDQIRLWEGER 319
VG++TR+SVR AL+ GITA QI+ FLR HA P V LP ++DQI+LW ER
Sbjct: 341 VVGIITRESVREALKRGITADQIVSFLRQHAHSECYKEPPV---LPSTIADQIKLWAIER 397
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNH 379
+R +GVLY+QF SQADFE LR+YAQ+ G+L W R +VVTK+GH +++KFW H
Sbjct: 398 DRFLYKDGVLYNQFDSQADFEILRNYAQERGLLVWHTLNGRKMVVTKEGHDEVRKFWKRH 457
Query: 380 QKG 382
+
Sbjct: 458 SRN 460
>gi|195590491|ref|XP_002084979.1| GD12553 [Drosophila simulans]
gi|194196988|gb|EDX10564.1| GD12553 [Drosophila simulans]
Length = 500
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 224/433 (51%), Positives = 295/433 (68%), Gaps = 53/433 (12%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT LT L
Sbjct: 66 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRSGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245
Query: 149 VESKGLDLVECLTFLFQL-KFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
E +G+ L ECL+ LFQL +FST G DYS+EGM++++ FLQHLREFGLV+QRKRK GRF
Sbjct: 246 CEERGISLPECLSMLFQLTQFSTLGRDYSSEGMNNQMLTFLQHLREFGLVFQRKRKEGRF 305
Query: 208 YPTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCEL 258
YPT+LALN+ ++ + ++ G+++VETN+RVYAYTDS L+VA++GLF EL
Sbjct: 306 YPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTEL 365
Query: 259 MYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVV 308
+YRFPNL VGVLTRDSVR ALR GITA QI+ +L +A P++ SC LPP V
Sbjct: 366 LYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTV 424
Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
DQI+LWE ER R T +EGVLY+QF+S DF LRDYAQ + +L WQNE+ RT+VV K G
Sbjct: 425 VDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNG 484
Query: 369 HSDIKKFWVNHQK 381
H D+K++W + K
Sbjct: 485 HDDVKRYWKKYSK 497
>gi|193633122|ref|XP_001945987.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 471
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/412 (51%), Positives = 285/412 (69%), Gaps = 32/412 (7%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +I + + +R+LF+EQPVPQ+V++SW+ +++E E+ + T L
Sbjct: 58 LPDITQQFTLRILFIEQPVPQSVLSSWVPANYSRELDESIEVATNLHIWKLTSVSGGLKG 117
Query: 48 -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
++ KVAL+GGG+ ++ + D KARD+DFL+ YA ERWECIL +MVGS+
Sbjct: 118 WILNSTFKKKLKVALMGGGRSTVPNSDMTADPKARDIDFLDSYAYERWECILHYMVGSKH 177
Query: 101 TEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
EGIS+DAVR L N G ++PVIT GFQFLLL+ QVWYF+L+Y+ETVES+GL
Sbjct: 178 -EGISSDAVRVLLNAGLMVRDTDDSPVITSTGFQFLLLDMATQVWYFMLRYMETVESRGL 236
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
DL +CLTFLFQ+ T G DY T+ MS+ LQ FLQHLREFGLVYQRKRKAGRFYPT+L +
Sbjct: 237 DLAQCLTFLFQIHLGTLGWDYITDEMSENLQAFLQHLREFGLVYQRKRKAGRFYPTRLVI 296
Query: 215 NMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
M +R + + + +++VETNFR+YA TDS+LKVAL+ LF ++YRFPN++ G+LT
Sbjct: 297 EMGQGNSRTSERMKNKERYIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILT 356
Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
RDSVR ALRSGITAAQI+ FL +H P + C +P V DQI LWE ER RLT ++GVLY
Sbjct: 357 RDSVRTALRSGITAAQIVRFLTVHTHPQMQECGMPQTVIDQIYLWENERNRLTYTDGVLY 416
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
S + D+E +++YA D+G L W +E++R +VV+ GH D++KFW K
Sbjct: 417 SNINTPNDYETIKNYAADIGALVWCDERRRNIVVSTDGHDDVRKFWKKQPKS 468
>gi|147899535|ref|NP_001085110.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
laevis]
gi|47939830|gb|AAH72322.1| MGC83106 protein [Xenopus laevis]
Length = 455
Score = 432 bits (1112), Expect = e-119, Method: Compositional matrix adjust.
Identities = 219/416 (52%), Positives = 291/416 (69%), Gaps = 34/416 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK+YV+R+LF+EQP+PQA +A W+ K + EH+E T LT LR
Sbjct: 41 ELPGLAKNYVMRMLFLEQPLPQAAVALWVKKENFNEHEENTQVLTGLRLWHTQQLPGGLQ 100
Query: 49 ---------RNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGS 98
N K+ALLGGGK W+ L D +D+ L++YA ERWE IL FMVGS
Sbjct: 101 GLILNPIFRENLKIALLGGGKAWADDGCLLGPDKHGKDVVSLDKYAEERWEVILHFMVGS 160
Query: 99 QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+ +S D + L NE E P I+ GFQFLLL+TP+Q+WYF+L+YL++ ES+
Sbjct: 161 P-SAAVSQDLAQLLIQAGLMKNESGEAPCISSAGFQFLLLDTPSQLWYFMLQYLKSAESR 219
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
G++LVE L+F+FQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 220 GMNLVEILSFMFQLSFSTLGKDYSVEGMSDSLLTFLQHLREFGLVFQRKRKSRRYYPTRL 279
Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
A+N+A+ G++ + GF++VETN+R+YAYTDS L++ALI LF E++YRFPNL V +
Sbjct: 280 AINLASGISGSVVDSHKQGFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQV 339
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEG 327
TR++V+ A+ +GITA QII FLR A P + + LPP ++DQIRLWE ER+RL SEG
Sbjct: 340 TRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQNPALPPTITDQIRLWELERDRLRFSEG 399
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
VLY+QF+SQ DFE LR+YA+DLGVL ++N KR +VVT GHSD+K+FW ++ S
Sbjct: 400 VLYNQFLSQVDFELLRNYARDLGVLVFENPAKRVMVVTPGGHSDVKRFWKRQKQSS 455
>gi|62751795|ref|NP_001015845.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
(Silurana) tropicalis]
gi|58475911|gb|AAH90134.1| general transcription factor II H, polypeptide 4 [Xenopus
(Silurana) tropicalis]
gi|89272862|emb|CAJ82116.1| transcription factor tfb2 [Xenopus (Silurana) tropicalis]
gi|114107989|gb|AAI22900.1| gtf2h4 protein [Xenopus (Silurana) tropicalis]
Length = 455
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 220/416 (52%), Positives = 291/416 (69%), Gaps = 34/416 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK+YV+R+LF+EQP+PQA +A W+ K + KEH+E T LT LR
Sbjct: 41 ELPGLAKNYVMRMLFLEQPLPQAAVALWVKKENFKEHEENTQVLTGLRLWHTQQLPGGLQ 100
Query: 49 ---------RNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGS 98
N K+ALLGGGKPW+ L D +D+ L++YA ERWE IL FMVGS
Sbjct: 101 GLILNPIFRENLKIALLGGGKPWADDGCLLGPDKHGKDVVSLDKYAEERWEVILHFMVGS 160
Query: 99 QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+ +S D + L +E E P I+ GFQFLLL+TP+Q+WYF+L+YL++ ES+
Sbjct: 161 P-SAAVSQDLAQLLIQAGLMKSESGEAPCISSAGFQFLLLDTPSQLWYFMLQYLKSAESR 219
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
G+ LVE L+F+FQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 220 GMILVEILSFMFQLSFSTLGKDYSVEGMSDSLLTFLQHLREFGLVFQRKRKSRRYYPTRL 279
Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
A+N+A+ G++ + GF++VETN+R+YAYTDS L++ALI LF E++YRFPNL V +
Sbjct: 280 AINLASGISGSVVDSHKQGFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQV 339
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEG 327
TR++V+ A+ +GITA QII FLR A P + + LPP ++DQIRLWE ER+RL SEG
Sbjct: 340 TRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQNPVLPPTITDQIRLWELERDRLRFSEG 399
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
VLY+QF+SQ DFE LR+YA+DLGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 400 VLYNQFLSQVDFELLRNYARDLGVLVFENPAKRVMVVTPGGHSDVKRFWKRQKHSS 455
>gi|327266348|ref|XP_003217968.1| PREDICTED: general transcription factor IIH subunit 4-like [Anolis
carolinensis]
Length = 460
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 221/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK+YV+R+L +EQP+PQA +ASW+ K + KE +E++D L LR
Sbjct: 45 ELPGLAKNYVMRMLLLEQPLPQAAVASWVKKEYTKEQEESSDILLGLRLWHKQLLPGGLQ 104
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
N +VALLGGGK WS +++L D ARD+ L++YA ERWE IL FMVGS
Sbjct: 105 GIVLNPIFKENLRVALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVILDFMVGS 164
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+T +Q+WYF+L+YL++ E+
Sbjct: 165 P-SAAVSQDLAQLLTEAGLMKSSEPGEPPCITSSGFQFLLLDTSSQLWYFMLQYLQSAET 223
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMS+ L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 224 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSESLLNFLQHLREFGLVFQRKRKSRRYYPTR 283
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G R GF+IVETN+R+YAYTDS L++ALI LF E++YRFPNL V
Sbjct: 284 LAINLSSGISGITIDTRNQGFIIVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQ 343
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ +GITA QII FLR A P + + LPP ++DQIRLWE ER+RL SE
Sbjct: 344 VTRESVQQAIANGITADQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFSE 403
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE LRD+A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 404 GVLYNQFLSQVDFELLRDHARELGVLIFENPSKRLMVVTPAGHSDVKRFWKRQKHSS 460
>gi|47059175|ref|NP_997666.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|46237651|emb|CAE84027.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|117558335|gb|AAI27470.1| General transcription factor II H, polypeptide 4 [Rattus
norvegicus]
Length = 463
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|395831929|ref|XP_003789035.1| PREDICTED: general transcription factor IIH subunit 4 [Otolemur
garnettii]
Length = 463
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|4504201|ref|NP_001508.1| general transcription factor IIH subunit 4 [Homo sapiens]
gi|113865855|ref|NP_001038968.1| general transcription factor IIH subunit 4 [Pan troglodytes]
gi|114050799|ref|NP_001040607.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|297677636|ref|XP_002816695.1| PREDICTED: general transcription factor IIH subunit 4 isoform 2
[Pongo abelii]
gi|332245940|ref|XP_003272109.1| PREDICTED: general transcription factor IIH subunit 4 [Nomascus
leucogenys]
gi|426352309|ref|XP_004043656.1| PREDICTED: general transcription factor IIH subunit 4 [Gorilla
gorilla gorilla]
gi|17380328|sp|Q92759.1|TF2H4_HUMAN RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|38503276|sp|P60027.1|TF2H4_PANTR RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|1514597|emb|CAA68870.1| transcription factor TFIIH [Homo sapiens]
gi|13436278|gb|AAH04935.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|15277224|dbj|BAB63317.1| Transcription factor II H [Homo sapiens]
gi|16740884|gb|AAH16302.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|21655317|gb|AAM64222.1| general transcription factor IIH, polypeptide 4 (52kD subunit)
[Homo sapiens]
gi|27544403|dbj|BAC54936.1| transcription factor II H [Homo sapiens]
gi|30583481|gb|AAP35985.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|32127781|dbj|BAC78171.1| transcription factor II H [Pan troglodytes]
gi|55700798|dbj|BAD69753.1| general transcription factor IIH, polypeptide 4, 52kDa [Macaca
mulatta]
gi|60655839|gb|AAX32483.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|60655841|gb|AAX32484.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|86197966|dbj|BAE78622.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|90960934|dbj|BAE92819.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|90960936|dbj|BAE92820.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|114306784|dbj|BAF31271.1| TFIIH protein [Homo sapiens]
gi|119623752|gb|EAX03347.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|123992979|gb|ABM84091.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|123999907|gb|ABM87462.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|355561507|gb|EHH18139.1| General transcription factor IIH polypeptide 4, partial [Macaca
mulatta]
gi|355748409|gb|EHH52892.1| General transcription factor IIH polypeptide 4 [Macaca
fascicularis]
gi|380812138|gb|AFE77944.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|380812140|gb|AFE77945.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|383408563|gb|AFH27495.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|410214852|gb|JAA04645.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410250640|gb|JAA13287.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410290944|gb|JAA24072.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410352807|gb|JAA43007.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
Length = 462
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 106
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 107 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 166
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 167 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 225
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 226 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 285
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 286 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 345
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 346 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 405
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 406 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 462
>gi|410958682|ref|XP_003985944.1| PREDICTED: general transcription factor IIH subunit 4 [Felis catus]
Length = 463
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 217/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + S LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQSPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|6754094|ref|NP_034494.1| general transcription factor IIH subunit 4 [Mus musculus]
gi|18202124|sp|O70422.1|TF2H4_MOUSE RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|2997755|gb|AAC08594.1| TFIIH transcription/DNA repair factor p52 subunit [Mus musculus]
gi|15215027|gb|AAH12638.1| Gtf2h4 protein [Mus musculus]
gi|26348003|dbj|BAC37650.1| unnamed protein product [Mus musculus]
gi|74194835|dbj|BAE26009.1| unnamed protein product [Mus musculus]
gi|74222199|dbj|BAE26910.1| unnamed protein product [Mus musculus]
Length = 463
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|344307658|ref|XP_003422497.1| PREDICTED: general transcription factor IIH subunit 4 [Loxodonta
africana]
Length = 463
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 217/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF+IVETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|281337500|gb|EFB13084.1| hypothetical protein PANDA_018954 [Ailuropoda melanoleuca]
Length = 463
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|338718559|ref|XP_003363845.1| PREDICTED: general transcription factor IIH subunit 4 [Equus
caballus]
Length = 463
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|426250586|ref|XP_004019016.1| PREDICTED: general transcription factor IIH subunit 4 [Ovis aries]
Length = 463
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463
>gi|178056954|ref|NP_001116592.1| general transcription factor IIH subunit 4 [Sus scrofa]
gi|41529162|dbj|BAD08424.1| general transcription factor IIH, polypeptide 4 [Sus scrofa]
gi|47496805|dbj|BAD08426.2| general transcription factor IIH, polypeptide 4 [Sus scrofa]
Length = 463
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLRE GLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLRELGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|359320969|ref|XP_003639474.1| PREDICTED: general transcription factor IIH subunit 4-like [Canis
lupus familiaris]
Length = 463
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 217/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPFLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF+IVETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|440896165|gb|ELR48176.1| General transcription factor IIH subunit 4 [Bos grunniens mutus]
Length = 463
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463
>gi|155371845|ref|NP_001094527.1| general transcription factor IIH subunit 4 [Bos taurus]
gi|148878127|gb|AAI46241.1| GTF2H4 protein [Bos taurus]
gi|296474219|tpg|DAA16334.1| TPA: general transcription factor IIH, polypeptide 4, 52kDa [Bos
taurus]
Length = 463
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLMFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463
>gi|62896773|dbj|BAD96327.1| general transcription factor IIH, polypeptide 4, 52kDa variant
[Homo sapiens]
Length = 462
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 47 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 106
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 107 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 166
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 167 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 225
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 226 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 285
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 286 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 345
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE R+RL +E
Sbjct: 346 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELGRDRLRFTE 405
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 406 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 462
>gi|41053973|ref|NP_956221.1| general transcription factor IIH subunit 4 [Danio rerio]
gi|39794736|gb|AAH64301.1| General transcription factor IIH, polypeptide 4 [Danio rerio]
Length = 466
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/428 (50%), Positives = 284/428 (66%), Gaps = 46/428 (10%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK+YV+R+LF++ P+PQA +A W+ K K+H + LT LR
Sbjct: 40 ELPPLAKNYVMRMLFLDHPLPQAAVALWVKKGSQKDHDQCVSVLTGLRLWHSQHLQGGLQ 99
Query: 49 ---------RNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
N ++ALLGGGKPW+ A L D ARD++ L++YA+ERWE IL FMVGS
Sbjct: 100 GFVLNPVFKDNLRIALLGGGKPWADEGANLGPDRHARDVESLDRYAMERWEVILHFMVGS 159
Query: 99 QQTEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+ +S D + L G E P IT GFQFLLL+T +Q+WYF L+YL+T +S+
Sbjct: 160 P-SAAVSQDLAQLLIQAGLMKSETGEAPCITSAGFQFLLLDTASQLWYFTLQYLKTAQSR 218
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
G+DLVE L+FLFQL FST G DYS EGMS+ L FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 219 GMDLVEILSFLFQLSFSTLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRL 278
Query: 213 ALNMATR--------------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCEL 258
A+ +A G + + GF++VETN+R+YAYT+S L++AL+ LF E+
Sbjct: 279 AITLAAGVTANPASGSASSALGAIPGTGDTGFIVVETNYRIYAYTNSELQIALVALFSEM 338
Query: 259 MYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLW 315
+YRFPNL V +TR+SV+ A+ +GITA QII FLR A P + + LPP ++DQIRLW
Sbjct: 339 LYRFPNLVVAQVTRESVQQAISNGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLW 398
Query: 316 EGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
E E++RL +EGVLY+QF+SQADFE LRD AQ LGVL WQN R +VVT GHS++K+F
Sbjct: 399 ELEKDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGVLVWQNPAHRVMVVTPHGHSEVKRF 458
Query: 376 WVNHQKGS 383
W + S
Sbjct: 459 WKRQKSHS 466
>gi|417401351|gb|JAA47565.1| Putative rna polymer [Desmodus rotundus]
Length = 463
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 214/417 (51%), Positives = 287/417 (68%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK +V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKSWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSSLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D RD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHTRDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 PDA-AVSQDLAQLLIQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GH D+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHGDVKRFWKRQKHSS 463
>gi|351713119|gb|EHB16038.1| General transcription factor IIH subunit 4 [Heterocephalus glaber]
Length = 463
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 217/417 (52%), Positives = 290/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGRDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNM--ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+ G + +PGF++VETN+R+YAYT+S L+VALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYTESELQVALIALFSEMLYRFPNVVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIGSGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|74143475|dbj|BAE28812.1| unnamed protein product [Mus musculus]
Length = 463
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 213/417 (51%), Positives = 289/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+ PAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDMPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ + SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQVIASGITAQQIIHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ FE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVVFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|156378083|ref|XP_001630974.1| predicted protein [Nematostella vectensis]
gi|156218005|gb|EDO38911.1| predicted protein [Nematostella vectensis]
Length = 471
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 211/422 (50%), Positives = 285/422 (67%), Gaps = 41/422 (9%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL E+AKH+V+R LF +QPVP++++++W+ + K + + +TL +LR
Sbjct: 50 ELPELAKHFVMRTLFADQPVPESIVSTWVKSAYYKYNIGSLNTLKQLRIWREVPSGVHKR 109
Query: 49 --------RNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
N K AL GGGK W S + L D R+ DFL++YA+ERWE +L FM GS
Sbjct: 110 YEMNATFRTNMKAALCGGGKSWMGSTQHLGPDKHTREPDFLDKYAIERWESVLHFMTGST 169
Query: 100 QTE----GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLET 148
+ G+S D V+ L G NP+I+ GFQFLLL+ P+QVWYF+L+ LET
Sbjct: 170 EMADNAGGVSQDVVKVLVLSGLMKCESPGSNPIISPAGFQFLLLDRPSQVWYFMLQCLET 229
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
VE++G+DLVECL+ LFQL FS+ G DY T+G++D FLQ LRE GLV+QRKRK+ R+Y
Sbjct: 230 VEARGMDLVECLSLLFQLSFSSPGKDYPTDGLTDSQMKFLQQLREIGLVFQRKRKSRRYY 289
Query: 209 PTKLALNMATRG-----TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
PTKL++N+ G T QI E GF+I+ETN+RVYAYT+S L+V+LIGLFCE++ RFP
Sbjct: 290 PTKLSVNLTAAGKGINTTDSQI-EAGFIIIETNYRVYAYTESCLQVSLIGLFCEILCRFP 348
Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERE 320
NL V LTR+S + AL SGI+A QI+ FL+ A P + + +P +SDQ+RLWE ER
Sbjct: 349 NLCVASLTRESCQQALASGISAEQILNFLQTRAHPEMLKRTPIIPSTISDQVRLWEMERS 408
Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
R+ +EGVLY+QF+SQADFE LR YA+DLGVL W N KR VVV++ GH D+K+FW +
Sbjct: 409 RMKFTEGVLYNQFLSQADFEMLRKYAEDLGVLIWANSTKRVVVVSRSGHDDVKRFWKRQR 468
Query: 381 KG 382
+G
Sbjct: 469 QG 470
>gi|344252968|gb|EGW09072.1| General transcription factor IIH subunit 4 [Cricetulus griseus]
Length = 463
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 215/417 (51%), Positives = 289/417 (69%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+T AQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTSAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNM--ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463
>gi|391335225|ref|XP_003741996.1| PREDICTED: general transcription factor IIH subunit 4-like
[Metaseiulus occidentalis]
Length = 457
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 214/418 (51%), Positives = 283/418 (67%), Gaps = 40/418 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +A+ YVIRLL + Q VPQAVI SW A++H + D L LR
Sbjct: 39 LPPLARQYVIRLLHIRQAVPQAVINSWYDPEAARDHTSSEDALLSLRLWHETTLPGGLPG 98
Query: 49 --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
+ AL GGG PW + +L+ D RD FL+QYA ERW+C+L +MVGS+
Sbjct: 99 WQLNPTFQEMLRQALAGGGDPWIVYGELDKDKHGRDTKFLDQYAQERWDCVLHYMVGSEV 158
Query: 101 TEGISADAVRTLFNEG-----AENP----VITKDGFQFLLLETPAQVWYFILKYLETVES 151
GIS DAVR L + G ++P +IT +GFQFLL++T QVWYFIL+YL TVE+
Sbjct: 159 ESGISQDAVRILLHSGLMKKEDDSPNSKNLITMEGFQFLLMDTEDQVWYFILQYLSTVET 218
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+ LV+CL F+FQL F T G DYS +G+S+ L VFLQHLREFGLVYQRKRK+GRFYPT+
Sbjct: 219 RGVSLVDCLQFIFQLSFLTLGKDYSIKGVSENLLVFLQHLREFGLVYQRKRKSGRFYPTR 278
Query: 212 LALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
LA+ +A+ LK+++ E G++IVETN+RVYAYTDS L+VAL+ LFCEL+YRFPNL V
Sbjct: 279 LAIGLAS--GLKELQATKDEQGYIIVETNYRVYAYTDSPLQVALLSLFCELLYRFPNLLV 336
Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHA----LPSVSCPLPPVVSDQIRLWEGERERLT 323
VLTR+SVR AL+ GIT+ QI FL+ + L +P V+DQ+RLWE ER+R
Sbjct: 337 AVLTRESVRQALKGGITSNQITHFLKSRSHRVVLEREEGIIPMTVTDQLRLWELERDRFK 396
Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
E VLYSQF + DFE LR+YA+D+GVL ++N +KR +VV+K G +++++FW H+K
Sbjct: 397 FGESVLYSQFQTTGDFEMLRNYARDIGVLKYENPQKRFLVVSKSGDAEVRQFWKRHKK 454
>gi|348542188|ref|XP_003458568.1| PREDICTED: general transcription factor IIH subunit 4 [Oreochromis
niloticus]
Length = 461
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 212/417 (50%), Positives = 280/417 (67%), Gaps = 43/417 (10%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK+YV+R+LF++QP+PQA +A W+ K K+H E L LR
Sbjct: 40 ELPSLAKNYVMRMLFLDQPLPQAAVALWVKKDSQKDHDECISVLAGLRLWHSQQLQGGLQ 99
Query: 49 ---------RNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
N K ALLGGG+ W+ + L D ARD++ L++YA+ERWE IL+FMVGS
Sbjct: 100 GYILNPVFKDNLKTALLGGGRAWADEGSTLGPDRHARDIESLDRYAMERWEIILQFMVGS 159
Query: 99 QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+S D + L +E E P IT GFQFLLL+T +Q+WYF L+YL+T +S+
Sbjct: 160 --PSAVSQDLAQLLVQAGLMKSEAGEAPYITSAGFQFLLLDTASQLWYFTLQYLKTAQSR 217
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
G+DLVE L+FLFQL FS+ G DYS EGMS+ L FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 218 GMDLVEILSFLFQLSFSSLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRL 277
Query: 213 ALNMATRGTLKQIR----------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
A+ +A T + GF++VETN+R+YAYT+S L++AL+ LF E++YRF
Sbjct: 278 AITLAAGVTSSSSVSNLSSSPGTGDAGFIVVETNYRIYAYTNSELQIALVALFSEMLYRF 337
Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGER 319
PN+ V LTR+SV+ A+ +GITA QII FLR A P + + LPP ++DQIRLWE ER
Sbjct: 338 PNVVVAQLTRESVQQAIANGITAQQIIHFLRTRAHPVMLTQTPVLPPTITDQIRLWELER 397
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
+RL +EGVLY+QF+SQADFE LRD AQ LG L WQ+ R +VVT QGHS++K+FW
Sbjct: 398 DRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCLVWQDVAHRVMVVTPQGHSEVKRFW 454
>gi|339521903|gb|AEJ84116.1| general transcription factor IIH subunit 4 [Capra hircus]
Length = 463
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 212/417 (50%), Positives = 286/417 (68%), Gaps = 35/417 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHAQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+S D + L G E P IT GFQFLLL+TPA +WYF+L+YL+T +S
Sbjct: 168 PSV-AVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAHLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+ L++A I LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTEPELQIARIALFSEMLYRFPNMVVPQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
+TR SV+ A+ SGITA QI+ FLR A P + + LPP ++DQIRLWE ER+RL +E
Sbjct: 347 VTRGSVQQAIASGITAQQIVHFLRTRADPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
GVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKDNS 463
>gi|432883135|ref|XP_004074222.1| PREDICTED: general transcription factor IIH subunit 4-like [Oryzias
latipes]
Length = 460
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 216/420 (51%), Positives = 281/420 (66%), Gaps = 50/420 (11%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK+YV+R+LF+EQP+PQA +A W+ K ++H E LT LR
Sbjct: 40 ELPLLAKNYVMRMLFLEQPLPQAAVALWVRKDGQRDHDECVSVLTGLRLWHSQQLQGGLQ 99
Query: 49 ---------RNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGS 98
N ++ALLGGGK W+ L D ARD++ L++YA+ERWE IL+FMVGS
Sbjct: 100 GYILNPVFKDNLRIALLGGGKVWADEGIILGPDRHARDIESLDRYAMERWEVILQFMVGS 159
Query: 99 QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+S D + L +E E P IT GFQFLLL+T +Q+WY L+YL T +S+
Sbjct: 160 --PSAVSQDLAQLLVQAGLMKSEAGEAPYITSAGFQFLLLDTASQLWYLTLQYLNTAQSR 217
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
G++LVE L+FLFQL FST G DYS EGMS+ L FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 218 GMELVEILSFLFQLSFSTLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRL 277
Query: 213 ALNMA-------------TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
A+ +A T GT + GF++VETN+R+YAYT+S L++AL+ LF E++
Sbjct: 278 AITLAAGVSSNSPSNMTNTPGT----GDAGFIVVETNYRIYAYTNSELQIALVALFSEML 333
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWE 316
YRFPN+ V LTR+SV+ A+ +GITA QII FLR A P + S LPP ++DQIRLWE
Sbjct: 334 YRFPNVVVAHLTRESVQQAIANGITAQQIIHFLRTRAHPVMLKQSPALPPTITDQIRLWE 393
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
ER+RL +EGVLY+QF+SQADFE LRD AQ LG L WQ+ R +VVT QGHS++KKFW
Sbjct: 394 LERDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCLVWQDASHRAMVVTPQGHSEVKKFW 453
>gi|410911638|ref|XP_003969297.1| PREDICTED: general transcription factor IIH subunit 4-like
[Takifugu rubripes]
Length = 465
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 211/420 (50%), Positives = 279/420 (66%), Gaps = 45/420 (10%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF++QP+PQA ++ W++K + K+H E L LR
Sbjct: 40 ELPSLAKNFVMRMLFLDQPLPQAAVSLWVNKDNQKDHDECVSVLAGLRLWHSQHLQGGLQ 99
Query: 49 ---------RNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
N K+ALLGGG+ W+ L D ARD+ L++YA ERWE IL FMVGS
Sbjct: 100 GYILNPVFKDNLKIALLGGGRAWADEGGTLGPDRHARDIGSLDRYATERWEVILHFMVGS 159
Query: 99 QQTEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+S D + L + G E P IT GFQFLLL+T +Q+WYF L+YL+T +S+
Sbjct: 160 P-CAAVSQDLAQLLVHAGLMKSEAGEPPYITSAGFQFLLLDTASQLWYFTLQYLKTAQSR 218
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 219 GMDLVEILSFLFQLSFSTLGRDYSVEGMSDSLLTFLQHLREFGLVFQRKRKSRRYYPTRL 278
Query: 213 ALNMATRGTLKQIREP-------------GFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
A+ +AT + + P GF++VETN+R+YAYT+S L++AL+ LF E++
Sbjct: 279 AITLATGDSSSSLHTPTASLASTPGSGDSGFIVVETNYRIYAYTNSELQIALVALFSEML 338
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWE 316
YRFPN+ V +TR+SV+ A+ +GITA QII FLR A P + P LPP ++DQIRLWE
Sbjct: 339 YRFPNVVVAQVTRESVQQAIANGITAQQIIHFLRTRAHPVMLRQTPFLPPTITDQIRLWE 398
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
ER+RL +EGVLY+QF+SQ DFE LRD A+ LG L WQ+ R +VVT GHS++KKFW
Sbjct: 399 LERDRLQFTEGVLYNQFLSQTDFEVLRDRAKSLGCLVWQDAAHRVMVVTLWGHSEVKKFW 458
>gi|72074363|ref|XP_796111.1| PREDICTED: general transcription factor IIH subunit 4
[Strongylocentrotus purpuratus]
Length = 469
Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust.
Identities = 201/419 (47%), Positives = 286/419 (68%), Gaps = 37/419 (8%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL + KH ++R+LF++ P+PQA + SW+ ++ A TLT+LR
Sbjct: 48 ELPVLGKHCIMRVLFIDTPIPQAAVTSWIQSNQQEQLHAALKTLTDLRLFRDQSLPGGLP 107
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSK---ARDLDFLNQYALERWECILRFMV 96
N K AL+GGGKPW++S K K ++ L++Y+ ERWEC+L F+V
Sbjct: 108 GWLLNLTFRTNLKTALIGGGKPWAISGKGGKGGKDKKVKETATLDKYSSERWECVLHFLV 167
Query: 97 GSQQT-EGISADAVRTL-------FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
GS + + +S D L E PVIT GFQFLLL+TP+QVW+F+L+YLET
Sbjct: 168 GSSKAVDTLSRDIATVLTHSGLMRLGESGGTPVITPSGFQFLLLDTPSQVWFFMLQYLET 227
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
+++GLD+V+ L+FLFQL FST G DYS+EGM+++ FLQHLRE GLV+QRKRK+ R+Y
Sbjct: 228 SQARGLDIVDALSFLFQLSFSTLGKDYSSEGMTEQQLHFLQHLRELGLVFQRKRKSMRYY 287
Query: 209 PTKLALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
PT+LA+N+A+ K + GF++VETNFRVYAYT+S+L+V ++GLFC +MYRFPNL+
Sbjct: 288 PTRLAINLASGVSSMAKDDHKDGFIVVETNFRVYAYTESDLQVEILGLFCSMMYRFPNLS 347
Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWEGERERLT 323
V LTR+SV+ A+ +GITA QI+ FLR HA P++ P+ PP +SDQ+RLWE ER+RL+
Sbjct: 348 VAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWELERDRLS 407
Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
++G++Y++F+S DFE LRDYA+DLGVL W + +R ++V+ GH +KK+W +KG
Sbjct: 408 FTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARRIMIVSPAGHDSVKKYWKRLKKG 466
>gi|431907067|gb|ELK11185.1| General transcription factor IIH subunit 4 [Pteropus alecto]
Length = 447
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 208/414 (50%), Positives = 283/414 (68%), Gaps = 45/414 (10%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF++QP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLDQPLPQAAVALWVKKEFSKAQEESTGLLSSLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
+TR+SV+ A+ SGITA Q + LPP ++DQIRLWE ER+RL +EGVL
Sbjct: 347 VTRESVQQAIASGITAQQ-------------TPVLPPTITDQIRLWELERDRLRFTEGVL 393
Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
Y+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 394 YNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 447
>gi|334323685|ref|XP_003340422.1| PREDICTED: valyl-tRNA synthetase, mitochondrial-like [Monodelphis
domestica]
Length = 1641
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 217/441 (49%), Positives = 289/441 (65%), Gaps = 66/441 (14%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N +VALLGGGK WS +++L D ARD+ L++YA ERWE IL FMVGS
Sbjct: 108 GLILNPVFRQNLRVALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVILHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSSEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAY--------------------------- 242
LA+N+++ G +PGF++VETN+R+YAY
Sbjct: 287 LAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYTGEVGGMRMGIGMLEGRNCGFFGIILF 346
Query: 243 ----TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
T+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ +GITA QII FL+ A P
Sbjct: 347 FSLATESELQIALIALFSEMLYRFPNMVVAQITRESVQQAIANGITAQQIIHFLKTRAHP 406
Query: 299 SV--SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
+ P LPP ++DQIRLWE ER+RL SEGVLY+QF+SQ DFE L +A++LGVL ++
Sbjct: 407 VMLKQIPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARELGVLVFE 466
Query: 356 NEKKRTVVVTKQGHSDIKKFW 376
N KR +VVT GHSD+K+FW
Sbjct: 467 NTAKRLMVVTPAGHSDVKRFW 487
>gi|395533817|ref|XP_003768949.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Sarcophilus harrisii]
Length = 1638
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 216/451 (47%), Positives = 291/451 (64%), Gaps = 69/451 (15%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 81 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 140
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N +VALLGGGK WS +++L D ARD+ L++YA ERWE IL FMVGS
Sbjct: 141 GFILNPVFRQNLRVALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVILHFMVGS 200
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 201 P-SAAVSQDLAQLLSQAGLMKSSEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 259
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 260 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 319
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAY--------------------------- 242
LA+N+++ G +PGF++VETN+R+YAY
Sbjct: 320 LAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYTGEVGGIKMRIEIRGGGIGGELNLLWS 379
Query: 243 -------TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
T+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ +GITA QII F++
Sbjct: 380 SIVFSLETESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIANGITAQQIIHFMKTR 439
Query: 296 ALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
A P + + LPP ++DQIRLWE ER+RL SEGVLY+QF+SQ DFE L +A++LGVL
Sbjct: 440 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARELGVL 499
Query: 353 TWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
++N KR +VVT GHSD+K+FW + S
Sbjct: 500 VFENTAKRLMVVTPAGHSDVKRFWKRQKHSS 530
>gi|402866404|ref|XP_003897374.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial,
partial [Papio anubis]
Length = 1647
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 213/448 (47%), Positives = 290/448 (64%), Gaps = 68/448 (15%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+ K++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 97 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 156
Query: 49 -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +
Sbjct: 157 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 215
Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 216 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 275
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 276 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 335
Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
+++ G + +PGF++VETN+R+YAYT
Sbjct: 336 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEVGQSAPEEKKEGLEGVWVWGWPPHPLSI 395
Query: 244 ---DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA QII FLR A P +
Sbjct: 396 PGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVM 455
Query: 301 ---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
+ LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L +A++LGVL ++N
Sbjct: 456 LKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENS 515
Query: 358 KKRTVVVTKQGHSDIKKFWVNHQKGSQH 385
KR +VVT GHSD+K+FW + S+
Sbjct: 516 AKRLMVVTPAGHSDVKRFWKRQKHSSRQ 543
>gi|62088538|dbj|BAD92716.1| VARS2L protein variant [Homo sapiens]
Length = 1653
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 213/452 (47%), Positives = 290/452 (64%), Gaps = 73/452 (16%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+ K++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 98 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 157
Query: 49 -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +
Sbjct: 158 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 216
Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 217 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 276
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 277 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 336
Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
+++ G + +PGF++VETN+R+YAYT
Sbjct: 337 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRGGWGHLKEGLEGVWVWGWPP 396
Query: 244 --------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA QII FLR
Sbjct: 397 LPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 456
Query: 296 ALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
A P + + LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L +A++LGVL
Sbjct: 457 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVL 516
Query: 353 TWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
++N KR +VVT GHSD+K+FW + S+
Sbjct: 517 VFENSAKRLMVVTPAGHSDVKRFWKRQKHSSR 548
>gi|403308524|ref|XP_003944708.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Saimiri boliviensis boliviensis]
Length = 1752
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 213/451 (47%), Positives = 289/451 (64%), Gaps = 73/451 (16%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+ K++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 196 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 255
Query: 49 -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +
Sbjct: 256 NPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 314
Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 315 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 374
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 375 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 434
Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
+++ G + +PGF++VETN+R+YAYT
Sbjct: 435 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEVGQRAPGRAGWGRGNARLEGVWVWGWPP 494
Query: 244 --------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA QII FLR
Sbjct: 495 HPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 554
Query: 296 ALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
A P + + LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L +A++LGVL
Sbjct: 555 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVL 614
Query: 353 TWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
++N KR +VVT GHSD+K+FW + S
Sbjct: 615 VFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 645
>gi|301786953|ref|XP_002928911.1| PREDICTED: LOW QUALITY PROTEIN: valyl-tRNA synthetase,
mitochondrial-like [Ailuropoda melanoleuca]
Length = 1637
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 215/459 (46%), Positives = 291/459 (63%), Gaps = 77/459 (16%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
+L +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 71 QLPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 130
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 131 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 190
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 191 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 249
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 250 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 309
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYT-------------------------- 243
LA+N+++ G +PGF++VETN+R+YAYT
Sbjct: 310 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTGEARQRAPNSRXGVGTFLVTLEGLEG 369
Query: 244 ----------------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQ 287
+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA Q
Sbjct: 370 VWVWGXPSSSLPIPGPESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQ 429
Query: 288 IIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
II FLR A P + + LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L
Sbjct: 430 IIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLA 489
Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
+A++LGVL ++N KR +VVT GHSD+K+FW + S
Sbjct: 490 HARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 528
>gi|390461382|ref|XP_002746360.2| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Callithrix jacchus]
Length = 1675
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 213/452 (47%), Positives = 289/452 (63%), Gaps = 74/452 (16%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+ K++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 118 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 177
Query: 49 -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +
Sbjct: 178 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 236
Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 237 AVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 296
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 297 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 356
Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
+++ G + +PGF++VETN+R+YAYT
Sbjct: 357 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRAGWGXHLKERFEGVWVWGWP 416
Query: 244 ---------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRL 294
+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA QII FLR
Sbjct: 417 PHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRT 476
Query: 295 HALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
A P + + LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L +A++LGV
Sbjct: 477 RAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGV 536
Query: 352 LTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
L ++N KR +VVT GHSD+K+FW + S
Sbjct: 537 LVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 568
>gi|193654859|ref|XP_001944632.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 470
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/413 (47%), Positives = 275/413 (66%), Gaps = 33/413 (7%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +IA+ + +R+LF+EQPVP + ++SW+ +++E +E+ + L
Sbjct: 58 LPDIAQVFTLRILFIEQPVPLSALSSWVPAKYSRELEESIEVAINLHIWKLTSVSGGLKG 117
Query: 48 -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
++ KVA++ GG+ ++ E KARD+D L+ YA ERWECIL +MVGS+
Sbjct: 118 WILNSTFKKKLKVAIMSGGRT-TVPNSDETALKARDIDILDSYAYERWECILYYMVGSKH 176
Query: 101 TEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
+GIS+DAVR L N G +PVIT G+QFLLL+ QVWYF+L Y+ETVES+GL
Sbjct: 177 -KGISSDAVRVLLNAGLMVRDTDNSPVITSTGYQFLLLDMATQVWYFMLHYMETVESRGL 235
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
DL + L FLFQ+ ST G DY T+GM + LQ FLQHL+EFGLVYQRK KAGRF PT+L +
Sbjct: 236 DLAQYLIFLFQIHLSTLGWDYITDGMPENLQTFLQHLQEFGLVYQRKLKAGRFCPTRLVI 295
Query: 215 NMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
M + + + ++ +++VETNFR+YA TDS+LKVAL+ LF ++YRFPN++ G+LT
Sbjct: 296 EMGRENSHTSKRMKKKERYIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILT 355
Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
RDSV+ ALR+GITAAQI+ FL +H P + C +P V DQI LWE ER RLT +GVLY
Sbjct: 356 RDSVQTALRNGITAAQIVRFLTVHTHPQMQECGMPQTVIDQIYLWEYERNRLTYRDGVLY 415
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
S + D+E +++YA D+G L W +E++R ++V+ GH+D+ FW K +
Sbjct: 416 SDINTPNDYEAIKNYAADIGALLWCDERQRNIIVSTDGHNDVTTFWKKQPKSN 468
>gi|397471696|ref|XP_003807420.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Pan paniscus]
Length = 1673
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 213/459 (46%), Positives = 290/459 (63%), Gaps = 80/459 (17%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+ K++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 111 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 170
Query: 49 -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS +
Sbjct: 171 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 229
Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 230 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 289
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 290 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 349
Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
+++ G + +PGF++VETN+R+YAYT
Sbjct: 350 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRGGWGALXHFSXHLKEGLEGV 409
Query: 244 ---------------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA QI
Sbjct: 410 WVWGCPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 469
Query: 289 IGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
I FLR A P + + LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L +
Sbjct: 470 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 529
Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
A++LGVL ++N KR +VVT GHSD+K+FW + S+
Sbjct: 530 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSSR 568
>gi|16198089|gb|AAL13841.1| LD30622p [Drosophila melanogaster]
Length = 350
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 188/349 (53%), Positives = 245/349 (70%), Gaps = 31/349 (8%)
Query: 64 MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS-----QQTEGISADAVRTLFN---- 114
M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+ E IS DAVR L +
Sbjct: 1 MTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAEAISPDAVRILLHANLM 60
Query: 115 --EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFG 172
+ + IT+ GFQFLLL+T AQVW+F+L+YL+T E +G+ L ECL+ LFQL FST G
Sbjct: 61 KRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPECLSMLFQLSFSTLG 120
Query: 173 TDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL---------K 223
DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFYPT+LALN+ ++ +
Sbjct: 121 RDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFYPTRLALNVTSKEAAATASVAMDEE 180
Query: 224 QIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGI 283
++ G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL VGVLTRDSVR ALR GI
Sbjct: 181 ATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGI 240
Query: 284 TAAQIIGFLRLHALPSV----------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
TA QI+ +L +A P++ SC LPP V DQI+LWE ER R T +EGVLY+QF
Sbjct: 241 TAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTVVDQIKLWELERNRFTYTEGVLYNQF 299
Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
+S DF LRDYAQ + +L WQNE+ RT+VV K GH D+K++W + K
Sbjct: 300 LSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSKS 348
>gi|198428509|ref|XP_002131018.1| PREDICTED: similar to General transcription factor II H,
polypeptide 4 [Ciona intestinalis]
Length = 463
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/421 (44%), Positives = 274/421 (65%), Gaps = 39/421 (9%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL E+AK+Y++R+LFVEQP+ +A +++W+ ++H EA ++ LR
Sbjct: 42 ELPELAKYYIMRILFVEQPISKAAVSAWVKVNAKQDHNEAVKSMCSLRVWMESNLQGSAS 101
Query: 49 ----------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
RN + AL+GGG+PWS +A L D +D++ L++YA ERWE +L ++VGS
Sbjct: 102 TAFIMSSIFRRNLQKALVGGGEPWSSTAHLGPDKHGKDIESLDKYASERWEMLLHYLVGS 161
Query: 99 QQTEGISADAVRTLFNE----------GAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
+ IS D ++ L N+ G +IT GFQFLLL+ +QVWYF+L+YL+
Sbjct: 162 ETNSTISQD-IKDLINQAGLMKQGSEGGRSYSIITAKGFQFLLLDNASQVWYFVLEYLDW 220
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
V +G++LV L F+F+L FS+ G D T+G D + + LQH RE GLV QRKRK+ RFY
Sbjct: 221 VRERGMNLVPILRFIFELSFSSPGKDLPTDGRDDHVLICLQHFREMGLVMQRKRKSRRFY 280
Query: 209 PTKLALNMATRGT---LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
PT LA+N+A R GF++ ETNFRVYAYTDS L+ +++ LFC+++YRFPN+
Sbjct: 281 PTYLAINLANRANASVTNVTNSRGFILAETNFRVYAYTDSELQYSILSLFCKMLYRFPNV 340
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLP---PVVSDQIRLWEGERERL 322
V LTR S+++A+ +GITA QI+ +++ +A P + P P ++DQ+RLW ER+RL
Sbjct: 341 CVLQLTRKSIQSAVVNGITADQILHYIKANAHPDMLKNDPIIAPTLADQVRLWAMERDRL 400
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
T +GVLY+QF++Q DFE LR+YA++LG L W+N KR +VVTK GH +K++W ++K
Sbjct: 401 TYRDGVLYNQFLAQKDFEVLRNYAKELGALIWENSSKRYMVVTKDGHDQVKRYWKKYKKE 460
Query: 383 S 383
S
Sbjct: 461 S 461
>gi|195999048|ref|XP_002109392.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
gi|190587516|gb|EDV27558.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
Length = 452
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 196/417 (47%), Positives = 270/417 (64%), Gaps = 41/417 (9%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
ELS++AKHYV+R LF Q +P + I++W++ H KEH+ A + LTE+R
Sbjct: 40 ELSQLAKHYVMRTLFTIQAIPTSTISAWVNSQHQKEHQAALNQLTEIRIWQSHTLPGGMQ 99
Query: 49 ---------RNAKVALLGGGKPW-----SMSAKLEVDSKARDLDFLNQYALERWECILRF 94
N K AL GGG+ W S + K + S+A DL + ERWEC+L +
Sbjct: 100 GIKLNESFRANLKTALCGGGESWYGPQLSKADKYQFTSEALDLK-----SKERWECVLHY 154
Query: 95 MVGSQQTEGISADAVRTLFNEG-AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
+VG+ + +S + L N G IT GFQFLLL+ P+QVWY +L+YL++ ES G
Sbjct: 155 LVGAGEDLALSGEIQSLLNNCGLVSKNGITAAGFQFLLLDRPSQVWYILLQYLDSAESLG 214
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+DLVE L+FLFQL +STFG +YSTEG++ LQH+RE GLV QRKRK +FYPT+LA
Sbjct: 215 MDLVEILSFLFQLSYSTFGQNYSTEGLTQTQLTCLQHMREIGLVVQRKRKDMKFYPTQLA 274
Query: 214 LNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
+N+A+ + L + G++IVETN+R+YAYT+S L +AL+ LFCE+MYR P L VG++
Sbjct: 275 INLASGAKQEELDHSKSSGYIIVETNYRLYAYTESPLDIALVALFCEMMYRLPGLCVGLI 334
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALP----SVSCPL-PPVVSDQIRLWEGERERLTTS 325
TR+SV+ A +GITA +II F+R HA P V+ P+ P + DQ+ LWE ER RL+ S
Sbjct: 335 TRESVQQAFTNGITANKIINFIRTHAHPEAKKKVTTPIVPSTIIDQLYLWELERGRLSCS 394
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWV-NHQK 381
+GVLY+Q +S +DFE LR YA D+GVL W + KR +VV + GH+ +K FW N QK
Sbjct: 395 DGVLYNQILSSSDFEALRKYADDMGVLLWASPAKRLLVVNRDGHNHVKHFWKRNKQK 451
>gi|221120480|ref|XP_002159945.1| PREDICTED: general transcription factor IIH subunit 4-like [Hydra
magnipapillata]
Length = 458
Score = 350 bits (899), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 186/411 (45%), Positives = 256/411 (62%), Gaps = 38/411 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L E+AKHY++R LF Q + A + SW K AK+H EA + L L
Sbjct: 44 LPELAKHYIMRTLFTNQSLSDAFVLSWCKKEWAKDHIEAVNKLKGLHIWVSFEIGTPMLR 103
Query: 48 -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
+ N ++ L GGG + K + ARD+ FL+ Y+ ERWECIL +M GSQ
Sbjct: 104 YEFNITFQTNLRIGLCGGGPVQATPLKAVEEKHARDVAFLDNYSKERWECILYYMTGSQV 163
Query: 101 T--EGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
T G+S D R L N G + IT GFQFLLL+T +QVWYF++++L ++E
Sbjct: 164 TGNAGVSQDVARVLINAGLLIFDHQEQATCITSSGFQFLLLDTSSQVWYFMVQHLNSMEP 223
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+ LV+CL+FLFQ FS G DY +++ FLQ LRE GL +QRKRK+ RFYPT+
Sbjct: 224 E--ILVQCLSFLFQTSFSVLGKDYPVNDLTEAQFSFLQLLREIGLAFQRKRKSKRFYPTR 281
Query: 212 LALNMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
LA+N+ T + ++ GFL+VETN+RVYAYTDS L +AL+ LF ++ RFP+ TV
Sbjct: 282 LAINLGSAVTGNSDSSSQQQGFLVVETNYRVYAYTDSVLHIALLSLFTDIKARFPSFTVA 341
Query: 269 VLTRDSVRAALRSGITAAQIIGFL--RLHALPSVSCPL-PPVVSDQIRLWEGERERLTTS 325
+L+R+SV+ AL GI+A QII FL R H+ + S P+ ++DQI+LWE ER+RL S
Sbjct: 342 LLSRESVQQALACGISAEQIIDFLKTRAHSQMTTSSPIIASTITDQIKLWEMERDRLRYS 401
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
+GVLY+QF+SQ+DFE LR +A + L W N +KR +VV+K GH D+KK+W
Sbjct: 402 QGVLYNQFLSQSDFEMLRKFADEKNHLLWANNQKRLMVVSKSGHEDVKKYW 452
>gi|340371925|ref|XP_003384495.1| PREDICTED: general transcription factor IIH subunit 4-like
[Amphimedon queenslandica]
Length = 446
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 180/414 (43%), Positives = 262/414 (63%), Gaps = 35/414 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW--------------LSKTHAKE-------HKEA 40
LSE+ + Y++R+LF+E+P+P A I SW L+K H E ++
Sbjct: 36 LSELGQLYIMRVLFIERPLPLADINSWALPGNNLHAETLSLLNKLHIFEPVLIRTGQQQG 95
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
+ N ++ + GGG+ W++ D +RD+ +LN YA ++WE IL FMVG
Sbjct: 96 YQLNQSYQTNLRILICGGGQLWAVPGHRGDDKHSRDIPYLNDYADKKWENILNFMVGG-- 153
Query: 101 TEGISADAVRTLFNEG------AENP-VITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
+E + ++ V+ L G + +P IT GFQFLL++T +Q+W FIL+YL T + +G
Sbjct: 154 SEKLDSEIVQVLQYAGLMSQASSRHPSFITTLGFQFLLMDTQSQIWQFILQYLNTAQDRG 213
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+DL++CL F+FQL FST G DYST+G+++ F+ HL +FGLVYQRK + R+YPTKL
Sbjct: 214 MDLIDCLKFIFQLSFSTLGKDYSTKGLTESQLTFMYHLCQFGLVYQRKSSSKRYYPTKLV 273
Query: 214 LNMATRGTL-KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
+ + T+ + GF+IVETN+RV AYTDS L +A + LFC++ YRFPN+ VG +TR
Sbjct: 274 IQLTAGETIGMNTSQAGFIIVETNYRVIAYTDSKLHIATLALFCQMQYRFPNVAVGTITR 333
Query: 273 DSVRAALRSGITAAQIIGFLRLHA---LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
+S++ AL GI A QII FL HA + S + LPP V+DQI+LWE ER RL+ EG+L
Sbjct: 334 ESIQQALVHGIKADQIISFLTQHAHHNMLSKAHILPPTVTDQIKLWEIERNRLSFQEGIL 393
Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
YS+F+S D+EK++ YA+DLGVL W N ++R +V+ H +I+ FW QK S
Sbjct: 394 YSEFLSVTDYEKVKKYAEDLGVLLWSNRQRRLMVIHPDRHDEIRHFW-KRQKSS 446
>gi|194388768|dbj|BAG60352.1| unnamed protein product [Homo sapiens]
Length = 386
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 170/329 (51%), Positives = 229/329 (69%), Gaps = 32/329 (9%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 53 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 112
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 113 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 172
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 173 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 231
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 232 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 291
Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
LA+N+++ G + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 292 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 351
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALP 298
+TR+SV+ A+ SGITA QII FLR A P
Sbjct: 352 VTRESVQQAIASGITAQQIIHFLRTRAHP 380
>gi|405952750|gb|EKC20525.1| General transcription factor IIH subunit 4 [Crassostrea gigas]
Length = 380
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 177/374 (47%), Positives = 226/374 (60%), Gaps = 73/374 (19%)
Query: 35 KEHKEATDTLTELR---------------------RNAKVALLGGGKPWSMSA-KLEVDS 72
+EHK A L+ELR N KVALLGGG + L D
Sbjct: 40 REHKFAAKVLSELRVWHEQQMQGGLLGWVLNGTFRSNMKVALLGGGMDSQFTGGPLPPDK 99
Query: 73 KARDLDFLNQYALERWECILRFMVGSQQ-TEGISADAVRTLFNEGAEN-------PVITK 124
A+D+ FL++YALERWEC+L FMVGS + TEG+S D + L N G P IT
Sbjct: 100 HAKDVPFLDKYALERWECVLHFMVGSTEGTEGVSKDIIDVLLNAGLMTMDGVDPMPSITP 159
Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKL 184
GFQFLL++ +QVWYF+L+YL+TVE++G+DL++CL+FLFQL FST G
Sbjct: 160 AGFQFLLMDIGSQVWYFMLQYLDTVEARGMDLIDCLSFLFQLSFSTLG------------ 207
Query: 185 QVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-RGTLKQI-REPGFLIVETNFRVYAY 242
KRK R+YPT+LA+N+A + + + G+L+VETN+RVYAY
Sbjct: 208 ----------------KRKIQRYYPTRLAINLAAGQSDFTSVGKNTGYLMVETNYRVYAY 251
Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC 302
TDS L+VAL+ LFCE++YRFP +VG LTR SVR AL GITA Q +
Sbjct: 252 TDSPLQVALVALFCEMLYRFPTFSVGNLTRVSVRDALIRGITADQ-------------TP 298
Query: 303 PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTV 362
+P V+DQ+RLWE ER+R EGVLY QF+SQ DFE LRDYA+DLGVL W N KR +
Sbjct: 299 VIPSTVTDQVRLWELERDRFKFMEGVLYDQFLSQNDFELLRDYAKDLGVLLWDNAIKRVM 358
Query: 363 VVTKQGHSDIKKFW 376
VVTK GH D+K++W
Sbjct: 359 VVTKGGHDDVKRYW 372
>gi|353231086|emb|CCD77504.1| putative tfiih, polypeptide [Schistosoma mansoni]
Length = 486
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 184/453 (40%), Positives = 254/453 (56%), Gaps = 73/453 (16%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL E+AKH+V+RLLF+EQP+P+++++ W+ K + ++ LT+LR
Sbjct: 37 ELPELAKHFVMRLLFIEQPIPKSIVSGWVEKGSSALLNDSCKALTDLRIWHSTDSNVSRG 96
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEV---DSKARDLDFLNQYALERWECILRFMV 96
+ +++L GGGKP + L V D ++ +DFL YA ERW+ IL FMV
Sbjct: 97 SWSLNKRYQESIRISLFGGGKP--LLGDLGVVTNDKYSKSVDFLKSYAAERWDAILHFMV 154
Query: 97 GSQQTE-GISADAVRTLFN----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVE 150
GS+ E G V L N EG + P+ ITK GF FLL+ QV FIL Y + ++
Sbjct: 155 GSESAEVGSVVKDVLLLSNLMKCEGNDCPIGITKHGFHFLLMSRQFQVLVFILHYFDYLK 214
Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
+LV L F+FQL F Y E +S Q LQH+RE GL YQRKR A RFY T
Sbjct: 215 ENSKNLVGALQFVFQLSFLCPTKSYPVEALSTAQQEVLQHMRELGLAYQRKRTAPRFYVT 274
Query: 211 KLALNMA-------------------------------------TRGTLKQIREPGFLIV 233
LAL+ A ++ ++ + G++++
Sbjct: 275 PLALDFAGGHTTFLESKSGGWGPQSGVIPTGVSKSDPTDSNKMFSQISVSNSSDVGYILL 334
Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
ETNFR+YAYTDS L+ AL+ LF ++ RFPNL V +TRDSVR AL GITA QI+ FL
Sbjct: 335 ETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQILSFLT 394
Query: 294 LHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
+A P + + LPP ++DQIRLWE ER+R EG LY QF DFE +RDYA+ +G
Sbjct: 395 ANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDFEMVRDYAKSIG 454
Query: 351 VLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
VL W+N ++R +VV+K GH D++KFW H++ S
Sbjct: 455 VLLWENPERRLMVVSKAGHEDVRKFW-KHKRPS 486
>gi|256086358|ref|XP_002579367.1| tfiih polypeptide [Schistosoma mansoni]
Length = 486
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 184/453 (40%), Positives = 253/453 (55%), Gaps = 73/453 (16%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL E+AKH+V+RLLF+EQP+P+++++ W+ K ++ LT+LR
Sbjct: 37 ELPELAKHFVMRLLFIEQPIPKSIVSGWVEKGSIALLNDSCKALTDLRIWHSTDSNVSRG 96
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEV---DSKARDLDFLNQYALERWECILRFMV 96
+ +++L GGGKP + L V D ++ +DFL YA ERW+ IL FMV
Sbjct: 97 SWSLNKRYQESIRISLFGGGKP--LLGDLGVVTNDKYSKSVDFLKSYAAERWDAILHFMV 154
Query: 97 GSQQTE-GISADAVRTLFN----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVE 150
GS+ E G V L N EG + P+ ITK GF FLL+ QV FIL Y + ++
Sbjct: 155 GSESAEVGSVVKDVLLLSNLMKCEGNDCPIGITKHGFHFLLMSRQFQVLVFILHYFDYLK 214
Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
+LV L F+FQL F Y E +S Q LQH+RE GL YQRKR A RFY T
Sbjct: 215 ENSKNLVGALQFVFQLSFLCPTKSYPVEALSTAQQEVLQHMRELGLAYQRKRTAPRFYVT 274
Query: 211 KLALNMA-------------------------------------TRGTLKQIREPGFLIV 233
LAL+ A ++ ++ + G++++
Sbjct: 275 PLALDFAGGHTTFLESKSGGWGPQSGVIPTGVSKSDPTDSNKMFSQISVSNSSDVGYILL 334
Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
ETNFR+YAYTDS L+ AL+ LF ++ RFPNL V +TRDSVR AL GITA QI+ FL
Sbjct: 335 ETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQILSFLT 394
Query: 294 LHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
+A P + + LPP ++DQIRLWE ER+R EG LY QF DFE +RDYA+ +G
Sbjct: 395 ANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDFEMVRDYAKSIG 454
Query: 351 VLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
VL W+N ++R +VV+K GH D++KFW H++ S
Sbjct: 455 VLLWENPERRLMVVSKAGHEDVRKFW-KHKRPS 486
>gi|320167307|gb|EFW44206.1| transcription factor tfb2 [Capsaspora owczarzaki ATCC 30864]
Length = 406
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 165/354 (46%), Positives = 227/354 (64%), Gaps = 20/354 (5%)
Query: 47 LRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R + AL GGGK W+ + LE D+ A+D+ L QY+ RWE +L+FM + TEG++
Sbjct: 33 FRVGLRTALSGGGKSWAGADPLEPDAHAKDISELEQYSRARWEAVLQFMAAAN-TEGVTQ 91
Query: 107 DAVRTLFN----------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
+ V L + EGA PVIT GFQFLL + QVWYF+ +YL +++ +G D
Sbjct: 92 EVVNVLIDAELISLSTTGEGAGRPVITNKGFQFLLQDVATQVWYFLTQYLNSLQKRGADP 151
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
VE L LF+L FST G DY EG+++ LQHLRE GLV++RKR + RFYPT LA+N+
Sbjct: 152 VEALALLFRLSFSTVGMDYPVEGLTEGQLDLLQHLREIGLVFRRKRTSRRFYPTPLAINL 211
Query: 217 ATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
A+ G+ K + G+++VETNFR+YAYTDS L++AL+ LF +L YRFPN+ G+L+RD
Sbjct: 212 AS-GSAKNLDAADVKGYIVVETNFRIYAYTDSPLQLALLSLFVDLRYRFPNMVCGLLSRD 270
Query: 274 SVRAALRSGITAAQIIGFLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
SVR AL G+TA Q+I FLR HA P S + LP +SDQ+RLWE ER RL VLY
Sbjct: 271 SVRKALVKGLTAEQMIRFLRTHAHPQMRSRTPVLPETISDQLRLWELERNRLRVLPAVLY 330
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
+F +Q + + L Y +DLGV + + +VVT +GH IK FW H++ ++
Sbjct: 331 ERFSNQREHDLLHHYGRDLGVELAHSSQH--IVVTFEGHEQIKTFWQAHRQQAE 382
>gi|444727354|gb|ELW67853.1| Valyl-tRNA synthetase, mitochondrial [Tupaia chinensis]
Length = 1382
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 178/407 (43%), Positives = 245/407 (60%), Gaps = 82/407 (20%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK 60
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T LL G +
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEEST------------GLLSGLR 95
Query: 61 PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLFNEGAENP 120
W + + G G+ + +F +
Sbjct: 96 IWHT----------------------------QLLPG-----GLQGLILNPIFRQNLRIA 122
Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGT------- 173
++ QFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE L+FLFQL FST G
Sbjct: 123 LLEIVA-QFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKRRGERGR 181
Query: 174 ------------------------DYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
DYS EGMSD L FLQHLREFGLV+QRKRK+ R+YP
Sbjct: 182 VHTVMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYP 241
Query: 210 TKLALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
T+LA+N+++ G +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 242 TRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVV 301
Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTT 324
+TR+SV+ A+ SGITA QII FLR A P + + LPP ++DQIRLWE ER+RL
Sbjct: 302 AQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRF 361
Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSD 371
+EGVLY+QF+SQ DFE L +A++LGVL ++N KR +VVT GHSD
Sbjct: 362 TEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSD 408
>gi|313224499|emb|CBY20289.1| unnamed protein product [Oikopleura dioica]
Length = 411
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 157/407 (38%), Positives = 237/407 (58%), Gaps = 28/407 (6%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +IAKH +IR+LF+ Q + + +I SW++ H ++ + A + +T LR
Sbjct: 6 ELPQIAKHIIIRILFINQQIAKPLIESWVADEHREKFEVAMEIITGLRIWENTNDGIAFN 65
Query: 49 -RNA-----KVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
N+ + AL GGG+ W + + L D A+ ++ L+ Y ERW+ IL F+ Q
Sbjct: 66 LNNSFRTYLQEALFGGGETWRPAVETLGADKNAKTVEQLDTYTKERWDQILSFLT---QE 122
Query: 102 EGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
+G ++ V +L + FQFLLL+ +QVWY +++YL V+ GL LV L
Sbjct: 123 QGKLSEEVISLLKYAGLCDANGEKRFQFLLLDRSSQVWYLLVQYLGYVQKLGLSLVNVLA 182
Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
F+ QL + +FGTDY + ++++ +QH RE GL+++RK K RFYPT+LA +++ G
Sbjct: 183 FVLQLGYCSFGTDYPCDNSNNEISRVIQHFREMGLIFKRKSKEQRFYPTRLAQSISIAGG 242
Query: 222 LKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
K E F++VETN+R+YAYTDS L ALI LF E+ YRFP + V ++RDS++ +
Sbjct: 243 KKASSEDVQEQFILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQS 302
Query: 279 LRSGITAAQIIGFLRLHALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
GI+A QI+ +LR A P +P VV D I LW ERERL ++G+LY QF+
Sbjct: 303 ADYGISAEQILNYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLD 362
Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
Q FE L+ YAQD+ L W N+++R +VV H IK ++ + G
Sbjct: 363 QEAFEMLKSYAQDIRALVWANDERRFMVVAPWSHDQIKSYYKQIKDG 409
>gi|313245784|emb|CBY34779.1| unnamed protein product [Oikopleura dioica]
Length = 456
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 157/407 (38%), Positives = 237/407 (58%), Gaps = 28/407 (6%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +IAKH +IR+LF+ Q + + +I SW++ H ++ + A + +T LR
Sbjct: 51 ELPQIAKHIIIRILFINQQIAKPLIESWVADEHREKFEVAMEIITGLRIWENTNDGIAFN 110
Query: 49 -RNA-----KVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
N+ + AL GGG+ W + + L D A+ ++ L+ Y ERW+ IL F+ Q
Sbjct: 111 LNNSFRTYLQEALFGGGETWRPAVETLGADKNAKTVEQLDTYTKERWDQILSFLT---QE 167
Query: 102 EGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
+G ++ V +L + FQFLLL+ +QVWY +++YL V+ GL LV L
Sbjct: 168 QGKLSEEVISLLKYAGLCDANGEKRFQFLLLDRSSQVWYLLVQYLGYVQKLGLSLVNVLA 227
Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
F+ QL + +FGTDY + ++++ +QH RE GL+++RK K RFYPT+LA +++ G
Sbjct: 228 FVLQLGYCSFGTDYPCDNSNNEISRVIQHFREMGLIFKRKSKEQRFYPTRLAQSISIAGG 287
Query: 222 LKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
K E F++VETN+R+YAYTDS L ALI LF E+ YRFP + V ++RDS++ +
Sbjct: 288 KKASSEDVQEQFILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQS 347
Query: 279 LRSGITAAQIIGFLRLHALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
GI+A QI+ +LR A P +P VV D I LW ERERL ++G+LY QF+
Sbjct: 348 ADYGISAEQILNYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLD 407
Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
Q FE L+ YAQD+ L W N+++R +VV H IK ++ + G
Sbjct: 408 QEAFEMLKSYAQDIRALVWANDERRFMVVAPWSHDQIKSYYKQIKDG 454
>gi|393221241|gb|EJD06726.1| transcription factor Tfb2 [Fomitiporia mediterranea MF3/22]
Length = 464
Score = 284 bits (726), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 166/413 (40%), Positives = 237/413 (57%), Gaps = 40/413 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L+ + + V+ LL++E +P A ++SW+++ K + A TL+ L
Sbjct: 43 LAPMERQLVMNLLWLESSIPTATMSSWINRKGIKLYDSALSTLSRLHIIGNPSQKLNFNP 102
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSKAR----DLDFLNQYALERWECILRFMV--GSQ 99
+ + + A+ GGG S VD + + L+ L+ YA+ERWE IL +MV G+
Sbjct: 103 TFKASLRQAITGGGTHRSFGVP-AVDGEEKRSSVTLEMLDAYAVERWETILHYMVSSGTG 161
Query: 100 QTEGISADAV------RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
Q + V L + N IT GFQFLL AQ+W +L+YL E +
Sbjct: 162 QMPARPSQGVLFLLERSGLMSGNGGNMKITSFGFQFLLHSPHAQLWELLLQYLHMAEERQ 221
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+DL+E L FLF L G YSTE + + + L+ LR++GL++Q+K + RFYPT+LA
Sbjct: 222 MDLIEVLGFLFMLSTMELGRGYSTENLGETQKAMLEDLRDYGLIWQKKASSRRFYPTRLA 281
Query: 214 LNMA-------TRGTL-KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
+ T G+ + GF+I+ETN+R+YAYTD+ L+ A++ LF L YRFPNL
Sbjct: 282 TTLTSSLPPLPTSGSGGANAQAQGFIILETNYRIYAYTDNPLQTAVLNLFVTLKYRFPNL 341
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERL 322
VG LTRDSVR AL +GITA QII +L HA P + + PL PV V DQIRLWE ER RL
Sbjct: 342 VVGSLTRDSVRKALANGITADQIIKYLTTHAHPQMRKNDPLIPVTVQDQIRLWELERNRL 401
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ EG LY+ F SQAD+E + YA+ LGV+ W+N KR + +GH++I+ F
Sbjct: 402 KSQEGYLYTAFASQADYEFVLKYAKQLGVVLWENAAKRCFFGSLEGHANIRGF 454
>gi|302684027|ref|XP_003031694.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
gi|300105387|gb|EFI96791.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
Length = 476
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 170/428 (39%), Positives = 237/428 (55%), Gaps = 55/428 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L + + V+ LL++E VP + + +W+ + H K ++ A +TL +L
Sbjct: 42 LGPLERQIVMNLLWLESAVPVSTMGAWVVREHQKIYEAALETLGKLHIVPTSPVKLAVHP 101
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMV--GSQQT 101
+ + + AL GGG S E D K A D+ L+ YALERWE IL FMV G+ Q
Sbjct: 102 TFKSSLRQALTGGGTQGSFGIPAEPDGKQQAPDVGKLDGYALERWETILHFMVSSGTGQE 161
Query: 102 EGISADAVRTLFNEGA------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
A V L E IT GFQFLLL AQ+W ++ YL E+ +D
Sbjct: 162 SAQPAAGVTYLLKESGLMESRGGTLRITSAGFQFLLLTPHAQLWELLIHYLRLQEANNID 221
Query: 156 LVECLTFLFQLKFSTFGT------------DYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
LVE L+FLF + + FG +YSTE +S V + LR +GL++QRK+
Sbjct: 222 LVEVLSFLFMVSTTEFGQGNLEMLTGDLHQEYSTENLSTSQLVVMDDLRHYGLLWQRKKN 281
Query: 204 AGRFYPTKLALNMATR--------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF 255
RF PT+LA + + G +E GF+++ETN+RVYAYTD+ L+ A++ LF
Sbjct: 282 YPRFSPTRLATTLTSSAPPLPTSSGPTSGPKE-GFIVLETNYRVYAYTDNALQTAVLNLF 340
Query: 256 CELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLPPV- 307
YRFPNL VG +TR+SV+ A+ +GITA QII +L HA P V+ PL PV
Sbjct: 341 ITPKYRFPNLVVGSITRESVKRAMSNGITADQIISYLITHAHPQMHKNVCLVANPLLPVT 400
Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
V DQIRLWE ER R + EG LY++F SQAD+E + +YA+ LGV+ W+N KR ++
Sbjct: 401 VQDQIRLWELERNRTKSQEGFLYTEFASQADYEYVLNYAKQLGVVLWENHAKRCFFAAQE 460
Query: 368 GHSDIKKF 375
GH++IK F
Sbjct: 461 GHANIKSF 468
>gi|195477962|ref|XP_002086438.1| GE22864 [Drosophila yakuba]
gi|194186228|gb|EDW99839.1| GE22864 [Drosophila yakuba]
Length = 241
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 136/240 (56%), Positives = 174/240 (72%), Gaps = 20/240 (8%)
Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
LFQL FST G DYS+EGM+ ++ FLQHLREFGLV+QRKRK GRFYPT+LALN+ ++
Sbjct: 1 MLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFYPTRLALNVTSKEA 60
Query: 222 L---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
+ ++ G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL VGVLTR
Sbjct: 61 AATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTR 120
Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSDQIRLWEGERERL 322
DSVR ALR GITA QI+ +L +A P++ SC LPP V DQI+LWE ER R
Sbjct: 121 DSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSC-LPPTVVDQIKLWELERNRF 179
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
T +EGV+Y+QF+S DF LRDYAQ + +L WQNE+ RT+VV K GH D+K++W + K
Sbjct: 180 TYTEGVVYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSKS 239
>gi|432098035|gb|ELK27922.1| General transcription factor IIH subunit 4 [Myotis davidii]
Length = 377
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 147/331 (44%), Positives = 205/331 (61%), Gaps = 67/331 (20%)
Query: 49 RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISAD 107
+N + ALLGGGK WS +++L D ARD+ L++YA ERWE +L FM GS
Sbjct: 104 QNLRTALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMAGSPSAA----- 158
Query: 108 AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLK 167
FLLL+TPAQ+WYF+L++L+T +S+G+DLVE L+FLFQL
Sbjct: 159 ---------------------FLLLDTPAQLWYFMLQHLQTAQSRGMDLVEILSFLFQLS 197
Query: 168 FSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT--RGTLKQI 225
FS+ G DYS EGMSD REFGLV+QRKRK+ R+YPT+LA+++++ G
Sbjct: 198 FSSLGKDYSVEGMSDS-------RREFGLVFQRKRKSRRYYPTRLAISLSSGVSGAGGTA 250
Query: 226 REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITA 285
+PGF++VETN+++YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 251 HQPGFIVVETNYQLYAYTESELQIALIALFSEMLYRFPNMVV------------------ 292
Query: 286 AQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
+ + LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DF+ L
Sbjct: 293 -------------AQTPVLPPTITDQIRLWELERDRLLFTEGVLYNQFLSQVDFDLLLAR 339
Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
A +LG L ++N K+ +VVT GH ++K+FW
Sbjct: 340 AWELGALVFENMAKQLMVVTPAGHREVKRFW 370
>gi|336370852|gb|EGN99192.1| hypothetical protein SERLA73DRAFT_90464 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383609|gb|EGO24758.1| hypothetical protein SERLADRAFT_361686 [Serpula lacrymans var.
lacrymans S7.9]
Length = 469
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 163/418 (38%), Positives = 235/418 (56%), Gaps = 40/418 (9%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL----RRNAKVAL----- 55
+ + ++ LL++E +P A +++W+ + K + A TL L + N K+AL
Sbjct: 52 LERQIIMNLLWLESAIPIATMSAWVIREGKKLYDAALTTLANLHILPQSNVKLALNPTFK 111
Query: 56 -------LGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
GGG S E D K ++ L+ YALERWE IL +MV S Q +
Sbjct: 112 ASFRQAITGGGNSSSFGVPAEKDDKRSTNTVETLDAYALERWETILHYMVSSGQGSLPTK 171
Query: 107 DAVRTLF----------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
+ L+ N G+ IT GFQFLL Q+W +L+YL E + +DL
Sbjct: 172 PSQGVLYLLQRSGLMTQNHGSAL-QITSAGFQFLLHTPHDQLWDLLLQYLHMAEERQMDL 230
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
VE L+FLF L G +YSTEG+S+ + L+ LR++GLV+QRK + RF PT+LA +
Sbjct: 231 VEVLSFLFMLSTMDLGREYSTEGLSETQKAMLEDLRDYGLVWQRKATSKRFSPTRLATTL 290
Query: 217 ATR--------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
+ GT + GF+++ETN+R+YAYTD L+ A++ LF + YRFPNL VG
Sbjct: 291 TSSCPPLPTSTGTSGGPQSQGFIVLETNYRIYAYTDKPLQTAVLNLFITMKYRFPNLVVG 350
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERLTTS 325
+LTR+SV+ AL +GI+A QII +L HA P + + PL PV V DQIRLWE ER RL +
Sbjct: 351 MLTRESVKKALSNGISAEQIISYLTTHAHPQMRKNNPLLPVTVQDQIRLWELERNRLKSE 410
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
EG LY F S AD+E + +YA+ L V+ W+N +R + GH++I+ F G+
Sbjct: 411 EGYLYKDFGSHADYEYVLNYAKQLDVVLWENTSRRCFFGSLDGHTNIRGFIERRTNGA 468
>gi|328766914|gb|EGF76966.1| hypothetical protein BATDEDRAFT_92242 [Batrachochytrium
dendrobatidis JAM81]
Length = 471
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 157/419 (37%), Positives = 226/419 (53%), Gaps = 45/419 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AKH ++RLL+ V + I +W + ++ + A L +L
Sbjct: 42 LPDLAKHTILRLLYTSMTVKISDIETWETHEYSGQFSHAVTGLAKLHILSIKNEGSHKSV 101
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQ 100
+++ AL+G G S E K R + FL YA E WE +L ++VG+
Sbjct: 102 HINSVFQKSIHNALVGSGSHTSFGMPCETIDKHRPSIAFLETYAKECWESVLHYLVGTPS 161
Query: 101 TEGISADAVRTLFNEGAENPV------------ITKDGFQFLLLETPAQVWYFILKYLET 148
+ A V+ L G P+ IT GFQFLL + Q+W F+L+YLE
Sbjct: 162 DKRPKA-IVKLLEKSGLMAPISSTNAAHNGDLRITSKGFQFLLQDVNVQIWAFLLQYLEM 220
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E ++LVE L F FQL G DYS E ++ + L L+ GL+YQRK+K+ RFY
Sbjct: 221 AEQLNMELVEVLNFFFQLGSLELGQDYSVEVLTPTQKHMLDDLKHLGLIYQRKKKSTRFY 280
Query: 209 PTKLALNMATRGTLKQIREP--------GFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
PT LA ++ + + + P GF+I+ETN+RVYAYTDS L++A++ LF E+
Sbjct: 281 PTHLATSLTSGASAGAVLSPRSLEADSDGFIIIETNYRVYAYTDSPLQIAVLSLFIEMRV 340
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEG 317
RF N+ +G++TRDSVR AL GI+A QII +L HA P + S LP + DQ+RLWE
Sbjct: 341 RFANMVIGIITRDSVREALAKGISAEQIIAYLTTHAHPEMKKGSPVLPTTIVDQVRLWEM 400
Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
ER RL S G LY F + ++ ++ YA DLG W ++ KR VVV+ +GH IK F+
Sbjct: 401 ERNRLRISRGHLYQMFSGEQEYREILKYATDLGYELWHSDSKRLVVVSAEGHEHIKVFF 459
>gi|213402795|ref|XP_002172170.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
gi|212000217|gb|EEB05877.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
Length = 444
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 165/406 (40%), Positives = 228/406 (56%), Gaps = 34/406 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L IA+ YV+ +LF PV W + E+ + L +
Sbjct: 36 LPIIARQYVMAMLFNPSPVALNDFDLWTKLSSKVYQTESFNKLVRMHIFQVDGQNVILNS 95
Query: 46 ELRRNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGI 104
E R+ AL GGG S ++ D + D++FL++YA + WE IL FMVG+ + I
Sbjct: 96 EFRQQFITALTGGGTHNSFGVPCIDEDRQRVDIEFLDKYATDTWETILHFMVGTS-VKKI 154
Query: 105 SADAVRTLFNEGA-----ENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVE 158
+ V +L G +N V IT GFQFLL + Q+W +L+YL+ E +D V+
Sbjct: 155 PGEGVLSLLRRGGLMTGTKNQVRITHSGFQFLLQDINTQIWTLLLEYLKLSEDTHMDPVQ 214
Query: 159 CLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
L FLF L G DYS ++D Q+ L+ LRE+GLVYQ+K + RFYPT+LA ++ T
Sbjct: 215 VLHFLFMLGSLELGRDYSVHFLTDTQQIMLEDLREYGLVYQKKSSSRRFYPTRLATSLTT 274
Query: 219 ------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
+G + GF+IVETN+R+YAYTDS L++A++GLF L RF NL VGV+TR
Sbjct: 275 EYHSPVKGAGSDAGK-GFIIVETNYRLYAYTDSPLQIAILGLFTNLRARFSNLIVGVITR 333
Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWEGERERLTTSEGVL 329
DS+R AL+SGITA QII +L HA P + P LPP + DQI LWE ER RL + G+L
Sbjct: 334 DSIRRALQSGITAEQIITYLTTHAHPQMRKEIPLLPPTLVDQIYLWELERNRLRATPGIL 393
Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ F++ DFEK YA++LGVL W + KR +T G + F
Sbjct: 394 FRDFLTDKDFEKAVQYAKELGVLVWDSSLKRMFFITNAGAQPMINF 439
>gi|388857860|emb|CCF48525.1| probable TFB2-TFIIH subunit (transcription/repair factor) [Ustilago
hordei]
Length = 492
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 158/416 (37%), Positives = 230/416 (55%), Gaps = 42/416 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +A+ ++ +LF++ P+ +W+ K K+ A D L+ L
Sbjct: 71 LPMMARQLIMHMLFLDVPLAADDFIAWIKKEARKDFDAAVDKLSRLSIVQLKSTGGKQIL 130
Query: 49 -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
+ AL GGGK S + D A D+ FL+QYA +WE IL +MVGS+
Sbjct: 131 LLNAVFTDGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDQYARNKWETILHYMVGSEN 190
Query: 101 TEGISADAVRTLFNEGAENP-----------VITKDGFQFLLLETPAQVWYFILKYLETV 149
+ + L P IT GFQFLL + Q+W +L+YL+
Sbjct: 191 SSTPREPVLYLLRRSNLMQPRPSSGSGSGGLNITSRGFQFLLEDVNTQLWDLLLQYLDMA 250
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
E + +DLVE L FLF L G DYSTE + + L+ R++GLVYQRK + RFYP
Sbjct: 251 EERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFYP 310
Query: 210 TKLALNMATRGTLKQI-------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
T+LA + + + + E G++I+ETN+R+YAYT + L+VA++ LF + RF
Sbjct: 311 TRLATTLTSSAAIPLLSSNGAEQEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARF 370
Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGER 319
PNL VG +TRDSV++AL +GITA QII +L HA + + PL PV VSDQIRLWE E+
Sbjct: 371 PNLVVGSITRDSVKSALANGITAEQIITYLTHHAHVQMHRNDPLLPVTVSDQIRLWEREK 430
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
R+ + G L++ F SQ DFE++R+YA+ LGVL W++E KR V + G+ ++ +
Sbjct: 431 NRVVQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVWEDESKRRFFVDEAGNEPVRDY 486
>gi|71024427|ref|XP_762443.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
gi|46097692|gb|EAK82925.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
Length = 491
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 159/415 (38%), Positives = 229/415 (55%), Gaps = 41/415 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +A+ ++ +LF++ P+ +W+ K E A D L+ L
Sbjct: 71 LPMMARQLIMHMLFLDVPLAADDFLAWIKKEVKNEFDAAVDKLSRLSIIQLKASAGKQML 130
Query: 49 -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
+ AL GGGK S + D A D+ FL+QYA +WE IL +MVGS
Sbjct: 131 LLNAGFTEGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDQYARTKWETILHYMVGSDN 190
Query: 101 TEGISADAVRTLFNEGAENP----------VITKDGFQFLLLETPAQVWYFILKYLETVE 150
+ + L P IT GFQFLL + Q+W +L+YL+ E
Sbjct: 191 SSTPREPVLYLLRRSNLMQPRTASSSNGGLNITSRGFQFLLEDVNTQLWDLLLQYLDMAE 250
Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
+ +DLVE L FLF L G DYSTE + + L+ R++GLVYQRK + RFYPT
Sbjct: 251 ERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFYPT 310
Query: 211 KLALNMATRGTLKQI-------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
+LA + + + + E G++I+ETN+R+YAYT + L+VA++ LF + RFP
Sbjct: 311 RLATTLTSSAAVPLLSSNGTEQEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFP 370
Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERE 320
NL VG +TRDSV++AL +GITA QII +L HA + + PL PV VSDQIRLWE E+
Sbjct: 371 NLVVGSITRDSVKSALANGITAEQIITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKN 430
Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
R+ + G L++ F SQ DFE++R+YA+ LGVL WQ+E+KR V + G+ ++ +
Sbjct: 431 RVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVWQDEEKRRFFVDEAGNEPVRDY 485
>gi|390594544|gb|EIN03954.1| transcription factor Tfb2 [Punctularia strigosozonata HHB-11173
SS5]
Length = 467
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 162/416 (38%), Positives = 234/416 (56%), Gaps = 47/416 (11%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKT-------HA------KEHKEATDTLTELR 48
L + + V+ LL++E VP + W+ HA + H + +T L+
Sbjct: 48 LRPLDQQLVMNLLWLESAVPAQTMQQWIVPGGNNRKLYHASLEILSRLHILSGNTQVRLQ 107
Query: 49 RNA------KVALLGG---------GKPWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
N+ + A+ GG G P AK +D D L+ +ALERWE IL
Sbjct: 108 LNSTWKASLRAAITGGTVATSSGSFGIPAERGAKDSMDK-----DGLDAFALERWETILH 162
Query: 94 FMVGSQQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
FMV S + LF G NP IT GFQFLL E AQ+W +L+YL
Sbjct: 163 FMVSSGTAHNPPRPSAGVLFLLQRSGLMGGGGNPQITSAGFQFLLHEPHAQLWELLLQYL 222
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
E + +DLVE L+F+F L + G +YSTE +SD + L+ LR++GL++Q+K + R
Sbjct: 223 RMAEERQMDLVEVLSFIFMLSTTELGREYSTENLSDTQKAMLEDLRDYGLIWQQKPTSKR 282
Query: 207 FYPTKLALNMATRG----TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
F PT+LA + + T GF+++ETN+R+YAYTD+ L+ A++ LF L RF
Sbjct: 283 FSPTRLATTLTSSSPPLPTSGSGVSEGFIVLETNYRLYAYTDNPLQTAVLALFTTLRSRF 342
Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGER 319
PNL VG +TR+SV+ AL+SGI+A QII +L HA P + + PL PV V DQIRLWE E+
Sbjct: 343 PNLVVGQITRESVKRALQSGISAEQIISYLSTHAHPQMRKNNPLIPVTVQDQIRLWELEK 402
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
RL + EG LY++F SQ D+E + +YA++L V+ W+N KR + +GH++I+ F
Sbjct: 403 NRLKSQEGFLYTEFASQGDYEYVLNYARELNVVLWENPVKRCFFGSMEGHANIRGF 458
>gi|343428936|emb|CBQ72481.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Sporisorium reilianum SRZ2]
Length = 493
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 157/417 (37%), Positives = 229/417 (54%), Gaps = 43/417 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +A+ ++ +LF++ P+ +W+ K K+ A D L+ L
Sbjct: 71 LPMMARQLIMHMLFLDVPLAADDFMAWIKKEARKDFDAAVDKLSRLSIVQLKTTAAKQIL 130
Query: 49 -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
+ AL GGGK S + D A D+ FL++YA +WE IL +MVGS
Sbjct: 131 LLNAVFTDGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDEYARTKWETILHYMVGSDN 190
Query: 101 TEGISADAVRTLFNEGAENP------------VITKDGFQFLLLETPAQVWYFILKYLET 148
+ + L P IT GFQFLL + Q+W +L+YL+
Sbjct: 191 SSTPREPVLYLLRRSNLMQPRSTSTSSSSGGLNITSRGFQFLLEDVNTQLWDLLLQYLDM 250
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
E + +DLVE L FLF L G DYSTE + + L+ R++GLVYQRK + RFY
Sbjct: 251 AEERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFY 310
Query: 209 PTKLALNMATRGTLKQI-------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYR 261
PT+LA + + + + E G++I+ETN+R+YAYT + L+VA++ LF + R
Sbjct: 311 PTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNYRLYAYTSNRLRVAVLSLFVTIKAR 370
Query: 262 FPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGE 318
FPNL VG +TRDSV++AL +GITA QII +L HA + + PL PV VSDQIRLWE E
Sbjct: 371 FPNLVVGSITRDSVKSALANGITAEQIITYLTHHAHVQMHRNDPLLPVTVSDQIRLWERE 430
Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ R+ + G L++ F SQ DFE++R+YA+ LGVL W++E KR V + G+ ++ +
Sbjct: 431 KNRVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVWEDEGKRRFFVDEAGNVPVRDY 487
>gi|392594388|gb|EIW83712.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 486
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 161/428 (37%), Positives = 239/428 (55%), Gaps = 54/428 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----RNAKVAL-- 55
L+ +++ V+ LL+++ PV +A+W+ K + EA +TLT L AK+AL
Sbjct: 50 LAPLSRQLVMNLLWLDSPVLAGTMAAWVVPDGKKLYDEALNTLTRLHILQSSGAKLALNP 109
Query: 56 ------------LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ--- 100
G + + A+ + A ++ L+ YA++RWE IL +MV S Q
Sbjct: 110 TFKASMRQAITCSGTTGSFGVPAQRDDRRDAPSVETLDGYAVQRWETILHYMVSSGQGQY 169
Query: 101 ----TEGISADAVRT--LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
T+G+ R+ + + IT GFQFLL Q+W +L+YL VE + +
Sbjct: 170 PTKPTQGVLYLLQRSGLMASYHGSTLQITSSGFQFLLYSPHDQLWDLLLQYLHMVEERQM 229
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
DLVE L+F F L G +YSTE +S+ + L+ LR++GLV+QRK + RF PT+LA
Sbjct: 230 DLVEVLSFFFMLSTMELGREYSTEPLSETQKAMLEDLRDYGLVWQRKASSKRFSPTRLAT 289
Query: 215 NMATRG----TLKQIREPG--------------------FLIVETNFRVYAYTDSNLKVA 250
+ + T R PG F+++ETN+RVYAYTD+ L++A
Sbjct: 290 TLTSVSPSLPTAGGSRNPGSAPGASSVANSTGINTNDHGFIVLETNYRVYAYTDNPLQIA 349
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV- 307
++ LF L YRFPNL VG+LTR+SVR AL +GI+A Q+I +L HA P + + PL PV
Sbjct: 350 VLNLFITLKYRFPNLVVGMLTRESVRRALGNGISAEQVISYLTAHAHPQMRKNNPLLPVT 409
Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
V DQ+RLWE ER RL + +G LY F +QAD+E + YA++L V+ W+N KR T
Sbjct: 410 VQDQVRLWELERNRLKSDDGFLYKDFATQADYEYVLAYAKELDVVLWENTSKRCFFGTLD 469
Query: 368 GHSDIKKF 375
GH++++ F
Sbjct: 470 GHTNVRGF 477
>gi|170120252|ref|XP_001891159.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164633568|gb|EDQ98188.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 475
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 160/417 (38%), Positives = 239/417 (57%), Gaps = 43/417 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L I++ ++ LL++E +P + + +W+++ + EA TLT+L
Sbjct: 51 LDPISRQLIMNLLWLESAIPASTMQAWVTREGKPVYTEALATLTKLHIFPNSAIKLAVVP 110
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSKARDLD--FLNQYALERWECILRFMVGSQQTEG 103
+ + + A+ GGG S E D K D+D FL+ +ALERWE IL +MV S +
Sbjct: 111 TFKTSFRQAITGGGTSGSFGVPSEKDDKRSDVDIEFLDVFALERWETILHYMVSSGSGQN 170
Query: 104 ISADAVRTLF--------NEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
+ +VR LF + P+ IT GFQFLL Q+W +L+YL E + +
Sbjct: 171 PTKPSVRVLFLLQRSGLMTTVGQGPLQITSAGFQFLLHSPHEQLWELLLQYLHLTEERQM 230
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
DLV+ L FLF L G +YSTE +S L+ L ++GL+++RK + F+PT+LA
Sbjct: 231 DLVDVLGFLFMLSTMELGREYSTERLSKTQTAMLEDLVDYGLIWRRKPSSKGFHPTRLAT 290
Query: 215 NMATRGTL-------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
+ + + + GF+++ETN+R+YAYTD+ L+ A++ LF L YRFPNL V
Sbjct: 291 TLTSSSPPLPSSIGNRSGPQEGFIVLETNYRIYAYTDNPLQTAILNLFVSLKYRFPNLVV 350
Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SC-PLPPV-VSDQIRLWEGE 318
G +TRDSV+ AL +GI+A QII +L HA P + SC PL PV V DQIRLWE E
Sbjct: 351 GAITRDSVKKALLNGISADQIISYLMSHAHPQMRKNVRLRSCNPLLPVTVQDQIRLWELE 410
Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ RL + EG LY+ F SQAD++ + +YA++LGV+ W+N KR + GH++I+ F
Sbjct: 411 KNRLKSQEGYLYTSFASQADYDFVLNYAKELGVVLWENPVKRCFFGSLDGHANIRGF 467
>gi|392564310|gb|EIW57488.1| transcription factor Tfb2 [Trametes versicolor FP-101664 SS1]
Length = 477
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 157/414 (37%), Positives = 236/414 (57%), Gaps = 42/414 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L + + V+ LL+++ P+P + +A+W+++ K +++A TL L
Sbjct: 56 LRPLERQLVMNLLWLDLPIPPSTMAAWVTREGKKPYQDALSTLARLHILPPSTSKHSLNP 115
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQTEG 103
+ + A+ GGG S +E D K ++D L+ YALERWE IL +MV S E
Sbjct: 116 TFKSGLRQAITGGGTSGSFGVLIENDDKNPPPEIDALDGYALERWETILYYMVSSGTGEF 175
Query: 104 ISADAVRTLF-----------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
S + L+ + GA IT GFQFLL Q+W +L+YL+ E +
Sbjct: 176 PSQPSKGVLYLLERSGLMARVHSGALQ--ITSGGFQFLLHPPHVQLWELLLQYLQMAEER 233
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
+DLVE ++FL L + G +YSTE +S + L LR++GL+ QR + RF PT+L
Sbjct: 234 QMDLVEVISFLLMLSTTELGKNYSTENLSPTQKTMLDDLRDYGLIKQRTPTSRRFSPTRL 293
Query: 213 ALNMATR--------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
A + + G+ GF+++ETN+R+YAYTD+ L++A++ LF L RFPN
Sbjct: 294 ATTLTSSSPPLPTSAGSGDGSHAQGFIVLETNYRLYAYTDNPLQIAVLNLFVTLKSRFPN 353
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERER 321
L +G +TRDSV+ AL SGITA QII +L HA P + + PL PV V DQIRLWE E+ R
Sbjct: 354 LVIGAITRDSVKKALASGITADQIISYLVTHAHPQMRKNQPLLPVTVQDQIRLWELEKNR 413
Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ + EG LY+ F SQAD+E + +YA+ L V+ W+N +R + +GH++I+ F
Sbjct: 414 MKSQEGYLYTAFASQADYEYVLNYAKQLDVVLWENASRRCFFGSVEGHANIRGF 467
>gi|19113355|ref|NP_596563.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe 972h-]
gi|62901133|sp|P87303.2|TFB2_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|3080538|emb|CAA18666.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe]
Length = 447
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 157/398 (39%), Positives = 222/398 (55%), Gaps = 31/398 (7%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L +A+ YV+ +LF PV + W+ + E+ + L +
Sbjct: 36 LPILARQYVMSMLFNPMPVALSDFDLWIKLSSKVYQSESFNKLVRMHIFQFDGQYITLNS 95
Query: 46 ELRRNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGSQQT--- 101
E R+ AL GGG S + D +DFL+ YA E WE IL FMVG+ +
Sbjct: 96 EFRKQFITALTGGGNHNSFGVPCTDEDKHLVTVDFLDAYAKETWETILHFMVGTPEAKFP 155
Query: 102 -EGISADAVRTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
EG+ + R G +N + IT+ GFQFLL + AQ+W +L YL+ E +D V+
Sbjct: 156 GEGVLSLLKRGGLMSGPKNQLRITRAGFQFLLQDINAQIWTLLLDYLKLSEDTHMDPVQV 215
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
L FLF L G YS + ++D Q+ L+ LRE+GLVYQRK + RFYPT+LA + T
Sbjct: 216 LHFLFMLGSLDLGRAYSVDFLTDTQQIMLEDLREYGLVYQRKITSKRFYPTRLATGLTTD 275
Query: 220 ------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
+ + GF+IVETN+R+YAYT S L++A+IGLF L RF NL VGV+TRD
Sbjct: 276 YRSLHGKQSENDDDKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRD 335
Query: 274 SVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWEGERERLTTSEGVLY 330
S+R AL +GI A QII +L HA P + + P LPP + DQI LWE E+ RL + G+L+
Sbjct: 336 SIRRALMNGIAAEQIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILF 395
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
F++ + F++ +YA++LGVL W + KR +T G
Sbjct: 396 RDFLTDSGFDQAVEYAKELGVLVWDSSLKRMFFITTTG 433
>gi|443894209|dbj|GAC71558.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Pseudozyma antarctica
T-34]
Length = 491
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 153/414 (36%), Positives = 225/414 (54%), Gaps = 40/414 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +A+ ++ +LF+ P+ +WL K E A L L
Sbjct: 72 LPMLARQLIMHMLFLHAPLAADDFLAWLRKDVRNEFDVAVAKLARLSIVQLKPASAKQIL 131
Query: 49 -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
+ AL GGG S + D A D+ FL+ YA +WE IL +MVGS +
Sbjct: 132 LLNAVFTEGMRRALTGGGSHRSFGVPCDTEDKNAVDIAFLDDYARTKWETILHYMVGSDK 191
Query: 101 TEGIS---------ADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ ++ +++ + IT GFQFLL + Q+W+ +L+YL+ E
Sbjct: 192 SSTPREPVLYLLRRSNLMQSRSAAATASLNITSRGFQFLLEDVNTQLWHLLLQYLDMAEE 251
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+ +DLVE L FLF L G DYSTE + + L+ R++GLVYQRK + RFYPT+
Sbjct: 252 RNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFYPTR 311
Query: 212 LALNMATRGTL-------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
LA + + + E G++I+ETN+R+YAYT + L+VA++ LF + RFPN
Sbjct: 312 LATTLTSSAAAPLLSTNGSEPEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPN 371
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERER 321
L VG +TRDSV++AL +GITA QII +L HA + + PL PV VSDQIRLWE E+ R
Sbjct: 372 LLVGSITRDSVKSALANGITAEQIITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKNR 431
Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ + G L++ F SQ DF ++R+YA LGVL W +E KR + V + G+ ++ +
Sbjct: 432 VQQNLGSLFTDFTSQPDFAEVRNYASQLGVLVWHDEPKRRLFVDEAGNEPVRDY 485
>gi|393246879|gb|EJD54387.1| transcription factor Tfb2 [Auricularia delicata TFB-10046 SS5]
Length = 480
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 157/427 (36%), Positives = 233/427 (54%), Gaps = 57/427 (13%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS + + V+ LL+++ + A + W+++ + +K + L +L
Sbjct: 45 LSPVERQIVMTLLWLDNNIENATLTPWVTRDGLEIYKASVMELCKLHIVPATSSTPNLLP 104
Query: 48 ---------RRNAKVALLGGG------KPWSMSAKLEVDSKARDLDFLNQYALERWECIL 92
+ N ++AL+GGG KP SA+L + + L+ YA+ERWE IL
Sbjct: 105 TTLDLNGVFKANLRLALVGGGDHNSFGKPVKRSAELPPVT----VSALDAYAVERWETIL 160
Query: 93 RFMV--GSQQTEGISADAVRTLFNEGA----------ENPVITKDGFQFLLLETPAQVWY 140
FMV G+ Q+ + AV L + + IT GFQFLL AQ+W
Sbjct: 161 HFMVSSGTDQSPATPSGAVCNLLRKSGLMIRLDPRSDSSMKITSRGFQFLLSSPHAQLWE 220
Query: 141 FILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR 200
+L YLE E +GL L+E ++FLF L G +YST+ ++ L L ++GL+YQR
Sbjct: 221 LLLHYLELAEERGLGLMEVVSFLFMLSTMELGQEYSTDNLTKDQATVLGELLDYGLIYQR 280
Query: 201 KRK--AGRFYPTKLALNMATR-------GTLKQIREPGFLIVETNFRVYAYTDSNLKVAL 251
+ RF+PT+LA + + + GF+I+ETN+R+YAYTD+ L++A+
Sbjct: 281 ALPGMSKRFFPTRLATTLMSSLPELPRTAGVASATSGGFIILETNYRLYAYTDNPLQIAV 340
Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-V 308
+ LF RFPNL VG++TRDSV+ AL +GITA QII +L HA P + + PL PV V
Sbjct: 341 LNLFVSFKSRFPNLVVGMVTRDSVKKALANGITAEQIITYLSAHAHPQMRKNNPLLPVTV 400
Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
DQ+RLWE E+ R+ EG LY F SQADFE + DYA+ LGV+ W+N ++R G
Sbjct: 401 QDQVRLWELEKNRVKAEEGFLYMDFTSQADFELVLDYARKLGVVIWENGRQRMFFGKADG 460
Query: 369 HSDIKKF 375
H++I+ F
Sbjct: 461 HNNIRTF 467
>gi|389749775|gb|EIM90946.1| transcription factor Tfb2 [Stereum hirsutum FP-91666 SS1]
Length = 494
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 166/425 (39%), Positives = 234/425 (55%), Gaps = 51/425 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-RNAKVALLGGGK 60
L+ + + V+ LL++E +P ++SW+ K + K EA LT L A LG
Sbjct: 49 LAPVERQLVMNLLWLESAIPTNTMSSWVVKENKKFFDEALAQLTRLHILPNPSAKLGLNP 108
Query: 61 PWSMSAK-----------LEVDSKARD------LDFLNQYALERWECILRFMVGS----- 98
+ S + V ++A D ++ L+ YALERWE IL FMV S
Sbjct: 109 TFKTSLRHAITGGGTSGSFGVPAEADDKRGPPSVEILDTYALERWETILHFMVSSGTGQM 168
Query: 99 --QQTEGISADAVRTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+ ++G+ R+ + + IT GFQFLL AQ+W +L+YL VE + +D
Sbjct: 169 PTKPSQGVLYLLQRSKLMAPTQGTLQITSSGFQFLLHTPHAQLWELLLQYLAMVEERQMD 228
Query: 156 LVECLTFLFQLKFSTFGT-----DYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
LVE L+FLF L G DYS E +SD + L+ LR++GL++QRK + RF PT
Sbjct: 229 LVEVLSFLFMLSTMELGRVRQFHDYSAESLSDTQKAMLEDLRDYGLIWQRKATSRRFSPT 288
Query: 211 KLALNMATR----------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
+LA + + GT + GF+I+ETN+R+YAYTD+ L+ A++ LF +
Sbjct: 289 RLATTLTSSLPPLPTASGTGTSGTGQTQGFIILETNYRLYAYTDNPLQTAVLNLFVTMKS 348
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS---------CPLPPV-VSD 310
RFPNL VG +TRDSVR AL +GITA QII +L +A P + PL PV V D
Sbjct: 349 RFPNLVVGSITRDSVRKALSNGITADQIISYLTTYAHPQMRKNVSRCRSFTPLLPVTVQD 408
Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
QIRLWE ER RL + EG LY+ F SQAD+E + YA+ L V+ W+N KR T +GH+
Sbjct: 409 QIRLWELERNRLKSQEGFLYTAFASQADYEYVLTYAKQLDVVLWENPTKRCFFGTMEGHA 468
Query: 371 DIKKF 375
+I+ F
Sbjct: 469 NIRGF 473
>gi|409077631|gb|EKM77996.1| hypothetical protein AGABI1DRAFT_114851 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426198975|gb|EKV48900.1| hypothetical protein AGABI2DRAFT_191077 [Agaricus bisporus var.
bisporus H97]
Length = 467
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 161/410 (39%), Positives = 234/410 (57%), Gaps = 36/410 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L+ + + V+ LL++E + + + +W+ + K ++EA L+ L
Sbjct: 48 LAPLERQIVMNLLWLESAIATSTMVAWVQRDGRKLYEEALGNLSRLHILPNSAAKLALNF 107
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSKA-RDLDFLNQYALERWECILRFMVGS---QQT 101
+ + + AL GGG S E D + + L+ +ALERWE IL +MV S QQ
Sbjct: 108 TFKTSLRQALTGGGTSGSFGVPAEKDEHSGPSIKTLDSFALERWETILHYMVSSGTGQQP 167
Query: 102 EGISADAV-----RTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S + +L + P+ IT GFQFLL AQ+W +L+YL + + +D
Sbjct: 168 AKPSQGVLFLLQRSSLMSSFHGGPLQITSSGFQFLLHSPHAQLWDLLLQYLHLAQERQMD 227
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LVE L FLF L G +Y TE +S V L+ LR++GL++Q K K+ RF PT+L
Sbjct: 228 LVEVLGFLFMLSTMELGREYLTENLSTTQGVLLEDLRDYGLIWQSKPKSRRFSPTRLGTT 287
Query: 216 MATRG-----TLKQIREP--GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
+ + T+ P GF+++ETN+R+YAYTD+ L+ A++ LF L YRFPNL VG
Sbjct: 288 LTSSSPPLPTTISASSGPLDGFIVLETNYRIYAYTDNPLQTAVLNLFVSLKYRFPNLVVG 347
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERLTTS 325
++TRDSVR AL +GITA QII +L HA P + + PL PV V DQIRLWE E+ RL +S
Sbjct: 348 MITRDSVRRALMNGITADQIISYLITHAHPQMRKNNPLLPVTVQDQIRLWELEKNRLKSS 407
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
EG LY+ F SQAD++ + +YA+ L V+ W+N KR + +GH IK+F
Sbjct: 408 EGYLYTAFTSQADYDLVLNYAKQLDVVLWENAAKRCFFGSLEGHLRIKEF 457
>gi|32487401|emb|CAE05735.1| OSJNBb0017I01.15 [Oryza sativa Japonica Group]
Length = 451
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 151/413 (36%), Positives = 232/413 (56%), Gaps = 45/413 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y +++L+V PV A + W+ +A +H+ A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99
Query: 49 --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
N + L+ GG P S++A+L L L +ALE+WEC L
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWECFLLQ 153
Query: 95 MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
++ S Q E S+ +RT L + E P +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVERGTSFSSSMMRTFQRGLLSSRDGEAPRLTENGFQFLLMETNAQLWYIMREYI 213
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAG 205
+ E +G+D E ++FL +L F T G YS ++D + ++ L E GLV Q+ RK
Sbjct: 214 SSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRNAIRDLAELGLVKLQQGRKDS 273
Query: 206 RFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
F PTKLA N++ + + GF++VETNFR+YAY+ S L ++ LF + Y+ PNL
Sbjct: 274 WFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNL 333
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERL 322
VG +T++S+ A +GITA QII FL+ +A P V+ +P V V+DQIRLWE +R R+
Sbjct: 334 IVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRV 393
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ LY F S+ F++ DYA+D G L W++ KK ++V + HS++++F
Sbjct: 394 DMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMREF 446
>gi|409048320|gb|EKM57798.1| hypothetical protein PHACADRAFT_171015 [Phanerochaete carnosa
HHB-10118-sp]
Length = 472
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 150/413 (36%), Positives = 226/413 (54%), Gaps = 39/413 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
L + + V+ LL++E +P + +A+W+ K ++ + L+ L + A+ P
Sbjct: 51 LGPLERQIVMNLLWLESSIPASTMATWVPHEVRKTYENSLSILSRLHILPQSAVKLALNP 110
Query: 62 ------------------WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEG 103
+ + A + + ++ L+ YALERWE IL +MV S
Sbjct: 111 TFKTSLRHAIIGGGSSGSFGVPATRDEKHSSPSIEVLDAYALERWETILHYMVSSGIATR 170
Query: 104 ISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLV 157
S + L G + IT GFQFLL AQ+W +L+YL VE + +DLV
Sbjct: 171 PSQGVLFLLQRSGLMASIHGGSLQITSLGFQFLLHSPHAQLWELLLQYLHMVEERQMDLV 230
Query: 158 ECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM- 216
E L+FLF L G +YS E +S L+ LR++G+++QR+ + RF PT+LA +
Sbjct: 231 EVLSFLFMLSTMELGREYSVEHLSPTQTAMLEDLRDYGIIWQRRATSKRFCPTRLATTLT 290
Query: 217 -------ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
A G GF+I+ETN+R+YAYTD+ L++A++ LF L R+PNL VG
Sbjct: 291 SSSPPLPAAGGVSASAHGQGFIILETNYRLYAYTDNPLQIAVLNLFVTLKSRYPNLVVGA 350
Query: 270 LTRDSVRAALRSGITA----AQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERL 322
+TR+SV+ AL +GITA QII +L HA P + + PL PV V DQIRLWE E+ R+
Sbjct: 351 ITRESVKKALTNGITADQASRQIISYLTTHAHPQMRKNKPLLPVTVQDQIRLWELEKNRM 410
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ EG LY+ F SQAD+E + +YA+ LGV+ W++ +R T +GH +I+ F
Sbjct: 411 KSQEGYLYTAFASQADYEYVLNYAKQLGVVLWESSGRRCFFGTLEGHPNIRGF 463
>gi|395332344|gb|EJF64723.1| transcription factor Tfb2 [Dichomitus squalens LYAD-421 SS1]
Length = 506
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 157/441 (35%), Positives = 229/441 (51%), Gaps = 68/441 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
L + + V+ LL++E P+P A +A+W+++ + ++EA TL L + + G P
Sbjct: 58 LQPLERQIVMNLLWLETPIPGATMAAWVTREGKRPYQEALSTLQSLHILPQQST-RPGPP 116
Query: 62 WSMSAKLEVDSKAR---------------------------DLDFLNQYALERWECILRF 94
+ L KA D++ L+ +ALERWE IL +
Sbjct: 117 QPVKHNLNPTFKASLRQAITGGGTSGSFGILAENDDKHPPPDIEALDAFALERWETILYY 176
Query: 95 MV--GSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKY 145
MV G+ Q + V L IT GFQFLL Q+W +L+Y
Sbjct: 177 MVSSGTGQFPAKPSQGVLYLLQRSGLMATVHGATLQITSAGFQFLLHPPHEQLWELLLQY 236
Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGT--------------------DYSTEGMSDKLQ 185
L+ E + +DLVE L+FL L + G YSTE +S +
Sbjct: 237 LQMAEERQMDLVEVLSFLLMLSTTELGKVGVHTIQFIAQVTQAIPHAQSYSTENLSATQK 296
Query: 186 VFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR--------GTLKQIREPGFLIVETNF 237
L+ LR++GL+ QR + +F PT+LA + + GT GF+++ETN+
Sbjct: 297 AMLEDLRDYGLIRQRTATSRKFSPTRLATTLTSSSPPLPTSAGTGDGSHVQGFIVLETNY 356
Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
R+YAYTD+ L++A++ LF L YRFPNL VG +TRDSV+ AL +GITA QII +L HA
Sbjct: 357 RLYAYTDNPLQIAVLNLFVTLKYRFPNLVVGQITRDSVKKALANGITADQIISYLITHAH 416
Query: 298 PSV--SCPLPPV-VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
P + + PL PV V DQIRLWE ER R+ + EG LY+ F S AD+E + DYA+ LGV+ W
Sbjct: 417 PQMRKNKPLLPVTVQDQIRLWELERNRVKSQEGYLYTAFASHADYEWVLDYAKKLGVVLW 476
Query: 355 QNEKKRTVVVTKQGHSDIKKF 375
+N KR + +GH +I+ +
Sbjct: 477 ENPSKRCFFGSVEGHMNIRGY 497
>gi|358333989|dbj|GAA34589.2| transcription initiation factor TFIIH subunit 4 [Clonorchis
sinensis]
Length = 456
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 160/408 (39%), Positives = 215/408 (52%), Gaps = 84/408 (20%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----------RN 50
EL E+AKH+V RLLFVEQP+P+++++ W+ K ++A L++LR R
Sbjct: 37 ELPELAKHFVARLLFVEQPIPKSIVSGWVEKGSQALLQDACRALSDLRIWHSVDGNVARG 96
Query: 51 A-----------KVALLGGGKPWSMS-AKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
A +++L GGGKP VD ++D+DFL YA ERW+ +L FMVGS
Sbjct: 97 AWCLSKKYQESLRISLFGGGKPLLGDLGSTAVDKYSKDVDFLETYAAERWDALLHFMVGS 156
Query: 99 QQTE-GISADAVRTLFN--EGA--ENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+ +E G V L N +G + PV ITK GF FLL+ P QV F+L Y + ++S
Sbjct: 157 ESSEVGSVVQDVLLLSNLMKGGTPDTPVGITKHGFHFLLMSRPDQVRLFLLHYFDYLKS- 215
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
Y + ++ Q LQ +RE GL YQRKR A RFY T L
Sbjct: 216 ----------------------YPVDALTAPQQEVLQQMRELGLAYQRKRTAPRFYVTPL 253
Query: 213 ALNMA------------------------------TRGTLKQIREPGFLIVETNFRVYAY 242
A ++ + T + G++++ETNFR+YAY
Sbjct: 254 ATVLSGSRCHQPAMSSGSVLSAIPTGVSHLEGTDSVQPTASSTSDVGYILLETNFRLYAY 313
Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
TDS L+ AL+ LF ++ RFPNL V +TRDSVR AL GITA QII FL +A P +
Sbjct: 314 TDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITADQIISFLTTNAHPDMLR 373
Query: 301 SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQ 347
P LPP + DQIRLWE ER+R EG LY QF ADFE +RD+A+
Sbjct: 374 EPPILPPTLVDQIRLWELERDRFVFQEGCLYEQFSKSADFEMVRDFAK 421
>gi|218195833|gb|EEC78260.1| hypothetical protein OsI_17944 [Oryza sativa Indica Group]
Length = 459
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 150/421 (35%), Positives = 232/421 (55%), Gaps = 53/421 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y +++L+V PV A + W+ +A +H+ A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99
Query: 49 --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
N + L+ GG P S++A+L L L +ALE+WEC L
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWECFLLQ 153
Query: 95 MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
++ S Q E S+ +RT L + E P +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVERGTSFSSSMMRTFQRGLLSSRDGEAPRLTENGFQFLLMETNAQLWYIMREYI 213
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK----- 201
+ E +G+D E ++FL +L F T G YS ++D + ++ L E GLV ++
Sbjct: 214 SSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRNAIRDLAELGLVKLQQMSRVK 273
Query: 202 ----RKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCE 257
RK F PTKLA N++ + + GF++VETNFR+YAY+ S L ++ LF
Sbjct: 274 PWQGRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSR 333
Query: 258 LMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRL 314
+ Y+ PNL VG +T++S+ A +GITA QII FL+ +A P V+ +P V V+DQIRL
Sbjct: 334 VEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRL 393
Query: 315 WEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKK 374
WE +R R+ + LY F S+ F++ DYA+D G L W++ KK ++V + HS++++
Sbjct: 394 WETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMRE 453
Query: 375 F 375
F
Sbjct: 454 F 454
>gi|242074798|ref|XP_002447335.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
gi|241938518|gb|EES11663.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
Length = 451
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 145/413 (35%), Positives = 229/413 (55%), Gaps = 45/413 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y +++L+V PV A + W+ +A +HK A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVLAPVTAAAMEEWVLNEYAAKHKVAIDKLLQLRVFVEVRDRRRDVS 99
Query: 49 --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
N + L+ GG P S++ +L + L YAL++WEC L
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPLPLSVTGRLPTPAD------LEAYALDQWECFLLQ 153
Query: 95 MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
++ S Q E S+ ++T L + E +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVEKGTSFSSSMMKTFQRGLLSSRDGEASKLTENGFQFLLMETNAQLWYIMREYI 213
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
+ E +G+D E ++FL +L F G YS ++D ++ ++ L E GLV Q++ + R
Sbjct: 214 SSAEERGVDPTELISFLLELSFHKLGAAYSLNTLTDVQRIAIRDLAELGLVKQQQGRTDR 273
Query: 207 -FYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
F PT+LA N++ + + GF++VETNFR+YAY+ S L ++ LF + Y+ PNL
Sbjct: 274 WFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFSRVEYQLPNL 333
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERL 322
VG +T++S+ A +GITA QII FLR +A P V+ +P V V+DQIRLWE +R R+
Sbjct: 334 IVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPTVPENVTDQIRLWETDRNRV 393
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LY F S+ FE+ DYA+D G L W++ K+ ++V + H ++++F
Sbjct: 394 EMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDSKRMRLIVRGEFHPEMREF 446
>gi|357166758|ref|XP_003580836.1| PREDICTED: general transcription factor IIH subunit 4-like
[Brachypodium distachyon]
Length = 451
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 143/404 (35%), Positives = 225/404 (55%), Gaps = 33/404 (8%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+AK Y +++L+V P+P A + W+ +A +HK A D L +LR
Sbjct: 43 LAKKYALQMLYVSAPMPAAAMEEWVLDEYASKHKVAIDRLLQLRVFVEVRDRRKEVSYKM 102
Query: 49 -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE- 102
N + L+ GG V + L L YALE+WEC L ++ S Q E
Sbjct: 103 NDKFQGNMQKYLVDGGSLPREPISSSVTGRLPTLTELENYALEQWECFLLQLINSSQVER 162
Query: 103 --GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S+ ++T L + + P ++++GFQFLL+ET AQ+WY + +Y+ + E +G+D
Sbjct: 163 GTSFSSSMMKTFQRGLLSSRDGDAPKLSENGFQFLLMETNAQLWYIMREYISSAEERGVD 222
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
E ++FL +L F T G YS ++D ++ ++ L E GLV Q+ RK F PTKLA
Sbjct: 223 PTELISFLLELSFHTLGAAYSFNTLTDVQRIAIRDLAELGLVKVQQGRKDSWFIPTKLAT 282
Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
N+++ + + G ++VETNFR+YAY+ S L ++ LF + Y+ PNL VG +T++S
Sbjct: 283 NLSSSLSDSSASKEGIVVVETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKES 342
Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
+ A +GITA QII FL+ +A P V+ +P V V+DQIRLWE +R R+ +Y
Sbjct: 343 LYGAFDNGITAEQIISFLKQNAHPRVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVYE 402
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
F S+ FE+ D A+D G L W++ KK ++V + H ++++F
Sbjct: 403 DFPSKDMFEQCCDLARDNGFLLWEDSKKMRLIVRVEFHQEMREF 446
>gi|326435722|gb|EGD81292.1| hypothetical protein PTSG_11329 [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 139/396 (35%), Positives = 229/396 (57%), Gaps = 40/396 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L ++K V+RLL++ + +A++++W+ A+ E+ + L +L+
Sbjct: 42 LDGLSKQMVLRLLYLPSALSKAIVSTWIVPEQARLWTESLEQLQKLQILAVSGDGAVAVA 101
Query: 49 ---RNAKVALL-GGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGI 104
+N +A L G GK + + A DFL QYA E+WE +L +MVGS + +
Sbjct: 102 QPFKNGLIAALQGTGKERCQAP--DPGKHAASKDFLQQYADEKWEDVLDYMVGS---DSL 156
Query: 105 SADAVR--------TLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
D V+ + E ++ I+ GFQFLL + Q+W++++K+L+T +++G+D+
Sbjct: 157 EEDTVKQALYQSHLMTWREDVQDFDISSLGFQFLLQDRRTQLWFYVIKFLQTCQNEGIDI 216
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
+ L F+F++ ++TFG Y TE ++ + L+H + GL Y RK+K+ ++YPT+LAL +
Sbjct: 217 EDALGFVFRVAYTTFGNYYPTEVLNKTEHLVLRHFSKIGLCYHRKKKSKQYYPTRLALAL 276
Query: 217 ATRGTL--KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
T L K GFL++ETNFR+YAYT S+LK++L+ LF + +Y+ P + V +TR S
Sbjct: 277 RTSSALVAKDAMRCGFLVIETNFRIYAYTKSDLKISLLKLFAKPLYKLPTMLVASITRSS 336
Query: 275 VRAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEGERERLTTSEG 327
VR A+ GITA QI+ FL++H SV P+PP VSDQI LWE E RL+ +EG
Sbjct: 337 VRQAMAQGITARQILHFLKVHCSSRMRHQPHSVFSPVPPTVSDQILLWEQETSRLSKTEG 396
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
VL+ + + + F KL + ++ W +E KR ++
Sbjct: 397 VLF-RHLPKPIFPKLLAFVKERQACIWYDETKRAII 431
>gi|402224480|gb|EJU04542.1| transcription factor Tfb2 [Dacryopinax sp. DJM-731 SS1]
Length = 466
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 145/370 (39%), Positives = 214/370 (57%), Gaps = 29/370 (7%)
Query: 34 AKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDL---DFLNQYALERWEC 90
AKE L + + AL GGG+ S ++ +++ + L+ YA ERWE
Sbjct: 91 AKEANGRILLLPAFQSGLRRALTGGGEHRSFGVPCP-GARGKEVMATEQLDDYASERWES 149
Query: 91 ILRFMVGSQQTEG-ISADAVRTLF---NEGAENPV---------ITKDGFQFLLLETPAQ 137
IL FMV S + G + ++ +F G P+ IT GFQFLL ++ Q
Sbjct: 150 ILHFMVSSGSSAGRVPPPSIAVIFLLRRSGLMVPIGAERHPESRITSKGFQFLLEDSHTQ 209
Query: 138 VWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
+W +L+YL E +G DLVE + FLF L G +YSTE +S+ V LQ ++GL+
Sbjct: 210 LWELLLQYLAMSEDQGRDLVEVIGFLFMLGSMQLGQEYSTENLSETQDVMLQDFLDYGLI 269
Query: 198 YQRK-RKAGRFYPTKLALNMATRGTL--------KQIREPGFLIVETNFRVYAYTDSNLK 248
Y+R RFYPT+LA + + +L K GF+I+ETN+RVYAYT++ L+
Sbjct: 270 YRRNPDDHTRFYPTRLATTLTSTSSLAFTSSKHEKAASSEGFIILETNYRVYAYTENPLQ 329
Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPP 306
+A++ LF L RF NL +G LTR+S++AAL +GITA QII +L +HA P + + P+ P
Sbjct: 330 IAVLNLFVALHSRFENLVIGRLTRESIKAALANGITADQIISYLTVHAHPMMHKNNPVLP 389
Query: 307 V-VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
V V DQIRLW+ E+ RL + G LY F SQ DF+ + +YA+ L V+ W+N +KR + V
Sbjct: 390 VTVQDQIRLWQLEKNRLKSENGYLYEDFNSQGDFDLVLNYAKQLDVVLWENREKRKMFVR 449
Query: 366 KQGHSDIKKF 375
+ GH ++++F
Sbjct: 450 EDGHENVREF 459
>gi|299750117|ref|XP_001836551.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
gi|298408751|gb|EAU85259.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
Length = 403
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 195/314 (62%), Gaps = 18/314 (5%)
Query: 80 LNQYALERWECILRFMV--GSQQTEGISADAVR------TLFNEGAENPVITKDGFQFLL 131
L+ YALERWE IL +MV GS Q + VR L NE + I+ GFQFLL
Sbjct: 81 LDAYALERWETILHYMVSSGSGQRPTEPSPGVRFLLKTSGLMNEIHDALHISSAGFQFLL 140
Query: 132 LETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHL 191
Q+W+ +L+YL+ E + +DLV+ L+F F L G +YS + ++ L+ L
Sbjct: 141 HSPHEQLWHLLLQYLQLAEQRRMDLVDVLSFFFMLSTMELGREYSVQNLTKTQSAMLEDL 200
Query: 192 REFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-------EPGFLIVETNFRVYAYTD 244
R++GL++QRK+ + RF PT+L+ + + + GF+I+ETN+R+YAYTD
Sbjct: 201 RDYGLIWQRKQTSKRFSPTRLSTTLTSSSPPLPSTSGASSGPQEGFIILETNYRIYAYTD 260
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
+ L+ A++ LF L YRFPNL VG +TR+SV+ AL +GI+A QII +L HA P++ +
Sbjct: 261 NPLQTAVLSLFASLKYRFPNLVVGSITRESVKKALLNGISADQIISYLITHAHPNMRKNN 320
Query: 303 PLPPV-VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
PL PV V DQIRLWE E+ RL + EG LY+ F SQAD+E + YA++L V+ W+N KR
Sbjct: 321 PLLPVTVQDQIRLWELEKNRLKSREGFLYTAFASQADYELVLQYAKELDVVLWENASKRC 380
Query: 362 VVVTKQGHSDIKKF 375
+ +GH++IK F
Sbjct: 381 FFGSLEGHANIKGF 394
>gi|414584751|tpg|DAA35322.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 451
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 145/413 (35%), Positives = 228/413 (55%), Gaps = 45/413 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y +++L+V PV A + W+ +A +HK A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVLAPVTAAAMEEWVLDEYAAKHKVAIDKLLQLRVFVEVRDRRRDVS 99
Query: 49 --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
N + L+ GG P S++ +L + L YAL++WEC L
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPLPLSVTGRLPTPAD------LEAYALDQWECFLLQ 153
Query: 95 MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
++ S Q E S+ ++T L + E +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVEKGSSFSSSMMKTFQRGLLSSRDGEASKLTENGFQFLLMETNAQLWYIMREYI 213
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK-RKAG 205
+ E +G+D E ++FL +L F G YS ++D ++ ++ L E GLV Q++ RK
Sbjct: 214 SSAEERGVDPTELISFLLELSFHKLGAAYSLNTLTDVQRIAIRDLAELGLVKQQQGRKDS 273
Query: 206 RFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
F PT+LA N++ + + GF++VETNFR+YAY+ S L ++ LF + Y+ PNL
Sbjct: 274 WFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFARVEYQLPNL 333
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERL 322
VG +T++S+ A +GITA QII FLR +A P V+ +P V V+DQIRLWE + R+
Sbjct: 334 IVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPAVPENVTDQIRLWETDLNRV 393
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LY F S+ FE+ DYA+D G L W++ K+ ++V + H ++++F
Sbjct: 394 EMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDPKRMRLIVRGEFHPEMREF 446
>gi|449540295|gb|EMD31288.1| hypothetical protein CERSUDRAFT_119839 [Ceriporiopsis subvermispora
B]
Length = 412
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 148/403 (36%), Positives = 223/403 (55%), Gaps = 38/403 (9%)
Query: 11 IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----RNAKVALLGGGKP----- 61
+ LL++E +P + +A+W+++ + + TL L K+AL K
Sbjct: 1 MNLLWLESAIPVSTMAAWVTREGRTTYHASLSTLARLHILPTSATKLALNPTFKSSLRQA 60
Query: 62 ---------WSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVR 110
+ + A + + ++ L+ YALERWE IL +MV G+ Q +DAV
Sbjct: 61 ITGGGSSGSFGVPAPYDEINPGPSIETLDSYALERWETILYYMVSSGNGQYPTKPSDAVL 120
Query: 111 TLFN-EGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFL 163
L G V IT GFQFLL Q+W +L+YL E + +DLV+ L FL
Sbjct: 121 YLLKRSGLMTSVRGAALQITSSGFQFLLHPPHEQLWELLLQYLHLTEERQMDLVDVLGFL 180
Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-- 221
L G +YST+G+S + L+ LR++GL++QR + RF PT+LA +
Sbjct: 181 LMLSTMELGREYSTDGLSPTQKAMLEDLRDYGLLWQRSATSQRFSPTRLATTLTASSNPL 240
Query: 222 ------LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
+ GF+++ETN+RVYAYTD+ L++A++ LF + RFPNL +G +TR+SV
Sbjct: 241 PTSSSASADSQSQGFIVLETNYRVYAYTDNPLQIAVLNLFVTMKSRFPNLVIGAVTRESV 300
Query: 276 RAALRSGITAAQIIGFLRLHALPSVS--CPLPPV-VSDQIRLWEGERERLTTSEGVLYSQ 332
+ AL +GITA QII +L HA P + PL PV V DQIRLWE E+ R+ + EG LY+
Sbjct: 301 KKALANGITADQIISYLTAHAHPQMRRYKPLLPVTVQDQIRLWELEKNRVKSQEGYLYTA 360
Query: 333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
F SQAD+E + +YA+ L V+ W++ +R + +GHS+I+ F
Sbjct: 361 FASQADYEYVLNYAKQLDVVIWESPTRRCFFGSLEGHSNIRGF 403
>gi|149031822|gb|EDL86757.1| rCG41794 [Rattus norvegicus]
Length = 209
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 118/209 (56%), Positives = 159/209 (76%), Gaps = 5/209 (2%)
Query: 180 MSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT--RGTLKQIREPGFLIVETNF 237
MSD L FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++ G + +PGF++VETN+
Sbjct: 1 MSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNY 60
Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
R+YAYT+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA QII FLR A
Sbjct: 61 RLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAH 120
Query: 298 PSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
P + + LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L +A++LGVL +
Sbjct: 121 PVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVF 180
Query: 355 QNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
+N KR +VVT GHSD+K+FW + S
Sbjct: 181 ENSAKRLMVVTPAGHSDVKRFWKRQKHSS 209
>gi|451847848|gb|EMD61155.1| hypothetical protein COCSADRAFT_97372 [Cochliobolus sativus ND90Pr]
Length = 482
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/424 (35%), Positives = 227/424 (53%), Gaps = 62/424 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK V+ +L++ P P A + +W T KE + AT TL L
Sbjct: 39 LPHLAKSIVMAMLYMPTPFPAADLDTWFKPTARKEKERATFTLDRLHIITSARQDNGTLS 98
Query: 48 -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+R+ + A+ G G + + + A E K ++FL++Y+ +WE IL ++V
Sbjct: 99 WTLNPGFQRSLRNAIEGSGTHRSFGVPATKEESGKRVSIEFLDEYSRSQWEGILYYLVSG 158
Query: 99 QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
G+S D++ + L + G +P ITKDGF F+L ET AQVW ++
Sbjct: 159 --AAGLSKDSISRAEVGPGTKKLLHTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216
Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
YL+ G+ E L+FLF L G DYST +S L+ L GLVY+ +R
Sbjct: 217 VYLKMTNELGMSETEVLSFLFMLGSLELGQDYSTSTLSATQLQMLEDLSAMGLVYRSERN 276
Query: 204 AGRFYPTKLALNM------ATRGTLKQIREP-------------GFLIVETNFRVYAYTD 244
A FYPT+LA + A + K I + GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSASSKDIAQASTSTTGPPTAANKGFIIIETNYRLYAYTN 336
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
S +++A++ LF +L +RFPNL G LT++SV A+++GIT+AQII +L +A P + +
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQIISYLTTYAHPQMQKTV 396
Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
P +PP V DQIRLWE E ER+ T+ G L +F S A++ + +YA LGVL WQN+ R
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDVLNYASALGVLVWQNDANRC 456
Query: 362 VVVT 365
V+
Sbjct: 457 FFVS 460
>gi|326510735|dbj|BAJ91715.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 451
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 143/404 (35%), Positives = 226/404 (55%), Gaps = 33/404 (8%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+AK YVI++L+V P+P A + W+ +A +HK A D L +LR
Sbjct: 43 LAKKYVIQMLYVSAPMPAAAMQEWVLDEYASKHKVAIDRLLQLRVFVEVRDRRKEVSYKM 102
Query: 49 -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEG 103
N + L+ GG V ++ L L YALE+WEC L ++ S Q E
Sbjct: 103 NNKFQANMQKYLVSGGCLPREPLPFNVTARLPTLVELENYALEQWECFLLQLINSSQVEK 162
Query: 104 ---ISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S+ ++T L + E ++++GFQFLL+ET AQ+WY + +Y+ + E +G+D
Sbjct: 163 GTTFSSSMMKTFQRGLLSSRDGEAAKLSENGFQFLLMETNAQLWYIMREYISSAEERGVD 222
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
+ ++FL +L F T G YS +++ ++ + L E GLV Q+ RK F PTKLA
Sbjct: 223 PTDLISFLLELSFHTQGAAYSLSTLTEVQRIAVMDLMELGLVKLQQGRKDSWFIPTKLAT 282
Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
N+++ + + G ++VETNFR+YAY+ S L ++ LF + Y+ PNL VG +T++S
Sbjct: 283 NLSSSLSDSAASKEGIVVVETNFRLYAYSASKLHCEILRLFSRVEYQLPNLIVGAITKES 342
Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
+ A +GITA QII FL+ +A P V +P V V+DQIRLWE +R R+ +Y
Sbjct: 343 LYGAFDNGITAEQIISFLQQNAHPRVIDKIPIVPENVTDQIRLWENDRNRVEMILSHVYE 402
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
F S+ FE+ D+A+D G L W++ KK ++V+ + H ++++F
Sbjct: 403 DFPSKDMFEQCCDHARDNGYLLWEDAKKMRLIVSGEFHQEMREF 446
>gi|403417620|emb|CCM04320.1| predicted protein [Fibroporia radiculosa]
Length = 472
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 148/412 (35%), Positives = 230/412 (55%), Gaps = 38/412 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L + + V+ LL++E VP + + +W+++ + ++ TL L
Sbjct: 52 LQPLQRQLVMNLLWLETAVPVSTMFAWVTREGRSVYSDSLSTLARLHILPSSPVKLALNP 111
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQTEG 103
+ + + A+ GGG S ++ D + A +++ L+ YALERWE IL +MV S
Sbjct: 112 TFKASLRQAVTGGGSSRSFGVPVDPDRQYPAPNINTLDAYALERWETILHYMVSSGTGHY 171
Query: 104 ISADAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
+ + LF N IT GFQFLL AQ+W +++YL VE + +
Sbjct: 172 PTRPSQEVLFLLQRSGLMTNAHGGALQITSSGFQFLLHSPHAQLWELLVQYLYMVEERQM 231
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
DLV+ L+FL L G +YSTE ++ L LR +GLV+Q+ + RF P++LA
Sbjct: 232 DLVDVLSFLLMLSTMELGREYSTENLTATQTAMLTDLRNYGLVWQQSPSSRRFSPSRLAT 291
Query: 215 NMATRGTLKQIRE--------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
+ + + + GF+++ETN+R+YAYTD+ L++A++ LF L RFPNL
Sbjct: 292 TLTSSSSPLPTSQNSGTGPHGEGFIVLETNYRIYAYTDNPLQIAVLNLFVTLKSRFPNLV 351
Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERLT 323
+G +TR+SV+ AL +GITA QII +L HA P + + PL PV V DQIRLWE ER R+
Sbjct: 352 IGAITRESVKKALANGITADQIISYLTAHAHPQMHKNKPLLPVTVQDQIRLWELERNRVK 411
Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ EG LY+ F SQAD+E + +YA+ L V+ W+N +R+ + GH++I+ F
Sbjct: 412 SEEGYLYTSFASQADYEYVLNYAKQLDVVLWENPSRRSFFGSLDGHANIRGF 463
>gi|50557260|ref|XP_506038.1| YALI0F30173p [Yarrowia lipolytica]
gi|62901293|sp|Q6BZX4.1|TFB2_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49651908|emb|CAG78851.1| YALI0F30173p [Yarrowia lipolytica CLIB122]
Length = 467
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 205/355 (57%), Gaps = 27/355 (7%)
Query: 48 RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQTEGISA 106
++N + AL+GG + S + + K + D+ FL+ +A+ +WE IL FMVG+ S
Sbjct: 100 KKNLRAALVGGDQNISFGVPCDTEDKHKVDVAFLDAHAVSQWEMILHFMVGTSIGRTPSD 159
Query: 107 DAVRTLFNEGAENPV-----ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
+ L + G P IT GFQFLL + AQ+W +L+YL E +D V+ L
Sbjct: 160 GVLNLLKHSGLMEPERGGLRITNAGFQFLLQDVNAQIWTLLLQYLNMSEYLQMDPVDVLN 219
Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA-------- 213
F+F L G DYS +S+ + L+ LR++G+VYQRK + RFYPT+LA
Sbjct: 220 FIFMLGSLELGQDYSLSALSETQKHMLEDLRDYGIVYQRKASSRRFYPTRLATTLTSETA 279
Query: 214 --------LNMATRGTLKQ--IREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
+ AT+ T+ + GF+I+ETNFR+YAYT+S L++A++ LF L RF
Sbjct: 280 ALRTASQSMEAATQDTISSSVAADSGFIILETNFRLYAYTESPLQIAVLNLFVNLKTRFA 339
Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---SVSCPLPPVVSDQIRLWEGERE 320
N+ G + RDSVR AL +GITA QII +L +HA P + LPP V DQI+LW+ E +
Sbjct: 340 NMVTGQINRDSVRFALSNGITAEQIITYLSVHAHPRMKGMEHVLPPTVVDQIKLWQLEMD 399
Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
R+ ++G L+S+F + +++ + YA++LGVL ++N KR V T G I +F
Sbjct: 400 RIRATDGYLFSEFKNFDEYKDVSTYAKELGVLLYENPGKRKFVSTLAGSQQIVEF 454
>gi|339237219|ref|XP_003380164.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
gi|316977049|gb|EFV60220.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
Length = 674
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 147/405 (36%), Positives = 214/405 (52%), Gaps = 33/405 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +A+ V+RL V A I W+ E+ L +L
Sbjct: 40 LPSVAQQIVMRLSLVSSGTTIADIEGWMVDEKKDILHESLKYLRQLHILQECNLSSIESV 99
Query: 49 -------RNAKVALLGGGKPWSMSAKLEVDSK-ARDLDFLNQYALERWECILRFMV--GS 98
+N ++ALL K + VD K + L+ YA ERWE +L+++ +
Sbjct: 100 VLNRVFAKNLRLALLC--KDAICFKTVTVDPKHQKSFADLDSYASERWESVLKYLALPSA 157
Query: 99 QQTEGISADAVRTLFNEGA-----ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
Q + +S + R L + G +T DGFQF+L + Q+W ++L YL +E KG
Sbjct: 158 QSEKSVSVETKRVLQDSGLIQLCDSKMQLTSDGFQFILYDRRQQLWTYLLHYLAQLEKKG 217
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+ +C+ + Q + YSTE +++ F+QHLRE GLV+QRKR AG FY T L
Sbjct: 218 SPVHDCIMLILQACLGSHRAAYSTENLTEAALNFIQHLREIGLVHQRKRSAGWFYYTPLI 277
Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
+ + + GFLIVETNFRVY YTDS L +A++ FCE +YRFPNL +L R+
Sbjct: 278 SVLTGLKSSSSSSKEGFLIVETNFRVYCYTDSVLDLAIVSTFCEPLYRFPNLVACILNRE 337
Query: 274 SVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
SVR A + I+A QII +L +A ++ + +P V+DQI+LWE ER+R GV+Y
Sbjct: 338 SVRRAFQVNISAEQIIQYLFSNAHKNMQKQTPTIPSTVTDQIKLWEMERDRFKFDPGVMY 397
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
S F S D+ +RDYA+DLGVL ++E R +VV+ GH +F
Sbjct: 398 SNFFSDTDYITIRDYAKDLGVLLCEHEANRALVVSADGHEQSNQF 442
>gi|345569654|gb|EGX52519.1| hypothetical protein AOL_s00043g13 [Arthrobotrys oligospora ATCC
24927]
Length = 452
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 149/416 (35%), Positives = 223/416 (53%), Gaps = 52/416 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK +V+ +LF PVP + + +++ + +EA L +L
Sbjct: 38 LPNLAKTFVMAMLFNPDPVPVSSLEAFVRPDQNRLKEEAFGKLRQLYIVTERQGTFALDA 97
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVG--SQQTE 102
+ + AL GGG P S + + K R D+ FL++YA E+WE IL FMVG +Q ++
Sbjct: 98 TFKVELRKALTGGGSPGSFGLPCDTEDKNRVDISFLDRYAAEQWESILHFMVGNETQSSQ 157
Query: 103 GISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
S VR L + A VIT+ GF FLL E +QVW +L YL+ E +D
Sbjct: 158 RPSEGVVRLLLHGKLMEQNRASGTVITQLGFSFLLQEVNSQVWTLLLLYLDMAEQLRMDP 217
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
V+ L FLF L G DY + ++ + L+ LR++G+VYQRK + RFYPT+LA +
Sbjct: 218 VDILHFLFLLGSLELGQDYDMKSLTPTQKGMLEDLRDYGIVYQRKSSSRRFYPTRLATGL 277
Query: 217 ATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
+ ++ +R P GFLI+ETN+ S L++A++ LF L+ RF
Sbjct: 278 TSE--IRSLRSPAATLANATPGNDENGFLIIETNYH------SPLQIAVLNLFARLISRF 329
Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGER 319
PNL +TR+S+R A+ GITA QII ++ +A P + S LPP V DQIRLW+ E
Sbjct: 330 PNLVTARITRESIRQAIGKGITAEQIIDYMTANAHPQLRKNSPVLPPTVVDQIRLWQIEG 389
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
ER+ + G L+ +F + A+F + YA D+GVL W ++ K + V+K H I +
Sbjct: 390 ERMKVTSGYLFKEFATNAEFNAVCKYADDIGVLVWIDKNKGMLFVSK--HEQIADY 443
>gi|189205068|ref|XP_001938869.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187985968|gb|EDU51456.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 482
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 150/425 (35%), Positives = 223/425 (52%), Gaps = 62/425 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK V+ +L++ P P A + +W + KE + AT TL L
Sbjct: 39 LPHLAKSIVMAMLYMPSPFPAADLDAWFKPSARKEKERATFTLDRLHIITSSRQDNGTLS 98
Query: 48 -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+R+ + A+ G G + + + A E K ++FL++Y+ +WE IL ++V
Sbjct: 99 WTLSPGFQRSLRNAIEGSGTHRSFGVPATKEESGKRVSVEFLDEYSRAQWEGILYYLVSG 158
Query: 99 QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
G+ D + +TL N G +P ITKDGF F+L ET AQVW ++
Sbjct: 159 --AAGLGKDNISRAEVSPSTKTLLNTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216
Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
YL+ G+ E L FLF L G DYST +S L L GL+Y+ +
Sbjct: 217 IYLKVTNELGMSETEVLAFLFMLGSLELGQDYSTSTLSPTQLRMLDDLSSMGLIYRSDKN 276
Query: 204 AGRFYPTKLALNM---------ATRGTLKQIRE----------PGFLIVETNFRVYAYTD 244
A FYPT+LA + A+ + Q + GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSATANKGFIIIETNYRLYAYTN 336
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
S +++A++ LF +L +RFPNL G LT++SV A++SGIT+AQII +L +A P + +
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQIISYLTTYAHPQMQKTV 396
Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
P +PP V DQIRLWE E ER+ T+ G L +F S A++ + YA LGVL WQN+ R
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDAEYRDVMGYASALGVLVWQNDGARC 456
Query: 362 VVVTK 366
V++
Sbjct: 457 FFVSQ 461
>gi|451996931|gb|EMD89397.1| hypothetical protein COCHEDRAFT_1141584 [Cochliobolus
heterostrophus C5]
Length = 482
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 149/424 (35%), Positives = 224/424 (52%), Gaps = 62/424 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK V+ +L++ P P A + +W T KE + AT TL L
Sbjct: 39 LPHLAKSIVMAMLYMPTPFPAADLDTWFKPTARKEKERATFTLDRLHIITSARQDNGTLS 98
Query: 48 -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+R+ + A+ G G + + + A E ++FL++Y+ +WE IL ++V
Sbjct: 99 WTLNPGFQRSLRNAIEGSGTHRSFGVPATKEESGNRVSIEFLDEYSRSQWEGILYYLVSG 158
Query: 99 QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
G+S D++ + L + G +P ITKDGF F+L ET AQVW ++
Sbjct: 159 --AAGLSKDSISRAEVGPGTKKLLHTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216
Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
YL+ G+ E L+FLF L G DYST +S L+ L GLVY+ R
Sbjct: 217 VYLKMTNELGMSETEVLSFLFMLGSLELGQDYSTSTLSATQLQMLEDLSAMGLVYRSDRN 276
Query: 204 AGRFYPTKLALNMAT--------------RGTLKQIREP-----GFLIVETNFRVYAYTD 244
A FYPT+LA + + + + P GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSTSSNDIAQASTSTTGPPTAANKGFIIIETNYRLYAYTN 336
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
S +++A++ LF +L +RFPNL G LT++SV A+++GIT+AQII +L +A P + +
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQIISYLTTYAHPQMQKTV 396
Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
P +PP V DQIRLWE E ER+ T+ G L +F S A++ + +YA LGVL WQN+ R
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDVLNYASALGVLVWQNDANRC 456
Query: 362 VVVT 365
V+
Sbjct: 457 FFVS 460
>gi|357123176|ref|XP_003563288.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
subunit 4-like [Brachypodium distachyon]
Length = 452
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 142/405 (35%), Positives = 223/405 (55%), Gaps = 34/405 (8%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASW-LSKTHAKEHKEATDTLTELR--------------- 48
+ K Y +++L+V P+P A + W L +A +HK A D L +LR
Sbjct: 43 LPKKYTLQMLYVSAPMPAAAMEEWVLLDEYASKHKVAIDRLLQLRVFVEVRDXRKEVSYK 102
Query: 49 ------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
N + L+ GG V + L L YALE+WEC L ++ S Q E
Sbjct: 103 MNDKFQGNMQKYLVDGGSLPREPIPSSVTGRLPTLTELENYALEQWECFLLQLINSSQVE 162
Query: 103 ---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S+ ++T L + + P ++++GFQFLL+ET Q+WY + +Y+ + E +G+
Sbjct: 163 RGTSFSSSMMKTFQRGLLSSRDGDAPKLSENGFQFLLMETNVQLWYIMREYISSAEERGV 222
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLA 213
D E ++FL +L F T G YS ++D ++ ++ L E GLV Q+ RK F PTKLA
Sbjct: 223 DPTELISFLLELSFHTLGAAYSFNTLTDVQRIAIRDLAELGLVKVQQGRKDSWFIPTKLA 282
Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
N+++ + + G ++VETNFR+YAY+ S L ++ LF + Y+ PNL VG +T++
Sbjct: 283 TNLSSSLSDSSASKEGIVVVETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKE 342
Query: 274 SVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLY 330
S+ A +GITA QII FL+ +A P V+ +P V V+DQIRLWE +R R+ +Y
Sbjct: 343 SLYGAFDNGITAEQIISFLKQNAHPXVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVY 402
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
F S+ FE+ D A+D G L W++ KK ++V + H ++++F
Sbjct: 403 EDFPSKDLFEQCCDLARDNGFLLWEDSKKMRLIVRVEFHQEMREF 447
>gi|330918142|ref|XP_003298106.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
gi|311328891|gb|EFQ93800.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
Length = 482
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 149/425 (35%), Positives = 222/425 (52%), Gaps = 62/425 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK V+ +L++ P P A + +W + KE + AT TL L
Sbjct: 39 LPHLAKSIVMAMLYMPSPFPAADLDAWFKPSARKEKERATFTLDRLHIITSVRQDNGTLS 98
Query: 48 -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+R+ + A+ G G + + + A E K ++FL++Y+ +WE IL ++V
Sbjct: 99 WTLSPGFQRSLRNAIEGSGTHRSFGVPATKEESGKRVSVEFLDEYSRSQWEGILYYLVSG 158
Query: 99 QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
G+ D + +TL N G +P ITKDGF F+L ET AQVW ++
Sbjct: 159 --AAGLGKDNISRAEVSPSTKTLLNTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216
Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
YL+ G+ E L FLF L G DYST +S L L GL+Y+ +
Sbjct: 217 IYLKVTNELGMSETEVLAFLFMLGSLELGQDYSTSTLSPTQLRMLDDLSSMGLIYRSDKN 276
Query: 204 AGRFYPTKLALNM---------ATRGTLKQIRE----------PGFLIVETNFRVYAYTD 244
A FYPT+LA + A+ + Q + GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSAAANKGFIIIETNYRLYAYTN 336
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
S +++A++ LF +L +RFPNL G LT++SV A++SGIT+AQII +L +A P + +
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQIISYLTTYAHPQMQKTV 396
Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
P +PP V DQIRLWE E ER+ T+ G L +F S ++ + YA LGVL WQN+ R
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDTEYRDVMGYASALGVLVWQNDAARC 456
Query: 362 VVVTK 366
V++
Sbjct: 457 FFVSQ 461
>gi|308452336|ref|XP_003089005.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
gi|308243592|gb|EFO87544.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
Length = 468
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 142/358 (39%), Positives = 211/358 (58%), Gaps = 30/358 (8%)
Query: 54 ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
A+LG S+ LE + + R + + A+ERW+CILR++ + T+ +S + R
Sbjct: 114 AMLGAASISSLV--LETNDEKRRGKEVEKKAVERWDCILRYLALPSEENTQAVS-ETTRN 170
Query: 112 LFNE-----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
LF + G IT GFQFLLL Q+W ++++YL+ S+G D+VE + L Q+
Sbjct: 171 LFKKANFTSGDSRIEITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQI 230
Query: 167 ------KFSTFGTDYSTE-GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
F Y + S Q L HLRE G+++ RKRK G F+ T L ++AT
Sbjct: 231 VLLANRGFKAEKECYQIDPSWSIPQQELLNHLRELGVIFIRKRKDGVFFLTHLLTHLATN 290
Query: 220 GTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
T+ G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SV
Sbjct: 291 ETIDDTSAERVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESV 350
Query: 276 RAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEGERERLTTSEGV 328
R AL+ GITAAQII FLR +A P +++C LP V+DQIRLWE ER R+ +
Sbjct: 351 RGALQHGITAAQIISFLRANAHPQCIATSGAINC-LPITVADQIRLWEDERRRMDLKDAY 409
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQHS 386
+YS F S+ +F+ + DYAQ+ G+L W N +++ V+V ++GH +++ W KG + S
Sbjct: 410 IYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQ-WYKRSKGGETS 466
>gi|308460924|ref|XP_003092760.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
gi|308252519|gb|EFO96471.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
Length = 470
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 142/358 (39%), Positives = 211/358 (58%), Gaps = 30/358 (8%)
Query: 54 ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
A+LG S+ LE + + R + + A+ERW+CILR++ + T+ +S + R
Sbjct: 117 AMLGAASISSLV--LETNDEKRRGKEVEKKAVERWDCILRYLALPSEENTQAVS-ETTRN 173
Query: 112 LFNE-----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
LF + G IT GFQFLLL Q+W ++++YL+ S+G D+VE + L Q+
Sbjct: 174 LFKKANFTSGDSRIEITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQI 233
Query: 167 ------KFSTFGTDYSTE-GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
F Y + S Q L HLRE G+++ RKRK G F+ T L ++AT
Sbjct: 234 VLLANRGFKAEKECYQIDPSWSIPQQELLNHLRELGVIFIRKRKDGVFFLTHLLTHLATN 293
Query: 220 GTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
T+ G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SV
Sbjct: 294 ETIDDTSAERVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESV 353
Query: 276 RAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEGERERLTTSEGV 328
R AL+ GITAAQII FLR +A P +++C LP V+DQIRLWE ER R+ +
Sbjct: 354 RGALQHGITAAQIISFLRANAHPQCIATSGAINC-LPITVADQIRLWEDERRRMDLKDAY 412
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQHS 386
+YS F S+ +F+ + DYAQ+ G+L W N +++ V+V ++GH +++ W KG + S
Sbjct: 413 IYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQ-WYKRSKGGETS 469
>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 452
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 144/408 (35%), Positives = 222/408 (54%), Gaps = 34/408 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------RNAK 52
L +AK YV++LL ++ PVP ++A W+ +HK A D L +LR +N K
Sbjct: 40 LPPLAKKYVLQLLHIDGPVPAKLLAEWVLPDGLSKHKVAIDRLVQLRVFVEAFDNRKNEK 99
Query: 53 V-------------ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
L+ GG S + + L+ L YALE+WEC L ++
Sbjct: 100 TYQVNSTYQKSLQNLLVHGGTLPRESMPSNITVRLPTLEDLETYALEQWECFLLQLISPS 159
Query: 99 --QQTEGISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+T IS+ + R L E P +T+ GFQFLL++T AQ+WY I +Y+ E
Sbjct: 160 HVDKTLNISSSLMKVFQRRLLSQRDREAPKLTESGFQFLLMDTNAQLWYIIREYITNSEE 219
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
+G+D + ++F+ +L F G Y+ +++ + ++ L + GLV Q+ RK F PT
Sbjct: 220 RGVDAGDLISFMLELSFHVIGEAYNINTLTEFQRNIIKDLADLGLVKLQQGRKESWFIPT 279
Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
KLA N++ T R+ GF++VETNFRVYAY+ S L ++ LF + Y+ PNL VG +
Sbjct: 280 KLATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAI 339
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
T++S+ A +GITA QI+ FLR +A P V+ +P V V+DQIRLWE + R+ +E
Sbjct: 340 TKESLYNAFDNGITADQIVSFLRQNAHPRVAQRVPAVPENVTDQIRLWESDLNRVEMTEA 399
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y +F S+ FE D A++ L W++ KK +VV + H+ ++ F
Sbjct: 400 YYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMHLVVKSEVHTYVRDF 447
>gi|145248676|ref|XP_001400677.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus niger CBS 513.88]
gi|134081344|emb|CAK41847.1| unnamed protein product [Aspergillus niger]
Length = 478
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 152/437 (34%), Positives = 223/437 (51%), Gaps = 65/437 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
L ++AK +V+ LL+++ P+P A + +W+ KE A L L R
Sbjct: 39 LPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKERDNALSILGRLHILSNTVTADNVRA 98
Query: 51 AKV----------ALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQ 99
KV AL G + S +V + + + L++YA +WE +L +MVG+
Sbjct: 99 YKVTDPFATSLRHALTGADQTQSFGVLSQVPAGSEVSIPELDEYARRQWEGVLGYMVGTG 158
Query: 100 QTEGIS---ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVE 150
G + V+ L G + IT+DGF F+L + QVW+ ++ Y+E+
Sbjct: 159 GMGGPQVTLSKGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVESAS 218
Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
+ +D VE L+F+F L G Y +S K L L +FG+VYQ A RFYPT
Sbjct: 219 AIKMDSVEVLSFVFLLSSLELGKSYEKNHLSSKQLRMLTDLADFGIVYQESPDAPRFYPT 278
Query: 211 KLALNMATRGT------LKQIREP----------GFLIVETNFRVYAYTDSNLKVALIGL 254
+LA + + + L + P GF+I+ETN+RVYAYT S L+++LI L
Sbjct: 279 RLATTLTSDSSALSNPLLGSLSGPAGEASSKAGTGFIIIETNYRVYAYTSSPLQISLIAL 338
Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------------S 301
F L YRFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 339 FTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQAG 398
Query: 302 CP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
P LPP V DQIRLW+ ER+R+ + G L+ FVS A++E YA+++GVL W++++
Sbjct: 399 IPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKDFVSLAEYEAPCRYAEEIGVLAWKSDR 458
Query: 359 KRTVVVTKQGHSDIKKF 375
KR VT+ H + F
Sbjct: 459 KRMFFVTR--HEQVAAF 473
>gi|121714341|ref|XP_001274781.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
gi|119402935|gb|EAW13355.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
Length = 478
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 151/438 (34%), Positives = 224/438 (51%), Gaps = 66/438 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA----------TDTLT------ 45
L ++AK +V+ LL+++ P+P A + +W+ KE A T+T+T
Sbjct: 38 LPDLAKCFVMALLYLKDPLPAADLETWVKPDSLKERDNALSTLGRLHILTNTMTADNVRA 97
Query: 46 -----ELRRNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGS- 98
E + + AL G S V D A + L++Y+ +WE +L +MVG+
Sbjct: 98 YMVTKEFASSLRQALTGAEHTQSFGVVSNVSDEAAVSIADLDEYSRRQWEGVLGYMVGTG 157
Query: 99 ---QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
Q + + V+ L G + ITKDGF F+L + QVW+ ++ Y+E+
Sbjct: 158 GLGMQRDVNLSKGVKQLLQAGHLVEIRDRRVEITKDGFAFVLQDVGTQVWHILILYVESA 217
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
E+ G+D VE L F+F L G Y + M+ L L +FG+VYQ +A FYP
Sbjct: 218 EAIGMDSVEVLKFVFFLSSLELGRSYEKKQMTSNQLRTLTDLADFGIVYQESPEATHFYP 277
Query: 210 TKLALNMAT----------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
T+LA + + GT GF+I+ETN+R+YAYT S L+++LI
Sbjct: 278 TRLATTLTSDSSALSNPIAGSLSGPTGTASNKAGSGFIIIETNYRLYAYTSSPLQISLIA 337
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------- 300
LF L YRFPNL G +TR S+R A+ GITA QII +L HA P +
Sbjct: 338 LFTTLKYRFPNLITGKITRQSIRRAVEMGITADQIISYLSTHAHPQMRKHNASRSTSNQA 397
Query: 301 SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
P LPP V DQIRLW+ ER+R+ + G L+ FVS A++E YA+++GVL W+++
Sbjct: 398 GMPVSVLPPTVVDQIRLWQLERDRVKATPGFLFKDFVSLAEYEAPCRYAEEIGVLVWKSD 457
Query: 358 KKRTVVVTKQGHSDIKKF 375
+KR VT+ H + F
Sbjct: 458 RKRMFFVTR--HEQVAAF 473
>gi|222629781|gb|EEE61913.1| hypothetical protein OsJ_16644 [Oryza sativa Japonica Group]
Length = 427
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 144/405 (35%), Positives = 222/405 (54%), Gaps = 53/405 (13%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y +++L+V PV A + W+ +A +H+ A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99
Query: 49 --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
N + L+ GG P S++A+L L L +ALE+WE +R
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWE--VRM 151
Query: 95 MVGSQQTEGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
+G E P +T++GFQFLL+ET AQ+WY + +Y+ + E +G+
Sbjct: 152 TKAYVCIDG--------------EAPRLTENGFQFLLMETNAQLWYIMREYISSAEERGV 197
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLA 213
D E ++FL +L F T G YS ++D + ++ L E GLV Q+ RK F PTKLA
Sbjct: 198 DPTELISFLLELSFHTLGEAYSLNTLTDVQRNAIRDLAELGLVKLQQGRKDSWFIPTKLA 257
Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
N++ + + GF++VETNFR+YAY+ S L ++ LF + Y+ PNL VG +T++
Sbjct: 258 TNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKE 317
Query: 274 SVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLY 330
S+ A +GITA QII FL+ +A P V+ +P V V+DQIRLWE +R R+ + LY
Sbjct: 318 SLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLY 377
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
F S+ F++ DYA+D G L W++ KK ++V + HS++++F
Sbjct: 378 EDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMREF 422
>gi|444320663|ref|XP_004180988.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
gi|387514031|emb|CCH61469.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
Length = 518
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 143/455 (31%), Positives = 225/455 (49%), Gaps = 84/455 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +++ ++F E V + W+ + +EA ++ L
Sbjct: 39 LPQLAKFFIMSMVFNENEVMLRDLDRWVKSNGKLQFQEAIKSMKSLHLLVPGKNSGAVMI 98
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
R + K AL GG S ++ D A + L +Y+ ++WE IL FMVG+
Sbjct: 99 NLNSTFRSSLKNALTGGEINNSFGVIVDNDRDAVKVSMLEEYSTKKWETILHFMVGTPLA 158
Query: 102 EGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S + + L NE IT +GFQFLL E+ +Q+W +L+YL+ E+ +
Sbjct: 159 TTPSENVLNLLKHSKLMEENEETNEFKITNEGFQFLLQESNSQIWTLLLQYLKLTETLHM 218
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L F+F L FG YS +S+ ++ L+ +R++GL++Q+ A FYPT +A
Sbjct: 219 DPVDVLNFIFMLGALEFGKAYSISSLSETQKIMLKDMRDYGLIFQKNSNANVFYPTSMAT 278
Query: 215 NMATR------------------------------------GTLKQIREPGFLIVETNFR 238
+ + G+ Q G LI+ETNF+
Sbjct: 279 MLTSDARNVRTASGAIDHILQRSQETNKNSNNDDDEDVDQVGSNTQATADGALIIETNFK 338
Query: 239 VYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
+Y+Y++S L++A++ LF L RF N+ G +TRDS+R ALR+GITA QII +L H+ P
Sbjct: 339 LYSYSNSPLQIAVLSLFVHLKTRFSNMVTGQITRDSIRRALRNGITAEQIIAYLLTHSHP 398
Query: 299 SVS-------------CP--------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
+ P LPP V DQI+LW+ E +R+ + EG LYS F +
Sbjct: 399 QMRRLAEGNLEKKLELDPNAKDSLQILPPTVVDQIKLWQLELDRIISHEGSLYSDFENNQ 458
Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
++ L YA+D+GVL W+N+KKR V K+G+S +
Sbjct: 459 EYNLLSTYAEDIGVLLWKNDKKRKFFVLKEGNSQV 493
>gi|358370616|dbj|GAA87227.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus kawachii IFO 4308]
Length = 478
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 154/440 (35%), Positives = 223/440 (50%), Gaps = 71/440 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
L ++AK +V+ LL+++ P+P A + +W+ KE A L L R
Sbjct: 39 LPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKERDNALSILGRLHILSNTVTADNVRA 98
Query: 51 AKV----------ALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQ 99
KV AL G + S V + + + L++YA +WE +L +MVG
Sbjct: 99 YKVTDPFATSLRHALTGADQTQSFGVLSHVPAGSEVSISDLDEYARRQWEGVLGYMVG-- 156
Query: 100 QTEGISA------DAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLE 147
T G+ A V+ L G + IT+DGF F+L + QVW+ ++ Y+E
Sbjct: 157 -TGGMGAPQVTLSKGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVE 215
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
+ + +D VE L+F+F L G Y +S K L L +FG+VYQ A RF
Sbjct: 216 SASAIKMDSVEVLSFVFLLSSLELGKSYEKNHLSSKQLRMLTDLADFGIVYQESPDAPRF 275
Query: 208 YPTKLALNMATRGT------LKQIREP----------GFLIVETNFRVYAYTDSNLKVAL 251
YPT+LA + + + L + P GF+I+ETN+RVYAYT S L+++L
Sbjct: 276 YPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGTGFIIIETNYRVYAYTSSPLQISL 335
Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------- 300
I LF L YRFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 336 IALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSN 395
Query: 301 --SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
P LPP V DQIRLW+ ER+R+ + G L+ FVS A++E YA+++GVL W+
Sbjct: 396 QAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKDFVSLAEYEAPCRYAEEIGVLAWK 455
Query: 356 NEKKRTVVVTKQGHSDIKKF 375
+++KR VT+ H + F
Sbjct: 456 SDRKRMFFVTR--HEQVAAF 473
>gi|350639203|gb|EHA27557.1| transcription factor Tfb2 [Aspergillus niger ATCC 1015]
Length = 478
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 153/438 (34%), Positives = 225/438 (51%), Gaps = 67/438 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
L ++AK +V+ LL+++ P+P A + +W+ KE A L L R
Sbjct: 39 LPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKERDNALSILGRLHILSNTVTADNVRA 98
Query: 51 AKV----------ALLGGGKPWSMS--AKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
KV AL G + S +++ V S+ + L++YA +WE +L +MVG+
Sbjct: 99 YKVTDPFATSLRHALTGADQTQSFGVLSQVPVGSEV-SIPELDEYARRQWEGVLGYMVGT 157
Query: 99 QQTEGIS---ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
G + V+ L G + IT+DGF F+L + QVW+ ++ Y+E+
Sbjct: 158 GGMGGPQVTLSKGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVESA 217
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
+ +D VE L+F+F L G Y +S K L L +FG+VYQ A RFYP
Sbjct: 218 SAIKMDSVEVLSFVFLLSSLELGKSYEKNHLSSKQLRMLTDLADFGIVYQESPDAPRFYP 277
Query: 210 TKLALNMATRGT------LKQIREP----------GFLIVETNFRVYAYTDSNLKVALIG 253
T+LA + + + L + P GF+I+ETN+RVYAYT S L+++LI
Sbjct: 278 TRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGTGFIIIETNYRVYAYTSSPLQISLIA 337
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------- 300
LF L YRFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 338 LFTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQA 397
Query: 301 SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
P LPP V DQIRLW+ ER+R+ + G L+ FVS A++E YA+++GVL W+++
Sbjct: 398 GIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKDFVSLAEYEAPCRYAEEIGVLAWKSD 457
Query: 358 KKRTVVVTKQGHSDIKKF 375
+KR VT+ H + F
Sbjct: 458 RKRMFFVTR--HEQVAAF 473
>gi|255721741|ref|XP_002545805.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
gi|240136294|gb|EER35847.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
Length = 494
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 150/450 (33%), Positives = 221/450 (49%), Gaps = 77/450 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK Y++ ++F E+P+ + W + K EA L L
Sbjct: 39 LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESLHLIEYDSKGSHIRL 98
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G P + S D D+ FL+ +A +WE IL FMVG++ T
Sbjct: 99 HSTFRKNFRDCLTGSQNPNAFGSISTTADKYKVDIAFLDNFASSKWETILHFMVGTEAT- 157
Query: 103 GISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
+D+V +L EG N + IT GFQFLL + AQ+W +L+YL + +D
Sbjct: 158 ATPSDSVLSLLKLGGLMEGPNNKLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDP 217
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN- 215
V+ L F+F L G Y+ +S+ L L+++GLVYQR +GRFYPT+LA
Sbjct: 218 VDVLNFIFVLGSLELGKSYAVSTLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTL 277
Query: 216 -----------MATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
MA +Q+ +I+ETNF++YAYT S L++A++ LF +L RF N
Sbjct: 278 TSDSAALKTPAMAMDEEEQQVVTKESIIIETNFKLYAYTKSPLEIAILNLFVQLKTRFAN 337
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------ 300
+ G +TR+S+R AL +GITA QII FL HA +
Sbjct: 338 MVCGQITRESIRNALYNGITADQIIKFLETHAHSQMRILAKEKLDKKIEFDTSHNINTAG 397
Query: 301 SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
P LPP V DQI+LW+ E +R+ T +G L+ F +Q +++ L +Y
Sbjct: 398 GAPQSKTDGSISQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFANQQEYDILSNY 457
Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
A +LGVL W ++ KR VTK G + + F
Sbjct: 458 ASELGVLIWADKVKRKFFVTKDGMTQVADF 487
>gi|261187837|ref|XP_002620336.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239593453|gb|EEQ76034.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239608433|gb|EEQ85420.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis ER-3]
gi|327356043|gb|EGE84900.1| RNA pol II transcription initiation subunit [Ajellomyces
dermatitidis ATCC 18188]
Length = 485
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 157/435 (36%), Positives = 218/435 (50%), Gaps = 70/435 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+ LL+++ P+P A + W+ +E A L L
Sbjct: 38 LPHLAKCFVMALLYLKDPLPAADLELWVKPESKRERDNALSILGRLHILSNTTTSDNVRA 97
Query: 49 --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+ + AL GG K S L D + L+ YA +WE +L +MVG+
Sbjct: 98 YTVTDPFASSLRQALTGGNKQQSFGVPSLTADENPMTVALLDDYARSQWEGVLGYMVGTS 157
Query: 99 ----QQTEGISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
QQ +S V+ L E + V ITKDGF F+L + QVW+ ++ Y+E
Sbjct: 158 ALGVQQAVTLS-KGVKQLLQACHLVEIRDRRVEITKDGFAFVLQDLNTQVWHILILYVEN 216
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
E G+D VE L+FLF L G Y + ++ L L +FG+VYQ ++A R
Sbjct: 217 AEQIGMDSVEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHPPHQEATR 276
Query: 207 FYPTKLA-----------------LNMATRGTLKQIREPG--FLIVETNFRVYAYTDSNL 247
FYPT+LA L + G EPG F+I+ETN+R+YAYT S L
Sbjct: 277 FYPTRLATTLTSDSITLGSSISSSLTAPSGGPAAVSNEPGTGFIIIETNYRLYAYTSSPL 336
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
+++LI LF L YRFPNL G LTR S+R A+ GITA QII +L HA P
Sbjct: 337 QISLISLFTTLKYRFPNLITGKLTRQSIRRAVEMGITADQIISYLTTHAHPQMRKYHASK 396
Query: 299 ------SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
V LPP V DQIRLW+ ER+R+ + G L+ FVS A+FE YA+++GVL
Sbjct: 397 SGANPAGVPAVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVL 456
Query: 353 TWQNEKKRTVVVTKQ 367
W++EK+R VT+
Sbjct: 457 LWKSEKRRMFFVTRH 471
>gi|407927340|gb|EKG20235.1| Transcription factor Tfb2 [Macrophomina phaseolina MS6]
Length = 466
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 141/414 (34%), Positives = 217/414 (52%), Gaps = 49/414 (11%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK V+ +L++ P P + +W+ + E +A L L
Sbjct: 38 LPHLAKMIVMAMLYMPGPFPATELDTWIRDGSSGEKDKAISVLESLHIITQAQETVGARA 97
Query: 48 -------RRNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + + AL GGG S + SK ++ L++ A +WE IL++MVGS
Sbjct: 98 YRISSGFQTSLRQALTGGGNHRSFGVPCMNTGSKRVSIEALDEIARAKWETILQYMVGST 157
Query: 100 QTE-GISAD---AVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETV 149
T G+ ++ +TL G P IT+ GF FLL E AQVW ++ YLE
Sbjct: 158 GTALGVGSEITTGTKTLLEVGQFIEMRGGRPSITQGGFSFLLQEVNAQVWSLLIVYLENS 217
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
+D V+ L+FLF L G DYST+ ++ Q L L +FG+VY+ K+ RFYP
Sbjct: 218 HRLNMDNVDVLSFLFMLGSLELGQDYSTDSLTATQQHMLDDLNDFGIVYRSPPKSNRFYP 277
Query: 210 TKLALNMATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNLKVALIGLF 255
T+LA + + + G++I+ETN+R+YAYT S L++A++ LF
Sbjct: 278 TRLATTLTSDAAALNNSDSFSSALNSTSASGSKGYIIIETNYRLYAYTSSPLQIAVLALF 337
Query: 256 CELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQI 312
+L RFPNL G LT++SV+ A+ GITA QII +L +A P + + LPP V DQI
Sbjct: 338 TDLETRFPNLVSGKLTKESVQRAIGLGITADQIISYLDTYAHPQMQRKTPVLPPTVVDQI 397
Query: 313 RLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
RLW+ E +R+ +EG L+ F +++ L+ YA +GVL W N+++R + +T+
Sbjct: 398 RLWQIEGDRMRATEGFLFKDFRDWDEYQDLKKYADSIGVLVWSNDQRRLMFLTR 451
>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis]
gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis]
Length = 451
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 137/407 (33%), Positives = 224/407 (55%), Gaps = 33/407 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK YV+++L+++ PV + WL + +H+ A D LT+LR
Sbjct: 40 LPPLAKKYVVQILYIDGPVADKTLEEWLLPDGSSKHRVAIDRLTQLRIFTEIVDRKKEIS 99
Query: 49 --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
N + L+ GG + + L+ L+ YAL +WEC L ++ S
Sbjct: 100 YKLNPTFQTNLQKHLIDGGVLPGEPLASNIAVRLPTLEELDTYALGQWECFLLHLINSGH 159
Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
E + + ++ +F G E P +T+ GFQFLL++T AQ+WY I +Y+ E +
Sbjct: 160 AERSTNFSSSMMKIFQRGLLTQRDKEAPRLTESGFQFLLMDTNAQLWYIIREYISNSEER 219
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
GLD + ++FL +L F G Y+ +++ + ++ L + GLV Q+ RK F PTK
Sbjct: 220 GLDSADLISFLLELSFHITGEAYNMIMLTEFQRNMIKDLADLGLVKLQQGRKESWFIPTK 279
Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
LA N++ T R+ GF++VETNFR+YAY+ S L ++ LF + Y+ PNL VG +T
Sbjct: 280 LATNLSMSLTDSSSRKQGFVVVETNFRMYAYSTSKLHCEIMRLFSRVEYQLPNLVVGAMT 339
Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
++S+ +A +GITA QII FL+ +A P V+ +P V V+DQIRLWE + R+ +
Sbjct: 340 KESLYSAFENGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWESDMNRVEMTPAH 399
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LY +F S+ FE ++A+D L W++ K+ +VV + H +++++
Sbjct: 400 LYDEFPSRDVFEAACNFARDWNGLLWEDSKRMRMVVKAEIHLNMREY 446
>gi|312069992|ref|XP_003137940.1| transcription factor Tfb2 family protein [Loa loa]
Length = 470
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 143/366 (39%), Positives = 211/366 (57%), Gaps = 41/366 (11%)
Query: 48 RRNAKVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE- 102
R K +G K +SA + D K R + D L + A+ERWECIL ++ + SQ++E
Sbjct: 100 RSYLKATRIGLCKASELSAVTDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQ 158
Query: 103 GISADAVRTLFNEG---AENP----VITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
G+S R + G +EN IT GFQFLLL Q+W ++L Y E G+D
Sbjct: 159 GVSGTTKRLFRSAGLTSSENDDGDIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGID 218
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQV----------FLQHLREFGLVYQRKRKAG 205
+++ L FLF++ T G+D S + FL HLRE GLV+ RKRK G
Sbjct: 219 IIKELDFLFRV---TNGSDKERPAGSRAFLIDECWPEPTKDFLMHLRELGLVFIRKRKDG 275
Query: 206 RFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
F+ T L +++ A+ ++ + GF+IVETN+RVYAYTDSNL++A++ F E++Y
Sbjct: 276 FFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLY 335
Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC----------PLPPVVSD 310
RF +++VGVL+R++VR A + GITAAQII FLR +A P+ +P V+D
Sbjct: 336 RFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVAD 395
Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
QIRLWE ER RL +YS F S+ ++ L+ Y G+L W ++ +R +V+T++GH
Sbjct: 396 QIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVSSQGILLWCDDVQRLMVITEEGHE 455
Query: 371 DIKKFW 376
+K +W
Sbjct: 456 SVKSWW 461
>gi|254584224|ref|XP_002497680.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
gi|238940573|emb|CAR28747.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
Length = 509
Score = 237 bits (605), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 149/460 (32%), Positives = 231/460 (50%), Gaps = 90/460 (19%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +++ ++F E + + W+ + + ++A ++ L
Sbjct: 40 LPQLAKFFIMSMVFNESAISLRDLDRWVKSSGKLQFQDAIKSMKSLHLLIPVRSNGPMLI 99
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
R + K AL GG S +E+D+ D L++Y+ ++WE IL FMVG+
Sbjct: 100 DLNHTFRESFKNALTGGEVSNSFGIVMELDNDTVTTDMLDKYSADKWETILHFMVGTSLA 159
Query: 102 EGISADAVRTLFNEGA----ENPV----ITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
I ++ V L N IT +GFQFLL E +Q+W +L+YL+ E
Sbjct: 160 H-IPSENVLNLLKHSKLMEESNTTGEFKITNEGFQFLLQEVNSQLWALLLQYLKMTEMLQ 218
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+D V+ L F+F L F YST G+S+ +V L+ +R++GLV+Q A FYPT+LA
Sbjct: 219 MDPVDILNFIFMLGALEFSKPYSTNGLSETQKVMLKDMRDYGLVFQMNSNANVFYPTRLA 278
Query: 214 LNMAT-----RGT-------LKQIREP----------------------------GFLIV 233
+ + RG LKQ +E G LIV
Sbjct: 279 SMLTSDPKSIRGASGAMDSVLKQNKEDVLNKSGGAGANADDDADEIGSEGQNMQDGALIV 338
Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
ETNF++Y+Y++S L++A++ LF + RF N+ G LTR+S+R AL +GITA QII +L
Sbjct: 339 ETNFKLYSYSNSPLQIAILSLFVHMKSRFSNMVTGQLTRESIRQALLNGITADQIIAYLE 398
Query: 294 LHALPSV------------------SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQ 332
HA P + P LPP V DQI+LW+ E +R+ + EG LYS
Sbjct: 399 THAHPQMRRMAEEQLEKRLELDPNSKDPLQILPPTVVDQIKLWQLELDRIISFEGSLYSD 458
Query: 333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
F + +F L YAQD+GVL W++++K+ V+K+G+S +
Sbjct: 459 FENHQEFTLLSSYAQDIGVLIWKDDRKKKFFVSKEGNSQV 498
>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 451
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 140/407 (34%), Positives = 220/407 (54%), Gaps = 33/407 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK YV++L ++ PVP ++A W+ +HK A D L +L
Sbjct: 40 LPPLAKKYVLQLFHIDGPVPAKLLAEWVLPDGLSKHKVAIDRLVQLGVFVEALDSKNGKT 99
Query: 48 -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS-- 98
+R+ + L+ GG S + + L+ L YALE+WEC L ++
Sbjct: 100 YKVNPTYQRSLQKLLVHGGTLPRESMPSNITVRLPTLEDLETYALEQWECFLLQLISPSH 159
Query: 99 -QQTEGISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+T IS+ + R L E P +T+ GFQFLL++T AQ+WY I +Y+ E +
Sbjct: 160 VDKTLNISSSLMKVFQRRLLSQRDREAPKLTESGFQFLLMDTNAQLWYIIREYITNSEER 219
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
G+D + ++F+ +L F G Y+ +++ + ++ L + GLV Q+ RK F PTK
Sbjct: 220 GVDAGDLISFMLELSFHVIGEAYNINTLTEFQRNIIKDLADLGLVKLQQGRKESWFIPTK 279
Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
LA N++ T R+ GF++VETNFRVYAY+ S L ++ LF + Y+ PNL VG +T
Sbjct: 280 LATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAIT 339
Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
++S+ A +GITA QI+ F R +A P V+ +P V V+DQIRLWE + R+ +E
Sbjct: 340 KESLYNAFDNGITADQIVSFFRQNAHPRVAQRIPAVPENVTDQIRLWESDLNRVEMTEAY 399
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y +F S+ FE D A++ L W++ KK +VV + H+ ++ F
Sbjct: 400 YYDEFPSRDVFEGACDCAREWSGLLWEDSKKMHLVVKSEVHTYVRDF 446
>gi|169784197|ref|XP_001826560.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus oryzae RIB40]
gi|238508739|ref|XP_002385554.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|83775305|dbj|BAE65427.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220688446|gb|EED44799.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|391868531|gb|EIT77745.1| RNA polymerase II transcription initiation protein [Aspergillus
oryzae 3.042]
Length = 480
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 153/439 (34%), Positives = 222/439 (50%), Gaps = 68/439 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
L ++AK +V+ LL+++ P+P + +W+ KE A L L R
Sbjct: 40 LPDLAKCFVMALLYLKDPLPATDLEAWVKAESLKERDNALSILGRLHILSNTVITDNVRA 99
Query: 51 AKV----------ALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
KV AL G K S + D A + L++YA +WE +L +MVG+
Sbjct: 100 YKVTDPFASSLRQALTGAEKTQSFGVLWHISDGAAVSIAELDEYARRQWEGVLGYMVGTS 159
Query: 100 QTEGISADA-----VRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
GI DA V+ L G + IT+DGF F+L + QVW+ ++ Y+E+
Sbjct: 160 GL-GIQRDANLSKGVKQLLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILMLYVES 218
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
++ G+ VE L+F+F L G Y + ++ L L +FG+VYQ A FY
Sbjct: 219 ADAIGMSSVEVLSFVFLLSSLELGKAYDKKHLTSNQLRTLTDLADFGIVYQDSPDASHFY 278
Query: 209 PTKLALNMAT----------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALI 252
PT+LA + + GT GF+IVETN+R+YAYT S L+++LI
Sbjct: 279 PTRLATTLTSDSSALSNPISGSLSGPSGTGSNKAGSGFIIVETNYRLYAYTSSPLQISLI 338
Query: 253 GLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------ 300
LF L YRFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 339 ALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQIISYLATHAHPQMRKHNVSRSTSNQ 398
Query: 301 -SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
P LPP V+DQIRLW+ ER+R+ + G L+ FVS A++E YA+++GVL W++
Sbjct: 399 AGIPPSVLPPTVTDQIRLWQLERDRVKATPGFLFKDFVSLAEYEAPCRYAEEIGVLVWKS 458
Query: 357 EKKRTVVVTKQGHSDIKKF 375
++KR VT+ H + F
Sbjct: 459 DRKRMFFVTR--HEQVAAF 475
>gi|406604717|emb|CCH43852.1| RNA polymerase II transcription factor B subunit 2 [Wickerhamomyces
ciferrii]
Length = 488
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 143/441 (32%), Positives = 223/441 (50%), Gaps = 67/441 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK +++ ++F E+P SW+ + EA ++ L
Sbjct: 39 LPGLAKFFIMSMVFNEKPQAVRDFDSWVKPNGKMDLVEALRSIKSLHLLKEANGIAIMNP 98
Query: 48 --RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQTEGI 104
R++ + L G S + D + ++FL+QYA +WE IL FMVG++ +
Sbjct: 99 VFRKSFRNVLTGNEIGNSFGVLCDTDDTHKVSVEFLDQYAANKWETILHFMVGTELEQSP 158
Query: 105 SADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVE 158
S + L + G ++ +IT +GFQFLL + AQ+W +L+YL ES +D V+
Sbjct: 159 SPGVLSLLTHSGLMEGKSVKDMLITNEGFQFLLQDVNAQLWTLLLQYLRMAESLQMDPVD 218
Query: 159 CLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL------ 212
L F+F L G DYS +S+ L LR++GL+YQRK + RFYPT+L
Sbjct: 219 VLNFIFMLGSLELGKDYSLSALSETQVNMLGDLRDYGLIYQRKSTSRRFYPTRLATTLTS 278
Query: 213 ----------ALNMATRG------TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFC 256
A+N T I G +I+ETNF+VYAYT+S L++A++ LF
Sbjct: 279 DTTSLRSASSAMNKVIENAKDSAYTPTNIENSGTIIIETNFKVYAYTNSPLQIAILNLFV 338
Query: 257 ELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP------------- 303
L RF NL G +TR+S+R AL SGIT+ QII +L HA P +
Sbjct: 339 HLKARFSNLVTGQITRESIRKALVSGITSEQIISYLESHAHPQLRRAAEEELNKKNGFES 398
Query: 304 ---------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
L P ++DQI+LW+ E +R+ + +G L++ F S +++ L Y++++GVL W
Sbjct: 399 NNHGEKIQILQPTIADQIKLWQLELDRIMSFDGYLFTDFSSDQEYQVLSKYSEEIGVLLW 458
Query: 355 QNEKKRTVVVTKQGHSDIKKF 375
+ K+ VTK+G+S + F
Sbjct: 459 NDSSKKKFFVTKEGNSQVIAF 479
>gi|324512881|gb|ADY45318.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 481
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 152/440 (34%), Positives = 241/440 (54%), Gaps = 67/440 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAK---------------EHKEATDTLTE 46
L ++++ ++++L++++ V ++ SW+ H+ E KE TL
Sbjct: 37 LPDVSQQCILKVLWLQEGVQPSLWQSWVKHEHSSLVENHFDLLRRLGIIESKEQI-TLNP 95
Query: 47 LRRNA--KVALLGGGKPWSMSAKLEVDSKARDL--DFLNQYALERWECILRFM-VGSQQT 101
+ R + + +G + + A ++D K+R L + A ERWECIL ++ + SQ++
Sbjct: 96 IFRKSYIRAVQMGLYRASQLKAMTDLDEKSRKSASKDLGKKATERWECILHYLALPSQKS 155
Query: 102 EGISADAVRTLF--------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
E + A + LF E + IT GFQFLLL Q+W +IL YL ES G
Sbjct: 156 EQGVSGATKQLFRAAGLTSGGESEGDMEITSAGFQFLLLNQTEQIWMYILHYLRLEESMG 215
Query: 154 LDLVECLTFLFQL---------------------KFSTFGTDYSTEGMSDKLQVFLQHLR 192
+++ L FL +L K +T+ + E S+ + FL HLR
Sbjct: 216 KNVMAELDFLLKLTLCVDHTHTNGRLRVVDGDREKKATWRAFFIDESWSETITNFLMHLR 275
Query: 193 EFGLVYQRKRKAGRFYPT----KLALNMATRGTLKQIR-EPGFLIVETNFRVYAYTDSNL 247
E GLV+ RKRK G F+ T L ++ ++ + + R G++IVETN+RVYAYTDS+L
Sbjct: 276 ELGLVFIRKRKDGYFFITPLFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDSSL 335
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------- 299
++A++ F E++YRF +++VG+LTRDSVR AL+ GITA+QII FLR +A P
Sbjct: 336 QLAILSTFTEMLYRFNDMSVGMLTRDSVRRALQVGITASQIISFLRANAHPETVAAANAS 395
Query: 300 ---VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
+ C +P V+DQIRLWE ER RL + LYS F S+ ++ +++Y + +L W +
Sbjct: 396 PGVIHC-VPVTVADQIRLWEDERHRLVFFDSALYSTFESEREYVGVKEYTRSQDILLWFD 454
Query: 357 EKKRTVVVTKQGHSDIKKFW 376
+ +R VVVT++GH +K +W
Sbjct: 455 DVQRLVVVTEEGHDMVKAWW 474
>gi|393910938|gb|EFO26137.2| transcription factor Tfb2 family protein [Loa loa]
Length = 486
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 147/377 (38%), Positives = 213/377 (56%), Gaps = 50/377 (13%)
Query: 48 RRNAKVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE- 102
R K +G K +SA + D K R + D L + A+ERWECIL ++ + SQ++E
Sbjct: 103 RSYLKATRIGLCKASELSAVTDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQ 161
Query: 103 GISADAVRTLFNEG---AENP----VITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
G+S R + G +EN IT GFQFLLL Q+W ++L Y E G+D
Sbjct: 162 GVSGTTKRLFRSAGLTSSENDDGDIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGID 221
Query: 156 LVECLTFLFQLKF---STFGTDYSTEGMSDK------------------LQVFLQHLREF 194
+++ L FLF+L T G T G SDK + FL HLRE
Sbjct: 222 IIKELDFLFRLTLYSGDTSGGTRVTNG-SDKERPAGSRAFLIDECWPEPTKDFLMHLREL 280
Query: 195 GLVYQRKRKAGRFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKV 249
GLV+ RKRK G F+ T L +++ A+ ++ + GF+IVETN+RVYAYTDSNL++
Sbjct: 281 GLVFIRKRKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNYRVYAYTDSNLQL 340
Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------- 302
A++ F E++YRF +++VGVL+R++VR A + GITAAQII FLR +A P+
Sbjct: 341 AILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAHPTTYAVTTEYGG 400
Query: 303 ---PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
+P V+DQIRLWE ER RL +YS F S+ ++ L+ Y G+L W ++ +
Sbjct: 401 MIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVSSQGILLWCDDVQ 460
Query: 360 RTVVVTKQGHSDIKKFW 376
R +V+T++GH +K +W
Sbjct: 461 RLMVITEEGHESVKSWW 477
>gi|393910939|gb|EJD76097.1| transcription factor Tfb2 family protein, variant [Loa loa]
Length = 389
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 147/377 (38%), Positives = 213/377 (56%), Gaps = 50/377 (13%)
Query: 48 RRNAKVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE- 102
R K +G K +SA + D K R + D L + A+ERWECIL ++ + SQ++E
Sbjct: 6 RSYLKATRIGLCKASELSAVTDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQ 64
Query: 103 GISADAVRTLFNEG---AENP----VITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
G+S R + G +EN IT GFQFLLL Q+W ++L Y E G+D
Sbjct: 65 GVSGTTKRLFRSAGLTSSENDDGDIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGID 124
Query: 156 LVECLTFLFQLKF---STFGTDYSTEGMSDK------------------LQVFLQHLREF 194
+++ L FLF+L T G T G SDK + FL HLRE
Sbjct: 125 IIKELDFLFRLTLYSGDTSGGTRVTNG-SDKERPAGSRAFLIDECWPEPTKDFLMHLREL 183
Query: 195 GLVYQRKRKAGRFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKV 249
GLV+ RKRK G F+ T L +++ A+ ++ + GF+IVETN+RVYAYTDSNL++
Sbjct: 184 GLVFIRKRKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNYRVYAYTDSNLQL 243
Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------- 302
A++ F E++YRF +++VGVL+R++VR A + GITAAQII FLR +A P+
Sbjct: 244 AILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAHPTTYAVTTEYGG 303
Query: 303 ---PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
+P V+DQIRLWE ER RL +YS F S+ ++ L+ Y G+L W ++ +
Sbjct: 304 MIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVSSQGILLWCDDVQ 363
Query: 360 RTVVVTKQGHSDIKKFW 376
R +V+T++GH +K +W
Sbjct: 364 RLMVITEEGHESVKSWW 380
>gi|344303892|gb|EGW34141.1| RNA polymerase II transcription factor B subunit 2 [Spathaspora
passalidarum NRRL Y-27907]
Length = 519
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 153/471 (32%), Positives = 222/471 (47%), Gaps = 98/471 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK +++ ++F E+PV + W + K EA L L
Sbjct: 40 LSPMAKFFIMSMIFNEKPVALRDLDKWCMPSARKLEYEALKQLESLHLIEYDSKASHIRL 99
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G P + S VD D+ FL+ +A +WE IL FMVG++ T
Sbjct: 100 HSTFRQNFRDCLTGSQDPTAFGSISTTVDKHQVDVAFLDSFASSKWETILHFMVGTEST- 158
Query: 103 GISADAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
+++V +L GA N IT GFQFLL + AQ+W +L+YL +
Sbjct: 159 ATPSNSVLSLLRLGGLMEGPGRGASNLKITNTGFQFLLQDVNAQIWTLLLQYLNLTQELN 218
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+D V+ L F+F L G Y+ +S+ L L+++GLVY+R + RFYPT+LA
Sbjct: 219 MDPVDVLNFIFVLGSLELGKSYAVSSLSETQVSMLADLKDYGLVYRRTDTSSRFYPTRLA 278
Query: 214 LNM-----------------------ATRGTLKQIREPGFL----IVETNFRVYAYTDSN 246
+ T + Q PG + I+ETNF++YAYT+S
Sbjct: 279 TTLTSDSSALKTPAMAVEQALDSVEGGTDNEVSQTAPPGNIQGTVIIETNFKLYAYTNSP 338
Query: 247 LKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------ 300
L++A++ LF L RF N+ G +TR+S+R AL +GITA QII FL HA P +
Sbjct: 339 LEIAILNLFVHLRTRFANMVCGQITRESIRNALYNGITADQIIRFLETHAHPQMKVLAKE 398
Query: 301 ------------------SCP------------------LPPVVSDQIRLWEGERERLTT 324
P LPP V DQI+LW+ E +R+ T
Sbjct: 399 KLDKKIEFDTSHNINTAGGAPQSQALRADGSVAQHKLEILPPNVVDQIKLWQLELDRIQT 458
Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+G L+ F +Q +F+KL YA +LGVL W ++ KR VTK G S + F
Sbjct: 459 FDGYLFKDFPNQQEFDKLSSYASELGVLIWSDKIKRKFFVTKDGMSQVADF 509
>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa]
gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 140/409 (34%), Positives = 227/409 (55%), Gaps = 39/409 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK YV+++LF + PV ++ W+ +H+ A D L +LR
Sbjct: 40 LPPLAKKYVVQILFFDGPVSGKLLEQWVLADGVTKHRVAIDRLIQLRIFIEVSDKKRESS 99
Query: 49 --------RNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
N + L GG +M+A +++ S L+ L+ YALE+WEC L ++ S
Sbjct: 100 YKLNQTFQANLRKHLTNGGVLPRETMAAVVKLPS----LEELDTYALEQWECFLLLLISS 155
Query: 99 QQTEGISA--DAVRTLFNEGA------ENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
Q E ++ ++ +F G + P +T+ GFQFLL++T AQ+WY I +Y+ E
Sbjct: 156 GQAEKPTSLSSSMMKIFQRGLLSQRDRDAPRLTEGGFQFLLMDTNAQLWYIIREYITNSE 215
Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYP 209
+G + + ++FL +L F G Y+ +++ + ++ L E GLV Q+ RK F P
Sbjct: 216 ERGTEPADLISFLLELSFHVTGEAYNMNTLTEIQRNTIKDLAELGLVKLQQGRKESWFIP 275
Query: 210 TKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
TKLA N++ T R+ G+++VETNFR+YAY+ S L ++ LF ++ Y+ PNL VG
Sbjct: 276 TKLATNLSVSLTDSSSRKQGYVVVETNFRLYAYSSSKLHCEILRLFSKIEYQLPNLIVGA 335
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSE 326
+T++S+ A +GIT+ QII FL+ +A P V+ LP V V+DQIRLWE + R+ +
Sbjct: 336 ITKESLYTAFENGITSDQIISFLQQNAHPRVAERLPSVPENVTDQIRLWEADLNRVEITP 395
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y +F S+ FE D+A++ L W++ KK VVV + H ++++F
Sbjct: 396 SHFYDEFPSRDTFEAACDFAREWNGLLWEDSKKMRVVVKAEIHMNMREF 444
>gi|388581217|gb|EIM21527.1| transcription factor Tfb2 [Wallemia sebi CBS 633.66]
Length = 449
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 143/373 (38%), Positives = 214/373 (57%), Gaps = 31/373 (8%)
Query: 36 EHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFM 95
++ + TL E +++ +L G + S+S +S ++ L+ +A ++WE IL +M
Sbjct: 76 DNSNGSITLNEQFKHSLKQILIGREIESLS-----NSITTTIEQLDTFATQQWELILHYM 130
Query: 96 VGSQQTEGISADAVRTLFNEGAENPV--------ITKDGFQFLLLETPAQVWYFILKYLE 147
VGS S + LF N + I+ GFQFLL + Q+W +L+YL
Sbjct: 131 VGSYN----STPSKGVLFLLEKSNLIYTANKQHRISSKGFQFLLEDVQVQLWQLLLEYLS 186
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
E + +DLVE L+FLF L G DYS E +++ Q L LR++GLV+QRK + RF
Sbjct: 187 LSEIRQMDLVEVLSFLFVLGTLELGKDYSIESLTNTQQAMLSDLRDYGLVWQRKTSSKRF 246
Query: 208 YPTKLALNMATRG-----------TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFC 256
YPT+LA + + F+I+ETN+R+YAYT + L+++++ LF
Sbjct: 247 YPTRLATTLTSSAPPLLPTTESSSFTTSSDNKRFIILETNYRLYAYTSNPLQISILNLFV 306
Query: 257 ELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIR 313
L R+PNL +GV+TRDS+R+AL +GITA QIIG+L HA + + PL PV VSDQIR
Sbjct: 307 TLKARYPNLVIGVITRDSIRSALSNGITAEQIIGYLTSHAHTQMHRNNPLLPVTVSDQIR 366
Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
LWE E+ RL +GVLY++F SQ D+E L +YA+ + W N+ KR VT +GH ++
Sbjct: 367 LWELEKNRLKADDGVLYAEFRSQPDYEILLNYAKSYDCVLWSNDIKRMFFVTLEGHQIVR 426
Query: 374 KFWVNHQKGSQHS 386
+F G+ +S
Sbjct: 427 EFVRRRIPGNNNS 439
>gi|268534706|ref|XP_002632484.1| Hypothetical protein CBG13719 [Caenorhabditis briggsae]
Length = 482
Score = 234 bits (598), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 141/355 (39%), Positives = 205/355 (57%), Gaps = 30/355 (8%)
Query: 54 ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
A+LG S+ LE + + R + + A+ERW+CILR++ + TE +S D +
Sbjct: 121 AMLGAASISSLV--LETNDEKRRGKDVEKKAVERWDCILRYLALPSEENTEAVS-DTTKD 177
Query: 112 LFNE-----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
LF + G IT GFQFLLL Q+W ++++YL+ SKG D+VE + L Q+
Sbjct: 178 LFRKANFTSGESRIEITTFGFQFLLLSPVKQMWTYVIEYLKLEISKGNDIVEVIEPLIQI 237
Query: 167 ------KFSTFGTDYST-EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
F Y E + L HLRE G+++ RKRK G F+ T L ++AT
Sbjct: 238 VLLANRGFKAEKECYQIDEQWTQPQHDLLNHLRELGVIFIRKRKDGVFFLTHLLTHLATN 297
Query: 220 GTLKQIREP----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
T+ G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SV
Sbjct: 298 ETIDDTSSEKASNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFTDMSVGMITRESV 357
Query: 276 RAALRSGITAAQIIGFLRLHALPS-------VSCPLPPVVSDQIRLWEGERERLTTSEGV 328
R AL+ GITAAQII FLR +A P V+C LP V+DQIRLWE ER R+ +
Sbjct: 358 RGALQHGITAAQIISFLRANAHPQCVATSGPVNC-LPITVADQIRLWEDERRRMDLKDAY 416
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
+YS F S ++ + YAQ+ G+L W N +++ V+V ++GH +++ W KG+
Sbjct: 417 IYSHFESDDEYHGVVRYAQERGILLWANPQQKLVIVNEEGHEAVRQ-WYKRSKGA 470
>gi|403217068|emb|CCK71563.1| hypothetical protein KNAG_0H01490 [Kazachstania naganishii CBS
8797]
Length = 519
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 143/467 (30%), Positives = 231/467 (49%), Gaps = 96/467 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +++ ++F + V + W+ + +EA ++ L
Sbjct: 41 LPQLAKFFIMSMVFNDNDVSLRDLDRWVKANGKIQFQEAIKSMKSLHLIIPTKGAGQLMI 100
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKAR----DLDFLNQYALERWECILRFMVG 97
R + + AL GG S + D R ++ L++Y+ +WE IL FMVG
Sbjct: 101 GLNPTFRDSFRNALTGGEVSNSFGTVVNDDEDLRTGVLNMRLLDEYSANKWETILHFMVG 160
Query: 98 SQQTEGISADAVRTLFN-------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
+ +E S++ + L + E IT +GFQFLL E +Q+W +L+YL+ E
Sbjct: 161 TPMSEVPSSNVLNLLKHSKLMEESEATGAFKITNEGFQFLLQEINSQIWTLLLQYLKMSE 220
Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
S +D V+ L F+F L G YSTEG+S+ ++ ++ +R++G+V+Q+ + FYPT
Sbjct: 221 SLQMDAVDVLNFIFMLGALEVGKAYSTEGLSETQRIMIKDMRDYGIVFQKNSTSNVFYPT 280
Query: 211 KLALNMATR--------------------------------------------GTLKQIR 226
LA+ + + GT Q
Sbjct: 281 SLAIMLTSDSKTVLRTASGAMNTVLSQNKEDTSSTNKSATDKANNDTAELDQVGTQSQEV 340
Query: 227 EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
+ G LIVETNF+VY+Y++S L++A++ LF L RF N+ G +T+ S+ AL +GITA
Sbjct: 341 QDGVLIVETNFKVYSYSNSPLQIAILSLFVHLKSRFINMVTGQITKRSIINALDNGITAE 400
Query: 287 QIIGFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTS 325
QII +L HA P + +C LPP V DQI+LW+ E++R+
Sbjct: 401 QIIAYLETHAHPQMRRLAEAQLEKKRELDQNCKDPLQILPPTVVDQIKLWQLEKDRVMAF 460
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
EG LYS F S A++ L+ YAQD+GV+ W +KKR + V+++G+S +
Sbjct: 461 EGSLYSDFESNAEYNILKKYAQDIGVMLWSEDKKRKLFVSQEGNSQV 507
>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 136/404 (33%), Positives = 218/404 (53%), Gaps = 33/404 (8%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+AK YVI++L ++ PV ++ W+ +H+ A D L +LR
Sbjct: 43 LAKKYVIQMLHIDVPVAAKLLEEWVLPAGVSKHRVAIDRLVQLRVFLEAVDRKNEKTYKV 102
Query: 49 -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE- 102
R+ + L+ GG S + + L+ L YALE+WEC L ++ Q E
Sbjct: 103 NPTFQRSLQKLLVQGGTLPRESMPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEK 162
Query: 103 --GISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
IS+ + R L + E P +T+ GFQFLL++T AQ+WY I +Y+ E +G+D
Sbjct: 163 PLNISSSLMKVFQRRILSHRDKEAPKLTESGFQFLLMDTNAQLWYIIREYISNSEDRGVD 222
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
+ ++F+ +L F G YS + ++ + + L + GLV Q+ RK F PTKLA
Sbjct: 223 AADLISFMLELSFHVIGEAYSVDTLTSFQRTIINDLADLGLVKIQQGRKGSWFIPTKLAT 282
Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
N++ R+ GF++VETNFRVYAY+ S L ++ LF + Y+ PNL VG +T++S
Sbjct: 283 NLSMSLADSSSRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKES 342
Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
+ +A +GITA QI+ FL+ +A P V+ +P V V++QIRLWE + R+ ++ Y
Sbjct: 343 LYSAYENGITAEQIVTFLQQNAHPRVAERVPSVPENVTEQIRLWEADLNRVEMTDTYYYD 402
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+F S+ FE D A++ L W++ KK +VV + H ++ +
Sbjct: 403 EFPSRDVFEGACDCAREWNGLLWEDSKKMHMVVKTEVHPYVRDY 446
>gi|296422658|ref|XP_002840876.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637102|emb|CAZ85067.1| unnamed protein product [Tuber melanosporum]
Length = 460
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 143/408 (35%), Positives = 217/408 (53%), Gaps = 47/408 (11%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRR-------NAKVA 54
L +AK V LL+ E PV + + + ++E D ++LRR N V+
Sbjct: 37 LPNLAKTIVTTLLYNETPV---TLGDLEALVRPESYREREDAFSKLRRLHIITENNLDVS 93
Query: 55 L----------LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGI 104
L G + + E D K + FL+ YA ++E IL +MVG+ E
Sbjct: 94 LDPVFKKNFRLAGDHHSFGVPCNAE-DKKKITVQFLDDYATRQFEAILHYMVGTHN-EVK 151
Query: 105 SADAVRTLFNEGA--------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
+ V TL +G + ITK+GF FLL + Q+W +++YLE E G++
Sbjct: 152 PSKGVITLLTKGGLMERYGAGSSATITKNGFSFLLQDGNPQIWALLIQYLEMSEELGMEQ 211
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
+ L FL L G YS +++ ++ L LR++G+VYQRK + RFYPT+LA +
Sbjct: 212 TDVLHFLLMLGSLELGQAYSVNTLTETQKLMLADLRDYGIVYQRKSSSDRFYPTRLATTL 271
Query: 217 ATR-GTLKQIRE-------------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
+ G L+ GF+I+ETN+RVYAYTDS L++A++ LF +L R+
Sbjct: 272 TSESGGLRSASASMSSAMAKDAEEGKGFIILETNYRVYAYTDSPLQIAVLNLFVKLSTRY 331
Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS---CPLPPVVSDQIRLWEGER 319
PNL G ++R S++ A++ GITA Q+I +L HA P + LPP V DQIRLW+ E
Sbjct: 332 PNLVSGRISRRSIQEAIKMGITADQVIDYLSAHAHPQMRKSLVTLPPTVVDQIRLWQIEG 391
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
ER+ T+ G L+ F S +FE++ YA++LGVL W+ +R + VT+
Sbjct: 392 ERMRTTTGFLFKDFHSTQEFEEVAKYAEELGVLKWKKPARRCMFVTRH 439
>gi|241958748|ref|XP_002422093.1| TFIIH subunit, putative; tanscription initiation factor IIB,
putative [Candida dubliniensis CD36]
gi|223645438|emb|CAX40094.1| TFIIH subunit, putative [Candida dubliniensis CD36]
Length = 494
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 151/450 (33%), Positives = 224/450 (49%), Gaps = 77/450 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK Y++ ++F E+P+ + W + K EA L L
Sbjct: 39 LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESLHLIEYDSKGSHIRL 98
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G P + S VD D+ FL+ +A ++WE IL FMVG++ T
Sbjct: 99 HSIFRKNFRDCLTGSQNPNAFGSISTTVDKHKVDIPFLDLFASQKWETILHFMVGTEST- 157
Query: 103 GISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
+D+V +L EG N + IT GFQFLL + AQ+W +L+YL + +D
Sbjct: 158 ATPSDSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDP 217
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL-- 214
V+ L F+F L G YS +S+ L L+++GLVYQR +GRFYPT+LA
Sbjct: 218 VDVLNFIFILGSLELGKSYSVSSLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTL 277
Query: 215 ----------NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
+MA +Q+ +I+ETNF++YAYT S L++A++ LF RF N
Sbjct: 278 TSDSAALKTPSMAMDEEEQQVVSKESIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFAN 337
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------ 300
+ G +TR+S+R AL +GITA QII FL HA P +
Sbjct: 338 MVCGQITRESIRNALYNGITADQIIKFLETHAHPQMRILAKEKLDKKIEFDTSHNINTAG 397
Query: 301 SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
P LPP V DQI+LW+ E +R+ T +G L+ F +Q +++ L +Y
Sbjct: 398 GAPQSKTDGMISQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNY 457
Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
A +LGVL W ++ K+ VTK+G + + F
Sbjct: 458 ASELGVLIWADKVKKKFFVTKEGMTQVADF 487
>gi|297800414|ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 136/418 (32%), Positives = 230/418 (55%), Gaps = 43/418 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK YV+++L++ PVP A++ W+ A +H+ A D L +LR
Sbjct: 40 LPPLAKKYVLQMLYINVPVPAAMMEEWVLADGASKHRVAIDRLIQLRIFSETSDRKRGIS 99
Query: 49 --------RNAKVALLGGG----KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
N + ++ GG +P +++ S L L YAL++WEC L ++
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMHSDNAIKLPS----LQELETYALKQWECFLLQLI 155
Query: 97 GSQQTE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLE 147
S Q E GIS+ +R +F G + P +T+ GFQFLL++T AQ+WY I +Y+
Sbjct: 156 NSGQGEKLTGISSSMMR-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYIS 214
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGR 206
E + ++ + ++FL +L F G Y++ +++ L+ L + GLV Q+ RK
Sbjct: 215 NAEERDVEPADLISFLLELSFHVTGEAYNSNTLTEVQNNTLKDLADLGLVKLQQGRKDSW 274
Query: 207 FYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
F PTKLA N++ R+ GF+++ETNFR+YAY+ S L+ ++ LF + Y+ PNL
Sbjct: 275 FIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLI 334
Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLT 323
+T++S+ A +GIT+ QII FL+ ++ P + +P + V+DQIRLWE + +R+
Sbjct: 335 ACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLKRIE 394
Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
++ Y +F S+ FE D+A++ G L W++ K+ +VV + H+ +++F N K
Sbjct: 395 MTQAHFYDEFPSKDVFEAACDFAREWGGLLWEDSKRMRLVVKSEVHNQMREFLHNQTK 452
>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera]
Length = 451
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 137/407 (33%), Positives = 222/407 (54%), Gaps = 33/407 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y++++L+++ PV + W+ +H+ A D L +LR
Sbjct: 40 LPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRKKETS 99
Query: 49 --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
N + L+ GG + + LD L YAL +WEC L ++ S Q
Sbjct: 100 YRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQ 159
Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
TE ++ + ++ +F G E P +T+ GFQFLL++T AQ+WY + +Y+ E +
Sbjct: 160 TEKLTNFSSSLMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISNSEER 219
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
G+D + ++FL +L F G Y+ +++ + ++ L + GLV Q+ RK F PTK
Sbjct: 220 GVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVKLQQGRKESWFIPTK 279
Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
LA N++ + R+ GF++VETNFR+YAY+ S L ++ LF + Y+ PNL VG +T
Sbjct: 280 LATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRVEYQLPNLIVGAIT 339
Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
++S+ A +GITA QII FL+ +A P V+ P V V+DQIRLWE + R+ T
Sbjct: 340 KESLYNAFENGITAEQIISFLQQNAHPRVAERTPAVPENVTDQIRLWETDLNRVETMPSH 399
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LY +F S+ FE D+A++ G L W++ KK +VV + H ++++
Sbjct: 400 LYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREY 446
>gi|347830304|emb|CCD46001.1| similar to RNA polymerase II transcription factor B subunit 2
[Botryotinia fuckeliana]
Length = 471
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 143/421 (33%), Positives = 222/421 (52%), Gaps = 57/421 (13%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+ LL++ +P+P + W+ K+ +A L+ L
Sbjct: 38 LPHLAKTFVMALLYMPKPLPLIALDQWVQSKSKKQKDQALSLLSRLHIVDITALSREDPQ 97
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSKAR--DLDFLNQYALERWECILRFMV- 96
+ ++AL GGG S D A D+DFL+ +A +WE IL +MV
Sbjct: 98 TVALTKNFGASLRLALTGGGNHQSFGVP-SSDHIAPHVDIDFLDTHARMQWEGILHYMVN 156
Query: 97 ----GSQQTEGISADAVRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYLE 147
GS + A++V+ L + G IT+ GF FLL E AQVW +L ++E
Sbjct: 157 TVTSGSGKDGNGPANSVKALLDAGKLVTRGRGIGITQAGFSFLLQEANAQVWTLLLLWIE 216
Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
ES G+D V+ L+FLF L G YST+ +++ + L +L + GL+Y +F
Sbjct: 217 NAESMGMDSVDVLSFLFMLGSLELGRAYSTKTLTEAQKGMLANLIDLGLIYLPPSAPTQF 276
Query: 208 YPTKLALNMATRGT-LKQI------------REPGFLIVETNFRVYAYTDSNLKVALIGL 254
+PT+LA + + + L+ + + GF+I+ETN+R+YAYT+S L++A++ L
Sbjct: 277 FPTRLATTLTSDASALRTVAAGFDAASKSAASQKGFIIIETNYRLYAYTNSPLQIAVLSL 336
Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------SVSCP-LP 305
F +L R+PN+ G ++RDS+R A+ GIT+ QII +L HA P S P LP
Sbjct: 337 FTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITYLSTHAHPQLVKASSASHGGPVLP 396
Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
P V DQIRLW+ E ER+ G L F +Q ++E YA+++GVL W+++ KR VT
Sbjct: 397 PTVVDQIRLWQLENERMKAVPGFLMKDFETQKEYEGCAKYAEEVGVLVWKSDAKRMFFVT 456
Query: 366 K 366
+
Sbjct: 457 R 457
>gi|328873270|gb|EGG21637.1| general transcription factor IIH [Dictyostelium fasciculatum]
Length = 453
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 145/413 (35%), Positives = 224/413 (54%), Gaps = 44/413 (10%)
Query: 6 AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----RNAKVALLGGGK 60
AK Y++RLL V+ Q+V W +++ + HK+A L +L+ N K +
Sbjct: 36 AKQYLMRLLVVDSISMQSVY-QWTTQSSIQAHKDAIRKLLDLKILTRVPNPKGVVSQDQL 94
Query: 61 PWSMSAKLEVDSKARDLDF-----------------------LNQYALERWECILRFMV- 96
+ K + DLD L++Y+ +WE +L ++
Sbjct: 95 FLNQRFKDNIKDSLTDLDSVVLKSVKDNQSAQQHIKPMTVEELDKYSKGQWEKVLYYLSD 154
Query: 97 -GSQQTEGISADAVRT-LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
G E +S + + L + + IT +GF+F+L + Q+W I+ YL T+ES+G
Sbjct: 155 EGKAPPELVSDLLLSSNLTKKNGTSIAITSEGFKFMLKDIYTQIWTLIIVYLNTLESRGR 214
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
E L FLF+L F T G Y ++ + L L+EFGLVY R K+ FYPT+LA+
Sbjct: 215 PRKEALIFLFKLSFLTLGKSYYLADLNPNQRDMLFDLKEFGLVYVRSEKSDVFYPTRLAI 274
Query: 215 NMATRGTLKQI-----------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
++AT T+ + ++ G+LI+ETNFR+YAYT S+L+++LI LF +++YR P
Sbjct: 275 SLATGRTVSLMNDLAQEISTSQKDQGYLILETNFRIYAYTQSSLQISLISLFVKMLYRLP 334
Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-VSCPLPPVVSDQIRLWEGERERL 322
NL VG+LTR+SVR+AL GITA QI+ F++ +A P+ V P VV +QIRLWE ER R+
Sbjct: 335 NLAVGILTRESVRSALLHGITADQIVDFVKQNAHPNMVISGYPEVVFEQIRLWESERNRI 394
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
T + VL+ F + F K +A+D L W +E K+ +VV +G I+ +
Sbjct: 395 TYKKAVLFDSFPNAESFSKTVTFAKDQYFLVWFDETKKMLVVNDEGSEPIRNY 447
>gi|225555033|gb|EEH03326.1| RNA pol II transcription initiation subunit [Ajellomyces capsulatus
G186AR]
Length = 485
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 151/434 (34%), Positives = 215/434 (49%), Gaps = 68/434 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+ LL+++ P+P A + W+ +E A L+ L
Sbjct: 38 LPYLAKCFVMALLYLKDPLPAADLELWVKTESKRERDNALSILSRLHILSNTTTSDNVRA 97
Query: 49 --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL GG K S + D + L+ YA +WE +L +MVG+
Sbjct: 98 YTVTDPFSSSLRQALTGGDKQQSFGVPSVTDDENPMTVAQLDHYARSQWEGVLGYMVGTS 157
Query: 100 ----QTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
Q + V+ L V ITKDGF F+L + QVW+ ++ Y+E
Sbjct: 158 ALGVQQAATLSKGVKQLLQACHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVENA 217
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGRF 207
E G+D +E L+FLF L G Y + ++ L L +FG+VYQ ++A RF
Sbjct: 218 EQIGMDSIEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATRF 277
Query: 208 YPTKLALNMATRGTL-----------------KQIREP--GFLIVETNFRVYAYTDSNLK 248
YPT+LA + + EP GF+I+ETN+R+YAYT S L+
Sbjct: 278 YPTRLATTLTSDSITLGSSISSSLTAPNGVPSASSNEPSTGFIIIETNYRLYAYTSSPLQ 337
Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---------- 298
++LI LF L YRFPNL G LTR SVR A+ GITA QII +L HA P
Sbjct: 338 ISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATKA 397
Query: 299 ---SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
V P LPP V DQIRLW+ ER+R+ + G L+ FV+ A+FE YA+++GVL
Sbjct: 398 GANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVNLAEFEGPCRYAEEIGVLV 457
Query: 354 WQNEKKRTVVVTKQ 367
W++E+KR VT+
Sbjct: 458 WKSERKRMFFVTRH 471
>gi|240279262|gb|EER42767.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H143]
Length = 485
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 155/435 (35%), Positives = 216/435 (49%), Gaps = 70/435 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+ LL+++ P+P A + W+ +E A L L
Sbjct: 38 LPYLAKCFVMALLYLKDPLPAADLELWVKAESKRERDNALSILGRLHILSNTTTSDNVRA 97
Query: 49 --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+ + AL GG K S L D + L+ YA +WE +L +MVG+
Sbjct: 98 YTVTDPFSSSLRQALTGGDKQQSFGVPSLTDDENPMTVAQLDHYARSQWEGVLGYMVGTS 157
Query: 99 ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
QQ +S V+ L V ITKDGF F+L + QVW+ ++ Y+E
Sbjct: 158 ALGVQQAVTLS-KGVKQLLQACHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVEN 216
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
E G+D +E L+FLF L G Y + ++ L L +FG+VYQ ++A R
Sbjct: 217 AEQIGMDSIEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATR 276
Query: 207 FYPTKLALNMATRGTLKQI-----------------REP--GFLIVETNFRVYAYTDSNL 247
FYPT+LA + + EP GF+I+ETN+R+YAYT S L
Sbjct: 277 FYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEPSTGFIIIETNYRLYAYTSSPL 336
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
+++LI LF L YRFPNL G LTR SVR A+ GITA QII +L HA P
Sbjct: 337 QISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATK 396
Query: 299 ----SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
V P LPP V DQIRLW+ ER+R+ + G L+ FVS A+FE YA+++GVL
Sbjct: 397 SGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVL 456
Query: 353 TWQNEKKRTVVVTKQ 367
W++E+KR VT+
Sbjct: 457 VWKSERKRMFFVTRH 471
>gi|325089533|gb|EGC42843.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H88]
Length = 485
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 155/435 (35%), Positives = 216/435 (49%), Gaps = 70/435 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+ LL+++ P+P A + W+ +E A L L
Sbjct: 38 LPYLAKCFVMALLYLKDPLPAADLELWVKAESKRERDNALSILGRLHILSNTTTSDNVRA 97
Query: 49 --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+ + AL GG K S L D + L+ YA +WE +L +MVG+
Sbjct: 98 YTVTDPFSSSLRQALTGGDKQQSFGVPSLTDDENPMTVAQLDHYARSQWEGVLGYMVGTS 157
Query: 99 ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
QQ +S V+ L V ITKDGF F+L + QVW+ ++ Y+E
Sbjct: 158 ALGVQQAVTLS-KGVKQLLQACHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVEN 216
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
E G+D +E L+FLF L G Y + ++ L L +FG+VYQ ++A R
Sbjct: 217 AEQIGMDSIEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATR 276
Query: 207 FYPTKLALNMATRGTLKQI-----------------REP--GFLIVETNFRVYAYTDSNL 247
FYPT+LA + + EP GF+I+ETN+R+YAYT S L
Sbjct: 277 FYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEPSTGFIIIETNYRLYAYTSSPL 336
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
+++LI LF L YRFPNL G LTR SVR A+ GITA QII +L HA P
Sbjct: 337 QISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATK 396
Query: 299 ----SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
V P LPP V DQIRLW+ ER+R+ + G L+ FVS A+FE YA+++GVL
Sbjct: 397 SGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVL 456
Query: 353 TWQNEKKRTVVVTKQ 367
W++E+KR VT+
Sbjct: 457 VWKSERKRMFFVTRH 471
>gi|119486893|ref|XP_001262366.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
gi|119410523|gb|EAW20469.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
Length = 468
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 144/433 (33%), Positives = 225/433 (51%), Gaps = 55/433 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTE----------LRRNA 51
L ++AK +V+ LL+++ P+P A + +W+ K H + T ++ +
Sbjct: 38 LPDLAKCFVMALLYLKDPLPAADLEAWVRPDSLKLHIMTSTTTSDNVKAYTVTDPFASSL 97
Query: 52 KVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ----QTEGISA 106
+ AL G + S ++ D +A + L++YA +WE +L +MVG+ Q + +
Sbjct: 98 RQALTGAEQTQSFGVLSQISDEEAVSIADLDEYARRQWEGVLGYMVGTSGLGMQRDVSLS 157
Query: 107 DAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECL 160
V+ L G + IT+DGF F+L + QVW+ ++ Y+E+ + G+D VE L
Sbjct: 158 KGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVGTQVWHILVLYVESAAAIGMDSVEVL 217
Query: 161 TFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-- 218
+F+F L G Y + ++ L L +FG+VYQ A FYPT+LA + +
Sbjct: 218 SFVFFLSSLELGKSYEKKQLTSNQLRTLTDLADFGIVYQETPDATHFYPTRLATTLTSDS 277
Query: 219 --------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
GT GF+I+ETN+R+YAYT S L+++LI LF L YRFPN
Sbjct: 278 STLSNPLAGSLSGPTGTSSSKAGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPN 337
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------------SCP---LPPVV 308
L G +TR SVR A+ GITA QII +L HA P + P LPP V
Sbjct: 338 LITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTV 397
Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
DQIRLW+ ER+R+ + G L+ F + A++E YA+++GVL W++++KR VT+
Sbjct: 398 VDQIRLWQLERDRVKATHGFLFRDFNTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR-- 455
Query: 369 HSDIKKFWVNHQK 381
H + F + +K
Sbjct: 456 HHQVAAFLKSRRK 468
>gi|50310215|ref|XP_455127.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62901296|sp|Q6CLR2.1|TFB2_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49644263|emb|CAG97834.1| KLLA0F01056p [Kluyveromyces lactis]
Length = 496
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 143/451 (31%), Positives = 221/451 (49%), Gaps = 77/451 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK +++ +LF + V + W+ + + + ++ L
Sbjct: 39 LSPMAKFFIMSMLFQDHDVSLRDLDKWVKPDAKYQLQYSIKSMKSLNLIIEGESKQPLLI 98
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
+++ K L GG S + D+ L+QY+ E+WE IL +MVG+ T
Sbjct: 99 RLNPIFKKSFKNVLTGGEINNSFGDVADDDTNPVSTATLDQYSAEKWETILHYMVGTPNT 158
Query: 102 EGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+ L + G IT GFQFLL + AQ+W +L+YL+ ES +D
Sbjct: 159 NTPGGKVLDLLQHSGLMEEAEYGELKITNQGFQFLLQDVNAQMWTLLLQYLKMAESLQMD 218
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA-- 213
V+ L F+F L G Y + +S+ + LQ +R++GL+YQ + +FYPT+LA
Sbjct: 219 PVDVLNFIFMLGALQLGKAYKCDQLSNTQRTMLQDMRDYGLIYQNQSDYAKFYPTRLATL 278
Query: 214 ------------------LNMATRGT----------LKQIREPGFLIVETNFRVYAYTDS 245
LN A T + + G LI+ETNF++Y+Y++S
Sbjct: 279 LTSDTKAFRSASVALDSVLNKANETTAVEGDSGQDETTERTQDGALIIETNFKLYSYSNS 338
Query: 246 NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS---- 301
L++A++ LF L RF N+ G LTR+SVR AL +GITA QII +L HA P +
Sbjct: 339 PLQIAILSLFVHLKSRFANMVTGQLTRESVRNALLNGITAEQIIAYLETHAHPRMRRLAE 398
Query: 302 -----------------CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
LPP V DQIRLW+ E +R+ + +G LY+ F S +++ + D
Sbjct: 399 ENLSKKLELDPTVKETLQVLPPTVVDQIRLWQLELDRIISYDGYLYTDFESYQEYQTVAD 458
Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
YA+D+GVL WQNEKK+ V+ +G+S + F
Sbjct: 459 YAKDIGVLLWQNEKKKMFFVSTEGNSQVLDF 489
>gi|68490740|ref|XP_710812.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|68490763|ref|XP_710801.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432046|gb|EAK91553.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432058|gb|EAK91564.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|238879997|gb|EEQ43635.1| TFIIH basal transcription factor complex subunit 2 [Candida
albicans WO-1]
Length = 494
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 151/450 (33%), Positives = 223/450 (49%), Gaps = 77/450 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK Y++ ++F E+P+ + W + K EA L L
Sbjct: 39 LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESLHLIEYDSKGSHVRL 98
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G P + S VD D+ FL+ +A ++WE IL FMVG++ T
Sbjct: 99 HSIFRKNFRDCLTGSQNPNAFGSISTTVDKHKVDIPFLDLFASQKWETILHFMVGTEST- 157
Query: 103 GISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
+D+V +L EG N + IT GFQFLL + AQ+W +L+YL + +D
Sbjct: 158 ATPSDSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDP 217
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL-- 214
V+ L F+F L G YS +S+ L L+++GLVYQR +GRFYPT+LA
Sbjct: 218 VDVLNFIFVLGSLELGKSYSVLSLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTL 277
Query: 215 ----------NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
+MA +Q+ +I+ETNF++YAYT S L++A++ LF RF N
Sbjct: 278 TSDSAALKTPSMAMDEEEQQVATKESIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFAN 337
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------ 300
+ G +TR+S+R AL +GITA QII FL HA P +
Sbjct: 338 MVCGQITRESIRNALYNGITADQIIKFLETHAHPQMRILAKEKLDKKIEFDTSHNINTAG 397
Query: 301 SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
P LPP V DQI+LW+ E +R+ T +G L+ F +Q +++ L +Y
Sbjct: 398 GAPQSKTDGMVSQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNY 457
Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
A +LGVL W ++ K+ VTK G + + F
Sbjct: 458 ASELGVLIWADKVKKKFFVTKDGMTQVADF 487
>gi|353240543|emb|CCA72408.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Piriformospora indica DSM 11827]
Length = 462
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 149/419 (35%), Positives = 220/419 (52%), Gaps = 50/419 (11%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--RNAKVALL--- 56
LS + + V+ LL++E PV I +W++ ++ A +L +L+ N+K +L
Sbjct: 42 LSPLERQLVMSLLWLEGPVETQSITNWITLEGRSAYEHALKSLAKLQILPNSKEQILLQA 101
Query: 57 -----------GGGKPWSMSAKLEVDSKARDL--DFLNQYALERWECILRFMV--GSQQT 101
G G+ S A +E D+ L + L+ YA+ERW+ IL FMV G++Q
Sbjct: 102 SFKSGLRNGLTGSGQVASFGALVEPDNDMGTLPTEMLDNYAVERWDTILHFMVTSGTEQA 161
Query: 102 EGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV-ESKG 153
++ V L NE +IT GFQFLL AQ+W FIL YL + E +
Sbjct: 162 SARPSEGVLYLLEHSGLMSNEHGRR-IITSAGFQFLLQSPHAQLWEFILSYLRMMAERED 220
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+D+V+ L F F L + G YST +S + + LR+FGLVY F PT+ A
Sbjct: 221 MDMVDILGFFFMLSMTQPGQHYSTHTLSPTQLIMVSDLRDFGLVYFPSDTTTSFQPTRFA 280
Query: 214 --------------LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
L + L Q F+++ETN+ VYAYT++ L++A++ LF
Sbjct: 281 TTLTSYTSSFSDHDLTIENGADLSQ----EFVVLETNYHVYAYTNNPLQIAVLNLFVSFK 336
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWE 316
RFPN+ +G LTRDSV+ AL +GITA QI+ +L HA P + + P+ PV V DQIRLWE
Sbjct: 337 ARFPNMIMGSLTRDSVKKALVNGITADQILSYLVTHAHPQMRKNNPIIPVTVQDQIRLWE 396
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
ER R+ +G LY +F S D+E + YA++LGV+ W+N +R G I+ +
Sbjct: 397 LERHRVKGQDGYLYKEFASMNDYEVVVQYARELGVVLWENASRRMFFADAAGRVHIRSY 455
>gi|295662078|ref|XP_002791593.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226279719|gb|EEH35285.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 484
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 156/434 (35%), Positives = 223/434 (51%), Gaps = 69/434 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--RNA-------- 51
L +AK++V+ LL+++ P+P A + W+ +E A L L NA
Sbjct: 38 LPYLAKYFVMALLYLKDPLPAADLELWVKPESKRERDNALSILGRLHILSNATTTDHVRA 97
Query: 52 -----------KVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+ AL GG K S + D+ + L++YA +WE +L +MVG+
Sbjct: 98 YVVTEPFASSLRQALTGGDKQQSFGVPSVTPDANLMTVAQLDEYASRQWEGVLGYMVGTS 157
Query: 99 ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
QQT +S V+ L V ITK+GF F+L + QVW ++ Y+E
Sbjct: 158 ALGAQQTVTLS-KGVKQLLQACHLVEVHDRRVDITKEGFAFVLQDVNTQVWNILILYVEN 216
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR--KRKAGR 206
E GL+ VE L+FLF L G Y + ++ L L +FG+VYQ ++A R
Sbjct: 217 AEQIGLESVEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLTDLTDFGIVYQHFPPQEATR 276
Query: 207 FYPTKLALNM----ATRG-------------TLKQIREP--GFLIVETNFRVYAYTDSNL 247
FYPT+L + AT G T EP GF+++ETN+R+YAYT S L
Sbjct: 277 FYPTRLVTTLTSDSATLGSSISTSLTGPSGPTAANSSEPSTGFIVIETNYRLYAYTSSPL 336
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
+++LI LF L YRFPNL G +TR S+R A+ GITA QII +L HA P
Sbjct: 337 QISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITADQIISYLTTHAHPQMRKYHASK 396
Query: 299 ---SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
+V P LPP V DQIRLW+ ER+R+ + G L+ FVS A+F+ YA+++GVL
Sbjct: 397 SSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFDGPCRYAEEIGVLV 456
Query: 354 WQNEKKRTVVVTKQ 367
W++EK+R VT+
Sbjct: 457 WKSEKRRMFFVTRH 470
>gi|170589001|ref|XP_001899262.1| Transcription factor Tfb2 family protein [Brugia malayi]
gi|158593475|gb|EDP32070.1| Transcription factor Tfb2 family protein [Brugia malayi]
Length = 465
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 146/384 (38%), Positives = 215/384 (55%), Gaps = 54/384 (14%)
Query: 44 LTELRRNA--KVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VG 97
L ++ R A K +G K +S + D K R + D L + A+ERWECIL ++ +
Sbjct: 76 LNQVYRRAYLKATRIGLCKASELSVVTDCDEKTRKSANKD-LGKKAVERWECILHYLALP 134
Query: 98 SQQTE-GISADAVRTLFNEGAENP-------VITKDGFQFLLLETPAQVWYFILKYLETV 149
SQ++E G+S R + G + IT GFQFLLL P Q+W ++L Y
Sbjct: 135 SQKSEQGVSGTTKRLFRSAGLTSSGNDDGDIEITSAGFQFLLLNRPEQIWTYLLHYFHMQ 194
Query: 150 ESKGLDLVECLTFLFQLKF---STFGTDYSTEGMSDK------------------LQVFL 188
E G+D+V+ L FLF+L ST G G SDK ++ FL
Sbjct: 195 EVAGIDIVKELDFLFKLTLYSGSTCGGTRVVNG-SDKERPTGSRAFIIDENWPETIKDFL 253
Query: 189 QHLREFGLVYQRKRKAGRFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYT 243
HLRE GLV+ RKRK G F+ T L ++ + ++ + GF+IVETN+RVYAYT
Sbjct: 254 MHLRELGLVFIRKRKDGFFFLTPLLNHLTGISNTSEAGMENRNQNGFVIVETNYRVYAYT 313
Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP 303
DSNL++A++ F E++YRF +++VGVL+R++VR A + GITAAQII FL +A P ++C
Sbjct: 314 DSNLQLAILSTFTEMLYRFSDMSVGVLSREAVRRAFQVGITAAQIIAFLSRNAHP-ITCA 372
Query: 304 -----------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
+P V DQI+LWE ER RLT +YS F S+ ++ L+ Y +L
Sbjct: 373 VTTEYGRMIQSVPITVVDQIQLWEDERRRLTFCSAAVYSSFESEKEYFGLKGYVISQNIL 432
Query: 353 TWQNEKKRTVVVTKQGHSDIKKFW 376
W ++ +R +V+T++GH ++K +W
Sbjct: 433 LWCHDVQRLMVITEEGHENVKVWW 456
>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis
sativus]
Length = 451
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 136/406 (33%), Positives = 219/406 (53%), Gaps = 33/406 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+++L+++ PV + W+ ++K A D L +LR
Sbjct: 40 LPPLAKKFVLQMLYIDGPVSAKSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETT 99
Query: 49 --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
N + L+ G + + L+ L YAL++WEC L ++ S Q
Sbjct: 100 YRLNPTFQANLQKLLIHGEVLAREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQ 159
Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
E S + +V +F +G E P +T+ GFQFLL+ET AQ+WY I +Y+ E +
Sbjct: 160 AEKPSNISSSVMKVFQKGLLSQRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEER 219
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
G+D + ++FL +L F G Y + +SD+ + ++ L + GLV Q+ RK F PTK
Sbjct: 220 GVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTK 279
Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
LA N++ R+ GF++VETNFR+YAY+ S L ++ LF + Y+ PNL VG +T
Sbjct: 280 LATNLSMSLADSSSRKLGFVVVETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAIT 339
Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
++S+ A ++GITA QI+ FL+ +A P V+ +P V V+DQIRLWE + R+ +
Sbjct: 340 KESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAH 399
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKK 374
Y +F S+ FE DYA++ L W++ K +VV H+ +++
Sbjct: 400 FYDEFPSREVFEAACDYAREWNGLLWEDSKNLRLVVKADIHTHMRE 445
>gi|341886092|gb|EGT42027.1| hypothetical protein CAEBREN_00374 [Caenorhabditis brenneri]
Length = 460
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 139/357 (38%), Positives = 206/357 (57%), Gaps = 36/357 (10%)
Query: 54 ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
A+LG S+ LE + + R + + A+ERW+CILR++ + T+ +S + R
Sbjct: 105 AMLGAASISSLV--LETNEEKRRGKEVEKKAVERWDCILRYLALPSEENTQAVS-ETTRN 161
Query: 112 LF-----NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
LF G IT GFQFLLL Q+W ++++YL+ S+G D+VE + L Q+
Sbjct: 162 LFKMANFTSGDSRIEITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGHDIVEVIEPLIQI 221
Query: 167 KF----------STFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
+ D S G +L L HLRE G+++ RKRK G F+ T+L ++
Sbjct: 222 VLLANRGYKAEKECYQIDPSWTGPQQEL---LNHLRELGVIFIRKRKDGVFFLTQLLTHL 278
Query: 217 ATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
AT T+ G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR
Sbjct: 279 ATNETIDDTSAEKVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITR 338
Query: 273 DSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLPPVVSDQIRLWEGERERLTTS 325
+SVR AL+ GITAAQII FLR +A P V+C LP V+DQIRLWE ER R+
Sbjct: 339 ESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNC-LPITVADQIRLWEDERRRMDLK 397
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
+ +YS F S +F + DYA++ +L W + +++ V+V ++GH +++ W QK
Sbjct: 398 DSYIYSNFESDEEFNGVCDYAKERKILLWADYQRKLVIVNEEGHEYVRQ-WYKQQKS 453
>gi|402592023|gb|EJW85952.1| transcription factor Tfb2 family protein [Wuchereria bancrofti]
Length = 470
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/358 (37%), Positives = 207/358 (57%), Gaps = 41/358 (11%)
Query: 56 LGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE-GISADAVR 110
+G GK +S + D K R + D L + A+ERWECIL ++ + SQ++E G+S R
Sbjct: 108 IGLGKASELSVVKDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQGVSGTTKR 166
Query: 111 TLFNEGAENP-------VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFL 163
+ G + IT GFQFLLL P Q+W ++L Y E G+D+++ L FL
Sbjct: 167 LFRSAGLTSSGNDDGDIEITSAGFQFLLLNRPEQIWTYLLHYFHMQEVAGIDIIKELDFL 226
Query: 164 FQLKFST----------FGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL- 212
F++ + F D E + ++ FL HLRE GLV+ RKRK G F+ T L
Sbjct: 227 FKVVNDSDKERPAGSRAFVID---ENWPETIKDFLIHLRELGLVFIRKRKDGVFFLTPLL 283
Query: 213 ----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
++ + +++ + GF+IVETN+RVYAYTDSNL++A++ F E++YRF +++VG
Sbjct: 284 NHLTGISNTSEASMENRNQHGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVG 343
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC----------PLPPVVSDQIRLWEGE 318
VL+R++VR A + GITAAQII FL +A P +P V+DQI+LWE E
Sbjct: 344 VLSREAVRRAFQVGITAAQIIAFLSTNAHPITYAVTTEYGRMIQSVPITVADQIQLWEDE 403
Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
R RLT +YS F S+ ++ L+ Y +L W ++ +R +V+T++GH ++K +W
Sbjct: 404 RRRLTFCNATVYSSFESEKEYFGLKGYVISQNILLWCHDVQRLMVITEEGHENVKAWW 461
>gi|403163300|ref|XP_003323393.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163998|gb|EFP78974.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 887
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 145/429 (33%), Positives = 219/429 (51%), Gaps = 56/429 (13%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHK-EATDTLTELRR----------- 49
L IA+H V+ LL+ + + ++ + W H++ + D+LT+L R
Sbjct: 76 LPLIARHLVLNLLWSDHSIDKSQLELWADINHSESRRIHVEDSLTKLFRLNIVEQSSLDS 135
Query: 50 ------------------------NAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYA 84
N + AL GGG S S A +D L++Y
Sbjct: 136 SSSSFDDPRNHHHLTLTLNRSFQQNFRRALTGGGNHCSFGVPCNTSSNAVVSIDELDRYG 195
Query: 85 LERWECILRFMVGSQQTEGISADAVRTLFNEG---AENP------VITKDGFQFLLLETP 135
E+WE IL +MVGS+ S + + L G + +P IT GF FLL +
Sbjct: 196 TEKWETILHYMVGSRLPTKPSHNILSLLGQSGLMTSSDPRSLQSLKITSKGFGFLLEDVN 255
Query: 136 AQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFG 195
Q+W +L+YL+ E GLD+V+ L LF L G +YS + LQ L ++G
Sbjct: 256 TQLWDILLQYLKMTEVNGLDVVDVLACLFMLGSLELGQEYSFSNWTPTQTQVLQDLVDYG 315
Query: 196 LVYQRKRKAGRFYPTKLALNMATRGTL-----KQIREPGFLIVETNFRVYAYTDSNLKVA 250
LV RFYPT+LA + + + E GFL++ETN+R+YAYT + L++A
Sbjct: 316 LVL--VSAPDRFYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETNYRIYAYTSNPLQIA 373
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPV 307
++ LF L YRFPNL VG +TR+S+++AL +GITA Q+I +L HA P + LPP
Sbjct: 374 VLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHAHPQMRKLEPLLPPT 433
Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
V DQIRLWE E+ R+ EG LY F S A+++ + Y++ LG++ W++ R + V
Sbjct: 434 VVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVLWEHAGLRKLFVGYD 493
Query: 368 GHSDIKKFW 376
GH +++F+
Sbjct: 494 GHLTLREFF 502
>gi|150863860|ref|XP_001382481.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
gi|149385116|gb|ABN64452.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
Length = 515
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 152/468 (32%), Positives = 227/468 (48%), Gaps = 96/468 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHK-EATDTLTEL------------- 47
L+ +AK Y++ +LF ++PV + W K HAK+ + ++ LT L
Sbjct: 40 LTPMAKFYIMTMLFNDRPVALRDLDRW-CKPHAKKLQFDSLKRLTSLHLIEEDRGGNHIR 98
Query: 48 -----RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQT 101
R+N + L G + + K R D+ FL+ +A +WE IL FMVG++ T
Sbjct: 99 LNPTFRQNFRNCLTGSQQSNAFGNLCTTTDKHRVDISFLDNFASSKWETILHFMVGTEAT 158
Query: 102 EGISADAVRTLFNEGA---------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+V +L G +N IT GFQFLL + AQ+W +L+YL +
Sbjct: 159 -ATPTTSVLSLLKSGGLMEGPGNSPQNLKITNTGFQFLLQDVNAQIWTLLLQYLNLTQEL 217
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
+D V+ L F+F L G Y +S L LR++GLVYQR +GRFYPT+L
Sbjct: 218 HMDPVDVLNFIFVLGSLELGKSYLVSSLSATQVSMLADLRDYGLVYQRTDSSGRFYPTRL 277
Query: 213 --------------------ALNMATRGTLKQIREP----GFLIVETNFRVYAYTDSNLK 248
AL + G +QI P G +I+ETNF++YAYT+S L+
Sbjct: 278 ATTLTSDSAALKTPSMAIEQALGASDGGEEQQIDTPTGNQGTVIIETNFKLYAYTNSPLE 337
Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------- 300
+A++ LF L RF N+ G +TR+S+R+AL +GITA QII FL HA P +
Sbjct: 338 IAILNLFVHLKSRFTNMVCGQITRESIRSALYNGITADQIIKFLETHAHPQMKLLAKEKL 397
Query: 301 ----------------SCP-----------------LPPVVSDQIRLWEGERERLTTSEG 327
P LPP V DQI+LW+ E +R+ T +G
Sbjct: 398 DKKIEFDTSHNINTAGGAPQSKVSGEGNVAQHKLEILPPTVVDQIKLWQLELDRIQTFDG 457
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
L+ F +Q +F+ L +YA ++GVL W ++ K+ VT +G + + +F
Sbjct: 458 YLFKDFSNQQEFDTLSNYASEVGVLVWSDKIKKKFFVTAEGMTQVAEF 505
>gi|169610043|ref|XP_001798440.1| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
gi|160701967|gb|EAT84391.2| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
Length = 426
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 141/409 (34%), Positives = 214/409 (52%), Gaps = 56/409 (13%)
Query: 11 IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL---------------------RR 49
+ +L++ P A + +W + E +A D L L +R
Sbjct: 1 MAMLYMPSSFPAADLEAWFKPSARSERVQALDILERLHILGSKREDDKSLSYSLIPGFQR 60
Query: 50 NAKVALLGGGKPWSM---SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
+ + A+ G G + ++K E K ++FL+ +A E+WE IL FMV G
Sbjct: 61 SLRHAIEGSGTHRTFGVPASKAERGEKRLSIEFLDAHAREQWEGILFFMVSG--AAGFQP 118
Query: 107 DAVRTLFNEGAE--------------NPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+V+ G + P ITK+GF F+L ET AQVW ++ YL+ V+
Sbjct: 119 GSVKMDVGPGTKKLLHAGDLVRTVHGTPRITKEGFSFVLQETNAQVWNLLIVYLKMVDDL 178
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
G+ E L+FLF L G DYST +S L L G+VY+ + + FYPT+L
Sbjct: 179 GMSETEVLSFLFMLGSLELGQDYSTSTLSPSQLHMLDDLSAMGIVYRSSKDSPTFYPTRL 238
Query: 213 ALNMAT-RGTL------------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
A + + G L + + GF+IVETN+R+YAYT+S +++A++ LF +L
Sbjct: 239 ATTLTSDSGALPGSDVGTTEKPDSKAQNKGFIIVETNYRLYAYTNSLIQIAILSLFTKLQ 298
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWE 316
+RFPNL G LT++SV A+++GIT+AQII +L +A P + + PLPP V DQIRLWE
Sbjct: 299 HRFPNLVSGKLTKESVHRAVQAGITSAQIISYLTTYAHPQMQKSNPPLPPTVMDQIRLWE 358
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
E ER+ + G L +F S++++ + YAQ LGVL W+N+ R V+
Sbjct: 359 YEGERVEVTTGYLMREFGSESEYRDVMGYAQALGVLVWKNDANRCFFVS 407
>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 136/404 (33%), Positives = 216/404 (53%), Gaps = 33/404 (8%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
+AK YVI++L ++ V ++ W+ +H+ A D L +LR
Sbjct: 43 LAKKYVIQMLHIDVSVAAKLLEEWVLPGGVSKHRVAIDRLVQLRVFLEAVDRKNEKTYKV 102
Query: 49 -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE- 102
R+ + L+ GG S + + L+ L YALE+WEC L ++ Q E
Sbjct: 103 NPTFQRSLQKLLVQGGTLPRESMPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEK 162
Query: 103 --GISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
IS+ + R L E P +T+ GFQFLL++T AQ+WY I +Y+ E +G+D
Sbjct: 163 PLNISSSLMKVFQRRILSYRDKEAPKLTESGFQFLLMDTNAQLWYIIREYISNSEDRGVD 222
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
+ ++F+ +L F G YS + ++ + + L + GLV Q+ RK F PTKLA
Sbjct: 223 AADLISFMLELSFHDIGEAYSVDTLTSFQRTIINDLADLGLVKIQQGRKGSWFIPTKLAT 282
Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
N++ R+ GF++VETNFRVYAY+ S L ++ LF + Y+ PNL VG +T++S
Sbjct: 283 NLSMSLADSSSRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKES 342
Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
+ +A +GITA QII FL+ +A P V+ +P V V++QIRLWE + R+ ++ Y
Sbjct: 343 LYSAFENGITAEQIITFLQQNAHPRVAERIPSVPENVTEQIRLWEADLNRVEMTDAYYYD 402
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+F S+ FE D A++ L W++ KK +VV + H ++ +
Sbjct: 403 EFPSRDVFEGACDCAREWNGLLWEDSKKMHMVVKTEVHQYVRDY 446
>gi|367009216|ref|XP_003679109.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
gi|359746766|emb|CCE89898.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
Length = 513
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 141/459 (30%), Positives = 232/459 (50%), Gaps = 88/459 (19%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +++ ++F E + + W+ + + ++A ++ L
Sbjct: 40 LPQMAKFFIMSMVFNENEISLRDLDRWVKASGKNQFQDAIKSMKSLHLLIPVRTNGPMMI 99
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
R + + AL GG S ++ + + + L+ Y+ ++WE IL FMVG+
Sbjct: 100 NLNQTFRESFRNALTGGEVNNSFGIVVDEPNDVVNTEMLDAYSADKWETILHFMVGTSLA 159
Query: 102 EGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S + + L + V IT +GFQFLL E +Q+W +L+YL+ E+ +
Sbjct: 160 STPSGNVLNLLKHSKLMEEVSTTGEFKITNEGFQFLLQEVNSQIWALLLQYLKMTETLQM 219
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D VE L F+F L FG YS +G+SD ++ + +R++GLV+Q+ + FYPT+LA
Sbjct: 220 DPVEVLNFIFMLGALEFGKPYSMDGLSDTQKLMSKDMRDYGLVFQKNSNSKVFYPTRLAT 279
Query: 215 NMAT-----RGT-------LKQIRE----------------------------PGFLIVE 234
+ + RG LKQ ++ G LIVE
Sbjct: 280 MLTSDAKSIRGASGAMDSVLKQNKDEASNKAASSNADADSDDDEVGINGQPIQDGALIVE 339
Query: 235 TNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRL 294
TNF++Y+Y++S L++A++ LF L RF N+ G +TR+S+R AL +GITA QII +L+
Sbjct: 340 TNFKLYSYSNSPLQIAILSLFVHLKSRFTNMVTGQITRESIRRALINGITADQIIAYLQT 399
Query: 295 HALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
HA P + +C LPP V DQI+LW+ E +R+ + +G LYS
Sbjct: 400 HAHPQMRRLAEEKLEKKLELDANCKDTLQILPPTVVDQIKLWQLELDRIISYDGSLYSDI 459
Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
S ++ L YAQD+GVL W++++KR V+K+G+S +
Sbjct: 460 DSHQEYILLSTYAQDIGVLLWKDDRKRIFFVSKEGNSQV 498
>gi|294660113|ref|XP_002777731.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
gi|218511847|sp|Q6BGW8.2|TFB2_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|199434471|emb|CAR66042.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
Length = 515
Score = 230 bits (586), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 152/475 (32%), Positives = 220/475 (46%), Gaps = 95/475 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L IAK Y++ +LF E PV + W T K ++ L L
Sbjct: 40 LPPIAKFYIMTMLFYETPVALKDLDKWCKPTSRKFQFDSLKRLRALHLIEEDSSGSHIRL 99
Query: 48 ----RRNAKVALLGG-GKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G + + D ++ + FL+ +AL++WE IL FMVG++ +
Sbjct: 100 HPTFRQNFRDCLTGSQARNAFGNLSTTPDEQSVSIKFLDSFALQKWESILHFMVGTELSV 159
Query: 103 GISADAVRTL--------FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S + L N+G IT GFQFLL + AQ+W +L+YL + +
Sbjct: 160 TPSNSVLSLLKSGGLMEGTNKGGNRLNITNSGFQFLLQDINAQIWTLLLQYLNLTQDLNM 219
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L F+F L G Y+ +S+ L LR++GL+YQR + RFYPT+LA
Sbjct: 220 DPVDVLNFIFILGSLELGKGYTVSSLSETQISMLADLRDYGLIYQRSENSERFYPTRLAT 279
Query: 215 ------------NMATRGTLKQIREP------------GFLIVETNFRVYAYTDSNLKVA 250
+MA L+ E G +I+ETNF++YAYT+S L++A
Sbjct: 280 TLTSDSAALKSPSMAMEQALESTTETEEQQNLASNSNQGTIILETNFKIYAYTNSPLEIA 339
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------- 300
++ LF L RF N+ G +TR+S+R AL +GITA QII FL HA +
Sbjct: 340 ILNLFVHLKTRFSNMVCGQITRESIRNALYNGITADQIIKFLETHAHSQMKILAKERLDK 399
Query: 301 --------------SCP--------------------LPPVVSDQIRLWEGERERLTTSE 326
P LPP V DQI+LW+ E +R+ T E
Sbjct: 400 KIEFDTSNNINTAGGAPQSQMMTNENGTTVAQHKLEVLPPTVVDQIKLWQLELDRIQTFE 459
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
G L+ F SQ +FE L +YA ++GVL W N+ K VTK G S + F +Q+
Sbjct: 460 GYLFKDFASQQEFEALSNYASEIGVLLWSNKSKMKFFVTKDGISQVADFASRNQR 514
>gi|225682227|gb|EEH20511.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides brasiliensis Pb03]
gi|226289599|gb|EEH45083.1| TFIIH basal transcription factor complex p52 subunit
[Paracoccidioides brasiliensis Pb18]
Length = 484
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 155/434 (35%), Positives = 222/434 (51%), Gaps = 69/434 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--RNA-------- 51
L +AK +V+ LL+++ P+P A + W+ +E A L L NA
Sbjct: 38 LPYLAKCFVMALLYLKDPLPAADLELWVKSESKRERDNALSILGRLHILSNATTTDHVRA 97
Query: 52 -----------KVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+ AL GG K S + D+ + L++YA +WE +L +MVG+
Sbjct: 98 YVVTEPFASSLRQALTGGDKQQSFGVPSVTPDANLMTVAQLDEYASRQWEGVLGYMVGTS 157
Query: 99 ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
QQT +S V+ L V ITK+GF F+L + QVW ++ Y+E
Sbjct: 158 VLGAQQTVTLS-KGVKQLLQACHLVEVHDRRVDITKEGFAFVLQDVNTQVWNILILYVEN 216
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR--KRKAGR 206
E GL+ VE L+FLF L G Y + ++ L L +FG+VYQ ++A R
Sbjct: 217 AEQIGLESVEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLTDLTDFGIVYQHFPPQEATR 276
Query: 207 FYPTKLALNM----ATRGT-------------LKQIREP--GFLIVETNFRVYAYTDSNL 247
FYPT+L + AT G+ EP GF+++ETN+R+YAYT S L
Sbjct: 277 FYPTRLVTTLTSDSATLGSSISTSLTGPSGPAAANSSEPSTGFIVIETNYRLYAYTSSPL 336
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
+++LI LF L YRFPNL G +TR S+R A+ GITA QII +L HA P
Sbjct: 337 QISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITADQIISYLTTHAHPQMRKYHASK 396
Query: 299 ---SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
+V P LPP V DQIRLW+ ER+R+ + G L+ FVS A+F+ YA+++GVL
Sbjct: 397 SSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFDGPCRYAEEIGVLV 456
Query: 354 WQNEKKRTVVVTKQ 367
W++EK+R VT+
Sbjct: 457 WKSEKRRMFFVTRH 470
>gi|366997767|ref|XP_003683620.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
gi|357521915|emb|CCE61186.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
Length = 504
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 140/458 (30%), Positives = 230/458 (50%), Gaps = 85/458 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK +++ ++F + + + W++ + ++A ++ L
Sbjct: 39 LPHMAKFFIMSMVFNDNEISIRDLDRWVNSNGKLQFQDAIKSMKSLHILIPGKEQGSMFV 98
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
R + + AL GG S ++ D L +YA ++WE IL FMVG+
Sbjct: 99 NLNSTFRTSFRNALTGGEVNNSFGNIIDKIDDLVDSSVLLEYAKDKWETILHFMVGTPLI 158
Query: 102 EGISADAVRTLFN--------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
+ + +D V L E + IT DGFQFLL + +Q+W +L+YL+ E+
Sbjct: 159 K-MPSDNVLNLLKHSKLMEELESSGEFKITNDGFQFLLQDVNSQIWTLLLQYLKIAEALH 217
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
++ V+ L F+F L FG YS EG+SD + L+ +R++GL++Q+ A FYPT LA
Sbjct: 218 MNPVDILNFIFMLGSLEFGKAYSVEGLSDTQKKMLKDMRDYGLIFQKTSNATVFYPTTLA 277
Query: 214 LNMATRGT------------LKQIREP-----------------------GFLIVETNFR 238
+ + + L Q +E G L+VETNF+
Sbjct: 278 TQLTSDSSTVRSASGAIDSLLSQTKEANNNDSQNLGTDTDDLVGNQSVEVGALVVETNFK 337
Query: 239 VYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
+Y+Y++S L++A++ LF L RF N+ G +TR+S+R AL +G+TA QII +L+ HA P
Sbjct: 338 LYSYSNSPLQIAILSLFVHLRTRFSNMVTGQITRESIRNALANGVTADQIIAYLQSHAHP 397
Query: 299 SVS-------------CP--------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
+ P LPP V DQI+LW+ E +R+ + +G LYS F +
Sbjct: 398 QMRRLAETTLKKKLELDPNTKDQLQILPPTVVDQIKLWQLELDRILSYDGSLYSDFDNNQ 457
Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
++ L +YA+D+GVL W+N++KR V+K+G+S + F
Sbjct: 458 EYNMLYNYAKDIGVLLWKNDRKRKFFVSKEGNSQVLDF 495
>gi|448100135|ref|XP_004199281.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359380703|emb|CCE82944.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 509
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 147/466 (31%), Positives = 218/466 (46%), Gaps = 92/466 (19%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK Y++ +LF E P+ ++ W + T K EA L L
Sbjct: 40 LPPLAKFYIMSMLFYEHPIALKDMSKWCNPTAKKIQFEALKRLRSLHLIEEHNKGSHLKL 99
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G + + D D+ FL+ +A ++WE IL FMVG++ +
Sbjct: 100 HQTFRKNFRDCLTGSQTSNAFGNLCTSPDKNHIDVKFLDMFASQKWESILHFMVGTELSV 159
Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S + L + G A N IT GFQFLL + Q+W +L+YL + +
Sbjct: 160 APSKSVLSLLKSGGLMEGPGTSASNLKITNKGFQFLLQDVNVQIWTLLLQYLSLTQDLNM 219
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L F+F L G Y+ +S+ L L+++GL+YQ RFYPT+LA
Sbjct: 220 DPVDVLNFIFVLGCLELGKSYTVSSLSETQISMLADLKDYGLIYQNHNDKERFYPTRLAT 279
Query: 215 NMATRGT--------LKQIREP---------------GFLIVETNFRVYAYTDSNLKVAL 251
+ + L Q+ + G +I+ETNF++YAYT+S L++A+
Sbjct: 280 TLTSEAAALRTPSMALNQVVDTANSEATEEQMFSNTGGSIILETNFKIYAYTNSPLEIAI 339
Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA--------------- 296
+ LF L RFPN+ G +TR+S+R AL +GITA QII FL HA
Sbjct: 340 LNLFVHLKTRFPNMVTGQITRESIRKALFNGITADQIIKFLESHAHSQMKQMAKENLDKK 399
Query: 297 ----------------------LPSVSC-----PLPPVVSDQIRLWEGERERLTTSEGVL 329
S S LPP V DQI+LW+ E +R+ T EG L
Sbjct: 400 IEFDANNNINTATDSQQLSVGEFGSASSYHKLEILPPTVVDQIKLWQLELDRIQTFEGYL 459
Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ F SQ +++ LR+YA +LGVL W ++ ++ VTK G S + F
Sbjct: 460 FKDFSSQQEYDALRNYASELGVLLWSDKNRKKFFVTKDGMSQVADF 505
>gi|115388809|ref|XP_001211910.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195994|gb|EAU37694.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 460
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 143/420 (34%), Positives = 216/420 (51%), Gaps = 48/420 (11%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAK---EHKEATDTLTELRRNAKVALLGG 58
L ++AK +V+ LL+++ P+P + W+ K ++ A + + AL G
Sbjct: 38 LPDLAKCFVMALLYLKDPLPAKDLELWVKSDSLKTLADNVRAYIVTNPFATSLRQALTGT 97
Query: 59 GKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ----QTEGISADAVRTLF 113
+ S ++ D + L++YA +WE +L +MVG+ Q + + V+ L
Sbjct: 98 EETQSFGVLSQLPDHPPVTIADLDEYARRQWEGVLGYMVGTSGLGIQRDVTLSKGVKQLL 157
Query: 114 NEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLK 167
G + IT+DGF F+L + QVW+ ++ Y+E+ E+ G+D VE L+F+F L
Sbjct: 158 QAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVESAEAIGMDSVEVLSFVFLLS 217
Query: 168 FSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR-------- 219
G Y + ++ L L +FG+VYQ A FYPT+LA + +
Sbjct: 218 SLELGKSYEKQHLTSNQLRTLTDLADFGIVYQHSPDATHFYPTRLATTLTSDASSLSNPI 277
Query: 220 --------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
GT GF+IVETN+R+YAYT S L+++LI LF L YRFPNL G +T
Sbjct: 278 SGSLSGPDGTSSAKAGSGFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKIT 337
Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSV-------------SCP---LPPVVSDQIRLW 315
R SVR A+ GITA QII +L HA P + P LPP V DQIRLW
Sbjct: 338 RQSVRRAIEMGITADQIISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLW 397
Query: 316 EGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ ER+R+ + G L+ F S A++E YA+++GVL W++ +KR VT+ H + F
Sbjct: 398 QLERDRVKATAGFLFKDFASLAEYEDPCRYAEEIGVLVWKSNRKRMFFVTR--HEQVAAF 455
>gi|392901992|ref|NP_001255864.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
gi|290457493|emb|CBK19514.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
Length = 431
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 137/371 (36%), Positives = 215/371 (57%), Gaps = 35/371 (9%)
Query: 37 HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
H++ TD + + K A+LG + S+ LE + R + + A+ERW+CI
Sbjct: 45 HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 102
Query: 92 LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
LR++ + T+ +S + R LF + G + + IT GFQFLLL Q+W +++
Sbjct: 103 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 161
Query: 144 KYLETVESKGLDLVECLTFLFQL------KFSTFGTDYSTEGM-SDKLQVFLQHLREFGL 196
+YL+ S+G D+VE + L Q+ F Y + ++ L HLRE G+
Sbjct: 162 EYLKLEMSQGQDIVEVIEPLIQIVLLANRGFKAERECYQIDANWTEPQNELLNHLRELGV 221
Query: 197 VYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALI 252
++ RKRK G F+ T+L ++AT T+ + G +IVETNFRVYAYT S L++A+I
Sbjct: 222 IFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQLAII 281
Query: 253 GLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLP 305
LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P V+C LP
Sbjct: 282 ALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNC-LP 340
Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
V+DQIRLWE ER R+ + +YS F S+ +F + +YA+ +L W + +++ V+V
Sbjct: 341 ITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKLVIVN 400
Query: 366 KQGHSDIKKFW 376
+ GH +++++
Sbjct: 401 EDGHELVRQWY 411
>gi|323302721|gb|EGA56527.1| Tfb2p [Saccharomyces cerevisiae FostersB]
Length = 505
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 149/467 (31%), Positives = 233/467 (49%), Gaps = 94/467 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L +AK +++ ++F E VP + W++ + + A ++
Sbjct: 39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98
Query: 46 ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
L K++L L GG+ + S + V+ LD L++Y+ +WE IL FMVG+ +
Sbjct: 99 NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157
Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S + L + V IT +GFQFLL E +Q+W +L+YL+ +E+ +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LV+ L F+F L G Y + +S+ ++ LQ +R++GLV+Q+ FYPTKLAL
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALM 277
Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
+ A L+Q RE G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDG 337
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
LIVETNF++Y+Y++S L++A++ LF L RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQII 397
Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
+L HA P + +C LPP V DQIRLW+ E +R+ T EG
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGS 457
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LYS F + ++ L YAQD+GVL W+++KK+ ++K+G+S + F
Sbjct: 458 LYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504
>gi|396459783|ref|XP_003834504.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
gi|312211053|emb|CBX91139.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
Length = 483
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 147/428 (34%), Positives = 217/428 (50%), Gaps = 66/428 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK V+ LL++ +P + +W T KE ++A L L
Sbjct: 39 LPHLAKSIVMALLYMPTSLPTTDLDAWFKSTARKEKEQALFILDCLHIITQQRNDDGKTS 98
Query: 48 --------RRNAKVALLGGGKPWSM---SAKLEVDSKARDLDFLNQYALERWECILRFMV 96
+R+ + AL G G S + + E K ++FL++Y+ +WE IL ++V
Sbjct: 99 SWALTPGFQRSLRNALEGAGSHCSFGVPATREESGGKKVSVEFLDEYSRGQWEGILYYLV 158
Query: 97 GSQQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYF 141
G+ +D++ + L EG P ITKDGF F+L ET AQVW
Sbjct: 159 SG--AAGLRSDSISRAEVGPGTKKLLMEGDLVRVIHGTPRITKDGFSFVLQETNAQVWSL 216
Query: 142 ILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
++ YL+ V G+ E L+FLF L G DYS +SD L L GLVY+
Sbjct: 217 LIVYLKMVHELGMSETEVLSFLFMLGSLDLGRDYSISTLSDTQLQMLDDLSAMGLVYRTS 276
Query: 202 RKAGRFYPTKLAL--------------------NMATRGTLKQIREPGFLIVETNFRVYA 241
+ A FYPT+LA N GTL + GF+I+ETN+R+YA
Sbjct: 277 KDARTFYPTRLATTLTSDSGSAMSASSNDIAQANQGNAGTLATANK-GFIIIETNYRLYA 335
Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS 301
YT+S +++A++ LF +L +RFPNL G LT++SV A+ +GIT+AQII +L +A P +
Sbjct: 336 YTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVSAGITSAQIISYLSTYAHPQMQ 395
Query: 302 ---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
+PP V DQIRLWE E ER+ + G L F S A++ + YA LGVL W+ ++
Sbjct: 396 KNVSFIPPTVMDQIRLWEYEGERVEVTPGFLMKDFGSDAEYRDVLGYADALGVLVWRRDE 455
Query: 359 KRTVVVTK 366
+R V++
Sbjct: 456 QRCFFVSQ 463
>gi|6325135|ref|NP_015203.1| Tfb2p [Saccharomyces cerevisiae S288c]
gi|62901134|sp|Q02939.1|TFB2_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=General transcription and DNA repair
factor IIH subunit TFB2; Short=TFIIH subunit TFB2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|1163092|gb|AAB68240.1| Tfb2p: TFIIH subunit [Saccharomyces cerevisiae]
gi|1778059|gb|AAB40628.1| transcription/repair factor TFIIH subunit Tfb2, partial
[Saccharomyces cerevisiae]
gi|151942676|gb|EDN61022.1| transcription initiation factor TFIIH subunit [Saccharomyces
cerevisiae YJM789]
gi|190407835|gb|EDV11100.1| RNA polymerase II transcription factor B subunit 2 [Saccharomyces
cerevisiae RM11-1a]
gi|256274256|gb|EEU09164.1| Tfb2p [Saccharomyces cerevisiae JAY291]
gi|259150035|emb|CAY86838.1| Tfb2p [Saccharomyces cerevisiae EC1118]
gi|285815419|tpg|DAA11311.1| TPA: Tfb2p [Saccharomyces cerevisiae S288c]
gi|323331200|gb|EGA72618.1| Tfb2p [Saccharomyces cerevisiae AWRI796]
gi|323335024|gb|EGA76314.1| Tfb2p [Saccharomyces cerevisiae Vin13]
gi|323346175|gb|EGA80465.1| Tfb2p [Saccharomyces cerevisiae Lalvin QA23]
gi|323352000|gb|EGA84539.1| Tfb2p [Saccharomyces cerevisiae VL3]
gi|365762780|gb|EHN04313.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295887|gb|EIW06990.1| Tfb2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 513
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 149/467 (31%), Positives = 233/467 (49%), Gaps = 94/467 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L +AK +++ ++F E VP + W++ + + A ++
Sbjct: 39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98
Query: 46 ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
L K++L L GG+ + S + V+ LD L++Y+ +WE IL FMVG+ +
Sbjct: 99 NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157
Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S + L + V IT +GFQFLL E +Q+W +L+YL+ +E+ +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LV+ L F+F L G Y + +S+ ++ LQ +R++GLV+Q+ FYPTKLAL
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALM 277
Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
+ A L+Q RE G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDG 337
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
LIVETNF++Y+Y++S L++A++ LF L RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQII 397
Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
+L HA P + +C LPP V DQIRLW+ E +R+ T EG
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGS 457
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LYS F + ++ L YAQD+GVL W+++KK+ ++K+G+S + F
Sbjct: 458 LYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504
>gi|70982185|ref|XP_746621.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2) [Aspergillus fumigatus Af293]
gi|66844244|gb|EAL84583.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus Af293]
gi|159122143|gb|EDP47265.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus A1163]
Length = 479
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 145/444 (32%), Positives = 223/444 (50%), Gaps = 66/444 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L ++AK +V+ LL+++ P+P A + +W+ KE A L L
Sbjct: 38 LPDLAKCFVMALLYLKDPLPAADLEAWVRPGSLKERDNALSILGRLHIMTSTTTSDNIKA 97
Query: 49 --------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL G + S ++ D +A + L++Y+ +WE +L +MVG+
Sbjct: 98 YTVTDPFASSLRQALTGAEQTQSFGVLSQISDKEAVSIADLDEYSRRQWEGVLGYMVGTS 157
Query: 100 ----QTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
Q + + V+ L G + IT+DGF F L + QVW+ ++ Y+E+
Sbjct: 158 GLGMQRDVSLSKGVKELLQAGHLVEIRDRRVEITQDGFAFALQDVGTQVWHILVLYVESA 217
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
+ G+D VE L+F+F L G Y + ++ L L +FG+VYQ A FYP
Sbjct: 218 AAIGMDSVEVLSFVFFLSSLELGKSYEKKQLTSNQLRTLTDLADFGIVYQETPDATHFYP 277
Query: 210 TKLALNMAT----------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
T+LA + + GT GF+I+ETN+R+YAYT S L+++LI
Sbjct: 278 TRLATTLTSDSSTLSNPLAGSLSGPTGTSSSKAGSGFIIIETNYRLYAYTSSPLQISLIA 337
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------- 300
LF L YRFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 338 LFTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQA 397
Query: 301 SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
P LPP V DQIRLW+ ER+R+ + G L+ F + A++E YA+++GVL W+++
Sbjct: 398 GMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRDFNTLAEYEAPCRYAEEIGVLVWKSD 457
Query: 358 KKRTVVVTKQGHSDIKKFWVNHQK 381
+KR VT+ H + F + +K
Sbjct: 458 RKRMFFVTR--HHQVAAFLKSRRK 479
>gi|401623322|gb|EJS41426.1| tfb2p [Saccharomyces arboricola H-6]
Length = 513
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 148/467 (31%), Positives = 233/467 (49%), Gaps = 94/467 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L +AK +++ ++F E VP + W++ + ++A ++
Sbjct: 39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQDAIKSMKSLHLLIPNKSSGTLMI 98
Query: 46 ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
L K++L L GG+ + S + V+ L L++Y+ +WE IL FMVG+ +
Sbjct: 99 NLNPTFKISLRNALTGGEVRN-SFGIVVEDNVVSLKMLDEYSANKWETILHFMVGTPLAK 157
Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S + L + + IT +GFQFLL E +Q+W +L+YL+ +E+ +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEINSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKVIETSKMD 217
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LV+ L F+F L G Y + +SD ++ LQ +R++GLV+Q+ FYPTKLAL
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSDTQRIMLQDMRDYGLVFQKHSNDTIFYPTKLALM 277
Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
+ A L+Q RE G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSANEEGANGKATTDTTTSDDLNKADLKNQDIPDG 337
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
LIVETNF++Y+Y++S L++A++ LF L RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITAEQII 397
Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
+L HA P + +C LPP V DQIRLW+ E +R+ T EG
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVMTYEGS 457
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LYS F + ++ L YAQD+GVL W+++KK+ ++K+G+S + F
Sbjct: 458 LYSDFETSQEYNVLNKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504
>gi|156057729|ref|XP_001594788.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980]
gi|154702381|gb|EDO02120.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 420
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 133/349 (38%), Positives = 196/349 (56%), Gaps = 35/349 (10%)
Query: 52 KVALLGGGKPWSMSAKLEVDSKAR--DLDFLNQYALERWECILRFMV-----GSQQTEGI 104
++AL GGG S D A D+DFL+ +A +WE IL +MV GS Q
Sbjct: 59 RLALTGGGNHQSFGVP-SSDHIAPHVDIDFLDVHARMQWEGILHYMVNNVAAGSGQEGSG 117
Query: 105 SADAVRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
A +V+ L + G + IT+ GF FLL E AQVW +L ++E ES G+D V+
Sbjct: 118 PAGSVKKLLDAGKLVTRGRSVGITQAGFSFLLQEANAQVWTLLLLWIENAESMGMDSVDV 177
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
L+FLF L G YST ++D L +L +FGL+Y +F+PT+LA + +
Sbjct: 178 LSFLFMLGSLELGRAYSTTTLTDAQHNMLGNLIDFGLIYLPPSAPTQFFPTRLATTLTSD 237
Query: 220 GT-LKQI------------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
+ L+ + + GF+I+ETN+R+YAYT+S L++A++ LF +L R+PN+
Sbjct: 238 ASALRTVSAGFDAASKSAANQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMV 297
Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--------SCP-LPPVVSDQIRLWEG 317
G ++RDS+R A+ GIT+ QII +L HA P + P LPP V DQIRLW+
Sbjct: 298 SGRVSRDSIRTAISHGITSDQIITYLSTHAHPQLVKASTAAHGGPVLPPTVVDQIRLWQL 357
Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
E ER+ G L+ +F Q ++E YA+++GVL W+N+ KR VT+
Sbjct: 358 ENERMKAVPGFLFKEFDGQKEYEGCAKYAEEVGVLVWKNDAKRMFFVTR 406
>gi|448103836|ref|XP_004200137.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359381559|emb|CCE82018.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 512
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 147/473 (31%), Positives = 218/473 (46%), Gaps = 93/473 (19%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK Y++ +LF E P+ ++ W T K EA L L
Sbjct: 40 LPPLAKFYIMSMLFYEHPIALKDMSKWCKPTAKKIQFEALKRLRSLHLIEEHNKGSHLKL 99
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G + + D D+ FL+ +A ++WE IL FMVG++ +
Sbjct: 100 HQTFRKNFRDCLTGSQTSNAFGNLCTSPDKNHIDVKFLDMFASQKWESILHFMVGTELSV 159
Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S + L + G A N IT GFQFLL + Q+W +L+YL + +
Sbjct: 160 APSKSVLSLLKSGGLMEGPGTSASNLKITNKGFQFLLQDVNVQIWTLLLQYLSLTQDLNM 219
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L F+F L G Y+ +S+ L L+++GL+YQ RFYPT+LA
Sbjct: 220 DPVDVLNFIFVLGCLELGKSYTVSSLSETQISMLADLKDYGLIYQNHNDKERFYPTRLAT 279
Query: 215 NMATRGT--------LKQIREP---------------GFLIVETNFRVYAYTDSNLKVAL 251
+ + L Q+ + G +I+ETNF++YAYT+S L++A+
Sbjct: 280 TLTSEAAALRTPSMALNQVVDTANSEATEEQMFSNTGGSIILETNFKIYAYTNSPLEIAI 339
Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC--------- 302
+ LF L RFPN+ G +TR+S+R AL +GITA QII FL HA +
Sbjct: 340 LNLFVHLKTRFPNMVTGQITRESIRKALFNGITADQIIKFLESHAHSQMKQMAKENLDKK 399
Query: 303 ---------------------------------PLPPVVSDQIRLWEGERERLTTSEGVL 329
LPP V DQI+LW+ E +R+ T EG L
Sbjct: 400 IEFDANNNINTATDNQQLSVGEFGSANSYHKLEVLPPTVVDQIKLWQLELDRIQTFEGYL 459
Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
+ F SQ +++ LR YA +LGVL W ++ ++ VTK G S + F N +G
Sbjct: 460 FKDFSSQQEYDALRSYASELGVLLWSDKNRKKFFVTKDGMSQVADF-ANKNRG 511
>gi|67541703|ref|XP_664619.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|40742471|gb|EAA61661.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|259483674|tpe|CBF79257.1| TPA: TFIIH and nucleotide excision repair factor 3 complexes
subunit (Tfb2), putative (AFU_orthologue; AFUA_4G04360)
[Aspergillus nidulans FGSC A4]
Length = 482
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 146/444 (32%), Positives = 224/444 (50%), Gaps = 73/444 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW-----LSKTHAKEHKEATDTLTELR-------- 48
L ++AK +V+ LL+++ P+P A + +W L + +E A L L
Sbjct: 37 LPDLAKCFVMALLYLKDPLPAADLETWVRSESLRLSFLRERDSALSILGRLHILTNTTTS 96
Query: 49 -------------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRF 94
+ + AL G + S + D + + L++YA +WE +L +
Sbjct: 97 GNVRAYMVTDPFAASLRQALTGAKETQSFGVLHHIPDDQTVPIHDLDEYARRQWEGVLGY 156
Query: 95 MVGSQQTEGISAD-----AVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFIL 143
MVG+ GI D V+ L G + IT+DGF F+L + QVW+ ++
Sbjct: 157 MVGTSGL-GIQRDVNLSKGVKQLLQAGHLVEIRDRRVEITQDGFAFVLQDVGTQVWHILI 215
Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
Y+E+ E+ G+D VE L+F+F L G Y + ++ L L +FG+VYQ +
Sbjct: 216 LYVESAEAIGMDSVEVLSFIFLLSSLELGKSYEKKHLTSNQLRTLTDLADFGIVYQDSPE 275
Query: 204 AGRFYPTKLALNMATRGTL------KQIREP----------GFLIVETNFRVYAYTDSNL 247
A FYPT+LA + + + + +P GF+I+ETN+R+YAYT S L
Sbjct: 276 ASHFYPTRLATTLTSDSSALSNPISGALSDPDGGDSNQPGSGFIIIETNYRLYAYTSSPL 335
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------- 300
+++LI LF L YRFPNL G +TR S+R A+ GITA QII +L HA P +
Sbjct: 336 QISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGITADQIISYLATHAHPQMRKHNVAR 395
Query: 301 ---------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
LPP V DQIRLW+ ER+R+ + G L+ FVS A++E YA+++GV
Sbjct: 396 STSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKDFVSLAEYEAPCRYAEEIGV 455
Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
L W++++KR VT+ H + F
Sbjct: 456 LIWKSDRKRMFFVTR--HEQVAAF 477
>gi|328863850|gb|EGG12949.1| hypothetical protein MELLADRAFT_46475 [Melampsora larici-populina
98AG31]
Length = 454
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 197/342 (57%), Gaps = 17/342 (4%)
Query: 48 RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISAD 107
++N ++AL GGG S + SK DL L+QYA + WE IL FMVGS + S+
Sbjct: 106 KQNFQLALTGGGDHKSFGLPCKTSSKV-DLLTLDQYAKQNWETILHFMVGSNWSNRPSSK 164
Query: 108 AVRTLFNEG---AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLF 164
+ L G ++N IT GFQFLL + Q+W +L+YL+ E+ LD+V+ L+ LF
Sbjct: 165 IITLLTFSGLISSQNSKITSVGFQFLLDDINTQLWELLLQYLKMAEANDLDIVDVLSCLF 224
Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRG---- 220
L G +YS + SD L L ++GLVY + K FYPT+L + +
Sbjct: 225 MLGSLELGKEYSMKNFSDTQVQVLNDLVDYGLVYLTQSKT--FYPTRLVTTLTSTAPPIV 282
Query: 221 ---TLKQIREPG-FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVR 276
+ + +P FLI+ETN+R+YAYT + L++A++ LF RFPNL +G +TR+S++
Sbjct: 283 SNPSDQSSSDPNEFLILETNYRIYAYTSNPLQIAILNLFISFKSRFPNLVIGSITRESIK 342
Query: 277 AALRSGITAAQIIGFLRLHA---LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
A ++GI A QII +L H+ + + LP V DQIRLWE E+ R+ EG LY F
Sbjct: 343 MAFKNGIRADQIISYLEFHSHSQMKLLKPILPNTVVDQIRLWELEKNRVRDQEGYLYEDF 402
Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
S +D+E + +Y++ + ++ W+N + + V+ GH+ +++F
Sbjct: 403 KSVSDYEIVINYSKQIDIILWENPELKKFFVSLDGHTALREF 444
>gi|392901990|ref|NP_001255863.1| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
gi|26985915|emb|CAB70230.2| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
Length = 481
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 139/377 (36%), Positives = 216/377 (57%), Gaps = 36/377 (9%)
Query: 37 HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
H++ TD + + K A+LG + S+ LE + R + + A+ERW+CI
Sbjct: 95 HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 152
Query: 92 LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
LR++ + T+ +S + R LF + G + + IT GFQFLLL Q+W +++
Sbjct: 153 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 211
Query: 144 KYLETVESKGLDLVECLTFLFQL------KFSTFGTDYSTEGM-SDKLQVFLQHLREFGL 196
+YL+ S+G D+VE + L Q+ F Y + ++ L HLRE G+
Sbjct: 212 EYLKLEMSQGQDIVEVIEPLIQIVLLANRGFKAERECYQIDANWTEPQNELLNHLRELGV 271
Query: 197 VYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALI 252
++ RKRK G F+ T+L ++AT T+ + G +IVETNFRVYAYT S L++A+I
Sbjct: 272 IFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQLAII 331
Query: 253 GLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLP 305
LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P V+C LP
Sbjct: 332 ALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNC-LP 390
Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
V+DQIRLWE ER R+ + +YS F S+ +F + +YA+ +L W + +++ V+V
Sbjct: 391 ITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKLVIVN 450
Query: 366 KQGHSDIKKFWVNHQKG 382
+ GH +++ W +K
Sbjct: 451 EDGHELVRQ-WYKREKS 466
>gi|349581695|dbj|GAA26852.1| K7_Tfb2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 513
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 149/467 (31%), Positives = 232/467 (49%), Gaps = 94/467 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L +AK +++ ++F E VP + W++ + + A ++
Sbjct: 39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98
Query: 46 ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
L K++L L GG+ + S + V+ LD L++Y+ +WE IL FMVG+ +
Sbjct: 99 NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157
Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S + L + V IT +GFQFLL E +Q+W +L+YL+ +E+ +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LV+ L F+F L G Y + +S ++ LQ +R++GLV+Q+ FYPTKLAL
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSQTQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALM 277
Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
+ A L+Q RE G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDG 337
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
LIVETNF++Y+Y++S L++A++ LF L RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQII 397
Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
+L HA P + +C LPP V DQIRLW+ E +R+ T EG
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGS 457
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LYS F + ++ L YAQD+GVL W+++KK+ ++K+G+S + F
Sbjct: 458 LYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504
>gi|453232318|ref|NP_001263825.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
gi|412976516|emb|CCO25635.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
Length = 435
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 137/375 (36%), Positives = 218/375 (58%), Gaps = 39/375 (10%)
Query: 37 HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
H++ TD + + K A+LG + S+ LE + R + + A+ERW+CI
Sbjct: 45 HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 102
Query: 92 LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
LR++ + T+ +S + R LF + G + + IT GFQFLLL Q+W +++
Sbjct: 103 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 161
Query: 144 KYLETVESKGLDLVECLTFLFQL-------KFSTFGTD---YSTEGM-SDKLQVFLQHLR 192
+YL+ S+G D+VE + L Q+ + + F + Y + ++ L HLR
Sbjct: 162 EYLKLEMSQGQDIVEVIEPLIQIVLLANRVQVAGFKAERECYQIDANWTEPQNELLNHLR 221
Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLK 248
E G+++ RKRK G F+ T+L ++AT T+ + G +IVETNFRVYAYT S L+
Sbjct: 222 ELGVIFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQ 281
Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VS 301
+A+I LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P V+
Sbjct: 282 LAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVN 341
Query: 302 CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
C LP V+DQIRLWE ER R+ + +YS F S+ +F + +YA+ +L W + +++
Sbjct: 342 C-LPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKL 400
Query: 362 VVVTKQGHSDIKKFW 376
V+V + GH +++++
Sbjct: 401 VIVNEDGHELVRQWY 415
>gi|453232316|ref|NP_001263824.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
gi|412976517|emb|CCO25636.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
Length = 485
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/381 (36%), Positives = 219/381 (57%), Gaps = 40/381 (10%)
Query: 37 HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
H++ TD + + K A+LG + S+ LE + R + + A+ERW+CI
Sbjct: 95 HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 152
Query: 92 LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
LR++ + T+ +S + R LF + G + + IT GFQFLLL Q+W +++
Sbjct: 153 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 211
Query: 144 KYLETVESKGLDLVECLTFLFQL-------KFSTFGTD---YSTEGM-SDKLQVFLQHLR 192
+YL+ S+G D+VE + L Q+ + + F + Y + ++ L HLR
Sbjct: 212 EYLKLEMSQGQDIVEVIEPLIQIVLLANRVQVAGFKAERECYQIDANWTEPQNELLNHLR 271
Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLK 248
E G+++ RKRK G F+ T+L ++AT T+ + G +IVETNFRVYAYT S L+
Sbjct: 272 ELGVIFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQ 331
Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VS 301
+A+I LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P V+
Sbjct: 332 LAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVN 391
Query: 302 CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
C LP V+DQIRLWE ER R+ + +YS F S+ +F + +YA+ +L W + +++
Sbjct: 392 C-LPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKL 450
Query: 362 VVVTKQGHSDIKKFWVNHQKG 382
V+V + GH +++ W +K
Sbjct: 451 VIVNEDGHELVRQ-WYKREKS 470
>gi|154273196|ref|XP_001537450.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150415962|gb|EDN11306.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 455
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 152/403 (37%), Positives = 209/403 (51%), Gaps = 53/403 (13%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
L +AK +V+ LL+++ P+P A + W+ K +K TD + R A L GG K
Sbjct: 38 LPYLAKCFVMALLYLKDPLPAADLELWV-KAESKRAYTVTDPFSSSLRQA---LTGGDKQ 93
Query: 62 WSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS-----QQTEGISADAVRTLFNE 115
S L D + + L+ YA +WE +L +MVG+ QQ +S V+ L
Sbjct: 94 QSFGVPSLTDDENSMTVAQLDYYARSQWEGVLGYMVGTSALGVQQAVTLS-KGVKQLLQA 152
Query: 116 GAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFS 169
V ITKDGF F+L + QVW+ ++ Y+E E G+D +E L+FLF L
Sbjct: 153 CHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVENAEQIGMDSIEVLSFLFVLSSL 212
Query: 170 TFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGRFYPTKLALNM----------- 216
G Y + ++ L L +FG+VYQ ++A RFYPT+LA +
Sbjct: 213 ELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSI 272
Query: 217 -----ATRGTLKQI-REP--GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
A G EP GF+I+ETN+R+YAYT S L+++LI LF L YRFPNL G
Sbjct: 273 SSSLTAPNGVPSATSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITG 332
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALP---------------SVSCPLPPVVSDQIR 313
LTR SVR A+ GITA QII +L HA P V LPP V DQIR
Sbjct: 333 KLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATKPGANPVGVHTVLPPTVVDQIR 392
Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
LW+ ER+R+ + G L+ FVS A+FE YA+++GVL +N
Sbjct: 393 LWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVLIQRN 435
>gi|334186626|ref|NP_001190745.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658440|gb|AEE83840.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 482
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
L +AK YV+++L+++ PVP ++ W+ +H+ A D L +LR ++++ G
Sbjct: 40 LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99
Query: 62 WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
+S++ + + K L L YAL++WEC L ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159
Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
E GIS+ ++ +F G + P +T+ GFQFLL++T AQ+WY I +Y+ E
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
+ +D + ++FL +L F G Y+ +++ L+ L + GLV Q+ RK F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278
Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
KLA N++ R+ GF+++ETNFR+YAY+ S L+ ++ LF + Y+ PNL +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
T++S+ A +GIT+ QII FL+ ++ P + +P + V+DQIRLWE + +R+ ++
Sbjct: 339 TKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y +F S+ FE D+A++ L W++ K+ +VV + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446
>gi|42572935|ref|NP_974564.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658438|gb|AEE83838.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 462
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
L +AK YV+++L+++ PVP ++ W+ +H+ A D L +LR ++++ G
Sbjct: 40 LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99
Query: 62 WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
+S++ + + K L L YAL++WEC L ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159
Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
E GIS+ ++ +F G + P +T+ GFQFLL++T AQ+WY I +Y+ E
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
+ +D + ++FL +L F G Y+ +++ L+ L + GLV Q+ RK F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278
Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
KLA N++ R+ GF+++ETNFR+YAY+ S L+ ++ LF + Y+ PNL +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
T++S+ A +GIT+ QII FL+ ++ P + +P + V+DQIRLWE + +R+ ++
Sbjct: 339 TKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y +F S+ FE D+A++ L W++ K+ +VV + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446
>gi|42566894|ref|NP_193435.2| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana]
gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana]
gi|332658439|gb|AEE83839.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 452
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
L +AK YV+++L+++ PVP ++ W+ +H+ A D L +LR ++++ G
Sbjct: 40 LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99
Query: 62 WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
+S++ + + K L L YAL++WEC L ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159
Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
E GIS+ ++ +F G + P +T+ GFQFLL++T AQ+WY I +Y+ E
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
+ +D + ++FL +L F G Y+ +++ L+ L + GLV Q+ RK F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278
Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
KLA N++ R+ GF+++ETNFR+YAY+ S L+ ++ LF + Y+ PNL +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
T++S+ A +GIT+ QII FL+ ++ P + +P + V+DQIRLWE + +R+ ++
Sbjct: 339 TKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y +F S+ FE D+A++ L W++ K+ +VV + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446
>gi|410082501|ref|XP_003958829.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
gi|372465418|emb|CCF59694.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
Length = 514
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 142/464 (30%), Positives = 229/464 (49%), Gaps = 93/464 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +++K +V+ ++F E V + W+ + +EA ++ L
Sbjct: 39 LPQLSKFFVMAMVFNENDVSLRDLDRWVKTNGKLQFQEAIKSMKSLHLIIPSKGNGTLMI 98
Query: 48 ------RRNAKVALLGG--GKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + + AL GG + + E + L+FL++Y+ +WE IL FMVG+
Sbjct: 99 SLNPTFKESFRNALTGGEINNSFGIVVDNEAEQSRITLNFLDEYSANKWETILHFMVGTP 158
Query: 100 QTEGISADAVRTLFN----EGAENP---VITKDGFQFLLLETPAQVWYFILKYLETVESK 152
S + + L N E A+ IT +GFQFLL E +Q+W +L+YL+ ES
Sbjct: 159 LANIPSHNVLNLLKNSRLMEEADKTGTLKITNEGFQFLLQEINSQIWTLLLQYLKMSESL 218
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
+D VE L F+F L G YS +G+++ + L+ R++G+++Q+ + FYPT L
Sbjct: 219 QMDPVEVLNFIFMLGALEIGKAYSIDGLTETQKTMLRDFRDYGIIFQKNLTSKSFYPTNL 278
Query: 213 ALNM-------------ATRGTLKQIREP------------------------------G 229
A+ + A L+Q +E G
Sbjct: 279 AIMLTSDTRSILRTASGAMDSVLRQNKEDSKKQTSGDNVTEDTANDIEQIGLENQEGRDG 338
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
LI+ETNF++Y+Y++S L++A++ LF L RF N+ G +TR+S+R AL +GITA QII
Sbjct: 339 SLIIETNFKLYSYSNSPLQIAILSLFVHLKSRFVNMVTGQITRESIRRALVNGITADQII 398
Query: 290 GFLRLHALPSV-------------SCP--------LPPVVSDQIRLWEGERERLTTSEGV 328
+L HA P + P LPP V DQI+LW+ E +R+ T +G
Sbjct: 399 AYLETHAHPRMRRLAEEKLEKKLELDPNSKDTLQILPPTVVDQIKLWQLELDRVITYDGS 458
Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
LYS F + ++ L YAQD+GVL W+++KKR V+K+G++ +
Sbjct: 459 LYSDFENSTEYNMLSKYAQDIGVLIWKDDKKRKFFVSKEGNTQV 502
>gi|51969958|dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana]
Length = 452
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
L +AK YV+++L+++ PVP ++ W+ +H+ A D L +LR ++++ G
Sbjct: 40 LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99
Query: 62 WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
+S++ + + K L L YAL++WEC L ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159
Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
E GIS+ ++ +F G + P +T+ GFQFLL++T AQ+WY I +Y+ E
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
+ +D + ++FL +L F G Y+ +++ L+ L + GLV Q+ RK F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278
Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
KLA N++ R+ GF+++ETNFR+YAY+ S L+ ++ LF + Y+ PNL +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
T++S+ A +GIT+ QII FL+ ++ P + +P + V+DQIRLWE + +R+ ++
Sbjct: 339 TKESLYNAFGNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y +F S+ FE D+A++ L W++ K+ +VV + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446
>gi|344229873|gb|EGV61758.1| transcription factor Tfb2 [Candida tenuis ATCC 10573]
Length = 498
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 147/456 (32%), Positives = 219/456 (48%), Gaps = 84/456 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK +++ +LF E+ V + W E+ + + L
Sbjct: 40 LPPMAKFHIMSMLFYEKNVAIKDLEKWSKPNTRHLQIESLERMRSLHLVQLSESGKYIRL 99
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
++N L G P + + V+ DL FL+ +A +WE IL +MVG++ +
Sbjct: 100 HPIFKKNFADCLTGNQAPDAFGNLDTTVEKHTVDLKFLDTFASTKWETILHYMVGTELSS 159
Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S +R L + G A+ IT GFQFLL + AQ+W +L+YL+ +
Sbjct: 160 TPSRSVLRLLKSGGLMEGNSDYAQELKITNKGFQFLLQDINAQIWTLLLQYLDLTLELQM 219
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L F+F L G YS +SD L LR+FGLVYQR + +FYPT+LA
Sbjct: 220 DAVDVLNFIFLLGSLELGKSYSVSALSDTQIEMLSDLRDFGLVYQRSATSNKFYPTRLAT 279
Query: 215 NMATRGTLKQIREPGF--------------------LIVETNFRVYAYTDSNLKVALIGL 254
+ + ++ P +I+ETNF+VY+YT S L++A++ L
Sbjct: 280 TLTSDS--NSLQTPSMAIDKANSGIDSTDSKQRQESIIIETNFKVYSYTQSPLEIAILNL 337
Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---------------- 298
F L RF N+ G +TR+S+R AL +GITA Q+I FL HA P
Sbjct: 338 FVHLRTRFANMVTGQITRESIRNALYNGITAGQVIKFLETHAHPQMKMLAQEKLDKKIEF 397
Query: 299 ------------SVSCP-------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF 339
S P LPP V DQI+LW+ E +R+ T +G L+ +F +Q D+
Sbjct: 398 DASNNINTATGKSTDAPMQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFREFNNQIDY 457
Query: 340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
E LR+YA ++GVL W ++KKRT VTK+G + F
Sbjct: 458 ESLRNYASEIGVLVWSDDKKRTCFVTKEGLKQVSNF 493
>gi|281208616|gb|EFA82792.1| general transcription factor IIH [Polysphondylium pallidum PN500]
Length = 458
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 135/396 (34%), Positives = 228/396 (57%), Gaps = 30/396 (7%)
Query: 6 AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--------RNAKVALLG 57
K Y++++L +E + ++SW ++++ + H E+ + L +L+ N K ++
Sbjct: 61 CKQYILKMLLIES-IQVNNVSSWTAQSNKQTHLESLERLEDLKILIKQEFQSNIKESI-T 118
Query: 58 GGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQTEG-ISADAVR----T 111
K +SA V+ + +D L+ Y+ +WE +L F+ +T + AD + T
Sbjct: 119 NMKNVVVSAVERVEPNLQMSVDQLDSYSKSQWEKVLYFLSDDSETPPELIADLLALSNLT 178
Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTF 171
N+G+ VIT +GF+FLL + Q+W I+ YL ++E++G + L FLF+L F +
Sbjct: 179 KLNDGS--MVITSEGFKFLLKDIYTQIWTLIIVYLNSLETRGKSRRDALAFLFKLSFLSL 236
Query: 172 GTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----- 226
G+ Y +++ + L LR+FGLVY R K+ FYPT+L +++ T T+ I+
Sbjct: 237 GSAYYVNDLTEDEKSLLFDLRQFGLVYIRSEKSEIFYPTRLIISLTTGKTVTVIKDLAKE 296
Query: 227 ------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
E G++I+ETNFR+YAYT S+L+++L+ LF +++YR PNL+VG+LTR+S+R A
Sbjct: 297 MSNTQKEQGYIILETNFRIYAYTASSLQISLLSLFVKMLYRLPNLSVGILTRESIRTAFL 356
Query: 281 SGITAAQIIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF 339
GITA QI+ F++ + P++ P +V +QIR+WE ER R+ + VL+ F +Q F
Sbjct: 357 HGITADQIVDFIKQNGHPNMLKVGAPEIVFEQIRIWENERNRILYKKAVLFDSFPTQESF 416
Query: 340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
YA+DL W +E K+ +VV+ G IK +
Sbjct: 417 NMTLQYAKDLSFYMWASEAKKVLVVSDNGFDAIKNY 452
>gi|448527133|ref|XP_003869438.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis Co 90-125]
gi|380353791|emb|CCG23303.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis]
Length = 500
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 146/457 (31%), Positives = 221/457 (48%), Gaps = 85/457 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK Y++ ++F E+P+ + W + K EA L L
Sbjct: 39 LSPMAKFYIMSMIFNEKPIAMRDLTKWCKPSAKKMEFEALKRLESLHLIDYDGKGTHVRL 98
Query: 48 ----RRNAKVALLGGGKP---WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
R N + L G P S+S +VD ++ FL+++A WE IL FMVG++
Sbjct: 99 NSIFRSNFRDCLTGSQDPNAFGSISTGDDVDK--VEISFLDKFASHEWETILHFMVGTEG 156
Query: 101 TEGISADAVRTL----FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
T S+ + L EG IT GFQFLL + AQ+W +L+YL ++
Sbjct: 157 TPTPSSSVLSLLKLGGLMEGESTLNITNTGFQFLLQDANAQIWTLLLQYLNLTSELNMNP 216
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL-- 214
V+ L F+F L G YS +S+ L L++ GLVYQ+ + +FYPT+LA
Sbjct: 217 VDVLNFIFILGCLELGKGYSVSNLSETQVSMLADLKDLGLVYQKSDTSNKFYPTRLATTL 276
Query: 215 ----------NMATRGTLKQIREPGF-------LIVETNFRVYAYTDSNLKVALIGLFCE 257
+MA + L++ E +I+ETNF++YAYT+S L++A++ LF +
Sbjct: 277 TSDSSALKTPSMAVQQALEENEEQMMASNSRESIIIETNFKIYAYTNSPLEIAILNLFVQ 336
Query: 258 LMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------------- 300
+ RF N+ G +TR+S+R AL +GIT+ QII FL HA P +
Sbjct: 337 MKTRFSNMVCGQITRESIRNALYNGITSDQIIKFLETHAHPQMRALAKEKLDKKVEFDAS 396
Query: 301 -------SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
P +PP V DQI+LW+ E +R+ T EG L+ F +Q +
Sbjct: 397 HNINTAGGAPQSKTDGAISQHKLEVIPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQE 456
Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
++ L +YA +LGVL W ++ KR VTK G + + F
Sbjct: 457 YDTLSNYATELGVLVWGDKVKRKFFVTKDGMAQVADF 493
>gi|119179887|ref|XP_001241468.1| hypothetical protein CIMG_08631 [Coccidioides immitis RS]
gi|392866653|gb|EAS30172.2| transcription factor Tfb2 [Coccidioides immitis RS]
Length = 487
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 222/444 (50%), Gaps = 72/444 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
L +AK +V+ LL+++ P+P A + W LS T +H A
Sbjct: 38 LPHLAKCFVMALLYLKDPLPAAELELWVKAGSKRERDNALSILSRLHILSSTTTSDHVRA 97
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL GG K S + D + L+ YA +WE +L +MVG+
Sbjct: 98 YMVTNPFSSSLRQALTGGDKQHSFGVMSSMPDPHPMTVSDLDDYARRQWEGVLGYMVGTN 157
Query: 100 ----QTEGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
Q E ++ + V++L V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 SLGIQRENVTLSKGVKSLLQACHLVEVRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
E+ G+D VE L+FLF L G Y + ++ L L +FG+VYQ ++ R
Sbjct: 218 AEAIGMDSVEVLSFLFLLSSLELGQSYDKKHLTSTQLRTLADLTDFGIVYQHSPASESTR 277
Query: 207 FYPTKLAL-----NMATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNL 247
FYPT+LA +MA ++ P GF+I+ETN+R+YAYT S L
Sbjct: 278 FYPTRLATTLTSDSMALSSSISGNLAPAGPNINAATGAPGTGFIIIETNYRLYAYTSSPL 337
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------- 300
+++LI LF L YRFPNL G LTR S+R A+ GITA QII +L HA P +
Sbjct: 338 QISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITADQIISYLTTHAHPQMRKVNASK 397
Query: 301 ---------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
+ LPP V DQIRLW+ ER+RL + G L+ F S A+FE YA+++GV
Sbjct: 398 STSTTAGLLASVLPPTVVDQIRLWQLERDRLKATPGFLFKDFASSAEFEAPCRYAEEIGV 457
Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
L W++EK+R VT+ H + F
Sbjct: 458 LVWKSEKRRMFFVTR--HEQVAAF 479
>gi|365986883|ref|XP_003670273.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
gi|343769043|emb|CCD25030.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
Length = 522
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 149/471 (31%), Positives = 226/471 (47%), Gaps = 101/471 (21%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +++K +V+ ++F V + W+ + ++A ++ L
Sbjct: 39 LPQLSKFFVMSMVFNNNEVSLRDLDRWVKPDAKLQFQDAIKSMKSLHLIIPSKGNGGPLM 98
Query: 48 -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLD-FLNQYALERWECILRFMVGSQ 99
R + K AL GG S + D LN+Y+ +WE IL FMVG+
Sbjct: 99 IILNPTFRESFKNALTGGQINNSFGIISDEDENGVVTSTLLNEYSANKWETILHFMVGTP 158
Query: 100 QTEGISADAVRTL----FNEGAENP---VITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+ S + L E EN IT +GFQFLL E +Q+W +L+YL+ E+
Sbjct: 159 MSSIPSGSVLNLLKHTKLMEEVENTGEFKITNEGFQFLLQELNSQLWTLLLQYLKLSETL 218
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
+D V+ L F+F L G YS +G+S+ ++ L+ +R++GLV+Q+ + FYPT L
Sbjct: 219 NMDPVDVLNFIFMLGALEVGKGYSIDGLSETQKIMLKDMRDYGLVFQKVSNSKTFYPTNL 278
Query: 213 ALNMATRGTLKQIR------------------------EPGF------------------ 230
AL M T T +R EPG
Sbjct: 279 AL-MLTSDTKSIVRTASGAIESVLNENRSGSNANENGYEPGTKKKNENTIMGTIDQVGMK 337
Query: 231 --------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
LIVETNF++Y+Y++S L++A++ LF L +RF N+ G +TR+S+R AL +G
Sbjct: 338 NQDVPDGSLIVETNFKLYSYSNSPLQIAILSLFVHLKFRFVNMVTGQITRESIRRALING 397
Query: 283 ITAAQIIGFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERER 321
ITA QII +L HA P + +C LPP V DQIRLW+ E +R
Sbjct: 398 ITAEQIIAYLETHAHPQMRRLAEEKLEKKLELDANCKDSLQILPPTVVDQIRLWQLELDR 457
Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
+ EG LYS F + ++ L YAQD+GVL W+++KKR V+K+G+S +
Sbjct: 458 VIAYEGSLYSDFENNTEYTTLYKYAQDIGVLLWKDDKKRKFFVSKEGNSQV 508
>gi|406862938|gb|EKD15987.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 421
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 139/391 (35%), Positives = 207/391 (52%), Gaps = 45/391 (11%)
Query: 27 SWLSKTH-------AKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKAR--DL 77
S LS+ H +++ + R + ++AL GGG+ S DS A D
Sbjct: 23 SLLSRLHIVTITAPSRDDPQKVSLTKNFRASLRLALTGGGEQQSFGIPSH-DSTASGVDA 81
Query: 78 DFLNQYALERWECILRFMVGS-----QQTEGISADAVRTLFNEGA-------ENPVITKD 125
+FL+ YA ERWE IL F+V S +Q +VR L G IT+
Sbjct: 82 EFLDNYARERWEDILHFVVNSVGNSMRQDGTGPPTSVRLLLEAGKLVTTGRRSGGGITQA 141
Query: 126 GFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQ 185
GF FLL E AQVW +L ++E E L+ VE L+FLF L G YST +S +
Sbjct: 142 GFSFLLQEVNAQVWALLLLWIENAEKMELESVEVLSFLFMLGSLELGKSYSTATLSASQR 201
Query: 186 VFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------------PGFLI 232
L++L +FGLVY +F+PT+L+ + + + L+ + GF+I
Sbjct: 202 AMLKYLVDFGLVYSPSSTPQQFFPTRLSTTLTSDASGLRSVSAGFDDALKSESGTKGFII 261
Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
+ETN+R+YAYT+S L++A++ LF ++ R+PN+ G +TR SV A+ GIT+ QII +L
Sbjct: 262 IETNYRLYAYTNSPLQIAVLALFTKMGVRYPNMVTGRVTRKSVANAISHGITSDQIISYL 321
Query: 293 RLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
R HA P + + LPP V DQIRLW+ E ER+ + G L+ +F +Q ++E
Sbjct: 322 RAHAHPQLVKAAAVNGNPVLPPTVVDQIRLWQIENERMKATAGFLFKEFETQKEYEGCAR 381
Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
YA + GVL W+N+ KR VT+ H ++ +
Sbjct: 382 YADENGVLVWKNDAKRMFFVTR--HEQLRDY 410
>gi|146414792|ref|XP_001483366.1| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 147/455 (32%), Positives = 215/455 (47%), Gaps = 83/455 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTHAKEHKEATDTL-- 44
L+ IAK +++ ++F E+P + W L H E + L
Sbjct: 39 LTPIAKFFIMLMIFNERPQSVRDLDKWCRPQSKSLQYDALKRLKSLHLVEEDAGGNFLRL 98
Query: 45 -TELRRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G + + + E D + FL+ +AL++WE IL +MVG++
Sbjct: 99 HMTFRKNFRDCLTGNQEATAFGNLCTEDDGHKVTVAFLDSFALQKWETILHYMVGTELPS 158
Query: 103 GISADAVRTL----FNEG----AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S + L EG A N IT GFQFLL + AQ+W +L+YL + +
Sbjct: 159 KPSPSVLSLLKLGGLMEGPGTDARNLKITNSGFQFLLQDMNAQIWTLLLQYLNLTQDLNM 218
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L FLF L G Y G+SD L LR++GLVYQ ++ RFYPT+LA
Sbjct: 219 DPVDVLNFLFVLGSLELGKSYYLSGLSDTQVSMLADLRDYGLVYQYSDQSDRFYPTRLAT 278
Query: 215 NMATRGTLKQIREP--------------------GFLIVETNFRVYAYTDSNLKVALIGL 254
+ + ++ P G +I+ETNF++YAYT+S L++A++ L
Sbjct: 279 TLTSESA--SLKTPSMALDQAVDTSNGLDSGPGQGSIILETNFKLYAYTNSPLEIAILNL 336
Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------------ 302
F L RF N+ G +TR+S+R AL +GIT+ QII FL HA P +
Sbjct: 337 FVNLRTRFSNMVSGQITRESIRNALYNGITSDQIINFLETHAHPQMRALAKERLDKKVEF 396
Query: 303 ----------------------PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
LPP V DQI+LW+ E +R+ EG L+ +F SQ +F+
Sbjct: 397 DASNNINTAGGGQNSSAQHKLEILPPTVVDQIKLWQLELDRIQAFEGYLFKEFASQQEFD 456
Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
L +YA D+GVL W + K VTK G S + F
Sbjct: 457 VLCNYASDVGVLIWSDRTKMRFFVTKDGISQVADF 491
>gi|303321033|ref|XP_003070511.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240110207|gb|EER28366.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320036051|gb|EFW17991.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Coccidioides posadasii str. Silveira]
Length = 487
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 221/444 (49%), Gaps = 72/444 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
L +AK +V+ LL+++ P+P A + W LS T +H A
Sbjct: 38 LPHLAKCFVMALLYLKDPLPAAELELWVKAGSKRERDNALSILSRLHILSSTTTSDHVRA 97
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL GG K S + D + L+ YA +WE +L +MVG+
Sbjct: 98 YMVTNPFSSSLRQALTGGDKQHSFGVMSSMPDPHPMTVSDLDDYARRQWEGVLGYMVGTN 157
Query: 100 ----QTEGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
Q E ++ + V++L V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 SLGIQRENVTLSKGVKSLLQACHLVEVRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
E+ G+D VE L+FLF L G Y + ++ L L +FG+VYQ ++ R
Sbjct: 218 AEAIGMDSVEVLSFLFLLSSLELGQSYDKKHLTSTQLRTLADLTDFGIVYQHSPASESTR 277
Query: 207 FYPTKLAL-----NMATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNL 247
FYPT+LA +MA + P GF+I+ETN+R+YAYT S L
Sbjct: 278 FYPTRLATTLTSDSMALSSPISGNLAPAGPNINAATGAPGTGFIIIETNYRLYAYTSSPL 337
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------- 300
+++LI LF L YRFPNL G LTR S+R A+ GITA QII +L HA P +
Sbjct: 338 QISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITADQIISYLTTHAHPQMRKVNASK 397
Query: 301 ---------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
+ LPP V DQIRLW+ ER+RL + G L+ F S A+FE YA+++GV
Sbjct: 398 STSTTAGLPASVLPPTVVDQIRLWQLERDRLKATPGFLFKDFASSAEFEAPCRYAEEIGV 457
Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
L W++EK+R VT+ H + F
Sbjct: 458 LVWKSEKRRMFFVTR--HEQVAAF 479
>gi|156839389|ref|XP_001643386.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
gi|156113994|gb|EDO15528.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
Length = 504
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 136/456 (29%), Positives = 224/456 (49%), Gaps = 86/456 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +++ ++F + + + W+ + + +E+ + L
Sbjct: 39 LPQLAKFFIMSMVFNDNEISLRDLDRWVKASAKVQFQESIKCMKSLNLLIPGKEPGSIMV 98
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
R++ + AL GG S ++ D+ L+ Y+ ++WE IL FMVG+
Sbjct: 99 NLNPTFRKSLRNALSGGEINNSFGIVVDSDNDLVTRAMLDSYSADKWETILHFMVGTPLV 158
Query: 102 EGISADAVRTLFN--------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
I +D V L E IT +GFQFLL E Q+W +L+YL+ E+
Sbjct: 159 -SIPSDNVLNLLKHSKLMEEVESTGEFKITNEGFQFLLQEVNLQIWTLLLQYLKMAETFR 217
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
++ V+ L F+F L FG YSTE +S + ++ +R++GL++Q+ FYPT+LA
Sbjct: 218 MNPVDVLHFIFMLGALEFGKAYSTEELSATQKTMMKDMRDYGLIFQKNSNTNVFYPTRLA 277
Query: 214 LNMATR------------------------------------GTLKQIREPGFLIVETNF 237
+ + GT Q + G L+VETNF
Sbjct: 278 TMLTSDSRTVRNASSAMDSVLTQSKDEPSGATSGSADVDEQVGTQGQNIQDGALVVETNF 337
Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
++Y+Y++S L++A++ LF L RF N+ G +TR+S+R AL +GITA QII +L HA
Sbjct: 338 KLYSYSNSPLQIAILSLFVHLKTRFSNMVTGQITRESIRNALSNGITAEQIIAYLETHAH 397
Query: 298 PSVS---------------------CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
P + LPP V DQI+LW+ E +R+ + EG LYS F ++
Sbjct: 398 PQMRRLAEERLKKKLELDQNSKETLQILPPTVVDQIKLWQLELDRIISYEGSLYSDFDNK 457
Query: 337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
+++ L YAQD+GVL W++++KR V+K+ +S +
Sbjct: 458 QEYDMLYSYAQDIGVLIWKDDRKRKFFVSKESNSQV 493
>gi|190347675|gb|EDK39997.2| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 145/455 (31%), Positives = 214/455 (47%), Gaps = 83/455 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTHAKEHKEATDTL-- 44
L+ IAK +++ ++F E+P + W L H E + L
Sbjct: 39 LTPIAKFFIMSMIFNERPQSVRDLDKWCRPQSKSLQYDALKRLKSLHLVEEDAGGNFLRL 98
Query: 45 -TELRRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R+N + L G + + + E D + FL+ +A ++WE IL +MVG++
Sbjct: 99 HMTFRKNFRDCLTGNQEATAFGNLCTEDDGHKVTVAFLDSFASQKWETILHYMVGTELPS 158
Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
S + L + G A N IT GFQFLL + AQ+W +L+YL + +
Sbjct: 159 KPSPSVLSLLKSGGLMEGPGTDARNLKITNSGFQFLLQDMNAQIWTLLLQYLNLTQDLNM 218
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L FLF L G Y G+SD L LR++GLVYQ ++ RFYPT+LA
Sbjct: 219 DPVDVLNFLFVLGSLELGKSYYLSGLSDTQVSMLADLRDYGLVYQYSDQSDRFYPTRLAT 278
Query: 215 NMATRGTLKQIREP--------------------GFLIVETNFRVYAYTDSNLKVALIGL 254
+ + ++ P G +I+ETNF++YAYT+S L++A++ L
Sbjct: 279 TLTSESA--SLKTPSMALDQAVDTSNGLDSGPGQGSIILETNFKLYAYTNSPLEIAILNL 336
Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------------ 302
F L RF N+ G +TR+S+R AL +GIT+ QII FL HA P +
Sbjct: 337 FVNLRTRFSNMVSGQITRESIRNALYNGITSDQIINFLETHAHPQMRALAKERLDKKVEF 396
Query: 303 ----------------------PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
LPP V DQI+LW+ E +R+ EG L+ +F SQ +F+
Sbjct: 397 DASNNINTAGGGQNSSAQHKLEILPPTVVDQIKLWQLELDRIQAFEGYLFKEFASQQEFD 456
Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
L +YA D+GVL W + K VTK G S + F
Sbjct: 457 VLCNYASDVGVLIWSDRTKMRFFVTKDGISQVADF 491
>gi|45187599|ref|NP_983822.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|62901343|sp|Q75B51.1|TFB2_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|44982337|gb|AAS51646.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|374107034|gb|AEY95942.1| FADL274Wp [Ashbya gossypii FDAG1]
Length = 514
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 137/459 (29%), Positives = 228/459 (49%), Gaps = 85/459 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK +++ ++F ++ V + W+ + +A ++ L
Sbjct: 48 LSPMAKFFIMSMIFQDEEVSLRDLDRWVKPDAKFQLHDAIKSMKLLHLITEGRSGQPLMV 107
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
+ + K AL GG S +E ++ + L+ YA ++WE IL FMVG+ T
Sbjct: 108 QLNSIFKESFKNALTGGEVKNSFGNVVEEENDPVTMAMLDSYAADKWETILHFMVGTPLT 167
Query: 102 EGISADAVRTL-------FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
+ + + L +E ++ IT +GFQFLL + AQ+W +L+YL E+ +
Sbjct: 168 KSPGKNVLSLLRHSKLMEVDESSKELKITNEGFQFLLQDANAQIWTLLLQYLTMAETFQM 227
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
D V+ L +F + G YS G+S+ + LQ LR++GLV+Q++ +FYPT+LA
Sbjct: 228 DPVDVLNLIFMIGALELGKAYSVVGLSETQKTMLQDLRDYGLVFQKQSNLSKFYPTRLAT 287
Query: 215 NM------------ATRGTLKQIRE-------------------------PGFLIVETNF 237
+ A L+Q E G LIVETNF
Sbjct: 288 MLTSDVVSIRSASGAVNSVLRQRAEGVDGKVLNGTALGDDDLQAGGEGALDGALIVETNF 347
Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
++Y+Y++S L++A++ LF L RF N+ G +TR+S+R AL +GITA QII ++ HA
Sbjct: 348 KLYSYSNSPLQIAILSLFIHLKTRFQNMVTGQITRESIRRALHNGITADQIIAYMETHAH 407
Query: 298 PSV----------------SC-----PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
P + +C LPP V DQI+LW+ E +R+ + +G L+ F +
Sbjct: 408 PQMRRLAGDNLEKKLELDPNCRDTLQVLPPTVVDQIKLWQLELDRIISYDGYLFRDFDNL 467
Query: 337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+++ L YA+D+GVL W ++KK+ V+K+G++ + F
Sbjct: 468 QEYQVLAQYARDIGVLLWSDDKKKMFFVSKEGNAQVIDF 506
>gi|255713706|ref|XP_002553135.1| KLTH0D09790p [Lachancea thermotolerans]
gi|238934515|emb|CAR22697.1| KLTH0D09790p [Lachancea thermotolerans CBS 6340]
Length = 506
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 138/457 (30%), Positives = 227/457 (49%), Gaps = 86/457 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +++ L+F E + W+ + EA ++ L
Sbjct: 39 LPKLAKFFIMSLVFNETDASLRDLDRWVKSGGKYQFNEAIKSMKSLHLLIEGNSGQPFMI 98
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
R + + AL GG S +E +++ L+QYA +WE IL FMVG+ T
Sbjct: 99 NLNPIFRSSFRNALTGGEVNNSFGNVVEDENETVHTSILDQYAANKWETILHFMVGTPLT 158
Query: 102 EGISADAVR-----TLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
+ S + + L E + + IT +GFQFLL + AQ+W +L+YL E+ +D
Sbjct: 159 QTPSRNVLSLLQHSKLMEESSSGELQITNEGFQFLLQDANAQIWALLLQYLRLAETLQMD 218
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL--- 212
V+ L F+F L G YS +SD ++ LQ +R++GLV+Q+ +FYPT+L
Sbjct: 219 PVDVLNFIFMLGALELGKAYSDTNLSDTQKIMLQDMRDYGLVFQKASNTHKFYPTRLTAM 278
Query: 213 -------------ALN-MATRGTLKQIREP----------------------GFLIVETN 236
A+N + ++GT E G +I+ETN
Sbjct: 279 LTSDTSSIRSASGAMNSVLSQGTSSSKEEAAAVAETDEETAQVGGTTQNIPDGAVILETN 338
Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
F++Y+Y++S L++A++ LF L RF N+ G +TR+S+R AL +GITA QII ++ HA
Sbjct: 339 FKLYSYSNSPLQIAVLSLFVHLKSRFSNMVTGQITRESIRRALHNGITADQIIAYMETHA 398
Query: 297 LPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
P + +C LPP V DQI+LW+ E +R+ + +G L++ F +
Sbjct: 399 HPQMRRLAEQTLDKKMELDPNCNEGLQILPPTVVDQIKLWQLELDRIISYDGYLFTDFEN 458
Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
++ L YA+D+GVL W ++KK+ V+++G+S +
Sbjct: 459 FQEYNMLSSYAKDIGVLLWSDDKKKRFFVSQEGNSQV 495
>gi|366991461|ref|XP_003675496.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
gi|342301361|emb|CCC69129.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
Length = 516
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 146/467 (31%), Positives = 230/467 (49%), Gaps = 96/467 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +V+ ++F ++ V + W+ + ++A ++ L
Sbjct: 39 LPQLAKFFVMSMVFNDKEVSLRDLDRWVKSNGKMQFQDAIKSMKSLHLIIPSKSNGGPLM 98
Query: 48 -------RRNAKVALLGG--GKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
R + K AL GG + + A + D+ L L++Y+ +WE IL FMVG+
Sbjct: 99 INLNPTFRESFKNALTGGQVDNSFGIVADDDDDNGVVSLQLLDEYSANKWETILHFMVGT 158
Query: 99 QQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ SA + L + V IT +GFQFLL E +Q+W +L+YL+ E+
Sbjct: 159 PMSSIPSASVLNLLKHTRLMEEVDHTSEFKITNEGFQFLLQELNSQIWTLLLQYLKMSET 218
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+D V+ L F+F L G YS +G+S+ ++ L+ +R++GLV+Q+ + FYPT
Sbjct: 219 LKMDSVDVLNFIFMLGALEVGKGYSIDGLSETQKIMLKDMRDYGLVFQKYTNSNLFYPTN 278
Query: 212 LALNM-------------ATRGTLKQIREP------------------------------ 228
LAL + A L+ RE
Sbjct: 279 LALMLTSDTKSIVRTASGALESVLQNKREDSSKQANGEINKDLITNGDELDQVGYNPQDI 338
Query: 229 --GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
G LI+ETNF++Y+Y++S L++A + LF L RF N+ G +TR+S+R AL +GITA
Sbjct: 339 PDGSLIIETNFKLYSYSNSPLQIATLSLFVHLKSRFANMVTGQITRESIRRALINGITAD 398
Query: 287 QIIGFLRLHALPSV------------------SCP---LPPVVSDQIRLWEGERERLTTS 325
QII +L HA P + P LPP V DQI+LW+ E +R+ T
Sbjct: 399 QIIAYLETHAHPQMRRLAEERLEKKLELDPNSKDPLQVLPPTVVDQIKLWQLELDRVITY 458
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
EG LYS F + A++ L YAQD+GVL W+++KK+ V+K+G+S +
Sbjct: 459 EGSLYSDFETIAEYTTLSKYAQDIGVLLWKDDKKKKFFVSKEGNSQV 505
>gi|342319113|gb|EGU11064.1| RNA polymerase II transcription factor B subunit 2 [Rhodotorula
glutinis ATCC 204091]
Length = 496
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 131/350 (37%), Positives = 187/350 (53%), Gaps = 25/350 (7%)
Query: 47 LRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
R + ++AL GGGK S A + + FL+ +A +WE I FMVGS
Sbjct: 130 FRDSFRMALTGGGKQGSFGAPAAEQDEEVTVQFLDDHAEVQWETIQHFMVGSDAGAKKPG 189
Query: 107 DAVRTLFNEGA---------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLV 157
+ V +L N IT GFQFLL + Q+W +L YLE + DLV
Sbjct: 190 EKVLSLLERSGLMYSPTRSLRNMRITSKGFQFLLEDVNTQLWDLLLVYLEGSQ----DLV 245
Query: 158 ECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMA 217
E + FLF L G Y T+ +S L+ L ++GLVY +R A FYPT+LA +
Sbjct: 246 ETIGFLFMLGSLELGRAYMTDNLSQIQHGVLRDLADYGLVYLPERNAPIFYPTRLATTLT 305
Query: 218 TRG-----TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
+ + E GF+++ETN+++YAYT + L++A++GLF L RF N G +TR
Sbjct: 306 SSAPPLVSSRHSNEEKGFIVLETNYKLYAYTSNPLQIAVLGLFAHLKTRFANFVTGHITR 365
Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSVSCP-------LPPVVSDQIRLWEGERERLTTS 325
+S+R L +GITA QII +L A P + LP V DQIRLWE ER R+ T+
Sbjct: 366 ESIRRGLANGITANQIISYLASRAHPQMRAQAGSDDKLLPITVVDQIRLWEHERRRIQTT 425
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
EG LY +F S D+E + +YA+++G + + K R V VT GH +++F
Sbjct: 426 EGYLYDEFSSTHDYELVVNYAREIGSVLLELPKARKVFVTADGHQQVREF 475
>gi|154308354|ref|XP_001553513.1| hypothetical protein BC1G_08237 [Botryotinia fuckeliana B05.10]
Length = 410
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 130/349 (37%), Positives = 196/349 (56%), Gaps = 35/349 (10%)
Query: 52 KVALLGGGKPWSMSAKLEVDSKAR--DLDFLNQYALERWECILRFMV-----GSQQTEGI 104
++AL GGG S D A D+DFL+ +A +WE IL +MV GS +
Sbjct: 49 RLALTGGGNHQSFGVP-SSDHIAPHVDIDFLDTHARMQWEGILHYMVNTVTSGSGKDGNG 107
Query: 105 SADAVRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
A++V+ L + G IT+ GF FLL E AQVW +L ++E ES G+D V+
Sbjct: 108 PANSVKALLDAGKLVTRGRGIGITQAGFSFLLQEANAQVWTLLLLWIENAESMGMDSVDV 167
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
L+FLF L G YST+ +++ + L +L + GL+Y +F+PT+LA + +
Sbjct: 168 LSFLFMLGSLELGRAYSTKTLTEAQKGMLANLIDLGLIYLPPSAPTQFFPTRLATTLTSD 227
Query: 220 GT-LKQI------------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
+ L+ + + GF+I+ETN+R+YAYT+S L++A++ LF +L R+PN+
Sbjct: 228 ASALRTVAAGFDAASKSAASQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMV 287
Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------SVSCP-LPPVVSDQIRLWEG 317
G ++RDS+R A+ GIT+ QII +L HA P S P LPP V DQIRLW+
Sbjct: 288 SGRVSRDSIRTAIAHGITSDQIITYLSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQL 347
Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
E ER+ G L F +Q ++E YA+++GVL W+++ KR VT+
Sbjct: 348 ENERMKAVPGFLMKDFETQKEYEGCAKYAEEVGVLVWKSDAKRMFFVTR 396
>gi|255936123|ref|XP_002559088.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583708|emb|CAP91723.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 483
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 148/448 (33%), Positives = 223/448 (49%), Gaps = 68/448 (15%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--------RNAK 52
+L +AK +V+ LL+++ P+P A + W++ KE A L L N +
Sbjct: 40 DLVNVAKCFVMALLYLKDPLPTADLEVWVNSESKKERDNALSILGRLHILSTTLTGDNVR 99
Query: 53 VALLGGGKPWSMSAKLEVDSKARDLDF----------------LNQYALERWECILRFMV 96
++ P++ S + + F L++YA +WE ++ +MV
Sbjct: 100 AYMIT--NPFAASLRQALTGAEDSHSFGVFSQAPEHTLTSITDLDEYARRQWEGVMGYMV 157
Query: 97 GSQQTEGIS-----ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKY 145
G+ G + V+ L G ITKDGF F+L + QVW+ ++ Y
Sbjct: 158 GTSALSGQRDMVNLSKGVKQLLQAGHLVEIRGNRVDITKDGFGFVLQDVNTQVWHILILY 217
Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
+E+ E+ G+D VE L+FLF L G+ Y M+ L L +FG+VYQ A
Sbjct: 218 VESAEAIGMDSVEVLSFLFLLSSLELGSSYDKSHMTPNQLRTLMDLADFGIVYQEHADAT 277
Query: 206 RFYPTKLALNMATRGTLKQIREPG---------------FLIVETNFRVYAYTDSNLKVA 250
RFYPT+LA + + + G F+I+ETN+R+YAYT S L+++
Sbjct: 278 RFYPTRLATTLTSDSSALSNPVTGSLTGPAGPSGGSGSGFIIIETNYRLYAYTSSPLQIS 337
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP-----SVSCP-- 303
LI LF L YRFPNL G +TR SVR A+ GITA QII +L HA P S + P
Sbjct: 338 LISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQIISYLLSHAHPQLRKHSAAQPNG 397
Query: 304 -------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
LPP V+DQIRLW+ ER+RL + G L+ F S A+++ YA ++GVL W++
Sbjct: 398 KGVPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTSLAEYQAPCQYAAEIGVLVWKS 457
Query: 357 EKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
++KR VT+ H + F + + G +
Sbjct: 458 DRKRMFFVTR--HEQVAAFLRSQKVGGK 483
>gi|167997877|ref|XP_001751645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697626|gb|EDQ83962.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 462
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 144/419 (34%), Positives = 211/419 (50%), Gaps = 38/419 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK YV+RLL+VE VP W +H+ A D L +LR
Sbjct: 40 LPPLAKQYVLRLLWVENAVPLKSWQEWARPEALSKHQIAIDRLEQLRVILPERSDLSNST 99
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--- 96
+ + AL GG P V ++ + L YA+++WE +L +V
Sbjct: 100 HYLMNPKLQKQLRQALSTGGGPPRDVVSESVAARVPNSADLENYAMKQWESVLLQLVDCA 159
Query: 97 -----GSQQTEGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
G + I L E+P +T GFQFLL++T +Q+W + +Y+ + E
Sbjct: 160 ADGPAGPKNPFIIKVFQRSGLLTPENESPSLTDLGFQFLLMDTNSQLWQLVREYVTSSED 219
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
+G D E + FL +L F G YS +S LQ L L GLV Q+ K + PT
Sbjct: 220 RGTDSGELVGFLLELGFHLVGEAYSVNSLSPALQKVLDELAALGLVKLQQGMKESWYIPT 279
Query: 211 KLALNMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
KLA N++ + T Q E GF++VETNF+VYAYT S L+ ++ F L Y+ PNL V
Sbjct: 280 KLASNLSASLSESTDWQSSE-GFVVVETNFKVYAYTSSKLQTEILRCFTRLEYQLPNLVV 338
Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTT 324
LT++SV AL SGI+A QII FLR HA P V+ +P V VSDQ+RLWE +R R+
Sbjct: 339 ATLTKESVNKALGSGISAEQIISFLRKHAHPHVAQKIPVVPETVSDQLRLWETDRNRVQF 398
Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
Y F + A +E + +A+DLG L +++ + ++V H D++++ G+
Sbjct: 399 EPAYFYDDFPTMAIYEAVVAHARDLGGLLFEDASAKRLIVRSDLHEDMRQYIRKQSSGA 457
>gi|149238950|ref|XP_001525351.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
gi|146450844|gb|EDK45100.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
Length = 503
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 144/463 (31%), Positives = 218/463 (47%), Gaps = 93/463 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
LS +AK Y++ ++F E+PV + W K +A L L
Sbjct: 39 LSSMAKFYIMSMIFTEKPVSMKDLTKWCKPLAKKLEFDALKRLESLHLIEFDSKGTHLRL 98
Query: 48 ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R N + L G P + S D DL FL+ +A ++WE IL FMVG++ T+
Sbjct: 99 NTIFRTNFRDCLTGSQDPNAFGSVSTTADPNKVDLAFLDSFASQKWESILHFMVGTEGTK 158
Query: 103 GISADAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
S + L +G N IT GFQFLL + AQ+W +L+YL +
Sbjct: 159 PPSKSVLSLLKLGGLMEQHEEDGTLN--ITNTGFQFLLQDVNAQIWTLLLQYLNLTQELN 216
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+D V+ L F+F L G YS +S+ L L+++GLVYQ+ + RFYPT+LA
Sbjct: 217 MDAVDVLNFIFILGSLELGNSYSVSNLSETQVSMLADLKDYGLVYQKSDTSSRFYPTRLA 276
Query: 214 LNMATRGTLKQIREPGF----------------------LIVETNFRVYAYTDSNLKVAL 251
+ + ++ P +I+ETNF++YAYT+S L++A+
Sbjct: 277 TTLTSDSA--ALKTPSMAVEQALDETETSLASSTQTKESIIIETNFKMYAYTNSPLEIAI 334
Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------- 300
+ LF + RF N+ G +TR+S+R AL +GITA QII FL HA P +
Sbjct: 335 LNLFVHMKTRFSNMICGQITRESIRNALYNGITADQIIKFLETHAHPQMRALAKERLDKK 394
Query: 301 -------------SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQ 332
P LPP V DQI+LW+ E +R+ T +G L+
Sbjct: 395 VEFDASHNINTAGGAPQSKTDGAISQHRLEVLPPNVVDQIKLWQLELDRIQTFDGYLFKD 454
Query: 333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
F +Q +++ L +YA ++GVL W ++ K+ VTK G + + +F
Sbjct: 455 FANQQEYDMLSNYAAEIGVLIWADKVKKKFFVTKDGMTQVAEF 497
>gi|212535854|ref|XP_002148083.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070482|gb|EEA24572.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 147/432 (34%), Positives = 222/432 (51%), Gaps = 68/432 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA----------TDTLTELRRNA 51
L ++AK V+ +L++ P+P A + W+ KE A T+T+T + A
Sbjct: 38 LPDLAKCLVMAVLYMRDPLPAADLELWVKAESKKERDHAISILGRLHIMTNTMTSNQVRA 97
Query: 52 -----------KVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
+ AL G S D A + L++YA +WE +L +MVG+
Sbjct: 98 YMVTNPFASSLRQALTGAENSQSFGVPCSTPDPNAVSIADLDEYARRQWEGVLGYMVGTS 157
Query: 100 QT---EGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
E +S + V+ L G + IT+DGF F+L + QVW I+ Y+E+
Sbjct: 158 GPGVRETVSLSRGVKQLLQAGHLVEIHHGRVDITQDGFAFVLQDVNTQVWQIIILYVESA 217
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS-DKLQVFLQHLREFGLVYQRK--RKAGR 206
++ ++ VE L+F+F L G Y ++ D+L+ L L +FG++YQ +A R
Sbjct: 218 KALQVNDVEVLSFIFLLSSLELGQSYEKRHLTPDQLRA-LADLTDFGIIYQHAPASEADR 276
Query: 207 FYPTKLALNMATRG--------------TLKQIREPG--FLIVETNFRVYAYTDSNLKVA 250
FYPT+LA + + T EPG F+I+ETN+R+YAYT S L+++
Sbjct: 277 FYPTRLATTLTSDSSALSNTMSSSLSAQTNAATGEPGSGFIIIETNYRLYAYTSSPLQIS 336
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP------------ 298
LI LF L YRFPNL G +TR S+R A+ GITA QII +L HA P
Sbjct: 337 LIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQIISYLSTHAHPQMRKEGAVKSTS 396
Query: 299 -SVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
+ P LPP V DQIRLW+ ER+R+ + G L+ +F S A++E YA+++GVL W
Sbjct: 397 NTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEFASLAEYEGPCRYAEEIGVLVW 456
Query: 355 QNEKKRTVVVTK 366
++++KR VT+
Sbjct: 457 KSDRKRMFFVTR 468
>gi|354546081|emb|CCE42810.1| hypothetical protein CPAR2_204530 [Candida parapsilosis]
Length = 451
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 135/446 (30%), Positives = 210/446 (47%), Gaps = 85/446 (19%)
Query: 13 LLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------------RRNAKVA 54
++F E+P+P + W + + +A L L R N +
Sbjct: 1 MIFNEKPMPMRDLTKWCKPSAKRMEFDALKRLESLHLIEYDSKGTNVRLNTIFRSNFRDC 60
Query: 55 LLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTL- 112
L G P + S + D+ + FL+++A WE IL FMVG++ T S + L
Sbjct: 61 LTGSQDPNAFGSISTDDDADKIETSFLDKFASHEWENILHFMVGTEGTPTPSRSVLSLLK 120
Query: 113 ---FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFS 169
EG IT GFQFLL + AQ+W +L+YL ++ V+ L F+F L
Sbjct: 121 LGGLMEGESELTITNTGFQFLLQDVNAQIWTLLLQYLNLTSESNMNPVDVLNFIFILGCL 180
Query: 170 TFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG 229
G YS +S+ L L++ GL+YQ+ + + RFYPT+LA + + + ++ P
Sbjct: 181 ELGKSYSVLSLSETQVSMLADLKDLGLIYQKSKTSSRFYPTRLATTLTSDSS--ALKTPS 238
Query: 230 F---------------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
+I+ETNF++YAYT+S L++A++ LF ++ RF N+ G
Sbjct: 239 MAVEQALEESEGQMMTSTSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCG 298
Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------SCP- 303
+TR+S+R AL +GIT+ QII FL HA P + P
Sbjct: 299 QITRESIRNALYNGITSDQIIKFLETHAHPQMKALAKEKLDKKVEFDASHNINTAGGAPQ 358
Query: 304 --------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDL 349
+PP V DQI+LW+ E +R+ T EG L+ F +Q +++ L +YA +L
Sbjct: 359 SKTDGTISQHKLEVIPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDMLSNYATEL 418
Query: 350 GVLTWQNEKKRTVVVTKQGHSDIKKF 375
GVL W ++ KR VTK G + + F
Sbjct: 419 GVLVWGDKAKRKFFVTKDGMAQVADF 444
>gi|327298217|ref|XP_003233802.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
gi|326463980|gb|EGD89433.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
Length = 484
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 147/442 (33%), Positives = 220/442 (49%), Gaps = 72/442 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
L +AK +V+ LL+++ P+P + W LS T H A
Sbjct: 38 LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL+G + S + D + L++YA ++WE +L +MVG
Sbjct: 98 YMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLDEYARQQWEAVLGYMVGMS 157
Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
G + V+ L E + V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
E+ G++ VE L+F+F L G Y + ++ L L +FG+VYQ +G R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADLTDFGIVYQHTPASGSTR 277
Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
FYPT+LA ++A G + EP GF+IVETN+R+YAYT
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPTVPTTSAGTTTSGDAGTGFIIVETNYRLYAYTS 335
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
S L+++LI LF L +RFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIISYLTTHAHPQMRKTK 395
Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
S LPP V DQIRLW+ ER+R+ + G L+ +F + A+FE YA+++GVL
Sbjct: 396 KSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455
Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
W+++ +R VT+ H + F
Sbjct: 456 WKSDSRRMFFVTR--HEQVAAF 475
>gi|242794214|ref|XP_002482325.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
gi|218718913|gb|EED18333.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
Length = 481
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 148/432 (34%), Positives = 222/432 (51%), Gaps = 68/432 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA----------TDTLTELRRNA 51
L ++AK V+ LL++ P+P A + W+ KE A T+T+T + A
Sbjct: 38 LPDLAKCLVMALLYMRDPLPAADLELWIKGESKKERDHAISILGRLHIMTNTMTSNQVRA 97
Query: 52 -----------KVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
+ AL G S D + L++YA +WE +L +MVG+
Sbjct: 98 YMVTNPFASSLRQALTGAENSQSFGVPCSTPDPNPVSIADLDEYARRQWEGVLGYMVGTS 157
Query: 100 QT---EGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
E +S + V+ L G + IT+DGF F+L + QVW I+ Y+E+
Sbjct: 158 GPGVRETVSLSRGVKQLLQAGHLVEIHHGRVDITQDGFAFVLQDVNTQVWQIIILYVESA 217
Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS-DKLQVFLQHLREFGLVYQ--RKRKAGR 206
++ ++ VE L+F+F L G Y + ++ D+L+ L L +FG+VYQ +A R
Sbjct: 218 KALQVNDVEVLSFIFLLSSLELGQSYEKKHLTPDQLRA-LTDLTDFGIVYQYPSAAEADR 276
Query: 207 FYPTKLALNMATRG--------------TLKQIREPG--FLIVETNFRVYAYTDSNLKVA 250
FYPT+LA + + T EPG F+I+ETN+R+YAYT S L+++
Sbjct: 277 FYPTRLATTLTSDSSALSNTMSSALSAQTNAATGEPGSGFIIIETNYRLYAYTSSPLQIS 336
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP------------ 298
LI LF L YRFPNL G +TR S+R A+ GITA QII +L HA P
Sbjct: 337 LIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQIISYLSTHAHPQMRKEDAVKSTS 396
Query: 299 -SVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
+ P LPP V DQIRLW+ ER+R+ + G L+ +F S A++E YA+++GVL W
Sbjct: 397 NTAGLPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEFASLAEYEGPCRYAEEIGVLVW 456
Query: 355 QNEKKRTVVVTK 366
++++KR VT+
Sbjct: 457 KSDRKRMFFVTR 468
>gi|323306981|gb|EGA60265.1| Tfb2p [Saccharomyces cerevisiae FostersO]
Length = 448
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 135/388 (34%), Positives = 199/388 (51%), Gaps = 79/388 (20%)
Query: 49 RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA 108
RNA L GG+ + S + V+ LD L++Y+ +WE IL FMVG+ + S
Sbjct: 60 RNA----LTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAKIPSEKV 114
Query: 109 VRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
+ L + V IT +GFQFLL E +Q+W +L+YL+ +E+ +DLV+ L
Sbjct: 115 LNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMDLVDVLH 174
Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM----- 216
F+F L G Y + +S+ ++ LQ +R++GLV+Q+ FYPTKLAL +
Sbjct: 175 FIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALMLTSDTK 234
Query: 217 -------ATRGTLKQIRE----------------------------------PGFLIVET 235
A L+Q RE G LIVET
Sbjct: 235 TIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDGSLIVET 294
Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
NF++Y+Y++S L++A++ LF L RF N+ +G +TR+S+R AL +GITA QII +L H
Sbjct: 295 NFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQIIAYLETH 354
Query: 296 ALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQFV 334
A P + +C LPP V DQIRLW+ E +R+ T EG LYS F
Sbjct: 355 AHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFE 414
Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTV 362
+ ++ L YAQD+GVL W+++ K +
Sbjct: 415 TSQEYNLLSKYAQDIGVLLWKDDXKEEI 442
>gi|50291141|ref|XP_448003.1| hypothetical protein [Candida glabrata CBS 138]
gi|62901305|sp|Q6FP41.1|TFB2_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49527314|emb|CAG60954.1| unnamed protein product [Candida glabrata]
Length = 504
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 143/458 (31%), Positives = 227/458 (49%), Gaps = 86/458 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK +++ ++F ++ + + W+ + +EA ++ L
Sbjct: 39 LPTLAKFFIMTVIFNDKDISLRDLDRWVKSNGKLQFQEAIKSMKSLHLLIPTRVNGQLLI 98
Query: 48 ------RRNAKVALLGGGKPWSMSAKLEVD--SKARDLDFLNQYALERWECILRFMVGSQ 99
R + + AL GG S ++ D +L L++YA +WE IL FMVG+
Sbjct: 99 NLNPTFRESFRNALTGGEVNNSFGIVVDEDRLDTVVNLAVLDEYAATKWETILHFMVGTP 158
Query: 100 QTEGISADAVRTLFNEG--AENP-----VITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+ S + + L + E P +IT +GFQFLL E +Q+W +L+YL+ ES
Sbjct: 159 MAKMPSENVLNLLKHSKLMEEVPDSSEFMITNEGFQFLLQEVNSQIWSLLLQYLKLAESL 218
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
+D V L F+F L G YSTE +S+ L +R++GLV+Q+ FYPT+L
Sbjct: 219 HMDPVHVLNFIFMLGALETGKAYSTENLSETQLKMLLDMRDYGLVFQKTSNPNIFYPTRL 278
Query: 213 ALNMATRGTLKQIR----------------------------------EPGFLIVETNFR 238
A M T T K +R + G LI+ETNF+
Sbjct: 279 A-QMLTSDT-KSMRTASGAMESVLNKPDDAAKSTDDKYDSLEGKAEDIQDGALIIETNFK 336
Query: 239 VYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
+Y+Y +S L++A++ LF L RF N+ G +TR+S+R AL +GITA Q+I +L HA P
Sbjct: 337 LYSYCNSPLQIAILSLFVHLKSRFANMVAGQITRESIRRALINGITADQVIAYLESHAHP 396
Query: 299 SV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
+ +C LPP V DQI+LW+ E +R+ T EG LY F +
Sbjct: 397 QMRRLAEEKLQKKLELDPNCKDPLQVLPPTVVDQIKLWQLELDRVLTYEGSLYIDFDTAQ 456
Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
DF L YAQD+G L W++++KR + V+++G++ + ++
Sbjct: 457 DFNMLCKYAQDIGALLWKDDRKRKLFVSREGNAQVLEY 494
>gi|326483225|gb|EGE07235.1| transcription factor Tfb2 [Trichophyton equinum CBS 127.97]
Length = 484
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 147/442 (33%), Positives = 219/442 (49%), Gaps = 72/442 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
L +AK +V+ LL+++ P+P + W LS T H A
Sbjct: 38 LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL+G + S + D + L++YA ++WE +L +MVG
Sbjct: 98 YMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLDEYARQQWEAVLGYMVGMS 157
Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
G + V+ L E + V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
E+ G++ VE L+F+F L G Y + ++ L L +FG+VYQ +G R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADLTDFGIVYQHTPASGSAR 277
Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
FYPT+LA ++A G + EP GF+IVETN R+YAYT
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPAVPTTTSGTTTSGDAGTGFIIVETNHRLYAYTS 335
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
S L+++LI LF L +RFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIISYLTTHAHPQMRKTK 395
Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
S LPP V DQIRLW+ ER+R+ + G L+ +F + A+FE YA+++GVL
Sbjct: 396 KSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455
Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
W+++ +R VT+ H + F
Sbjct: 456 WKSDSRRMFFVTR--HEQVAAF 475
>gi|452977455|gb|EME77221.1| hypothetical protein MYCFIDRAFT_146475 [Pseudocercospora fijiensis
CIRAD86]
Length = 476
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 138/428 (32%), Positives = 219/428 (51%), Gaps = 51/428 (11%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTH----AKEHKEATDTLTELRRNAKVALLG 57
L +AK V+ +L++ P A + +W+ H AKE + A ++ + AL G
Sbjct: 38 LPHMAKTLVMAMLYMPTPFSVADLDTWVKPEHDSAQAKERRPALRLNPAFSKSLRQALTG 97
Query: 58 GGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGS------------QQTEGI 104
GG S D + ++FL+ +A ++WE IL ++VGS T+ +
Sbjct: 98 GGNHRSFGVPCTTPDKSSVTIEFLDTFARKQWETILYYVVGSANSALSGGMDVHNSTKKL 157
Query: 105 SADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG-LDLVECLTFL 163
+ G IT DGF FLL + AQ+W ++ YL+T E++ +D V+ L+FL
Sbjct: 158 LENGEFVQMQNGGRQRFITTDGFTFLLQDVNAQIWSLLIVYLQTSETECFMDPVDVLSFL 217
Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL- 222
F L GT YST ++ L+ L + GL++ ++ R+YPT+LA + +
Sbjct: 218 FTLGSLELGTSYSTSNLTKTQLQMLEDLSDLGLIFH-PEQSDRYYPTRLATTLTSDAPAL 276
Query: 223 ---------------------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYR 261
E GF+I+ETN+R+YAYT+S L ++++ LF L R
Sbjct: 277 LNSSHTSTTTTVSASSNDDLAASANEKGFIILETNYRLYAYTNSPLLISILSLFASLHTR 336
Query: 262 FPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIR 313
+PNL +T+ SV+AA+ SGIT+ QII +L+ HA P + + P LPP V DQIR
Sbjct: 337 YPNLVTAKITKTSVQAAISSGITSNQIISYLQTHAHPILRRTASMHNAPILPPTVVDQIR 396
Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
LW+ E ER+T+++G L + S+ D+EK YA+ LG+L K++ VT+ +K
Sbjct: 397 LWQIEGERMTSTKGYLIREVGSKEDYEKAVQYAEALGILVKDFAHKQSFFVTRMDQ--MK 454
Query: 374 KFWVNHQK 381
F+ N +
Sbjct: 455 IFFANQAQ 462
>gi|315041383|ref|XP_003170068.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
gi|311345102|gb|EFR04305.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
Length = 484
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 145/442 (32%), Positives = 220/442 (49%), Gaps = 72/442 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
L +AK +V+ LL+++ P+P + W LS T + H A
Sbjct: 38 LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTSSSHVRA 97
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL+G + S + D + L++YA ++WE +L +MVG
Sbjct: 98 YMVTEPFSTSLRQALMGEDQKGSFGVISVTPDKYPVSVADLDEYARQQWEAVLGYMVGMS 157
Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
G + V+ L E + V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
E+ G++ VE L+F+F L G Y + ++ L L +FG+VYQ +G R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKNLTPTQLRTLADLTDFGIVYQHTPASGSTR 277
Query: 207 FYPTKLAL-----NMATRGTLKQIREPG-----------------FLIVETNFRVYAYTD 244
FYPT+LA ++A G + EP F+IVETN+R+YAYT
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPTVPAAAAGATTAGDAGTGFIIVETNYRLYAYTS 335
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
S L+++LI LF L +RFPNL G +TR SVR A+ GITA QI+ +L HA P +
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIVSYLTTHAHPQMRKTK 395
Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
S LPP V DQIRLW+ ER+R+ + G L+ +F + A+FE YA+++GVL
Sbjct: 396 KSTTNVSSTVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455
Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
W+++ +R VT+ H + F
Sbjct: 456 WRSDSRRMFFVTR--HEQVAAF 475
>gi|358057347|dbj|GAA96696.1| hypothetical protein E5Q_03367 [Mixia osmundae IAM 14324]
Length = 467
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 139/416 (33%), Positives = 215/416 (51%), Gaps = 46/416 (11%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKE-ATDTLTEL------------- 47
L A+H ++ L+ E +PQ + ++ + + + A L L
Sbjct: 47 LPSTAQHLIVNALWQEHDIPQQDLHTYTKRGEGRRQLDTAIAALQRLHIVQLIAGEPTGN 106
Query: 48 ---------RRNAKVALLGGGKPWSMS--AKLEVDSKARDLDFLNQYALERWECILRFMV 96
R++ + AL G GK S A+ D A + L++YA ++W+ +L +V
Sbjct: 107 TVFALSHIFRKSLRKALTGAGKDSSFGWFARSSRDENALSIPQLDEYATDQWDSLLHCLV 166
Query: 97 GSQQTEGISADAVRTLFNEGA------ENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
GS++++ S + L G + IT GFQFLL + +Q+W +L YL+ E
Sbjct: 167 GSERSQQPSKAVIDLLVAAGLLSSGDRDTRRITSLGFQFLLEDVNSQLWSLLLHYLKLSE 226
Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
G DL E + +FQ+ G YS+E ++ L+ GLVY K+G + PT
Sbjct: 227 DAGTDLKEVIALVFQIGNQELGRVYSSETLNPLQLHILKTFGGLGLVY--VYKSGDYSPT 284
Query: 211 KLALNMAT-------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
+LA+ + + GT ++ E GFLI+ETN+RVYAYT + L++A++ LF L RFP
Sbjct: 285 RLAVTLTSGAPPLLKAGTAEE--EQGFLILETNYRVYAYTQNPLQIAVLNLFVALKSRFP 342
Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERE 320
L VG++TR+S++A L +GI + QII FL HA P + LPP V DQI+LWE E+
Sbjct: 343 GLVVGMITRESIKAGLANGIKSDQIIAFLTAHAHPQMRKQEPLLPPTVVDQIKLWEREKN 402
Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK-KRTVVVTKQGHSDIKKF 375
R+ T LY F SQAD++ + DYA+ +G + W E R T+ GH ++ F
Sbjct: 403 RVKTEPCFLYDDFRSQADYDLVCDYAKQIGAVLWLGEPGSRRFATTEDGHVQVRGF 458
>gi|384484453|gb|EIE76633.1| hypothetical protein RO3G_01337 [Rhizopus delemar RA 99-880]
Length = 284
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 115/279 (41%), Positives = 164/279 (58%), Gaps = 40/279 (14%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
IT GFQFLL + QVW F+L+YL+ E L+L G +YS + ++
Sbjct: 13 ITNKGFQFLLQDVNTQVWAFLLQYLDMAELGSLEL---------------GENYSVDTLT 57
Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-RGTLKQIR-------------- 226
L+ LR++G+VYQRK+ + R+YPT+LA + + + L +
Sbjct: 58 QTQLQMLEDLRDYGIVYQRKKHSKRYYPTRLATTLTSGKSALATVAGKYNHMMQETNIDD 117
Query: 227 -------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
+ GF+I+ETN+++YAYTDS L++A++ LF +L RF N+ GV+TRDS+R AL
Sbjct: 118 TTDTESVDQGFIILETNYKLYAYTDSPLQIAVLNLFVQLQSRFRNMVTGVITRDSIRNAL 177
Query: 280 RSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
GITA QII +L+ HA P + + LP V DQIRLWE ER RL + LY +F Q
Sbjct: 178 MKGITAEQIIYYLQSHAHPQMRKETPVLPLTVVDQIRLWEMERNRLKPTPSYLYHEFNVQ 237
Query: 337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
ADF+ YA+DLGVL W N +KRT+ +T+ GH ++K F
Sbjct: 238 ADFDAAEKYARDLGVLLWSNNQKRTMAITEAGHENVKGF 276
>gi|260945437|ref|XP_002617016.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
gi|238848870|gb|EEQ38334.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
Length = 511
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 143/468 (30%), Positives = 218/468 (46%), Gaps = 95/468 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK Y++ ++F ++PV + W K E+ L L
Sbjct: 39 LPPLAKFYIVSMVFTDKPVAVKDLEKWCQPQARKLQYESFKRLRALHLIEEDQSGTHVRL 98
Query: 48 ----RRNAKVALLGGGKPWSMSAKLE-VDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
R N + L G P + + D D+ FL+Q+A ++WE IL FMVG++
Sbjct: 99 HQTFRNNFRDCLTGSQAPNAFGIIVPGSDKHPIDVKFLDQFASQKWETILHFMVGTE-LP 157
Query: 103 GISADAVRTLFNEG---------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
+ + +V +L G A + IT GFQFLL + Q+W +L+YL +
Sbjct: 158 TVPSKSVLSLLKSGGLMEGHGTSAYSLKITNSGFQFLLQDVNVQIWTLLLEYLNLTQELN 217
Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
+D V+ L F+F L G Y+ +SD L LR++GLVYQR + RF+PT+LA
Sbjct: 218 MDPVDVLNFIFILGSLELGKAYAVSSLSDTQLSMLPDLRDYGLVYQRSENSTRFFPTRLA 277
Query: 214 LNMATRGT--------LKQ-IREPGF-----------------LIVETNFRVYAYTDSNL 247
+ + T L Q + +PG +I+ETNF++YAYT+S L
Sbjct: 278 TTLTSESTGLKTPSMALNQTLEDPGAKDQEHVTSEAGESQSGQIILETNFKLYAYTNSPL 337
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP---- 303
++A++ LF L RF N+ +G +TR+S+R AL +GITA QII FL HA +
Sbjct: 338 EIAILNLFVNLKVRFANMVIGQITRESIRKALYNGITADQIIKFLETHAHSQMRTSAKEK 397
Query: 304 ------------------------------------LPPVVSDQIRLWEGERERLTTSEG 327
LPP V DQI+LW+ E +R+ T EG
Sbjct: 398 LDKKIEFDASHNINTAGGAPQSKSSDSGVAQHRLEILPPNVVDQIKLWQLELDRIQTFEG 457
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
L+ F +Q +++ L +YA ++GVL W ++ K VT +G + F
Sbjct: 458 YLFKDFKNQNEYDVLCNYATEVGVLLWSDKTKLRFFVTAEGMHQVADF 505
>gi|296818081|ref|XP_002849377.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
gi|238839830|gb|EEQ29492.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
Length = 484
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 144/442 (32%), Positives = 218/442 (49%), Gaps = 72/442 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
L +AK +V+ LL+++ P+P + W LS T H A
Sbjct: 38 LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL+G + S + D + L++YA ++WE +L +MVG
Sbjct: 98 YMVTEPFSTSLRQALMGEDQRHSFGVISMTPDKHHVSVADLDEYARQQWEAVLGYMVGMS 157
Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
G + V+ L E + V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
E+ ++ VE L+F+F L G Y ++ L L +FG+VYQ +G R
Sbjct: 218 AEAIHMESVEVLSFIFLLSSLELGQSYEKNDLTPTQLRTLADLTDFGIVYQHTPASGSTR 277
Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
FYPT+LA ++A G + EP GF+IVETN+R+YAYT
Sbjct: 278 FYPTRLATTLTSDSLAMPGPVSG--EPVVPAPSTGPTSATDGGTGFIIVETNYRLYAYTS 335
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
S L+++LI LF L +RFPNL G +TR SVR A+ GITA QI+ +L HA P +
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIVSYLTTHAHPQMRKTK 395
Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
+ LPP V DQIRLW+ ER+R+ + G L+ +F + A+FE YA+++GVL
Sbjct: 396 KSTSNVAASILPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455
Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
W+++ +R VT+ H + F
Sbjct: 456 WRSDSRRMFFVTR--HEQVAAF 475
>gi|390370355|ref|XP_783140.3| PREDICTED: general transcription factor IIH subunit 4-like, partial
[Strongylocentrotus purpuratus]
Length = 180
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 95/180 (52%), Positives = 139/180 (77%), Gaps = 5/180 (2%)
Query: 202 RKAGRFYPTKLALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
RK+ R+YPT+LA+N+A+ K + GF++VETNFR+YAYT+S+L+V ++GLFC +M
Sbjct: 1 RKSMRYYPTRLAINLASGVSSMAKDDHKDGFIVVETNFRIYAYTESDLQVEILGLFCSMM 60
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWE 316
YRFPNL+V LTR+SV+ A+ +GITA QI+ FLR HA P++ P+ PP +SDQ+RLWE
Sbjct: 61 YRFPNLSVAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWE 120
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
ER+RL+ ++G++Y++F+S DFE LRDYA+DLGVL W + +R ++V+ GH +KK+W
Sbjct: 121 LERDRLSFTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARRIMIVSPAGHDSVKKYW 180
>gi|326474968|gb|EGD98977.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton tonsurans CBS 112818]
Length = 489
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 143/424 (33%), Positives = 211/424 (49%), Gaps = 70/424 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
L +AK +V+ LL+++ P+P + W LS T H A
Sbjct: 38 LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97
Query: 41 TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL+G + S + D + LN+YA ++WE +L +MVG
Sbjct: 98 YMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLNEYARQQWEAVLGYMVGMS 157
Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
G + V+ L E + V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
E+ G++ VE L+F+F L G Y + ++ L L +FG+VYQ +G R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADLTDFGIVYQHTPASGSAR 277
Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
FYPT+LA ++A G + EP GF+IVETN R+YAYT
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPAVPTTTSGTTTSGDAGTGFIIVETNHRLYAYTS 335
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
S L+++LI LF L +RFPNL G +TR SVR A+ GITA QII +L HA P +
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIISYLTTHAHPQMRKTK 395
Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
S LPP V DQIRLW+ ER+R+ + G L+ +F + A+FE YA+++GVL
Sbjct: 396 KSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455
Query: 354 WQNE 357
W+++
Sbjct: 456 WKSD 459
>gi|389642093|ref|XP_003718679.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|351641232|gb|EHA49095.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|440469172|gb|ELQ38293.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae Y34]
gi|440481925|gb|ELQ62459.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae P131]
Length = 490
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 139/434 (32%), Positives = 220/434 (50%), Gaps = 70/434 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK +V+ LL++ QP+P + SW+ K +A L +
Sbjct: 39 LPPLAKTFVMSLLYMPQPLPLTALDSWVKPEAKKNKDQALSILRSMHITTITPVTKEKPV 98
Query: 48 ---------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
+++ ++AL GGG + + + L + + D+ FL+++A RW+ IL ++V
Sbjct: 99 QEMSLTPNFKKSLRLALEGGGSHNSFGVPSSLPIPPQV-DVAFLDKWARSRWDAILHYVV 157
Query: 97 GS-QQTEGISA------------DAVRTLFNEG-------AENPVITKDGFQFLLLETPA 136
S ++T+ + D V+TL +G +E ITK GF FLL E A
Sbjct: 158 NSVEETDSMEPSKKYNFGGSKLQDTVKTLLVQGGLVQRRSSERISITKTGFTFLLQEANA 217
Query: 137 QVWYFILKYLETVESKGLD-LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFG 195
QVW +L++L +V + V+ L+FLF L G Y T +S++ + L L +FG
Sbjct: 218 QVWTLLLQWLHSVNEDNTNRAVDMLSFLFMLGTLELGQAYDTGALSEERRNMLPDLNDFG 277
Query: 196 LVYQRKRKAGRFYPTKLA---------LNMATRGTLKQIREPG------FLIVETNFRVY 240
LVY +++PT+LA L + G E G +I+ETNFR+Y
Sbjct: 278 LVYIPPSNPDQYFPTRLATTLTSGSSALRSVSSGVAAATAEAGENNTKGAIILETNFRIY 337
Query: 241 AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
AYT + L++A++ LF L RF + G L+R S++ A+ GITA QII +L HA +
Sbjct: 338 AYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAISHGITADQIIEYLASHAHEQM 397
Query: 301 -------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
+ P LPP V DQIRLW+ E ER+ G L+ F SQA+F+ + DYA ++GVL
Sbjct: 398 HRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKDFESQAEFKAIADYADEVGVL 457
Query: 353 TWQNEKKRTVVVTK 366
W+++ ++ +K
Sbjct: 458 IWRSDARQLFFASK 471
>gi|302495702|ref|XP_003009866.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
gi|291173385|gb|EFE29221.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
Length = 386
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 185/342 (54%), Gaps = 50/342 (14%)
Query: 80 LNQYALERWECILRFMVGSQQTEG-----ISADAVRTLFNEGAENPV------ITKDGFQ 128
L++YA ++WE +L +MVG G + V+ L V ITK+GF
Sbjct: 40 LDEYARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEVRDRRVEITKEGFA 99
Query: 129 FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFL 188
F+L + QVW+ ++ Y+E E+ G++ VE L+F+F L G Y + ++ L
Sbjct: 100 FVLQDVNTQVWHILILYVENAEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTL 159
Query: 189 QHLREFGLVYQRKRKAG--RFYPTKLAL-----NMATRGTLKQIREP------------- 228
L +FG+VYQ +G RFYPT+LA ++A G + EP
Sbjct: 160 ADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISG--EPAVPTTTAGTTTSG 217
Query: 229 ----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGIT 284
GF+IVETN+R+YAYT S L+++LI LF L +RFPNL G +TR SVR A+ GIT
Sbjct: 218 DAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGIT 277
Query: 285 AAQIIGFLRLHALPSV-----------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
A QII +L HA P + S LPP V DQIRLW+ ER+R+ + G L+ +F
Sbjct: 278 ADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 337
Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ A+FE YA+++GVL W+++ +R VT+ H + F
Sbjct: 338 DTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR--HEQVAAF 377
>gi|336466164|gb|EGO54329.1| hypothetical protein NEUTE1DRAFT_87562 [Neurospora tetrasperma FGSC
2508]
gi|350286986|gb|EGZ68233.1| transcription factor Tfb2 [Neurospora tetrasperma FGSC 2509]
Length = 475
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 137/411 (33%), Positives = 212/411 (51%), Gaps = 66/411 (16%)
Query: 34 AKEHKEATDTLTELRRNAKVALLGGGKPWS--MSAKLEVDSKARDLDFLNQYALERWECI 91
+K+ + ++ ++A GGG+ S + + L VD D+ +L++YA ++W+ I
Sbjct: 68 SKDRTQEIQITASFKKGLQLAFEGGGEDGSFGVPSSLPVDPSI-DIAYLDKYARKKWDDI 126
Query: 92 LRFMVGS----QQTEGISAD--AVRTLFNEG---AENP------VITKDGFQFLLLETPA 136
L ++V S +EG+S +V+ L G P IT+ GF FLL E+ A
Sbjct: 127 LHYVVNSLGVHGSSEGVSGPKASVKELLLAGRLVERRPDTRTGIGITQAGFTFLLQESNA 186
Query: 137 QVWYFILKYLETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
QVW +L +LE + G+D ++ L+FLF L GT Y T +++ + L L +
Sbjct: 187 QVWTLLLLWLEAADHTADSGMDSIDMLSFLFMLSSLELGTAYDTSALTETRRNMLPSLVD 246
Query: 194 FGLVYQRKRKAGRFYPTKLALNMA-------------TRGTLKQIREPGF---------- 230
FGL+Y R +F+PT+LA + T T EPG
Sbjct: 247 FGLIY-FPRDTRQFFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTTTSTSGGPAA 305
Query: 231 -----------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
+I+ETN+R+YAYT S L++A++ LFC+L RFPN+ G LTRDS+R A+
Sbjct: 306 TPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAI 365
Query: 280 RSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
GITA QII +L HA + LPP V DQIRLW+ E ER+ T+ G L+
Sbjct: 366 GFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFK 425
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
F S ++ L YA+++GVL W+N+K++ +K H ++ + + +KG
Sbjct: 426 DFESTEEYAALSRYAEEIGVLVWKNDKRQLFFASK--HEQLRDYLKSRKKG 474
>gi|164425579|ref|XP_960485.2| hypothetical protein NCU05523 [Neurospora crassa OR74A]
gi|157070982|gb|EAA31249.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 475
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 137/411 (33%), Positives = 212/411 (51%), Gaps = 66/411 (16%)
Query: 34 AKEHKEATDTLTELRRNAKVALLGGGKPWS--MSAKLEVDSKARDLDFLNQYALERWECI 91
+K+ + ++ ++A GGG+ S + + L VD D+ +L++YA ++W+ I
Sbjct: 68 SKDRTQEIQITASFKKGLQLAFEGGGEDGSFGVPSSLPVDPSI-DIAYLDRYARKKWDDI 126
Query: 92 LRFMVGS----QQTEGISAD--AVRTLFNEG---AENP------VITKDGFQFLLLETPA 136
L ++V S +EG+S +V+ L G P IT+ GF FLL E+ A
Sbjct: 127 LHYVVNSLGVHGSSEGVSGPKASVKELLLAGRLVERRPDTRTGIGITQAGFTFLLQESNA 186
Query: 137 QVWYFILKYLETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
QVW +L +LE + G+D ++ L+FLF L GT Y T +++ + L L +
Sbjct: 187 QVWTLLLLWLEAADHTADSGMDSIDMLSFLFMLSSLELGTAYDTSALTETRRNMLPSLVD 246
Query: 194 FGLVYQRKRKAGRFYPTKLALNMA-------------TRGTLKQIREPGF---------- 230
FGL+Y R +F+PT+LA + T T EPG
Sbjct: 247 FGLIY-FPRDTRQFFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTTTSTSGGPAA 305
Query: 231 -----------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
+I+ETN+R+YAYT S L++A++ LFC+L RFPN+ G LTRDS+R A+
Sbjct: 306 TPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAI 365
Query: 280 RSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
GITA QII +L HA + LPP V DQIRLW+ E ER+ T+ G L+
Sbjct: 366 GFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFK 425
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
F S ++ L YA+++GVL W+N+K++ +K H ++ + + +KG
Sbjct: 426 DFESTEEYAALSRYAEEIGVLVWKNDKRQLFFASK--HEQLRDYLKSRKKG 474
>gi|302667863|ref|XP_003025510.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
gi|291189624|gb|EFE44899.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
Length = 386
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 185/342 (54%), Gaps = 50/342 (14%)
Query: 80 LNQYALERWECILRFMVGSQQTEG-----ISADAVRTLFNEGAENPV------ITKDGFQ 128
L++YA ++WE +L +MVG G + V+ L V ITK+GF
Sbjct: 40 LDEYARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEVRDRRVEITKEGFA 99
Query: 129 FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFL 188
F+L + QVW+ ++ Y+E E+ G++ VE L+F+F L G Y + ++ L
Sbjct: 100 FVLQDVNTQVWHILILYVENAEAIGMESVEVLSFVFLLSSLELGQSYEKKDLTPTQLRTL 159
Query: 189 QHLREFGLVYQRKRKAG--RFYPTKLAL-----NMATRGTLKQIREP------------- 228
L +FG+VYQ +G RFYPT+LA ++A G + EP
Sbjct: 160 ADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISG--EPAVPTTTAGTTTSG 217
Query: 229 ----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGIT 284
GF+IVETN+R+YAYT S L+++LI LF L +RFPNL G +TR SVR A+ GIT
Sbjct: 218 DAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGIT 277
Query: 285 AAQIIGFLRLHALPSV-----------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
A QII +L HA P + S LPP V DQIRLW+ ER+R+ + G L+ +F
Sbjct: 278 ADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 337
Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ A+FE YA+++GVL W+++ +R VT+ H + F
Sbjct: 338 DTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR--HEQVAAF 377
>gi|194390490|dbj|BAG62007.1| unnamed protein product [Homo sapiens]
Length = 220
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 111/220 (50%), Positives = 144/220 (65%), Gaps = 30/220 (13%)
Query: 11 IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------------R 49
+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR +
Sbjct: 1 MRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQ 60
Query: 50 NAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA 108
N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS + +S D
Sbjct: 61 NLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SAAVSQDL 119
Query: 109 VRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
+ L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE L+
Sbjct: 120 AQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILS 179
Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
FLFQL FST G DYS EGMSD L FLQHLREFGLV+QRK
Sbjct: 180 FLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRK 219
>gi|342884444|gb|EGU84659.1| hypothetical protein FOXB_04847 [Fusarium oxysporum Fo5176]
Length = 496
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 137/451 (30%), Positives = 229/451 (50%), Gaps = 80/451 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
L +AK +V+R+L+ +P+ + + W+ +H ++ +A L
Sbjct: 41 LPHLAKTFVMRILYSPKPILLSDLDDWVKPSHKRQKDQALSILRVLHIVSITAPSKERPQ 100
Query: 45 -----TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
T + + ++AL GGG + + + L++ + D+ FL++YA ++WE IL F+V
Sbjct: 101 EMQLTTNFKLSLRLALSGGGTHNSFGVPSTLQIPPEI-DIAFLDRYARKKWEDILHFVVS 159
Query: 98 S--QQTEGISADAVRT---------LFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKY 145
S ++ G S+ ++ L + A V IT+ GF FLL E AQVW +L +
Sbjct: 160 SVGYKSAGESSGPNKSVKELLVAGRLVDRKASGSVGITQAGFTFLLQEANAQVWTLLLLW 219
Query: 146 LETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
LE +E GL+ + L+FLF L G Y T ++++ + L L +FGL+Y
Sbjct: 220 LEAMEVNKMAGLEATDMLSFLFVLASMELGRAYDTNALTEQRKNMLPSLVDFGLIYIPNH 279
Query: 203 KAGRFYPTKLALNM----------------------------ATRG--TLKQIREPGFLI 232
K F+PT+LA + A G T + G +I
Sbjct: 280 KRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSGAPGGSVTAANTEQRGSVI 339
Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
+ETN+R+YAYT S L++A++ LF +L RFP++ G ++R S+R A++ GITA QII +L
Sbjct: 340 IETNYRIYAYTQSTLQIAVLALFTKLAMRFPDMVAGRISRQSIRQAIQFGITAEQIISYL 399
Query: 293 RLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
HA + + P LPP V DQIRLW+ E ER+ T+ G L+ F +++ +
Sbjct: 400 SAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKSFEDDREYKDIAR 459
Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+A+++GVL W+N+ ++ +K H I+ +
Sbjct: 460 FAEEVGVLVWKNDARQMFFASK--HEQIRDY 488
>gi|212535856|ref|XP_002148084.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070483|gb|EEA24573.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 422
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 125/340 (36%), Positives = 185/340 (54%), Gaps = 44/340 (12%)
Query: 71 DSKARDLDFLNQYALERWECILRFMVGSQQT---EGIS-ADAVRTLFNEGAENPV----- 121
D A + L++YA +WE +L +MVG+ E +S + V+ L G +
Sbjct: 70 DPNAVSIADLDEYARRQWEGVLGYMVGTSGPGVRETVSLSRGVKQLLQAGHLVEIHHGRV 129
Query: 122 -ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
IT+DGF F+L + QVW I+ Y+E+ ++ ++ VE L+F+F L G Y +
Sbjct: 130 DITQDGFAFVLQDVNTQVWQIIILYVESAKALQVNDVEVLSFIFLLSSLELGQSYEKRHL 189
Query: 181 SDKLQVFLQHLREFGLVYQRK--RKAGRFYPTKLALNMATRG--------------TLKQ 224
+ L L +FG++YQ +A RFYPT+LA + + T
Sbjct: 190 TPDQLRALADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLSAQTNAA 249
Query: 225 IREPG--FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
EPG F+I+ETN+R+YAYT S L+++LI LF L YRFPNL G +TR S+R A+ G
Sbjct: 250 TGEPGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMG 309
Query: 283 ITAAQIIGFLRLHALP-------------SVSCP---LPPVVSDQIRLWEGERERLTTSE 326
ITA QII +L HA P + P LPP V DQIRLW+ ER+R+ +
Sbjct: 310 ITADQIISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATP 369
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
G L+ +F S A++E YA+++GVL W++++KR VT+
Sbjct: 370 GFLFKEFASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTR 409
>gi|340518979|gb|EGR49219.1| trancription factor tfb2-like protein [Trichoderma reesei QM6a]
Length = 480
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 147/448 (32%), Positives = 228/448 (50%), Gaps = 74/448 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTH-------AKEHKE 39
L +AK +V+R+LF+ P+ A + +W L H +KE +
Sbjct: 39 LPHLAKTFVMRMLFLPHPMTLADLDAWVKPEAKRKKDQSLSILRSLHIVQISAPSKEKPQ 98
Query: 40 ATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
++ +R+ ++AL GGG P S+ E+D L FL++YA ++WE +L
Sbjct: 99 EIQLMSNFKRSLRLALTGGGNHNSFGVPSSLMVPPEID-----LPFLDRYARKKWEDVLH 153
Query: 94 FMV-----------GSQQTEGISADAVRTLFNEGAENPV--ITKDGFQFLLLETPAQVWY 140
F+V GS +G+ + + N IT+ GF FLL E AQVW
Sbjct: 154 FVVSSVGYKSSLGEGSGPNKGVKELLIAGQLVDRRPNGSLGITQAGFTFLLQEANAQVWT 213
Query: 141 FILKYLETV---ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
+L +LE + +S GLD V+ L+FLF L G Y T ++++ + L L +FGL+
Sbjct: 214 LLLLWLEVLGNNKSSGLDPVDMLSFLFMLASLELGRAYDTNALTEERRNMLPSLVDFGLI 273
Query: 198 YQRKRKAGRFYPTKLALNMATRG-TLKQIRE--------------PGFLIVETNFRVYAY 242
Y + K F+PT+LA + + G +L+ I E G +++ETN+R+YAY
Sbjct: 274 YIPQHKRSMFFPTRLATTLTSGGNSLRTISEGSLGLLGGGGTGEQAGSVVIETNYRLYAY 333
Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
T S L++A++ LF +L RFP++ G L+R S+R A+ GITA QII +L HA +
Sbjct: 334 TQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQIISYLSAHAHEQMHR 393
Query: 301 -----SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
+ P LPP V DQIRLW+ E ER+ T+ G L+ F D+ +++++GVL W
Sbjct: 394 TAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYLDTARFSEEIGVLVW 453
Query: 355 QNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
+N+ R K H IK F + +K
Sbjct: 454 RNDNTRMFFANK--HEQIKDFLKSRKKA 479
>gi|46127843|ref|XP_388475.1| hypothetical protein FG08299.1 [Gibberella zeae PH-1]
gi|408390907|gb|EKJ70292.1| hypothetical protein FPSE_09509 [Fusarium pseudograminearum CS3096]
Length = 494
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 139/451 (30%), Positives = 228/451 (50%), Gaps = 80/451 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
L ++AK +V+R+LF +P+ + + W+ +H ++ +A +
Sbjct: 39 LPQLAKVFVMRILFNPKPILLSDLDDWVKPSHKRQKDQALSVMRVLHIVQISTPSKERPQ 98
Query: 45 -----TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
T + + ++AL GGG + + + L++ + D+DFL++YA ++WE IL F+V
Sbjct: 99 EMQLTTNFKVSLRLALSGGGSHNSFGVPSTLQIPPEI-DIDFLDRYARKKWEDILHFVVS 157
Query: 98 S--QQTEGISA---DAVRTLFNEG---AENPV----ITKDGFQFLLLETPAQVWYFILKY 145
S ++ G S+ +V+ L G P IT+ GF FLL E AQVW +L +
Sbjct: 158 SVGYKSAGESSGPNKSVKELLVAGRLVDRRPSGAIGITQAGFTFLLQEANAQVWTLLLLW 217
Query: 146 LETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
LE ++ GL+ + L+FLF L G Y T ++++ + L L +FGL+Y
Sbjct: 218 LEAMDVNKMAGLEATDMLSFLFVLASMELGRAYDTNALTEQRKNMLPSLVDFGLIYIPNH 277
Query: 203 KAGRFYPTKLALNMATRGTLK------------------QIREPGF------------LI 232
K F+PT+LA + + Q R PG +I
Sbjct: 278 KRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSRAPGSSATGSSNEQRGSVI 337
Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
+ETN+R+YAYT S L++A++ LF +L RFP++ G ++R S+R A+ GITA QII +L
Sbjct: 338 IETNYRIYAYTQSTLQIAVLALFTKLQMRFPDMVAGRISRQSIRQAINFGITAEQIISYL 397
Query: 293 RLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
HA + + P LPP V DQIRLW+ E ER+ T+ G L+ F ++ +
Sbjct: 398 SAHAHDQMRRTASLNNKPVLPPTVVDQIRLWQLENERMKTTGGFLFKDFEDHKEYMAVAG 457
Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+A+++GVL W+N+ K +K H I+ +
Sbjct: 458 FAEEVGVLVWRNDVKGMFFASK--HEQIRDY 486
>gi|254573074|ref|XP_002493646.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|238033445|emb|CAY71467.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|328354526|emb|CCA40923.1| RNA polymerase II transcription factor B subunit 2 [Komagataella
pastoris CBS 7435]
Length = 547
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 130/393 (33%), Positives = 198/393 (50%), Gaps = 77/393 (19%)
Query: 48 RRNAKVALLGGGKPWSMSAKLEVDSKARDLD-----FLNQYALERWECILRFMVGSQQTE 102
R+N + AL G MS L D + D++ FL++Y+L +WE IL FMVGS+ E
Sbjct: 144 RKNFRNALTGS----QMSNDLNNDIEMTDVEEVSEEFLDRYSLMKWENILHFMVGSEVPE 199
Query: 103 GISADAVRTL-----FNEGAENPVI------------------TKDGFQFLLLETPAQVW 139
S + L + G EN I T+DGFQFLL + AQ+W
Sbjct: 200 LPSIGVLSLLRHSGLMDIGEENEFIATPKRFKGDNMDLKSMKITQDGFQFLLQDINAQIW 259
Query: 140 YFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
+L+YL E ++ V+ L F+F L G Y +SD + L+ L ++GL+Y
Sbjct: 260 QLLLQYLRMSEKLMMNPVDVLNFIFMLGSLELGKQYPVTLLSDTQVIMLEDLVDYGLIY- 318
Query: 200 RKRKAGRFYPTKLALNMATRGTL---------------KQIREPGFLIVETNFRVYAYTD 244
K FYPT++A + + T ++++ G +++ETNF++Y YT
Sbjct: 319 YPEKNRYFYPTRMATTLTSEKTTFKTASMSMNQVLEGGPEVKDHGSIVLETNFKLYCYTT 378
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC-- 302
S L++A++ LF L RF N+ G++TR+SVR ALR+GITA QII +L HA +
Sbjct: 379 SPLQIAILNLFVHLRTRFANMVTGMITRESVREALRNGITADQIIKYLETHAHSQMKKLA 438
Query: 303 ---------------------------PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
+PP V DQI+LW+ E +RL T G L+ F +
Sbjct: 439 EEKLLKKLEFDSNATETQQIGSQVKLEVIPPTVVDQIKLWQLEMDRLQTFAGFLFKDFAN 498
Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
+FE+L +YA ++GV+ W+++ KR VT++G
Sbjct: 499 AQEFEQLANYADEVGVMLWRDDDKRKFFVTEEG 531
>gi|440636738|gb|ELR06657.1| hypothetical protein GMDG_00274 [Geomyces destructans 20631-21]
Length = 470
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 139/429 (32%), Positives = 210/429 (48%), Gaps = 58/429 (13%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+ +L++ +P+ + + +W+ E +A L+ L
Sbjct: 38 LPHLAKCFVMGMLYMPKPLLLSELDNWVRPGGKSEKGKALAILSRLHIVTSTAASRDGPP 97
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGS 98
+ ++AL GGG S + + + FL++YA +WE IL ++V S
Sbjct: 98 ALNLTHNFATSLRLALTGGGDHGSFGIPSDSTQDSNITISFLDEYATAQWEGILHYVVNS 157
Query: 99 QQTEGI-----SADAVRTLFNEGA--------ENPVITKDGFQFLLLETPAQVWYFILKY 145
+ + VR L G IT+ GF FLL E AQVW +L +
Sbjct: 158 TGDPSLLGAQYPSANVRQLLQMGTLVDSKGKYSGVGITQAGFSFLLQEANAQVWTVLLLW 217
Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
LE E +D V+ L+F F L G YST +S + L L +FGLVY +
Sbjct: 218 LEHAEELAMDHVDLLSFFFLLGSLELGLAYSTTTLSTTQKSMLPILADFGLVYSPPSSSS 277
Query: 206 RFYPTKLALNMATRGT----------------LKQIREPGFLIVETNFRVYAYTDSNLKV 249
F T+LA + + + GF+++ETN+R+YAYT S L++
Sbjct: 278 FFP-TRLATTLTSSASALRSIASGFSAAAASPASSGGTTGFIVIETNYRLYAYTSSPLQI 336
Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPP 306
A++ LF +L R+PN+ G +TRDSVR A+ GITA QII +L HA P + S LPP
Sbjct: 337 AVLALFAKLSTRYPNMVSGRVTRDSVRGAIEHGITADQIISYLTTHAHPQLRRKSPVLPP 396
Query: 307 VVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
V DQIRLW+ E ER+ + G L+ F S A++E YA ++GVL W+++ KR VTK
Sbjct: 397 TVVDQIRLWQIENERMKATAGFLFKDFASLAEYEGCVRYADEIGVLVWRSDAKRNFFVTK 456
Query: 367 QGHSDIKKF 375
H ++ +
Sbjct: 457 --HEQLRDY 463
>gi|425777809|gb|EKV15965.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum PHI26]
gi|425782577|gb|EKV20476.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum Pd1]
Length = 382
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/345 (36%), Positives = 187/345 (54%), Gaps = 42/345 (12%)
Query: 80 LNQYALERWECILRFMVG----SQQTEGIS-ADAVRTLFNEG------AENPVITKDGFQ 128
L++YA +WE ++ +MVG S Q + ++ + V+ L G ITKDGF
Sbjct: 40 LDEYARRQWEGVMGYMVGTSALSGQRDAVNLSKGVKQLLQAGHLVEIRGNRVDITKDGFG 99
Query: 129 FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFL 188
F+L + QVW+ ++ Y+E+ E+ G+D VE L+FLF L G+ Y M+ L
Sbjct: 100 FVLQDVNTQVWHILILYVESAEAIGMDSVEVLSFLFLLSSLELGSSYDKSHMTSNQLRTL 159
Query: 189 QHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG---------------FLIV 233
L +FG+VYQ A FYPT+LA + + + G F+I+
Sbjct: 160 ADLADFGIVYQEDADATHFYPTRLATTLTSDSSALSNPVTGSLTGPVGPSGGSGSGFIII 219
Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
ETN+R+YAYT S L+++LI LF L YRFPNL G +TR SVR A+ GITA QII +L
Sbjct: 220 ETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQIISYLL 279
Query: 294 LHALPSV--------------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF 339
HA P + + LPP V+DQIRLW+ ER+RL + G L+ F S A++
Sbjct: 280 SHAHPQLRKHSAAQSNGKGIPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTSLAEY 339
Query: 340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
+ YA ++GVL W++++KR VT+ H + F + + G +
Sbjct: 340 QAPCQYAAEIGVLVWKSDRKRMFFVTR--HEQVAAFLRSQKVGGR 382
>gi|430812024|emb|CCJ30551.1| unnamed protein product [Pneumocystis jirovecii]
Length = 319
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 172/309 (55%), Gaps = 32/309 (10%)
Query: 71 DSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTL-------FNEGAENPVIT 123
D K D+ FL+ YA +WE IL FMVG+ TE + D V L F + IT
Sbjct: 17 DKKHVDIPFLDNYAKNKWEMILHFMVGTG-TEKLPGDHVLNLLRYSGLMFGSRCDEMKIT 75
Query: 124 KDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSD- 182
GFQFLLL +Q+W +L YL +E +D V+ L FLF L G Y ++
Sbjct: 76 NSGFQFLLLNINSQIWTLLLHYLNMIEDLEVDPVDILQFLFMLGNLELGQSYLISSLTPI 135
Query: 183 KLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAY 242
++QV L++L+++G+VY+RK + RFYPT+LA + T + IR
Sbjct: 136 QIQV-LENLKDYGIVYRRK-SSRRFYPTRLA-TILTSNSEILIRH--------------- 177
Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
S L++A++ LF L RF NL VGV+TR+SVR A +GITA QII +L +A P +
Sbjct: 178 --SPLQIAVLNLFVHLYLRFSNLVVGVITRNSVRQAFMNGITAEQIISYLTSYAHPQMKK 235
Query: 301 SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
+ P+ PP V+DQIRLWE ER RL +EG L+ F S +DFE YA +L V+ W+ K
Sbjct: 236 NTPIIPPTVNDQIRLWEMERNRLKATEGYLFRDFNSNSDFELALKYANELNVVVWEAPSK 295
Query: 360 RTVVVTKQG 368
R V QG
Sbjct: 296 RVFFVNIQG 304
>gi|348683175|gb|EGZ22990.1| hypothetical protein PHYSODRAFT_479459 [Phytophthora sojae]
Length = 490
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 197/367 (53%), Gaps = 52/367 (14%)
Query: 58 GGKPWS-----MSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAV--- 109
GG PW + E A DL+ +YA RW+ +L +MVGS + V
Sbjct: 118 GGSPWERGRLQLPKDPENTFAAADLE---RYARARWDSVLHYMVGSTAVQEPPQSVVDIL 174
Query: 110 ---RTLFNEGAENPV--ITKDGFQFLLLETPAQVWYFILKYLETVESKG-LDLVECLTFL 163
R L GA++ IT G++F+L + Q+W F+L+Y+ T+++ G L + L FL
Sbjct: 175 LRTRLLQASGADSRALHITDTGYEFMLKDIHVQMWIFMLEYIRTLDNTGTLKQEDILQFL 234
Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM------- 216
FQ+ + G Y+ +++ ++ L +FGL+++++ + RFY T LA+N+
Sbjct: 235 FQISYCQTGEYYAVADLTETQRLLLGDFIDFGLLFRKRPNSDRFYTTSLAVNLIFGGSTG 294
Query: 217 -------------ATRGTLK-QIREPG-----------FLIVETNFRVYAYTDSNLKVAL 251
R LK Q+ +P ++VETNF++YAYT S L VA+
Sbjct: 295 QKRSHVSLTSSFAGVRAGLKSQVADPRQAPTADHSAQLLVVVETNFKIYAYTTSTLHVAM 354
Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVV 308
+ +F +++ R PNL +G +TR+S+R+AL GI+A QI FL HA P + S +P +
Sbjct: 355 LSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIYDFLMKHAHPKMRRNSPVIPENI 414
Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
+DQI LWE ER R+ EG+L+ F ++ D+E +RDYA+DL VLTW + + +T G
Sbjct: 415 ADQIYLWERERNRVQFMEGILFDGFNTKEDYESVRDYAKDLKVLTWSDPIHFRLSITTAG 474
Query: 369 HSDIKKF 375
D++ +
Sbjct: 475 IDDVRHY 481
>gi|47212202|emb|CAF90416.1| unnamed protein product [Tetraodon nigroviridis]
Length = 471
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 176/358 (49%), Gaps = 105/358 (29%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL ++K++V+R+LF+EQP+PQA +A W+ K ++H E L LR
Sbjct: 70 ELPSLSKNFVMRMLFLEQPLPQAAVALWVGKDSQRDHDECVCVLAGLRLWHSQHLQGGLQ 129
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
N K ALLGGG+ W+ L D ARD+ L++YA ERWE IL FMVGS
Sbjct: 130 GYVLNPVFKENLKTALLGGGRAWAEEGGALGPDRHARDIGSLDRYATERWEVILHFMVGS 189
Query: 99 QQTEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
+S D + L + G E P IT GFQFLLL+T +Q+WYF L+YL+T +
Sbjct: 190 P-CAAVSQDLAQLLVHAGLMKSEAGEPPYITSAGFQFLLLDTASQLWYFTLQYLKTAQ-- 246
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
DYS EGMSD L FLQHLREFGLV+QRK +GR
Sbjct: 247 ---------------------DYSVEGMSDSLLTFLQHLREFGLVFQRK-ASGR------ 278
Query: 213 ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
E++YRFPN+ V +TR
Sbjct: 279 --------------------------------------------EMLYRFPNVVVAQVTR 294
Query: 273 DSVRAALRSGITAAQIIGF-LRLHALPSVSCP-LPPVVSDQIRLWEGERERLTTSEGV 328
+SV+ A+ +GITA Q + F L P P LPP ++DQIRLWE ER+RL +E V
Sbjct: 295 ESVQQAIANGITAQQRLTFDLWSSPTPLSQTPVLPPTITDQIRLWELERDRLQFTEAV 352
>gi|414584750|tpg|DAA35321.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 386
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 191/350 (54%), Gaps = 45/350 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y +++L+V PV A + W+ +A +HK A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVLAPVTAAAMEEWVLDEYAAKHKVAIDKLLQLRVFVEVRDRRRDVS 99
Query: 49 --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
N + L+ GG P S++ +L + L YAL++WEC L
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPLPLSVTGRLPTPAD------LEAYALDQWECFLLQ 153
Query: 95 MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
++ S Q E S+ ++T L + E +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVEKGSSFSSSMMKTFQRGLLSSRDGEASKLTENGFQFLLMETNAQLWYIMREYI 213
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK-RKAG 205
+ E +G+D E ++FL +L F G YS ++D ++ ++ L E GLV Q++ RK
Sbjct: 214 SSAEERGVDPTELISFLLELSFHKLGAAYSLNTLTDVQRIAIRDLAELGLVKQQQGRKDS 273
Query: 206 RFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
F PT+LA N++ + + GF++VETNFR+YAY+ S L ++ LF + Y+ PNL
Sbjct: 274 WFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFARVEYQLPNL 333
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQI 312
VG +T++S+ A +GITA QII FLR +A P V+ +P V V+DQ+
Sbjct: 334 IVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPAVPENVTDQV 383
>gi|325181847|emb|CCA16302.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 473
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 136/437 (31%), Positives = 215/437 (49%), Gaps = 66/437 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWL------------SKTHAKE----HKEATDTLT 45
L +A+ +V+RLL V + V+ WL K AK ++ + L
Sbjct: 32 LPSLAQQFVMRLLSCNTVVQEQVLREWLVTQVQSDPGTSVGKEKAKSIDEIYQHSIQKLR 91
Query: 46 ELR-----------------RNAKVALLG-GGKPWSMSAK-----LEVDSKARDLDFLNQ 82
+LR + K AL GG PW + + +EV + +L+ +
Sbjct: 92 KLRVFTQSSTDSYQLHAGFQKQLKYALCHLGGCPWEVGRRKLGKDIEVSFASTELE---R 148
Query: 83 YALERWECILRFMVGSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETP 135
YA W+ +L +MVGS V L IT G++F+L +
Sbjct: 149 YARSHWDAVLHYMVGSTSVPDPPQAVVDILLRTNLMQASVDQRALHITDKGYEFMLKDIH 208
Query: 136 AQVWYFILKYLETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
Q+W FIL+Y++T++ G L + L FLFQ+ + Y + ++ Q+ L F
Sbjct: 209 IQMWIFILEYIKTLDRTGALKQEDILRFLFQISYCQVNAYYPVQDLTKTQQLLLTDFNNF 268
Query: 195 GLVYQRKRKAGRFYPTKLALNMATRGTLK------QIREPG-------FLIVETNFRVYA 241
GL+Y+++ + RFY T LA+N+ GT Q+ P IVETNF+VYA
Sbjct: 269 GLLYRKRSNSDRFYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLTIVETNFKVYA 328
Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV- 300
YT S L +A++ +F +++ R PNL +G +TR+S+R+AL GI+A QI FL HA P +
Sbjct: 329 YTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDFLLKHAHPRMK 388
Query: 301 -SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
+ P+ P ++DQI LWE ER R+ EG+L+ F S+ +FE++R+YAQ LTW +
Sbjct: 389 KNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQKQRFLTWTDPI 448
Query: 359 KRTVVVTKQGHSDIKKF 375
+ ++ G ++ F
Sbjct: 449 HLKLAISSNGIESVRNF 465
>gi|90076516|dbj|BAE87938.1| unnamed protein product [Macaca fascicularis]
Length = 358
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 108/223 (48%), Positives = 144/223 (64%), Gaps = 30/223 (13%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 83 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 142
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 143 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 202
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 203 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 261
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQHL+E
Sbjct: 262 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLQEI 304
>gi|325181222|emb|CCA15636.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 1033
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 136/437 (31%), Positives = 215/437 (49%), Gaps = 66/437 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWL------------SKTHAKE----HKEATDTLT 45
L +A+ +V+RLL V + V+ WL K AK ++ + L
Sbjct: 79 LPSLAQQFVMRLLSCNTVVQEQVLREWLVTQVQSDPGTSVGKEKAKSIDEIYQHSIQKLR 138
Query: 46 ELR-----------------RNAKVALLG-GGKPWSMSAK-----LEVDSKARDLDFLNQ 82
+LR + K AL GG PW + + +EV + +L+ +
Sbjct: 139 KLRVFTQSSTDSYQLHAGFQKQLKYALCHLGGCPWEVGRRKLGKDIEVSFASTELE---R 195
Query: 83 YALERWECILRFMVGSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETP 135
YA W+ +L +MVGS V L IT G++F+L +
Sbjct: 196 YARSHWDAVLHYMVGSTSVPDPPQAVVDILLRTNLMQASVDQRALHITDKGYEFMLKDIH 255
Query: 136 AQVWYFILKYLETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
Q+W FIL+Y++T++ G L + L FLFQ+ + Y + ++ Q+ L F
Sbjct: 256 IQMWIFILEYIKTLDRTGALKQEDILRFLFQISYCQVNAYYPVQDLTKTQQLLLTDFNNF 315
Query: 195 GLVYQRKRKAGRFYPTKLALNMATRGTLK------QIREPG-------FLIVETNFRVYA 241
GL+Y+++ + RFY T LA+N+ GT Q+ P IVETNF+VYA
Sbjct: 316 GLLYRKRSNSDRFYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLTIVETNFKVYA 375
Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV- 300
YT S L +A++ +F +++ R PNL +G +TR+S+R+AL GI+A QI FL HA P +
Sbjct: 376 YTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDFLLKHAHPRMK 435
Query: 301 -SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
+ P+ P ++DQI LWE ER R+ EG+L+ F S+ +FE++R+YAQ LTW +
Sbjct: 436 KNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQKQRFLTWTDPI 495
Query: 359 KRTVVVTKQGHSDIKKF 375
+ ++ G ++ F
Sbjct: 496 HLKLAISSNGIESVRNF 512
>gi|302806060|ref|XP_002984780.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
gi|302808297|ref|XP_002985843.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300146350|gb|EFJ13020.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300147366|gb|EFJ14030.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
Length = 459
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 136/420 (32%), Positives = 212/420 (50%), Gaps = 43/420 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK YV+RLL+++ VP ++ W+ +HK A D L +LR
Sbjct: 41 LPPLAKLYVLRLLYLDAAVPDKMLLEWIKGDAVAKHKNAVDRLVQLRVFLPEEKILSRKK 100
Query: 49 -----------RNAKVALLGG-GKPWS-MSAKLEVD-SKARDLDFLNQYALERWECILRF 94
++AL G G P + + V A++LD YA E+WE
Sbjct: 101 ETHYRVNPKFQEQLRLALSTGLGTPRDPLPPDIAVRMPSAKELD---DYATEKWEAWFSS 157
Query: 95 MVGSQQTEGISADA---VRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYL 146
+V +G SA + LF + ++P IT GFQFLLL+ +Q+W I +Y+
Sbjct: 158 LVNFASPDGPSASTNGFIVQLFQKADLLTSGQDPKITPAGFQFLLLDRNSQLWRVIREYV 217
Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
+ E++ +D E + FL ++ F + G YS + + + + F + L G++ +K R
Sbjct: 218 QYAEARQIDTGELIRFLLEIGFYSVGEPYSMDSLPNSQRNFAEELAMLGVLQLQKGMKDR 277
Query: 207 -FYPTKLALNMATRGTLKQI-REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
F PT+LA ++ + + GF++VETNFRVYAYT S L + + +F Y PN
Sbjct: 278 WFIPTRLATGLSASLSESSAWQTEGFIMVETNFRVYAYTSSKLHIETLHVFVRTEYVLPN 337
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERER 321
+ VG +T++SV A SGI+A QII FL+ HA P V+ +P V V DQIRLWE +R R
Sbjct: 338 ILVGSITKESVNGAFASGISADQIIKFLQQHAHPFVAQKVPSVPETVCDQIRLWESDRVR 397
Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
+ Y F S + +E + +A+D L W++ ++ +VV + H I+ F N K
Sbjct: 398 VQYLPAYCYEGFPSTSVYESVVAHARDRNGLLWEDANRKMIVVGGEHHEAIRAFLQNINK 457
>gi|301098671|ref|XP_002898428.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
gi|262105199|gb|EEY63251.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
Length = 483
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 136/450 (30%), Positives = 223/450 (49%), Gaps = 75/450 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWL------SKTHAKEHKEATDTLTELR------- 48
L +A+ +V+RLL VP+ ++ W+ +K + A L LR
Sbjct: 32 LPSLAQQFVMRLLPTNSAVPRDLLEQWVVPIPGETKRMPPQFNAALQKLEGLRVFVQQDG 91
Query: 49 ---------RNAKVALLG-GGKPWS-----MSAKLEVDSKARDLDFLNQYALERWECILR 93
+ AL GG PW + E A DL+ +YA RW+ +L
Sbjct: 92 GYKPHPTFQKQLMYALSNLGGSPWERGRLHLPRDPENTFAAVDLE---RYARARWDLVLH 148
Query: 94 FMVGSQQTEGISADAVRTLFN------EGAENPV--ITKDGFQFLLLETPAQVWYFILKY 145
+MVGS + V L GA++ IT G++F+L + Q+W F+L+Y
Sbjct: 149 YMVGSTAVQEPPQSVVDILLRTKLLQASGADSRALHITDTGYEFMLKDIHVQMWIFMLEY 208
Query: 146 LETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKA 204
+ T+++ G L + L FLFQ+ + G Y+ +++ ++ L +FGL++++++ +
Sbjct: 209 IRTLDNTGTLKQEDILQFLFQISYCQTGGYYAVADLTETQRLLLGDFIDFGLLFRKRQNS 268
Query: 205 GRFYPTKLALNM--------------------ATRGTLK-QIREPG-----------FLI 232
RFY T LA+N+ R +K Q+ +P ++
Sbjct: 269 DRFYTTSLAVNLIFGGSTGQKRSHVSLTSSFAGVRAGMKSQVADPRQAPTVDHGARLLVV 328
Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
VETNF++YAYT S L VA++ +F +++ R PNL +G +TR+S+R+AL GI+A QI FL
Sbjct: 329 VETNFKIYAYTSSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIYDFL 388
Query: 293 RLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDL 349
HA P + S +P ++DQI LWE ER R+ EG+L+ F ++ D+E +RDYA+ L
Sbjct: 389 MKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFLEGILFDGFNTKEDYESVRDYAKGL 448
Query: 350 GVLTWQNEKKRTVVVTKQGHSDIKKFWVNH 379
VLTW + + + G +++ F N
Sbjct: 449 MVLTWSDPIHFRLSIATAGIDEVRHFIQNQ 478
>gi|402083674|gb|EJT78692.1| RNA polymerase II transcription factor B subunit 2 [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 496
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 134/444 (30%), Positives = 218/444 (49%), Gaps = 80/444 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLS------KTHA----------------KEHKE 39
L +AK +V+ LL++ +P+ A + +W K HA KE +
Sbjct: 38 LPSLAKTFVMALLYMPKPLSLADLDTWAKPDARKQKDHALAILRVLHIVLITAPSKEKSQ 97
Query: 40 ATDTLTELRRNAKVALLGGG---KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
+++ ++AL GGG + + + L V D+ FL+++A RW+ IL ++V
Sbjct: 98 EVSLTPNFKKSLRLALEGGGTSHNTFGVPSSLPVPPNV-DIPFLDKFARTRWDAILHYVV 156
Query: 97 GSQQTEGISADAVRTLFNEGAE--------------------NPVITKDGFQFLLLETPA 136
S + + + + L + G++ + I+K GF FLL E+ A
Sbjct: 157 NSVEGDEFAQRPSKRLHSGGSKLQDTVKELLVAGGLVERRGGSISISKTGFTFLLQESNA 216
Query: 137 QVWYFILKYLETVESKGLD----LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
QVW +L++LE V + G D V+ L+FLF L G Y TE +S++ + L L
Sbjct: 217 QVWTLLLQWLEAVNAAGGDHSAMAVDMLSFLFMLGTLELGQAYDTEELSEQRRNMLPSLV 276
Query: 193 EFGLVYQRKRKAGRFYPTKLALNMA----------------------TRGTLKQIREPGF 230
+FGLVY +++PT+LA + G + G
Sbjct: 277 DFGLVYIPPGNTSQYFPTRLATTLTSGSSALRSASSALAAATAEGPNASGGGQSEAAKGS 336
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+I+ETN+R+YAYT + L++A++GLF EL +RF + G L R+S++ A+ GITA Q+I
Sbjct: 337 IIIETNYRLYAYTSTPLQIAILGLFAELRFRFAGMVTGRLDRESIKRAISYGITADQVIE 396
Query: 291 FLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKL 342
+L HA + LPP V DQIRLW+ E ER+ G L+ F SQA+++ L
Sbjct: 397 YLAAHAHEQMHRTATLRKKPVLPPTVVDQIRLWQLETERMKIMRGFLFRDFDSQAEYDDL 456
Query: 343 RDYAQDLGVLTWQNEKKRTVVVTK 366
YA ++GVL W+++ ++ +K
Sbjct: 457 AKYADEIGVLLWRSDARQLFFASK 480
>gi|290987279|ref|XP_002676350.1| predicted protein [Naegleria gruberi]
gi|284089952|gb|EFC43606.1| predicted protein [Naegleria gruberi]
Length = 532
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 117/323 (36%), Positives = 186/323 (57%), Gaps = 26/323 (8%)
Query: 78 DFLNQYALERWECILRFMVGSQ-QTEGISADAVRTLFN-----EGAENPVITKDGFQFLL 131
+FL +Y+ +WE +L ++VG+ ++E + V L + ++ IT +GFQFLL
Sbjct: 204 EFLQEYSKRKWENVLLYLVGTAAESEQTFSGNVNELLKYSQLIKKSDTVRITNEGFQFLL 263
Query: 132 LETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHL 191
ET QVW + YLET + E L F+F+L F G +YS++ ++ + L
Sbjct: 264 QETKVQVWKLLKHYLETSGQRNQVKNEILNFIFELGFLDVGKEYSSKDLTSTQKSLLVDF 323
Query: 192 REFGLVYQRKRKAGR-----FYPTKLA----LNMATR-------GTLKQIREPGFLIVET 235
+ G++Y + K + F+PT LA ++M+T G+ I G++IVET
Sbjct: 324 NDLGIIYLHRDKKKKIKDKYFFPTPLAKSLTVSMSTSYDLISSFGSHNSINN-GYIIVET 382
Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
N+RVYAYT+S L++AL+ LF YR PN+ VG++TR ++R AL++GI+A QI+ FLRL+
Sbjct: 383 NYRVYAYTNSPLQIALLSLFIFPEYRLPNMVVGLITRSTIREALKNGISAHQILQFLRLN 442
Query: 296 ALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
A P + +P VSDQI LWE ER R+ + V+Y +F + ++ DYA+ G L
Sbjct: 443 AHPQMRLKKPVIPDTVSDQILLWEKERNRVIKTTSVVYDKFTNVSELNATVDYARRQGAL 502
Query: 353 TWQNEKKRTVVVTKQGHSDIKKF 375
W +E+K +V ++ H +K+F
Sbjct: 503 LWHSEEKMMMVCKREFHGLMKEF 525
>gi|302917488|ref|XP_003052448.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256733388|gb|EEU46735.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 496
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 138/451 (30%), Positives = 226/451 (50%), Gaps = 80/451 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
L +AK +V+R+L+ +P+ + W+ + ++ +A L
Sbjct: 41 LPHLAKTFVMRILYSPKPILLTDLDDWVKPSAKRQKDQALSILRVLYIVQITAPSKERPQ 100
Query: 45 -----TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
+ + ++AL GGG + + + L+V + D+ FL++YA +WE IL F+V
Sbjct: 101 EMQLTANFKTSLRLALSGGGTHNSFGVPSTLQVPPEI-DIIFLDRYARRKWEDILHFVVS 159
Query: 98 S--QQTEGISA---DAVRTLFNEG-----AENPV--ITKDGFQFLLLETPAQVWYFILKY 145
S ++ G S+ +V+ L G N + IT+ GF FLL E AQVW +L +
Sbjct: 160 SVGYKSTGDSSGPNKSVKELLIAGRLVDRKPNGMVGITQAGFTFLLQEANAQVWTLLLLW 219
Query: 146 LETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
LE ++ GL+ + L+FLF L G Y T ++++ + L L +FGL+Y
Sbjct: 220 LEAMDVNKGAGLEATDMLSFLFVLASMELGRAYDTNALTEQRKNMLPSLVDFGLIYIPNH 279
Query: 203 KAGRFYPTKLALNMATRG-TLKQIREP-----------------------------GFLI 232
K F+PT+LA + + G +L+ I + G +I
Sbjct: 280 KRSMFFPTRLATTLTSSGNSLRSISDGVAAATAAALQPGQSGGSGGSTTGSGQEQRGSVI 339
Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
+ETN+R+YAYT S L++A++ LF +L RFP++ G L+R S+R A+ GITA QII +L
Sbjct: 340 IETNYRIYAYTQSTLQIAVLALFSKLSMRFPDMVAGRLSRQSIRQAINFGITADQIISYL 399
Query: 293 RLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
HA + + P LPP V DQIRLW+ E ER+ T+ G L+ F ++ +
Sbjct: 400 AAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKDFEDHKEYMAVAG 459
Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+A+++GVL W+N+ K +K H I+ +
Sbjct: 460 FAEEVGVLVWRNDVKGMFFASK--HEQIRDY 488
>gi|452839401|gb|EME41340.1| hypothetical protein DOTSEDRAFT_156149 [Dothistroma septosporum
NZE10]
Length = 497
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 134/431 (31%), Positives = 211/431 (48%), Gaps = 67/431 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEH--KEATDTLTELR----------- 48
L +AK V+ +L++ P A + +W+ + +A TL LR
Sbjct: 38 LPHLAKTLVMAMLYMPTPFAAADLDTWIRPDMPSQQARDKALSTLQRLRILFDDQHNGKP 97
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGS 98
++ ++AL GGG S D + ++L+ +A +WE IL ++VGS
Sbjct: 98 AYKLSPAYGKSLRLALTGGGDHRSFGVPCSTPDKTSISTEYLDTFARRQWEAILYYVVGS 157
Query: 99 QQTEGISAD-----AVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKY 145
+S + ++L +G + +I+ GF FLL + AQVW ++ Y
Sbjct: 158 AN-RALSGEVDISPGTKSLLQKGEFVNVMSGGKQRMISTTGFTFLLQDVNAQVWSLLIVY 216
Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
LE S +D VE L+FLF L G YST ++ Q L L +FGL+Y+R +
Sbjct: 217 LEVSSSLRMDPVEVLSFLFTLGSLELGISYSTANLTPTQQQMLDDLADFGLIYRRTPDSH 276
Query: 206 RFYPTKLALNMA------TRGTLKQIR----------------EPGFLIVETNFRVYAYT 243
R+YPT+LA + T +L + E GF+I+ETN+R+YAYT
Sbjct: 277 RYYPTRLATTLTSDAPALTNNSLMKTTVMTGDPSTDLAASASDEKGFIILETNYRLYAYT 336
Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--- 300
S L ++++ LF L R+PNL +T+ S + A+ +GIT+ QII +L HA P +
Sbjct: 337 SSPLLISILSLFASLNTRYPNLITAKITKASTQNAIAAGITSDQIIDYLTTHAHPVLRRQ 396
Query: 301 ----SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
P LPP V DQI+LW+ E ER+ ++G L SQ +++K +YA +GVL+ +
Sbjct: 397 AAANDAPILPPTVVDQIKLWQMEGERMEATKGYLIRDIGSQEEYDKAVNYADAIGVLSKE 456
Query: 356 NEKKRTVVVTK 366
K + VT+
Sbjct: 457 FRHKGSFFVTR 467
>gi|320581720|gb|EFW95939.1| TFIIH basal transcription factor complex subunit 2 [Ogataea
parapolymorpha DL-1]
Length = 544
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 140/486 (28%), Positives = 222/486 (45%), Gaps = 115/486 (23%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW-----------LSKTHAKE---HKEATDTLTEL 47
L +AK Y++ +LF + +P + W LSK + + +A + L E
Sbjct: 47 LPSLAKFYIMTMLFQDAAIPSTDLNRWIKSSSFNGSRNLSKIYQNDSLKRLKALNLLKET 106
Query: 48 RR------------------------NAKVALLGGGKPWSMSAKLEVDS-KARDLDFLNQ 82
RR + + AL G + ++S + ++FL+
Sbjct: 107 RRIFTHPQTGQTTQLPFVSLNPTFRQSFRNALTGSRDANEVVEDNNLESIETISVEFLDT 166
Query: 83 YALERWECILRFMVGSQQTEGISADAVRTLFNEGA------------------------- 117
Y L RWE IL FMVGS+ E S + L G
Sbjct: 167 YCLHRWESILHFMVGSETKELPSVGVLTLLRYSGLMELPSDREQREHLGDDYEFQPKLQQ 226
Query: 118 -------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFST 170
+N +IT++GFQFLL + +Q+W +L+YL+ E ++ VE L F+F L
Sbjct: 227 SSSIQTLKNLLITQNGFQFLLQDINSQIWTLLLQYLKMSEKLMMNPVEVLNFIFMLGSLE 286
Query: 171 FGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR-FYPTKLALNMATRGT-------- 221
G Y + +SD ++ L L ++GL+Y + + FYPT+LA + + T
Sbjct: 287 LGQGYPVDPLSDTQKIMLDDLIDYGLIYTPSKTNKKIFYPTRLATTLTSEHTNFKTSAAV 346
Query: 222 -------LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
K G ++VETNF++Y YT S L++A++ LF L RF N+ GV+TR+S
Sbjct: 347 IDQEITNSKNSNNQGTIVVETNFKIYCYTSSPLQIAILNLFVHLKARFANMVTGVITRES 406
Query: 275 VRAALRSGITAAQIIGFLRLHALP------------------SVSCP----------LPP 306
VR AL +GITA Q+I +L HA P S+ LPP
Sbjct: 407 VRRALINGITADQMINYLGSHAHPWMVKQAEDKLARKLELESSIGNAAGQRQIDLEILPP 466
Query: 307 VVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
V DQI+LW+ E +R+ + +G LY F + +FEKL Y +++GV+ W++ ++ VT+
Sbjct: 467 TVVDQIKLWQLELDRIQSFKGYLYKDFSTDLEFEKLLTYGEEIGVIVWKDRARKRFFVTQ 526
Query: 367 QGHSDI 372
+G+ +
Sbjct: 527 EGNGQL 532
>gi|403180193|ref|XP_003338499.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166015|gb|EFP94080.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 679
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 106/263 (40%), Positives = 157/263 (59%), Gaps = 10/263 (3%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
IT GF FLL + Q+W +L+YL+ E+ GLD+V+ L LF L G +YS +
Sbjct: 37 ITSKGFGFLLEDVNTQLWDILLQYLKMTEANGLDVVDVLACLFMLGSLELGQEYSFSNWT 96
Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL-----KQIREPGFLIVETN 236
LQ L ++GLV RFYPT+LA + + + E GFL++ETN
Sbjct: 97 PTQTQVLQDLVDYGLVL--VSAPDRFYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETN 154
Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
+R+YAYT + L++A++ LF L YRFPNL VG +TR+S+++AL +GITA Q+I +L HA
Sbjct: 155 YRIYAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHA 214
Query: 297 LPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
P + LPP V DQIRLWE E+ R+ EG LY F S A+++ + Y++ LG++
Sbjct: 215 HPQMRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVL 274
Query: 354 WQNEKKRTVVVTKQGHSDIKKFW 376
W++ R + V GH +++F+
Sbjct: 275 WEHAGLRKLFVGYDGHLTLREFF 297
>gi|346323394|gb|EGX92992.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Cordyceps militaris CM01]
Length = 511
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 139/455 (30%), Positives = 213/455 (46%), Gaps = 103/455 (22%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
L +AK V+R+L++ P+ A + +W+ ++ + TL
Sbjct: 41 LPHLAKTIVMRMLYMSGPMSLADLDAWVKPEAKRQKDQMLSTLRSLHIVQISAPAKDKTQ 100
Query: 45 -----TELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
T R + ++AL GGG P S++ E+D+ FL++YA RW+ IL
Sbjct: 101 EVQLTTNFRTSMRLALTGGGNHNSFGVPSSLTVPPEIDAP-----FLDRYARRRWDEILH 155
Query: 94 FMVGS--------------QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLL 132
F+V S G +VR + G + + IT+ GF FLL
Sbjct: 156 FVVSSVGFKSTATTAAGGEGGGGGGPNKSVRDILIAGRLVDRRPSGSVGITQAGFTFLLQ 215
Query: 133 ETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
E AQVW +L +L+ ++ V+ L+FLF L G Y T +S++ + L L
Sbjct: 216 EANAQVWTLLLLWLDAQDADKNAAVDTLSFLFMLASLEPGRAYDTHALSEQRRDMLPALA 275
Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-------------------------- 226
+FGLVY +K F+PT+LA + T G +R
Sbjct: 276 DFGLVYIPPQKRSMFFPTRLATTL-TSGASSGLRSISDGVAAATSAARSSPSSSAHQPAS 334
Query: 227 --------------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
+PG +++ETN+R+YAYT S L++A++ LF L RFP++ G LTR
Sbjct: 335 SSAGSGPLGGGSGDQPGAVVIETNYRLYAYTQSTLQIAVLALFARLTMRFPDMVAGRLTR 394
Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTT 324
S+R A++ GITA QII +L HA + + P LPP V DQIRLW+ E ER+ T
Sbjct: 395 TSIRQAIQFGITADQIIAYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKT 454
Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
+ G L+ F ++ R +A ++GVL W ++KK
Sbjct: 455 TPGFLFRDFDDPKEYHDTRRFADEIGVLVWSDDKK 489
>gi|355693901|gb|AER99490.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
putorius furo]
Length = 265
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 106/219 (48%), Positives = 141/219 (64%), Gaps = 30/219 (13%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 49 ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
+ +S D + L G E P IT GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQH 190
+G+DLVE L+FLFQL FST G DYS EGMSD L FLQH
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQH 265
>gi|171681598|ref|XP_001905742.1| hypothetical protein [Podospora anserina S mat+]
gi|170940758|emb|CAP66407.1| unnamed protein product [Podospora anserina S mat+]
Length = 540
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 135/430 (31%), Positives = 214/430 (49%), Gaps = 75/430 (17%)
Query: 25 IASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQ 82
IA T AK+ ++ T +++ ++AL GGG + + + L D K + +L+Q
Sbjct: 115 IAQLSQSTRAKKAQDIQLT-ANFKKSLRLALEGGGSHNSFGVPSTLPTDPKIH-IQYLDQ 172
Query: 83 YALERWECILRFMVGSQQTEG------------ISADAVRTLFN------EGAENPV-IT 123
+A W+ IL ++V S + AVR L EGA V I+
Sbjct: 173 WAGRIWQDILYYVVNSVPMKANESGGRHGSGGGGPKRAVRELLKMGGLVREGAGGLVQIS 232
Query: 124 KDGFQFLLLETPAQVWYFILKYLET--------------VESKGLDLVECLTFLFQLKFS 169
+ GF FLL E AQVW +L +LE + +D VE L+FLF L
Sbjct: 233 EHGFNFLLQEANAQVWTLLLLWLEAADRNKALAKEQGTDITGTAIDNVEMLSFLFMLASL 292
Query: 170 TFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR--- 226
G Y T +++ + L L +FGL+Y + + ++YPT+LA + + T++ +
Sbjct: 293 ELGRAYDTSALTETRKNMLPALADFGLIYIDRDRPQQYYPTRLATTLTSLSTMRSVSASI 352
Query: 227 -------------------------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYR 261
E G ++VETN+R+YAYT S L++A++ LFC L R
Sbjct: 353 DAATKKTPGDAGSLGADSTPTAPADENGGIVVETNYRIYAYTSSPLQIAILKLFCRLHMR 412
Query: 262 FPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIR 313
FPN+ LTR+SV+ A++ GITA QII +L HA P + + +PP V DQIR
Sbjct: 413 FPNMVTARLTRESVQEAIKEGITANQIIDYLVAHAHPQMRRAAAARGTTVIPPTVMDQIR 472
Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
LW+ E +R+ + G + F S ++ +L +YA ++GVL W++++K T V+K I+
Sbjct: 473 LWQLESQRMQKTPGFQFKDFESVEEYRQLAEYATEIGVLVWKDDRKGTFFVSKV--EQIR 530
Query: 374 KFWVNHQKGS 383
+F +KG+
Sbjct: 531 EFLKARKKGN 540
>gi|453082965|gb|EMF11011.1| RNA polymerase II transcription factor B subunit 2 [Mycosphaerella
populorum SO2202]
Length = 507
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 140/431 (32%), Positives = 210/431 (48%), Gaps = 66/431 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTH-AKEHKE-ATDTLTELR----------- 48
L +AK V+ LL++ QP A + +W+ H + + KE A L L+
Sbjct: 38 LPHLAKTMVMALLYLPQPFAAADLDAWVKPDHDSTQAKERALSILQRLKIVYDEPYQGRP 97
Query: 49 ---------RNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGS 98
++ + AL G G S E D +D+L+ +A +WE IL ++VGS
Sbjct: 98 AYRLSPAFGKSLRSALTGSGTHNSFGVPCTEPDKHPVTVDYLDTFARTQWEAILYYVVGS 157
Query: 99 QQTEGIS----ADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYL 146
+ RTL G + IT DGF FLL + AQVW ++ YL
Sbjct: 158 ASAGLGGGVDISSGTRTLLETGKFVVVKGGGRHRAITTDGFTFLLQDVNAQVWSLLIVYL 217
Query: 147 E-TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
E + + +D V+ L+FLF L G YST ++ L L FGLVY+R +
Sbjct: 218 EVSGDLLQMDSVDVLSFLFTLGSLELGVSYSTSNLTPTQLQMLDDLTNFGLVYRRSPSSD 277
Query: 206 RFYPTKLALNMATRG------------TLKQIREP----------GFLIVETNFRVYAYT 243
R+YPT+LA + + T Q +P G++I+ETN+R+YAYT
Sbjct: 278 RYYPTRLATTLTSDAPALPNNSFTTTTTTNQNADPNDPAATASEKGYIILETNYRLYAYT 337
Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--- 300
S L ++++ LF L R+PNL +T+ S+ A+ SGIT+ QII +L+ HA P +
Sbjct: 338 SSPLPISILSLFASLNTRYPNLITAKITKTSIHTAIASGITSDQIISYLQTHAHPILRRQ 397
Query: 301 ----SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
S P LPP V DQIRLW+ E ER+ +++G L ++ D+ K YA+ LG+L +
Sbjct: 398 AALNSAPILPPTVVDQIRLWQIEGERMKSTKGYLIRDVGTEDDYTKAVQYAEALGILVKE 457
Query: 356 NEKKRTVVVTK 366
+R VT+
Sbjct: 458 FHHRRAFFVTR 468
>gi|330797174|ref|XP_003286637.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
gi|325083385|gb|EGC36839.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
Length = 471
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 134/431 (31%), Positives = 228/431 (52%), Gaps = 62/431 (14%)
Query: 6 AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----------------- 48
AK Y+ ++L V+ P + W + +HKEA L +L+
Sbjct: 36 AKQYIFKMLMVD-TYPLTIAKDWSQHSSNHQHKEALKKLFDLKIILLNRGKQQSEQSIQL 94
Query: 49 -----RNAKVALLGGGKP-WSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQ 100
N K AL+ K +S S+ ++ +K +++ L+ Y+ ++WE +L F+ +Q
Sbjct: 95 NPLFQENIKSALVEFDKVIFSNSSNIKDTNKIPNINDLDNYSKKQWEQVLYFLSDDAAQP 154
Query: 101 TEGISADAVRTLFNEGAENPV--ITKDGFQFLLLETPAQVWYFILKYLETVESK---GL- 154
++ IS + + ++ IT DGF+FLL + Q+W ++ YL +E K GL
Sbjct: 155 SKFISDLLLSSNLTRRDQDGSLSITSDGFKFLLKDVYTQIWTLLIVYLNDLEKKRREGLT 214
Query: 155 --DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
+ L FLF+L F G Y +++ + +L L++FGL+Y + + FYPT+L
Sbjct: 215 QGSRNDLLGFLFRLSFLQLGKGYLVGELTELQKEYLICLKQFGLIYMKSDASLFFYPTRL 274
Query: 213 ALNMATRGTLKQI-------------------------REPGFLIVETNFRVYAYTDSNL 247
+++ T TL I +E G++++ETN+R+YAYT S+L
Sbjct: 275 IISLTTGKTLSLIQTLAVDKINSSSNSANTVASSTIEKKESGYIVLETNYRLYAYTASSL 334
Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPL 304
+++L+ LF +++YR PNL VG++TR+S+R AL GITA QII F+R +A P+ P+
Sbjct: 335 QISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITADQIIDFIRHNAHPNAVNNGQPI 394
Query: 305 PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVV 364
P VV++QI LWE ER R+T ++ VLY+ F + + +A++ W N++ +T+VV
Sbjct: 395 PDVVAEQILLWEQERNRITYTKSVLYNSFPTTDCYHATLKFAKEQDYYIWSNDQLKTLVV 454
Query: 365 TKQGHSDIKKF 375
+ G+ I+ F
Sbjct: 455 NENGNDPIRNF 465
>gi|346972761|gb|EGY16213.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
dahliae VdLs.17]
Length = 476
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 140/435 (32%), Positives = 213/435 (48%), Gaps = 67/435 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATD-TLTELR------------ 48
LS +AK +V LL++ P+P A + + + E+K A D L +LR
Sbjct: 41 LSPLAKTFVTMLLYLPGPIPIADLEARVKP----EYKRAKDHALAQLRSLHMLQMSVPTQ 96
Query: 49 -------------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRF 94
++ + AL G G P S S ++ + ++A ++W+ IL +
Sbjct: 97 GAPQMIQLTANFSKSYREALEGNGAPGSFGVISPHPRSPEITIESIEKHARQKWDAILHY 156
Query: 95 MVGSQQT----EGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILK 144
+V S G ++V+ L G + IT+ GF FLL E AQVW +L
Sbjct: 157 IVNSVTPGFVDSGGPKNSVKELLLAGQLVKRQGGSTTITQAGFTFLLQEENAQVWTLLLL 216
Query: 145 YLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKA 204
+L+ + + + L+FLFQL Y TE +S + Q L L +FGLVY K
Sbjct: 217 WLDATDQQTAKHTDMLSFLFQLSILELDEVYDTEALSKERQDMLPSLMDFGLVYIPSHKP 276
Query: 205 GRFYPTKLA-----LNMATR---GTLKQIREP--------GFLIVETNFRVYAYTDSNLK 248
+++ T A ++ ATR + + +P +IVETN+RVYAYT S L+
Sbjct: 277 FQYFVTPQARALTNVSTATRTLTDGMNALSQPQGGPGDQTNGIIVETNYRVYAYTTSQLQ 336
Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------- 300
+A++ LFC L +FP + G LTR SVR A+ GITA QII +L HA +
Sbjct: 337 IAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIISYLAAHAHEQMHRQAAADG 396
Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
LPP V DQIRLW+ E ER+ T+ G L+ F A++ + +A ++GVL W+N++KR
Sbjct: 397 KPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDIARFADEIGVLAWRNDRKR 456
Query: 361 TVVVTKQGHSDIKKF 375
+K H I+ +
Sbjct: 457 MFFASK--HEQIRDY 469
>gi|302405781|ref|XP_003000727.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
gi|261360684|gb|EEY23112.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
Length = 476
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 143/435 (32%), Positives = 214/435 (49%), Gaps = 67/435 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATD-TLTELR------------ 48
LS +AK +V LL++ P+P A + + + E+K A D L +LR
Sbjct: 41 LSPLAKTFVTMLLYLPGPIPIADLEARVKP----EYKRAKDHALAQLRSLHMLQMSVPTQ 96
Query: 49 -------------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRF 94
++ + AL G G P S S ++ + ++A ++W+ IL +
Sbjct: 97 GAPQMIQLTANFSKSYREALEGNGAPGSFGVISPHPRSPEITIESIEKHARQKWDAILHY 156
Query: 95 MVGSQQT----EGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILK 144
+V S G ++V+ L G + IT+ GF FLL E AQVW +L
Sbjct: 157 IVNSVTPGFVDSGGPKNSVKELLLAGQLVKRQGGSTTITQAGFTFLLQEENAQVWTLLLL 216
Query: 145 YLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKA 204
+L+ + + + L+FLFQL Y TE +S + Q L L +FGLVY K
Sbjct: 217 WLDATDQQTAKHTDMLSFLFQLSILELDEVYDTEALSKERQDMLPSLMDFGLVYIPSHKP 276
Query: 205 GRFYPTKLA-----LNMATR------GTLKQIRE-PG----FLIVETNFRVYAYTDSNLK 248
+++ T A ++ ATR L Q + PG +IVETN+RVYAYT S L+
Sbjct: 277 FQYFVTPQARALTNVSTATRTLTDGMNALSQSQGGPGDQTNGIIVETNYRVYAYTTSQLQ 336
Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------- 300
+A++ LFC L +FP + G LTR SVR A+ GITA QII +L HA +
Sbjct: 337 IAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIISYLAAHAHEQMHRQAAADG 396
Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
LPP V DQIRLW+ E ER+ T+ G L+ F A++ + +A ++GVL W+N++KR
Sbjct: 397 KPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDIARFADEIGVLAWRNDRKR 456
Query: 361 TVVVTKQGHSDIKKF 375
+K H I+ +
Sbjct: 457 MFFASK--HEQIRDY 469
>gi|384246869|gb|EIE20357.1| Tfb2-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 451
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 136/415 (32%), Positives = 214/415 (51%), Gaps = 41/415 (9%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L+ +AK YVIR+LF++ PV ++V+ SW++++ +H A + L L
Sbjct: 32 LTSLAKQYVIRMLFLDVPVTKSVMDSWIAQSSKSKHATAINRLKGLQLLLPSNVASNGQQ 91
Query: 48 ---------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
R + A+ GGK V +A + L+ YA ++WE I ++
Sbjct: 92 AAYQLHPVFREQLRWAVSTGGKVGIDEVPAGVLQQAPSREMLSSYAQQQWEVITVPLLNQ 151
Query: 99 QQTE---GISADAVRTLF-----NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
+ G S +++TL + AE+ I + GFQFLL +T +Q+W + Y+ + E
Sbjct: 152 LKCTCGLGSSFASLQTLMLYLVGSICAESRSIEEQGFQFLLSDTYSQLWRLLRAYIASGE 211
Query: 151 SK-GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY- 208
+ G L L FL QL F G+ ++ G+ D + + + GL+ K G +
Sbjct: 212 ERSGAPLGTILNFLLQLGFREVGSPFALSGLDDSQRHIAADMAQLGLLMPFTAKDGSVWL 271
Query: 209 -PTKLALNMA--TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
PT+LAL +A + G + GF++VETN+RVYAYT S L+ AL+ LF PNL
Sbjct: 272 APTRLALALAGGSSGQAQHDVTDGFVVVETNYRVYAYTSSLLQTALLRLFTRCECILPNL 331
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC---PLPPVVSDQIRLWEGERERL 322
VGVLTR+SV AL G++A QI+ +LR HA P V+ +P VV+DQ+RLW+ + R+
Sbjct: 332 FVGVLTRESVTGALACGLSADQIVLYLRQHAHPHVASRTPVVPEVVADQVRLWQADTMRV 391
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT--VVVTKQGHSDIKKF 375
+ VLY F S F+ A+ LGV W++ K + V + GH ++++
Sbjct: 392 RHNRAVLYDDFPSAQVFQLSAQKARTLGVWLWEDPKAGMGRLAVQEAGHDAMREY 446
>gi|449296697|gb|EMC92716.1| hypothetical protein BAUCODRAFT_259506 [Baudoinia compniacensis
UAMH 10762]
Length = 489
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 133/413 (32%), Positives = 197/413 (47%), Gaps = 62/413 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHA--KEHKEATDTLTELR----------- 48
L +AK V+ +L++ +P A + W+ + A L LR
Sbjct: 39 LPHLAKTLVMAMLYMPKPFAVADVDVWIKPDAEGLQARDRALSILQRLRILFDTRDEKDG 98
Query: 49 -----------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMV 96
R+ ++AL GGG S D + +L+ +A ++WE IL ++V
Sbjct: 99 RPAYKLSDGFARSLRMALTGGGNHRSFGVPSNAPDKQPVSGSYLDAFARQQWEAILYYVV 158
Query: 97 GSQQTE-GISADAV----RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKY 145
GS G S DA+ + L +G IT+ GF FLL E AQ+W ++ Y
Sbjct: 159 GSAHAGLGGSTDAISAGTKQLLQKGEFVSVRGREAHITQRGFTFLLQEINAQIWTLLIVY 218
Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
LE + +D V+ L+FLF L G YST ++ Q L L +FGLVY+R
Sbjct: 219 LEVSGALHMDPVDVLSFLFTLGSLELGISYSTTNLTPTQQQMLDDLSDFGLVYRRSNDEA 278
Query: 206 RFYPTKLALNMATRGTL-----------------------KQIREPGFLIVETNFRVYAY 242
R+YPT+LA + + E G++I+ETN+R+YAY
Sbjct: 279 RYYPTRLATTLTSDAPALPNTSLTSTTSTTTLASSSTDPSASANEKGYIILETNYRLYAY 338
Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
T S L + ++ LF L R+PNL LT+ S+++A+ SGIT+ QII +L HA P +
Sbjct: 339 TSSPLPITILSLFATLKTRYPNLITAKLTKSSIQSAIASGITSDQIITYLTTHAHPILRR 398
Query: 301 -SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
+ LPP V DQIRLW+ E ER+T +G L + +++K YA LGVL
Sbjct: 399 QNPVLPPTVVDQIRLWQIEGERMTAWKGFLIRDVGTAEEYDKAVQYADALGVL 451
>gi|336270172|ref|XP_003349845.1| hypothetical protein SMAC_00733 [Sordaria macrospora k-hell]
gi|380095234|emb|CCC06707.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 496
Score = 197 bits (500), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 147/471 (31%), Positives = 223/471 (47%), Gaps = 103/471 (21%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWL-----SKTH-------------------AKEH 37
LS AK V LL++ P P + W K H K+
Sbjct: 38 LSHHAKVIVRGLLYMPDPFPLTSLDIWFPPESGGKIHRDRTIAALRSLHVLQVSPPGKDR 97
Query: 38 KEATDTLTELRRNAKVALLGGGKPWS--MSAKLEVDSKARDLDFLNQYALERWECILRFM 95
+ T ++ ++A GGG+ S + + L VD D+ +L++YA ++W+ IL ++
Sbjct: 98 TQEIQITTSFKKGLQLAFEGGGEDGSFGVPSSLPVDPTI-DIGYLDKYARKKWDDILHYV 156
Query: 96 VGS----QQTEGISAD--AVRTLFNEG---AENP------VITKDGFQFLLLETPAQVWY 140
V S +EG+S +V+ L G P IT+ GF FLL E+ AQVW
Sbjct: 157 VNSLGVQSSSEGVSGPKASVKELLLAGRLVERRPDTRTGIGITQAGFTFLLQESNAQVWT 216
Query: 141 FILKYLETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
+L +LE + G+D ++ L+FLF L GT Y T +++ + L L +FGL+
Sbjct: 217 LLLLWLEAADHAADSGMDSIDMLSFLFMLSSLELGTAYDTSALTETRRNMLPSLVDFGLI 276
Query: 198 YQRKRKAGRFYPTKLALNMA-------------TRGTLKQIREPGF-------------- 230
Y R +F+PT+LA + T T EPG
Sbjct: 277 Y-FPRDTRQFFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTTTSTSGGPAGATPA 335
Query: 231 --------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS- 281
+I+ETN+R+YAYT S L++A++ LFC+L RFPN+ V AALR+
Sbjct: 336 DKTGTSKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNM---------VSAALRAI 386
Query: 282 --GITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
GITA QII +L HA + LPP V DQIRLW+ E ER+ T+ G L+
Sbjct: 387 GFGITADQIISYLHAHAHQQMVREATVTGKTVLPPTVVDQIRLWQLENERMQTTAGFLFK 446
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
F S ++ L YA+++GVL W+N+K++ +K H ++ + + +KG
Sbjct: 447 DFESTEEYAALSRYAEEIGVLVWKNDKRQLFFASK--HEQLRDYLKSRKKG 495
>gi|358385986|gb|EHK23582.1| hypothetical protein TRIVIDRAFT_36912 [Trichoderma virens Gv29-8]
Length = 492
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 145/453 (32%), Positives = 223/453 (49%), Gaps = 86/453 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTH-------AKEHKE 39
L +AK +V+R+LF+ P+ + +W L H +KE +
Sbjct: 39 LPHLAKTFVMRMLFLPHPMTLTDLDAWVKPEAKRKKDQSLSILRSLHIVQISAPSKEKPQ 98
Query: 40 ATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
++ +++ ++AL GGG P S+ E+D L FL++YA ++WE +L
Sbjct: 99 EIQLMSNFKKSLRLALTGGGNHNSFGVPSSLLVPPEID-----LPFLDRYARKKWEDVLH 153
Query: 94 FMVGSQQTEGISADA------VRTLFNEG---AENPV----ITKDGFQFLLLETPAQVWY 140
F+V S + D V+ L G P IT+ GF FLL E AQVW
Sbjct: 154 FVVSSVGYKSSVGDGSGPNKGVKELLIAGRLVDRRPNGSLGITQAGFTFLLQEANAQVWT 213
Query: 141 FILKYLETV---ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
+L +LE + + GLD V+ L+FLF L G Y T ++++ + L L +FGL+
Sbjct: 214 LLLLWLEVLGNNKGSGLDPVDMLSFLFMLASLELGRAYDTNALTEERRNMLPSLVDFGLI 273
Query: 198 YQRKRKAGRFYPTKLALNMATRG-TLKQIRE--------------------------PGF 230
Y + K F+PT+LA + + G +L+ I E G
Sbjct: 274 YIPQHKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQALGPLGGSGGEQAGS 333
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+++ETN+R+YAYT S L++A++ LF +L RFP++ G L+R S+R A+ GITA QII
Sbjct: 334 VVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQIIS 393
Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKL 342
+L HA + + P LPP V DQIRLW+ E ER+ T+ G L+ F D+
Sbjct: 394 YLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYLDT 453
Query: 343 RDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+++++GVL W+N+ R K H IK F
Sbjct: 454 ARFSEEIGVLVWKNDHARMFFANK--HEQIKDF 484
>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group]
gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group]
Length = 444
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 136/437 (31%), Positives = 214/437 (48%), Gaps = 92/437 (21%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRR------------ 49
L +AK Y +++L+V PV A + W+ +A +H+ A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99
Query: 50 ---------NAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWE----- 89
N + L+ GG P S++A+L L L +ALE+WE
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWEVRMTK 153
Query: 90 ---CILRFMVGSQQTEGISADAVRTLFNE----GAENPVIT---KDGFQ----------- 128
CI + + + S L+ + A + +I+ +D FQ
Sbjct: 154 AYVCI-SYCLNATNVHSPSKGTHSFLYGKIVLLAAIDQLISSRERDKFQLIHDENFSARS 212
Query: 129 --FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQV 186
F L+ET AQ+WY + +Y+ + E +G+D E ++FL +L F T G YS ++D +
Sbjct: 213 SEFKLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRN 272
Query: 187 FLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSN 246
++ L E GLV L+Q GF++VETNFR+YAY+ S
Sbjct: 273 AIRDLAELGLV-----------------------KLQQ----GFVVVETNFRMYAYSTSR 305
Query: 247 LKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPP 306
L ++ LF + Y+ PNL VG +T++S+ A +GITA QII FL+ +A P V+ +P
Sbjct: 306 LHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPA 365
Query: 307 V---VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
V V+DQIRLWE +R R+ + LY F S+ F++ DYA+D G L W++ KK ++
Sbjct: 366 VPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLI 425
Query: 364 VTKQGHSDIKKFWVNHQ 380
V + HS++ + V +
Sbjct: 426 VRVEFHSEMHDYSVQER 442
>gi|66800623|ref|XP_629237.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
gi|74850716|sp|Q54C29.1|TF2H4_DICDI RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=TFIIH basal transcription factor complex subunit 4
gi|60462610|gb|EAL60813.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
Length = 483
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 114/335 (34%), Positives = 200/335 (59%), Gaps = 21/335 (6%)
Query: 62 WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS--QQTEGISADAVRT-LFNEGAE 118
+S ++ ++ + K +D L+ Y+ +WE +L F+ Q ++ IS + + L + +
Sbjct: 143 FSNNSSIKDNHKPPSIDDLDSYSKSQWEKVLYFLSDDTVQPSKLISELLLSSNLTKQEGD 202
Query: 119 NPVITKDGFQFLLLETPAQVWYFILKYLETVESK----GLDLVECLTFLFQLKFSTFGTD 174
IT +GF+FLL + Q+W ++ YL+ +E K + L+FLF+L F G
Sbjct: 203 GLSITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGSRNDLLSFLFRLSFLNLGRG 262
Query: 175 YSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQI--------- 225
Y +S++ + +L L++FGL+Y R + FYPT+L +++ T TL I
Sbjct: 263 YLVSELSEQQKEYLFALKQFGLIYMRTDSSILFYPTRLIISLTTGKTLSLIQSISSERTQ 322
Query: 226 --REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGI 283
+E G++++ETN+R+YAYT S+L+++L+ LF +++YR PNL VG++TR+S+R AL GI
Sbjct: 323 TQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGI 382
Query: 284 TAAQIIGFLRLHALPSVSC---PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
TA QII F+R ++ P+ + P+P VV++QI LWE ER R+T ++ VLY+ F + +
Sbjct: 383 TADQIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRITYTKSVLYNSFPTNDCYI 442
Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+A++ W ++ +T+VV ++G+ I+ F
Sbjct: 443 ATLKFAKEQDYYIWSHDPLKTLVVKEEGNDPIRNF 477
>gi|358394641|gb|EHK44034.1| hypothetical protein TRIATDRAFT_222970 [Trichoderma atroviride IMI
206040]
Length = 491
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 143/455 (31%), Positives = 222/455 (48%), Gaps = 91/455 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLS----------------------KTHAKEHKE 39
L +AK +V+R+LF+ P+ + W+ +KE +
Sbjct: 39 LPHLAKTFVMRMLFMPHPMTLNDLDVWVKPEAKRKKDQSLSILRGLHIVQISAPSKEKPQ 98
Query: 40 ATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
++ +R+ ++AL GGG P S+ E+D L FL++YA ++W+ +L
Sbjct: 99 EIQLMSNFKRSLRLALTGGGNHNSFGVPSSLLIPPEID-----LPFLDRYARKKWDDVLH 153
Query: 94 FMV----------GSQQTEG-----ISADAVRTLFNEGAENPVITKDGFQFLLLETPAQV 138
F+V GS +G I+ V N G IT+ GF FLL E AQV
Sbjct: 154 FVVSSVGYKSVGDGSGPNKGVKELLIAGRLVDRRPNGGLG---ITQAGFTFLLQEANAQV 210
Query: 139 WYFILKYLETV---ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFG 195
W +L +L+ + + GLD V+ L+FLF L G Y T ++++ + L L +FG
Sbjct: 211 WTLLLLWLDVLGNNKGSGLDPVDMLSFLFMLASLELGRAYDTNALTEERRNMLPSLVDFG 270
Query: 196 LVYQRKRKAGRFYPTKLALNMATRG-TLKQIRE--------------------------P 228
L+Y + K F+PT+LA + + G +L+ I E
Sbjct: 271 LIYIPQHKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQSLGPLGGGGEQQS 330
Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
G +++ETN+R+YAYT S L++A++ LF +L RFP++ G L+R S+R A+ GITA QI
Sbjct: 331 GSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQI 390
Query: 289 IGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
I +L HA + + P LPP V DQIRLW+ E ER+ T+ G L+ F D+
Sbjct: 391 ISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYL 450
Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+++++GVL W+N+ R K H IK F
Sbjct: 451 DTARFSEEIGVLVWRNDHARMFFANK--HEQIKDF 483
>gi|322697444|gb|EFY89224.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium acridum CQMa 102]
Length = 541
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 133/402 (33%), Positives = 201/402 (50%), Gaps = 82/402 (20%)
Query: 34 AKEHKEATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALER 87
+KE + T + + ++AL GGG+ P ++ E+D+ FL++YA +R
Sbjct: 122 SKEKPQEMQLTTNFKNSLRLALTGGGEHNSFGVPSTLLVPPEIDTA-----FLDRYARKR 176
Query: 88 WECILRFMV---GSQQTEGISA--DAVRTLFNEG---AENPV----ITKDGFQFLLLETP 135
WE IL F+V G + T +S +V+ L G P IT+ GF FLL E
Sbjct: 177 WEDILHFVVSSVGYKSTGDVSGPNKSVKELLIAGRLVDRRPNGSIGITQAGFTFLLQEPN 236
Query: 136 AQVWYFILKYLETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
AQVW +L +LE E+ GL+ V+ L+FLF L G Y T ++++ + L L
Sbjct: 237 AQVWTLLLLWLEASETNKAAGLETVDMLSFLFVLASLELGRAYDTNALTEQRRNMLPSLL 296
Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRG-TLKQIRE------------------------ 227
+FGL+Y + K F+PT+LA + + G +L+ I E
Sbjct: 297 DFGLIYIPQHKRSMFFPTRLATALTSGGGSLRTISEGVAAATAAAGPSSHEGVAAATAAA 356
Query: 228 -----------------------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
G +IVETN+R+YAYT S L++A++ LFC+L RFP+
Sbjct: 357 GPSSQSGSGSGGAGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPD 416
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWE 316
+ G +TR S+ A+ GITA QII +L HA + + P LPP V DQIRLW+
Sbjct: 417 MVAGRVTRTSIHQAINFGITADQIISYLAAHAHEQMHRTAALANKPILPPTVVDQIRLWQ 476
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
E ER+ T+ G L+ F +F +A+++GVL W+++K
Sbjct: 477 LENERMKTTGGFLFRDFEDHKEFLDTARFAEEIGVLVWRSDK 518
>gi|392594381|gb|EIW83705.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 393
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/338 (36%), Positives = 180/338 (53%), Gaps = 51/338 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----RNAKVAL-- 55
L+ ++K V+ LL+++ PV +A+W+ K + EA +TL L AK+AL
Sbjct: 50 LAPLSKQLVMNLLWLDSPVLAGTMAAWVVLDGKKLYDEALNTLMRLHILQSSGAKLALNP 109
Query: 56 ----------LGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQ--- 100
G S + D K A ++ L+ YA++RWE IL +MV S Q
Sbjct: 110 TFKASMRQAITCSGTTGSFGVPAQRDDKRDAPSVETLDGYAVQRWETILHYMVSSGQGQY 169
Query: 101 ----TEGISADAVRT--LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
T+G+ R+ + + IT GFQFLL Q+W +L+YL VE + +
Sbjct: 170 PTKPTQGVLYLLQRSGLMASYHGSTLQITSSGFQFLLYSPRDQLWDLLLQYLHMVEERQM 229
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
DLVE L+F F L G +YSTE +S+ + L+ LR++GLV+QRK + RF PT+LA
Sbjct: 230 DLVEVLSFFFMLSTMELGREYSTEPLSETQKAMLEDLRDYGLVWQRKASSKRFSPTRLAT 289
Query: 215 NMATRG----TLKQIREP--------------------GFLIVETNFRVYAYTDSNLKVA 250
+ + T R P GF+++ETN+RVYAYTD+ L++A
Sbjct: 290 TLTSVSPSLPTAGGSRNPGSAPGASSVANSTGINTNDHGFIVLETNYRVYAYTDNPLQIA 349
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
++ LF L YRFPNL VG+LTR+SVR AL +GI+A QI
Sbjct: 350 VLNLFITLKYRFPNLVVGMLTRESVRRALGNGISAEQI 387
>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group]
Length = 470
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 102/261 (39%), Positives = 156/261 (59%), Gaps = 30/261 (11%)
Query: 118 ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
E P +T++GFQFLL+ET AQ+WY + +Y+ + E +G+D E ++FL +L F T G YS
Sbjct: 232 EAPRLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGEAYSL 291
Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNF 237
++D + ++ L E GLV L+Q GF++VETNF
Sbjct: 292 NTLTDVQRNAIRDLAELGLV-----------------------KLQQ----GFVVVETNF 324
Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
R+YAY+ S L ++ LF + Y+ PNL VG +T++S+ A +GITA QII FL+ +A
Sbjct: 325 RMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAH 384
Query: 298 PSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
P V+ +P V V+DQIRLWE +R R+ + LY F S+ F++ DYA+D G L W
Sbjct: 385 PRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLW 444
Query: 355 QNEKKRTVVVTKQGHSDIKKF 375
++ KK ++V + HS++++F
Sbjct: 445 EDAKKMRLIVRVEFHSEMREF 465
>gi|378729002|gb|EHY55461.1| transcription initiation factor TFIIH subunit H4 [Exophiala
dermatitidis NIH/UT8656]
Length = 508
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 141/455 (30%), Positives = 209/455 (45%), Gaps = 91/455 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
L+ +AK +V+ L+++ PVP + ++ T E + A D L
Sbjct: 38 LTSLAKSFVMMLVYMPAPVPVKQLELFVKDTSRGEREYAIDLLHRYHIFKDVTYNGAKAY 97
Query: 45 ---TELRRNAKVALLGGGKPWS---MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
+ ++ + AL G G S ++ + D K + L++YA +RWE IL +MVGS
Sbjct: 98 ALTPDFAKSLRRALTGAGDSRSFGQVATNVPEDQKV-TIAQLDEYARQRWEGILGYMVGS 156
Query: 99 -----------QQTEGISADAVRTLFNEG-----------AENPVITKDGFQFLLLETPA 136
Q + V L G + ITK+GF F+L +
Sbjct: 157 SSILLESGTTENQASIQPSPGVIELLKAGHLIELTGTYSRGQAAKITKEGFAFVLQDINT 216
Query: 137 QVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGL 196
Q+W + Y++ E +D V+ L+FLF + G YST + + L L FG+
Sbjct: 217 QIWALLFLYVDNAEVFEMDKVDVLSFLFFVSSLELGLAYSTAPLDETQSRCLSDLVSFGI 276
Query: 197 VYQRKRKAGR----FYPTKLAL-------------NMATRGTLKQIREPG---------- 229
VYQ + G FYPT+LA N +L
Sbjct: 277 VYQPLLEDGTPVDYFYPTRLATTLTSDSSTTLSATNTTLASSLSTKSAHSSSSSSAGAGK 336
Query: 230 -FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
F+IVETN+R+YAYT S L++AL+ LF L R PNL G +++ SV+ A+++GITA QI
Sbjct: 337 GFIIVETNYRLYAYTSSPLQIALLSLFVNLRSRHPNLVTGKMSKSSVQRAIQAGITADQI 396
Query: 289 IGFLRLHALPS-----------------VSCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
I +L HA P V LP + DQI LW+ ER+R+TT+ G L
Sbjct: 397 ISYLTSHAHPQMRRHAQAEQARNADPNRVVPILPATILDQIHLWQLERDRMTTTPGFLLK 456
Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
F +QAD+E YA ++GVL W+N+KKR V +
Sbjct: 457 DFPNQADYEAPCRYADEIGVLVWKNDKKRMFFVNR 491
>gi|116201905|ref|XP_001226764.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
gi|88177355|gb|EAQ84823.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
Length = 503
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 145/449 (32%), Positives = 215/449 (47%), Gaps = 84/449 (18%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTH-------AKEHKE 39
L +AK +V LL++ P+ A + W L H K+ +
Sbjct: 40 LPPLAKVFVQALLYMPSPLLLAELDVWVRAEGKMQRDRALSILRSLHIIQITPPGKDKPQ 99
Query: 40 ATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGS 98
T + + ++AL GGG S + R D+ L+ YA ++WE IL ++V S
Sbjct: 100 EMQLTTNFKNSLRLALEGGGSHNSFGVPSTLPVAPRIDIAHLDGYARKKWEDILHYVVNS 159
Query: 99 --QQTEGISA-------DAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWY 140
Q+EG +V+ L ++ IT+ GF FLL E AQVW
Sbjct: 160 VPAQSEGGGGVSGGGPKTSVKELLLIGRLVERRHDTQAGVGITQAGFTFLLQEANAQVWT 219
Query: 141 FILKYLETV-----ESKG---LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
+L +LE E+KG D ++ L+FLF L G Y+T +++ + L HL
Sbjct: 220 LLLLWLEAADRAKGEAKGAANTDSIDMLSFLFMLASLELGRAYNTGALTETRRNMLPHLV 279
Query: 193 EFGLVYQRKRKAGRFYPTKLA----------------LNMATR---------GTLKQIR- 226
+FGL++ + +++PT+LA N AT GT +
Sbjct: 280 DFGLIFIPREDTRQYFPTRLATTLTSSASSLRSVSSGFNAATANNPGDTSSLGTTPETSA 339
Query: 227 -EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITA 285
G +I+ETN+R+YAYT S L++A++GLF +L RF + G LTRDS+R A+ GITA
Sbjct: 340 ASKGSVIIETNYRLYAYTSSPLQIAVLGLFTQLNMRFAGMATGRLTRDSIRRAIGFGITA 399
Query: 286 AQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
QII +L H A + P LPP V DQIRLW+ E ER+ TS G L+ F S
Sbjct: 400 DQIISYLASHAHDQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFLFKDFDSLE 459
Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
++ L YA ++GVL W+++KKR +K
Sbjct: 460 EYVSLSSYADEVGVLVWRSDKKRMFYASK 488
>gi|322711210|gb|EFZ02784.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium anisopliae ARSEF 23]
Length = 540
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 132/398 (33%), Positives = 201/398 (50%), Gaps = 78/398 (19%)
Query: 34 AKEHKEATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALER 87
+KE + T + + ++AL GGG+ P ++ E+D+ FL++YA +R
Sbjct: 125 SKEKPQEMQLTTNFKNSLRLALTGGGEHNSFGVPSTLLVPPEIDTP-----FLDRYARKR 179
Query: 88 WECILRFMV---GSQQTEGISA--DAVRTLFNEG---AENPV----ITKDGFQFLLLETP 135
WE IL F+V G + T +S +V+ L G P IT+ GF FLL E
Sbjct: 180 WEDILHFVVSSVGYKSTGDVSGPNKSVKELLIAGRLVDRRPNGTIGITQAGFTFLLQEPN 239
Query: 136 AQVWYFILKYLETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
AQVW +L +LE E+ GL+ V+ L+FLF L G Y T ++++ + L L
Sbjct: 240 AQVWTLLLLWLEASETNKAAGLETVDMLSFLFVLASLELGRAYDTNALTEQRKNMLPSLL 299
Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------------------------ 227
+FGL+Y + K F+PT+LA + + G+ L+ I E
Sbjct: 300 DFGLIYIPQHKRSMFFPTRLATTLTSGGSSLRTISEGVAAATAAATTTAPAGGPSSQSGS 359
Query: 228 ---------------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
G +IVETN+R+YAYT S L++A++ LFC+L RFP++ G +TR
Sbjct: 360 GSGGTGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTR 419
Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRL----WEGERE 320
S+ A+ GI+A QII +L HA + + P LPP V DQIRL W+ E E
Sbjct: 420 TSIHQAINFGISADQIISYLAAHAHEQMHRSAALANKPILPPTVVDQIRLWQLEWQLENE 479
Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
R+ T+ G L+ F +F +A+++GVL W+++K
Sbjct: 480 RMKTTGGFLFRDFEDHKEFLDTARFAEEIGVLVWRSDK 517
>gi|400597688|gb|EJP65418.1| RNA polymerase II transcription factor B subunit 2 [Beauveria
bassiana ARSEF 2860]
Length = 507
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 136/452 (30%), Positives = 210/452 (46%), Gaps = 103/452 (22%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L ++AK +V+R+L++ P+ + SW+ ++ + TL L
Sbjct: 41 LPQLAKTFVMRMLYMPNPMSLNDLDSWVKPEAKRQKDQMLSTLRSLHIVQISAPSKDKTQ 100
Query: 48 --------RRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
R + ++AL GGG P +++ E+D L FL++YA RW+ IL
Sbjct: 101 EVQLTANFRTSMRLALTGGGNHGSFGVPSTLAVPPEID-----LPFLDRYARRRWDEILH 155
Query: 94 FMVGSQQTEGISADAVRTLFNEGAENPV-------------------ITKDGFQFLLLET 134
F+V S G +G V IT+ GF FLL E
Sbjct: 156 FVVSS---VGFKNAGGGGAAADGPNKSVRDILVAGRLVDRRPNGSIGITQAGFTFLLQEA 212
Query: 135 PAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
AQVW +L +L+ +++ V+ L+FLF L G Y T +++ + L L +F
Sbjct: 213 NAQVWTLLLLWLDALDADKSAAVDMLSFLFVLASLELGRAYDTNALTEHRRDMLPALADF 272
Query: 195 GLVYQRKRKAGRFYPTKLA--LNMATRGTLKQIREP------------------------ 228
GLVY +K F+PT+LA L +L+ I E
Sbjct: 273 GLVYIPPQKRSMFFPTRLATTLTSGASSSLRSISEGVVAATSAALSASSSSSTHQDQNHQ 332
Query: 229 --------------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
G +++ETN+R+YAYT S L++A++ LF L RFP++ G +TR S
Sbjct: 333 SRAGPLGAGAGDQIGSIVIETNYRLYAYTQSTLQIAVLALFSRLTMRFPDMVAGRITRTS 392
Query: 275 VRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSE 326
+R A++ GITA QII +L HA + + P LPP V DQIRLW+ E ER+ T+
Sbjct: 393 IRQAIQFGITADQIIAYLAAHAHEQMHRSAALAARPVLPPTVVDQIRLWQLENERMKTTG 452
Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
G L+ F Q +++ R +A ++GVL W ++K
Sbjct: 453 GFLFRDFDDQKEYQDTRRFADEIGVLVWSSDK 484
>gi|367040709|ref|XP_003650735.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
gi|346997996|gb|AEO64399.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
Length = 513
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 138/461 (29%), Positives = 218/461 (47%), Gaps = 98/461 (21%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
L +AK +V LL++ P+ + + W+ + AK H++ ++
Sbjct: 40 LPPLAKIFVQALLYMPSPLLLSELDVWV-RPEAKMHRDRALSILRSLHIVQISPPGKDKP 98
Query: 45 ------TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
T + + ++AL GGG + + + L VD + D+ L+QYA ++WE IL ++V
Sbjct: 99 QEIQLTTNFKNSLRLALEGGGTHNSFGVPSTLPVDPRI-DIAHLDQYARKKWEDILHYVV 157
Query: 97 GSQQTEG----------ISADAVRTLFNEG---AENP------VITKDGFQFLLLETPAQ 137
S +V+ L G P IT+ GF FLL E AQ
Sbjct: 158 NSVPVHAEGGGGGGSGGGPKASVKDLLLAGRLVERRPDTRGGVAITQAGFTFLLQEANAQ 217
Query: 138 VWYFILKYLETVE-----------------SKG-LDLVECLTFLFQLKFSTFGTDYSTEG 179
VW +L +LE + S+G D ++ L+FLF L G Y T+
Sbjct: 218 VWTLLLLWLEAADHSKAAAAAAGDHAKGAASQGKPDSIDMLSFLFMLASLELGRAYDTDS 277
Query: 180 MSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGF--------- 230
+++ + L L +FGL+Y + +++PT+LA + + + + GF
Sbjct: 278 LTETRRNMLPALVDFGLIYIPREDTRQYFPTRLATTLTSSASALRSVSSGFSAAAANNPG 337
Query: 231 -----------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
LI+ETN+R+YAYT S L++A++ LF +L RF + G LTRD
Sbjct: 338 DAASLGMTPDTTPTKGSLIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMVTGRLTRD 397
Query: 274 SVRAALRSGITAAQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
S+R A+ GITA QII +L H A + P LPP V DQIRLW+ E ER+ T+
Sbjct: 398 SIRRAISFGITADQIISYLASHAHEQMTRAAAAAGRPVLPPTVVDQIRLWQLENERMRTT 457
Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
G L+ F S ++ L YA+++GVL W++++KR +K
Sbjct: 458 AGFLFKDFDSPEEYVSLSGYAEEIGVLVWKSDRKRVFFASK 498
>gi|340939354|gb|EGS19976.1| RNA polymerase II transcription factor B subunit 2-like protein
[Chaetomium thermophilum var. thermophilum DSM 1495]
Length = 514
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 140/459 (30%), Positives = 216/459 (47%), Gaps = 95/459 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWL---SKTH-------------------AKEHKE 39
L +AK +V LL++ QP+ + + W+ +K H K+ +
Sbjct: 42 LPPLAKVFVQALLYMPQPMLLSDLDVWVRPEAKMHRDRALSILRSLHIVQITPPGKDRPQ 101
Query: 40 ATDTLTELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
T R + ++AL GG + + + L VD + D+ FL+ YA ++WE IL ++V
Sbjct: 102 EVQLTTNFRNSLRLALEGGAAHNSFGVPSSLPVDPRI-DIAFLDNYARKKWEDILHYVVS 160
Query: 98 SQQTEGISAD----------AVRTLFNEG---AENPV------ITKDGFQFLLLETPAQV 138
S G + +V+ L G P IT+ GF FLL E AQV
Sbjct: 161 SVPVHGDAGPGGMGGGGPKASVKDLLLAGRLVERRPDTKTGIGITQAGFTFLLQEANAQV 220
Query: 139 WYFILKYLET-------------VESKGL-----DLVECLTFLFQLKFSTFGTDYSTEGM 180
W +L +LE V+ K D +E L+FLF L G Y T+ +
Sbjct: 221 WTLLLLWLEAADQAKAAAAAASGVDPKNAPPTKPDSIEMLSFLFMLASLELGRAYDTDAL 280
Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGF---------- 230
S+ + L L +FGL+Y + +++PT+LA + + + + GF
Sbjct: 281 SETRRNMLPALVDFGLIYIPREDTRQYFPTRLATTLTSSASALRSVSSGFTAATNNTAND 340
Query: 231 ---------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
+I+ETN+R+YAYT S L++A++ LF L RF + G LTR+S+
Sbjct: 341 ASSLGGADPSAHKGSIIIETNYRLYAYTSSPLQIAVLALFTHLNMRFAGMVTGRLTRESI 400
Query: 276 RAALRSGITAAQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERERLTTSEG 327
R A+ GITA QII +L H A + P LPP V DQIRLW+ E ER+ TS G
Sbjct: 401 RRAISFGITADQIISYLASHAHEQMVRAAAAAGRPVLPPTVVDQIRLWQLENERMRTSPG 460
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
L+ F + ++ L YA+++GVL W++++KR +K
Sbjct: 461 FLFKDFENVEEYMALAGYAEEIGVLVWRSDRKRMFFASK 499
>gi|367030175|ref|XP_003664371.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
gi|347011641|gb|AEO59126.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
Length = 512
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 128/405 (31%), Positives = 196/405 (48%), Gaps = 73/405 (18%)
Query: 34 AKEHKEATDTLTELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
K+ + T R + ++AL GGG + + + L VD + D+ +L+ YA +WE I
Sbjct: 94 GKDKPQEIQLTTNFRNSLRLALEGGGTHNSFGVPSTLPVDPRI-DIAYLDSYARRKWEDI 152
Query: 92 LRFMVGSQQTEGISAD---------AVRTLF---------NEGAENPVITKDGFQFLLLE 133
L ++V S +G +V+ L N+ IT+ GF FLL E
Sbjct: 153 LHYVVNSVPAQGEGGGGGSVGGPKASVKDLLLIGRLVERRNDTKAGVGITQAGFTFLLQE 212
Query: 134 TPAQVWYFILKYLETVE-----------------SKGLDLVECLTFLFQLKFSTFGTDYS 176
AQVW +L +LE + + D ++ L+FLF L G Y
Sbjct: 213 ANAQVWTLLLLWLEAADRAKGAAEQGKGGGAAAAAAKADSIDMLSFLFMLASLELGRAYD 272
Query: 177 TEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGF------ 230
T+ +S+ + L L +FGL+Y + +++PT+LA + + + + GF
Sbjct: 273 TDALSETRRNMLPALVDFGLIYIPREDTRQYFPTRLATTLTSSASSLRSVSSGFSAAAAN 332
Query: 231 ---------------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
+I+ETN+R+YAYT S L++A++ LF +L RF + G
Sbjct: 333 NPGDASSLGTTPETSAASKGSVIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMATGR 392
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERER 321
LTRDS+R A+ GITA QII +L H A + P LPP V DQIRLW+ E ER
Sbjct: 393 LTRDSIRRAIGFGITADQIISYLASHAHEQMVRAAAATGRPVLPPTVVDQIRLWQLENER 452
Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
+ TS G L+ F S ++ L YA+++GVL W+++KKR +K
Sbjct: 453 MRTSAGFLFKDFDSLDEYLSLSAYAEEIGVLVWKSDKKRMFYASK 497
>gi|440800489|gb|ELR21525.1| transcription factor tfb2 subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 465
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 181/303 (59%), Gaps = 13/303 (4%)
Query: 88 WECILRFMVGSQQTEGISADAVRTLFNEGA--------ENPVITKDGFQFLLLETPAQVW 139
W+ I F++GS + S D V +L G ++ I GF FLL + QVW
Sbjct: 152 WDTIFLFIMGSTVIQPPS-DRVVSLLTRGEFMVVHEEDQSIRIADKGFPFLLKDLRTQVW 210
Query: 140 YFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
+L YL +++ + ++ + L+FLF+L F T G Y + ++ LQ L++ G++Y+
Sbjct: 211 TLLLLYLRSLQEEKANVHDVLSFLFRLSFLTVGEGYQMDDLAFSESGLLQDLQDLGIIYR 270
Query: 200 RKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
+ + + YPT+LA+ +++ K+ +E G++IV T++R+YAYT S +K+ L+ LF ++
Sbjct: 271 KHKDSKWLYPTQLAIGLSSTEAAKRDQE-GWIIVGTDYRIYAYTSSPVKLLLLSLFTQIE 329
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWE 316
Y+ PN+ +G+L R+++R A++ GI+A QI+ FL +A P + + P +P ++DQ+RLWE
Sbjct: 330 YQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMKQNTPIIPESIADQLRLWE 389
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
E RL+ S G Y F S A F+K YA+D+G L + + KR + V++ GH ++++
Sbjct: 390 AEDRRLSLSSGYFYDDFASLAAFKKAEKYARDVGALIYSDATKRFLFVSEPGHQLLRRYV 449
Query: 377 VNH 379
H
Sbjct: 450 KQH 452
>gi|440791255|gb|ELR12502.1| TFIIH basal transcription factor complex subunit 2, putative
[Acanthamoeba castellanii str. Neff]
Length = 304
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 165/261 (63%), Gaps = 4/261 (1%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
I GF FLL + QVW +L YL +++ + ++ + L+FLF+L F T G Y + ++
Sbjct: 34 IADKGFPFLLKDLRTQVWILLLLYLRSLQEEKANVHDVLSFLFRLSFLTVGEGYQMDDLA 93
Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYA 241
LQ L++ G++Y++ R + YPT+LA+ +++ K+ +E G++IV T++R+YA
Sbjct: 94 FSESGLLQDLQDLGIIYRKHRDSKWLYPTQLAIGLSSTEAAKRDQE-GWIIVGTDYRIYA 152
Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV- 300
YT S +K+ L+ LF ++ Y+ PN+ +G+L R+++R A++ GI+A QI+ FL +A P +
Sbjct: 153 YTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMK 212
Query: 301 -SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
+ P +P ++DQ+RLWE E RL+ S G Y F S A F+K YA+D+G L + +
Sbjct: 213 QNTPIIPESIADQLRLWEAEDRRLSLSPGYFYDDFASLAAFKKAEKYARDVGALLYSDAA 272
Query: 359 KRTVVVTKQGHSDIKKFWVNH 379
KR + VT+QGH ++++ H
Sbjct: 273 KRFLFVTEQGHQLLRRYVKQH 293
>gi|320587960|gb|EFX00435.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Grosmannia clavigera kw1407]
Length = 518
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 143/480 (29%), Positives = 223/480 (46%), Gaps = 100/480 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
L +AK +V LL+++ P+ + + + +E +A L L
Sbjct: 41 LPSLAKSFVRMLLYMDGPLLLDTLDTMVVPEAKRERDQALSILRALHIVQITTAMRDRPQ 100
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGS 98
+ + ++AL GGG S E A D FL+++A +WE IL ++V S
Sbjct: 101 EIMLTANFKNSFRLALEGGGSQNSFGVPSEKPVSAETDTAFLDRFARRKWEDILHYVVNS 160
Query: 99 --------QQTEGISADAVRTLFNEG---AENPV------ITKDGFQFLLLETPAQVWYF 141
++G + V+ L G P IT+ GF FLL AQVW
Sbjct: 161 VGLPSGPGHDSQG-PKNTVKELLLAGHLVESRPGRPGGVGITQTGFTFLLQAANAQVWTL 219
Query: 142 ILKYLETVESK-----GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGL 196
+L++LE V+ + +D V+ L+FLF L G Y T +++ + L L +FGL
Sbjct: 220 LLQWLEAVDQQRQAGGAVDSVDMLSFLFMLSTLELGRAYDTNALTEARRNMLPSLMDFGL 279
Query: 197 VY-QRKRKAGRFYPTKLALNMATRGT--LKQIREP------------------------- 228
+Y +A +++PT+LA + + G L+ +
Sbjct: 280 IYIPPHDRASQYFPTRLATTLTSSGAAALRSLSTGFAAASANSTAAAGLPGTGTGSNNNH 339
Query: 229 ----------------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
G +I+ETNFR+YAYT S L++A++ LF +L RFP + G LTR
Sbjct: 340 VGAASTANSAATTEGRGSVILETNFRLYAYTASPLQIAVLALFSKLSQRFPGMVAGKLTR 399
Query: 273 DSVRAALRSGITAAQIIGFL------RLHALPSVS--CPLPPVVSDQIRLWEGERERLTT 324
DSVR A+ GIT+ QII +L ++H + + LPP V DQIRLWE E ER+ +
Sbjct: 400 DSVRRAISYGITSDQIISYLAANAHEQMHKYAAANHRPVLPPTVVDQIRLWELENERMKS 459
Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
+G L+ F S +++ L YA ++GVL ++N+KKR VTK H + + +K Q
Sbjct: 460 HKGFLFKDFDSAKEYDMLAQYADEIGVLVYRNDKKRHFFVTK--HEQLVTYLKARKKADQ 517
>gi|85857462|gb|ABC86267.1| RE43632p [Drosophila melanogaster]
Length = 280
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 94/204 (46%), Positives = 127/204 (62%), Gaps = 32/204 (15%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ ++IR+LFV+QPVPQAV+ SW ++ AKE EAT LT L
Sbjct: 70 ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 129
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
+++ + LLGGGKPW M+ L+ DSK RD+ FL+ YA+ RW C+L +MVG+
Sbjct: 130 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 189
Query: 99 ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
E IS DAVR L + + + IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 190 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 249
Query: 149 VESKGLDLVECLTFLFQLKFSTFG 172
E +G+ L ECL+ LFQL FST+
Sbjct: 250 CEERGISLPECLSMLFQLSFSTWA 273
>gi|310799661|gb|EFQ34554.1| transcription factor tfb2 [Glomerella graminicola M1.001]
Length = 485
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 135/443 (30%), Positives = 215/443 (48%), Gaps = 76/443 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
+ +AK V+ +L++ +P+ + W+ ++ +A TL
Sbjct: 41 IPPLAKTIVMAILYMPKPMLLEDLDVWVKPESRRQKDQAISTLRSLHILQITVPSKERPQ 100
Query: 45 -----TELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
T + + ++AL GGG P S+ A E+ + FL++YA +WE IL
Sbjct: 101 EMQLTTNFKNSLRLALEGGGTHNSFGVPSSLPAPPEIT-----VPFLDRYARRKWEDILH 155
Query: 94 FMVGSQQTEGIS----ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
++V + G +V+ L G IT+ GF FLL E AQVW +L
Sbjct: 156 YIVNTVNPGGADLGGPKSSVKNLLVAGQLVRRQGSAVGITQAGFTFLLQEANAQVWTLLL 215
Query: 144 KYLETVE----SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
+LE + + G++ + L+FLF L G Y T +++ + L L +FGL+Y
Sbjct: 216 LWLEATDHAEDAAGMESTDMLSFLFLLASLELGRPYDTNALTEARRNMLPSLLDFGLIYI 275
Query: 200 RKRKAGRFYPTKLALNMA-------------------TRGTLKQIREPGFLIVETNFRVY 240
K +++PT+LA + + + IR F + ETN+RVY
Sbjct: 276 PSHKPQQYFPTRLATTLTSSSSALSYAHCRDSETFFQSYASHLMIRVRPFRLSETNYRVY 335
Query: 241 AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
AY + L++A++ LFC+L RF ++ G LTR+S+R A+ GITA QII +L HA +
Sbjct: 336 AYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQM 395
Query: 301 -------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
S P LPP V DQIRLW+ E ER+TT+ G L+ F S +++ + YA ++GVL
Sbjct: 396 HRMAAARSRPILPPTVIDQIRLWQLETERMTTTSGFLFRDFDSPREYDDIAGYAAEIGVL 455
Query: 353 TWQNEKKRTVVVTKQGHSDIKKF 375
W+N+K +K H I+ +
Sbjct: 456 VWRNDKLGMFFASK--HEQIRDY 476
>gi|398406290|ref|XP_003854611.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
gi|339474494|gb|EGP89587.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
Length = 493
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 131/430 (30%), Positives = 208/430 (48%), Gaps = 67/430 (15%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKT--HAKEHKEATDTLTELR----------- 48
L +AK V+ +L++ P+ A + +W ++E + + L L+
Sbjct: 39 LPHLAKTVVMSMLYLPGPMRAADLEAWFKSDVQSSQERERSVSVLVRLKVLVEEQAGGGA 98
Query: 49 -----------RNAKVALLG-------GGKPWSMSAKLEVDSKARDLDFLNQYALERWEC 90
+ + AL G G + ++ E D ++ FL+ +A +WE
Sbjct: 99 GLQYRLSEAFAKGLRCALTGAVGKGDAGKGSFGVACSAEKDEDV-NVKFLDDFARRQWEA 157
Query: 91 ILRFMVGSQQTEGISAD-----AVRTLFNEG--------AENPVITKDGFQFLLLETPAQ 137
IL ++VGS + G+S D + L +G + +IT+ GF FLL E AQ
Sbjct: 158 ILYYVVGSANS-GLSGDLEISAGTKRLLQQGNFVQIKSGGKQRLITQAGFTFLLEEVNAQ 216
Query: 138 VWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
VW + YL E +D V+ L+FLF L G YST ++ Q L L++FGL+
Sbjct: 217 VWSLAVVYLRVCEDLQMDPVDVLSFLFTLGSLELGIPYSTSNLTLTQQHMLDDLQDFGLI 276
Query: 198 YQRKRKAGRFYPTKLALNMATRG-------------TLKQIREPGFLIVETNFRVYAYTD 244
Y+ A +YPT+LA + + + E G++I+ETN+R+YAYT
Sbjct: 277 YRSSPTATAYYPTRLATTLTSDAPALPNNTSSTTTTAATESNEKGYIILETNYRLYAYTS 336
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS----- 299
S L+++++ LF L R+PNL +T+ S+ +A+ IT+ QII +L HA P
Sbjct: 337 SPLQISILSLFSTLHTRYPNLITAKITKSSIASAISQSITSDQIITYLSTHAHPVLRRQA 396
Query: 300 --VSCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
+S P LPP V DQIRLW+ E ER+TT +G ++ K +YA+ LGVL +
Sbjct: 397 AIMSAPILPPTVVDQIRLWQIEGERMTTWKGFYIRDVGGWEEYVKAVEYAEALGVLRQRF 456
Query: 357 EKKRTVVVTK 366
+ KR+ V +
Sbjct: 457 DAKRSFFVGR 466
>gi|298705308|emb|CBJ48998.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 534
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 129/415 (31%), Positives = 196/415 (47%), Gaps = 61/415 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L ++AK +V+RLLFVE PV + ++ SW+ K + + H + L LR
Sbjct: 66 LPQLAKQHVMRLLFVEGPVGKGMLKSWVKKEYQRVHAASVRKLVSLRVLLPSAGGQEYQL 125
Query: 49 -----RNAKVALLGGGK-PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
N + AL K PW L D + + + +W+ +L F+VGS + E
Sbjct: 126 NPPFRENLQRALCASDKTPWQGDVPLSADRSPPTVAKIERQMHSQWQDVLYFLVGSDE-E 184
Query: 103 GIS---ADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETV-ES 151
G S A V+ + G PV IT G++F+L E Q W + + +
Sbjct: 185 GRSTPPARVVKFMEESGLMRPVRGRGALRITDKGYEFMLKEAHVQAWMVVHALINGYGRT 244
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
+ E L FLFQL + G Y ++ + Q E GL+++RK K+ RFYPT
Sbjct: 245 QPGCRDELLAFLFQLSYCKVGDAYPLGALTQTQRDLAQDFVELGLLFKRKAKSTRFYPTS 304
Query: 212 LALNM-------------ATRGTLKQIREPGFL-------------IVETNFRVYAYTDS 245
+A+N+ GT ++ + G L IVETNF+V AYT S
Sbjct: 305 IAVNLIFGSSPSGDAGGAGGGGTQRKPQPAGGLGGGKREDDTSIHIIVETNFQVIAYTRS 364
Query: 246 NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SC 302
L A++ LF EL PN VG +TR+S+R AL +GI Q++ FL+ HA P V +
Sbjct: 365 KLHFAMLSLFLELRALLPNAIVGAITRESMRKALSTGIKGRQVLDFLKWHAHPVVRRRTP 424
Query: 303 PLPPVVSDQIRLWEGERERLTTSEGVLYS-QFVSQADFEKLRDYAQDLGVLTWQN 356
+P ++DQ+ LWE ER+R+ +GVL + S+ F + ++A L W +
Sbjct: 425 VVPENIADQVLLWERERDRMEHRDGVLVDVSYASRDAFRGMTEFANAKQGLLWSS 479
>gi|76154655|gb|AAX26095.2| SJCHGC05044 protein [Schistosoma japonicum]
Length = 214
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 85/158 (53%), Positives = 114/158 (72%), Gaps = 4/158 (2%)
Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
G++++ETNFR+YAYTDS L+ AL+ LF ++ RFPNL V +TRDSVR AL GITA QI
Sbjct: 58 GYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQI 117
Query: 289 IGFLRLHALPSV--SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
+ FL +A P + S P LPP ++DQIRLWE ER+R EG LY F DFE +RDY
Sbjct: 118 LSFLTSNAHPDMLLSNPILPPTLTDQIRLWELERDRFVFQEGCLYEHFSRNTDFELVRDY 177
Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
A+++GVL W+N ++R +VV+K GH D++KFW H++ S
Sbjct: 178 AKNIGVLLWENPERRLMVVSKSGHEDVRKFW-KHKRPS 214
>gi|365758016|gb|EHM99881.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 419
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 182/371 (49%), Gaps = 73/371 (19%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L +AK +++ ++F E VP + W++ + ++A ++
Sbjct: 39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQDAIKSMKSLHLLIPDKSSGTLMI 98
Query: 46 ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
L K++L L GG+ + S + V+ LD L++Y+ +WE IL FMVG+ +
Sbjct: 99 NLNPTFKISLRNALTGGEVQN-SFGVVVEDNVISLDLLDEYSANKWETILHFMVGTPLAK 157
Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S + L + V IT +GFQFLL E +Q+W +L+YL+ +E+ +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
LV+ L F+F L G Y + +S+ ++ LQ +R++GLV+Q+ FYPTKLAL
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHLNDSIFYPTKLALM 277
Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
+ A L+Q RE G
Sbjct: 278 LTSDTKTIISASNAMDSVLRQNREEPSVNEDGANGKSTTDTTASDDLNKAGSKNQDIPDG 337
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
LIVETNF++Y+Y++S L++A++ LF L RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRNALTNGITADQII 397
Query: 290 GFLRLHALPSV 300
+L HA P +
Sbjct: 398 AYLETHAHPQM 408
>gi|158302621|ref|XP_561433.2| Anopheles gambiae str. PEST AGAP012880-PA [Anopheles gambiae str.
PEST]
gi|157021114|gb|EAL42401.2| AGAP012880-PA [Anopheles gambiae str. PEST]
Length = 244
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 87/180 (48%), Positives = 119/180 (66%), Gaps = 33/180 (18%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL EIA+ +VIR+LFVEQP+PQAV++SW ++ +AKE+ + LTEL
Sbjct: 65 ELPEIARQFVIRILFVEQPIPQAVVSSWATQVYAKENTSVSQVLTELGVWRSAAYPGGLA 124
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
++N K+ALLGGG+PWSMS L+ D K+RD+DFL+ YA+ RW C+L +MVG+
Sbjct: 125 AWELCPTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIDFLDTYAMSRWRCVLHYMVGAG 184
Query: 99 ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
+ EGIS DAVR L +E +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 185 SSKGMEGEGISPDAVRILLHANLMKRDESDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 244
>gi|401887601|gb|EJT51582.1| hypothetical protein A1Q1_07170 [Trichosporon asahii var. asahii
CBS 2479]
gi|406699646|gb|EKD02845.1| hypothetical protein A1Q2_02789 [Trichosporon asahii var. asahii
CBS 8904]
Length = 451
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 176/356 (49%), Gaps = 41/356 (11%)
Query: 33 HAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECIL 92
H K DT RNA L G + + K + A + L Y WE IL
Sbjct: 62 HQKFQWSLNDTFKRGMRNALTGL-GTSNSFGVPYKRQARDPAPTAEELIDYGENTWESIL 120
Query: 93 RFMVGS---QQTEGISADAVRTL----------FNEGAENP-----VITKDGFQFLLLET 134
++MV S + G ++V +L F++ NP IT GFQFLL +
Sbjct: 121 KYMVSSGLGARVTGRPRNSVLSLLHASGLMVDQFDDTGRNPNYDRLTITSQGFQFLLEDR 180
Query: 135 PAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF------L 188
Q+W ++ YL + +G + L+ F L G DYS S Q F L
Sbjct: 181 QTQLWQILMFYLARKQRRGEAAADILSLFFSLGCMQLGQDYSA---SKSFQNFPQGQQCL 237
Query: 189 QHLREFGLVYQRK----RKAGRFYPTKLALNMATRGTL----KQIREPGFLIVETNFRVY 240
L ++G +Y+R+ RK +F+PT LA ++ + T + FLI+ETN+RVY
Sbjct: 238 DDLEQYGFIYRRQEVDGRKTDQFFPTHLATSLCSGDTSVNRSATADDKRFLILETNYRVY 297
Query: 241 AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
AYT + L++A++ LF + R+PNL VG L R V+AA+ GI+A QII +L HA P +
Sbjct: 298 AYTSNELEIAILNLFVNITIRYPNLIVGHLDRRHVKAAMDKGISAYQIISYLTTHAHPQM 357
Query: 301 SCPLPPV----VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
PP+ V+DQ+ LW+ ER R+ + V++ +F S+ FE RD A+ +G L
Sbjct: 358 YNHPPPLLHPSVTDQLHLWDRERNRVQAQDTVMF-EFYSKELFETCRDEAERMGAL 412
>gi|241654873|ref|XP_002411342.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
gi|215503972|gb|EEC13466.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
Length = 218
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 82/176 (46%), Positives = 115/176 (65%), Gaps = 29/176 (16%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL I++HY++RL+FV+QPVPQAV++SW + + KEH E+ + LT L
Sbjct: 43 ELPIISRHYIMRLMFVDQPVPQAVVSSWNEQKYVKEHLESLEALTALHIWADSSLPGGLP 102
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
R+N ++ALLGGG+PW++ + LE D RD FL++YA+ERWEC+L FMVG
Sbjct: 103 GWSLSGVFRKNIQIALLGGGQPWAVYSTLEKDKHGRDAQFLDRYAMERWECVLHFMVGCH 162
Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLE 147
EGISADAVR L + G P+IT +GFQFLL++T +QVW+F+L+YL+
Sbjct: 163 TKEGISADAVRILLHAGLMKSEEEEGSAPLITMEGFQFLLMDTASQVWHFVLQYLD 218
>gi|452824378|gb|EME31381.1| transcription initiation factor TFIIH subunit H4 [Galdieria
sulphuraria]
Length = 740
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 107/299 (35%), Positives = 155/299 (51%), Gaps = 34/299 (11%)
Query: 76 DLDFLNQYALERWECILRFMVGSQQTEG-ISADAVRTLFNEGAENPV-----ITKDGFQF 129
D+ FL+QY+ W IL+F+VG + E S + L G V IT GFQF
Sbjct: 141 DVTFLSQYSRHCWNKILQFLVGLEPLESEPSVRIIEALVESGLIEQVEDGFRITTVGFQF 200
Query: 130 LLLETPAQVWYFILKYLETVESK---GLDLV------ECLTFLFQLKFSTFGTDYSTEGM 180
LL +T QVW+ + L + E+ G L E + FLF+L F G Y + +
Sbjct: 201 LLKDTREQVWFILENILFSNETNLKIGTKLQKRVMNEEMVEFLFELSFCASGVAYHWKTL 260
Query: 181 SDKLQVFLQHLREFGLVY----QRKRKAGRFYPTKLALNMATRGTL-------------- 222
S Q+++ +L + GL+Y K FY T L + + TR
Sbjct: 261 SKTQQIYIPYLADLGLIYLHISSSLEKDSYFYVTPLGVTL-TRSICWLGDKNNYLLLEDW 319
Query: 223 KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
+ +IV+TNFRVY YT+ +++L+ LF + +YR PN+ VGV+TRDS+R AL +G
Sbjct: 320 SNVDNDCRMIVQTNFRVYVYTNCTFQISLLSLFIQFLYRLPNMAVGVITRDSIRTALNNG 379
Query: 283 ITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK 341
ITA Q+I +L+ H P++ LP + DQIRLWE +R R+TT +L F + FEK
Sbjct: 380 ITAQQMISYLQNHMHPNMKGKLPITIIDQIRLWEAQRFRVTTKHALLLDHFDNMTCFEK 438
>gi|255079788|ref|XP_002503474.1| predicted protein [Micromonas sp. RCC299]
gi|226518741|gb|ACO64732.1| predicted protein [Micromonas sp. RCC299]
Length = 458
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 127/426 (29%), Positives = 194/426 (45%), Gaps = 62/426 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWL--SKTHAKEHKEATDTLTELRRNAKVALLGGG 59
L +AKHYV+RLL+V++ + + + +W+ K H H + L LR V + G
Sbjct: 34 LPSLAKHYVMRLLYVDEGIAREEMDAWVRPGKEHRDRHARSMLALKRLR----VLVPAGD 89
Query: 60 KPWSMSAKL-------------------------EVDSKARDLDFLN----------QYA 84
+ + K EV++ D + ++A
Sbjct: 90 QEYVADGKELVRLNRRFVKGVRAQIETCFAPEGDEVEATVAPDDAMGGKRPSPEKIEEFA 149
Query: 85 LERWECILRFMVGSQQTEGISADAVRTLFNEG--AENPV------ITKDGFQFLLLETPA 136
RWE +L + G + +G D G AE IT+ GF+FLL
Sbjct: 150 KGRWEALLMTLTGRDRRKGPGLDVAALFRGAGLVAEKSNKSNGWGITEKGFRFLLSTARE 209
Query: 137 QVWYFILKYLETVESKG---LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
Q+W + +Y+ + G L + F+ +L F G Y + + + + L
Sbjct: 210 QIWTLLTEYVRQYSAPGDRTLVAPAVIGFMLRLTFQAVGQPYRVDDLPSAQRAIAEDLAH 269
Query: 194 FGLVY--QRKRKAGRFYPTKL-ALNMATRGTLKQIREPG-FLIVETNFRVYAYTDSNLKV 249
GL+Y K G + PT+L A G +PG +I ETNFRVYAYT S+++
Sbjct: 270 LGLLYLFAGPGKEGYYVPTQLTAGKDVADGDASLGGDPGGHIIAETNFRVYAYTFSDVEC 329
Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-----SCPL 304
++ LF YR PNL VG+LTR++V AL +G+ A QII +++ HA P+ +
Sbjct: 330 EILRLFTRPDYRLPNLYVGMLTREAVHEALDTGVAAEQIIKYIKSHAHPNARKTTNGSGI 389
Query: 305 PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD-FEKLRDYAQDLGVLTWQNEKKRTVV 363
PP V+DQI LW ER R+ ++E VLY F + D + A D GVL W+N ++ +
Sbjct: 390 PPNVADQIMLWAMERRRVRSAECVLYCDFPTGTDEYAAAVKAASDAGVLLWENREQMKLA 449
Query: 364 VTKQGH 369
V K GH
Sbjct: 450 VAKSGH 455
>gi|321265552|ref|XP_003197492.1| subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Cryptococcus gattii WM276]
gi|317463972|gb|ADV25705.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes,
putative [Cryptococcus gattii WM276]
Length = 481
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/380 (31%), Positives = 179/380 (47%), Gaps = 36/380 (9%)
Query: 29 LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDF-----LNQY 83
L H K+ K +R + AL G G S E +A DLD L Y
Sbjct: 93 LHPMHYKKQKMQWSLNDSFKRGLRNALTGLGASNSFGVPFE-RHQATDLDLPSKDELVAY 151
Query: 84 ALERWECILRFMVGS----------QQTEGISADAVRTLFNEGAE----NP-----VITK 124
E +E IL++MV S Q E + L + + NP IT
Sbjct: 152 GEETFESILKYMVSSGLGTEFSGSRPQPEVLQLLHASGLMTDPGDMSRRNPNIYRLTITS 211
Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKL 184
GFQFLL + Q+W ++ YL E+ E L+ F L G DYS
Sbjct: 212 KGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASNSFPHA 271
Query: 185 QVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVETNFRV 239
Q L L ++G +Y+ + +F+PT LA ++ + G I+ + FLI+ETN+++
Sbjct: 272 QEALNDLAQYGFIYKPSPDSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILETNYKI 330
Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
YAYT + L++A++ LF ++ R+PNL VG L R V+AA+ GI+A QII +L HA P
Sbjct: 331 YAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQ 390
Query: 300 V-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
+ + P P P + DQ+ LW+ ER RL T E V+Y +F S+ F+ + A+ L
Sbjct: 391 MYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFDDTVNEAKANAALQHA 449
Query: 356 NEKKRTVVVTKQGHSDIKKF 375
++ + + I +F
Sbjct: 450 ATSQKLLFIEPHTKPAITEF 469
>gi|134118870|ref|XP_771938.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50254542|gb|EAL17291.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 481
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 116/384 (30%), Positives = 181/384 (47%), Gaps = 44/384 (11%)
Query: 29 LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQ-----Y 83
L H K+ + ++ + AL G G S E +A DLD ++ Y
Sbjct: 93 LHPMHYKKQRMQWSLNDSFKKGLRNALTGLGTSNSFGVPFE-RHQATDLDLPSEDELVAY 151
Query: 84 ALERWECILRFMVGSQ------------------QTEGISADAVRTLFNEGAENP----- 120
+ +E IL++MV S T G+ D+V + NP
Sbjct: 152 GEDTFESILKYMVSSGLGTEFSGSRPQPEVLQLLHTSGLMTDSV----DMNGRNPNINRL 207
Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
IT GFQFLL + Q+W ++ YL E+ E L+ F L G DYS
Sbjct: 208 TITSKGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASNS 267
Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVET 235
Q L L ++G +Y+ + +F+PT LA ++ + G I+ + FLI+ET
Sbjct: 268 FPGAQEALDDLAQYGFIYKSSPDSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILET 326
Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
N+++YAYT + L++A++ LF ++ R+PNL VG L R V+AA+ GI+A QII +L H
Sbjct: 327 NYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSH 386
Query: 296 ALPSV-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
A P + + P P P + DQ+ LW+ ER RL T E V+Y +F S+ F+ + A+
Sbjct: 387 AHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFDDTVNEAKANAA 445
Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
L ++ + + I +F
Sbjct: 446 LQHAATSQKLLFIEPHTKPAITEF 469
>gi|58262306|ref|XP_568563.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230737|gb|AAW47046.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 481
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 112/349 (32%), Positives = 170/349 (48%), Gaps = 44/349 (12%)
Query: 29 LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQ-----Y 83
L H K+ + ++ + AL G G S E +A DLD ++ Y
Sbjct: 93 LHPMHYKKQRMQWSLNDSFKKGLRNALTGLGTSNSFGVPFE-RHQATDLDLPSEDELVAY 151
Query: 84 ALERWECILRFMVGSQ------------------QTEGISADAVRTLFNEGAENP----- 120
+ +E IL++MV S T G+ D+V + NP
Sbjct: 152 GEDTFESILKYMVSSGLGTEFSGSRPQPEVLQLLHTSGLMTDSV----DMNGRNPNINRL 207
Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
IT GFQFLL + Q+W ++ YL E+ E L+ F L G DYS
Sbjct: 208 TITSKGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASNS 267
Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVET 235
Q L L ++G +Y+ + +F+PT LA ++ + G I+ + FLI+ET
Sbjct: 268 FPGAQEALDDLAQYGFIYKSSPDSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILET 326
Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
N+++YAYT + L++A++ LF ++ R+PNL VG L R V+AA+ GI+A QII +L H
Sbjct: 327 NYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSH 386
Query: 296 ALPSV-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
A P + + P P P + DQ+ LW+ ER RL T E V+Y +F S+ F+
Sbjct: 387 AHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFD 434
>gi|405123868|gb|AFR98631.1| transcription factor TFIIH complex subunit Tfb2 [Cryptococcus
neoformans var. grubii H99]
Length = 481
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 180/380 (47%), Gaps = 36/380 (9%)
Query: 29 LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQ-----Y 83
L H K+ K ++ + AL G G S E +A DLD ++ Y
Sbjct: 93 LHPMHYKKQKMQWSMNDSFKKGLRNALTGLGTSNSFGVPFE-RHQATDLDLPSEDELVAY 151
Query: 84 ALERWECILRFMVGS----------QQTEGISADAVRTLFNEGAE----NP-----VITK 124
+ +E IL++MV S Q E + L + + NP IT
Sbjct: 152 GEDTFESILKYMVSSGLGTEFSGSRPQPEVLQLLHASGLMTDPGDMTGRNPNINRLTITS 211
Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKL 184
GFQFLL + Q+W ++ YL E+ E L+ F L G DYS
Sbjct: 212 KGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASKSFPHA 271
Query: 185 QVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVETNFRV 239
Q L L ++G +Y+ + +F+PT LA ++ + G I+ + FLI+ETN+++
Sbjct: 272 QEALDDLAQYGFIYKSSPGSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILETNYKI 330
Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
YAYT + L++A++ LF ++ R+PNL VG L R V+AA+ GI+A QII +L HA P
Sbjct: 331 YAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISAGQIIAYLSSHAHPQ 390
Query: 300 V-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
+ + P P P + DQ+ LW+ ER RL T E V+Y +F S+ F+ + A+ L
Sbjct: 391 MYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFDDTVNEAKANAALQHA 449
Query: 356 NEKKRTVVVTKQGHSDIKKF 375
++ + + I +F
Sbjct: 450 ATSQKLLFIEPHTKPAITEF 469
>gi|355693906|gb|AER99491.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
putorius furo]
Length = 143
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 78/143 (54%), Positives = 105/143 (73%), Gaps = 3/143 (2%)
Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--- 300
+S L++ALI LF E++YRFPN+ V +TR+SV+ A+ SGITA QII FLR A P +
Sbjct: 1 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 60
Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
+ LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE L +A++LGVL ++N KR
Sbjct: 61 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 120
Query: 361 TVVVTKQGHSDIKKFWVNHQKGS 383
+VVT GHSD+K+FW + S
Sbjct: 121 LMVVTPAGHSDVKRFWKRQKHSS 143
>gi|303272589|ref|XP_003055656.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463630|gb|EEH60908.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 476
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 127/454 (27%), Positives = 199/454 (43%), Gaps = 91/454 (20%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKV----ALLG 57
L E+AKHY +RL++VE VP + W++ H E+ + ELR + G
Sbjct: 39 LPELAKHYCMRLMYVEGGVPSIDMDEWVTDLGRDAHAESVRRMRELRVMLPLEDVTGAAG 98
Query: 58 GGKPWSMSAKLE-------------------VDSKARDL-------DFLNQYALERWECI 91
G+ ++ K + + A DL + L YA RWE +
Sbjct: 99 DGESLGLNPKFQRGMRSIMEGGGFGDGFDDDDAATASDLGSALPSPEDLESYAKGRWEAL 158
Query: 92 LRFMVGSQ-------------------QTEGISADAVRTLFNEGAENPVITKDGFQFLLL 132
L + G+ + G+ DA + E +T+ GF+FLL
Sbjct: 159 LLTLTGASDAFAAAGANAADLDVGALFRAAGLIGDASK------GEKEGVTEAGFKFLLS 212
Query: 133 ETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
Q+W LD L+FL +L F G YST+G+ + ++ +
Sbjct: 213 TAREQIWAL------------LDAPAVLSFLLKLTFQAPGVAYSTDGLPASQKGVVRDVA 260
Query: 193 EFGLVYQRKRKA-GRFYPTKLALNMATRGTLKQIREPG------------------FLIV 233
+ GL+Y G + PT L+ ++ G + G +IV
Sbjct: 261 KLGLLYPLAAAGKGYYVPTSLSSGLSGGGGGDDDGDGGVGGGGKKSGDGGGVGARGHIIV 320
Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
ETNFRVYAYT S ++V ++ LF Y+ PNL VG++TR++V ALR GI+A QI+ +LR
Sbjct: 321 ETNFRVYAYTSSAVEVEILRLFTRPDYKLPNLYVGMMTREAVVTALRGGISAEQIVSYLR 380
Query: 294 LHALP----SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD-FEKLRDYAQD 348
HA P + +P V DQIRLW + R+ + VLY F + FEK+ + A++
Sbjct: 381 KHAHPQARKTPGPAIPATVCDQIRLWSKDENRVKYTPCVLYCDFPTGTGMFEKVAEIAKE 440
Query: 349 LGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
G+ W + + V ++GH +K + + G
Sbjct: 441 RGLYLWGDPVGLKLAVREEGHESMKDVFKKIRAG 474
>gi|145351329|ref|XP_001420034.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580267|gb|ABO98327.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 127/433 (29%), Positives = 188/433 (43%), Gaps = 66/433 (15%)
Query: 5 IAKHYVIRLLFVEQPVPQAVIASWLSK--THAKEHKEATDTLTELR-------------- 48
IAK ++RL +V+ A W + A+ + A L LR
Sbjct: 52 IAKQIILRLAYVDDGADARECAKWCGDGASDARALERAMARLARLRVTTALGEGEDARVV 111
Query: 49 ------RNAKVAL------LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
N + AL L G V +K D LN YA +WE +L +
Sbjct: 112 ANEAFANNLRRALERGFVNLDDGDGADAREDAGVAAKVPDRATLNAYAKAKWEDLLLTLT 171
Query: 97 GSQQTEGISADAVRTLFNEGA-----------------------ENPVITKDGFQFLLLE 133
G+ V+ + A +N +T +GF FLL
Sbjct: 172 GASNAFSRPGAKVKGRLDAQALFRAAGLTTATTAALKKGGQKRLKNAGVTAEGFAFLLQT 231
Query: 134 TPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEG-MSDKLQVFLQHLR 192
Q+W + +Y+ +S+ ++FL +L F G Y+ G +SD Q + L
Sbjct: 232 AQEQIWVLLTQYIR--DSQKTSATSAISFLLRLTFQEPGRAYAMTGVLSDTEQDVVLDLT 289
Query: 193 EFGLVYQRKRKAGRFY-PTKLALNMA-----TRGTLKQIRE--PGFLIVETNFRVYAYTD 244
GL+Y + K +Y PT LA ++ G K G +IVETN+RVYAYT
Sbjct: 290 HLGLLYTFEVKKKFYYVPTLLASGLSGGFTGDDGDTKAAAAAAEGHIIVETNYRVYAYTS 349
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---S 301
S +++ ++ LF YR PNL VG+LTR+SV+ ALR+G+ A QI+G++R HA V
Sbjct: 350 SAVEMEILRLFTRADYRLPNLYVGMLTRESVQNALRAGVDAEQIVGYIRAHAHKQVRRKK 409
Query: 302 CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF-EKLRDYAQDLGVLTWQNEKKR 360
+P V DQIRLW + ER+ E VLY F F + A+ G L W+++ R
Sbjct: 410 PSVPSTVCDQIRLWARDMERMEAEECVLYCDFPQAGTFYGAVVSEAEKRGALLWRDDASR 469
Query: 361 TVVVTKQGHSDIK 373
+ V H ++K
Sbjct: 470 RLTVRASAHDEMK 482
>gi|223993063|ref|XP_002286215.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
gi|220977530|gb|EED95856.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
Length = 461
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 169/345 (48%), Gaps = 52/345 (15%)
Query: 60 KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE-----GISADA---VRT 111
PW + + +D + D L Y W+ +L F+VGS +T+ GI V+
Sbjct: 119 SPWPIITQDHIDEFSMDGGDLETYTQSSWDSVLHFLVGSDETDRGSGYGIEEPPEAMVQF 178
Query: 112 LF-------------NEGAENP-VITKDGFQFLLLETPAQVWYFILKYLETV---ESKGL 154
L N+ + P VIT G++F+L +T AQVW F+L+YL ++ E K
Sbjct: 179 LMRIGLMQVDPDFKGNDKSRAPLVITSKGYEFMLRDTNAQVWQFVLQYLNSMAHHEQKDF 238
Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDK-LQVFLQHLREFGLVYQRKRKAGR--FYPTK 211
+E L+FL L G Y + + K + ++ FGL++ R AG+ FYPT+
Sbjct: 239 IRMEALSFLICLGSCRVGEGYQSSVLGSKSARALMRDFSRFGLLFV-CRVAGKNSFYPTR 297
Query: 212 LALNMA----------------TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF 255
A+N+ T R ++V+TNF+V AYT S L ++ +GLF
Sbjct: 298 AAVNLVASSEKAGRQASDLMGQTLAAPVPSRSHLAVVVQTNFQVVAYTKSKLHISTLGLF 357
Query: 256 CEL--MYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---SVSCPL-PPVVS 309
C++ R PN+ +TRDS+R+A R G+TA QI+ FL +HA P S P+ P V
Sbjct: 358 CDVSSFRRLPNVIFFHITRDSIRSAFRLGVTADQILRFLHVHAHPMLRSGDRPMVPSNVV 417
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
DQI LW+ ER R+ E V Q A+F + YA D+ L W
Sbjct: 418 DQILLWDRERHRVVMDE-VCVHQCRDAAEFTAVSQYASDVDALAW 461
>gi|159474228|ref|XP_001695231.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276165|gb|EDP01939.1| predicted protein [Chlamydomonas reinhardtii]
Length = 435
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/400 (30%), Positives = 180/400 (45%), Gaps = 42/400 (10%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEAT------DTLTELR------- 48
L +A+ YV+RLLFV P P + SW +K A HK A D LTE R
Sbjct: 32 LPPLAQVYVMRLLFVPSPFPADFVDSWAAKGAASGHKAALSALRGLDVLTEERGRSRARA 91
Query: 49 ---RNAKVALLGGGKPWSMSAKLEVDSKAR------DLDFLNQYALERWECILRFMVGSQ 99
R + G P +S + D A L+ L ++A+ +WE + +++GS
Sbjct: 92 GQGRGRRATGQGPKGPVELSQLMRGDVPAAAASCCPSLEELGEWAVGQWEALQLYLLGSS 151
Query: 100 QTEG-------ISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES- 151
++ + A G + +T+ F +LL Q+W + +Y+ E+
Sbjct: 152 RSPPQLPQLLRVRARTCEDGAAAGRRSLAVTQRSFHWLLQPGDRQLWAVLREYISGAEAA 211
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGL--VYQRKRKAGRFYP 209
G +L L+FL QL F G + + Q HL + G+ V+Q A
Sbjct: 212 SGDELAHTLSFLLQLGFRRVGQPCAWAELRPPEQRMAAHLAQLGVLAVFQGSAGAAGSGG 271
Query: 210 TKLALNMATRGTLKQIREP-------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
A G F+IVE+N+RVY YT S + +A++ LF +
Sbjct: 272 ALAARGGGAGGAAAAAAMDGGGAGSDAFVIVESNYRVYVYTRSPVTIAVLELFVKREALL 331
Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS--CP-LPPVVSDQIRLWEGER 319
PNL VG + RDS+ AAL GITA +++ +L PS++ CP +P VVSDQIRLWE
Sbjct: 332 PNLFVGAIRRDSILAALARGITADELVAYLSARPHPSIAARCPVVPEVVSDQIRLWEASM 391
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
RL VLY S+ +E+ A+ G L W++ +
Sbjct: 392 NRLQAHPAVLYENMESRQLYERAVAAARAAGTLQWEDGAR 431
>gi|392574410|gb|EIW67546.1| hypothetical protein TREMEDRAFT_33390 [Tremella mesenterica DSM
1558]
Length = 504
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 175/381 (45%), Gaps = 51/381 (13%)
Query: 47 LRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
+R + AL G G P+ +S E S+++ L Q+ WE IL++MV S
Sbjct: 124 FKRGLRNALTGSGTSNSFGLPYPLSPTDEYPSESQ----LIQFGDRTWESILKYMVSSGL 179
Query: 101 TEGIS-----ADAVRTLFNEGA--------------ENPVITKDGFQFLLLETPAQVWYF 141
G +R L + G + IT GFQFLL + Q+W
Sbjct: 180 DRGFPVARPETSVLRLLHSSGLMADLSESNGRQQSLDRMTITSQGFQFLLEDRQTQLWQI 239
Query: 142 ILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQV-FLQHLREFGLVYQR 200
++ YL ++E + L+ F L YS S+ Q+ L L+ +GL+Y+
Sbjct: 240 LVFYLTSIEVSCISSAPVLSLFFSLGCMQITQSYSLSTFSNPTQLQALDDLQSYGLIYRP 299
Query: 201 KR----KAGRFYPTKLAL-----NMATRGTLKQIR-------EPGFLIVETNFRVYAYTD 244
KA F+PT L+ N A L + FLI+ETN+++YAYT+
Sbjct: 300 ATPSGAKADYFFPTHLSTSLCSGNSALSAALNTTEIDTSHEDDKKFLILETNYKIYAYTN 359
Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPL 304
+ L++A++ LF ++ ++ NL VG L R V+AA+ GI+A QII +L+ HA P +
Sbjct: 360 NELEIAILNLFMDIKVQYRNLVVGKLDRAHVKAAMEKGISAYQIIAYLQSHAHPQMYSSP 419
Query: 305 PPV----VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
PP+ + DQ+ LW+ ER RL T E ++ +F S+ +E A+ L K +
Sbjct: 420 PPILHTTIVDQLHLWDKERNRLRTEESEMF-EFFSKDLYEDTEAEAKRYDGLLLAVPKDK 478
Query: 361 TVVVTKQGHSDIKKFWVNHQK 381
+ V IK F Q+
Sbjct: 479 LLFVNPSVKDAIKDFVKGQQR 499
>gi|164660182|ref|XP_001731214.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
gi|159105114|gb|EDP44000.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
Length = 231
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 78/184 (42%), Positives = 110/184 (59%), Gaps = 5/184 (2%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
IT GFQFLL + +Q+W +L YL + + +DLVE L F F + G Y + G S
Sbjct: 36 ITSLGFQFLLQDVNSQLWALLLHYLSMADERNMDLVEVLAFFFTVGGLELGRAYESRGFS 95
Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-----LKQIREPGFLIVETN 236
L+ L ++GLVY+ + A F+PT+LA + + + L E G+LI+ETN
Sbjct: 96 QTQLQTLEELSDYGLVYRPSKSAKYFFPTRLASTLTSTASPLLSRLNDQEEQGYLILETN 155
Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
+RVYAYT + L++A++ LF L R PNL +G LTR SV++AL GITA QII +L HA
Sbjct: 156 YRVYAYTANLLRIAILNLFVTLKSRLPNLVIGQLTRHSVKSALNKGITADQIITYLTHHA 215
Query: 297 LPSV 300
P +
Sbjct: 216 HPQM 219
>gi|307106040|gb|EFN54287.1| hypothetical protein CHLNCDRAFT_25123 [Chlorella variabilis]
Length = 458
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 154/322 (47%), Gaps = 24/322 (7%)
Query: 80 LNQYALERWECILRFMVGSQQTEGIS---------------ADAVRTLFNEGAENPVITK 124
L+ YA +WE +L ++V T ++ A A + +E IT+
Sbjct: 133 LDAYARRQWEALLLYLVNGNGTPPMAPPVLHATPIDIPSLLAAAGLMVKDECTLEQKITE 192
Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDK 183
GFQFLL +Q+W + +YL + + G DL + FL +L + +
Sbjct: 193 HGFQFLLANLYSQLWSAVRQYLTLLNTAGGADLAVAINFLLRLGLQGAAAAMAHSQLDSA 252
Query: 184 LQVFLQHLREFGLVYQRKRKAGR-FYPTKLALNMATRGTLKQIR---EPGFLIVETNFRV 239
+ H+ + GL+ +PT+LA +A G + E G++IVE+NFRV
Sbjct: 253 ERTIAAHMCQLGLLMPVPGANELWLHPTRLAAVLAGGGRAGEAAVAPEEGYVIVESNFRV 312
Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
YAYT S ++VA++ +F PNL VG +TR+S AL +GI A Q++ FLR HA P
Sbjct: 313 YAYTTSAVQVAVLRVFVRCDALLPNLFVGTITRESATNALDTGIAADQVVAFLRQHAHPR 372
Query: 300 VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
+ P VV+DQIRLW E +RL LY +F S+ + +A+ L L + E +
Sbjct: 373 AAAKTPTVVTDQIRLWAQELKRLQEKNATLYDKFESKELYVGAVAHARQLNALLYSCEDR 432
Query: 360 RTVVVTKQGHSDIKKFWVNHQK 381
R +VV H V H K
Sbjct: 433 RQLVVESAFHG----LMVGHLK 450
>gi|303388087|ref|XP_003072278.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
gi|303301417|gb|ADM10918.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
Length = 414
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 154/302 (50%), Gaps = 36/302 (11%)
Query: 87 RWECILRFMVGSQQTEGISADAVRTLFNEGA---ENPVITKDGFQFLLLETPAQVWYFIL 143
R+E ILR +V + I VR + G + IT GF+FLL Q+W +L
Sbjct: 131 RFETILRSIVNKESK--IREFGVREVLMFGGLLGRDENITNRGFEFLLKTKKEQLWCLVL 188
Query: 144 ----KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
+L++VE + V L +F+L GT Y DK L++L G++
Sbjct: 189 LSLKYFLKSVEEE----VSTLEAIFELSTKNVGTVYRHSSSMDKR--LLKYLEVLGIL-- 240
Query: 200 RKRKAGRFYPTKLALNMATRGTLKQIREPG------FLIVETNFRVYAYTDSNLKVALIG 253
+ Y N+A R + Q+ E F+IVETN ++YAYT+S + ++I
Sbjct: 241 ------KLYGE----NLAIRRSFVQLFEASERNRREFIIVETNNKIYAYTNSEYEKSVIH 290
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIR 313
LFC + + PNLT G +T +SV AA GIT QII FL + P LPP + +QI
Sbjct: 291 LFCNVSFNLPNLTKGSITEESVNAAFDKGITGRQIIHFLEASSKPG---SLPPAIINQII 347
Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
+WE +R R+ + G LYS F++ +D++K+ ++ + L + +R +VV +GH +K
Sbjct: 348 IWESKRNRIFMAPGYLYSNFLNLSDYQKVLEFCSERNYLIESDIDRRMIVVNPKGHVFVK 407
Query: 374 KF 375
+F
Sbjct: 408 EF 409
>gi|412992641|emb|CCO18621.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 163/371 (43%), Gaps = 71/371 (19%)
Query: 76 DLDFLNQYALERWECILRFMVGSQQTE---------------GISADA---VRTLFNEG- 116
D FL+ YA RWE +L + E G A +R LF
Sbjct: 171 DAGFLDHYAKSRWETVLLELTSEASVEKKRKKKKMMKKKGDGGAGAQHALHLRKLFYAAR 230
Query: 117 --AENPVITKDGFQFLLLETPAQVWYFILKYL---------------------ETVESKG 153
++ IT +GF FLL T Q+W + +Y + +E+ G
Sbjct: 231 LITKDGRITSEGFSFLLGATSEQIWILLARYARGGDFEKKMKNKTTNEENGEGDKMETNG 290
Query: 154 LD------------LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
D + FL +L F G YS +S+ + HL G++Y+ +
Sbjct: 291 EDEKQQHKLKSDESSAAAMAFLVRLSFQHPGRKYSKANLSEAERRVASHLSALGVLYENE 350
Query: 202 RKAGR---FYPTKLALNMATRGTLKQIREP-----GFLIVETNFRVYAYTDSNLKVALIG 253
+ PT L+ +++ T + G +IVETNFRVYAYT S L+ ++
Sbjct: 351 DDENDNNWYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLR 410
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC-------PLPP 306
LF Y+ PN VG++TRDS+ A+R+GI+ QI+ +L+ A P + +PP
Sbjct: 411 LFTRPDYKLPNAYVGMITRDSILDAMRAGISPDQIVNYLQTRAHPRCTIGKRPNHPAVPP 470
Query: 307 VVSDQIRLWEGERERLTTSEGVLYSQFVSQAD-FEKLRDYAQDLGV-LTWQNEKKRTVVV 364
V DQIRLW + R+ + V+Y+ F Q + F+ + A+++G + W +E KR V
Sbjct: 471 TVCDQIRLWARDLYRVKADDCVMYTDFPMQGNQFQDAVNNARNVGAQILWMDENKRRFAV 530
Query: 365 TKQGHSDIKKF 375
H +K F
Sbjct: 531 DADSHERLKVF 541
>gi|323455511|gb|EGB11379.1| hypothetical protein AURANDRAFT_52511 [Aureococcus anophagefferens]
Length = 291
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 93/266 (34%), Positives = 144/266 (54%), Gaps = 21/266 (7%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETV-ESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
+T G FLL E QVW + +Y+ ++KG E + + L ++T G Y+ +
Sbjct: 12 VTGRGVDFLLKERHEQVWALVDEYIRAAGDAKG----EVVALVLTLAYATPGEGYAIHEL 67
Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMA-----TRGTLK---QIREPGFLI 232
S+ + L L GLVY+R + RFYPT L +++A + G + R P +I
Sbjct: 68 SEAQKAALDVLFALGLVYRRNASSSRFYPTTLGVDVAFGARRSAGGARAGGDFRRPVDVI 127
Query: 233 VETNFRVYAYTD-----SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQ 287
V+TNF+V AYTD S L +A + LF EL R PNL VG ++RD+++ + GI Q
Sbjct: 128 VQTNFQVLAYTDAGVNTSTLVLATLNLFAELTTRLPNLVVGTISRDAIKRCVDRGIRVPQ 187
Query: 288 IIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYA 346
I+ FLR HA P++ + +P V+DQ+ LW GE R+ ++GVL + S A ++ D A
Sbjct: 188 IVKFLRAHAHPAMKASGVPQNVTDQMALWAGEGNRVAFTDGVLVA-LDSDALYDAALDAA 246
Query: 347 QDLGV-LTWQNEKKRTVVVTKQGHSD 371
+DL + W++ +KR + V D
Sbjct: 247 RDLDARVHWKSARKRCLFVDADAEPD 272
>gi|322785901|gb|EFZ12520.1| hypothetical protein SINV_14207 [Solenopsis invicta]
Length = 170
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/124 (55%), Positives = 84/124 (67%), Gaps = 21/124 (16%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL IAK+YV+RLLFVEQPVPQAVIASW SK + +EH++ L EL
Sbjct: 47 ELPVIAKNYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQVLNELYVWKEASIPGGLP 106
Query: 48 --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 107 GWILNNTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 166
Query: 100 QTEG 103
Q EG
Sbjct: 167 QQEG 170
>gi|167535774|ref|XP_001749560.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771952|gb|EDQ85611.1| predicted protein [Monosiga brevicollis MX1]
Length = 417
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 129/235 (54%), Gaps = 18/235 (7%)
Query: 79 FLNQYALERWECILRFMVGSQQTEGISADAV----RTLFNEGAENPV-ITKDGFQFLLLE 133
FL +++ WE IL +M + + + + + + + N+ + P+ +T +GFQFLL +
Sbjct: 128 FLEEHSRVSWEAILSYMTTNAELKNSTMILILQEAQLIVND--QPPISVTPEGFQFLLQD 185
Query: 134 TPAQVWYFILKYL------ETVESKGLDLVECLTFLFQLKFSTFGT----DYSTEGMSDK 183
QVW I+ +L E + +GL + E L LF++ ++S E ++
Sbjct: 186 RANQVWLLIVTFLKLLSREEPIPGEGLPMHEALVLLFRIGLGRLNEVSVLEFSAENLTTM 245
Query: 184 LQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-EPGFLIVETNFRVYAY 242
FL+ L G+VY R RK+ RF+ T A+ + L + E GFL+ ETNF VYAY
Sbjct: 246 QITFLRQLHYMGVVYHRNRKSKRFFLTPYAVMLYHNAELALSQTETGFLLAETNFHVYAY 305
Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
TDS +KVAL+ F L YR P +T ++TR SVR AL GITA QI+ F++ +L
Sbjct: 306 TDSAVKVALLSKFATLTYRLPIMTTAIITRTSVRRALSQGITADQILRFMQRSSL 360
>gi|207340606|gb|EDZ68904.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 226
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/168 (42%), Positives = 106/168 (63%), Gaps = 21/168 (12%)
Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
G LIVETNF++Y+Y++S L++A++ LF L RF N+ +G +TR+S+R AL +GITA QI
Sbjct: 50 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 109
Query: 289 IGFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEG 327
I +L HA P + +C LPP V DQIRLW+ E +R+ T EG
Sbjct: 110 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 169
Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
LYS F + ++ L YAQD+GVL W+++KK+ ++K+G+S + F
Sbjct: 170 SLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 217
>gi|401825125|ref|XP_003886658.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
gi|395459803|gb|AFM97677.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
Length = 414
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 84/259 (32%), Positives = 133/259 (51%), Gaps = 21/259 (8%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYS-TEGM 180
IT GF+FLL Q+W +L L+ + + L LF+L G Y T+ M
Sbjct: 167 ITNRGFEFLLKTKKEQLWCLVLLSLKYFLGSVDEEIHALEALFELSTKAVGIAYHRTDRM 226
Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT----RGTLKQIREPGFLIVETN 236
+L L++L G++ + Y LA+ + + + RE F+IVETN
Sbjct: 227 DARL---LKYLEALGIL--------KLYKESLAIGKSFVQLFEASERNRRE--FIIVETN 273
Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
++YAYT+S + ++I LFC + PNL G +T +SV A GIT QII FL
Sbjct: 274 NKIYAYTNSEYEKSVIHLFCSVSVNLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASV 333
Query: 297 LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
P LPP +S QI +WE +R R+ + G +YS F++ +D++++ ++ + L +
Sbjct: 334 KPG---SLPPAISSQIMIWESKRNRIFMAPGYIYSNFLNLSDYQRVLEFCTERSYLIESD 390
Query: 357 EKKRTVVVTKQGHSDIKKF 375
KR +VV +GH +K+F
Sbjct: 391 VDKRMIVVKVEGHELVKEF 409
>gi|361130709|gb|EHL02459.1| putative RNA polymerase II transcription factor B subunit 2 [Glarea
lozoyensis 74030]
Length = 210
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 115/202 (56%), Gaps = 28/202 (13%)
Query: 187 FLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------------PGFLIV 233
L++L +FGLVY +F+PT+LA + + + L+ I GF+I+
Sbjct: 1 MLENLVDFGLVYIPPSVKDQFFPTRLATTLTSDASALRSITAGFESALSTGSGTAGFIII 60
Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
ETN+ S L++A++ LF +L R+PN+ G +TR+SV A+ SGI++ QII +L
Sbjct: 61 ETNYH------SPLQIAVLALFTKLTTRYPNMVTGRVTRESVYRAVSSGISSDQIITYLS 114
Query: 294 LHALPSV--------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
HA P + P LPP V DQIRLW+ E ER+ T+ G L+ F S +++KL
Sbjct: 115 THAHPELLKASAAKGGGPVLPPTVVDQIRLWQIENERMKTTYGFLFKDFESTDEYKKLLK 174
Query: 345 YAQDLGVLTWQNEKKRTVVVTK 366
YA ++GVLTW ++ + V+K
Sbjct: 175 YADEIGVLTWASKTEEKFFVSK 196
>gi|85691005|ref|XP_965902.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi GB-M1]
gi|19068469|emb|CAD24937.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
gi|449329766|gb|AGE96035.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi]
Length = 414
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 84/256 (32%), Positives = 129/256 (50%), Gaps = 15/256 (5%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
IT GF+FLL Q+W +L L+ + + L LF+L GT Y
Sbjct: 167 ITNRGFEFLLKTKKEQLWCLVLLSLKYFLGSVEEEIAVLEALFELSARAVGTVYRQVDQM 226
Query: 182 DKLQVFLQHLREFGLV--YQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRV 239
D+ ++L G++ Y++ GR + + R RE F+IVETN ++
Sbjct: 227 DRR--LFKYLEALGILRLYEKGLAIGRSFVQLFEASERNR------RE--FIIVETNNKI 276
Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
YAYT+S + ++I LFC + PNL G++T +SV A GIT QII FL
Sbjct: 277 YAYTNSEYEKSVIHLFCNVTVNLPNLIKGIITEESVNVAFDKGITGKQIIHFLEASVRQG 336
Query: 300 VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
LPP + +QI +WE +R R+ G LYS F++ +D++K+ ++ L + K
Sbjct: 337 ---SLPPAIRNQIVIWESKRNRIFMVPGYLYSNFLNLSDYQKVLEFCIRGNHLVESDVDK 393
Query: 360 RTVVVTKQGHSDIKKF 375
R +VV +GHS +K+F
Sbjct: 394 RMIVVKLEGHSLVKEF 409
>gi|396080769|gb|AFN82390.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
romaleae SJ-2008]
Length = 414
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 147/296 (49%), Gaps = 24/296 (8%)
Query: 87 RWECILRFMVGSQQTEGISADAVRTLFNEGA---ENPVITKDGFQFLLLETPAQVWYFIL 143
++E ILR +V + I A + + G ++ IT GF+FLL Q+W +L
Sbjct: 131 KFEAILRSIVNREYR--IQAFGIGEVLMFGGLLRKDRNITNRGFEFLLKTKKEQLWCLVL 188
Query: 144 ----KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
+L +VE + + L LF+L T T + +G D L++L G++
Sbjct: 189 LSLKYFLRSVEEE----IYALEALFELSTKTIETAHHQDGRMDGR--LLKYLEALGIL-- 240
Query: 200 RKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
R G + + + RE F+IVETN ++YAYT+S + ++I LFC +
Sbjct: 241 --RVCGERLAIGRSFVQLFEASERNRRE--FIIVETNNKIYAYTNSEYEKSVIHLFCNVS 296
Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGER 319
PNL G +T +SV A GIT QII FL P LPP ++ QI +WE +R
Sbjct: 297 VSLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASVKPG---SLPPAITSQIMIWESKR 353
Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
R+ G +YS F++ +D++++ ++ + L + KR +VV +GH +++F
Sbjct: 354 NRIFMVPGYIYSNFLNLSDYQRVLEFCTERSYLIESDVGKRMIVVKLEGHELVREF 409
>gi|397564488|gb|EJK44232.1| hypothetical protein THAOC_37249 [Thalassiosira oceanica]
Length = 971
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 152/318 (47%), Gaps = 51/318 (16%)
Query: 80 LNQYALERWECILRFMVGSQQTE-----GISADAVR-----TLFNEGAENP--------- 120
L + +W+ +L F+VGS ++ GI V T E+P
Sbjct: 225 LEAHTQSQWDSVLHFLVGSDDSDRKGVNGIEEPNVAMVNFLTRIGLMQEDPDFTGKDRSR 284
Query: 121 ---VITKDGFQFLLLETPAQVWYFILKYLETVESKGL-DLV--ECLTFLFQLKFSTFGTD 174
VIT G++F+L +T AQVW F+L+YL ++ L D + E L+FL L G
Sbjct: 285 APLVITSKGYEFMLRDTNAQVWQFVLQYLNSMAHHDLKDTIRKEALSFLICLGSCRIGEG 344
Query: 175 YSTEGMSDK-LQVFLQHLREFGLVYQRKRKAGR--FYPTKLALNMATRGTLKQIREPGFL 231
Y + + K +V ++ FGL++ R AG+ FYPT++A+N+ R+ L
Sbjct: 345 YFSSVLGSKSARVLMKDFARFGLLFV-CRVAGKTAFYPTRVAVNLVASNEKGGSRQSDAL 403
Query: 232 I--------VETNFRVYAYTDSNLKVALIGLFCELMY-RFPNLTVGVLTRDSVRAALRSG 282
+ +E T S+L V + Y R PN+ LTRDS+++A R G
Sbjct: 404 LPSVAATRSLEEAMNAPDPTRSHLAV--------ISYKRLPNVVFFHLTRDSIKSAFRLG 455
Query: 283 ITAAQIIGFLRLHALP---SVSCPLPPV-VSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
+TA QI+ FL++HA P S + P+PP V DQI LW+ ER R+ E V Q A+
Sbjct: 456 VTADQILRFLQVHAHPMMRSGNQPMPPANVRDQILLWDRERRRVVMDE-VWVHQCRDDAE 514
Query: 339 FEKLRDYAQDLGVLTWQN 356
F + YA D L W N
Sbjct: 515 FSAVGMYASDSEALAWGN 532
>gi|324515605|gb|ADY46257.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 245
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 126/237 (53%), Gaps = 37/237 (15%)
Query: 64 MSAKLEVDSKARDLDF--LNQYALERWECILRFM-VGSQQTEGISADAVRTLF------- 113
+ A ++D K+R L + A ERWECIL ++ + SQ++E + A + LF
Sbjct: 9 LKAMTDLDEKSRKSASKDLGKKATERWECILHYLALPSQKSEQGVSGATKQLFRAAGLTS 68
Query: 114 -NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL------ 166
E + IT GFQFLLL Q+W +IL YL ES G +++ L FL +L
Sbjct: 69 GGESEGDMEITSAGFQFLLLNQTEQIWMYILHYLRLEESMGKNVMAELDFLLKLTLCVDH 128
Query: 167 ---------------KFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT- 210
K +T+ + E S+ + FL LRE GLV+ RKRK G F+ T
Sbjct: 129 THTNGRLRVVDGDREKKATWRAFFIDESWSETITNFLMQLRELGLVFIRKRKDGYFFITP 188
Query: 211 ---KLALNMATRGTLKQIR-EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
L ++ ++ + + R G++IVETN+RVYAYTDS+L++A++ F E++YR+
Sbjct: 189 LFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDSSLQLAILSTFTEMLYRWA 245
>gi|440493551|gb|ELQ76006.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Trachipleistophora
hominis]
Length = 426
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 145/281 (51%), Gaps = 32/281 (11%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTF---LFQ----LKFS---TF 171
IT GF+FLL Q+W+ IL + + +D + + F +++ LK + F
Sbjct: 157 ITHSGFEFLLKSRKEQMWFLILNGILMLGKNYVDWLTMIVFEMGMYERHCWLKINESKIF 216
Query: 172 GT------DYSTEGMSDK-----LQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRG 220
GT + + + +SDK L L++L G++ + PT + +
Sbjct: 217 GTARKHNKNTNIDPVSDKEAHSVLVRLLKYLSYLGII---QYSNDMVKPTAEYFLLFSSA 273
Query: 221 TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
+ + FLIVETN+++YAYT S+ ++++I LF +++ PNL +T +SV AA
Sbjct: 274 AI----QDSFLIVETNYKLYAYTTSSHELSIIKLFSQIVRELPNLVTAHITEESVNAAFL 329
Query: 281 SGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
GIT QI+ +L LPPVV +QI +WE +R+R+ + V+YS F++ ++E
Sbjct: 330 KGITGQQIVDYLT----EKSKSELPPVVLEQILIWERQRDRMKCIDAVIYSHFMTYNEYE 385
Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
Y ++ G L +E +R +VV + H+++K F N+ K
Sbjct: 386 ITYRYCREKGALIDHDEFRRLLVVRLECHNEVKNFIKNNIK 426
>gi|422294120|gb|EKU21420.1| transcription initiation factor TFIIH subunit 4, partial
[Nannochloropsis gaditana CCMP526]
Length = 213
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 111/212 (52%), Gaps = 29/212 (13%)
Query: 172 GTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----- 226
G Y +S Q L+ FGL+YQ R + FYPT +A+N+ GT+++ R
Sbjct: 1 GRPYPVRALSPTQQALLEKFISFGLIYQDDRHSRHFYPTAVAVNLIFGGTVQEERLRRGH 60
Query: 227 -----------------EPGFL--IVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
+P L IVETN+++ AYT S+L V ++ +F ++ R PN+ +
Sbjct: 61 GHVGPEGRENTKELRLVDPSQLAVIVETNYQLVAYTSSSLHVEMLRIFTDVRCRLPNVVI 120
Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP----LPPVVSDQIRLWEGERERLT 323
G +TR SVR A+ SGITAA I+ FL+ H +V LP V QI LW ER R+
Sbjct: 121 GFITRASVRRAMASGITAATILSFLKTHTHVAVRAGKGRLLPENVEAQIELWHQERSRVK 180
Query: 324 TSEGVLYS-QFVSQADFEKLRDYAQDLGVLTW 354
E ++ + +FE++R YA++L V+ W
Sbjct: 181 FEEVMMIDLSSLMVEEFEEVRTYAENLAVVCW 212
>gi|429965574|gb|ELA47571.1| hypothetical protein VCUG_00894 [Vavraia culicis 'floridensis']
Length = 424
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 137/273 (50%), Gaps = 30/273 (10%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTF------------LFQLKFS 169
IT GF+FLL Q+W+ IL + + +D + + F + + K
Sbjct: 157 ITHSGFEFLLKNRKEQMWFLILNGILMLGKSYVDWLIMIVFEMGMYERYRWLKVNEAKIF 216
Query: 170 TFGTDYSTEGMS-------DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL 222
++S S D L FL++L G++ + PT+ + + +
Sbjct: 217 CDVCEHSNNTGSVGGGEEGDILFRFLKYLSYLGII---QHSNDMVKPTEEYFLLFSSAS- 272
Query: 223 KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
+ FL+VETN+++YAYT S+ ++++I LFC+++ PNL V +T +SV AA G
Sbjct: 273 ---TQDSFLVVETNYKLYAYTTSSHELSIIKLFCQIIRELPNLVVAHITEESVNAAFVKG 329
Query: 283 ITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKL 342
IT QI+ +L LP VV +QI +WE +R+R+ + V+YS F++ ++E
Sbjct: 330 ITGQQIVDYLN----EKSRSELPVVVLEQILIWERKRDRMKCMDAVIYSHFMTYGEYEIT 385
Query: 343 RDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y ++ G L +E +R +VV + H+D+K F
Sbjct: 386 YKYCKEKGALVDYDEFRRLLVVKLEYHNDVKNF 418
>gi|449017614|dbj|BAM81016.1| similar to TFIIH subunit TFB2 [Cyanidioschyzon merolae strain 10D]
Length = 693
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/157 (40%), Positives = 96/157 (61%), Gaps = 7/157 (4%)
Query: 228 PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQ 287
P +IVETNFR+YAY S+ +VAL+ LF ++YR P + +GV+TRDSVR AL+ GITA Q
Sbjct: 535 PFQIIVETNFRLYAYAASSFQVALLSLFTRILYRMPGVAIGVITRDSVRRALKCGITAKQ 594
Query: 288 IIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD----FEKLR 343
++ FL +H++ S P V DQI LWE ER+R+ GVL F +D F++L+
Sbjct: 595 LLHFLGIHSMEGKSVPF--NVHDQILLWELERKRIQAYPGVLLEGFEPTSDGRAFFDQLQ 652
Query: 344 DYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
+Y +LG W + ++ +V +++ W+ Q
Sbjct: 653 EYTVELGAQQWCDRVRQLLVADASSFERLRE-WIRRQ 688
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 60/145 (41%), Gaps = 26/145 (17%)
Query: 80 LNQYALERWECILRFMVG--------SQQT----------EGISADAVRTLFNEGAENPV 121
L +Y WE +L F++G Q T E S + + L + G V
Sbjct: 231 LREYTRHAWEHVLHFVIGIPVAEPEIRQPTGPEHTPEHVMEPPSDEVIEALLSLGVSEQV 290
Query: 122 ------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDY 175
IT+ GF +LLL+ AQ+W + Y+E + D +E LF+L FS G Y
Sbjct: 291 GSDGMRITERGFSYLLLDIHAQLWALLEAYMENWDEARGDRLELFDLLFRLGFSVPGHIY 350
Query: 176 S--TEGMSDKLQVFLQHLREFGLVY 198
+S L L E GLVY
Sbjct: 351 DGGDPSLSAAQCRMLGFLAEIGLVY 375
>gi|428181353|gb|EKX50217.1| TFB2 transcription factor B2 nucleotide excision repair [Guillardia
theta CCMP2712]
Length = 463
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 134/297 (45%), Gaps = 38/297 (12%)
Query: 61 PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG---------------SQQTEGI- 104
PWS ++ DS + L Q+A RWE IL +MV ++ GI
Sbjct: 123 PWS-KGRIMGDSPDTTAEELQQWAQNRWESILHYMVSPGLDPMKRPEPGWKPTKWVRGIL 181
Query: 105 -SADAVRTLFNEGAEN--PVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
D F +N P IT GFQFLL Q+ ++ L S+ DL + L+
Sbjct: 182 VGEDKNGNSFMMSLQNLAPKITAFGFQFLLRPLRNQILQMLMHVLTKKGSENKDLQDSLS 241
Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
+FQL F T G Y + +S Q + L + G VY + F+PT L +N+ T
Sbjct: 242 CIFQLSFCTLGAGYRLDLLSRSNQELITSLHDLGAVYFKPNDRQYFFPTPLIVNLCTESE 301
Query: 222 LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS 281
+ P T+ V + ++ LF +Y+FP++ + V+TR+S+R AL
Sbjct: 302 VPS--GPS-----TDAGVADEFSAEPAAGILRLFTRPIYKFPHMIIAVITRESIRNAL-- 352
Query: 282 GITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
II +LR+HA P + P+ P VV+DQI WE ER R+ V Y F S
Sbjct: 353 ------IIEYLRMHAHPQCLENWPIVPEVVTDQICFWEQERCRIRAEPAVAYHNFFS 403
>gi|449521313|ref|XP_004167674.1| PREDICTED: RNA polymerase II transcription factor B subunit 2-like,
partial [Cucumis sativus]
Length = 296
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 123/246 (50%), Gaps = 30/246 (12%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+++L+++ PV + W+ ++K A D L +LR
Sbjct: 40 LPPLAKKFVLQMLYIDGPVSAKSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETT 99
Query: 49 --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
N + L+ G + + L+ L YAL++WEC L ++ S Q
Sbjct: 100 YRLNPTFQANLQKLLIHGEVLAREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQ 159
Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
E S + +V +F +G E P +T+ GFQFLL+ET AQ+WY I +Y+ E +
Sbjct: 160 AEKPSNISSSVMKVFQKGLLSQRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEER 219
Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
G+D + ++FL +L F G Y + +SD+ + ++ L + GLV Q+ RK F PTK
Sbjct: 220 GVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTK 279
Query: 212 LALNMA 217
LA N++
Sbjct: 280 LATNLS 285
>gi|308460920|ref|XP_003092758.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
gi|308252517|gb|EFO96469.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
Length = 128
Score = 115 bits (289), Expect = 3e-23, Method: Composition-based stats.
Identities = 56/127 (44%), Positives = 86/127 (67%), Gaps = 9/127 (7%)
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEG 317
++VG++TR+SVR AL+ GITAAQII FLR +A P +++C LP V+DQIRLWE
Sbjct: 1 MSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGAINC-LPITVADQIRLWED 59
Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWV 377
ER R+ + +YS F S+ +F+ + DYAQ+ G+L W N +++ V+V ++GH +++ W
Sbjct: 60 ERRRMDLKDAYIYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQ-WY 118
Query: 378 NHQKGSQ 384
KG +
Sbjct: 119 KRSKGGE 125
>gi|300707677|ref|XP_002996037.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
gi|239605297|gb|EEQ82366.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
Length = 403
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/298 (27%), Positives = 151/298 (50%), Gaps = 22/298 (7%)
Query: 87 RWECILRFMVGSQQTEGISADAVRTL-FNEGAENPV-ITKDGFQFLLLETPAQVWYFILK 144
++E IL+F+V + V L F+ +N IT GF+FLL Q W+ I+
Sbjct: 121 KFEEILKFLVTKNNVNH-APGVVNVLKFSNLIDNNYDITNKGFEFLLKPRYDQYWFLIIA 179
Query: 145 YLE--TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
L+ V+ ++++ F+ ++ S Y + D + F L GL+
Sbjct: 180 ALKFYCVD----EILQISNFMSIMELSNMLPIYKYKLKKDVNKKFYDFLSYLGLIKLEND 235
Query: 203 KAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
++ N+ + K++R F+++ETNF++YAYT S ++++I LF + +
Sbjct: 236 ILTIYH------NLFVKSDTKKLR---FILLETNFKLYAYTSSVYEMSIIQLFSNIYLKM 286
Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERL 322
PNL +T +S+ A G+T+ QII FL+ ++L +P + QI +WE +R+R+
Sbjct: 287 PNLIKASITEESLSNAFSKGVTSQQIINFLKSYSLFE---DIPVAIISQIIIWETKRKRI 343
Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
G LYS F++ D++K+ + + ++++KR +V+ K H+++ K +V Q
Sbjct: 344 KIFPGYLYSNFLNLIDYQKVVKFCLSNDCIIEKDDEKRMIVI-KPDHNEVVKKFVKQQ 400
>gi|403348380|gb|EJY73625.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Oxytricha trifallax]
Length = 532
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/254 (31%), Positives = 131/254 (51%), Gaps = 32/254 (12%)
Query: 67 KLEVDSKARDLDFLNQYALERWECILRFMVG---SQQTEGIS-----ADAVRTLFNEGAE 118
+L+ K ++ L+ + +WE I +FM+ QQ S A F+ A+
Sbjct: 142 RLKEKKKIKNQAQLDTASQTKWENISKFMLNIIPPQQNTPKSLINLLQQARYIEFDVNAQ 201
Query: 119 NPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTE 178
+T +GFQ++L++ +QV +L Y++T E+ G ++VE L F+F L T+ + +
Sbjct: 202 TYQLTVNGFQYILMDIASQVQNILLNYIQTAETSGRNVVEVLGFIFNLTL----TEPNYD 257
Query: 179 GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL--ALNMATRGTLKQI----------- 225
G ++ + L+ + GLV +R+ +F T L + + G L++
Sbjct: 258 G-NENVNSILRDFDDMGLVGMERRQ--KFVITSLLQSFLQSQTGQLQKSSDSSSSNVIDA 314
Query: 226 ----REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS 281
+ FLIVETNF+VYAYT S+L AL+ F + FPNL VG LTR S++ A +
Sbjct: 315 QRSQSQDKFLIVETNFKVYAYTSSDLYRALLRQFIRVECIFPNLVVGTLTRKSLQKAFQR 374
Query: 282 GITAAQIIGFLRLH 295
GI++ QI+ FL H
Sbjct: 375 GISSGQILSFLESH 388
>gi|123488965|ref|XP_001325283.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121908180|gb|EAY13060.1| hypothetical protein TVAG_212420 [Trichomonas vaginalis G3]
Length = 460
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 105/417 (25%), Positives = 177/417 (42%), Gaps = 67/417 (16%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA-------------TDTLTELR 48
L +++ + RLL++E+ P++ + W K+ + + A TDT++E
Sbjct: 33 LPPLSQQILFRLLYIERGFPRSEVEKWSPKSKSNDLINAIKKLIKAHLVYIVTDTISENS 92
Query: 49 RNAKV--------ALLGG-----------GKPWSMSAKL--------EVD-SKARDLD-- 78
N+ + LLG P++ + E+D SK +D
Sbjct: 93 YNSSIIINKRVREVLLGTYVEDKKAFVRQKNPFAPDDQPDENFVPFDELDQSKFPTIDEK 152
Query: 79 FLNQYALERWECILRFMVG-----SQQTEGISADAVRTLFNEGAENPVITKDGFQFLLLE 133
L+ ++ + E IL FM+ + + I +DA F + G +FLLL
Sbjct: 153 ILDDFSNYQLESILSFMLKLRDNIDNEAKKILSDAKLMEFGGN-----LCPKGHRFLLLS 207
Query: 134 TPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
Q+W + YL+ + DL L FL ++ Y ++ + L +
Sbjct: 208 PKEQIWRIVKCYLKFTK----DLHSSLRFLLKIGSMELSKGYPITSLTPTQKELLSPFKT 263
Query: 194 FGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
GLVY FYPTK LN + + Q G+++++TNF++ A+ S L AL+
Sbjct: 264 IGLVY---IDGDYFYPTKSILNFFGKSNIFQTE--GWMLIDTNFKITAFPKSPLHTALLK 318
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-SCPLPPVVSDQI 312
F + Y FP ++ +S R AL G T IIGFL+ + + S +P V Q
Sbjct: 319 KFANVTYEFPGFASAFISPNSFREALNQGTTLDDIIGFLKSNLSHKIGSGQIPSAVMKQF 378
Query: 313 RLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDL-GVLTWQNEKKRT---VVVT 365
+W +RERLT + + Q+ + D A+ L G + EKK + +V+T
Sbjct: 379 YVWRDQRERLTVTHECIMRQYTNPNDANLAAQCAKQLAGYVYGPAEKKESHYWIVIT 435
>gi|429962184|gb|ELA41728.1| hypothetical protein VICG_01232 [Vittaforma corneae ATCC 50505]
Length = 449
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
IT GF+FLLL Q+W+ I+ ++ L+ E L ++ Y S
Sbjct: 173 ITNKGFEFLLLARKDQLWFLIVNAIKYHSRDSLEESEMFISLAEILMKRRCGPYICSSFS 232
Query: 182 ------DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIRE-------- 227
D + V ++ LV+ + LA + T I +
Sbjct: 233 SWHSFLDSIGVLFVISQDNNLVFYVNNSV-LYDKVPLADSAMTHHGTAGIDDIYDIDYTK 291
Query: 228 PG---------FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
P F+++ETNF++YAYT + +++ LF + +Y FPNL +S+ +A
Sbjct: 292 PNIPINKASNKFIVLETNFKIYAYTSTAYDKSVLSLFSKTVYVFPNLIKACFDEESLLSA 351
Query: 279 LRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
GITA QII +L+ H S +P + +QI +WE + R+ G LY F+ +D
Sbjct: 352 FNKGITAKQIIKYLQEH-----SEEVPKNIVNQISIWEHRQHRIRARNGYLYHDFIHLSD 406
Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
F ++ Y + G L +++E KR +V ++ H +K F QK
Sbjct: 407 FHRVLRYVESKGGLIYRDEVKRMIVGEERIHESVKNFIKEMQK 449
>gi|403221352|dbj|BAM39485.1| transcription factor IIH subunit [Theileria orientalis strain
Shintoku]
Length = 634
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 107/207 (51%), Gaps = 27/207 (13%)
Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
+ E + L L S G +STE ++ ++ L E G+VY + A RFY L+
Sbjct: 338 VTESVELLLALSQSKCGDCFSTENLTKTQMRLVRLLNELGIVYYKN--ANRFYIYDLSYI 395
Query: 216 MATRG----------TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
+ K +IV++NF+VY YT S L+++++ CEL R PNL
Sbjct: 396 IGKSNINVDLFKEFDVNKTAGNESRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNL 455
Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFL---RLHALPSVSC----------PLPPVVSDQI 312
VGVLTR+SV++A +SGIT+ +II FL +L + +C +P V Q+
Sbjct: 456 VVGVLTRESVQSAFKSGITSNEIIRFLSPMKLSLSYAGTCNSFMDSFANYKIPENVCRQL 515
Query: 313 RLWEGERERLTTSEGVLYSQFVSQADF 339
++WE ER+R+ S +L+ ++ ADF
Sbjct: 516 KMWESERDRIELSPAILFKRW--DADF 540
>gi|402465832|gb|EJW01474.1| hypothetical protein EDEG_03937 [Edhazardia aedis USNM 41457]
Length = 465
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 153/340 (45%), Gaps = 35/340 (10%)
Query: 58 GGKPWSMSAKLEVDSKARDLDFLNQY--------ALERWECILRFMVGSQQT----EGIS 105
G + ++MS +++ + + N Y + +++E IL +V Q+T E +
Sbjct: 138 GTENYNMSPVKSIENIKKTVTIFNDYESKNTSNKSFQKYEEILHLIVDQQETPKNEEILK 197
Query: 106 ADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFIL---KYLET---VESKGLDLVEC 159
+ N+ E IT GF FLL Q+W ++ KYL++ +E K L+
Sbjct: 198 LLTFSRILNKNLE---ITNTGFDFLLKTRKEQLWILLITTIKYLKSTMKIEEKDFILL-- 252
Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR-KRKAGRFYPTKLALNMAT 218
+F+L Y +Q + ++ + GL+ K + L +N+
Sbjct: 253 ---IFELCEKRPYKSYKIINCPSSIQTCINYMSKLGLLENSIKNNTVILSISPLFINLFE 309
Query: 219 RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
+ F+ +ETNF++YAYT S +++ LF ++ + PNL ++ DSV A
Sbjct: 310 ENL---VLSESFMYIETNFKLYAYTTSKYDFSILSLFSKISCKLPNLISAIINEDSVNTA 366
Query: 279 LRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
I+A QI +L+ S +P V +Q+ +WE +R R+ TSE L+ F++ D
Sbjct: 367 FDKKISAKQISYYLK-----SKGKNVPKNVVEQVYIWESKRNRIKTSECTLFKGFLNLMD 421
Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVN 378
F+K D ++ L E KR + V + K++ N
Sbjct: 422 FKKAVDVCKEKHWLVDVYEDKRFIFVNNLYAEEFKEYIKN 461
>gi|84998132|ref|XP_953787.1| TFIIH subunit, Tf2b [Theileria annulata]
gi|65304784|emb|CAI73109.1| TFIIH subunit, Tf2b homologue, putative [Theileria annulata]
Length = 645
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 110/213 (51%), Gaps = 27/213 (12%)
Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
LE E+ + E + L L ++ G +STEG++ L+ L E G+VY + K
Sbjct: 356 LELKEASMKTVTESVELLLSLSQASCGDCFSTEGLTKTQIRLLRLLNELGIVYYKNPK-- 413
Query: 206 RFYPTKLAL---NMATRGTLKQIREPGF---------LIVETNFRVYAYTDSNLKVALIG 253
+FY L+ T L ++ +IV++NF+VY YT S L+++++
Sbjct: 414 KFYLYDLSYIVGKTNTNSVLPNSKDLDISIKAGNDSRIIVQSNFKVYVYTASPLQISVLS 473
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL-------------PSV 300
CEL R PNL VGVLTR+SV++A +SGIT+ +II FL + + S
Sbjct: 474 HLCELQARTPNLVVGVLTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGSQENTLLNSSF 533
Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
+ +P V Q+++WE ER+R+ +++ ++
Sbjct: 534 TYSIPENVCRQLKMWESERDRIELCPSIVFKRW 566
>gi|359497087|ref|XP_002267711.2| PREDICTED: general transcription factor IIH subunit 4-like [Vitis
vinifera]
Length = 238
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 73/238 (30%), Positives = 116/238 (48%), Gaps = 44/238 (18%)
Query: 142 ILKYLET---VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV- 197
IL+YL+ V+ +G+D + ++FL +L F G Y+ +++ + ++ L + GLV
Sbjct: 36 ILRYLDLLNLVQERGVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVK 95
Query: 198 YQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCE 257
Q+ RK F PTKLA N++ + R+ GF++VETNFR+YAY+ S L CE
Sbjct: 96 LQQGRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLH-------CE 148
Query: 258 LMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEG 317
+ LRL + SVS P IRLWE
Sbjct: 149 I---------------------------------LRLFSRSSVSTVATPCHGYLIRLWET 175
Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ R+ T LY +F S+ FE D+A++ G L W++ KK +VV + H ++++
Sbjct: 176 DLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREY 233
>gi|296084734|emb|CBI25875.3| unnamed protein product [Vitis vinifera]
Length = 258
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 125/252 (49%), Gaps = 28/252 (11%)
Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
+G+D + ++FL +L F G Y+ +++ + ++ L + GLV Q+ RK F PT
Sbjct: 2 RGVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVKLQQGRKESWFIPT 61
Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF---------CELMYR 261
KLA N++ + R+ GF++VETNFR+YAY+ S L ++ LF C L
Sbjct: 62 KLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRYASMLFDCLLCIC 121
Query: 262 FPN-----LTVGVLTRDSVRAALRSGI----------TAAQIIGFLRLHALPSVSCPLPP 306
+ + L V ++ + +R I A G+L + + ++ +P
Sbjct: 122 YSSQKGTLLDVSLIMYECLRTFSLGKIIFWFFSSVSTVATPCHGYLVIFTISCLAEKIPC 181
Query: 307 VV---SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
+ QIRLWE + R+ T LY +F S+ FE D+A++ G L W++ KK +V
Sbjct: 182 LQFTNMSQIRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLV 241
Query: 364 VTKQGHSDIKKF 375
V + H ++++
Sbjct: 242 VKAEIHLHMREY 253
>gi|71033737|ref|XP_766510.1| hypothetical protein [Theileria parva strain Muguga]
gi|68353467|gb|EAN34227.1| hypothetical protein, conserved [Theileria parva]
Length = 359
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 109/216 (50%), Gaps = 33/216 (15%)
Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
LE E+ + E + L L ++ G +STEG++ L+ L E G+VY + K
Sbjct: 69 LELKEASFKTVTESVELLLSLSQASCGDCFSTEGLTKTQLRLLRLLNELGIVYYKNPKKF 128
Query: 206 RFY---------------PTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVA 250
Y PT L+++ + +IV++NF+VY YT S L+++
Sbjct: 129 YLYDLSYIVGKTNTNTGLPTSKDLDLSIKAG-----NDSRIIVQSNFKVYVYTASPLQIS 183
Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS----------- 299
++ CEL R PNL VGVLTR+SV++A +SGIT+ +II FL + + S
Sbjct: 184 VLSHLCELQARTPNLVVGVLTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGNQENTLLN 243
Query: 300 --VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
+ +P V Q+++WE ER+R+ +++ ++
Sbjct: 244 STFTYSIPENVCRQLKMWESERDRIELCPSIVFKRW 279
>gi|387592902|gb|EIJ87926.1| hypothetical protein NEQG_01998 [Nematocida parisii ERTm3]
Length = 398
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 126/258 (48%), Gaps = 24/258 (9%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYL----ETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
+T GF FLL Q+W +L ++ ET E++ L L E L + T+ D S
Sbjct: 155 LTHKGFNFLLTGKKRQLWTLLLAHIQEDPETAENEILVLCE---LLVKDPKRTYAVDRSQ 211
Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNF 237
+ S L +F GL++ K +F PT L G K FL++E+NF
Sbjct: 212 K--SKLLGLF----ESLGLIFFEK-GLVKFSPTFSLLFDDEEGAEK------FLVLESNF 258
Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
R+Y Y++ L +I LF FPN+ V ++ DS+R AL GITA QI +L +++
Sbjct: 259 RLYIYSNRPLDTFIISLFSIKSREFPNMMVAMINEDSIRQALMHGITAGQIRVYLNQNSM 318
Query: 298 PSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
++ V +QIRLWE R+ + E ++S F++ DF + Y ++ +
Sbjct: 319 YEIN----ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYR 374
Query: 358 KKRTVVVTKQGHSDIKKF 375
+KR +VV + + +K F
Sbjct: 375 EKRMLVVGIENYESVKSF 392
>gi|387595518|gb|EIJ93142.1| hypothetical protein NEPG_02098 [Nematocida parisii ERTm1]
Length = 398
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 126/258 (48%), Gaps = 24/258 (9%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYL----ETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
+T GF FLL Q+W +L ++ ET E++ L L E L + T+ D S
Sbjct: 155 LTHKGFNFLLTGKKRQLWTLLLAHIQEDPETAENEILVLCE---LLVKDPKRTYAVDRSQ 211
Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNF 237
+ S L +F GL++ K +F PT L G K FL++E+NF
Sbjct: 212 K--SKLLGLF----ESLGLIFFEK-GLVKFSPTFSLLFDDEEGAEK------FLVLESNF 258
Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
R+Y Y++ L +I LF FPN+ V ++ DS+R AL GITA QI +L +++
Sbjct: 259 RLYIYSNRPLDTFIISLFSIKSREFPNMIVAMINEDSIRQALMHGITAGQIRVYLNQNSM 318
Query: 298 PSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
++ V +QIRLWE R+ + E ++S F++ DF + Y ++ +
Sbjct: 319 YEIN----ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYR 374
Query: 358 KKRTVVVTKQGHSDIKKF 375
+KR +VV + + +K F
Sbjct: 375 EKRMLVVGIENYESVKSF 392
>gi|124806778|ref|XP_001350829.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
gi|23496958|gb|AAN36509.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
Length = 945
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 87/156 (55%), Gaps = 11/156 (7%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
LIV++NF+VY YT S LK+ ++ CEL R PN+ VG+LTR SV A S ITA QII
Sbjct: 766 LIVQSNFKVYLYTSSLLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 825
Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQ 347
FL ++ P + +P V Q++LWE ER RLT + +++ F + DF Y Q
Sbjct: 826 FLESYSHPGKNNFKSSIPMNVITQLKLWESERHRLTLEDAIVFKSF--EKDFMP-HLYQQ 882
Query: 348 DLGVLTWQNEKKRTVVVTKQGHSDIKKF--WVNHQK 381
++ W N K + T ++ K+F W+ +K
Sbjct: 883 ---IVIWANSKNYLLYYTPWPKNNTKEFDLWIKAEK 915
>gi|389586531|dbj|GAB69260.1| hypothetical protein PCYB_146880 [Plasmodium cynomolgi strain B]
Length = 913
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 3/106 (2%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
LIV++NF+VY YT+S LK+ ++ CEL R PN+ VG+LTR SV A S ITA QII
Sbjct: 733 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 792
Query: 291 FLRLHALPSVS---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
FL +A P S +P V Q++LWE ER RLT + +++ F
Sbjct: 793 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 838
>gi|221061889|ref|XP_002262514.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193811664|emb|CAQ42392.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 943
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 3/106 (2%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
LIV++NF+VY YT+S LK+ ++ CEL R PN+ VG+LTR SV A S ITA QII
Sbjct: 763 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 822
Query: 291 FLRLHALPSVS---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
FL +A P S +P V Q++LWE ER RLT + +++ F
Sbjct: 823 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 868
>gi|412992644|emb|CCO18624.1| predicted protein [Bathycoccus prasinos]
Length = 464
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 117/276 (42%), Gaps = 62/276 (22%)
Query: 76 DLDFLNQYALERWECILRFMVGSQQTE---------------GISADA---VRTLFNEG- 116
D FL+ YA RWE +L + E G A +R LF
Sbjct: 186 DAGFLDHYAKSRWETVLLELTSEASVEKKRKKKKMMKKKGDGGAGAQHALHLRKLFYAAR 245
Query: 117 --AENPVITKDGFQFLLLETPAQVWYFILKYL---------------------ETVESKG 153
++ IT +GF FLL T Q+W + +Y + +E+ G
Sbjct: 246 LITKDGRITSEGFSFLLGATSEQIWILLARYARGGDFEKKMKNKTTNEENGEGDKMETNG 305
Query: 154 LD------------LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
D + FL +L F G YS +S+ + HL G++Y+ +
Sbjct: 306 EDEKQQHKLKSDESSAAAMAFLVRLSFQHPGRKYSKANLSEAERRVASHLSALGVLYENE 365
Query: 202 RKAGR---FYPTKLALNMATRGTLKQIREP-----GFLIVETNFRVYAYTDSNLKVALIG 253
+ PT L+ +++ T + G +IVETNFRVYAYT S L+ ++
Sbjct: 366 DDENDNNWYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLR 425
Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
LF Y+ PN VG++TRDS+ A+R+GI+ QI+
Sbjct: 426 LFTRPDYKLPNAYVGMITRDSILDAMRAGISPDQIV 461
>gi|156095867|ref|XP_001613968.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802842|gb|EDL44241.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 960
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 3/106 (2%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
LIV++NF+VY YT+S LK+ ++ CEL R PN+ VG+LTR SV A S ITA QII
Sbjct: 780 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 839
Query: 291 FLRLHALPSVS---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
FL +A P S +P V Q++LWE ER RLT + +++ F
Sbjct: 840 FLESYAHPGRSTFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 885
>gi|241725008|ref|XP_002412209.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
gi|215505416|gb|EEC14910.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
Length = 75
Score = 99.8 bits (247), Expect = 2e-18, Method: Composition-based stats.
Identities = 42/72 (58%), Positives = 55/72 (76%)
Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
DQ+RLWE ER+R EGVLYSQF+SQ+DF+ LR+YA DLGVL W N KR +VV + GH
Sbjct: 2 DQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRVMVVNRNGH 61
Query: 370 SDIKKFWVNHQK 381
++K+FW H++
Sbjct: 62 DEVKRFWKRHRQ 73
>gi|378754852|gb|EHY64880.1| hypothetical protein NERG_01936 [Nematocida sp. 1 ERTm2]
Length = 396
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 130/272 (47%), Gaps = 27/272 (9%)
Query: 110 RTLFNEGAENPVITKDG-----FQFLLLETPAQVWYFILKYL-ETVESKGLDLVECLTFL 163
R + N G +++KDG F FLL Q+W +L ++ E + ++ +++ FL
Sbjct: 140 RLMLNTG----IVSKDGLTHKGFNFLLTGRKNQMWTLVLAHIQEDLATRQEEVLVMCEFL 195
Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLK 223
+ + D S S L +F GL+ +R RF T L G K
Sbjct: 196 VKDPKRMYAIDVSYR--SKMLDLF----ESLGLI-TFERGLVRFSSTFSLLFDDEEGGEK 248
Query: 224 QIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGI 283
FL +E+NFR+Y Y++ L V +I LF + FPN+ V ++ DS+R AL GI
Sbjct: 249 ------FLTLESNFRLYIYSNRPLDVFIISLFSIKIREFPNMIVAMINEDSIRQALTYGI 302
Query: 284 TAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLR 343
TA QI +L +++ ++ V +QIRLWE R+ E ++S F++ DF +
Sbjct: 303 TAGQIRVYLNQNSMHKIN----ENVLEQIRLWEKRMNRIHAWESYIFSNFLNYKDFLLVE 358
Query: 344 DYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y ++ V KR +VV + + ++K F
Sbjct: 359 SYCENNNVEHRSYRDKRVLVVGVENYDNVKSF 390
>gi|428673252|gb|EKX74165.1| conserved hypothetical protein [Babesia equi]
Length = 668
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 111/223 (49%), Gaps = 28/223 (12%)
Query: 172 GTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY--------------PTKLALNMA 217
G +ST+ ++ L+ L E G+VY + K G FY P+ L++
Sbjct: 422 GDSFSTKSLTKSQFRILRLLYELGIVYYKSIK-GPFYVLDLSFIVGPKNLVPSNSPLSVH 480
Query: 218 TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
T + P +IV++NF+VY YT +NL+ ++ + CE+ R PN+ VGVLTR+S +
Sbjct: 481 TSISFTSEYLPSKIIVQSNFKVYVYTVNNLQFDILNILCEVQARTPNMVVGVLTRESAQR 540
Query: 278 ALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
A +SGIT+ +II F + + P V Q+R+WE ER R+ S +L + +
Sbjct: 541 AFKSGITSHEIIRFFS----STNTSTFPENVIRQLRMWEAERNRVELSPAIL----IKRW 592
Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTK---QGHSDIKKFWV 377
D E L D Q + W K+ + TK HS+ + W+
Sbjct: 593 DREFLPDLFQ--RTVRWAQSKRYELFHTKWPQDPHSEEYQEWL 633
>gi|221482097|gb|EEE20458.1| tfiih, polypeptide, putative [Toxoplasma gondii GT1]
Length = 861
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
++V++NF+VY YT S L+++++ CEL R PNL VG+LTR SV AA +SGITA QII
Sbjct: 675 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 734
Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
FL HA P V + P LP V+ Q+R+WE ER RL+ VL ++ +Q
Sbjct: 735 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 788
>gi|401410644|ref|XP_003884770.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
gi|325119188|emb|CBZ54742.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
Length = 868
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
++V++NF+VY YT S L+++++ CEL R PNL VG+LTR SV AA +SGITA QII
Sbjct: 682 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 741
Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
FL HA P V + P LP V+ Q+R+WE ER RL+ VL ++ +Q
Sbjct: 742 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 795
>gi|429854513|gb|ELA29524.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Colletotrichum gloeosporioides Nara gc5]
Length = 509
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 85/155 (54%), Gaps = 33/155 (21%)
Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
G +IVETN+RVYAY + L++A++ LFC+L RF ++ G LTR+S+R A+ GITA QI
Sbjct: 371 GSIIVETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQI 430
Query: 289 IGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
I +L HA + + P LPP V DQIRL ++E
Sbjct: 431 ISYLASHAHEQMHRFAAVRNKPVLPPTVIDQIRL-----------------------EYE 467
Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ YA ++GVLTW+N+K +K H I+ +
Sbjct: 468 DIAGYAAEIGVLTWRNDKLGMFFASK--HEQIRDY 500
>gi|221502519|gb|EEE28246.1| tfiih, polypeptide, putative [Toxoplasma gondii VEG]
Length = 836
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
++V++NF+VY YT S L+++++ CEL R PNL VG+LTR SV AA +SGITA QII
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709
Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
FL HA P V + P LP V+ Q+R+WE ER RL+ VL ++ +Q
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 763
>gi|237842971|ref|XP_002370783.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
gi|211968447|gb|EEB03643.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
Length = 836
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
++V++NF+VY YT S L+++++ CEL R PNL VG+LTR SV AA +SGITA QII
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709
Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
FL HA P V + P LP V+ Q+R+WE ER RL+ VL ++ +Q
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 763
>gi|149031821|gb|EDL86756.1| rCG41767 [Rattus norvegicus]
Length = 187
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 81/138 (58%), Gaps = 23/138 (16%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ L
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107
Query: 48 --------RRNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
R+N ++ALLGGGK WS +++L D ARD+ L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167
Query: 99 QQTEGISADAVRTLFNEG 116
+ +S D + L G
Sbjct: 168 -PSAAVSQDLAQLLSQAG 184
>gi|82704497|ref|XP_726580.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23482045|gb|EAA18145.1| Drosophila melanogaster LD30622p-related [Plasmodium yoelii yoelii]
Length = 926
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 66/106 (62%), Gaps = 3/106 (2%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
LI+++NF+VY YT S LK+ ++ CEL R PN+ VG+LTR SV A S ITA QII
Sbjct: 746 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 805
Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
FL ++ P + +P V Q++LWE ER RLT + +++ F
Sbjct: 806 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 851
>gi|70950974|ref|XP_744765.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524852|emb|CAH80923.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 922
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 66/106 (62%), Gaps = 3/106 (2%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
LI+++NF+VY YT S LK+ ++ CEL R PN+ VG+LTR SV A S ITA QII
Sbjct: 742 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 801
Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
FL ++ P + +P V Q++LWE ER RLT + +++ F
Sbjct: 802 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 847
>gi|68070707|ref|XP_677265.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497310|emb|CAH94094.1| conserved hypothetical protein [Plasmodium berghei]
Length = 929
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 65/106 (61%), Gaps = 3/106 (2%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
LI+++NF+VY YT S LK+ ++ CEL R PN+ VG+LTR SV A S ITA QII
Sbjct: 749 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 808
Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
FL ++ P + +P V Q++LWE ER RL + +++ F
Sbjct: 809 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLILEDSIVFKNF 854
>gi|296084771|emb|CBI25914.3| unnamed protein product [Vitis vinifera]
Length = 225
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 97/200 (48%), Gaps = 29/200 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y++++L+++ PV + W+ +H+ A D L +LR
Sbjct: 9 LPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRKKETS 68
Query: 49 --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
N + L+ GG + + LD L YAL +WEC L ++ S Q
Sbjct: 69 YRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQ 128
Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
TE ++ + ++ +F G E P +T+ GFQFLL++T AQ+WY + +Y+ E +
Sbjct: 129 TEKLTNFSSSMMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISNSEER 188
Query: 153 GLDLVECLTFLFQLKFSTFG 172
G+D + ++FL +L F G
Sbjct: 189 GVDPADLISFLLELSFHVTG 208
>gi|359497414|ref|XP_003635508.1| PREDICTED: general transcription factor IIH subunit 4-like, partial
[Vitis vinifera]
Length = 209
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 97/200 (48%), Gaps = 29/200 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK Y++++L+++ PV + W+ +H+ A D L +LR
Sbjct: 9 LPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRKKETS 68
Query: 49 --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
N + L+ GG + + LD L YAL +WEC L ++ S Q
Sbjct: 69 YRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQ 128
Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
TE ++ + ++ +F G E P +T+ GFQFLL++T AQ+WY + +Y+ E +
Sbjct: 129 TEKLTNFSSSMMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISNSEER 188
Query: 153 GLDLVECLTFLFQLKFSTFG 172
G+D + ++FL +L F G
Sbjct: 189 GVDPADLISFLLELSFHVTG 208
>gi|399219100|emb|CCF75987.1| unnamed protein product [Babesia microti strain RI]
Length = 612
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/106 (44%), Positives = 64/106 (60%), Gaps = 7/106 (6%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+IV++NF+VY YT S L++ ++ CEL R PNL +GVLTR S AA R+GITA QI
Sbjct: 432 IIVQSNFKVYVYTASPLQINVLAHLCELQSRTPNLVIGVLTRASAHAAFRAGITAKQICQ 491
Query: 291 FLRLHALPSV-------SCPLPPVVSDQIRLWEGERERLTTSEGVL 329
FL H+ P + LP V Q+ +WE ER R++ + VL
Sbjct: 492 FLETHSHPILLQNVREGGSYLPNNVVTQLNMWEAERNRISLEQCVL 537
>gi|258577869|ref|XP_002543116.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237903382|gb|EEP77783.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 311
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 118/252 (46%), Gaps = 35/252 (13%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
L +AK +V+ LL+++ P+P A + W+ +E A L+ L
Sbjct: 38 LPHLAKCFVMALLYLKDPLPAAELELWVKTESKRERDNALSILSRLHILSSTTTANHVRA 97
Query: 49 --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
+ + AL GG K S D + + L+ YA +WE +L +MVG+
Sbjct: 98 FMVTNPFSSSLRQALTGGDKQQSFGVISTTPDPQPMTIADLDDYARRQWEGVLGYMVGTN 157
Query: 100 ----QTEGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
Q E ++ + V++L V ITK+GF F+L + QVW+ ++ Y+E
Sbjct: 158 TLGIQRESVTLSKGVKSLLQACHLVEVRDRRVEITKEGFAFVLQDVNTQVWHILVLYVEN 217
Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
E+ G+D VE L+FLF L G Y + +S L L +FG+VYQ +A R
Sbjct: 218 AEAIGMDSVEVLSFLFLLSSLELGQSYEKQHLSSTQLRTLADLTDFGIVYQHSPASEATR 277
Query: 207 FYPTKLALNMAT 218
FYPT+LA + +
Sbjct: 278 FYPTRLATTLTS 289
>gi|47194594|emb|CAF93785.1| unnamed protein product [Tetraodon nigroviridis]
Length = 123
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/96 (48%), Positives = 57/96 (59%), Gaps = 23/96 (23%)
Query: 304 LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYA----------------- 346
LPP ++DQIRLWE ER+RL +EGVLY+QF+SQ DFE LRD A
Sbjct: 24 LPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRDRAQVGLFHCCYTIRQKVHQ 83
Query: 347 ------QDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
Q LG L WQ+ R +VVT GHS++KKFW
Sbjct: 84 QLSPVLQSLGCLLWQDAAHRVMVVTLWGHSEVKKFW 119
>gi|380476902|emb|CCF44454.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
higginsianum]
Length = 126
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 70/119 (58%), Gaps = 10/119 (8%)
Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWE 316
+ G LTR+S+R A+ GITA QII +L HA + S P LPP V DQIRLW+
Sbjct: 1 MVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQMHRMAAVRSRPVLPPTVVDQIRLWQ 60
Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
E ER+TT+ G L+ F S ++E + YA ++GVL W+N+K +K H I+ +
Sbjct: 61 LETERMTTTSGFLFRDFDSPKEYEVIAGYASEIGVLVWRNDKLGMFFASK--HEQIRDY 117
>gi|337743331|gb|AEI73162.1| GTF2H4 [Kryptolebias marmoratus]
Length = 82
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/81 (54%), Positives = 55/81 (67%), Gaps = 11/81 (13%)
Query: 174 DYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR------- 226
DYS EGMS+ L FLQHLREFGLV+QRKRK+ R+YPT+LA+ +A T
Sbjct: 2 DYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRLAITLAAGVTSNSSSSSNLSST 61
Query: 227 ----EPGFLIVETNFRVYAYT 243
+ GF++VETN+RVYAYT
Sbjct: 62 QGAGDAGFIVVETNYRVYAYT 82
>gi|60603004|gb|AAX27959.1| unknown [Schistosoma japonicum]
Length = 100
Score = 86.3 bits (212), Expect = 2e-14, Method: Composition-based stats.
Identities = 41/72 (56%), Positives = 54/72 (75%)
Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
G++++ETNFR+YAYTDS L+ AL+ LF ++ RFPNL V +TRDSVR AL GITA QI
Sbjct: 14 GYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQI 73
Query: 289 IGFLRLHALPSV 300
+ FL +A P +
Sbjct: 74 LSFLTSNAHPDM 85
>gi|399949686|gb|AFP65344.1| tfIIB-like protein [Chroomonas mesostigmatica CCMP1168]
Length = 464
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 65/100 (65%), Gaps = 3/100 (3%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+I+E+NFR+Y Y +++L L+ +F E++Y+ PNL VG +T S+ A SG T+ IIG
Sbjct: 329 IIIESNFRIYVYKNAHLGNDLLLVFSEILYQLPNLFVGEITEKSILKAFNSGSTSRNIIG 388
Query: 291 FLRLHALPSVSCP-LPPVVSDQIRLWEGERERLTTSEGVL 329
FL+ + P CP +P V++QI+ WE ++ + +E ++
Sbjct: 389 FLKNNLHP--ICPWIPSAVTNQIKAWEFQKTEIKITESLM 426
>gi|156086630|ref|XP_001610724.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797977|gb|EDO07156.1| conserved hypothetical protein [Babesia bovis]
Length = 586
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 89/184 (48%), Gaps = 27/184 (14%)
Query: 171 FGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA---------------LN 215
+G + + + Q L+ L E GLVY + Y T L+ ++
Sbjct: 327 YGDTFRLDNPTLSQQRILRFLIELGLVYA-DSDGSKIYVTNLSFLVDVKHANLVEEHQIS 385
Query: 216 MATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
M+ G + ++V++NF++YAY S L++ ++ CEL + PNL +GVLTR S+
Sbjct: 386 MSVCGN-----KGSKMVVQSNFKIYAYIPSALQMNVLNHICELQAKTPNLVIGVLTRSSL 440
Query: 276 RAALRSGITAAQIIGFLRLHA------LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
+ A +SGITA Q+I F + +P V Q+++WE ER RL V+
Sbjct: 441 QTAFKSGITADQLICFFESKGQYDDIQINRNVMNVPENVRRQLKMWEAERNRLELLNAVV 500
Query: 330 YSQF 333
+ ++
Sbjct: 501 FKRW 504
>gi|160331881|ref|XP_001712647.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
gi|159766096|gb|ABW98322.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
Length = 432
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 62/107 (57%), Gaps = 1/107 (0%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+I+E+N+R+Y Y ++ L +F E++Y PN VG +T S+ AL+SGIT I+G
Sbjct: 300 IIIESNYRIYVYQKNSNNNQLFLIFSEILYILPNFFVGEITETSISRALKSGITIQNILG 359
Query: 291 FLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
F+R L V +P + +Q RLWE +++++ ++ V+Q
Sbjct: 360 FIR-ENLHCVCRSIPSTILNQFRLWEFQKKKIKIENCFVFFDMVTQG 405
>gi|443919931|gb|ELU39967.1| Tfb2 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 302
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 74/155 (47%), Gaps = 13/155 (8%)
Query: 72 SKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLF--------NEGAENPVIT 123
S ++ L++ ALE WE IL +MV + + L G IT
Sbjct: 133 SSQMSIEQLDRKALEHWETILHYMVDTNSSAQRPGGGALHLLVAGGWLEEGRGGGGHEIT 192
Query: 124 KDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDK 183
GFQFLL AQ+W +L+YL +++ +D+ E L+FLF L G +YS + +S
Sbjct: 193 STGFQFLLQSPRAQLWDILLQYLHMSDARRMDIAEVLSFLFMLSLMKLGQEYSCDNLSPT 252
Query: 184 LQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
+ L+++G+ + FYPT+LA + +
Sbjct: 253 QNAMMTDLKDYGINANKS-----FYPTRLATTLTS 282
>gi|167395228|ref|XP_001741280.1| TFIIH basal transcription factor complex p52 subunit [Entamoeba
dispar SAW760]
gi|165894177|gb|EDR22239.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba dispar SAW760]
Length = 489
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 148/348 (42%), Gaps = 37/348 (10%)
Query: 45 TELRRNAKVALLGGGKPWSMSAKL-EVDSKARDL---DFLNQYALERWECILRFMVGSQQ 100
+ + N K+ L+G +P + K+ EV+ K + L + ++ +E + ++
Sbjct: 101 NDFQNNFKMNLIGSMEP---AYKIKEVNEKVSQIIKEKCLKENCIKTFEYFMNKLLQFNS 157
Query: 101 TEGISADAVRTLFNE----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
IS + +R +F + E IT+ G+QFL ET Q+W +L + ++ + +
Sbjct: 158 QPNISNNPLR-IFKDLELVKEETRQITRKGYQFLFQETKTQLWVIMLSIIGLIQRR---I 213
Query: 157 VECLTFLFQLKFSTFGTDYSTEGMS-----DKLQVFLQHLREFGL-VYQRKRKAGRFYPT 210
+ +F++ + Y+ + D LQ+F + G+ VY +++ P
Sbjct: 214 SPFINDVFEMTYLKENIIYNCDRFKKVYGPDPLQLF----NDLGIIVYYKEQNVMAITPL 269
Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
L +++P I E N+ VY YT+S +V L LF ++ NL VG L
Sbjct: 270 MSLLRSNANIPSDLVKKPK-TITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKL 328
Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGV-- 328
SV A GIT+ +I FL+ + LP + QI LW+ E R V
Sbjct: 329 NHQSVTEAFAKGITSEMLINFLQPN--------LPRTIQKQIDLWKKEINRFKEEHVVRY 380
Query: 329 -LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
Y + + F +++ + L + E R + V Q I+ F
Sbjct: 381 KFYDDAIGRQLFIIVKNESDRLKATILEKEDIRLIFVKYQYGETIRSF 428
>gi|67483069|ref|XP_656826.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474050|gb|EAL51440.1| hypothetical protein EHI_192400 [Entamoeba histolytica HM-1:IMSS]
gi|449706224|gb|EMD46115.1| tfiih basal transcription factor complex p52 subunit, putative
[Entamoeba histolytica KU27]
Length = 484
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 145/345 (42%), Gaps = 31/345 (8%)
Query: 45 TELRRNAKVALLGGGKP-WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEG 103
+ + N K+ L G +P + + E SK L + ++ +E + ++
Sbjct: 101 NDFQSNFKMNLTGSMEPAYKIKEVNEKVSKIIQEKCLKENCIKTFEYFMNKLLQFNSQPN 160
Query: 104 ISADAVRTLFNE----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
IS + +R +F + E IT+ G+QFL ET Q+W +L + ++ + +
Sbjct: 161 ISNNPLR-IFKDLELVKEETRQITRKGYQFLFQETKTQLWVIMLSIIGLIQRR---ISPF 216
Query: 160 LTFLFQLKFSTFGTDYSTEGMS-----DKLQVFLQHLREFGL-VYQRKRKAGRFYPTKLA 213
+ +F++ + Y+ + D LQ+F + G+ VY +++ P
Sbjct: 217 INDVFEMTYLKEHIIYNCDHFKKVYGPDPLQLF----NDLGIIVYYKEQNVMAITPLMSL 272
Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
L +++P I E N+ VY YT+S +V L LF ++ NL VG L
Sbjct: 273 LRSNANIPSDLVKKPK-TITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQ 331
Query: 274 SVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGV---LY 330
SV A GIT+ +I FL+ P+ LP + QI LW+ E R V Y
Sbjct: 332 SVTEAFAKGITSEMLINFLQ----PN----LPRTIQKQIDLWKKEINRFKEEHVVRYRFY 383
Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
+ + F +++ + L + E R + V Q I+ F
Sbjct: 384 DDAIGRQLFVIVKNESDRLKATILEKEDIRLIFVKYQYGETIRSF 428
>gi|66358470|ref|XP_626413.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
gi|46227864|gb|EAK88784.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
Length = 709
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 92/193 (47%), Gaps = 27/193 (13%)
Query: 210 TKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
+ + +N ++ R +IV++NFR+Y YT S L+ ++ C++ R PN+ G+
Sbjct: 507 SNVKINEPNCLSMNSTRLEAGIIVQSNFRIYCYTASPLQAKILRHLCQVKVRGPNIICGI 566
Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERER 321
LTR + +A G++A QI+ F +A P + + +P V Q++LWE ++ R
Sbjct: 567 LTRKGLLSAYSMGVSAEQILRFFSSNAHPIILRRFMLEGTSIIPVNVETQLKLWEKDKNR 626
Query: 322 LTTSEGVLYSQF-VSQADFEKLRD---YAQDLGVLTWQN---------------EKKRTV 362
L S +S + S D + R YA+ +L + + +KK +
Sbjct: 627 LKISHASTFSDWGASPNDIQLFRQTILYARSKDILLYNSPIELTEKELNLNVELQKKIIL 686
Query: 363 VVTKQGHSDIKKF 375
V+ ++ DIK F
Sbjct: 687 VIKQEYEDDIKTF 699
>gi|207340600|gb|EDZ68899.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 226
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 91/189 (48%), Gaps = 27/189 (14%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
L +AK +++ ++F E VP + W++ + + A ++
Sbjct: 39 LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98
Query: 46 ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
L K++L L GG+ + S + V+ LD L++Y+ +WE IL FMVG+ +
Sbjct: 99 NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157
Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
S + L + V IT +GFQFLL E +Q+W +L+YL+ +E+ +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217
Query: 156 LVECLTFLF 164
LV+ L F+F
Sbjct: 218 LVDVLHFIF 226
>gi|330038479|ref|XP_003239608.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
gi|327206532|gb|AEA38710.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
Length = 467
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 5/101 (4%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+I+E+N+RVYAY ++ ++ FC+L+Y P L VG + + S+ A+ GI A II
Sbjct: 319 MIIESNYRVYAYNNNTFLNKILIQFCDLIYNLPGLFVGEINKTSIHKAIHKGINAKNIIS 378
Query: 291 FL--RLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
F+ H + SC P++ +QIR+WE ++ + +T + V+
Sbjct: 379 FILKNSHYIHQNSC--NPII-NQIRIWEFQQRKTSTHDCVM 416
>gi|269861689|ref|XP_002650538.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220065990|gb|EED43520.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 291
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/257 (21%), Positives = 108/257 (42%), Gaps = 41/257 (15%)
Query: 119 NPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTE 178
N IT G +FLL+ Q+WY I++ ++ S+
Sbjct: 69 NHEITTKGLEFLLMNKNEQIWYLIVE--------------------DIRLSS-------- 100
Query: 179 GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG------FLI 232
+ KL + Q L ++ K+ + N+ + K I F+
Sbjct: 101 HLHKKLYSYAQVLNGHKII--DKKILNFLVEINVIDNIVDQNLYKYILYKNNTEIQKFIY 158
Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
+ETNF++Y+YT++ A++ LF +++ P + +L + L GI QI+ ++
Sbjct: 159 LETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQIVEYI 218
Query: 293 RLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
R H + + L I +W+ +R R+ G LYS F + ++ + + + L
Sbjct: 219 RRHTINNCDQILH-----MIEIWDKQRNRINKQIGYLYSDFTNYNEYRSVLEQIKTDTDL 273
Query: 353 TWQNEKKRTVVVTKQGH 369
++NE++R + V + H
Sbjct: 274 LYKNEEERILFVKNKLH 290
>gi|380492949|emb|CCF34234.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
higginsianum]
Length = 343
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 116/262 (44%), Gaps = 47/262 (17%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
+ +AK V+ +L++ +P+ + W+ ++ +A TL
Sbjct: 41 IPPLAKTIVMAILYMPKPMLLEDLDVWVKPESRRQKDQAISTLRSLHILQITVPSKERPQ 100
Query: 45 -----TELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
T + + ++AL GGG P S+ E+ + FL++YA +WE IL
Sbjct: 101 EMQLTTNFKSSLRLALEGGGTHNSFGVPSSLPVPPEIT-----VPFLDRYARRKWEDILH 155
Query: 94 FMVGSQQTEGI----SADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
++V + G +V+ L G IT+ GF FLL E AQVW +L
Sbjct: 156 YIVNTVNPGGADLGGPKSSVKNLLVAGQLVRRQGSAVGITQAGFTFLLQEANAQVWTLLL 215
Query: 144 KYLETVE----SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
+LE + + G++ + L+FLF L G Y T +++ + L L +FGL+Y
Sbjct: 216 LWLEATDHAEDAAGMESTDMLSFLFLLASLELGRAYDTNALTEARRNMLPSLLDFGLIYI 275
Query: 200 RKRKAGRFYPTKLALNMATRGT 221
K +++PT+LA + + +
Sbjct: 276 PSHKPQQYFPTRLATTLTSSSS 297
>gi|269860316|ref|XP_002649880.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220066720|gb|EED44193.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 369
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/258 (21%), Positives = 108/258 (41%), Gaps = 41/258 (15%)
Query: 118 ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
N IT G +FLL+ Q+WY I++ ++ S+
Sbjct: 146 NNHEITTKGLEFLLMNKNEQIWYLIVE--------------------DIRLSS------- 178
Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG------FL 231
+ KL + Q L ++ K+ + N+ + K I F+
Sbjct: 179 -HLHKKLYSYAQVLNGHKII--DKKILNFLVEINVIDNIVDQNLYKYILYKNNTEIQKFI 235
Query: 232 IVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGF 291
+ETNF++Y+YT++ A++ LF +++ P + +L + L GI QI+ +
Sbjct: 236 YLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQIVEY 295
Query: 292 LRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
+R H + + L I +W+ +R R+ G LYS F + ++ + + +
Sbjct: 296 IRRHTINNCDQILH-----MIEIWDKQRNRINKQIGYLYSDFTNYNEYRSVLEQIKTDTD 350
Query: 352 LTWQNEKKRTVVVTKQGH 369
L ++NE++R + V + H
Sbjct: 351 LLYKNEEERILFVKNKLH 368
>gi|238570605|ref|XP_002386883.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
gi|215440101|gb|EEB87813.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
Length = 69
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/66 (53%), Positives = 48/66 (72%), Gaps = 5/66 (7%)
Query: 227 EPGFLIVETNFRVYAYTDS-----NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS 281
+ GF+++ETN+RVYAYT S L+ A++ LF L YRFPNL VG +TR+SV+ AL +
Sbjct: 4 QQGFIVLETNYRVYAYTGSFSLHNPLQTAILHLFVSLKYRFPNLVVGSITRESVKKALSN 63
Query: 282 GITAAQ 287
GI+A Q
Sbjct: 64 GISADQ 69
>gi|209880858|ref|XP_002141868.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209557474|gb|EEA07519.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 794
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 61/109 (55%), Gaps = 8/109 (7%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+IV++NFRVY YT S L+ ++ C++ R PN+ GVLTR + +A G+ A QI+
Sbjct: 606 IIVQSNFRVYCYTASPLQAKILRHLCQVKVRGPNIICGVLTRRGLLSAYSMGVKAYQILR 665
Query: 291 FLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
F +A P + + +P V Q++LWE + RL ++ L+S
Sbjct: 666 FFISNAHPIILKKHITDGTSIIPISVETQLKLWENDHNRLEINKVSLFS 714
>gi|440295536|gb|ELP88449.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba invadens IP1]
Length = 531
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 115/264 (43%), Gaps = 23/264 (8%)
Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKF----STFGTDYST 177
IT+ G+QFL ET Q+W +L + K + + + F+L + + + D T
Sbjct: 181 ITRKGYQFLFQETKTQIWILMLSIFGPAQRKR-RMGKLINEFFELTYLAPETCYNADPFT 239
Query: 178 EGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETN 236
+ S + + G++ Y +++ A P L + +P I ETN
Sbjct: 240 KIYSAAPLILFDMM---GIIAYSKEKNALVVTPLMSLLRSNALVPSDLVTKPR-TITETN 295
Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
+ VY YT+S +V L LF + NL VG +T D+V A GIT ++ FL+ +
Sbjct: 296 YTVYIYTESFFQVKLYSLFIRQNLQLTNLCVGRITYDTVTEAFLKGITNEMLVNFLQPN- 354
Query: 297 LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK-----LRDYAQDLGV 351
LP + QI LW+ E RL V + + + + +K + A+ +
Sbjct: 355 -------LPKNIQAQIDLWKRELNRLKEVRAVKFRFYEPELEVQKELYHLTKSEAEKMKG 407
Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
+ + E++ T+ V IK+F
Sbjct: 408 VVFYKEEELTLFVRYDVAEKIKEF 431
>gi|294950275|ref|XP_002786548.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
gi|239900840|gb|EER18344.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
Length = 580
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 105/470 (22%), Positives = 170/470 (36%), Gaps = 150/470 (31%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK- 60
L+ I K ++ RL+ V +P+ Q VI W+S H +H + L++L + + + G+
Sbjct: 67 LNGIGKVFINRLILVHRPIQQGVIRHWISSLHDSDHSKYRAALSQLTKLHILLMDPKGQH 126
Query: 61 --------------------------PWSMSAKLEVDSKARD-------------LDFLN 81
PW + + +VD D ++ L+
Sbjct: 127 LRMSSNFRQRLHEVFLKNTSDDHQSPPWEILEEAKVDVSKVDGVLTNGGAPRDAGMNRLD 186
Query: 82 QYALERWECILRFMVGSQQTEGISA---DAVRTLFNEGAE---------NPVITKDG--F 127
+YA RWE +L MV +G++A DA +L EGA + T G F
Sbjct: 187 EYAALRWEYVLEAMV-----DGMTAAVGDA--SLRPEGARQGLSGIVGMSSTSTSTGQLF 239
Query: 128 QFLLLETPAQVWYFILKYLETVESK--------GLDLVECLT---------FLFQLKFST 170
QF+L +Q W + + + +E + + +V LT L +
Sbjct: 240 QFVLASRTSQYWELVRRAFDMIERQHGIQSLLAPIRMVVALTRSIKHEPERLLPGTRIRV 299
Query: 171 FGTDYSTEGMSDKLQVFLQHLREFGLVY--------------QRKRKAGRFYPTKLA--- 213
+T G + L+ L + G+V QR+ Y L
Sbjct: 300 NPQCMATPGAKGAVTTLLETLADLGIVVPATYDDELRKVAALQRQYGESVKYELPLVSSD 359
Query: 214 ------------------LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF 255
+ M+ T + L V++NF V AYT S+L + L+G F
Sbjct: 360 SRMVVSSLVYVMQASRSKMMMSVDDTTAITNKSCRLFVDSNFAVTAYTTSSLDLRLLGTF 419
Query: 256 CELMYRF-------PN---LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC--- 302
+L + PN +G LT+ SV++A + G+T+ II +L+ H P +
Sbjct: 420 VQLQRQLGDGREYDPNDFGCVLGTLTQSSVQSAAQRGVTSEYIISYLKSHVDPRAAHMGS 479
Query: 303 ----------------------PLPPVVSDQIRLWEGE--RERLTTSEGV 328
+P V QI LWE E RL GV
Sbjct: 480 QGGRSSAATANTGAARGEKFIDGIPANVVTQITLWEREAIHNRLRIDPGV 529
>gi|197107449|pdb|3DOM|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
gi|197107451|pdb|3DOM|C Chain C, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
Length = 108
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 50/72 (69%)
Query: 304 LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
LPP V DQIRLW+ E +R+ T EG LYS F + ++ L YAQD+GVL W+++KK+
Sbjct: 28 LPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFF 87
Query: 364 VTKQGHSDIKKF 375
++K+G+S + F
Sbjct: 88 ISKEGNSQVLDF 99
>gi|197107383|pdb|3DGP|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
C-Terminal Domain Of Tfb2
Length = 80
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 48/70 (68%)
Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
P V DQIRLW+ E +R+ T EG LYS F + ++ L YAQD+GVL W+++KK+ ++
Sbjct: 2 PTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFIS 61
Query: 366 KQGHSDIKKF 375
K+G+S + F
Sbjct: 62 KEGNSQVLDF 71
>gi|145509120|ref|XP_001440504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407721|emb|CAK73107.1| unnamed protein product [Paramecium tetraurelia]
Length = 400
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 32/185 (17%)
Query: 126 GFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQ 185
GF F+L T Q+ + Y+ S + + CL+ L +K + + +
Sbjct: 177 GFSFILSSTHKQINQILKYYVTKQSSNVVRFILCLSVLDPMKIYQMPNEEWQKNV----- 231
Query: 186 VFLQHLREFGLVY---QRKRKAGRFY-----PTKLALNMATRGTLKQIREPGFLIVETNF 237
++ L+EFGL + Q+ R F+ P +++ + +IVE NF
Sbjct: 232 --IKDLQEFGLTHYQDQQMRITFLFWNFLYEPPSVSIGIQCN-----------IIVEANF 278
Query: 238 RVYAYTDS------NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGF 291
R+YAY +S + L+ LF E+ RF L + L+ S+R A+R + A QII F
Sbjct: 279 RIYAYLNSGDQQEEEILCNLLNLFSEIKKRFKILIIADLSESSIRKAVRENLQAKQIIQF 338
Query: 292 LRLHA 296
L +++
Sbjct: 339 LEMNS 343
>gi|401419872|ref|XP_003874425.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490661|emb|CBZ25923.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 559
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/71 (49%), Positives = 45/71 (63%), Gaps = 1/71 (1%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
+I ETNFR+YAYTD + ++ F EL NL +TRDS AA+R GITAAQ++
Sbjct: 359 IITETNFRLYAYTDDADLLNILNQFAELEEIVNGNLHCYRVTRDSFAAAMRKGITAAQVL 418
Query: 290 GFLRLHALPSV 300
FL L A PS+
Sbjct: 419 RFLSLRAHPSM 429
>gi|76156573|gb|AAX27758.2| SJCHGC02916 protein [Schistosoma japonicum]
Length = 141
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 22/105 (20%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
EL E+AKH V+RLLF+EQP+P+++++ W+ K + ++ LT LR
Sbjct: 37 ELPELAKHIVMRLLFIEQPIPKSIVSGWVEKGSSALLNDSCSALTVLRIWHSTDTNVSGG 96
Query: 49 ---------RNAKVALLGGGKPWSMSAKLEVDSK-ARDLDFLNQY 83
+ +++L GGGKP + + K ++ +DFL Y
Sbjct: 97 SWSLNKKYQESIRISLFGGGKPLLGDLGIVTNDKYSKSVDFLKSY 141
>gi|146103275|ref|XP_001469523.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134073893|emb|CAM72632.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 557
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 1/71 (1%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
+I ETNFR+YAYTD + ++ F +L NL +TRDS +A+R GITAAQ++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 290 GFLRLHALPSV 300
FL L A PS+
Sbjct: 417 RFLSLRAHPSM 427
>gi|398024082|ref|XP_003865202.1| hypothetical protein, conserved [Leishmania donovani]
gi|322503439|emb|CBZ38524.1| hypothetical protein, conserved [Leishmania donovani]
Length = 557
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 1/71 (1%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
+I ETNFR+YAYTD + ++ F +L NL +TRDS +A+R GITAAQ++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 290 GFLRLHALPSV 300
FL L A PS+
Sbjct: 417 RFLSLRAHPSM 427
>gi|154345556|ref|XP_001568715.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066057|emb|CAM43844.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 561
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 1/71 (1%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
+I ETNFR+YAYT+ + + ++ F EL NL +TRDS AA+R GITA Q++
Sbjct: 362 IITETNFRLYAYTEDSDLLNILHQFAELEEVVNGNLHCYRVTRDSFAAAMRKGITATQVL 421
Query: 290 GFLRLHALPSV 300
FL L A PS+
Sbjct: 422 RFLSLRAHPSM 432
>gi|261332554|emb|CBH15549.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 500
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 120/316 (37%), Gaps = 63/316 (19%)
Query: 96 VGSQQTEGISAD-AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFIL----KYLETVE 150
+G ++ E I+A L +E V+T +G F + Q W + + L +
Sbjct: 156 LGGRRNEPIAALLQFSRLIPPPSEERVVTSEGLAFCMQPLQQQWWTLVSVVLDRALAIMS 215
Query: 151 SKGLD---LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF--LQHLREFGLVYQ---RKR 202
KG+ L + L LF L S + Y D L+ F L L E GLVY
Sbjct: 216 GKGVSRATLWQLLAVLFALNTSDYV--YLFPSKDDDLEAFQLLARLSEVGLVYPLICNGE 273
Query: 203 KAGRFYP--------------TKLALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSN 246
K P AL TRG ++R + +I ETNFR+YAYT +
Sbjct: 274 KCFVLSPHFHHAICWSSTPSLCTAALLDDTRGPSSRLRREDEDTIITETNFRLYAYTKNP 333
Query: 247 LKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV----- 300
+ ++ F + V +TR S +AL GI A I+ FL + A PS+
Sbjct: 334 DMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGANHILQFLTVKAHPSMIKQSN 393
Query: 301 ------SCPLPPVVS--------------------DQIRLWEGERERLTTSEGVLYSQFV 334
SCP+ P S DQ+ WE E RL V+ + +
Sbjct: 394 SEAGDPSCPVLPAASAGFGNTSEYRQGNIIPQSFCDQLFTWERECRRLIFRHDVVLLKNI 453
Query: 335 SQADFEKLRDYAQDLG 350
+ E + + + G
Sbjct: 454 GKEQMEIILNCLSNSG 469
>gi|302831319|ref|XP_002947225.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
gi|300267632|gb|EFJ51815.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
Length = 90
Score = 55.8 bits (133), Expect = 3e-05, Method: Composition-based stats.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 3/90 (3%)
Query: 282 GITAAQIIGFLRLHALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
GI A +++ +L PS++ +P VVSDQIRLWE RL VLY S+
Sbjct: 1 GIAADELVSYLAARPHPSIASRTPVVPEVVSDQIRLWEASMNRLRADSVVLYENLASREL 60
Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
FE+ +++ G L W++ + V G
Sbjct: 61 FERALAFSRSSGTLLWEDSGQMRFVALDAG 90
>gi|71747422|ref|XP_822766.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832434|gb|EAN77938.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 500
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 120/316 (37%), Gaps = 63/316 (19%)
Query: 96 VGSQQTEGISAD-AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFIL----KYLETVE 150
+G ++ E I+A L +E V+T +G F + Q W + + L +
Sbjct: 156 LGGRRNEPIAALLQFSRLIPPPSEERVVTSEGLAFCMQPLQQQWWTLVSVVLDRALAIMS 215
Query: 151 SKGLD---LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF--LQHLREFGLVYQ---RKR 202
KG+ L + L LF L S + Y D L+ F L L E GLVY
Sbjct: 216 GKGVSRATLWQLLAVLFALNTSDYV--YLFPSKDDDLEAFQLLARLSEVGLVYPLICNGE 273
Query: 203 KAGRFYP--------------TKLALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSN 246
K P AL TRG ++R + +I ETNFR+YAYT +
Sbjct: 274 KCFVLSPHFHHAICWSSTPSLCTAALLDDTRGPSSRLRREDEDTIITETNFRLYAYTKNP 333
Query: 247 LKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV----- 300
+ ++ F + V +TR S +AL GI A I+ FL + A PS+
Sbjct: 334 DMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGANHILQFLTVKAHPSMIKQSN 393
Query: 301 ------SCPLPPVVS--------------------DQIRLWEGERERLTTSEGVLYSQFV 334
SCP+ P S DQ+ WE E RL V+ + +
Sbjct: 394 SEAGDPSCPVLPAASVGFGNTSEYRQGNIIPQSFCDQLFTWERECRRLIFRHDVVLLKNI 453
Query: 335 SQADFEKLRDYAQDLG 350
+ E + + + G
Sbjct: 454 GKEQMEIILNCLSNSG 469
>gi|146161819|ref|XP_001033510.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila]
gi|146142827|gb|EAR85847.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila
SB210]
Length = 484
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 61/134 (45%), Gaps = 22/134 (16%)
Query: 187 FLQHLREFGLVYQRKRKAGR--FYPTKLALNMATRG-TLKQIREPGFLIVETNFRVYAYT 243
FL L G+V++++ R FY T + R LK + + +ETNF VYAY
Sbjct: 277 FLNDLHSLGMVFKKEILKDRIAFYITPVLWQFCYRKIDLKTLNAK--ITIETNFNVYAYI 334
Query: 244 D-----------SNLK------VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
D LK L+ FC+L Y+FP+L VG L+ + ++G+TA
Sbjct: 335 DQEDPKEIKDSKDTLKDQTEHIEKLLASFCDLHYKFPHLIVGQLSEQKTKDQFKNGLTAR 394
Query: 287 QIIGFLRLHALPSV 300
+I F + P +
Sbjct: 395 LLIQFFNKTSDPQM 408
>gi|148691278|gb|EDL23225.1| general transcription factor II H, polypeptide 4 [Mus musculus]
Length = 116
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 36/48 (75%)
Query: 1 ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
EL +AK++V+R+LF+EQP+PQA +A W+ K +K +E+T L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLR 95
>gi|157876614|ref|XP_001686652.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129727|emb|CAJ09033.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 626
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 44/71 (61%), Gaps = 1/71 (1%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
+I ETNFR++AYT+ + ++ F EL NL +TR S AA+R GITAAQ++
Sbjct: 426 IITETNFRLFAYTNDADLLNILNQFAELEEIVNGNLHCYRVTRASFAAAMRKGITAAQVL 485
Query: 290 GFLRLHALPSV 300
FL L A PS+
Sbjct: 486 RFLSLRAHPSM 496
>gi|340057132|emb|CCC51474.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 502
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 130/341 (38%), Gaps = 71/341 (20%)
Query: 82 QYALERWECILRF-----MVGSQQTEGISAD---AVRTLFNEGAENP----VITKDGFQF 129
Q AL R +LR + GS + G +++ +V F+ P V+T +G F
Sbjct: 131 QNALSRSRNMLRMVTACALAGSLEPLGTNSEQPISVMLQFSNLIPPPPEVGVVTPEGLNF 190
Query: 130 LLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLFQLKFSTFGTDYSTEGMSD 182
+ Q W + L+ V ++G+ +L + L LF L S F +
Sbjct: 191 CMQPIQQQWWTLVSVALDRVLAVAATQGVLRAELWQLLAVLFALDTSEFVYPFPDREKDF 250
Query: 183 KLQVFLQHLREFGLVYQRKRKAGRFYP-----------------TKLALNMATRGTLKQI 225
L L E GLVY R + +AL +G ++
Sbjct: 251 AAFQLLARLSEVGLVYPLLCDGQRCFVLSPQFHHAVCWNSTAPLCTVALLDDDKGGSYRV 310
Query: 226 R--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSG 282
R + +I ETNFR+YAYT + +A++ F + V +TR S AALR G
Sbjct: 311 RREDEDTIITETNFRLYAYTRNKHLLAILEQFAIKEAEVDGMIVCFRVTRASFAAALRKG 370
Query: 283 ITAAQIIGFLRLHALPSV-----SCP---------------------------LPPVVSD 310
I A I+ FLR+ A S+ +C +P D
Sbjct: 371 IGAQHIVQFLRVKAHSSMLKHQLTCDPRDSAGLAATSSRVTPTDTPWTHADKIIPRSFCD 430
Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
Q+ WE E RLT + ++ + VS + E + D G+
Sbjct: 431 QLFTWERECRRLTFNRALVLLRNVSSSQQELILDCLGHYGL 471
>gi|162605818|ref|XP_001713424.1| hypothetical protein GTHECHR3089 [Guillardia theta]
gi|13794356|gb|AAK39733.1|AF083031_90 hypothetical protein [Guillardia theta]
Length = 180
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/149 (21%), Positives = 65/149 (43%), Gaps = 14/149 (9%)
Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
+I+E NFR+Y + + + + +Y P VG + + R + G+T+ I+
Sbjct: 45 IIIEVNFRIYIFKYFDFSNSFFKFISKKIYSLPYFYVGEINENYSRRLFQLGVTSDNILK 104
Query: 291 FLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
F++ L +P D++R+WE ++ G L +R+Y ++
Sbjct: 105 FIK-KNLHYTCNKIPSTFEDRLRIWEISFKKKYFFGGFL------------MRNYNSNI- 150
Query: 351 VLTWQNEKKRTVVVTKQGHSDIKKFWVNH 379
+ + K + V TK+ + I K +N+
Sbjct: 151 FFSLKKTKSISYVKTKKHLAIISKIPINY 179
>gi|340508252|gb|EGR33999.1| hypothetical protein IMG5_027880 [Ichthyophthirius multifiliis]
Length = 454
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 85/197 (43%), Gaps = 40/197 (20%)
Query: 128 QFLLLETPAQVWYFILKY------LETVESKGLDLVECLTFLFQLK-----FSTFGTDYS 176
+F+L +T AQ+ + Y +E +E + D++ + L L F + ++Y+
Sbjct: 189 EFILQDTNAQIHQILFYYCQVFAQMENIEEE--DIINFIITLSNLDVNKTYFFNYRSEYA 246
Query: 177 TEGMSDKLQVF----------LQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQI 225
+ D+ F L ++EF L Q+K F T L R +K I
Sbjct: 247 KKQNIDQKFTFELKILSDLAKLGMIKEFDLGQQKKV----FGITPLIWQFCYRSIDIKTI 302
Query: 226 REPGFLIVETNFRVYAYTDSNLKVA----------LIGLFCELMYRFPNLTVGVLTRDSV 275
+IVETNF +YAY D N + L+ F ++ Y FP+L V LT +
Sbjct: 303 NAK--IIVETNFNLYAYLDYNPQNKTFSESKYIRDLLKKFSKIHYTFPHLIVAQLTEAKM 360
Query: 276 RAALRSGITAAQIIGFL 292
+ A GIT+ +I F
Sbjct: 361 KQAFNQGITSKLLIEFF 377
>gi|342184190|emb|CCC93671.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 499
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 93/235 (39%), Gaps = 32/235 (13%)
Query: 96 VGSQQTEGISAD-AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----E 150
+G ++ E ISA L +E IT +G F + Q W + L+ V
Sbjct: 155 LGERRGEPISAMLHFSRLIPPPSEAARITCEGLSFCMQSLQQQWWTLVSVALDRVLTLTS 214
Query: 151 SKGLD---LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF--LQHLREFGLVYQ---RKR 202
KG+ L + L LF L S + Y + L+ F L L E GLVY R
Sbjct: 215 GKGVTRATLWQLLAVLFALDASDYV--YPFPSKEEDLEAFHLLARLSEVGLVYPLLCNGR 272
Query: 203 KAGRFYP--------------TKLALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSN 246
K P AL T +R + +I ETNFR+YAYT +
Sbjct: 273 KCFVLSPHFHHAICWSSVPSLCAAALLDNPNQTPCPLRREDEDTIITETNFRLYAYTRNP 332
Query: 247 LKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
+ ++ F + V +TR S ALR GI A I+ FL L A PS+
Sbjct: 333 DLLGILDQFAVKEIDIDGIIVCYRVTRSSFALALRKGIDAKHILQFLTLKAHPSM 387
>gi|430746993|ref|YP_007206122.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
18658]
gi|430018713|gb|AGA30427.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
18658]
Length = 790
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 55/118 (46%), Gaps = 15/118 (12%)
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT-VGV-----LTRDSVRAALRSGI 283
FL V+ NF + AY L LIG F RF + VG LT +SV L G+
Sbjct: 450 FLYVQPNFEIIAYR-QGLTPPLIGQFS----RFALWSQVGAALELKLTPESVYRGLEGGL 504
Query: 284 TAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK 341
T +++ L H+ LP VS+ +R W G RER+T +F ++ D EK
Sbjct: 505 TPPEMLDRLARHS----QRVLPAGVSEAVRTWAGRRERITYYASATLVEFATKEDLEK 558
>gi|407397449|gb|EKF27748.1| DNA repair and transcription factor protein [Trypanosoma cruzi
marinkellei]
Length = 501
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 108/298 (36%), Gaps = 59/298 (19%)
Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
L +E IT +G F + Q W + L+ V G+ +L + L LF
Sbjct: 173 LVPASSETREITSEGLSFCMQPLQQQWWTLVSVALDRVLILTNKSGVARAELWQLLAVLF 232
Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
L S F + + L L E GLVY R +
Sbjct: 233 ALDTSQFVYQFPEKEKDLAAFQILARLSEVGLVYPLISGGHRCFVLSPHFHHAVCWSSTT 292
Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
P A + + G +L ++R + +I E NFR+YAYT + + ++ F E +
Sbjct: 293 PLCTAALLDSEGNSLGRVRREDEDTIITEANFRLYAYTKNPDLLNILNQFAERDAEIDQM 352
Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPS------------------------- 299
+TR + AAL+ GI ++ I+ FL + A PS
Sbjct: 353 IACYRVTRKTFAAALKRGIGSSHILQFLAVKAHPSMLRNHRDGDSKKTSGLSVLGAGTRF 412
Query: 300 -------VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
+ +P DQ+ WE E RL S ++ + VS E L ++ G
Sbjct: 413 KNTSDIRIDGIIPQSFCDQLMTWEKECRRLIFSRNLVLLRNVSAMQQELLLNFLSSGG 470
>gi|71650598|ref|XP_813994.1| DNA repair and transcription factor protein [Trypanosoma cruzi
strain CL Brener]
gi|70878928|gb|EAN92143.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 44.7 bits (104), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 27/216 (12%)
Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
L +E IT +G F + Q W + L+ + ++ G+ +L + L LF
Sbjct: 173 LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLAVLF 232
Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
L S F + + L L E GLVY R +
Sbjct: 233 ALNTSQFVYPFPEKEKDLAAFQILARLSEVGLVYPLISGGRRCFALSPHFHHAVCWSSTA 292
Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
P A + + G ++ ++R + +I E NFR+YAYT + + ++ F E +
Sbjct: 293 PLCTAALLDSGGNSMGRVRREDEDTIITEANFRLYAYTKNPDLLNILNQFAERDAEVDQM 352
Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
+TR + AAL+ GI ++ I+ FL + A PS+
Sbjct: 353 IACYRVTRRTFAAALKRGIGSSHILQFLAVKAHPSM 388
>gi|71425201|ref|XP_813043.1| DNA repair and transcription factor protein [Trypanosoma cruzi
strain CL Brener]
gi|70877891|gb|EAN91192.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 27/216 (12%)
Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
L +E IT +G F + Q W + L+ + ++ G+ +L + L LF
Sbjct: 173 LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLAVLF 232
Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
L S F + + L L E GLVY R +
Sbjct: 233 ALDTSQFVYPFPEKEKDLAAFQILARLSEVGLVYPLISGGRRCFALSPHFHHAVCWSSTA 292
Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
P A + + G ++ ++R + +I E NFR+YAYT + + ++ F E +
Sbjct: 293 PLCTAALLDSGGNSMGRVRREDEDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQM 352
Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
+TR + AAL+ GI ++ I+ FL + A PS+
Sbjct: 353 IACYRVTRRTFAAALKRGIGSSHILQFLAVKAHPSM 388
>gi|407835017|gb|EKF99108.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 27/216 (12%)
Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
L +E IT +G F + Q W + L+ + ++ G+ +L + L LF
Sbjct: 173 LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLAVLF 232
Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
L S F + + L L E GLVY R +
Sbjct: 233 ALDTSQFVYPFPEKEKDLAAFQILARLSEVGLVYPLISGGRRCFALSPHFHHAVCWSSTA 292
Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
P A + + G ++ ++R + +I E NFR+YAYT + + ++ F E +
Sbjct: 293 PLCTAALLDSGGDSMGRVRREDEDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQM 352
Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
+TR + AAL+ GI + I+ FL + A PS+
Sbjct: 353 IACYRVTRRTFAAALKRGIGSLHILQFLAVKAHPSM 388
>gi|300122496|emb|CBK23066.2| unnamed protein product [Blastocystis hominis]
Length = 339
Score = 42.0 bits (97), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 68/155 (43%), Gaps = 23/155 (14%)
Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
IT G+QFL+ Q W+ + ++ + + + L+FLF L+ Y +
Sbjct: 182 AITATGYQFLMKSYRDQAWWLLHPHITSANA-----LPALSFLFSLRHMIPCRAYDVAAL 236
Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGR-FYPTKLALNM-----ATRGT-LKQIREPGFL-- 231
S+ + L+ R G+VY R F+ T L + A+RG ++ GF+
Sbjct: 237 SEPQRQLLEIARIAGVVYTPTTAGKRVFFATPLGAALFTQEGASRGEGGRKSGGEGFMKA 296
Query: 232 ------IVETNFRVYAY---TDSNLKVALIGLFCE 257
++E NF VYA+ ++ L V L+ F E
Sbjct: 297 EGSMRVVIEANFHVYAFGLLAENRLHVQLLRQFLE 331
>gi|2245059|emb|CAB10482.1| transcription factor like protein [Arabidopsis thaliana]
gi|7268453|emb|CAB80973.1| transcription factor like protein [Arabidopsis thaliana]
Length = 101
Score = 41.2 bits (95), Expect = 0.86, Method: Composition-based stats.
Identities = 16/47 (34%), Positives = 29/47 (61%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
L +AK YV+++L+++ PVP ++ W+ +H+ A D L +LR
Sbjct: 40 LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLR 86
>gi|320103368|ref|YP_004178959.1| hypothetical protein Isop_1828 [Isosphaera pallida ATCC 43644]
gi|319750650|gb|ADV62410.1| hypothetical protein Isop_1828 [Isosphaera pallida ATCC 43644]
Length = 961
Score = 41.2 bits (95), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 49/115 (42%), Gaps = 9/115 (7%)
Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV---LTRDSVRAALRSGITAA 286
FL ++ NF + AY L IG L R+ L +T DS+ L G++
Sbjct: 496 FLNIQPNFEMIAYR-QGLNPGAIGKLT-LFGRWTKLGGAAELKVTEDSLTLGLHGGLSVD 553
Query: 287 QIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK 341
I+ LR H+ PL P + +R W RER+T +F + D E+
Sbjct: 554 AILERLRRHS----PRPLAPALERTLRGWSARRERITLHTAATLLEFATPEDLEE 604
>gi|90398991|emb|CAJ86263.1| H0801D08.21 [Oryza sativa Indica Group]
Length = 119
Score = 40.0 bits (92), Expect = 1.8, Method: Composition-based stats.
Identities = 17/47 (36%), Positives = 28/47 (59%)
Query: 2 LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
L +AK Y +++L+V PV A + W+ +A +H+ A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLR 86
>gi|393786717|ref|ZP_10374849.1| hypothetical protein HMPREF1068_01129 [Bacteroides nordii CL02T12C05]
gi|392657952|gb|EIY51582.1| hypothetical protein HMPREF1068_01129 [Bacteroides nordii CL02T12C05]
Length = 1885
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 24/148 (16%)
Query: 10 VIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLE 69
+ RL ++QP SW + T+ + EL NA++A+L G KP LE
Sbjct: 1433 LTRLKELQQP-----DGSWAWYKGMNGSRYITNYIVEL--NARLAMLTG-KP------LE 1478
Query: 70 VDS---KARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLFNEGAENPVITKDG 126
D+ + L FLN+ ALE ++ I + + +G+SA A++ ++ +I G
Sbjct: 1479 GDALVMQRTALSFLNKEALEEYKNIRKAEKNGDKVKGLSASALKYIY-------LIAISG 1531
Query: 127 FQFLLLETPAQVWYFILKYLETVESKGL 154
Q L E YF+ K ET+ ++ +
Sbjct: 1532 EQIPLAENQTAYNYFLSKVTETLTTRSM 1559
>gi|359320939|ref|XP_003431754.2| PREDICTED: uncharacterized protein LOC612235 [Canis lupus
familiaris]
Length = 1766
Score = 38.1 bits (87), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 31/50 (62%)
Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
ERE LT GVL S F D E+LRD AQ++ +L ++K +++ + +Q
Sbjct: 133 EREELTPQLGVLLSHFNIPYDLEELRDAAQEIELLNLVSQKFQSIFMEQQ 182
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.136 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,805,996,948
Number of Sequences: 23463169
Number of extensions: 231505562
Number of successful extensions: 513480
Number of sequences better than 100.0: 415
Number of HSP's better than 100.0 without gapping: 398
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 511611
Number of HSP's gapped (non-prelim): 551
length of query: 386
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 242
effective length of database: 8,980,499,031
effective search space: 2173280765502
effective search space used: 2173280765502
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)